BLASTX nr result
ID: Dioscorea21_contig00039476
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Dioscorea21_contig00039476 (473 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value dbj|BAD27693.1| pentatricopeptide (PPR) repeat-containing protei... 202 3e-50 ref|XP_003572787.1| PREDICTED: pentatricopeptide repeat-containi... 202 3e-50 gb|EEC73741.1| hypothetical protein OsI_08374 [Oryza sativa Indi... 199 1e-49 ref|XP_002452788.1| hypothetical protein SORBIDRAFT_04g032540 [S... 190 9e-47 gb|AFW72694.1| hypothetical protein ZEAMMB73_533387 [Zea mays] 188 3e-46 >dbj|BAD27693.1| pentatricopeptide (PPR) repeat-containing protein-like [Oryza sativa Japonica Group] gi|222623393|gb|EEE57525.1| hypothetical protein OsJ_07836 [Oryza sativa Japonica Group] Length = 667 Score = 202 bits (513), Expect = 3e-50 Identities = 89/157 (56%), Positives = 123/157 (78%) Frame = +2 Query: 2 DQATIASVLPAYSNSADMNQAMNLHCFLIKTGFLRSTETTTGLVDAYAKTGGLDAAKELF 181 D AT+AS+LPAY+ SAD+ + N+HCFL+ GFLRSTE TGL+D Y+K G LDAA LF Sbjct: 411 DSATMASILPAYAESADLKEGKNIHCFLLTLGFLRSTEIATGLIDVYSKAGDLDAAWALF 470 Query: 182 EGLNTKDYVAWSAIISGYGKHGHAKTAISLLERMIEAGIEPNEVTFTSVLYSCSHAGLVD 361 + L KD VAW+ II+GYG HGHA+TAI L +RM+E+G +PN VT ++LY+CSHAG++D Sbjct: 471 QWLPEKDVVAWTTIIAGYGIHGHARTAILLYDRMVESGGKPNTVTIATLLYACSHAGMID 530 Query: 362 EGVHIFQQMIQAHQVKPSLDHYACVVDLIGRAGKLEE 472 EG+ +F+ M H + P+ +HY+C+VD++GRAG++EE Sbjct: 531 EGIKVFKDMRNVHGLMPNGEHYSCLVDMLGRAGRIEE 567 Score = 62.8 bits (151), Expect = 3e-08 Identities = 39/116 (33%), Positives = 62/116 (53%), Gaps = 2/116 (1%) Frame = +2 Query: 2 DQATIASVLPAYSNSADMNQAMNLHCFLIKTGFLRSTETTTGLVDAYAKTGGLDAAKELF 181 D+AT+ SVLPA + + D+N +H + G L+D Y K L+ A+ +F Sbjct: 208 DRATVVSVLPACAQAKDLNTGRAVHRLVEDKGLGDYVAVKNALIDMYGKCRSLEDARRVF 267 Query: 182 EGL-NTKDYVAWSAIISGYGKHGHAKTAISLLERMIEAGIE-PNEVTFTSVLYSCS 343 + + KD V+W+A+I Y + A AISL +M+ +G PN VT +L +C+ Sbjct: 268 DHCKHDKDVVSWTAMIGAYVLNDRAFEAISLGCQMLMSGAAWPNGVTMVYLLSACA 323 Score = 61.2 bits (147), Expect = 8e-08 Identities = 33/134 (24%), Positives = 67/134 (50%) Frame = +2 Query: 71 LHCFLIKTGFLRSTETTTGLVDAYAKTGGLDAAKELFEGLNTKDYVAWSAIISGYGKHGH 250 +HC + GF T L+ Y G + AA+ +F + + V+W+A+I+G K+G+ Sbjct: 130 VHCRALAAGFGGDTYVQNALISMYMSCGDVGAAEAVFGAMRNRTVVSWNAVIAGCVKNGY 189 Query: 251 AKTAISLLERMIEAGIEPNEVTFTSVLYSCSHAGLVDEGVHIFQQMIQAHQVKPSLDHYA 430 A+ A+ + M G+ + T SVL +C+ A ++ G ++++ + + Sbjct: 190 AERALEVFGEMAADGVGIDRATVVSVLPACAQAKDLNTG-RAVHRLVEDKGLGDYVAVKN 248 Query: 431 CVVDLIGRAGKLEE 472 ++D+ G+ LE+ Sbjct: 249 ALIDMYGKCRSLED 262 Score = 58.5 bits (140), Expect = 5e-07 Identities = 40/153 (26%), Positives = 72/153 (47%) Frame = +2 Query: 11 TIASVLPAYSNSADMNQAMNLHCFLIKTGFLRSTETTTGLVDAYAKTGGLDAAKELFEGL 190 T+ +L A ++ A H I+ G T L+DAYA+ G + + E Sbjct: 314 TMVYLLSACASMPSGKHAKCTHALCIRLGLKSDIAVETALIDAYARCGKMKLMRLTLERG 373 Query: 191 NTKDYVAWSAIISGYGKHGHAKTAISLLERMIEAGIEPNEVTFTSVLYSCSHAGLVDEGV 370 + + W+A +SGY G K AI L +RMI + P+ T S+L + + + + EG Sbjct: 374 SWRAET-WNAALSGYTVSGREKKAIELFKRMIAESVRPDSATMASILPAYAESADLKEGK 432 Query: 371 HIFQQMIQAHQVKPSLDHYACVVDLIGRAGKLE 469 +I ++ ++ S + ++D+ +AG L+ Sbjct: 433 NIHCFLLTLGFLR-STEIATGLIDVYSKAGDLD 464 >ref|XP_003572787.1| PREDICTED: pentatricopeptide repeat-containing protein At5g39350-like [Brachypodium distachyon] Length = 594 Score = 202 bits (513), Expect = 3e-50 Identities = 89/157 (56%), Positives = 122/157 (77%) Frame = +2 Query: 2 DQATIASVLPAYSNSADMNQAMNLHCFLIKTGFLRSTETTTGLVDAYAKTGGLDAAKELF 181 D AT+AS+LPAY+ SAD+ QA N+HC+L+ GFLRSTE TTGL++ YAK G LD + LF Sbjct: 341 DSATMASILPAYAESADVRQATNIHCYLLTLGFLRSTEITTGLINVYAKAGDLDVSWSLF 400 Query: 182 EGLNTKDYVAWSAIISGYGKHGHAKTAISLLERMIEAGIEPNEVTFTSVLYSCSHAGLVD 361 +GL KD VAW+ +I+GYG HG A+T+I L RM++ G++PN VTF S+LY+CSH G+VD Sbjct: 401 DGLPEKDVVAWTTVIAGYGMHGQAQTSILLYNRMVQLGVKPNTVTFASLLYACSHVGMVD 460 Query: 362 EGVHIFQQMIQAHQVKPSLDHYACVVDLIGRAGKLEE 472 EG+ +F+ M H V P+ DHY+ +VD++GRAG++EE Sbjct: 461 EGLQLFEDMRGIHGVMPNADHYSSLVDMVGRAGRIEE 497 Score = 57.8 bits (138), Expect = 9e-07 Identities = 34/117 (29%), Positives = 59/117 (50%), Gaps = 3/117 (2%) Frame = +2 Query: 2 DQATIASVLPAYSNSADMNQAMNLHCFLIKTGFLRSTETTTGLVDAYAKTGGLDAAKELF 181 D+AT+ SVLPA + + +++ +H + + G L+D Y K L+ A+++F Sbjct: 138 DRATVVSVLPACAQAKNLSIGRAVHQLVEERGLADYAAVKNALIDMYGKCRNLEGARKVF 197 Query: 182 EGLN-TKDYVAWSAIISGYGKHGHAKTAISLLERMI--EAGIEPNEVTFTSVLYSCS 343 + KD V+W+ +I Y + H + A +L M+ PN VT +L +CS Sbjct: 198 DDHKYDKDVVSWTVMIGAYVLNDHVEEAFALGHEMLMTSGAPWPNGVTMAYLLSACS 254 Score = 56.2 bits (134), Expect = 3e-06 Identities = 38/137 (27%), Positives = 64/137 (46%), Gaps = 4/137 (2%) Frame = +2 Query: 71 LHCFLIKTGFLRSTETTTGLVDAYAKTGGLDAAKELFEGLNTKDYVAWSAIISGYGKHGH 250 +HC + GF T L+ Y G + AA+ +F + + V+W+A+I+G K+ Sbjct: 60 VHCRALAAGFGDDTYVQNALISMYMGCGDVAAAEAVFCAMQNRTVVSWNAVIAGCVKNDC 119 Query: 251 AKTAISLLERMIEAGIEPNEVTFTSVLYSCSHAGLVDEGVHIFQQMIQAHQVKPSLDHYA 430 A+ A+ + M G E + T SVL +C+ A + G + Q + + L YA Sbjct: 120 AERALEVFGEMAGDGTEIDRATVVSVLPACAQAKNLSIGRAVHQLVEER-----GLADYA 174 Query: 431 CV----VDLIGRAGKLE 469 V +D+ G+ LE Sbjct: 175 AVKNALIDMYGKCRNLE 191 >gb|EEC73741.1| hypothetical protein OsI_08374 [Oryza sativa Indica Group] Length = 667 Score = 199 bits (507), Expect = 1e-49 Identities = 88/157 (56%), Positives = 122/157 (77%) Frame = +2 Query: 2 DQATIASVLPAYSNSADMNQAMNLHCFLIKTGFLRSTETTTGLVDAYAKTGGLDAAKELF 181 D AT+AS+LPAY+ SAD+ + N+HCFL+ GFLRSTE TGL+D Y+K G LDAA LF Sbjct: 411 DSATMASILPAYAESADLKEGKNIHCFLLTLGFLRSTEIATGLIDVYSKAGDLDAAWALF 470 Query: 182 EGLNTKDYVAWSAIISGYGKHGHAKTAISLLERMIEAGIEPNEVTFTSVLYSCSHAGLVD 361 + L KD VAW+ II+GY HGHA+TAI L +RM+E+G +PN VT ++LY+CSHAG++D Sbjct: 471 QWLPEKDVVAWTTIIAGYSIHGHARTAILLYDRMVESGGKPNTVTIATLLYACSHAGMID 530 Query: 362 EGVHIFQQMIQAHQVKPSLDHYACVVDLIGRAGKLEE 472 EG+ +F+ M H + P+ +HY+C+VD++GRAG++EE Sbjct: 531 EGIKVFKDMRNVHGLMPNGEHYSCLVDMLGRAGRIEE 567 Score = 62.8 bits (151), Expect = 3e-08 Identities = 39/116 (33%), Positives = 62/116 (53%), Gaps = 2/116 (1%) Frame = +2 Query: 2 DQATIASVLPAYSNSADMNQAMNLHCFLIKTGFLRSTETTTGLVDAYAKTGGLDAAKELF 181 D+AT+ SVLPA + + D+N +H + G L+D Y K L+ A+ +F Sbjct: 208 DRATVVSVLPACAQAKDLNTGRAVHRLVEDKGLGDYVAVKNALIDMYGKCRSLEDARRVF 267 Query: 182 EGL-NTKDYVAWSAIISGYGKHGHAKTAISLLERMIEAGIE-PNEVTFTSVLYSCS 343 + + KD V+W+A+I Y + A AISL +M+ +G PN VT +L +C+ Sbjct: 268 DHCKHDKDVVSWTAMIGAYVLNDRAFEAISLGCQMLMSGAAWPNGVTMVYLLSACA 323 Score = 61.2 bits (147), Expect = 8e-08 Identities = 33/134 (24%), Positives = 67/134 (50%) Frame = +2 Query: 71 LHCFLIKTGFLRSTETTTGLVDAYAKTGGLDAAKELFEGLNTKDYVAWSAIISGYGKHGH 250 +HC + GF T L+ Y G + AA+ +F + + V+W+A+I+G K+G+ Sbjct: 130 VHCRALAAGFGGDTYVQNALISMYMSCGDVGAAEAVFGAMRNRTVVSWNAVIAGCVKNGY 189 Query: 251 AKTAISLLERMIEAGIEPNEVTFTSVLYSCSHAGLVDEGVHIFQQMIQAHQVKPSLDHYA 430 A+ A+ + M G+ + T SVL +C+ A ++ G ++++ + + Sbjct: 190 AERALEVFGEMAADGVGIDRATVVSVLPACAQAKDLNTG-RAVHRLVEDKGLGDYVAVKN 248 Query: 431 CVVDLIGRAGKLEE 472 ++D+ G+ LE+ Sbjct: 249 ALIDMYGKCRSLED 262 Score = 58.5 bits (140), Expect = 5e-07 Identities = 40/153 (26%), Positives = 72/153 (47%) Frame = +2 Query: 11 TIASVLPAYSNSADMNQAMNLHCFLIKTGFLRSTETTTGLVDAYAKTGGLDAAKELFEGL 190 T+ +L A ++ A H I+ G T L+DAYA+ G + + E Sbjct: 314 TMVYLLSACASMPSGKHAKCTHALCIRLGLKSDIAVETALIDAYARCGKMKLMRLTLERG 373 Query: 191 NTKDYVAWSAIISGYGKHGHAKTAISLLERMIEAGIEPNEVTFTSVLYSCSHAGLVDEGV 370 + + W+A +SGY G K AI L +RMI + P+ T S+L + + + + EG Sbjct: 374 SWRAET-WNAALSGYTVSGREKKAIELFKRMIAESVRPDSATMASILPAYAESADLKEGK 432 Query: 371 HIFQQMIQAHQVKPSLDHYACVVDLIGRAGKLE 469 +I ++ ++ S + ++D+ +AG L+ Sbjct: 433 NIHCFLLTLGFLR-STEIATGLIDVYSKAGDLD 464 >ref|XP_002452788.1| hypothetical protein SORBIDRAFT_04g032540 [Sorghum bicolor] gi|241932619|gb|EES05764.1| hypothetical protein SORBIDRAFT_04g032540 [Sorghum bicolor] Length = 662 Score = 190 bits (483), Expect = 9e-47 Identities = 88/157 (56%), Positives = 116/157 (73%) Frame = +2 Query: 2 DQATIASVLPAYSNSADMNQAMNLHCFLIKTGFLRSTETTTGLVDAYAKTGGLDAAKELF 181 D AT+ASV+PAY+ SAD+ QA N+HC L+ G L ST+ TGL++ YAK G L A ELF Sbjct: 411 DSATMASVIPAYAESADLVQAKNIHCCLLIRGCLGSTDIATGLINVYAKAGDLGVAWELF 470 Query: 182 EGLNTKDYVAWSAIISGYGKHGHAKTAISLLERMIEAGIEPNEVTFTSVLYSCSHAGLVD 361 + L KD VAW+ +I+GYG HGHA+TAI L RMIE G+ PN VT S++YSCSHAG+VD Sbjct: 471 QCLPEKDVVAWTTVIAGYGMHGHAQTAILLYSRMIEMGVTPNTVTMASLMYSCSHAGMVD 530 Query: 362 EGVHIFQQMIQAHQVKPSLDHYACVVDLIGRAGKLEE 472 EG+ +F M H + P+ +HY C+VD++GRAG++EE Sbjct: 531 EGLRLFNDMRGVHGLMPNAEHYLCLVDMLGRAGRIEE 567 Score = 67.8 bits (164), Expect = 9e-10 Identities = 38/116 (32%), Positives = 61/116 (52%), Gaps = 2/116 (1%) Frame = +2 Query: 2 DQATIASVLPAYSNSADMNQAMNLHCFLIKTGFLRSTETTTGLVDAYAKTGGLDAAKELF 181 D+AT+ SVLPA + + D++ +H + G L+D Y K L+ AK +F Sbjct: 208 DRATVVSVLPACAQARDLHMGRAVHRLAVVRGLGNYAAVKNALIDMYGKCRSLEDAKRVF 267 Query: 182 -EGLNTKDYVAWSAIISGYGKHGHAKTAISL-LERMIEAGIEPNEVTFTSVLYSCS 343 E KD V+W+A+I Y + HA A +L E ++ + +PN VT +L +C+ Sbjct: 268 DEDSYDKDVVSWTAMIGAYVLNDHASKAFALGSEMLVTSEAQPNAVTMVHLLSACT 323 >gb|AFW72694.1| hypothetical protein ZEAMMB73_533387 [Zea mays] Length = 663 Score = 188 bits (478), Expect = 3e-46 Identities = 87/157 (55%), Positives = 116/157 (73%) Frame = +2 Query: 2 DQATIASVLPAYSNSADMNQAMNLHCFLIKTGFLRSTETTTGLVDAYAKTGGLDAAKELF 181 D AT+ASV+PAY+ SAD+ QA N+HC L+ G L ST+ TGL+D YAK G L A ELF Sbjct: 412 DSATMASVIPAYAESADLVQANNIHCCLLVRGCLVSTDIATGLIDLYAKAGDLGVAWELF 471 Query: 182 EGLNTKDYVAWSAIISGYGKHGHAKTAISLLERMIEAGIEPNEVTFTSVLYSCSHAGLVD 361 + L KD VAW+ +I+GYG HGHA+TA+ L RM+E G+ PN VT S+L+SCSHAG+VD Sbjct: 472 QCLPEKDVVAWTTVIAGYGMHGHAQTAMLLYSRMVELGVMPNTVTIASLLHSCSHAGMVD 531 Query: 362 EGVHIFQQMIQAHQVKPSLDHYACVVDLIGRAGKLEE 472 EG+ +F M H + P+ +HY C+VD++GRAG++EE Sbjct: 532 EGLRLFNDMHGVHGLMPNAEHYLCLVDMLGRAGRIEE 568 Score = 67.4 bits (163), Expect = 1e-09 Identities = 36/116 (31%), Positives = 62/116 (53%), Gaps = 2/116 (1%) Frame = +2 Query: 2 DQATIASVLPAYSNSADMNQAMNLHCFLIKTGFLRSTETTTGLVDAYAKTGGLDAAKELF 181 D+A++ SVLPA + + D++ +H + G + L+D Y K G L+ A+ +F Sbjct: 209 DRASVVSVLPACAQARDLHTGRAVHRLAVVRGLGKYVAVKNALIDMYGKCGSLEDARRVF 268 Query: 182 -EGLNTKDYVAWSAIISGYGKHGHAKTAISL-LERMIEAGIEPNEVTFTSVLYSCS 343 E KD V+W+ +I Y + HA A +L E ++ + +PN VT +L +C+ Sbjct: 269 DEDSYDKDVVSWTVMIGAYVLNDHASKAFALGSEMLVSSEAQPNAVTMAHLLSACA 324 Score = 55.1 bits (131), Expect = 6e-06 Identities = 35/138 (25%), Positives = 65/138 (47%), Gaps = 4/138 (2%) Frame = +2 Query: 71 LHCFLIKTGFLRSTETTTGLVDAYAKTGGLDAAKELFEGLNTKDYVAWSAIISGYGKHGH 250 +H + GF +V Y + + AA+ +F L ++ V+W+ +I+G K G Sbjct: 131 VHGRALAAGFGSDAYVQNAIVSMYMRCRDVAAAEAVFVALPSRTTVSWNTVITGCVKDGR 190 Query: 251 AKTAISLLERMIEAGIEPNEVTFTSVLYSCSHAGLVDEGVHIFQQMIQAHQVKPSLDHYA 430 A+ A+ + E M++ G+ + + SVL +C+ A + G + + V L Y Sbjct: 191 AERALEVFETMVDRGVCIDRASVVSVLPACAQARDLHTG-----RAVHRLAVVRGLGKYV 245 Query: 431 CV----VDLIGRAGKLEE 472 V +D+ G+ G LE+ Sbjct: 246 AVKNALIDMYGKCGSLED 263 Score = 54.7 bits (130), Expect = 8e-06 Identities = 40/152 (26%), Positives = 69/152 (45%) Frame = +2 Query: 11 TIASVLPAYSNSADMNQAMNLHCFLIKTGFLRSTETTTGLVDAYAKTGGLDAAKELFEGL 190 T+A +L A ++ A H I+ G T LVD YAK G + + E Sbjct: 315 TMAHLLSACASLLSGKHAKCTHALCIRLGLGSDIVVETALVDCYAKCGYMGVIDMVVEK- 373 Query: 191 NTKDYVAWSAIISGYGKHGHAKTAISLLERMIEAGIEPNEVTFTSVLYSCSHAGLVDEGV 370 ++ W+A ISGY + K A++L +RM+ + P+ T SV+ + + + + + Sbjct: 374 GSRRTETWNAAISGYTQRDQGKKALALFKRMLAESVRPDSATMASVIPAYAESADLVQAN 433 Query: 371 HIFQQMIQAHQVKPSLDHYACVVDLIGRAGKL 466 +I ++ S D ++DL +AG L Sbjct: 434 NIHCCLL-VRGCLVSTDIATGLIDLYAKAGDL 464