BLASTX nr result
ID: Dioscorea21_contig00012008
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Dioscorea21_contig00012008 (1205 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002278451.1| PREDICTED: pentatricopeptide repeat-containi... 451 e-124 ref|XP_002519842.1| pentatricopeptide repeat-containing protein,... 437 e-120 ref|XP_004148730.1| PREDICTED: pentatricopeptide repeat-containi... 434 e-119 ref|XP_003608637.1| Pentatricopeptide repeat-containing protein ... 429 e-118 ref|NP_199470.1| pentatricopeptide repeat-containing protein [Ar... 428 e-117 >ref|XP_002278451.1| PREDICTED: pentatricopeptide repeat-containing protein At5g46580, chloroplastic [Vitis vinifera] Length = 723 Score = 451 bits (1159), Expect = e-124 Identities = 225/353 (63%), Positives = 271/353 (76%), Gaps = 5/353 (1%) Frame = +1 Query: 160 SLSDQLKPLAVTLL-KDSPKPSAIDDLHPPKPIWINPSRPKPSVLSLRRHERRPYSHNPN 336 SLS+QLKPL+ T+L +D + + + PK WINP++PKPSVLSL+RH+R YS+NP Sbjct: 74 SLSEQLKPLSKTILTRDHSGQTHL--VSKPKSTWINPTKPKPSVLSLQRHKRHNYSYNPQ 131 Query: 337 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX----VLLNSLRPWSKAHLFFLYLKA 504 +LLNSL+PW K +LFF ++K Sbjct: 132 IRDLKLFAKKINESESSDESEFLAVLEQIPHPPTRDNALLLLNSLKPWPKTYLFFNWIKT 191 Query: 505 NSSSPLETLFYNVAMKALRAGRQFSLVEQLAHEMLDLGVELDNVTYSTIITTAKRCHEFD 684 + P+ET+FYNV MK+LR GRQF L+E+LA+EM+ GVELDN+TYSTIIT AKRC+ FD Sbjct: 192 QNLFPMETIFYNVTMKSLRFGRQFQLIEELANEMISTGVELDNITYSTIITCAKRCNLFD 251 Query: 685 KAILWFERMYETGVMPDEVTYSAVLDVYAKLGKKEEVIGLYERARAAGWIPDQVAFSVLA 864 KA+ WFERMY+TG+MPDEVTYSA+LDVYAKLGK EEV+ LYER RA+GW PD +AF+VL Sbjct: 252 KAVKWFERMYKTGLMPDEVTYSAILDVYAKLGKVEEVLSLYERGRASGWKPDPIAFAVLG 311 Query: 865 KMFGQAGDYDGIRYVLQEMDNLGVKPNLIVYNTLLEALGNAGKPGLARSLFEDMVSAGVS 1044 KMFG+AGDYDGIRYVLQEM +LGV+PNL+VYNTLLEA+G AGKPGLARSLFE+MV +GV Sbjct: 312 KMFGEAGDYDGIRYVLQEMKSLGVQPNLVVYNTLLEAMGKAGKPGLARSLFEEMVGSGVI 371 Query: 1045 PDEKTLTALIKIYGKARWSRDALQLWERMRSNKWPMDFILYNTLLSMCADVGL 1203 PD KTLTAL+KIYGKARW+RDAL+LWERMRSN WPMDFILYNTLLSMCAD+GL Sbjct: 372 PDAKTLTALVKIYGKARWARDALELWERMRSNGWPMDFILYNTLLSMCADLGL 424 Score = 78.2 bits (191), Expect = 4e-12 Identities = 49/182 (26%), Positives = 88/182 (48%), Gaps = 1/182 (0%) Frame = +1 Query: 583 VEQLAHEMLDLGVELDNVTYSTIITTAKRCHEFDKAILWFERMYETGVMPDEVTYSAVLD 762 + + EM LGV+ + V Y+T++ + + A FE M +GV+PD T +A++ Sbjct: 323 IRYVLQEMKSLGVQPNLVVYNTLLEAMGKAGKPGLARSLFEEMVGSGVIPDAKTLTALVK 382 Query: 763 VYAKLGKKEEVIGLYERARAAGWIPDQVAFSVLAKMFGQAGDYDGIRYVLQEMDNLG-VK 939 +Y K + + L+ER R+ GW D + ++ L M G + + ++M + Sbjct: 383 IYGKARWARDALELWERMRSNGWPMDFILYNTLLSMCADLGLEEEAEKLFEDMKKSEHCR 442 Query: 940 PNLIVYNTLLEALGNAGKPGLARSLFEDMVSAGVSPDEKTLTALIKIYGKARWSRDALQL 1119 P+ Y +L G+ G A LF++M GV + T L + G+AR D +++ Sbjct: 443 PDSWSYTAMLNIYGSGGNVDRAMQLFDEMSELGVQINVMGCTCLSQCLGRARRIDDLVKV 502 Query: 1120 WE 1125 +E Sbjct: 503 FE 504 Score = 74.7 bits (182), Expect = 4e-11 Identities = 50/193 (25%), Positives = 89/193 (46%), Gaps = 1/193 (0%) Frame = +1 Query: 535 YNVAMKALRAGRQFSLVEQLAHEMLDLGVELDNVTYSTIITTAKRCHEFDKAILWFERMY 714 YN ++A+ + L L EM+ GV D T + ++ + A+ +ERM Sbjct: 342 YNTLLEAMGKAGKPGLARSLFEEMVGSGVIPDAKTLTALVKIYGKARWARDALELWERMR 401 Query: 715 ETGVMPDEVTYSAVLDVYAKLGKKEEVIGLYERARAAGWI-PDQVAFSVLAKMFGQAGDY 891 G D + Y+ +L + A LG +EE L+E + + PD +++ + ++G G+ Sbjct: 402 SNGWPMDFILYNTLLSMCADLGLEEEAEKLFEDMKKSEHCRPDSWSYTAMLNIYGSGGNV 461 Query: 892 DGIRYVLQEMDNLGVKPNLIVYNTLLEALGNAGKPGLARSLFEDMVSAGVSPDEKTLTAL 1071 D + EM LGV+ N++ L + LG A + +FE + GV PD++ L Sbjct: 462 DRAMQLFDEMSELGVQINVMGCTCLSQCLGRARRIDDLVKVFEVSLERGVKPDDRLCGCL 521 Query: 1072 IKIYGKARWSRDA 1110 + + + DA Sbjct: 522 LSVVSFCEGAEDA 534 >ref|XP_002519842.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223540888|gb|EEF42446.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 677 Score = 437 bits (1124), Expect = e-120 Identities = 221/377 (58%), Positives = 272/377 (72%), Gaps = 8/377 (2%) Frame = +1 Query: 97 RATIVCNS---SRAXXXXXXXXXXSLSDQLKPLAVTLLKDSPKPSAIDD--LHPPKPIWI 261 R TI CNS S+ SLSDQLKPL+ T L + + L P W+ Sbjct: 46 RLTISCNSRKTSKHLSESPNPKNPSLSDQLKPLSSTTLSTTTTTTTTKHSLLSKPNSTWV 105 Query: 262 NPSRPKPSVLSLRRHERRPYSHNPNXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 441 NP++PK SVLSL+R +R PYS NP Sbjct: 106 NPTKPKRSVLSLQRQKRSPYSLNPKVKELRLFAQKLNDCDSTESSFLSLLEQIPYPLTRE 165 Query: 442 ---VLLNSLRPWSKAHLFFLYLKANSSSPLETLFYNVAMKALRAGRQFSLVEQLAHEMLD 612 ++LNSLRPW KAHLFF ++K +S P+ET+FYNV MK+LR GRQF L+++LA+EM+ Sbjct: 166 NALLILNSLRPWQKAHLFFNWIKTQNSFPVETIFYNVTMKSLRFGRQFELIDKLANEMVS 225 Query: 613 LGVELDNVTYSTIITTAKRCHEFDKAILWFERMYETGVMPDEVTYSAVLDVYAKLGKKEE 792 +ELDN+TYSTIIT AKRC+ FD A+ WFERMY+TG+MPDEVTYSA+LDVYAKLG+ EE Sbjct: 226 NEIELDNITYSTIITCAKRCNRFDMALEWFERMYKTGLMPDEVTYSAILDVYAKLGRVEE 285 Query: 793 VIGLYERARAAGWIPDQVAFSVLAKMFGQAGDYDGIRYVLQEMDNLGVKPNLIVYNTLLE 972 V+ LYER A+GW PD + FSVLAKMFG+AGDYDGIRYVLQEM +L V+PN++VYNTLLE Sbjct: 286 VLSLYERGVASGWKPDPITFSVLAKMFGEAGDYDGIRYVLQEMKSLAVQPNVVVYNTLLE 345 Query: 973 ALGNAGKPGLARSLFEDMVSAGVSPDEKTLTALIKIYGKARWSRDALQLWERMRSNKWPM 1152 A+G AGKPGLARSLF++MV +G++P+EKTLTAL KIYGKARW++DA++LWERMRSN WPM Sbjct: 346 AMGKAGKPGLARSLFDEMVESGLTPNEKTLTALAKIYGKARWAKDAMELWERMRSNDWPM 405 Query: 1153 DFILYNTLLSMCADVGL 1203 DFILYNTLLSMCAD+G+ Sbjct: 406 DFILYNTLLSMCADLGM 422 >ref|XP_004148730.1| PREDICTED: pentatricopeptide repeat-containing protein At5g46580, chloroplastic-like [Cucumis sativus] gi|449521148|ref|XP_004167592.1| PREDICTED: pentatricopeptide repeat-containing protein At5g46580, chloroplastic-like [Cucumis sativus] Length = 710 Score = 434 bits (1117), Expect = e-119 Identities = 217/382 (56%), Positives = 272/382 (71%), Gaps = 11/382 (2%) Frame = +1 Query: 91 SSRATIVCNSSRAXXXXXXXXXXS-------LSDQLKPLAVTLLKDSPKPSAIDDLHPPK 249 + R T++C+SS++ S LS+QLK L+ T L ++P L PK Sbjct: 30 TKRLTLLCSSSKSPRKPSSVSSQSVDNKNPSLSEQLKNLSTTTLSNAPNDET-RLLSKPK 88 Query: 250 PIWINPSRPKPSVLSLRRHERRPYSHNPNXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 429 W+NP++PK SVLSL+R +R YS+NP Sbjct: 89 STWVNPTKPKRSVLSLQRQKRSSYSYNPKMRDLKSFAHKLNACDSSDDASFIAALEEIPH 148 Query: 430 XXXX----VLLNSLRPWSKAHLFFLYLKANSSSPLETLFYNVAMKALRAGRQFSLVEQLA 597 ++LNSLRPW K HLFF ++K+ + P+ET+FYNVAMK+LR GRQF L+E LA Sbjct: 149 PPTKENALLILNSLRPWQKTHLFFNWIKSQNLFPMETIFYNVAMKSLRYGRQFQLIEDLA 208 Query: 598 HEMLDLGVELDNVTYSTIITTAKRCHEFDKAILWFERMYETGVMPDEVTYSAVLDVYAKL 777 +EM+ G+ELDN+TYSTIIT AK+C FDKA+ WFERMY+TG+MPDEVTYSA+LDVYA L Sbjct: 209 NEMISAGIELDNITYSTIITCAKKCSRFDKAMEWFERMYKTGLMPDEVTYSAILDVYANL 268 Query: 778 GKKEEVIGLYERARAAGWIPDQVAFSVLAKMFGQAGDYDGIRYVLQEMDNLGVKPNLIVY 957 GK EEV+ LYER RA+GW PD FSVL KMFG+AGDYDGI YVLQEM ++ ++PNL+VY Sbjct: 269 GKVEEVLSLYERGRASGWTPDPYTFSVLGKMFGEAGDYDGIMYVLQEMKSIEMQPNLVVY 328 Query: 958 NTLLEALGNAGKPGLARSLFEDMVSAGVSPDEKTLTALIKIYGKARWSRDALQLWERMRS 1137 NTLL+A+G AGKPG ARSLF++MV +G++P+EKTLTAL+KIYGKARW+RDAL LWERMRS Sbjct: 329 NTLLDAMGKAGKPGFARSLFDEMVESGITPNEKTLTALVKIYGKARWARDALDLWERMRS 388 Query: 1138 NKWPMDFILYNTLLSMCADVGL 1203 N WPMDFILYNTLL+MCAD+GL Sbjct: 389 NGWPMDFILYNTLLNMCADLGL 410 Score = 69.7 bits (169), Expect = 1e-09 Identities = 45/192 (23%), Positives = 89/192 (46%), Gaps = 1/192 (0%) Frame = +1 Query: 535 YNVAMKALRAGRQFSLVEQLAHEMLDLGVELDNVTYSTIITTAKRCHEFDKAILWFERMY 714 YN + A+ + L EM++ G+ + T + ++ + A+ +ERM Sbjct: 328 YNTLLDAMGKAGKPGFARSLFDEMVESGITPNEKTLTALVKIYGKARWARDALDLWERMR 387 Query: 715 ETGVMPDEVTYSAVLDVYAKLGKKEEVIGLYERARAAGWI-PDQVAFSVLAKMFGQAGDY 891 G D + Y+ +L++ A LG +EE L+E + + PD +++ + ++G G+ Sbjct: 388 SNGWPMDFILYNTLLNMCADLGLEEEAETLFEEMKKSKHSRPDSWSYTAMLNIYGSGGNV 447 Query: 892 DGIRYVLQEMDNLGVKPNLIVYNTLLEALGNAGKPGLARSLFEDMVSAGVSPDEKTLTAL 1071 + +EM LGV+ N++ L++ LG +G+ +F V G+ PD++ L Sbjct: 448 KRSMELFEEMLELGVEINVMCCTCLIQCLGKSGRIDDLVRVFNVSVQKGIKPDDRLCGCL 507 Query: 1072 IKIYGKARWSRD 1107 + + S D Sbjct: 508 LSVLSLCYNSED 519 >ref|XP_003608637.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] gi|355509692|gb|AES90834.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] Length = 715 Score = 429 bits (1104), Expect = e-118 Identities = 210/353 (59%), Positives = 261/353 (73%), Gaps = 5/353 (1%) Frame = +1 Query: 160 SLSDQLKPLAVTLLKDSPKPSAIDDLHPPKPIWINPSRPKPSVLSLRRHERRPYSHNPNX 339 SLSDQL LA T L P+ L PKP W+NP++ K VLS +RH+R S+NP Sbjct: 64 SLSDQLASLANTTLSTVPENQP-KVLSKPKPTWVNPTKTKRPVLSHQRHKRSSVSYNPQL 122 Query: 340 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX-----VLLNSLRPWSKAHLFFLYLKA 504 ++LNSLRPW K H+FF ++K Sbjct: 123 REFQRFAQRLNNCDVSSSDEEFMVCLEEIPSSLTRGNALLVLNSLRPWQKTHMFFNWIKT 182 Query: 505 NSSSPLETLFYNVAMKALRAGRQFSLVEQLAHEMLDLGVELDNVTYSTIITTAKRCHEFD 684 + P+ET+FYNV MK+LR GRQF ++E+LAH+M+D GVELDN+TYSTII+ AK+C+ FD Sbjct: 183 QNLLPMETIFYNVTMKSLRFGRQFGIIEELAHQMIDGGVELDNITYSTIISCAKKCNLFD 242 Query: 685 KAILWFERMYETGVMPDEVTYSAVLDVYAKLGKKEEVIGLYERARAAGWIPDQVAFSVLA 864 KA+ WFERMY+TG+MPDEVT+SA+LDVYA+LGK EEV+ L+ER RA GW PD + FSVL Sbjct: 243 KAVYWFERMYKTGLMPDEVTFSAILDVYARLGKVEEVVNLFERGRATGWKPDPITFSVLG 302 Query: 865 KMFGQAGDYDGIRYVLQEMDNLGVKPNLIVYNTLLEALGNAGKPGLARSLFEDMVSAGVS 1044 KMFG+AGDYDGIRYVLQEM +LGV+PNL+VYNTLLEA+G AGKPG ARSLFE+M+ +G++ Sbjct: 303 KMFGEAGDYDGIRYVLQEMKSLGVQPNLVVYNTLLEAMGKAGKPGFARSLFEEMIDSGIA 362 Query: 1045 PDEKTLTALIKIYGKARWSRDALQLWERMRSNKWPMDFILYNTLLSMCADVGL 1203 P+EKTLTA+IKIYGKARWS+DAL+LW+RM+ N WPMDFILYNTLL+MCADVGL Sbjct: 363 PNEKTLTAVIKIYGKARWSKDALELWKRMKENGWPMDFILYNTLLNMCADVGL 415 Score = 80.1 bits (196), Expect = 1e-12 Identities = 50/202 (24%), Positives = 97/202 (48%), Gaps = 1/202 (0%) Frame = +1 Query: 523 ETLFYNVAMKALRAGRQFSLVEQLAHEMLDLGVELDNVTYSTIITTAKRCHEFDKAILWF 702 + + ++V K + + + EM LGV+ + V Y+T++ + + A F Sbjct: 294 DPITFSVLGKMFGEAGDYDGIRYVLQEMKSLGVQPNLVVYNTLLEAMGKAGKPGFARSLF 353 Query: 703 ERMYETGVMPDEVTYSAVLDVYAKLGKKEEVIGLYERARAAGWIPDQVAFSVLAKMFGQA 882 E M ++G+ P+E T +AV+ +Y K ++ + L++R + GW D + ++ L M Sbjct: 354 EEMIDSGIAPNEKTLTAVIKIYGKARWSKDALELWKRMKENGWPMDFILYNTLLNMCADV 413 Query: 883 GDYDGIRYVLQEM-DNLGVKPNLIVYNTLLEALGNAGKPGLARSLFEDMVSAGVSPDEKT 1059 G + + ++M + KP+ Y +L G+ G A LFE+M G+ + Sbjct: 414 GLIEEAETLFRDMKQSEHCKPDSWSYTAMLNIYGSEGAVDKAMKLFEEMSKFGIELNVMG 473 Query: 1060 LTALIKIYGKARWSRDALQLWE 1125 T LI+ GKA D +++++ Sbjct: 474 CTCLIQCLGKAMEIDDLVKVFD 495 >ref|NP_199470.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75180372|sp|Q9LS25.1|PP420_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At5g46580, chloroplastic; Flags: Precursor gi|8885599|dbj|BAA97529.1| unnamed protein product [Arabidopsis thaliana] gi|332008017|gb|AED95400.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 711 Score = 428 bits (1100), Expect = e-117 Identities = 213/352 (60%), Positives = 261/352 (74%), Gaps = 4/352 (1%) Frame = +1 Query: 160 SLSDQLKPLAVTLLKDSPKPSAIDDLHPPKPIWINPSRPKPSVLSLRRHERRPYSHNPNX 339 SLS+QLKPL+ T L+ L PK +W+NP+RPK SVLSL+R +R YS+NP Sbjct: 64 SLSEQLKPLSATTLRQEQTQI----LSKPKSVWVNPTRPKRSVLSLQRQKRSAYSYNPQI 119 Query: 340 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX----VLLNSLRPWSKAHLFFLYLKAN 507 ++LNSLR W K H FF ++K+ Sbjct: 120 KDLRAFALKLNSSIFTEKSEFLSLLDEIPHPPNRDNALLVLNSLREWQKTHTFFNWVKSK 179 Query: 508 SSSPLETLFYNVAMKALRAGRQFSLVEQLAHEMLDLGVELDNVTYSTIITTAKRCHEFDK 687 S P+ET+FYNV MK+LR GRQF L+E++A EM+ GVELDN+TYSTIIT AKRC+ ++K Sbjct: 180 SLFPMETIFYNVTMKSLRFGRQFQLIEEMALEMVKDGVELDNITYSTIITCAKRCNLYNK 239 Query: 688 AILWFERMYETGVMPDEVTYSAVLDVYAKLGKKEEVIGLYERARAAGWIPDQVAFSVLAK 867 AI WFERMY+TG+MPDEVTYSA+LDVY+K GK EEV+ LYERA A GW PD +AFSVL K Sbjct: 240 AIEWFERMYKTGLMPDEVTYSAILDVYSKSGKVEEVLSLYERAVATGWKPDAIAFSVLGK 299 Query: 868 MFGQAGDYDGIRYVLQEMDNLGVKPNLIVYNTLLEALGNAGKPGLARSLFEDMVSAGVSP 1047 MFG+AGDYDGIRYVLQEM ++ VKPN++VYNTLLEA+G AGKPGLARSLF +M+ AG++P Sbjct: 300 MFGEAGDYDGIRYVLQEMKSMDVKPNVVVYNTLLEAMGRAGKPGLARSLFNEMLEAGLTP 359 Query: 1048 DEKTLTALIKIYGKARWSRDALQLWERMRSNKWPMDFILYNTLLSMCADVGL 1203 +EKTLTAL+KIYGKARW+RDALQLWE M++ KWPMDFILYNTLL+MCAD+GL Sbjct: 360 NEKTLTALVKIYGKARWARDALQLWEEMKAKKWPMDFILYNTLLNMCADIGL 411 Score = 82.0 bits (201), Expect = 3e-13 Identities = 55/224 (24%), Positives = 103/224 (45%), Gaps = 1/224 (0%) Frame = +1 Query: 523 ETLFYNVAMKALRAGRQFSLVEQLAHEMLDLGVELDNVTYSTIITTAKRCHEFDKAILWF 702 + + ++V K + + + EM + V+ + V Y+T++ R + A F Sbjct: 290 DAIAFSVLGKMFGEAGDYDGIRYVLQEMKSMDVKPNVVVYNTLLEAMGRAGKPGLARSLF 349 Query: 703 ERMYETGVMPDEVTYSAVLDVYAKLGKKEEVIGLYERARAAGWIPDQVAFSVLAKMFGQA 882 M E G+ P+E T +A++ +Y K + + L+E +A W D + ++ L M Sbjct: 350 NEMLEAGLTPNEKTLTALVKIYGKARWARDALQLWEEMKAKKWPMDFILYNTLLNMCADI 409 Query: 883 GDYDGIRYVLQEM-DNLGVKPNLIVYNTLLEALGNAGKPGLARSLFEDMVSAGVSPDEKT 1059 G + + +M +++ +P+ Y +L G+ GK A LFE+M+ AGV + Sbjct: 410 GLEEEAERLFNDMKESVQCRPDNFSYTAMLNIYGSGGKAEKAMELFEEMLKAGVQVNVMG 469 Query: 1060 LTALIKIYGKARWSRDALQLWERMRSNKWPMDFILYNTLLSMCA 1191 T L++ GKA+ D + +++ D L LLS+ A Sbjct: 470 CTCLVQCLGKAKRIDDVVYVFDLSIKRGVKPDDRLCGCLLSVMA 513