BLASTX nr result
ID: Dioscorea21_contig00029200
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Dioscorea21_contig00029200 (701 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_004140525.1| PREDICTED: pentatricopeptide repeat-containi... 183 3e-44 ref|XP_002334407.1| predicted protein [Populus trichocarpa] gi|2... 181 1e-43 ref|XP_002874971.1| pentatricopeptide repeat-containing protein ... 181 2e-43 ref|XP_002302689.1| predicted protein [Populus trichocarpa] gi|2... 180 2e-43 ref|XP_002515124.1| pentatricopeptide repeat-containing protein,... 179 5e-43 >ref|XP_004140525.1| PREDICTED: pentatricopeptide repeat-containing protein At4g01570-like [Cucumis sativus] gi|449523383|ref|XP_004168703.1| PREDICTED: pentatricopeptide repeat-containing protein At4g01570-like [Cucumis sativus] Length = 803 Score = 183 bits (465), Expect = 3e-44 Identities = 88/179 (49%), Positives = 121/179 (67%), Gaps = 10/179 (5%) Frame = +2 Query: 194 RKAGLQQDFCNLFDKLCQL-GFSFNAWTYNICIHAFGSWGHLALALKLFKEMK------- 349 RK ++ +F +FDKL + F F+ + YNICI+AFG WG+L AL LFKEMK Sbjct: 220 RKLDMRVEFKKVFDKLRAIESFEFSVYGYNICIYAFGCWGYLDTALSLFKEMKEKSLVSE 279 Query: 350 --SPDICSFNSVLHALCLAGRVQDAVEVFDEMKSSGFEPDRFTYRTLILGCCKAFRIDEA 523 SPD+C++NS++H LCL G+V+DA+ V++E+K SG EPD FTYR +I GCCK+ R+D+A Sbjct: 280 SFSPDLCTYNSIIHVLCLVGKVKDALIVWEELKGSGHEPDAFTYRIIIQGCCKSCRMDDA 339 Query: 524 LRVFREMEYNNVKCDTLVYNNIXXXXXXXXXXXXXCQMFDKMVSGGIGVSPYSYNILID 700 +F EMEYN + DT+VYN++ CQ+FDKMV + SP++YNILID Sbjct: 340 TMIFNEMEYNGLIPDTIVYNSLLNGLFKARKVTEACQLFDKMVQEDVRASPWTYNILID 398 Score = 58.5 bits (140), Expect = 1e-06 Identities = 32/95 (33%), Positives = 53/95 (55%), Gaps = 4/95 (4%) Frame = +2 Query: 272 TYNICIHAFGSWGHLALALKLFKEMKSP----DICSFNSVLHALCLAGRVQDAVEVFDEM 439 TYN+ I G G LA + +++ DI +N++++AL AGR+ D ++F +M Sbjct: 670 TYNVIIQGLGKMGRADLASSVLEKLMEQGGYLDIVMYNTLINALGKAGRMDDVNKLFGQM 729 Query: 440 KSSGFEPDRFTYRTLILGCCKAFRIDEALRVFREM 544 ++SG PD T+ TLI KA R+ +A + + M Sbjct: 730 RNSGINPDVVTFNTLIEVHSKAGRLKDAYKFLKMM 764 Score = 57.8 bits (138), Expect = 2e-06 Identities = 35/127 (27%), Positives = 60/127 (47%), Gaps = 4/127 (3%) Frame = +2 Query: 221 CNLFDKLCQLGFSFNAWTYNICIHAFGSWGHLALALKLFKEMKS----PDICSFNSVLHA 388 C LF+ +G + +TYN + +F G+ A +F EM DI ++N ++ Sbjct: 618 CKLFEIFSDMGVNPVKYTYNSMLSSFVKKGYFHQAWGIFNEMGENVCPADIATYNVIIQG 677 Query: 389 LCLAGRVQDAVEVFDEMKSSGFEPDRFTYRTLILGCCKAFRIDEALRVFREMEYNNVKCD 568 L GR A V +++ G D Y TLI KA R+D+ ++F +M + + D Sbjct: 678 LGKMGRADLASSVLEKLMEQGGYLDIVMYNTLINALGKAGRMDDVNKLFGQMRNSGINPD 737 Query: 569 TLVYNNI 589 + +N + Sbjct: 738 VVTFNTL 744 >ref|XP_002334407.1| predicted protein [Populus trichocarpa] gi|222872045|gb|EEF09176.1| predicted protein [Populus trichocarpa] Length = 513 Score = 181 bits (459), Expect = 1e-43 Identities = 85/179 (47%), Positives = 121/179 (67%), Gaps = 10/179 (5%) Frame = +2 Query: 194 RKAGLQQDFCNLFDKLC-QLGFSFNAWTYNICIHAFGSWGHLALALKLFKEMKS------ 352 R ++ +F +F KL ++GF N W YNICIHAFG WG L +L+LFKEMK Sbjct: 191 RNGEMKVEFKTVFAKLRGKVGFELNTWGYNICIHAFGCWGDLTTSLRLFKEMKEKSLASG 250 Query: 353 ---PDICSFNSVLHALCLAGRVQDAVEVFDEMKSSGFEPDRFTYRTLILGCCKAFRIDEA 523 PD+C++NS++H LCLAG+V+DAV V++E+K SG EPD FTYR LI GCCK+++++++ Sbjct: 251 SLDPDLCTYNSLIHVLCLAGKVKDAVIVYEELKVSGHEPDAFTYRILIQGCCKSYQMEDS 310 Query: 524 LRVFREMEYNNVKCDTLVYNNIXXXXXXXXXXXXXCQMFDKMVSGGIGVSPYSYNILID 700 ++F EM+YN DT+VYN++ CQ+F+KMV G+ S ++YNILID Sbjct: 311 TKIFSEMQYNGFLPDTVVYNSLLDGMFKARKVMEACQLFEKMVQDGVRASCWTYNILID 369 Score = 58.9 bits (141), Expect = 9e-07 Identities = 37/125 (29%), Positives = 59/125 (47%), Gaps = 4/125 (3%) Frame = +2 Query: 197 KAGLQQDFCNLFDKLCQLGFSFNAWTYNICIHAFGSWGHLALALKLFKEMKSP----DIC 364 KA + C LF+K+ Q G + WTYNI I G LF +K D Sbjct: 338 KARKVMEACQLFEKMVQDGVRASCWTYNILIDGLCKNGRAEAGYNLFCGLKKKGQFVDAV 397 Query: 365 SFNSVLHALCLAGRVQDAVEVFDEMKSSGFEPDRFTYRTLILGCCKAFRIDEALRVFREM 544 +++ V+ LC G ++DA+ + +EM+ GF D T +L++ K R D R+ + + Sbjct: 398 TYSIVVLLLCRKGHLEDALHLVEEMEERGFVVDLITITSLLIAFHKQGRWDCTERLMKHI 457 Query: 545 EYNNV 559 N+ Sbjct: 458 RDVNL 462 >ref|XP_002874971.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297320808|gb|EFH51230.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 802 Score = 181 bits (458), Expect = 2e-43 Identities = 87/180 (48%), Positives = 120/180 (66%), Gaps = 11/180 (6%) Frame = +2 Query: 194 RKAGLQQDFCNLFDKLCQLG-FSFNAWTYNICIHAFGSWGHLALALKLFKEMK------- 349 R+A ++ +F +F+KL + F F+ W+YNICIH FG WG L AL LFKEMK Sbjct: 221 RRADMRSEFKTVFEKLKGMNRFKFDTWSYNICIHGFGCWGDLDAALSLFKEMKERSSVSG 280 Query: 350 ---SPDICSFNSVLHALCLAGRVQDAVEVFDEMKSSGFEPDRFTYRTLILGCCKAFRIDE 520 +PDIC++NS++H LCL G+ +DA+ V+DE+K SG EPD TYR LI GCCK++R+D+ Sbjct: 281 SSFAPDICTYNSLIHVLCLFGKAKDALIVWDELKVSGHEPDNSTYRILIQGCCKSYRMDD 340 Query: 521 ALRVFREMEYNNVKCDTLVYNNIXXXXXXXXXXXXXCQMFDKMVSGGIGVSPYSYNILID 700 A+R+F EM+YN DT+VYN + CQ+F+KMV G+ S ++YNILID Sbjct: 341 AMRIFGEMQYNGFVPDTVVYNCLLDGTLKARKVTEACQLFEKMVQEGVRASCWTYNILID 400 Score = 67.0 bits (162), Expect = 3e-09 Identities = 40/133 (30%), Positives = 68/133 (51%), Gaps = 4/133 (3%) Frame = +2 Query: 197 KAGLQQDFCNLFDKLCQLGFSFNAWTYNICIHAFGSWGHLALALKLFKEMKSP----DIC 364 KA + C LF+K+ Q G + WTYNI I G LF ++K D Sbjct: 369 KARKVTEACQLFEKMVQEGVRASCWTYNILIDGLFRNGRAEAGFTLFCDLKKKGQFVDAI 428 Query: 365 SFNSVLHALCLAGRVQDAVEVFDEMKSSGFEPDRFTYRTLILGCCKAFRIDEALRVFREM 544 +F+ V+ LC G++++AV++ +EM++ GF D T +L++G K R D ++ + + Sbjct: 429 TFSIVVLQLCREGKLEEAVKLVEEMETRGFTVDLVTISSLLIGFHKQGRWDWKEKLMKHV 488 Query: 545 EYNNVKCDTLVYN 583 N+ + L +N Sbjct: 489 REGNLVPNVLRWN 501 Score = 57.4 bits (137), Expect = 3e-06 Identities = 38/109 (34%), Positives = 59/109 (54%), Gaps = 7/109 (6%) Frame = +2 Query: 239 LCQLGFSFNAW---TYNICIHAFGSWGHLALAL----KLFKEMKSPDICSFNSVLHALCL 397 L Q+G +F A TYN+ I G G LA +L K+ DI +N++++A+ Sbjct: 651 LDQMGENFCAADIATYNVIIQGLGKMGRADLAGAVLDRLTKQGGYLDIVMYNTLINAIGK 710 Query: 398 AGRVQDAVEVFDEMKSSGFEPDRFTYRTLILGCCKAFRIDEALRVFREM 544 A R+ A ++FD MKS+G PD +Y T+I KA ++ EA + + M Sbjct: 711 ANRLDAATQLFDHMKSNGINPDVVSYNTMIEVNSKAGKLKEAYKYLKAM 759 >ref|XP_002302689.1| predicted protein [Populus trichocarpa] gi|222844415|gb|EEE81962.1| predicted protein [Populus trichocarpa] Length = 640 Score = 180 bits (457), Expect = 2e-43 Identities = 86/179 (48%), Positives = 120/179 (67%), Gaps = 10/179 (5%) Frame = +2 Query: 194 RKAGLQQDFCNLFDKLC-QLGFSFNAWTYNICIHAFGSWGHLALALKLFKEMKS------ 352 R ++ +F +F KL + GF N W YNICIHAFG WG L +L+LFKEMK Sbjct: 55 RNGEMKVEFKTVFAKLRGKGGFELNTWGYNICIHAFGCWGDLTTSLRLFKEMKEKSLASG 114 Query: 353 ---PDICSFNSVLHALCLAGRVQDAVEVFDEMKSSGFEPDRFTYRTLILGCCKAFRIDEA 523 PD+C++NS++H LCLAG+V+DAV V++E+K SG EPD FTYR LI GCCK++++++A Sbjct: 115 SLDPDLCTYNSLIHVLCLAGKVKDAVIVYEELKVSGHEPDAFTYRILIQGCCKSYQMEDA 174 Query: 524 LRVFREMEYNNVKCDTLVYNNIXXXXXXXXXXXXXCQMFDKMVSGGIGVSPYSYNILID 700 ++F EM+YN DT+VYN++ CQ+F+KMV G+ S ++YNILID Sbjct: 175 TKIFSEMQYNGFLPDTVVYNSLLDGMFKARKVMEACQLFEKMVQDGVRASCWTYNILID 233 Score = 67.0 bits (162), Expect = 3e-09 Identities = 42/125 (33%), Positives = 61/125 (48%), Gaps = 4/125 (3%) Frame = +2 Query: 221 CNLFDKLCQLGFSFNAWTYNICIHAFGSWGHLALALKLFKEMKS----PDICSFNSVLHA 388 C LF+ +G ++TYN + +F G+ A +F EM PDI ++N V+ Sbjct: 454 CKLFEIFTDMGVDPVSYTYNSIMSSFVKKGYFNRAWDVFNEMGEKVCPPDIATYNLVIQG 513 Query: 389 LCLAGRVQDAVEVFDEMKSSGFEPDRFTYRTLILGCCKAFRIDEALRVFREMEYNNVKCD 568 L GR A V D++ G D Y TLI KA RIDEA +F +M+ + + D Sbjct: 514 LGKMGRADLASSVLDKLMKQGGYLDIVMYNTLIDALGKAGRIDEANNLFEQMKISGLNPD 573 Query: 569 TLVYN 583 + YN Sbjct: 574 VVTYN 578 Score = 57.4 bits (137), Expect = 3e-06 Identities = 36/125 (28%), Positives = 59/125 (47%), Gaps = 4/125 (3%) Frame = +2 Query: 197 KAGLQQDFCNLFDKLCQLGFSFNAWTYNICIHAFGSWGHLALALKLFKEMKSP----DIC 364 KA + C LF+K+ Q G + WTYNI I G LF +K D Sbjct: 202 KARKVMEACQLFEKMVQDGVRASCWTYNILIDGLCKNGRAEAGYNLFCGLKKKGQFVDAV 261 Query: 365 SFNSVLHALCLAGRVQDAVEVFDEMKSSGFEPDRFTYRTLILGCCKAFRIDEALRVFREM 544 +++ V+ LC G +++A+ + +EM+ GF D T +L++ K R D R+ + + Sbjct: 262 TYSIVVLLLCRKGHLEEALHLVEEMEERGFVVDLITITSLLIAFHKQGRWDCTERLMKHI 321 Query: 545 EYNNV 559 N+ Sbjct: 322 RDVNL 326 Score = 57.0 bits (136), Expect = 4e-06 Identities = 34/95 (35%), Positives = 50/95 (52%), Gaps = 4/95 (4%) Frame = +2 Query: 272 TYNICIHAFGSWGHLALAL----KLFKEMKSPDICSFNSVLHALCLAGRVQDAVEVFDEM 439 TYN+ I G G LA KL K+ DI +N+++ AL AGR+ +A +F++M Sbjct: 506 TYNLVIQGLGKMGRADLASSVLDKLMKQGGYLDIVMYNTLIDALGKAGRIDEANNLFEQM 565 Query: 440 KSSGFEPDRFTYRTLILGCCKAFRIDEALRVFREM 544 K SG PD TY +I K R+ +A + + M Sbjct: 566 KISGLNPDVVTYNIMIEVHSKTGRLKDAYKFLKMM 600 >ref|XP_002515124.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223545604|gb|EEF47108.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 898 Score = 179 bits (454), Expect = 5e-43 Identities = 83/178 (46%), Positives = 118/178 (66%), Gaps = 9/178 (5%) Frame = +2 Query: 194 RKAGLQQDFCNLFDKLCQLGFSFNAWTYNICIHAFGSWGHLALALKLFKEMKS------- 352 RKA ++ +F +FDKL +GF + W YNICIHAFG W L AL+LFKEMK Sbjct: 236 RKADMRVEFKKVFDKLKGMGFELDTWGYNICIHAFGCWSDLGTALRLFKEMKEKSKGFGS 295 Query: 353 --PDICSFNSVLHALCLAGRVQDAVEVFDEMKSSGFEPDRFTYRTLILGCCKAFRIDEAL 526 PD+C++NS++ LC +G+V+DA+ V++E+K SG EPD FTYR +I GC K++R+++A Sbjct: 296 CCPDLCTYNSLIRLLCFSGKVKDALVVYEELKISGHEPDAFTYRIIIEGCSKSYRMNDAT 355 Query: 527 RVFREMEYNNVKCDTLVYNNIXXXXXXXXXXXXXCQMFDKMVSGGIGVSPYSYNILID 700 ++F EM+YN DT VYN++ CQ+F+KMV G+ S ++YNILID Sbjct: 356 KIFSEMQYNGFVPDTTVYNSLLDGMFKARKVTEACQLFEKMVQDGVRASSWTYNILID 413 Score = 63.2 bits (152), Expect = 5e-08 Identities = 41/116 (35%), Positives = 62/116 (53%), Gaps = 7/116 (6%) Frame = +2 Query: 218 FCNLFDKLCQLGFSF---NAWTYNICIHAFGSWGHLALAL----KLFKEMKSPDICSFNS 376 F +D L Q+G + TYN+ I G G LA KL K+ DI +N+ Sbjct: 668 FSEAWDVLNQMGEKVCPSDIATYNLIIQGLGKMGRADLASSVLDKLMKQGGYLDIVMYNT 727 Query: 377 VLHALCLAGRVQDAVEVFDEMKSSGFEPDRFTYRTLILGCCKAFRIDEALRVFREM 544 +++AL AGR+ + ++F++MK+SG PD TY TLI KA R+ +A + + M Sbjct: 728 LINALGKAGRIDEVRKLFEQMKTSGINPDVVTYNTLIEVHTKAGRLKDAYKFLKMM 783 Score = 62.4 bits (150), Expect = 9e-08 Identities = 37/127 (29%), Positives = 62/127 (48%), Gaps = 4/127 (3%) Frame = +2 Query: 221 CNLFDKLCQLGFSFNAWTYNICIHAFGSWGHLALALKLFKEMKSP----DICSFNSVLHA 388 C LF+ +G + ++TYN + +F G+ + A + +M DI ++N ++ Sbjct: 637 CKLFEIFSDMGVNPVSYTYNSIMSSFVKKGYFSEAWDVLNQMGEKVCPSDIATYNLIIQG 696 Query: 389 LCLAGRVQDAVEVFDEMKSSGFEPDRFTYRTLILGCCKAFRIDEALRVFREMEYNNVKCD 568 L GR A V D++ G D Y TLI KA RIDE ++F +M+ + + D Sbjct: 697 LGKMGRADLASSVLDKLMKQGGYLDIVMYNTLINALGKAGRIDEVRKLFEQMKTSGINPD 756 Query: 569 TLVYNNI 589 + YN + Sbjct: 757 VVTYNTL 763 Score = 60.1 bits (144), Expect = 4e-07 Identities = 35/125 (28%), Positives = 63/125 (50%), Gaps = 4/125 (3%) Frame = +2 Query: 197 KAGLQQDFCNLFDKLCQLGFSFNAWTYNICIHAFGSWGHLALALKLFKEMKSP----DIC 364 KA + C LF+K+ Q G ++WTYNI I G A LF ++K D Sbjct: 382 KARKVTEACQLFEKMVQDGVRASSWTYNILIDGLCKNGRSAAGYSLFCDLKKKGKFVDAI 441 Query: 365 SFNSVLHALCLAGRVQDAVEVFDEMKSSGFEPDRFTYRTLILGCCKAFRIDEALRVFREM 544 +++ ++ LC G++++A+ + +EM+ GF D T +L++ K R D ++ + + Sbjct: 442 TYSIIVLLLCREGQLKEALSLVEEMEERGFVVDLVTITSLLIAFHKQGRWDWTEKLMKHV 501 Query: 545 EYNNV 559 N+ Sbjct: 502 RDGNL 506