BLASTX nr result
ID: Dioscorea21_contig00017824
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Dioscorea21_contig00017824 (1950 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CBI32449.3| unnamed protein product [Vitis vinifera] 765 0.0 ref|XP_002281969.1| PREDICTED: pentatricopeptide repeat-containi... 765 0.0 ref|XP_002521838.1| pentatricopeptide repeat-containing protein,... 738 0.0 ref|XP_004171087.1| PREDICTED: pentatricopeptide repeat-containi... 716 0.0 ref|XP_004152074.1| PREDICTED: pentatricopeptide repeat-containi... 714 0.0 >emb|CBI32449.3| unnamed protein product [Vitis vinifera] Length = 790 Score = 765 bits (1976), Expect = 0.0 Identities = 366/570 (64%), Positives = 455/570 (79%) Frame = -3 Query: 1948 KLADHLGKDRKFAKCREMFDAIISHGRVPSESTFHILTVAYLSAPVQGCLDEACTIYNRM 1769 KLAD++GK+RKF+KCRE+FD II G VP ESTFHIL +AYLSA VQGCLDEAC IYNRM Sbjct: 193 KLADYMGKERKFSKCREIFDDIIKQGLVPCESTFHILIIAYLSASVQGCLDEACGIYNRM 252 Query: 1768 IQLGGYRPRLSLHNSLFRALVSKPGGLSKHYLKQAEFIYHQLVTMELEVHKDIYAGLIWL 1589 IQLGGY+PRLSLHNSLFRALV +PGG SK++LKQAEFI+H LVT E+HKD+Y GLIWL Sbjct: 253 IQLGGYQPRLSLHNSLFRALVGQPGGSSKYFLKQAEFIFHNLVTFGFEIHKDVYGGLIWL 312 Query: 1588 HSYQDNIDRERIAALREEMQCAGIEESRDVLISLMRAFSKEGDVDETERAWLKLIDSGGI 1409 HSYQD IDRERIA+LREEMQ AGIEESRDVL+S++RA SKEGDV+E E+ WLKL+ S Sbjct: 313 HSYQDTIDRERIASLREEMQLAGIEESRDVLLSILRACSKEGDVEEAEKTWLKLLHSDCA 372 Query: 1408 APFQAFVYRIELYAKIGEPMKSLEIFKGMKEKGISINVAVYNKXXXXXXXXXXXXXXXXX 1229 P Q FVYR+E+YAK+GEPMKSLEIF+ M+E+ S +V Y+K Sbjct: 373 IPSQGFVYRMEVYAKVGEPMKSLEIFREMQEQLGSTSVVAYHKIIEVLSKAQEIELVESL 432 Query: 1228 XXXXINSGMKPLTSAFLDLLKMYLNLGVHDKLEVSFANCLQRCRPNRSIYYIYLKSLVGN 1049 INSGMKPL +++DL+ MY NL +HDKLE +F CL++CRPNR+IY IY+ SLV Sbjct: 433 MTEFINSGMKPLMPSYIDLMNMYFNLSLHDKLEAAFYECLEKCRPNRAIYNIYMDSLVQI 492 Query: 1048 GNLEKAVEIFYEMHTNPAIGIHAQSCNTILGAYLSSGEFVKAEKIYDLMCQKKYDIEPQY 869 GNL+KA EIF +M++N AIG++ +SCNTIL YLS G+++KAEKIYDLMCQKKY I+ Sbjct: 493 GNLDKAEEIFNQMYSNGAIGVNTKSCNTILSGYLSCGDYLKAEKIYDLMCQKKYAIDAPL 552 Query: 868 MEKLDYILSLKRKVIKRPVSMKLDPEQREXXXXXXXXXXXIQSDEERRNHAIYFEFSGNS 689 MEKLDY+LSL RKV+KRPVS+KL EQRE ++SDEER+NH IYFEF+ NS Sbjct: 553 MEKLDYVLSLSRKVVKRPVSLKLSKEQREILIGLLLGGLQMESDEERKNHVIYFEFNENS 612 Query: 688 NVHSRLKIHIHERFYEWLKSSNESANRDNDIPDRFSTIAHSYFGFFADQFWLKGRPVIPK 509 HS L+ HIHE+++EWL SS++ ++ ++D+P +FSTI+HSYFGF+ADQFW +GRP+IPK Sbjct: 613 GAHSVLRRHIHEQYHEWLNSSSKLSDDNDDVPYKFSTISHSYFGFYADQFWPRGRPMIPK 672 Query: 508 LIHRWLSPRVLAYWYMYGGIRTSAGDILLKLKGGNQEDLERIAKVFQAKSLTCKVKRKGR 329 LIHRWLSPRVLAYWYMYGG RTS+GDILLKLK G++E +E++ + +A+S+ C+VKRKG Sbjct: 673 LIHRWLSPRVLAYWYMYGGHRTSSGDILLKLK-GSREGVEKVVRTLKAQSMDCRVKRKGT 731 Query: 328 VFWIGFQGDNAVWFWKLTEPYILENVREFL 239 VFWIG G N+ WFWKL EPYIL++V++F+ Sbjct: 732 VFWIGLLGSNSTWFWKLIEPYILDDVKDFV 761 >ref|XP_002281969.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15820-like [Vitis vinifera] Length = 823 Score = 765 bits (1976), Expect = 0.0 Identities = 366/570 (64%), Positives = 455/570 (79%) Frame = -3 Query: 1948 KLADHLGKDRKFAKCREMFDAIISHGRVPSESTFHILTVAYLSAPVQGCLDEACTIYNRM 1769 KLAD++GK+RKF+KCRE+FD II G VP ESTFHIL +AYLSA VQGCLDEAC IYNRM Sbjct: 226 KLADYMGKERKFSKCREIFDDIIKQGLVPCESTFHILIIAYLSASVQGCLDEACGIYNRM 285 Query: 1768 IQLGGYRPRLSLHNSLFRALVSKPGGLSKHYLKQAEFIYHQLVTMELEVHKDIYAGLIWL 1589 IQLGGY+PRLSLHNSLFRALV +PGG SK++LKQAEFI+H LVT E+HKD+Y GLIWL Sbjct: 286 IQLGGYQPRLSLHNSLFRALVGQPGGSSKYFLKQAEFIFHNLVTFGFEIHKDVYGGLIWL 345 Query: 1588 HSYQDNIDRERIAALREEMQCAGIEESRDVLISLMRAFSKEGDVDETERAWLKLIDSGGI 1409 HSYQD IDRERIA+LREEMQ AGIEESRDVL+S++RA SKEGDV+E E+ WLKL+ S Sbjct: 346 HSYQDTIDRERIASLREEMQLAGIEESRDVLLSILRACSKEGDVEEAEKTWLKLLHSDCA 405 Query: 1408 APFQAFVYRIELYAKIGEPMKSLEIFKGMKEKGISINVAVYNKXXXXXXXXXXXXXXXXX 1229 P Q FVYR+E+YAK+GEPMKSLEIF+ M+E+ S +V Y+K Sbjct: 406 IPSQGFVYRMEVYAKVGEPMKSLEIFREMQEQLGSTSVVAYHKIIEVLSKAQEIELVESL 465 Query: 1228 XXXXINSGMKPLTSAFLDLLKMYLNLGVHDKLEVSFANCLQRCRPNRSIYYIYLKSLVGN 1049 INSGMKPL +++DL+ MY NL +HDKLE +F CL++CRPNR+IY IY+ SLV Sbjct: 466 MTEFINSGMKPLMPSYIDLMNMYFNLSLHDKLEAAFYECLEKCRPNRAIYNIYMDSLVQI 525 Query: 1048 GNLEKAVEIFYEMHTNPAIGIHAQSCNTILGAYLSSGEFVKAEKIYDLMCQKKYDIEPQY 869 GNL+KA EIF +M++N AIG++ +SCNTIL YLS G+++KAEKIYDLMCQKKY I+ Sbjct: 526 GNLDKAEEIFNQMYSNGAIGVNTKSCNTILSGYLSCGDYLKAEKIYDLMCQKKYAIDAPL 585 Query: 868 MEKLDYILSLKRKVIKRPVSMKLDPEQREXXXXXXXXXXXIQSDEERRNHAIYFEFSGNS 689 MEKLDY+LSL RKV+KRPVS+KL EQRE ++SDEER+NH IYFEF+ NS Sbjct: 586 MEKLDYVLSLSRKVVKRPVSLKLSKEQREILIGLLLGGLQMESDEERKNHVIYFEFNENS 645 Query: 688 NVHSRLKIHIHERFYEWLKSSNESANRDNDIPDRFSTIAHSYFGFFADQFWLKGRPVIPK 509 HS L+ HIHE+++EWL SS++ ++ ++D+P +FSTI+HSYFGF+ADQFW +GRP+IPK Sbjct: 646 GAHSVLRRHIHEQYHEWLNSSSKLSDDNDDVPYKFSTISHSYFGFYADQFWPRGRPMIPK 705 Query: 508 LIHRWLSPRVLAYWYMYGGIRTSAGDILLKLKGGNQEDLERIAKVFQAKSLTCKVKRKGR 329 LIHRWLSPRVLAYWYMYGG RTS+GDILLKLK G++E +E++ + +A+S+ C+VKRKG Sbjct: 706 LIHRWLSPRVLAYWYMYGGHRTSSGDILLKLK-GSREGVEKVVRTLKAQSMDCRVKRKGT 764 Query: 328 VFWIGFQGDNAVWFWKLTEPYILENVREFL 239 VFWIG G N+ WFWKL EPYIL++V++F+ Sbjct: 765 VFWIGLLGSNSTWFWKLIEPYILDDVKDFV 794 >ref|XP_002521838.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223538876|gb|EEF40474.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 835 Score = 738 bits (1905), Expect = 0.0 Identities = 361/596 (60%), Positives = 458/596 (76%) Frame = -3 Query: 1948 KLADHLGKDRKFAKCREMFDAIISHGRVPSESTFHILTVAYLSAPVQGCLDEACTIYNRM 1769 KLAD++GK+RKFAKCRE+FD II+ GRVPSESTFHIL +AYLSAPVQGCL+EACTIYNRM Sbjct: 237 KLADYMGKERKFAKCREIFDDIINQGRVPSESTFHILIIAYLSAPVQGCLEEACTIYNRM 296 Query: 1768 IQLGGYRPRLSLHNSLFRALVSKPGGLSKHYLKQAEFIYHQLVTMELEVHKDIYAGLIWL 1589 IQLGGY+PRLSLHNSLFRALVSKPGG +KHYLKQAEFIYH LVT LE+ DIY GLIWL Sbjct: 297 IQLGGYQPRLSLHNSLFRALVSKPGGFAKHYLKQAEFIYHNLVTSGLEIQNDIYGGLIWL 356 Query: 1588 HSYQDNIDRERIAALREEMQCAGIEESRDVLISLMRAFSKEGDVDETERAWLKLIDSGGI 1409 HSYQDNID+ RIA++REEM+ AGI E R++L+S+MRA SKEGDV+E ER WLKL+ G Sbjct: 357 HSYQDNIDKVRIASIREEMKQAGIMEGREILLSIMRACSKEGDVEEAERTWLKLLQVDGG 416 Query: 1408 APFQAFVYRIELYAKIGEPMKSLEIFKGMKEKGISINVAVYNKXXXXXXXXXXXXXXXXX 1229 P QAFVYR+E++AK+GE MKSLE F+ M+E S ++A Y+K Sbjct: 417 LPTQAFVYRMEVFAKLGEHMKSLETFREMQELLGSSSIAAYHKIIEVVSQAQEVELAESL 476 Query: 1228 XXXXINSGMKPLTSAFLDLLKMYLNLGVHDKLEVSFANCLQRCRPNRSIYYIYLKSLVGN 1049 I SG+KPL +F DL+ MYLNL +H+KLE +F CL+ CRPNR+IY +YL SLV Sbjct: 477 MQEFIKSGLKPLMPSFTDLMNMYLNLNLHEKLESTFFACLENCRPNRNIYNVYLDSLVKV 536 Query: 1048 GNLEKAVEIFYEMHTNPAIGIHAQSCNTILGAYLSSGEFVKAEKIYDLMCQKKYDIEPQY 869 GNL+KA E F M +N A+G++ +SCNTIL YLSSG++VKAEKIYDLMCQKKYDIEP Sbjct: 537 GNLDKAEEAFNNMCSNEAVGVNIRSCNTILRGYLSSGDYVKAEKIYDLMCQKKYDIEPSL 596 Query: 868 MEKLDYILSLKRKVIKRPVSMKLDPEQREXXXXXXXXXXXIQSDEERRNHAIYFEFSGNS 689 MEKLDY+LSL RKV+K+P+S+KL +QRE ++SD+ R+ H I FEF+ NS Sbjct: 597 MEKLDYVLSLSRKVVKKPLSLKLSKDQREILVGLLLGGLRVESDDNRKKHMIRFEFNENS 656 Query: 688 NVHSRLKIHIHERFYEWLKSSNESANRDNDIPDRFSTIAHSYFGFFADQFWLKGRPVIPK 509 + H+ L+ H++++++EWL S + ++ + RFSTI+HSYF F+A+QFW KG+P+IPK Sbjct: 657 STHAILRRHLYDKYHEWLHPSCKLSDGSDGASYRFSTISHSYFSFYAEQFWPKGQPMIPK 716 Query: 508 LIHRWLSPRVLAYWYMYGGIRTSAGDILLKLKGGNQEDLERIAKVFQAKSLTCKVKRKGR 329 LIHRWLSP+VLA+WYMY G RTS+GDILLKLK G++E +E++ K ++KSL CKVKRKGR Sbjct: 717 LIHRWLSPQVLAFWYMYAGHRTSSGDILLKLK-GSREGVEKVFKTLKSKSLNCKVKRKGR 775 Query: 328 VFWIGFQGDNAVWFWKLTEPYILENVREFLTPESDAMRNEPKGNLFTDFDSESDND 161 VFWIGF G+++VWFWKL EPYIL++++ FL + + +FDS SD++ Sbjct: 776 VFWIGFLGNDSVWFWKLVEPYILDDLKLFLKAGDQTLEYSAEN---INFDSGSDSE 828 >ref|XP_004171087.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15820-like, partial [Cucumis sativus] Length = 747 Score = 716 bits (1847), Expect = 0.0 Identities = 358/597 (59%), Positives = 449/597 (75%) Frame = -3 Query: 1948 KLADHLGKDRKFAKCREMFDAIISHGRVPSESTFHILTVAYLSAPVQGCLDEACTIYNRM 1769 KLAD++GK+RKF+KCRE+FD II+ G VPSESTFHIL VAYLSAPVQGC++EA TIYNRM Sbjct: 149 KLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCVEEASTIYNRM 208 Query: 1768 IQLGGYRPRLSLHNSLFRALVSKPGGLSKHYLKQAEFIYHQLVTMELEVHKDIYAGLIWL 1589 IQLGGY+PRLSLH+SLFRALVSKPG LSKH+LKQAEFIYH LVT LE+HKDIY GLIWL Sbjct: 209 IQLGGYQPRLSLHSSLFRALVSKPGDLSKHHLKQAEFIYHNLVTSGLELHKDIYGGLIWL 268 Query: 1588 HSYQDNIDRERIAALREEMQCAGIEESRDVLISLMRAFSKEGDVDETERAWLKLIDSGGI 1409 HSYQD IDRERI +LR+EMQ AGI+E R+VL+S++RA SK GDV E E+ W +L G Sbjct: 269 HSYQDTIDRERIVSLRKEMQQAGIKEEREVLLSILRASSKMGDVMEAEKLWQELKYLDGN 328 Query: 1408 APFQAFVYRIELYAKIGEPMKSLEIFKGMKEKGISINVAVYNKXXXXXXXXXXXXXXXXX 1229 P QAFVY++E+YAK+G+PMK+LEIF+ M++ S N A Y Sbjct: 329 MPSQAFVYKMEVYAKMGKPMKALEIFREMEQLN-STNAAAYQTIIGILCKFQVIELAESI 387 Query: 1228 XXXXINSGMKPLTSAFLDLLKMYLNLGVHDKLEVSFANCLQRCRPNRSIYYIYLKSLVGN 1049 I S +KPLT A++DL+ M+ NL + DKLE++F+ CL++C+PNR+IY IYL SLV Sbjct: 388 MAGFIESNLKPLTPAYVDLMNMFFNLNLDDKLELTFSQCLEKCKPNRTIYSIYLDSLVKV 447 Query: 1048 GNLEKAVEIFYEMHTNPAIGIHAQSCNTILGAYLSSGEFVKAEKIYDLMCQKKYDIEPQY 869 GNL++A EIF +M TN IGI+A+SCN IL YL G ++KAEKIYDLMCQK+YDI+P Sbjct: 448 GNLDRAEEIFSQMETNGEIGINARSCNIILRGYLLCGNYMKAEKIYDLMCQKRYDIDPPL 507 Query: 868 MEKLDYILSLKRKVIKRPVSMKLDPEQREXXXXXXXXXXXIQSDEERRNHAIYFEFSGNS 689 MEKL+YILSL RK +K+P+S+KL EQRE I+SDEER+NH I FEF N Sbjct: 508 MEKLEYILSLSRKEVKKPMSLKLSKEQREILVGLLLGGLEIESDEERKNHRIQFEFHRNC 567 Query: 688 NVHSRLKIHIHERFYEWLKSSNESANRDNDIPDRFSTIAHSYFGFFADQFWLKGRPVIPK 509 HS L+ HI+E++++WL S+++ + D DIP +F T++HSYFGF+ADQFW +GR IP Sbjct: 568 KTHSVLRRHIYEQYHKWLHSASKLTDGDVDIPYKFCTVSHSYFGFYADQFWPRGRRAIPN 627 Query: 508 LIHRWLSPRVLAYWYMYGGIRTSAGDILLKLKGGNQEDLERIAKVFQAKSLTCKVKRKGR 329 LIHRWLSPRVLAYWYMYGG RTS+GDILLKLK G+ E +E+I K + KS+ CKVKRKG Sbjct: 628 LIHRWLSPRVLAYWYMYGGCRTSSGDILLKLK-GSHEGVEKIVKSLREKSIHCKVKRKGN 686 Query: 328 VFWIGFQGDNAVWFWKLTEPYILENVREFLTPESDAMRNEPKGNLFTDFDSESDNDE 158 ++WIG G NA WFWKL EP+IL+ ++E +S + G+ +FDSESD+ E Sbjct: 687 MYWIGLLGTNATWFWKLIEPFILDYLKESTQADSLNLVGVLNGSENINFDSESDSVE 743 >ref|XP_004152074.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15820-like [Cucumis sativus] Length = 797 Score = 714 bits (1843), Expect = 0.0 Identities = 356/597 (59%), Positives = 449/597 (75%) Frame = -3 Query: 1948 KLADHLGKDRKFAKCREMFDAIISHGRVPSESTFHILTVAYLSAPVQGCLDEACTIYNRM 1769 KLAD++GK+RKF+KCRE+FD II+ G VPSESTFHIL VAYLSAPVQGC++EA TIYNRM Sbjct: 199 KLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRM 258 Query: 1768 IQLGGYRPRLSLHNSLFRALVSKPGGLSKHYLKQAEFIYHQLVTMELEVHKDIYAGLIWL 1589 IQLGGY+PRLSLH+SLFRALVSKPG LSKH+LKQAEFIYH LVT LE+HKD+Y GLIWL Sbjct: 259 IQLGGYQPRLSLHSSLFRALVSKPGDLSKHHLKQAEFIYHNLVTSGLELHKDMYGGLIWL 318 Query: 1588 HSYQDNIDRERIAALREEMQCAGIEESRDVLISLMRAFSKEGDVDETERAWLKLIDSGGI 1409 HSYQD IDRERI +LR+EMQ AGI+E R+VL+S++RA SK GDV E E+ W +L G Sbjct: 319 HSYQDTIDRERIVSLRKEMQQAGIKEEREVLLSILRASSKMGDVMEAEKLWQELKYLDGN 378 Query: 1408 APFQAFVYRIELYAKIGEPMKSLEIFKGMKEKGISINVAVYNKXXXXXXXXXXXXXXXXX 1229 P QAFVY++E+YAK+G+PMK+LEIF+ M++ S N A Y Sbjct: 379 MPSQAFVYKMEVYAKMGKPMKALEIFREMEQLN-STNAAAYQTIIGILCKFQVIELAESI 437 Query: 1228 XXXXINSGMKPLTSAFLDLLKMYLNLGVHDKLEVSFANCLQRCRPNRSIYYIYLKSLVGN 1049 I S +KPLT A++DL+ M+ NL + DKLE++F+ CL++C+PNR+IY IYL SLV Sbjct: 438 MAGFIESNLKPLTPAYVDLMNMFFNLNLDDKLELTFSQCLEKCKPNRTIYSIYLDSLVKV 497 Query: 1048 GNLEKAVEIFYEMHTNPAIGIHAQSCNTILGAYLSSGEFVKAEKIYDLMCQKKYDIEPQY 869 GNL++A EIF +M TN IGI+A+SCN IL YL G ++KAEKIYDLMCQK+YDI+P Sbjct: 498 GNLDRAEEIFSQMETNGEIGINARSCNIILRGYLLCGNYMKAEKIYDLMCQKRYDIDPPL 557 Query: 868 MEKLDYILSLKRKVIKRPVSMKLDPEQREXXXXXXXXXXXIQSDEERRNHAIYFEFSGNS 689 MEKL+YILSL RK +K+P+S+KL EQRE I+SD+ER+NH I FEF N Sbjct: 558 MEKLEYILSLSRKEVKKPMSLKLSKEQREILVGLLLGGLEIESDDERKNHRIQFEFHRNC 617 Query: 688 NVHSRLKIHIHERFYEWLKSSNESANRDNDIPDRFSTIAHSYFGFFADQFWLKGRPVIPK 509 HS L+ HI+E++++WL S+++ + D DIP +F T++HSYFGF+ADQFW +GR IP Sbjct: 618 KTHSVLRRHIYEQYHKWLHSASKLTDGDVDIPYKFCTVSHSYFGFYADQFWPRGRRAIPN 677 Query: 508 LIHRWLSPRVLAYWYMYGGIRTSAGDILLKLKGGNQEDLERIAKVFQAKSLTCKVKRKGR 329 LIHRWLSPRVLAYWYMYGG RTS+GDILLKLK G+ E +E+I K + KS+ CKVKRKG Sbjct: 678 LIHRWLSPRVLAYWYMYGGCRTSSGDILLKLK-GSHEGVEKIVKSLREKSIHCKVKRKGN 736 Query: 328 VFWIGFQGDNAVWFWKLTEPYILENVREFLTPESDAMRNEPKGNLFTDFDSESDNDE 158 ++WIG G NA WFWKL EP+IL+ ++E +S + G+ +FDSESD+ E Sbjct: 737 MYWIGLLGSNATWFWKLIEPFILDYLKESTQADSLNLVGVLNGSENINFDSESDSVE 793