BLASTX nr result

ID: Dioscorea21_contig00017824 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Dioscorea21_contig00017824
         (1950 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI32449.3| unnamed protein product [Vitis vinifera]              765   0.0  
ref|XP_002281969.1| PREDICTED: pentatricopeptide repeat-containi...   765   0.0  
ref|XP_002521838.1| pentatricopeptide repeat-containing protein,...   738   0.0  
ref|XP_004171087.1| PREDICTED: pentatricopeptide repeat-containi...   716   0.0  
ref|XP_004152074.1| PREDICTED: pentatricopeptide repeat-containi...   714   0.0  

>emb|CBI32449.3| unnamed protein product [Vitis vinifera]
          Length = 790

 Score =  765 bits (1976), Expect = 0.0
 Identities = 366/570 (64%), Positives = 455/570 (79%)
 Frame = -3

Query: 1948 KLADHLGKDRKFAKCREMFDAIISHGRVPSESTFHILTVAYLSAPVQGCLDEACTIYNRM 1769
            KLAD++GK+RKF+KCRE+FD II  G VP ESTFHIL +AYLSA VQGCLDEAC IYNRM
Sbjct: 193  KLADYMGKERKFSKCREIFDDIIKQGLVPCESTFHILIIAYLSASVQGCLDEACGIYNRM 252

Query: 1768 IQLGGYRPRLSLHNSLFRALVSKPGGLSKHYLKQAEFIYHQLVTMELEVHKDIYAGLIWL 1589
            IQLGGY+PRLSLHNSLFRALV +PGG SK++LKQAEFI+H LVT   E+HKD+Y GLIWL
Sbjct: 253  IQLGGYQPRLSLHNSLFRALVGQPGGSSKYFLKQAEFIFHNLVTFGFEIHKDVYGGLIWL 312

Query: 1588 HSYQDNIDRERIAALREEMQCAGIEESRDVLISLMRAFSKEGDVDETERAWLKLIDSGGI 1409
            HSYQD IDRERIA+LREEMQ AGIEESRDVL+S++RA SKEGDV+E E+ WLKL+ S   
Sbjct: 313  HSYQDTIDRERIASLREEMQLAGIEESRDVLLSILRACSKEGDVEEAEKTWLKLLHSDCA 372

Query: 1408 APFQAFVYRIELYAKIGEPMKSLEIFKGMKEKGISINVAVYNKXXXXXXXXXXXXXXXXX 1229
             P Q FVYR+E+YAK+GEPMKSLEIF+ M+E+  S +V  Y+K                 
Sbjct: 373  IPSQGFVYRMEVYAKVGEPMKSLEIFREMQEQLGSTSVVAYHKIIEVLSKAQEIELVESL 432

Query: 1228 XXXXINSGMKPLTSAFLDLLKMYLNLGVHDKLEVSFANCLQRCRPNRSIYYIYLKSLVGN 1049
                INSGMKPL  +++DL+ MY NL +HDKLE +F  CL++CRPNR+IY IY+ SLV  
Sbjct: 433  MTEFINSGMKPLMPSYIDLMNMYFNLSLHDKLEAAFYECLEKCRPNRAIYNIYMDSLVQI 492

Query: 1048 GNLEKAVEIFYEMHTNPAIGIHAQSCNTILGAYLSSGEFVKAEKIYDLMCQKKYDIEPQY 869
            GNL+KA EIF +M++N AIG++ +SCNTIL  YLS G+++KAEKIYDLMCQKKY I+   
Sbjct: 493  GNLDKAEEIFNQMYSNGAIGVNTKSCNTILSGYLSCGDYLKAEKIYDLMCQKKYAIDAPL 552

Query: 868  MEKLDYILSLKRKVIKRPVSMKLDPEQREXXXXXXXXXXXIQSDEERRNHAIYFEFSGNS 689
            MEKLDY+LSL RKV+KRPVS+KL  EQRE           ++SDEER+NH IYFEF+ NS
Sbjct: 553  MEKLDYVLSLSRKVVKRPVSLKLSKEQREILIGLLLGGLQMESDEERKNHVIYFEFNENS 612

Query: 688  NVHSRLKIHIHERFYEWLKSSNESANRDNDIPDRFSTIAHSYFGFFADQFWLKGRPVIPK 509
              HS L+ HIHE+++EWL SS++ ++ ++D+P +FSTI+HSYFGF+ADQFW +GRP+IPK
Sbjct: 613  GAHSVLRRHIHEQYHEWLNSSSKLSDDNDDVPYKFSTISHSYFGFYADQFWPRGRPMIPK 672

Query: 508  LIHRWLSPRVLAYWYMYGGIRTSAGDILLKLKGGNQEDLERIAKVFQAKSLTCKVKRKGR 329
            LIHRWLSPRVLAYWYMYGG RTS+GDILLKLK G++E +E++ +  +A+S+ C+VKRKG 
Sbjct: 673  LIHRWLSPRVLAYWYMYGGHRTSSGDILLKLK-GSREGVEKVVRTLKAQSMDCRVKRKGT 731

Query: 328  VFWIGFQGDNAVWFWKLTEPYILENVREFL 239
            VFWIG  G N+ WFWKL EPYIL++V++F+
Sbjct: 732  VFWIGLLGSNSTWFWKLIEPYILDDVKDFV 761


>ref|XP_002281969.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15820-like
            [Vitis vinifera]
          Length = 823

 Score =  765 bits (1976), Expect = 0.0
 Identities = 366/570 (64%), Positives = 455/570 (79%)
 Frame = -3

Query: 1948 KLADHLGKDRKFAKCREMFDAIISHGRVPSESTFHILTVAYLSAPVQGCLDEACTIYNRM 1769
            KLAD++GK+RKF+KCRE+FD II  G VP ESTFHIL +AYLSA VQGCLDEAC IYNRM
Sbjct: 226  KLADYMGKERKFSKCREIFDDIIKQGLVPCESTFHILIIAYLSASVQGCLDEACGIYNRM 285

Query: 1768 IQLGGYRPRLSLHNSLFRALVSKPGGLSKHYLKQAEFIYHQLVTMELEVHKDIYAGLIWL 1589
            IQLGGY+PRLSLHNSLFRALV +PGG SK++LKQAEFI+H LVT   E+HKD+Y GLIWL
Sbjct: 286  IQLGGYQPRLSLHNSLFRALVGQPGGSSKYFLKQAEFIFHNLVTFGFEIHKDVYGGLIWL 345

Query: 1588 HSYQDNIDRERIAALREEMQCAGIEESRDVLISLMRAFSKEGDVDETERAWLKLIDSGGI 1409
            HSYQD IDRERIA+LREEMQ AGIEESRDVL+S++RA SKEGDV+E E+ WLKL+ S   
Sbjct: 346  HSYQDTIDRERIASLREEMQLAGIEESRDVLLSILRACSKEGDVEEAEKTWLKLLHSDCA 405

Query: 1408 APFQAFVYRIELYAKIGEPMKSLEIFKGMKEKGISINVAVYNKXXXXXXXXXXXXXXXXX 1229
             P Q FVYR+E+YAK+GEPMKSLEIF+ M+E+  S +V  Y+K                 
Sbjct: 406  IPSQGFVYRMEVYAKVGEPMKSLEIFREMQEQLGSTSVVAYHKIIEVLSKAQEIELVESL 465

Query: 1228 XXXXINSGMKPLTSAFLDLLKMYLNLGVHDKLEVSFANCLQRCRPNRSIYYIYLKSLVGN 1049
                INSGMKPL  +++DL+ MY NL +HDKLE +F  CL++CRPNR+IY IY+ SLV  
Sbjct: 466  MTEFINSGMKPLMPSYIDLMNMYFNLSLHDKLEAAFYECLEKCRPNRAIYNIYMDSLVQI 525

Query: 1048 GNLEKAVEIFYEMHTNPAIGIHAQSCNTILGAYLSSGEFVKAEKIYDLMCQKKYDIEPQY 869
            GNL+KA EIF +M++N AIG++ +SCNTIL  YLS G+++KAEKIYDLMCQKKY I+   
Sbjct: 526  GNLDKAEEIFNQMYSNGAIGVNTKSCNTILSGYLSCGDYLKAEKIYDLMCQKKYAIDAPL 585

Query: 868  MEKLDYILSLKRKVIKRPVSMKLDPEQREXXXXXXXXXXXIQSDEERRNHAIYFEFSGNS 689
            MEKLDY+LSL RKV+KRPVS+KL  EQRE           ++SDEER+NH IYFEF+ NS
Sbjct: 586  MEKLDYVLSLSRKVVKRPVSLKLSKEQREILIGLLLGGLQMESDEERKNHVIYFEFNENS 645

Query: 688  NVHSRLKIHIHERFYEWLKSSNESANRDNDIPDRFSTIAHSYFGFFADQFWLKGRPVIPK 509
              HS L+ HIHE+++EWL SS++ ++ ++D+P +FSTI+HSYFGF+ADQFW +GRP+IPK
Sbjct: 646  GAHSVLRRHIHEQYHEWLNSSSKLSDDNDDVPYKFSTISHSYFGFYADQFWPRGRPMIPK 705

Query: 508  LIHRWLSPRVLAYWYMYGGIRTSAGDILLKLKGGNQEDLERIAKVFQAKSLTCKVKRKGR 329
            LIHRWLSPRVLAYWYMYGG RTS+GDILLKLK G++E +E++ +  +A+S+ C+VKRKG 
Sbjct: 706  LIHRWLSPRVLAYWYMYGGHRTSSGDILLKLK-GSREGVEKVVRTLKAQSMDCRVKRKGT 764

Query: 328  VFWIGFQGDNAVWFWKLTEPYILENVREFL 239
            VFWIG  G N+ WFWKL EPYIL++V++F+
Sbjct: 765  VFWIGLLGSNSTWFWKLIEPYILDDVKDFV 794


>ref|XP_002521838.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223538876|gb|EEF40474.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 835

 Score =  738 bits (1905), Expect = 0.0
 Identities = 361/596 (60%), Positives = 458/596 (76%)
 Frame = -3

Query: 1948 KLADHLGKDRKFAKCREMFDAIISHGRVPSESTFHILTVAYLSAPVQGCLDEACTIYNRM 1769
            KLAD++GK+RKFAKCRE+FD II+ GRVPSESTFHIL +AYLSAPVQGCL+EACTIYNRM
Sbjct: 237  KLADYMGKERKFAKCREIFDDIINQGRVPSESTFHILIIAYLSAPVQGCLEEACTIYNRM 296

Query: 1768 IQLGGYRPRLSLHNSLFRALVSKPGGLSKHYLKQAEFIYHQLVTMELEVHKDIYAGLIWL 1589
            IQLGGY+PRLSLHNSLFRALVSKPGG +KHYLKQAEFIYH LVT  LE+  DIY GLIWL
Sbjct: 297  IQLGGYQPRLSLHNSLFRALVSKPGGFAKHYLKQAEFIYHNLVTSGLEIQNDIYGGLIWL 356

Query: 1588 HSYQDNIDRERIAALREEMQCAGIEESRDVLISLMRAFSKEGDVDETERAWLKLIDSGGI 1409
            HSYQDNID+ RIA++REEM+ AGI E R++L+S+MRA SKEGDV+E ER WLKL+   G 
Sbjct: 357  HSYQDNIDKVRIASIREEMKQAGIMEGREILLSIMRACSKEGDVEEAERTWLKLLQVDGG 416

Query: 1408 APFQAFVYRIELYAKIGEPMKSLEIFKGMKEKGISINVAVYNKXXXXXXXXXXXXXXXXX 1229
             P QAFVYR+E++AK+GE MKSLE F+ M+E   S ++A Y+K                 
Sbjct: 417  LPTQAFVYRMEVFAKLGEHMKSLETFREMQELLGSSSIAAYHKIIEVVSQAQEVELAESL 476

Query: 1228 XXXXINSGMKPLTSAFLDLLKMYLNLGVHDKLEVSFANCLQRCRPNRSIYYIYLKSLVGN 1049
                I SG+KPL  +F DL+ MYLNL +H+KLE +F  CL+ CRPNR+IY +YL SLV  
Sbjct: 477  MQEFIKSGLKPLMPSFTDLMNMYLNLNLHEKLESTFFACLENCRPNRNIYNVYLDSLVKV 536

Query: 1048 GNLEKAVEIFYEMHTNPAIGIHAQSCNTILGAYLSSGEFVKAEKIYDLMCQKKYDIEPQY 869
            GNL+KA E F  M +N A+G++ +SCNTIL  YLSSG++VKAEKIYDLMCQKKYDIEP  
Sbjct: 537  GNLDKAEEAFNNMCSNEAVGVNIRSCNTILRGYLSSGDYVKAEKIYDLMCQKKYDIEPSL 596

Query: 868  MEKLDYILSLKRKVIKRPVSMKLDPEQREXXXXXXXXXXXIQSDEERRNHAIYFEFSGNS 689
            MEKLDY+LSL RKV+K+P+S+KL  +QRE           ++SD+ R+ H I FEF+ NS
Sbjct: 597  MEKLDYVLSLSRKVVKKPLSLKLSKDQREILVGLLLGGLRVESDDNRKKHMIRFEFNENS 656

Query: 688  NVHSRLKIHIHERFYEWLKSSNESANRDNDIPDRFSTIAHSYFGFFADQFWLKGRPVIPK 509
            + H+ L+ H++++++EWL  S + ++  +    RFSTI+HSYF F+A+QFW KG+P+IPK
Sbjct: 657  STHAILRRHLYDKYHEWLHPSCKLSDGSDGASYRFSTISHSYFSFYAEQFWPKGQPMIPK 716

Query: 508  LIHRWLSPRVLAYWYMYGGIRTSAGDILLKLKGGNQEDLERIAKVFQAKSLTCKVKRKGR 329
            LIHRWLSP+VLA+WYMY G RTS+GDILLKLK G++E +E++ K  ++KSL CKVKRKGR
Sbjct: 717  LIHRWLSPQVLAFWYMYAGHRTSSGDILLKLK-GSREGVEKVFKTLKSKSLNCKVKRKGR 775

Query: 328  VFWIGFQGDNAVWFWKLTEPYILENVREFLTPESDAMRNEPKGNLFTDFDSESDND 161
            VFWIGF G+++VWFWKL EPYIL++++ FL      +    +     +FDS SD++
Sbjct: 776  VFWIGFLGNDSVWFWKLVEPYILDDLKLFLKAGDQTLEYSAEN---INFDSGSDSE 828


>ref|XP_004171087.1| PREDICTED: pentatricopeptide repeat-containing protein
            At2g15820-like, partial [Cucumis sativus]
          Length = 747

 Score =  716 bits (1847), Expect = 0.0
 Identities = 358/597 (59%), Positives = 449/597 (75%)
 Frame = -3

Query: 1948 KLADHLGKDRKFAKCREMFDAIISHGRVPSESTFHILTVAYLSAPVQGCLDEACTIYNRM 1769
            KLAD++GK+RKF+KCRE+FD II+ G VPSESTFHIL VAYLSAPVQGC++EA TIYNRM
Sbjct: 149  KLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCVEEASTIYNRM 208

Query: 1768 IQLGGYRPRLSLHNSLFRALVSKPGGLSKHYLKQAEFIYHQLVTMELEVHKDIYAGLIWL 1589
            IQLGGY+PRLSLH+SLFRALVSKPG LSKH+LKQAEFIYH LVT  LE+HKDIY GLIWL
Sbjct: 209  IQLGGYQPRLSLHSSLFRALVSKPGDLSKHHLKQAEFIYHNLVTSGLELHKDIYGGLIWL 268

Query: 1588 HSYQDNIDRERIAALREEMQCAGIEESRDVLISLMRAFSKEGDVDETERAWLKLIDSGGI 1409
            HSYQD IDRERI +LR+EMQ AGI+E R+VL+S++RA SK GDV E E+ W +L    G 
Sbjct: 269  HSYQDTIDRERIVSLRKEMQQAGIKEEREVLLSILRASSKMGDVMEAEKLWQELKYLDGN 328

Query: 1408 APFQAFVYRIELYAKIGEPMKSLEIFKGMKEKGISINVAVYNKXXXXXXXXXXXXXXXXX 1229
             P QAFVY++E+YAK+G+PMK+LEIF+ M++   S N A Y                   
Sbjct: 329  MPSQAFVYKMEVYAKMGKPMKALEIFREMEQLN-STNAAAYQTIIGILCKFQVIELAESI 387

Query: 1228 XXXXINSGMKPLTSAFLDLLKMYLNLGVHDKLEVSFANCLQRCRPNRSIYYIYLKSLVGN 1049
                I S +KPLT A++DL+ M+ NL + DKLE++F+ CL++C+PNR+IY IYL SLV  
Sbjct: 388  MAGFIESNLKPLTPAYVDLMNMFFNLNLDDKLELTFSQCLEKCKPNRTIYSIYLDSLVKV 447

Query: 1048 GNLEKAVEIFYEMHTNPAIGIHAQSCNTILGAYLSSGEFVKAEKIYDLMCQKKYDIEPQY 869
            GNL++A EIF +M TN  IGI+A+SCN IL  YL  G ++KAEKIYDLMCQK+YDI+P  
Sbjct: 448  GNLDRAEEIFSQMETNGEIGINARSCNIILRGYLLCGNYMKAEKIYDLMCQKRYDIDPPL 507

Query: 868  MEKLDYILSLKRKVIKRPVSMKLDPEQREXXXXXXXXXXXIQSDEERRNHAIYFEFSGNS 689
            MEKL+YILSL RK +K+P+S+KL  EQRE           I+SDEER+NH I FEF  N 
Sbjct: 508  MEKLEYILSLSRKEVKKPMSLKLSKEQREILVGLLLGGLEIESDEERKNHRIQFEFHRNC 567

Query: 688  NVHSRLKIHIHERFYEWLKSSNESANRDNDIPDRFSTIAHSYFGFFADQFWLKGRPVIPK 509
              HS L+ HI+E++++WL S+++  + D DIP +F T++HSYFGF+ADQFW +GR  IP 
Sbjct: 568  KTHSVLRRHIYEQYHKWLHSASKLTDGDVDIPYKFCTVSHSYFGFYADQFWPRGRRAIPN 627

Query: 508  LIHRWLSPRVLAYWYMYGGIRTSAGDILLKLKGGNQEDLERIAKVFQAKSLTCKVKRKGR 329
            LIHRWLSPRVLAYWYMYGG RTS+GDILLKLK G+ E +E+I K  + KS+ CKVKRKG 
Sbjct: 628  LIHRWLSPRVLAYWYMYGGCRTSSGDILLKLK-GSHEGVEKIVKSLREKSIHCKVKRKGN 686

Query: 328  VFWIGFQGDNAVWFWKLTEPYILENVREFLTPESDAMRNEPKGNLFTDFDSESDNDE 158
            ++WIG  G NA WFWKL EP+IL+ ++E    +S  +     G+   +FDSESD+ E
Sbjct: 687  MYWIGLLGTNATWFWKLIEPFILDYLKESTQADSLNLVGVLNGSENINFDSESDSVE 743


>ref|XP_004152074.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15820-like
            [Cucumis sativus]
          Length = 797

 Score =  714 bits (1843), Expect = 0.0
 Identities = 356/597 (59%), Positives = 449/597 (75%)
 Frame = -3

Query: 1948 KLADHLGKDRKFAKCREMFDAIISHGRVPSESTFHILTVAYLSAPVQGCLDEACTIYNRM 1769
            KLAD++GK+RKF+KCRE+FD II+ G VPSESTFHIL VAYLSAPVQGC++EA TIYNRM
Sbjct: 199  KLADYMGKERKFSKCREVFDDIINQGCVPSESTFHILIVAYLSAPVQGCIEEASTIYNRM 258

Query: 1768 IQLGGYRPRLSLHNSLFRALVSKPGGLSKHYLKQAEFIYHQLVTMELEVHKDIYAGLIWL 1589
            IQLGGY+PRLSLH+SLFRALVSKPG LSKH+LKQAEFIYH LVT  LE+HKD+Y GLIWL
Sbjct: 259  IQLGGYQPRLSLHSSLFRALVSKPGDLSKHHLKQAEFIYHNLVTSGLELHKDMYGGLIWL 318

Query: 1588 HSYQDNIDRERIAALREEMQCAGIEESRDVLISLMRAFSKEGDVDETERAWLKLIDSGGI 1409
            HSYQD IDRERI +LR+EMQ AGI+E R+VL+S++RA SK GDV E E+ W +L    G 
Sbjct: 319  HSYQDTIDRERIVSLRKEMQQAGIKEEREVLLSILRASSKMGDVMEAEKLWQELKYLDGN 378

Query: 1408 APFQAFVYRIELYAKIGEPMKSLEIFKGMKEKGISINVAVYNKXXXXXXXXXXXXXXXXX 1229
             P QAFVY++E+YAK+G+PMK+LEIF+ M++   S N A Y                   
Sbjct: 379  MPSQAFVYKMEVYAKMGKPMKALEIFREMEQLN-STNAAAYQTIIGILCKFQVIELAESI 437

Query: 1228 XXXXINSGMKPLTSAFLDLLKMYLNLGVHDKLEVSFANCLQRCRPNRSIYYIYLKSLVGN 1049
                I S +KPLT A++DL+ M+ NL + DKLE++F+ CL++C+PNR+IY IYL SLV  
Sbjct: 438  MAGFIESNLKPLTPAYVDLMNMFFNLNLDDKLELTFSQCLEKCKPNRTIYSIYLDSLVKV 497

Query: 1048 GNLEKAVEIFYEMHTNPAIGIHAQSCNTILGAYLSSGEFVKAEKIYDLMCQKKYDIEPQY 869
            GNL++A EIF +M TN  IGI+A+SCN IL  YL  G ++KAEKIYDLMCQK+YDI+P  
Sbjct: 498  GNLDRAEEIFSQMETNGEIGINARSCNIILRGYLLCGNYMKAEKIYDLMCQKRYDIDPPL 557

Query: 868  MEKLDYILSLKRKVIKRPVSMKLDPEQREXXXXXXXXXXXIQSDEERRNHAIYFEFSGNS 689
            MEKL+YILSL RK +K+P+S+KL  EQRE           I+SD+ER+NH I FEF  N 
Sbjct: 558  MEKLEYILSLSRKEVKKPMSLKLSKEQREILVGLLLGGLEIESDDERKNHRIQFEFHRNC 617

Query: 688  NVHSRLKIHIHERFYEWLKSSNESANRDNDIPDRFSTIAHSYFGFFADQFWLKGRPVIPK 509
              HS L+ HI+E++++WL S+++  + D DIP +F T++HSYFGF+ADQFW +GR  IP 
Sbjct: 618  KTHSVLRRHIYEQYHKWLHSASKLTDGDVDIPYKFCTVSHSYFGFYADQFWPRGRRAIPN 677

Query: 508  LIHRWLSPRVLAYWYMYGGIRTSAGDILLKLKGGNQEDLERIAKVFQAKSLTCKVKRKGR 329
            LIHRWLSPRVLAYWYMYGG RTS+GDILLKLK G+ E +E+I K  + KS+ CKVKRKG 
Sbjct: 678  LIHRWLSPRVLAYWYMYGGCRTSSGDILLKLK-GSHEGVEKIVKSLREKSIHCKVKRKGN 736

Query: 328  VFWIGFQGDNAVWFWKLTEPYILENVREFLTPESDAMRNEPKGNLFTDFDSESDNDE 158
            ++WIG  G NA WFWKL EP+IL+ ++E    +S  +     G+   +FDSESD+ E
Sbjct: 737  MYWIGLLGSNATWFWKLIEPFILDYLKESTQADSLNLVGVLNGSENINFDSESDSVE 793


Top