BLASTX nr result

ID: Forsythia21_contig00013551 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Forsythia21_contig00013551
         (970 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI32449.3| unnamed protein product [Vitis vinifera]              463   e-128
ref|XP_002281969.1| PREDICTED: pentatricopeptide repeat-containi...   463   e-128
ref|XP_002521838.1| pentatricopeptide repeat-containing protein,...   433   e-118
ref|XP_007042349.1| Endonucleases isoform 2 [Theobroma cacao] gi...   431   e-118
ref|XP_007042348.1| Pentatricopeptide repeat-containing protein ...   431   e-118
ref|XP_008236630.1| PREDICTED: pentatricopeptide repeat-containi...   420   e-115
ref|XP_012072457.1| PREDICTED: pentatricopeptide repeat-containi...   419   e-114
ref|XP_011088297.1| PREDICTED: pentatricopeptide repeat-containi...   416   e-113
ref|XP_011088296.1| PREDICTED: pentatricopeptide repeat-containi...   416   e-113
ref|XP_010258255.1| PREDICTED: pentatricopeptide repeat-containi...   411   e-112
ref|XP_009379614.1| PREDICTED: pentatricopeptide repeat-containi...   410   e-112
ref|XP_010999833.1| PREDICTED: pentatricopeptide repeat-containi...   407   e-111
ref|XP_002313087.2| hypothetical protein POPTR_0009s11000g [Popu...   407   e-111
ref|XP_012450253.1| PREDICTED: pentatricopeptide repeat-containi...   402   e-109
ref|XP_010040589.1| PREDICTED: LOW QUALITY PROTEIN: pentatricope...   401   e-109
gb|KCW63832.1| hypothetical protein EUGRSUZ_G01504 [Eucalyptus g...   401   e-109
gb|KCW45077.1| hypothetical protein EUGRSUZ_L01320 [Eucalyptus g...   401   e-109
emb|CDO97140.1| unnamed protein product [Coffea canephora]            400   e-109
gb|KHG30621.1| hypothetical protein F383_13349 [Gossypium arboreum]   400   e-109
gb|KDO49164.1| hypothetical protein CISIN_1g003913mg [Citrus sin...   400   e-109

>emb|CBI32449.3| unnamed protein product [Vitis vinifera]
          Length = 790

 Score =  463 bits (1192), Expect = e-128
 Identities = 220/322 (68%), Positives = 268/322 (83%)
 Frame = -3

Query: 968  VFRGMKEQVGSASVTAYYKIIEVLCKARKTELAESLMTEFINSGLKPLMPSFIGLMEMYS 789
            +FR M+EQ+GS SV AY+KIIEVL KA++ EL ESLMTEFINSG+KPLMPS+I LM MY 
Sbjct: 397  IFREMQEQLGSTSVVAYHKIIEVLSKAQEIELVESLMTEFINSGMKPLMPSYIDLMNMYF 456

Query: 788  DLDLHDKVESTFFKCLENCRPNRTVYNMYLDSLVHISNLEKAEEIFNEMYGNEAIGVNTR 609
            +L LHDK+E+ F++CLE CRPNR +YN+Y+DSLV I NL+KAEEIFN+MY N AIGVNT+
Sbjct: 457  NLSLHDKLEAAFYECLEKCRPNRAIYNIYMDSLVQIGNLDKAEEIFNQMYSNGAIGVNTK 516

Query: 608  SCNTILRGYLACEQNVKAEKIYDLMCQKKFDIESSLMEKLDYMLSLRWEVVKKPISMKLS 429
            SCNTIL GYL+C   +KAEKIYDLMCQKK+ I++ LMEKLDY+LSL  +VVK+P+S+KLS
Sbjct: 517  SCNTILSGYLSCGDYLKAEKIYDLMCQKKYAIDAPLMEKLDYVLSLSRKVVKRPVSLKLS 576

Query: 428  KEQRETLVGLLLGGLQIEVDXXXXXXXXXXXXXXXFGVHSVLKRHIHDQFREWLACTDKL 249
            KEQRE L+GLLLGGLQ+E D                G HSVL+RHIH+Q+ EWL  + KL
Sbjct: 577  KEQREILIGLLLGGLQMESDEERKNHVIYFEFNENSGAHSVLRRHIHEQYHEWLNSSSKL 636

Query: 248  ADDEDDIPCQFSTISHSHFNFYAEQFRAQGRPVIPKLIHRWLSPRVLAYWYMYGGHRTST 69
            +DD DD+P +FSTISHS+F FYA+QF  +GRP+IPKLIHRWLSPRVLAYWYMYGGHRTS+
Sbjct: 637  SDDNDDVPYKFSTISHSYFGFYADQFWPRGRPMIPKLIHRWLSPRVLAYWYMYGGHRTSS 696

Query: 68   GDILLKLKASKEGVQRIVKTLK 3
            GDILLKLK S+EGV+++V+TLK
Sbjct: 697  GDILLKLKGSREGVEKVVRTLK 718


>ref|XP_002281969.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15820
            [Vitis vinifera]
          Length = 823

 Score =  463 bits (1192), Expect = e-128
 Identities = 220/322 (68%), Positives = 268/322 (83%)
 Frame = -3

Query: 968  VFRGMKEQVGSASVTAYYKIIEVLCKARKTELAESLMTEFINSGLKPLMPSFIGLMEMYS 789
            +FR M+EQ+GS SV AY+KIIEVL KA++ EL ESLMTEFINSG+KPLMPS+I LM MY 
Sbjct: 430  IFREMQEQLGSTSVVAYHKIIEVLSKAQEIELVESLMTEFINSGMKPLMPSYIDLMNMYF 489

Query: 788  DLDLHDKVESTFFKCLENCRPNRTVYNMYLDSLVHISNLEKAEEIFNEMYGNEAIGVNTR 609
            +L LHDK+E+ F++CLE CRPNR +YN+Y+DSLV I NL+KAEEIFN+MY N AIGVNT+
Sbjct: 490  NLSLHDKLEAAFYECLEKCRPNRAIYNIYMDSLVQIGNLDKAEEIFNQMYSNGAIGVNTK 549

Query: 608  SCNTILRGYLACEQNVKAEKIYDLMCQKKFDIESSLMEKLDYMLSLRWEVVKKPISMKLS 429
            SCNTIL GYL+C   +KAEKIYDLMCQKK+ I++ LMEKLDY+LSL  +VVK+P+S+KLS
Sbjct: 550  SCNTILSGYLSCGDYLKAEKIYDLMCQKKYAIDAPLMEKLDYVLSLSRKVVKRPVSLKLS 609

Query: 428  KEQRETLVGLLLGGLQIEVDXXXXXXXXXXXXXXXFGVHSVLKRHIHDQFREWLACTDKL 249
            KEQRE L+GLLLGGLQ+E D                G HSVL+RHIH+Q+ EWL  + KL
Sbjct: 610  KEQREILIGLLLGGLQMESDEERKNHVIYFEFNENSGAHSVLRRHIHEQYHEWLNSSSKL 669

Query: 248  ADDEDDIPCQFSTISHSHFNFYAEQFRAQGRPVIPKLIHRWLSPRVLAYWYMYGGHRTST 69
            +DD DD+P +FSTISHS+F FYA+QF  +GRP+IPKLIHRWLSPRVLAYWYMYGGHRTS+
Sbjct: 670  SDDNDDVPYKFSTISHSYFGFYADQFWPRGRPMIPKLIHRWLSPRVLAYWYMYGGHRTSS 729

Query: 68   GDILLKLKASKEGVQRIVKTLK 3
            GDILLKLK S+EGV+++V+TLK
Sbjct: 730  GDILLKLKGSREGVEKVVRTLK 751


>ref|XP_002521838.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223538876|gb|EEF40474.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 835

 Score =  433 bits (1113), Expect = e-118
 Identities = 210/321 (65%), Positives = 259/321 (80%)
 Frame = -3

Query: 965  FRGMKEQVGSASVTAYYKIIEVLCKARKTELAESLMTEFINSGLKPLMPSFIGLMEMYSD 786
            FR M+E +GS+S+ AY+KIIEV+ +A++ ELAESLM EFI SGLKPLMPSF  LM MY +
Sbjct: 442  FREMQELLGSSSIAAYHKIIEVVSQAQEVELAESLMQEFIKSGLKPLMPSFTDLMNMYLN 501

Query: 785  LDLHDKVESTFFKCLENCRPNRTVYNMYLDSLVHISNLEKAEEIFNEMYGNEAIGVNTRS 606
            L+LH+K+ESTFF CLENCRPNR +YN+YLDSLV + NL+KAEE FN M  NEA+GVN RS
Sbjct: 502  LNLHEKLESTFFACLENCRPNRNIYNVYLDSLVKVGNLDKAEEAFNNMCSNEAVGVNIRS 561

Query: 605  CNTILRGYLACEQNVKAEKIYDLMCQKKFDIESSLMEKLDYMLSLRWEVVKKPISMKLSK 426
            CNTILRGYL+    VKAEKIYDLMCQKK+DIE SLMEKLDY+LSL  +VVKKP+S+KLSK
Sbjct: 562  CNTILRGYLSSGDYVKAEKIYDLMCQKKYDIEPSLMEKLDYVLSLSRKVVKKPLSLKLSK 621

Query: 425  EQRETLVGLLLGGLQIEVDXXXXXXXXXXXXXXXFGVHSVLKRHIHDQFREWLACTDKLA 246
            +QRE LVGLLLGGL++E D                  H++L+RH++D++ EWL  + KL+
Sbjct: 622  DQREILVGLLLGGLRVESDDNRKKHMIRFEFNENSSTHAILRRHLYDKYHEWLHPSCKLS 681

Query: 245  DDEDDIPCQFSTISHSHFNFYAEQFRAQGRPVIPKLIHRWLSPRVLAYWYMYGGHRTSTG 66
            D  D    +FSTISHS+F+FYAEQF  +G+P+IPKLIHRWLSP+VLA+WYMY GHRTS+G
Sbjct: 682  DGSDGASYRFSTISHSYFSFYAEQFWPKGQPMIPKLIHRWLSPQVLAFWYMYAGHRTSSG 741

Query: 65   DILLKLKASKEGVQRIVKTLK 3
            DILLKLK S+EGV+++ KTLK
Sbjct: 742  DILLKLKGSREGVEKVFKTLK 762


>ref|XP_007042349.1| Endonucleases isoform 2 [Theobroma cacao] gi|508706284|gb|EOX98180.1|
            Endonucleases isoform 2 [Theobroma cacao]
          Length = 621

 Score =  431 bits (1108), Expect = e-118
 Identities = 211/322 (65%), Positives = 258/322 (80%)
 Frame = -3

Query: 968  VFRGMKEQVGSASVTAYYKIIEVLCKARKTELAESLMTEFINSGLKPLMPSFIGLMEMYS 789
            VFR M++ +GSASV AY+KIIEVLCK+++ +LAESLM EF+ SG KPLMPS+I L +MY 
Sbjct: 220  VFRQMQKYLGSASVAAYHKIIEVLCKSQQMDLAESLMKEFMESGKKPLMPSYIELTDMYL 279

Query: 788  DLDLHDKVESTFFKCLENCRPNRTVYNMYLDSLVHISNLEKAEEIFNEMYGNEAIGVNTR 609
            ++ LHDK+ESTF +CLE CRPNRT+YN+YL+SLV + NLEKA EIF +M+GN  IGVN R
Sbjct: 280  NMSLHDKLESTFLECLEKCRPNRTIYNIYLNSLVKVGNLEKAGEIFGQMHGNSTIGVNAR 339

Query: 608  SCNTILRGYLACEQNVKAEKIYDLMCQKKFDIESSLMEKLDYMLSLRWEVVKKPISMKLS 429
            SCNTIL GYL+    +KAEKIYDLMCQKK++IES L+EKLDY+LSL  + VKKP+S+KLS
Sbjct: 340  SCNTILGGYLSSGDFLKAEKIYDLMCQKKYEIESLLIEKLDYVLSLSRKEVKKPVSLKLS 399

Query: 428  KEQRETLVGLLLGGLQIEVDXXXXXXXXXXXXXXXFGVHSVLKRHIHDQFREWLACTDKL 249
            KEQR+ LVGLLLGGL+I+ D                  HS+LKRHIHDQ+ EWL  + K 
Sbjct: 400  KEQRQILVGLLLGGLKIDSDGERKNHMIRFEFNQNSVTHSILKRHIHDQYHEWLHPSSKP 459

Query: 248  ADDEDDIPCQFSTISHSHFNFYAEQFRAQGRPVIPKLIHRWLSPRVLAYWYMYGGHRTST 69
             D  DDIP +FSTISHS+F FYA+QF  +G+PVIPKLIHRWLSP VLAYWYMYGG++TS 
Sbjct: 460  TDGNDDIPHKFSTISHSYFGFYADQFWPRGQPVIPKLIHRWLSPLVLAYWYMYGGYKTSY 519

Query: 68   GDILLKLKASKEGVQRIVKTLK 3
            GDILLKLK S+EGV+++VKTLK
Sbjct: 520  GDILLKLKGSREGVEKVVKTLK 541


>ref|XP_007042348.1| Pentatricopeptide repeat-containing protein isoform 1 [Theobroma
            cacao] gi|508706283|gb|EOX98179.1| Pentatricopeptide
            repeat-containing protein isoform 1 [Theobroma cacao]
          Length = 823

 Score =  431 bits (1108), Expect = e-118
 Identities = 211/322 (65%), Positives = 258/322 (80%)
 Frame = -3

Query: 968  VFRGMKEQVGSASVTAYYKIIEVLCKARKTELAESLMTEFINSGLKPLMPSFIGLMEMYS 789
            VFR M++ +GSASV AY+KIIEVLCK+++ +LAESLM EF+ SG KPLMPS+I L +MY 
Sbjct: 422  VFRQMQKYLGSASVAAYHKIIEVLCKSQQMDLAESLMKEFMESGKKPLMPSYIELTDMYL 481

Query: 788  DLDLHDKVESTFFKCLENCRPNRTVYNMYLDSLVHISNLEKAEEIFNEMYGNEAIGVNTR 609
            ++ LHDK+ESTF +CLE CRPNRT+YN+YL+SLV + NLEKA EIF +M+GN  IGVN R
Sbjct: 482  NMSLHDKLESTFLECLEKCRPNRTIYNIYLNSLVKVGNLEKAGEIFGQMHGNSTIGVNAR 541

Query: 608  SCNTILRGYLACEQNVKAEKIYDLMCQKKFDIESSLMEKLDYMLSLRWEVVKKPISMKLS 429
            SCNTIL GYL+    +KAEKIYDLMCQKK++IES L+EKLDY+LSL  + VKKP+S+KLS
Sbjct: 542  SCNTILGGYLSSGDFLKAEKIYDLMCQKKYEIESLLIEKLDYVLSLSRKEVKKPVSLKLS 601

Query: 428  KEQRETLVGLLLGGLQIEVDXXXXXXXXXXXXXXXFGVHSVLKRHIHDQFREWLACTDKL 249
            KEQR+ LVGLLLGGL+I+ D                  HS+LKRHIHDQ+ EWL  + K 
Sbjct: 602  KEQRQILVGLLLGGLKIDSDGERKNHMIRFEFNQNSVTHSILKRHIHDQYHEWLHPSSKP 661

Query: 248  ADDEDDIPCQFSTISHSHFNFYAEQFRAQGRPVIPKLIHRWLSPRVLAYWYMYGGHRTST 69
             D  DDIP +FSTISHS+F FYA+QF  +G+PVIPKLIHRWLSP VLAYWYMYGG++TS 
Sbjct: 662  TDGNDDIPHKFSTISHSYFGFYADQFWPRGQPVIPKLIHRWLSPLVLAYWYMYGGYKTSY 721

Query: 68   GDILLKLKASKEGVQRIVKTLK 3
            GDILLKLK S+EGV+++VKTLK
Sbjct: 722  GDILLKLKGSREGVEKVVKTLK 743


>ref|XP_008236630.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15820
            [Prunus mume]
          Length = 830

 Score =  420 bits (1079), Expect = e-115
 Identities = 202/322 (62%), Positives = 255/322 (79%)
 Frame = -3

Query: 968  VFRGMKEQVGSASVTAYYKIIEVLCKARKTELAESLMTEFINSGLKPLMPSFIGLMEMYS 789
            +FR M+EQ+GSA+  AY+K+IEVLCKA++ ELAESLMT+FIN GLK  MPS+I LM MY 
Sbjct: 426  IFREMQEQLGSANAVAYHKVIEVLCKAQEVELAESLMTDFINIGLKTFMPSYIDLMNMYF 485

Query: 788  DLDLHDKVESTFFKCLENCRPNRTVYNMYLDSLVHISNLEKAEEIFNEMYGNEAIGVNTR 609
            +L  HDK+ES FF+CLE CRP+RT+Y++YLDSLV + NL+KAEEIF++M  N A G+N R
Sbjct: 486  NLGSHDKLESAFFQCLERCRPSRTIYSIYLDSLVKVGNLDKAEEIFDQMQRNGATGINAR 545

Query: 608  SCNTILRGYLACEQNVKAEKIYDLMCQKKFDIESSLMEKLDYMLSLRWEVVKKPISMKLS 429
            SCNTIL GYL+    VKAEKI+DLMCQKK+D++S LMEK+DY+LSL  +VVK+P+S+KLS
Sbjct: 546  SCNTILSGYLSSGDYVKAEKIFDLMCQKKYDVDSPLMEKIDYVLSLSRKVVKRPVSLKLS 605

Query: 428  KEQRETLVGLLLGGLQIEVDXXXXXXXXXXXXXXXFGVHSVLKRHIHDQFREWLACTDKL 249
            KEQRE LVG+LLGGLQIE D                  HS+L+RH++DQ+ EWL  + K 
Sbjct: 606  KEQREVLVGMLLGGLQIESDEDRKNHMIRFEFSENSSTHSLLRRHMYDQYHEWLHPSCKT 665

Query: 248  ADDEDDIPCQFSTISHSHFNFYAEQFRAQGRPVIPKLIHRWLSPRVLAYWYMYGGHRTST 69
            ++  DDIP +FSTISHS   FYA+QF  +GR VIPKLIHRWLSP  LAYWYMYGGHR+S+
Sbjct: 666  SESTDDIPYKFSTISHSCLGFYADQFWPKGRQVIPKLIHRWLSPCALAYWYMYGGHRSSS 725

Query: 68   GDILLKLKASKEGVQRIVKTLK 3
            GDILLK+K ++EGV++IV+ LK
Sbjct: 726  GDILLKIKGNEEGVEKIVRALK 747


>ref|XP_012072457.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15820
            [Jatropha curcas] gi|643730809|gb|KDP38241.1|
            hypothetical protein JCGZ_04884 [Jatropha curcas]
          Length = 846

 Score =  419 bits (1076), Expect = e-114
 Identities = 203/322 (63%), Positives = 251/322 (77%)
 Frame = -3

Query: 968  VFRGMKEQVGSASVTAYYKIIEVLCKARKTELAESLMTEFINSGLKPLMPSFIGLMEMYS 789
            +FR MKE++GS SVT Y+KIIEVLC+A++ +L+ESLM EFI SG+KPLMPSF  LM +Y 
Sbjct: 441  IFREMKERLGSVSVTGYHKIIEVLCRAQEMDLSESLMQEFIESGMKPLMPSFSELMNLYL 500

Query: 788  DLDLHDKVESTFFKCLENCRPNRTVYNMYLDSLVHISNLEKAEEIFNEMYGNEAIGVNTR 609
            +L+LHDK+ES F  CL+ CRPNRT+YNMYLDSLV + NL+KAEEIF  +   E +GV  R
Sbjct: 501  NLNLHDKLESVFSACLKKCRPNRTIYNMYLDSLVKVGNLDKAEEIFTHICSGEGVGVTGR 560

Query: 608  SCNTILRGYLACEQNVKAEKIYDLMCQKKFDIESSLMEKLDYMLSLRWEVVKKPISMKLS 429
            SCN IL  YL+  ++VKAE +Y+LMCQKK+DIE SLM+KLDY+LSL  + VKKP+S+K+S
Sbjct: 561  SCNIILSAYLSSGEHVKAENVYNLMCQKKYDIEPSLMQKLDYVLSLSRKEVKKPVSLKMS 620

Query: 428  KEQRETLVGLLLGGLQIEVDXXXXXXXXXXXXXXXFGVHSVLKRHIHDQFREWLACTDKL 249
            K QRE LVGLLLGGLQIE D                 VHSVL+RH++D++ EWL  + KL
Sbjct: 621  KNQREILVGLLLGGLQIESDEERKRHMIRFEFNENSSVHSVLRRHLYDEYHEWLHPSCKL 680

Query: 248  ADDEDDIPCQFSTISHSHFNFYAEQFRAQGRPVIPKLIHRWLSPRVLAYWYMYGGHRTST 69
             D  DDI  +FSTISHS+F FYA+QF  +GR +IPKLIHRWLSP+VLAYWYMYGGHRTS+
Sbjct: 681  NDGSDDISYRFSTISHSYFGFYADQFWPKGRAIIPKLIHRWLSPQVLAYWYMYGGHRTSS 740

Query: 68   GDILLKLKASKEGVQRIVKTLK 3
            GDILLKLK S+EGV ++VK  K
Sbjct: 741  GDILLKLKGSREGVAKVVKAFK 762


>ref|XP_011088297.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15820
            isoform X2 [Sesamum indicum]
            gi|747081997|ref|XP_011088298.1| PREDICTED:
            pentatricopeptide repeat-containing protein At2g15820
            isoform X2 [Sesamum indicum]
            gi|747081999|ref|XP_011088299.1| PREDICTED:
            pentatricopeptide repeat-containing protein At2g15820
            isoform X2 [Sesamum indicum]
          Length = 711

 Score =  416 bits (1068), Expect = e-113
 Identities = 209/326 (64%), Positives = 256/326 (78%), Gaps = 4/326 (1%)
 Frame = -3

Query: 968  VFRGMKEQVGSASVTAYYKIIEVLCKARKTELAESLMTEFINSGLKPLMPSFIGLMEMYS 789
            +FR M+E++ S++V +YYK++EVLCKA++ ELAESLM EFINSG+KPLMPSFI +M MY+
Sbjct: 304  IFRVMQERI-SSNVVSYYKVVEVLCKAQEIELAESLMIEFINSGMKPLMPSFIAMMNMYA 362

Query: 788  DLDLHDKVESTFFKCLENCRPNRTVYNMYLDSLVHISNLEKAEEIFNEMYGNEAIGVNTR 609
            +L LHDKVESTFF CLE C PNR VYN+YL+SLV   NL KAE+IFN+M+ +EAIGVNTR
Sbjct: 363  NLSLHDKVESTFFWCLETCCPNRNVYNLYLNSLVQTENLAKAEQIFNQMFTDEAIGVNTR 422

Query: 608  SCNTILRGYLACEQNVKAEKIYDLMCQKKFDIESSLMEKLDYMLSLRWEVVKKPISMKLS 429
            SCNTILRGYLAC Q  KA++IY+ MC+KK++ ESSLMEKL+ +LSL  E V+K IS+KLS
Sbjct: 423  SCNTILRGYLACGQYAKAKQIYNFMCEKKYETESSLMEKLESILSLSQEEVQKSISLKLS 482

Query: 428  KEQRETLVGLLLGGLQIEVDXXXXXXXXXXXXXXXFGVHSVLKRHIHDQFREWLACTDKL 249
            KEQRE L+GLLLGGL++++D                  HS LKRHI++QF EWLA  DKL
Sbjct: 483  KEQREILIGLLLGGLRVKIDEEKKSYAVHFVFRENSNTHSFLKRHIYNQFHEWLA--DKL 540

Query: 248  -ADDED---DIPCQFSTISHSHFNFYAEQFRAQGRPVIPKLIHRWLSPRVLAYWYMYGGH 81
              DD D   DIPC+F TISHS+F FYA+QF  QG P IP LIHRWL+PR+LAYWYMYGG+
Sbjct: 541  PVDDNDRGNDIPCEFMTISHSYFKFYADQFWPQGIPSIPNLIHRWLTPRILAYWYMYGGY 600

Query: 80   RTSTGDILLKLKASKEGVQRIVKTLK 3
            RTS+ DILLKL++ KE V RI K  K
Sbjct: 601  RTSSRDILLKLQSRKEDVPRIAKAFK 626


>ref|XP_011088296.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15820
            isoform X1 [Sesamum indicum]
          Length = 832

 Score =  416 bits (1068), Expect = e-113
 Identities = 209/326 (64%), Positives = 256/326 (78%), Gaps = 4/326 (1%)
 Frame = -3

Query: 968  VFRGMKEQVGSASVTAYYKIIEVLCKARKTELAESLMTEFINSGLKPLMPSFIGLMEMYS 789
            +FR M+E++ S++V +YYK++EVLCKA++ ELAESLM EFINSG+KPLMPSFI +M MY+
Sbjct: 425  IFRVMQERI-SSNVVSYYKVVEVLCKAQEIELAESLMIEFINSGMKPLMPSFIAMMNMYA 483

Query: 788  DLDLHDKVESTFFKCLENCRPNRTVYNMYLDSLVHISNLEKAEEIFNEMYGNEAIGVNTR 609
            +L LHDKVESTFF CLE C PNR VYN+YL+SLV   NL KAE+IFN+M+ +EAIGVNTR
Sbjct: 484  NLSLHDKVESTFFWCLETCCPNRNVYNLYLNSLVQTENLAKAEQIFNQMFTDEAIGVNTR 543

Query: 608  SCNTILRGYLACEQNVKAEKIYDLMCQKKFDIESSLMEKLDYMLSLRWEVVKKPISMKLS 429
            SCNTILRGYLAC Q  KA++IY+ MC+KK++ ESSLMEKL+ +LSL  E V+K IS+KLS
Sbjct: 544  SCNTILRGYLACGQYAKAKQIYNFMCEKKYETESSLMEKLESILSLSQEEVQKSISLKLS 603

Query: 428  KEQRETLVGLLLGGLQIEVDXXXXXXXXXXXXXXXFGVHSVLKRHIHDQFREWLACTDKL 249
            KEQRE L+GLLLGGL++++D                  HS LKRHI++QF EWLA  DKL
Sbjct: 604  KEQREILIGLLLGGLRVKIDEEKKSYAVHFVFRENSNTHSFLKRHIYNQFHEWLA--DKL 661

Query: 248  -ADDED---DIPCQFSTISHSHFNFYAEQFRAQGRPVIPKLIHRWLSPRVLAYWYMYGGH 81
              DD D   DIPC+F TISHS+F FYA+QF  QG P IP LIHRWL+PR+LAYWYMYGG+
Sbjct: 662  PVDDNDRGNDIPCEFMTISHSYFKFYADQFWPQGIPSIPNLIHRWLTPRILAYWYMYGGY 721

Query: 80   RTSTGDILLKLKASKEGVQRIVKTLK 3
            RTS+ DILLKL++ KE V RI K  K
Sbjct: 722  RTSSRDILLKLQSRKEDVPRIAKAFK 747


>ref|XP_010258255.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15820
            [Nelumbo nucifera]
          Length = 838

 Score =  411 bits (1056), Expect = e-112
 Identities = 202/323 (62%), Positives = 262/323 (81%), Gaps = 1/323 (0%)
 Frame = -3

Query: 968  VFRGMKEQVGSASVTAYYKIIEVLCKARKTELAESLMTEFINSGLKPLMPSFIGLMEMYS 789
            +FRGM+E + S SV AY+KIIEV+ KA++ E+AE++M+EF++SGLKPLMPSFI LM MY 
Sbjct: 437  IFRGMQECLESTSVVAYHKIIEVMSKAQEMEIAETIMSEFLDSGLKPLMPSFIDLMSMYF 496

Query: 788  DLDLHDKVESTFFKCLENCRPNRTVYNMYLDSLVHISNLEKAEEIFNEMYGNEAIGVNTR 609
            +L+LH K+ESTF +C+E C PNRT+YN+YLDSLV   +++KAE+IF++M  +  IGVN+R
Sbjct: 497  NLNLHGKLESTFSQCIEKCHPNRTIYNIYLDSLVKSGHIDKAEDIFSKMNNDGTIGVNSR 556

Query: 608  SCNTILRGYLACEQNVKAEKIYDLMCQKKFDIESSLMEKLDYMLSLRWEVVKKPISMKLS 429
            SCNTIL G+L+    +KAEKIYDLMCQKK+DI+SSLMEKL+Y+LSL+ +V+KKPIS+KL+
Sbjct: 557  SCNTILAGHLSSGDYIKAEKIYDLMCQKKYDIDSSLMEKLEYILSLKRKVIKKPISLKLT 616

Query: 428  KEQRETLVGLLLGGLQIEVDXXXXXXXXXXXXXXXFGVHSVLKRHIHDQFREWLACTDKL 249
            KEQRE LVGLLLGGL IE D                 VHSVL+RHIHDQ+ EWL  +  +
Sbjct: 617  KEQREILVGLLLGGLCIETDEERRNHAIHFEFNENSDVHSVLRRHIHDQYHEWLN-SPGM 675

Query: 248  ADDEDDIPCQFSTISHSHFNFYAEQFRAQGRPVIPKLIHRWLSPRVLAYWYMYGGHRTST 69
             +DE+++P +FSTI HS+F FYA+QF  +G+PVIPKLIHRWLSPRVLAYWYM+GG R ++
Sbjct: 676  PNDEENLPFRFSTIRHSYFGFYADQFWPKGQPVIPKLIHRWLSPRVLAYWYMHGGQRMAS 735

Query: 68   GDILLKLK-ASKEGVQRIVKTLK 3
            GDILLKLK A++E V+R+VKTLK
Sbjct: 736  GDILLKLKSATREDVERVVKTLK 758


>ref|XP_009379614.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15820
            [Pyrus x bretschneideri]
          Length = 830

 Score =  410 bits (1054), Expect = e-112
 Identities = 201/322 (62%), Positives = 250/322 (77%)
 Frame = -3

Query: 968  VFRGMKEQVGSASVTAYYKIIEVLCKARKTELAESLMTEFINSGLKPLMPSFIGLMEMYS 789
            +FR M+EQ+GS++  AY+K+IEVLCKA++ E  ESLMTEFIN+GLK  MPS+I LM MY 
Sbjct: 428  IFREMQEQLGSSNAVAYHKVIEVLCKAQEVERVESLMTEFINTGLKNFMPSYIDLMNMYF 487

Query: 788  DLDLHDKVESTFFKCLENCRPNRTVYNMYLDSLVHISNLEKAEEIFNEMYGNEAIGVNTR 609
            +L LHDK+ES F +CLE CRP+RT+ ++YLDSLV + NL +AEEIF  M  + AIGVN R
Sbjct: 488  NLGLHDKLESAFSQCLERCRPSRTICSIYLDSLVKVGNLARAEEIFGLMQSDAAIGVNAR 547

Query: 608  SCNTILRGYLACEQNVKAEKIYDLMCQKKFDIESSLMEKLDYMLSLRWEVVKKPISMKLS 429
            SCNTIL GYL+    VKAE I+DLMCQKK+D+E  LME +DY+LSL  +VVKKP+S+KLS
Sbjct: 548  SCNTILSGYLSSGDYVKAENIFDLMCQKKYDVEPPLMENIDYILSLSRKVVKKPVSLKLS 607

Query: 428  KEQRETLVGLLLGGLQIEVDXXXXXXXXXXXXXXXFGVHSVLKRHIHDQFREWLACTDKL 249
            KEQRE LVGLLLGGLQI+ D                  HS+LKRHI DQ+ EWL  + K 
Sbjct: 608  KEQREILVGLLLGGLQIDSDEDWKNHMIRFEFSENSSTHSLLKRHIFDQYHEWLHPSCKS 667

Query: 248  ADDEDDIPCQFSTISHSHFNFYAEQFRAQGRPVIPKLIHRWLSPRVLAYWYMYGGHRTST 69
            ++  +DIP +FSTISHS+F FYAEQF  +GR +IPKLIHRWLSP  LAYWYMYGGH+TS+
Sbjct: 668  SESTEDIPYKFSTISHSYFAFYAEQFWPKGRRMIPKLIHRWLSPCALAYWYMYGGHKTSS 727

Query: 68   GDILLKLKASKEGVQRIVKTLK 3
            GD+LLK+K S+EGV++IVK+LK
Sbjct: 728  GDVLLKIKGSEEGVEKIVKSLK 749


>ref|XP_010999833.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15820
            [Populus euphratica]
          Length = 819

 Score =  407 bits (1046), Expect = e-111
 Identities = 205/323 (63%), Positives = 247/323 (76%), Gaps = 2/323 (0%)
 Frame = -3

Query: 965  FRGMKEQVGSASVTAYYKIIEVLCKARKTELAESLMTEFINSGLKPLMPSFIGLMEMYSD 786
            FR M+E + S +V  Y+KIIEVLCKA + ELAESLM E + SG+KPL PSFI +M+MY +
Sbjct: 418  FREMQEVLSSYNVVPYHKIIEVLCKAGEVELAESLMQELVQSGMKPLTPSFISIMDMYFN 477

Query: 785  LDLHDKVESTFFKCLENCRPNRTVYNMYLDSLVHISNLEKAEEIFNEMYGNEAIGVNTRS 606
            L+LHDK+ES F  CLE CRPNR VY +YLDSLV + N +KAEEIFN M  NEAIGVN RS
Sbjct: 478  LNLHDKLESAFSACLEKCRPNRIVYMIYLDSLVKVGNFDKAEEIFNHMRNNEAIGVNARS 537

Query: 605  CNTILRGYLACEQNVKAEKIYDLMCQKKFDIESSLMEKLDYMLSLRWEVVKKPISMKLSK 426
            CNTILR YL+   +VKAE+IYDLMCQKK+DI+SSLMEKLD +LS   +V ++PI +KLSK
Sbjct: 538  CNTILREYLSSGYHVKAERIYDLMCQKKYDIDSSLMEKLDSVLSSSRKVARRPIRLKLSK 597

Query: 425  EQRETLVGLLLGGLQIEVDXXXXXXXXXXXXXXXFGVHSVLKRHIHDQFREWL--ACTDK 252
            EQRE LVGL LGGLQIE D                 +HS+L+RH+HDQ+ EWL  +C   
Sbjct: 598  EQREILVGLFLGGLQIESDGKKHMIQFEFNQNSI--MHSILRRHLHDQYHEWLHPSCKPS 655

Query: 251  LADDEDDIPCQFSTISHSHFNFYAEQFRAQGRPVIPKLIHRWLSPRVLAYWYMYGGHRTS 72
               D DDIP +F TISHS F+FYAEQF  +G+P +PKLIHRW+SP+VLAYWYMYGGHRTS
Sbjct: 656  DDSDSDDIPWRFCTISHSCFDFYAEQFWPRGQPQLPKLIHRWMSPQVLAYWYMYGGHRTS 715

Query: 71   TGDILLKLKASKEGVQRIVKTLK 3
            +GDI+LKLK S +GV R+VKTLK
Sbjct: 716  SGDIVLKLKGSVKGVGRVVKTLK 738


>ref|XP_002313087.2| hypothetical protein POPTR_0009s11000g [Populus trichocarpa]
            gi|550331483|gb|EEE87042.2| hypothetical protein
            POPTR_0009s11000g [Populus trichocarpa]
          Length = 622

 Score =  407 bits (1046), Expect = e-111
 Identities = 206/323 (63%), Positives = 250/323 (77%), Gaps = 2/323 (0%)
 Frame = -3

Query: 965  FRGMKEQVGSASVTAYYKIIEVLCKARKTELAESLMTEFINSGLKPLMPSFIGLMEMYSD 786
            FR M+E + S +V  Y+KIIEVLCKA + ELAESLM E + SG+KPL PSFI +M+MY +
Sbjct: 221  FREMQEVLSSYNVAPYHKIIEVLCKAEEVELAESLMQELVQSGMKPLTPSFISIMDMYLN 280

Query: 785  LDLHDKVESTFFKCLENCRPNRTVYNMYLDSLVHISNLEKAEEIFNEMYGNEAIGVNTRS 606
            L+LHDK+ES F  CLE CRPNR+VY +YLDSLV + N +KAEEIFN M  NEAIGVN RS
Sbjct: 281  LNLHDKLESAFSACLEKCRPNRSVYMIYLDSLVKVGNFDKAEEIFNHMRNNEAIGVNARS 340

Query: 605  CNTILRGYLACEQNVKAEKIYDLMCQKKFDIESSLMEKLDYMLSLRWEVVKKPISMKLSK 426
            CNTILR YL+   +VKAE+IYDLMCQKK+DI+SSLMEKLD +LS   +V ++ IS+KLSK
Sbjct: 341  CNTILREYLSSGYHVKAERIYDLMCQKKYDIDSSLMEKLDSVLSSSRKVARRRISLKLSK 400

Query: 425  EQRETLVGLLLGGLQIEVDXXXXXXXXXXXXXXXFGVHSVLKRHIHDQFREWLACTDKLA 246
            EQRE LVGL LGGLQIE D                 +HS+L+RH+HDQ+ EWL  + K +
Sbjct: 401  EQREILVGLFLGGLQIESDGKKHMIQFEFNQNSI--MHSILRRHLHDQYHEWLHPSFKPS 458

Query: 245  D--DEDDIPCQFSTISHSHFNFYAEQFRAQGRPVIPKLIHRWLSPRVLAYWYMYGGHRTS 72
            D  D DDIP +F TISHS F+FYAEQF  +G+P +PKLIHRW+SP+VLAYWYMYGGHRTS
Sbjct: 459  DDSDSDDIPWRFCTISHSCFDFYAEQFWPRGQPQLPKLIHRWMSPQVLAYWYMYGGHRTS 518

Query: 71   TGDILLKLKASKEGVQRIVKTLK 3
            +GDI+LKLK S +GV R+VKTLK
Sbjct: 519  SGDIVLKLKGSVKGVGRVVKTLK 541


>ref|XP_012450253.1| PREDICTED: pentatricopeptide repeat-containing protein At2g15820
            [Gossypium raimondii] gi|763744783|gb|KJB12222.1|
            hypothetical protein B456_002G007000 [Gossypium
            raimondii]
          Length = 835

 Score =  402 bits (1034), Expect = e-109
 Identities = 201/322 (62%), Positives = 244/322 (75%)
 Frame = -3

Query: 968  VFRGMKEQVGSASVTAYYKIIEVLCKARKTELAESLMTEFINSGLKPLMPSFIGLMEMYS 789
            VFR MK+ +G  S+ +Y+KIIEVLC++ + +LAES M E I SG+KPLMPS+I L + Y 
Sbjct: 432  VFREMKKCLGYTSIASYHKIIEVLCESEQMDLAESFMKELIESGMKPLMPSYIKLTDTYL 491

Query: 788  DLDLHDKVESTFFKCLENCRPNRTVYNMYLDSLVHISNLEKAEEIFNEMYGNEAIGVNTR 609
             L+ HDK+ESTF +CLE CRPNRT+Y++YL SLV + NL KAEEIFN M  N  IGVN +
Sbjct: 492  RLNYHDKLESTFLECLEKCRPNRTIYSIYLSSLVKVGNLGKAEEIFNHMGENVTIGVNAK 551

Query: 608  SCNTILRGYLACEQNVKAEKIYDLMCQKKFDIESSLMEKLDYMLSLRWEVVKKPISMKLS 429
            SCNTIL GYL+   N KAEKIYDLMCQKKF+IES LMEKL+ +L    + VKKP+S+KLS
Sbjct: 552  SCNTILYGYLSSGDNSKAEKIYDLMCQKKFEIESPLMEKLESVLRSSRKEVKKPVSLKLS 611

Query: 428  KEQRETLVGLLLGGLQIEVDXXXXXXXXXXXXXXXFGVHSVLKRHIHDQFREWLACTDKL 249
            KEQRE L+GLLLGGL+I+ D                  HS+LKRHIHDQ+ EWL  + KL
Sbjct: 612  KEQREILMGLLLGGLRIDSDEERKNHMIRFEFNPSSIPHSILKRHIHDQYHEWLHPSSKL 671

Query: 248  ADDEDDIPCQFSTISHSHFNFYAEQFRAQGRPVIPKLIHRWLSPRVLAYWYMYGGHRTST 69
                 DIP +F+TISHS+F FYA+QF  +G+PVIPKLIHRWLSP VLAYWYMYGG+RTS 
Sbjct: 672  TAGNGDIPHKFNTISHSYFGFYADQFWPKGQPVIPKLIHRWLSPIVLAYWYMYGGYRTSA 731

Query: 68   GDILLKLKASKEGVQRIVKTLK 3
            GDILLKLK S EGV+++VKTLK
Sbjct: 732  GDILLKLKGSSEGVKKVVKTLK 753


>ref|XP_010040589.1| PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing
            protein At2g15820-like [Eucalyptus grandis]
          Length = 821

 Score =  401 bits (1031), Expect = e-109
 Identities = 195/322 (60%), Positives = 251/322 (77%)
 Frame = -3

Query: 968  VFRGMKEQVGSASVTAYYKIIEVLCKARKTELAESLMTEFINSGLKPLMPSFIGLMEMYS 789
            +FR M+E++GSA+V AY+KIIEV+CKA+  ELAESLM EF NSGLKPL PSFI +M MY 
Sbjct: 432  LFREMQERLGSATVAAYHKIIEVICKAQDVELAESLMKEFKNSGLKPLGPSFIDMMNMYF 491

Query: 788  DLDLHDKVESTFFKCLENCRPNRTVYNMYLDSLVHISNLEKAEEIFNEMYGNEAIGVNTR 609
            +L LHDK+ES F +C+E C+PNR +Y +YLDSLV I ++ KAEEIF+EM+ + AIG++ R
Sbjct: 492  NLGLHDKLESAFSQCVEKCQPNRVIYGIYLDSLVRIGDISKAEEIFSEMHSSGAIGISGR 551

Query: 608  SCNTILRGYLACEQNVKAEKIYDLMCQKKFDIESSLMEKLDYMLSLRWEVVKKPISMKLS 429
            +CN+IL GYL+    VKAEK+Y L C+KK++IE + MEKLD +LSLR + VKKP+S+KL+
Sbjct: 552  NCNSILGGYLSAGDYVKAEKVYHLKCEKKYEIERASMEKLDPILSLRGKAVKKPVSLKLT 611

Query: 428  KEQRETLVGLLLGGLQIEVDXXXXXXXXXXXXXXXFGVHSVLKRHIHDQFREWLACTDKL 249
            KEQRE LVG+LLGGLQI+ D                G+HS LKRHI++ + EWL  + KL
Sbjct: 612  KEQREILVGMLLGGLQIDSDEQRKNHMIKFKFNENSGMHSALKRHIYEHYHEWLHPSCKL 671

Query: 248  ADDEDDIPCQFSTISHSHFNFYAEQFRAQGRPVIPKLIHRWLSPRVLAYWYMYGGHRTST 69
             D+ ++IP  FSTI HS+F FYA+QF  +G+PVIPKLIHRWLSP  LAYWYMYGG+R S+
Sbjct: 672  DDNSNEIPNSFSTICHSYFGFYADQFWPRGKPVIPKLIHRWLSPCALAYWYMYGGYRMSS 731

Query: 68   GDILLKLKASKEGVQRIVKTLK 3
            GDILLKL+ S+EGV+R+VK LK
Sbjct: 732  GDILLKLRGSQEGVERVVKALK 753


>gb|KCW63832.1| hypothetical protein EUGRSUZ_G01504 [Eucalyptus grandis]
          Length = 708

 Score =  401 bits (1031), Expect = e-109
 Identities = 196/322 (60%), Positives = 249/322 (77%)
 Frame = -3

Query: 968  VFRGMKEQVGSASVTAYYKIIEVLCKARKTELAESLMTEFINSGLKPLMPSFIGLMEMYS 789
            +FR M+E +GSA+V AY+KIIEV+CKA+  ELAESLM EF +SGLKPL PSFI +M MY 
Sbjct: 319  LFREMQEHLGSATVAAYHKIIEVICKAQDVELAESLMKEFKDSGLKPLGPSFIDMMNMYF 378

Query: 788  DLDLHDKVESTFFKCLENCRPNRTVYNMYLDSLVHISNLEKAEEIFNEMYGNEAIGVNTR 609
             L LHDK+ES F +C+E C+PNR +Y +YLDSLV I ++ KAEEIF+EM+ + AIG+  R
Sbjct: 379  KLGLHDKLESAFSQCVEKCQPNRVIYGIYLDSLVRIGDISKAEEIFSEMHSSGAIGIGGR 438

Query: 608  SCNTILRGYLACEQNVKAEKIYDLMCQKKFDIESSLMEKLDYMLSLRWEVVKKPISMKLS 429
            +CN+IL GYL+    VKAEK+Y LMCQKK++IE + MEKLD +LSLR + VKKP+S+KL+
Sbjct: 439  NCNSILGGYLSAGDYVKAEKVYHLMCQKKYEIEPASMEKLDPILSLRGKAVKKPVSLKLT 498

Query: 428  KEQRETLVGLLLGGLQIEVDXXXXXXXXXXXXXXXFGVHSVLKRHIHDQFREWLACTDKL 249
            KEQRE LVG+LLGGLQI+ D                G+HS LKRHI++ + EWL  + KL
Sbjct: 499  KEQREILVGMLLGGLQIDSDEQRKNHMIKFKFNENSGMHSALKRHIYEHYHEWLHPSCKL 558

Query: 248  ADDEDDIPCQFSTISHSHFNFYAEQFRAQGRPVIPKLIHRWLSPRVLAYWYMYGGHRTST 69
             D+ ++IP  FSTI HS+F FYA+QF  +G+PVIPKLIHRWLSP  LAYWYMYGG+R S+
Sbjct: 559  DDNSNEIPNSFSTIRHSYFGFYADQFWPRGKPVIPKLIHRWLSPCALAYWYMYGGYRMSS 618

Query: 68   GDILLKLKASKEGVQRIVKTLK 3
            GDILLKL+ S+EGV+R+VK LK
Sbjct: 619  GDILLKLRGSQEGVERVVKALK 640


>gb|KCW45077.1| hypothetical protein EUGRSUZ_L01320 [Eucalyptus grandis]
          Length = 806

 Score =  401 bits (1031), Expect = e-109
 Identities = 195/322 (60%), Positives = 251/322 (77%)
 Frame = -3

Query: 968  VFRGMKEQVGSASVTAYYKIIEVLCKARKTELAESLMTEFINSGLKPLMPSFIGLMEMYS 789
            +FR M+E++GSA+V AY+KIIEV+CKA+  ELAESLM EF NSGLKPL PSFI +M MY 
Sbjct: 417  LFREMQERLGSATVAAYHKIIEVICKAQDVELAESLMKEFKNSGLKPLGPSFIDMMNMYF 476

Query: 788  DLDLHDKVESTFFKCLENCRPNRTVYNMYLDSLVHISNLEKAEEIFNEMYGNEAIGVNTR 609
            +L LHDK+ES F +C+E C+PNR +Y +YLDSLV I ++ KAEEIF+EM+ + AIG++ R
Sbjct: 477  NLGLHDKLESAFSQCVEKCQPNRVIYGIYLDSLVRIGDISKAEEIFSEMHSSGAIGISGR 536

Query: 608  SCNTILRGYLACEQNVKAEKIYDLMCQKKFDIESSLMEKLDYMLSLRWEVVKKPISMKLS 429
            +CN+IL GYL+    VKAEK+Y L C+KK++IE + MEKLD +LSLR + VKKP+S+KL+
Sbjct: 537  NCNSILGGYLSAGDYVKAEKVYHLKCEKKYEIERASMEKLDPILSLRGKAVKKPVSLKLT 596

Query: 428  KEQRETLVGLLLGGLQIEVDXXXXXXXXXXXXXXXFGVHSVLKRHIHDQFREWLACTDKL 249
            KEQRE LVG+LLGGLQI+ D                G+HS LKRHI++ + EWL  + KL
Sbjct: 597  KEQREILVGMLLGGLQIDSDEQRKNHMIKFKFNENSGMHSALKRHIYEHYHEWLHPSCKL 656

Query: 248  ADDEDDIPCQFSTISHSHFNFYAEQFRAQGRPVIPKLIHRWLSPRVLAYWYMYGGHRTST 69
             D+ ++IP  FSTI HS+F FYA+QF  +G+PVIPKLIHRWLSP  LAYWYMYGG+R S+
Sbjct: 657  DDNSNEIPNSFSTICHSYFGFYADQFWPRGKPVIPKLIHRWLSPCALAYWYMYGGYRMSS 716

Query: 68   GDILLKLKASKEGVQRIVKTLK 3
            GDILLKL+ S+EGV+R+VK LK
Sbjct: 717  GDILLKLRGSQEGVERVVKALK 738


>emb|CDO97140.1| unnamed protein product [Coffea canephora]
          Length = 467

 Score =  400 bits (1029), Expect = e-109
 Identities = 198/322 (61%), Positives = 241/322 (74%)
 Frame = -3

Query: 968  VFRGMKEQVGSASVTAYYKIIEVLCKARKTELAESLMTEFINSGLKPLMPSFIGLMEMYS 789
            VFR M+E +GSAS+ AY+KI+EVL KAR TEL ESLM EFINSGLKPL PSFI LM MYS
Sbjct: 67   VFRSMQELLGSASIMAYHKIVEVLSKARNTELVESLMVEFINSGLKPLRPSFIHLMVMYS 126

Query: 788  DLDLHDKVESTFFKCLENCRPNRTVYNMYLDSLVHISNLEKAEEIFNEMYGNEAIGVNTR 609
            +L LHDK+ES F +CLE C+PNRT+YN+YLDSLV + +LE+AEEIF++MYGN  +GVN R
Sbjct: 127  NLGLHDKLESAFIQCLEKCQPNRTMYNIYLDSLVQVGSLERAEEIFSQMYGNATVGVNAR 186

Query: 608  SCNTILRGYLACEQNVKAEKIYDLMCQKKFDIESSLMEKLDYMLSLRWEVVKKPISMKLS 429
            SCNT+L+GYL+C   VK EK++DLM    ++IE +L +KLDY LSL  EVV+ P  +KL+
Sbjct: 187  SCNTMLKGYLSCGDYVKIEKMFDLMYCNNYEIEPALKKKLDYTLSLSREVVRTPPKLKLN 246

Query: 428  KEQRETLVGLLLGGLQIEVDXXXXXXXXXXXXXXXFGVHSVLKRHIHDQFREWLACTDKL 249
              QRE LVG+LLGGL++                   G+HSVLKRHIHD++ EWL C + +
Sbjct: 247  DVQREILVGMLLGGLRMASGQDNRKFAITFEFNEQSGIHSVLKRHIHDEYHEWLDCGNPV 306

Query: 248  ADDEDDIPCQFSTISHSHFNFYAEQFRAQGRPVIPKLIHRWLSPRVLAYWYMYGGHRTST 69
             D  D  P  F+TISHS F FYAEQF    +P IPKLIHRWLSPRVLAYWYMYGGHRT  
Sbjct: 307  -DGVDGTPFHFTTISHSCFTFYAEQFWPNSQPAIPKLIHRWLSPRVLAYWYMYGGHRTPG 365

Query: 68   GDILLKLKASKEGVQRIVKTLK 3
            GDILLKLK S+E + R++K LK
Sbjct: 366  GDILLKLKGSQESIARVLKALK 387


>gb|KHG30621.1| hypothetical protein F383_13349 [Gossypium arboreum]
          Length = 836

 Score =  400 bits (1028), Expect = e-109
 Identities = 201/322 (62%), Positives = 243/322 (75%)
 Frame = -3

Query: 968  VFRGMKEQVGSASVTAYYKIIEVLCKARKTELAESLMTEFINSGLKPLMPSFIGLMEMYS 789
            VFR MK+ +G  S+ +Y+KIIEVLC++ + +LAES M E I SG+KPLMPS+I L + Y 
Sbjct: 433  VFREMKKCLGYTSIASYHKIIEVLCESEQMDLAESFMKELIESGMKPLMPSYIKLTDTYL 492

Query: 788  DLDLHDKVESTFFKCLENCRPNRTVYNMYLDSLVHISNLEKAEEIFNEMYGNEAIGVNTR 609
             L+ HDK+ESTF +CLE CRPNRT+YN+YL SLV + NL KAEEIFN M  N  IGVN +
Sbjct: 493  RLNCHDKLESTFLECLEKCRPNRTIYNIYLSSLVKVGNLGKAEEIFNHMGENVTIGVNAK 552

Query: 608  SCNTILRGYLACEQNVKAEKIYDLMCQKKFDIESSLMEKLDYMLSLRWEVVKKPISMKLS 429
            SCNTIL GYL+   N KAEKIYDLMCQKKF+IES LMEKL+ +L    + VKKP+S+KLS
Sbjct: 553  SCNTILCGYLSSGDNSKAEKIYDLMCQKKFEIESPLMEKLENVLRSSRKEVKKPLSLKLS 612

Query: 428  KEQRETLVGLLLGGLQIEVDXXXXXXXXXXXXXXXFGVHSVLKRHIHDQFREWLACTDKL 249
            KEQRE L+GLLLGGL+I+ D                  HS+LKRHIHDQ+ EWL  + KL
Sbjct: 613  KEQREILMGLLLGGLRIDSDEERKNHMIRFEFNPSSIPHSILKRHIHDQYHEWLHPSSKL 672

Query: 248  ADDEDDIPCQFSTISHSHFNFYAEQFRAQGRPVIPKLIHRWLSPRVLAYWYMYGGHRTST 69
                 DI  +F+TISHS+F FYA+QF  +G+PVIPKLIHRWLSP VLAYWYMYGG+RTS 
Sbjct: 673  TAGNGDILHKFNTISHSYFGFYADQFWPKGQPVIPKLIHRWLSPIVLAYWYMYGGYRTSA 732

Query: 68   GDILLKLKASKEGVQRIVKTLK 3
            GDILLKLK S EGV+++VKTLK
Sbjct: 733  GDILLKLKGSSEGVEKVVKTLK 754


>gb|KDO49164.1| hypothetical protein CISIN_1g003913mg [Citrus sinensis]
          Length = 787

 Score =  400 bits (1028), Expect = e-109
 Identities = 194/322 (60%), Positives = 246/322 (76%)
 Frame = -3

Query: 968  VFRGMKEQVGSASVTAYYKIIEVLCKARKTELAESLMTEFINSGLKPLMPSFIGLMEMYS 789
            +FR M+E++GSASV AY+KIIE+LCKA +TEL ESLM EF+ +G+KPLMPS+I L  MY 
Sbjct: 389  IFREMQERLGSASVPAYHKIIELLCKAEETELTESLMKEFVETGMKPLMPSYINLTNMYL 448

Query: 788  DLDLHDKVESTFFKCLENCRPNRTVYNMYLDSLVHISNLEKAEEIFNEMYGNEAIGVNTR 609
            +L +HD++   F +CLE CRPNRT+Y +YL+SL +  N+EKAEEIFN M+ ++ IGVNTR
Sbjct: 449  NLGMHDRLHLAFSECLEKCRPNRTIYGIYLESLKNAGNIEKAEEIFNHMHSDQTIGVNTR 508

Query: 608  SCNTILRGYLACEQNVKAEKIYDLMCQKKFDIESSLMEKLDYMLSLRWEVVKKPISMKLS 429
            SCN IL  YL+    VKAEKIYDLMC KK++IES+ MEKLDY+LSL  + VKKP+S+ LS
Sbjct: 509  SCNIILSAYLSSGDFVKAEKIYDLMCLKKYEIESAWMEKLDYVLSLNRKEVKKPVSLNLS 568

Query: 428  KEQRETLVGLLLGGLQIEVDXXXXXXXXXXXXXXXFGVHSVLKRHIHDQFREWLACTDKL 249
             EQRE L+GLLLGGL IE D                 +HSVL+R+++DQ+ EWL  + K+
Sbjct: 569  SEQRENLIGLLLGGLCIESDEKRKRHMIRFQFNENSRMHSVLRRYLYDQYHEWLHPSFKV 628

Query: 248  ADDEDDIPCQFSTISHSHFNFYAEQFRAQGRPVIPKLIHRWLSPRVLAYWYMYGGHRTST 69
            +D  DDIP ++STISH +F FYA++F  +GR VIPKLIHRWL+PR LAYW+MYGGHRTS 
Sbjct: 629  SDGNDDIPYKYSTISHPYFCFYADKFWPKGRLVIPKLIHRWLTPRALAYWFMYGGHRTSV 688

Query: 68   GDILLKLKASKEGVQRIVKTLK 3
            GDILLKLK S EG+  + KTLK
Sbjct: 689  GDILLKLKVSSEGIALVFKTLK 710


Top