BLASTX nr result

ID: Rehmannia29_contig00037805 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia29_contig00037805
         (761 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_020549796.1| uncharacterized protein LOC110012043 [Sesamu...   141   2e-34
ref|XP_012858045.1| PREDICTED: uncharacterized protein LOC105977...   141   2e-34
ref|XP_012846702.1| PREDICTED: uncharacterized protein LOC105966...   139   1e-33
ref|XP_012828505.1| PREDICTED: uncharacterized protein LOC105949...   139   1e-33
ref|XP_012844821.1| PREDICTED: uncharacterized protein LOC105964...   139   1e-33
ref|XP_012828530.1| PREDICTED: uncharacterized protein LOC105949...   139   1e-33
ref|XP_012847850.1| PREDICTED: uncharacterized protein LOC105967...   139   1e-33
ref|XP_012855480.1| PREDICTED: uncharacterized protein LOC105974...   138   2e-33
ref|XP_012844111.1| PREDICTED: uncharacterized protein LOC105964...   137   3e-33
ref|XP_012853187.1| PREDICTED: uncharacterized protein LOC105972...   137   3e-33
ref|XP_012850055.1| PREDICTED: uncharacterized protein LOC105969...   136   1e-32
ref|XP_012850054.1| PREDICTED: uncharacterized protein LOC105969...   134   4e-32
ref|XP_012843177.1| PREDICTED: uncharacterized protein LOC105963...   134   5e-32
gb|PIM97453.1| hypothetical protein CDL12_30077 [Handroanthus im...   124   1e-28
gb|EOY19200.1| Retrotransposon, unclassified-like protein [Theob...   108   6e-23
gb|KZV24446.1| hypothetical protein F511_25471 [Dorcoceras hygro...   103   1e-22
gb|EOX96783.1| Uncharacterized protein TCM_005954 [Theobroma cacao]   105   5e-22
gb|EOY06960.1| Uncharacterized protein TCM_021522 [Theobroma cacao]   105   8e-22
gb|EOY02239.1| Uncharacterized protein TCM_016763 [Theobroma cacao]   104   1e-21
gb|EOY06959.1| Uncharacterized protein TCM_021521 [Theobroma cacao]   103   2e-21

>ref|XP_020549796.1| uncharacterized protein LOC110012043 [Sesamum indicum]
          Length = 1116

 Score =  141 bits (355), Expect = 2e-34
 Identities = 74/196 (37%), Positives = 106/196 (54%)
 Frame = -2

Query: 757  LTSQCYCCSCHIESLPHLFLLNDQVKKIWEHFARMFNCTLPHTESIHIFLQFWSNFTPFT 578
            L S+C C + H+ESL H+F+  + ++ +WEHFA+ FN  LP+T++I + L +W       
Sbjct: 559  LASKCSCYN-HVESLQHVFIEGNGIRCVWEHFAKKFNMHLPNTDNIVLLLNYWR--ISAL 615

Query: 577  HHSHITFILPCLILWFTWLERNKSKHENVKFSYWHIICQVEAHLKLLYQSHMLNANVWKG 398
              +HI  I+P LILWF WLERN  KH N  F+   I  +V  H+   ++S       WKG
Sbjct: 616  GQNHIRMIVPMLILWFGWLERNDVKHRNKNFNSERIKWKVHQHIVTTFKSKTTKRINWKG 675

Query: 397  FLHIASGYXXXXXXXXGIKISSVLWHKPPPHLVKVNVDGATKGLINQAGLGGVLRDHEGN 218
               +A             KI  V W KP    +K+N DGA+KG   +AG GG+ RD EG 
Sbjct: 676  DRFVAKSMGLELGSQYKPKIKIVKWTKPELGWIKINTDGASKGNPGRAGAGGIARDEEGA 735

Query: 217  ILWICYGFAEECDNSF 170
            ++   Y    E +N+F
Sbjct: 736  VILAFYEVLGETNNTF 751


>ref|XP_012858045.1| PREDICTED: uncharacterized protein LOC105977287 [Erythranthe guttata]
          Length = 1237

 Score =  141 bits (355), Expect = 2e-34
 Identities = 71/202 (35%), Positives = 109/202 (53%), Gaps = 19/202 (9%)
 Frame = -2

Query: 760  SLTSQCYCC----------------SCHIESLPHLFLLNDQVKKIWEHFARMFNCTLPHT 629
            SL S+CYCC                S  +ES+ H+F+ +   K++W HF  +F  T  HT
Sbjct: 904  SLASRCYCCPDPSIPVSSLVSQSVESPSVESIDHIFVESPTAKRVWHHFFYLFGYTPAHT 963

Query: 628  ESIHIFLQFWSNFTP--FTHHSHITFILPCLILWFTWLERNKSKHENVKFSYWHIICQVE 455
              I   L +W +FT    THH+HIT I+PCLILW+ W+ RN SKH+++      II +V 
Sbjct: 964  THIPQILLYWQHFTSHTLTHHTHITTIVPCLILWYLWIARNDSKHKDITVRASSIIYRVI 1023

Query: 454  AHLKLLYQSHMLNANVWKGFLHIASGY-XXXXXXXXGIKISSVLWHKPPPHLVKVNVDGA 278
             H+K+L+Q+++L+A+ W G  H+A             ++   V+W  P P  VK+N DGA
Sbjct: 1024 QHIKILHQTNLLSADSWTGIPHVAESLGLYYRVRTPTLRPHRVVWLPPDPGWVKLNTDGA 1083

Query: 277  TKGLINQAGLGGVLRDHEGNIL 212
             +     A +GG++R  +   +
Sbjct: 1084 RRASTQIAAIGGIIRGSDAEAI 1105


>ref|XP_012846702.1| PREDICTED: uncharacterized protein LOC105966658 [Erythranthe guttata]
          Length = 1233

 Score =  139 bits (349), Expect = 1e-33
 Identities = 70/202 (34%), Positives = 108/202 (53%), Gaps = 19/202 (9%)
 Frame = -2

Query: 760  SLTSQCYCC----------------SCHIESLPHLFLLNDQVKKIWEHFARMFNCTLPHT 629
            SL S+CYCC                S  +ES+ H+F+ +   K++W HF  +F  T  HT
Sbjct: 900  SLASRCYCCPDPSIPVSSLVSQSVESPSVESIDHIFVESPTAKRVWHHFFYLFGYTPAHT 959

Query: 628  ESIHIFLQFWSNFTP--FTHHSHITFILPCLILWFTWLERNKSKHENVKFSYWHIICQVE 455
              I   L +W +FT    THH+HIT I+PCLILW+ W+ RN SKH+++      II +V 
Sbjct: 960  THIPQILLYWQHFTSHTLTHHTHITTIVPCLILWYLWIARNDSKHKDITVRASSIIYRVI 1019

Query: 454  AHLKLLYQSHMLNANVWKGFLHIASGY-XXXXXXXXGIKISSVLWHKPPPHLVKVNVDGA 278
             H+++L+Q+++L+A+ W G  H+A             +    V+W  P P  VK+N DGA
Sbjct: 1020 QHIRILHQTNLLSADSWTGIPHVAESLGLYYRVRTPTLTPHRVVWLPPDPGWVKLNTDGA 1079

Query: 277  TKGLINQAGLGGVLRDHEGNIL 212
             +     A +GG++R  +   +
Sbjct: 1080 RRASTQIAAIGGIIRGSDAEAI 1101


>ref|XP_012828505.1| PREDICTED: uncharacterized protein LOC105949732 [Erythranthe guttata]
          Length = 1237

 Score =  139 bits (349), Expect = 1e-33
 Identities = 70/202 (34%), Positives = 108/202 (53%), Gaps = 19/202 (9%)
 Frame = -2

Query: 760  SLTSQCYCC----------------SCHIESLPHLFLLNDQVKKIWEHFARMFNCTLPHT 629
            SL S+CYCC                S  +ES+ H+F+ +   K++W HF  +F  T  HT
Sbjct: 904  SLASRCYCCPDSSIPVSSLVSQSVESPSVESIDHIFVESPTAKRVWHHFFYLFGYTPAHT 963

Query: 628  ESIHIFLQFWSNFTP--FTHHSHITFILPCLILWFTWLERNKSKHENVKFSYWHIICQVE 455
              I   L +W +FT    THH+HIT I+PCLILW+ W+ RN SKH+++      II +V 
Sbjct: 964  THIPQILLYWQHFTSHTLTHHTHITTIVPCLILWYLWIARNDSKHKDITVRASSIIYRVI 1023

Query: 454  AHLKLLYQSHMLNANVWKGFLHIASGY-XXXXXXXXGIKISSVLWHKPPPHLVKVNVDGA 278
             H+++L+Q+++L+A+ W G  H+A             +    V+W  P P  VK+N DGA
Sbjct: 1024 QHIRILHQTNLLSADSWTGIPHVAESLGLYYRVRTPTLTPHRVVWLPPDPGWVKLNTDGA 1083

Query: 277  TKGLINQAGLGGVLRDHEGNIL 212
             +     A +GG++R  +   +
Sbjct: 1084 RRASTQIAAIGGIIRGSDAEAI 1105


>ref|XP_012844821.1| PREDICTED: uncharacterized protein LOC105964855 [Erythranthe guttata]
          Length = 1237

 Score =  139 bits (349), Expect = 1e-33
 Identities = 70/202 (34%), Positives = 108/202 (53%), Gaps = 19/202 (9%)
 Frame = -2

Query: 760  SLTSQCYCC----------------SCHIESLPHLFLLNDQVKKIWEHFARMFNCTLPHT 629
            SL S+CYCC                S  +ES+ H+F+ +   K++W HF  +F  T  HT
Sbjct: 904  SLASRCYCCPDPSIPVSSLVSQSVESPSVESIDHIFVESPTAKRVWHHFFYLFGYTPAHT 963

Query: 628  ESIHIFLQFWSNFTP--FTHHSHITFILPCLILWFTWLERNKSKHENVKFSYWHIICQVE 455
              I   L +W +FT    THH+HIT I+PCLILW+ W+ RN SKH+++      II +V 
Sbjct: 964  THIPQILLYWQHFTSHTLTHHTHITTIVPCLILWYLWIARNDSKHKDITVRASSIIYRVI 1023

Query: 454  AHLKLLYQSHMLNANVWKGFLHIASGY-XXXXXXXXGIKISSVLWHKPPPHLVKVNVDGA 278
             H+++L+Q+++L+A+ W G  H+A             +    V+W  P P  VK+N DGA
Sbjct: 1024 QHIRILHQTNLLSADSWTGIPHVAESLGLYYRVRTPTLTPHRVVWLPPDPGWVKLNTDGA 1083

Query: 277  TKGLINQAGLGGVLRDHEGNIL 212
             +     A +GG++R  +   +
Sbjct: 1084 RRASTQIAAIGGIIRGSDAEAI 1105


>ref|XP_012828530.1| PREDICTED: uncharacterized protein LOC105949758 [Erythranthe guttata]
          Length = 1245

 Score =  139 bits (349), Expect = 1e-33
 Identities = 70/202 (34%), Positives = 108/202 (53%), Gaps = 19/202 (9%)
 Frame = -2

Query: 760  SLTSQCYCC----------------SCHIESLPHLFLLNDQVKKIWEHFARMFNCTLPHT 629
            SL S+CYCC                S  +ES+ H+F+ +   K++W HF  +F  T  HT
Sbjct: 912  SLASRCYCCPDPSIPVSSLVSQSVESPSVESIDHIFVESPTAKRVWHHFFYLFGYTPAHT 971

Query: 628  ESIHIFLQFWSNFTP--FTHHSHITFILPCLILWFTWLERNKSKHENVKFSYWHIICQVE 455
              I   L +W +FT    THH+HIT I+PCLILW+ W+ RN SKH+++      II +V 
Sbjct: 972  THIPQILLYWQHFTSHTLTHHTHITTIVPCLILWYLWIARNDSKHKDITVRASSIIYRVI 1031

Query: 454  AHLKLLYQSHMLNANVWKGFLHIASGY-XXXXXXXXGIKISSVLWHKPPPHLVKVNVDGA 278
             H+++L+Q+++L+A+ W G  H+A             +    V+W  P P  VK+N DGA
Sbjct: 1032 QHIRILHQTNLLSADSWTGIPHVAESLGLYYRVRTPTLTPHRVVWLPPDPGWVKLNTDGA 1091

Query: 277  TKGLINQAGLGGVLRDHEGNIL 212
             +     A +GG++R  +   +
Sbjct: 1092 RRASTQIAAIGGIIRGSDAEAI 1113


>ref|XP_012847850.1| PREDICTED: uncharacterized protein LOC105967783 [Erythranthe guttata]
          Length = 1298

 Score =  139 bits (349), Expect = 1e-33
 Identities = 70/202 (34%), Positives = 108/202 (53%), Gaps = 19/202 (9%)
 Frame = -2

Query: 760  SLTSQCYCC----------------SCHIESLPHLFLLNDQVKKIWEHFARMFNCTLPHT 629
            SL S+CYCC                S  +ES+ H+F+ +   K++W HF  +F  T  HT
Sbjct: 965  SLASRCYCCPDPSIPVSSLVSQSVESPSVESIDHIFVESPTAKRVWHHFFYLFGYTPAHT 1024

Query: 628  ESIHIFLQFWSNFTP--FTHHSHITFILPCLILWFTWLERNKSKHENVKFSYWHIICQVE 455
              I   L +W +FT    THH+HIT I+PCLILW+ W+ RN SKH+++      II +V 
Sbjct: 1025 THIPQILLYWQHFTSHTLTHHTHITTIVPCLILWYLWIARNDSKHKDITVRASSIIYRVI 1084

Query: 454  AHLKLLYQSHMLNANVWKGFLHIASGY-XXXXXXXXGIKISSVLWHKPPPHLVKVNVDGA 278
             H+++L+Q+++L+A+ W G  H+A             +    V+W  P P  VK+N DGA
Sbjct: 1085 QHIRILHQTNLLSADSWTGIPHVAESLGLYYRVRTPTLTPHRVVWLPPDPGWVKLNTDGA 1144

Query: 277  TKGLINQAGLGGVLRDHEGNIL 212
             +     A +GG++R  +   +
Sbjct: 1145 RRASTQIAAIGGIIRGSDAEAI 1166


>ref|XP_012855480.1| PREDICTED: uncharacterized protein LOC105974867 [Erythranthe guttata]
          Length = 1393

 Score =  138 bits (348), Expect = 2e-33
 Identities = 71/202 (35%), Positives = 107/202 (52%), Gaps = 19/202 (9%)
 Frame = -2

Query: 760  SLTSQCYCC----------------SCHIESLPHLFLLNDQVKKIWEHFARMFNCTLPHT 629
            SL S+CYCC                S  IES+ H+F+ +   K++W HF  +F  T  HT
Sbjct: 1060 SLASRCYCCPDPSIPVSSLVSLSVESPSIESIDHIFVESPTAKRVWHHFFYLFGYTPAHT 1119

Query: 628  ESIHIFLQFWSNFTP--FTHHSHITFILPCLILWFTWLERNKSKHENVKFSYWHIICQVE 455
              I   L +W +FT    THH+HIT I+PCLILW+ W+ RN SKH+++      II +V 
Sbjct: 1120 THIPQILLYWQHFTSHTLTHHTHITTIVPCLILWYLWIARNDSKHKDITVRASSIIYRVI 1179

Query: 454  AHLKLLYQSHMLNANVWKGFLHIASGY-XXXXXXXXGIKISSVLWHKPPPHLVKVNVDGA 278
             H+++L+Q+ +L+A+ W G  H+A             +    V+W  P P  VK+N DGA
Sbjct: 1180 QHIRILHQTKLLSADSWTGIPHVAESLGLYYRVRTPTLTPHRVVWLPPDPGWVKLNTDGA 1239

Query: 277  TKGLINQAGLGGVLRDHEGNIL 212
             +     A +GG++R  +   +
Sbjct: 1240 RRASTQIAAIGGIIRGSDAEAI 1261


>ref|XP_012844111.1| PREDICTED: uncharacterized protein LOC105964144 [Erythranthe guttata]
          Length = 1237

 Score =  137 bits (346), Expect = 3e-33
 Identities = 69/202 (34%), Positives = 108/202 (53%), Gaps = 19/202 (9%)
 Frame = -2

Query: 760  SLTSQCYCC----------------SCHIESLPHLFLLNDQVKKIWEHFARMFNCTLPHT 629
            SL S+CYCC                S  +ES+ H+F+ +   K++W HF  +F  T  HT
Sbjct: 904  SLASRCYCCPDPSIPVSSLVSQSVESFSVESIDHIFVESPTAKRVWHHFFYLFGYTPAHT 963

Query: 628  ESIHIFLQFWSNFTP--FTHHSHITFILPCLILWFTWLERNKSKHENVKFSYWHIICQVE 455
              I   L +W +FT    THH+HIT I+PCLILW+ W+ RN SKH+++      II +V 
Sbjct: 964  THIPQILLYWQHFTSHALTHHTHITTIVPCLILWYLWIARNDSKHKDITVRASSIINRVI 1023

Query: 454  AHLKLLYQSHMLNANVWKGFLHIASGY-XXXXXXXXGIKISSVLWHKPPPHLVKVNVDGA 278
             H+++L+Q+++L+A+ W G  H+A             +    V+W  P P  +K+N DGA
Sbjct: 1024 QHIRILHQTNLLSADSWTGIPHVAESLGLYYRVRTPTLTPHRVVWLPPDPGWMKLNTDGA 1083

Query: 277  TKGLINQAGLGGVLRDHEGNIL 212
             +     A +GG++R  +   +
Sbjct: 1084 RRASTQIAAIGGIIRGSDAEAI 1105


>ref|XP_012853187.1| PREDICTED: uncharacterized protein LOC105972756 [Erythranthe guttata]
          Length = 1285

 Score =  137 bits (346), Expect = 3e-33
 Identities = 70/202 (34%), Positives = 107/202 (52%), Gaps = 19/202 (9%)
 Frame = -2

Query: 760  SLTSQCYCC----------------SCHIESLPHLFLLNDQVKKIWEHFARMFNCTLPHT 629
            SL S+CYCC                S  +ES+ H+F+ +   K++W HF  +F  T  HT
Sbjct: 952  SLASRCYCCPDPSIPVSSLVSQSVESPSVESIDHIFVESPTAKRVWHHFFYLFGYTPAHT 1011

Query: 628  ESIHIFLQFWSNFTP--FTHHSHITFILPCLILWFTWLERNKSKHENVKFSYWHIICQVE 455
              I   L +W +FT    THH+HIT I+PCLILW+ W+ RN SKH+++      II +V 
Sbjct: 1012 THIPQILLYWQHFTSHTLTHHTHITTIVPCLILWYLWIARNDSKHKDITVRASSIIYRVI 1071

Query: 454  AHLKLLYQSHMLNANVWKGFLHIASGY-XXXXXXXXGIKISSVLWHKPPPHLVKVNVDGA 278
             H+++L+Q+ +L+A+ W G  H+A             +    V+W  P P  VK+N DGA
Sbjct: 1072 QHIRILHQTKLLSADSWTGIPHVAESLGLYYRVRTPTLTPYRVVWLPPDPGWVKLNTDGA 1131

Query: 277  TKGLINQAGLGGVLRDHEGNIL 212
             +     A +GG++R  +   +
Sbjct: 1132 RRASTQIAAIGGIIRGSDAEAI 1153


>ref|XP_012850055.1| PREDICTED: uncharacterized protein LOC105969825 [Erythranthe guttata]
          Length = 1331

 Score =  136 bits (342), Expect = 1e-32
 Identities = 69/202 (34%), Positives = 108/202 (53%), Gaps = 19/202 (9%)
 Frame = -2

Query: 760  SLTSQCYCC----------------SCHIESLPHLFLLNDQVKKIWEHFARMFNCTLPHT 629
            SL S+CYCC                S  +ES+ H+F+ +   K++W HF  +F  T  HT
Sbjct: 998  SLASRCYCCPDPSIPVSSLVSQSVESPSVESIDHIFVESPTAKRVWHHFFYLFGYTPAHT 1057

Query: 628  ESIHIFLQFWSNFTP--FTHHSHITFILPCLILWFTWLERNKSKHENVKFSYWHIICQVE 455
              I   L +W +FT    THH+HIT I+PCLILW+ W+ RN SKH+++      II +V 
Sbjct: 1058 THIPQILLYWQHFTSHTLTHHTHITTIVPCLILWYLWIARNDSKHKDITVRASSIIYRVI 1117

Query: 454  AHLKLLYQSHMLNANVWKGFLHIASGY-XXXXXXXXGIKISSVLWHKPPPHLVKVNVDGA 278
             H+++L+Q+++L+A+ W G  H+A             +    V+W  P P  VK+N +GA
Sbjct: 1118 QHIRILHQTNLLSADSWTGIPHMAESLGLYYRVGTPTLTPHRVVWLPPDPGWVKLNTNGA 1177

Query: 277  TKGLINQAGLGGVLRDHEGNIL 212
             +     A +GG++R  +   +
Sbjct: 1178 RRASTQIAAIGGIIRGSDAEAI 1199


>ref|XP_012850054.1| PREDICTED: uncharacterized protein LOC105969824 [Erythranthe guttata]
          Length = 1805

 Score =  134 bits (338), Expect = 4e-32
 Identities = 69/202 (34%), Positives = 107/202 (52%), Gaps = 19/202 (9%)
 Frame = -2

Query: 760  SLTSQCYCC----------------SCHIESLPHLFLLNDQVKKIWEHFARMFNCTLPHT 629
            SL S+ YCC                S  +ES+ H+F+ +   K++W HF  +F  T  HT
Sbjct: 1472 SLASRFYCCPDPSIPVSSLVSQSVESPSVESIDHIFVESPTAKRVWHHFFYLFGYTPAHT 1531

Query: 628  ESIHIFLQFWSNFTP--FTHHSHITFILPCLILWFTWLERNKSKHENVKFSYWHIICQVE 455
              I   L +W +FT    THH+HIT I+PCLILW+ W+ RN SKH+++      II +V 
Sbjct: 1532 THIPQILLYWQHFTSHTLTHHTHITTIVPCLILWYLWIARNDSKHKDITVRASSIIYRVI 1591

Query: 454  AHLKLLYQSHMLNANVWKGFLHIASGY-XXXXXXXXGIKISSVLWHKPPPHLVKVNVDGA 278
             H+++L+Q+++L+A+ W G  H+A             +    V+W  P P  VK+N DGA
Sbjct: 1592 QHIRILHQTNLLSADSWTGIPHVAESLGLYYRVRTPTLTPHRVVWLPPDPGWVKLNTDGA 1651

Query: 277  TKGLINQAGLGGVLRDHEGNIL 212
             +     A +GG++R  +   +
Sbjct: 1652 RRASTQIAAIGGIIRGSDAEAI 1673


>ref|XP_012843177.1| PREDICTED: uncharacterized protein LOC105963331 [Erythranthe guttata]
          Length = 1172

 Score =  134 bits (337), Expect = 5e-32
 Identities = 68/202 (33%), Positives = 106/202 (52%), Gaps = 19/202 (9%)
 Frame = -2

Query: 760  SLTSQCYCC----------------SCHIESLPHLFLLNDQVKKIWEHFARMFNCTLPHT 629
            SL S+CYCC                S  +E + H+F+ +   K++W HF  +F  T  HT
Sbjct: 738  SLASRCYCCPDPSIPVSSLVSQSVGSPFVELIDHIFVESPTAKRVWHHFFYLFGYTPAHT 797

Query: 628  ESIHIFLQFWSNFT--PFTHHSHITFILPCLILWFTWLERNKSKHENVKFSYWHIICQVE 455
              I   L +W +FT    THH+HIT I+PCLILWF W+ RN   H+++      II +V 
Sbjct: 798  THIPQILLYWQHFTLHTLTHHTHITTIVPCLILWFLWIARNDRNHKDIMVRASSIIYRVI 857

Query: 454  AHLKLLYQSHMLNANVWKGFLHIASGY-XXXXXXXXGIKISSVLWHKPPPHLVKVNVDGA 278
             H+++L+Q+++L+A+ W G  H+A             +    V+W  P P  VK+N DGA
Sbjct: 858  QHIRILHQTNLLSADSWTGIPHVAESLGLYYRVRTPTLTPHRVVWLPPDPGWVKLNTDGA 917

Query: 277  TKGLINQAGLGGVLRDHEGNIL 212
             +     A +GG++R  + + +
Sbjct: 918  RRASTQIAAIGGIIRGSDADAI 939


>gb|PIM97453.1| hypothetical protein CDL12_30077 [Handroanthus impetiginosus]
          Length = 983

 Score =  124 bits (312), Expect = 1e-28
 Identities = 66/180 (36%), Positives = 91/180 (50%)
 Frame = -2

Query: 760  SLTSQCYCCSCHIESLPHLFLLNDQVKKIWEHFARMFNCTLPHTESIHIFLQFWSNFTPF 581
            SL S+CYCC+  IES+ HLF+ ++    IW HF+  FN     T S+   L  W    PF
Sbjct: 673  SLASKCYCCNS-IESVSHLFVTSNFAHDIWGHFSEFFNIPQLSTGSLVAILSSWKYSMPF 731

Query: 580  THHSHITFILPCLILWFTWLERNKSKHENVKFSYWHIICQVEAHLKLLYQSHMLNANVWK 401
              H HI  ++P LI W  W  RN +K+  + FS   II +V  H++ +  ++ L    W+
Sbjct: 732  VTHGHIRQVIPILICWHIWEARNDAKYRYICFSARRIIFKVRQHIQHIILTNKLTFRHWR 791

Query: 400  GFLHIASGYXXXXXXXXGIKISSVLWHKPPPHLVKVNVDGATKGLINQAGLGGVLRDHEG 221
            G   +A             K  +V W KP P   K+N DGA K    +AG GG+L DH G
Sbjct: 792  GDTAVAQALGQHIPLPPPRKSMAVWWSKPKPGEWKLNTDGAAKRSTCRAGAGGILHDHTG 851


>gb|EOY19200.1| Retrotransposon, unclassified-like protein [Theobroma cacao]
          Length = 1368

 Score =  108 bits (269), Expect = 6e-23
 Identities = 62/190 (32%), Positives = 97/190 (51%)
 Frame = -2

Query: 757  LTSQCYCCSCHIESLPHLFLLNDQVKKIWEHFARMFNCTLPHTESIHIFLQFWSNFTPFT 578
            L S+C CC    ESL H+   +   +++W +F++ F   + + ++I   L  W     FT
Sbjct: 1029 LASKCLCCKSE-ESLLHVLWESPVAQQVWNYFSKFFQIYVHNPQNILQILNSWYYSGDFT 1087

Query: 577  HHSHITFILPCLILWFTWLERNKSKHENVKFSYWHIICQVEAHLKLLYQSHMLNANVWKG 398
               HI  ++   I WF W+ERN +KH ++      II ++   L+ L+Q  +L    WKG
Sbjct: 1088 KPGHIRTLILLFIFWFVWVERNDAKHRDLGMYPDRIIWRIMKILRKLFQGGLLCKWQWKG 1147

Query: 397  FLHIASGYXXXXXXXXGIKISSVLWHKPPPHLVKVNVDGATKGLINQAGLGGVLRDHEGN 218
             L IA  +          +   + W KP    +K+NVDG++K     A  GGVLRDH GN
Sbjct: 1148 DLDIAIHWGFNFAQERQARPKIINWIKPLIGELKLNVDGSSKDEFQNAAGGGVLRDHTGN 1207

Query: 217  ILWICYGFAE 188
            ++   +GF+E
Sbjct: 1208 LI---FGFSE 1214


>gb|KZV24446.1| hypothetical protein F511_25471 [Dorcoceras hygrometricum]
          Length = 297

 Score =  103 bits (258), Expect = 1e-22
 Identities = 61/193 (31%), Positives = 92/193 (47%), Gaps = 2/193 (1%)
 Frame = -2

Query: 757 LTSQCYCCSCHIESLPHLFLLNDQVKKIWEHFARMFNCTLPHTESIHIFLQFWSNFTPFT 578
           L S+C CC  H ESL HLF       ++WEHF R+F       ++ HI    W     + 
Sbjct: 26  LASKCQCCD-HEESLEHLFFSGSVAIRVWEHFGRIFGVQ----QASHI--SNWRISNSWR 78

Query: 577 HHSHITFILPCLILWFTWLERNKSKHENVKFSYWHIICQVEAHLKLLYQSHMLNANVWKG 398
              HI   +P LILWF W+ RN SKH  +      II ++  ++   + S ++    W+G
Sbjct: 79  SRGHIRECMPFLILWFIWIGRNDSKHRGIMIRPAAIIRKIRYYITTAFTSGLMKHEHWQG 138

Query: 397 FLHIASGYXXXXXXXXGIKISSVLWHKPPPHLVKVNVDG--ATKGLINQAGLGGVLRDHE 224
              +A  +         I +S++ W KPP    K+N DG  +  G+I+    GG++R   
Sbjct: 139 LQSLARNFDVVLRGFKRITVSTISWIKPPAPFYKLNSDGCRSNNGMIS---TGGLIRYTN 195

Query: 223 GNILWICYGFAEE 185
           G +L   +GF  E
Sbjct: 196 GLVLTAFHGFLGE 208


>gb|EOX96783.1| Uncharacterized protein TCM_005954 [Theobroma cacao]
          Length = 1134

 Score =  105 bits (262), Expect = 5e-22
 Identities = 59/190 (31%), Positives = 90/190 (47%)
 Frame = -2

Query: 757  LTSQCYCCSCHIESLPHLFLLNDQVKKIWEHFARMFNCTLPHTESIHIFLQFWSNFTPFT 578
            L S+C CC+   ESL H+   N   K++W  FA++F   + +   +   +  W     + 
Sbjct: 826  LASKCVCCNSE-ESLIHVLWENPVAKQVWNFFAKLFQIYILNPRHVSQIIWAWYVSGDYV 884

Query: 577  HHSHITFILPCLILWFTWLERNKSKHENVKFSYWHIICQVEAHLKLLYQSHMLNANVWKG 398
               H   +LP  I WF WLERN +KH +       +I +   H + LY   +L    WKG
Sbjct: 885  RKGHFRVLLPLFICWFLWLERNDAKHRHTGLYPDRVIWRTMKHCRQLYDGSLLQQWQWKG 944

Query: 397  FLHIASGYXXXXXXXXGIKISSVLWHKPPPHLVKVNVDGATKGLINQAGLGGVLRDHEGN 218
               IA+                + W KP     K+NVDG+++  ++ A  GGVLRDH G 
Sbjct: 945  DTDIAAMLGFSFPPQQHASPQIIYWKKPSIGEYKLNVDGSSRNGLH-AATGGVLRDHTGK 1003

Query: 217  ILWICYGFAE 188
            ++   +GF+E
Sbjct: 1004 LI---FGFSE 1010


>gb|EOY06960.1| Uncharacterized protein TCM_021522 [Theobroma cacao]
          Length = 3503

 Score =  105 bits (261), Expect = 8e-22
 Identities = 58/190 (30%), Positives = 96/190 (50%)
 Frame = -2

Query: 757  LTSQCYCCSCHIESLPHLFLLNDQVKKIWEHFARMFNCTLPHTESIHIFLQFWSNFTPFT 578
            L S+C CC    ESL H+   N    ++W +FA++F   + +  +I+  +  W     ++
Sbjct: 3196 LASRCRCCKSE-ESLMHVMWDNPVANQVWSYFAKVFQIHIINPCTINHIISAWFYSGDYS 3254

Query: 577  HHSHITFILPCLILWFTWLERNKSKHENVKFSYWHIICQVEAHLKLLYQSHMLNANVWKG 398
               HI  ++P  ILWF W+ERN +KH N+      I+ ++   +  L+Q   L    W+G
Sbjct: 3255 KPGHIRTLVPLFILWFLWVERNDAKHRNLGMYPNRIVWKILKLIHQLFQGKQLQKWQWQG 3314

Query: 397  FLHIASGYXXXXXXXXGIKISSVLWHKPPPHLVKVNVDGATKGLINQAGLGGVLRDHEGN 218
               IA  +              + W+KP     K+NVDG++K  +  A  GG+LRDH G+
Sbjct: 3315 DKQIAQEWGIILKAVAPSPPKLLFWNKPSIGEFKLNVDGSSKYNLQTAAGGGLLRDHTGS 3374

Query: 217  ILWICYGFAE 188
            ++   +GF+E
Sbjct: 3375 MI---FGFSE 3381



 Score =  103 bits (256), Expect = 3e-21
 Identities = 62/190 (32%), Positives = 91/190 (47%)
 Frame = -2

Query: 757  LTSQCYCCSCHIESLPHLFLLNDQVKKIWEHFARMFNCTLPHTESIHIFLQFWSNFTPFT 578
            L S+C CC    ESL H+   N   K++W  FA+ F   +   + I   +  W     +T
Sbjct: 1402 LASKCVCCRSE-ESLIHVLWENPVAKQVWNFFAKSFQIYVSKPKHISQIIWAWFFSGDYT 1460

Query: 577  HHSHITFILPCLILWFTWLERNKSKHENVKFSYWHIICQVEAHLKLLYQSHMLNANVWKG 398
             + HI  ++P  I WF WLERN +KH ++      +I ++   L  L+   +L    WKG
Sbjct: 1461 RNGHIRILIPLFICWFLWLERNDAKHRHMGMYPNRVIWRIMKLLNQLHAGSLLKQWQWKG 1520

Query: 397  FLHIASGYXXXXXXXXGIKISSVLWHKPPPHLVKVNVDGATKGLINQAGLGGVLRDHEGN 218
               IA+ +              + W KP     K+NVDG++K   N AG GGVLRDH G 
Sbjct: 1521 DTDIATMWGFKYPPKYCQSPQIISWIKPFIGEYKLNVDGSSKSSQNAAG-GGVLRDHTGK 1579

Query: 217  ILWICYGFAE 188
               + + F+E
Sbjct: 1580 ---LAFAFSE 1586


>gb|EOY02239.1| Uncharacterized protein TCM_016763 [Theobroma cacao]
          Length = 2127

 Score =  104 bits (260), Expect = 1e-21
 Identities = 59/190 (31%), Positives = 90/190 (47%)
 Frame = -2

Query: 757  LTSQCYCCSCHIESLPHLFLLNDQVKKIWEHFARMFNCTLPHTESIHIFLQFWSNFTPFT 578
            L S+C CC+   ESL H+   N   K++W  FA++F   + +   +   +  W     + 
Sbjct: 1822 LASKCVCCNSE-ESLIHVLWENPVAKQVWNFFAQLFQIYIWNPRHVSQIIWAWYVSGDYV 1880

Query: 577  HHSHITFILPCLILWFTWLERNKSKHENVKFSYWHIICQVEAHLKLLYQSHMLNANVWKG 398
               H   +LP  I WF WLERN +KH +       +I +   H + LY   +L    WKG
Sbjct: 1881 RKGHFRVLLPLFICWFLWLERNDAKHRHTGLYADRVIWRTMKHCRQLYDGSLLQQWQWKG 1940

Query: 397  FLHIASGYXXXXXXXXGIKISSVLWHKPPPHLVKVNVDGATKGLINQAGLGGVLRDHEGN 218
               IA+                + W KP     K+NVDG+++  ++ A  GGVLRDH G 
Sbjct: 1941 DTDIATMLGFSFTHKQHAPPQIIYWKKPSIGEYKLNVDGSSRNGLH-AATGGVLRDHTGK 1999

Query: 217  ILWICYGFAE 188
            ++   +GF+E
Sbjct: 2000 LI---FGFSE 2006


>gb|EOY06959.1| Uncharacterized protein TCM_021521 [Theobroma cacao]
          Length = 1951

 Score =  103 bits (258), Expect = 2e-21
 Identities = 62/190 (32%), Positives = 90/190 (47%)
 Frame = -2

Query: 757  LTSQCYCCSCHIESLPHLFLLNDQVKKIWEHFARMFNCTLPHTESIHIFLQFWSNFTPFT 578
            L S+C CC    ESL H+   N    ++W  FA+ F   +     I   +  W     +T
Sbjct: 1645 LASKCVCCRSE-ESLIHVLWENPVATQVWFFFAKSFQIYVSKPNHISQIIWAWFFSGDYT 1703

Query: 577  HHSHITFILPCLILWFTWLERNKSKHENVKFSYWHIICQVEAHLKLLYQSHMLNANVWKG 398
             + HI  ++P  I WF WLERN +KH ++      +I ++   L  LY   +L    WKG
Sbjct: 1704 RNGHIRILIPLFICWFLWLERNDAKHRHMGMYPNRVIWRIMKLLNQLYAGSLLKQWQWKG 1763

Query: 397  FLHIASGYXXXXXXXXGIKISSVLWHKPPPHLVKVNVDGATKGLINQAGLGGVLRDHEGN 218
               IA+ +              + W KP     K+NVDG++K  +N AG GGVLRDH G 
Sbjct: 1764 DTDIATMWGFKFPPKYCTSPQIIYWIKPFIGEYKLNVDGSSKSNLNAAG-GGVLRDHTGK 1822

Query: 217  ILWICYGFAE 188
               + + F+E
Sbjct: 1823 ---LAFAFSE 1829


Top