BLASTX nr result

ID: Rehmannia23_contig00003820 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia23_contig00003820
         (955 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EOY06959.1| Uncharacterized protein TCM_021521 [Theobroma cacao]   129   2e-27
gb|EOY02238.1| Uncharacterized protein TCM_016762 [Theobroma cacao]   126   1e-26
gb|EOY06960.1| Uncharacterized protein TCM_021522 [Theobroma cacao]   125   2e-26
gb|EOY25451.1| Uncharacterized protein TCM_016759 [Theobroma cacao]   125   2e-26
gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao]   125   2e-26
gb|EOX96783.1| Uncharacterized protein TCM_005954 [Theobroma cacao]   125   2e-26
gb|EOY19200.1| Retrotransposon, unclassified-like protein [Theob...   122   2e-25
gb|EOY34747.1| Uncharacterized protein TCM_042327 [Theobroma cacao]   119   2e-24
gb|EOY17514.1| Uncharacterized protein TCM_042330 [Theobroma cacao]   119   2e-24
gb|EOY02234.1| Uncharacterized protein TCM_011921 [Theobroma cacao]   119   2e-24
gb|EOY02236.1| Uncharacterized protein TCM_011923 [Theobroma cacao]   118   3e-24
gb|EOY02239.1| Uncharacterized protein TCM_016763 [Theobroma cacao]   116   1e-23
gb|EOY34748.1| Uncharacterized protein TCM_042328 [Theobroma cacao]   112   3e-22
gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao]   110   6e-22
gb|EOY25447.1| Uncharacterized protein TCM_016753 [Theobroma cacao]   109   2e-21
gb|EOY19103.1| Uncharacterized protein TCM_043836 [Theobroma cacao]   106   1e-20
gb|EOX93822.1| Uncharacterized protein TCM_002766 [Theobroma cacao]   106   1e-20
gb|EOY26529.1| Ribonuclease H-like protein [Theobroma cacao]          105   2e-20
gb|EOY32248.1| Uncharacterized protein TCM_039895 [Theobroma cacao]   104   4e-20
gb|EOX99990.1| Uncharacterized protein TCM_009188 [Theobroma cacao]   104   6e-20

>gb|EOY06959.1| Uncharacterized protein TCM_021521 [Theobroma cacao]
          Length = 1951

 Score =  129 bits (323), Expect = 2e-27
 Identities = 78/229 (34%), Positives = 113/229 (49%), Gaps = 1/229 (0%)
 Frame = +3

Query: 3    NSVKHRHIPFLASHIIWQVERYLWILVSGRRLSPSDWSGCFPTVGFLQGSAPPQVPRRPR 182
            N  KHRH+    + +IW++ + L  L +G  L    W G            PP+    P+
Sbjct: 1725 NDAKHRHMGMYPNRVIWRIMKLLNQLYAGSLLKQWQWKGDTDIATMWGFKFPPKYCTSPQ 1784

Query: 183  MVIWRAPQPPQIKLNTDGSFDPGTLIAGGGGLIRDHLGRLMLAFHSSFQAVSSFHAELLA 362
            ++ W  P   + KLN DGS     L A GGG++RDH G+L  AF  +   + S  AEL A
Sbjct: 1785 IIYWIKPFIGEYKLNVDGS-SKSNLNAAGGGVLRDHTGKLAFAFSENLGPLPSLQAELHA 1843

Query: 363  LEMGLRHARRFSL-QIWIELDAAAVVTTITSGGLGSWQVQHTLIRIRNMLRDLQYSISHI 539
            L  GL   +  ++  +WIE+DA   V  +     GS  +++ L  IR  LR   Y ISHI
Sbjct: 1844 LLRGLLLCKERNITNLWIEMDALVAVQMVQQSQKGSHDIRYLLESIRLCLRSFSYRISHI 1903

Query: 540  HREGNRPADHLAELGAAALGSHFVDEGTASHHLLALIRMDQLGYPSFRF 686
            +REGN+ AD L+  G          E  A   L+ ++++D+L  P  RF
Sbjct: 1904 YREGNQAADFLSNKGQTHQSLCVFSE--AQGELIGILKLDKLNLPYVRF 1950


>gb|EOY02238.1| Uncharacterized protein TCM_016762 [Theobroma cacao]
          Length = 2214

 Score =  126 bits (317), Expect = 1e-26
 Identities = 76/229 (33%), Positives = 111/229 (48%), Gaps = 1/229 (0%)
 Frame = +3

Query: 3    NSVKHRHIPFLASHIIWQVERYLWILVSGRRLSPSDWSGCFPTVGFLQGSAPPQVPRRPR 182
            N  K+RH       I+W++ + L  L  G  L    W G        Q +   ++   P+
Sbjct: 1989 NDAKYRHSGLNTDRIVWRIMKLLRQLKDGSLLQQWQWKGDTDIAAMWQYNFQLKLRAPPQ 2048

Query: 183  MVIWRAPQPPQIKLNTDGSFDPGTLIAGGGGLIRDHLGRLMLAFHSSFQAVSSFHAELLA 362
            +V WR P   + KLN DGS   G   A  GG++RDH G+L+  F  +    +S  AEL A
Sbjct: 2049 IVYWRKPSTGEYKLNVDGSSRHGQH-AASGGVLRDHTGKLIFGFSENIGTCNSLQAELRA 2107

Query: 363  LEMGLRHARRFSLQ-IWIELDAAAVVTTITSGGLGSWQVQHTLIRIRNMLRDLQYSISHI 539
            L  GL   +   ++ +WIE+DA A +  +     GS  +++ L  IR  L  + Y ISHI
Sbjct: 2108 LLRGLLLCKERHIEKLWIEMDALAAIQLLPHSQKGSHDIRYLLESIRKCLNSISYRISHI 2167

Query: 540  HREGNRPADHLAELGAAALGSHFVDEGTASHHLLALIRMDQLGYPSFRF 686
            HREGN+ AD L+  G      H   E     H   ++++D+L  P  RF
Sbjct: 2168 HREGNQVADFLSNEGHNHQNLHVFTEAQGKLH--GMLKLDRLNLPYVRF 2214


>gb|EOY06960.1| Uncharacterized protein TCM_021522 [Theobroma cacao]
          Length = 3503

 Score =  125 bits (315), Expect = 2e-26
 Identities = 72/229 (31%), Positives = 111/229 (48%), Gaps = 1/229 (0%)
 Frame = +3

Query: 3    NSVKHRHIPFLASHIIWQVERYLWILVSGRRLSPSDWSGCFPTVGFLQGSAPPQVPRRPR 182
            N  KHR++    + I+W++ + +  L  G++L    W G                P  P+
Sbjct: 3276 NDAKHRNLGMYPNRIVWKILKLIHQLFQGKQLQKWQWQGDKQIAQEWGIILKAVAPSPPK 3335

Query: 183  MVIWRAPQPPQIKLNTDGSFDPGTLIAGGGGLIRDHLGRLMLAFHSSFQAVSSFHAELLA 362
            ++ W  P   + KLN DGS       A GGGL+RDH G ++  F  +F +  S  AEL+A
Sbjct: 3336 LLFWNKPSIGEFKLNVDGSSKYNLQTAAGGGLLRDHTGSMIFGFSENFGSQDSLQAELMA 3395

Query: 363  LEMGLRHARRFSL-QIWIELDAAAVVTTITSGGLGSWQVQHTLIRIRNMLRDLQYSISHI 539
            L  GL      ++ ++WIE+DA   V  I  G  GS + ++ L  I   L  + + ISHI
Sbjct: 3396 LHRGLLLCIDHNVTRLWIEMDAKVAVQMINEGHQGSSRTRYLLASIHRCLSGISFRISHI 3455

Query: 540  HREGNRPADHLAELGAAALGSHFVDEGTASHHLLALIRMDQLGYPSFRF 686
             REGN+ ADHL+  G        + +  A   L  ++R+D++     RF
Sbjct: 3456 FREGNQAADHLSNQGYTHQNLQVISQ--AEGQLRGILRLDKINLAYVRF 3502



 Score =  118 bits (296), Expect = 3e-24
 Identities = 69/195 (35%), Positives = 100/195 (51%), Gaps = 1/195 (0%)
 Frame = +3

Query: 3    NSVKHRHIPFLASHIIWQVERYLWILVSGRRLSPSDWSGCFPTVGFLQGSAPPQVPRRPR 182
            N  KHRH+    + +IW++ + L  L +G  L    W G            PP+  + P+
Sbjct: 1482 NDAKHRHMGMYPNRVIWRIMKLLNQLHAGSLLKQWQWKGDTDIATMWGFKYPPKYCQSPQ 1541

Query: 183  MVIWRAPQPPQIKLNTDGSFDPGTLIAGGGGLIRDHLGRLMLAFHSSFQAVSSFHAELLA 362
            ++ W  P   + KLN DGS    +  A GGG++RDH G+L  AF  +   + S  AEL A
Sbjct: 1542 IISWIKPFIGEYKLNVDGS-SKSSQNAAGGGVLRDHTGKLAFAFSENLGPLPSLQAELHA 1600

Query: 363  LEMGLRHARRFSL-QIWIELDAAAVVTTITSGGLGSWQVQHTLIRIRNMLRDLQYSISHI 539
            L  GL   +  ++  +WIE+DA   V  +     GS  +++ L  IR  LR   Y ISHI
Sbjct: 1601 LLRGLLLCKERNITNLWIEMDALVAVQMVQQSQKGSHDIRYLLESIRLCLRSFSYRISHI 1660

Query: 540  HREGNRPADHLAELG 584
            +REGN+ AD L+  G
Sbjct: 1661 YREGNQAADFLSNKG 1675


>gb|EOY25451.1| Uncharacterized protein TCM_016759 [Theobroma cacao]
          Length = 879

 Score =  125 bits (314), Expect = 2e-26
 Identities = 76/229 (33%), Positives = 113/229 (49%), Gaps = 1/229 (0%)
 Frame = +3

Query: 3    NSVKHRHIPFLASHIIWQVERYLWILVSGRRLSPSDWSGCFPTVGFLQGSAPPQVPRRPR 182
            N  KHRH       ++W++ + L  L+ G  L    W G          +   +    P+
Sbjct: 653  NDAKHRHTRLNPDRVVWRIMKLLRQLLDGSLLHQWQWKGDTDIASMWGHTFQSKHRAPPQ 712

Query: 183  MVIWRAPQPPQIKLNTDGSFDPGTLIAGGGGLIRDHLGRLMLAFHSSFQAVSSFHAELLA 362
            ++ WR P   + KLN DGS   G L A  GG++RDH G+L+  F  +    +S  AEL A
Sbjct: 713  IIYWRKPFTGEYKLNVDGSSRNGHL-AASGGILRDHTGKLIFGFSENIGLCNSLQAELRA 771

Query: 363  LEMGLRHARRFSLQ-IWIELDAAAVVTTITSGGLGSWQVQHTLIRIRNMLRDLQYSISHI 539
            L  GL   +   ++ +WIE+DA AV+  I     GS  +++ L  IR  L  + Y ISHI
Sbjct: 772  LLRGLLLCKERHIENLWIEMDALAVIQLIQHSQKGSHDIRYLLESIRKCLSCISYRISHI 831

Query: 540  HREGNRPADHLAELGAAALGSHFVDEGTASHHLLALIRMDQLGYPSFRF 686
             REGN+ AD+LA  G +      + E     H   ++++D+L  P  RF
Sbjct: 832  FREGNQAADYLANEGHSHQNLCVITEAQGELH--GMLKLDRLNLPYVRF 878


>gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao]
          Length = 2215

 Score =  125 bits (314), Expect = 2e-26
 Identities = 71/229 (31%), Positives = 111/229 (48%), Gaps = 1/229 (0%)
 Frame = +3

Query: 3    NSVKHRHIPFLASHIIWQVERYLWILVSGRRLSPSDWSGCFPTVGFLQGSAPPQVPRRPR 182
            N  KHR++    + ++W++ + L  L  G++L    W G                P  P+
Sbjct: 1988 NDAKHRNLGMYPNRVVWKILKLLHQLFQGKQLQKWQWQGDKQIAQEWGIILKADAPSPPK 2047

Query: 183  MVIWRAPQPPQIKLNTDGSFDPGTLIAGGGGLIRDHLGRLMLAFHSSFQAVSSFHAELLA 362
            ++ W  P   ++KLN DGS       A GGGL+RDH G ++  F  +F    S  AEL+A
Sbjct: 2048 LLFWLKPSIGELKLNVDGSCKHNPQSAAGGGLLRDHTGSMIFGFSENFGPQDSLQAELMA 2107

Query: 363  LEMGLRHARRFSL-QIWIELDAAAVVTTITSGGLGSWQVQHTLIRIRNMLRDLQYSISHI 539
            L  GL      ++ ++WIE+DA   V  I  G  GS + ++ L  I   L  + + ISHI
Sbjct: 2108 LHRGLLLCIEHNISRLWIEMDAKVAVQMIKEGHQGSSRTRYLLASIHRCLSGISFRISHI 2167

Query: 540  HREGNRPADHLAELGAAALGSHFVDEGTASHHLLALIRMDQLGYPSFRF 686
             REGN+ ADHL+  G        + +  A   L  ++R++++     RF
Sbjct: 2168 FREGNQAADHLSNQGHTHQNLQVISQ--AEGQLRGILRLEKINLAYVRF 2214


>gb|EOX96783.1| Uncharacterized protein TCM_005954 [Theobroma cacao]
          Length = 1134

 Score =  125 bits (314), Expect = 2e-26
 Identities = 76/229 (33%), Positives = 109/229 (47%), Gaps = 1/229 (0%)
 Frame = +3

Query: 3    NSVKHRHIPFLASHIIWQVERYLWILVSGRRLSPSDWSGCFPTVGFLQGSAPPQVPRRPR 182
            N  KHRH       +IW+  ++   L  G  L    W G       L  S PPQ    P+
Sbjct: 906  NDAKHRHTGLYPDRVIWRTMKHCRQLYDGSLLQQWQWKGDTDIAAMLGFSFPPQQHASPQ 965

Query: 183  MVIWRAPQPPQIKLNTDGSFDPGTLIAGGGGLIRDHLGRLMLAFHSSFQAVSSFHAELLA 362
            ++ W+ P   + KLN DGS   G L A  GG++RDH G+L+  F  +    +S  AEL A
Sbjct: 966  IIYWKKPSIGEYKLNVDGSSRNG-LHAATGGVLRDHTGKLIFGFSENIGPCNSLQAELRA 1024

Query: 363  LEMGLRHAR-RFSLQIWIELDAAAVVTTITSGGLGSWQVQHTLIRIRNMLRDLQYSISHI 539
            L  GL   + R   ++WIE+DA A +  I     G + +++ L  IR  L    Y +SH 
Sbjct: 1025 LLRGLLLCKERHIEKLWIEMDALAAIQLIQPSKKGPYDIRYLLESIRMCLSSFSYRLSHT 1084

Query: 540  HREGNRPADHLAELGAAALGSHFVDEGTASHHLLALIRMDQLGYPSFRF 686
             REGN+ AD+L+  G          E     H   ++++D+L  P  RF
Sbjct: 1085 FREGNKAADYLSNEGHKHQNLCVFTEAQGQLH--GMLKLDRLNLPYVRF 1131


>gb|EOY19200.1| Retrotransposon, unclassified-like protein [Theobroma cacao]
          Length = 1368

 Score =  122 bits (307), Expect = 2e-25
 Identities = 72/206 (34%), Positives = 105/206 (50%), Gaps = 1/206 (0%)
 Frame = +3

Query: 3    NSVKHRHIPFLASHIIWQVERYLWILVSGRRLSPSDWSGCFPTVGFLQGSAPPQVPRRPR 182
            N  KHR +      IIW++ + L  L  G  L    W G          +   +   RP+
Sbjct: 1109 NDAKHRDLGMYPDRIIWRIMKILRKLFQGGLLCKWQWKGDLDIAIHWGFNFAQERQARPK 1168

Query: 183  MVIWRAPQPPQIKLNTDGSFDPGTLIAGGGGLIRDHLGRLMLAFHSSFQAVSSFHAELLA 362
            ++ W  P   ++KLN DGS       A GGG++RDH G L+  F  +F   +S  AELLA
Sbjct: 1169 IINWIKPLIGELKLNVDGSSKDEFQNAAGGGVLRDHTGNLIFGFSENFGYQNSLQAELLA 1228

Query: 363  LEMGLRHARRFSL-QIWIELDAAAVVTTITSGGLGSWQVQHTLIRIRNMLRDLQYSISHI 539
            L  GL     +++ ++WIE+DA  V+  I +   GS+++Q+ L  IR  L+ +   ISHI
Sbjct: 1229 LHRGLCLCMEYNVSRVWIEVDAQVVIQMIQNHHKGSYKIQYLLESIRKCLQVISVRISHI 1288

Query: 540  HREGNRPADHLAELGAAALGSHFVDE 617
            HREGN+ AD L++ G      H   E
Sbjct: 1289 HREGNQAADFLSKHGHTHQNLHVFTE 1314


>gb|EOY34747.1| Uncharacterized protein TCM_042327 [Theobroma cacao]
          Length = 1014

 Score =  119 bits (298), Expect = 2e-24
 Identities = 71/229 (31%), Positives = 109/229 (47%), Gaps = 1/229 (0%)
 Frame = +3

Query: 3    NSVKHRHIPFLASHIIWQVERYLWILVSGRRLSPSDWSGCFPTVGFLQGSAPPQVPRRPR 182
            N  KHRH+   +  ++W++ + L  L  G  L    W G          + P ++   P+
Sbjct: 788  NDAKHRHLGMYSDRVVWKIMKVLRQLQDGSLLKKWQWKGDTDIAAMWGFTLPLKIRESPQ 847

Query: 183  MVIWRAPQPPQIKLNTDGSFDPGTLIAGGGGLIRDHLGRLMLAFHSSFQAVSSFHAELLA 362
            ++ W  P   + KLN DGS       A  GGL+RDH G L+  F  +    +S  AEL A
Sbjct: 848  IIHWVKPVTGEYKLNVDGS-SRHNQSAATGGLLRDHTGTLVFGFSENIGPSNSLQAELRA 906

Query: 363  LEMGLRHARRFSLQ-IWIELDAAAVVTTITSGGLGSWQVQHTLIRIRNMLRDLQYSISHI 539
            L  GL   +  +++ +WIE+DA  V+  I     GS  +++ L  IR  L    + ISHI
Sbjct: 907  LLRGLLLCKDRNIEKLWIEMDALVVIQMIQQSKKGSHDIRYLLASIRKCLSFFSFRISHI 966

Query: 540  HREGNRPADHLAELGAAALGSHFVDEGTASHHLLALIRMDQLGYPSFRF 686
             REGN+ AD L+  G        + E     H   ++++D+L  P  +F
Sbjct: 967  FREGNQAADFLSNKGHTHQNLQVISEAQGKLH--GMLKLDRLNLPYVKF 1013


>gb|EOY17514.1| Uncharacterized protein TCM_042330 [Theobroma cacao]
          Length = 2249

 Score =  119 bits (298), Expect = 2e-24
 Identities = 76/234 (32%), Positives = 116/234 (49%), Gaps = 6/234 (2%)
 Frame = +3

Query: 3    NSVKHRHIPFLASHIIWQVERYLWILVSGRRLSPSDWSGCFPT-----VGFLQGSAPPQV 167
            N  KHR++    + I+W++ + +  L  G++L    W G         + F   S PP  
Sbjct: 2023 NDAKHRNLGMYPNRIVWRILKLIQQLSLGQQLLKWQWKGDKQIAQEWGITFQAESLPP-- 2080

Query: 168  PRRPRMVIWRAPQPPQIKLNTDGSFDPGTLIAGGGGLIRDHLGRLMLAFHSSFQAVSSFH 347
               P++  W  P   + KLN DGS       A GGG++RDH G ++  F  +    +S  
Sbjct: 2081 ---PKVFPWHKPSIGEFKLNVDGSAKLSQN-AAGGGVLRDHAGVMVFGFSENLGIQNSLQ 2136

Query: 348  AELLALEMGLRHARRFSLQ-IWIELDAAAVVTTITSGGLGSWQVQHTLIRIRNMLRDLQY 524
            AELLAL  GL   R ++++ +WIE+DAA+V+  +     G   +++ L+ IR +L    +
Sbjct: 2137 AELLALYRGLILCRDYNIRRLWIEMDAASVIRLLQGNQRGPHAIRYLLVSIRQLLSHFSF 2196

Query: 525  SISHIHREGNRPADHLAELGAAALGSHFVDEGTASHHLLALIRMDQLGYPSFRF 686
             +SHI REGN+ AD LA  G        V    A   L  ++R+DQ   P  RF
Sbjct: 2197 RLSHIFREGNQAADFLANRGHEHQSLQVVT--VAQGKLRGMLRLDQTSLPYVRF 2248


>gb|EOY02234.1| Uncharacterized protein TCM_011921 [Theobroma cacao]
          Length = 926

 Score =  119 bits (297), Expect = 2e-24
 Identities = 75/228 (32%), Positives = 109/228 (47%), Gaps = 1/228 (0%)
 Frame = +3

Query: 3    NSVKHRHIPFLASHIIWQVERYLWILVSGRRLSPSDWSGCFPTVGFLQGSAPPQVPRRPR 182
            N  KHR+       ++W++ + L  L  G  L    W G        + +   ++   P+
Sbjct: 701  NDAKHRYSGLYTDRVVWRIMKLLRQLHDGSLLQQWQWKGDTDIAAMWKYNLQLKLRAPPQ 760

Query: 183  MVIWRAPQPPQIKLNTDGSFDPGTLIAGGGGLIRDHLGRLMLAFHSSFQAVSSFHAELLA 362
            +V WR P   + KLN DGS   G   A  GG++RDH G+L+  F  +    +S  AEL A
Sbjct: 761  IVYWRKPSTGEYKLNVDGSSRHGQH-AASGGVLRDHTGKLIFGFSENIGNCNSLQAELRA 819

Query: 363  LEMGLRHAR-RFSLQIWIELDAAAVVTTITSGGLGSWQVQHTLIRIRNMLRDLQYSISHI 539
            L  GL   + R   Q+WIE+DA AV+  I     GS  +++ L  IR  L  + Y ISHI
Sbjct: 820  LLRGLLLCKERHIEQLWIEMDALAVIQLIPHSQKGSHDIRYLLESIRKCLNSISYRISHI 879

Query: 540  HREGNRPADHLAELGAAALGSHFVDEGTASHHLLALIRMDQLGYPSFR 683
             REGN+ AD L+  G          E     H   ++++D+L  P  R
Sbjct: 880  LREGNQVADFLSNEGHNHQNLRVFTEAQGKLH--GMLKLDRLNLPYVR 925


>gb|EOY02236.1| Uncharacterized protein TCM_011923 [Theobroma cacao]
          Length = 1954

 Score =  118 bits (296), Expect = 3e-24
 Identities = 72/229 (31%), Positives = 110/229 (48%), Gaps = 1/229 (0%)
 Frame = +3

Query: 3    NSVKHRHIPFLASHIIWQVERYLWILVSGRRLSPSDWSGCFPTVGFLQGSAPPQVPRRPR 182
            N  KHRH+   +  ++W++ + L  L  G  L    W G           +PP+    P+
Sbjct: 1728 NDAKHRHLGMYSDRVVWKIMKLLRQLQDGYLLKSWQWKGDKDFATMWGLFSPPKTRAAPQ 1787

Query: 183  MVIWRAPQPPQIKLNTDGSFDPGTLIAGGGGLIRDHLGRLMLAFHSSFQAVSSFHAELLA 362
            ++ W  P P + KLN DGS       A  GG++RDH G L+  F  +    +S  AEL A
Sbjct: 1788 ILHWVKPVPGEHKLNVDGSSRQNQT-AAIGGVLRDHTGTLVFDFSENIGPSNSLQAELRA 1846

Query: 363  LEMGLRHARRFSLQ-IWIELDAAAVVTTITSGGLGSWQVQHTLIRIRNMLRDLQYSISHI 539
            L  GL   +  +++ +W+E+DA   +  I     GS  +++ L  IR  L    + ISHI
Sbjct: 1847 LLRGLLLCKERNIEKLWVEMDALVAIQMIQQSQKGSHDIRYLLASIRKYLNFFSFRISHI 1906

Query: 540  HREGNRPADHLAELGAAALGSHFVDEGTASHHLLALIRMDQLGYPSFRF 686
             REGN+ AD L+  G      H   E  A   L  ++++D+L  P  R+
Sbjct: 1907 FREGNQAADFLSNKGHTHQSLHVFTE--AQGKLYGMLKLDRLNLPYVRY 1953


>gb|EOY02239.1| Uncharacterized protein TCM_016763 [Theobroma cacao]
          Length = 2127

 Score =  116 bits (291), Expect = 1e-23
 Identities = 73/229 (31%), Positives = 108/229 (47%), Gaps = 1/229 (0%)
 Frame = +3

Query: 3    NSVKHRHIPFLASHIIWQVERYLWILVSGRRLSPSDWSGCFPTVGFLQGSAPPQVPRRPR 182
            N  KHRH    A  +IW+  ++   L  G  L    W G       L  S   +    P+
Sbjct: 1902 NDAKHRHTGLYADRVIWRTMKHCRQLYDGSLLQQWQWKGDTDIATMLGFSFTHKQHAPPQ 1961

Query: 183  MVIWRAPQPPQIKLNTDGSFDPGTLIAGGGGLIRDHLGRLMLAFHSSFQAVSSFHAELLA 362
            ++ W+ P   + KLN DGS   G L A  GG++RDH G+L+  F  +    +S  AEL A
Sbjct: 1962 IIYWKKPSIGEYKLNVDGSSRNG-LHAATGGVLRDHTGKLIFGFSENIGPCNSLQAELRA 2020

Query: 363  LEMGLRHARRFSLQ-IWIELDAAAVVTTITSGGLGSWQVQHTLIRIRNMLRDLQYSISHI 539
            L  GL   +   ++ +WIE+DA   +  I     G + +++ L  IR  L    Y +SHI
Sbjct: 2021 LLRGLLLCKERHIEKLWIEMDALVAIQLIQPSKKGPYNLRYLLESIRMCLSSFSYRLSHI 2080

Query: 540  HREGNRPADHLAELGAAALGSHFVDEGTASHHLLALIRMDQLGYPSFRF 686
             REGN+ AD+L+  G          E     H   ++++D+L  P  RF
Sbjct: 2081 LREGNQAADYLSNEGHKHQNLCVFTEAQGQLH--GMLKLDRLNLPYVRF 2127


>gb|EOY34748.1| Uncharacterized protein TCM_042328 [Theobroma cacao]
          Length = 910

 Score =  112 bits (279), Expect = 3e-22
 Identities = 71/235 (30%), Positives = 114/235 (48%), Gaps = 7/235 (2%)
 Frame = +3

Query: 3    NSVKHRHIPFLASHIIWQVERYLWILVSGRRLSPSDWSGC------FPTVGFLQGSAPPQ 164
            N  KHR++    + ++W+V + +  L  G++L    W G       +  +   +  APP+
Sbjct: 684  NDAKHRNLGMYPNRVVWRVLKLIQQLSLGQQLLKWQWKGDKQIAQEWGIILQAESLAPPK 743

Query: 165  VPRRPRMVIWRAPQPPQIKLNTDGSFDPGTLIAGGGGLIRDHLGRLMLAFHSSFQAVSSF 344
            V        W  P   + KLN DGS       A GGG++RDH G ++  F  +    +S 
Sbjct: 744  V------FSWHKPTTGEFKLNVDGSAKHSHN-AAGGGILRDHAGVMVFGFSENLGIQNSL 796

Query: 345  HAELLALEMGLRHARRFSLQ-IWIELDAAAVVTTITSGGLGSWQVQHTLIRIRNMLRDLQ 521
             AELLAL  GL   R ++++ +WIE+DA +V+  +     G   +++ ++ +R +L    
Sbjct: 797  QAELLALYRGLILCRDYNIRRLWIEMDAISVIRLLQGNHRGPHAIRYLMVSLRQLLSHFS 856

Query: 522  YSISHIHREGNRPADHLAELGAAALGSHFVDEGTASHHLLALIRMDQLGYPSFRF 686
            +  SHI REGN+ AD LA  G             A   L  ++R+DQ  +P  RF
Sbjct: 857  FRFSHIFREGNQAADFLANRGHEHQNLQVFT--VAQGKLRGMLRLDQTSFPYVRF 909


>gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao]
          Length = 2251

 Score =  110 bits (276), Expect = 6e-22
 Identities = 70/235 (29%), Positives = 113/235 (48%), Gaps = 7/235 (2%)
 Frame = +3

Query: 3    NSVKHRHIPFLASHIIWQVERYLWILVSGRRLSPSDWSGC------FPTVGFLQGSAPPQ 164
            N  KHR++    + ++W+V + +  L  G++L    W G       +  +   +  APP+
Sbjct: 2025 NDAKHRNLGMYPNRVVWRVLKLIQQLSLGQQLLKWQWKGDKQIAQEWGIIFQAESLAPPK 2084

Query: 165  VPRRPRMVIWRAPQPPQIKLNTDGSFDPGTLIAGGGGLIRDHLGRLMLAFHSSFQAVSSF 344
            V        W  P   + KLN DGS       A GGG++RDH G ++  F  +    +S 
Sbjct: 2085 V------FSWHKPSLGEFKLNVDGSAKQSHN-AAGGGILRDHAGEMVFGFSENLGTQNSL 2137

Query: 345  HAELLALEMGLRHARRFSLQ-IWIELDAAAVVTTITSGGLGSWQVQHTLIRIRNMLRDLQ 521
             AELLAL  GL   R ++++ +WIE+DA +V+  +     G   +++ ++ +R +L    
Sbjct: 2138 QAELLALYRGLILCRDYNIRRLWIEMDAISVIRLLQGNHRGPHAIRYLMVSLRQLLSHFS 2197

Query: 522  YSISHIHREGNRPADHLAELGAAALGSHFVDEGTASHHLLALIRMDQLGYPSFRF 686
            +  SHI REGN+ AD LA  G             A   L  ++ +DQ  +P  RF
Sbjct: 2198 FRFSHIFREGNQAADFLANRGHEHQNLQVFT--VAQGKLRGMLCLDQTSFPYVRF 2250


>gb|EOY25447.1| Uncharacterized protein TCM_016753 [Theobroma cacao]
          Length = 1275

 Score =  109 bits (272), Expect = 2e-21
 Identities = 65/195 (33%), Positives = 94/195 (48%), Gaps = 1/195 (0%)
 Frame = +3

Query: 3    NSVKHRHIPFLASHIIWQVERYLWILVSGRRLSPSDWSGCFPTVGFLQGSAPPQVPRRPR 182
            N  KHRH       ++W++   L  L     L    W G        + +   +    P+
Sbjct: 905  NDAKHRHSGLYTDRVVWRIMTLLRQLQDDSLLQQWQWKGDTDIAAMWRYNFQLKQRAPPQ 964

Query: 183  MVIWRAPQPPQIKLNTDGSFDPGTLIAGGGGLIRDHLGRLMLAFHSSFQAVSSFHAELLA 362
            +V WR P   + KLN DGS   G   A  GG++RDH  +L+  F  +    +S  AEL A
Sbjct: 965  IVYWRKPFTGEYKLNVDGSSRNGQH-AASGGVLRDHTSKLIFCFSENIGTYNSLQAELRA 1023

Query: 363  LEMGLRHARRFSLQ-IWIELDAAAVVTTITSGGLGSWQVQHTLIRIRNMLRDLQYSISHI 539
            L  GL   +   ++ +WIE+DA AV+  I     GS  +++ L  I+  L  + Y ISHI
Sbjct: 1024 LHRGLLLCKERHIEKLWIEMDALAVIQLIPHSQKGSHDIRYLLESIKKCLNSISYRISHI 1083

Query: 540  HREGNRPADHLAELG 584
             REGN+ AD L+  G
Sbjct: 1084 FREGNQAADFLSNEG 1098


>gb|EOY19103.1| Uncharacterized protein TCM_043836 [Theobroma cacao]
          Length = 228

 Score =  106 bits (265), Expect = 1e-20
 Identities = 58/144 (40%), Positives = 81/144 (56%), Gaps = 1/144 (0%)
 Frame = +3

Query: 156 PPQVPRRPRMVIWRAPQPPQIKLNTDGSFDPGTLIAGGGGLIRDHLGRLMLAFHSSFQAV 335
           P +V   P+++ W  P   + KLN DGS       AGGGGL+RDH   L+  F  +  A 
Sbjct: 38  PRKVISLPKVISWHKPSTGEFKLNVDGSSINNFQNAGGGGLLRDHTSTLVFVFSENLGAK 97

Query: 336 SSFHAELLALEMGLRHARRFSL-QIWIELDAAAVVTTITSGGLGSWQVQHTLIRIRNMLR 512
           +S  AELLAL  GL   +  ++ ++WIE+DA  V+  +  G +GS   ++    IR  L+
Sbjct: 98  NSLQAELLALHRGLLLCQENNISRLWIEMDAMIVIQMLKEGHIGSHDSRYLWASIRQQLK 157

Query: 513 DLQYSISHIHREGNRPADHLAELG 584
              + ISHIHREGN+ AD LA  G
Sbjct: 158 LFSFRISHIHREGNQAADWLANRG 181


>gb|EOX93822.1| Uncharacterized protein TCM_002766 [Theobroma cacao]
          Length = 241

 Score =  106 bits (265), Expect = 1e-20
 Identities = 68/191 (35%), Positives = 93/191 (48%), Gaps = 1/191 (0%)
 Frame = +3

Query: 36  ASHIIWQVERYLWILVSGRRLSPSDWSGCFPTVGFLQGSAPPQVPRRPRMVIWRAPQPPQ 215
           A  IIWQ + +      G       W   F      Q ++PP     P++  W  P   +
Sbjct: 58  AWEIIWQKDLFKRWQWRGDLQIAQAWGLMF------QRASPPS----PKIFSWHKPLTGE 107

Query: 216 IKLNTDGSFDPGTLIAGGGGLIRDHLGRLMLAFHSSFQAVSSFHAELLALEMGLRHARRF 395
            KLN D S       A G GL+RDH G ++  F  +F+   S  AEL+AL  GL     +
Sbjct: 108 FKLNVDDSSKHNCQNAAGSGLLRDHTGIVIFGFSKNFRLYISLQAELMALHRGLLLCIEY 167

Query: 396 SL-QIWIELDAAAVVTTITSGGLGSWQVQHTLIRIRNMLRDLQYSISHIHREGNRPADHL 572
           ++ +IWIE++A  VV  I  G  GS Q ++ L  IR  L  + Y ISHIHREGN+  DHL
Sbjct: 168 NVSRIWIEMNAKVVVQMIHEGNKGSSQTRYLLASIRKCLNAISYCISHIHREGNQVVDHL 227

Query: 573 AELGAAALGSH 605
           +  G +    H
Sbjct: 228 SNQGHSDKNLH 238


>gb|EOY26529.1| Ribonuclease H-like protein [Theobroma cacao]
          Length = 458

 Score =  105 bits (263), Expect = 2e-20
 Identities = 68/225 (30%), Positives = 102/225 (45%), Gaps = 1/225 (0%)
 Frame = +3

Query: 3   NSVKHRHIPFLASHIIWQVERYLWILVSGRRLSPSDWSGCFPTVGFLQGSAPPQVPRRPR 182
           N  KHRH+      ++W+  + L  L  G  L    W              PP+    P+
Sbjct: 235 NDAKHRHLGMYPDRVVWETMKLLRQLHDGSPLKQWQWKVDKDIAAMWSFLFPPKHGTTPQ 294

Query: 183 MVIWRAPQPPQIKLNTDGSFDPGTLIAGGGGLIRDHLGRLMLAFHSSFQAVSSFHAELLA 362
           ++ W  P   + KLN DGS       A  GGL+RDH+G+L+  F  +    +S  AEL A
Sbjct: 295 IIHWVKPFTGEYKLNVDGS-SRNCQSATSGGLLRDHIGKLVFGFSENIGRCNSLQAELRA 353

Query: 363 LEMGLRHARRFSLQ-IWIELDAAAVVTTITSGGLGSWQVQHTLIRIRNMLRDLQYSISHI 539
           L   L   +   ++ +WIE+DA  V+  I     GS  +++ L  IR  L  + Y I HI
Sbjct: 354 LLRRLLLCKEQHIERLWIEMDALVVIQMIHQYQKGSHDIRYLLTSIRKGLSSISYRILHI 413

Query: 540 HREGNRPADHLAELGAAALGSHFVDEGTASHHLLALIRMDQLGYP 674
            REGN+ A  L+  G        + E     H   ++++D+L  P
Sbjct: 414 FREGNQAAYFLSNQGYTHQNLCLITEAQGELH--GMLKLDRLNLP 456


>gb|EOY32248.1| Uncharacterized protein TCM_039895 [Theobroma cacao]
          Length = 206

 Score =  104 bits (260), Expect = 4e-20
 Identities = 56/137 (40%), Positives = 76/137 (55%), Gaps = 1/137 (0%)
 Frame = +3

Query: 177 PRMVIWRAPQPPQIKLNTDGSFDPGTLIAGGGGLIRDHLGRLMLAFHSSFQAVSSFHAEL 356
           P+++ W  P   + KLN DGS       A GGGL+RDH G L+  F  +F   +   A+L
Sbjct: 40  PKLISWHKPLIGEFKLNADGSSKDAFQNAAGGGLLRDHTGNLIFGFSENFGPANLLQAKL 99

Query: 357 LALEMGLRHARRFSLQ-IWIELDAAAVVTTITSGGLGSWQVQHTLIRIRNMLRDLQYSIS 533
           +AL  GL     +++  IWIE+DA  VV  I  G  GS+Q ++ L  IR  L    +  S
Sbjct: 100 MALHRGLFLCIEYNISSIWIEMDAKIVVQMIHEGHQGSYQTRYLLAFIRKCLSGFTFRFS 159

Query: 534 HIHREGNRPADHLAELG 584
           HIHREGN+ AD+L   G
Sbjct: 160 HIHREGNQAADYLFNQG 176


>gb|EOX99990.1| Uncharacterized protein TCM_009188 [Theobroma cacao]
          Length = 260

 Score =  104 bits (259), Expect = 6e-20
 Identities = 60/170 (35%), Positives = 91/170 (53%), Gaps = 1/170 (0%)
 Frame = +3

Query: 177 PRMVIWRAPQPPQIKLNTDGSFDPGTLIAGGGGLIRDHLGRLMLAFHSSFQAVSSFHAEL 356
           P+++ W  P   + KLN DGS       AGGGGL+RDH G L+ AF  + +A +S  AEL
Sbjct: 91  PKIISWHKPSIGEFKLNVDGSSINNFQNAGGGGLLRDHTGTLVFAFSENLEAKNSLQAEL 150

Query: 357 LALEMGLRHARRFSL-QIWIELDAAAVVTTITSGGLGSWQVQHTLIRIRNMLRDLQYSIS 533
           LAL  GL   +  ++ ++W E++A  V+  +  G +GS   ++    IR  L+   + IS
Sbjct: 151 LALHSGLLLCQENNISRLWTEMEAMIVIQMLKEGHIGSHDSRYLWASIRQQLKLFSFKIS 210

Query: 534 HIHREGNRPADHLAELGAAALGSHFVDEGTASHHLLALIRMDQLGYPSFR 683
           HIHR+GN+  + LA  G    G   + E  A   L  ++ +D+   P  R
Sbjct: 211 HIHRKGNQATNWLANHGHQHHGLQVLKE--AQGKLRGILTLDKSNLPYVR 258


Top