BLASTX nr result

ID: Forsythia21_contig00042192 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Forsythia21_contig00042192
         (876 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007026457.1| Uncharacterized protein TCM_021521 [Theobrom...    89   3e-15
ref|XP_007010390.1| Retrotransposon, unclassified-like protein [...    88   6e-15
ref|XP_007008704.1| Uncharacterized protein TCM_042330 [Theobrom...    86   2e-14
ref|XP_007040950.1| Uncharacterized protein TCM_016759 [Theobrom...    86   3e-14
ref|XP_007022832.1| Uncharacterized protein TCM_026877 [Theobrom...    86   4e-14
ref|XP_007026458.1| Uncharacterized protein TCM_021522 [Theobrom...    86   4e-14
ref|XP_007017129.1| Uncharacterized protein TCM_042328 [Theobrom...    85   5e-14
ref|XP_007020288.1| Uncharacterized protein TCM_036737 [Theobrom...    85   5e-14
ref|XP_007040952.1| Uncharacterized protein TCM_005954 [Theobrom...    85   7e-14
ref|XP_007014629.1| Uncharacterized protein TCM_039895 [Theobrom...    84   1e-13
ref|XP_007046402.1| Uncharacterized protein TCM_011921 [Theobrom...    84   1e-13
ref|XP_006356603.1| PREDICTED: putative ribonuclease H protein A...    61   2e-13
ref|XP_007017131.1| Uncharacterized protein TCM_033752 [Theobrom...    84   2e-13
ref|XP_007046404.1| Uncharacterized protein TCM_011923 [Theobrom...    83   2e-13
ref|XP_007031313.1| Uncharacterized protein TCM_016763 [Theobrom...    83   3e-13
ref|XP_007031312.1| Uncharacterized protein TCM_016762 [Theobrom...    83   3e-13
ref|XP_007036030.1| Uncharacterized protein TCM_021518 [Theobrom...    81   1e-12
ref|XP_007022459.1| RNase H family protein [Theobroma cacao] gi|...    80   1e-12
ref|XP_007019605.1| Uncharacterized protein TCM_035716 [Theobrom...    80   2e-12
ref|XP_007017128.1| Uncharacterized protein TCM_042327 [Theobrom...    80   2e-12

>ref|XP_007026457.1| Uncharacterized protein TCM_021521 [Theobroma cacao]
            gi|508715062|gb|EOY06959.1| Uncharacterized protein
            TCM_021521 [Theobroma cacao]
          Length = 1951

 Score = 89.4 bits (220), Expect = 3e-15
 Identities = 58/164 (35%), Positives = 83/164 (50%), Gaps = 15/164 (9%)
 Frame = +3

Query: 429  VQWRKPMPWTVKLNIDDSQKSSSKLASGGGVARDHKGNILFALHEFYGDFSSLEAE*LAI 608
            + W KP     KLN+D S KS+   A+GGGV RDH G + FA  E  G   SL+AE  A+
Sbjct: 1786 IYWIKPFIGEYKLNVDGSSKSNLN-AAGGGVLRDHTGKLAFAFSENLGPLPSLQAELHAL 1844

Query: 609  Y-------------VWTEVDXXXXXXXXXXXXAGPWDVQHCLKDIHYWGKKMTVYYSHIW 749
                          +W E+D             G  D+++ L+ I    +  +   SHI+
Sbjct: 1845 LRGLLLCKERNITNLWIEMDALVAVQMVQQSQKGSHDIRYLLESIRLCLRSFSYRISHIY 1904

Query: 750  REGNGLADWLANTG--HREII*VLPASKTQGEVVGLLRLDKWGL 875
            REGN  AD+L+N G  H+ +      S+ QGE++G+L+LDK  L
Sbjct: 1905 REGNQAADFLSNKGQTHQSL---CVFSEAQGELIGILKLDKLNL 1945



 Score = 59.3 bits (142), Expect = 3e-06
 Identities = 27/83 (32%), Positives = 41/83 (49%)
 Frame = +1

Query: 58   SKCQCCKDEETIGHVLISGRVAQEVWTXXXXXXXXXXXXXXXXIMLFQYWRYAGYAPSPG 237
            SKC CC+ EE++ HVL    VA +VW                   +   W ++G     G
Sbjct: 1647 SKCVCCRSEESLIHVLWENPVATQVWFFFAKSFQIYVSKPNHISQIIWAWFFSGDYTRNG 1706

Query: 238  YSQTITPLVTFWYLWVERNNSKH 306
            + + + PL   W+LW+ERN++KH
Sbjct: 1707 HIRILIPLFICWFLWLERNDAKH 1729


>ref|XP_007010390.1| Retrotransposon, unclassified-like protein [Theobroma cacao]
            gi|508727303|gb|EOY19200.1| Retrotransposon,
            unclassified-like protein [Theobroma cacao]
          Length = 1368

 Score = 88.2 bits (217), Expect = 6e-15
 Identities = 50/135 (37%), Positives = 72/135 (53%), Gaps = 13/135 (9%)
 Frame = +3

Query: 429  VQWRKPMPWTVKLNIDDSQKSSSKLASGGGVARDHKGNILFALHEFYGDFSSLEAE*LAI 608
            + W KP+   +KLN+D S K   + A+GGGV RDH GN++F   E +G  +SL+AE LA+
Sbjct: 1170 INWIKPLIGELKLNVDGSSKDEFQNAAGGGVLRDHTGNLIFGFSENFGYQNSLQAELLAL 1229

Query: 609  Y-------------VWTEVDXXXXXXXXXXXXAGPWDVQHCLKDIHYWGKKMTVYYSHIW 749
            +             VW EVD             G + +Q+ L+ I    + ++V  SHI 
Sbjct: 1230 HRGLCLCMEYNVSRVWIEVDAQVVIQMIQNHHKGSYKIQYLLESIRKCLQVISVRISHIH 1289

Query: 750  REGNGLADWLANTGH 794
            REGN  AD+L+  GH
Sbjct: 1290 REGNQAADFLSKHGH 1304



 Score = 73.2 bits (178), Expect = 2e-10
 Identities = 32/83 (38%), Positives = 45/83 (54%)
 Frame = +1

Query: 58   SKCQCCKDEETIGHVLISGRVAQEVWTXXXXXXXXXXXXXXXXIMLFQYWRYAGYAPSPG 237
            SKC CCK EE++ HVL    VAQ+VW                 + +   W Y+G    PG
Sbjct: 1031 SKCLCCKSEESLLHVLWESPVAQQVWNYFSKFFQIYVHNPQNILQILNSWYYSGDFTKPG 1090

Query: 238  YSQTITPLVTFWYLWVERNNSKH 306
            + +T+  L  FW++WVERN++KH
Sbjct: 1091 HIRTLILLFIFWFVWVERNDAKH 1113


>ref|XP_007008704.1| Uncharacterized protein TCM_042330 [Theobroma cacao]
            gi|508725617|gb|EOY17514.1| Uncharacterized protein
            TCM_042330 [Theobroma cacao]
          Length = 2249

 Score = 86.3 bits (212), Expect = 2e-14
 Identities = 59/160 (36%), Positives = 80/160 (50%), Gaps = 13/160 (8%)
 Frame = +3

Query: 435  WRKPMPWTVKLNIDDSQKSSSKLASGGGVARDHKGNILFALHEFYGDFSSLEAE*LAIY- 611
            W KP     KLN+D S K S   A+GGGV RDH G ++F   E  G  +SL+AE LA+Y 
Sbjct: 2086 WHKPSIGEFKLNVDGSAKLSQN-AAGGGVLRDHAGVMVFGFSENLGIQNSLQAELLALYR 2144

Query: 612  ------------VWTEVDXXXXXXXXXXXXAGPWDVQHCLKDIHYWGKKMTVYYSHIWRE 755
                        +W E+D             GP  +++ L  I       +   SHI+RE
Sbjct: 2145 GLILCRDYNIRRLWIEMDAASVIRLLQGNQRGPHAIRYLLVSIRQLLSHFSFRLSHIFRE 2204

Query: 756  GNGLADWLANTGHREII*VLPASKTQGEVVGLLRLDKWGL 875
            GN  AD+LAN GH E   +   +  QG++ G+LRLD+  L
Sbjct: 2205 GNQAADFLANRGH-EHQSLQVVTVAQGKLRGMLRLDQTSL 2243



 Score = 72.0 bits (175), Expect = 5e-10
 Identities = 31/83 (37%), Positives = 45/83 (54%)
 Frame = +1

Query: 58   SKCQCCKDEETIGHVLISGRVAQEVWTXXXXXXXXXXXXXXXXIMLFQYWRYAGYAPSPG 237
            S+C+CCK EE+I HV+    VA +VW                   +   W Y+G    PG
Sbjct: 1945 SRCRCCKSEESIMHVMWDNPVATQVWNYFSKFFQILVINPCTINQILGAWFYSGDYCKPG 2004

Query: 238  YSQTITPLVTFWYLWVERNNSKH 306
            + +T+ P+ T W+LWVERN++KH
Sbjct: 2005 HIRTLVPIFTLWFLWVERNDAKH 2027


>ref|XP_007040950.1| Uncharacterized protein TCM_016759 [Theobroma cacao]
            gi|508778195|gb|EOY25451.1| Uncharacterized protein
            TCM_016759 [Theobroma cacao]
          Length = 879

 Score = 85.9 bits (211), Expect = 3e-14
 Identities = 54/162 (33%), Positives = 82/162 (50%), Gaps = 13/162 (8%)
 Frame = +3

Query: 429  VQWRKPMPWTVKLNIDDSQKSSSKLASGGGVARDHKGNILFALHEFYGDFSSLEAE*LAI 608
            + WRKP     KLN+D S ++   LA+ GG+ RDH G ++F   E  G  +SL+AE  A+
Sbjct: 714  IYWRKPFTGEYKLNVDGSSRNGH-LAASGGILRDHTGKLIFGFSENIGLCNSLQAELRAL 772

Query: 609  Y-------------VWTEVDXXXXXXXXXXXXAGPWDVQHCLKDIHYWGKKMTVYYSHIW 749
                          +W E+D             G  D+++ L+ I      ++   SHI+
Sbjct: 773  LRGLLLCKERHIENLWIEMDALAVIQLIQHSQKGSHDIRYLLESIRKCLSCISYRISHIF 832

Query: 750  REGNGLADWLANTGHREII*VLPASKTQGEVVGLLRLDKWGL 875
            REGN  AD+LAN GH     +   ++ QGE+ G+L+LD+  L
Sbjct: 833  REGNQAADYLANEGHSHQN-LCVITEAQGELHGMLKLDRLNL 873



 Score = 61.2 bits (147), Expect = 9e-07
 Identities = 26/83 (31%), Positives = 42/83 (50%)
 Frame = +1

Query: 58  SKCQCCKDEETIGHVLISGRVAQEVWTXXXXXXXXXXXXXXXXIMLFQYWRYAGYAPSPG 237
           SKC CC  EE++ HVL    VA++VW                   +   W ++G     G
Sbjct: 575 SKCVCCNSEESLMHVLWGNSVAKQVWAFFGKFFQIYVLNPQHVSQILWAWFFSGDYVKKG 634

Query: 238 YSQTITPLVTFWYLWVERNNSKH 306
           + +++ P+   W+LW+ERN++KH
Sbjct: 635 HIRSLLPIFICWFLWLERNDAKH 657


>ref|XP_007022832.1| Uncharacterized protein TCM_026877 [Theobroma cacao]
            gi|508778198|gb|EOY25454.1| Uncharacterized protein
            TCM_026877 [Theobroma cacao]
          Length = 2367

 Score = 85.5 bits (210), Expect = 4e-14
 Identities = 58/161 (36%), Positives = 82/161 (50%), Gaps = 14/161 (8%)
 Frame = +3

Query: 435  WRKPMPWTVKLNIDDSQKSSSKLASGGGVARDHKGNILFALHEFYGDFSSLEAE*LAIY- 611
            W KP     KLN+D S K S   A+GGGV RDH G ++F   E  G  +SL+AE LA+Y 
Sbjct: 2204 WHKPSNGEFKLNVDGSAKLSQN-AAGGGVLRDHAGVMIFGFSENLGIQNSLKAELLALYR 2262

Query: 612  ------------VWTEVDXXXXXXXXXXXXAGPWDVQHCLKDIHYWGKKMTVYYSHIWRE 755
                        +W E+D             GP  +++ L  I       +   +HI+RE
Sbjct: 2263 GLILCRDYNIRRLWIEMDATSVIRLLQGNHRGPHAIRYLLGSIRQLLSHFSFRLTHIFRE 2322

Query: 756  GNGLADWLANTGH-REII*VLPASKTQGEVVGLLRLDKWGL 875
            GN  AD+LAN GH  + + V+  +  QG++ G+LRLD+  L
Sbjct: 2323 GNQAADFLANRGHEHQSLQVITVA--QGKLRGMLRLDQTSL 2361


>ref|XP_007026458.1| Uncharacterized protein TCM_021522 [Theobroma cacao]
            gi|508715063|gb|EOY06960.1| Uncharacterized protein
            TCM_021522 [Theobroma cacao]
          Length = 3503

 Score = 85.5 bits (210), Expect = 4e-14
 Identities = 54/162 (33%), Positives = 83/162 (51%), Gaps = 15/162 (9%)
 Frame = +3

Query: 435  WRKPMPWTVKLNIDDSQKSSSKLASGGGVARDHKGNILFALHEFYGDFSSLEAE*LAIY- 611
            W KP     KLN+D S K + + A+GGG+ RDH G+++F   E +G   SL+AE +A++ 
Sbjct: 3339 WNKPSIGEFKLNVDGSSKYNLQTAAGGGLLRDHTGSMIFGFSENFGSQDSLQAELMALHR 3398

Query: 612  ------------VWTEVDXXXXXXXXXXXXAGPWDVQHCLKDIHYWGKKMTVYYSHIWRE 755
                        +W E+D             G    ++ L  IH     ++   SHI+RE
Sbjct: 3399 GLLLCIDHNVTRLWIEMDAKVAVQMINEGHQGSSRTRYLLASIHRCLSGISFRISHIFRE 3458

Query: 756  GNGLADWLANTG--HREII*VLPASKTQGEVVGLLRLDKWGL 875
            GN  AD L+N G  H+ +  +   S+ +G++ G+LRLDK  L
Sbjct: 3459 GNQAADHLSNQGYTHQNLQVI---SQAEGQLRGILRLDKINL 3497



 Score = 78.6 bits (192), Expect = 5e-12
 Identities = 52/140 (37%), Positives = 69/140 (49%), Gaps = 13/140 (9%)
 Frame = +3

Query: 411  SKQIITVQWRKPMPWTVKLNIDDSQKSSSKLASGGGVARDHKGNILFALHEFYGDFSSLE 590
            S QII+  W KP     KLN+D S KSS   A+GGGV RDH G + FA  E  G   SL+
Sbjct: 1539 SPQIIS--WIKPFIGEYKLNVDGSSKSSQN-AAGGGVLRDHTGKLAFAFSENLGPLPSLQ 1595

Query: 591  AE*LAIY-------------VWTEVDXXXXXXXXXXXXAGPWDVQHCLKDIHYWGKKMTV 731
            AE  A+              +W E+D             G  D+++ L+ I    +  + 
Sbjct: 1596 AELHALLRGLLLCKERNITNLWIEMDALVAVQMVQQSQKGSHDIRYLLESIRLCLRSFSY 1655

Query: 732  YYSHIWREGNGLADWLANTG 791
              SHI+REGN  AD+L+N G
Sbjct: 1656 RISHIYREGNQAADFLSNKG 1675



 Score = 71.2 bits (173), Expect = 8e-10
 Identities = 30/83 (36%), Positives = 45/83 (54%)
 Frame = +1

Query: 58   SKCQCCKDEETIGHVLISGRVAQEVWTXXXXXXXXXXXXXXXXIMLFQYWRYAGYAPSPG 237
            S+C+CCK EE++ HV+    VA +VW+                  +   W Y+G    PG
Sbjct: 3198 SRCRCCKSEESLMHVMWDNPVANQVWSYFAKVFQIHIINPCTINHIISAWFYSGDYSKPG 3257

Query: 238  YSQTITPLVTFWYLWVERNNSKH 306
            + +T+ PL   W+LWVERN++KH
Sbjct: 3258 HIRTLVPLFILWFLWVERNDAKH 3280



 Score = 60.8 bits (146), Expect = 1e-06
 Identities = 27/83 (32%), Positives = 42/83 (50%)
 Frame = +1

Query: 58   SKCQCCKDEETIGHVLISGRVAQEVWTXXXXXXXXXXXXXXXXIMLFQYWRYAGYAPSPG 237
            SKC CC+ EE++ HVL    VA++VW                   +   W ++G     G
Sbjct: 1404 SKCVCCRSEESLIHVLWENPVAKQVWNFFAKSFQIYVSKPKHISQIIWAWFFSGDYTRNG 1463

Query: 238  YSQTITPLVTFWYLWVERNNSKH 306
            + + + PL   W+LW+ERN++KH
Sbjct: 1464 HIRILIPLFICWFLWLERNDAKH 1486


>ref|XP_007017129.1| Uncharacterized protein TCM_042328 [Theobroma cacao]
            gi|508787492|gb|EOY34748.1| Uncharacterized protein
            TCM_042328 [Theobroma cacao]
          Length = 910

 Score = 85.1 bits (209), Expect = 5e-14
 Identities = 55/157 (35%), Positives = 80/157 (50%), Gaps = 13/157 (8%)
 Frame = +3

Query: 435  WRKPMPWTVKLNIDDSQKSSSKLASGGGVARDHKGNILFALHEFYGDFSSLEAE*LAIY- 611
            W KP     KLN+D S K S   A+GGG+ RDH G ++F   E  G  +SL+AE LA+Y 
Sbjct: 747  WHKPTTGEFKLNVDGSAKHSHN-AAGGGILRDHAGVMVFGFSENLGIQNSLQAELLALYR 805

Query: 612  ------------VWTEVDXXXXXXXXXXXXAGPWDVQHCLKDIHYWGKKMTVYYSHIWRE 755
                        +W E+D             GP  +++ +  +       +  +SHI+RE
Sbjct: 806  GLILCRDYNIRRLWIEMDAISVIRLLQGNHRGPHAIRYLMVSLRQLLSHFSFRFSHIFRE 865

Query: 756  GNGLADWLANTGHREII*VLPASKTQGEVVGLLRLDK 866
            GN  AD+LAN GH E   +   +  QG++ G+LRLD+
Sbjct: 866  GNQAADFLANRGH-EHQNLQVFTVAQGKLRGMLRLDQ 901



 Score = 68.9 bits (167), Expect = 4e-09
 Identities = 30/83 (36%), Positives = 44/83 (53%)
 Frame = +1

Query: 58  SKCQCCKDEETIGHVLISGRVAQEVWTXXXXXXXXXXXXXXXXIMLFQYWRYAGYAPSPG 237
           S+C+CCK EE+I HV+    VA +VW                   +   W ++G    PG
Sbjct: 606 SRCRCCKSEESIMHVMWDNPVAMQVWNYFAKLFQICIINPCTINQIIGAWFHSGDYCKPG 665

Query: 238 YSQTITPLVTFWYLWVERNNSKH 306
           + +T+ PL   W+LWVERN++KH
Sbjct: 666 HIRTLVPLFILWFLWVERNDAKH 688


>ref|XP_007020288.1| Uncharacterized protein TCM_036737 [Theobroma cacao]
            gi|508725616|gb|EOY17513.1| Uncharacterized protein
            TCM_036737 [Theobroma cacao]
          Length = 2215

 Score = 85.1 bits (209), Expect = 5e-14
 Identities = 54/161 (33%), Positives = 85/161 (52%), Gaps = 14/161 (8%)
 Frame = +3

Query: 435  WRKPMPWTVKLNIDDSQKSSSKLASGGGVARDHKGNILFALHEFYGDFSSLEAE*LAIY- 611
            W KP    +KLN+D S K + + A+GGG+ RDH G+++F   E +G   SL+AE +A++ 
Sbjct: 2051 WLKPSIGELKLNVDGSCKHNPQSAAGGGLLRDHTGSMIFGFSENFGPQDSLQAELMALHR 2110

Query: 612  ------------VWTEVDXXXXXXXXXXXXAGPWDVQHCLKDIHYWGKKMTVYYSHIWRE 755
                        +W E+D             G    ++ L  IH     ++   SHI+RE
Sbjct: 2111 GLLLCIEHNISRLWIEMDAKVAVQMIKEGHQGSSRTRYLLASIHRCLSGISFRISHIFRE 2170

Query: 756  GNGLADWLANTGH-REII*VLPASKTQGEVVGLLRLDKWGL 875
            GN  AD L+N GH  + + V+  S+ +G++ G+LRL+K  L
Sbjct: 2171 GNQAADHLSNQGHTHQNLQVI--SQAEGQLRGILRLEKINL 2209



 Score = 73.2 bits (178), Expect = 2e-10
 Identities = 31/83 (37%), Positives = 46/83 (55%)
 Frame = +1

Query: 58   SKCQCCKDEETIGHVLISGRVAQEVWTXXXXXXXXXXXXXXXXIMLFQYWRYAGYAPSPG 237
            S+C+CCK EE++ HV+    VA +VW+                  +   W Y+G    PG
Sbjct: 1910 SRCRCCKSEESLMHVMWKNPVANQVWSYFAKVFQIQIINPCTINQIICAWFYSGDYSKPG 1969

Query: 238  YSQTITPLVTFWYLWVERNNSKH 306
            + +T+ PL T W+LWVERN++KH
Sbjct: 1970 HIRTLVPLFTLWFLWVERNDAKH 1992


>ref|XP_007040952.1| Uncharacterized protein TCM_005954 [Theobroma cacao]
            gi|508704887|gb|EOX96783.1| Uncharacterized protein
            TCM_005954 [Theobroma cacao]
          Length = 1134

 Score = 84.7 bits (208), Expect = 7e-14
 Identities = 51/162 (31%), Positives = 82/162 (50%), Gaps = 13/162 (8%)
 Frame = +3

Query: 429  VQWRKPMPWTVKLNIDDSQKSSSKLASGGGVARDHKGNILFALHEFYGDFSSLEAE*LAI 608
            + W+KP     KLN+D S ++    A+GG V RDH G ++F   E  G  +SL+AE  A+
Sbjct: 967  IYWKKPSIGEYKLNVDGSSRNGLHAATGG-VLRDHTGKLIFGFSENIGPCNSLQAELRAL 1025

Query: 609  Y-------------VWTEVDXXXXXXXXXXXXAGPWDVQHCLKDIHYWGKKMTVYYSHIW 749
                          +W E+D             GP+D+++ L+ I       +   SH +
Sbjct: 1026 LRGLLLCKERHIEKLWIEMDALAAIQLIQPSKKGPYDIRYLLESIRMCLSSFSYRLSHTF 1085

Query: 750  REGNGLADWLANTGHREII*VLPASKTQGEVVGLLRLDKWGL 875
            REGN  AD+L+N GH+    +   ++ QG++ G+L+LD+  L
Sbjct: 1086 REGNKAADYLSNEGHKHQN-LCVFTEAQGQLHGMLKLDRLNL 1126



 Score = 59.3 bits (142), Expect = 3e-06
 Identities = 27/83 (32%), Positives = 40/83 (48%)
 Frame = +1

Query: 58   SKCQCCKDEETIGHVLISGRVAQEVWTXXXXXXXXXXXXXXXXIMLFQYWRYAGYAPSPG 237
            SKC CC  EE++ HVL    VA++VW                   +   W  +G     G
Sbjct: 828  SKCVCCNSEESLIHVLWENPVAKQVWNFFAKLFQIYILNPRHVSQIIWAWYVSGDYVRKG 887

Query: 238  YSQTITPLVTFWYLWVERNNSKH 306
            + + + PL   W+LW+ERN++KH
Sbjct: 888  HFRVLLPLFICWFLWLERNDAKH 910


>ref|XP_007014629.1| Uncharacterized protein TCM_039895 [Theobroma cacao]
           gi|508784992|gb|EOY32248.1| Uncharacterized protein
           TCM_039895 [Theobroma cacao]
          Length = 206

 Score = 84.0 bits (206), Expect = 1e-13
 Identities = 51/162 (31%), Positives = 80/162 (49%), Gaps = 13/162 (8%)
 Frame = +3

Query: 429 VQWRKPMPWTVKLNIDDSQKSSSKLASGGGVARDHKGNILFALHEFYGDFSSLEAE*LAI 608
           + W KP+    KLN D S K + + A+GGG+ RDH GN++F   E +G  + L+A+ +A+
Sbjct: 43  ISWHKPLIGEFKLNADGSSKDAFQNAAGGGLLRDHTGNLIFGFSENFGPANLLQAKLMAL 102

Query: 609 Y-------------VWTEVDXXXXXXXXXXXXAGPWDVQHCLKDIHYWGKKMTVYYSHIW 749
           +             +W E+D             G +  ++ L  I       T  +SHI 
Sbjct: 103 HRGLFLCIEYNISSIWIEMDAKIVVQMIHEGHQGSYQTRYLLAFIRKCLSGFTFRFSHIH 162

Query: 750 REGNGLADWLANTGHREII*VLPASKTQGEVVGLLRLDKWGL 875
           REGN  AD+L N GH     +   ++ +G++ G+LRL K  L
Sbjct: 163 REGNQAADYLFNQGHMHHN-LQVFAQAEGKLRGILRLGKLNL 203


>ref|XP_007046402.1| Uncharacterized protein TCM_011921 [Theobroma cacao]
            gi|508710337|gb|EOY02234.1| Uncharacterized protein
            TCM_011921 [Theobroma cacao]
          Length = 926

 Score = 84.0 bits (206), Expect = 1e-13
 Identities = 54/162 (33%), Positives = 81/162 (50%), Gaps = 13/162 (8%)
 Frame = +3

Query: 429  VQWRKPMPWTVKLNIDDSQKSSSKLASGGGVARDHKGNILFALHEFYGDFSSLEAE*LAI 608
            V WRKP     KLN+D S +     ASGG V RDH G ++F   E  G+ +SL+AE  A+
Sbjct: 762  VYWRKPSTGEYKLNVDGSSRHGQHAASGG-VLRDHTGKLIFGFSENIGNCNSLQAELRAL 820

Query: 609  Y-------------VWTEVDXXXXXXXXXXXXAGPWDVQHCLKDIHYWGKKMTVYYSHIW 749
                          +W E+D             G  D+++ L+ I      ++   SHI 
Sbjct: 821  LRGLLLCKERHIEQLWIEMDALAVIQLIPHSQKGSHDIRYLLESIRKCLNSISYRISHIL 880

Query: 750  REGNGLADWLANTGHREII*VLPASKTQGEVVGLLRLDKWGL 875
            REGN +AD+L+N GH     +   ++ QG++ G+L+LD+  L
Sbjct: 881  REGNQVADFLSNEGHNHQN-LRVFTEAQGKLHGMLKLDRLNL 921



 Score = 63.2 bits (152), Expect = 2e-07
 Identities = 28/83 (33%), Positives = 42/83 (50%)
 Frame = +1

Query: 58  SKCQCCKDEETIGHVLISGRVAQEVWTXXXXXXXXXXXXXXXXIMLFQYWRYAGYAPSPG 237
           SKC CC  EE++ HVL    VA++VW                   +   W Y+G     G
Sbjct: 623 SKCVCCNSEESLMHVLWGNSVAKQVWAFFANFFQIYIFNPQHVSHILWAWFYSGDYVKRG 682

Query: 238 YSQTITPLVTFWYLWVERNNSKH 306
           + +T+ P+   W+LW+ERN++KH
Sbjct: 683 HIRTLLPIFICWFLWLERNDAKH 705


>ref|XP_006356603.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Solanum
            tuberosum]
          Length = 885

 Score = 61.2 bits (147), Expect(2) = 2e-13
 Identities = 50/179 (27%), Positives = 80/179 (44%), Gaps = 29/179 (16%)
 Frame = +3

Query: 336  KIVEHIIEVLRAL----------------ELIADSKQYTRMSKQIITVQWRKPMPWTVKL 467
            ++VE +IEV+R +                 +I    QY R    ++ V W+ P    VK 
Sbjct: 652  RMVEMVIEVVRKMVKSQFPWIKNMRWTWQAIIQRLNQYKRKI-HVLRVTWKPPDDHYVKS 710

Query: 468  NIDDSQKSSSKLASGGGVARDHKGNILFALHEFYGDFSSLEAE*LAIY------------ 611
            N D + + +  L+S G   RD KG++++A  +  G  +++EAE +AI             
Sbjct: 711  NTDGACRGNPGLSSFGFCIRDDKGDLIYAKAKGIGIATNMEAETVAILTALRECSNRKMQ 770

Query: 612  -VWTEVDXXXXXXXXXXXXAGPWDVQHCLKDIHYWGKKMTVYYSHIWREGNGLADWLAN 785
             V  E D              PW +   +++I    +K+    +HI+REGN LAD LAN
Sbjct: 771  KVIIETDSLSLKKIIQQTWRVPWKIAEKVEEIREIMEKIKAKITHIFREGNSLADSLAN 829



 Score = 42.7 bits (99), Expect(2) = 2e-13
 Identities = 22/88 (25%), Positives = 36/88 (40%), Gaps = 2/88 (2%)
 Frame = +1

Query: 58  SKCQCC--KDEETIGHVLISGRVAQEVWTXXXXXXXXXXXXXXXXIMLFQYWRYAGYAPS 231
           S+C CC  K EET+ H+  +  +  ++W                  ++  +W++      
Sbjct: 561 SRCWCCDRKKEETMTHLFPTAPITYKLWRYFAHFAGINIDGMHLQQLIISWWKHEATPKL 620

Query: 232 PGYSQTITPLVTFWYLWVERNNSKHSGS 315
            G  + I P +  W LW  RN  KH  S
Sbjct: 621 QGIYKAI-PAIIMWTLWKRRNALKHDSS 647


>ref|XP_007017131.1| Uncharacterized protein TCM_033752 [Theobroma cacao]
            gi|508722459|gb|EOY14356.1| Uncharacterized protein
            TCM_033752 [Theobroma cacao]
          Length = 2251

 Score = 83.6 bits (205), Expect = 2e-13
 Identities = 54/157 (34%), Positives = 79/157 (50%), Gaps = 13/157 (8%)
 Frame = +3

Query: 435  WRKPMPWTVKLNIDDSQKSSSKLASGGGVARDHKGNILFALHEFYGDFSSLEAE*LAIY- 611
            W KP     KLN+D S K S   A+GGG+ RDH G ++F   E  G  +SL+AE LA+Y 
Sbjct: 2088 WHKPSLGEFKLNVDGSAKQSHN-AAGGGILRDHAGEMVFGFSENLGTQNSLQAELLALYR 2146

Query: 612  ------------VWTEVDXXXXXXXXXXXXAGPWDVQHCLKDIHYWGKKMTVYYSHIWRE 755
                        +W E+D             GP  +++ +  +       +  +SHI+RE
Sbjct: 2147 GLILCRDYNIRRLWIEMDAISVIRLLQGNHRGPHAIRYLMVSLRQLLSHFSFRFSHIFRE 2206

Query: 756  GNGLADWLANTGHREII*VLPASKTQGEVVGLLRLDK 866
            GN  AD+LAN GH E   +   +  QG++ G+L LD+
Sbjct: 2207 GNQAADFLANRGH-EHQNLQVFTVAQGKLRGMLCLDQ 2242



 Score = 70.9 bits (172), Expect = 1e-09
 Identities = 31/83 (37%), Positives = 44/83 (53%)
 Frame = +1

Query: 58   SKCQCCKDEETIGHVLISGRVAQEVWTXXXXXXXXXXXXXXXXIMLFQYWRYAGYAPSPG 237
            S+C+CCK EE+I HV+    VA +VW                   +   W Y+G    PG
Sbjct: 1947 SRCRCCKSEESIMHVMWDNPVAMQVWNYFAKLFQILIINPCTINQIIGAWFYSGDYCKPG 2006

Query: 238  YSQTITPLVTFWYLWVERNNSKH 306
            + +T+ PL   W+LWVERN++KH
Sbjct: 2007 HIRTLVPLFILWFLWVERNDAKH 2029


>ref|XP_007046404.1| Uncharacterized protein TCM_011923 [Theobroma cacao]
            gi|508710339|gb|EOY02236.1| Uncharacterized protein
            TCM_011923 [Theobroma cacao]
          Length = 1954

 Score = 83.2 bits (204), Expect = 2e-13
 Identities = 57/172 (33%), Positives = 89/172 (51%), Gaps = 14/172 (8%)
 Frame = +3

Query: 402  TRMSKQIITVQWRKPMPWTVKLNIDDSQKSSSKLASGGGVARDHKGNILFALHEFYGDFS 581
            TR + QI+   W KP+P   KLN+D S + + + A+ GGV RDH G ++F   E  G  +
Sbjct: 1782 TRAAPQIL--HWVKPVPGEHKLNVDGSSRQN-QTAAIGGVLRDHTGTLVFDFSENIGPSN 1838

Query: 582  SLEAE*LAIY-------------VWTEVDXXXXXXXXXXXXAGPWDVQHCLKDIHYWGKK 722
            SL+AE  A+              +W E+D             G  D+++ L  I  +   
Sbjct: 1839 SLQAELRALLRGLLLCKERNIEKLWVEMDALVAIQMIQQSQKGSHDIRYLLASIRKYLNF 1898

Query: 723  MTVYYSHIWREGNGLADWLANTGH-REII*VLPASKTQGEVVGLLRLDKWGL 875
             +   SHI+REGN  AD+L+N GH  + + V   ++ QG++ G+L+LD+  L
Sbjct: 1899 FSFRISHIFREGNQAADFLSNKGHTHQSLHVF--TEAQGKLYGMLKLDRLNL 1948



 Score = 58.2 bits (139), Expect = 7e-06
 Identities = 26/83 (31%), Positives = 40/83 (48%)
 Frame = +1

Query: 58   SKCQCCKDEETIGHVLISGRVAQEVWTXXXXXXXXXXXXXXXXIMLFQYWRYAGYAPSPG 237
            SKC CC  EE++ HVL    +A++VW                   +   W  +G     G
Sbjct: 1650 SKCICCNSEESLIHVLWDNPIAKQVWNFFANSFQIYISKPQNVSQILWTWYLSGDYVRKG 1709

Query: 238  YSQTITPLVTFWYLWVERNNSKH 306
            + + + PL   W+LW+ERN++KH
Sbjct: 1710 HIRILIPLFICWFLWLERNDAKH 1732


>ref|XP_007031313.1| Uncharacterized protein TCM_016763 [Theobroma cacao]
            gi|508710342|gb|EOY02239.1| Uncharacterized protein
            TCM_016763 [Theobroma cacao]
          Length = 2127

 Score = 82.8 bits (203), Expect = 3e-13
 Identities = 51/162 (31%), Positives = 82/162 (50%), Gaps = 13/162 (8%)
 Frame = +3

Query: 429  VQWRKPMPWTVKLNIDDSQKSSSKLASGGGVARDHKGNILFALHEFYGDFSSLEAE*LAI 608
            + W+KP     KLN+D S ++    A+GG V RDH G ++F   E  G  +SL+AE  A+
Sbjct: 1963 IYWKKPSIGEYKLNVDGSSRNGLHAATGG-VLRDHTGKLIFGFSENIGPCNSLQAELRAL 2021

Query: 609  Y-------------VWTEVDXXXXXXXXXXXXAGPWDVQHCLKDIHYWGKKMTVYYSHIW 749
                          +W E+D             GP+++++ L+ I       +   SHI 
Sbjct: 2022 LRGLLLCKERHIEKLWIEMDALVAIQLIQPSKKGPYNLRYLLESIRMCLSSFSYRLSHIL 2081

Query: 750  REGNGLADWLANTGHREII*VLPASKTQGEVVGLLRLDKWGL 875
            REGN  AD+L+N GH+    +   ++ QG++ G+L+LD+  L
Sbjct: 2082 REGNQAADYLSNEGHKHQN-LCVFTEAQGQLHGMLKLDRLNL 2122



 Score = 59.3 bits (142), Expect = 3e-06
 Identities = 27/83 (32%), Positives = 40/83 (48%)
 Frame = +1

Query: 58   SKCQCCKDEETIGHVLISGRVAQEVWTXXXXXXXXXXXXXXXXIMLFQYWRYAGYAPSPG 237
            SKC CC  EE++ HVL    VA++VW                   +   W  +G     G
Sbjct: 1824 SKCVCCNSEESLIHVLWENPVAKQVWNFFAQLFQIYIWNPRHVSQIIWAWYVSGDYVRKG 1883

Query: 238  YSQTITPLVTFWYLWVERNNSKH 306
            + + + PL   W+LW+ERN++KH
Sbjct: 1884 HFRVLLPLFICWFLWLERNDAKH 1906


>ref|XP_007031312.1| Uncharacterized protein TCM_016762 [Theobroma cacao]
            gi|508710341|gb|EOY02238.1| Uncharacterized protein
            TCM_016762 [Theobroma cacao]
          Length = 2214

 Score = 82.8 bits (203), Expect = 3e-13
 Identities = 54/162 (33%), Positives = 80/162 (49%), Gaps = 13/162 (8%)
 Frame = +3

Query: 429  VQWRKPMPWTVKLNIDDSQKSSSKLASGGGVARDHKGNILFALHEFYGDFSSLEAE*LAI 608
            V WRKP     KLN+D S +     ASGG V RDH G ++F   E  G  +SL+AE  A+
Sbjct: 2050 VYWRKPSTGEYKLNVDGSSRHGQHAASGG-VLRDHTGKLIFGFSENIGTCNSLQAELRAL 2108

Query: 609  Y-------------VWTEVDXXXXXXXXXXXXAGPWDVQHCLKDIHYWGKKMTVYYSHIW 749
                          +W E+D             G  D+++ L+ I      ++   SHI 
Sbjct: 2109 LRGLLLCKERHIEKLWIEMDALAAIQLLPHSQKGSHDIRYLLESIRKCLNSISYRISHIH 2168

Query: 750  REGNGLADWLANTGHREII*VLPASKTQGEVVGLLRLDKWGL 875
            REGN +AD+L+N GH     +   ++ QG++ G+L+LD+  L
Sbjct: 2169 REGNQVADFLSNEGHNHQN-LHVFTEAQGKLHGMLKLDRLNL 2209



 Score = 62.0 bits (149), Expect = 5e-07
 Identities = 30/87 (34%), Positives = 44/87 (50%), Gaps = 2/87 (2%)
 Frame = +1

Query: 58   SKCQCCKDEETIGHVLISGRVAQEVWTXXXXXXXXXXXXXXXXIMLFQYWRYAGYAPSPG 237
            SKC CC  EE++ HVL    VA++VW                   +   W Y+G     G
Sbjct: 1911 SKCVCCNSEESLMHVLWGNSVAKQVWAFFAKFFQIYVLNPKHVSHILWAWFYSGDYVKRG 1970

Query: 238  YSQTITPLVTFWYLWVERNNSK--HSG 312
            + +T+ P+   W+LW+ERN++K  HSG
Sbjct: 1971 HIRTLLPIFICWFLWLERNDAKYRHSG 1997


>ref|XP_007036030.1| Uncharacterized protein TCM_021518 [Theobroma cacao]
            gi|508715059|gb|EOY06956.1| Uncharacterized protein
            TCM_021518 [Theobroma cacao]
          Length = 1702

 Score = 80.9 bits (198), Expect = 1e-12
 Identities = 51/162 (31%), Positives = 78/162 (48%), Gaps = 13/162 (8%)
 Frame = +3

Query: 429  VQWRKPMPWTVKLNIDDSQKSSSKLASGGGVARDHKGNILFALHEFYGDFSSLEAE*LAI 608
            + W +P+    KLN+D   K + + A+ GGV RDH   ++F   E +G ++S +AE +A+
Sbjct: 1536 IYWSRPLMGEFKLNVDGCSKEAFQNAASGGVPRDHTSTMIFGFSENFGPYNSTQAELMAL 1595

Query: 609  Y-------------VWTEVDXXXXXXXXXXXXAGPWDVQHCLKDIHYWGKKMTVYYSHIW 749
            +             VW E+D             G    Q+ L  I      ++   SHI 
Sbjct: 1596 HRGLLLCNEYNISRVWIEIDAKAIVQMLHEGHKGYSRTQYLLSFICQCLSGISYRISHIH 1655

Query: 750  REGNGLADWLANTGHREII*VLPASKTQGEVVGLLRLDKWGL 875
            RE N  AD+L+N GH     +   SK +GE+ G++RLDK  L
Sbjct: 1656 RESNQAADYLSNQGHTHQS-LQVFSKAEGELRGMIRLDKSNL 1696



 Score = 67.8 bits (164), Expect = 9e-09
 Identities = 50/160 (31%), Positives = 74/160 (46%), Gaps = 13/160 (8%)
 Frame = +3

Query: 435  WRKPMPWTVKLNIDDSQKSSSKLASGGGVARDHKGNILFALHEFYGDFSSLEAE*LAIY- 611
            W K +    KLN+D S + +   A GG + RDH G ++F   E  G  +SL+AE  A+  
Sbjct: 1371 WVKLVSGEHKLNVDGSSRQNQSAAIGG-LLRDHTGTLVFGFSENIGPSNSLQAELRALLR 1429

Query: 612  ------------VWTEVDXXXXXXXXXXXXAGPWDVQHCLKDIHYWGKKMTVYYSHIWRE 755
                        +W E+D             G  D+Q+ L  I       +   SHI+RE
Sbjct: 1430 GLLLCKERNIEKLWIEMDALVAIQMIQQSQKGSHDIQYLLASIRKCLSFFSFRISHIFRE 1489

Query: 756  GNGLADWLANTGHREII*VLPASKTQGEVVGLLRLDKWGL 875
            GN +AD+L+N GH +   +L  S+ +GE+        WGL
Sbjct: 1490 GNQVADFLSNKGHTQQN-LLVFSEAEGELHA-----HWGL 1523



 Score = 61.6 bits (148), Expect = 7e-07
 Identities = 28/82 (34%), Positives = 41/82 (50%)
 Frame = +1

Query: 58   SKCQCCKDEETIGHVLISGRVAQEVWTXXXXXXXXXXXXXXXXIMLFQYWRYAGYAPSPG 237
            SKC CC  EET+ HVL    VA++VW                   +   W ++G     G
Sbjct: 1230 SKCACCNSEETLIHVLWDNPVAKQVWNFFANFFQIYVSNPQNVSQILWAWYFSGDYVRKG 1289

Query: 238  YSQTITPLVTFWYLWVERNNSK 303
            + +T+ PL   W+LW+ERN++K
Sbjct: 1290 HIRTLIPLFICWFLWLERNDAK 1311


>ref|XP_007022459.1| RNase H family protein [Theobroma cacao]
           gi|508722087|gb|EOY13984.1| RNase H family protein
           [Theobroma cacao]
          Length = 429

 Score = 80.5 bits (197), Expect = 1e-12
 Identities = 50/158 (31%), Positives = 81/158 (51%), Gaps = 14/158 (8%)
 Frame = +3

Query: 435 WRKPMPWTVKLNIDDSQKSSSKLASGGGVARDHKGNILFALHEFYGDFSSLEAE*LAIY- 611
           W+KP+    KLN+D   K   + A+GG + RDH G ++F+  E +G ++SL+AE +A+Y 
Sbjct: 258 WQKPLTGEFKLNVDGGSKYDCQSAAGGRLLRDHTGTLIFSFVENFGPYNSLQAELMALYR 317

Query: 612 ------------VWTEVDXXXXXXXXXXXXAGPWDVQHCLKDIHYWGKKMTVYYSHIWRE 755
                       +W E+D             G   +++ L  I      ++   SHI RE
Sbjct: 318 GLLLCIEHNVRRLWIEMDAKVVIQMIHRGHKGSAQIRYLLASIRKCLSVISFRISHIHRE 377

Query: 756 GNGLADWLANTGH-REII*VLPASKTQGEVVGLLRLDK 866
           GN  AD L+N G+  + + V   S+ +G++ G+L LDK
Sbjct: 378 GNQAADLLSNQGYMHQNLHVF--SQVKGQLKGILGLDK 413


>ref|XP_007019605.1| Uncharacterized protein TCM_035716 [Theobroma cacao]
           gi|508724933|gb|EOY16830.1| Uncharacterized protein
           TCM_035716 [Theobroma cacao]
          Length = 165

 Score = 80.1 bits (196), Expect = 2e-12
 Identities = 52/160 (32%), Positives = 80/160 (50%), Gaps = 16/160 (10%)
 Frame = +3

Query: 435 WRKPMPWTVKLNIDDSQKSSSKLASGGGVARDHKGNILFALHEFYGDFSSLEAE*LAIY- 611
           W+KP+    KLN+D S K + + A+ GG+ RD+ G+++F  +E +G  +S++AE LA+Y 
Sbjct: 4   WQKPVLGEFKLNVDGSSKCNFQNATSGGILRDYTGSLVFGFYENFGVKNSIQAELLALYK 63

Query: 612 ------------VWTEVDXXXXXXXXXXXXAGPWDVQHCLKDIHYWGKKMTVYYSHIWRE 755
                       +W E+D             G  D ++ L +I       +   SHI++E
Sbjct: 64  GLILCRDYGISHLWIEMDALVVIQMLTGRYRGSHDSRYLLANIQNLHNYFSYKLSHIFQE 123

Query: 756 GNGLADWLANTG---HREII*VLPASKTQGEVVGLLRLDK 866
           GN  AD L N G   H   +  +P  K Q    G+LRLDK
Sbjct: 124 GNQAADLLVNLGYEYHSLQVFTVPFGKLQ----GILRLDK 159


>ref|XP_007017128.1| Uncharacterized protein TCM_042327 [Theobroma cacao]
            gi|508787491|gb|EOY34747.1| Uncharacterized protein
            TCM_042327 [Theobroma cacao]
          Length = 1014

 Score = 79.7 bits (195), Expect = 2e-12
 Identities = 57/171 (33%), Positives = 86/171 (50%), Gaps = 14/171 (8%)
 Frame = +3

Query: 405  RMSKQIITVQWRKPMPWTVKLNIDDSQKSSSKLASGGGVARDHKGNILFALHEFYGDFSS 584
            R S QII   W KP+    KLN+D S + +   A+GG + RDH G ++F   E  G  +S
Sbjct: 843  RESPQII--HWVKPVTGEYKLNVDGSSRHNQSAATGG-LLRDHTGTLVFGFSENIGPSNS 899

Query: 585  LEAE*LAIY-------------VWTEVDXXXXXXXXXXXXAGPWDVQHCLKDIHYWGKKM 725
            L+AE  A+              +W E+D             G  D+++ L  I       
Sbjct: 900  LQAELRALLRGLLLCKDRNIEKLWIEMDALVVIQMIQQSKKGSHDIRYLLASIRKCLSFF 959

Query: 726  TVYYSHIWREGNGLADWLANTGH-REII*VLPASKTQGEVVGLLRLDKWGL 875
            +   SHI+REGN  AD+L+N GH  + + V+  S+ QG++ G+L+LD+  L
Sbjct: 960  SFRISHIFREGNQAADFLSNKGHTHQNLQVI--SEAQGKLHGMLKLDRLNL 1008



 Score = 64.3 bits (155), Expect = 1e-07
 Identities = 29/83 (34%), Positives = 42/83 (50%)
 Frame = +1

Query: 58  SKCQCCKDEETIGHVLISGRVAQEVWTXXXXXXXXXXXXXXXXIMLFQYWRYAGYAPSPG 237
           SKC CC  EE++ HVL    VA++VW                   +   W Y+G     G
Sbjct: 710 SKCVCCNSEESLIHVLWDNPVAKQVWNFFADFFQINISNPQHVSQIIWAWYYSGDFVRKG 769

Query: 238 YSQTITPLVTFWYLWVERNNSKH 306
           + +T+ PL   W+LW+ERN++KH
Sbjct: 770 HIRTLIPLFICWFLWLERNDAKH 792


Top