BLASTX nr result

ID: Rehmannia24_contig00012259 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia24_contig00012259
         (790 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EOY19200.1| Retrotransposon, unclassified-like protein [Theob...   179   1e-42
gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao]   178   2e-42
gb|EOY06960.1| Uncharacterized protein TCM_021522 [Theobroma cacao]   176   8e-42
gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao]   174   2e-41
gb|EOY02236.1| Uncharacterized protein TCM_011923 [Theobroma cacao]   172   9e-41
gb|EOY06959.1| Uncharacterized protein TCM_021521 [Theobroma cacao]   172   2e-40
gb|EOY17514.1| Uncharacterized protein TCM_042330 [Theobroma cacao]   171   2e-40
gb|EOY02239.1| Uncharacterized protein TCM_016763 [Theobroma cacao]   169   1e-39
gb|EOY25454.1| Uncharacterized protein TCM_026877 [Theobroma cacao]   168   2e-39
gb|EOY34748.1| Uncharacterized protein TCM_042328 [Theobroma cacao]   167   4e-39
gb|EOY25451.1| Uncharacterized protein TCM_016759 [Theobroma cacao]   167   5e-39
gb|EOY34747.1| Uncharacterized protein TCM_042327 [Theobroma cacao]   164   2e-38
gb|EOY02234.1| Uncharacterized protein TCM_011921 [Theobroma cacao]   159   1e-36
gb|EOX99578.1| Uncharacterized protein TCM_008287 [Theobroma cacao]   156   9e-36
gb|EOY02238.1| Uncharacterized protein TCM_016762 [Theobroma cacao]   155   1e-35
ref|XP_006358721.1| PREDICTED: uncharacterized protein LOC102596...   130   4e-28
gb|EOY25447.1| Uncharacterized protein TCM_016753 [Theobroma cacao]   129   1e-27
ref|XP_004253442.1| PREDICTED: putative ribonuclease H protein A...   125   2e-26
gb|EOY34749.1| Uncharacterized protein TCM_042329 [Theobroma cacao]   124   5e-26
ref|XP_004234855.1| PREDICTED: putative ribonuclease H protein A...   122   2e-25

>gb|EOY19200.1| Retrotransposon, unclassified-like protein [Theobroma cacao]
          Length = 1368

 Score =  179 bits (454), Expect = 1e-42
 Identities = 91/260 (35%), Positives = 135/260 (51%)
 Frame = -3

Query: 788  GLGVKSIHHSITAFSMKLWWRLRDKSSLWARFMHVKYAGGMPICIAPVHPTDSLGWKRVC 609
            GLG++S+  S  AFS KLWWR     SLW R+M +KY  G         P DS  WK + 
Sbjct: 819  GLGIRSLKDSFDAFSAKLWWRFDTCQSLWVRYMRLKYCTGQIHHNIAPKPHDSATWKPLL 878

Query: 608  KIRNFMQDNIIWDVGSGNISFWHDIWFESEPLFNIIPCDKNGLENFDVAYFWNNNSWDMN 429
              R      I W +G G+I FWHD W   EPL N  P     +    V YF+N+++WD++
Sbjct: 879  AGRATASQQIRWRIGKGDIFFWHDAWMGDEPLVNSFPSFSQSM--MKVNYFFNDDAWDVD 936

Query: 428  RLHNIVGLEWANKLSKIPISHTQVDSMKWKLSANGNFSISSAYTALLSISETQPIFKQLW 249
            +L   +      ++ KIPIS  + D   W L+ANG+FSI SA+  L    +   + + +W
Sbjct: 937  KLKTFIPNAIVEEILKIPISREKEDIAYWALTANGDFSIKSAWELLRQRKQVNLVGQLIW 996

Query: 248  NPMIPPSASIFLWWLFQNMIPVDKKLHERGISLASKCVCCGHSSSSMPNFSPNTFEIVPH 69
            +  IP + S FLW    N +PV+ ++  +GI LASKC+CC    S +            H
Sbjct: 997  HKSIPLTVSFFLWRTLHNWLPVEVRMKAKGIQLASKCLCCKSEESLL------------H 1044

Query: 68   LFLQNAQVVKVWNHFASWLR 9
            +  ++    +VWN+F+ + +
Sbjct: 1045 VLWESPVAQQVWNYFSKFFQ 1064


>gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao]
          Length = 2215

 Score =  178 bits (451), Expect = 2e-42
 Identities = 92/258 (35%), Positives = 138/258 (53%), Gaps = 2/258 (0%)
 Frame = -3

Query: 788  GLGVKSIHHSITAFSMKLWWRLRDKSSLWARFMHVKYAGG-MPICIAP-VHPTDSLGWKR 615
            GL ++++     AFSMKLWWR R  +SLW +FM  KY GG +P  + P +H  DS  WKR
Sbjct: 1698 GLDIRNVEDVCEAFSMKLWWRFRTTNSLWTQFMRAKYCGGQLPTDVQPKLH--DSQTWKR 1755

Query: 614  VCKIRNFMQDNIIWDVGSGNISFWHDIWFESEPLFNIIPCDKNGLENFDVAYFWNNNSWD 435
            +  I +  + NI W +G G + FWHD W   EPL N      + +    V+ F+ NNSW+
Sbjct: 1756 MVTISSITEQNIRWRIGHGELFFWHDCWMGEEPLVNRNQAFASSMAQ--VSDFFLNNSWN 1813

Query: 434  MNRLHNIVGLEWANKLSKIPISHTQVDSMKWKLSANGNFSISSAYTALLSISETQPIFKQ 255
            + +L  ++  E   ++ KIPI  +  D   W  + NG+FS  SA+  + +     P+F  
Sbjct: 1814 VEKLKTVLQQEVVEEIVKIPIDTSSNDKAYWTTTPNGDFSTKSAWQLIRNRKVENPVFNF 1873

Query: 254  LWNPMIPPSASIFLWWLFQNMIPVDKKLHERGISLASKCVCCGHSSSSMPNFSPNTFEIV 75
            +W+  +P + S FLW L  + IPV+ K+  +G  LAS+C CC    S M           
Sbjct: 1874 IWHKSVPLTTSFFLWRLLHDWIPVELKMKTKGFQLASRCRCCKSEESLM----------- 1922

Query: 74   PHLFLQNAQVVKVWNHFA 21
             H+  +N    +VW++FA
Sbjct: 1923 -HVMWKNPVANQVWSYFA 1939


>gb|EOY06960.1| Uncharacterized protein TCM_021522 [Theobroma cacao]
          Length = 3503

 Score =  176 bits (446), Expect = 8e-42
 Identities = 92/258 (35%), Positives = 137/258 (53%), Gaps = 2/258 (0%)
 Frame = -3

Query: 788  GLGVKSIHHSITAFSMKLWWRLRDKSSLWARFMHVKYAGG-MPICIAP-VHPTDSLGWKR 615
            GL ++++     AFSMKLWWR R  +SLW +FM  KY GG +P  + P +H  DS  WKR
Sbjct: 2986 GLDIRNLEDVFKAFSMKLWWRFRTTNSLWMQFMRAKYCGGQLPTHVQPKLH--DSQTWKR 3043

Query: 614  VCKIRNFMQDNIIWDVGSGNISFWHDIWFESEPLFNIIPCDKNGLENFDVAYFWNNNSWD 435
            +  I +  + NI W VG G + FWHD W   EPL  +I   +       V+ F+ NNSWD
Sbjct: 3044 MVTISSITEQNIRWRVGHGKLFFWHDCWMGEEPL--VIRNQEFASSMAQVSDFFLNNSWD 3101

Query: 434  MNRLHNIVGLEWANKLSKIPISHTQVDSMKWKLSANGNFSISSAYTALLSISETQPIFKQ 255
            + +L +++  E   +++KIPI+ +  D   W  + NG+FS  SA+          P +  
Sbjct: 3102 IEKLKSVLQQEVVEEIAKIPINASSNDRAYWTPTPNGDFSTKSAWQLSRERKVVNPTYNY 3161

Query: 254  LWNPMIPPSASIFLWWLFQNMIPVDKKLHERGISLASKCVCCGHSSSSMPNFSPNTFEIV 75
            +W+  +P + S FLW L  + +PV+ K+  +G  LAS+C CC    S M           
Sbjct: 3162 IWHKSVPLTTSFFLWRLLHDWVPVELKMKSKGFQLASRCRCCKSEESLM----------- 3210

Query: 74   PHLFLQNAQVVKVWNHFA 21
             H+   N    +VW++FA
Sbjct: 3211 -HVMWDNPVANQVWSYFA 3227



 Score =  169 bits (428), Expect = 1e-39
 Identities = 89/258 (34%), Positives = 136/258 (52%), Gaps = 2/258 (0%)
 Frame = -3

Query: 788  GLGVKSIHHSITAFSMKLWWRLRDKSSLWARFMHVKYA-GGMPICIAP-VHPTDSLGWKR 615
            GL ++++     AFS+KLWWR +  +SLW RF+  KY  G +P  + P +H  DS  WKR
Sbjct: 1192 GLDIRNLRDVFEAFSLKLWWRFQTCNSLWTRFLRTKYCLGRIPHLVQPKLH--DSQVWKR 1249

Query: 614  VCKIRNFMQDNIIWDVGSGNISFWHDIWFESEPLFNIIPCDKNGLENFDVAYFWNNNSWD 435
            +   R+    NI W +G G + FWHD W   +PL  + P   N + +  V  F+N + WD
Sbjct: 1250 MIVGRDVALQNIRWRIGKGELFFWHDCWMGDQPLATLFPSFHNDMSH--VHKFYNGDEWD 1307

Query: 434  MNRLHNIVGLEWANKLSKIPISHTQVDSMKWKLSANGNFSISSAYTALLSISETQPIFKQ 255
            + +L++ +     +++ +IP   +Q D   W L++NG FS  SA+  +        +   
Sbjct: 1308 IVKLNSYLPTSLVDEILQIPFDRSQEDVAYWALTSNGEFSFWSAWEIIRQRQTPNALLSF 1367

Query: 254  LWNPMIPPSASIFLWWLFQNMIPVDKKLHERGISLASKCVCCGHSSSSMPNFSPNTFEIV 75
             W+  IP S S FLW +  N IPV+ ++ ++GI LASKCVCC    S +           
Sbjct: 1368 NWHRSIPLSISFFLWRVLNNWIPVELRMKDKGIHLASKCVCCRSEESLI----------- 1416

Query: 74   PHLFLQNAQVVKVWNHFA 21
             H+  +N    +VWN FA
Sbjct: 1417 -HVLWENPVAKQVWNFFA 1433


>gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao]
          Length = 2251

 Score =  174 bits (442), Expect = 2e-41
 Identities = 91/258 (35%), Positives = 135/258 (52%), Gaps = 2/258 (0%)
 Frame = -3

Query: 788  GLGVKSIHHSITAFSMKLWWRLRDKSSLWARFMHVKYAGG-MPICIAP-VHPTDSLGWKR 615
            GL ++S+     AFSMKLWWR R   SLW RFM +KY  G +P+   P +H  DS  WKR
Sbjct: 1735 GLDIRSLAEVFEAFSMKLWWRFRTTDSLWTRFMRMKYCRGQLPMQTQPKLH--DSQTWKR 1792

Query: 614  VCKIRNFMQDNIIWDVGSGNISFWHDIWFESEPLFNIIPCDKNGLENFDVAYFWNNNSWD 435
            +       + ++ W VG GN+ FWHD W    PL +      + +    V  F+ NNSW+
Sbjct: 1793 MLTSSTITEQHMRWRVGQGNVFFWHDCWMGEAPLISSNQEFTSSM--VQVCDFFTNNSWN 1850

Query: 434  MNRLHNIVGLEWANKLSKIPISHTQVDSMKWKLSANGNFSISSAYTALLSISETQPIFKQ 255
            + +L  ++  E  ++++KIPI     D   W  + NG+FS  SA+  +       P+F  
Sbjct: 1851 IEKLKTVLQQEVVDEIAKIPIDTMNKDEAYWTPTPNGDFSTKSAWQLIRKRKVVNPVFNF 1910

Query: 254  LWNPMIPPSASIFLWWLFQNMIPVDKKLHERGISLASKCVCCGHSSSSMPNFSPNTFEIV 75
            +W+  +P + S FLW L  + IPV+ K+  +G+ LAS+C CC    S M           
Sbjct: 1911 IWHKTVPLTTSFFLWRLLHDWIPVELKMKSKGLQLASRCRCCKSEESIM----------- 1959

Query: 74   PHLFLQNAQVVKVWNHFA 21
             H+   N   ++VWN+FA
Sbjct: 1960 -HVMWDNPVAMQVWNYFA 1976


>gb|EOY02236.1| Uncharacterized protein TCM_011923 [Theobroma cacao]
          Length = 1954

 Score =  172 bits (437), Expect = 9e-41
 Identities = 92/259 (35%), Positives = 138/259 (53%), Gaps = 2/259 (0%)
 Frame = -3

Query: 788  GLGVKSIHHSITAFSMKLWWRLRDKSSLWARFMHVKYA-GGMPICIAP-VHPTDSLGWKR 615
            GL ++ +     AFS+KLWWR      LW +F+  KY  G +P  + P +H  DS  WKR
Sbjct: 1438 GLDIRRLTDMFDAFSLKLWWRFSTCEGLWTKFLKTKYCMGQIPHYVHPKLH--DSQVWKR 1495

Query: 614  VCKIRNFMQDNIIWDVGSGNISFWHDIWFESEPLFNIIPCDKNGLENFDVAYFWNNNSWD 435
            + + R     N  W +G G++ FWHD W   +PL    P  +N +    V  F+N ++WD
Sbjct: 1496 MVRGREVAIQNTRWRIGKGSLFFWHDCWMGDQPLVTSFPHFRNDMST--VHNFFNGHNWD 1553

Query: 434  MNRLHNIVGLEWANKLSKIPISHTQVDSMKWKLSANGNFSISSAYTALLSISETQPIFKQ 255
            +++L+  + +   +++ +IPI  +Q D   W L++NG FS  SA+ A+        +   
Sbjct: 1554 VDKLNLYLPMNLVDEILQIPIDRSQDDVAYWSLTSNGEFSTRSAWEAIRLRKSPNVLCSL 1613

Query: 254  LWNPMIPPSASIFLWWLFQNMIPVDKKLHERGISLASKCVCCGHSSSSMPNFSPNTFEIV 75
            LW+  IP S S FLW +F N IPVD +L E+G  LASKC+CC            N+ E +
Sbjct: 1614 LWHKSIPLSISFFLWRVFHNWIPVDIRLKEKGFHLASKCICC------------NSEESL 1661

Query: 74   PHLFLQNAQVVKVWNHFAS 18
             H+   N    +VWN FA+
Sbjct: 1662 IHVLWDNPIAKQVWNFFAN 1680


>gb|EOY06959.1| Uncharacterized protein TCM_021521 [Theobroma cacao]
          Length = 1951

 Score =  172 bits (435), Expect = 2e-40
 Identities = 89/258 (34%), Positives = 139/258 (53%), Gaps = 2/258 (0%)
 Frame = -3

Query: 788  GLGVKSIHHSITAFSMKLWWRLRDKSSLWARFMHVKYA-GGMPICIAP-VHPTDSLGWKR 615
            GL ++++     AFS+KLWWR +  +SLW +F+  KY  G +P  + P +H  DS  WKR
Sbjct: 1435 GLDIRNLRDVFEAFSLKLWWRFQTCNSLWTKFLRTKYCLGRIPHFVQPKLH--DSQVWKR 1492

Query: 614  VCKIRNFMQDNIIWDVGSGNISFWHDIWFESEPLFNIIPCDKNGLENFDVAYFWNNNSWD 435
            +   R+    NI W +G G + FWHD W   +PL  + P   N + +  V  F+N + WD
Sbjct: 1493 MIVGRDVALQNIRWRIGKGELFFWHDCWMGDQPLATLCPSFHNDMSH--VHKFYNGDVWD 1550

Query: 434  MNRLHNIVGLEWANKLSKIPISHTQVDSMKWKLSANGNFSISSAYTALLSISETQPIFKQ 255
            + +L + +     +++ +IP   +Q D   W L++NG+FS+ SA+ A+        +F  
Sbjct: 1551 IEKLSSCLPTSLVDEILQIPFDRSQEDVAYWALTSNGDFSLWSAWEAIRQRQTPNALFSL 1610

Query: 254  LWNPMIPPSASIFLWWLFQNMIPVDKKLHERGISLASKCVCCGHSSSSMPNFSPNTFEIV 75
            +W+  IP S S FLW +  N IPV+ ++ ++GI LASKCVCC    S +           
Sbjct: 1611 IWHRSIPLSISFFLWRVLNNWIPVELRMKDKGIHLASKCVCCRSEESLI----------- 1659

Query: 74   PHLFLQNAQVVKVWNHFA 21
             H+  +N    +VW  FA
Sbjct: 1660 -HVLWENPVATQVWFFFA 1676


>gb|EOY17514.1| Uncharacterized protein TCM_042330 [Theobroma cacao]
          Length = 2249

 Score =  171 bits (434), Expect = 2e-40
 Identities = 91/262 (34%), Positives = 133/262 (50%), Gaps = 2/262 (0%)
 Frame = -3

Query: 788  GLGVKSIHHSITAFSMKLWWRLRDKSSLWARFMHVKYAGG-MPICIAP-VHPTDSLGWKR 615
            GL ++++     AFSMKLWWR R   SLW RFM +KY  G +P+   P +H  DS  WKR
Sbjct: 1733 GLDIRNLAEVFEAFSMKLWWRFRTIDSLWTRFMRMKYCRGQLPMHTQPKLH--DSQTWKR 1790

Query: 614  VCKIRNFMQDNIIWDVGSGNISFWHDIWFESEPLFNIIPCDKNGLENFDVAYFWNNNSWD 435
            +       + N+ W VG G + FWHD W    PL +     +  L    V  F+ NNSWD
Sbjct: 1791 MVANSAITEQNMRWRVGQGKLFFWHDCWMGETPLTS--SNQELSLSMVQVCDFFMNNSWD 1848

Query: 434  MNRLHNIVGLEWANKLSKIPISHTQVDSMKWKLSANGNFSISSAYTALLSISETQPIFKQ 255
            + +L  ++  E  ++++KIPI     D   W  + NG FS  SA+  +       P+F  
Sbjct: 1849 IEKLKTVLQQEVVDEIAKIPIDAMSKDEAYWAPTPNGEFSTKSAWQLIRKREVVNPVFNF 1908

Query: 254  LWNPMIPPSASIFLWWLFQNMIPVDKKLHERGISLASKCVCCGHSSSSMPNFSPNTFEIV 75
            +W+  +P + S FLW L  + IPV+ K+  +G  LAS+C CC    S M           
Sbjct: 1909 IWHKTVPLTISFFLWRLLHDWIPVELKMKSKGFQLASRCRCCKSEESIM----------- 1957

Query: 74   PHLFLQNAQVVKVWNHFASWLR 9
             H+   N    +VWN+F+ + +
Sbjct: 1958 -HVMWDNPVATQVWNYFSKFFQ 1978


>gb|EOY02239.1| Uncharacterized protein TCM_016763 [Theobroma cacao]
          Length = 2127

 Score =  169 bits (428), Expect = 1e-39
 Identities = 88/258 (34%), Positives = 140/258 (54%), Gaps = 2/258 (0%)
 Frame = -3

Query: 788  GLGVKSIHHSITAFSMKLWWRLRDKSSLWARFMHVKYA-GGMPICIAP-VHPTDSLGWKR 615
            GLG++ +     AF++KLWWR +  +SLW +F+  KY  G +P  I P +H  DS  WKR
Sbjct: 1612 GLGIRKLEDVCAAFTLKLWWRFQTGNSLWTQFLRTKYCLGRIPHHIQPKLH--DSHVWKR 1669

Query: 614  VCKIRNFMQDNIIWDVGSGNISFWHDIWFESEPLFNIIPCDKNGLENFDVAYFWNNNSWD 435
            +   R     NI W +G G++ FWHD W   +PL    P  +N + +    +F+N ++WD
Sbjct: 1670 MISGREMALQNIRWKIGKGDLFFWHDCWMGDKPLAASFPEFQNDMSHG--YHFYNGDTWD 1727

Query: 434  MNRLHNIVGLEWANKLSKIPISHTQVDSMKWKLSANGNFSISSAYTALLSISETQPIFKQ 255
            +++L + +      ++ ++P   ++ D   W L++NG+FS  SA+  +     +  +   
Sbjct: 1728 VDKLRSFLPTILVEEILQVPFDKSREDVAYWTLTSNGDFSTRSAWEMIRQRQTSNALCSF 1787

Query: 254  LWNPMIPPSASIFLWWLFQNMIPVDKKLHERGISLASKCVCCGHSSSSMPNFSPNTFEIV 75
            +W+  IP S S FLW    N IPV+ ++ E+GI LASKCVCC            N+ E +
Sbjct: 1788 IWHRSIPLSISFFLWKTLHNWIPVELRMKEKGIQLASKCVCC------------NSEESL 1835

Query: 74   PHLFLQNAQVVKVWNHFA 21
             H+  +N    +VWN FA
Sbjct: 1836 IHVLWENPVAKQVWNFFA 1853


>gb|EOY25454.1| Uncharacterized protein TCM_026877 [Theobroma cacao]
          Length = 2367

 Score =  168 bits (425), Expect = 2e-39
 Identities = 89/241 (36%), Positives = 125/241 (51%), Gaps = 2/241 (0%)
 Frame = -3

Query: 788  GLGVKSIHHSITAFSMKLWWRLRDKSSLWARFMHVKYAGG-MPICIAP-VHPTDSLGWKR 615
            GL ++S+     AFSMKLWWR R   SLW RFM +KY  G +P+   P +H  DS  WKR
Sbjct: 1905 GLDIRSLAEVFEAFSMKLWWRFRTTDSLWTRFMRMKYCRGQLPMHTQPKLH--DSQTWKR 1962

Query: 614  VCKIRNFMQDNIIWDVGSGNISFWHDIWFESEPLFNIIPCDKNGLENFDVAYFWNNNSWD 435
            +       + N+ W VG GN+ FWHD W    PL  I    +  L    V  F+ NNSWD
Sbjct: 1963 MVASSAITEQNMRWRVGQGNLFFWHDCWMGETPL--ISSNHEFSLSMVQVCDFFMNNSWD 2020

Query: 434  MNRLHNIVGLEWANKLSKIPISHTQVDSMKWKLSANGNFSISSAYTALLSISETQPIFKQ 255
            + +L  ++  E  ++++KIPI     D   W  + NG FS  SA+  +       P+F  
Sbjct: 2021 IEKLKTVLQQEVVDEIAKIPIDAMSKDEAYWAPTPNGEFSTKSAWQLIRKREVVNPVFNF 2080

Query: 254  LWNPMIPPSASIFLWWLFQNMIPVDKKLHERGISLASKCVCCGHSSSSMPNFSPNTFEIV 75
            +W+  IP + S FLW L  + IPV+ ++  +G  LAS+C CC    S +     N   + 
Sbjct: 2081 IWHKAIPLTTSFFLWRLLHDWIPVELRMKSKGFQLASRCRCCRSEESIIHVMWDNPVAVQ 2140

Query: 74   P 72
            P
Sbjct: 2141 P 2141


>gb|EOY34748.1| Uncharacterized protein TCM_042328 [Theobroma cacao]
          Length = 910

 Score =  167 bits (423), Expect = 4e-39
 Identities = 90/255 (35%), Positives = 132/255 (51%), Gaps = 2/255 (0%)
 Frame = -3

Query: 779  VKSIHHSITAFSMKLWWRLRDKSSLWARFMHVKYAGG-MPICIAP-VHPTDSLGWKRVCK 606
            V S+     AFSMKLWWR R   SLW RFM +KY  G +P+   P +H  DS  WKR+  
Sbjct: 397  VNSLAEVFEAFSMKLWWRFRTIDSLWTRFMRMKYCRGQLPMQTQPKLH--DSQTWKRMLT 454

Query: 605  IRNFMQDNIIWDVGSGNISFWHDIWFESEPLFNIIPCDKNGLENFDVAYFWNNNSWDMNR 426
                 + ++ W VG GN+ FWHD W    PL +      + +    V  F+ NNSW++ +
Sbjct: 455  SSATTEQHMRWRVGQGNLFFWHDCWMGDAPLISSNQEFTSSM--VQVCDFFMNNSWNVEK 512

Query: 425  LHNIVGLEWANKLSKIPISHTQVDSMKWKLSANGNFSISSAYTALLSISETQPIFKQLWN 246
            L  ++  E  ++++KIPI     D   W  + NG+FS  SA+  +       P+F  +W+
Sbjct: 513  LKTVLQQEVVDEIAKIPIDTMSKDEAYWTPTPNGDFSTKSAWQLIRKRKVVNPVFNFIWH 572

Query: 245  PMIPPSASIFLWWLFQNMIPVDKKLHERGISLASKCVCCGHSSSSMPNFSPNTFEIVPHL 66
              +P + S FLW L  + IPV+ K+  +G+ LAS+C CC    S M            H+
Sbjct: 573  KTVPLTTSFFLWRLLHDWIPVELKMKSKGLQLASRCRCCKSEESIM------------HV 620

Query: 65   FLQNAQVVKVWNHFA 21
               N   ++VWN+FA
Sbjct: 621  MWDNPVAMQVWNYFA 635


>gb|EOY25451.1| Uncharacterized protein TCM_016759 [Theobroma cacao]
          Length = 879

 Score =  167 bits (422), Expect = 5e-39
 Identities = 88/262 (33%), Positives = 141/262 (53%), Gaps = 2/262 (0%)
 Frame = -3

Query: 788  GLGVKSIHHSITAFSMKLWWRLRDKSSLWARFMHVKYA-GGMPICIAP-VHPTDSLGWKR 615
            GL +++++    AF++KLWWR +   SLW  F+  KY  G +P  + P +H  DSL WKR
Sbjct: 363  GLDIRNLNDVFEAFTLKLWWRFQTCDSLWTHFLKTKYCLGRIPHYVHPKLH--DSLVWKR 420

Query: 614  VCKIRNFMQDNIIWDVGSGNISFWHDIWFESEPLFNIIPCDKNGLENFDVAYFWNNNSWD 435
            + + R     NI W +G G++ FWHD W  ++PL    P  +N +    V  F+N ++WD
Sbjct: 421  MIRGREVAFRNIRWKIGKGDLFFWHDCWMGNQPLVMSFPSLRNDMSL--VHNFYNGDTWD 478

Query: 434  MNRLHNIVGLEWANKLSKIPISHTQVDSMKWKLSANGNFSISSAYTALLSISETQPIFKQ 255
            +++L   + +   +++  IP + TQ D   W L++NG F+  SA+  +     +  +   
Sbjct: 479  VDKLKAYLPMNLIDEILLIPFNRTQQDVAYWTLTSNGEFATWSAWETIRQRKSSNALCSF 538

Query: 254  LWNPMIPPSASIFLWWLFQNMIPVDKKLHERGISLASKCVCCGHSSSSMPNFSPNTFEIV 75
            +W+  IP S S FLW    N IPV+ ++ E+GI LASKCVCC            N+ E +
Sbjct: 539  IWHRSIPLSISFFLWRALNNWIPVELRMKEKGIQLASKCVCC------------NSEESL 586

Query: 74   PHLFLQNAQVVKVWNHFASWLR 9
             H+   N+   +VW  F  + +
Sbjct: 587  MHVLWGNSVAKQVWAFFGKFFQ 608


>gb|EOY34747.1| Uncharacterized protein TCM_042327 [Theobroma cacao]
          Length = 1014

 Score =  164 bits (416), Expect = 2e-38
 Identities = 86/260 (33%), Positives = 132/260 (50%)
 Frame = -3

Query: 788  GLGVKSIHHSITAFSMKLWWRLRDKSSLWARFMHVKYAGGMPICIAPVHPTDSLGWKRVC 609
            GL ++ ++    AF+MKLWWR +    LW  F+  KY  G           DS  WKR+ 
Sbjct: 498  GLDIRRLNDVSDAFTMKLWWRFQTCDGLWTNFLKTKYCMGQIPHYVQSKLHDSQVWKRMV 557

Query: 608  KIRNFMQDNIIWDVGSGNISFWHDIWFESEPLFNIIPCDKNGLENFDVAYFWNNNSWDMN 429
            + R+    N  W +G GN+ FWHD W  ++PL    P  +N +    V  F+N ++WD+N
Sbjct: 558  RGRDVAIQNTRWRIGKGNLFFWHDCWMGNKPLVTSFPSFRNDMTF--VHKFYNGDNWDVN 615

Query: 428  RLHNIVGLEWANKLSKIPISHTQVDSMKWKLSANGNFSISSAYTALLSISETQPIFKQLW 249
             L   + +   +++ +IP   +Q D   W L+++G FS  SA+ A+        +   +W
Sbjct: 616  TLKLYLPMNLIDEILQIPFDRSQDDIAYWALTSDGEFSTWSAWEAVRQRQSPNTLCSFIW 675

Query: 248  NPMIPPSASIFLWWLFQNMIPVDKKLHERGISLASKCVCCGHSSSSMPNFSPNTFEIVPH 69
            +  IP + S FLW +  N IPV+ +L E+G  LASKCVCC            N+ E + H
Sbjct: 676  HKSIPLTISFFLWRVLNNWIPVELRLKEKGFHLASKCVCC------------NSEESLIH 723

Query: 68   LFLQNAQVVKVWNHFASWLR 9
            +   N    +VWN FA + +
Sbjct: 724  VLWDNPVAKQVWNFFADFFQ 743


>gb|EOY02234.1| Uncharacterized protein TCM_011921 [Theobroma cacao]
          Length = 926

 Score =  159 bits (401), Expect = 1e-36
 Identities = 85/261 (32%), Positives = 132/261 (50%), Gaps = 1/261 (0%)
 Frame = -3

Query: 788  GLGVKSIHHSITAFSMKLWWRLRDKSSLWARFMHVKYA-GGMPICIAPVHPTDSLGWKRV 612
            GL ++++     AF++KLWWR     SLW  F+  KY  G +P  + P     S+ WKR+
Sbjct: 411  GLDIRNLKDVFDAFTLKLWWRFYTCDSLWTHFLKTKYCLGRIPHYVQPKLHNSSI-WKRI 469

Query: 611  CKIRNFMQDNIIWDVGSGNISFWHDIWFESEPLFNIIPCDKNGLENFDVAYFWNNNSWDM 432
               R+    N  W +G G + FWHD W   +PL    P  +N +    V  F+  +SWD+
Sbjct: 470  TGGRDVTIQNTRWKIGRGELFFWHDCWMGDQPLVISFPSFRNDMSL--VHKFYKGDSWDV 527

Query: 431  NRLHNIVGLEWANKLSKIPISHTQVDSMKWKLSANGNFSISSAYTALLSISETQPIFKQL 252
            ++L   + +   +++  IP   TQ D   W L++NG FS  SA+  +        +   +
Sbjct: 528  DKLRLFLPVNLVDEILLIPFDRTQQDVAYWILTSNGEFSTRSAWETIRKRQPHNTLGSLI 587

Query: 251  WNPMIPPSASIFLWWLFQNMIPVDKKLHERGISLASKCVCCGHSSSSMPNFSPNTFEIVP 72
            W+  IP S S F+W    N IPV+ ++ E+GI LASKCVCC            N+ E + 
Sbjct: 588  WHRSIPLSISFFIWRALNNWIPVELRMKEKGIHLASKCVCC------------NSEESLM 635

Query: 71   HLFLQNAQVVKVWNHFASWLR 9
            H+   N+   +VW  FA++ +
Sbjct: 636  HVLWGNSVAKQVWAFFANFFQ 656


>gb|EOX99578.1| Uncharacterized protein TCM_008287 [Theobroma cacao]
          Length = 499

 Score =  156 bits (394), Expect = 9e-36
 Identities = 75/220 (34%), Positives = 117/220 (53%)
 Frame = -3

Query: 788 GLGVKSIHHSITAFSMKLWWRLRDKSSLWARFMHVKYAGGMPICIAPVHPTDSLGWKRVC 609
           GL +K +     AFSMKLWW+ +  +++W++FM  KY  G           DS  WKR+ 
Sbjct: 280 GLDIKGLEDVFEAFSMKLWWKFQTCNNIWSKFMRAKYCYGRIPGYTQPKRHDSQMWKRML 339

Query: 608 KIRNFMQDNIIWDVGSGNISFWHDIWFESEPLFNIIPCDKNGLENFDVAYFWNNNSWDMN 429
                 + ++ W +G G + FW+D W   EPL N  P   + +    V YF+NNN WD++
Sbjct: 340 ACYLVTEQHMRWKIGKGELFFWYDCWMGDEPLINRFPVFSSSMTQ--VCYFFNNNEWDVD 397

Query: 428 RLHNIVGLEWANKLSKIPISHTQVDSMKWKLSANGNFSISSAYTALLSISETQPIFKQLW 249
           +L+ ++  E   ++ KIP + +  D   W  +++G+F+  SA+  +        +F  +W
Sbjct: 398 KLNTMLPEEMVVEILKIPFNTSSTDVAYWVPTSDGDFTTKSAWEIIRQRDLVNSVFNLIW 457

Query: 248 NPMIPPSASIFLWWLFQNMIPVDKKLHERGISLASKCVCC 129
           +  IP + S FLW L QN  PVD +L  +G  LASKC  C
Sbjct: 458 HRCIPLTTSFFLWRLLQNWSPVDLRLKIKGFQLASKCQYC 497


>gb|EOY02238.1| Uncharacterized protein TCM_016762 [Theobroma cacao]
          Length = 2214

 Score =  155 bits (392), Expect = 1e-35
 Identities = 84/261 (32%), Positives = 130/261 (49%), Gaps = 1/261 (0%)
 Frame = -3

Query: 788  GLGVKSIHHSITAFSMKLWWRLRDKSSLWARFMHVKYA-GGMPICIAPVHPTDSLGWKRV 612
            GL ++++     AF++KLWWR     SLW  F+  KY  G +P  + P   + S+ WKR+
Sbjct: 1699 GLDIRNLKDVFDAFTLKLWWRFYTCDSLWTLFLKTKYCLGRIPHYVQPKIHSSSI-WKRI 1757

Query: 611  CKIRNFMQDNIIWDVGSGNISFWHDIWFESEPLFNIIPCDKNGLENFDVAYFWNNNSWDM 432
               R+    N  W +G G + FWHD W   +PL    P  +N +    V  F+  +SWD+
Sbjct: 1758 TGGRDVTIQNTRWKIGRGELFFWHDCWMGDQPLVISFPSFRNDMSF--VHKFYKGDSWDV 1815

Query: 431  NRLHNIVGLEWANKLSKIPISHTQVDSMKWKLSANGNFSISSAYTALLSISETQPIFKQL 252
            ++L   + +    ++  IP   TQ D   W L++NG FS  SA+  +        +   +
Sbjct: 1816 DKLRLFLPVNLIYEILLIPFDRTQQDVAYWTLTSNGEFSTKSAWETIRQQQSHNTLGSLI 1875

Query: 251  WNPMIPPSASIFLWWLFQNMIPVDKKLHERGISLASKCVCCGHSSSSMPNFSPNTFEIVP 72
            W+  IP S S F+W    N IPV+ ++  +GI LASKCVCC            N+ E + 
Sbjct: 1876 WHRSIPLSISFFIWRALNNWIPVELRMKGKGIHLASKCVCC------------NSEESLM 1923

Query: 71   HLFLQNAQVVKVWNHFASWLR 9
            H+   N+   +VW  FA + +
Sbjct: 1924 HVLWGNSVAKQVWAFFAKFFQ 1944


>ref|XP_006358721.1| PREDICTED: uncharacterized protein LOC102596481 [Solanum tuberosum]
          Length = 1135

 Score =  130 bits (328), Expect = 4e-28
 Identities = 80/269 (29%), Positives = 126/269 (46%), Gaps = 11/269 (4%)
 Frame = -3

Query: 788  GLGVKSIHHSITAFSMKLWWRLR-DKSSLWARFMHVKYAGGMPICIAPVHPTDSLG---- 624
            GLG +S+H    AF  KLWW  R D SSLWA FM  KY   M       HPT + G    
Sbjct: 722  GLGFRSLHDVSKAFFAKLWWNFRTDTSSLWASFMWNKYCKKM-------HPTVARGQGAS 774

Query: 623  --WKRVCKIRNFMQDNIIWDVGSGNISFWHDIWFESEPLFNIIPCDKNGLEN-FDVAYFW 453
              W+++  +R  ++ NI W + +GN SFW D W +   L+ +   + N +E   +V YF 
Sbjct: 775  HVWRKMITVREEVEHNIWWQIKAGNSSFWFDNWTKQGALWYVE--ENNAVEEKIEVKYFT 832

Query: 452  NNNSWDMNRLHNIVGLEWANKLS---KIPISHTQVDSMKWKLSANGNFSISSAYTALLSI 282
            +  +WD  +L N +  E  + +    K P+     D   W  S  G F++ SA+  +   
Sbjct: 833  HQGAWDREKLLNKISEEMTDYIMESIKPPLEEYINDVAWWMGSTQGIFTVKSAWELMRHK 892

Query: 281  SETQPIFKQLWNPMIPPSASIFLWWLFQNMIPVDKKLHERGISLASKCVCCGHSSSSMPN 102
             E +  ++ +W   +P   + FLW L++  I  D  L    I + S+C CC  +      
Sbjct: 893  QERRTDYQLIWTKDVPFKMNFFLWRLWKRRIATDDNLKRMKIQIVSRCWCCSETEE---- 948

Query: 101  FSPNTFEIVPHLFLQNAQVVKVWNHFASW 15
                  E + H+FL      ++W  F+++
Sbjct: 949  ------ETMTHIFLTAPIANRLWRQFSNF 971


>gb|EOY25447.1| Uncharacterized protein TCM_016753 [Theobroma cacao]
          Length = 1275

 Score =  129 bits (324), Expect = 1e-27
 Identities = 71/194 (36%), Positives = 100/194 (51%), Gaps = 2/194 (1%)
 Frame = -3

Query: 788 GLGVKSIHHSITAFSMKLWWRLRDKSSLWARFMHVKYAGGMPICIAPVHPTDSLGWKRVC 609
           GL + S+     AFS KLWWR     SLWAR+M +KY  G         P DS  WKR+ 
Sbjct: 389 GLDICSLKDFFDAFSTKLWWRFDTCQSLWARYMRLKYCTGQIHHNIAPKPHDSATWKRLI 448

Query: 608 KIRNFMQDNIIWDVGSGNISFWHDIWFESEPLFNIIPCDKNGLENFDVAYFWNNNSWDMN 429
             R      I W +G G+I FWHD W   EPL N  P     +    V YF+N+++WD++
Sbjct: 449 DGRVTASQQIRWRIGKGDIFFWHDAWMGDEPLVNSFPSFSQSM--MKVNYFFNDDAWDVD 506

Query: 428 RLHNIVGLEWANKLSKIPISHTQVDSMKWKLSANGNFSISSAYTALLSISETQPIFKQLW 249
           +L  ++     +++ KIPIS    D   W L+ NG+FS  SA+  L    +   + + +W
Sbjct: 507 KLKTVIPNAIVDEILKIPISRENEDIAYWALTPNGDFSTKSAWELLRQRKQVNLVGQLIW 566

Query: 248 -NPMIPP-SASIFL 213
            NP   P S S+F+
Sbjct: 567 HNPNPDPISPSLFI 580



 Score =  123 bits (308), Expect = 8e-26
 Identities = 69/228 (30%), Positives = 106/228 (46%), Gaps = 1/228 (0%)
 Frame = -3

Query: 788  GLGVKSIHHSITAFSMKLWWRLRDKSSLWARFMHVKYA-GGMPICIAPVHPTDSLGWKRV 612
            GL ++++     AF++KLWWR     SLW  F+  KY  G +P  + P     S+ WKR+
Sbjct: 675  GLDIRNLKDVFDAFTLKLWWRFYTCDSLWTHFLKTKYCLGRIPQYMQPKLHNSSI-WKRM 733

Query: 611  CKIRNFMQDNIIWDVGSGNISFWHDIWFESEPLFNIIPCDKNGLENFDVAYFWNNNSWDM 432
               ++ +  NI W +G G +  WHD W   +PL    P  +N + +  V  F+  +SWD+
Sbjct: 734  TGGQDVVIQNIRWKIGKGELFSWHDCWMGDQPLVISFPSFRNDMSS--VHKFYKGDSWDV 791

Query: 431  NRLHNIVGLEWANKLSKIPISHTQVDSMKWKLSANGNFSISSAYTALLSISETQPIFKQL 252
            ++L   + +   N++  IP   TQ D   W L++NG FS  SA+  +             
Sbjct: 792  DKLRLFLPVNLINEILPIPFDRTQQDVAYWTLTSNGEFSTWSAWETIRQ----------- 840

Query: 251  WNPMIPPSASIFLWWLFQNMIPVDKKLHERGISLASKCVCCGHSSSSM 108
                          W   N + +   + E+GI L SKCVCC    S M
Sbjct: 841  --------------WQSHNTLALSFGIEEKGIHLVSKCVCCNSEESLM 874


>ref|XP_004253442.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Solanum
           lycopersicum]
          Length = 775

 Score =  125 bits (313), Expect = 2e-26
 Identities = 77/259 (29%), Positives = 114/259 (44%), Gaps = 2/259 (0%)
 Frame = -3

Query: 788 GLGVKSIHHSITAFSMKLWWRLRDKSSLWARFMHVKYAGGMPICIAPVHPTDSLGWKRVC 609
           G+G+++++    +F  K WW  R K +LW  F+  KY               SL WK + 
Sbjct: 243 GVGMRNLNDVCKSFQFKQWWTFRTKQTLWGDFLRAKYCQRSNPVSKKWDTGQSLTWKHML 302

Query: 608 KIRNFMQDNIIWDVGSGNISFWHDIWFESEPLFNIIPCDKNGLENFDVAYFWNNNSWDMN 429
            IR  ++ +I W + +GN SFW D W  + PL     C+   L N  VA FW N  W+  
Sbjct: 303 AIRQQVEQHIQWQLQAGNCSFWWDNWMGTGPLAQHT-CNNIRLNNSKVADFWENGVWNYR 361

Query: 428 RL-HNIVGLEWANKLS-KIPISHTQVDSMKWKLSANGNFSISSAYTALLSISETQPIFKQ 255
           +L       + AN ++  IP    Q D   WKL + G FS  SA+  + +          
Sbjct: 362 KLVEQAPASQLANIMAIAIPQQQYQQDQPVWKLHSQGKFSCHSAWEEIRNKKAKNRFLSF 421

Query: 254 LWNPMIPPSASIFLWWLFQNMIPVDKKLHERGISLASKCVCCGHSSSSMPNFSPNTFEIV 75
           LW+  IP   S  LW + +  IP ++KL   GI   S C CC   +           + +
Sbjct: 422 LWHNFIPFKTSFLLWRILKGKIPTNEKLTNFGIE-PSPCYCCVDRAG---------MDSI 471

Query: 74  PHLFLQNAQVVKVWNHFAS 18
            H+F       +VW  FA+
Sbjct: 472 NHIFNTGNFAGRVWKSFAA 490


>gb|EOY34749.1| Uncharacterized protein TCM_042329 [Theobroma cacao]
          Length = 2606

 Score =  124 bits (310), Expect = 5e-26
 Identities = 63/169 (37%), Positives = 99/169 (58%), Gaps = 2/169 (1%)
 Frame = -3

Query: 788  GLGVKSIHHSITAFSMKLWWRLRDKSSLWARFMHVKYA-GGMPICIAP-VHPTDSLGWKR 615
            GLG++++     AFS+KLWWR +  +SLW RF+  KY  G +P  + P +H  DS  WKR
Sbjct: 1376 GLGIRNLRDVFEAFSLKLWWRFQTCNSLWTRFLKTKYCLGRIPHFVQPKLH--DSQVWKR 1433

Query: 614  VCKIRNFMQDNIIWDVGSGNISFWHDIWFESEPLFNIIPCDKNGLENFDVAYFWNNNSWD 435
            +   R+    NI W +G G + FWHD W    PL N+ P   N + +  V  F+N + WD
Sbjct: 1434 MIFGRDVALQNIRWGIGKGELFFWHDCWMGDLPLSNLFPSFHNDMSH--VHKFYNGDGWD 1491

Query: 434  MNRLHNIVGLEWANKLSKIPISHTQVDSMKWKLSANGNFSISSAYTALL 288
            + +L++ + +   +++ +IP   +Q D   W L++NG+FS+ SA+ A L
Sbjct: 1492 IVKLNSCLPMSLIDEILQIPFDRSQEDIAYWALTSNGDFSLWSAWEAEL 1540


>ref|XP_004234855.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Solanum
           lycopersicum]
          Length = 440

 Score =  122 bits (305), Expect = 2e-25
 Identities = 74/260 (28%), Positives = 114/260 (43%), Gaps = 3/260 (1%)
 Frame = -3

Query: 788 GLGVKSIHHSITAFSMKLWWRLRDKSSLWARFMHVKYAG-GMPICIAPVHPTDSLGWKRV 612
           G+G++++     +F  K WW  R K +LW  F+  KY     P+C       +SL WK +
Sbjct: 153 GIGMRNLQDVCKSFQFKQWWVFRTKQTLWGEFLRAKYCQRSNPVC-KKWDTGESLTWKHM 211

Query: 611 CKIRNFMQDNIIWDVGSGNISFWHDIWFESEPLFNIIPCDKNGLENFDVAYFWNNNSWDM 432
              R  ++ +I W++ +GN SFW D W  + PL        N   N  VA F  N  W  
Sbjct: 212 LDTRQQVEQHIHWNLQAGNCSFWWDNWLGTGPLAQHTT-SSNRFNNITVAEFLENGEWKW 270

Query: 431 NRLHNIVGLEWANKL--SKIPISHTQVDSMKWKLSANGNFSISSAYTALLSISETQPIFK 258
           ++L     +   + +  ++IP    + D   WK + +G FS +SA+  + S         
Sbjct: 271 SKLMKHAPVTQLSSILATRIPQHQHRPDQAIWKPNTHGRFSCTSAWEEIRSKKAKNNFNS 330

Query: 257 QLWNPMIPPSASIFLWWLFQNMIPVDKKLHERGISLASKCVCCGHSSSSMPNFSPNTFEI 78
            +W+  IP   S  LW   +  +P ++KL   GI   S C CC         F     + 
Sbjct: 331 LIWHKSIPFKTSFLLWRTLKGKLPTNEKLFNFGIE-PSPCFCC---------FDRAGMDT 380

Query: 77  VPHLFLQNAQVVKVWNHFAS 18
           V H+F       KVW  FA+
Sbjct: 381 VEHIFNSGPFAAKVWRFFAA 400


Top