BLASTX nr result

ID: Rehmannia25_contig00026754 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia25_contig00026754
         (920 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EOX96783.1| Uncharacterized protein TCM_005954 [Theobroma cacao]   120   2e-29
gb|EOY02239.1| Uncharacterized protein TCM_016763 [Theobroma cacao]   121   2e-29
gb|EOY25451.1| Uncharacterized protein TCM_016759 [Theobroma cacao]   119   6e-29
gb|EOY34747.1| Uncharacterized protein TCM_042327 [Theobroma cacao]   121   3e-28
gb|EOY06959.1| Uncharacterized protein TCM_021521 [Theobroma cacao]   124   4e-28
gb|EOY19200.1| Retrotransposon, unclassified-like protein [Theob...   120   4e-28
gb|EOY02238.1| Uncharacterized protein TCM_016762 [Theobroma cacao]   117   6e-28
gb|EOY02234.1| Uncharacterized protein TCM_011921 [Theobroma cacao]   120   8e-28
gb|EOY02236.1| Uncharacterized protein TCM_011923 [Theobroma cacao]   126   4e-27
gb|EOY34748.1| Uncharacterized protein TCM_042328 [Theobroma cacao]   126   1e-26
gb|EOY06960.1| Uncharacterized protein TCM_021522 [Theobroma cacao]   119   1e-26
gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao]   126   1e-26
gb|EOY17514.1| Uncharacterized protein TCM_042330 [Theobroma cacao]   124   4e-26
gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao]   117   6e-25
gb|EOY13984.1| RNase H family protein [Theobroma cacao]               114   5e-23
gb|EOY06956.1| Uncharacterized protein TCM_021518 [Theobroma cacao]   110   1e-22
gb|EOY17470.1| Uncharacterized protein TCM_036655 [Theobroma cacao]   102   2e-19
gb|EOY25454.1| Uncharacterized protein TCM_026877 [Theobroma cacao]    97   1e-17
gb|ABI34321.1| RNase H family protein [Solanum demissum]               90   1e-15
ref|XP_004296004.1| PREDICTED: putative ribonuclease H protein A...    77   2e-15

>gb|EOX96783.1| Uncharacterized protein TCM_005954 [Theobroma cacao]
          Length = 1134

 Score =  120 bits (302), Expect(2) = 2e-29
 Identities = 61/202 (30%), Positives = 100/202 (49%)
 Frame = +1

Query: 34   VAYFWNNNSWDMNRLHNIVGLEWANKLSKIPISHTQVDSMMWKLSVNGNFSISSAYTALL 213
            V +F+N ++WD+++L + +      ++ ++P   ++ D   W L+ NG+FS  SA   + 
Sbjct: 721  VYHFYNGDTWDVDKLKSFLPTVLVEEILQVPFDKSREDVAYWTLTSNGDFSTRSAGEMIR 780

Query: 214  SISETQPIFKQLWNPMIPPSASIFLWWLFQNMIPVDKKLHERGISLASKCVCCGHSSSSM 393
                +  +   +W+  IP S S FLW    N IPV+ ++ E+GI LASKCVCC       
Sbjct: 781  QRQTSNALCSFIWHRSIPLSISFFLWKTLHNWIPVELRMKEKGIQLASKCVCC------- 833

Query: 394  PNFSPNTFEIVPHLFLQNAQVVKVWNHFASWLRFTPPHTEHIHIFFSAWRNLTPYAHTPH 573
                 N+ E + H+  +N    +VWN FA   +    +  H+     AW     Y    H
Sbjct: 834  -----NSEESLIHVLWENPVAKQVWNFFAKLFQIYILNPRHVSQIIWAWYVSGDYVRKGH 888

Query: 574  ISIMLPCLIMWKIWEERNHCRY 639
              ++LP  I W +W ERN  ++
Sbjct: 889  FRVLLPLFICWFLWLERNDAKH 910



 Score = 36.2 bits (82), Expect(2) = 2e-29
 Identities = 23/70 (32%), Positives = 31/70 (44%), Gaps = 1/70 (1%)
 Frame = +2

Query: 659  RIITNVXXXXXXXXKAKLFHATTWKGFLDIISTLGFSFRLPTRHHHLHVL-WKPPDNPWL 835
            R+I              L     WKG  DI + LGFSF  P +H    ++ WK P     
Sbjct: 919  RVIWRTMKHCRQLYDGSLLQQWQWKGDTDIAAMLGFSFP-PQQHASPQIIYWKKPSIGEY 977

Query: 836  KLNVDAAYKS 865
            KLNVD + ++
Sbjct: 978  KLNVDGSSRN 987


>gb|EOY02239.1| Uncharacterized protein TCM_016763 [Theobroma cacao]
          Length = 2127

 Score =  121 bits (304), Expect(2) = 2e-29
 Identities = 60/200 (30%), Positives = 100/200 (50%)
 Frame = +1

Query: 40   YFWNNNSWDMNRLHNIVGLEWANKLSKIPISHTQVDSMMWKLSVNGNFSISSAYTALLSI 219
            +F+N ++WD+++L + +      ++ ++P   ++ D   W L+ NG+FS  SA+  +   
Sbjct: 1719 HFYNGDTWDVDKLRSFLPTILVEEILQVPFDKSREDVAYWTLTSNGDFSTRSAWEMIRQR 1778

Query: 220  SETQPIFKQLWNPMIPPSASIFLWWLFQNMIPVDKKLHERGISLASKCVCCGHSSSSMPN 399
              +  +   +W+  IP S S FLW    N IPV+ ++ E+GI LASKCVCC         
Sbjct: 1779 QTSNALCSFIWHRSIPLSISFFLWKTLHNWIPVELRMKEKGIQLASKCVCC--------- 1829

Query: 400  FSPNTFEIVPHLFLQNAQVVKVWNHFASWLRFTPPHTEHIHIFFSAWRNLTPYAHTPHIS 579
               N+ E + H+  +N    +VWN FA   +    +  H+     AW     Y    H  
Sbjct: 1830 ---NSEESLIHVLWENPVAKQVWNFFAQLFQIYIWNPRHVSQIIWAWYVSGDYVRKGHFR 1886

Query: 580  IMLPCLIMWKIWEERNHCRY 639
            ++LP  I W +W ERN  ++
Sbjct: 1887 VLLPLFICWFLWLERNDAKH 1906



 Score = 35.0 bits (79), Expect(2) = 2e-29
 Identities = 21/69 (30%), Positives = 27/69 (39%)
 Frame = +2

Query: 659  RIITNVXXXXXXXXKAKLFHATTWKGFLDIISTLGFSFRLPTRHHHLHVLWKPPDNPWLK 838
            R+I              L     WKG  DI + LGFSF          + WK P     K
Sbjct: 1915 RVIWRTMKHCRQLYDGSLLQQWQWKGDTDIATMLGFSFTHKQHAPPQIIYWKKPSIGEYK 1974

Query: 839  LNVDAAYKS 865
            LNVD + ++
Sbjct: 1975 LNVDGSSRN 1983


>gb|EOY25451.1| Uncharacterized protein TCM_016759 [Theobroma cacao]
          Length = 879

 Score =  119 bits (298), Expect(2) = 6e-29
 Identities = 61/199 (30%), Positives = 99/199 (49%)
 Frame = +1

Query: 43   FWNNNSWDMNRLHNIVGLEWANKLSKIPISHTQVDSMMWKLSVNGNFSISSAYTALLSIS 222
            F+N ++WD+++L   + +   +++  IP + TQ D   W L+ NG F+  SA+  +    
Sbjct: 471  FYNGDTWDVDKLKAYLPMNLIDEILLIPFNRTQQDVAYWTLTSNGEFATWSAWETIRQRK 530

Query: 223  ETQPIFKQLWNPMIPPSASIFLWWLFQNMIPVDKKLHERGISLASKCVCCGHSSSSMPNF 402
             +  +   +W+  IP S S FLW    N IPV+ ++ E+GI LASKCVCC          
Sbjct: 531  SSNALCSFIWHRSIPLSISFFLWRALNNWIPVELRMKEKGIQLASKCVCC---------- 580

Query: 403  SPNTFEIVPHLFLQNAQVVKVWNHFASWLRFTPPHTEHIHIFFSAWRNLTPYAHTPHISI 582
              N+ E + H+   N+   +VW  F  + +    + +H+     AW     Y    HI  
Sbjct: 581  --NSEESLMHVLWGNSVAKQVWAFFGKFFQIYVLNPQHVSQILWAWFFSGDYVKKGHIRS 638

Query: 583  MLPCLIMWKIWEERNHCRY 639
            +LP  I W +W ERN  ++
Sbjct: 639  LLPIFICWFLWLERNDAKH 657



 Score = 35.8 bits (81), Expect(2) = 6e-29
 Identities = 20/74 (27%), Positives = 31/74 (41%)
 Frame = +2

Query: 659 RIITNVXXXXXXXXKAKLFHATTWKGFLDIISTLGFSFRLPTRHHHLHVLWKPPDNPWLK 838
           R++  +           L H   WKG  DI S  G +F+   R     + W+ P     K
Sbjct: 666 RVVWRIMKLLRQLLDGSLLHQWQWKGDTDIASMWGHTFQSKHRAPPQIIYWRKPFTGEYK 725

Query: 839 LNVDAAYKSQTVTA 880
           LNVD + ++  + A
Sbjct: 726 LNVDGSSRNGHLAA 739


>gb|EOY34747.1| Uncharacterized protein TCM_042327 [Theobroma cacao]
          Length = 1014

 Score =  121 bits (304), Expect(2) = 3e-28
 Identities = 61/199 (30%), Positives = 99/199 (49%)
 Frame = +1

Query: 43   FWNNNSWDMNRLHNIVGLEWANKLSKIPISHTQVDSMMWKLSVNGNFSISSAYTALLSIS 222
            F+N ++WD+N L   + +   +++ +IP   +Q D   W L+ +G FS  SA+ A+    
Sbjct: 606  FYNGDNWDVNTLKLYLPMNLIDEILQIPFDRSQDDIAYWALTSDGEFSTWSAWEAVRQRQ 665

Query: 223  ETQPIFKQLWNPMIPPSASIFLWWLFQNMIPVDKKLHERGISLASKCVCCGHSSSSMPNF 402
                +   +W+  IP + S FLW +  N IPV+ +L E+G  LASKCVCC          
Sbjct: 666  SPNTLCSFIWHKSIPLTISFFLWRVLNNWIPVELRLKEKGFHLASKCVCC---------- 715

Query: 403  SPNTFEIVPHLFLQNAQVVKVWNHFASWLRFTPPHTEHIHIFFSAWRNLTPYAHTPHISI 582
              N+ E + H+   N    +VWN FA + +    + +H+     AW     +    HI  
Sbjct: 716  --NSEESLIHVLWDNPVAKQVWNFFADFFQINISNPQHVSQIIWAWYYSGDFVRKGHIRT 773

Query: 583  MLPCLIMWKIWEERNHCRY 639
            ++P  I W +W ERN  ++
Sbjct: 774  LIPLFICWFLWLERNDAKH 792



 Score = 31.2 bits (69), Expect(2) = 3e-28
 Identities = 18/66 (27%), Positives = 25/66 (37%)
 Frame = +2

Query: 659 RIITNVXXXXXXXXKAKLFHATTWKGFLDIISTLGFSFRLPTRHHHLHVLWKPPDNPWLK 838
           R++  +           L     WKG  DI +  GF+  L  R     + W  P     K
Sbjct: 801 RVVWKIMKVLRQLQDGSLLKKWQWKGDTDIAAMWGFTLPLKIRESPQIIHWVKPVTGEYK 860

Query: 839 LNVDAA 856
           LNVD +
Sbjct: 861 LNVDGS 866


>gb|EOY06959.1| Uncharacterized protein TCM_021521 [Theobroma cacao]
          Length = 1951

 Score =  124 bits (310), Expect(2) = 4e-28
 Identities = 63/199 (31%), Positives = 98/199 (49%)
 Frame = +1

Query: 43   FWNNNSWDMNRLHNIVGLEWANKLSKIPISHTQVDSMMWKLSVNGNFSISSAYTALLSIS 222
            F+N + WD+ +L + +     +++ +IP   +Q D   W L+ NG+FS+ SA+ A+    
Sbjct: 1543 FYNGDVWDIEKLSSCLPTSLVDEILQIPFDRSQEDVAYWALTSNGDFSLWSAWEAIRQRQ 1602

Query: 223  ETQPIFKQLWNPMIPPSASIFLWWLFQNMIPVDKKLHERGISLASKCVCCGHSSSSMPNF 402
                +F  +W+  IP S S FLW +  N IPV+ ++ ++GI LASKCVCC    S +   
Sbjct: 1603 TPNALFSLIWHRSIPLSISFFLWRVLNNWIPVELRMKDKGIHLASKCVCCRSEESLI--- 1659

Query: 403  SPNTFEIVPHLFLQNAQVVKVWNHFASWLRFTPPHTEHIHIFFSAWRNLTPYAHTPHISI 582
                     H+  +N    +VW  FA   +       HI     AW     Y    HI I
Sbjct: 1660 ---------HVLWENPVATQVWFFFAKSFQIYVSKPNHISQIIWAWFFSGDYTRNGHIRI 1710

Query: 583  MLPCLIMWKIWEERNHCRY 639
            ++P  I W +W ERN  ++
Sbjct: 1711 LIPLFICWFLWLERNDAKH 1729



 Score = 28.5 bits (62), Expect(2) = 4e-28
 Identities = 20/69 (28%), Positives = 25/69 (36%)
 Frame = +2

Query: 659  RIITNVXXXXXXXXKAKLFHATTWKGFLDIISTLGFSFRLPTRHHHLHVLWKPPDNPWLK 838
            R+I  +           L     WKG  DI +  GF F          + W  P     K
Sbjct: 1738 RVIWRIMKLLNQLYAGSLLKQWQWKGDTDIATMWGFKFPPKYCTSPQIIYWIKPFIGEYK 1797

Query: 839  LNVDAAYKS 865
            LNVD + KS
Sbjct: 1798 LNVDGSSKS 1806


>gb|EOY19200.1| Retrotransposon, unclassified-like protein [Theobroma cacao]
          Length = 1368

 Score =  120 bits (301), Expect(2) = 4e-28
 Identities = 58/202 (28%), Positives = 104/202 (51%)
 Frame = +1

Query: 34   VAYFWNNNSWDMNRLHNIVGLEWANKLSKIPISHTQVDSMMWKLSVNGNFSISSAYTALL 213
            V YF+N+++WD+++L   +      ++ KIPIS  + D   W L+ NG+FSI SA+  L 
Sbjct: 924  VNYFFNDDAWDVDKLKTFIPNAIVEEILKIPISREKEDIAYWALTANGDFSIKSAWELLR 983

Query: 214  SISETQPIFKQLWNPMIPPSASIFLWWLFQNMIPVDKKLHERGISLASKCVCCGHSSSSM 393
               +   + + +W+  IP + S FLW    N +PV+ ++  +GI LASKC+CC    S +
Sbjct: 984  QRKQVNLVGQLIWHKSIPLTVSFFLWRTLHNWLPVEVRMKAKGIQLASKCLCCKSEESLL 1043

Query: 394  PNFSPNTFEIVPHLFLQNAQVVKVWNHFASWLRFTPPHTEHIHIFFSAWRNLTPYAHTPH 573
                        H+  ++    +VWN+F+ + +    + ++I    ++W     +    H
Sbjct: 1044 ------------HVLWESPVAQQVWNYFSKFFQIYVHNPQNILQILNSWYYSGDFTKPGH 1091

Query: 574  ISIMLPCLIMWKIWEERNHCRY 639
            I  ++   I W +W ERN  ++
Sbjct: 1092 IRTLILLFIFWFVWVERNDAKH 1113



 Score = 32.0 bits (71), Expect(2) = 4e-28
 Identities = 22/70 (31%), Positives = 29/70 (41%)
 Frame = +2

Query: 659  RIITNVXXXXXXXXKAKLFHATTWKGFLDIISTLGFSFRLPTRHHHLHVLWKPPDNPWLK 838
            RII  +        +  L     WKG LDI    GF+F    +     + W  P    LK
Sbjct: 1122 RIIWRIMKILRKLFQGGLLCKWQWKGDLDIAIHWGFNFAQERQARPKIINWIKPLIGELK 1181

Query: 839  LNVDAAYKSQ 868
            LNVD + K +
Sbjct: 1182 LNVDGSSKDE 1191


>gb|EOY02238.1| Uncharacterized protein TCM_016762 [Theobroma cacao]
          Length = 2214

 Score =  117 bits (294), Expect(2) = 6e-28
 Identities = 62/199 (31%), Positives = 95/199 (47%)
 Frame = +1

Query: 43   FWNNNSWDMNRLHNIVGLEWANKLSKIPISHTQVDSMMWKLSVNGNFSISSAYTALLSIS 222
            F+  +SWD+++L   + +    ++  IP   TQ D   W L+ NG FS  SA+  +    
Sbjct: 1807 FYKGDSWDVDKLRLFLPVNLIYEILLIPFDRTQQDVAYWTLTSNGEFSTKSAWETIRQQQ 1866

Query: 223  ETQPIFKQLWNPMIPPSASIFLWWLFQNMIPVDKKLHERGISLASKCVCCGHSSSSMPNF 402
                +   +W+  IP S S F+W    N IPV+ ++  +GI LASKCVCC          
Sbjct: 1867 SHNTLGSLIWHRSIPLSISFFIWRALNNWIPVELRMKGKGIHLASKCVCC---------- 1916

Query: 403  SPNTFEIVPHLFLQNAQVVKVWNHFASWLRFTPPHTEHIHIFFSAWRNLTPYAHTPHISI 582
              N+ E + H+   N+   +VW  FA + +    + +H+     AW     Y    HI  
Sbjct: 1917 --NSEESLMHVLWGNSVAKQVWAFFAKFFQIYVLNPKHVSHILWAWFYSGDYVKRGHIRT 1974

Query: 583  MLPCLIMWKIWEERNHCRY 639
            +LP  I W +W ERN  +Y
Sbjct: 1975 LLPIFICWFLWLERNDAKY 1993



 Score = 33.9 bits (76), Expect(2) = 6e-28
 Identities = 19/66 (28%), Positives = 27/66 (40%)
 Frame = +2

Query: 659  RIITNVXXXXXXXXKAKLFHATTWKGFLDIISTLGFSFRLPTRHHHLHVLWKPPDNPWLK 838
            RI+  +           L     WKG  DI +   ++F+L  R     V W+ P     K
Sbjct: 2002 RIVWRIMKLLRQLKDGSLLQQWQWKGDTDIAAMWQYNFQLKLRAPPQIVYWRKPSTGEYK 2061

Query: 839  LNVDAA 856
            LNVD +
Sbjct: 2062 LNVDGS 2067


>gb|EOY02234.1| Uncharacterized protein TCM_011921 [Theobroma cacao]
          Length = 926

 Score =  120 bits (300), Expect(2) = 8e-28
 Identities = 66/202 (32%), Positives = 99/202 (49%), Gaps = 3/202 (1%)
 Frame = +1

Query: 43   FWNNNSWDMNRLHNIVGLEWANKLSKIPISHTQVDSMMWKLSVNGNFSISSAYTALLSIS 222
            F+  +SWD+++L   + +   +++  IP   TQ D   W L+ NG FS  SA+  +    
Sbjct: 519  FYKGDSWDVDKLRLFLPVNLVDEILLIPFDRTQQDVAYWILTSNGEFSTRSAWETIRKRQ 578

Query: 223  ETQPIFKQLWNPMIPPSASIFLWWLFQNMIPVDKKLHERGISLASKCVCCGHSSSSMPNF 402
                +   +W+  IP S S F+W    N IPV+ ++ E+GI LASKCVCC          
Sbjct: 579  PHNTLGSLIWHRSIPLSISFFIWRALNNWIPVELRMKEKGIHLASKCVCC---------- 628

Query: 403  SPNTFEIVPHLFLQNAQVVKVWNHFASWLR---FTPPHTEHIHIFFSAWRNLTPYAHTPH 573
              N+ E + H+   N+   +VW  FA++ +   F P H  HI     AW     Y    H
Sbjct: 629  --NSEESLMHVLWGNSVAKQVWAFFANFFQIYIFNPQHVSHI---LWAWFYSGDYVKRGH 683

Query: 574  ISIMLPCLIMWKIWEERNHCRY 639
            I  +LP  I W +W ERN  ++
Sbjct: 684  IRTLLPIFICWFLWLERNDAKH 705



 Score = 31.2 bits (69), Expect(2) = 8e-28
 Identities = 17/66 (25%), Positives = 26/66 (39%)
 Frame = +2

Query: 659 RIITNVXXXXXXXXKAKLFHATTWKGFLDIISTLGFSFRLPTRHHHLHVLWKPPDNPWLK 838
           R++  +           L     WKG  DI +   ++ +L  R     V W+ P     K
Sbjct: 714 RVVWRIMKLLRQLHDGSLLQQWQWKGDTDIAAMWKYNLQLKLRAPPQIVYWRKPSTGEYK 773

Query: 839 LNVDAA 856
           LNVD +
Sbjct: 774 LNVDGS 779


>gb|EOY02236.1| Uncharacterized protein TCM_011923 [Theobroma cacao]
          Length = 1954

 Score =  126 bits (317), Expect(2) = 4e-27
 Identities = 65/199 (32%), Positives = 102/199 (51%)
 Frame = +1

Query: 43   FWNNNSWDMNRLHNIVGLEWANKLSKIPISHTQVDSMMWKLSVNGNFSISSAYTALLSIS 222
            F+N ++WD+++L+  + +   +++ +IPI  +Q D   W L+ NG FS  SA+ A+    
Sbjct: 1546 FFNGHNWDVDKLNLYLPMNLVDEILQIPIDRSQDDVAYWSLTSNGEFSTRSAWEAIRLRK 1605

Query: 223  ETQPIFKQLWNPMIPPSASIFLWWLFQNMIPVDKKLHERGISLASKCVCCGHSSSSMPNF 402
                +   LW+  IP S S FLW +F N IPVD +L E+G  LASKC+CC          
Sbjct: 1606 SPNVLCSLLWHKSIPLSISFFLWRVFHNWIPVDIRLKEKGFHLASKCICC---------- 1655

Query: 403  SPNTFEIVPHLFLQNAQVVKVWNHFASWLRFTPPHTEHIHIFFSAWRNLTPYAHTPHISI 582
              N+ E + H+   N    +VWN FA+  +      +++      W     Y    HI I
Sbjct: 1656 --NSEESLIHVLWDNPIAKQVWNFFANSFQIYISKPQNVSQILWTWYLSGDYVRKGHIRI 1713

Query: 583  MLPCLIMWKIWEERNHCRY 639
            ++P  I W +W ERN  ++
Sbjct: 1714 LIPLFICWFLWLERNDAKH 1732



 Score = 22.3 bits (46), Expect(2) = 4e-27
 Identities = 17/74 (22%), Positives = 25/74 (33%)
 Frame = +2

Query: 659  RIITNVXXXXXXXXKAKLFHATTWKGFLDIISTLGFSFRLPTRHHHLHVLWKPPDNPWLK 838
            R++  +           L  +  WKG  D  +  G      TR     + W  P     K
Sbjct: 1741 RVVWKIMKLLRQLQDGYLLKSWQWKGDKDFATMWGLFSPPKTRAAPQILHWVKPVPGEHK 1800

Query: 839  LNVDAAYKSQTVTA 880
            LNVD + +     A
Sbjct: 1801 LNVDGSSRQNQTAA 1814


>gb|EOY34748.1| Uncharacterized protein TCM_042328 [Theobroma cacao]
          Length = 910

 Score =  126 bits (317), Expect = 1e-26
 Identities = 63/212 (29%), Positives = 106/212 (50%)
 Frame = +1

Query: 28   IDVAYFWNNNSWDMNRLHNIVGLEWANKLSKIPISHTQVDSMMWKLSVNGNFSISSAYTA 207
            + V  F+ NNSW++ +L  ++  E  ++++KIPI     D   W  + NG+FS  SA+  
Sbjct: 497  VQVCDFFMNNSWNVEKLKTVLQQEVVDEIAKIPIDTMSKDEAYWTPTPNGDFSTKSAWQL 556

Query: 208  LLSISETQPIFKQLWNPMIPPSASIFLWWLFQNMIPVDKKLHERGISLASKCVCCGHSSS 387
            +       P+F  +W+  +P + S FLW L  + IPV+ K+  +G+ LAS+C CC    S
Sbjct: 557  IRKRKVVNPVFNFIWHKTVPLTTSFFLWRLLHDWIPVELKMKSKGLQLASRCRCCKSEES 616

Query: 388  SMPNFSPNTFEIVPHLFLQNAQVVKVWNHFASWLRFTPPHTEHIHIFFSAWRNLTPYAHT 567
             M            H+   N   ++VWN+FA   +    +   I+    AW +   Y   
Sbjct: 617  IM------------HVMWDNPVAMQVWNYFAKLFQICIINPCTINQIIGAWFHSGDYCKP 664

Query: 568  PHISIMLPCLIMWKIWEERNHCRYGSVFSF*N 663
             HI  ++P  I+W +W ERN  ++ ++  + N
Sbjct: 665  GHIRTLVPLFILWFLWVERNDAKHRNLGMYPN 696


>gb|EOY06960.1| Uncharacterized protein TCM_021522 [Theobroma cacao]
          Length = 3503

 Score =  119 bits (299), Expect(2) = 1e-26
 Identities = 62/199 (31%), Positives = 96/199 (48%)
 Frame = +1

Query: 43   FWNNNSWDMNRLHNIVGLEWANKLSKIPISHTQVDSMMWKLSVNGNFSISSAYTALLSIS 222
            F+N + WD+ +L++ +     +++ +IP   +Q D   W L+ NG FS  SA+  +    
Sbjct: 1300 FYNGDEWDIVKLNSYLPTSLVDEILQIPFDRSQEDVAYWALTSNGEFSFWSAWEIIRQRQ 1359

Query: 223  ETQPIFKQLWNPMIPPSASIFLWWLFQNMIPVDKKLHERGISLASKCVCCGHSSSSMPNF 402
                +    W+  IP S S FLW +  N IPV+ ++ ++GI LASKCVCC    S +   
Sbjct: 1360 TPNALLSFNWHRSIPLSISFFLWRVLNNWIPVELRMKDKGIHLASKCVCCRSEESLI--- 1416

Query: 403  SPNTFEIVPHLFLQNAQVVKVWNHFASWLRFTPPHTEHIHIFFSAWRNLTPYAHTPHISI 582
                     H+  +N    +VWN FA   +      +HI     AW     Y    HI I
Sbjct: 1417 ---------HVLWENPVAKQVWNFFAKSFQIYVSKPKHISQIIWAWFFSGDYTRNGHIRI 1467

Query: 583  MLPCLIMWKIWEERNHCRY 639
            ++P  I W +W ERN  ++
Sbjct: 1468 LIPLFICWFLWLERNDAKH 1486



 Score = 27.7 bits (60), Expect(2) = 1e-26
 Identities = 19/69 (27%), Positives = 25/69 (36%)
 Frame = +2

Query: 659  RIITNVXXXXXXXXKAKLFHATTWKGFLDIISTLGFSFRLPTRHHHLHVLWKPPDNPWLK 838
            R+I  +           L     WKG  DI +  GF +          + W  P     K
Sbjct: 1495 RVIWRIMKLLNQLHAGSLLKQWQWKGDTDIATMWGFKYPPKYCQSPQIISWIKPFIGEYK 1554

Query: 839  LNVDAAYKS 865
            LNVD + KS
Sbjct: 1555 LNVDGSSKS 1563



 Score =  119 bits (299), Expect(2) = 9e-26
 Identities = 62/210 (29%), Positives = 105/210 (50%)
 Frame = +1

Query: 34   VAYFWNNNSWDMNRLHNIVGLEWANKLSKIPISHTQVDSMMWKLSVNGNFSISSAYTALL 213
            V+ F+ NNSWD+ +L +++  E   +++KIPI+ +  D   W  + NG+FS  SA+    
Sbjct: 3091 VSDFFLNNSWDIEKLKSVLQQEVVEEIAKIPINASSNDRAYWTPTPNGDFSTKSAWQLSR 3150

Query: 214  SISETQPIFKQLWNPMIPPSASIFLWWLFQNMIPVDKKLHERGISLASKCVCCGHSSSSM 393
                  P +  +W+  +P + S FLW L  + +PV+ K+  +G  LAS+C CC    S M
Sbjct: 3151 ERKVVNPTYNYIWHKSVPLTTSFFLWRLLHDWVPVELKMKSKGFQLASRCRCCKSEESLM 3210

Query: 394  PNFSPNTFEIVPHLFLQNAQVVKVWNHFASWLRFTPPHTEHIHIFFSAWRNLTPYAHTPH 573
                        H+   N    +VW++FA   +    +   I+   SAW     Y+   H
Sbjct: 3211 ------------HVMWDNPVANQVWSYFAKVFQIHIINPCTINHIISAWFYSGDYSKPGH 3258

Query: 574  ISIMLPCLIMWKIWEERNHCRYGSVFSF*N 663
            I  ++P  I+W +W ERN  ++ ++  + N
Sbjct: 3259 IRTLVPLFILWFLWVERNDAKHRNLGMYPN 3288



 Score = 24.6 bits (52), Expect(2) = 9e-26
 Identities = 17/74 (22%), Positives = 24/74 (32%)
 Frame = +2

Query: 659  RIITNVXXXXXXXXKAKLFHATTWKGFLDIISTLGFSFRLPTRHHHLHVLWKPPDNPWLK 838
            RI+  +        + K      W+G   I    G   +         + W  P     K
Sbjct: 3289 RIVWKILKLIHQLFQGKQLQKWQWQGDKQIAQEWGIILKAVAPSPPKLLFWNKPSIGEFK 3348

Query: 839  LNVDAAYKSQTVTA 880
            LNVD + K    TA
Sbjct: 3349 LNVDGSSKYNLQTA 3362


>gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao]
          Length = 2251

 Score =  126 bits (316), Expect = 1e-26
 Identities = 63/212 (29%), Positives = 105/212 (49%)
 Frame = +1

Query: 28   IDVAYFWNNNSWDMNRLHNIVGLEWANKLSKIPISHTQVDSMMWKLSVNGNFSISSAYTA 207
            + V  F+ NNSW++ +L  ++  E  ++++KIPI     D   W  + NG+FS  SA+  
Sbjct: 1838 VQVCDFFTNNSWNIEKLKTVLQQEVVDEIAKIPIDTMNKDEAYWTPTPNGDFSTKSAWQL 1897

Query: 208  LLSISETQPIFKQLWNPMIPPSASIFLWWLFQNMIPVDKKLHERGISLASKCVCCGHSSS 387
            +       P+F  +W+  +P + S FLW L  + IPV+ K+  +G+ LAS+C CC    S
Sbjct: 1898 IRKRKVVNPVFNFIWHKTVPLTTSFFLWRLLHDWIPVELKMKSKGLQLASRCRCCKSEES 1957

Query: 388  SMPNFSPNTFEIVPHLFLQNAQVVKVWNHFASWLRFTPPHTEHIHIFFSAWRNLTPYAHT 567
             M            H+   N   ++VWN+FA   +    +   I+    AW     Y   
Sbjct: 1958 IM------------HVMWDNPVAMQVWNYFAKLFQILIINPCTINQIIGAWFYSGDYCKP 2005

Query: 568  PHISIMLPCLIMWKIWEERNHCRYGSVFSF*N 663
             HI  ++P  I+W +W ERN  ++ ++  + N
Sbjct: 2006 GHIRTLVPLFILWFLWVERNDAKHRNLGMYPN 2037


>gb|EOY17514.1| Uncharacterized protein TCM_042330 [Theobroma cacao]
          Length = 2249

 Score =  124 bits (312), Expect = 4e-26
 Identities = 63/215 (29%), Positives = 103/215 (47%)
 Frame = +1

Query: 19   LENIDVAYFWNNNSWDMNRLHNIVGLEWANKLSKIPISHTQVDSMMWKLSVNGNFSISSA 198
            L  + V  F+ NNSWD+ +L  ++  E  ++++KIPI     D   W  + NG FS  SA
Sbjct: 1833 LSMVQVCDFFMNNSWDIEKLKTVLQQEVVDEIAKIPIDAMSKDEAYWAPTPNGEFSTKSA 1892

Query: 199  YTALLSISETQPIFKQLWNPMIPPSASIFLWWLFQNMIPVDKKLHERGISLASKCVCCGH 378
            +  +       P+F  +W+  +P + S FLW L  + IPV+ K+  +G  LAS+C CC  
Sbjct: 1893 WQLIRKREVVNPVFNFIWHKTVPLTISFFLWRLLHDWIPVELKMKSKGFQLASRCRCCKS 1952

Query: 379  SSSSMPNFSPNTFEIVPHLFLQNAQVVKVWNHFASWLRFTPPHTEHIHIFFSAWRNLTPY 558
              S M            H+   N    +VWN+F+ + +    +   I+    AW     Y
Sbjct: 1953 EESIM------------HVMWDNPVATQVWNYFSKFFQILVINPCTINQILGAWFYSGDY 2000

Query: 559  AHTPHISIMLPCLIMWKIWEERNHCRYGSVFSF*N 663
                HI  ++P   +W +W ERN  ++ ++  + N
Sbjct: 2001 CKPGHIRTLVPIFTLWFLWVERNDAKHRNLGMYPN 2035


>gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao]
          Length = 2215

 Score =  117 bits (293), Expect(2) = 6e-25
 Identities = 61/210 (29%), Positives = 104/210 (49%)
 Frame = +1

Query: 34   VAYFWNNNSWDMNRLHNIVGLEWANKLSKIPISHTQVDSMMWKLSVNGNFSISSAYTALL 213
            V+ F+ NNSW++ +L  ++  E   ++ KIPI  +  D   W  + NG+FS  SA+  + 
Sbjct: 1803 VSDFFLNNSWNVEKLKTVLQQEVVEEIVKIPIDTSSNDKAYWTTTPNGDFSTKSAWQLIR 1862

Query: 214  SISETQPIFKQLWNPMIPPSASIFLWWLFQNMIPVDKKLHERGISLASKCVCCGHSSSSM 393
            +     P+F  +W+  +P + S FLW L  + IPV+ K+  +G  LAS+C CC    S M
Sbjct: 1863 NRKVENPVFNFIWHKSVPLTTSFFLWRLLHDWIPVELKMKTKGFQLASRCRCCKSEESLM 1922

Query: 394  PNFSPNTFEIVPHLFLQNAQVVKVWNHFASWLRFTPPHTEHIHIFFSAWRNLTPYAHTPH 573
                        H+  +N    +VW++FA   +    +   I+    AW     Y+   H
Sbjct: 1923 ------------HVMWKNPVANQVWSYFAKVFQIQIINPCTINQIICAWFYSGDYSKPGH 1970

Query: 574  ISIMLPCLIMWKIWEERNHCRYGSVFSF*N 663
            I  ++P   +W +W ERN  ++ ++  + N
Sbjct: 1971 IRTLVPLFTLWFLWVERNDAKHRNLGMYPN 2000



 Score = 24.3 bits (51), Expect(2) = 6e-25
 Identities = 16/74 (21%), Positives = 25/74 (33%)
 Frame = +2

Query: 659  RIITNVXXXXXXXXKAKLFHATTWKGFLDIISTLGFSFRLPTRHHHLHVLWKPPDNPWLK 838
            R++  +        + K      W+G   I    G   +         + W  P    LK
Sbjct: 2001 RVVWKILKLLHQLFQGKQLQKWQWQGDKQIAQEWGIILKADAPSPPKLLFWLKPSIGELK 2060

Query: 839  LNVDAAYKSQTVTA 880
            LNVD + K    +A
Sbjct: 2061 LNVDGSCKHNPQSA 2074


>gb|EOY13984.1| RNase H family protein [Theobroma cacao]
          Length = 429

 Score =  114 bits (285), Expect = 5e-23
 Identities = 61/210 (29%), Positives = 104/210 (49%)
 Frame = +1

Query: 34  VAYFWNNNSWDMNRLHNIVGLEWANKLSKIPISHTQVDSMMWKLSVNGNFSISSAYTALL 213
           V+ F+ N SW + +L++ +  +   ++ KIPI  +++    W  + +G F+  SA+  + 
Sbjct: 48  VSNFYQNGSWHIGKLNDALLEDVVTEIMKIPIDESRIYEAYWAPTSDGKFTTKSAWEIVR 107

Query: 214 SISETQPIFKQLWNPMIPPSASIFLWWLFQNMIPVDKKLHERGISLASKCVCCGHSSSSM 393
                  +F  +W+  IP S S FLW LFQ+ IPVD +L  +G  L  KC  C       
Sbjct: 108 QRHSINFVFYSIWHRSIPLSISFFLWRLFQDWIPVDLRLKSKGFQLVFKCQHC------- 160

Query: 394 PNFSPNTFEIVPHLFLQNAQVVKVWNHFASWLRFTPPHTEHIHIFFSAWRNLTPYAHTPH 573
                N+ E + H+  +     +VWN+FA + +    H + I+    AW   + Y    H
Sbjct: 161 -----NSKESLFHVMWECPLASQVWNYFAKFFQIYIIHRKSIYQIIWAWLFSSDYTKKGH 215

Query: 574 ISIMLPCLIMWKIWEERNHCRYGSVFSF*N 663
           I I++P  I W +W ERN  ++ ++  + N
Sbjct: 216 IHILIPLFIFWFLWVERNDAKHRNLGMYPN 245


>gb|EOY06956.1| Uncharacterized protein TCM_021518 [Theobroma cacao]
          Length = 1702

 Score =  110 bits (274), Expect(2) = 1e-22
 Identities = 59/179 (32%), Positives = 83/179 (46%)
 Frame = +1

Query: 100  WANKLSKIPISHTQVDSMMWKLSVNGNFSISSAYTALLSISETQPIFKQLWNPMIPPSAS 279
            W      IP   +Q D   W L+ NG FS  SA+ AL        +    W+  IP S S
Sbjct: 1145 WMGDQPLIPFDRSQDDIAYWALTSNGEFSTWSAWEALRLRQSPNVLCSLFWHKSIPLSIS 1204

Query: 280  IFLWWLFQNMIPVDKKLHERGISLASKCVCCGHSSSSMPNFSPNTFEIVPHLFLQNAQVV 459
             FLW +F N IPVD +L ++G  LASKC CC            N+ E + H+   N    
Sbjct: 1205 FFLWRVFHNWIPVDLRLKDKGFHLASKCACC------------NSEETLIHVLWDNPVAK 1252

Query: 460  KVWNHFASWLRFTPPHTEHIHIFFSAWRNLTPYAHTPHISIMLPCLIMWKIWEERNHCR 636
            +VWN FA++ +    + +++     AW     Y    HI  ++P  I W +W ERN  +
Sbjct: 1253 QVWNFFANFFQIYVSNPQNVSQILWAWYFSGDYVRKGHIRTLIPLFICWFLWLERNDAK 1311



 Score = 23.5 bits (49), Expect(2) = 1e-22
 Identities = 15/51 (29%), Positives = 22/51 (43%)
 Frame = +2

Query: 728  WKGFLDIISTLGFSFRLPTRHHHLHVLWKPPDNPWLKLNVDAAYKSQTVTA 880
            WKG +DI +  GF+F    +       W    +   KLNVD + +     A
Sbjct: 1344 WKGDMDIAAMWGFNFSPKIQATPQIFHWVKLVSGEHKLNVDGSSRQNQSAA 1394


>gb|EOY17470.1| Uncharacterized protein TCM_036655 [Theobroma cacao]
          Length = 270

 Score =  102 bits (255), Expect = 2e-19
 Identities = 51/155 (32%), Positives = 85/155 (54%)
 Frame = +1

Query: 28  IDVAYFWNNNSWDMNRLHNIVGLEWANKLSKIPISHTQVDSMMWKLSVNGNFSISSAYTA 207
           I V YF+++N WD+++L  ++     N++ K+PIS TQ +   W L++NG+F+  SA+  
Sbjct: 2   IKVNYFFHDNEWDVDKLKVVLPAVIINEILKVPISCTQENLAYWALTLNGDFTTKSAWEL 61

Query: 208 LLSISETQPIFKQLWNPMIPPSASIFLWWLFQNMIPVDKKLHERGISLASKCVCCGHSSS 387
           L        + K +W+  IP + S FLW L  N IPV+ ++  +G  LASKC+CC     
Sbjct: 62  LRQRQLIHALGKFIWHTSIPLTVSFFLWCLVHNWIPVELRMKSKGFQLASKCLCC----- 116

Query: 388 SMPNFSPNTFEIVPHLFLQNAQVVKVWNHFASWLR 492
                   + E + H+  +     +VWN+FA + +
Sbjct: 117 -------QSKETIMHVLWEGPIAQQVWNYFAKFFQ 144


>gb|EOY25454.1| Uncharacterized protein TCM_026877 [Theobroma cacao]
          Length = 2367

 Score = 96.7 bits (239), Expect = 1e-17
 Identities = 58/208 (27%), Positives = 88/208 (42%), Gaps = 1/208 (0%)
 Frame = +1

Query: 19   LENIDVAYFWNNNSWDMNRLHNIVGLEWANKLSKIPISHTQVDSMMWKLSVNGNFSISSA 198
            L  + V  F+ NNSWD+ +L  ++  E  ++++KIPI     D   W  + NG FS  SA
Sbjct: 2005 LSMVQVCDFFMNNSWDIEKLKTVLQQEVVDEIAKIPIDAMSKDEAYWAPTPNGEFSTKSA 2064

Query: 199  YTALLSISETQPIFKQLWNPMIPPSASIFLWWLFQNMIPVDKKLHERGISLASKCVCCGH 378
            +  +       P+F  +W+  IP + S FLW L  + IPV+ ++  +G  LAS+C CC  
Sbjct: 2065 WQLIRKREVVNPVFNFIWHKAIPLTTSFFLWRLLHDWIPVELRMKSKGFQLASRCRCCRS 2124

Query: 379  SSSSMPNFSPNTFEIVPHLFLQNAQVVKVWNHFASWLRFTPPHTEHIHIFFSAWRNLTPY 558
              S                                           IH+    W N  P 
Sbjct: 2125 EESI------------------------------------------IHVM---WDN--PV 2137

Query: 559  AHTP-HISIMLPCLIMWKIWEERNHCRY 639
            A  P HI  ++P   +W +W ERN  ++
Sbjct: 2138 AVQPGHIRTLIPIFTLWFLWVERNDAKH 2165


>gb|ABI34321.1| RNase H family protein [Solanum demissum]
          Length = 945

 Score = 89.7 bits (221), Expect = 1e-15
 Identities = 56/205 (27%), Positives = 92/205 (44%), Gaps = 1/205 (0%)
 Frame = +1

Query: 31   DVAYFWNNNSWDMNRLHNIVGLEWANKLSKIPISH-TQVDSMMWKLSVNGNFSISSAYTA 207
            +V  F +   WD ++L +I+  +  N++  IPI    Q D  +W  S NG+F+  SAY  
Sbjct: 497  NVKDFIHKREWDFDKLSDILPPQVVNQIVSIPIGDPNQSDYAIWIPSENGHFTTKSAYVD 556

Query: 208  LLSISETQPIFKQLWNPMIPPSASIFLWWLFQNMIPVDKKLHERGISLASKCVCCGHSSS 387
              +  E   +  ++W+   P   S   W L QN +P    + +   ++ S CVCC +  +
Sbjct: 557  CSNTREKNDMRNKIWHGKFPFKMSFLTWRLVQNKLPFYDTVGKFVDNIDSNCVCCKNMKT 616

Query: 388  SMPNFSPNTFEIVPHLFLQNAQVVKVWNHFASWLRFTPPHTEHIHIFFSAWRNLTPYAHT 567
                      E + H+FL +     +W  F   L      +  I++  + W   T  +  
Sbjct: 617  ----------ETINHVFLNSDVASYLWKKFGGTLGIDTRASSTINLLKTWWNVQTHNSIH 666

Query: 568  PHISIMLPCLIMWKIWEERNHCRYG 642
              I   LP LI W+IW+ R  C+YG
Sbjct: 667  NVIIHTLPILIFWEIWKRRCACKYG 691


>ref|XP_004296004.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Fragaria
           vesca subsp. vesca]
          Length = 751

 Score = 77.4 bits (189), Expect(2) = 2e-15
 Identities = 54/211 (25%), Positives = 94/211 (44%), Gaps = 4/211 (1%)
 Frame = +1

Query: 19  LENIDVAYFWNNNSWDMNRLHNIVGLEWANKLSKIPISHT-QVDSMMWKLSVNGNFSISS 195
           L N  VA F  +  W +    + +  + A ++ +IP+ +T + D ++W+ S +G FS S 
Sbjct: 397 LLNSRVADFIWDQQWALPSHFSNLFPDCAKQILEIPLPNTPESDILIWEHSSSGIFSFSD 456

Query: 196 AYTALLSISETQPIFKQLWNPMIPPSASIFLWWLFQNMIPVDKKLHERGISLASKCVCCG 375
            Y  +    E       +W+  IPP  S+  W +F   +P D +L  RGI   S C  C 
Sbjct: 457 GYELVRPYFEKLDWASSVWHSFIPPRYSVLAWRIFHLKLPTDDQLQRRGIPFVSVCQLCS 516

Query: 376 HSSSSMPNFSPNTFEIVPHLFLQNAQVVKVWNHFASWLRFTPPHTEHIHIFFSAWRNLTP 555
            S +          E +PHLF+  +    +W   A +   + P +  ++     W ++T 
Sbjct: 517 FSHT----------EDIPHLFVNCSFAQHIWQWLAYYFGTSLPSSGSLN---DLWSSVTG 563

Query: 556 YAHTPHISIM--LPCLI-MWKIWEERNHCRY 639
            A +P +  +    CL  +  IW+  N  R+
Sbjct: 564 KAFSPQLKNIWFASCLFALMAIWKSHNKLRF 594



 Score = 32.3 bits (72), Expect(2) = 2e-15
 Identities = 18/46 (39%), Positives = 25/46 (54%), Gaps = 2/46 (4%)
 Frame = +2

Query: 731 KGFLD--IISTLGFSFRLPTRHHHLHVLWKPPDNPWLKLNVDAAYK 862
           +G LD  ++S++G    L  +     VLW PP  PWLKLN +   K
Sbjct: 624 RGVLDSKVLSSMGVILVLKCQSALRIVLWHPPLIPWLKLNTNGFSK 669


Top