BLASTX nr result

ID: Zingiber24_contig00030042 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Zingiber24_contig00030042
         (1110 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EOY02239.1| Uncharacterized protein TCM_016763 [Theobroma cacao]   177   5e-42
gb|EOY06959.1| Uncharacterized protein TCM_021521 [Theobroma cacao]   174   7e-41
gb|EOY34747.1| Uncharacterized protein TCM_042327 [Theobroma cacao]   168   3e-39
gb|EOY06960.1| Uncharacterized protein TCM_021522 [Theobroma cacao]   168   4e-39
gb|EOY25451.1| Uncharacterized protein TCM_016759 [Theobroma cacao]   167   7e-39
gb|EOY02238.1| Uncharacterized protein TCM_016762 [Theobroma cacao]   162   2e-37
gb|EOX96783.1| Uncharacterized protein TCM_005954 [Theobroma cacao]   161   4e-37
gb|EOY06956.1| Uncharacterized protein TCM_021518 [Theobroma cacao]   161   5e-37
gb|EOY34748.1| Uncharacterized protein TCM_042328 [Theobroma cacao]   157   7e-36
gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao]   157   7e-36
gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao]   157   9e-36
gb|EOY02236.1| Uncharacterized protein TCM_011923 [Theobroma cacao]   154   5e-35
gb|EOY19200.1| Retrotransposon, unclassified-like protein [Theob...   153   1e-34
gb|EOY02234.1| Uncharacterized protein TCM_011921 [Theobroma cacao]   153   1e-34
gb|EOY17514.1| Uncharacterized protein TCM_042330 [Theobroma cacao]   151   4e-34
gb|EOY13984.1| RNase H family protein [Theobroma cacao]               134   9e-29
gb|EOY25454.1| Uncharacterized protein TCM_026877 [Theobroma cacao]   128   4e-27
gb|EOY25447.1| Uncharacterized protein TCM_016753 [Theobroma cacao]   105   2e-20
ref|XP_006356603.1| PREDICTED: putative ribonuclease H protein A...   105   3e-20
ref|XP_006358721.1| PREDICTED: uncharacterized protein LOC102596...   104   7e-20

>gb|EOY02239.1| Uncharacterized protein TCM_016763 [Theobroma cacao]
          Length = 2127

 Score =  177 bits (450), Expect = 5e-42
 Identities = 119/394 (30%), Positives = 190/394 (48%), Gaps = 31/394 (7%)
 Frame = -2

Query: 1103 KSNASSWWKRLIKSRAVAEILIGWSFGKGQVDFWYDTAL-----------------MGGL 975
            K + S  WKR+I  R +A   I W  GKG + FW+D  +                  G  
Sbjct: 1660 KLHDSHVWKRMISGREMALQNIRWKIGKGDLFFWHDCWMGDKPLAASFPEFQNDMSHGYH 1719

Query: 974  FF*G*IM*IAECSQFLSSWMLVDGFDVS*KLC*KPAG------DGKFSLKSA*NQVKQKY 813
            F+ G    + +   FL + ++ +   V      +         +G FS +SA   ++Q+ 
Sbjct: 1720 FYNGDTWDVDKLRSFLPTILVEEILQVPFDKSREDVAYWTLTSNGDFSTRSAWEMIRQRQ 1779

Query: 812  HAGGIWVTV*SRMLSPTIAVFVWRFLKKRLSVDELLQRRGMNLVSKCQCCAEVESWEHFF 633
             +  +   +  R +  +I+ F+W+ L   + V+  ++ +G+ L SKC CC   ES  H  
Sbjct: 1780 TSNALCSFIWHRSIPLSISFFLWKTLHNWIPVELRMKEKGIQLASKCVCCNSEESLIHVL 1839

Query: 632  FYGPVAKEVWVFFAKMFCVSKW--RHFEN----WKNGRDW-SSGQVREIIPFLIIWFLWK 474
            +  PVAK+VW FFA++F +  W  RH       W    D+   G  R ++P  I WFLW 
Sbjct: 1840 WENPVAKQVWNFFAQLFQIYIWNPRHVSQIIWAWYVSGDYVRKGHFRVLLPLFICWFLWL 1899

Query: 473  ARNDAKHRDIKPEARLICRNVIRYLGDGMTACTIQNKN*KGSRLVASLLGXXXXXXXXXX 294
             RNDAKHR     A  +    +++         +Q    KG   +A++LG          
Sbjct: 1900 ERNDAKHRHTGLYADRVIWRTMKHCRQLYDGSLLQQWQWKGDTDIATMLGFSFTHKQHAP 1959

Query: 293  XXXXX*RKPKYG-FKLNTNGSTKRNPGSSSYGAIVRDHEGKVIFALHGLIGMGSDLRAEL 117
                  +KP  G +KLN +GS+ RN   ++ G ++RDH GK+IF     IG  + L+AEL
Sbjct: 1960 PQIIYWKKPSIGEYKLNVDGSS-RNGLHAATGGVLRDHTGKLIFGFSENIGPCNSLQAEL 2018

Query: 116  FGILKGLEFCIDKHMFPLWLESDSLVALKILQSS 15
              +L+GL  C ++H+  LW+E D+LVA++++Q S
Sbjct: 2019 RALLRGLLLCKERHIEKLWIEMDALVAIQLIQPS 2052


>gb|EOY06959.1| Uncharacterized protein TCM_021521 [Theobroma cacao]
          Length = 1951

 Score =  174 bits (440), Expect = 7e-41
 Identities = 128/396 (32%), Positives = 195/396 (49%), Gaps = 33/396 (8%)
 Frame = -2

Query: 1103 KSNASSWWKRLIKSRAVAEILIGWSFGKGQVDFWYDTALMGGL----------------- 975
            K + S  WKR+I  R VA   I W  GKG++ FW+D   MG                   
Sbjct: 1483 KLHDSQVWKRMIVGRDVALQNIRWRIGKGELFFWHD-CWMGDQPLATLCPSFHNDMSHVH 1541

Query: 974  -FF*G*IM*IAECSQFLSSWMLVDG-----FDVS*KLC*KPA--GDGKFSLKSA*NQVKQ 819
             F+ G +  I + S  L +  LVD      FD S +     A   +G FSL SA   ++Q
Sbjct: 1542 KFYNGDVWDIEKLSSCLPT-SLVDEILQIPFDRSQEDVAYWALTSNGDFSLWSAWEAIRQ 1600

Query: 818  KYHAGGIWVTV*SRMLSPTIAVFVWRFLKKRLSVDELLQRRGMNLVSKCQCCAEVESWEH 639
            +     ++  +  R +  +I+ F+WR L   + V+  ++ +G++L SKC CC   ES  H
Sbjct: 1601 RQTPNALFSLIWHRSIPLSISFFLWRVLNNWIPVELRMKDKGIHLASKCVCCRSEESLIH 1660

Query: 638  FFFYGPVAKEVWVFFAKMF--CVSKWRHFEN----WKNGRDWS-SGQVREIIPFLIIWFL 480
              +  PVA +VW FFAK F   VSK  H       W    D++ +G +R +IP  I WFL
Sbjct: 1661 VLWENPVATQVWFFFAKSFQIYVSKPNHISQIIWAWFFSGDYTRNGHIRILIPLFICWFL 1720

Query: 479  WKARNDAKHRDIKPEARLICRNVIRYLGDGMTACTIQNKN*KGSRLVASLLGXXXXXXXX 300
            W  RNDAKHR +      +   +++ L        ++    KG   +A++ G        
Sbjct: 1721 WLERNDAKHRHMGMYPNRVIWRIMKLLNQLYAGSLLKQWQWKGDTDIATMWGFKFPPKYC 1780

Query: 299  XXXXXXX*RKPKYG-FKLNTNGSTKRNPGSSSYGAIVRDHEGKVIFALHGLIGMGSDLRA 123
                     KP  G +KLN +GS+K N  ++  G ++RDH GK+ FA    +G    L+A
Sbjct: 1781 TSPQIIYWIKPFIGEYKLNVDGSSKSNLNAAG-GGVLRDHTGKLAFAFSENLGPLPSLQA 1839

Query: 122  ELFGILKGLEFCIDKHMFPLWLESDSLVALKILQSS 15
            EL  +L+GL  C ++++  LW+E D+LVA++++Q S
Sbjct: 1840 ELHALLRGLLLCKERNITNLWIEMDALVAVQMVQQS 1875


>gb|EOY34747.1| Uncharacterized protein TCM_042327 [Theobroma cacao]
          Length = 1014

 Score =  168 bits (426), Expect = 3e-39
 Identities = 118/395 (29%), Positives = 183/395 (46%), Gaps = 32/395 (8%)
 Frame = -2

Query: 1103 KSNASSWWKRLIKSRAVAEILIGWSFGKGQVDFWYDTALMGGLFF*G*IM*IAECSQFLS 924
            K + S  WKR+++ R VA     W  GKG + FW+D   MG                F+ 
Sbjct: 546  KLHDSQVWKRMVRGRDVAIQNTRWRIGKGNLFFWHD-CWMGNKPLVTSFPSFRNDMTFVH 604

Query: 923  SWMLVDGFDVS*KLC*KP------------------------AGDGKFSLKSA*NQVKQK 816
             +   D +DV+      P                          DG+FS  SA   V+Q+
Sbjct: 605  KFYNGDNWDVNTLKLYLPMNLIDEILQIPFDRSQDDIAYWALTSDGEFSTWSAWEAVRQR 664

Query: 815  YHAGGIWVTV*SRMLSPTIAVFVWRFLKKRLSVDELLQRRGMNLVSKCQCCAEVESWEHF 636
                 +   +  + +  TI+ F+WR L   + V+  L+ +G +L SKC CC   ES  H 
Sbjct: 665  QSPNTLCSFIWHKSIPLTISFFLWRVLNNWIPVELRLKEKGFHLASKCVCCNSEESLIHV 724

Query: 635  FFYGPVAKEVWVFFAKMF--CVSKWRHFEN----WKNGRDW-SSGQVREIIPFLIIWFLW 477
             +  PVAK+VW FFA  F   +S  +H       W    D+   G +R +IP  I WFLW
Sbjct: 725  LWDNPVAKQVWNFFADFFQINISNPQHVSQIIWAWYYSGDFVRKGHIRTLIPLFICWFLW 784

Query: 476  KARNDAKHRDIKPEARLICRNVIRYLGDGMTACTIQNKN*KGSRLVASLLGXXXXXXXXX 297
              RNDAKHR +   +  +   +++ L        ++    KG   +A++ G         
Sbjct: 785  LERNDAKHRHLGMYSDRVVWKIMKVLRQLQDGSLLKKWQWKGDTDIAAMWGFTLPLKIRE 844

Query: 296  XXXXXX*RKPKYG-FKLNTNGSTKRNPGSSSYGAIVRDHEGKVIFALHGLIGMGSDLRAE 120
                    KP  G +KLN +GS++ N  S++ G ++RDH G ++F     IG  + L+AE
Sbjct: 845  SPQIIHWVKPVTGEYKLNVDGSSRHNQ-SAATGGLLRDHTGTLVFGFSENIGPSNSLQAE 903

Query: 119  LFGILKGLEFCIDKHMFPLWLESDSLVALKILQSS 15
            L  +L+GL  C D+++  LW+E D+LV ++++Q S
Sbjct: 904  LRALLRGLLLCKDRNIEKLWIEMDALVVIQMIQQS 938


>gb|EOY06960.1| Uncharacterized protein TCM_021522 [Theobroma cacao]
          Length = 3503

 Score =  168 bits (425), Expect = 4e-39
 Identities = 126/396 (31%), Positives = 195/396 (49%), Gaps = 33/396 (8%)
 Frame = -2

Query: 1103 KSNASSWWKRLIKSRAVAEILIGWSFGKGQVDFWYDTALMGGL----------------- 975
            K + S  WKR+I  R VA   I W  GKG++ FW+D   MG                   
Sbjct: 1240 KLHDSQVWKRMIVGRDVALQNIRWRIGKGELFFWHD-CWMGDQPLATLFPSFHNDMSHVH 1298

Query: 974  -FF*G*IM*IAECSQFLSSWMLVDG-----FDVS*KLC*KPA--GDGKFSLKSA*NQVKQ 819
             F+ G    I + + +L +  LVD      FD S +     A   +G+FS  SA   ++Q
Sbjct: 1299 KFYNGDEWDIVKLNSYLPT-SLVDEILQIPFDRSQEDVAYWALTSNGEFSFWSAWEIIRQ 1357

Query: 818  KYHAGGIWVTV*SRMLSPTIAVFVWRFLKKRLSVDELLQRRGMNLVSKCQCCAEVESWEH 639
            +     +      R +  +I+ F+WR L   + V+  ++ +G++L SKC CC   ES  H
Sbjct: 1358 RQTPNALLSFNWHRSIPLSISFFLWRVLNNWIPVELRMKDKGIHLASKCVCCRSEESLIH 1417

Query: 638  FFFYGPVAKEVWVFFAKMF--CVSKWRHFEN----WKNGRDWS-SGQVREIIPFLIIWFL 480
              +  PVAK+VW FFAK F   VSK +H       W    D++ +G +R +IP  I WFL
Sbjct: 1418 VLWENPVAKQVWNFFAKSFQIYVSKPKHISQIIWAWFFSGDYTRNGHIRILIPLFICWFL 1477

Query: 479  WKARNDAKHRDIKPEARLICRNVIRYLGDGMTACTIQNKN*KGSRLVASLLGXXXXXXXX 300
            W  RNDAKHR +      +   +++ L        ++    KG   +A++ G        
Sbjct: 1478 WLERNDAKHRHMGMYPNRVIWRIMKLLNQLHAGSLLKQWQWKGDTDIATMWGFKYPPKYC 1537

Query: 299  XXXXXXX*RKPKYG-FKLNTNGSTKRNPGSSSYGAIVRDHEGKVIFALHGLIGMGSDLRA 123
                     KP  G +KLN +GS+K +  ++  G ++RDH GK+ FA    +G    L+A
Sbjct: 1538 QSPQIISWIKPFIGEYKLNVDGSSKSSQNAAG-GGVLRDHTGKLAFAFSENLGPLPSLQA 1596

Query: 122  ELFGILKGLEFCIDKHMFPLWLESDSLVALKILQSS 15
            EL  +L+GL  C ++++  LW+E D+LVA++++Q S
Sbjct: 1597 ELHALLRGLLLCKERNITNLWIEMDALVAVQMVQQS 1632



 Score =  159 bits (401), Expect = 2e-36
 Identities = 109/396 (27%), Positives = 180/396 (45%), Gaps = 36/396 (9%)
 Frame = -2

Query: 1103 KSNASSWWKRLIKSRAVAEILIGWSFGKGQVDFWYDTALMGGLFF*G*IM*IAECSQFLS 924
            K + S  WKR++   ++ E  I W  G G++ FW+D   MG           A     +S
Sbjct: 3034 KLHDSQTWKRMVTISSITEQNIRWRVGHGKLFFWHD-CWMGEEPLVIRNQEFASSMAQVS 3092

Query: 923  SWMLVDGFDV------------------------S*KLC*KPAGDGKFSLKSA*NQVKQK 816
             + L + +D+                        + +    P  +G FS KSA    +++
Sbjct: 3093 DFFLNNSWDIEKLKSVLQQEVVEEIAKIPINASSNDRAYWTPTPNGDFSTKSAWQLSRER 3152

Query: 815  YHAGGIWVTV*SRMLSPTIAVFVWRFLKKRLSVDELLQRRGMNLVSKCQCCAEVESWEHF 636
                  +  +  + +  T + F+WR L   + V+  ++ +G  L S+C+CC   ES  H 
Sbjct: 3153 KVVNPTYNYIWHKSVPLTTSFFLWRLLHDWVPVELKMKSKGFQLASRCRCCKSEESLMHV 3212

Query: 635  FFYGPVAKEVWVFFAKMFCVSKWRHFEN----------WKNGRDWSS-GQVREIIPFLII 489
             +  PVA +VW +FAK+F +    H  N          W    D+S  G +R ++P  I+
Sbjct: 3213 MWDNPVANQVWSYFAKVFQI----HIINPCTINHIISAWFYSGDYSKPGHIRTLVPLFIL 3268

Query: 488  WFLWKARNDAKHRDIKPEARLICRNVIRYLGDGMTACTIQNKN*KGSRLVASLLGXXXXX 309
            WFLW  RNDAKHR++      I   +++ +        +Q    +G + +A   G     
Sbjct: 3269 WFLWVERNDAKHRNLGMYPNRIVWKILKLIHQLFQGKQLQKWQWQGDKQIAQEWGIILKA 3328

Query: 308  XXXXXXXXXX*RKPKYG-FKLNTNGSTKRNPGSSSYGAIVRDHEGKVIFALHGLIGMGSD 132
                        KP  G FKLN +GS+K N  +++ G ++RDH G +IF      G    
Sbjct: 3329 VAPSPPKLLFWNKPSIGEFKLNVDGSSKYNLQTAAGGGLLRDHTGSMIFGFSENFGSQDS 3388

Query: 131  LRAELFGILKGLEFCIDKHMFPLWLESDSLVALKIL 24
            L+AEL  + +GL  CID ++  LW+E D+ VA++++
Sbjct: 3389 LQAELMALHRGLLLCIDHNVTRLWIEMDAKVAVQMI 3424


>gb|EOY25451.1| Uncharacterized protein TCM_016759 [Theobroma cacao]
          Length = 879

 Score =  167 bits (423), Expect = 7e-39
 Identities = 117/395 (29%), Positives = 183/395 (46%), Gaps = 32/395 (8%)
 Frame = -2

Query: 1103 KSNASSWWKRLIKSRAVAEILIGWSFGKGQVDFWYDTALMGGLFF*G*IM*IAECSQFLS 924
            K + S  WKR+I+ R VA   I W  GKG + FW+D   MG          +      + 
Sbjct: 411  KLHDSLVWKRMIRGREVAFRNIRWKIGKGDLFFWHD-CWMGNQPLVMSFPSLRNDMSLVH 469

Query: 923  SWMLVDGFDVS*KLC*KP------------------------AGDGKFSLKSA*NQVKQK 816
            ++   D +DV       P                          +G+F+  SA   ++Q+
Sbjct: 470  NFYNGDTWDVDKLKAYLPMNLIDEILLIPFNRTQQDVAYWTLTSNGEFATWSAWETIRQR 529

Query: 815  YHAGGIWVTV*SRMLSPTIAVFVWRFLKKRLSVDELLQRRGMNLVSKCQCCAEVESWEHF 636
              +  +   +  R +  +I+ F+WR L   + V+  ++ +G+ L SKC CC   ES  H 
Sbjct: 530  KSSNALCSFIWHRSIPLSISFFLWRALNNWIPVELRMKEKGIQLASKCVCCNSEESLMHV 589

Query: 635  FFYGPVAKEVWVFFAKMF--CVSKWRHFEN----WKNGRDW-SSGQVREIIPFLIIWFLW 477
             +   VAK+VW FF K F   V   +H       W    D+   G +R ++P  I WFLW
Sbjct: 590  LWGNSVAKQVWAFFGKFFQIYVLNPQHVSQILWAWFFSGDYVKKGHIRSLLPIFICWFLW 649

Query: 476  KARNDAKHRDIKPEARLICRNVIRYLGDGMTACTIQNKN*KGSRLVASLLGXXXXXXXXX 297
              RNDAKHR  +     +   +++ L   +    +     KG   +AS+ G         
Sbjct: 650  LERNDAKHRHTRLNPDRVVWRIMKLLRQLLDGSLLHQWQWKGDTDIASMWGHTFQSKHRA 709

Query: 296  XXXXXX*RKPKYG-FKLNTNGSTKRNPGSSSYGAIVRDHEGKVIFALHGLIGMGSDLRAE 120
                   RKP  G +KLN +GS+ RN   ++ G I+RDH GK+IF     IG+ + L+AE
Sbjct: 710  PPQIIYWRKPFTGEYKLNVDGSS-RNGHLAASGGILRDHTGKLIFGFSENIGLCNSLQAE 768

Query: 119  LFGILKGLEFCIDKHMFPLWLESDSLVALKILQSS 15
            L  +L+GL  C ++H+  LW+E D+L  ++++Q S
Sbjct: 769  LRALLRGLLLCKERHIENLWIEMDALAVIQLIQHS 803


>gb|EOY02238.1| Uncharacterized protein TCM_016762 [Theobroma cacao]
          Length = 2214

 Score =  162 bits (410), Expect = 2e-37
 Identities = 116/395 (29%), Positives = 181/395 (45%), Gaps = 32/395 (8%)
 Frame = -2

Query: 1103 KSNASSWWKRLIKSRAVAEILIGWSFGKGQVDFWYDTALMGGLFF*G*IM*IAECSQFLS 924
            K ++SS WKR+   R V      W  G+G++ FW+D   MG                F+ 
Sbjct: 1747 KIHSSSIWKRITGGRDVTIQNTRWKIGRGELFFWHD-CWMGDQPLVISFPSFRNDMSFVH 1805

Query: 923  SWMLVDGFDVS*KLC*KPAG------------------------DGKFSLKSA*NQVKQK 816
             +   D +DV       P                          +G+FS KSA   ++Q+
Sbjct: 1806 KFYKGDSWDVDKLRLFLPVNLIYEILLIPFDRTQQDVAYWTLTSNGEFSTKSAWETIRQQ 1865

Query: 815  YHAGGIWVTV*SRMLSPTIAVFVWRFLKKRLSVDELLQRRGMNLVSKCQCCAEVESWEHF 636
                 +   +  R +  +I+ F+WR L   + V+  ++ +G++L SKC CC   ES  H 
Sbjct: 1866 QSHNTLGSLIWHRSIPLSISFFIWRALNNWIPVELRMKGKGIHLASKCVCCNSEESLMHV 1925

Query: 635  FFYGPVAKEVWVFFAKMF--CVSKWRHFEN----WKNGRDW-SSGQVREIIPFLIIWFLW 477
             +   VAK+VW FFAK F   V   +H  +    W    D+   G +R ++P  I WFLW
Sbjct: 1926 LWGNSVAKQVWAFFAKFFQIYVLNPKHVSHILWAWFYSGDYVKRGHIRTLLPIFICWFLW 1985

Query: 476  KARNDAKHRDIKPEARLICRNVIRYLGDGMTACTIQNKN*KGSRLVASLLGXXXXXXXXX 297
              RNDAK+R        I   +++ L        +Q    KG   +A++           
Sbjct: 1986 LERNDAKYRHSGLNTDRIVWRIMKLLRQLKDGSLLQQWQWKGDTDIAAMWQYNFQLKLRA 2045

Query: 296  XXXXXX*RKPKYG-FKLNTNGSTKRNPGSSSYGAIVRDHEGKVIFALHGLIGMGSDLRAE 120
                   RKP  G +KLN +GS++    ++S G ++RDH GK+IF     IG  + L+AE
Sbjct: 2046 PPQIVYWRKPSTGEYKLNVDGSSRHGQHAAS-GGVLRDHTGKLIFGFSENIGTCNSLQAE 2104

Query: 119  LFGILKGLEFCIDKHMFPLWLESDSLVALKILQSS 15
            L  +L+GL  C ++H+  LW+E D+L A+++L  S
Sbjct: 2105 LRALLRGLLLCKERHIEKLWIEMDALAAIQLLPHS 2139


>gb|EOX96783.1| Uncharacterized protein TCM_005954 [Theobroma cacao]
          Length = 1134

 Score =  161 bits (408), Expect = 4e-37
 Identities = 103/295 (34%), Positives = 159/295 (53%), Gaps = 12/295 (4%)
 Frame = -2

Query: 863  DGKFSLKSA*NQVKQKYHAGGIWVTV*SRMLSPTIAVFVWRFLKKRLSVDELLQRRGMNL 684
            +G FS +SA   ++Q+  +  +   +  R +  +I+ F+W+ L   + V+  ++ +G+ L
Sbjct: 767  NGDFSTRSAGEMIRQRQTSNALCSFIWHRSIPLSISFFLWKTLHNWIPVELRMKEKGIQL 826

Query: 683  VSKCQCCAEVESWEHFFFYGPVAKEVWVFFAKMF--CVSKWRHFEN----WKNGRDW-SS 525
             SKC CC   ES  H  +  PVAK+VW FFAK+F   +   RH       W    D+   
Sbjct: 827  ASKCVCCNSEESLIHVLWENPVAKQVWNFFAKLFQIYILNPRHVSQIIWAWYVSGDYVRK 886

Query: 524  GQVREIIPFLIIWFLWKARNDAKHR--DIKPEARLICRNV--IRYLGDGMTACTIQNKN* 357
            G  R ++P  I WFLW  RNDAKHR   + P+ R+I R +   R L DG     +Q    
Sbjct: 887  GHFRVLLPLFICWFLWLERNDAKHRHTGLYPD-RVIWRTMKHCRQLYDG---SLLQQWQW 942

Query: 356  KGSRLVASLLGXXXXXXXXXXXXXXX*RKPKYG-FKLNTNGSTKRNPGSSSYGAIVRDHE 180
            KG   +A++LG                +KP  G +KLN +GS+ RN   ++ G ++RDH 
Sbjct: 943  KGDTDIAAMLGFSFPPQQHASPQIIYWKKPSIGEYKLNVDGSS-RNGLHAATGGVLRDHT 1001

Query: 179  GKVIFALHGLIGMGSDLRAELFGILKGLEFCIDKHMFPLWLESDSLVALKILQSS 15
            GK+IF     IG  + L+AEL  +L+GL  C ++H+  LW+E D+L A++++Q S
Sbjct: 1002 GKLIFGFSENIGPCNSLQAELRALLRGLLLCKERHIEKLWIEMDALAAIQLIQPS 1056


>gb|EOY06956.1| Uncharacterized protein TCM_021518 [Theobroma cacao]
          Length = 1702

 Score =  161 bits (407), Expect = 5e-37
 Identities = 112/371 (30%), Positives = 178/371 (47%), Gaps = 8/371 (2%)
 Frame = -2

Query: 1103 KSNASSWWKRLIKSRAVAEILIGWSFGKGQVDFWYDTALMGGLFF*G*IM*IAECSQFLS 924
            K + S  WKR++KSR VA     W  GKG + FWYD   MG       ++        ++
Sbjct: 1108 KLHDSQVWKRMVKSREVAIQNTRWRIGKGNLFFWYD-CWMGDQP----LIPFDRSQDDIA 1162

Query: 923  SWMLVDGFDVS*KLC*KPAGDGKFSLKSA*NQVKQKYHAGGIWVTV*SRMLSPTIAVFVW 744
             W L                +G+FS  SA   ++ +     +      + +  +I+ F+W
Sbjct: 1163 YWALTS--------------NGEFSTWSAWEALRLRQSPNVLCSLFWHKSIPLSISFFLW 1208

Query: 743  RFLKKRLSVDELLQRRGMNLVSKCQCCAEVESWEHFFFYGPVAKEVWVFFAKMF--CVSK 570
            R     + VD  L+ +G +L SKC CC   E+  H  +  PVAK+VW FFA  F   VS 
Sbjct: 1209 RVFHNWIPVDLRLKDKGFHLASKCACCNSEETLIHVLWDNPVAKQVWNFFANFFQIYVSN 1268

Query: 569  WRHFEN----WKNGRDW-SSGQVREIIPFLIIWFLWKARNDAKHRDIKPEARLICRNVIR 405
             ++       W    D+   G +R +IP  I WFLW  RNDAK R +   +  +   +++
Sbjct: 1269 PQNVSQILWAWYFSGDYVRKGHIRTLIPLFICWFLWLERNDAKQRHLGMYSDRVVWKIMK 1328

Query: 404  YLGDGMTACTIQNKN*KGSRLVASLLGXXXXXXXXXXXXXXX*RKPKYG-FKLNTNGSTK 228
             L        ++N   KG   +A++ G                 K   G  KLN +GS++
Sbjct: 1329 LLRQLQDGYVLKNWQWKGDMDIAAMWGFNFSPKIQATPQIFHWVKLVSGEHKLNVDGSSR 1388

Query: 227  RNPGSSSYGAIVRDHEGKVIFALHGLIGMGSDLRAELFGILKGLEFCIDKHMFPLWLESD 48
            +N  S++ G ++RDH G ++F     IG  + L+AEL  +L+GL  C ++++  LW+E D
Sbjct: 1389 QNQ-SAAIGGLLRDHTGTLVFGFSENIGPSNSLQAELRALLRGLLLCKERNIEKLWIEMD 1447

Query: 47   SLVALKILQSS 15
            +LVA++++Q S
Sbjct: 1448 ALVAIQMIQQS 1458


>gb|EOY34748.1| Uncharacterized protein TCM_042328 [Theobroma cacao]
          Length = 910

 Score =  157 bits (397), Expect = 7e-36
 Identities = 107/406 (26%), Positives = 182/406 (44%), Gaps = 43/406 (10%)
 Frame = -2

Query: 1103 KSNASSWWKRLIKSRAVAEILIGWSFGKGQVDFWYDTALMGGLFF*G*IM*IAECSQFLS 924
            K + S  WKR++ S A  E  + W  G+G + FW+D  +       G    I+   +F S
Sbjct: 442  KLHDSQTWKRMLTSSATTEQHMRWRVGQGNLFFWHDCWM-------GDAPLISSNQEFTS 494

Query: 923  SWMLVDGFDVS*------------------------------KLC*KPAGDGKFSLKSA* 834
            S + V  F ++                               +    P  +G FS KSA 
Sbjct: 495  SMVQVCDFFMNNSWNVEKLKTVLQQEVVDEIAKIPIDTMSKDEAYWTPTPNGDFSTKSAW 554

Query: 833  NQVKQKYHAGGIWVTV*SRMLSPTIAVFVWRFLKKRLSVDELLQRRGMNLVSKCQCCAEV 654
              ++++     ++  +  + +  T + F+WR L   + V+  ++ +G+ L S+C+CC   
Sbjct: 555  QLIRKRKVVNPVFNFIWHKTVPLTTSFFLWRLLHDWIPVELKMKSKGLQLASRCRCCKSE 614

Query: 653  ESWEHFFFYGPVAKEVWVFFAKMF------------CVSKWRHFENWKNGRDWSSGQVRE 510
            ES  H  +  PVA +VW +FAK+F             +  W H     +G     G +R 
Sbjct: 615  ESIMHVMWDNPVAMQVWNYFAKLFQICIINPCTINQIIGAWFH-----SGDYCKPGHIRT 669

Query: 509  IIPFLIIWFLWKARNDAKHRDIKPEARLICRNVIRYLGDGMTACTIQNKN*KGSRLVASL 330
            ++P  I+WFLW  RNDAKHR++      +   V++ +        +     KG + +A  
Sbjct: 670  LVPLFILWFLWVERNDAKHRNLGMYPNRVVWRVLKLIQQLSLGQQLLKWQWKGDKQIAQE 729

Query: 329  LGXXXXXXXXXXXXXXX*RKPKYG-FKLNTNGSTKRNPGSSSYGAIVRDHEGKVIFALHG 153
             G                 KP  G FKLN +GS K +  ++  G I+RDH G ++F    
Sbjct: 730  WGIILQAESLAPPKVFSWHKPTTGEFKLNVDGSAKHSHNAAG-GGILRDHAGVMVFGFSE 788

Query: 152  LIGMGSDLRAELFGILKGLEFCIDKHMFPLWLESDSLVALKILQSS 15
             +G+ + L+AEL  + +GL  C D ++  LW+E D++  +++LQ +
Sbjct: 789  NLGIQNSLQAELLALYRGLILCRDYNIRRLWIEMDAISVIRLLQGN 834


>gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao]
          Length = 2215

 Score =  157 bits (397), Expect = 7e-36
 Identities = 106/393 (26%), Positives = 177/393 (45%), Gaps = 32/393 (8%)
 Frame = -2

Query: 1103 KSNASSWWKRLIKSRAVAEILIGWSFGKGQVDFWYDTALMGGLFF*G*IM*IAECSQFLS 924
            K + S  WKR++   ++ E  I W  G G++ FW+D   MG           A     +S
Sbjct: 1746 KLHDSQTWKRMVTISSITEQNIRWRIGHGELFFWHD-CWMGEEPLVNRNQAFASSMAQVS 1804

Query: 923  SWMLVDGFDV------------------------S*KLC*KPAGDGKFSLKSA*NQVKQK 816
             + L + ++V                        + K       +G FS KSA   ++ +
Sbjct: 1805 DFFLNNSWNVEKLKTVLQQEVVEEIVKIPIDTSSNDKAYWTTTPNGDFSTKSAWQLIRNR 1864

Query: 815  YHAGGIWVTV*SRMLSPTIAVFVWRFLKKRLSVDELLQRRGMNLVSKCQCCAEVESWEHF 636
                 ++  +  + +  T + F+WR L   + V+  ++ +G  L S+C+CC   ES  H 
Sbjct: 1865 KVENPVFNFIWHKSVPLTTSFFLWRLLHDWIPVELKMKTKGFQLASRCRCCKSEESLMHV 1924

Query: 635  FFYGPVAKEVWVFFAKMFCVSKWRHFE------NWKNGRDWSS-GQVREIIPFLIIWFLW 477
             +  PVA +VW +FAK+F +              W    D+S  G +R ++P   +WFLW
Sbjct: 1925 MWKNPVANQVWSYFAKVFQIQIINPCTINQIICAWFYSGDYSKPGHIRTLVPLFTLWFLW 1984

Query: 476  KARNDAKHRDIKPEARLICRNVIRYLGDGMTACTIQNKN*KGSRLVASLLGXXXXXXXXX 297
              RNDAKHR++      +   +++ L        +Q    +G + +A   G         
Sbjct: 1985 VERNDAKHRNLGMYPNRVVWKILKLLHQLFQGKQLQKWQWQGDKQIAQEWGIILKADAPS 2044

Query: 296  XXXXXX*RKPKYG-FKLNTNGSTKRNPGSSSYGAIVRDHEGKVIFALHGLIGMGSDLRAE 120
                    KP  G  KLN +GS K NP S++ G ++RDH G +IF      G    L+AE
Sbjct: 2045 PPKLLFWLKPSIGELKLNVDGSCKHNPQSAAGGGLLRDHTGSMIFGFSENFGPQDSLQAE 2104

Query: 119  LFGILKGLEFCIDKHMFPLWLESDSLVALKILQ 21
            L  + +GL  CI+ ++  LW+E D+ VA+++++
Sbjct: 2105 LMALHRGLLLCIEHNISRLWIEMDAKVAVQMIK 2137


>gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao]
          Length = 2251

 Score =  157 bits (396), Expect = 9e-36
 Identities = 106/401 (26%), Positives = 181/401 (45%), Gaps = 38/401 (9%)
 Frame = -2

Query: 1103 KSNASSWWKRLIKSRAVAEILIGWSFGKGQVDFWYDTALMGGLFF*G*IM*IAECSQFLS 924
            K + S  WKR++ S  + E  + W  G+G V FW+D  +       G    I+   +F S
Sbjct: 1783 KLHDSQTWKRMLTSSTITEQHMRWRVGQGNVFFWHDCWM-------GEAPLISSNQEFTS 1835

Query: 923  SWMLVDGFDVS*------------------------------KLC*KPAGDGKFSLKSA* 834
            S + V  F  +                               +    P  +G FS KSA 
Sbjct: 1836 SMVQVCDFFTNNSWNIEKLKTVLQQEVVDEIAKIPIDTMNKDEAYWTPTPNGDFSTKSAW 1895

Query: 833  NQVKQKYHAGGIWVTV*SRMLSPTIAVFVWRFLKKRLSVDELLQRRGMNLVSKCQCCAEV 654
              ++++     ++  +  + +  T + F+WR L   + V+  ++ +G+ L S+C+CC   
Sbjct: 1896 QLIRKRKVVNPVFNFIWHKTVPLTTSFFLWRLLHDWIPVELKMKSKGLQLASRCRCCKSE 1955

Query: 653  ESWEHFFFYGPVAKEVWVFFAKMF-------CVSKWRHFENWKNGRDWSSGQVREIIPFL 495
            ES  H  +  PVA +VW +FAK+F       C         + +G     G +R ++P  
Sbjct: 1956 ESIMHVMWDNPVAMQVWNYFAKLFQILIINPCTINQIIGAWFYSGDYCKPGHIRTLVPLF 2015

Query: 494  IIWFLWKARNDAKHRDIKPEARLICRNVIRYLGDGMTACTIQNKN*KGSRLVASLLGXXX 315
            I+WFLW  RNDAKHR++      +   V++ +        +     KG + +A   G   
Sbjct: 2016 ILWFLWVERNDAKHRNLGMYPNRVVWRVLKLIQQLSLGQQLLKWQWKGDKQIAQEWGIIF 2075

Query: 314  XXXXXXXXXXXX*RKPKYG-FKLNTNGSTKRNPGSSSYGAIVRDHEGKVIFALHGLIGMG 138
                          KP  G FKLN +GS K++  ++  G I+RDH G+++F     +G  
Sbjct: 2076 QAESLAPPKVFSWHKPSLGEFKLNVDGSAKQSHNAAG-GGILRDHAGEMVFGFSENLGTQ 2134

Query: 137  SDLRAELFGILKGLEFCIDKHMFPLWLESDSLVALKILQSS 15
            + L+AEL  + +GL  C D ++  LW+E D++  +++LQ +
Sbjct: 2135 NSLQAELLALYRGLILCRDYNIRRLWIEMDAISVIRLLQGN 2175


>gb|EOY02236.1| Uncharacterized protein TCM_011923 [Theobroma cacao]
          Length = 1954

 Score =  154 bits (390), Expect = 5e-35
 Identities = 111/395 (28%), Positives = 188/395 (47%), Gaps = 32/395 (8%)
 Frame = -2

Query: 1103 KSNASSWWKRLIKSRAVAEILIGWSFGKGQVDFWYDTALMGGL----------------- 975
            K + S  WKR+++ R VA     W  GKG + FW+D   MG                   
Sbjct: 1486 KLHDSQVWKRMVRGREVAIQNTRWRIGKGSLFFWHD-CWMGDQPLVTSFPHFRNDMSTVH 1544

Query: 974  -FF*G*IM*IAECSQFLSSWMLVDGFDVS*KLC*KPAG------DGKFSLKSA*NQVKQK 816
             FF G    + + + +L   ++ +   +                +G+FS +SA   ++ +
Sbjct: 1545 NFFNGHNWDVDKLNLYLPMNLVDEILQIPIDRSQDDVAYWSLTSNGEFSTRSAWEAIRLR 1604

Query: 815  YHAGGIWVTV*SRMLSPTIAVFVWRFLKKRLSVDELLQRRGMNLVSKCQCCAEVESWEHF 636
                 +   +  + +  +I+ F+WR     + VD  L+ +G +L SKC CC   ES  H 
Sbjct: 1605 KSPNVLCSLLWHKSIPLSISFFLWRVFHNWIPVDIRLKEKGFHLASKCICCNSEESLIHV 1664

Query: 635  FFYGPVAKEVWVFFAKMF--CVSKWRHFE----NWKNGRDW-SSGQVREIIPFLIIWFLW 477
             +  P+AK+VW FFA  F   +SK ++       W    D+   G +R +IP  I WFLW
Sbjct: 1665 LWDNPIAKQVWNFFANSFQIYISKPQNVSQILWTWYLSGDYVRKGHIRILIPLFICWFLW 1724

Query: 476  KARNDAKHRDIKPEARLICRNVIRYLGDGMTACTIQNKN*KGSRLVASLLGXXXXXXXXX 297
              RNDAKHR +   +  +   +++ L        +++   KG +  A++ G         
Sbjct: 1725 LERNDAKHRHLGMYSDRVVWKIMKLLRQLQDGYLLKSWQWKGDKDFATMWGLFSPPKTRA 1784

Query: 296  XXXXXX*RKPKYG-FKLNTNGSTKRNPGSSSYGAIVRDHEGKVIFALHGLIGMGSDLRAE 120
                    KP  G  KLN +GS+++N  +++ G ++RDH G ++F     IG  + L+AE
Sbjct: 1785 APQILHWVKPVPGEHKLNVDGSSRQNQ-TAAIGGVLRDHTGTLVFDFSENIGPSNSLQAE 1843

Query: 119  LFGILKGLEFCIDKHMFPLWLESDSLVALKILQSS 15
            L  +L+GL  C ++++  LW+E D+LVA++++Q S
Sbjct: 1844 LRALLRGLLLCKERNIEKLWVEMDALVAIQMIQQS 1878


>gb|EOY19200.1| Retrotransposon, unclassified-like protein [Theobroma cacao]
          Length = 1368

 Score =  153 bits (387), Expect = 1e-34
 Identities = 114/398 (28%), Positives = 186/398 (46%), Gaps = 36/398 (9%)
 Frame = -2

Query: 1103 KSNASSWWKRLIKSRAVAEILIGWSFGKGQVDFWYDTALMGGLFF*G*IM*IAECSQFLS 924
            K + S+ WK L+  RA A   I W  GKG + FW+D A MG           ++    ++
Sbjct: 867  KPHDSATWKPLLAGRATASQQIRWRIGKGDIFFWHD-AWMGDEPLVNSFPSFSQSMMKVN 925

Query: 923  SWMLVDGFDVS*KLC*KP------------------------AGDGKFSLKSA*NQVKQK 816
             +   D +DV       P                          +G FS+KSA   ++Q+
Sbjct: 926  YFFNDDAWDVDKLKTFIPNAIVEEILKIPISREKEDIAYWALTANGDFSIKSAWELLRQR 985

Query: 815  YHAGGIWVTV*SRMLSPTIAVFVWRFLKKRLSVDELLQRRGMNLVSKCQCCAEVESWEHF 636
                 +   +  + +  T++ F+WR L   L V+  ++ +G+ L SKC CC   ES  H 
Sbjct: 986  KQVNLVGQLIWHKSIPLTVSFFLWRTLHNWLPVEVRMKAKGIQLASKCLCCKSEESLLHV 1045

Query: 635  FFYGPVAKEVWVFFAKMFCV------SKWRHFENWKNGRDWSS-GQVREIIPFLIIWFLW 477
             +  PVA++VW +F+K F +      +  +   +W    D++  G +R +I   I WF+W
Sbjct: 1046 LWESPVAQQVWNYFSKFFQIYVHNPQNILQILNSWYYSGDFTKPGHIRTLILLFIFWFVW 1105

Query: 476  KARNDAKHRDI--KPEARLICR--NVIRYLGDGMTACTIQNKN*KGSRLVASLLGXXXXX 309
              RNDAKHRD+   P+ R+I R   ++R L  G   C  Q    KG   +A   G     
Sbjct: 1106 VERNDAKHRDLGMYPD-RIIWRIMKILRKLFQGGLLCKWQ---WKGDLDIAIHWGFNFAQ 1161

Query: 308  XXXXXXXXXX*RKPKYG-FKLNTNGSTKRNPGSSSYGAIVRDHEGKVIFALHGLIGMGSD 132
                        KP  G  KLN +GS+K    +++ G ++RDH G +IF      G  + 
Sbjct: 1162 ERQARPKIINWIKPLIGELKLNVDGSSKDEFQNAAGGGVLRDHTGNLIFGFSENFGYQNS 1221

Query: 131  LRAELFGILKGLEFCIDKHMFPLWLESDSLVALKILQS 18
            L+AEL  + +GL  C++ ++  +W+E D+ V ++++Q+
Sbjct: 1222 LQAELLALHRGLCLCMEYNVSRVWIEVDAQVVIQMIQN 1259


>gb|EOY02234.1| Uncharacterized protein TCM_011921 [Theobroma cacao]
          Length = 926

 Score =  153 bits (387), Expect = 1e-34
 Identities = 111/401 (27%), Positives = 181/401 (45%), Gaps = 38/401 (9%)
 Frame = -2

Query: 1103 KSNASSWWKRLIKSRAVAEILIGWSFGKGQVDFWYDTALMGGLFF*G*IM*IAECSQFLS 924
            K + SS WKR+   R V      W  G+G++ FW+D  +       G    +     F +
Sbjct: 459  KLHNSSIWKRITGGRDVTIQNTRWKIGRGELFFWHDCWM-------GDQPLVISFPSFRN 511

Query: 923  SWMLV------DGFDVS*KLC*KPAG------------------------DGKFSLKSA* 834
               LV      D +DV       P                          +G+FS +SA 
Sbjct: 512  DMSLVHKFYKGDSWDVDKLRLFLPVNLVDEILLIPFDRTQQDVAYWILTSNGEFSTRSAW 571

Query: 833  NQVKQKYHAGGIWVTV*SRMLSPTIAVFVWRFLKKRLSVDELLQRRGMNLVSKCQCCAEV 654
              ++++     +   +  R +  +I+ F+WR L   + V+  ++ +G++L SKC CC   
Sbjct: 572  ETIRKRQPHNTLGSLIWHRSIPLSISFFIWRALNNWIPVELRMKEKGIHLASKCVCCNSE 631

Query: 653  ESWEHFFFYGPVAKEVWVFFAKMFCVSKW--RHFEN----WKNGRDW-SSGQVREIIPFL 495
            ES  H  +   VAK+VW FFA  F +  +  +H  +    W    D+   G +R ++P  
Sbjct: 632  ESLMHVLWGNSVAKQVWAFFANFFQIYIFNPQHVSHILWAWFYSGDYVKRGHIRTLLPIF 691

Query: 494  IIWFLWKARNDAKHRDIKPEARLICRNVIRYLGDGMTACTIQNKN*KGSRLVASLLGXXX 315
            I WFLW  RNDAKHR        +   +++ L        +Q    KG   +A++     
Sbjct: 692  ICWFLWLERNDAKHRYSGLYTDRVVWRIMKLLRQLHDGSLLQQWQWKGDTDIAAMWKYNL 751

Query: 314  XXXXXXXXXXXX*RKPKYG-FKLNTNGSTKRNPGSSSYGAIVRDHEGKVIFALHGLIGMG 138
                         RKP  G +KLN +GS++    ++S G ++RDH GK+IF     IG  
Sbjct: 752  QLKLRAPPQIVYWRKPSTGEYKLNVDGSSRHGQHAAS-GGVLRDHTGKLIFGFSENIGNC 810

Query: 137  SDLRAELFGILKGLEFCIDKHMFPLWLESDSLVALKILQSS 15
            + L+AEL  +L+GL  C ++H+  LW+E D+L  ++++  S
Sbjct: 811  NSLQAELRALLRGLLLCKERHIEQLWIEMDALAVIQLIPHS 851


>gb|EOY17514.1| Uncharacterized protein TCM_042330 [Theobroma cacao]
          Length = 2249

 Score =  151 bits (382), Expect = 4e-34
 Identities = 104/396 (26%), Positives = 181/396 (45%), Gaps = 33/396 (8%)
 Frame = -2

Query: 1103 KSNASSWWKRLIKSRAVAEILIGWSFGKGQVDFWYD-----TALMGGLFF*G*IM*IAEC 939
            K + S  WKR++ + A+ E  + W  G+G++ FW+D     T L          M +  C
Sbjct: 1781 KLHDSQTWKRMVANSAITEQNMRWRVGQGKLFFWHDCWMGETPLTSSNQELSLSM-VQVC 1839

Query: 938  SQFLS-SWML-------------------VDGFDVS*KLC*KPAGDGKFSLKSA*NQVKQ 819
              F++ SW +                   +D      +    P  +G+FS KSA   +++
Sbjct: 1840 DFFMNNSWDIEKLKTVLQQEVVDEIAKIPIDAMSKD-EAYWAPTPNGEFSTKSAWQLIRK 1898

Query: 818  KYHAGGIWVTV*SRMLSPTIAVFVWRFLKKRLSVDELLQRRGMNLVSKCQCCAEVESWEH 639
            +     ++  +  + +  TI+ F+WR L   + V+  ++ +G  L S+C+CC   ES  H
Sbjct: 1899 REVVNPVFNFIWHKTVPLTISFFLWRLLHDWIPVELKMKSKGFQLASRCRCCKSEESIMH 1958

Query: 638  FFFYGPVAKEVWVFFAKMF-------CVSKWRHFENWKNGRDWSSGQVREIIPFLIIWFL 480
              +  PVA +VW +F+K F       C         + +G     G +R ++P   +WFL
Sbjct: 1959 VMWDNPVATQVWNYFSKFFQILVINPCTINQILGAWFYSGDYCKPGHIRTLVPIFTLWFL 2018

Query: 479  WKARNDAKHRDIKPEARLICRNVIRYLGDGMTACTIQNKN*KGSRLVASLLGXXXXXXXX 300
            W  RNDAKHR++      I   +++ +        +     KG + +A   G        
Sbjct: 2019 WVERNDAKHRNLGMYPNRIVWRILKLIQQLSLGQQLLKWQWKGDKQIAQEWGITFQAESL 2078

Query: 299  XXXXXXX*RKPKYG-FKLNTNGSTKRNPGSSSYGAIVRDHEGKVIFALHGLIGMGSDLRA 123
                     KP  G FKLN +GS K +  ++  G ++RDH G ++F     +G+ + L+A
Sbjct: 2079 PPPKVFPWHKPSIGEFKLNVDGSAKLSQNAAG-GGVLRDHAGVMVFGFSENLGIQNSLQA 2137

Query: 122  ELFGILKGLEFCIDKHMFPLWLESDSLVALKILQSS 15
            EL  + +GL  C D ++  LW+E D+   +++LQ +
Sbjct: 2138 ELLALYRGLILCRDYNIRRLWIEMDAASVIRLLQGN 2173


>gb|EOY13984.1| RNase H family protein [Theobroma cacao]
          Length = 429

 Score =  134 bits (336), Expect = 9e-29
 Identities = 90/291 (30%), Positives = 143/291 (49%), Gaps = 8/291 (2%)
 Frame = -2

Query: 872 PAGDGKFSLKSA*NQVKQKYHAGGIWVTV*SRMLSPTIAVFVWRFLKKRLSVDELLQRRG 693
           P  DGKF+ KSA   V+Q++    ++ ++  R +  +I+ F+WR  +  + VD  L+ +G
Sbjct: 91  PTSDGKFTTKSAWEIVRQRHSINFVFYSIWHRSIPLSISFFLWRLFQDWIPVDLRLKSKG 150

Query: 692 MNLVSKCQCCAEVESWEHFFFYGPVAKEVWVFFAKMFCV------SKWRHFENWKNGRDW 531
             LV KCQ C   ES  H  +  P+A +VW +FAK F +      S ++    W    D+
Sbjct: 151 FQLVFKCQHCNSKESLFHVMWECPLASQVWNYFAKFFQIYIIHRKSIYQIIWAWLFSSDY 210

Query: 530 S-SGQVREIIPFLIIWFLWKARNDAKHRDIKPEARLICRNVIRYLGDGMTACTIQNKN*K 354
           +  G +  +IP  I WFLW  RNDAKHR++                 GM        N K
Sbjct: 211 TKKGHIHILIPLFIFWFLWVERNDAKHRNL-----------------GM------YPNRK 247

Query: 353 GSRLVASLLGXXXXXXXXXXXXXXX*RKPKYG-FKLNTNGSTKRNPGSSSYGAIVRDHEG 177
            S     +                  +KP  G FKLN +G +K +  S++ G ++RDH G
Sbjct: 248 PSLPKPKVFS---------------WQKPLTGEFKLNVDGGSKYDCQSAAGGRLLRDHTG 292

Query: 176 KVIFALHGLIGMGSDLRAELFGILKGLEFCIDKHMFPLWLESDSLVALKIL 24
            +IF+     G  + L+AEL  + +GL  CI+ ++  LW+E D+ V ++++
Sbjct: 293 TLIFSFVENFGPYNSLQAELMALYRGLLLCIEHNVRRLWIEMDAKVVIQMI 343


>gb|EOY25454.1| Uncharacterized protein TCM_026877 [Theobroma cacao]
          Length = 2367

 Score =  128 bits (322), Expect = 4e-27
 Identities = 98/389 (25%), Positives = 165/389 (42%), Gaps = 26/389 (6%)
 Frame = -2

Query: 1103 KSNASSWWKRLIKSRAVAEILIGWSFGKGQVDFWYD-----TALMGGLFF*G*IM*IAEC 939
            K + S  WKR++ S A+ E  + W  G+G + FW+D     T L+         M +  C
Sbjct: 1953 KLHDSQTWKRMVASSAITEQNMRWRVGQGNLFFWHDCWMGETPLISSNHEFSLSM-VQVC 2011

Query: 938  SQFLS-SWML-------------------VDGFDVS*KLC*KPAGDGKFSLKSA*NQVKQ 819
              F++ SW +                   +D      +    P  +G+FS KSA   +++
Sbjct: 2012 DFFMNNSWDIEKLKTVLQQEVVDEIAKIPIDAMSKD-EAYWAPTPNGEFSTKSAWQLIRK 2070

Query: 818  KYHAGGIWVTV*SRMLSPTIAVFVWRFLKKRLSVDELLQRRGMNLVSKCQCCAEVESWEH 639
            +     ++  +  + +  T + F+WR L   + V+  ++ +G  L S+C+CC   ES  H
Sbjct: 2071 REVVNPVFNFIWHKAIPLTTSFFLWRLLHDWIPVELRMKSKGFQLASRCRCCRSEESIIH 2130

Query: 638  FFFYGPVAKEVWVFFAKMFCVSKWRHFENWKNGRDWSSGQVREIIPFLIIWFLWKARNDA 459
              +  PVA +                            G +R +IP   +WFLW  RNDA
Sbjct: 2131 VMWDNPVAVQ---------------------------PGHIRTLIPIFTLWFLWVERNDA 2163

Query: 458  KHRDIKPEARLICRNVIRYLGDGMTACTIQNKN*KGSRLVASLLGXXXXXXXXXXXXXXX 279
            KHR++  +                    +     KG + +A   G               
Sbjct: 2164 KHRNLGQQ--------------------LLEWQWKGDKQIAQEWGITFQAKSLPPPKVFC 2203

Query: 278  *RKPKYG-FKLNTNGSTKRNPGSSSYGAIVRDHEGKVIFALHGLIGMGSDLRAELFGILK 102
              KP  G FKLN +GS K +  ++  G ++RDH G +IF     +G+ + L+AEL  + +
Sbjct: 2204 WHKPSNGEFKLNVDGSAKLSQNAAG-GGVLRDHAGVMIFGFSENLGIQNSLKAELLALYR 2262

Query: 101  GLEFCIDKHMFPLWLESDSLVALKILQSS 15
            GL  C D ++  LW+E D+   +++LQ +
Sbjct: 2263 GLILCRDYNIRRLWIEMDATSVIRLLQGN 2291


>gb|EOY25447.1| Uncharacterized protein TCM_016753 [Theobroma cacao]
          Length = 1275

 Score =  105 bits (263), Expect = 2e-20
 Identities = 68/232 (29%), Positives = 107/232 (46%), Gaps = 1/232 (0%)
 Frame = -2

Query: 707  LQRRGMNLVSKCQCCAEVESWEHFFFYGPVAKEVWVFFAKMFCVSKWRHFENWKNGRDWS 528
            ++ +G++LVSKC CC   ES  H  +   VAK+                           
Sbjct: 853  IEEKGIHLVSKCVCCNSEESLMHVLWGNSVAKQ--------------------------- 885

Query: 527  SGQVREIIPFLIIWFLWKARNDAKHRDIKPEARLICRNVIRYLGDGMTACTIQNKN*KGS 348
             G++R ++P  I WFLW  RNDAKHR        +   ++  L        +Q    KG 
Sbjct: 886  -GRIRTLLPIFICWFLWLERNDAKHRHSGLYTDRVVWRIMTLLRQLQDDSLLQQWQWKGD 944

Query: 347  RLVASLLGXXXXXXXXXXXXXXX*RKPKYG-FKLNTNGSTKRNPGSSSYGAIVRDHEGKV 171
              +A++                  RKP  G +KLN +GS+ RN   ++ G ++RDH  K+
Sbjct: 945  TDIAAMWRYNFQLKQRAPPQIVYWRKPFTGEYKLNVDGSS-RNGQHAASGGVLRDHTSKL 1003

Query: 170  IFALHGLIGMGSDLRAELFGILKGLEFCIDKHMFPLWLESDSLVALKILQSS 15
            IF     IG  + L+AEL  + +GL  C ++H+  LW+E D+L  ++++  S
Sbjct: 1004 IFCFSENIGTYNSLQAELRALHRGLLLCKERHIEKLWIEMDALAVIQLIPHS 1055


>ref|XP_006356603.1| PREDICTED: putative ribonuclease H protein At1g65750-like [Solanum
            tuberosum]
          Length = 885

 Score =  105 bits (262), Expect = 3e-20
 Identities = 79/292 (27%), Positives = 136/292 (46%), Gaps = 10/292 (3%)
 Frame = -2

Query: 860  GKFSLKSA*NQVKQKYHAGGIWVTV*SRMLSPTIAVFVWRFLKKRLSVDELLQRRGMNLV 681
            G F++KSA    + K         + ++ L   I  F+WR  K+R++ D+ L++  +N+V
Sbjct: 501  GIFTVKSAWQITRNKQEVRRDCEVIWNKELPFKINFFMWRVWKRRIATDDNLKKMRINIV 560

Query: 680  SKCQCC--AEVESWEHFFFYGPVAKEVWVFFAKMFCVS-KWRHFEN-----WKNGRDWSS 525
            S+C CC   + E+  H F   P+  ++W +FA    ++    H +      WK+      
Sbjct: 561  SRCWCCDRKKEETMTHLFPTAPITYKLWRYFAHFAGINIDGMHLQQLIISWWKHEATPKL 620

Query: 524  GQVREIIPFLIIWFLWKARNDAKHRDIKPEARLI--CRNVIRYLGDGMTACTIQNKN*KG 351
              + + IP +I+W LWK RN  KH       R++     V+R +        I+N     
Sbjct: 621  QGIYKAIPAIIMWTLWKRRNALKHDSSISWERMVEMVIEVVRKMVKSQFP-WIKNMRWTW 679

Query: 350  SRLVASLLGXXXXXXXXXXXXXXX*RKPKYGFKLNTNGSTKRNPGSSSYGAIVRDHEGKV 171
              ++  L                      +  K NT+G+ + NPG SS+G  +RD +G +
Sbjct: 680  QAIIQRL---NQYKRKIHVLRVTWKPPDDHYVKSNTDGACRGNPGLSSFGFCIRDDKGDL 736

Query: 170  IFALHGLIGMGSDLRAELFGILKGLEFCIDKHMFPLWLESDSLVALKILQSS 15
            I+A    IG+ +++ AE   IL  L  C ++ M  + +E+DSL   KI+Q +
Sbjct: 737  IYAKAKGIGIATNMEAETVAILTALRECSNRKMQKVIIETDSLSLKKIIQQT 788


>ref|XP_006358721.1| PREDICTED: uncharacterized protein LOC102596481 [Solanum tuberosum]
          Length = 1135

 Score =  104 bits (259), Expect = 7e-20
 Identities = 71/289 (24%), Positives = 134/289 (46%), Gaps = 8/289 (2%)
 Frame = -2

Query: 860  GKFSLKSA*NQVKQKYHAGGIWVTV*SRMLSPTIAVFVWRFLKKRLSVDELLQRRGMNLV 681
            G F++KSA   ++ K      +  + ++ +   +  F+WR  K+R++ D+ L+R  + +V
Sbjct: 878  GIFTVKSAWELMRHKQERRTDYQLIWTKDVPFKMNFFLWRLWKRRIATDDNLKRMKIQIV 937

Query: 680  SKCQCCAEV--ESWEHFFFYGPVAKEVWVFFAKMFCVS-KWRHFEN-----WKNGRDWSS 525
            S+C CC+E   E+  H F   P+A  +W  F+    +  +  H +      WK+  +   
Sbjct: 938  SRCWCCSETEEETMTHIFLTAPIANRLWRQFSNFAGIQIESMHLQQLIINWWKHSDNAKL 997

Query: 524  GQVREIIPFLIIWFLWKARNDAKHRDIKPEARLICRNVIRYLGDGMTACTIQNKN*KGSR 345
              V   +P +I+W LWK RN+ KHR     + ++ +              +Q +      
Sbjct: 998  KVVMRAMPTIIMWTLWKRRNNFKHRGTTTYSEVVMQ--------------VQEE------ 1037

Query: 344  LVASLLGXXXXXXXXXXXXXXX*RKPKYGFKLNTNGSTKRNPGSSSYGAIVRDHEGKVIF 165
                                   +  +   K NT+G+ + N G+SS   +VRD EG +I+
Sbjct: 1038 -----------------------KPGRNKVKCNTDGAARGNSGASSTSFVVRDEEGDLIY 1074

Query: 164  ALHGLIGMGSDLRAELFGILKGLEFCIDKHMFPLWLESDSLVALKILQS 18
            A    IG+ +++ AE   +L+ + +C +K +    +E+DSLV  K++ +
Sbjct: 1075 ARAKGIGIATNMEAEALALLEAVWYCQEKDLKEPIIETDSLVLKKMVDN 1123


Top