BLASTX nr result

ID: Atropa21_contig00029508 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Atropa21_contig00029508
         (1010 letters)

Database: nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|AAT38724.1| Putative retrotransposon protein, identical [Sola...   200   1e-84
ref|XP_006364939.1| PREDICTED: uncharacterized protein LOC102581...   180   6e-70
gb|AAT39297.2| Gag-pol protein, putative [Solanum demissum]           146   2e-68
gb|ABI34354.1| Retrotransposon gag protein [Solanum demissum]         164   4e-68
gb|AAV31171.1| Putative polyprotein, identical [Solanum tuberosum]    141   6e-65
gb|EMJ01464.1| hypothetical protein PRUPE_ppa015000mg [Prunus pe...   152   2e-64
gb|EOY26451.1| DNA/RNA polymerases superfamily protein [Theobrom...   182   2e-64
gb|EOY03078.1| Retrotransposon protein, putative [Theobroma cacao]    175   2e-63
gb|EOY20280.1| Uncharacterized protein TCM_045699 [Theobroma cacao]   178   3e-63
gb|EOY21678.1| DNA/RNA polymerases superfamily protein [Theobrom...   178   4e-63
gb|EOY08667.1| Retrotransposon protein, Ty3-gypsy subclass, puta...   177   8e-63
gb|EOY03326.1| DNA/RNA polymerases superfamily protein [Theobrom...   179   1e-62
gb|EOY19683.1| Uncharacterized protein TCM_044868 [Theobroma cacao]   173   7e-61
emb|CAN61139.1| hypothetical protein VITISV_009489 [Vitis vinifera]   144   9e-61
emb|CAN77191.1| hypothetical protein VITISV_006389 [Vitis vinifera]   144   1e-60
gb|AAO45752.1| pol protein [Cucumis melo subsp. melo]                 150   2e-60
gb|AEV42258.1| hypothetical protein [Beta vulgaris]                   141   2e-59
emb|CAN61694.1| hypothetical protein VITISV_026655 [Vitis vinifera]   137   2e-58
gb|AAL79340.1|AC099402_4 Putative 22 kDa kafirin cluster; Ty3-Gy...   140   3e-58
gb|EOX99963.1| Uncharacterized protein TCM_009073 [Theobroma cacao]   163   3e-58

>gb|AAT38724.1| Putative retrotransposon protein, identical [Solanum demissum]
          Length = 1602

 Score =  200 bits (508), Expect(3) = 1e-84
 Identities = 107/191 (56%), Positives = 134/191 (70%)
 Frame = +3

Query: 435  PYRLFEVGEETLLGPDLVQQVIEKVKLIQEWWLTT*SKQKSYSDNQR*ELEFVIGNWVFL 614
            P   FEVGE  L+GPDLV Q +EKVK+IQE   T  S+QKSY+D +R  LEF + +WV+L
Sbjct: 1405 PIGWFEVGEARLIGPDLVHQAMEKVKVIQERLKTAQSRQKSYTDVRRRALEFEVDDWVYL 1464

Query: 615  KVSPXXXXXXXXXXXXLSLRYVALYKIVQRIDQVSYRLELPPELEAVHPMFYMFILRKCI 794
            KVSP            LS RY+  Y+IVQR+  V+Y LELP EL AVHP+F++ +L+KCI
Sbjct: 1465 KVSPMKGVMRFGKKGKLSPRYIGPYRIVQRVGSVAYELELPQELAAVHPVFHISMLKKCI 1524

Query: 795  IYPSCITPIEDVQITEDLSY*ETPVAILDRQVCKL*TKEVASVKVLWRRNNVKEMT*EAK 974
              PS I P E V+I ++LSY E PV ILDRQV +L TK+VASVKVLWR   V+E T EA+
Sbjct: 1525 GDPSLILPTESVKIKDNLSYEEVPVQILDRQVRRLRTKDVASVKVLWRNQFVEEATWEAE 1584

Query: 975  VDIKFKYPHLF 1007
             D+K +YPHLF
Sbjct: 1585 EDMKKRYPHLF 1595



 Score =  110 bits (274), Expect(3) = 1e-84
 Identities = 59/105 (56%), Positives = 76/105 (72%)
 Frame = +1

Query: 127  RLHMFQVSIISNRSD*FTTNF*RSFPKSLGTRVNLSTTFNL*TD*QFGRTIQMLKDLL*P 306
            RLH   +SIIS+R   FT  F +SF K LG++V+LST F+  TD Q  RTIQ L+D+L  
Sbjct: 1302 RLHGVPISIISDRGAQFTAQFWKSFQKGLGSKVSLSTAFHPQTDGQAERTIQTLEDMLRA 1361

Query: 307  CVLEFKGSLEDYLLLIEFACNNNYHASIKMAPNEAPYERTCRSLI 441
            CV++FK + +D+L LIEFA NN+YH+SI+MAP EA Y R CRS I
Sbjct: 1362 CVIDFKSNWDDHLPLIEFAYNNSYHSSIQMAPYEALYGRRCRSPI 1406



 Score = 52.8 bits (125), Expect(3) = 1e-84
 Identities = 26/42 (61%), Positives = 34/42 (80%)
 Frame = +2

Query: 2    SYRKFNSI*VIFDRLTKSTNFLLVRTTYLVEDYAKLYVKEIV 127
            S R+ +SI VI DR+TKS +FL V+TT+  EDYAKLY++EIV
Sbjct: 1260 SRRQHDSIWVIVDRMTKSAHFLPVKTTHSAEDYAKLYIQEIV 1301


>ref|XP_006364939.1| PREDICTED: uncharacterized protein LOC102581051 [Solanum tuberosum]
          Length = 1946

 Score =  180 bits (456), Expect(2) = 6e-70
 Identities = 94/189 (49%), Positives = 132/189 (69%), Gaps = 1/189 (0%)
 Frame = +3

Query: 444  LFEVGEETLLGPDLVQQVIEKVKLIQEWWLTT*SKQKSYSDNQR*ELEFVIGNWVFLKVS 623
            LFEVGE  LLGPDLV + +E+V++I+E      S++KSY+D +R  LEF +G+WV+LKVS
Sbjct: 1753 LFEVGEVALLGPDLVMEALEEVRMIRERLKMAQSRRKSYADVRRRALEFRVGDWVYLKVS 1812

Query: 624  PXXXXXXXXXXXXLSLRYVALYKIVQRIDQVSYRLELPPELEAVHPMFYMFILRKCIIYP 803
            P            LS RYV  YK+++RI +V+Y LELP E++ VHP+F++ +LRKC+  P
Sbjct: 1813 PMKGVVRFGKKGKLSPRYVGPYKVMRRIGKVAYELELPSEMDLVHPVFHVSMLRKCVGDP 1872

Query: 804  SCITPIEDVQITED-LSY*ETPVAILDRQVCKL*TKEVASVKVLWRRNNVKEMT*EAKVD 980
            + I  ++ V + ED L+Y E PV ILDRQV +L  KEVASVKVLWR   V+  T EA+ D
Sbjct: 1873 NAIVSLDVVGVVEDNLTYEEVPVQILDRQVKRLRNKEVASVKVLWRNQQVESATWEAEAD 1932

Query: 981  IKFKYPHLF 1007
            ++ +YP++F
Sbjct: 1933 MQRRYPYIF 1941



 Score =  112 bits (280), Expect(2) = 6e-70
 Identities = 61/105 (58%), Positives = 76/105 (72%)
 Frame = +1

Query: 127  RLHMFQVSIISNRSD*FTTNF*RSFPKSLGTRVNLSTTFNL*TD*QFGRTIQMLKDLL*P 306
            RLH   +SIIS+R   FT++F +SF + LGTRV L+T F+  TD Q  RTIQ L+D+L  
Sbjct: 1647 RLHGIPLSIISDRGTQFTSHFWKSFQRGLGTRVKLTTAFHPQTDGQAERTIQTLEDMLRA 1706

Query: 307  CVLEFKGSLEDYLLLIEFACNNNYHASIKMAPNEAPYERTCRSLI 441
            CVLE KGS ED+L LIEF+ NN+YH+SI MAP EA Y R CRS +
Sbjct: 1707 CVLELKGSWEDHLPLIEFSYNNSYHSSIGMAPFEALYGRRCRSSV 1751


>gb|AAT39297.2| Gag-pol protein, putative [Solanum demissum]
          Length = 1554

 Score =  146 bits (369), Expect(3) = 2e-68
 Identities = 78/150 (52%), Positives = 100/150 (66%)
 Frame = +3

Query: 435  PYRLFEVGEETLLGPDLVQQVIEKVKLIQEWWLTT*SKQKSYSDNQR*ELEFVIGNWVFL 614
            P   FEVGE  L+GPDLV Q +EKVK+IQE   T  S+QKSY D +   LEF + +WV+L
Sbjct: 1405 PIGWFEVGEAQLIGPDLVHQAMEKVKVIQERLKTAQSRQKSYIDVRTRALEFEVDDWVYL 1464

Query: 615  KVSPXXXXXXXXXXXXLSLRYVALYKIVQRIDQVSYRLELPPELEAVHPMFYMFILRKCI 794
            KVSP            LS +Y+  Y+I +RI  V+Y LELP ELEAVHP+F++ +L+KCI
Sbjct: 1465 KVSPMKGVMRFGKKGKLSPQYIGPYRIAKRIGNVAYELELPQELEAVHPVFHISMLKKCI 1524

Query: 795  IYPSCITPIEDVQITEDLSY*ETPVAILDR 884
              PS I P E ++I ++LSY E PV ILDR
Sbjct: 1525 GDPSLILPTESIRIKDNLSYEEIPVQILDR 1554



 Score =  107 bits (268), Expect(3) = 2e-68
 Identities = 59/105 (56%), Positives = 73/105 (69%)
 Frame = +1

Query: 127  RLHMFQVSIISNRSD*FTTNF*RSFPKSLGTRVNLSTTFNL*TD*QFGRTIQMLKDLL*P 306
            RLH   +SIIS+R   FT  F +SF K LG++VNLST F   TD Q  RTI  L+D+L  
Sbjct: 1302 RLHGIPISIISDRGAQFTAQFWKSFKKGLGSKVNLSTAFYPQTDGQAERTIHTLEDMLRA 1361

Query: 307  CVLEFKGSLEDYLLLIEFACNNNYHASIKMAPNEAPYERTCRSLI 441
            CV++FKG+ +D+L LIEFA NN+YH+SI MAP EA Y R C S I
Sbjct: 1362 CVIDFKGNWDDHLPLIEFAYNNSYHSSIHMAPYEALYGRRCISPI 1406



 Score = 54.3 bits (129), Expect(3) = 2e-68
 Identities = 27/42 (64%), Positives = 35/42 (83%)
 Frame = +2

Query: 2    SYRKFNSI*VIFDRLTKSTNFLLVRTTYLVEDYAKLYVKEIV 127
            S+R+ +SI VI D++TKS +FL VRTT + EDYAKLYV+EIV
Sbjct: 1260 SHRQHDSIWVIVDQMTKSAHFLPVRTTNIAEDYAKLYVQEIV 1301


>gb|ABI34354.1| Retrotransposon gag protein [Solanum demissum]
          Length = 4543

 Score =  164 bits (415), Expect(3) = 4e-68
 Identities = 88/164 (53%), Positives = 113/164 (68%)
 Frame = +3

Query: 435  PYRLFEVGEETLLGPDLVQQVIEKVKLIQEWWLTT*SKQKSYSDNQR*ELEFVIGNWVFL 614
            P   FEVGE  L+GPDLV Q +EKVK+I+E   T  S+QKSY+D +R  LEF + +WV+L
Sbjct: 1154 PIGWFEVGEAQLIGPDLVHQAMEKVKVIKERLKTAQSRQKSYTDVRRRALEFEVDDWVYL 1213

Query: 615  KVSPXXXXXXXXXXXXLSLRYVALYKIVQRIDQVSYRLELPPELEAVHPMFYMFILRKCI 794
            KVSP            LS RY+  Y+I +RI  V+Y LELP EL AVHP+F++ +L+KCI
Sbjct: 1214 KVSPMKGVMRFGKKGKLSPRYIGPYRIAKRIGNVAYELELPQELAAVHPVFHISMLKKCI 1273

Query: 795  IYPSCITPIEDVQITEDLSY*ETPVAILDRQVCKL*TKEVASVK 926
              PS I P E ++I ++LSY E PV ILDRQV +L TK+VASVK
Sbjct: 1274 GDPSLILPTESIKINDNLSYEEVPVQILDRQVRRLRTKDVASVK 1317



 Score =  164 bits (415), Expect(3) = 4e-68
 Identities = 88/164 (53%), Positives = 113/164 (68%)
 Frame = +3

Query: 435  PYRLFEVGEETLLGPDLVQQVIEKVKLIQEWWLTT*SKQKSYSDNQR*ELEFVIGNWVFL 614
            P   FEVGE  L+GPDLV Q +EKVK+I+E   T  S+QKSY+D +R  LEF + +WV+L
Sbjct: 2664 PIGWFEVGEAQLIGPDLVHQAMEKVKVIKERLKTAQSRQKSYTDVRRRALEFEVDDWVYL 2723

Query: 615  KVSPXXXXXXXXXXXXLSLRYVALYKIVQRIDQVSYRLELPPELEAVHPMFYMFILRKCI 794
            KVSP            LS RY+  Y+I +RI  V+Y LELP EL AVHP+F++ +L+KCI
Sbjct: 2724 KVSPMKGVMRFGKKGKLSPRYIGPYRIAKRIGNVAYELELPQELAAVHPVFHISMLKKCI 2783

Query: 795  IYPSCITPIEDVQITEDLSY*ETPVAILDRQVCKL*TKEVASVK 926
              PS I P E ++I ++LSY E PV ILDRQV +L TK+VASVK
Sbjct: 2784 GDPSLILPTESIKINDNLSYEEVPVQILDRQVRRLRTKDVASVK 2827



 Score =  164 bits (415), Expect(3) = 4e-68
 Identities = 88/164 (53%), Positives = 113/164 (68%)
 Frame = +3

Query: 435  PYRLFEVGEETLLGPDLVQQVIEKVKLIQEWWLTT*SKQKSYSDNQR*ELEFVIGNWVFL 614
            P   FEVGE  L+GPDLV Q +EKVK+I+E   T  S+QKSY+D +R  LEF + +WV+L
Sbjct: 4174 PIGWFEVGEAQLIGPDLVHQAMEKVKVIKERLKTAQSRQKSYTDVRRRALEFEVDDWVYL 4233

Query: 615  KVSPXXXXXXXXXXXXLSLRYVALYKIVQRIDQVSYRLELPPELEAVHPMFYMFILRKCI 794
            KVSP            LS RY+  Y+I +RI  V+Y LELP EL AVHP+F++ +L+KCI
Sbjct: 4234 KVSPMKGVMRFGKKGKLSPRYIGPYRIAKRIGNVAYELELPQELAAVHPVFHISMLKKCI 4293

Query: 795  IYPSCITPIEDVQITEDLSY*ETPVAILDRQVCKL*TKEVASVK 926
              PS I P E ++I ++LSY E PV ILDRQV +L TK+VASVK
Sbjct: 4294 GDPSLILPTESIKINDNLSYEEVPVQILDRQVRRLRTKDVASVK 4337



 Score = 91.3 bits (225), Expect(3) = 4e-68
 Identities = 47/79 (59%), Positives = 60/79 (75%)
 Frame = +1

Query: 205  KSLGTRVNLSTTFNL*TD*QFGRTIQMLKDLL*PCVLEFKGSLEDYLLLIEFACNNNYHA 384
            K LG++VNLST F+  TD Q   TIQ+L+D+L  CV++FKG+ +D+L LIEFA NN+YH 
Sbjct: 1077 KGLGSKVNLSTAFHPQTDGQAEHTIQILEDMLRACVIDFKGNWDDHLPLIEFAYNNSYHP 1136

Query: 385  SIKMAPNEAPYERTCRSLI 441
            SI+MAP EA Y R CRS I
Sbjct: 1137 SIQMAPYEALYGRRCRSPI 1155



 Score = 91.3 bits (225), Expect(3) = 4e-68
 Identities = 47/79 (59%), Positives = 60/79 (75%)
 Frame = +1

Query: 205  KSLGTRVNLSTTFNL*TD*QFGRTIQMLKDLL*PCVLEFKGSLEDYLLLIEFACNNNYHA 384
            K LG++VNLST F+  TD Q   TIQ+L+D+L  CV++FKG+ +D+L LIEFA NN+YH 
Sbjct: 2587 KGLGSKVNLSTAFHPQTDGQAEHTIQILEDMLRACVIDFKGNWDDHLPLIEFAYNNSYHP 2646

Query: 385  SIKMAPNEAPYERTCRSLI 441
            SI+MAP EA Y R CRS I
Sbjct: 2647 SIQMAPYEALYGRRCRSPI 2665



 Score = 91.3 bits (225), Expect(3) = 4e-68
 Identities = 47/79 (59%), Positives = 60/79 (75%)
 Frame = +1

Query: 205  KSLGTRVNLSTTFNL*TD*QFGRTIQMLKDLL*PCVLEFKGSLEDYLLLIEFACNNNYHA 384
            K LG++VNLST F+  TD Q   TIQ+L+D+L  CV++FKG+ +D+L LIEFA NN+YH 
Sbjct: 4097 KGLGSKVNLSTAFHPQTDGQAEHTIQILEDMLRACVIDFKGNWDDHLPLIEFAYNNSYHP 4156

Query: 385  SIKMAPNEAPYERTCRSLI 441
            SI+MAP EA Y R CRS I
Sbjct: 4157 SIQMAPYEALYGRRCRSPI 4175



 Score = 52.0 bits (123), Expect(3) = 4e-68
 Identities = 27/43 (62%), Positives = 33/43 (76%)
 Frame = +2

Query: 2    SYRKFNSI*VIFDRLTKSTNFLLVRTTYLVEDYAKLYVKEIVG 130
            S R+ +SI VI DR+TKS +FL V+TT   EDYAKLYV+EI G
Sbjct: 1036 SRRQHDSIWVIVDRMTKSAHFLPVKTTNTTEDYAKLYVQEIKG 1078



 Score = 52.0 bits (123), Expect(3) = 4e-68
 Identities = 27/43 (62%), Positives = 33/43 (76%)
 Frame = +2

Query: 2    SYRKFNSI*VIFDRLTKSTNFLLVRTTYLVEDYAKLYVKEIVG 130
            S R+ +SI VI DR+TKS +FL V+TT   EDYAKLYV+EI G
Sbjct: 2546 SRRQHDSIWVIVDRMTKSAHFLPVKTTNTTEDYAKLYVQEIKG 2588



 Score = 52.0 bits (123), Expect(3) = 4e-68
 Identities = 27/43 (62%), Positives = 33/43 (76%)
 Frame = +2

Query: 2    SYRKFNSI*VIFDRLTKSTNFLLVRTTYLVEDYAKLYVKEIVG 130
            S R+ +SI VI DR+TKS +FL V+TT   EDYAKLYV+EI G
Sbjct: 4056 SRRQHDSIWVIVDRMTKSAHFLPVKTTNTTEDYAKLYVQEIKG 4098


>gb|AAV31171.1| Putative polyprotein, identical [Solanum tuberosum]
          Length = 1487

 Score =  141 bits (355), Expect(3) = 6e-65
 Identities = 86/191 (45%), Positives = 106/191 (55%)
 Frame = +3

Query: 435  PYRLFEVGEETLLGPDLVQQVIEKVKLIQEWWLTT*SKQKSYSDNQR*ELEFVIGNWVFL 614
            P   FEVGE  L+GP+LV Q +EKVK+IQE   T  S+QKSY+D +R  LEF + NWV+L
Sbjct: 1286 PIGWFEVGEAGLIGPNLVHQAMEKVKVIQERLKTAQSRQKSYTDVRRRALEFEVDNWVYL 1345

Query: 615  KVSPXXXXXXXXXXXXLSLRYVALYKIVQRIDQVSYRLELPPELEAVHPMFYMFILRKCI 794
            KVSP            LS RY+  Y+I +RI  ++Y LELP EL AV+P           
Sbjct: 1346 KVSPMKGVMRVGKKGKLSPRYIGPYRIAKRIGNIAYELELPQELAAVYP----------- 1394

Query: 795  IYPSCITPIEDVQITEDLSY*ETPVAILDRQVCKL*TKEVASVKVLWRRNNVKEMT*EAK 974
                                      ILDRQV +L TKEVASVKVLWR   V+E T E +
Sbjct: 1395 --------------------------ILDRQVRRLRTKEVASVKVLWRNQFVEEATWEDE 1428

Query: 975  VDIKFKYPHLF 1007
             D+K +YPHLF
Sbjct: 1429 EDMKKRYPHLF 1439



 Score =  104 bits (260), Expect(3) = 6e-65
 Identities = 59/105 (56%), Positives = 72/105 (68%)
 Frame = +1

Query: 127  RLHMFQVSIISNRSD*FTTNF*RSFPKSLGTRVNLSTTFNL*TD*QFGRTIQMLKDLL*P 306
            RLH   +SIISNR       F + F K LG  VNLST F+  TD Q  RTIQ L+D+L  
Sbjct: 1187 RLHGVPISIISNRG----AQFWKFFQKGLGLNVNLSTAFHPQTDGQAERTIQTLEDMLRA 1242

Query: 307  CVLEFKGSLEDYLLLIEFACNNNYHASIKMAPNEAPYERTCRSLI 441
            CV++FKG+ +D+L LIEFA NN+YH+SI+MAP EA Y R CRS I
Sbjct: 1243 CVIDFKGNWDDHLPLIEFAYNNSYHSSIQMAPYEALYGRRCRSPI 1287



 Score = 50.8 bits (120), Expect(3) = 6e-65
 Identities = 25/42 (59%), Positives = 33/42 (78%)
 Frame = +2

Query: 2    SYRKFNSI*VIFDRLTKSTNFLLVRTTYLVEDYAKLYVKEIV 127
            S R+ +SI VI DR+TKS +FL V+TT   EDYAKLY++E+V
Sbjct: 1145 SRRQHDSIWVIVDRMTKSAHFLPVKTTNSAEDYAKLYIQEVV 1186


>gb|EMJ01464.1| hypothetical protein PRUPE_ppa015000mg [Prunus persica]
          Length = 1493

 Score =  152 bits (384), Expect(3) = 2e-64
 Identities = 83/186 (44%), Positives = 116/186 (62%)
 Frame = +3

Query: 450  EVGEETLLGPDLVQQVIEKVKLIQEWWLTT*SKQKSYSDNQR*ELEFVIGNWVFLKVSPX 629
            EVG++ L   D +Q   EKVK+I+E       +QKSY+DN+  +LEF +G+WVFLK+SP 
Sbjct: 1306 EVGDKKLEKVDSIQATTEKVKMIKEKLKIAQDRQKSYADNRSKDLEFAVGDWVFLKLSPW 1365

Query: 630  XXXXXXXXXXXLSLRYVALYKIVQRIDQVSYRLELPPELEAVHPMFYMFILRKCIIYPSC 809
                       LS RY+  Y+I +RI  V+YRL LP EL  VH +F++ +LRK +  PS 
Sbjct: 1366 KGVMRFGKRGKLSPRYIGPYEITERIGPVAYRLALPAELSQVHDVFHVSMLRKYMSDPSH 1425

Query: 810  ITPIEDVQITEDLSY*ETPVAILDRQVCKL*TKEVASVKVLWRRNNVKEMT*EAKVDIKF 989
            I   + V++ EDLSY E PV ILDR+   L ++ +  VKVLWR   V+E T E +  ++ 
Sbjct: 1426 ILEYQPVEVEEDLSYEEQPVQILDRKEQMLRSRFIPVVKVLWRSQTVEEATWEPEAQMRV 1485

Query: 990  KYPHLF 1007
            KYP+LF
Sbjct: 1486 KYPYLF 1491



 Score =  102 bits (253), Expect(3) = 2e-64
 Identities = 55/105 (52%), Positives = 72/105 (68%)
 Frame = +1

Query: 127  RLHMFQVSIISNRSD*FTTNF*RSFPKSLGTRVNLSTTFNL*TD*QFGRTIQMLKDLL*P 306
            RLH   VSI+S+R   FT+ F +   +++GTR+  ST F+  TD Q  RTIQ L+D+L  
Sbjct: 1198 RLHGAPVSIVSDRDARFTSRFWKCLQEAMGTRLQFSTAFHPQTDGQSERTIQTLEDMLRS 1257

Query: 307  CVLEFKGSLEDYLLLIEFACNNNYHASIKMAPNEAPYERTCRSLI 441
            CVL+ K S + +L L+EFA NN+YHASIKMAP EA Y R CR+ I
Sbjct: 1258 CVLQMKDSWDTHLALVEFAYNNSYHASIKMAPYEALYGRQCRTPI 1302



 Score = 40.8 bits (94), Expect(3) = 2e-64
 Identities = 21/37 (56%), Positives = 27/37 (72%)
 Frame = +2

Query: 17   NSI*VIFDRLTKSTNFLLVRTTYLVEDYAKLYVKEIV 127
            + I VI DRLTKST+FL ++ TY +   AKL+V EIV
Sbjct: 1161 DGIWVIVDRLTKSTHFLPIKETYSLTKLAKLFVDEIV 1197


>gb|EOY26451.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
          Length = 679

 Score =  182 bits (462), Expect(2) = 2e-64
 Identities = 94/191 (49%), Positives = 133/191 (69%)
 Frame = +3

Query: 435  PYRLFEVGEETLLGPDLVQQVIEKVKLIQEWWLTT*SKQKSYSDNQR*ELEFVIGNWVFL 614
            P    EVGE  LLGP+LVQ   EK+ +I++  LT  S+QKSY+DN+R +LEF +G+ VFL
Sbjct: 487  PIGWLEVGERKLLGPELVQDATEKIHMIRQRMLTAQSRQKSYADNRRRDLEFQVGDHVFL 546

Query: 615  KVSPXXXXXXXXXXXXLSLRYVALYKIVQRIDQVSYRLELPPELEAVHPMFYMFILRKCI 794
            K SP            LS RY+  +KI++++  V+YRL LPP+L  +HP+F++ +LRK  
Sbjct: 547  KFSPTKGVMRFGKKGKLSPRYIGPFKILEKVGAVAYRLALPPDLSNIHPVFHVSMLRKYN 606

Query: 795  IYPSCITPIEDVQITEDLSY*ETPVAILDRQVCKL*TKEVASVKVLWRRNNVKEMT*EAK 974
            + PS +   E +Q+ +DLSY E PVAILDRQV KL +K+VASVKVLWR +  +E+T EA+
Sbjct: 607  LDPSHVIRYETIQLQDDLSYEEQPVAILDRQVKKLRSKDVASVKVLWRNHTSEEVTWEAE 666

Query: 975  VDIKFKYPHLF 1007
             +++ K+PHLF
Sbjct: 667  DEMRTKHPHLF 677



 Score = 91.7 bits (226), Expect(2) = 2e-64
 Identities = 51/109 (46%), Positives = 70/109 (64%)
 Frame = +1

Query: 127 RLHMFQVSIISNRSD*FTTNF*RSFPKSLGTRVNLSTTFNL*TD*QFGRTIQMLKDLL*P 306
           RLH   +SI+S+R   FT+ F     ++LGT+++ ST F+  TD Q  RTIQ L+D+L  
Sbjct: 384 RLHGIPISIVSDRGAQFTSRFWGKLQEALGTKLDFSTAFHPQTDGQSERTIQTLEDMLRA 443

Query: 307 CVLEFKGSLEDYLLLIEFACNNNYHASIKMAPNEAPYERTCRSLIDCLK 453
           CV++     E YL L+EFA NN++  SI+MAP EA Y R CRS I  L+
Sbjct: 444 CVIDLGVRWEQYLPLVEFAYNNSFQTSIQMAPFEALYGRRCRSPIGWLE 492


>gb|EOY03078.1| Retrotransposon protein, putative [Theobroma cacao]
          Length = 1263

 Score =  175 bits (444), Expect(3) = 2e-63
 Identities = 93/191 (48%), Positives = 130/191 (68%)
 Frame = +3

Query: 435  PYRLFEVGEETLLGPDLVQQVIEKVKLIQEWWLTT*SKQKSYSDNQR*ELEFVIGNWVFL 614
            P    EVGE  LLGP LVQ   EK+ +I++  LT  S+QKSY DN+R +LEF +G+ VFL
Sbjct: 1071 PIGWLEVGERKLLGPKLVQDATEKIHMIRQRMLTAQSRQKSYVDNRRRDLEFQVGDHVFL 1130

Query: 615  KVSPXXXXXXXXXXXXLSLRYVALYKIVQRIDQVSYRLELPPELEAVHPMFYMFILRKCI 794
            KVSP            LS RY+  ++I++R+ +V+YRL LPP+L  +HP+F + +LRK  
Sbjct: 1131 KVSPTKGVMRFGKKGKLSPRYIGPFEILERVGEVAYRLALPPDLSNIHPVFQVSMLRKYN 1190

Query: 795  IYPSCITPIEDVQITEDLSY*ETPVAILDRQVCKL*TKEVASVKVLWRRNNVKEMT*EAK 974
              PS +   E +Q+ +DL+Y E PVAILDRQV KL +K+VASVKVLWR +  +E+T EA+
Sbjct: 1191 PDPSHVIWYETIQLQDDLTYEEQPVAILDRQVKKLRSKDVASVKVLWRNHTSEEVTWEAE 1250

Query: 975  VDIKFKYPHLF 1007
             +++ K+PH F
Sbjct: 1251 DEMRTKHPHQF 1261



 Score = 75.1 bits (183), Expect(3) = 2e-63
 Identities = 41/81 (50%), Positives = 55/81 (67%)
 Frame = +1

Query: 211  LGTRVNLSTTFNL*TD*QFGRTIQMLKDLL*PCVLEFKGSLEDYLLLIEFACNNNYHASI 390
            LGT+++ STTF+  TD Q  +TIQ L+D+L  CV++     E YL L+EFA NN++  SI
Sbjct: 996  LGTKLDFSTTFHPQTDGQSEQTIQTLEDMLRACVIDLGVRWEQYLPLVEFAYNNSFQTSI 1055

Query: 391  KMAPNEAPYERTCRSLIDCLK 453
            +MAP EA Y R CRS I  L+
Sbjct: 1056 QMAPFEALYGRRCRSPIGWLE 1076



 Score = 41.6 bits (96), Expect(3) = 2e-63
 Identities = 19/38 (50%), Positives = 28/38 (73%)
 Frame = +2

Query: 14   FNSI*VIFDRLTKSTNFLLVRTTYLVEDYAKLYVKEIV 127
            ++SI ++ DRLTKS +FL V+ TY    YA++YV EI+
Sbjct: 955  YDSIWIVVDRLTKSAHFLPVKITYGAAQYARVYVDEIL 992


>gb|EOY20280.1| Uncharacterized protein TCM_045699 [Theobroma cacao]
          Length = 415

 Score =  178 bits (451), Expect(2) = 3e-63
 Identities = 93/191 (48%), Positives = 132/191 (69%)
 Frame = +3

Query: 435  PYRLFEVGEETLLGPDLVQQVIEKVKLIQEWWLTT*SKQKSYSDNQR*ELEFVIGNWVFL 614
            P    EVGE  LLGP+LVQ   EK+ +I++  LTT S+QKSY+DN+R +LEF +G+ VFL
Sbjct: 223  PIGWLEVGERKLLGPELVQDATEKIHMIRQKMLTTQSRQKSYADNRRRDLEFQVGDHVFL 282

Query: 615  KVSPXXXXXXXXXXXXLSLRYVALYKIVQRIDQVSYRLELPPELEAVHPMFYMFILRKCI 794
            KVSP            LS RY+  + I++++  V+YRL LPP+L  +HP+F++ +LRK  
Sbjct: 283  KVSPTKGVMRFGKKGKLSPRYIRPFDILEKVGAVAYRLALPPDLSNIHPVFHVSMLRKYN 342

Query: 795  IYPSCITPIEDVQITEDLSY*ETPVAILDRQVCKL*TKEVASVKVLWRRNNVKEMT*EAK 974
              PS +   E +Q+  DL+Y E PVAILDRQV KL +K+VASVKVLW+ +  +E+T EA+
Sbjct: 343  PDPSHVIRYETIQLQNDLTYEEQPVAILDRQVKKLRSKDVASVKVLWQNHTSEEVTWEAE 402

Query: 975  VDIKFKYPHLF 1007
             +++ K+PHLF
Sbjct: 403  DEMRTKHPHLF 413



 Score = 92.0 bits (227), Expect(2) = 3e-63
 Identities = 51/109 (46%), Positives = 70/109 (64%)
 Frame = +1

Query: 127 RLHMFQVSIISNRSD*FTTNF*RSFPKSLGTRVNLSTTFNL*TD*QFGRTIQMLKDLL*P 306
           RLH   +SI+S+R   FT+ F     ++LGT+++ ST F+  TD Q  RTIQ L+D+L  
Sbjct: 120 RLHGIPISIVSDREAQFTSRFWGKLQEALGTKLDFSTAFHPQTDGQSERTIQTLEDMLRA 179

Query: 307 CVLEFKGSLEDYLLLIEFACNNNYHASIKMAPNEAPYERTCRSLIDCLK 453
           CV++     E YL L+EFA NN++  SI+MAP EA Y R CRS I  L+
Sbjct: 180 CVIDLGVKWEQYLPLVEFAYNNSFQTSIQMAPFEALYGRRCRSPIGWLE 228


>gb|EOY21678.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
          Length = 448

 Score =  178 bits (451), Expect(2) = 4e-63
 Identities = 93/191 (48%), Positives = 132/191 (69%)
 Frame = +3

Query: 435  PYRLFEVGEETLLGPDLVQQVIEKVKLIQEWWLTT*SKQKSYSDNQR*ELEFVIGNWVFL 614
            P    EVGE  LLGP+LVQ   EK+ +I++  LT  S+QKSY+DN+R  LEF +G+ VFL
Sbjct: 256  PIGWLEVGERKLLGPELVQDATEKIHMIRQRMLTAQSRQKSYADNRRRYLEFQVGDHVFL 315

Query: 615  KVSPXXXXXXXXXXXXLSLRYVALYKIVQRIDQVSYRLELPPELEAVHPMFYMFILRKCI 794
            KVSP            LS RY+  ++I++++  V+YRL LPP+L  +HP+F++ +LRK  
Sbjct: 316  KVSPTKGIMRFGKKGKLSPRYIGPFEILEKVGAVAYRLALPPDLSNIHPVFHVSMLRKYN 375

Query: 795  IYPSCITPIEDVQITEDLSY*ETPVAILDRQVCKL*TKEVASVKVLWRRNNVKEMT*EAK 974
              PS +   E +Q+ +DL+Y E PVAILDRQV KL +K+VASVKVLWR +  +E+T EA+
Sbjct: 376  PDPSHVIRYETIQLQDDLTYEEQPVAILDRQVKKLRSKDVASVKVLWRNHTSEEVTWEAE 435

Query: 975  VDIKFKYPHLF 1007
             +++ K+PHLF
Sbjct: 436  DEMRTKHPHLF 446



 Score = 91.7 bits (226), Expect(2) = 4e-63
 Identities = 51/109 (46%), Positives = 70/109 (64%)
 Frame = +1

Query: 127 RLHMFQVSIISNRSD*FTTNF*RSFPKSLGTRVNLSTTFNL*TD*QFGRTIQMLKDLL*P 306
           RLH   +SI+S+R   FT+ F     ++LGT+++ ST F+  TD Q  RTIQ L+D+L  
Sbjct: 153 RLHGIPISIVSDRGAQFTSRFWGKLQEALGTKLDFSTAFHPQTDGQSERTIQTLEDMLRA 212

Query: 307 CVLEFKGSLEDYLLLIEFACNNNYHASIKMAPNEAPYERTCRSLIDCLK 453
           CV++     E YL L+EFA NN++  SI+MAP EA Y R CRS I  L+
Sbjct: 213 CVIDLGVRWEQYLPLVEFAYNNSFQTSIQMAPFEALYGRRCRSPIGWLE 261


>gb|EOY08667.1| Retrotransposon protein, Ty3-gypsy subclass, putative [Theobroma
            cacao]
          Length = 521

 Score =  177 bits (448), Expect(2) = 8e-63
 Identities = 92/191 (48%), Positives = 131/191 (68%)
 Frame = +3

Query: 435  PYRLFEVGEETLLGPDLVQQVIEKVKLIQEWWLTT*SKQKSYSDNQR*ELEFVIGNWVFL 614
            P    EVGE  LLGP+LVQ   EK+ +I++  LT  S+ KSY+DN+R +LEF +G+ VFL
Sbjct: 329  PIGWLEVGERKLLGPELVQDATEKIHMIRQRMLTAQSRHKSYADNRRRDLEFQVGDHVFL 388

Query: 615  KVSPXXXXXXXXXXXXLSLRYVALYKIVQRIDQVSYRLELPPELEAVHPMFYMFILRKCI 794
            KVSP            LS RY+  ++I+ ++  V+YRL LPP+L  +HP+F++ +LRK  
Sbjct: 389  KVSPTKGVMRFGKKGKLSPRYIGPFEILDKVGTVAYRLALPPDLSNIHPVFHVSMLRKYN 448

Query: 795  IYPSCITPIEDVQITEDLSY*ETPVAILDRQVCKL*TKEVASVKVLWRRNNVKEMT*EAK 974
              PS +   E +Q+ +DL+Y E PVAILDRQV KL +K+VASVKVLWR +  +E+T EA+
Sbjct: 449  PDPSHVIRYETIQLQDDLTYEEQPVAILDRQVKKLRSKDVASVKVLWRNHTSEEVTWEAE 508

Query: 975  VDIKFKYPHLF 1007
             +++ K+PHLF
Sbjct: 509  DEMRTKHPHLF 519



 Score = 91.7 bits (226), Expect(2) = 8e-63
 Identities = 51/109 (46%), Positives = 70/109 (64%)
 Frame = +1

Query: 127 RLHMFQVSIISNRSD*FTTNF*RSFPKSLGTRVNLSTTFNL*TD*QFGRTIQMLKDLL*P 306
           RLH   +SI+S+R   FT+ F     ++LGT+++ ST F+  TD Q  RTIQ L+D+L  
Sbjct: 226 RLHGIPISIVSDRGAQFTSRFWGKLQEALGTKLDFSTAFHPQTDGQSERTIQTLEDMLRA 285

Query: 307 CVLEFKGSLEDYLLLIEFACNNNYHASIKMAPNEAPYERTCRSLIDCLK 453
           CV++     E YL L+EFA NN++  SI+MAP EA Y R CRS I  L+
Sbjct: 286 CVIDLGVRWEQYLPLVEFAYNNSFQTSIQMAPFEALYGRRCRSPIGWLE 334


>gb|EOY03326.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
          Length = 1447

 Score =  179 bits (455), Expect(2) = 1e-62
 Identities = 93/191 (48%), Positives = 133/191 (69%)
 Frame = +3

Query: 435  PYRLFEVGEETLLGPDLVQQVIEKVKLIQEWWLTT*SKQKSYSDNQR*ELEFVIGNWVFL 614
            P    EVGE  LLGP+LVQ   EK+ +I++  LT  S+QKSY+DN+R +LEF +G+ VFL
Sbjct: 1255 PIGWLEVGERKLLGPELVQDATEKIHMIRQRMLTAQSRQKSYADNRRRDLEFQVGDHVFL 1314

Query: 615  KVSPXXXXXXXXXXXXLSLRYVALYKIVQRIDQVSYRLELPPELEAVHPMFYMFILRKCI 794
            KVSP            LS RY+  ++I++++  V+YRL LPP+L  +HP+F++ +LRK  
Sbjct: 1315 KVSPTKGVMRFGKKGKLSPRYIGPFEILEKVGAVAYRLALPPDLSNIHPVFHVSMLRKYN 1374

Query: 795  IYPSCITPIEDVQITEDLSY*ETPVAILDRQVCKL*TKEVASVKVLWRRNNVKEMT*EAK 974
              PS +   E +Q+ +DL+Y E PVAILDRQV KL +K+VASVKVLWR +  +E+T EA+
Sbjct: 1375 PDPSHVIRYETIQLQDDLTYEEQPVAILDRQVKKLRSKDVASVKVLWRNHTSEEVTWEAE 1434

Query: 975  VDIKFKYPHLF 1007
             +++ K+PHLF
Sbjct: 1435 DEMRTKHPHLF 1445



 Score = 88.6 bits (218), Expect(2) = 1e-62
 Identities = 50/109 (45%), Positives = 69/109 (63%)
 Frame = +1

Query: 127  RLHMFQVSIISNRSD*FTTNF*RSFPKSLGTRVNLSTTFNL*TD*QFGRTIQMLKDLL*P 306
            RLH   +SI+S+R   FT+ F     ++LGT+++ ST F+  TD Q  RTIQ L+ +L  
Sbjct: 1152 RLHGIPISIVSDRGAQFTSRFWGKLQEALGTKLDFSTAFHPQTDGQSERTIQTLEAMLRA 1211

Query: 307  CVLEFKGSLEDYLLLIEFACNNNYHASIKMAPNEAPYERTCRSLIDCLK 453
            CV++     E YL L+EFA NN++  SI+MAP EA Y R CRS I  L+
Sbjct: 1212 CVIDLGVRWEQYLPLVEFAYNNSFQTSIQMAPFEALYGRRCRSPIGWLE 1260


>gb|EOY19683.1| Uncharacterized protein TCM_044868 [Theobroma cacao]
          Length = 403

 Score =  173 bits (439), Expect(2) = 7e-61
 Identities = 91/191 (47%), Positives = 130/191 (68%)
 Frame = +3

Query: 435  PYRLFEVGEETLLGPDLVQQVIEKVKLIQEWWLTT*SKQKSYSDNQR*ELEFVIGNWVFL 614
            P    EVGE  LLGP+LVQ   EK+ +I++  LT  S+QKSY+DN+R +LEF +G+ VFL
Sbjct: 211  PVGWLEVGERKLLGPELVQDATEKIHMIRQRMLTAQSRQKSYADNRRRDLEFQVGDHVFL 270

Query: 615  KVSPXXXXXXXXXXXXLSLRYVALYKIVQRIDQVSYRLELPPELEAVHPMFYMFILRKCI 794
            KV P            LS RY+  ++I+ ++  V+YRL LPP+L  +HP+F++ +LRK  
Sbjct: 271  KVLPTKGVMRFGKKGKLSPRYIGPFEILDKVGAVAYRLALPPDLSNIHPVFHVSMLRKYN 330

Query: 795  IYPSCITPIEDVQITEDLSY*ETPVAILDRQVCKL*TKEVASVKVLWRRNNVKEMT*EAK 974
              PS +   E +Q+ +DL+Y E PVAILDRQV KL +K+VASVKVLW  +  +E+T EA+
Sbjct: 331  PDPSHVIRYETIQLQDDLTYEEQPVAILDRQVKKLRSKDVASVKVLWWNHTSEEVTWEAE 390

Query: 975  VDIKFKYPHLF 1007
             +++ K+PHLF
Sbjct: 391  DEMRTKHPHLF 401



 Score = 88.6 bits (218), Expect(2) = 7e-61
 Identities = 49/109 (44%), Positives = 69/109 (63%)
 Frame = +1

Query: 127 RLHMFQVSIISNRSD*FTTNF*RSFPKSLGTRVNLSTTFNL*TD*QFGRTIQMLKDLL*P 306
           RLH   +SI+S+R   FT+ F     ++LGT+++ ST F+  T  Q  RTIQ L+D+L  
Sbjct: 108 RLHGIPISIVSDRGAQFTSRFWGKLQEALGTKLDFSTAFHPQTGGQSERTIQTLEDMLRA 167

Query: 307 CVLEFKGSLEDYLLLIEFACNNNYHASIKMAPNEAPYERTCRSLIDCLK 453
           CV++     E YL L+EFA NN++  SI+MAP EA Y R CRS +  L+
Sbjct: 168 CVIDLGVRWEQYLPLVEFAYNNSFQTSIQMAPFEALYGRRCRSPVGWLE 216


>emb|CAN61139.1| hypothetical protein VITISV_009489 [Vitis vinifera]
          Length = 984

 Score =  144 bits (362), Expect(3) = 9e-61
 Identities = 79/186 (42%), Positives = 116/186 (62%)
 Frame = +3

Query: 450  EVGEETLLGPDLVQQVIEKVKLIQEWWLTT*SKQKSYSDNQR*ELEFVIGNWVFLKVSPX 629
            +VGE  LLGP+LVQ  +EKV LI+E      S+ KSY D++R +LEF +G+ VFLKVSP 
Sbjct: 789  DVGERKLLGPELVQLTVEKVALIKERLKAAQSRHKSYVDHRRRDLEFEVGDHVFLKVSPM 848

Query: 630  XXXXXXXXXXXLSLRYVALYKIVQRIDQVSYRLELPPELEAVHPMFYMFILRKCIIYPSC 809
                       LS R+V L++I++R+  ++Y++ LPP L  VH +F++  LRK I  PS 
Sbjct: 849  KSVMRFGRKGKLSPRFVGLFEILERVGTLAYKVALPPSLSKVHNVFHVSTLRKYIYDPSH 908

Query: 810  ITPIEDVQITEDLSY*ETPVAILDRQVCKL*TKEVASVKVLWRRNNVKEMT*EAKVDIKF 989
            +  +E +QI EDL+Y E PV I+D     L    V  VKV W  ++++E T E + +++ 
Sbjct: 909  VVDLEPIQIFEDLTYEEVPVQIVDMMDKVLRHAVVKLVKVQWSNHSIREATWELEEEMRE 968

Query: 990  KYPHLF 1007
            K+P LF
Sbjct: 969  KHPQLF 974



 Score = 99.0 bits (245), Expect(3) = 9e-61
 Identities = 53/105 (50%), Positives = 72/105 (68%)
 Frame = +1

Query: 127 RLHMFQVSIISNRSD*FTTNF*RSFPKSLGTRVNLSTTFNL*TD*QFGRTIQMLKDLL*P 306
           R+H   VSI+S+R   FT+ F  S  KSLGT+++ ST F+  TD Q  R IQ+L+DL   
Sbjct: 681 RMHGVPVSIVSDRDPRFTSRFWHSLQKSLGTKLSFSTAFHPQTDGQSERVIQVLEDLFRA 740

Query: 307 CVLEFKGSLEDYLLLIEFACNNNYHASIKMAPNEAPYERTCRSLI 441
           C+L+ +G+ +D+L L+EFA NN++ ASI MAP EA Y R CRS I
Sbjct: 741 CILDLQGNWDDHLPLVEFAYNNSFQASIGMAPFEALYGRKCRSPI 785



 Score = 40.0 bits (92), Expect(3) = 9e-61
 Identities = 20/37 (54%), Positives = 27/37 (72%)
 Frame = +2

Query: 17  NSI*VIFDRLTKSTNFLLVRTTYLVEDYAKLYVKEIV 127
           N+I VI DRLTKS +FL ++  + ++  A LYVKEIV
Sbjct: 644 NAIWVIVDRLTKSAHFLPMKVNFSLDRLASLYVKEIV 680


>emb|CAN77191.1| hypothetical protein VITISV_006389 [Vitis vinifera]
          Length = 1387

 Score =  144 bits (364), Expect(3) = 1e-60
 Identities = 76/191 (39%), Positives = 118/191 (61%)
 Frame = +3

Query: 435  PYRLFEVGEETLLGPDLVQQVIEKVKLIQEWWLTT*SKQKSYSDNQR*ELEFVIGNWVFL 614
            P    E+GE  LLGP++VQ+  EK++LI+E   T   +QKSY+D +R  LEF  G+WVF+
Sbjct: 1194 PLCWIEMGESRLLGPEIVQETXEKIQLIKEKLKTAQDRQKSYADKRRRPLEFEEGDWVFV 1253

Query: 615  KVSPXXXXXXXXXXXXLSLRYVALYKIVQRIDQVSYRLELPPELEAVHPMFYMFILRKCI 794
            KVSP            L+ R+V  ++I +R+  V+Y+L LP +L  VH +F++ +LRKC 
Sbjct: 1254 KVSPRRGIFRFGKKGKLAPRFVGPFQIDKRVGPVAYKLILPQQLSLVHDVFHVSMLRKCT 1313

Query: 795  IYPSCITPIEDVQITEDLSY*ETPVAILDRQVCKL*TKEVASVKVLWRRNNVKEMT*EAK 974
              P+ +  ++DVQI+ED SY E P+ IL+    +   K +  VKV W+ + ++E T E +
Sbjct: 1314 PDPTWVVDMQDVQISEDTSYVEEPLRILEVGEHRFRNKVIPXVKVXWQHHGIEEATWELE 1373

Query: 975  VDIKFKYPHLF 1007
             +++  YP LF
Sbjct: 1374 EEMRRHYPQLF 1384



 Score = 96.7 bits (239), Expect(3) = 1e-60
 Identities = 53/103 (51%), Positives = 69/103 (66%)
 Frame = +1

Query: 127  RLHMFQVSIISNRSD*FTTNF*RSFPKSLGTRVNLSTTFNL*TD*QFGRTIQMLKDLL*P 306
            RLH   VSI+S+R   FT+ F +S  ++LGT++N ST F+  TD Q  R IQ+L+D+L  
Sbjct: 1091 RLHGIPVSIVSDRDPKFTSQFWQSLQRTLGTQLNFSTAFHPQTDGQSERVIQILEDMLRA 1150

Query: 307  CVLEFKGSLEDYLLLIEFACNNNYHASIKMAPNEAPYERTCRS 435
            CVL+F G+  DYL L EFA NN+Y +SI M   EA Y R CRS
Sbjct: 1151 CVLDFGGNWADYLPLAEFAYNNSYQSSIGMXTYEALYGRPCRS 1193



 Score = 40.8 bits (94), Expect(3) = 1e-60
 Identities = 20/39 (51%), Positives = 28/39 (71%)
 Frame = +2

Query: 11   KFNSI*VIFDRLTKSTNFLLVRTTYLVEDYAKLYVKEIV 127
            K N + +I DRLTKST+FL ++T   +   AKLY++EIV
Sbjct: 1052 KKNGVWMIVDRLTKSTHFLAMKTIDSMNSLAKLYIQEIV 1090


>gb|AAO45752.1| pol protein [Cucumis melo subsp. melo]
          Length = 923

 Score =  150 bits (378), Expect(3) = 2e-60
 Identities = 82/186 (44%), Positives = 119/186 (63%)
 Frame = +3

Query: 450  EVGEETLLGPDLVQQVIEKVKLIQEWWLTT*SKQKSYSDNQR*ELEFVIGNWVFLKVSPX 629
            EVGE+ L+GP+LVQ   E ++ I+    T  S+QKSY+D +R +LEF +G+ VFLKV+P 
Sbjct: 736  EVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPM 795

Query: 630  XXXXXXXXXXXLSLRYVALYKIVQRIDQVSYRLELPPELEAVHPMFYMFILRKCIIYPSC 809
                       LS R+V  ++I++RI  V+YRL LPP L  VH +F++ +LRK +  PS 
Sbjct: 796  KGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSH 855

Query: 810  ITPIEDVQITEDLSY*ETPVAILDRQVCKL*TKEVASVKVLWRRNNVKEMT*EAKVDIKF 989
            +   E ++I E+LSY E PV +L R V  L  K++  VKVLWR + V+E T E + D++ 
Sbjct: 856  VVDYEPLEIDENLSYVEQPVEVLARGVKTLRNKQIPLVKVLWRNHRVEEATWEREDDMRS 915

Query: 990  KYPHLF 1007
            +YP LF
Sbjct: 916  RYPELF 921



 Score = 92.8 bits (229), Expect(3) = 2e-60
 Identities = 51/103 (49%), Positives = 68/103 (66%)
 Frame = +1

Query: 127 RLHMFQVSIISNRSD*FTTNF*RSFPKSLGTRVNLSTTFNL*TD*QFGRTIQMLKDLL*P 306
           RLH   VSI+S+R   FT+ F +    ++GTR++ ST F+  TD Q  R  Q+L+D+L  
Sbjct: 628 RLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRA 687

Query: 307 CVLEFKGSLEDYLLLIEFACNNNYHASIKMAPNEAPYERTCRS 435
           C LEF GS + +L L+EFA NN+Y A+I MAP EA Y R CRS
Sbjct: 688 CALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGRCCRS 730



 Score = 38.9 bits (89), Expect(3) = 2e-60
 Identities = 19/40 (47%), Positives = 27/40 (67%)
 Frame = +2

Query: 8   RKFNSI*VIFDRLTKSTNFLLVRTTYLVEDYAKLYVKEIV 127
           R F  I V+ DRLTKS +F+  ++TY    +A+LY+ EIV
Sbjct: 588 RGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIV 627


>gb|AEV42258.1| hypothetical protein [Beta vulgaris]
          Length = 1553

 Score =  141 bits (355), Expect(3) = 2e-59
 Identities = 76/186 (40%), Positives = 116/186 (62%)
 Frame = +3

Query: 450  EVGEETLLGPDLVQQVIEKVKLIQEWWLTT*SKQKSYSDNQR*ELEFVIGNWVFLKVSPX 629
            ++ E  +LGPD++Q+ +++V++IQE   T   +QKSY+D +R +  F +G  V LKVSP 
Sbjct: 1336 DISETVVLGPDMIQETMDQVRVIQEKIKTAQDRQKSYADQKRRDENFEVGEKVLLKVSPM 1395

Query: 630  XXXXXXXXXXXLSLRYVALYKIVQRIDQVSYRLELPPELEAVHPMFYMFILRKCIIYPSC 809
                       LS +++  Y+I+ R+ +V+YRL+LP +LE VH +F++  LR+ +   S 
Sbjct: 1396 KGVMRFGKKGKLSPKFIGPYEILARVGKVAYRLDLPNDLERVHNVFHVSQLRRYVPDASH 1455

Query: 810  ITPIEDVQITEDLSY*ETPVAILDRQVCKL*TKEVASVKVLWRRNNVKEMT*EAKVDIKF 989
            +   E+V+I E LSY E PV ILDR+V     K+V  VKVLWR    +E T EA+  ++ 
Sbjct: 1456 VLEPENVEIDETLSYEEKPVQILDRKVRSTRNKDVRIVKVLWRNQTTEEATWEAEDAMRL 1515

Query: 990  KYPHLF 1007
            KYP LF
Sbjct: 1516 KYPELF 1521



 Score = 99.8 bits (247), Expect(3) = 2e-59
 Identities = 53/103 (51%), Positives = 71/103 (68%)
 Frame = +1

Query: 127  RLHMFQVSIISNRSD*FTTNF*RSFPKSLGTRVNLSTTFNL*TD*QFGRTIQMLKDLL*P 306
            RLH    SI+S++   F +NF +   ++ G+ + +ST F+  TD Q  RTIQ L+D+L  
Sbjct: 1228 RLHGVPTSIVSDQDSRFLSNFWKKVQEAFGSELLMSTAFHPATDGQTERTIQTLEDMLRA 1287

Query: 307  CVLEFKGSLEDYLLLIEFACNNNYHASIKMAPNEAPYERTCRS 435
            C LE++GS ED+L LIEF+ NN+YHASIKMAP EA Y R CRS
Sbjct: 1288 CALEYQGSWEDHLDLIEFSYNNSYHASIKMAPFEALYGRKCRS 1330



 Score = 37.4 bits (85), Expect(3) = 2e-59
 Identities = 17/37 (45%), Positives = 26/37 (70%)
 Frame = +2

Query: 17   NSI*VIFDRLTKSTNFLLVRTTYLVEDYAKLYVKEIV 127
            N+I VI DRLTK+  F+ ++ T+ +E  AK YVK ++
Sbjct: 1191 NTIWVIVDRLTKTARFIPMKDTWSMEALAKAYVKNVI 1227


>emb|CAN61694.1| hypothetical protein VITISV_026655 [Vitis vinifera]
          Length = 1313

 Score =  137 bits (345), Expect(3) = 2e-58
 Identities = 74/191 (38%), Positives = 115/191 (60%)
 Frame = +3

Query: 435  PYRLFEVGEETLLGPDLVQQVIEKVKLIQEWWLTT*SKQKSYSDNQR*ELEFVIGNWVFL 614
            P    E+GE  LLGP++V +  EK++LI+E       +QKSY+D +R  LEF  G+WVF+
Sbjct: 1120 PLCWIEMGESRLLGPEIVXETTEKIQLIKEKLKXAQDRQKSYADKRRRPLEFEEGDWVFV 1179

Query: 615  KVSPXXXXXXXXXXXXLSLRYVALYKIVQRIDQVSYRLELPPELEAVHPMFYMFILRKCI 794
            KVSP            L  R V  ++I +R+  V+Y+L LP +L  VH +F++ +LRKC 
Sbjct: 1180 KVSPRRXIFRFGKKGKLXPRXVGPFQIDKRVGPVAYKLILPQQLSLVHDVFHVSMLRKCX 1239

Query: 795  IYPSCITPIEDVQITEDLSY*ETPVAILDRQVCKL*TKEVASVKVLWRRNNVKEMT*EAK 974
              P+ +  ++DVQI+E+ SY E P+ IL+    +   K + +VKV W+ + + E T E +
Sbjct: 1240 PXPTWVVDLQDVQISENTSYVEEPLRILEVGEHRFRNKVIPAVKVWWQHHGIXEATWEPE 1299

Query: 975  VDIKFKYPHLF 1007
             +++  YP LF
Sbjct: 1300 EEMRXHYPQLF 1310



 Score = 98.2 bits (243), Expect(3) = 2e-58
 Identities = 53/103 (51%), Positives = 70/103 (67%)
 Frame = +1

Query: 127  RLHMFQVSIISNRSD*FTTNF*RSFPKSLGTRVNLSTTFNL*TD*QFGRTIQMLKDLL*P 306
            RLH   VSI+S+R   FT+ F +S  ++LGT++N +T F+  TD Q  R IQ+L+D+L  
Sbjct: 1017 RLHGILVSIVSDRDPKFTSQFWQSLQRALGTQLNFNTAFHPQTDGQSERVIQILEDMLRA 1076

Query: 307  CVLEFKGSLEDYLLLIEFACNNNYHASIKMAPNEAPYERTCRS 435
            CVL+F G+  DYL L EFA NN+Y +SI  AP EA Y R CRS
Sbjct: 1077 CVLDFGGNWADYLPLAEFAYNNSYQSSIXXAPYEALYGRPCRS 1119



 Score = 39.3 bits (90), Expect(3) = 2e-58
 Identities = 20/39 (51%), Positives = 27/39 (69%)
 Frame = +2

Query: 11   KFNSI*VIFDRLTKSTNFLLVRTTYLVEDYAKLYVKEIV 127
            K N + VI D LTKS +FL ++TT  +   AKLY++EIV
Sbjct: 978  KKNGVWVIVDCLTKSAHFLAMKTTDSMNSLAKLYIQEIV 1016


>gb|AAL79340.1|AC099402_4 Putative 22 kDa kafirin cluster; Ty3-Gypsy type [Oryza sativa]
            gi|21327374|gb|AAM48279.1|AC122148_32 Putative 22 kDa
            kafirin cluster; Ty3-Gypsy type [Oryza sativa Japonica
            Group] gi|31431495|gb|AAP53268.1| retrotransposon
            protein, putative, Ty3-gypsy subclass [Oryza sativa
            Japonica Group]
          Length = 1230

 Score =  140 bits (354), Expect(3) = 3e-58
 Identities = 76/186 (40%), Positives = 112/186 (60%)
 Frame = +3

Query: 450  EVGEETLLGPDLVQQVIEKVKLIQEWWLTT*SKQKSYSDNQR*ELEFVIGNWVFLKVSPX 629
            EVGE  LLGPD++QQ  E ++LI++   T  ++QKSY DN+R +L F IG+WV+LKVSP 
Sbjct: 1027 EVGERKLLGPDIIQQTKETIRLIRKRLQTAQNRQKSYVDNRRRDLRFDIGDWVYLKVSPM 1086

Query: 630  XXXXXXXXXXXLSLRYVALYKIVQRIDQVSYRLELPPELEAVHPMFYMFILRKCIIYPSC 809
                       LS RYV  + IV+RI +V+Y+++LP  L  VH +F++ ++RKC+  PS 
Sbjct: 1087 KGVKRFGLGKKLSPRYVGPFAIVKRIGEVAYKVKLPDALIGVHDVFHISMIRKCLRRPSD 1146

Query: 810  ITPIEDVQITEDLSY*ETPVAILDRQVCKL*TKEVASVKVLWRRNNVKEMT*EAKVDIKF 989
               I   ++  DL+Y E PV ILD +  K   + +  +KV W  +   E T E + D++ 
Sbjct: 1147 QVEIPMAELRNDLTYQEYPVCILDTKDGKTRNRNIRFLKVQWSHHTQDEATWEKEDDLQK 1206

Query: 990  KYPHLF 1007
             YP  F
Sbjct: 1207 NYPQFF 1212



 Score = 89.7 bits (221), Expect(3) = 3e-58
 Identities = 46/102 (45%), Positives = 69/102 (67%)
 Frame = +1

Query: 130  LHMFQVSIISNRSD*FTTNF*RSFPKSLGTRVNLSTTFNL*TD*QFGRTIQMLKDLL*PC 309
            LH   V I+S+R   F + F +S  ++ GT+++ ST ++  TD Q  R  Q+++D+L  C
Sbjct: 920  LHGVPVRIVSDRDTRFLSKFWKSLHRAPGTKLDFSTAYHPQTDGQTERVNQIIEDMLRSC 979

Query: 310  VLEFKGSLEDYLLLIEFACNNNYHASIKMAPNEAPYERTCRS 435
            +LEFKGS E+++ L EFA NN+Y +SI+MAP EA Y R CR+
Sbjct: 980  ILEFKGSWEEFMPLAEFAYNNSYQSSIRMAPYEALYGRKCRT 1021



 Score = 43.9 bits (102), Expect(3) = 3e-58
 Identities = 23/37 (62%), Positives = 29/37 (78%)
 Frame = +2

Query: 17  NSI*VIFDRLTKSTNFLLVRTTYLVEDYAKLYVKEIV 127
           +SI VI DRLTKST+FL V+  + ++  AKLYVKEIV
Sbjct: 882 DSIWVIVDRLTKSTHFLPVKRNFSLKKLAKLYVKEIV 918


>gb|EOX99963.1| Uncharacterized protein TCM_009073 [Theobroma cacao]
          Length = 421

 Score =  163 bits (412), Expect(2) = 3e-58
 Identities = 85/175 (48%), Positives = 120/175 (68%)
 Frame = +3

Query: 435 PYRLFEVGEETLLGPDLVQQVIEKVKLIQEWWLTT*SKQKSYSDNQR*ELEFVIGNWVFL 614
           P    EVGE  LLGP+LVQ   EK+ +I++  LT  S+QKSY+DN+R +LEF +G+ VFL
Sbjct: 215 PIGWLEVGERKLLGPELVQDATEKIHIIRQRMLTAQSRQKSYADNRRRDLEFQVGDHVFL 274

Query: 615 KVSPXXXXXXXXXXXXLSLRYVALYKIVQRIDQVSYRLELPPELEAVHPMFYMFILRKCI 794
           KVSP            LS RY+  ++I++++  V+YRL LPP+L  +HP+F++ +LRK  
Sbjct: 275 KVSPTKGVMRFGKKGKLSPRYIGPFEILEKVGAVAYRLALPPDLSNIHPVFHVSMLRKYN 334

Query: 795 IYPSCITPIEDVQITEDLSY*ETPVAILDRQVCKL*TKEVASVKVLWRRNNVKEM 959
             PS +   E +Q+ +DL+Y E PVAILDRQV KL +K+VASVKVLWR +  +E+
Sbjct: 335 PDPSHVIRYETIQLQDDLTYEEQPVAILDRQVKKLRSKDVASVKVLWRNHTSEEI 389



 Score = 90.1 bits (222), Expect(2) = 3e-58
 Identities = 50/109 (45%), Positives = 70/109 (64%)
 Frame = +1

Query: 127 RLHMFQVSIISNRSD*FTTNF*RSFPKSLGTRVNLSTTFNL*TD*QFGRTIQMLKDLL*P 306
           RLH   +SI+S+R   FT+ F     ++LGT+++ ST F+  TD Q  RTIQ L+D+L  
Sbjct: 112 RLHGIPISIVSDRGAQFTSRFWGKLQEALGTKLDFSTAFHPQTDGQSERTIQTLEDMLRA 171

Query: 307 CVLEFKGSLEDYLLLIEFACNNNYHASIKMAPNEAPYERTCRSLIDCLK 453
           CV++     E YL L+EFA NN++  SI+MAP +A Y R CRS I  L+
Sbjct: 172 CVIDLGVRWEQYLPLVEFAYNNSFQTSIQMAPFKALYGRRCRSPIGWLE 220


Top