BLASTX nr result

ID: Atropa21_contig00020033 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Atropa21_contig00020033
         (2157 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|ADU56211.1| gag-pol polyprotein [Solanum lycopersicum]             219   2e-57
gb|AAT66771.2| Putative polyprotein, identical [Solanum demissum]     175   9e-46
gb|ABI34389.1| Polyprotein, putative [Solanum tuberosum]              176   4e-45
gb|AAT39954.1| Putative integrase, identical [Solanum demissum]       168   1e-43
gb|AAT39297.2| Gag-pol protein, putative [Solanum demissum]           117   7e-42
ref|XP_006364939.1| PREDICTED: uncharacterized protein LOC102581...   100   9e-42
gb|AEV42258.1| hypothetical protein [Beta vulgaris]                   117   3e-41
ref|XP_006341875.1| PREDICTED: uncharacterized protein LOC102587...   143   3e-41
gb|EOY03326.1| DNA/RNA polymerases superfamily protein [Theobrom...   169   1e-40
gb|EOY08667.1| Retrotransposon protein, Ty3-gypsy subclass, puta...   169   1e-40
ref|XP_004515382.1| PREDICTED: uncharacterized protein LOC101499...   106   2e-40
gb|EOY21678.1| DNA/RNA polymerases superfamily protein [Theobrom...   167   2e-40
gb|EOY26451.1| DNA/RNA polymerases superfamily protein [Theobrom...   167   3e-40
ref|XP_006366848.1| PREDICTED: uncharacterized protein LOC102605...   150   2e-39
gb|EOX99963.1| Uncharacterized protein TCM_009073 [Theobroma cacao]   169   3e-39
gb|EOY19683.1| Uncharacterized protein TCM_044868 [Theobroma cacao]   168   1e-38
gb|AAT38724.1| Putative retrotransposon protein, identical [Sola...   156   3e-38
gb|EOX94130.1| DNA/RNA polymerases superfamily protein [Theobrom...   159   7e-38
gb|EOY20280.1| Uncharacterized protein TCM_045699 [Theobroma cacao]   160   1e-37
ref|XP_004228792.1| PREDICTED: uncharacterized protein LOC101263...   162   4e-37

>gb|ADU56211.1| gag-pol polyprotein [Solanum lycopersicum]
          Length = 367

 Score =  219 bits (559), Expect(2) = 2e-57
 Identities = 138/308 (44%), Positives = 179/308 (58%), Gaps = 16/308 (5%)
 Frame = -2

Query: 2084 MDFIV--VQTLG*FDVIWIIVDRLMKSSHFVPIQTIYNSEILVKIYIRKIV*LNRVPIFI 1911
            MDF+V   +T+G +  IW+IVDRL KS+HF+P++  YN+E L KIYI +IV L+ VP+ I
Sbjct: 1    MDFVVGLPKTMGKYSSIWVIVDRLTKSAHFIPVKVTYNAEKLAKIYISEIVRLHGVPLSI 60

Query: 1910 ILD*GTSLHPASGYLCRRSW-VIEWSLV*YFTLIWIFSPN*LSKS*GYVVVLCDXXXXXX 1734
            I D GT           + W ++   L     L   F P    +S   + VL D      
Sbjct: 61   ISDRGTQF-------TSKFWKILHAELGTRLDLSTAFHPQTDGQSERTIQVLEDMICACV 113

Query: 1733 XXXXXL-------VEFA*------IIEMLPFKALYSGRFHSYIVWFFAFEVRPWGINLSR 1593
                          EF+        I+M PF+ALY  R  S I WF AFEVRPWG +L R
Sbjct: 114  IEFGGHWDSFLPLAEFSYNNSYHSSIDMAPFEALYGRRCRSPIGWFDAFEVRPWGTDLLR 173

Query: 1592 DSLDKVKLI*ERVITIQSWQNSYIDQKVRDLEFKVSVRVLLKVSPXXXXXXXXXXXKLIL 1413
            DS++KVK I E+++  QS Q  Y D+KVRDLEF    +VLLKVSP           KLI 
Sbjct: 174  DSIEKVKSIQEKLLAAQSRQKEYADRKVRDLEFMEGEQVLLKVSPMKAVMRFGKRGKLIP 233

Query: 1412 SCIGQFEIIHHISEIAYELDLPYGLLGVHPVFHIFMLKKYHLDGSYVIQ*DSVSLD*NLT 1233
              IG FE++  + E+AYEL LP GL GVHPVFH+ MLK+YH DG+Y+I+ DSV LD NL+
Sbjct: 234  RYIGPFEVLKRVGEVAYELALPPGLSGVHPVFHVSMLKRYHGDGNYIIRWDSVLLDENLS 293

Query: 1232 FEEDPVEV 1209
            +EE PV +
Sbjct: 294  YEEKPVVI 301



 Score = 32.7 bits (73), Expect(2) = 2e-57
 Identities = 12/30 (40%), Positives = 21/30 (70%)
 Frame = -1

Query: 1197 IFLQWRHSPIEKVTWDTKSDMQYRYPQFST 1108
            I +QW++ P+E+ T + ++DM+ RYP   T
Sbjct: 317  IKVQWKNRPVEEATSEKEADMRERYPHLFT 346


>gb|AAT66771.2| Putative polyprotein, identical [Solanum demissum]
          Length = 1771

 Score =  175 bits (444), Expect(2) = 9e-46
 Identities = 122/314 (38%), Positives = 168/314 (53%), Gaps = 16/314 (5%)
 Frame = -2

Query: 2102 KWG*IAMDFIV--VQTLG*FDVIWIIVDRLMKSSHFVPIQTIYNSEILVKIYIRKIV*LN 1929
            KW  I MDF+V   +T    D IW+IVDRL KS+HF+P+ T +++E L +IYIR++V L+
Sbjct: 1415 KWERITMDFVVGLPRTSRGVDSIWVIVDRLTKSAHFLPVHTTFSAERLARIYIREVVRLH 1474

Query: 1928 RVPIFIILD*GTSLHPASGYLCRRSW-VIEWSLV*YFTLIWIFSPN*LSKS*GYVVVLCD 1752
             VP+ II D G+    +        W   +  L     L   F P    +S   + VL D
Sbjct: 1475 GVPVSIISDRGSQFTSSF-------WRAFQEELGTRVHLSTSFHPQTDGQSERTIQVLED 1527

Query: 1751 XXXXXXXXXXXL-------VEFA*------IIEMLPFKALYSGRFHSYIVWFFAFEVRPW 1611
                                EFA        I+M PF+ALY  R  S + WF + E RP 
Sbjct: 1528 MLRACVMDFGGQWEQFLPLAEFAYNNSYHSSIQMAPFEALYGRRCRSPVGWFESTEPRPR 1587

Query: 1610 GINLSRDSLDKVKLI*ERVITIQSWQNSYIDQKVRDLEFKVSVRVLLKVSPXXXXXXXXX 1431
            G +L +++LD+V++I +R+ T QS   SY DQ+ R L F V  RV L+VSP         
Sbjct: 1588 GTDLLQEALDQVRVIQDRLRTAQSRHQSYADQRRRPLRFSVGDRVFLRVSPMKGVMRFGR 1647

Query: 1430 XXKLILSCIGQFEIIHHISEIAYELDLPYGLLGVHPVFHIFMLKKYHLDGSYVIQ*DSVS 1251
              KL    IG FEI+  + E+AYEL LP     +HPVFH+ ML++Y  D S+V+Q D+V 
Sbjct: 1648 RGKLSPRYIGPFEILRTVGEVAYELALPPVFSAIHPVFHVSMLRRYVPDESHVLQYDAVE 1707

Query: 1250 LD*NLTFEEDPVEV 1209
            LD  LTF E+PV +
Sbjct: 1708 LDDRLTFVEEPVAI 1721



 Score = 37.7 bits (86), Expect(2) = 9e-46
 Identities = 11/28 (39%), Positives = 22/28 (78%)
 Frame = -1

Query: 1203 PRIFLQWRHSPIEKVTWDTKSDMQYRYP 1120
            P + ++WRH P+E+ TW+T+ +M+ ++P
Sbjct: 1735 PVVKVRWRHRPVEEATWETEQEMREQFP 1762


>gb|ABI34389.1| Polyprotein, putative [Solanum tuberosum]
          Length = 545

 Score =  176 bits (446), Expect(2) = 4e-45
 Identities = 123/313 (39%), Positives = 169/313 (53%), Gaps = 15/313 (4%)
 Frame = -2

Query: 2102 KWG*IAMDFIV--VQTLG*FDVIWIIVDRLMKSSHFVPIQTIYNSEILVKIYIRKIV*LN 1929
            KW  I MDF+V   +T    D IW+IVDRL KS+HF+P+ T +++E L +IYIR++V L+
Sbjct: 21   KWERITMDFVVGLPRTSRGVDSIWVIVDRLTKSAHFLPVHTTFSAERLARIYIREVVRLH 80

Query: 1928 RVPIFIILD*GTSLHPASGYLCRRSWVIEWSLV*YFTLIWIFSPN*LSKS*GYVVVLCDX 1749
             VP+ II D G+     S +L       +  L     L   F P    +S   + VL D 
Sbjct: 81   GVPVSIISDRGSQF--TSSFLR----AFQEELGTRVHLSTAFHPQTDGQSERTIQVLEDM 134

Query: 1748 XXXXXXXXXXL-------VEFA*------IIEMLPFKALYSGRFHSYIVWFFAFEVRPWG 1608
                               EFA        I+M PF+ALY  R HS + WF + E R  G
Sbjct: 135  LRACVMDFGGQWDQFLPLAEFAYNNSYHSSIQMAPFEALYGRRCHSPVGWFESTEPRLRG 194

Query: 1607 INLSRDSLDKVKLI*ERVITIQSWQNSYIDQKVRDLEFKVSVRVLLKVSPXXXXXXXXXX 1428
             +L +++LD+V++I +R+ T QS   SY DQ+ R L F V  RV L+VSP          
Sbjct: 195  TDLLQEALDQVRVIQDRLRTAQSRHQSYADQRRRPLRFSVGDRVFLRVSPMKGVMRFGRR 254

Query: 1427 XKLILSCIGQFEIIHHISEIAYELDLPYGLLGVHPVFHIFMLKKYHLDGSYVIQ*DSVSL 1248
             KL    IG FEI+  + E+AYEL LP     +HPVFH+ ML++Y  D S+V+Q D+V L
Sbjct: 255  GKLSPRYIGPFEILRTVGEVAYELALPPVFSAIHPVFHVSMLRRYVPDESHVLQYDAVEL 314

Query: 1247 D*NLTFEEDPVEV 1209
            D  LTF E+PV +
Sbjct: 315  DDRLTFVEEPVAI 327



 Score = 34.7 bits (78), Expect(2) = 4e-45
 Identities = 10/28 (35%), Positives = 21/28 (75%)
 Frame = -1

Query: 1203 PRIFLQWRHSPIEKVTWDTKSDMQYRYP 1120
            P + ++WRH  +E+ TW+T+ +M+ ++P
Sbjct: 341  PVVKVRWRHCSVEEATWETEQEMREQFP 368


>gb|AAT39954.1| Putative integrase, identical [Solanum demissum]
          Length = 1609

 Score =  168 bits (426), Expect(2) = 1e-43
 Identities = 119/313 (38%), Positives = 167/313 (53%), Gaps = 15/313 (4%)
 Frame = -2

Query: 2102 KWG*IAMDFIV--VQTLG*FDVIWIIVDRLMKSSHFVPIQTIYNSEILVKIYIRKIV*LN 1929
            KW  I MDFIV   +T    D IW+IVDRL KSSHF+ +Q+ +++E L +IYIR++V L+
Sbjct: 1112 KWERITMDFIVGLPRTSRGVDNIWVIVDRLTKSSHFLHVQSSFSTERLARIYIREVVRLH 1171

Query: 1928 RVPIFIILD*GTSLHPASGYLCRRSWVIEWSLV*YFTLIWIFSPN*LSKS*GYVVVLCDX 1749
             VP+ II D G+   P +    R     +  L     L   F P    +S   + VL D 
Sbjct: 1172 GVPVSIISDRGS---PFTSSFWR---TFQDDLGTRVDLSTTFHPQTDGQSERTIQVLEDM 1225

Query: 1748 XXXXXXXXXXL-------VEFA*------IIEMLPFKALYSGRFHSYIVWFFAFEVRPWG 1608
                               EFA        I+M PF+ALY  R  S + WF + E RP G
Sbjct: 1226 LQACVMDFGGQWDQFLPLAEFAYNNNYYSSIQMAPFEALYGRRCRSPVGWFESTEARPRG 1285

Query: 1607 INLSRDSLDKVKLI*ERVITIQSWQNSYIDQKVRDLEFKVSVRVLLKVSPXXXXXXXXXX 1428
             +L +++LD+V++I +R+   QS   +Y D++ R L F V  RV  +VSP          
Sbjct: 1286 TDLLQEALDQVRVIQDRLRMAQSRHQNYADRRRRPLRFSVGDRVFFRVSPMKGVMRFGRR 1345

Query: 1427 XKLILSCIGQFEIIHHISEIAYELDLPYGLLGVHPVFHIFMLKKYHLDGSYVIQ*DSVSL 1248
             KL    IG FEI+  + E+AYEL LP     +HPVFH+ ML++Y  D S+V+Q D+V L
Sbjct: 1346 DKLSPRYIGPFEILRTVGEVAYELALPPAFSAIHPVFHVPMLRRYVPDESHVLQYDAVEL 1405

Query: 1247 D*NLTFEEDPVEV 1209
            D  LTF E+P+ +
Sbjct: 1406 DDRLTFVEEPIAI 1418



 Score = 37.4 bits (85), Expect(2) = 1e-43
 Identities = 11/28 (39%), Positives = 21/28 (75%)
 Frame = -1

Query: 1203 PRIFLQWRHSPIEKVTWDTKSDMQYRYP 1120
            P + + WRH P+E+ TW+T+ +M+ ++P
Sbjct: 1432 PVVKVHWRHRPVEEATWETEQEMREQFP 1459


>gb|AAT39297.2| Gag-pol protein, putative [Solanum demissum]
          Length = 1554

 Score =  117 bits (294), Expect(4) = 7e-42
 Identities = 72/163 (44%), Positives = 95/163 (58%)
 Frame = -2

Query: 1697 IEMLPFKALYSGRFHSYIVWFFAFEVRPWGINLSRDSLDKVKLI*ERVITIQSWQNSYID 1518
            I M P++ALY  R  S I WF   E +  G +L   +++KVK+I ER+ T QS Q SYID
Sbjct: 1389 IHMAPYEALYGRRCISPIGWFEVGEAQLIGPDLVHQAMEKVKVIQERLKTAQSRQKSYID 1448

Query: 1517 QKVRDLEFKVSVRVLLKVSPXXXXXXXXXXXKLILSCIGQFEIIHHISEIAYELDLPYGL 1338
             + R LEF+V   V LKVSP           KL    IG + I   I  +AYEL+LP  L
Sbjct: 1449 VRTRALEFEVDDWVYLKVSPMKGVMRFGKKGKLSPQYIGPYRIAKRIGNVAYELELPQEL 1508

Query: 1337 LGVHPVFHIFMLKKYHLDGSYVIQ*DSVSLD*NLTFEEDPVEV 1209
              VHPVFHI MLKK   D S ++  +S+ +  NL++EE PV++
Sbjct: 1509 EAVHPVFHISMLKKCIGDPSLILPTESIRIKDNLSYEEIPVQI 1551



 Score = 58.2 bits (139), Expect(4) = 7e-42
 Identities = 32/77 (41%), Positives = 51/77 (66%), Gaps = 2/77 (2%)
 Frame = -2

Query: 2120 ISILD*KWG*IAMDFI--VVQTLG*FDVIWIIVDRLMKSSHFVPIQTIYNSEILVKIYIR 1947
            I + + KW  I MDFI  + ++    D IW+IVD++ KS+HF+P++T   +E   K+Y++
Sbjct: 1239 IKLPEWKWEMINMDFITGLPKSHRQHDSIWVIVDQMTKSAHFLPVRTTNIAEDYAKLYVQ 1298

Query: 1946 KIV*LNRVPIFIILD*G 1896
            +IV L+ +PI II D G
Sbjct: 1299 EIVRLHGIPISIISDRG 1315



 Score = 41.2 bits (95), Expect(4) = 7e-42
 Identities = 20/38 (52%), Positives = 24/38 (63%)
 Frame = -1

Query: 1890 FTSSFWISM*KELGNRVELSIIFYPHMDIQSELTIQVL 1777
            FT+ FW S  K LG++V LS  FYP  D Q+E TI  L
Sbjct: 1318 FTAQFWKSFKKGLGSKVNLSTAFYPQTDGQAERTIHTL 1355



 Score = 23.9 bits (50), Expect(4) = 7e-42
 Identities = 10/18 (55%), Positives = 14/18 (77%)
 Frame = -3

Query: 1783 SLEDMLWSYVINFDCHWD 1730
            +LEDML + VI+F  +WD
Sbjct: 1354 TLEDMLRACVIDFKGNWD 1371


>ref|XP_006364939.1| PREDICTED: uncharacterized protein LOC102581051 [Solanum tuberosum]
          Length = 1946

 Score =  100 bits (250), Expect(4) = 9e-42
 Identities = 64/164 (39%), Positives = 96/164 (58%), Gaps = 1/164 (0%)
 Frame = -2

Query: 1697 IEMLPFKALYSGRFHSYIVWFFAFEVRPWGINLSRDSLDKVKLI*ERVITIQSWQNSYID 1518
            I M PF+ALY  R  S +  F   EV   G +L  ++L++V++I ER+   QS + SY D
Sbjct: 1734 IGMAPFEALYGRRCRSSVGLFEVGEVALLGPDLVMEALEEVRMIRERLKMAQSRRKSYAD 1793

Query: 1517 QKVRDLEFKVSVRVLLKVSPXXXXXXXXXXXKLILSCIGQFEIIHHISEIAYELDLPYGL 1338
             + R LEF+V   V LKVSP           KL    +G ++++  I ++AYEL+LP  +
Sbjct: 1794 VRRRALEFRVGDWVYLKVSPMKGVVRFGKKGKLSPRYVGPYKVMRRIGKVAYELELPSEM 1853

Query: 1337 LGVHPVFHIFMLKKYHLDGSYVIQ*DSVS-LD*NLTFEEDPVEV 1209
              VHPVFH+ ML+K   D + ++  D V  ++ NLT+EE PV++
Sbjct: 1854 DLVHPVFHVSMLRKCVGDPNAIVSLDVVGVVEDNLTYEEVPVQI 1897



 Score = 64.7 bits (156), Expect(4) = 9e-42
 Identities = 32/72 (44%), Positives = 48/72 (66%), Gaps = 2/72 (2%)
 Frame = -2

Query: 2102 KWG*IAMDFIV--VQTLG*FDVIWIIVDRLMKSSHFVPIQTIYNSEILVKIYIRKIV*LN 1929
            KW  I MDF+V   +T   F  IW++VDR+ KS+HF+P++T Y +E   ++YI  +V L+
Sbjct: 1590 KWEEINMDFVVGLPKTRKGFGSIWVVVDRMTKSAHFLPVKTTYGAEDYARLYIHDLVRLH 1649

Query: 1928 RVPIFIILD*GT 1893
             +P+ II D GT
Sbjct: 1650 GIPLSIISDRGT 1661



 Score = 40.8 bits (94), Expect(4) = 9e-42
 Identities = 20/38 (52%), Positives = 25/38 (65%)
 Frame = -1

Query: 1890 FTSSFWISM*KELGNRVELSIIFYPHMDIQSELTIQVL 1777
            FTS FW S  + LG RV+L+  F+P  D Q+E TIQ L
Sbjct: 1663 FTSHFWKSFQRGLGTRVKLTTAFHPQTDGQAERTIQTL 1700



 Score = 34.3 bits (77), Expect(4) = 9e-42
 Identities = 11/22 (50%), Positives = 16/22 (72%)
 Frame = -1

Query: 1185 WRHSPIEKVTWDTKSDMQYRYP 1120
            WR+  +E  TW+ ++DMQ RYP
Sbjct: 1917 WRNQQVESATWEAEADMQRRYP 1938


>gb|AEV42258.1| hypothetical protein [Beta vulgaris]
          Length = 1553

 Score =  117 bits (293), Expect(4) = 3e-41
 Identities = 64/163 (39%), Positives = 99/163 (60%)
 Frame = -2

Query: 1697 IEMLPFKALYSGRFHSYIVWFFAFEVRPWGINLSRDSLDKVKLI*ERVITIQSWQNSYID 1518
            I+M PF+ALY  +  S + W    E    G ++ ++++D+V++I E++ T Q  Q SY D
Sbjct: 1315 IKMAPFEALYGRKCRSPLCWNDISETVVLGPDMIQETMDQVRVIQEKIKTAQDRQKSYAD 1374

Query: 1517 QKVRDLEFKVSVRVLLKVSPXXXXXXXXXXXKLILSCIGQFEIIHHISEIAYELDLPYGL 1338
            QK RD  F+V  +VLLKVSP           KL    IG +EI+  + ++AY LDLP  L
Sbjct: 1375 QKRRDENFEVGEKVLLKVSPMKGVMRFGKKGKLSPKFIGPYEILARVGKVAYRLDLPNDL 1434

Query: 1337 LGVHPVFHIFMLKKYHLDGSYVIQ*DSVSLD*NLTFEEDPVEV 1209
              VH VFH+  L++Y  D S+V++ ++V +D  L++EE PV++
Sbjct: 1435 ERVHNVFHVSQLRRYVPDASHVLEPENVEIDETLSYEEKPVQI 1477



 Score = 60.5 bits (145), Expect(4) = 3e-41
 Identities = 29/76 (38%), Positives = 49/76 (64%), Gaps = 2/76 (2%)
 Frame = -2

Query: 2123 PISILD*KWG*IAMDFIVV--QTLG*FDVIWIIVDRLMKSSHFVPIQTIYNSEILVKIYI 1950
            P+ I   KW  I+MDF+V   ++ G  + IW+IVDRL K++ F+P++  ++ E L K Y+
Sbjct: 1164 PLDIPTWKWDSISMDFVVALPRSRGGNNTIWVIVDRLTKTARFIPMKDTWSMEALAKAYV 1223

Query: 1949 RKIV*LNRVPIFIILD 1902
            + ++ L+ VP  I+ D
Sbjct: 1224 KNVIRLHGVPTSIVSD 1239



 Score = 32.0 bits (71), Expect(4) = 3e-41
 Identities = 15/38 (39%), Positives = 23/38 (60%)
 Frame = -1

Query: 1890 FTSSFWISM*KELGNRVELSIIFYPHMDIQSELTIQVL 1777
            F S+FW  + +  G+ + +S  F+P  D Q+E TIQ L
Sbjct: 1244 FLSNFWKKVQEAFGSELLMSTAFHPATDGQTERTIQTL 1281



 Score = 28.9 bits (63), Expect(4) = 3e-41
 Identities = 13/37 (35%), Positives = 23/37 (62%), Gaps = 1/37 (2%)
 Frame = -1

Query: 1224 RSS*S*DPRIF-LQWRHSPIEKVTWDTKSDMQYRYPQ 1117
            RS+ + D RI  + WR+   E+ TW+ +  M+ +YP+
Sbjct: 1483 RSTRNKDVRIVKVLWRNQTTEEATWEAEDAMRLKYPE 1519


>ref|XP_006341875.1| PREDICTED: uncharacterized protein LOC102587225 [Solanum tuberosum]
          Length = 819

 Score =  143 bits (360), Expect(4) = 3e-41
 Identities = 81/143 (56%), Positives = 94/143 (65%)
 Frame = -2

Query: 1637 FFAFEVRPWGINLSRDSLDKVKLI*ERVITIQSWQNSYIDQKVRDLEFKVSVRVLLKVSP 1458
            F    VRPW  +L R+SLDKVKLI +R++  QS + SY D+KVRDLEF V  RVLLKVSP
Sbjct: 648  FLPLAVRPWDTDLLRESLDKVKLIQDRLLMAQSRKKSYADRKVRDLEFMVGERVLLKVSP 707

Query: 1457 XXXXXXXXXXXKLILSCIGQFEIIHHISEIAYELDLPYGLLGVHPVFHIFMLKKYHLDGS 1278
                       KL    IG FEI+  I E+AY+L LP  L  VH VFHI MLKKYH  G+
Sbjct: 708  MKGVMRFGKKGKLSPRYIGPFEIVERIGEVAYQLALPPRLSRVHSVFHISMLKKYHQGGA 767

Query: 1277 YVIQ*DSVSLD*NLTFEEDPVEV 1209
             VIQ DSV LD NLTFEE+PV +
Sbjct: 768  TVIQWDSVLLDQNLTFEEEPVTI 790



 Score = 45.8 bits (107), Expect(4) = 3e-41
 Identities = 24/38 (63%), Positives = 26/38 (68%)
 Frame = -1

Query: 1890 FTSSFWISM*KELGNRVELSIIFYPHMDIQSELTIQVL 1777
            FTS FW SM KELG RV+LS  F+P  D QS   IQVL
Sbjct: 593  FTSHFWQSMQKELGTRVDLSTAFHPQTDGQSGRIIQVL 630



 Score = 26.9 bits (58), Expect(4) = 3e-41
 Identities = 12/19 (63%), Positives = 14/19 (73%)
 Frame = -3

Query: 1780 LEDMLWSYVINFDCHWD*F 1724
            LEDML + VI+F  HWD F
Sbjct: 630  LEDMLRACVIDFGGHWDQF 648



 Score = 22.7 bits (47), Expect(4) = 3e-41
 Identities = 6/12 (50%), Positives = 10/12 (83%)
 Frame = -1

Query: 1191 LQWRHSPIEKVT 1156
            +QW+H P+E+ T
Sbjct: 808  VQWKHRPVEEAT 819


>gb|EOY03326.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
          Length = 1447

 Score =  169 bits (427), Expect(2) = 1e-40
 Identities = 116/321 (36%), Positives = 169/321 (52%), Gaps = 16/321 (4%)
 Frame = -2

Query: 2123 PISILD*KWG*IAMDFI--VVQTLG*FDVIWIIVDRLMKSSHFVPIQTIYNSEILVKIYI 1950
            P+ + + KW  IAMDF+  + +T G +D IWI+VDRL KS+HF+P++T Y +    ++Y+
Sbjct: 1088 PLPVPEWKWEHIAMDFVTGLPRTSGGYDSIWIVVDRLTKSAHFLPVKTTYGAAQYARVYV 1147

Query: 1949 RKIV*LNRVPIFIILD*GTSLHPASGYLCRRSW-VIEWSLV*YFTLIWIFSPN*LSKS*G 1773
             +IV L+ +PI I+ D G            R W  ++ +L         F P    +S  
Sbjct: 1148 DEIVRLHGIPISIVSDRGAQF-------TSRFWGKLQEALGTKLDFSTAFHPQTDGQSER 1200

Query: 1772 YVVVL-------CDXXXXXXXXXXXLVEFA*------IIEMLPFKALYSGRFHSYIVWFF 1632
             +  L                    LVEFA        I+M PF+ALY  R  S I W  
Sbjct: 1201 TIQTLEAMLRACVIDLGVRWEQYLPLVEFAYNNSFQTSIQMAPFEALYGRRCRSPIGWLE 1260

Query: 1631 AFEVRPWGINLSRDSLDKVKLI*ERVITIQSWQNSYIDQKVRDLEFKVSVRVLLKVSPXX 1452
              E +  G  L +D+ +K+ +I +R++T QS Q SY D + RDLEF+V   V LKVSP  
Sbjct: 1261 VGERKLLGPELVQDATEKIHMIRQRMLTAQSRQKSYADNRRRDLEFQVGDHVFLKVSPTK 1320

Query: 1451 XXXXXXXXXKLILSCIGQFEIIHHISEIAYELDLPYGLLGVHPVFHIFMLKKYHLDGSYV 1272
                     KL    IG FEI+  +  +AY L LP  L  +HPVFH+ ML+KY+ D S+V
Sbjct: 1321 GVMRFGKKGKLSPRYIGPFEILEKVGAVAYRLALPPDLSNIHPVFHVSMLRKYNPDPSHV 1380

Query: 1271 IQ*DSVSLD*NLTFEEDPVEV 1209
            I+ +++ L  +LT+EE PV +
Sbjct: 1381 IRYETIQLQDDLTYEEQPVAI 1401



 Score = 27.3 bits (59), Expect(2) = 1e-40
 Identities = 8/22 (36%), Positives = 16/22 (72%)
 Frame = -1

Query: 1185 WRHSPIEKVTWDTKSDMQYRYP 1120
            WR+   E+VTW+ + +M+ ++P
Sbjct: 1421 WRNHTSEEVTWEAEDEMRTKHP 1442


>gb|EOY08667.1| Retrotransposon protein, Ty3-gypsy subclass, putative [Theobroma
            cacao]
          Length = 521

 Score =  169 bits (427), Expect(2) = 1e-40
 Identities = 115/321 (35%), Positives = 168/321 (52%), Gaps = 16/321 (4%)
 Frame = -2

Query: 2123 PISILD*KWG*IAMDFI--VVQTLG*FDVIWIIVDRLMKSSHFVPIQTIYNSEILVKIYI 1950
            P+ + + KW  IAMDF+  + +T G +D IWI+VDRL KS+HF+P++T Y +    ++Y+
Sbjct: 162  PLPVPEWKWEHIAMDFVTGLPRTSGGYDSIWIVVDRLTKSAHFLPVKTTYGAAQYARVYV 221

Query: 1949 RKIV*LNRVPIFIILD*GTSLHPASGYLCRRSW-VIEWSLV*YFTLIWIFSPN*LSKS*G 1773
             +IV L+ +PI I+ D G            R W  ++ +L         F P    +S  
Sbjct: 222  DEIVRLHGIPISIVSDRGAQF-------TSRFWGKLQEALGTKLDFSTAFHPQTDGQSER 274

Query: 1772 YVVVLCDXXXXXXXXXXXL-------VEFA*------IIEMLPFKALYSGRFHSYIVWFF 1632
             +  L D                   VEFA        I+M PF+ALY  R  S I W  
Sbjct: 275  TIQTLEDMLRACVIDLGVRWEQYLPLVEFAYNNSFQTSIQMAPFEALYGRRCRSPIGWLE 334

Query: 1631 AFEVRPWGINLSRDSLDKVKLI*ERVITIQSWQNSYIDQKVRDLEFKVSVRVLLKVSPXX 1452
              E +  G  L +D+ +K+ +I +R++T QS   SY D + RDLEF+V   V LKVSP  
Sbjct: 335  VGERKLLGPELVQDATEKIHMIRQRMLTAQSRHKSYADNRRRDLEFQVGDHVFLKVSPTK 394

Query: 1451 XXXXXXXXXKLILSCIGQFEIIHHISEIAYELDLPYGLLGVHPVFHIFMLKKYHLDGSYV 1272
                     KL    IG FEI+  +  +AY L LP  L  +HPVFH+ ML+KY+ D S+V
Sbjct: 395  GVMRFGKKGKLSPRYIGPFEILDKVGTVAYRLALPPDLSNIHPVFHVSMLRKYNPDPSHV 454

Query: 1271 IQ*DSVSLD*NLTFEEDPVEV 1209
            I+ +++ L  +LT+EE PV +
Sbjct: 455  IRYETIQLQDDLTYEEQPVAI 475



 Score = 27.3 bits (59), Expect(2) = 1e-40
 Identities = 8/22 (36%), Positives = 16/22 (72%)
 Frame = -1

Query: 1185 WRHSPIEKVTWDTKSDMQYRYP 1120
            WR+   E+VTW+ + +M+ ++P
Sbjct: 495  WRNHTSEEVTWEAEDEMRTKHP 516


>ref|XP_004515382.1| PREDICTED: uncharacterized protein LOC101499978 [Cicer arietinum]
          Length = 1352

 Score =  106 bits (265), Expect(4) = 2e-40
 Identities = 60/164 (36%), Positives = 92/164 (56%)
 Frame = -2

Query: 1697 IEMLPFKALYSGRFHSYIVWFFAFEVRPWGINLSRDSLDKVKLI*ERVITIQSWQNSYID 1518
            I M PF+ALY  R  + + WF   +    G  + + + DKVK+I E++   QS Q SY D
Sbjct: 979  IGMAPFEALYGRRCRTPLCWFETGDNLVLGPEIVQQTTDKVKMIQEKMRASQSRQKSYHD 1038

Query: 1517 QKVRDLEFKVSVRVLLKVSPXXXXXXXXXXXKLILSCIGQFEIIHHISEIAYELDLPYGL 1338
            ++ + LEF+    V L+V+P           KL    IG ++I+  +  +AY++ LP  L
Sbjct: 1039 KRRKSLEFQEGDHVFLRVTPTTGVGRALKMRKLTPRFIGPYQILKRVGNVAYQIALPPSL 1098

Query: 1337 LGVHPVFHIFMLKKYHLDGSYVIQ*DSVSLD*NLTFEEDPVEVE 1206
              +H VFH+  L+KY  D S+VI+ D V +  NLTFE  P+++E
Sbjct: 1099 SNLHSVFHVSQLRKYIFDPSHVIESDKVQIKENLTFETLPLQIE 1142



 Score = 71.2 bits (173), Expect(4) = 2e-40
 Identities = 37/76 (48%), Positives = 53/76 (69%), Gaps = 2/76 (2%)
 Frame = -2

Query: 2123 PISILD*KWG*IAMDFIV--VQTLG*FDVIWIIVDRLMKSSHFVPIQTIYNSEILVKIYI 1950
            P+SI + KW  I+MDF+V   +T   +D IW+IVDRL KS+HF+PI   Y+ E L +IYI
Sbjct: 828  PLSIPEWKWDSISMDFVVGLPRTPKRYDSIWVIVDRLTKSAHFIPINITYSMERLAEIYI 887

Query: 1949 RKIV*LNRVPIFIILD 1902
            ++IV L+ +P  I+ D
Sbjct: 888  KEIVKLHGIPSSIVSD 903



 Score = 33.1 bits (74), Expect(4) = 2e-40
 Identities = 17/45 (37%), Positives = 25/45 (55%), Gaps = 4/45 (8%)
 Frame = -1

Query: 1890 FTSSFWISM*KELGNRVELSIIFYPHMDIQSELTIQ----VLRIC 1768
            FTS FW  + + LG  + +S  ++P  D Q+E T Q    +LR C
Sbjct: 908  FTSKFWQGLQRALGTNLRMSSAYHPQTDGQTERTNQSLEDLLRAC 952



 Score = 25.0 bits (53), Expect(4) = 2e-40
 Identities = 9/29 (31%), Positives = 16/29 (55%)
 Frame = -1

Query: 1203 PRIFLQWRHSPIEKVTWDTKSDMQYRYPQ 1117
            P + + W  +  E  TW+ +S M+  YP+
Sbjct: 1155 PLVKVVWGGATGESATWEVESQMRDSYPE 1183


>gb|EOY21678.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
          Length = 448

 Score =  167 bits (424), Expect(2) = 2e-40
 Identities = 115/321 (35%), Positives = 168/321 (52%), Gaps = 16/321 (4%)
 Frame = -2

Query: 2123 PISILD*KWG*IAMDFI--VVQTLG*FDVIWIIVDRLMKSSHFVPIQTIYNSEILVKIYI 1950
            P+ + + KW  IAMDF+  + +T G +D IWI+VDRL KS+HF+P++T Y +    ++Y+
Sbjct: 89   PLPVPEWKWEHIAMDFVTGLPRTSGGYDSIWIVVDRLTKSAHFLPVKTTYGAAQYARVYV 148

Query: 1949 RKIV*LNRVPIFIILD*GTSLHPASGYLCRRSW-VIEWSLV*YFTLIWIFSPN*LSKS*G 1773
             +IV L+ +PI I+ D G            R W  ++ +L         F P    +S  
Sbjct: 149  DEIVRLHGIPISIVSDRGAQF-------TSRFWGKLQEALGTKLDFSTAFHPQTDGQSER 201

Query: 1772 YVVVLCDXXXXXXXXXXXL-------VEFA*------IIEMLPFKALYSGRFHSYIVWFF 1632
             +  L D                   VEFA        I+M PF+ALY  R  S I W  
Sbjct: 202  TIQTLEDMLRACVIDLGVRWEQYLPLVEFAYNNSFQTSIQMAPFEALYGRRCRSPIGWLE 261

Query: 1631 AFEVRPWGINLSRDSLDKVKLI*ERVITIQSWQNSYIDQKVRDLEFKVSVRVLLKVSPXX 1452
              E +  G  L +D+ +K+ +I +R++T QS Q SY D + R LEF+V   V LKVSP  
Sbjct: 262  VGERKLLGPELVQDATEKIHMIRQRMLTAQSRQKSYADNRRRYLEFQVGDHVFLKVSPTK 321

Query: 1451 XXXXXXXXXKLILSCIGQFEIIHHISEIAYELDLPYGLLGVHPVFHIFMLKKYHLDGSYV 1272
                     KL    IG FEI+  +  +AY L LP  L  +HPVFH+ ML+KY+ D S+V
Sbjct: 322  GIMRFGKKGKLSPRYIGPFEILEKVGAVAYRLALPPDLSNIHPVFHVSMLRKYNPDPSHV 381

Query: 1271 IQ*DSVSLD*NLTFEEDPVEV 1209
            I+ +++ L  +LT+EE PV +
Sbjct: 382  IRYETIQLQDDLTYEEQPVAI 402



 Score = 27.3 bits (59), Expect(2) = 2e-40
 Identities = 8/22 (36%), Positives = 16/22 (72%)
 Frame = -1

Query: 1185 WRHSPIEKVTWDTKSDMQYRYP 1120
            WR+   E+VTW+ + +M+ ++P
Sbjct: 422  WRNHTSEEVTWEAEDEMRTKHP 443


>gb|EOY26451.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
          Length = 679

 Score =  167 bits (423), Expect(2) = 3e-40
 Identities = 113/321 (35%), Positives = 169/321 (52%), Gaps = 16/321 (4%)
 Frame = -2

Query: 2123 PISILD*KWG*IAMDFI--VVQTLG*FDVIWIIVDRLMKSSHFVPIQTIYNSEILVKIYI 1950
            P+ + + KW  IAMDF+  + +T G +D IWI+VD+L KS+HF+P++T Y +    ++Y+
Sbjct: 320  PLPVPEWKWEHIAMDFVTGLPRTSGGYDSIWIVVDQLTKSAHFLPVKTTYGAAHYARVYV 379

Query: 1949 RKIV*LNRVPIFIILD*GTSLHPASGYLCRRSW-VIEWSLV*YFTLIWIFSPN*LSKS*G 1773
             +IV L+ +PI I+ D G            R W  ++ +L         F P    +S  
Sbjct: 380  DEIVRLHGIPISIVSDRGAQF-------TSRFWGKLQEALGTKLDFSTAFHPQTDGQSER 432

Query: 1772 YVVVLCDXXXXXXXXXXXL-------VEFA*------IIEMLPFKALYSGRFHSYIVWFF 1632
             +  L D                   VEFA        I+M PF+ALY  R  S I W  
Sbjct: 433  TIQTLEDMLRACVIDLGVRWEQYLPLVEFAYNNSFQTSIQMAPFEALYGRRCRSPIGWLE 492

Query: 1631 AFEVRPWGINLSRDSLDKVKLI*ERVITIQSWQNSYIDQKVRDLEFKVSVRVLLKVSPXX 1452
              E +  G  L +D+ +K+ +I +R++T QS Q SY D + RDLEF+V   V LK SP  
Sbjct: 493  VGERKLLGPELVQDATEKIHMIRQRMLTAQSRQKSYADNRRRDLEFQVGDHVFLKFSPTK 552

Query: 1451 XXXXXXXXXKLILSCIGQFEIIHHISEIAYELDLPYGLLGVHPVFHIFMLKKYHLDGSYV 1272
                     KL    IG F+I+  +  +AY L LP  L  +HPVFH+ ML+KY+LD S+V
Sbjct: 553  GVMRFGKKGKLSPRYIGPFKILEKVGAVAYRLALPPDLSNIHPVFHVSMLRKYNLDPSHV 612

Query: 1271 IQ*DSVSLD*NLTFEEDPVEV 1209
            I+ +++ L  +L++EE PV +
Sbjct: 613  IRYETIQLQDDLSYEEQPVAI 633



 Score = 27.3 bits (59), Expect(2) = 3e-40
 Identities = 8/22 (36%), Positives = 16/22 (72%)
 Frame = -1

Query: 1185 WRHSPIEKVTWDTKSDMQYRYP 1120
            WR+   E+VTW+ + +M+ ++P
Sbjct: 653  WRNHTSEEVTWEAEDEMRTKHP 674


>ref|XP_006366848.1| PREDICTED: uncharacterized protein LOC102605741 [Solanum tuberosum]
          Length = 823

 Score =  150 bits (379), Expect(2) = 2e-39
 Identities = 81/138 (58%), Positives = 97/138 (70%)
 Frame = -2

Query: 1622 VRPWGINLSRDSLDKVKLI*ERVITIQSWQNSYIDQKVRDLEFKVSVRVLLKVSPXXXXX 1443
            VRPWG +L R+SLDKVK+I +R++  QS Q SY D+KVR+LEF V  RVLLKVSP     
Sbjct: 532  VRPWGTDLLRESLDKVKMIQDRLLMAQSRQKSYADRKVRNLEFMVGERVLLKVSPMKGVM 591

Query: 1442 XXXXXXKLILSCIGQFEIIHHISEIAYELDLPYGLLGVHPVFHIFMLKKYHLDGSYVIQ* 1263
                  KL    IG FEI+  I E+AY+L LP GL GVH VFHI MLKKYH  G++VIQ 
Sbjct: 592  RFGRKGKLSPRYIGPFEIVERIGEVAYQLALPPGLSGVHSVFHISMLKKYHQGGAHVIQW 651

Query: 1262 DSVSLD*NLTFEEDPVEV 1209
            DSV LD NLTFEE+P+ +
Sbjct: 652  DSVLLDQNLTFEEEPITI 669



 Score = 41.2 bits (95), Expect(2) = 2e-39
 Identities = 16/45 (35%), Positives = 30/45 (66%), Gaps = 3/45 (6%)
 Frame = -1

Query: 1191 LQWRHSPIEKVTWDTKSDMQYRYPQF---STIQVLSFSFMLKDEI 1066
            +QW+H P+++ TW+ +SDM+ +YPQ    S +  L+ + +  DE+
Sbjct: 687  VQWKHRPVDEATWEIESDMRSKYPQLFECSVVHELNRAKIQSDEL 731


>gb|EOX99963.1| Uncharacterized protein TCM_009073 [Theobroma cacao]
          Length = 421

 Score =  169 bits (429), Expect = 3e-39
 Identities = 116/321 (36%), Positives = 168/321 (52%), Gaps = 16/321 (4%)
 Frame = -2

Query: 2123 PISILD*KWG*IAMDFI--VVQTLG*FDVIWIIVDRLMKSSHFVPIQTIYNSEILVKIYI 1950
            P+ + + KW  IAMDF+  + +T G +D IWI+VDRL KS+HF+ ++T Y +    ++Y+
Sbjct: 48   PLPVPEWKWEHIAMDFVTGLPRTSGGYDSIWIVVDRLTKSAHFLSVKTTYGAAQYARVYV 107

Query: 1949 RKIV*LNRVPIFIILD*GTSLHPASGYLCRRSW-VIEWSLV*YFTLIWIFSPN*LSKS*G 1773
             +IV L+ +PI I+ D G            R W  ++ +L         F P    +S  
Sbjct: 108  DEIVRLHGIPISIVSDRGAQF-------TSRFWGKLQEALGTKLDFSTAFHPQTDGQSER 160

Query: 1772 YVVVLCDXXXXXXXXXXXL-------VEFA*------IIEMLPFKALYSGRFHSYIVWFF 1632
             +  L D                   VEFA        I+M PFKALY  R  S I W  
Sbjct: 161  TIQTLEDMLRACVIDLGVRWEQYLPLVEFAYNNSFQTSIQMAPFKALYGRRCRSPIGWLE 220

Query: 1631 AFEVRPWGINLSRDSLDKVKLI*ERVITIQSWQNSYIDQKVRDLEFKVSVRVLLKVSPXX 1452
              E +  G  L +D+ +K+ +I +R++T QS Q SY D + RDLEF+V   V LKVSP  
Sbjct: 221  VGERKLLGPELVQDATEKIHIIRQRMLTAQSRQKSYADNRRRDLEFQVGDHVFLKVSPTK 280

Query: 1451 XXXXXXXXXKLILSCIGQFEIIHHISEIAYELDLPYGLLGVHPVFHIFMLKKYHLDGSYV 1272
                     KL    IG FEI+  +  +AY L LP  L  +HPVFH+ ML+KY+ D S+V
Sbjct: 281  GVMRFGKKGKLSPRYIGPFEILEKVGAVAYRLALPPDLSNIHPVFHVSMLRKYNPDPSHV 340

Query: 1271 IQ*DSVSLD*NLTFEEDPVEV 1209
            I+ +++ L  +LT+EE PV +
Sbjct: 341  IRYETIQLQDDLTYEEQPVAI 361


>gb|EOY19683.1| Uncharacterized protein TCM_044868 [Theobroma cacao]
          Length = 403

 Score =  168 bits (425), Expect = 1e-38
 Identities = 114/321 (35%), Positives = 168/321 (52%), Gaps = 16/321 (4%)
 Frame = -2

Query: 2123 PISILD*KWG*IAMDFI--VVQTLG*FDVIWIIVDRLMKSSHFVPIQTIYNSEILVKIYI 1950
            P+ + + KW  IAMDF+  + +T G +D IWI+VDRL KS+HF+P++T Y +    ++Y+
Sbjct: 44   PLPVPEWKWEHIAMDFVTGLPRTSGGYDSIWIVVDRLTKSAHFLPVKTTYGAAQYARVYV 103

Query: 1949 RKIV*LNRVPIFIILD*GTSLHPASGYLCRRSW-VIEWSLV*YFTLIWIFSPN*LSKS*G 1773
             +IV L+ +PI I+ D G            R W  ++ +L         F P    +S  
Sbjct: 104  DEIVRLHGIPISIVSDRGAQF-------TSRFWGKLQEALGTKLDFSTAFHPQTGGQSER 156

Query: 1772 YVVVLCDXXXXXXXXXXXL-------VEFA*------IIEMLPFKALYSGRFHSYIVWFF 1632
             +  L D                   VEFA        I+M PF+ALY  R  S + W  
Sbjct: 157  TIQTLEDMLRACVIDLGVRWEQYLPLVEFAYNNSFQTSIQMAPFEALYGRRCRSPVGWLE 216

Query: 1631 AFEVRPWGINLSRDSLDKVKLI*ERVITIQSWQNSYIDQKVRDLEFKVSVRVLLKVSPXX 1452
              E +  G  L +D+ +K+ +I +R++T QS Q SY D + RDLEF+V   V LKV P  
Sbjct: 217  VGERKLLGPELVQDATEKIHMIRQRMLTAQSRQKSYADNRRRDLEFQVGDHVFLKVLPTK 276

Query: 1451 XXXXXXXXXKLILSCIGQFEIIHHISEIAYELDLPYGLLGVHPVFHIFMLKKYHLDGSYV 1272
                     KL    IG FEI+  +  +AY L LP  L  +HPVFH+ ML+KY+ D S+V
Sbjct: 277  GVMRFGKKGKLSPRYIGPFEILDKVGAVAYRLALPPDLSNIHPVFHVSMLRKYNPDPSHV 336

Query: 1271 IQ*DSVSLD*NLTFEEDPVEV 1209
            I+ +++ L  +LT+EE PV +
Sbjct: 337  IRYETIQLQDDLTYEEQPVAI 357


>gb|AAT38724.1| Putative retrotransposon protein, identical [Solanum demissum]
          Length = 1602

 Score =  156 bits (395), Expect(2) = 3e-38
 Identities = 117/320 (36%), Positives = 166/320 (51%), Gaps = 16/320 (5%)
 Frame = -2

Query: 2120 ISILD*KWG*IAMDFI--VVQTLG*FDVIWIIVDRLMKSSHFVPIQTIYNSEILVKIYIR 1947
            I + + KW  I MDFI  + ++    D IW+IVDR+ KS+HF+P++T +++E   K+YI+
Sbjct: 1239 IELPEWKWEMINMDFITGLPRSRRQHDSIWVIVDRMTKSAHFLPVKTTHSAEDYAKLYIQ 1298

Query: 1946 KIV*LNRVPIFIILD*GTSLHPASGYLCRRSWV-IEWSLV*YFTLIWIFSPN*LSKS*GY 1770
            +IV L+ VPI II D G            + W   +  L    +L   F P    ++   
Sbjct: 1299 EIVRLHGVPISIISDRGAQF-------TAQFWKSFQKGLGSKVSLSTAFHPQTDGQAERT 1351

Query: 1769 VVVLCDXXXXXXXXXXXL-------VEFA*------IIEMLPFKALYSGRFHSYIVWFFA 1629
            +  L D                   +EFA        I+M P++ALY  R  S I WF  
Sbjct: 1352 IQTLEDMLRACVIDFKSNWDDHLPLIEFAYNNSYHSSIQMAPYEALYGRRCRSPIGWFEV 1411

Query: 1628 FEVRPWGINLSRDSLDKVKLI*ERVITIQSWQNSYIDQKVRDLEFKVSVRVLLKVSPXXX 1449
             E R  G +L   +++KVK+I ER+ T QS Q SY D + R LEF+V   V LKVSP   
Sbjct: 1412 GEARLIGPDLVHQAMEKVKVIQERLKTAQSRQKSYTDVRRRALEFEVDDWVYLKVSPMKG 1471

Query: 1448 XXXXXXXXKLILSCIGQFEIIHHISEIAYELDLPYGLLGVHPVFHIFMLKKYHLDGSYVI 1269
                    KL    IG + I+  +  +AYEL+LP  L  VHPVFHI MLKK   D S ++
Sbjct: 1472 VMRFGKKGKLSPRYIGPYRIVQRVGSVAYELELPQELAAVHPVFHISMLKKCIGDPSLIL 1531

Query: 1268 Q*DSVSLD*NLTFEEDPVEV 1209
              +SV +  NL++EE PV++
Sbjct: 1532 PTESVKIKDNLSYEEVPVQI 1551



 Score = 31.6 bits (70), Expect(2) = 3e-38
 Identities = 10/22 (45%), Positives = 16/22 (72%)
 Frame = -1

Query: 1185 WRHSPIEKVTWDTKSDMQYRYP 1120
            WR+  +E+ TW+ + DM+ RYP
Sbjct: 1571 WRNQFVEEATWEAEEDMKKRYP 1592


>gb|EOX94130.1| DNA/RNA polymerases superfamily protein [Theobroma cacao]
          Length = 1401

 Score =  159 bits (402), Expect(2) = 7e-38
 Identities = 105/307 (34%), Positives = 156/307 (50%), Gaps = 2/307 (0%)
 Frame = -2

Query: 2123 PISILD*KWG*IAMDFIV--VQTLG*FDVIWIIVDRLMKSSHFVPIQTIYNSEILVKIYI 1950
            P+ +   KW  IAMDF+    +T G +D IWI+VDRL KS+HF+P++T Y +    ++Y+
Sbjct: 1085 PLPVPKWKWEHIAMDFVTGFPRTSGGYDSIWIVVDRLTKSAHFLPVKTTYGAAQYARVYV 1144

Query: 1949 RKIV*LNRVPIFIILD*GTSLHPASGYLCRRSWVIEWSLV*YFTLIWIFSPN*LSKS*GY 1770
             +IV L+ +PI I L+            C     + W    Y  L+     N    S   
Sbjct: 1145 DEIVRLHGIPISITLEDMLRA-------CVIDLGVRWEQ--YLPLVEFAYNNSFQTS--- 1192

Query: 1769 VVVLCDXXXXXXXXXXXLVEFA*IIEMLPFKALYSGRFHSYIVWFFAFEVRPWGINLSRD 1590
                                    I+M PF+ALY     S I W    E + +G  L +D
Sbjct: 1193 ------------------------IQMAPFEALYGRICRSPIGWLEVGERKLFGPELVQD 1228

Query: 1589 SLDKVKLI*ERVITIQSWQNSYIDQKVRDLEFKVSVRVLLKVSPXXXXXXXXXXXKLILS 1410
            + +K+ +I ++++T QS + SY D + RDLEF+V   V LKVSP           KL   
Sbjct: 1229 ATEKIHMIRQKMLTAQSREKSYADNRRRDLEFQVGDHVFLKVSPTKGVMRFGKKGKLNPR 1288

Query: 1409 CIGQFEIIHHISEIAYELDLPYGLLGVHPVFHIFMLKKYHLDGSYVIQ*DSVSLD*NLTF 1230
             IG FEI+  +  +AY L LP  L  +HPVFH+ ML+KY+ D S+VI+ +++    +LT+
Sbjct: 1289 YIGPFEILEKVGAVAYRLALPPDLSNIHPVFHVSMLRKYNPDPSHVIRYETIQSQNDLTY 1348

Query: 1229 EEDPVEV 1209
            EE PV +
Sbjct: 1349 EEQPVAI 1355



 Score = 27.3 bits (59), Expect(2) = 7e-38
 Identities = 8/22 (36%), Positives = 16/22 (72%)
 Frame = -1

Query: 1185 WRHSPIEKVTWDTKSDMQYRYP 1120
            WR+   E+VTW+ + +M+ ++P
Sbjct: 1375 WRNHTSEEVTWEAEDEMRTKHP 1396


>gb|EOY20280.1| Uncharacterized protein TCM_045699 [Theobroma cacao]
          Length = 415

 Score =  160 bits (405), Expect(2) = 1e-37
 Identities = 111/321 (34%), Positives = 166/321 (51%), Gaps = 16/321 (4%)
 Frame = -2

Query: 2123 PISILD*KWG*IAMDFI--VVQTLG*FDVIWIIVDRLMKSSHFVPIQTIYNSEILVKIYI 1950
            P+ + + KW  IAMDF+  + +T G +D IWI+VDRL KS+HF+ ++T Y +    ++Y+
Sbjct: 56   PLPVPEWKWEHIAMDFVTGLPRTSGGYDSIWIVVDRLTKSAHFLLVKTTYGAAQYARVYV 115

Query: 1949 RKIV*LNRVPIFIILD*GTSLHPASGYLCRRSW-VIEWSLV*YFTLIWIFSPN*LSKS*G 1773
             +IV L+ +PI I+ D              R W  ++ +L         F P    +S  
Sbjct: 116  DEIVRLHGIPISIVSD-------REAQFTSRFWGKLQEALGTKLDFSTAFHPQTDGQSER 168

Query: 1772 YVVVLCDXXXXXXXXXXXL-------VEFA*------IIEMLPFKALYSGRFHSYIVWFF 1632
             +  L D                   VEFA        I+M PF+ALY  R  S I W  
Sbjct: 169  TIQTLEDMLRACVIDLGVKWEQYLPLVEFAYNNSFQTSIQMAPFEALYGRRCRSPIGWLE 228

Query: 1631 AFEVRPWGINLSRDSLDKVKLI*ERVITIQSWQNSYIDQKVRDLEFKVSVRVLLKVSPXX 1452
              E +  G  L +D+ +K+ +I ++++T QS Q SY D + RDLEF+V   V LKVSP  
Sbjct: 229  VGERKLLGPELVQDATEKIHMIRQKMLTTQSRQKSYADNRRRDLEFQVGDHVFLKVSPTK 288

Query: 1451 XXXXXXXXXKLILSCIGQFEIIHHISEIAYELDLPYGLLGVHPVFHIFMLKKYHLDGSYV 1272
                     KL    I  F+I+  +  +AY L LP  L  +HPVFH+ ML+KY+ D S+V
Sbjct: 289  GVMRFGKKGKLSPRYIRPFDILEKVGAVAYRLALPPDLSNIHPVFHVSMLRKYNPDPSHV 348

Query: 1271 IQ*DSVSLD*NLTFEEDPVEV 1209
            I+ +++ L  +LT+EE PV +
Sbjct: 349  IRYETIQLQNDLTYEEQPVAI 369



 Score = 25.8 bits (55), Expect(2) = 1e-37
 Identities = 7/22 (31%), Positives = 16/22 (72%)
 Frame = -1

Query: 1185 WRHSPIEKVTWDTKSDMQYRYP 1120
            W++   E+VTW+ + +M+ ++P
Sbjct: 389  WQNHTSEEVTWEAEDEMRTKHP 410


>ref|XP_004228792.1| PREDICTED: uncharacterized protein LOC101263838, partial [Solanum
            lycopersicum]
          Length = 609

 Score =  162 bits (411), Expect = 4e-37
 Identities = 113/282 (40%), Positives = 148/282 (52%), Gaps = 16/282 (5%)
 Frame = -2

Query: 2099 WG*IAMDFIV--VQTLG*FDVIWIIVDRLMKSSHFVPIQTIYNSEILVKIYIRKIV*LNR 1926
            W  IAMDF+V   +TLG FD IW+IVDR+   +H +P++  YN+E L ++YI +IV L+ 
Sbjct: 335  WERIAMDFVVGLPKTLGKFDYIWVIVDRITMFAHLIPVKVTYNAEKLARLYISEIVRLHG 394

Query: 1925 VPIFIILD*GTSLHPASGYLCRRSW-VIEWSLV*YFTLIWIFSPN*LSKS*GYVVVLCDX 1749
            V + II   GT             W  +   L     L   F P    +S   + VL D 
Sbjct: 395  VALSIISYRGTQF-------TSMFWRTLHAKLGTRLDLSTAFHPQTDGQSERTIQVLEDM 447

Query: 1748 XXXXXXXXXXL-------VEFA*I------IEMLPFKALYSGRFHSYIVWFFAFEVRPWG 1608
                              +EF+        I+M PF ALY  R  S I WF A+EV PWG
Sbjct: 448  LCACVIEFGGHWDNFLPLLEFSYNNSYHSGIDMAPFVALYGRRCGSPIGWFDAYEVTPWG 507

Query: 1607 INLSRDSLDKVKLI*ERVITIQSWQNSYIDQKVRDLEFKVSVRVLLKVSPXXXXXXXXXX 1428
             ++ RDSL+KVK I E+++  QS Q  Y D+KVRDLEF    +VLLKVSP          
Sbjct: 508  TDILRDSLEKVKSIQEKLLVAQSRQKEYADRKVRDLEFMEGDQVLLKVSPMKGVMRFGKR 567

Query: 1427 XKLILSCIGQFEIIHHISEIAYELDLPYGLLGVHPVFHIFML 1302
             KL    IG F+++  + E+AYEL LP  L GVHPVFH+ ML
Sbjct: 568  CKLSPRYIGPFDVLKRVGEVAYELALPPALSGVHPVFHVSML 609


Top