BLASTX nr result
ID: Atropa21_contig00020033
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Atropa21_contig00020033 (2157 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|ADU56211.1| gag-pol polyprotein [Solanum lycopersicum] 219 2e-57 gb|AAT66771.2| Putative polyprotein, identical [Solanum demissum] 175 9e-46 gb|ABI34389.1| Polyprotein, putative [Solanum tuberosum] 176 4e-45 gb|AAT39954.1| Putative integrase, identical [Solanum demissum] 168 1e-43 gb|AAT39297.2| Gag-pol protein, putative [Solanum demissum] 117 7e-42 ref|XP_006364939.1| PREDICTED: uncharacterized protein LOC102581... 100 9e-42 gb|AEV42258.1| hypothetical protein [Beta vulgaris] 117 3e-41 ref|XP_006341875.1| PREDICTED: uncharacterized protein LOC102587... 143 3e-41 gb|EOY03326.1| DNA/RNA polymerases superfamily protein [Theobrom... 169 1e-40 gb|EOY08667.1| Retrotransposon protein, Ty3-gypsy subclass, puta... 169 1e-40 ref|XP_004515382.1| PREDICTED: uncharacterized protein LOC101499... 106 2e-40 gb|EOY21678.1| DNA/RNA polymerases superfamily protein [Theobrom... 167 2e-40 gb|EOY26451.1| DNA/RNA polymerases superfamily protein [Theobrom... 167 3e-40 ref|XP_006366848.1| PREDICTED: uncharacterized protein LOC102605... 150 2e-39 gb|EOX99963.1| Uncharacterized protein TCM_009073 [Theobroma cacao] 169 3e-39 gb|EOY19683.1| Uncharacterized protein TCM_044868 [Theobroma cacao] 168 1e-38 gb|AAT38724.1| Putative retrotransposon protein, identical [Sola... 156 3e-38 gb|EOX94130.1| DNA/RNA polymerases superfamily protein [Theobrom... 159 7e-38 gb|EOY20280.1| Uncharacterized protein TCM_045699 [Theobroma cacao] 160 1e-37 ref|XP_004228792.1| PREDICTED: uncharacterized protein LOC101263... 162 4e-37 >gb|ADU56211.1| gag-pol polyprotein [Solanum lycopersicum] Length = 367 Score = 219 bits (559), Expect(2) = 2e-57 Identities = 138/308 (44%), Positives = 179/308 (58%), Gaps = 16/308 (5%) Frame = -2 Query: 2084 MDFIV--VQTLG*FDVIWIIVDRLMKSSHFVPIQTIYNSEILVKIYIRKIV*LNRVPIFI 1911 MDF+V +T+G + IW+IVDRL KS+HF+P++ YN+E L KIYI +IV L+ VP+ I Sbjct: 1 MDFVVGLPKTMGKYSSIWVIVDRLTKSAHFIPVKVTYNAEKLAKIYISEIVRLHGVPLSI 60 Query: 1910 ILD*GTSLHPASGYLCRRSW-VIEWSLV*YFTLIWIFSPN*LSKS*GYVVVLCDXXXXXX 1734 I D GT + W ++ L L F P +S + VL D Sbjct: 61 ISDRGTQF-------TSKFWKILHAELGTRLDLSTAFHPQTDGQSERTIQVLEDMICACV 113 Query: 1733 XXXXXL-------VEFA*------IIEMLPFKALYSGRFHSYIVWFFAFEVRPWGINLSR 1593 EF+ I+M PF+ALY R S I WF AFEVRPWG +L R Sbjct: 114 IEFGGHWDSFLPLAEFSYNNSYHSSIDMAPFEALYGRRCRSPIGWFDAFEVRPWGTDLLR 173 Query: 1592 DSLDKVKLI*ERVITIQSWQNSYIDQKVRDLEFKVSVRVLLKVSPXXXXXXXXXXXKLIL 1413 DS++KVK I E+++ QS Q Y D+KVRDLEF +VLLKVSP KLI Sbjct: 174 DSIEKVKSIQEKLLAAQSRQKEYADRKVRDLEFMEGEQVLLKVSPMKAVMRFGKRGKLIP 233 Query: 1412 SCIGQFEIIHHISEIAYELDLPYGLLGVHPVFHIFMLKKYHLDGSYVIQ*DSVSLD*NLT 1233 IG FE++ + E+AYEL LP GL GVHPVFH+ MLK+YH DG+Y+I+ DSV LD NL+ Sbjct: 234 RYIGPFEVLKRVGEVAYELALPPGLSGVHPVFHVSMLKRYHGDGNYIIRWDSVLLDENLS 293 Query: 1232 FEEDPVEV 1209 +EE PV + Sbjct: 294 YEEKPVVI 301 Score = 32.7 bits (73), Expect(2) = 2e-57 Identities = 12/30 (40%), Positives = 21/30 (70%) Frame = -1 Query: 1197 IFLQWRHSPIEKVTWDTKSDMQYRYPQFST 1108 I +QW++ P+E+ T + ++DM+ RYP T Sbjct: 317 IKVQWKNRPVEEATSEKEADMRERYPHLFT 346 >gb|AAT66771.2| Putative polyprotein, identical [Solanum demissum] Length = 1771 Score = 175 bits (444), Expect(2) = 9e-46 Identities = 122/314 (38%), Positives = 168/314 (53%), Gaps = 16/314 (5%) Frame = -2 Query: 2102 KWG*IAMDFIV--VQTLG*FDVIWIIVDRLMKSSHFVPIQTIYNSEILVKIYIRKIV*LN 1929 KW I MDF+V +T D IW+IVDRL KS+HF+P+ T +++E L +IYIR++V L+ Sbjct: 1415 KWERITMDFVVGLPRTSRGVDSIWVIVDRLTKSAHFLPVHTTFSAERLARIYIREVVRLH 1474 Query: 1928 RVPIFIILD*GTSLHPASGYLCRRSW-VIEWSLV*YFTLIWIFSPN*LSKS*GYVVVLCD 1752 VP+ II D G+ + W + L L F P +S + VL D Sbjct: 1475 GVPVSIISDRGSQFTSSF-------WRAFQEELGTRVHLSTSFHPQTDGQSERTIQVLED 1527 Query: 1751 XXXXXXXXXXXL-------VEFA*------IIEMLPFKALYSGRFHSYIVWFFAFEVRPW 1611 EFA I+M PF+ALY R S + WF + E RP Sbjct: 1528 MLRACVMDFGGQWEQFLPLAEFAYNNSYHSSIQMAPFEALYGRRCRSPVGWFESTEPRPR 1587 Query: 1610 GINLSRDSLDKVKLI*ERVITIQSWQNSYIDQKVRDLEFKVSVRVLLKVSPXXXXXXXXX 1431 G +L +++LD+V++I +R+ T QS SY DQ+ R L F V RV L+VSP Sbjct: 1588 GTDLLQEALDQVRVIQDRLRTAQSRHQSYADQRRRPLRFSVGDRVFLRVSPMKGVMRFGR 1647 Query: 1430 XXKLILSCIGQFEIIHHISEIAYELDLPYGLLGVHPVFHIFMLKKYHLDGSYVIQ*DSVS 1251 KL IG FEI+ + E+AYEL LP +HPVFH+ ML++Y D S+V+Q D+V Sbjct: 1648 RGKLSPRYIGPFEILRTVGEVAYELALPPVFSAIHPVFHVSMLRRYVPDESHVLQYDAVE 1707 Query: 1250 LD*NLTFEEDPVEV 1209 LD LTF E+PV + Sbjct: 1708 LDDRLTFVEEPVAI 1721 Score = 37.7 bits (86), Expect(2) = 9e-46 Identities = 11/28 (39%), Positives = 22/28 (78%) Frame = -1 Query: 1203 PRIFLQWRHSPIEKVTWDTKSDMQYRYP 1120 P + ++WRH P+E+ TW+T+ +M+ ++P Sbjct: 1735 PVVKVRWRHRPVEEATWETEQEMREQFP 1762 >gb|ABI34389.1| Polyprotein, putative [Solanum tuberosum] Length = 545 Score = 176 bits (446), Expect(2) = 4e-45 Identities = 123/313 (39%), Positives = 169/313 (53%), Gaps = 15/313 (4%) Frame = -2 Query: 2102 KWG*IAMDFIV--VQTLG*FDVIWIIVDRLMKSSHFVPIQTIYNSEILVKIYIRKIV*LN 1929 KW I MDF+V +T D IW+IVDRL KS+HF+P+ T +++E L +IYIR++V L+ Sbjct: 21 KWERITMDFVVGLPRTSRGVDSIWVIVDRLTKSAHFLPVHTTFSAERLARIYIREVVRLH 80 Query: 1928 RVPIFIILD*GTSLHPASGYLCRRSWVIEWSLV*YFTLIWIFSPN*LSKS*GYVVVLCDX 1749 VP+ II D G+ S +L + L L F P +S + VL D Sbjct: 81 GVPVSIISDRGSQF--TSSFLR----AFQEELGTRVHLSTAFHPQTDGQSERTIQVLEDM 134 Query: 1748 XXXXXXXXXXL-------VEFA*------IIEMLPFKALYSGRFHSYIVWFFAFEVRPWG 1608 EFA I+M PF+ALY R HS + WF + E R G Sbjct: 135 LRACVMDFGGQWDQFLPLAEFAYNNSYHSSIQMAPFEALYGRRCHSPVGWFESTEPRLRG 194 Query: 1607 INLSRDSLDKVKLI*ERVITIQSWQNSYIDQKVRDLEFKVSVRVLLKVSPXXXXXXXXXX 1428 +L +++LD+V++I +R+ T QS SY DQ+ R L F V RV L+VSP Sbjct: 195 TDLLQEALDQVRVIQDRLRTAQSRHQSYADQRRRPLRFSVGDRVFLRVSPMKGVMRFGRR 254 Query: 1427 XKLILSCIGQFEIIHHISEIAYELDLPYGLLGVHPVFHIFMLKKYHLDGSYVIQ*DSVSL 1248 KL IG FEI+ + E+AYEL LP +HPVFH+ ML++Y D S+V+Q D+V L Sbjct: 255 GKLSPRYIGPFEILRTVGEVAYELALPPVFSAIHPVFHVSMLRRYVPDESHVLQYDAVEL 314 Query: 1247 D*NLTFEEDPVEV 1209 D LTF E+PV + Sbjct: 315 DDRLTFVEEPVAI 327 Score = 34.7 bits (78), Expect(2) = 4e-45 Identities = 10/28 (35%), Positives = 21/28 (75%) Frame = -1 Query: 1203 PRIFLQWRHSPIEKVTWDTKSDMQYRYP 1120 P + ++WRH +E+ TW+T+ +M+ ++P Sbjct: 341 PVVKVRWRHCSVEEATWETEQEMREQFP 368 >gb|AAT39954.1| Putative integrase, identical [Solanum demissum] Length = 1609 Score = 168 bits (426), Expect(2) = 1e-43 Identities = 119/313 (38%), Positives = 167/313 (53%), Gaps = 15/313 (4%) Frame = -2 Query: 2102 KWG*IAMDFIV--VQTLG*FDVIWIIVDRLMKSSHFVPIQTIYNSEILVKIYIRKIV*LN 1929 KW I MDFIV +T D IW+IVDRL KSSHF+ +Q+ +++E L +IYIR++V L+ Sbjct: 1112 KWERITMDFIVGLPRTSRGVDNIWVIVDRLTKSSHFLHVQSSFSTERLARIYIREVVRLH 1171 Query: 1928 RVPIFIILD*GTSLHPASGYLCRRSWVIEWSLV*YFTLIWIFSPN*LSKS*GYVVVLCDX 1749 VP+ II D G+ P + R + L L F P +S + VL D Sbjct: 1172 GVPVSIISDRGS---PFTSSFWR---TFQDDLGTRVDLSTTFHPQTDGQSERTIQVLEDM 1225 Query: 1748 XXXXXXXXXXL-------VEFA*------IIEMLPFKALYSGRFHSYIVWFFAFEVRPWG 1608 EFA I+M PF+ALY R S + WF + E RP G Sbjct: 1226 LQACVMDFGGQWDQFLPLAEFAYNNNYYSSIQMAPFEALYGRRCRSPVGWFESTEARPRG 1285 Query: 1607 INLSRDSLDKVKLI*ERVITIQSWQNSYIDQKVRDLEFKVSVRVLLKVSPXXXXXXXXXX 1428 +L +++LD+V++I +R+ QS +Y D++ R L F V RV +VSP Sbjct: 1286 TDLLQEALDQVRVIQDRLRMAQSRHQNYADRRRRPLRFSVGDRVFFRVSPMKGVMRFGRR 1345 Query: 1427 XKLILSCIGQFEIIHHISEIAYELDLPYGLLGVHPVFHIFMLKKYHLDGSYVIQ*DSVSL 1248 KL IG FEI+ + E+AYEL LP +HPVFH+ ML++Y D S+V+Q D+V L Sbjct: 1346 DKLSPRYIGPFEILRTVGEVAYELALPPAFSAIHPVFHVPMLRRYVPDESHVLQYDAVEL 1405 Query: 1247 D*NLTFEEDPVEV 1209 D LTF E+P+ + Sbjct: 1406 DDRLTFVEEPIAI 1418 Score = 37.4 bits (85), Expect(2) = 1e-43 Identities = 11/28 (39%), Positives = 21/28 (75%) Frame = -1 Query: 1203 PRIFLQWRHSPIEKVTWDTKSDMQYRYP 1120 P + + WRH P+E+ TW+T+ +M+ ++P Sbjct: 1432 PVVKVHWRHRPVEEATWETEQEMREQFP 1459 >gb|AAT39297.2| Gag-pol protein, putative [Solanum demissum] Length = 1554 Score = 117 bits (294), Expect(4) = 7e-42 Identities = 72/163 (44%), Positives = 95/163 (58%) Frame = -2 Query: 1697 IEMLPFKALYSGRFHSYIVWFFAFEVRPWGINLSRDSLDKVKLI*ERVITIQSWQNSYID 1518 I M P++ALY R S I WF E + G +L +++KVK+I ER+ T QS Q SYID Sbjct: 1389 IHMAPYEALYGRRCISPIGWFEVGEAQLIGPDLVHQAMEKVKVIQERLKTAQSRQKSYID 1448 Query: 1517 QKVRDLEFKVSVRVLLKVSPXXXXXXXXXXXKLILSCIGQFEIIHHISEIAYELDLPYGL 1338 + R LEF+V V LKVSP KL IG + I I +AYEL+LP L Sbjct: 1449 VRTRALEFEVDDWVYLKVSPMKGVMRFGKKGKLSPQYIGPYRIAKRIGNVAYELELPQEL 1508 Query: 1337 LGVHPVFHIFMLKKYHLDGSYVIQ*DSVSLD*NLTFEEDPVEV 1209 VHPVFHI MLKK D S ++ +S+ + NL++EE PV++ Sbjct: 1509 EAVHPVFHISMLKKCIGDPSLILPTESIRIKDNLSYEEIPVQI 1551 Score = 58.2 bits (139), Expect(4) = 7e-42 Identities = 32/77 (41%), Positives = 51/77 (66%), Gaps = 2/77 (2%) Frame = -2 Query: 2120 ISILD*KWG*IAMDFI--VVQTLG*FDVIWIIVDRLMKSSHFVPIQTIYNSEILVKIYIR 1947 I + + KW I MDFI + ++ D IW+IVD++ KS+HF+P++T +E K+Y++ Sbjct: 1239 IKLPEWKWEMINMDFITGLPKSHRQHDSIWVIVDQMTKSAHFLPVRTTNIAEDYAKLYVQ 1298 Query: 1946 KIV*LNRVPIFIILD*G 1896 +IV L+ +PI II D G Sbjct: 1299 EIVRLHGIPISIISDRG 1315 Score = 41.2 bits (95), Expect(4) = 7e-42 Identities = 20/38 (52%), Positives = 24/38 (63%) Frame = -1 Query: 1890 FTSSFWISM*KELGNRVELSIIFYPHMDIQSELTIQVL 1777 FT+ FW S K LG++V LS FYP D Q+E TI L Sbjct: 1318 FTAQFWKSFKKGLGSKVNLSTAFYPQTDGQAERTIHTL 1355 Score = 23.9 bits (50), Expect(4) = 7e-42 Identities = 10/18 (55%), Positives = 14/18 (77%) Frame = -3 Query: 1783 SLEDMLWSYVINFDCHWD 1730 +LEDML + VI+F +WD Sbjct: 1354 TLEDMLRACVIDFKGNWD 1371 >ref|XP_006364939.1| PREDICTED: uncharacterized protein LOC102581051 [Solanum tuberosum] Length = 1946 Score = 100 bits (250), Expect(4) = 9e-42 Identities = 64/164 (39%), Positives = 96/164 (58%), Gaps = 1/164 (0%) Frame = -2 Query: 1697 IEMLPFKALYSGRFHSYIVWFFAFEVRPWGINLSRDSLDKVKLI*ERVITIQSWQNSYID 1518 I M PF+ALY R S + F EV G +L ++L++V++I ER+ QS + SY D Sbjct: 1734 IGMAPFEALYGRRCRSSVGLFEVGEVALLGPDLVMEALEEVRMIRERLKMAQSRRKSYAD 1793 Query: 1517 QKVRDLEFKVSVRVLLKVSPXXXXXXXXXXXKLILSCIGQFEIIHHISEIAYELDLPYGL 1338 + R LEF+V V LKVSP KL +G ++++ I ++AYEL+LP + Sbjct: 1794 VRRRALEFRVGDWVYLKVSPMKGVVRFGKKGKLSPRYVGPYKVMRRIGKVAYELELPSEM 1853 Query: 1337 LGVHPVFHIFMLKKYHLDGSYVIQ*DSVS-LD*NLTFEEDPVEV 1209 VHPVFH+ ML+K D + ++ D V ++ NLT+EE PV++ Sbjct: 1854 DLVHPVFHVSMLRKCVGDPNAIVSLDVVGVVEDNLTYEEVPVQI 1897 Score = 64.7 bits (156), Expect(4) = 9e-42 Identities = 32/72 (44%), Positives = 48/72 (66%), Gaps = 2/72 (2%) Frame = -2 Query: 2102 KWG*IAMDFIV--VQTLG*FDVIWIIVDRLMKSSHFVPIQTIYNSEILVKIYIRKIV*LN 1929 KW I MDF+V +T F IW++VDR+ KS+HF+P++T Y +E ++YI +V L+ Sbjct: 1590 KWEEINMDFVVGLPKTRKGFGSIWVVVDRMTKSAHFLPVKTTYGAEDYARLYIHDLVRLH 1649 Query: 1928 RVPIFIILD*GT 1893 +P+ II D GT Sbjct: 1650 GIPLSIISDRGT 1661 Score = 40.8 bits (94), Expect(4) = 9e-42 Identities = 20/38 (52%), Positives = 25/38 (65%) Frame = -1 Query: 1890 FTSSFWISM*KELGNRVELSIIFYPHMDIQSELTIQVL 1777 FTS FW S + LG RV+L+ F+P D Q+E TIQ L Sbjct: 1663 FTSHFWKSFQRGLGTRVKLTTAFHPQTDGQAERTIQTL 1700 Score = 34.3 bits (77), Expect(4) = 9e-42 Identities = 11/22 (50%), Positives = 16/22 (72%) Frame = -1 Query: 1185 WRHSPIEKVTWDTKSDMQYRYP 1120 WR+ +E TW+ ++DMQ RYP Sbjct: 1917 WRNQQVESATWEAEADMQRRYP 1938 >gb|AEV42258.1| hypothetical protein [Beta vulgaris] Length = 1553 Score = 117 bits (293), Expect(4) = 3e-41 Identities = 64/163 (39%), Positives = 99/163 (60%) Frame = -2 Query: 1697 IEMLPFKALYSGRFHSYIVWFFAFEVRPWGINLSRDSLDKVKLI*ERVITIQSWQNSYID 1518 I+M PF+ALY + S + W E G ++ ++++D+V++I E++ T Q Q SY D Sbjct: 1315 IKMAPFEALYGRKCRSPLCWNDISETVVLGPDMIQETMDQVRVIQEKIKTAQDRQKSYAD 1374 Query: 1517 QKVRDLEFKVSVRVLLKVSPXXXXXXXXXXXKLILSCIGQFEIIHHISEIAYELDLPYGL 1338 QK RD F+V +VLLKVSP KL IG +EI+ + ++AY LDLP L Sbjct: 1375 QKRRDENFEVGEKVLLKVSPMKGVMRFGKKGKLSPKFIGPYEILARVGKVAYRLDLPNDL 1434 Query: 1337 LGVHPVFHIFMLKKYHLDGSYVIQ*DSVSLD*NLTFEEDPVEV 1209 VH VFH+ L++Y D S+V++ ++V +D L++EE PV++ Sbjct: 1435 ERVHNVFHVSQLRRYVPDASHVLEPENVEIDETLSYEEKPVQI 1477 Score = 60.5 bits (145), Expect(4) = 3e-41 Identities = 29/76 (38%), Positives = 49/76 (64%), Gaps = 2/76 (2%) Frame = -2 Query: 2123 PISILD*KWG*IAMDFIVV--QTLG*FDVIWIIVDRLMKSSHFVPIQTIYNSEILVKIYI 1950 P+ I KW I+MDF+V ++ G + IW+IVDRL K++ F+P++ ++ E L K Y+ Sbjct: 1164 PLDIPTWKWDSISMDFVVALPRSRGGNNTIWVIVDRLTKTARFIPMKDTWSMEALAKAYV 1223 Query: 1949 RKIV*LNRVPIFIILD 1902 + ++ L+ VP I+ D Sbjct: 1224 KNVIRLHGVPTSIVSD 1239 Score = 32.0 bits (71), Expect(4) = 3e-41 Identities = 15/38 (39%), Positives = 23/38 (60%) Frame = -1 Query: 1890 FTSSFWISM*KELGNRVELSIIFYPHMDIQSELTIQVL 1777 F S+FW + + G+ + +S F+P D Q+E TIQ L Sbjct: 1244 FLSNFWKKVQEAFGSELLMSTAFHPATDGQTERTIQTL 1281 Score = 28.9 bits (63), Expect(4) = 3e-41 Identities = 13/37 (35%), Positives = 23/37 (62%), Gaps = 1/37 (2%) Frame = -1 Query: 1224 RSS*S*DPRIF-LQWRHSPIEKVTWDTKSDMQYRYPQ 1117 RS+ + D RI + WR+ E+ TW+ + M+ +YP+ Sbjct: 1483 RSTRNKDVRIVKVLWRNQTTEEATWEAEDAMRLKYPE 1519 >ref|XP_006341875.1| PREDICTED: uncharacterized protein LOC102587225 [Solanum tuberosum] Length = 819 Score = 143 bits (360), Expect(4) = 3e-41 Identities = 81/143 (56%), Positives = 94/143 (65%) Frame = -2 Query: 1637 FFAFEVRPWGINLSRDSLDKVKLI*ERVITIQSWQNSYIDQKVRDLEFKVSVRVLLKVSP 1458 F VRPW +L R+SLDKVKLI +R++ QS + SY D+KVRDLEF V RVLLKVSP Sbjct: 648 FLPLAVRPWDTDLLRESLDKVKLIQDRLLMAQSRKKSYADRKVRDLEFMVGERVLLKVSP 707 Query: 1457 XXXXXXXXXXXKLILSCIGQFEIIHHISEIAYELDLPYGLLGVHPVFHIFMLKKYHLDGS 1278 KL IG FEI+ I E+AY+L LP L VH VFHI MLKKYH G+ Sbjct: 708 MKGVMRFGKKGKLSPRYIGPFEIVERIGEVAYQLALPPRLSRVHSVFHISMLKKYHQGGA 767 Query: 1277 YVIQ*DSVSLD*NLTFEEDPVEV 1209 VIQ DSV LD NLTFEE+PV + Sbjct: 768 TVIQWDSVLLDQNLTFEEEPVTI 790 Score = 45.8 bits (107), Expect(4) = 3e-41 Identities = 24/38 (63%), Positives = 26/38 (68%) Frame = -1 Query: 1890 FTSSFWISM*KELGNRVELSIIFYPHMDIQSELTIQVL 1777 FTS FW SM KELG RV+LS F+P D QS IQVL Sbjct: 593 FTSHFWQSMQKELGTRVDLSTAFHPQTDGQSGRIIQVL 630 Score = 26.9 bits (58), Expect(4) = 3e-41 Identities = 12/19 (63%), Positives = 14/19 (73%) Frame = -3 Query: 1780 LEDMLWSYVINFDCHWD*F 1724 LEDML + VI+F HWD F Sbjct: 630 LEDMLRACVIDFGGHWDQF 648 Score = 22.7 bits (47), Expect(4) = 3e-41 Identities = 6/12 (50%), Positives = 10/12 (83%) Frame = -1 Query: 1191 LQWRHSPIEKVT 1156 +QW+H P+E+ T Sbjct: 808 VQWKHRPVEEAT 819 >gb|EOY03326.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 1447 Score = 169 bits (427), Expect(2) = 1e-40 Identities = 116/321 (36%), Positives = 169/321 (52%), Gaps = 16/321 (4%) Frame = -2 Query: 2123 PISILD*KWG*IAMDFI--VVQTLG*FDVIWIIVDRLMKSSHFVPIQTIYNSEILVKIYI 1950 P+ + + KW IAMDF+ + +T G +D IWI+VDRL KS+HF+P++T Y + ++Y+ Sbjct: 1088 PLPVPEWKWEHIAMDFVTGLPRTSGGYDSIWIVVDRLTKSAHFLPVKTTYGAAQYARVYV 1147 Query: 1949 RKIV*LNRVPIFIILD*GTSLHPASGYLCRRSW-VIEWSLV*YFTLIWIFSPN*LSKS*G 1773 +IV L+ +PI I+ D G R W ++ +L F P +S Sbjct: 1148 DEIVRLHGIPISIVSDRGAQF-------TSRFWGKLQEALGTKLDFSTAFHPQTDGQSER 1200 Query: 1772 YVVVL-------CDXXXXXXXXXXXLVEFA*------IIEMLPFKALYSGRFHSYIVWFF 1632 + L LVEFA I+M PF+ALY R S I W Sbjct: 1201 TIQTLEAMLRACVIDLGVRWEQYLPLVEFAYNNSFQTSIQMAPFEALYGRRCRSPIGWLE 1260 Query: 1631 AFEVRPWGINLSRDSLDKVKLI*ERVITIQSWQNSYIDQKVRDLEFKVSVRVLLKVSPXX 1452 E + G L +D+ +K+ +I +R++T QS Q SY D + RDLEF+V V LKVSP Sbjct: 1261 VGERKLLGPELVQDATEKIHMIRQRMLTAQSRQKSYADNRRRDLEFQVGDHVFLKVSPTK 1320 Query: 1451 XXXXXXXXXKLILSCIGQFEIIHHISEIAYELDLPYGLLGVHPVFHIFMLKKYHLDGSYV 1272 KL IG FEI+ + +AY L LP L +HPVFH+ ML+KY+ D S+V Sbjct: 1321 GVMRFGKKGKLSPRYIGPFEILEKVGAVAYRLALPPDLSNIHPVFHVSMLRKYNPDPSHV 1380 Query: 1271 IQ*DSVSLD*NLTFEEDPVEV 1209 I+ +++ L +LT+EE PV + Sbjct: 1381 IRYETIQLQDDLTYEEQPVAI 1401 Score = 27.3 bits (59), Expect(2) = 1e-40 Identities = 8/22 (36%), Positives = 16/22 (72%) Frame = -1 Query: 1185 WRHSPIEKVTWDTKSDMQYRYP 1120 WR+ E+VTW+ + +M+ ++P Sbjct: 1421 WRNHTSEEVTWEAEDEMRTKHP 1442 >gb|EOY08667.1| Retrotransposon protein, Ty3-gypsy subclass, putative [Theobroma cacao] Length = 521 Score = 169 bits (427), Expect(2) = 1e-40 Identities = 115/321 (35%), Positives = 168/321 (52%), Gaps = 16/321 (4%) Frame = -2 Query: 2123 PISILD*KWG*IAMDFI--VVQTLG*FDVIWIIVDRLMKSSHFVPIQTIYNSEILVKIYI 1950 P+ + + KW IAMDF+ + +T G +D IWI+VDRL KS+HF+P++T Y + ++Y+ Sbjct: 162 PLPVPEWKWEHIAMDFVTGLPRTSGGYDSIWIVVDRLTKSAHFLPVKTTYGAAQYARVYV 221 Query: 1949 RKIV*LNRVPIFIILD*GTSLHPASGYLCRRSW-VIEWSLV*YFTLIWIFSPN*LSKS*G 1773 +IV L+ +PI I+ D G R W ++ +L F P +S Sbjct: 222 DEIVRLHGIPISIVSDRGAQF-------TSRFWGKLQEALGTKLDFSTAFHPQTDGQSER 274 Query: 1772 YVVVLCDXXXXXXXXXXXL-------VEFA*------IIEMLPFKALYSGRFHSYIVWFF 1632 + L D VEFA I+M PF+ALY R S I W Sbjct: 275 TIQTLEDMLRACVIDLGVRWEQYLPLVEFAYNNSFQTSIQMAPFEALYGRRCRSPIGWLE 334 Query: 1631 AFEVRPWGINLSRDSLDKVKLI*ERVITIQSWQNSYIDQKVRDLEFKVSVRVLLKVSPXX 1452 E + G L +D+ +K+ +I +R++T QS SY D + RDLEF+V V LKVSP Sbjct: 335 VGERKLLGPELVQDATEKIHMIRQRMLTAQSRHKSYADNRRRDLEFQVGDHVFLKVSPTK 394 Query: 1451 XXXXXXXXXKLILSCIGQFEIIHHISEIAYELDLPYGLLGVHPVFHIFMLKKYHLDGSYV 1272 KL IG FEI+ + +AY L LP L +HPVFH+ ML+KY+ D S+V Sbjct: 395 GVMRFGKKGKLSPRYIGPFEILDKVGTVAYRLALPPDLSNIHPVFHVSMLRKYNPDPSHV 454 Query: 1271 IQ*DSVSLD*NLTFEEDPVEV 1209 I+ +++ L +LT+EE PV + Sbjct: 455 IRYETIQLQDDLTYEEQPVAI 475 Score = 27.3 bits (59), Expect(2) = 1e-40 Identities = 8/22 (36%), Positives = 16/22 (72%) Frame = -1 Query: 1185 WRHSPIEKVTWDTKSDMQYRYP 1120 WR+ E+VTW+ + +M+ ++P Sbjct: 495 WRNHTSEEVTWEAEDEMRTKHP 516 >ref|XP_004515382.1| PREDICTED: uncharacterized protein LOC101499978 [Cicer arietinum] Length = 1352 Score = 106 bits (265), Expect(4) = 2e-40 Identities = 60/164 (36%), Positives = 92/164 (56%) Frame = -2 Query: 1697 IEMLPFKALYSGRFHSYIVWFFAFEVRPWGINLSRDSLDKVKLI*ERVITIQSWQNSYID 1518 I M PF+ALY R + + WF + G + + + DKVK+I E++ QS Q SY D Sbjct: 979 IGMAPFEALYGRRCRTPLCWFETGDNLVLGPEIVQQTTDKVKMIQEKMRASQSRQKSYHD 1038 Query: 1517 QKVRDLEFKVSVRVLLKVSPXXXXXXXXXXXKLILSCIGQFEIIHHISEIAYELDLPYGL 1338 ++ + LEF+ V L+V+P KL IG ++I+ + +AY++ LP L Sbjct: 1039 KRRKSLEFQEGDHVFLRVTPTTGVGRALKMRKLTPRFIGPYQILKRVGNVAYQIALPPSL 1098 Query: 1337 LGVHPVFHIFMLKKYHLDGSYVIQ*DSVSLD*NLTFEEDPVEVE 1206 +H VFH+ L+KY D S+VI+ D V + NLTFE P+++E Sbjct: 1099 SNLHSVFHVSQLRKYIFDPSHVIESDKVQIKENLTFETLPLQIE 1142 Score = 71.2 bits (173), Expect(4) = 2e-40 Identities = 37/76 (48%), Positives = 53/76 (69%), Gaps = 2/76 (2%) Frame = -2 Query: 2123 PISILD*KWG*IAMDFIV--VQTLG*FDVIWIIVDRLMKSSHFVPIQTIYNSEILVKIYI 1950 P+SI + KW I+MDF+V +T +D IW+IVDRL KS+HF+PI Y+ E L +IYI Sbjct: 828 PLSIPEWKWDSISMDFVVGLPRTPKRYDSIWVIVDRLTKSAHFIPINITYSMERLAEIYI 887 Query: 1949 RKIV*LNRVPIFIILD 1902 ++IV L+ +P I+ D Sbjct: 888 KEIVKLHGIPSSIVSD 903 Score = 33.1 bits (74), Expect(4) = 2e-40 Identities = 17/45 (37%), Positives = 25/45 (55%), Gaps = 4/45 (8%) Frame = -1 Query: 1890 FTSSFWISM*KELGNRVELSIIFYPHMDIQSELTIQ----VLRIC 1768 FTS FW + + LG + +S ++P D Q+E T Q +LR C Sbjct: 908 FTSKFWQGLQRALGTNLRMSSAYHPQTDGQTERTNQSLEDLLRAC 952 Score = 25.0 bits (53), Expect(4) = 2e-40 Identities = 9/29 (31%), Positives = 16/29 (55%) Frame = -1 Query: 1203 PRIFLQWRHSPIEKVTWDTKSDMQYRYPQ 1117 P + + W + E TW+ +S M+ YP+ Sbjct: 1155 PLVKVVWGGATGESATWEVESQMRDSYPE 1183 >gb|EOY21678.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 448 Score = 167 bits (424), Expect(2) = 2e-40 Identities = 115/321 (35%), Positives = 168/321 (52%), Gaps = 16/321 (4%) Frame = -2 Query: 2123 PISILD*KWG*IAMDFI--VVQTLG*FDVIWIIVDRLMKSSHFVPIQTIYNSEILVKIYI 1950 P+ + + KW IAMDF+ + +T G +D IWI+VDRL KS+HF+P++T Y + ++Y+ Sbjct: 89 PLPVPEWKWEHIAMDFVTGLPRTSGGYDSIWIVVDRLTKSAHFLPVKTTYGAAQYARVYV 148 Query: 1949 RKIV*LNRVPIFIILD*GTSLHPASGYLCRRSW-VIEWSLV*YFTLIWIFSPN*LSKS*G 1773 +IV L+ +PI I+ D G R W ++ +L F P +S Sbjct: 149 DEIVRLHGIPISIVSDRGAQF-------TSRFWGKLQEALGTKLDFSTAFHPQTDGQSER 201 Query: 1772 YVVVLCDXXXXXXXXXXXL-------VEFA*------IIEMLPFKALYSGRFHSYIVWFF 1632 + L D VEFA I+M PF+ALY R S I W Sbjct: 202 TIQTLEDMLRACVIDLGVRWEQYLPLVEFAYNNSFQTSIQMAPFEALYGRRCRSPIGWLE 261 Query: 1631 AFEVRPWGINLSRDSLDKVKLI*ERVITIQSWQNSYIDQKVRDLEFKVSVRVLLKVSPXX 1452 E + G L +D+ +K+ +I +R++T QS Q SY D + R LEF+V V LKVSP Sbjct: 262 VGERKLLGPELVQDATEKIHMIRQRMLTAQSRQKSYADNRRRYLEFQVGDHVFLKVSPTK 321 Query: 1451 XXXXXXXXXKLILSCIGQFEIIHHISEIAYELDLPYGLLGVHPVFHIFMLKKYHLDGSYV 1272 KL IG FEI+ + +AY L LP L +HPVFH+ ML+KY+ D S+V Sbjct: 322 GIMRFGKKGKLSPRYIGPFEILEKVGAVAYRLALPPDLSNIHPVFHVSMLRKYNPDPSHV 381 Query: 1271 IQ*DSVSLD*NLTFEEDPVEV 1209 I+ +++ L +LT+EE PV + Sbjct: 382 IRYETIQLQDDLTYEEQPVAI 402 Score = 27.3 bits (59), Expect(2) = 2e-40 Identities = 8/22 (36%), Positives = 16/22 (72%) Frame = -1 Query: 1185 WRHSPIEKVTWDTKSDMQYRYP 1120 WR+ E+VTW+ + +M+ ++P Sbjct: 422 WRNHTSEEVTWEAEDEMRTKHP 443 >gb|EOY26451.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 679 Score = 167 bits (423), Expect(2) = 3e-40 Identities = 113/321 (35%), Positives = 169/321 (52%), Gaps = 16/321 (4%) Frame = -2 Query: 2123 PISILD*KWG*IAMDFI--VVQTLG*FDVIWIIVDRLMKSSHFVPIQTIYNSEILVKIYI 1950 P+ + + KW IAMDF+ + +T G +D IWI+VD+L KS+HF+P++T Y + ++Y+ Sbjct: 320 PLPVPEWKWEHIAMDFVTGLPRTSGGYDSIWIVVDQLTKSAHFLPVKTTYGAAHYARVYV 379 Query: 1949 RKIV*LNRVPIFIILD*GTSLHPASGYLCRRSW-VIEWSLV*YFTLIWIFSPN*LSKS*G 1773 +IV L+ +PI I+ D G R W ++ +L F P +S Sbjct: 380 DEIVRLHGIPISIVSDRGAQF-------TSRFWGKLQEALGTKLDFSTAFHPQTDGQSER 432 Query: 1772 YVVVLCDXXXXXXXXXXXL-------VEFA*------IIEMLPFKALYSGRFHSYIVWFF 1632 + L D VEFA I+M PF+ALY R S I W Sbjct: 433 TIQTLEDMLRACVIDLGVRWEQYLPLVEFAYNNSFQTSIQMAPFEALYGRRCRSPIGWLE 492 Query: 1631 AFEVRPWGINLSRDSLDKVKLI*ERVITIQSWQNSYIDQKVRDLEFKVSVRVLLKVSPXX 1452 E + G L +D+ +K+ +I +R++T QS Q SY D + RDLEF+V V LK SP Sbjct: 493 VGERKLLGPELVQDATEKIHMIRQRMLTAQSRQKSYADNRRRDLEFQVGDHVFLKFSPTK 552 Query: 1451 XXXXXXXXXKLILSCIGQFEIIHHISEIAYELDLPYGLLGVHPVFHIFMLKKYHLDGSYV 1272 KL IG F+I+ + +AY L LP L +HPVFH+ ML+KY+LD S+V Sbjct: 553 GVMRFGKKGKLSPRYIGPFKILEKVGAVAYRLALPPDLSNIHPVFHVSMLRKYNLDPSHV 612 Query: 1271 IQ*DSVSLD*NLTFEEDPVEV 1209 I+ +++ L +L++EE PV + Sbjct: 613 IRYETIQLQDDLSYEEQPVAI 633 Score = 27.3 bits (59), Expect(2) = 3e-40 Identities = 8/22 (36%), Positives = 16/22 (72%) Frame = -1 Query: 1185 WRHSPIEKVTWDTKSDMQYRYP 1120 WR+ E+VTW+ + +M+ ++P Sbjct: 653 WRNHTSEEVTWEAEDEMRTKHP 674 >ref|XP_006366848.1| PREDICTED: uncharacterized protein LOC102605741 [Solanum tuberosum] Length = 823 Score = 150 bits (379), Expect(2) = 2e-39 Identities = 81/138 (58%), Positives = 97/138 (70%) Frame = -2 Query: 1622 VRPWGINLSRDSLDKVKLI*ERVITIQSWQNSYIDQKVRDLEFKVSVRVLLKVSPXXXXX 1443 VRPWG +L R+SLDKVK+I +R++ QS Q SY D+KVR+LEF V RVLLKVSP Sbjct: 532 VRPWGTDLLRESLDKVKMIQDRLLMAQSRQKSYADRKVRNLEFMVGERVLLKVSPMKGVM 591 Query: 1442 XXXXXXKLILSCIGQFEIIHHISEIAYELDLPYGLLGVHPVFHIFMLKKYHLDGSYVIQ* 1263 KL IG FEI+ I E+AY+L LP GL GVH VFHI MLKKYH G++VIQ Sbjct: 592 RFGRKGKLSPRYIGPFEIVERIGEVAYQLALPPGLSGVHSVFHISMLKKYHQGGAHVIQW 651 Query: 1262 DSVSLD*NLTFEEDPVEV 1209 DSV LD NLTFEE+P+ + Sbjct: 652 DSVLLDQNLTFEEEPITI 669 Score = 41.2 bits (95), Expect(2) = 2e-39 Identities = 16/45 (35%), Positives = 30/45 (66%), Gaps = 3/45 (6%) Frame = -1 Query: 1191 LQWRHSPIEKVTWDTKSDMQYRYPQF---STIQVLSFSFMLKDEI 1066 +QW+H P+++ TW+ +SDM+ +YPQ S + L+ + + DE+ Sbjct: 687 VQWKHRPVDEATWEIESDMRSKYPQLFECSVVHELNRAKIQSDEL 731 >gb|EOX99963.1| Uncharacterized protein TCM_009073 [Theobroma cacao] Length = 421 Score = 169 bits (429), Expect = 3e-39 Identities = 116/321 (36%), Positives = 168/321 (52%), Gaps = 16/321 (4%) Frame = -2 Query: 2123 PISILD*KWG*IAMDFI--VVQTLG*FDVIWIIVDRLMKSSHFVPIQTIYNSEILVKIYI 1950 P+ + + KW IAMDF+ + +T G +D IWI+VDRL KS+HF+ ++T Y + ++Y+ Sbjct: 48 PLPVPEWKWEHIAMDFVTGLPRTSGGYDSIWIVVDRLTKSAHFLSVKTTYGAAQYARVYV 107 Query: 1949 RKIV*LNRVPIFIILD*GTSLHPASGYLCRRSW-VIEWSLV*YFTLIWIFSPN*LSKS*G 1773 +IV L+ +PI I+ D G R W ++ +L F P +S Sbjct: 108 DEIVRLHGIPISIVSDRGAQF-------TSRFWGKLQEALGTKLDFSTAFHPQTDGQSER 160 Query: 1772 YVVVLCDXXXXXXXXXXXL-------VEFA*------IIEMLPFKALYSGRFHSYIVWFF 1632 + L D VEFA I+M PFKALY R S I W Sbjct: 161 TIQTLEDMLRACVIDLGVRWEQYLPLVEFAYNNSFQTSIQMAPFKALYGRRCRSPIGWLE 220 Query: 1631 AFEVRPWGINLSRDSLDKVKLI*ERVITIQSWQNSYIDQKVRDLEFKVSVRVLLKVSPXX 1452 E + G L +D+ +K+ +I +R++T QS Q SY D + RDLEF+V V LKVSP Sbjct: 221 VGERKLLGPELVQDATEKIHIIRQRMLTAQSRQKSYADNRRRDLEFQVGDHVFLKVSPTK 280 Query: 1451 XXXXXXXXXKLILSCIGQFEIIHHISEIAYELDLPYGLLGVHPVFHIFMLKKYHLDGSYV 1272 KL IG FEI+ + +AY L LP L +HPVFH+ ML+KY+ D S+V Sbjct: 281 GVMRFGKKGKLSPRYIGPFEILEKVGAVAYRLALPPDLSNIHPVFHVSMLRKYNPDPSHV 340 Query: 1271 IQ*DSVSLD*NLTFEEDPVEV 1209 I+ +++ L +LT+EE PV + Sbjct: 341 IRYETIQLQDDLTYEEQPVAI 361 >gb|EOY19683.1| Uncharacterized protein TCM_044868 [Theobroma cacao] Length = 403 Score = 168 bits (425), Expect = 1e-38 Identities = 114/321 (35%), Positives = 168/321 (52%), Gaps = 16/321 (4%) Frame = -2 Query: 2123 PISILD*KWG*IAMDFI--VVQTLG*FDVIWIIVDRLMKSSHFVPIQTIYNSEILVKIYI 1950 P+ + + KW IAMDF+ + +T G +D IWI+VDRL KS+HF+P++T Y + ++Y+ Sbjct: 44 PLPVPEWKWEHIAMDFVTGLPRTSGGYDSIWIVVDRLTKSAHFLPVKTTYGAAQYARVYV 103 Query: 1949 RKIV*LNRVPIFIILD*GTSLHPASGYLCRRSW-VIEWSLV*YFTLIWIFSPN*LSKS*G 1773 +IV L+ +PI I+ D G R W ++ +L F P +S Sbjct: 104 DEIVRLHGIPISIVSDRGAQF-------TSRFWGKLQEALGTKLDFSTAFHPQTGGQSER 156 Query: 1772 YVVVLCDXXXXXXXXXXXL-------VEFA*------IIEMLPFKALYSGRFHSYIVWFF 1632 + L D VEFA I+M PF+ALY R S + W Sbjct: 157 TIQTLEDMLRACVIDLGVRWEQYLPLVEFAYNNSFQTSIQMAPFEALYGRRCRSPVGWLE 216 Query: 1631 AFEVRPWGINLSRDSLDKVKLI*ERVITIQSWQNSYIDQKVRDLEFKVSVRVLLKVSPXX 1452 E + G L +D+ +K+ +I +R++T QS Q SY D + RDLEF+V V LKV P Sbjct: 217 VGERKLLGPELVQDATEKIHMIRQRMLTAQSRQKSYADNRRRDLEFQVGDHVFLKVLPTK 276 Query: 1451 XXXXXXXXXKLILSCIGQFEIIHHISEIAYELDLPYGLLGVHPVFHIFMLKKYHLDGSYV 1272 KL IG FEI+ + +AY L LP L +HPVFH+ ML+KY+ D S+V Sbjct: 277 GVMRFGKKGKLSPRYIGPFEILDKVGAVAYRLALPPDLSNIHPVFHVSMLRKYNPDPSHV 336 Query: 1271 IQ*DSVSLD*NLTFEEDPVEV 1209 I+ +++ L +LT+EE PV + Sbjct: 337 IRYETIQLQDDLTYEEQPVAI 357 >gb|AAT38724.1| Putative retrotransposon protein, identical [Solanum demissum] Length = 1602 Score = 156 bits (395), Expect(2) = 3e-38 Identities = 117/320 (36%), Positives = 166/320 (51%), Gaps = 16/320 (5%) Frame = -2 Query: 2120 ISILD*KWG*IAMDFI--VVQTLG*FDVIWIIVDRLMKSSHFVPIQTIYNSEILVKIYIR 1947 I + + KW I MDFI + ++ D IW+IVDR+ KS+HF+P++T +++E K+YI+ Sbjct: 1239 IELPEWKWEMINMDFITGLPRSRRQHDSIWVIVDRMTKSAHFLPVKTTHSAEDYAKLYIQ 1298 Query: 1946 KIV*LNRVPIFIILD*GTSLHPASGYLCRRSWV-IEWSLV*YFTLIWIFSPN*LSKS*GY 1770 +IV L+ VPI II D G + W + L +L F P ++ Sbjct: 1299 EIVRLHGVPISIISDRGAQF-------TAQFWKSFQKGLGSKVSLSTAFHPQTDGQAERT 1351 Query: 1769 VVVLCDXXXXXXXXXXXL-------VEFA*------IIEMLPFKALYSGRFHSYIVWFFA 1629 + L D +EFA I+M P++ALY R S I WF Sbjct: 1352 IQTLEDMLRACVIDFKSNWDDHLPLIEFAYNNSYHSSIQMAPYEALYGRRCRSPIGWFEV 1411 Query: 1628 FEVRPWGINLSRDSLDKVKLI*ERVITIQSWQNSYIDQKVRDLEFKVSVRVLLKVSPXXX 1449 E R G +L +++KVK+I ER+ T QS Q SY D + R LEF+V V LKVSP Sbjct: 1412 GEARLIGPDLVHQAMEKVKVIQERLKTAQSRQKSYTDVRRRALEFEVDDWVYLKVSPMKG 1471 Query: 1448 XXXXXXXXKLILSCIGQFEIIHHISEIAYELDLPYGLLGVHPVFHIFMLKKYHLDGSYVI 1269 KL IG + I+ + +AYEL+LP L VHPVFHI MLKK D S ++ Sbjct: 1472 VMRFGKKGKLSPRYIGPYRIVQRVGSVAYELELPQELAAVHPVFHISMLKKCIGDPSLIL 1531 Query: 1268 Q*DSVSLD*NLTFEEDPVEV 1209 +SV + NL++EE PV++ Sbjct: 1532 PTESVKIKDNLSYEEVPVQI 1551 Score = 31.6 bits (70), Expect(2) = 3e-38 Identities = 10/22 (45%), Positives = 16/22 (72%) Frame = -1 Query: 1185 WRHSPIEKVTWDTKSDMQYRYP 1120 WR+ +E+ TW+ + DM+ RYP Sbjct: 1571 WRNQFVEEATWEAEEDMKKRYP 1592 >gb|EOX94130.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 1401 Score = 159 bits (402), Expect(2) = 7e-38 Identities = 105/307 (34%), Positives = 156/307 (50%), Gaps = 2/307 (0%) Frame = -2 Query: 2123 PISILD*KWG*IAMDFIV--VQTLG*FDVIWIIVDRLMKSSHFVPIQTIYNSEILVKIYI 1950 P+ + KW IAMDF+ +T G +D IWI+VDRL KS+HF+P++T Y + ++Y+ Sbjct: 1085 PLPVPKWKWEHIAMDFVTGFPRTSGGYDSIWIVVDRLTKSAHFLPVKTTYGAAQYARVYV 1144 Query: 1949 RKIV*LNRVPIFIILD*GTSLHPASGYLCRRSWVIEWSLV*YFTLIWIFSPN*LSKS*GY 1770 +IV L+ +PI I L+ C + W Y L+ N S Sbjct: 1145 DEIVRLHGIPISITLEDMLRA-------CVIDLGVRWEQ--YLPLVEFAYNNSFQTS--- 1192 Query: 1769 VVVLCDXXXXXXXXXXXLVEFA*IIEMLPFKALYSGRFHSYIVWFFAFEVRPWGINLSRD 1590 I+M PF+ALY S I W E + +G L +D Sbjct: 1193 ------------------------IQMAPFEALYGRICRSPIGWLEVGERKLFGPELVQD 1228 Query: 1589 SLDKVKLI*ERVITIQSWQNSYIDQKVRDLEFKVSVRVLLKVSPXXXXXXXXXXXKLILS 1410 + +K+ +I ++++T QS + SY D + RDLEF+V V LKVSP KL Sbjct: 1229 ATEKIHMIRQKMLTAQSREKSYADNRRRDLEFQVGDHVFLKVSPTKGVMRFGKKGKLNPR 1288 Query: 1409 CIGQFEIIHHISEIAYELDLPYGLLGVHPVFHIFMLKKYHLDGSYVIQ*DSVSLD*NLTF 1230 IG FEI+ + +AY L LP L +HPVFH+ ML+KY+ D S+VI+ +++ +LT+ Sbjct: 1289 YIGPFEILEKVGAVAYRLALPPDLSNIHPVFHVSMLRKYNPDPSHVIRYETIQSQNDLTY 1348 Query: 1229 EEDPVEV 1209 EE PV + Sbjct: 1349 EEQPVAI 1355 Score = 27.3 bits (59), Expect(2) = 7e-38 Identities = 8/22 (36%), Positives = 16/22 (72%) Frame = -1 Query: 1185 WRHSPIEKVTWDTKSDMQYRYP 1120 WR+ E+VTW+ + +M+ ++P Sbjct: 1375 WRNHTSEEVTWEAEDEMRTKHP 1396 >gb|EOY20280.1| Uncharacterized protein TCM_045699 [Theobroma cacao] Length = 415 Score = 160 bits (405), Expect(2) = 1e-37 Identities = 111/321 (34%), Positives = 166/321 (51%), Gaps = 16/321 (4%) Frame = -2 Query: 2123 PISILD*KWG*IAMDFI--VVQTLG*FDVIWIIVDRLMKSSHFVPIQTIYNSEILVKIYI 1950 P+ + + KW IAMDF+ + +T G +D IWI+VDRL KS+HF+ ++T Y + ++Y+ Sbjct: 56 PLPVPEWKWEHIAMDFVTGLPRTSGGYDSIWIVVDRLTKSAHFLLVKTTYGAAQYARVYV 115 Query: 1949 RKIV*LNRVPIFIILD*GTSLHPASGYLCRRSW-VIEWSLV*YFTLIWIFSPN*LSKS*G 1773 +IV L+ +PI I+ D R W ++ +L F P +S Sbjct: 116 DEIVRLHGIPISIVSD-------REAQFTSRFWGKLQEALGTKLDFSTAFHPQTDGQSER 168 Query: 1772 YVVVLCDXXXXXXXXXXXL-------VEFA*------IIEMLPFKALYSGRFHSYIVWFF 1632 + L D VEFA I+M PF+ALY R S I W Sbjct: 169 TIQTLEDMLRACVIDLGVKWEQYLPLVEFAYNNSFQTSIQMAPFEALYGRRCRSPIGWLE 228 Query: 1631 AFEVRPWGINLSRDSLDKVKLI*ERVITIQSWQNSYIDQKVRDLEFKVSVRVLLKVSPXX 1452 E + G L +D+ +K+ +I ++++T QS Q SY D + RDLEF+V V LKVSP Sbjct: 229 VGERKLLGPELVQDATEKIHMIRQKMLTTQSRQKSYADNRRRDLEFQVGDHVFLKVSPTK 288 Query: 1451 XXXXXXXXXKLILSCIGQFEIIHHISEIAYELDLPYGLLGVHPVFHIFMLKKYHLDGSYV 1272 KL I F+I+ + +AY L LP L +HPVFH+ ML+KY+ D S+V Sbjct: 289 GVMRFGKKGKLSPRYIRPFDILEKVGAVAYRLALPPDLSNIHPVFHVSMLRKYNPDPSHV 348 Query: 1271 IQ*DSVSLD*NLTFEEDPVEV 1209 I+ +++ L +LT+EE PV + Sbjct: 349 IRYETIQLQNDLTYEEQPVAI 369 Score = 25.8 bits (55), Expect(2) = 1e-37 Identities = 7/22 (31%), Positives = 16/22 (72%) Frame = -1 Query: 1185 WRHSPIEKVTWDTKSDMQYRYP 1120 W++ E+VTW+ + +M+ ++P Sbjct: 389 WQNHTSEEVTWEAEDEMRTKHP 410 >ref|XP_004228792.1| PREDICTED: uncharacterized protein LOC101263838, partial [Solanum lycopersicum] Length = 609 Score = 162 bits (411), Expect = 4e-37 Identities = 113/282 (40%), Positives = 148/282 (52%), Gaps = 16/282 (5%) Frame = -2 Query: 2099 WG*IAMDFIV--VQTLG*FDVIWIIVDRLMKSSHFVPIQTIYNSEILVKIYIRKIV*LNR 1926 W IAMDF+V +TLG FD IW+IVDR+ +H +P++ YN+E L ++YI +IV L+ Sbjct: 335 WERIAMDFVVGLPKTLGKFDYIWVIVDRITMFAHLIPVKVTYNAEKLARLYISEIVRLHG 394 Query: 1925 VPIFIILD*GTSLHPASGYLCRRSW-VIEWSLV*YFTLIWIFSPN*LSKS*GYVVVLCDX 1749 V + II GT W + L L F P +S + VL D Sbjct: 395 VALSIISYRGTQF-------TSMFWRTLHAKLGTRLDLSTAFHPQTDGQSERTIQVLEDM 447 Query: 1748 XXXXXXXXXXL-------VEFA*I------IEMLPFKALYSGRFHSYIVWFFAFEVRPWG 1608 +EF+ I+M PF ALY R S I WF A+EV PWG Sbjct: 448 LCACVIEFGGHWDNFLPLLEFSYNNSYHSGIDMAPFVALYGRRCGSPIGWFDAYEVTPWG 507 Query: 1607 INLSRDSLDKVKLI*ERVITIQSWQNSYIDQKVRDLEFKVSVRVLLKVSPXXXXXXXXXX 1428 ++ RDSL+KVK I E+++ QS Q Y D+KVRDLEF +VLLKVSP Sbjct: 508 TDILRDSLEKVKSIQEKLLVAQSRQKEYADRKVRDLEFMEGDQVLLKVSPMKGVMRFGKR 567 Query: 1427 XKLILSCIGQFEIIHHISEIAYELDLPYGLLGVHPVFHIFML 1302 KL IG F+++ + E+AYEL LP L GVHPVFH+ ML Sbjct: 568 CKLSPRYIGPFDVLKRVGEVAYELALPPALSGVHPVFHVSML 609