BLASTX nr result
ID: Atropa21_contig00037334
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Atropa21_contig00037334 (604 letters) Database: nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|ADU56211.1| gag-pol polyprotein [Solanum lycopersicum] 116 3e-43 ref|XP_004228792.1| PREDICTED: uncharacterized protein LOC101263... 116 5e-39 gb|AAT66771.2| Putative polyprotein, identical [Solanum demissum] 99 2e-37 gb|AAT39954.1| Putative integrase, identical [Solanum demissum] 97 1e-36 gb|ABI34389.1| Polyprotein, putative [Solanum tuberosum] 95 1e-36 ref|XP_004506381.1| PREDICTED: enzymatic polyprotein-like [Cicer... 93 1e-32 gb|EOY19683.1| Uncharacterized protein TCM_044868 [Theobroma cacao] 91 6e-31 ref|XP_006356454.1| PREDICTED: uncharacterized protein LOC102599... 94 9e-31 gb|EOY26451.1| DNA/RNA polymerases superfamily protein [Theobrom... 87 1e-30 gb|EOY20275.1| DNA/RNA polymerases superfamily protein [Theobrom... 89 2e-30 gb|EOY08659.1| DNA/RNA polymerases superfamily protein [Theobrom... 88 3e-30 gb|EOX99963.1| Uncharacterized protein TCM_009073 [Theobroma cacao] 88 3e-30 gb|EOY03078.1| Retrotransposon protein, putative [Theobroma cacao] 87 5e-30 gb|EOY08667.1| Retrotransposon protein, Ty3-gypsy subclass, puta... 87 6e-30 gb|EOY03326.1| DNA/RNA polymerases superfamily protein [Theobrom... 89 1e-29 gb|EOY20280.1| Uncharacterized protein TCM_045699 [Theobroma cacao] 86 1e-29 gb|ABI34354.1| Retrotransposon gag protein [Solanum demissum] 89 1e-29 gb|EOY21678.1| DNA/RNA polymerases superfamily protein [Theobrom... 86 2e-29 gb|AAT38724.1| Putative retrotransposon protein, identical [Sola... 91 3e-29 gb|EOY26390.1| Uncharacterized protein TCM_027940 [Theobroma cacao] 90 2e-28 >gb|ADU56211.1| gag-pol polyprotein [Solanum lycopersicum] Length = 367 Score = 116 bits (291), Expect(4) = 3e-43 Identities = 63/102 (61%), Positives = 70/102 (68%) Frame = -3 Query: 443 YSSIEMEPFETLYGRRCCSLIGWFDAFGVSPRGIDLLRDSLDKVKXXXXXXXIAYSRQRS 264 +SSI+M PFE LYGRRC S IGWFDAF V P G DLLRDS++KVK A SRQ+ Sbjct: 136 HSSIDMAPFEALYGRRCRSPIGWFDAFEVRPWGTDLLRDSIEKVKSIQEKLLAAQSRQKE 195 Query: 263 YADQKV*DFEFMVCE*VLLKVLLMKGEMRLRKKSKLSSRYIG 138 YAD+KV D EFM E VLLKV MK MR K+ KL RYIG Sbjct: 196 YADRKVRDLEFMEGEQVLLKVSPMKAVMRFGKRGKLIPRYIG 237 Score = 63.9 bits (154), Expect(4) = 3e-43 Identities = 31/42 (73%), Positives = 36/42 (85%) Frame = -1 Query: 568 TNVQSE*IIQVLKDMLRACVIDFGGH*D*FLPLAEFAYNNSY 443 T+ QSE IQVL+DM+ ACVI+FGGH D FLPLAEF+YNNSY Sbjct: 94 TDGQSERTIQVLEDMICACVIEFGGHWDSFLPLAEFSYNNSY 135 Score = 38.9 bits (89), Expect(4) = 3e-43 Identities = 16/32 (50%), Positives = 24/32 (75%) Frame = -2 Query: 111 EVALTLGLLSEHPIFHVSMFKKYDLEGSHVIQ 16 E+AL GL HP+FHVSM K+Y +G+++I+ Sbjct: 251 ELALPPGLSGVHPVFHVSMLKRYHGDGNYIIR 282 Score = 23.1 bits (48), Expect(4) = 3e-43 Identities = 8/12 (66%), Positives = 11/12 (91%) Frame = -2 Query: 603 TRVELSTTFHPR 568 TR++LST FHP+ Sbjct: 82 TRLDLSTAFHPQ 93 >ref|XP_004228792.1| PREDICTED: uncharacterized protein LOC101263838, partial [Solanum lycopersicum] Length = 609 Score = 116 bits (291), Expect(4) = 5e-39 Identities = 62/106 (58%), Positives = 74/106 (69%) Frame = -3 Query: 443 YSSIEMEPFETLYGRRCCSLIGWFDAFGVSPRGIDLLRDSLDKVKXXXXXXXIAYSRQRS 264 +S I+M PF LYGRRC S IGWFDA+ V+P G D+LRDSL+KVK +A SRQ+ Sbjct: 475 HSGIDMAPFVALYGRRCGSPIGWFDAYEVTPWGTDILRDSLEKVKSIQEKLLVAQSRQKE 534 Query: 263 YADQKV*DFEFMVCE*VLLKVLLMKGEMRLRKKSKLSSRYIGLSDI 126 YAD+KV D EFM + VLLKV MKG MR K+ KLS RYIG D+ Sbjct: 535 YADRKVRDLEFMEGDQVLLKVSPMKGVMRFGKRCKLSPRYIGPFDV 580 Score = 62.8 bits (151), Expect(4) = 5e-39 Identities = 31/42 (73%), Positives = 35/42 (83%) Frame = -1 Query: 568 TNVQSE*IIQVLKDMLRACVIDFGGH*D*FLPLAEFAYNNSY 443 T+ QSE IQVL+DML ACVI+FGGH D FLPL EF+YNNSY Sbjct: 433 TDGQSERTIQVLEDMLCACVIEFGGHWDNFLPLLEFSYNNSY 474 Score = 25.8 bits (55), Expect(4) = 5e-39 Identities = 11/19 (57%), Positives = 13/19 (68%) Frame = -2 Query: 111 EVALTLGLLSEHPIFHVSM 55 E+AL L HP+FHVSM Sbjct: 590 ELALPPALSGVHPVFHVSM 608 Score = 23.1 bits (48), Expect(4) = 5e-39 Identities = 8/12 (66%), Positives = 11/12 (91%) Frame = -2 Query: 603 TRVELSTTFHPR 568 TR++LST FHP+ Sbjct: 421 TRLDLSTAFHPQ 432 >gb|AAT66771.2| Putative polyprotein, identical [Solanum demissum] Length = 1771 Score = 99.0 bits (245), Expect(4) = 2e-37 Identities = 52/106 (49%), Positives = 69/106 (65%) Frame = -3 Query: 443 YSSIEMEPFETLYGRRCCSLIGWFDAFGVSPRGIDLLRDSLDKVKXXXXXXXIAYSRQRS 264 +SSI+M PFE LYGRRC S +GWF++ PRG DLL+++LD+V+ A SR +S Sbjct: 1556 HSSIQMAPFEALYGRRCRSPVGWFESTEPRPRGTDLLQEALDQVRVIQDRLRTAQSRHQS 1615 Query: 263 YADQKV*DFEFMVCE*VLLKVLLMKGEMRLRKKSKLSSRYIGLSDI 126 YADQ+ F V + V L+V MKG MR ++ KLS RYIG +I Sbjct: 1616 YADQRRRPLRFSVGDRVFLRVSPMKGVMRFGRRGKLSPRYIGPFEI 1661 Score = 65.1 bits (157), Expect(4) = 2e-37 Identities = 33/47 (70%), Positives = 38/47 (80%) Frame = -1 Query: 583 NFSS*TNVQSE*IIQVLKDMLRACVIDFGGH*D*FLPLAEFAYNNSY 443 +F T+ QSE IQVL+DMLRACV+DFGG + FLPLAEFAYNNSY Sbjct: 1509 SFHPQTDGQSERTIQVLEDMLRACVMDFGGQWEQFLPLAEFAYNNSY 1555 Score = 35.0 bits (79), Expect(4) = 2e-37 Identities = 18/37 (48%), Positives = 24/37 (64%), Gaps = 4/37 (10%) Frame = -2 Query: 114 GEVALTLGL----LSEHPIFHVSMFKKYDLEGSHVIQ 16 GEVA L L + HP+FHVSM ++Y + SHV+Q Sbjct: 1666 GEVAYELALPPVFSAIHPVFHVSMLRRYVPDESHVLQ 1702 Score = 23.9 bits (50), Expect(4) = 2e-37 Identities = 9/12 (75%), Positives = 11/12 (91%) Frame = -2 Query: 603 TRVELSTTFHPR 568 TRV LST+FHP+ Sbjct: 1502 TRVHLSTSFHPQ 1513 >gb|AAT39954.1| Putative integrase, identical [Solanum demissum] Length = 1609 Score = 97.4 bits (241), Expect(4) = 1e-36 Identities = 50/106 (47%), Positives = 69/106 (65%) Frame = -3 Query: 443 YSSIEMEPFETLYGRRCCSLIGWFDAFGVSPRGIDLLRDSLDKVKXXXXXXXIAYSRQRS 264 YSSI+M PFE LYGRRC S +GWF++ PRG DLL+++LD+V+ +A SR ++ Sbjct: 1253 YSSIQMAPFEALYGRRCRSPVGWFESTEARPRGTDLLQEALDQVRVIQDRLRMAQSRHQN 1312 Query: 263 YADQKV*DFEFMVCE*VLLKVLLMKGEMRLRKKSKLSSRYIGLSDI 126 YAD++ F V + V +V MKG MR ++ KLS RYIG +I Sbjct: 1313 YADRRRRPLRFSVGDRVFFRVSPMKGVMRFGRRDKLSPRYIGPFEI 1358 Score = 63.5 bits (153), Expect(4) = 1e-36 Identities = 31/42 (73%), Positives = 36/42 (85%) Frame = -1 Query: 568 TNVQSE*IIQVLKDMLRACVIDFGGH*D*FLPLAEFAYNNSY 443 T+ QSE IQVL+DML+ACV+DFGG D FLPLAEFAYNN+Y Sbjct: 1211 TDGQSERTIQVLEDMLQACVMDFGGQWDQFLPLAEFAYNNNY 1252 Score = 33.1 bits (74), Expect(4) = 1e-36 Identities = 14/32 (43%), Positives = 21/32 (65%) Frame = -2 Query: 111 EVALTLGLLSEHPIFHVSMFKKYDLEGSHVIQ 16 E+AL + HP+FHV M ++Y + SHV+Q Sbjct: 1368 ELALPPAFSAIHPVFHVPMLRRYVPDESHVLQ 1399 Score = 26.2 bits (56), Expect(4) = 1e-36 Identities = 10/12 (83%), Positives = 12/12 (100%) Frame = -2 Query: 603 TRVELSTTFHPR 568 TRV+LSTTFHP+ Sbjct: 1199 TRVDLSTTFHPQ 1210 >gb|ABI34389.1| Polyprotein, putative [Solanum tuberosum] Length = 545 Score = 95.1 bits (235), Expect(4) = 1e-36 Identities = 51/106 (48%), Positives = 68/106 (64%) Frame = -3 Query: 443 YSSIEMEPFETLYGRRCCSLIGWFDAFGVSPRGIDLLRDSLDKVKXXXXXXXIAYSRQRS 264 +SSI+M PFE LYGRRC S +GWF++ RG DLL+++LD+V+ A SR +S Sbjct: 162 HSSIQMAPFEALYGRRCHSPVGWFESTEPRLRGTDLLQEALDQVRVIQDRLRTAQSRHQS 221 Query: 263 YADQKV*DFEFMVCE*VLLKVLLMKGEMRLRKKSKLSSRYIGLSDI 126 YADQ+ F V + V L+V MKG MR ++ KLS RYIG +I Sbjct: 222 YADQRRRPLRFSVGDRVFLRVSPMKGVMRFGRRGKLSPRYIGPFEI 267 Score = 66.2 bits (160), Expect(4) = 1e-36 Identities = 33/42 (78%), Positives = 36/42 (85%) Frame = -1 Query: 568 TNVQSE*IIQVLKDMLRACVIDFGGH*D*FLPLAEFAYNNSY 443 T+ QSE IQVL+DMLRACV+DFGG D FLPLAEFAYNNSY Sbjct: 120 TDGQSERTIQVLEDMLRACVMDFGGQWDQFLPLAEFAYNNSY 161 Score = 35.0 bits (79), Expect(4) = 1e-36 Identities = 18/37 (48%), Positives = 24/37 (64%), Gaps = 4/37 (10%) Frame = -2 Query: 114 GEVALTLGL----LSEHPIFHVSMFKKYDLEGSHVIQ 16 GEVA L L + HP+FHVSM ++Y + SHV+Q Sbjct: 272 GEVAYELALPPVFSAIHPVFHVSMLRRYVPDESHVLQ 308 Score = 23.5 bits (49), Expect(4) = 1e-36 Identities = 9/12 (75%), Positives = 10/12 (83%) Frame = -2 Query: 603 TRVELSTTFHPR 568 TRV LST FHP+ Sbjct: 108 TRVHLSTAFHPQ 119 >ref|XP_004506381.1| PREDICTED: enzymatic polyprotein-like [Cicer arietinum] Length = 690 Score = 93.2 bits (230), Expect(4) = 1e-32 Identities = 50/107 (46%), Positives = 68/107 (63%) Frame = -3 Query: 440 SSIEMEPFETLYGRRCCSLIGWFDAFGVSPRGIDLLRDSLDKVKXXXXXXXIAYSRQRSY 261 SSI+M PFE LYGRRC S IGWF+ G +L++D+++KVK A SRQ+SY Sbjct: 482 SSIQMAPFEALYGRRCRSPIGWFEVREAKLVGPELIQDAIEKVKLIRDRLVTAQSRQKSY 541 Query: 260 ADQKV*DFEFMVCE*VLLKVLLMKGEMRLRKKSKLSSRYIGLSDITQ 120 +D++ EF V E V L+V MKG +R KK KLS R+IG ++ + Sbjct: 542 SDKRRRPLEFSVGEHVFLRVSPMKGVLRFGKKGKLSPRFIGPFEVLE 588 Score = 56.6 bits (135), Expect(4) = 1e-32 Identities = 27/39 (69%), Positives = 31/39 (79%) Frame = -1 Query: 559 QSE*IIQVLKDMLRACVIDFGGH*D*FLPLAEFAYNNSY 443 QSE IQ+L+DMLRACV+D GG D LPL EFAYNN+Y Sbjct: 442 QSERTIQILEDMLRACVLDLGGSWDQHLPLMEFAYNNTY 480 Score = 33.5 bits (75), Expect(4) = 1e-32 Identities = 15/30 (50%), Positives = 20/30 (66%) Frame = -2 Query: 108 VALTLGLLSEHPIFHVSMFKKYDLEGSHVI 19 +AL L HP+FH+SM +KY + SHVI Sbjct: 597 LALPPDLSGVHPVFHISMLRKYLHDPSHVI 626 Score = 23.5 bits (49), Expect(4) = 1e-32 Identities = 8/13 (61%), Positives = 12/13 (92%) Frame = -2 Query: 603 TRVELSTTFHPRL 565 TR++LST FHP++ Sbjct: 427 TRLKLSTAFHPQI 439 >gb|EOY19683.1| Uncharacterized protein TCM_044868 [Theobroma cacao] Length = 403 Score = 90.9 bits (224), Expect(3) = 6e-31 Identities = 50/105 (47%), Positives = 64/105 (60%) Frame = -3 Query: 440 SSIEMEPFETLYGRRCCSLIGWFDAFGVSPRGIDLLRDSLDKVKXXXXXXXIAYSRQRSY 261 +SI+M PFE LYGRRC S +GW + G +L++D+ +K+ A SRQ+SY Sbjct: 193 TSIQMAPFEALYGRRCRSPVGWLEVGERKLLGPELVQDATEKIHMIRQRMLTAQSRQKSY 252 Query: 260 ADQKV*DFEFMVCE*VLLKVLLMKGEMRLRKKSKLSSRYIGLSDI 126 AD + D EF V + V LKVL KG MR KK KLS RYIG +I Sbjct: 253 ADNRRRDLEFQVGDHVFLKVLPTKGVMRFGKKGKLSPRYIGPFEI 297 Score = 53.9 bits (128), Expect(3) = 6e-31 Identities = 27/42 (64%), Positives = 31/42 (73%) Frame = -1 Query: 568 TNVQSE*IIQVLKDMLRACVIDFGGH*D*FLPLAEFAYNNSY 443 T QSE IQ L+DMLRACVID G + +LPL EFAYNNS+ Sbjct: 150 TGGQSERTIQTLEDMLRACVIDLGVRWEQYLPLVEFAYNNSF 191 Score = 36.2 bits (82), Expect(3) = 6e-31 Identities = 16/31 (51%), Positives = 23/31 (74%) Frame = -2 Query: 108 VALTLGLLSEHPIFHVSMFKKYDLEGSHVIQ 16 +AL L + HP+FHVSM +KY+ + SHVI+ Sbjct: 308 LALPPDLSNIHPVFHVSMLRKYNPDPSHVIR 338 >ref|XP_006356454.1| PREDICTED: uncharacterized protein LOC102599406 [Solanum tuberosum] Length = 859 Score = 94.0 bits (232), Expect(3) = 9e-31 Identities = 49/106 (46%), Positives = 69/106 (65%) Frame = -3 Query: 443 YSSIEMEPFETLYGRRCCSLIGWFDAFGVSPRGIDLLRDSLDKVKXXXXXXXIAYSRQRS 264 +SSI+M PFE LYGRRC S +GWF++ PRG DL+R++LD V+ A SR +S Sbjct: 612 HSSIQMAPFEALYGRRCRSPVGWFESTKRRPRGTDLMREALDHVRVIQDRLRTAQSRHQS 671 Query: 263 YADQKV*DFEFMVCE*VLLKVLLMKGEMRLRKKSKLSSRYIGLSDI 126 YA+++ +F V V L+V MKG +R ++ KLS+RYIG +I Sbjct: 672 YANRRRRPLKFAVGNRVFLRVSPMKGVIRFGRRGKLSARYIGPFEI 717 Score = 62.0 bits (149), Expect(3) = 9e-31 Identities = 31/42 (73%), Positives = 35/42 (83%) Frame = -1 Query: 568 TNVQSE*IIQVLKDMLRACVIDFGGH*D*FLPLAEFAYNNSY 443 T+ QSE I+VL+DMLRACV+DFGG D LPLAEFAYNNSY Sbjct: 570 TDGQSERTIKVLEDMLRACVMDFGGQWDQHLPLAEFAYNNSY 611 Score = 24.3 bits (51), Expect(3) = 9e-31 Identities = 9/12 (75%), Positives = 11/12 (91%) Frame = -2 Query: 603 TRVELSTTFHPR 568 TRV+L TTFHP+ Sbjct: 558 TRVDLCTTFHPQ 569 >gb|EOY26451.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 679 Score = 86.7 bits (213), Expect(3) = 1e-30 Identities = 48/101 (47%), Positives = 60/101 (59%) Frame = -3 Query: 440 SSIEMEPFETLYGRRCCSLIGWFDAFGVSPRGIDLLRDSLDKVKXXXXXXXIAYSRQRSY 261 +SI+M PFE LYGRRC S IGW + G +L++D+ +K+ A SRQ+SY Sbjct: 469 TSIQMAPFEALYGRRCRSPIGWLEVGERKLLGPELVQDATEKIHMIRQRMLTAQSRQKSY 528 Query: 260 ADQKV*DFEFMVCE*VLLKVLLMKGEMRLRKKSKLSSRYIG 138 AD + D EF V + V LK KG MR KK KLS RYIG Sbjct: 529 ADNRRRDLEFQVGDHVFLKFSPTKGVMRFGKKGKLSPRYIG 569 Score = 54.3 bits (129), Expect(3) = 1e-30 Identities = 27/42 (64%), Positives = 32/42 (76%) Frame = -1 Query: 568 TNVQSE*IIQVLKDMLRACVIDFGGH*D*FLPLAEFAYNNSY 443 T+ QSE IQ L+DMLRACVID G + +LPL EFAYNNS+ Sbjct: 426 TDGQSERTIQTLEDMLRACVIDLGVRWEQYLPLVEFAYNNSF 467 Score = 38.9 bits (89), Expect(3) = 1e-30 Identities = 17/31 (54%), Positives = 24/31 (77%) Frame = -2 Query: 108 VALTLGLLSEHPIFHVSMFKKYDLEGSHVIQ 16 +AL L + HP+FHVSM +KY+L+ SHVI+ Sbjct: 584 LALPPDLSNIHPVFHVSMLRKYNLDPSHVIR 614 >gb|EOY20275.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 562 Score = 89.4 bits (220), Expect(3) = 2e-30 Identities = 50/107 (46%), Positives = 64/107 (59%) Frame = -3 Query: 440 SSIEMEPFETLYGRRCCSLIGWFDAFGVSPRGIDLLRDSLDKVKXXXXXXXIAYSRQRSY 261 +SI+M PFE LYGRRC S IGW + G +L++D+ +K+ A SRQ+SY Sbjct: 418 TSIQMAPFEALYGRRCRSPIGWLEVGERKLLGPELVQDATEKIHMISQKMLTAQSRQKSY 477 Query: 260 ADQKV*DFEFMVCE*VLLKVLLMKGEMRLRKKSKLSSRYIGLSDITQ 120 AD + D EF V + V LKV KG MR KK KLS RYIG +I + Sbjct: 478 ADNRRRDLEFQVGDHVFLKVSPTKGVMRFGKKGKLSPRYIGPFEILE 524 Score = 54.3 bits (129), Expect(3) = 2e-30 Identities = 27/42 (64%), Positives = 32/42 (76%) Frame = -1 Query: 568 TNVQSE*IIQVLKDMLRACVIDFGGH*D*FLPLAEFAYNNSY 443 T+ QSE IQ L+DMLRACVID G + +LPL EFAYNNS+ Sbjct: 375 TDGQSERTIQTLEDMLRACVIDLGVRWEQYLPLVEFAYNNSF 416 Score = 35.8 bits (81), Expect(3) = 2e-30 Identities = 16/30 (53%), Positives = 22/30 (73%) Frame = -2 Query: 108 VALTLGLLSEHPIFHVSMFKKYDLEGSHVI 19 +AL L + HP+FHVSM +KY+ + SHVI Sbjct: 533 LALPPDLSNIHPVFHVSMLRKYNPDPSHVI 562 >gb|EOY08659.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 937 Score = 87.8 bits (216), Expect(3) = 3e-30 Identities = 49/107 (45%), Positives = 63/107 (58%) Frame = -3 Query: 440 SSIEMEPFETLYGRRCCSLIGWFDAFGVSPRGIDLLRDSLDKVKXXXXXXXIAYSRQRSY 261 +SI+M PFE LYGRRC S IGW + G + ++D+ +K+ A SRQ+SY Sbjct: 699 TSIQMAPFEALYGRRCRSPIGWLEVGERKLLGPEFVQDATEKIHMIRQRMLTAQSRQKSY 758 Query: 260 ADQKV*DFEFMVCE*VLLKVLLMKGEMRLRKKSKLSSRYIGLSDITQ 120 AD + D EF V + V LKV KG MR KK KLS RYIG +I + Sbjct: 759 ADNRRRDLEFQVGDHVFLKVSPTKGVMRFGKKGKLSPRYIGPFEILE 805 Score = 54.3 bits (129), Expect(3) = 3e-30 Identities = 27/42 (64%), Positives = 32/42 (76%) Frame = -1 Query: 568 TNVQSE*IIQVLKDMLRACVIDFGGH*D*FLPLAEFAYNNSY 443 T+ QSE IQ L+DMLRACVID G + +LPL EFAYNNS+ Sbjct: 656 TDGQSEWTIQTLEDMLRACVIDLGVRWEQYLPLVEFAYNNSF 697 Score = 36.2 bits (82), Expect(3) = 3e-30 Identities = 16/31 (51%), Positives = 23/31 (74%) Frame = -2 Query: 108 VALTLGLLSEHPIFHVSMFKKYDLEGSHVIQ 16 +AL L + HP+FHVSM +KY+ + SHVI+ Sbjct: 814 LALPPDLSNIHPVFHVSMLRKYNPDPSHVIR 844 >gb|EOX99963.1| Uncharacterized protein TCM_009073 [Theobroma cacao] Length = 421 Score = 87.8 bits (216), Expect(3) = 3e-30 Identities = 49/107 (45%), Positives = 64/107 (59%) Frame = -3 Query: 440 SSIEMEPFETLYGRRCCSLIGWFDAFGVSPRGIDLLRDSLDKVKXXXXXXXIAYSRQRSY 261 +SI+M PF+ LYGRRC S IGW + G +L++D+ +K+ A SRQ+SY Sbjct: 197 TSIQMAPFKALYGRRCRSPIGWLEVGERKLLGPELVQDATEKIHIIRQRMLTAQSRQKSY 256 Query: 260 ADQKV*DFEFMVCE*VLLKVLLMKGEMRLRKKSKLSSRYIGLSDITQ 120 AD + D EF V + V LKV KG MR KK KLS RYIG +I + Sbjct: 257 ADNRRRDLEFQVGDHVFLKVSPTKGVMRFGKKGKLSPRYIGPFEILE 303 Score = 54.3 bits (129), Expect(3) = 3e-30 Identities = 27/42 (64%), Positives = 32/42 (76%) Frame = -1 Query: 568 TNVQSE*IIQVLKDMLRACVIDFGGH*D*FLPLAEFAYNNSY 443 T+ QSE IQ L+DMLRACVID G + +LPL EFAYNNS+ Sbjct: 154 TDGQSERTIQTLEDMLRACVIDLGVRWEQYLPLVEFAYNNSF 195 Score = 36.2 bits (82), Expect(3) = 3e-30 Identities = 16/31 (51%), Positives = 23/31 (74%) Frame = -2 Query: 108 VALTLGLLSEHPIFHVSMFKKYDLEGSHVIQ 16 +AL L + HP+FHVSM +KY+ + SHVI+ Sbjct: 312 LALPPDLSNIHPVFHVSMLRKYNPDPSHVIR 342 >gb|EOY03078.1| Retrotransposon protein, putative [Theobroma cacao] Length = 1263 Score = 86.7 bits (213), Expect(4) = 5e-30 Identities = 49/107 (45%), Positives = 62/107 (57%) Frame = -3 Query: 440 SSIEMEPFETLYGRRCCSLIGWFDAFGVSPRGIDLLRDSLDKVKXXXXXXXIAYSRQRSY 261 +SI+M PFE LYGRRC S IGW + G L++D+ +K+ A SRQ+SY Sbjct: 1053 TSIQMAPFEALYGRRCRSPIGWLEVGERKLLGPKLVQDATEKIHMIRQRMLTAQSRQKSY 1112 Query: 260 ADQKV*DFEFMVCE*VLLKVLLMKGEMRLRKKSKLSSRYIGLSDITQ 120 D + D EF V + V LKV KG MR KK KLS RYIG +I + Sbjct: 1113 VDNRRRDLEFQVGDHVFLKVSPTKGVMRFGKKGKLSPRYIGPFEILE 1159 Score = 54.3 bits (129), Expect(4) = 5e-30 Identities = 27/42 (64%), Positives = 32/42 (76%) Frame = -1 Query: 568 TNVQSE*IIQVLKDMLRACVIDFGGH*D*FLPLAEFAYNNSY 443 T+ QSE IQ L+DMLRACVID G + +LPL EFAYNNS+ Sbjct: 1010 TDGQSEQTIQTLEDMLRACVIDLGVRWEQYLPLVEFAYNNSF 1051 Score = 34.3 bits (77), Expect(4) = 5e-30 Identities = 18/36 (50%), Positives = 23/36 (63%), Gaps = 4/36 (11%) Frame = -2 Query: 114 GEVALTLGLLSE----HPIFHVSMFKKYDLEGSHVI 19 GEVA L L + HP+F VSM +KY+ + SHVI Sbjct: 1162 GEVAYRLALPPDLSNIHPVFQVSMLRKYNPDPSHVI 1197 Score = 22.3 bits (46), Expect(4) = 5e-30 Identities = 7/12 (58%), Positives = 11/12 (91%) Frame = -2 Query: 603 TRVELSTTFHPR 568 T+++ STTFHP+ Sbjct: 998 TKLDFSTTFHPQ 1009 >gb|EOY08667.1| Retrotransposon protein, Ty3-gypsy subclass, putative [Theobroma cacao] Length = 521 Score = 87.0 bits (214), Expect(3) = 6e-30 Identities = 49/105 (46%), Positives = 62/105 (59%) Frame = -3 Query: 440 SSIEMEPFETLYGRRCCSLIGWFDAFGVSPRGIDLLRDSLDKVKXXXXXXXIAYSRQRSY 261 +SI+M PFE LYGRRC S IGW + G +L++D+ +K+ A SR +SY Sbjct: 311 TSIQMAPFEALYGRRCRSPIGWLEVGERKLLGPELVQDATEKIHMIRQRMLTAQSRHKSY 370 Query: 260 ADQKV*DFEFMVCE*VLLKVLLMKGEMRLRKKSKLSSRYIGLSDI 126 AD + D EF V + V LKV KG MR KK KLS RYIG +I Sbjct: 371 ADNRRRDLEFQVGDHVFLKVSPTKGVMRFGKKGKLSPRYIGPFEI 415 Score = 54.3 bits (129), Expect(3) = 6e-30 Identities = 27/42 (64%), Positives = 32/42 (76%) Frame = -1 Query: 568 TNVQSE*IIQVLKDMLRACVIDFGGH*D*FLPLAEFAYNNSY 443 T+ QSE IQ L+DMLRACVID G + +LPL EFAYNNS+ Sbjct: 268 TDGQSERTIQTLEDMLRACVIDLGVRWEQYLPLVEFAYNNSF 309 Score = 36.2 bits (82), Expect(3) = 6e-30 Identities = 16/31 (51%), Positives = 23/31 (74%) Frame = -2 Query: 108 VALTLGLLSEHPIFHVSMFKKYDLEGSHVIQ 16 +AL L + HP+FHVSM +KY+ + SHVI+ Sbjct: 426 LALPPDLSNIHPVFHVSMLRKYNPDPSHVIR 456 >gb|EOY03326.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 1447 Score = 89.4 bits (220), Expect(3) = 1e-29 Identities = 50/107 (46%), Positives = 64/107 (59%) Frame = -3 Query: 440 SSIEMEPFETLYGRRCCSLIGWFDAFGVSPRGIDLLRDSLDKVKXXXXXXXIAYSRQRSY 261 +SI+M PFE LYGRRC S IGW + G +L++D+ +K+ A SRQ+SY Sbjct: 1237 TSIQMAPFEALYGRRCRSPIGWLEVGERKLLGPELVQDATEKIHMIRQRMLTAQSRQKSY 1296 Query: 260 ADQKV*DFEFMVCE*VLLKVLLMKGEMRLRKKSKLSSRYIGLSDITQ 120 AD + D EF V + V LKV KG MR KK KLS RYIG +I + Sbjct: 1297 ADNRRRDLEFQVGDHVFLKVSPTKGVMRFGKKGKLSPRYIGPFEILE 1343 Score = 51.2 bits (121), Expect(3) = 1e-29 Identities = 26/42 (61%), Positives = 31/42 (73%) Frame = -1 Query: 568 TNVQSE*IIQVLKDMLRACVIDFGGH*D*FLPLAEFAYNNSY 443 T+ QSE IQ L+ MLRACVID G + +LPL EFAYNNS+ Sbjct: 1194 TDGQSERTIQTLEAMLRACVIDLGVRWEQYLPLVEFAYNNSF 1235 Score = 36.2 bits (82), Expect(3) = 1e-29 Identities = 16/31 (51%), Positives = 23/31 (74%) Frame = -2 Query: 108 VALTLGLLSEHPIFHVSMFKKYDLEGSHVIQ 16 +AL L + HP+FHVSM +KY+ + SHVI+ Sbjct: 1352 LALPPDLSNIHPVFHVSMLRKYNPDPSHVIR 1382 >gb|EOY20280.1| Uncharacterized protein TCM_045699 [Theobroma cacao] Length = 415 Score = 86.3 bits (212), Expect(3) = 1e-29 Identities = 49/107 (45%), Positives = 62/107 (57%) Frame = -3 Query: 440 SSIEMEPFETLYGRRCCSLIGWFDAFGVSPRGIDLLRDSLDKVKXXXXXXXIAYSRQRSY 261 +SI+M PFE LYGRRC S IGW + G +L++D+ +K+ SRQ+SY Sbjct: 205 TSIQMAPFEALYGRRCRSPIGWLEVGERKLLGPELVQDATEKIHMIRQKMLTTQSRQKSY 264 Query: 260 ADQKV*DFEFMVCE*VLLKVLLMKGEMRLRKKSKLSSRYIGLSDITQ 120 AD + D EF V + V LKV KG MR KK KLS RYI DI + Sbjct: 265 ADNRRRDLEFQVGDHVFLKVSPTKGVMRFGKKGKLSPRYIRPFDILE 311 Score = 53.9 bits (128), Expect(3) = 1e-29 Identities = 27/42 (64%), Positives = 32/42 (76%) Frame = -1 Query: 568 TNVQSE*IIQVLKDMLRACVIDFGGH*D*FLPLAEFAYNNSY 443 T+ QSE IQ L+DMLRACVID G + +LPL EFAYNNS+ Sbjct: 162 TDGQSERTIQTLEDMLRACVIDLGVKWEQYLPLVEFAYNNSF 203 Score = 36.2 bits (82), Expect(3) = 1e-29 Identities = 16/31 (51%), Positives = 23/31 (74%) Frame = -2 Query: 108 VALTLGLLSEHPIFHVSMFKKYDLEGSHVIQ 16 +AL L + HP+FHVSM +KY+ + SHVI+ Sbjct: 320 LALPPDLSNIHPVFHVSMLRKYNPDPSHVIR 350 >gb|ABI34354.1| Retrotransposon gag protein [Solanum demissum] Length = 4543 Score = 89.0 bits (219), Expect(4) = 1e-29 Identities = 50/102 (49%), Positives = 61/102 (59%) Frame = -3 Query: 443 YSSIEMEPFETLYGRRCCSLIGWFDAFGVSPRGIDLLRDSLDKVKXXXXXXXIAYSRQRS 264 + SI+M P+E LYGRRC S IGWF+ G DL+ +++KVK A SRQ+S Sbjct: 1135 HPSIQMAPYEALYGRRCRSPIGWFEVGEAQLIGPDLVHQAMEKVKVIKERLKTAQSRQKS 1194 Query: 263 YADQKV*DFEFMVCE*VLLKVLLMKGEMRLRKKSKLSSRYIG 138 Y D + EF V + V LKV MKG MR KK KLS RYIG Sbjct: 1195 YTDVRRRALEFEVDDWVYLKVSPMKGVMRFGKKGKLSPRYIG 1236 Score = 89.0 bits (219), Expect(4) = 1e-29 Identities = 50/102 (49%), Positives = 61/102 (59%) Frame = -3 Query: 443 YSSIEMEPFETLYGRRCCSLIGWFDAFGVSPRGIDLLRDSLDKVKXXXXXXXIAYSRQRS 264 + SI+M P+E LYGRRC S IGWF+ G DL+ +++KVK A SRQ+S Sbjct: 2645 HPSIQMAPYEALYGRRCRSPIGWFEVGEAQLIGPDLVHQAMEKVKVIKERLKTAQSRQKS 2704 Query: 263 YADQKV*DFEFMVCE*VLLKVLLMKGEMRLRKKSKLSSRYIG 138 Y D + EF V + V LKV MKG MR KK KLS RYIG Sbjct: 2705 YTDVRRRALEFEVDDWVYLKVSPMKGVMRFGKKGKLSPRYIG 2746 Score = 89.0 bits (219), Expect(4) = 1e-29 Identities = 50/102 (49%), Positives = 61/102 (59%) Frame = -3 Query: 443 YSSIEMEPFETLYGRRCCSLIGWFDAFGVSPRGIDLLRDSLDKVKXXXXXXXIAYSRQRS 264 + SI+M P+E LYGRRC S IGWF+ G DL+ +++KVK A SRQ+S Sbjct: 4155 HPSIQMAPYEALYGRRCRSPIGWFEVGEAQLIGPDLVHQAMEKVKVIKERLKTAQSRQKS 4214 Query: 263 YADQKV*DFEFMVCE*VLLKVLLMKGEMRLRKKSKLSSRYIG 138 Y D + EF V + V LKV MKG MR KK KLS RYIG Sbjct: 4215 YTDVRRRALEFEVDDWVYLKVSPMKGVMRFGKKGKLSPRYIG 4256 Score = 58.5 bits (140), Expect(4) = 1e-29 Identities = 29/42 (69%), Positives = 34/42 (80%) Frame = -1 Query: 568 TNVQSE*IIQVLKDMLRACVIDFGGH*D*FLPLAEFAYNNSY 443 T+ Q+E IQ+L+DMLRACVIDF G+ D LPL EFAYNNSY Sbjct: 1093 TDGQAEHTIQILEDMLRACVIDFKGNWDDHLPLIEFAYNNSY 1134 Score = 58.5 bits (140), Expect(4) = 1e-29 Identities = 29/42 (69%), Positives = 34/42 (80%) Frame = -1 Query: 568 TNVQSE*IIQVLKDMLRACVIDFGGH*D*FLPLAEFAYNNSY 443 T+ Q+E IQ+L+DMLRACVIDF G+ D LPL EFAYNNSY Sbjct: 2603 TDGQAEHTIQILEDMLRACVIDFKGNWDDHLPLIEFAYNNSY 2644 Score = 58.5 bits (140), Expect(4) = 1e-29 Identities = 29/42 (69%), Positives = 34/42 (80%) Frame = -1 Query: 568 TNVQSE*IIQVLKDMLRACVIDFGGH*D*FLPLAEFAYNNSY 443 T+ Q+E IQ+L+DMLRACVIDF G+ D LPL EFAYNNSY Sbjct: 4113 TDGQAEHTIQILEDMLRACVIDFKGNWDDHLPLIEFAYNNSY 4154 Score = 27.7 bits (60), Expect(4) = 1e-29 Identities = 11/22 (50%), Positives = 15/22 (68%) Frame = -2 Query: 111 EVALTLGLLSEHPIFHVSMFKK 46 E+ L L + HP+FH+SM KK Sbjct: 1250 ELELPQELAAVHPVFHISMLKK 1271 Score = 27.7 bits (60), Expect(4) = 1e-29 Identities = 11/22 (50%), Positives = 15/22 (68%) Frame = -2 Query: 111 EVALTLGLLSEHPIFHVSMFKK 46 E+ L L + HP+FH+SM KK Sbjct: 2760 ELELPQELAAVHPVFHISMLKK 2781 Score = 27.7 bits (60), Expect(4) = 1e-29 Identities = 11/22 (50%), Positives = 15/22 (68%) Frame = -2 Query: 111 EVALTLGLLSEHPIFHVSMFKK 46 E+ L L + HP+FH+SM KK Sbjct: 4270 ELELPQELAAVHPVFHISMLKK 4291 Score = 20.8 bits (42), Expect(4) = 1e-29 Identities = 7/12 (58%), Positives = 10/12 (83%) Frame = -2 Query: 603 TRVELSTTFHPR 568 ++V LST FHP+ Sbjct: 1081 SKVNLSTAFHPQ 1092 Score = 20.8 bits (42), Expect(4) = 1e-29 Identities = 7/12 (58%), Positives = 10/12 (83%) Frame = -2 Query: 603 TRVELSTTFHPR 568 ++V LST FHP+ Sbjct: 2591 SKVNLSTAFHPQ 2602 Score = 20.8 bits (42), Expect(4) = 1e-29 Identities = 7/12 (58%), Positives = 10/12 (83%) Frame = -2 Query: 603 TRVELSTTFHPR 568 ++V LST FHP+ Sbjct: 4101 SKVNLSTAFHPQ 4112 >gb|EOY21678.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 448 Score = 85.5 bits (210), Expect(3) = 2e-29 Identities = 49/107 (45%), Positives = 63/107 (58%) Frame = -3 Query: 440 SSIEMEPFETLYGRRCCSLIGWFDAFGVSPRGIDLLRDSLDKVKXXXXXXXIAYSRQRSY 261 +SI+M PFE LYGRRC S IGW + G +L++D+ +K+ A SRQ+SY Sbjct: 238 TSIQMAPFEALYGRRCRSPIGWLEVGERKLLGPELVQDATEKIHMIRQRMLTAQSRQKSY 297 Query: 260 ADQKV*DFEFMVCE*VLLKVLLMKGEMRLRKKSKLSSRYIGLSDITQ 120 AD + EF V + V LKV KG MR KK KLS RYIG +I + Sbjct: 298 ADNRRRYLEFQVGDHVFLKVSPTKGIMRFGKKGKLSPRYIGPFEILE 344 Score = 54.3 bits (129), Expect(3) = 2e-29 Identities = 27/42 (64%), Positives = 32/42 (76%) Frame = -1 Query: 568 TNVQSE*IIQVLKDMLRACVIDFGGH*D*FLPLAEFAYNNSY 443 T+ QSE IQ L+DMLRACVID G + +LPL EFAYNNS+ Sbjct: 195 TDGQSERTIQTLEDMLRACVIDLGVRWEQYLPLVEFAYNNSF 236 Score = 36.2 bits (82), Expect(3) = 2e-29 Identities = 16/31 (51%), Positives = 23/31 (74%) Frame = -2 Query: 108 VALTLGLLSEHPIFHVSMFKKYDLEGSHVIQ 16 +AL L + HP+FHVSM +KY+ + SHVI+ Sbjct: 353 LALPPDLSNIHPVFHVSMLRKYNPDPSHVIR 383 >gb|AAT38724.1| Putative retrotransposon protein, identical [Solanum demissum] Length = 1602 Score = 91.3 bits (225), Expect(4) = 3e-29 Identities = 53/108 (49%), Positives = 64/108 (59%) Frame = -3 Query: 443 YSSIEMEPFETLYGRRCCSLIGWFDAFGVSPRGIDLLRDSLDKVKXXXXXXXIAYSRQRS 264 +SSI+M P+E LYGRRC S IGWF+ G DL+ +++KVK A SRQ+S Sbjct: 1386 HSSIQMAPYEALYGRRCRSPIGWFEVGEARLIGPDLVHQAMEKVKVIQERLKTAQSRQKS 1445 Query: 263 YADQKV*DFEFMVCE*VLLKVLLMKGEMRLRKKSKLSSRYIGLSDITQ 120 Y D + EF V + V LKV MKG MR KK KLS RYIG I Q Sbjct: 1446 YTDVRRRALEFEVDDWVYLKVSPMKGVMRFGKKGKLSPRYIGPYRIVQ 1493 Score = 55.1 bits (131), Expect(4) = 3e-29 Identities = 28/42 (66%), Positives = 32/42 (76%) Frame = -1 Query: 568 TNVQSE*IIQVLKDMLRACVIDFGGH*D*FLPLAEFAYNNSY 443 T+ Q+E IQ L+DMLRACVIDF + D LPL EFAYNNSY Sbjct: 1344 TDGQAERTIQTLEDMLRACVIDFKSNWDDHLPLIEFAYNNSY 1385 Score = 27.7 bits (60), Expect(4) = 3e-29 Identities = 11/22 (50%), Positives = 15/22 (68%) Frame = -2 Query: 111 EVALTLGLLSEHPIFHVSMFKK 46 E+ L L + HP+FH+SM KK Sbjct: 1501 ELELPQELAAVHPVFHISMLKK 1522 Score = 20.8 bits (42), Expect(4) = 3e-29 Identities = 7/12 (58%), Positives = 10/12 (83%) Frame = -2 Query: 603 TRVELSTTFHPR 568 ++V LST FHP+ Sbjct: 1332 SKVSLSTAFHPQ 1343 >gb|EOY26390.1| Uncharacterized protein TCM_027940 [Theobroma cacao] Length = 1052 Score = 90.1 bits (222), Expect(3) = 2e-28 Identities = 50/105 (47%), Positives = 64/105 (60%) Frame = -3 Query: 440 SSIEMEPFETLYGRRCCSLIGWFDAFGVSPRGIDLLRDSLDKVKXXXXXXXIAYSRQRSY 261 +SI+M PFE LYGRRC S IGW + G +L++D+ +K++ A SRQ+SY Sbjct: 842 TSIQMAPFEALYGRRCRSPIGWLEVGERKLLGPELVQDATEKIRMIRQKMLTAQSRQKSY 901 Query: 260 ADQKV*DFEFMVCE*VLLKVLLMKGEMRLRKKSKLSSRYIGLSDI 126 AD + D EF V + V LKV KG MR KK KLS RYIG +I Sbjct: 902 ADNRRRDLEFQVGDHVFLKVSPTKGVMRFGKKGKLSPRYIGPFEI 946 Score = 46.2 bits (108), Expect(3) = 2e-28 Identities = 21/31 (67%), Positives = 25/31 (80%) Frame = -1 Query: 535 LKDMLRACVIDFGGH*D*FLPLAEFAYNNSY 443 L+DMLRACVID G + +LPL EFAYNNS+ Sbjct: 810 LEDMLRACVIDLGVRWEQYLPLVEFAYNNSF 840 Score = 36.2 bits (82), Expect(3) = 2e-28 Identities = 16/31 (51%), Positives = 23/31 (74%) Frame = -2 Query: 108 VALTLGLLSEHPIFHVSMFKKYDLEGSHVIQ 16 +AL L + HP+FHVSM +KY+ + SHVI+ Sbjct: 957 LALPPDLSNIHPVFHVSMLRKYNPDPSHVIR 987