BLASTX nr result
ID: Atropa21_contig00037125
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Atropa21_contig00037125 (641 letters) Database: nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|ABI34339.1| Polyprotein, 3'-partial, putative [Solanum demissum] 167 2e-50 gb|AAT66771.2| Putative polyprotein, identical [Solanum demissum] 163 3e-49 gb|ADU56212.1| gag-pol polyprotein [Solanum lycopersicum] 145 2e-40 ref|XP_006364939.1| PREDICTED: uncharacterized protein LOC102581... 130 8e-39 gb|EOY21478.1| DNA/RNA polymerases superfamily protein [Theobrom... 136 7e-38 gb|EOY08678.1| DNA/RNA polymerases superfamily protein [Theobrom... 136 7e-38 ref|XP_004239522.1| PREDICTED: uncharacterized protein LOC101244... 162 8e-38 gb|ABG37663.1| CCHC-type integrase [Populus trichocarpa] 129 2e-37 gb|EOY14099.1| DNA/RNA polymerases superfamily protein [Theobrom... 134 7e-37 gb|EOY19264.1| Uncharacterized protein TCM_044274 [Theobroma cacao] 132 9e-37 ref|XP_004243106.1| PREDICTED: uncharacterized protein LOC101256... 125 2e-36 gb|ABI34354.1| Retrotransposon gag protein [Solanum demissum] 120 2e-36 gb|EOY03146.1| Retrotransposon protein, putative [Theobroma cacao] 134 4e-36 gb|EOX94130.1| DNA/RNA polymerases superfamily protein [Theobrom... 131 2e-35 gb|EOX94203.1| DNA/RNA polymerases superfamily protein [Theobrom... 121 4e-35 gb|AAT38744.1| Putative gag-pol polyprotein, identical [Solanum ... 119 5e-35 gb|EOY31685.1| Uncharacterized protein TCM_038736 [Theobroma cacao] 118 8e-35 gb|EOY21520.1| CCHC-type integrase, putative [Theobroma cacao] 125 8e-35 gb|EOY03326.1| DNA/RNA polymerases superfamily protein [Theobrom... 124 1e-34 gb|AAT38724.1| Putative retrotransposon protein, identical [Sola... 117 1e-34 >gb|ABI34339.1| Polyprotein, 3'-partial, putative [Solanum demissum] Length = 1475 Score = 167 bits (423), Expect(2) = 2e-50 Identities = 83/140 (59%), Positives = 106/140 (75%) Frame = -1 Query: 422 KEKVIAYGSRQLKAHEKNYPTHNLELAAIIFILKI*SHYVYGVHCEVYTDYHSLQYLFN* 243 +++VIAY SRQLK HE+NYPTH+LELAA++F LKI HY+YGV CE+YTD+ SLQY+ + Sbjct: 990 QDRVIAYASRQLKIHERNYPTHDLELAAVVFALKIWRHYLYGVRCEIYTDHRSLQYIMSQ 1049 Query: 242 KDXXXXXXXXXXXLKNYDMTILYYPSKKNMVADALSRKAVSMGSLDMLYIEECPNIMDVQ 63 +D LK+YD++ILY+P K N+VADALSRKAVSMGSL L +EE P MD+Q Sbjct: 1050 RDLNSRQRRWIELLKDYDLSILYHPGKANVVADALSRKAVSMGSLAFLSVEERPLAMDIQ 1109 Query: 62 SLVNRFVRLNISEPDRVLAY 3 L N VRL+IS+ RVLA+ Sbjct: 1110 FLANSMVRLDISDSRRVLAH 1129 Score = 58.5 bits (140), Expect(2) = 2e-50 Identities = 29/75 (38%), Positives = 42/75 (56%) Frame = -3 Query: 639 RQFIEGYLSIAAPLTRLTQKKVTCQWSEDCEVRFQXXXXXXXXXXXXXXXIERESFNMYF 460 R+F+E + ++A PLTRLT+ V WSE+CE F +E E F +Y Sbjct: 917 RRFVESFSTLATPLTRLTRVDVPFVWSEECEASFLRLKELLTTAPILTLPVEGEGFTVYC 976 Query: 459 DASGVSLVCIDVQRE 415 DASGV L C+ +Q++ Sbjct: 977 DASGVGLGCVLMQQD 991 >gb|AAT66771.2| Putative polyprotein, identical [Solanum demissum] Length = 1771 Score = 163 bits (413), Expect(2) = 3e-49 Identities = 82/138 (59%), Positives = 103/138 (74%) Frame = -1 Query: 416 KVIAYGSRQLKAHEKNYPTHNLELAAIIFILKI*SHYVYGVHCEVYTDYHSLQYLFN*KD 237 +VIAY SRQLK HE NYPTH+LELAA++F LKI HY+YGV CE+YTD+ SLQY+ + +D Sbjct: 1148 RVIAYASRQLKIHEHNYPTHDLELAAVVFALKIWRHYLYGVRCEIYTDHRSLQYIMSQRD 1207 Query: 236 XXXXXXXXXXXLKNYDMTILYYPSKKNMVADALSRKAVSMGSLDMLYIEECPNIMDVQSL 57 LK+YD++ILY+P K N+VADALSRKAVSMGSL L +EE P +D+QSL Sbjct: 1208 LNSRQRRWIELLKDYDLSILYHPGKANVVADALSRKAVSMGSLAFLSVEERPLALDIQSL 1267 Query: 56 VNRFVRLNISEPDRVLAY 3 N VRL+IS+ VLA+ Sbjct: 1268 ANSMVRLDISDSRCVLAF 1285 Score = 58.5 bits (140), Expect(2) = 3e-49 Identities = 31/74 (41%), Positives = 42/74 (56%) Frame = -3 Query: 639 RQFIEGYLSIAAPLTRLTQKKVTCQWSEDCEVRFQXXXXXXXXXXXXXXXIERESFNMYF 460 R+F+EG+ +IAA LTRLT+ V WSE+CE F +E E F +Y Sbjct: 1073 RRFVEGFSTIAALLTRLTRVDVPFVWSEECEASFLRLKELLTTAPILTLPVEGEGFTVYC 1132 Query: 459 DASGVSLVCIDVQR 418 DASGV L C+ +Q+ Sbjct: 1133 DASGVGLGCVLMQQ 1146 >gb|ADU56212.1| gag-pol polyprotein [Solanum lycopersicum] Length = 624 Score = 145 bits (365), Expect(2) = 2e-40 Identities = 77/139 (55%), Positives = 94/139 (67%) Frame = -1 Query: 422 KEKVIAYGSRQLKAHEKNYPTHNLELAAIIFILKI*SHYVYGVHCEVYTDYHSLQYLFN* 243 ++ VIAY SRQLK HE+NYPTH+LELAA++F LK HY+YGV CEVYTD+ SLQY+F Sbjct: 355 EKNVIAYASRQLKVHERNYPTHDLELAAVVFALKQWRHYLYGVKCEVYTDHRSLQYVFTQ 414 Query: 242 KDXXXXXXXXXXXLKNYDMTILYYPSKKNMVADALSRKAVSMGSLDMLYIEECPNIMDVQ 63 KD LK+YD+TILY+P K N+VA ALSRKA SMGSL L P +VQ Sbjct: 415 KDLNLRQRRWMELLKDYDITILYHPGKANVVAVALSRKAGSMGSLAHLQASRHPLAREVQ 474 Query: 62 SLVNRFVRLNISEPDRVLA 6 L N +RL ++E LA Sbjct: 475 ILANDLMRLEVNEKGGFLA 493 Score = 47.4 bits (111), Expect(2) = 2e-40 Identities = 26/76 (34%), Positives = 41/76 (53%) Frame = -3 Query: 639 RQFIEGYLSIAAPLTRLTQKKVTCQWSEDCEVRFQXXXXXXXXXXXXXXXIERESFNMYF 460 RQF++G+ SIA+ LT LT++ V WS +CE FQ +E ++F +Y Sbjct: 282 RQFVKGFSSIASQLTNLTKQNVPFGWSAECEESFQKLKTLLTTAPILTLPVEGKNFIVYC 341 Query: 459 DASGVSLVCIDVQRES 412 DAS L + +Q ++ Sbjct: 342 DASYSGLGAVLMQEKN 357 >ref|XP_006364939.1| PREDICTED: uncharacterized protein LOC102581051 [Solanum tuberosum] Length = 1946 Score = 130 bits (326), Expect(2) = 8e-39 Identities = 71/127 (55%), Positives = 87/127 (68%) Frame = -1 Query: 416 KVIAYGSRQLKAHEKNYPTHNLELAAIIFILKI*SHYVYGVHCEVYTDYHSLQYLFN*KD 237 KVIAY SRQLK HEKNYPTH+LELAA++F LKI HY+YGVH +V+TD+ SLQY+F KD Sbjct: 1325 KVIAYASRQLKVHEKNYPTHDLELAAVVFALKIWRHYLYGVHVDVFTDHKSLQYVFTQKD 1384 Query: 236 XXXXXXXXXXXLKNYDMTILYYPSKKNMVADALSRKAVSMGSLDMLYIEECPNIMDVQSL 57 LK+YDM++ Y+P K N+VADALSR VSMGSL + I + +V L Sbjct: 1385 LNLRQRRWLEFLKDYDMSVHYHPGKANVVADALSR--VSMGSLAHVDIGDREMAREVHRL 1442 Query: 56 VNRFVRL 36 VRL Sbjct: 1443 ARLGVRL 1449 Score = 57.0 bits (136), Expect(2) = 8e-39 Identities = 28/73 (38%), Positives = 41/73 (56%) Frame = -3 Query: 639 RQFIEGYLSIAAPLTRLTQKKVTCQWSEDCEVRFQXXXXXXXXXXXXXXXIERESFNMYF 460 R+F+ G+ SIA+P+T+LTQKK +W+++CE FQ E F +Y Sbjct: 1250 RRFVNGFSSIASPMTKLTQKKAKFEWTDECERSFQTLKDKLVSAPILSLPDGLEGFVVYC 1309 Query: 459 DASGVSLVCIDVQ 421 DAS V L C+ +Q Sbjct: 1310 DASRVGLGCVLMQ 1322 >gb|EOY21478.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 878 Score = 136 bits (343), Expect(2) = 7e-38 Identities = 72/138 (52%), Positives = 96/138 (69%) Frame = -1 Query: 416 KVIAYGSRQLKAHEKNYPTHNLELAAIIFILKI*SHYVYGVHCEVYTDYHSLQYLFN*KD 237 KVIAY SRQLK HE+NYP H+LE+AAI+F LKI HY+YG CE+YTD+ SL+Y+F +D Sbjct: 439 KVIAYASRQLKRHEQNYPIHDLEMAAIVFALKIWRHYLYGETCEIYTDHKSLKYIFQQRD 498 Query: 236 XXXXXXXXXXXLKNYDMTILYYPSKKNMVADALSRKAVSMGSLDMLYIEECPNIMDVQSL 57 LK+YD TILY+P K N+VADALSRK SMGSL ++I + ++ SL Sbjct: 499 LNLRQRRWMELLKDYDCTILYHPGKANVVADALSRK--SMGSLAHIFIGRRSLVREIHSL 556 Query: 56 VNRFVRLNISEPDRVLAY 3 + VRL ++E + +LA+ Sbjct: 557 GDIGVRLEVAETNALLAH 574 Score = 47.4 bits (111), Expect(2) = 7e-38 Identities = 23/73 (31%), Positives = 37/73 (50%) Frame = -3 Query: 639 RQFIEGYLSIAAPLTRLTQKKVTCQWSEDCEVRFQXXXXXXXXXXXXXXXIERESFNMYF 460 R+F++ + I APLT+LT+K +WS+ CE F+ + ++ Sbjct: 364 RRFVKDFSKIVAPLTKLTRKDTKFEWSDACENSFEKLKACLTTAPVLSLPQGTGGYTVFC 423 Query: 459 DASGVSLVCIDVQ 421 DASGV L C+ +Q Sbjct: 424 DASGVGLGCVLMQ 436 >gb|EOY08678.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 666 Score = 136 bits (343), Expect(2) = 7e-38 Identities = 73/138 (52%), Positives = 95/138 (68%) Frame = -1 Query: 416 KVIAYGSRQLKAHEKNYPTHNLELAAIIFILKI*SHYVYGVHCEVYTDYHSLQYLFN*KD 237 KVIAY SRQLK HE+NYP HNLE+AAI+F LKI HY+YG CE+YTD+ SL+Y+F +D Sbjct: 156 KVIAYASRQLKRHEQNYPIHNLEMAAIVFALKIWRHYLYGETCEIYTDHKSLKYIFQQRD 215 Query: 236 XXXXXXXXXXXLKNYDMTILYYPSKKNMVADALSRKAVSMGSLDMLYIEECPNIMDVQSL 57 LK+YD TILY+P K N+VADALSRK SMGSL + I + ++ SL Sbjct: 216 LNLRQRRWMELLKDYDCTILYHPGKANVVADALSRK--SMGSLAHISIGRRSLVREIHSL 273 Query: 56 VNRFVRLNISEPDRVLAY 3 + VRL ++E + +LA+ Sbjct: 274 GDIGVRLEVAETNALLAH 291 Score = 47.4 bits (111), Expect(2) = 7e-38 Identities = 23/73 (31%), Positives = 37/73 (50%) Frame = -3 Query: 639 RQFIEGYLSIAAPLTRLTQKKVTCQWSEDCEVRFQXXXXXXXXXXXXXXXIERESFNMYF 460 R+F++ + I APLT+LT+K +WS+ CE F+ + ++ Sbjct: 81 RRFVKDFSKIVAPLTKLTRKDTKFEWSDACENSFEKLKACLTTAPVLSLPQGTGGYTVFC 140 Query: 459 DASGVSLVCIDVQ 421 DASGV L C+ +Q Sbjct: 141 DASGVGLGCVLMQ 153 >ref|XP_004239522.1| PREDICTED: uncharacterized protein LOC101244956 [Solanum lycopersicum] Length = 933 Score = 162 bits (410), Expect = 8e-38 Identities = 85/140 (60%), Positives = 101/140 (72%) Frame = -1 Query: 422 KEKVIAYGSRQLKAHEKNYPTHNLELAAIIFILKI*SHYVYGVHCEVYTDYHSLQYLFN* 243 K KVIAY SRQLK HEKNYP H+L+LA ++F LKI SHY+Y VHCEV+TD+ SL Y+FN Sbjct: 616 KGKVIAYASRQLKVHEKNYPIHDLKLATVVFALKIWSHYLYDVHCEVFTDHRSLHYIFNK 675 Query: 242 KDXXXXXXXXXXXLKNYDMTILYYPSKKNMVADALSRKAVSMGSLDMLYIEECPNIMDVQ 63 +D L +Y+MTILY+P K+N+VADA S KA SMGSL ML E P DVQ Sbjct: 676 RDLNLRQWRWLELLNDYEMTILYHPGKENVVADASSWKAASMGSLAMLQGSEHPLAKDVQ 735 Query: 62 SLVNRFVRLNISEPDRVLAY 3 SL NRFVRL+ SE +VLAY Sbjct: 736 SLANRFVRLDYSEFCKVLAY 755 >gb|ABG37663.1| CCHC-type integrase [Populus trichocarpa] Length = 2037 Score = 129 bits (325), Expect(2) = 2e-37 Identities = 69/138 (50%), Positives = 92/138 (66%) Frame = -1 Query: 416 KVIAYGSRQLKAHEKNYPTHNLELAAIIFILKI*SHYVYGVHCEVYTDYHSLQYLFN*KD 237 KVIAY SRQLK HE+NYPTH+LE+ A+IF LKI HY+YG CE++TD+ SL+Y+F +D Sbjct: 1808 KVIAYASRQLKKHEQNYPTHDLEMTAVIFALKIWRHYLYGETCEIFTDHKSLKYIFQQRD 1867 Query: 236 XXXXXXXXXXXLKNYDMTILYYPSKKNMVADALSRKAVSMGSLDMLYIEECPNIMDVQSL 57 LK+YD TI Y+P K N+VADALSRK S GSL + P I ++ L Sbjct: 1868 LNLRQRRWMELLKDYDCTIHYHPGKANVVADALSRK--SSGSLAHIQEVRRPLIRELHEL 1925 Query: 56 VNRFVRLNISEPDRVLAY 3 V+ VR ++SE ++A+ Sbjct: 1926 VDEGVRFDLSEAGAMIAH 1943 Score = 52.4 bits (124), Expect(2) = 2e-37 Identities = 28/73 (38%), Positives = 37/73 (50%) Frame = -3 Query: 639 RQFIEGYLSIAAPLTRLTQKKVTCQWSEDCEVRFQXXXXXXXXXXXXXXXIERESFNMYF 460 R+F+E + I+APLT+LTQK V QWSE CE F + +Y Sbjct: 1733 RRFVENFSRISAPLTKLTQKNVKFQWSEACEKSFLELKERLTTAPVLAVPSGSGGYTVYC 1792 Query: 459 DASGVSLVCIDVQ 421 DAS V L C+ +Q Sbjct: 1793 DASRVGLGCVLMQ 1805 >gb|EOY14099.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 1502 Score = 134 bits (336), Expect(2) = 7e-37 Identities = 72/138 (52%), Positives = 94/138 (68%) Frame = -1 Query: 416 KVIAYGSRQLKAHEKNYPTHNLELAAIIFILKI*SHYVYGVHCEVYTDYHSLQYLFN*KD 237 KVIAY SRQLK HE NYP H+LE+AAI+F LKI HY+YG CE+YTD+ SL+Y+F +D Sbjct: 954 KVIAYASRQLKRHEHNYPIHDLEMAAIVFALKIWRHYLYGETCEIYTDHKSLKYIFQQRD 1013 Query: 236 XXXXXXXXXXXLKNYDMTILYYPSKKNMVADALSRKAVSMGSLDMLYIEECPNIMDVQSL 57 LK+YD TILY+P K N+VADALSRK SMGSL + I + ++ SL Sbjct: 1014 LNLRQRRWMELLKDYDCTILYHPGKANVVADALSRK--SMGSLAHISIGRRSLVREIHSL 1071 Query: 56 VNRFVRLNISEPDRVLAY 3 + VRL ++E + +LA+ Sbjct: 1072 GDIGVRLEVAETNALLAH 1089 Score = 46.6 bits (109), Expect(2) = 7e-37 Identities = 23/73 (31%), Positives = 37/73 (50%) Frame = -3 Query: 639 RQFIEGYLSIAAPLTRLTQKKVTCQWSEDCEVRFQXXXXXXXXXXXXXXXIERESFNMYF 460 R+F++ + I APLT+LT+K +WS+ CE F+ + ++ Sbjct: 879 RRFVKDFSKIVAPLTKLTRKDTKFEWSDACENSFEKLKACLTTAPVLSLPQGTGGYMVFC 938 Query: 459 DASGVSLVCIDVQ 421 DASGV L C+ +Q Sbjct: 939 DASGVGLGCVLMQ 951 >gb|EOY19264.1| Uncharacterized protein TCM_044274 [Theobroma cacao] Length = 860 Score = 132 bits (331), Expect(2) = 9e-37 Identities = 71/138 (51%), Positives = 93/138 (67%) Frame = -1 Query: 416 KVIAYGSRQLKAHEKNYPTHNLELAAIIFILKI*SHYVYGVHCEVYTDYHSLQYLFN*KD 237 KVIAY SRQLK HE+NYP H+LE+AAI+F LKI HY+YG CE+Y D+ SL+Y+F +D Sbjct: 273 KVIAYASRQLKRHEQNYPIHDLEMAAIVFALKIWRHYLYGETCEIYMDHKSLKYIFQQRD 332 Query: 236 XXXXXXXXXXXLKNYDMTILYYPSKKNMVADALSRKAVSMGSLDMLYIEECPNIMDVQSL 57 LK+YD TILY+P K N+VADALSRK SMGSL + I + ++ SL Sbjct: 333 LNLRQRRWMELLKDYDCTILYHPGKANVVADALSRK--SMGSLAHISIGRRSLVREIHSL 390 Query: 56 VNRFVRLNISEPDRVLAY 3 + VRL ++E +LA+ Sbjct: 391 GDIGVRLEVAETSALLAH 408 Score = 48.1 bits (113), Expect(2) = 9e-37 Identities = 23/73 (31%), Positives = 37/73 (50%) Frame = -3 Query: 639 RQFIEGYLSIAAPLTRLTQKKVTCQWSEDCEVRFQXXXXXXXXXXXXXXXIERESFNMYF 460 R+F++ + I APLT+LT+K +WS+ CE F+ + ++ Sbjct: 198 RRFVKDFSKIVAPLTKLTRKDTKFEWSDACENSFEKLKACLTTAPVLSLPQGTRGYTVFC 257 Query: 459 DASGVSLVCIDVQ 421 DASGV L C+ +Q Sbjct: 258 DASGVGLGCVLMQ 270 >ref|XP_004243106.1| PREDICTED: uncharacterized protein LOC101256304 [Solanum lycopersicum] Length = 647 Score = 125 bits (313), Expect(2) = 2e-36 Identities = 69/131 (52%), Positives = 88/131 (67%) Frame = -1 Query: 416 KVIAYGSRQLKAHEKNYPTHNLELAAIIFILKI*SHYVYGVHCEVYTDYHSLQYLFN*KD 237 KVIAY SRQL+ HEKNY TH+LELA +I +KI HY+YGVH ++YTD+ SLQY+F K+ Sbjct: 439 KVIAYASRQLRKHEKNYRTHDLELAVVIHAMKIWMHYLYGVHVDIYTDHKSLQYIFKQKE 498 Query: 236 XXXXXXXXXXXLKNYDMTILYYPSKKNMVADALSRKAVSMGSLDMLYIEECPNIMDVQSL 57 LK+YD+ ILY+P K N+VADALSRK SMGSL + E + ++Q L Sbjct: 499 LNLRQRRWLELLKDYDIDILYHPGKANIVADALSRK--SMGSLTDVQPERRDMVWEIQWL 556 Query: 56 VNRFVRLNISE 24 + VRL SE Sbjct: 557 SSLGVRLANSE 567 Score = 54.3 bits (129), Expect(2) = 2e-36 Identities = 28/73 (38%), Positives = 39/73 (53%) Frame = -3 Query: 639 RQFIEGYLSIAAPLTRLTQKKVTCQWSEDCEVRFQXXXXXXXXXXXXXXXIERESFNMYF 460 R+F+E + SI+APL RLTQK QW++ CE FQ + + +Y Sbjct: 364 RRFVEKFASISAPLKRLTQKAAKLQWTDACERSFQLLKDKLTTAPVLTLPEGPDGYVIYC 423 Query: 459 DASGVSLVCIDVQ 421 DASGV L C+ +Q Sbjct: 424 DASGVGLGCVLMQ 436 >gb|ABI34354.1| Retrotransposon gag protein [Solanum demissum] Length = 4543 Score = 120 bits (301), Expect(2) = 2e-36 Identities = 63/111 (56%), Positives = 82/111 (73%) Frame = -1 Query: 419 EKVIAYGSRQLKAHEKNYPTHNLELAAIIFILKI*SHYVYGVHCEVYTDYHSLQYLFN*K 240 +KVIAY SRQLKAHEKNYPTH+LELAA++F LKI HY+YGVH +++TD+ SLQY+ K Sbjct: 755 DKVIAYASRQLKAHEKNYPTHDLELAAVVFALKIWRHYLYGVHVDIFTDHKSLQYVLTQK 814 Query: 239 DXXXXXXXXXXXLKNYDMTILYYPSKKNMVADALSRKAVSMGSLDMLYIEE 87 + LK+Y ++ILY+P K N+VAD+LSR +SMGS +IEE Sbjct: 815 ELNLRQRRWLELLKDYVLSILYHPGKANVVADSLSR--LSMGS--TAHIEE 861 Score = 120 bits (301), Expect(2) = 2e-36 Identities = 63/111 (56%), Positives = 82/111 (73%) Frame = -1 Query: 419 EKVIAYGSRQLKAHEKNYPTHNLELAAIIFILKI*SHYVYGVHCEVYTDYHSLQYLFN*K 240 +KVIAY SRQLKAHEKNYPTH+LELAA++F LKI HY+YGVH +++TD+ SLQY+ K Sbjct: 2265 DKVIAYASRQLKAHEKNYPTHDLELAAVVFALKIWRHYLYGVHVDIFTDHKSLQYVLTQK 2324 Query: 239 DXXXXXXXXXXXLKNYDMTILYYPSKKNMVADALSRKAVSMGSLDMLYIEE 87 + LK+Y ++ILY+P K N+VAD+LSR +SMGS +IEE Sbjct: 2325 ELNLRQRRWLELLKDYVLSILYHPGKANVVADSLSR--LSMGS--TAHIEE 2371 Score = 120 bits (301), Expect(2) = 2e-36 Identities = 63/111 (56%), Positives = 82/111 (73%) Frame = -1 Query: 419 EKVIAYGSRQLKAHEKNYPTHNLELAAIIFILKI*SHYVYGVHCEVYTDYHSLQYLFN*K 240 +KVIAY SRQLKAHEKNYPTH+LELAA++F LKI HY+YGVH +++TD+ SLQY+ K Sbjct: 3775 DKVIAYASRQLKAHEKNYPTHDLELAAVVFALKIWRHYLYGVHVDIFTDHKSLQYVLTQK 3834 Query: 239 DXXXXXXXXXXXLKNYDMTILYYPSKKNMVADALSRKAVSMGSLDMLYIEE 87 + LK+Y ++ILY+P K N+VAD+LSR +SMGS +IEE Sbjct: 3835 ELNLRQRRWLELLKDYVLSILYHPGKANVVADSLSR--LSMGS--TAHIEE 3881 Score = 58.5 bits (140), Expect(2) = 2e-36 Identities = 30/75 (40%), Positives = 42/75 (56%) Frame = -3 Query: 639 RQFIEGYLSIAAPLTRLTQKKVTCQWSEDCEVRFQXXXXXXXXXXXXXXXIERESFNMYF 460 R+F+EG+ +IA+PLT+LTQK V QWSE CE FQ + F ++ Sbjct: 681 RRFVEGFSTIASPLTKLTQKTVKLQWSEACEKSFQELKKRLTTAPVLTLPEGTQGFVVHC 740 Query: 459 DASGVSLVCIDVQRE 415 DAS V L C+ +Q + Sbjct: 741 DASRVGLGCVLMQND 755 Score = 58.5 bits (140), Expect(2) = 2e-36 Identities = 30/75 (40%), Positives = 42/75 (56%) Frame = -3 Query: 639 RQFIEGYLSIAAPLTRLTQKKVTCQWSEDCEVRFQXXXXXXXXXXXXXXXIERESFNMYF 460 R+F+EG+ +IA+PLT+LTQK V QWSE CE FQ + F ++ Sbjct: 2191 RRFVEGFSTIASPLTKLTQKTVKLQWSEACEKSFQELKKRLTTAPVLTLPEGTQGFVVHC 2250 Query: 459 DASGVSLVCIDVQRE 415 DAS V L C+ +Q + Sbjct: 2251 DASRVGLGCVLMQND 2265 Score = 58.5 bits (140), Expect(2) = 2e-36 Identities = 30/75 (40%), Positives = 42/75 (56%) Frame = -3 Query: 639 RQFIEGYLSIAAPLTRLTQKKVTCQWSEDCEVRFQXXXXXXXXXXXXXXXIERESFNMYF 460 R+F+EG+ +IA+PLT+LTQK V QWSE CE FQ + F ++ Sbjct: 3701 RRFVEGFSTIASPLTKLTQKTVKLQWSEACEKSFQELKKRLTTAPVLTLPEGTQGFVVHC 3760 Query: 459 DASGVSLVCIDVQRE 415 DAS V L C+ +Q + Sbjct: 3761 DASRVGLGCVLMQND 3775 >gb|EOY03146.1| Retrotransposon protein, putative [Theobroma cacao] Length = 1480 Score = 134 bits (338), Expect(2) = 4e-36 Identities = 72/138 (52%), Positives = 95/138 (68%) Frame = -1 Query: 416 KVIAYGSRQLKAHEKNYPTHNLELAAIIFILKI*SHYVYGVHCEVYTDYHSLQYLFN*KD 237 KVIAY SRQLK HE+NYP H+LE+AAI+F LKI HY+YG CE+YTD+ SL+Y+F +D Sbjct: 959 KVIAYASRQLKRHEQNYPIHDLEMAAIVFALKIWRHYLYGETCEIYTDHKSLKYIFQQRD 1018 Query: 236 XXXXXXXXXXXLKNYDMTILYYPSKKNMVADALSRKAVSMGSLDMLYIEECPNIMDVQSL 57 LK+YD TILY+P K N+VADALSRK SMGSL + I + ++ SL Sbjct: 1019 LNLRQHRWMELLKDYDCTILYHPGKANVVADALSRK--SMGSLAHISIGRRSLVREIHSL 1076 Query: 56 VNRFVRLNISEPDRVLAY 3 + VRL ++E + +LA+ Sbjct: 1077 GDIGVRLEVAETNALLAH 1094 Score = 43.5 bits (101), Expect(2) = 4e-36 Identities = 22/73 (30%), Positives = 36/73 (49%) Frame = -3 Query: 639 RQFIEGYLSIAAPLTRLTQKKVTCQWSEDCEVRFQXXXXXXXXXXXXXXXIERESFNMYF 460 R+F++ + I A LT+LT+K +WS+ CE F+ + ++ Sbjct: 884 RRFVKDFSKIVALLTKLTRKDTKFEWSDACENSFEKLKACLTTAPVLSLLQGTGGYTVFC 943 Query: 459 DASGVSLVCIDVQ 421 DASGV L C+ +Q Sbjct: 944 DASGVGLGCVLMQ 956 >gb|EOX94130.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 1401 Score = 131 bits (330), Expect(2) = 2e-35 Identities = 70/138 (50%), Positives = 93/138 (67%) Frame = -1 Query: 416 KVIAYGSRQLKAHEKNYPTHNLELAAIIFILKI*SHYVYGVHCEVYTDYHSLQYLFN*KD 237 KVIAY SRQLK HE+NYP H+LE+A I+F LKI HY+YG CE+YTD+ SL+Y+F +D Sbjct: 827 KVIAYASRQLKRHEQNYPIHDLEMATIVFALKIWRHYLYGETCEIYTDHKSLKYIFQQRD 886 Query: 236 XXXXXXXXXXXLKNYDMTILYYPSKKNMVADALSRKAVSMGSLDMLYIEECPNIMDVQSL 57 LK+YD TILY+P K N+VAD LSRK SMGSL + I + ++ SL Sbjct: 887 LNLRQRRWMELLKDYDCTILYHPGKANVVADVLSRK--SMGSLAHISIGRRSLVREIHSL 944 Query: 56 VNRFVRLNISEPDRVLAY 3 + VRL ++E + +LA+ Sbjct: 945 GDIGVRLEVAETNALLAH 962 Score = 44.3 bits (103), Expect(2) = 2e-35 Identities = 22/73 (30%), Positives = 36/73 (49%) Frame = -3 Query: 639 RQFIEGYLSIAAPLTRLTQKKVTCQWSEDCEVRFQXXXXXXXXXXXXXXXIERESFNMYF 460 R+F++ + I APLT+LT+K +WS+ CE F+ + ++ Sbjct: 752 RRFVKDFSKIVAPLTKLTRKDTKFEWSDACENSFEKLKACLTTAPVLSLPQGTGGYTVFC 811 Query: 459 DASGVSLVCIDVQ 421 DAS V L C+ +Q Sbjct: 812 DASRVGLGCVLMQ 824 >gb|EOX94203.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 1336 Score = 121 bits (303), Expect(2) = 4e-35 Identities = 61/122 (50%), Positives = 85/122 (69%), Gaps = 1/122 (0%) Frame = -1 Query: 419 EKVIAYGSRQLKAHEKNYPTHNLELAAIIFILKI*SHYVYGVHCEVYTDYHSLQYLFN*K 240 EKV+AY SRQLK HE NYPTH+LELAA++F LKI HY+YG HC+++TD+ SL+YL K Sbjct: 855 EKVVAYASRQLKRHEANYPTHDLELAAVVFALKIWRHYLYGEHCQIFTDHKSLKYLLTQK 914 Query: 239 DXXXXXXXXXXXLKNYDMTILYYPSKKNMVADALSRK-AVSMGSLDMLYIEECPNIMDVQ 63 + +K+YD+ I Y+P K N+VADALSRK + S+ +L Y P +++++ Sbjct: 915 EINLRQRRWLELIKDYDLVIDYHPGKANVVADALSRKSSSSLAALQNCYF---PALIEMK 971 Query: 62 SL 57 SL Sbjct: 972 SL 973 Score = 53.5 bits (127), Expect(2) = 4e-35 Identities = 29/75 (38%), Positives = 41/75 (54%) Frame = -3 Query: 639 RQFIEGYLSIAAPLTRLTQKKVTCQWSEDCEVRFQXXXXXXXXXXXXXXXIERESFNMYF 460 R+F++G+ IAAPLTRLT+K V W + CE RFQ + + F +Y Sbjct: 781 RRFVQGFSLIAAPLTRLTRKGVKFVWDDVCENRFQELKNRLTFAPVLTLPVNGKGFVVYS 840 Query: 459 DASGVSLVCIDVQRE 415 DAS + L C+ +Q E Sbjct: 841 DASKLGLGCVLMQDE 855 >gb|AAT38744.1| Putative gag-pol polyprotein, identical [Solanum demissum] Length = 1515 Score = 119 bits (298), Expect(2) = 5e-35 Identities = 61/110 (55%), Positives = 80/110 (72%) Frame = -1 Query: 416 KVIAYGSRQLKAHEKNYPTHNLELAAIIFILKI*SHYVYGVHCEVYTDYHSLQYLFN*KD 237 KVIAY SRQLK HEKNYPTH+LELA ++F LK+ HY+YGVH +++TD+ SLQY+ K+ Sbjct: 974 KVIAYASRQLKVHEKNYPTHDLELAVVVFALKLWRHYLYGVHVDIFTDHKSLQYVLTQKE 1033 Query: 236 XXXXXXXXXXXLKNYDMTILYYPSKKNMVADALSRKAVSMGSLDMLYIEE 87 LK+YD++ILY+P K N+VAD+LSR +SMGS +IEE Sbjct: 1034 LNLRQRRWLELLKDYDLSILYHPGKANVVADSLSR--LSMGS--TTHIEE 1079 Score = 55.1 bits (131), Expect(2) = 5e-35 Identities = 29/73 (39%), Positives = 39/73 (53%) Frame = -3 Query: 639 RQFIEGYLSIAAPLTRLTQKKVTCQWSEDCEVRFQXXXXXXXXXXXXXXXIERESFNMYF 460 R+F+EG+ SIA+PLT+LTQK QWSE CE FQ + +Y Sbjct: 899 RRFVEGFSSIASPLTKLTQKTGKFQWSEACEKSFQELKKRLITAPVLTLPEGTQGLVVYC 958 Query: 459 DASGVSLVCIDVQ 421 DAS + L C+ +Q Sbjct: 959 DASRIGLGCVLMQ 971 >gb|EOY31685.1| Uncharacterized protein TCM_038736 [Theobroma cacao] Length = 1486 Score = 118 bits (296), Expect(2) = 8e-35 Identities = 65/138 (47%), Positives = 89/138 (64%) Frame = -1 Query: 419 EKVIAYGSRQLKAHEKNYPTHNLELAAIIFILKI*SHYVYGVHCEVYTDYHSLQYLFN*K 240 EKVIAY SRQLK HE NYPTH+LELAA++F LKI HY+YG C +++D+ SL+YL K Sbjct: 906 EKVIAYASRQLKKHETNYPTHDLELAAVVFALKIWRHYLYGERCRIFSDHKSLKYLLTHK 965 Query: 239 DXXXXXXXXXXXLKNYDMTILYYPSKKNMVADALSRKAVSMGSLDMLYIEECPNIMDVQS 60 + +K+YD+ I Y+P K N+VADALSRK S SL L P +++++S Sbjct: 966 ELNLRQRRWLELIKDYDLVIDYHPGKANVVADALSRK--SSSSLATLRSSYFPMLLEMKS 1023 Query: 59 LVNRFVRLNISEPDRVLA 6 L ++LN E +L+ Sbjct: 1024 L---GIQLNNGEDGTLLS 1038 Score = 55.1 bits (131), Expect(2) = 8e-35 Identities = 29/75 (38%), Positives = 42/75 (56%) Frame = -3 Query: 639 RQFIEGYLSIAAPLTRLTQKKVTCQWSEDCEVRFQXXXXXXXXXXXXXXXIERESFNMYF 460 R+F++G+ IAAPLTRLT+K V +W + CE RFQ + + F +Y Sbjct: 832 RRFVQGFSLIAAPLTRLTRKGVKYEWDDVCENRFQELKNRLTSAPVLTLPVSGKEFVVYS 891 Query: 459 DASGVSLVCIDVQRE 415 DAS + L C+ +Q E Sbjct: 892 DASKLGLGCVLMQDE 906 >gb|EOY21520.1| CCHC-type integrase, putative [Theobroma cacao] Length = 508 Score = 125 bits (314), Expect(2) = 8e-35 Identities = 66/120 (55%), Positives = 83/120 (69%) Frame = -1 Query: 416 KVIAYGSRQLKAHEKNYPTHNLELAAIIFILKI*SHYVYGVHCEVYTDYHSLQYLFN*KD 237 KVIAY SRQLK HE+NYP H+LE+AAI+F LKI HY+YG CE+YTD+ SL+Y+F +D Sbjct: 376 KVIAYASRQLKRHEQNYPIHDLEMAAIVFALKIWRHYLYGETCEIYTDHKSLKYIFQQRD 435 Query: 236 XXXXXXXXXXXLKNYDMTILYYPSKKNMVADALSRKAVSMGSLDMLYIEECPNIMDVQSL 57 LK+YD TILY+P K N+VADALSRK SMGSL + I + ++ SL Sbjct: 436 LNLRQRRWMELLKDYDCTILYHPGKANVVADALSRK--SMGSLAHISIGRRSLVREIHSL 493 Score = 48.1 bits (113), Expect(2) = 8e-35 Identities = 23/73 (31%), Positives = 37/73 (50%) Frame = -3 Query: 639 RQFIEGYLSIAAPLTRLTQKKVTCQWSEDCEVRFQXXXXXXXXXXXXXXXIERESFNMYF 460 R+F++ + I APLT+LT+K +WS+ CE F+ + ++ Sbjct: 301 RRFVKDFSKIVAPLTKLTRKDTKFEWSDACENSFEKLKACLTTAPVLSLPQGTRGYTVFC 360 Query: 459 DASGVSLVCIDVQ 421 DASGV L C+ +Q Sbjct: 361 DASGVGLGCVLMQ 373 >gb|EOY03326.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 1447 Score = 124 bits (311), Expect(2) = 1e-34 Identities = 67/116 (57%), Positives = 82/116 (70%) Frame = -1 Query: 416 KVIAYGSRQLKAHEKNYPTHNLELAAIIFILKI*SHYVYGVHCEVYTDYHSLQYLFN*KD 237 KVIAY SRQLK HE+NYP H+LE+AAI+F LKI HY+YG CE+YTD+ SL+Y+F +D Sbjct: 862 KVIAYASRQLKRHEQNYPIHDLEMAAIVFALKIWRHYLYGETCEIYTDHKSLKYIFQQRD 921 Query: 236 XXXXXXXXXXXLKNYDMTILYYPSKKNMVADALSRKAVSMGSLDMLYIEECPNIMD 69 LK+YD TILY+P K N+VADALSRK SMGSL + I P +MD Sbjct: 922 LNLRQCRWMELLKDYDCTILYHPGKANVVADALSRK--SMGSLAHISIVR-PILMD 974 Score = 48.9 bits (115), Expect(2) = 1e-34 Identities = 24/73 (32%), Positives = 37/73 (50%) Frame = -3 Query: 639 RQFIEGYLSIAAPLTRLTQKKVTCQWSEDCEVRFQXXXXXXXXXXXXXXXIERESFNMYF 460 R+F++ + I APLT+LT+K +WS+ CE F+ + M+ Sbjct: 787 RRFVKDFSKIVAPLTKLTRKDTKFEWSDACENSFEKLKACLTTAPVLSLPQGTGGYTMFC 846 Query: 459 DASGVSLVCIDVQ 421 DASGV L C+ +Q Sbjct: 847 DASGVGLGCVLMQ 859 >gb|AAT38724.1| Putative retrotransposon protein, identical [Solanum demissum] Length = 1602 Score = 117 bits (294), Expect(2) = 1e-34 Identities = 61/110 (55%), Positives = 79/110 (71%) Frame = -1 Query: 416 KVIAYGSRQLKAHEKNYPTHNLELAAIIFILKI*SHYVYGVHCEVYTDYHSLQYLFN*KD 237 KVIAY SRQLK HEKNYPTH+LELA ++F LK+ HY+YGVH +++TD+ SLQY+ K Sbjct: 980 KVIAYASRQLKVHEKNYPTHDLELAVVVFALKLWRHYLYGVHVDIFTDHKSLQYVLTQKA 1039 Query: 236 XXXXXXXXXXXLKNYDMTILYYPSKKNMVADALSRKAVSMGSLDMLYIEE 87 LK+YD++ILY+P K N+VAD+LSR +SMGS +IEE Sbjct: 1040 LNLRQRRWLELLKDYDLSILYHPGKANVVADSLSR--LSMGS--TTHIEE 1085 Score = 55.1 bits (131), Expect(2) = 1e-34 Identities = 29/73 (39%), Positives = 39/73 (53%) Frame = -3 Query: 639 RQFIEGYLSIAAPLTRLTQKKVTCQWSEDCEVRFQXXXXXXXXXXXXXXXIERESFNMYF 460 R+F+EG+ SIA+PLT+LTQK QWSE CE FQ + +Y Sbjct: 905 RRFVEGFSSIASPLTKLTQKTGKFQWSEACEKSFQELKKRLITAPVLTLPEGTQGLVVYC 964 Query: 459 DASGVSLVCIDVQ 421 DAS + L C+ +Q Sbjct: 965 DASRIGLGCVLMQ 977