BLASTX nr result
ID: Atropa21_contig00029413
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Atropa21_contig00029413 (719 letters) Database: nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006364939.1| PREDICTED: uncharacterized protein LOC102581... 190 4e-46 gb|ABI34354.1| Retrotransposon gag protein [Solanum demissum] 190 4e-46 gb|AAT38744.1| Putative gag-pol polyprotein, identical [Solanum ... 182 1e-43 gb|AAT38724.1| Putative retrotransposon protein, identical [Sola... 181 2e-43 ref|XP_004243106.1| PREDICTED: uncharacterized protein LOC101256... 176 9e-42 emb|CAH66066.1| OSIGBa0092O07.1 [Oryza sativa Indica Group] gi|1... 125 2e-41 gb|ABA91307.1| retrotransposon protein, putative, Ty3-gypsy subc... 128 9e-41 gb|EOY03146.1| Retrotransposon protein, putative [Theobroma cacao] 171 2e-40 gb|EOY31663.1| CCHC-type integrase [Theobroma cacao] 171 3e-40 gb|EOY14099.1| DNA/RNA polymerases superfamily protein [Theobrom... 171 3e-40 gb|EOY21478.1| DNA/RNA polymerases superfamily protein [Theobrom... 170 5e-40 gb|ABG37663.1| CCHC-type integrase [Populus trichocarpa] 169 6e-40 gb|EOY32249.1| CCHC-type integrase [Theobroma cacao] 169 8e-40 gb|EOY08678.1| DNA/RNA polymerases superfamily protein [Theobrom... 169 8e-40 gb|EOY19264.1| Uncharacterized protein TCM_044274 [Theobroma cacao] 169 1e-39 gb|EOY21520.1| CCHC-type integrase, putative [Theobroma cacao] 166 7e-39 gb|EOY03326.1| DNA/RNA polymerases superfamily protein [Theobrom... 151 2e-34 gb|EOY19685.1| DNA/RNA polymerases superfamily protein, putative... 149 1e-33 gb|EOY00082.1| DNA/RNA polymerases superfamily protein [Theobrom... 147 3e-33 emb|CAA73042.1| polyprotein [Ananas comosus] 146 7e-33 >ref|XP_006364939.1| PREDICTED: uncharacterized protein LOC102581051 [Solanum tuberosum] Length = 1946 Score = 190 bits (482), Expect = 4e-46 Identities = 103/147 (70%), Positives = 113/147 (76%), Gaps = 1/147 (0%) Frame = -3 Query: 717 EGFVVYCDASGVGLGCVLMQHDKVIAYASRQLKPHEKNYPTHDLELAAVIFALKIWHHYL 538 EGFVVYCDAS VGLGCVLMQ+ KVIAYASRQLK HEKNYPTHDLELAAV+FALKIW HYL Sbjct: 1303 EGFVVYCDASRVGLGCVLMQNGKVIAYASRQLKVHEKNYPTHDLELAAVVFALKIWRHYL 1362 Query: 537 YGVHVNIFTNHKSLKYIFKQWKLNLG*EDGSNCLRTMI*MFSAIQG-ANVVADALSR*SM 361 YGVHV++FT+HKSL+Y+F Q LNL L+ G ANVVADALSR SM Sbjct: 1363 YGVHVDVFTDHKSLQYVFTQKDLNLRQRRWLEFLKDYDMSVHYHPGKANVVADALSRVSM 1422 Query: 360 KSLEHVEEQTRTMAKEVHRLASLGVRL 280 SL HV+ R MA+EVHRLA LGVRL Sbjct: 1423 GSLAHVDIGDREMAREVHRLARLGVRL 1449 Score = 104 bits (259), Expect = 3e-20 Identities = 70/170 (41%), Positives = 91/170 (53%), Gaps = 8/170 (4%) Frame = -2 Query: 487 FQAMEVEFRLRRWLELLKNYDIDVLCHPGSECGSRCS*SVIHEKLRTC*RTNENH---GE 317 F ++ R RRWLE LK+YD+ V HPG +V+ + L + H G+ Sbjct: 1380 FTQKDLNLRQRRWLEFLKDYDMSVHYHPGKA-------NVVADALSRVSMGSLAHVDIGD 1432 Query: 316 GSTPISQFRSSTFGLE-----DSDVVGQNRFESSLVAEEKEKQFNDTNLLQLKEGIHKHK 152 R + G+ + VV + SSLV E KQ D++LL+LK + + K Sbjct: 1433 REMAREVHRLARLGVRLEEVGNGGVVVVDGARSSLVDEVIAKQDLDSSLLELKALVKEGK 1492 Query: 151 TTAFEQGGDDGTLRYRGRLCVPNVDGLRERIMAETHISMYSIHLGSMKMY 2 F QGGD G LRY+GRLCVP VDGLRE+I+ E H S YSIH GS KMY Sbjct: 1493 VEVFSQGGD-GALRYQGRLCVPCVDGLREKILEEAHNSSYSIHPGSTKMY 1541 >gb|ABI34354.1| Retrotransposon gag protein [Solanum demissum] Length = 4543 Score = 190 bits (482), Expect = 4e-46 Identities = 99/149 (66%), Positives = 114/149 (76%), Gaps = 1/149 (0%) Frame = -3 Query: 717 EGFVVYCDASGVGLGCVLMQHDKVIAYASRQLKPHEKNYPTHDLELAAVIFALKIWHHYL 538 +GFVV+CDAS VGLGCVLMQ+DKVIAYASRQLK HEKNYPTHDLELAAV+FALKIW HYL Sbjct: 734 QGFVVHCDASRVGLGCVLMQNDKVIAYASRQLKAHEKNYPTHDLELAAVVFALKIWRHYL 793 Query: 537 YGVHVNIFTNHKSLKYIFKQWKLNLG*EDGSNCLRTMI*MFSAIQG-ANVVADALSR*SM 361 YGVHV+IFT+HKSL+Y+ Q +LNL L+ + G ANVVAD+LSR SM Sbjct: 794 YGVHVDIFTDHKSLQYVLTQKELNLRQRRWLELLKDYVLSILYHPGKANVVADSLSRLSM 853 Query: 360 KSLEHVEEQTRTMAKEVHRLASLGVRLLD 274 S H+EE R + K+VHRLA LGVR D Sbjct: 854 GSTAHIEEGRRELTKDVHRLACLGVRFTD 882 Score = 190 bits (482), Expect = 4e-46 Identities = 99/149 (66%), Positives = 114/149 (76%), Gaps = 1/149 (0%) Frame = -3 Query: 717 EGFVVYCDASGVGLGCVLMQHDKVIAYASRQLKPHEKNYPTHDLELAAVIFALKIWHHYL 538 +GFVV+CDAS VGLGCVLMQ+DKVIAYASRQLK HEKNYPTHDLELAAV+FALKIW HYL Sbjct: 2244 QGFVVHCDASRVGLGCVLMQNDKVIAYASRQLKAHEKNYPTHDLELAAVVFALKIWRHYL 2303 Query: 537 YGVHVNIFTNHKSLKYIFKQWKLNLG*EDGSNCLRTMI*MFSAIQG-ANVVADALSR*SM 361 YGVHV+IFT+HKSL+Y+ Q +LNL L+ + G ANVVAD+LSR SM Sbjct: 2304 YGVHVDIFTDHKSLQYVLTQKELNLRQRRWLELLKDYVLSILYHPGKANVVADSLSRLSM 2363 Query: 360 KSLEHVEEQTRTMAKEVHRLASLGVRLLD 274 S H+EE R + K+VHRLA LGVR D Sbjct: 2364 GSTAHIEEGRRELTKDVHRLACLGVRFTD 2392 Score = 190 bits (482), Expect = 4e-46 Identities = 99/149 (66%), Positives = 114/149 (76%), Gaps = 1/149 (0%) Frame = -3 Query: 717 EGFVVYCDASGVGLGCVLMQHDKVIAYASRQLKPHEKNYPTHDLELAAVIFALKIWHHYL 538 +GFVV+CDAS VGLGCVLMQ+DKVIAYASRQLK HEKNYPTHDLELAAV+FALKIW HYL Sbjct: 3754 QGFVVHCDASRVGLGCVLMQNDKVIAYASRQLKAHEKNYPTHDLELAAVVFALKIWRHYL 3813 Query: 537 YGVHVNIFTNHKSLKYIFKQWKLNLG*EDGSNCLRTMI*MFSAIQG-ANVVADALSR*SM 361 YGVHV+IFT+HKSL+Y+ Q +LNL L+ + G ANVVAD+LSR SM Sbjct: 3814 YGVHVDIFTDHKSLQYVLTQKELNLRQRRWLELLKDYVLSILYHPGKANVVADSLSRLSM 3873 Query: 360 KSLEHVEEQTRTMAKEVHRLASLGVRLLD 274 S H+EE R + K+VHRLA LGVR D Sbjct: 3874 GSTAHIEEGRRELTKDVHRLACLGVRFTD 3902 Score = 122 bits (306), Expect = 1e-25 Identities = 75/166 (45%), Positives = 98/166 (59%), Gaps = 8/166 (4%) Frame = -2 Query: 475 EVEFRLRRWLELLKNYDIDVLCHPGSECGSRCS*SVIHEKLRTC*RTNENH-GEGSTPIS 299 E+ R RRWLELLK+Y + +L HPG +V+ + L + H EG ++ Sbjct: 815 ELNLRQRRWLELLKDYVLSILYHPGKA-------NVVADSLSRLSMGSTAHIEEGRRELT 867 Query: 298 Q--FRSSTFGLEDSD-----VVGQNRFESSLVAEEKEKQFNDTNLLQLKEGIHKHKTTAF 140 + R + G+ +D + NR ESSLV E K+KQ D LL+LK + K + AF Sbjct: 868 KDVHRLACLGVRFTDSAKGGIAVANRAESSLVLEVKKKQDQDPILLELKANVQKQRVLAF 927 Query: 139 EQGGDDGTLRYRGRLCVPNVDGLRERIMAETHISMYSIHLGSMKMY 2 EQGGD G LRY+GRLCVP VDGL+E+IM E H S YS+H GS KMY Sbjct: 928 EQGGD-GALRYQGRLCVPMVDGLQEKIMEEAHSSRYSVHPGSTKMY 972 Score = 122 bits (306), Expect = 1e-25 Identities = 75/166 (45%), Positives = 98/166 (59%), Gaps = 8/166 (4%) Frame = -2 Query: 475 EVEFRLRRWLELLKNYDIDVLCHPGSECGSRCS*SVIHEKLRTC*RTNENH-GEGSTPIS 299 E+ R RRWLELLK+Y + +L HPG +V+ + L + H EG ++ Sbjct: 2325 ELNLRQRRWLELLKDYVLSILYHPGKA-------NVVADSLSRLSMGSTAHIEEGRRELT 2377 Query: 298 Q--FRSSTFGLEDSD-----VVGQNRFESSLVAEEKEKQFNDTNLLQLKEGIHKHKTTAF 140 + R + G+ +D + NR ESSLV E K+KQ D LL+LK + K + AF Sbjct: 2378 KDVHRLACLGVRFTDSAKGGIAVANRAESSLVLEVKKKQDQDPILLELKANVQKQRVLAF 2437 Query: 139 EQGGDDGTLRYRGRLCVPNVDGLRERIMAETHISMYSIHLGSMKMY 2 EQGGD G LRY+GRLCVP VDGL+E+IM E H S YS+H GS KMY Sbjct: 2438 EQGGD-GALRYQGRLCVPMVDGLQEKIMEEAHSSRYSVHPGSTKMY 2482 Score = 122 bits (306), Expect = 1e-25 Identities = 75/166 (45%), Positives = 98/166 (59%), Gaps = 8/166 (4%) Frame = -2 Query: 475 EVEFRLRRWLELLKNYDIDVLCHPGSECGSRCS*SVIHEKLRTC*RTNENH-GEGSTPIS 299 E+ R RRWLELLK+Y + +L HPG +V+ + L + H EG ++ Sbjct: 3835 ELNLRQRRWLELLKDYVLSILYHPGKA-------NVVADSLSRLSMGSTAHIEEGRRELT 3887 Query: 298 Q--FRSSTFGLEDSD-----VVGQNRFESSLVAEEKEKQFNDTNLLQLKEGIHKHKTTAF 140 + R + G+ +D + NR ESSLV E K+KQ D LL+LK + K + AF Sbjct: 3888 KDVHRLACLGVRFTDSAKGGIAVANRAESSLVLEVKKKQDQDPILLELKANVQKQRVLAF 3947 Query: 139 EQGGDDGTLRYRGRLCVPNVDGLRERIMAETHISMYSIHLGSMKMY 2 EQGGD G LRY+GRLCVP VDGL+E+IM E H S YS+H GS KMY Sbjct: 3948 EQGGD-GALRYQGRLCVPMVDGLQEKIMEEAHSSRYSVHPGSTKMY 3992 >gb|AAT38744.1| Putative gag-pol polyprotein, identical [Solanum demissum] Length = 1515 Score = 182 bits (461), Expect = 1e-43 Identities = 95/149 (63%), Positives = 111/149 (74%), Gaps = 1/149 (0%) Frame = -3 Query: 717 EGFVVYCDASGVGLGCVLMQHDKVIAYASRQLKPHEKNYPTHDLELAAVIFALKIWHHYL 538 +G VVYCDAS +GLGCVLMQ+ KVIAYASRQLK HEKNYPTHDLELA V+FALK+W HYL Sbjct: 952 QGLVVYCDASRIGLGCVLMQNGKVIAYASRQLKVHEKNYPTHDLELAVVVFALKLWRHYL 1011 Query: 537 YGVHVNIFTNHKSLKYIFKQWKLNLG*EDGSNCLRTMI*MFSAIQG-ANVVADALSR*SM 361 YGVHV+IFT+HKSL+Y+ Q +LNL L+ G ANVVAD+LSR SM Sbjct: 1012 YGVHVDIFTDHKSLQYVLTQKELNLRQRRWLELLKDYDLSILYHPGKANVVADSLSRLSM 1071 Query: 360 KSLEHVEEQTRTMAKEVHRLASLGVRLLD 274 S H+EE R +AK++HRLA LGVR D Sbjct: 1072 GSTTHIEEGRRELAKDMHRLACLGVRFTD 1100 Score = 125 bits (314), Expect = 1e-26 Identities = 73/166 (43%), Positives = 96/166 (57%), Gaps = 8/166 (4%) Frame = -2 Query: 475 EVEFRLRRWLELLKNYDIDVLCHPGSECGSRCS*SVIHEKLRTC*RTNENH---GEGSTP 305 E+ R RRWLELLK+YD+ +L HPG +V+ + L + H G Sbjct: 1033 ELNLRQRRWLELLKDYDLSILYHPGKA-------NVVADSLSRLSMGSTTHIEEGRRELA 1085 Query: 304 ISQFRSSTFGLEDSD-----VVGQNRFESSLVAEEKEKQFNDTNLLQLKEGIHKHKTTAF 140 R + G+ +D + ++ ESSL++E KEKQ D LL+LK + K + AF Sbjct: 1086 KDMHRLACLGVRFTDSTEGGIAVTSKAESSLMSEVKEKQDQDPILLELKANVQKQRVLAF 1145 Query: 139 EQGGDDGTLRYRGRLCVPNVDGLRERIMAETHISMYSIHLGSMKMY 2 EQGGD G LRY+GRLCVP VDGL+ER+M E H S YS+H GS KMY Sbjct: 1146 EQGGD-GVLRYQGRLCVPMVDGLQERVMEEAHSSRYSVHPGSTKMY 1190 >gb|AAT38724.1| Putative retrotransposon protein, identical [Solanum demissum] Length = 1602 Score = 181 bits (459), Expect = 2e-43 Identities = 95/149 (63%), Positives = 110/149 (73%), Gaps = 1/149 (0%) Frame = -3 Query: 717 EGFVVYCDASGVGLGCVLMQHDKVIAYASRQLKPHEKNYPTHDLELAAVIFALKIWHHYL 538 +G VVYCDAS +GLGCVLMQ+ KVIAYASRQLK HEKNYPTHDLELA V+FALK+W HYL Sbjct: 958 QGLVVYCDASRIGLGCVLMQNGKVIAYASRQLKVHEKNYPTHDLELAVVVFALKLWRHYL 1017 Query: 537 YGVHVNIFTNHKSLKYIFKQWKLNLG*EDGSNCLRTMI*MFSAIQG-ANVVADALSR*SM 361 YGVHV+IFT+HKSL+Y+ Q LNL L+ G ANVVAD+LSR SM Sbjct: 1018 YGVHVDIFTDHKSLQYVLTQKALNLRQRRWLELLKDYDLSILYHPGKANVVADSLSRLSM 1077 Query: 360 KSLEHVEEQTRTMAKEVHRLASLGVRLLD 274 S H+EE R +AK++HRLA LGVR D Sbjct: 1078 GSTTHIEEGRRELAKDMHRLACLGVRFTD 1106 Score = 123 bits (309), Expect = 5e-26 Identities = 72/165 (43%), Positives = 95/165 (57%), Gaps = 8/165 (4%) Frame = -2 Query: 472 VEFRLRRWLELLKNYDIDVLCHPGSECGSRCS*SVIHEKLRTC*RTNENH---GEGSTPI 302 + R RRWLELLK+YD+ +L HPG +V+ + L + H G Sbjct: 1040 LNLRQRRWLELLKDYDLSILYHPGKA-------NVVADSLSRLSMGSTTHIEEGRRELAK 1092 Query: 301 SQFRSSTFGLEDSD-----VVGQNRFESSLVAEEKEKQFNDTNLLQLKEGIHKHKTTAFE 137 R + G+ +D + ++ ESSL++E KEKQ D LL+LK + K + AFE Sbjct: 1093 DMHRLACLGVRFTDSTEGGIAVTSKAESSLMSEVKEKQDQDPILLELKANVQKQRVLAFE 1152 Query: 136 QGGDDGTLRYRGRLCVPNVDGLRERIMAETHISMYSIHLGSMKMY 2 QGGD G LRY+GRLCVP VDGL+ER+M E H S YS+H GS KMY Sbjct: 1153 QGGD-GVLRYQGRLCVPMVDGLQERVMEEAHSSRYSVHPGSTKMY 1196 >ref|XP_004243106.1| PREDICTED: uncharacterized protein LOC101256304 [Solanum lycopersicum] Length = 647 Score = 176 bits (445), Expect = 9e-42 Identities = 94/147 (63%), Positives = 111/147 (75%), Gaps = 1/147 (0%) Frame = -3 Query: 717 EGFVVYCDASGVGLGCVLMQHDKVIAYASRQLKPHEKNYPTHDLELAAVIFALKIWHHYL 538 +G+V+YCDASGVGLGCVLMQH KVIAYASRQL+ HEKNY THDLELA VI A+KIW HYL Sbjct: 417 DGYVIYCDASGVGLGCVLMQHGKVIAYASRQLRKHEKNYRTHDLELAVVIHAMKIWMHYL 476 Query: 537 YGVHVNIFTNHKSLKYIFKQWKLNLG*EDGSNCLRTM-I*MFSAIQGANVVADALSR*SM 361 YGVHV+I+T+HKSL+YIFKQ +LNL L+ I + AN+VADALSR SM Sbjct: 477 YGVHVDIYTDHKSLQYIFKQKELNLRQRRWLELLKDYDIDILYHPGKANIVADALSRKSM 536 Query: 360 KSLEHVEEQTRTMAKEVHRLASLGVRL 280 SL V+ + R M E+ L+SLGVRL Sbjct: 537 GSLTDVQPERRDMVWEIQWLSSLGVRL 563 >emb|CAH66066.1| OSIGBa0092O07.1 [Oryza sativa Indica Group] gi|116309115|emb|CAH66220.1| OSIGBa0157N01.6 [Oryza sativa Indica Group] Length = 1445 Score = 125 bits (314), Expect(2) = 2e-41 Identities = 59/83 (71%), Positives = 66/83 (79%) Frame = -3 Query: 711 FVVYCDASGVGLGCVLMQHDKVIAYASRQLKPHEKNYPTHDLELAAVIFALKIWHHYLYG 532 F VYCDAS GLGCVLMQ +V+AYASRQL+PH NYPTHDLELAAV+ ALKIW HYL G Sbjct: 875 FQVYCDASRQGLGCVLMQEGRVVAYASRQLRPHVTNYPTHDLELAAVVHALKIWRHYLIG 934 Query: 531 VHVNIFTNHKSLKYIFKQWKLNL 463 ++T+HKSLKYIF Q LNL Sbjct: 935 NRCEVYTDHKSLKYIFTQPDLNL 957 Score = 70.5 bits (171), Expect(2) = 2e-41 Identities = 45/162 (27%), Positives = 69/162 (42%) Frame = -2 Query: 487 FQAMEVEFRLRRWLELLKNYDIDVLCHPGSECGSRCS*SVIHEKLRTC*RTNENHGEGST 308 F ++ R RRWLEL+K+YD+ + HPG Sbjct: 950 FTQPDLNLRQRRWLELIKDYDMGIHYHPGK-----------------------------A 980 Query: 307 PISQFRSSTFGLEDSDVVGQNRFESSLVAEEKEKQFNDTNLLQLKEGIHKHKTTAFEQGG 128 + G+ + V +LV + + Q ND + +LK+ + K F + Sbjct: 981 NVQDLEHLNLGIVEHGYVAALEARPTLVDQVRAAQVNDPKIAELKKNMRVGKAREFHED- 1039 Query: 127 DDGTLRYRGRLCVPNVDGLRERIMAETHISMYSIHLGSMKMY 2 + GT+ RLCVP+ L++ I+ E H + YSIH GS KMY Sbjct: 1040 EHGTIWLGERLCVPDDKELKDLILTEAHQTQYSIHPGSTKMY 1081 >gb|ABA91307.1| retrotransposon protein, putative, Ty3-gypsy subclass [Oryza sativa Japonica Group] Length = 1284 Score = 128 bits (321), Expect(2) = 9e-41 Identities = 60/83 (72%), Positives = 69/83 (83%) Frame = -3 Query: 711 FVVYCDASGVGLGCVLMQHDKVIAYASRQLKPHEKNYPTHDLELAAVIFALKIWHHYLYG 532 FV+YCDAS GLG VLMQ KV+AYASRQL+PHE+NYPTHDLELAAV+ AL IW HYL G Sbjct: 838 FVIYCDASRQGLGGVLMQDGKVVAYASRQLRPHEENYPTHDLELAAVVHALNIWRHYLIG 897 Query: 531 VHVNIFTNHKSLKYIFKQWKLNL 463 H +I+T+HK+LKYIF Q LNL Sbjct: 898 NHCDIYTDHKNLKYIFTQSDLNL 920 Score = 65.9 bits (159), Expect(2) = 9e-41 Identities = 50/164 (30%), Positives = 74/164 (45%), Gaps = 2/164 (1%) Frame = -2 Query: 487 FQAMEVEFRLRRWLELLKNYDIDVLCHPGSECGSRCS*SVIHEKLRTC*RTNENHGEGST 308 F ++ R RRWLEL+K+YD++V HPG +V+ + L N EG Sbjct: 959 FIQSDLNLRQRRWLELIKDYDLEVHYHPGKA-------NVVADALSRKSHCNHLEMEGMA 1011 Query: 307 PISQFRSSTFGLE--DSDVVGQNRFESSLVAEEKEKQFNDTNLLQLKEGIHKHKTTAFEQ 134 P + + L V + L + +E Q ++ + ++KE + F Sbjct: 1012 PELKEELAQLNLHIVPRGQVNTLDIQPLLRTQIEEAQKDNEEIREVKERLAAGFAKEFST 1071 Query: 133 GGDDGTLRYRGRLCVPNVDGLRERIMAETHISMYSIHLGSMKMY 2 D L Y+ R+ VP GLR I+ E H S YS+H GS KMY Sbjct: 1072 DEKD-VLWYKKRIYVPKQGGLRGLILKEAHESAYSLHPGSTKMY 1114 >gb|EOY03146.1| Retrotransposon protein, putative [Theobroma cacao] Length = 1480 Score = 171 bits (433), Expect = 2e-40 Identities = 89/146 (60%), Positives = 105/146 (71%), Gaps = 1/146 (0%) Frame = -3 Query: 714 GFVVYCDASGVGLGCVLMQHDKVIAYASRQLKPHEKNYPTHDLELAAVIFALKIWHHYLY 535 G+ V+CDASGVGLGCVLMQH KVIAYASRQLK HE+NYP HDLE+AA++FALKIW HYLY Sbjct: 938 GYTVFCDASGVGLGCVLMQHGKVIAYASRQLKRHEQNYPIHDLEMAAIVFALKIWRHYLY 997 Query: 534 GVHVNIFTNHKSLKYIFKQWKLNLG*EDGSNCLRTMI*MFSAIQG-ANVVADALSR*SMK 358 G I+T+HKSLKYIF+Q LNL L+ G ANVVADALSR SM Sbjct: 998 GETCEIYTDHKSLKYIFQQRDLNLRQHRWMELLKDYDCTILYHPGKANVVADALSRKSMG 1057 Query: 357 SLEHVEEQTRTMAKEVHRLASLGVRL 280 SL H+ R++ +E+H L +GVRL Sbjct: 1058 SLAHISIGRRSLVREIHSLGDIGVRL 1083 Score = 78.6 bits (192), Expect = 2e-12 Identities = 57/168 (33%), Positives = 81/168 (48%), Gaps = 6/168 (3%) Frame = -2 Query: 487 FQAMEVEFRLRRWLELLKNYDIDVLCHPG-----SECGSRCS*-SVIHEKLRTC*RTNEN 326 FQ ++ R RW+ELLK+YD +L HPG ++ SR S S+ H + E Sbjct: 1014 FQQRDLNLRQHRWMELLKDYDCTILYHPGKANVVADALSRKSMGSLAHISIGRRSLVREI 1073 Query: 325 HGEGSTPISQFRSSTFGLEDSDVVGQNRFESSLVAEEKEKQFNDTNLLQLKEGIHKHKTT 146 H G + + T L + R L+ KE Q D +++ E K Sbjct: 1074 HSLGDIGVRLEVAETNAL-----LAHFRVRPILMDRIKEAQSKDEFVIKALEDPQGRKGK 1128 Query: 145 AFEQGGDDGTLRYRGRLCVPNVDGLRERIMAETHISMYSIHLGSMKMY 2 F +G DG LRY RL VP+ DGLR I+ E H++ Y +H G++KMY Sbjct: 1129 MFTKG-TDGVLRYGTRLYVPDGDGLRREILEEAHMAAYVVHPGALKMY 1175 >gb|EOY31663.1| CCHC-type integrase [Theobroma cacao] Length = 395 Score = 171 bits (432), Expect = 3e-40 Identities = 88/146 (60%), Positives = 105/146 (71%), Gaps = 1/146 (0%) Frame = -3 Query: 714 GFVVYCDASGVGLGCVLMQHDKVIAYASRQLKPHEKNYPTHDLELAAVIFALKIWHHYLY 535 G+ V+CDASG+GLGCVLMQH KVIAYASRQLK HE+NYP HDLE+AA++FALKIW HYLY Sbjct: 44 GYTVFCDASGIGLGCVLMQHGKVIAYASRQLKRHEQNYPIHDLEMAAIVFALKIWRHYLY 103 Query: 534 GVHVNIFTNHKSLKYIFKQWKLNLG*EDGSNCLRTMI*MFSAIQG-ANVVADALSR*SMK 358 G I+T+HKSLKYIF+Q LNL L+ G ANVVADALSR SM Sbjct: 104 GETCEIYTDHKSLKYIFQQRDLNLRQRRWMELLKDYDCTILYHPGKANVVADALSRKSMG 163 Query: 357 SLEHVEEQTRTMAKEVHRLASLGVRL 280 SL H+ R++ +E+H L +GVRL Sbjct: 164 SLAHISIGRRSLVREIHSLGDIGVRL 189 Score = 79.3 bits (194), Expect = 1e-12 Identities = 58/168 (34%), Positives = 81/168 (48%), Gaps = 6/168 (3%) Frame = -2 Query: 487 FQAMEVEFRLRRWLELLKNYDIDVLCHPG-----SECGSRCS*-SVIHEKLRTC*RTNEN 326 FQ ++ R RRW+ELLK+YD +L HPG ++ SR S S+ H + E Sbjct: 120 FQQRDLNLRQRRWMELLKDYDCTILYHPGKANVVADALSRKSMGSLAHISIGRRSLVREI 179 Query: 325 HGEGSTPISQFRSSTFGLEDSDVVGQNRFESSLVAEEKEKQFNDTNLLQLKEGIHKHKTT 146 H G + + T L + R L+ KE Q D +++ E K Sbjct: 180 HSLGDIGVRLEVAETNAL-----LAHFRVRPILMDRIKEAQSKDEFVIKALEDPQGRKGK 234 Query: 145 AFEQGGDDGTLRYRGRLCVPNVDGLRERIMAETHISMYSIHLGSMKMY 2 F +G DG LRY RL VP+ DGLR I+ E H++ Y +H G+ KMY Sbjct: 235 MFTKG-TDGVLRYGTRLYVPDGDGLRREILEEAHMAAYVVHPGATKMY 281 >gb|EOY14099.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 1502 Score = 171 bits (432), Expect = 3e-40 Identities = 89/146 (60%), Positives = 105/146 (71%), Gaps = 1/146 (0%) Frame = -3 Query: 714 GFVVYCDASGVGLGCVLMQHDKVIAYASRQLKPHEKNYPTHDLELAAVIFALKIWHHYLY 535 G++V+CDASGVGLGCVLMQH KVIAYASRQLK HE NYP HDLE+AA++FALKIW HYLY Sbjct: 933 GYMVFCDASGVGLGCVLMQHGKVIAYASRQLKRHEHNYPIHDLEMAAIVFALKIWRHYLY 992 Query: 534 GVHVNIFTNHKSLKYIFKQWKLNLG*EDGSNCLRTMI*MFSAIQG-ANVVADALSR*SMK 358 G I+T+HKSLKYIF+Q LNL L+ G ANVVADALSR SM Sbjct: 993 GETCEIYTDHKSLKYIFQQRDLNLRQRRWMELLKDYDCTILYHPGKANVVADALSRKSMG 1052 Query: 357 SLEHVEEQTRTMAKEVHRLASLGVRL 280 SL H+ R++ +E+H L +GVRL Sbjct: 1053 SLAHISIGRRSLVREIHSLGDIGVRL 1078 Score = 79.3 bits (194), Expect = 1e-12 Identities = 58/168 (34%), Positives = 81/168 (48%), Gaps = 6/168 (3%) Frame = -2 Query: 487 FQAMEVEFRLRRWLELLKNYDIDVLCHPG-----SECGSRCS*-SVIHEKLRTC*RTNEN 326 FQ ++ R RRW+ELLK+YD +L HPG ++ SR S S+ H + E Sbjct: 1009 FQQRDLNLRQRRWMELLKDYDCTILYHPGKANVVADALSRKSMGSLAHISIGRRSLVREI 1068 Query: 325 HGEGSTPISQFRSSTFGLEDSDVVGQNRFESSLVAEEKEKQFNDTNLLQLKEGIHKHKTT 146 H G + + T L + R L+ KE Q D +++ E K Sbjct: 1069 HSLGDIGVRLEVAETNAL-----LAHFRVRPILMDRIKEAQSKDEFVIKALEDPQGRKGK 1123 Query: 145 AFEQGGDDGTLRYRGRLCVPNVDGLRERIMAETHISMYSIHLGSMKMY 2 F +G DG LRY RL VP+ DGLR I+ E H++ Y +H G+ KMY Sbjct: 1124 MFTKG-TDGVLRYGTRLYVPDGDGLRREILEEAHMAAYVVHPGATKMY 1170 >gb|EOY21478.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 878 Score = 170 bits (430), Expect = 5e-40 Identities = 89/146 (60%), Positives = 105/146 (71%), Gaps = 1/146 (0%) Frame = -3 Query: 714 GFVVYCDASGVGLGCVLMQHDKVIAYASRQLKPHEKNYPTHDLELAAVIFALKIWHHYLY 535 G+ V+CDASGVGLGCVLMQH KVIAYASRQLK HE+NYP HDLE+AA++FALKIW HYLY Sbjct: 418 GYTVFCDASGVGLGCVLMQHGKVIAYASRQLKRHEQNYPIHDLEMAAIVFALKIWRHYLY 477 Query: 534 GVHVNIFTNHKSLKYIFKQWKLNLG*EDGSNCLRTMI*MFSAIQG-ANVVADALSR*SMK 358 G I+T+HKSLKYIF+Q LNL L+ G ANVVADALSR SM Sbjct: 478 GETCEIYTDHKSLKYIFQQRDLNLRQRRWMELLKDYDCTILYHPGKANVVADALSRKSMG 537 Query: 357 SLEHVEEQTRTMAKEVHRLASLGVRL 280 SL H+ R++ +E+H L +GVRL Sbjct: 538 SLAHIFIGRRSLVREIHSLGDIGVRL 563 Score = 78.2 bits (191), Expect = 2e-12 Identities = 58/168 (34%), Positives = 81/168 (48%), Gaps = 6/168 (3%) Frame = -2 Query: 487 FQAMEVEFRLRRWLELLKNYDIDVLCHPG-----SECGSRCS*-SVIHEKLRTC*RTNEN 326 FQ ++ R RRW+ELLK+YD +L HPG ++ SR S S+ H + E Sbjct: 494 FQQRDLNLRQRRWMELLKDYDCTILYHPGKANVVADALSRKSMGSLAHIFIGRRSLVREI 553 Query: 325 HGEGSTPISQFRSSTFGLEDSDVVGQNRFESSLVAEEKEKQFNDTNLLQLKEGIHKHKTT 146 H G + + T L + R L+ KE Q D +++ E K Sbjct: 554 HSLGDIGVRLEVAETNAL-----LAHFRVRPILMDRIKEAQSKDEFVIKALEDPQGRKGK 608 Query: 145 AFEQGGDDGTLRYRGRLCVPNVDGLRERIMAETHISMYSIHLGSMKMY 2 F +G DG LRY RL VP+ DGLR I+ E H++ Y +H G+ KMY Sbjct: 609 MFTKG-TDGVLRYGTRLYVPDGDGLRREILEEAHMAAYVVHPGATKMY 655 >gb|ABG37663.1| CCHC-type integrase [Populus trichocarpa] Length = 2037 Score = 169 bits (429), Expect = 6e-40 Identities = 91/145 (62%), Positives = 102/145 (70%), Gaps = 1/145 (0%) Frame = -3 Query: 714 GFVVYCDASGVGLGCVLMQHDKVIAYASRQLKPHEKNYPTHDLELAAVIFALKIWHHYLY 535 G+ VYCDAS VGLGCVLMQH KVIAYASRQLK HE+NYPTHDLE+ AVIFALKIW HYLY Sbjct: 1787 GYTVYCDASRVGLGCVLMQHGKVIAYASRQLKKHEQNYPTHDLEMTAVIFALKIWRHYLY 1846 Query: 534 GVHVNIFTNHKSLKYIFKQWKLNLG*EDGSNCLRTMI*MFSAIQG-ANVVADALSR*SMK 358 G IFT+HKSLKYIF+Q LNL L+ G ANVVADALSR S Sbjct: 1847 GETCEIFTDHKSLKYIFQQRDLNLRQRRWMELLKDYDCTIHYHPGKANVVADALSRKSSG 1906 Query: 357 SLEHVEEQTRTMAKEVHRLASLGVR 283 SL H++E R + +E+H L GVR Sbjct: 1907 SLAHIQEVRRPLIRELHELVDEGVR 1931 Score = 86.3 bits (212), Expect = 9e-15 Identities = 61/171 (35%), Positives = 88/171 (51%), Gaps = 9/171 (5%) Frame = -2 Query: 487 FQAMEVEFRLRRWLELLKNYDIDVLCHPGSE------CGSRCS*SVIH--EKLRTC*RT- 335 FQ ++ R RRW+ELLK+YD + HPG + S S+ H E R R Sbjct: 1863 FQQRDLNLRQRRWMELLKDYDCTIHYHPGKANVVADALSRKSSGSLAHIQEVRRPLIREL 1922 Query: 334 NENHGEGSTPISQFRSSTFGLEDSDVVGQNRFESSLVAEEKEKQFNDTNLLQLKEGIHKH 155 +E EG +F S G ++ + +S L + K Q D +LL+++ + + Sbjct: 1923 HELVDEGV----RFDLSEAGA----MIAHFQVKSDLFDKIKAAQKKDDSLLRIRNEVEQG 1974 Query: 154 KTTAFEQGGDDGTLRYRGRLCVPNVDGLRERIMAETHISMYSIHLGSMKMY 2 K F G DD LRY+ RLCVP+VD LR +M E H ++Y++H GS KMY Sbjct: 1975 KAAGFVIGDDD-VLRYKDRLCVPDVDDLRRELMVEAHQTVYTVHPGSTKMY 2024 >gb|EOY32249.1| CCHC-type integrase [Theobroma cacao] Length = 282 Score = 169 bits (428), Expect = 8e-40 Identities = 88/146 (60%), Positives = 105/146 (71%), Gaps = 1/146 (0%) Frame = -3 Query: 714 GFVVYCDASGVGLGCVLMQHDKVIAYASRQLKPHEKNYPTHDLELAAVIFALKIWHHYLY 535 G+ V+CDASGVGLGCVLMQH KVIAYASRQLK HE+NYP HDLE+AA++FALKIW HYLY Sbjct: 22 GYTVFCDASGVGLGCVLMQHGKVIAYASRQLKRHEQNYPIHDLEMAAIVFALKIWRHYLY 81 Query: 534 GVHVNIFTNHKSLKYIFKQWKLNLG*EDGSNCLRTMI*MFSAIQG-ANVVADALSR*SMK 358 G I+T+HKSLKYIF+Q L+L L+ G ANVVADALSR SM Sbjct: 82 GETCEIYTDHKSLKYIFQQRDLDLRQRRWMELLKDYDCTILYHPGKANVVADALSRKSMG 141 Query: 357 SLEHVEEQTRTMAKEVHRLASLGVRL 280 SL H+ R++ +E+H L +GVRL Sbjct: 142 SLAHISIGRRSLVREIHSLGDIGVRL 167 Score = 80.5 bits (197), Expect = 5e-13 Identities = 58/168 (34%), Positives = 82/168 (48%), Gaps = 6/168 (3%) Frame = -2 Query: 487 FQAMEVEFRLRRWLELLKNYDIDVLCHPG-----SECGSRCS*-SVIHEKLRTC*RTNEN 326 FQ +++ R RRW+ELLK+YD +L HPG ++ SR S S+ H + E Sbjct: 98 FQQRDLDLRQRRWMELLKDYDCTILYHPGKANVVADALSRKSMGSLAHISIGRRSLVREI 157 Query: 325 HGEGSTPISQFRSSTFGLEDSDVVGQNRFESSLVAEEKEKQFNDTNLLQLKEGIHKHKTT 146 H G + + T L + R L+ KE Q D +++ E K Sbjct: 158 HSLGDIGVRLEVAETNAL-----LAHFRVRPILMDRIKEAQSKDEFMIKALEDPQGRKGK 212 Query: 145 AFEQGGDDGTLRYRGRLCVPNVDGLRERIMAETHISMYSIHLGSMKMY 2 F +G DG LRY RL VP+ DGLR I+ E H++ Y +H G+ KMY Sbjct: 213 MFTKG-TDGVLRYGTRLYVPDGDGLRREILEEAHMAAYVVHPGATKMY 259 >gb|EOY08678.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 666 Score = 169 bits (428), Expect = 8e-40 Identities = 88/146 (60%), Positives = 105/146 (71%), Gaps = 1/146 (0%) Frame = -3 Query: 714 GFVVYCDASGVGLGCVLMQHDKVIAYASRQLKPHEKNYPTHDLELAAVIFALKIWHHYLY 535 G+ V+CDASGVGLGCVLMQH KVIAYASRQLK HE+NYP H+LE+AA++FALKIW HYLY Sbjct: 135 GYTVFCDASGVGLGCVLMQHGKVIAYASRQLKRHEQNYPIHNLEMAAIVFALKIWRHYLY 194 Query: 534 GVHVNIFTNHKSLKYIFKQWKLNLG*EDGSNCLRTMI*MFSAIQG-ANVVADALSR*SMK 358 G I+T+HKSLKYIF+Q LNL L+ G ANVVADALSR SM Sbjct: 195 GETCEIYTDHKSLKYIFQQRDLNLRQRRWMELLKDYDCTILYHPGKANVVADALSRKSMG 254 Query: 357 SLEHVEEQTRTMAKEVHRLASLGVRL 280 SL H+ R++ +E+H L +GVRL Sbjct: 255 SLAHISIGRRSLVREIHSLGDIGVRL 280 Score = 80.5 bits (197), Expect = 5e-13 Identities = 58/168 (34%), Positives = 83/168 (49%), Gaps = 6/168 (3%) Frame = -2 Query: 487 FQAMEVEFRLRRWLELLKNYDIDVLCHPG-----SECGSRCS*-SVIHEKLRTC*RTNEN 326 FQ ++ R RRW+ELLK+YD +L HPG ++ SR S S+ H + E Sbjct: 211 FQQRDLNLRQRRWMELLKDYDCTILYHPGKANVVADALSRKSMGSLAHISIGRRSLVREI 270 Query: 325 HGEGSTPISQFRSSTFGLEDSDVVGQNRFESSLVAEEKEKQFNDTNLLQLKEGIHKHKTT 146 H G + + T L + R L+ + KE Q D +++ E K Sbjct: 271 HSLGDIGVRLEVAETNAL-----LAHFRVRPILMDKIKEAQSKDEFVIKALEDPQGRKGK 325 Query: 145 AFEQGGDDGTLRYRGRLCVPNVDGLRERIMAETHISMYSIHLGSMKMY 2 F +G DG LRY RL VP+ DGLR +I+ E H++ Y +H G+ KMY Sbjct: 326 MFTKG-TDGVLRYGTRLYVPDGDGLRRKILEEAHMAAYVVHPGATKMY 372 >gb|EOY19264.1| Uncharacterized protein TCM_044274 [Theobroma cacao] Length = 860 Score = 169 bits (427), Expect = 1e-39 Identities = 88/146 (60%), Positives = 104/146 (71%), Gaps = 1/146 (0%) Frame = -3 Query: 714 GFVVYCDASGVGLGCVLMQHDKVIAYASRQLKPHEKNYPTHDLELAAVIFALKIWHHYLY 535 G+ V+CDASGVGLGCVLMQH KVIAYASRQLK HE+NYP HDLE+AA++FALKIW HYLY Sbjct: 252 GYTVFCDASGVGLGCVLMQHGKVIAYASRQLKRHEQNYPIHDLEMAAIVFALKIWRHYLY 311 Query: 534 GVHVNIFTNHKSLKYIFKQWKLNLG*EDGSNCLRTMI*MFSAIQG-ANVVADALSR*SMK 358 G I+ +HKSLKYIF+Q LNL L+ G ANVVADALSR SM Sbjct: 312 GETCEIYMDHKSLKYIFQQRDLNLRQRRWMELLKDYDCTILYHPGKANVVADALSRKSMG 371 Query: 357 SLEHVEEQTRTMAKEVHRLASLGVRL 280 SL H+ R++ +E+H L +GVRL Sbjct: 372 SLAHISIGRRSLVREIHSLGDIGVRL 397 Score = 80.5 bits (197), Expect = 5e-13 Identities = 58/168 (34%), Positives = 83/168 (49%), Gaps = 6/168 (3%) Frame = -2 Query: 487 FQAMEVEFRLRRWLELLKNYDIDVLCHPG-----SECGSRCS*-SVIHEKLRTC*RTNEN 326 FQ ++ R RRW+ELLK+YD +L HPG ++ SR S S+ H + E Sbjct: 328 FQQRDLNLRQRRWMELLKDYDCTILYHPGKANVVADALSRKSMGSLAHISIGRRSLVREI 387 Query: 325 HGEGSTPISQFRSSTFGLEDSDVVGQNRFESSLVAEEKEKQFNDTNLLQLKEGIHKHKTT 146 H G + + T S ++ R L+ + KE Q D +++ E K Sbjct: 388 HSLGDIGVRLEVAET-----SALLAHFRVRPILMDKIKEAQSKDEFVIKALEDPQGRKGK 442 Query: 145 AFEQGGDDGTLRYRGRLCVPNVDGLRERIMAETHISMYSIHLGSMKMY 2 F +G DG LRY RL VP+ DGLR I+ E H++ Y +H G+ KMY Sbjct: 443 MFTKG-TDGVLRYGTRLYVPDGDGLRREILEEAHMAAYVVHPGATKMY 489 >gb|EOY21520.1| CCHC-type integrase, putative [Theobroma cacao] Length = 508 Score = 166 bits (420), Expect = 7e-39 Identities = 86/143 (60%), Positives = 102/143 (71%), Gaps = 1/143 (0%) Frame = -3 Query: 714 GFVVYCDASGVGLGCVLMQHDKVIAYASRQLKPHEKNYPTHDLELAAVIFALKIWHHYLY 535 G+ V+CDASGVGLGCVLMQH KVIAYASRQLK HE+NYP HDLE+AA++FALKIW HYLY Sbjct: 355 GYTVFCDASGVGLGCVLMQHGKVIAYASRQLKRHEQNYPIHDLEMAAIVFALKIWRHYLY 414 Query: 534 GVHVNIFTNHKSLKYIFKQWKLNLG*EDGSNCLRTMI*MFSAIQG-ANVVADALSR*SMK 358 G I+T+HKSLKYIF+Q LNL L+ G ANVVADALSR SM Sbjct: 415 GETCEIYTDHKSLKYIFQQRDLNLRQRRWMELLKDYDCTILYHPGKANVVADALSRKSMG 474 Query: 357 SLEHVEEQTRTMAKEVHRLASLG 289 SL H+ R++ +E+H L +G Sbjct: 475 SLAHISIGRRSLVREIHSLGDIG 497 >gb|EOY03326.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 1447 Score = 151 bits (382), Expect = 2e-34 Identities = 80/125 (64%), Positives = 92/125 (73%), Gaps = 1/125 (0%) Frame = -3 Query: 714 GFVVYCDASGVGLGCVLMQHDKVIAYASRQLKPHEKNYPTHDLELAAVIFALKIWHHYLY 535 G+ ++CDASGVGLGCVLMQH KVIAYASRQLK HE+NYP HDLE+AA++FALKIW HYLY Sbjct: 841 GYTMFCDASGVGLGCVLMQHGKVIAYASRQLKRHEQNYPIHDLEMAAIVFALKIWRHYLY 900 Query: 534 GVHVNIFTNHKSLKYIFKQWKLNLG*EDGSNCLRTMI*MFSAIQG-ANVVADALSR*SMK 358 G I+T+HKSLKYIF+Q LNL L+ G ANVVADALSR SM Sbjct: 901 GETCEIYTDHKSLKYIFQQRDLNLRQCRWMELLKDYDCTILYHPGKANVVADALSRKSMG 960 Query: 357 SLEHV 343 SL H+ Sbjct: 961 SLAHI 965 Score = 70.5 bits (171), Expect = 5e-10 Identities = 52/162 (32%), Positives = 73/162 (45%) Frame = -2 Query: 487 FQAMEVEFRLRRWLELLKNYDIDVLCHPGSECGSRCS*SVIHEKLRTC*RTNENHGEGST 308 FQ ++ R RW+ELLK+YD +L HPG +V+ + L Sbjct: 917 FQQRDLNLRQCRWMELLKDYDCTILYHPGKA-------NVVADALS-------------- 955 Query: 307 PISQFRSSTFGLEDSDVVGQNRFESSLVAEEKEKQFNDTNLLQLKEGIHKHKTTAFEQGG 128 R S L +V L+ + KE Q D +++ E K F +G Sbjct: 956 -----RKSMGSLAHISIV-----RPILMDKIKEAQSKDEFVIKALEDPQGRKGKMFTKG- 1004 Query: 127 DDGTLRYRGRLCVPNVDGLRERIMAETHISMYSIHLGSMKMY 2 DG LRY RL VP+ DGLR I+ E H++ Y +H G+ KMY Sbjct: 1005 TDGVLRYGTRLYVPDGDGLRREILEEAHMAAYVVHPGATKMY 1046 >gb|EOY19685.1| DNA/RNA polymerases superfamily protein, putative [Theobroma cacao] Length = 1347 Score = 149 bits (375), Expect = 1e-33 Identities = 84/147 (57%), Positives = 100/147 (68%), Gaps = 1/147 (0%) Frame = -3 Query: 717 EGFVVYCDASGVGLGCVLMQHDKVIAYASRQLKPHEKNYPTHDLELAAVIFALKIWHHYL 538 +GFVVY DAS +GLGCVLMQ +KV+AYASRQLK HE NYPTHDLELAAV+FALKIW HYL Sbjct: 725 KGFVVYSDASKLGLGCVLMQDEKVVAYASRQLKRHEANYPTHDLELAAVVFALKIWRHYL 784 Query: 537 YGVHVNIFTNHKSLKYIFKQWKLNLG*EDGSNCLRTMI*MFSAIQG-ANVVADALSR*SM 361 YG H IFT+HKSLKY+ Q +LNL ++ + G ANVVADALSR S Sbjct: 785 YGEHCRIFTDHKSLKYLLTQKELNLRQRRWLELIKDYDLVIDYHPGKANVVADALSRKSS 844 Query: 360 KSLEHVEEQTRTMAKEVHRLASLGVRL 280 SL ++ + + SLGV+L Sbjct: 845 SSLAALQS---CYFSALIEMKSLGVQL 868 Score = 67.8 bits (164), Expect = 3e-09 Identities = 51/163 (31%), Positives = 81/163 (49%), Gaps = 5/163 (3%) Frame = -2 Query: 475 EVEFRLRRWLELLKNYDIDVLCHPG-----SECGSRCS*SVIHEKLRTC*RTNENHGEGS 311 E+ R RRWLEL+K+YD+ + HPG ++ SR S S + L++C + Sbjct: 806 ELNLRQRRWLELIKDYDLVIDYHPGKANVVADALSRKSSSSL-AALQSC------YFSAL 858 Query: 310 TPISQFRSSTFGLEDSDVVGQNRFESSLVAEEKEKQFNDTNLLQLKEGIHKHKTTAFEQG 131 + ED V+ SL+ + K+ Q +D L + + + + F + Sbjct: 859 IEMKSLGVQLRNGEDGSVLANFIVRPSLLNQIKDIQRSDDELRKEIQKLTDGGVSEF-RF 917 Query: 130 GDDGTLRYRGRLCVPNVDGLRERIMAETHISMYSIHLGSMKMY 2 G+D L +R R+CVP + LR+ IM E H S Y+++ GS KMY Sbjct: 918 GEDNVLMFRDRVCVPEGNQLRQTIMEEAHSSAYALNPGSTKMY 960 >gb|EOY00082.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 1515 Score = 147 bits (372), Expect = 3e-33 Identities = 83/147 (56%), Positives = 100/147 (68%), Gaps = 1/147 (0%) Frame = -3 Query: 717 EGFVVYCDASGVGLGCVLMQHDKVIAYASRQLKPHEKNYPTHDLELAAVIFALKIWHHYL 538 +GF+VY DAS +GLGCVLMQ +KV+AYASRQLK HE NYPTHDLELAAV+FALKIW HYL Sbjct: 871 KGFIVYSDASKLGLGCVLMQDEKVVAYASRQLKRHEANYPTHDLELAAVVFALKIWRHYL 930 Query: 537 YGVHVNIFTNHKSLKYIFKQWKLNLG*EDGSNCLRTMI*MFSAIQG-ANVVADALSR*SM 361 YG H IFT+HKSLKY+ Q +LNL ++ + G ANVVADALSR S Sbjct: 931 YGEHCRIFTDHKSLKYLLTQKELNLRQRRWLELIKDYDLVIDYHLGKANVVADALSRKSS 990 Query: 360 KSLEHVEEQTRTMAKEVHRLASLGVRL 280 SL ++ + + SLGV+L Sbjct: 991 SSLAALQS---CYFPALIEMKSLGVQL 1014 Score = 63.9 bits (154), Expect = 5e-08 Identities = 49/163 (30%), Positives = 80/163 (49%), Gaps = 5/163 (3%) Frame = -2 Query: 475 EVEFRLRRWLELLKNYDIDVLCHPG-----SECGSRCS*SVIHEKLRTC*RTNENHGEGS 311 E+ R RRWLEL+K+YD+ + H G ++ SR S S + L++C + Sbjct: 952 ELNLRQRRWLELIKDYDLVIDYHLGKANVVADALSRKSSSSL-AALQSC------YFPAL 1004 Query: 310 TPISQFRSSTFGLEDSDVVGQNRFESSLVAEEKEKQFNDTNLLQLKEGIHKHKTTAFEQG 131 + ED ++ SL+ + K+ Q +D L + + + + F + Sbjct: 1005 IEMKSLGVQLRNGEDGSLLANFIVRPSLLNQIKDIQRSDDELRKEIQKLTDGGVSEF-RF 1063 Query: 130 GDDGTLRYRGRLCVPNVDGLRERIMAETHISMYSIHLGSMKMY 2 G+D L ++ R+CVP + LR+ IM E H S Y++H GS KMY Sbjct: 1064 GEDNVLMFKDRVCVPEGNQLRQAIMEEAHSSAYALHPGSTKMY 1106 >emb|CAA73042.1| polyprotein [Ananas comosus] Length = 871 Score = 146 bits (368), Expect = 7e-33 Identities = 84/140 (60%), Positives = 100/140 (71%), Gaps = 2/140 (1%) Frame = -3 Query: 714 GFVVYCDASGVGLGCVLMQHDKVIAYASRQLKPHEKNYPTHDLELAAVIFALKIWHHYLY 535 G+VVY DAS GLGCVLMQ DKVIAYASRQLK +EKNYPTHDLELAAV+FALK+W HYLY Sbjct: 312 GYVVYSDASLNGLGCVLMQDDKVIAYASRQLKEYEKNYPTHDLELAAVVFALKLWRHYLY 371 Query: 534 GVHVNIFTNHKSLKYIFKQWKLNLG*EDGSNCLRTMI*MFSAIQG-ANVVADALSR*SMK 358 G ++T+HKSLKY+F Q +LNL L+ G ANVVADALSR SM+ Sbjct: 372 GERCEVYTDHKSLKYLFTQKELNLRQRRWLELLKDYDLTILYHPGKANVVADALSRKSME 431 Query: 357 SLE-HVEEQTRTMAKEVHRL 301 +L HV Q R + +++ RL Sbjct: 432 NLAMHVVTQPR-LIEQMKRL 450 Score = 79.7 bits (195), Expect = 9e-13 Identities = 60/171 (35%), Positives = 85/171 (49%), Gaps = 9/171 (5%) Frame = -2 Query: 487 FQAMEVEFRLRRWLELLKNYDIDVLCHPG-----SECGSRCS*SVIHEKLRTC*RTNENH 323 F E+ R RRWLELLK+YD+ +L HPG ++ SR S + + T R E Sbjct: 388 FTQKELNLRQRRWLELLKDYDLTILYHPGKANVVADALSRKSMENLAMHVVTQPRLIEQM 447 Query: 322 G----EGSTPISQFRSSTFGLEDSDVVGQNRFESSLVAEEKEKQFNDTNLLQLKEGIHKH 155 E TP + R T ++ + L+ KEKQ +D L ++K + Sbjct: 448 KRLELEIVTPDTPMRLMTLVVQPT-----------LLDRIKEKQASDVELQKIKGKMVDG 496 Query: 154 KTTAFEQGGDDGTLRYRGRLCVPNVDGLRERIMAETHISMYSIHLGSMKMY 2 T F GD G +R+RGR+CVP G++E I+ E H + Y+IH G KMY Sbjct: 497 CTGDFTLDGD-GLMRFRGRICVPADSGIKEDILQEAHRAPYAIHPGGTKMY 546