BLASTX nr result
ID: Atropa21_contig00037394
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Atropa21_contig00037394 (551 letters) Database: nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|ADU56212.1| gag-pol polyprotein [Solanum lycopersicum] 158 1e-38 gb|AAT66771.2| Putative polyprotein, identical [Solanum demissum] 162 4e-38 ref|XP_004248966.1| PREDICTED: uncharacterized protein LOC101245... 153 1e-37 ref|XP_004239522.1| PREDICTED: uncharacterized protein LOC101244... 159 4e-37 gb|ABI34339.1| Polyprotein, 3'-partial, putative [Solanum demissum] 159 6e-37 ref|XP_006340520.1| PREDICTED: uncharacterized protein LOC102599... 155 5e-36 ref|XP_004234875.1| PREDICTED: uncharacterized protein LOC101266... 124 2e-26 ref|XP_004228987.1| PREDICTED: uncharacterized protein LOC101252... 115 5e-24 ref|XP_006364939.1| PREDICTED: uncharacterized protein LOC102581... 106 2e-23 ref|XP_004515018.1| PREDICTED: uncharacterized protein LOC101498... 109 5e-22 gb|AAT38724.1| Putative retrotransposon protein, identical [Sola... 101 2e-21 gb|AAT38744.1| Putative gag-pol polyprotein, identical [Solanum ... 101 2e-21 ref|XP_004229115.1| PREDICTED: uncharacterized protein LOC101247... 105 9e-21 gb|ABI34354.1| Retrotransposon gag protein [Solanum demissum] 97 3e-20 gb|EOY21478.1| DNA/RNA polymerases superfamily protein [Theobrom... 97 3e-18 gb|EOY08678.1| DNA/RNA polymerases superfamily protein [Theobrom... 96 4e-18 gb|EOY08659.1| DNA/RNA polymerases superfamily protein [Theobrom... 96 4e-18 emb|CAJ65807.1| polyprotein [Citrus sinensis] 96 4e-18 gb|EOY20325.1| Retrotransposon protein, Ty3-gypsy subclass, puta... 96 6e-18 gb|EOY26451.1| DNA/RNA polymerases superfamily protein [Theobrom... 96 8e-18 >gb|ADU56212.1| gag-pol polyprotein [Solanum lycopersicum] Length = 624 Score = 158 bits (400), Expect(2) = 1e-38 Identities = 88/166 (53%), Positives = 111/166 (66%) Frame = -1 Query: 551 PGKANVVEDALSRKATSLWSLAHLIIIERHLALEDQSLANSFVRLDISNPRRILAYIEVM 372 PGKANVV ALSRKA S+ SLAHL LA E Q LAN +RL+++ LA +E Sbjct: 439 PGKANVVAVALSRKAGSMGSLAHLQASRHPLAREVQILANDLMRLEVNEKGGFLACVEAR 498 Query: 371 SSLLEQIKG**YEDMKLCKIKYKMM*WEANEAMIDSEGVLRIKGRVCVPCVGGLMRLIIE 192 SS L++IKG + D KL +I+ K++ EA EA ID EGVLRIKGRVCVP V L+ I+ Sbjct: 499 SSSLDKIKGKQFNDEKLIRIRDKVLRGEAKEAQIDEEGVLRIKGRVCVPRVDDLINTILT 558 Query: 191 *AYTFRYSINLSVMMTYRDLKQHYWWGRMKRGIVEFVS**YNCQRL 54 A++ RYSI+ YRDLKQH+WW RMKR IV F++ NCQ++ Sbjct: 559 EAHSSRYSIHPGATKMYRDLKQHFWWSRMKRDIVNFIAKCPNCQQV 604 Score = 27.3 bits (59), Expect(2) = 1e-38 Identities = 11/18 (61%), Positives = 14/18 (77%) Frame = -3 Query: 60 KVKYEHQRPCGLTQKISI 7 +VKYEHQRP G Q++ I Sbjct: 603 QVKYEHQRPGGTLQRMPI 620 >gb|AAT66771.2| Putative polyprotein, identical [Solanum demissum] Length = 1771 Score = 162 bits (411), Expect = 4e-38 Identities = 86/166 (51%), Positives = 118/166 (71%) Frame = -1 Query: 551 PGKANVVEDALSRKATSLWSLAHLIIIERHLALEDQSLANSFVRLDISNPRRILAYIEVM 372 PGKANVV DALSRKA S+ SLA L + ER LAL+ QSLANS VRLDIS+ R +LA++ V Sbjct: 1230 PGKANVVADALSRKAVSMGSLAFLSVEERPLALDIQSLANSMVRLDISDSRCVLAFMRVQ 1289 Query: 371 SSLLEQIKG**YEDMKLCKIKYKMM*WEANEAMIDSEGVLRIKGRVCVPCVGGLMRLIIE 192 SSLL++I+G +ED L ++ +++ + +A +D +GVL+ GR+CVP VG L++LI+ Sbjct: 1290 SSLLDRIRGCQFEDDTLVALRDRVLADDGGQATLDPDGVLKFAGRICVPRVGDLIQLILS 1349 Query: 191 *AYTFRYSINLSVMMTYRDLKQHYWWGRMKRGIVEFVS**YNCQRL 54 A+ RYSI+ YRDL+QHYWW M+R I +FVS CQ++ Sbjct: 1350 EAHESRYSIHPGTAKMYRDLRQHYWWSGMRRDIADFVSRCLCCQQV 1395 >ref|XP_004248966.1| PREDICTED: uncharacterized protein LOC101245305 [Solanum lycopersicum] Length = 256 Score = 153 bits (386), Expect(2) = 1e-37 Identities = 82/147 (55%), Positives = 107/147 (72%) Frame = -1 Query: 494 SLAHLIIIERHLALEDQSLANSFVRLDISNPRRILAYIEVMSSLLEQIKG**YEDMKLCK 315 SLA L + ER LA + QSLANSF+R DIS P ++LAY+E S+LLEQI+ ++D L K Sbjct: 3 SLAMLQVDERPLARDVQSLANSFLRPDISEPGKVLAYMEARSTLLEQIRAQQFDDGDLFK 62 Query: 314 IKYKMM*WEANEAMIDSEGVLRIKGRVCVPCVGGLMRLIIE*AYTFRYSINLSVMMTYRD 135 I YK++ EA A++D+EGVLRIKGR+CVP G L +LI+E A++ RYSI Y D Sbjct: 63 IIYKVLKGEAKAAILDNEGVLRIKGRICVPRTGDLTKLIMEEAHSSRYSIPPRATKMYHD 122 Query: 134 LKQHYWWGRMKRGIVEFVS**YNCQRL 54 LKQ+YWW RMKR IV+FVS +NCQ++ Sbjct: 123 LKQYYWWCRMKRDIVDFVSRCFNCQQV 149 Score = 29.6 bits (65), Expect(2) = 1e-37 Identities = 11/18 (61%), Positives = 16/18 (88%) Frame = -3 Query: 60 KVKYEHQRPCGLTQKISI 7 +VKYEHQ+P G+TQ++ I Sbjct: 148 QVKYEHQKPGGITQRMPI 165 >ref|XP_004239522.1| PREDICTED: uncharacterized protein LOC101244956 [Solanum lycopersicum] Length = 933 Score = 159 bits (402), Expect = 4e-37 Identities = 91/166 (54%), Positives = 113/166 (68%) Frame = -1 Query: 551 PGKANVVEDALSRKATSLWSLAHLIIIERHLALEDQSLANSFVRLDISNPRRILAYIEVM 372 PGK NVV DA S KA S+ SLA L E LA + QSLAN FVRLD S ++LAY+E Sbjct: 700 PGKENVVADASSWKAASMGSLAMLQGSEHPLAKDVQSLANRFVRLDYSEFCKVLAYMEAR 759 Query: 371 SSLLEQIKG**YEDMKLCKIKYKMM*WEANEAMIDSEGVLRIKGRVCVPCVGGLMRLIIE 192 SS+LE I+ ++D LCKI+ K++ E N +++DSEGVLRIK +CVPC L RLI+E Sbjct: 760 SSMLEHIRAQQFDDGDLCKIRDKVLKGETNASILDSEGVLRIKCHICVPCTSDLTRLIME 819 Query: 191 *AYTFRYSINLSVMMTYRDLKQHYWWGRMKRGIVEFVS**YNCQRL 54 A+ R SI+L YRDLKQHYWW RMKR IV+ VS NCQ++ Sbjct: 820 EAH--RVSIHLGDTNLYRDLKQHYWWCRMKRDIVDIVSHCLNCQQV 863 >gb|ABI34339.1| Polyprotein, 3'-partial, putative [Solanum demissum] Length = 1475 Score = 159 bits (401), Expect = 6e-37 Identities = 84/166 (50%), Positives = 116/166 (69%) Frame = -1 Query: 551 PGKANVVEDALSRKATSLWSLAHLIIIERHLALEDQSLANSFVRLDISNPRRILAYIEVM 372 PGKANVV DALSRKA S+ SLA L + ER LA++ Q LANS VRLDIS+ RR+LA++ V Sbjct: 1074 PGKANVVADALSRKAVSMGSLAFLSVEERPLAMDIQFLANSMVRLDISDSRRVLAHMGVQ 1133 Query: 371 SSLLEQIKG**YEDMKLCKIKYKMM*WEANEAMIDSEGVLRIKGRVCVPCVGGLMRLIIE 192 SSLL++I+G +ED L ++ +++ + +A + +GVLR GR+CVP VG L++LI+ Sbjct: 1134 SSLLDRIRGCQFEDEALVALRDRVLAGDGGQASLYPDGVLRFAGRICVPRVGDLIQLILS 1193 Query: 191 *AYTFRYSINLSVMMTYRDLKQHYWWGRMKRGIVEFVS**YNCQRL 54 + RYSI+ YRDL+QHYWW M+R I +FVS CQ++ Sbjct: 1194 EGHESRYSIHPGTTKMYRDLRQHYWWSGMRRDIADFVSRCLCCQQV 1239 >ref|XP_006340520.1| PREDICTED: uncharacterized protein LOC102599426 [Solanum tuberosum] Length = 1228 Score = 155 bits (393), Expect = 5e-36 Identities = 87/154 (56%), Positives = 109/154 (70%) Frame = -1 Query: 545 KANVVEDALSRKATSLWSLAHLIIIERHLALEDQSLANSFVRLDISNPRRILAYIEVMSS 366 KAN+V DALS +A S+ SLA L + E LA + +SLANSFV LD S ++LAY+E SS Sbjct: 429 KANMVADALSLEAVSMGSLAMLQVGECLLARDFKSLANSFVSLDNSESGKVLAYVEARSS 488 Query: 365 LLEQIKG**YEDMKLCKIKYKMM*WEANEAMIDSEGVLRIKGRVCVPCVGGLMRLIIE*A 186 LLEQI ++D LCKI+ K+ EA +A++DSE VLRIKGR+CVP G L RLI+E A Sbjct: 489 LLEQIWTQQFDDGDLCKIRDKVSKGEAMDAILDSERVLRIKGRICVPRTGDLTRLIMEEA 548 Query: 185 YTFRYSINLSVMMTYRDLKQHYWWGRMKRGIVEF 84 Y+ RYSI+ Y DLKQHYWW RMK+ IV+F Sbjct: 549 YSLRYSIHHEATKMYCDLKQHYWWSRMKKEIVDF 582 >ref|XP_004234875.1| PREDICTED: uncharacterized protein LOC101266912 [Solanum lycopersicum] Length = 150 Score = 124 bits (310), Expect = 2e-26 Identities = 80/157 (50%), Positives = 98/157 (62%) Frame = -1 Query: 548 GKANVVEDALSRKATSLWSLAHLIIIERHLALEDQSLANSFVRLDISNPRRILAYIEVMS 369 GKANVV +ALS+KA S+ SLA L E LA + QSLANSFVRLDIS +I A Sbjct: 8 GKANVVAEALSQKAMSMGSLAMLQGREHPLARDVQSLANSFVRLDISESGKIRAQQ---- 63 Query: 368 SLLEQIKG**YEDMKLCKIKYKMM*WEANEAMIDSEGVLRIKGRVCVPCVGGLMRLIIE* 189 ++D L KI+ ++ EA A++DSEGVLRIKGR+ VP L RLI+E Sbjct: 64 ----------FDDCDLFKIRDTVLKGEAKTAILDSEGVLRIKGRISVPHTSYLTRLIMEE 113 Query: 188 AYTFRYSINLSVMMTYRDLKQHYWWGRMKRGIVEFVS 78 A++ R SI+ Y DLKQHYW RMKR IV+FVS Sbjct: 114 AHSSRNSIHPGSTKIYLDLKQHYWLCRMKRDIVDFVS 150 >ref|XP_004228987.1| PREDICTED: uncharacterized protein LOC101252110 [Solanum lycopersicum] Length = 193 Score = 115 bits (289), Expect = 5e-24 Identities = 57/124 (45%), Positives = 85/124 (68%) Frame = -1 Query: 425 VRLDISNPRRILAYIEVMSSLLEQIKG**YEDMKLCKIKYKMM*WEANEAMIDSEGVLRI 246 VRLDIS+ RR+LAY+ V SSL ++I+G +ED L ++ +++ ++A +D +GVLR+ Sbjct: 2 VRLDISDSRRVLAYMGVQSSLYDRIRGCKFEDKALGSLRDRVLAGNGDQATLDHDGVLRL 61 Query: 245 KGRVCVPCVGGLMRLIIE*AYTFRYSINLSVMMTYRDLKQHYWWGRMKRGIVEFVS**YN 66 GR+CVP VG L++ I+ A+ RYSI+ YRDL+QHYWW M+R I +FVS Sbjct: 62 AGRICVPRVGDLIQFILSEAHESRYSIHPGTAKMYRDLRQHYWWSGMRRDIADFVSRCLC 121 Query: 65 CQRL 54 CQ++ Sbjct: 122 CQQV 125 >ref|XP_006364939.1| PREDICTED: uncharacterized protein LOC102581051 [Solanum tuberosum] Length = 1946 Score = 106 bits (265), Expect(2) = 2e-23 Identities = 66/166 (39%), Positives = 94/166 (56%) Frame = -1 Query: 551 PGKANVVEDALSRKATSLWSLAHLIIIERHLALEDQSLANSFVRLDISNPRRILAYIEVM 372 PGKANVV DALSR S+ SLAH+ I +R +A E LA VRL+ ++ Sbjct: 1407 PGKANVVADALSR--VSMGSLAHVDIGDREMAREVHRLARLGVRLEEVGNGGVVVVDGAR 1464 Query: 371 SSLLEQIKG**YEDMKLCKIKYKMM*WEANEAMIDSEGVLRIKGRVCVPCVGGLMRLIIE 192 SSL++++ D L ++K + + +G LR +GR+CVPCV GL I+E Sbjct: 1465 SSLVDEVIAKQDLDSSLLELKALVKEGKVEVFSQGGDGALRYQGRLCVPCVDGLREKILE 1524 Query: 191 *AYTFRYSINLSVMMTYRDLKQHYWWGRMKRGIVEFVS**YNCQRL 54 A+ YSI+ YRDL+ YWWG MK+ I +FVS ++CQ++ Sbjct: 1525 EAHNSSYSIHPGSTKMYRDLRDVYWWGGMKKDIAKFVSGCHSCQQV 1570 Score = 28.5 bits (62), Expect(2) = 2e-23 Identities = 13/18 (72%), Positives = 14/18 (77%) Frame = -3 Query: 60 KVKYEHQRPCGLTQKISI 7 +VK EHQRP GLTQ I I Sbjct: 1569 QVKAEHQRPGGLTQDIEI 1586 >ref|XP_004515018.1| PREDICTED: uncharacterized protein LOC101498380 [Cicer arietinum] Length = 524 Score = 109 bits (272), Expect = 5e-22 Identities = 57/156 (36%), Positives = 97/156 (62%) Frame = -1 Query: 545 KANVVEDALSRKATSLWSLAHLIIIERHLALEDQSLANSFVRLDISNPRRILAYIEVMSS 366 K NVV DALSRK + +LAH+ ++R + E Q + S ++ ++ + R LA+++++ + Sbjct: 257 KENVVADALSRKF--MGNLAHIAEVKRQIVKEFQEIVESGIQFELGHSRLFLAHVQILPT 314 Query: 365 LLEQIKG**YEDMKLCKIKYKMM*WEANEAMIDSEGVLRIKGRVCVPCVGGLMRLIIE*A 186 +++ IK +D L + + + ++ +DS+GVLR+K R+CVP VGGL R I++ A Sbjct: 315 IVDDIKKAQSQDSHLVNMVNNVQNGKISDFSVDSDGVLRLKSRLCVPNVGGLRRKILDEA 374 Query: 185 YTFRYSINLSVMMTYRDLKQHYWWGRMKRGIVEFVS 78 + Y I+ Y+DL++ YWW RMKR + +FVS Sbjct: 375 HHSSYIIHPVSNKMYQDLRELYWWERMKRDVADFVS 410 >gb|AAT38724.1| Putative retrotransposon protein, identical [Solanum demissum] Length = 1602 Score = 101 bits (252), Expect(2) = 2e-21 Identities = 64/168 (38%), Positives = 92/168 (54%) Frame = -1 Query: 551 PGKANVVEDALSRKATSLWSLAHLIIIERHLALEDQSLANSFVRLDISNPRRILAYIEVM 372 PGKANVV D+LSR S+ S H+ R LA + LA VR S I + Sbjct: 1062 PGKANVVADSLSR--LSMGSTTHIEEGRRELAKDMHRLACLGVRFTDSTEGGIAVTSKAE 1119 Query: 371 SSLLEQIKG**YEDMKLCKIKYKMM*WEANEAMIDSEGVLRIKGRVCVPCVGGLMRLIIE 192 SSL+ ++K +D L ++K + +GVLR +GR+CVP V GL ++E Sbjct: 1120 SSLMSEVKEKQDQDPILLELKANVQKQRVLAFEQGGDGVLRYQGRLCVPMVDGLQERVME 1179 Query: 191 *AYTFRYSINLSVMMTYRDLKQHYWWGRMKRGIVEFVS**YNCQRLSM 48 A++ RYS++ YRDL++ YWW MK+GI EFV+ NCQ++ + Sbjct: 1180 EAHSSRYSVHPGSTKMYRDLREFYWWNGMKKGIAEFVAKCPNCQQVKV 1227 Score = 26.6 bits (57), Expect(2) = 2e-21 Identities = 11/18 (61%), Positives = 13/18 (72%) Frame = -3 Query: 60 KVKYEHQRPCGLTQKISI 7 +VK EHQRP GL Q I + Sbjct: 1224 QVKVEHQRPGGLAQNIEL 1241 >gb|AAT38744.1| Putative gag-pol polyprotein, identical [Solanum demissum] Length = 1515 Score = 101 bits (252), Expect(2) = 2e-21 Identities = 64/168 (38%), Positives = 92/168 (54%) Frame = -1 Query: 551 PGKANVVEDALSRKATSLWSLAHLIIIERHLALEDQSLANSFVRLDISNPRRILAYIEVM 372 PGKANVV D+LSR S+ S H+ R LA + LA VR S I + Sbjct: 1056 PGKANVVADSLSR--LSMGSTTHIEEGRRELAKDMHRLACLGVRFTDSTEGGIAVTSKAE 1113 Query: 371 SSLLEQIKG**YEDMKLCKIKYKMM*WEANEAMIDSEGVLRIKGRVCVPCVGGLMRLIIE 192 SSL+ ++K +D L ++K + +GVLR +GR+CVP V GL ++E Sbjct: 1114 SSLMSEVKEKQDQDPILLELKANVQKQRVLAFEQGGDGVLRYQGRLCVPMVDGLQERVME 1173 Query: 191 *AYTFRYSINLSVMMTYRDLKQHYWWGRMKRGIVEFVS**YNCQRLSM 48 A++ RYS++ YRDL++ YWW MK+GI EFV+ NCQ++ + Sbjct: 1174 EAHSSRYSVHPGSTKMYRDLREFYWWNGMKKGIAEFVAKCPNCQQVKV 1221 Score = 26.6 bits (57), Expect(2) = 2e-21 Identities = 11/18 (61%), Positives = 13/18 (72%) Frame = -3 Query: 60 KVKYEHQRPCGLTQKISI 7 +VK EHQRP GL Q I + Sbjct: 1218 QVKVEHQRPGGLAQNIEL 1235 >ref|XP_004229115.1| PREDICTED: uncharacterized protein LOC101247457 [Solanum lycopersicum] Length = 173 Score = 105 bits (261), Expect = 9e-21 Identities = 54/113 (47%), Positives = 73/113 (64%) Frame = -1 Query: 395 ILAYIEVMSSLLEQIKG**YEDMKLCKIKYKMM*WEANEAMIDSEGVLRIKGRVCVPCVG 216 ++ +IE SSL+EQI+ ++D KLC I+ K++ WEA E ++D GVLRI GR+CVP G Sbjct: 29 MIVFIEARSSLVEQIRAHHFDDEKLCIIRDKVLRWEAKEVVLDPNGVLRIGGRICVPKTG 88 Query: 215 GLMRLIIE*AYTFRYSINLSVMMTYRDLKQHYWWGRMKRGIVEFVS**YNCQR 57 L+RLI+E A+ R+SI+ Y DL QHYW M R I FVS CQ+ Sbjct: 89 DLIRLILEEAHCSRFSIHPRAAKMYHDLSQHYWLCGMTRDISNFVSRCLTCQQ 141 >gb|ABI34354.1| Retrotransposon gag protein [Solanum demissum] Length = 4543 Score = 96.7 bits (239), Expect(2) = 3e-20 Identities = 64/168 (38%), Positives = 90/168 (53%) Frame = -1 Query: 551 PGKANVVEDALSRKATSLWSLAHLIIIERHLALEDQSLANSFVRLDISNPRRILAYIEVM 372 PGKANVV D+LSR S+ S AH+ R L + LA VR S I Sbjct: 838 PGKANVVADSLSR--LSMGSTAHIEEGRRELTKDVHRLACLGVRFTDSAKGGIAVANRAE 895 Query: 371 SSLLEQIKG**YEDMKLCKIKYKMM*WEANEAMIDSEGVLRIKGRVCVPCVGGLMRLIIE 192 SSL+ ++K +D L ++K + +G LR +GR+CVP V GL I+E Sbjct: 896 SSLVLEVKKKQDQDPILLELKANVQKQRVLAFEQGGDGALRYQGRLCVPMVDGLQEKIME 955 Query: 191 *AYTFRYSINLSVMMTYRDLKQHYWWGRMKRGIVEFVS**YNCQRLSM 48 A++ RYS++ YRDL++ YWW MK+GI EFV+ NCQ++ + Sbjct: 956 EAHSSRYSVHPGSTKMYRDLREVYWWNGMKKGIAEFVAKCPNCQQVKV 1003 Score = 96.7 bits (239), Expect(2) = 3e-20 Identities = 64/168 (38%), Positives = 90/168 (53%) Frame = -1 Query: 551 PGKANVVEDALSRKATSLWSLAHLIIIERHLALEDQSLANSFVRLDISNPRRILAYIEVM 372 PGKANVV D+LSR S+ S AH+ R L + LA VR S I Sbjct: 2348 PGKANVVADSLSR--LSMGSTAHIEEGRRELTKDVHRLACLGVRFTDSAKGGIAVANRAE 2405 Query: 371 SSLLEQIKG**YEDMKLCKIKYKMM*WEANEAMIDSEGVLRIKGRVCVPCVGGLMRLIIE 192 SSL+ ++K +D L ++K + +G LR +GR+CVP V GL I+E Sbjct: 2406 SSLVLEVKKKQDQDPILLELKANVQKQRVLAFEQGGDGALRYQGRLCVPMVDGLQEKIME 2465 Query: 191 *AYTFRYSINLSVMMTYRDLKQHYWWGRMKRGIVEFVS**YNCQRLSM 48 A++ RYS++ YRDL++ YWW MK+GI EFV+ NCQ++ + Sbjct: 2466 EAHSSRYSVHPGSTKMYRDLREVYWWNGMKKGIAEFVAKCPNCQQVKV 2513 Score = 96.7 bits (239), Expect(2) = 3e-20 Identities = 64/168 (38%), Positives = 90/168 (53%) Frame = -1 Query: 551 PGKANVVEDALSRKATSLWSLAHLIIIERHLALEDQSLANSFVRLDISNPRRILAYIEVM 372 PGKANVV D+LSR S+ S AH+ R L + LA VR S I Sbjct: 3858 PGKANVVADSLSR--LSMGSTAHIEEGRRELTKDVHRLACLGVRFTDSAKGGIAVANRAE 3915 Query: 371 SSLLEQIKG**YEDMKLCKIKYKMM*WEANEAMIDSEGVLRIKGRVCVPCVGGLMRLIIE 192 SSL+ ++K +D L ++K + +G LR +GR+CVP V GL I+E Sbjct: 3916 SSLVLEVKKKQDQDPILLELKANVQKQRVLAFEQGGDGALRYQGRLCVPMVDGLQEKIME 3975 Query: 191 *AYTFRYSINLSVMMTYRDLKQHYWWGRMKRGIVEFVS**YNCQRLSM 48 A++ RYS++ YRDL++ YWW MK+GI EFV+ NCQ++ + Sbjct: 3976 EAHSSRYSVHPGSTKMYRDLREVYWWNGMKKGIAEFVAKCPNCQQVKV 4023 Score = 27.3 bits (59), Expect(2) = 3e-20 Identities = 11/18 (61%), Positives = 14/18 (77%) Frame = -3 Query: 60 KVKYEHQRPCGLTQKISI 7 +VK EHQRP GL Q+I + Sbjct: 1000 QVKVEHQRPGGLAQRIEL 1017 Score = 27.3 bits (59), Expect(2) = 3e-20 Identities = 11/18 (61%), Positives = 14/18 (77%) Frame = -3 Query: 60 KVKYEHQRPCGLTQKISI 7 +VK EHQRP GL Q+I + Sbjct: 2510 QVKVEHQRPGGLAQRIEL 2527 Score = 27.3 bits (59), Expect(2) = 3e-20 Identities = 11/18 (61%), Positives = 14/18 (77%) Frame = -3 Query: 60 KVKYEHQRPCGLTQKISI 7 +VK EHQRP GL Q+I + Sbjct: 4020 QVKVEHQRPGGLAQRIEL 4037 >gb|EOY21478.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 878 Score = 96.7 bits (239), Expect = 3e-18 Identities = 64/166 (38%), Positives = 91/166 (54%) Frame = -1 Query: 551 PGKANVVEDALSRKATSLWSLAHLIIIERHLALEDQSLANSFVRLDISNPRRILAYIEVM 372 PGKANVV DALSRK S+ SLAH+ I R L E SL + VRL+++ +LA+ V Sbjct: 521 PGKANVVADALSRK--SMGSLAHIFIGRRSLVREIHSLGDIGVRLEVAETNALLAHFRVR 578 Query: 371 SSLLEQIKG**YEDMKLCKIKYKMM*WEANEAMIDSEGVLRIKGRVCVPCVGGLMRLIIE 192 L+++IK +D + K + ++GVLR R+ VP GL R I+E Sbjct: 579 PILMDRIKEAQSKDEFVIKALEDPQGRKGKMFTKGTDGVLRYGTRLYVPDGDGLRREILE 638 Query: 191 *AYTFRYSINLSVMMTYRDLKQHYWWGRMKRGIVEFVS**YNCQRL 54 A+ Y ++ Y+DLK+ YWW +KR + EFVS CQ++ Sbjct: 639 EAHMAAYVVHPGATKMYQDLKEVYWWEGLKRDVAEFVSKCLVCQQV 684 >gb|EOY08678.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 666 Score = 96.3 bits (238), Expect = 4e-18 Identities = 64/166 (38%), Positives = 91/166 (54%) Frame = -1 Query: 551 PGKANVVEDALSRKATSLWSLAHLIIIERHLALEDQSLANSFVRLDISNPRRILAYIEVM 372 PGKANVV DALSRK S+ SLAH+ I R L E SL + VRL+++ +LA+ V Sbjct: 238 PGKANVVADALSRK--SMGSLAHISIGRRSLVREIHSLGDIGVRLEVAETNALLAHFRVR 295 Query: 371 SSLLEQIKG**YEDMKLCKIKYKMM*WEANEAMIDSEGVLRIKGRVCVPCVGGLMRLIIE 192 L+++IK +D + K + ++GVLR R+ VP GL R I+E Sbjct: 296 PILMDKIKEAQSKDEFVIKALEDPQGRKGKMFTKGTDGVLRYGTRLYVPDGDGLRRKILE 355 Query: 191 *AYTFRYSINLSVMMTYRDLKQHYWWGRMKRGIVEFVS**YNCQRL 54 A+ Y ++ Y+DLK+ YWW +KR + EFVS CQ++ Sbjct: 356 EAHMAAYVVHPGATKMYQDLKEVYWWEGLKRDVAEFVSKCLVCQQV 401 >gb|EOY08659.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 937 Score = 96.3 bits (238), Expect = 4e-18 Identities = 65/166 (39%), Positives = 91/166 (54%) Frame = -1 Query: 551 PGKANVVEDALSRKATSLWSLAHLIIIERHLALEDQSLANSFVRLDISNPRRILAYIEVM 372 PGKANVV DALSRK S+ SLAH+ I R L E SL + VRL+++ +LA+ V Sbjct: 374 PGKANVVADALSRK--SMGSLAHISIGRRSLVKEIHSLGDIGVRLEVAETNALLAHFRVR 431 Query: 371 SSLLEQIKG**YEDMKLCKIKYKMM*WEANEAMIDSEGVLRIKGRVCVPCVGGLMRLIIE 192 L+++IK +D + K + ++GVLR R+ VP GL R I+E Sbjct: 432 PILMDRIKEAQSKDEFVIKALEDPRGKKGKMFTKGTDGVLRYGTRLYVPDSDGLRREILE 491 Query: 191 *AYTFRYSINLSVMMTYRDLKQHYWWGRMKRGIVEFVS**YNCQRL 54 A+ Y I+ Y+DLK+ YWW +KR + EFVS CQ++ Sbjct: 492 EAHMAAYVIHPGATKMYQDLKEVYWWEGLKRDVAEFVSKCLVCQQV 537 >emb|CAJ65807.1| polyprotein [Citrus sinensis] Length = 533 Score = 96.3 bits (238), Expect = 4e-18 Identities = 62/166 (37%), Positives = 94/166 (56%) Frame = -1 Query: 551 PGKANVVEDALSRKATSLWSLAHLIIIERHLALEDQSLANSFVRLDISNPRRILAYIEVM 372 PGKANVV DALSRK+ S S+AHL L +E +SL V L++ N R ++A V Sbjct: 248 PGKANVVADALSRKSFS--SIAHLRGTYMPLLIELRSLG---VELEVDNCRALIANFRVR 302 Query: 371 SSLLEQIKG**YEDMKLCKIKYKMM*WEANEAMIDSEGVLRIKGRVCVPCVGGLMRLIIE 192 +L++++ +D++L K+K + + + GVL + R+CVP + L + I+E Sbjct: 303 PTLIDKVHQMQDQDLQLLKLKENVQKDLRTDFAVRDNGVLVMGNRLCVPDIKELKKEIME 362 Query: 191 *AYTFRYSINLSVMMTYRDLKQHYWWGRMKRGIVEFVS**YNCQRL 54 A+ Y+++ YR L+ HYWW MKR I EFVS CQ++ Sbjct: 363 EAHCSAYAMHPGSTKMYRTLRDHYWWQGMKREIAEFVSRCLVCQQI 408 >gb|EOY20325.1| Retrotransposon protein, Ty3-gypsy subclass, putative [Theobroma cacao] Length = 460 Score = 95.9 bits (237), Expect = 6e-18 Identities = 64/166 (38%), Positives = 91/166 (54%) Frame = -1 Query: 551 PGKANVVEDALSRKATSLWSLAHLIIIERHLALEDQSLANSFVRLDISNPRRILAYIEVM 372 PGKANVV DALSRK S+ SLAH+ I R L E SL + VRL+++ +LA+ V Sbjct: 107 PGKANVVADALSRK--SMGSLAHISIGRRSLVREIHSLGDIGVRLEVAETNALLAHFRVR 164 Query: 371 SSLLEQIKG**YEDMKLCKIKYKMM*WEANEAMIDSEGVLRIKGRVCVPCVGGLMRLIIE 192 L+++IK +D + K + ++GVLR R+ VP GL R I+E Sbjct: 165 PILMDRIKEAQSKDEFVIKALEDPQGRKGKMFTKGTDGVLRYGTRLYVPDGDGLRREILE 224 Query: 191 *AYTFRYSINLSVMMTYRDLKQHYWWGRMKRGIVEFVS**YNCQRL 54 A+ Y ++ Y+DLK+ YWW +KR + EFVS CQ++ Sbjct: 225 EAHMAAYVVHPGATKMYQDLKEVYWWEGLKRDVAEFVSKCLVCQQV 270 >gb|EOY26451.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 679 Score = 95.5 bits (236), Expect = 8e-18 Identities = 64/166 (38%), Positives = 91/166 (54%) Frame = -1 Query: 551 PGKANVVEDALSRKATSLWSLAHLIIIERHLALEDQSLANSFVRLDISNPRRILAYIEVM 372 PGKANVV DALSRK S+ SLAH+ I R L E SL + VRL+++ +LA+ V Sbjct: 144 PGKANVVADALSRK--SMGSLAHISIGRRSLVREIHSLGDIGVRLEVAETNALLAHFRVR 201 Query: 371 SSLLEQIKG**YEDMKLCKIKYKMM*WEANEAMIDSEGVLRIKGRVCVPCVGGLMRLIIE 192 L+++IK +D + K + ++GVLR R+ VP GL R I+E Sbjct: 202 PILMDRIKEAQSKDEFVIKALEDPRGRKGKMFTKGTDGVLRYGTRLYVPDGDGLRREILE 261 Query: 191 *AYTFRYSINLSVMMTYRDLKQHYWWGRMKRGIVEFVS**YNCQRL 54 A+ Y ++ Y+DLK+ YWW +KR + EFVS CQ++ Sbjct: 262 EAHMAAYVVHPGATKMYQDLKEVYWWEGLKRDVAEFVSKCLVCQQV 307