BLASTX nr result
ID: Atropa21_contig00036934
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Atropa21_contig00036934 (663 letters) Database: nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|AAT38724.1| Putative retrotransposon protein, identical [Sola... 112 3e-36 gb|AAT38744.1| Putative gag-pol polyprotein, identical [Solanum ... 112 3e-36 gb|ABI34354.1| Retrotransposon gag protein [Solanum demissum] 108 2e-34 gb|AAT39297.2| Gag-pol protein, putative [Solanum demissum] 102 1e-31 ref|XP_006358737.1| PREDICTED: uncharacterized protein LOC102605... 101 6e-30 ref|XP_006364939.1| PREDICTED: uncharacterized protein LOC102581... 94 2e-28 gb|AAV31171.1| Putative polyprotein, identical [Solanum tuberosum] 92 8e-28 ref|XP_004248998.1| PREDICTED: uncharacterized protein LOC101264... 103 2e-24 ref|XP_006352152.1| PREDICTED: uncharacterized protein LOC102604... 78 3e-24 dbj|BAL46523.1| hypothetical protein [Gentiana scabra x Gentiana... 71 9e-20 gb|EOY19264.1| Uncharacterized protein TCM_044274 [Theobroma cacao] 70 1e-19 gb|EOY08678.1| DNA/RNA polymerases superfamily protein [Theobrom... 70 1e-19 gb|EOY08659.1| DNA/RNA polymerases superfamily protein [Theobrom... 70 2e-19 gb|EOY08454.1| DNA/RNA polymerases superfamily protein [Theobrom... 69 3e-19 gb|EOY21478.1| DNA/RNA polymerases superfamily protein [Theobrom... 69 3e-19 gb|EOY26451.1| DNA/RNA polymerases superfamily protein [Theobrom... 69 3e-19 gb|EOY20325.1| Retrotransposon protein, Ty3-gypsy subclass, puta... 69 3e-19 ref|XP_004510585.1| PREDICTED: uncharacterized protein LOC101494... 69 4e-19 gb|AAT38734.2| Polyprotein, putative [Solanum demissum] 66 5e-19 gb|EOY08667.1| Retrotransposon protein, Ty3-gypsy subclass, puta... 67 9e-19 >gb|AAT38724.1| Putative retrotransposon protein, identical [Solanum demissum] Length = 1602 Score = 112 bits (280), Expect(2) = 3e-36 Identities = 58/117 (49%), Positives = 76/117 (64%) Frame = -1 Query: 525 LGVRLLDSEDSGGVIQSKVDSSLVAEVNEKWFGDPYLL*LEEGIHKYKTTVFIQGEDDGT 346 LGVR DS + G + SK +SSL++EV EK DP LL L+ + K + F QG DG Sbjct: 1100 LGVRFTDSTEGGIAVTSKAESSLMSEVKEKQDQDPILLELKANVQKQRVLAFEQG-GDGV 1158 Query: 345 LRYRDRLCIPDIDRLRERIMFETHNSRYFIDTCSTKMYHDLKEIYW*NDIKKRRSRF 175 LRY+ RLC+P +D L+ER+M E H+SRY + STKMY DL+E YW N +KK + F Sbjct: 1159 LRYQGRLCVPMVDGLQERVMEEAHSSRYSVHPGSTKMYRDLREFYWWNGMKKGIAEF 1215 Score = 66.2 bits (160), Expect(2) = 3e-36 Identities = 32/52 (61%), Positives = 39/52 (75%) Frame = -3 Query: 157 N*QHVKAEH*RPGGLAQNIDILIWK*EMINIDFITNLHCLFQKYDSIWVFID 2 N Q VK EH RPGGLAQNI++ WK EMIN+DFIT L +++DSIWV +D Sbjct: 1221 NCQQVKVEHQRPGGLAQNIELPEWKWEMINMDFITGLPRSRRQHDSIWVIVD 1272 >gb|AAT38744.1| Putative gag-pol polyprotein, identical [Solanum demissum] Length = 1515 Score = 112 bits (280), Expect(2) = 3e-36 Identities = 58/117 (49%), Positives = 76/117 (64%) Frame = -1 Query: 525 LGVRLLDSEDSGGVIQSKVDSSLVAEVNEKWFGDPYLL*LEEGIHKYKTTVFIQGEDDGT 346 LGVR DS + G + SK +SSL++EV EK DP LL L+ + K + F QG DG Sbjct: 1094 LGVRFTDSTEGGIAVTSKAESSLMSEVKEKQDQDPILLELKANVQKQRVLAFEQG-GDGV 1152 Query: 345 LRYRDRLCIPDIDRLRERIMFETHNSRYFIDTCSTKMYHDLKEIYW*NDIKKRRSRF 175 LRY+ RLC+P +D L+ER+M E H+SRY + STKMY DL+E YW N +KK + F Sbjct: 1153 LRYQGRLCVPMVDGLQERVMEEAHSSRYSVHPGSTKMYRDLREFYWWNGMKKGIAEF 1209 Score = 66.2 bits (160), Expect(2) = 3e-36 Identities = 32/52 (61%), Positives = 39/52 (75%) Frame = -3 Query: 157 N*QHVKAEH*RPGGLAQNIDILIWK*EMINIDFITNLHCLFQKYDSIWVFID 2 N Q VK EH RPGGLAQNI++ WK EMIN+DFIT L +++DSIWV +D Sbjct: 1215 NCQQVKVEHQRPGGLAQNIELPEWKWEMINMDFITGLPRSRRQHDSIWVIVD 1266 >gb|ABI34354.1| Retrotransposon gag protein [Solanum demissum] Length = 4543 Score = 108 bits (269), Expect(3) = 2e-34 Identities = 56/117 (47%), Positives = 75/117 (64%) Frame = -1 Query: 525 LGVRLLDSEDSGGVIQSKVDSSLVAEVNEKWFGDPYLL*LEEGIHKYKTTVFIQGEDDGT 346 LGVR DS G + ++ +SSLV EV +K DP LL L+ + K + F QG DG Sbjct: 876 LGVRFTDSAKGGIAVANRAESSLVLEVKKKQDQDPILLELKANVQKQRVLAFEQG-GDGA 934 Query: 345 LRYRDRLCIPDIDRLRERIMFETHNSRYFIDTCSTKMYHDLKEIYW*NDIKKRRSRF 175 LRY+ RLC+P +D L+E+IM E H+SRY + STKMY DL+E+YW N +KK + F Sbjct: 935 LRYQGRLCVPMVDGLQEKIMEEAHSSRYSVHPGSTKMYRDLREVYWWNGMKKGIAEF 991 Score = 108 bits (269), Expect(3) = 2e-34 Identities = 56/117 (47%), Positives = 75/117 (64%) Frame = -1 Query: 525 LGVRLLDSEDSGGVIQSKVDSSLVAEVNEKWFGDPYLL*LEEGIHKYKTTVFIQGEDDGT 346 LGVR DS G + ++ +SSLV EV +K DP LL L+ + K + F QG DG Sbjct: 2386 LGVRFTDSAKGGIAVANRAESSLVLEVKKKQDQDPILLELKANVQKQRVLAFEQG-GDGA 2444 Query: 345 LRYRDRLCIPDIDRLRERIMFETHNSRYFIDTCSTKMYHDLKEIYW*NDIKKRRSRF 175 LRY+ RLC+P +D L+E+IM E H+SRY + STKMY DL+E+YW N +KK + F Sbjct: 2445 LRYQGRLCVPMVDGLQEKIMEEAHSSRYSVHPGSTKMYRDLREVYWWNGMKKGIAEF 2501 Score = 108 bits (269), Expect(3) = 2e-34 Identities = 56/117 (47%), Positives = 75/117 (64%) Frame = -1 Query: 525 LGVRLLDSEDSGGVIQSKVDSSLVAEVNEKWFGDPYLL*LEEGIHKYKTTVFIQGEDDGT 346 LGVR DS G + ++ +SSLV EV +K DP LL L+ + K + F QG DG Sbjct: 3896 LGVRFTDSAKGGIAVANRAESSLVLEVKKKQDQDPILLELKANVQKQRVLAFEQG-GDGA 3954 Query: 345 LRYRDRLCIPDIDRLRERIMFETHNSRYFIDTCSTKMYHDLKEIYW*NDIKKRRSRF 175 LRY+ RLC+P +D L+E+IM E H+SRY + STKMY DL+E+YW N +KK + F Sbjct: 3955 LRYQGRLCVPMVDGLQEKIMEEAHSSRYSVHPGSTKMYRDLREVYWWNGMKKGIAEF 4011 Score = 63.9 bits (154), Expect(3) = 2e-34 Identities = 31/52 (59%), Positives = 38/52 (73%) Frame = -3 Query: 157 N*QHVKAEH*RPGGLAQNIDILIWK*EMINIDFITNLHCLFQKYDSIWVFID 2 N Q VK EH RPGGLAQ I++ WK EMIN+DFIT L +++DSIWV +D Sbjct: 997 NCQQVKVEHQRPGGLAQRIELPEWKWEMINMDFITGLPRSRRQHDSIWVIVD 1048 Score = 63.9 bits (154), Expect(3) = 2e-34 Identities = 31/52 (59%), Positives = 38/52 (73%) Frame = -3 Query: 157 N*QHVKAEH*RPGGLAQNIDILIWK*EMINIDFITNLHCLFQKYDSIWVFID 2 N Q VK EH RPGGLAQ I++ WK EMIN+DFIT L +++DSIWV +D Sbjct: 2507 NCQQVKVEHQRPGGLAQRIELPEWKWEMINMDFITGLPRSRRQHDSIWVIVD 2558 Score = 63.9 bits (154), Expect(3) = 2e-34 Identities = 31/52 (59%), Positives = 38/52 (73%) Frame = -3 Query: 157 N*QHVKAEH*RPGGLAQNIDILIWK*EMINIDFITNLHCLFQKYDSIWVFID 2 N Q VK EH RPGGLAQ I++ WK EMIN+DFIT L +++DSIWV +D Sbjct: 4017 NCQQVKVEHQRPGGLAQRIELPEWKWEMINMDFITGLPRSRRQHDSIWVIVD 4068 Score = 21.2 bits (43), Expect(3) = 2e-34 Identities = 7/11 (63%), Positives = 10/11 (90%) Frame = -2 Query: 194 KKDVADFMAKC 162 KK +A+F+AKC Sbjct: 985 KKGIAEFVAKC 995 Score = 21.2 bits (43), Expect(3) = 2e-34 Identities = 7/11 (63%), Positives = 10/11 (90%) Frame = -2 Query: 194 KKDVADFMAKC 162 KK +A+F+AKC Sbjct: 2495 KKGIAEFVAKC 2505 Score = 21.2 bits (43), Expect(3) = 2e-34 Identities = 7/11 (63%), Positives = 10/11 (90%) Frame = -2 Query: 194 KKDVADFMAKC 162 KK +A+F+AKC Sbjct: 4005 KKGIAEFVAKC 4015 >gb|AAT39297.2| Gag-pol protein, putative [Solanum demissum] Length = 1554 Score = 102 bits (253), Expect(3) = 1e-31 Identities = 52/106 (49%), Positives = 69/106 (65%) Frame = -1 Query: 492 GGVIQSKVDSSLVAEVNEKWFGDPYLL*LEEGIHKYKTTVFIQGEDDGTLRYRDRLCIPD 313 G + ++ +SSLV+EV EK DP L + + K + F QG DG LRY+ RLC+P Sbjct: 1111 GIAVANRAESSLVSEVKEKQDQDPIFLEFKANVQKQRVLAFEQG-GDGVLRYQGRLCVPM 1169 Query: 312 IDRLRERIMFETHNSRYFIDTCSTKMYHDLKEIYW*NDIKKRRSRF 175 +D L+ERIM E H+SRY I STKMYHDL+E+YW N +KK + F Sbjct: 1170 VDGLQERIMEEAHSSRYSIHPGSTKMYHDLREVYWWNGMKKGIAEF 1215 Score = 60.1 bits (144), Expect(3) = 1e-31 Identities = 30/52 (57%), Positives = 36/52 (69%) Frame = -3 Query: 157 N*QHVKAEH*RPGGLAQNIDILIWK*EMINIDFITNLHCLFQKYDSIWVFID 2 N Q VK EH RP GLAQ I + WK EMIN+DFIT L +++DSIWV +D Sbjct: 1221 NCQQVKVEHQRPVGLAQRIKLPEWKWEMINMDFITGLPKSHRQHDSIWVIVD 1272 Score = 21.2 bits (43), Expect(3) = 1e-31 Identities = 7/11 (63%), Positives = 10/11 (90%) Frame = -2 Query: 194 KKDVADFMAKC 162 KK +A+F+AKC Sbjct: 1209 KKGIAEFVAKC 1219 >ref|XP_006358737.1| PREDICTED: uncharacterized protein LOC102605124 [Solanum tuberosum] Length = 780 Score = 101 bits (251), Expect(2) = 6e-30 Identities = 56/120 (46%), Positives = 77/120 (64%) Frame = -1 Query: 525 LGVRLLDSEDSGGVIQSKVDSSLVAEVNEKWFGDPYLL*LEEGIHKYKTTVFIQGEDDGT 346 LGVRLL++ G V+ + V+ SLV EV EK F D + L+E +++ T F + DG Sbjct: 401 LGVRLLETPKEGIVVHNAVEFSLVVEVKEKQFKDLKIQRLKENVNQGTTKGF-ELTQDGV 459 Query: 345 LRYRDRLCIPDIDRLRERIMFETHNSRYFIDTCSTKMYHDLKEIYW*NDIKKRRSRFYGQ 166 L ++RLC+P+++ LR+RIM E H+SRY I STKMYHDLK +YW D+KK F Q Sbjct: 460 LCCQNRLCVPNVNGLRKRIMTEAHHSRYSIHPGSTKMYHDLKGVYWWRDMKKYIVEFVAQ 519 Score = 56.2 bits (134), Expect(2) = 6e-30 Identities = 25/52 (48%), Positives = 35/52 (67%) Frame = -3 Query: 157 N*QHVKAEH*RPGGLAQNIDILIWK*EMINIDFITNLHCLFQKYDSIWVFID 2 N Q VK +H +PGG Q +++ WK +MIN+DF L F+K+DSIWV +D Sbjct: 522 NCQRVKVKHQKPGGYMQCMELPTWKWDMINMDFFIGLPRSFRKFDSIWVIVD 573 >ref|XP_006364939.1| PREDICTED: uncharacterized protein LOC102581051 [Solanum tuberosum] Length = 1946 Score = 93.6 bits (231), Expect(2) = 2e-28 Identities = 55/117 (47%), Positives = 72/117 (61%) Frame = -1 Query: 525 LGVRLLDSEDSGGVIQSKVDSSLVAEVNEKWFGDPYLL*LEEGIHKYKTTVFIQGEDDGT 346 LGVRL + + G V+ SSLV EV K D LL L+ + + K VF QG DG Sbjct: 1445 LGVRLEEVGNGGVVVVDGARSSLVDEVIAKQDLDSSLLELKALVKEGKVEVFSQG-GDGA 1503 Query: 345 LRYRDRLCIPDIDRLRERIMFETHNSRYFIDTCSTKMYHDLKEIYW*NDIKKRRSRF 175 LRY+ RLC+P +D LRE+I+ E HNS Y I STKMY DL+++YW +KK ++F Sbjct: 1504 LRYQGRLCVPCVDGLREKILEEAHNSSYSIHPGSTKMYRDLRDVYWWGGMKKDIAKF 1560 Score = 58.5 bits (140), Expect(2) = 2e-28 Identities = 28/53 (52%), Positives = 36/53 (67%) Frame = -3 Query: 160 HN*QHVKAEH*RPGGLAQNIDILIWK*EMINIDFITNLHCLFQKYDSIWVFID 2 H+ Q VKAEH RPGGL Q+I+I WK E IN+DF+ L + + SIWV +D Sbjct: 1565 HSCQQVKAEHQRPGGLTQDIEIPTWKWEEINMDFVVGLPKTRKGFGSIWVVVD 1617 >gb|AAV31171.1| Putative polyprotein, identical [Solanum tuberosum] Length = 1487 Score = 91.7 bits (226), Expect(3) = 8e-28 Identities = 51/117 (43%), Positives = 69/117 (58%) Frame = -1 Query: 525 LGVRLLDSEDSGGVIQSKVDSSLVAEVNEKWFGDPYLL*LEEGIHKYKTTVFIQGEDDGT 346 LGVRL+DS G + ++ +SSLV+E N + K + F QG DG Sbjct: 1000 LGVRLIDSAKGGISVTNEAESSLVSEAN---------------VQKQRVLAFEQG-GDGV 1043 Query: 345 LRYRDRLCIPDIDRLRERIMFETHNSRYFIDTCSTKMYHDLKEIYW*NDIKKRRSRF 175 LRY+ RLC+P +D L++RIM E H+SRY I TKMY DL+E+YW N +KK + F Sbjct: 1044 LRYQGRLCVPMVDGLQKRIMEEAHSSRYSIHPGFTKMYRDLREVYWWNGMKKGIAEF 1100 Score = 57.8 bits (138), Expect(3) = 8e-28 Identities = 30/52 (57%), Positives = 37/52 (71%) Frame = -3 Query: 157 N*QHVKAEH*RPGGLAQNIDILIWK*EMINIDFITNLHCLFQKYDSIWVFID 2 N Q VK EH R GGLAQ I++L K EMIN+DFIT L +++DSIWV +D Sbjct: 1106 NCQQVKVEHQRLGGLAQRIELLELKWEMINMDFITGLPRSRRQHDSIWVIVD 1157 Score = 21.2 bits (43), Expect(3) = 8e-28 Identities = 7/11 (63%), Positives = 10/11 (90%) Frame = -2 Query: 194 KKDVADFMAKC 162 KK +A+F+AKC Sbjct: 1094 KKGIAEFVAKC 1104 >ref|XP_004248998.1| PREDICTED: uncharacterized protein LOC101264383 [Solanum lycopersicum] Length = 256 Score = 103 bits (256), Expect(2) = 2e-24 Identities = 54/117 (46%), Positives = 72/117 (61%) Frame = -1 Query: 525 LGVRLLDSEDSGGVIQSKVDSSLVAEVNEKWFGDPYLL*LEEGIHKYKTTVFIQGEDDGT 346 LGVRL +S G +++ DS LV V K DP + L+E + F QGED G Sbjct: 45 LGVRLEESSKGGFMVRHNSDSCLVLYVKSKQHLDPLFMELKESVLNKNNESFSQGED-GV 103 Query: 345 LRYRDRLCIPDIDRLRERIMFETHNSRYFIDTCSTKMYHDLKEIYW*NDIKKRRSRF 175 LRY++RLC+ D+D LR++IM E H SRY I TKMYHD ++IYW N IK+ ++F Sbjct: 104 LRYQERLCVLDVDGLRDKIMDEAHGSRYSIHPGDTKMYHDFRDIYWWNGIKREVAKF 160 Score = 35.8 bits (81), Expect(2) = 2e-24 Identities = 16/34 (47%), Positives = 23/34 (67%) Frame = -3 Query: 157 N*QHVKAEH*RPGGLAQNIDILIWK*EMINIDFI 56 N +KA+H PG L Q+IDI WK E +N++F+ Sbjct: 166 NWHQIKAKHQGPGVLTQDIDIPTWKWEDVNMEFL 199 >ref|XP_006352152.1| PREDICTED: uncharacterized protein LOC102604634 [Solanum tuberosum] Length = 427 Score = 78.2 bits (191), Expect(3) = 3e-24 Identities = 38/84 (45%), Positives = 55/84 (65%), Gaps = 1/84 (1%) Frame = -1 Query: 423 PYLL*LEEGIHKYKTTVFIQGEDDGTLRYRDRLCIPDIDRLRERIMFETHNSRYFIDT-C 247 P L L+ +H+ + V QG DG L Y+ RLC+P +D LR++I+ E HNSRY I Sbjct: 179 PEFLGLQGAVHQQRVEVISQG-GDGVLHYQGRLCVPKVDELRQQILAEAHNSRYSIHPGG 237 Query: 246 STKMYHDLKEIYW*NDIKKRRSRF 175 +TKMY DL+E++W ND+K+ + F Sbjct: 238 TTKMYRDLREVFWWNDMKRDIANF 261 Score = 57.8 bits (138), Expect(3) = 3e-24 Identities = 27/52 (51%), Positives = 36/52 (69%) Frame = -3 Query: 157 N*QHVKAEH*RPGGLAQNIDILIWK*EMINIDFITNLHCLFQKYDSIWVFID 2 N Q VK EH +PGG+ Q I+I W E+IN+DFIT L +++DSIWV +D Sbjct: 267 NSQQVKVEHHKPGGICQEINIPTWNWEVINMDFITALPHTRRQHDSIWVIVD 318 Score = 22.3 bits (46), Expect(3) = 3e-24 Identities = 7/11 (63%), Positives = 11/11 (100%) Frame = -2 Query: 194 KKDVADFMAKC 162 K+D+A+F+AKC Sbjct: 255 KRDIANFVAKC 265 >dbj|BAL46523.1| hypothetical protein [Gentiana scabra x Gentiana triflora] Length = 1152 Score = 71.2 bits (173), Expect(2) = 9e-20 Identities = 40/103 (38%), Positives = 59/103 (57%) Frame = -1 Query: 465 SSLVAEVNEKWFGDPYLL*LEEGIHKYKTTVFIQGEDDGTLRYRDRLCIPDIDRLRERIM 286 S L+ ++ K DP L+ L+ + + K TV Q + +G L Y DRLC+PD+D LR+++M Sbjct: 663 SELLDDIRAKQDEDPVLVDLKR-VAREKPTVGYQLDKNGHLWYGDRLCVPDVDGLRQQVM 721 Query: 285 FETHNSRYFIDTCSTKMYHDLKEIYW*NDIKKRRSRFYGQVFT 157 E H + + STKMY DLKE YW +K + F + T Sbjct: 722 DEAHKIAFAVHPGSTKMYRDLKERYWWLGMKLNIAEFVAKCDT 764 Score = 52.0 bits (123), Expect(2) = 9e-20 Identities = 25/50 (50%), Positives = 32/50 (64%) Frame = -3 Query: 151 QHVKAEH*RPGGLAQNIDILIWK*EMINIDFITNLHCLFQKYDSIWVFID 2 Q VKAEH RPGGL + +++ WK E I +DFIT L +D IWV +D Sbjct: 766 QRVKAEHRRPGGLLKPLEVPEWKWENITMDFITGLPRTKSGHDMIWVIVD 815 >gb|EOY19264.1| Uncharacterized protein TCM_044274 [Theobroma cacao] Length = 860 Score = 70.5 bits (171), Expect(3) = 1e-19 Identities = 42/117 (35%), Positives = 63/117 (53%) Frame = -1 Query: 525 LGVRLLDSEDSGGVIQSKVDSSLVAEVNEKWFGDPYLL*LEEGIHKYKTTVFIQGEDDGT 346 +GVRL +E S + +V L+ ++ E D +++ E K +F +G D G Sbjct: 393 IGVRLEVAETSALLAHFRVRPILMDKIKEAQSKDEFVIKALEDPQGRKGKMFTKGTD-GV 451 Query: 345 LRYRDRLCIPDIDRLRERIMFETHNSRYFIDTCSTKMYHDLKEIYW*NDIKKRRSRF 175 LRY RL +PD D LR I+ E H + Y + +TKMY DLKE+YW +K+ + F Sbjct: 452 LRYGTRLYVPDGDGLRREILEEAHMAAYVVHPGATKMYQDLKEVYWWEGLKRDVAEF 508 Score = 50.4 bits (119), Expect(3) = 1e-19 Identities = 24/50 (48%), Positives = 31/50 (62%) Frame = -3 Query: 151 QHVKAEH*RPGGLAQNIDILIWK*EMINIDFITNLHCLFQKYDSIWVFID 2 Q VKAEH +P GL Q + + WK E I +DF+T L YDSIW+ +D Sbjct: 516 QQVKAEHQKPAGLLQPLPVPEWKWEHIAMDFVTGLPRTSGGYDSIWIVVD 565 Score = 21.9 bits (45), Expect(3) = 1e-19 Identities = 7/11 (63%), Positives = 11/11 (100%) Frame = -2 Query: 194 KKDVADFMAKC 162 K+DVA+F++KC Sbjct: 502 KRDVAEFVSKC 512 >gb|EOY08678.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 666 Score = 70.1 bits (170), Expect(3) = 1e-19 Identities = 41/117 (35%), Positives = 64/117 (54%) Frame = -1 Query: 525 LGVRLLDSEDSGGVIQSKVDSSLVAEVNEKWFGDPYLL*LEEGIHKYKTTVFIQGEDDGT 346 +GVRL +E + + +V L+ ++ E D +++ E K +F +G D G Sbjct: 276 IGVRLEVAETNALLAHFRVRPILMDKIKEAQSKDEFVIKALEDPQGRKGKMFTKGTD-GV 334 Query: 345 LRYRDRLCIPDIDRLRERIMFETHNSRYFIDTCSTKMYHDLKEIYW*NDIKKRRSRF 175 LRY RL +PD D LR +I+ E H + Y + +TKMY DLKE+YW +K+ + F Sbjct: 335 LRYGTRLYVPDGDGLRRKILEEAHMAAYVVHPGATKMYQDLKEVYWWEGLKRDVAEF 391 Score = 50.4 bits (119), Expect(3) = 1e-19 Identities = 24/50 (48%), Positives = 31/50 (62%) Frame = -3 Query: 151 QHVKAEH*RPGGLAQNIDILIWK*EMINIDFITNLHCLFQKYDSIWVFID 2 Q VKAEH +P GL Q + + WK E I +DF+T L YDSIW+ +D Sbjct: 399 QQVKAEHQKPAGLLQPLPVPEWKWEHIAMDFVTGLPRTSGGYDSIWIVVD 448 Score = 21.9 bits (45), Expect(3) = 1e-19 Identities = 7/11 (63%), Positives = 11/11 (100%) Frame = -2 Query: 194 KKDVADFMAKC 162 K+DVA+F++KC Sbjct: 385 KRDVAEFVSKC 395 >gb|EOY08659.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 937 Score = 70.1 bits (170), Expect(3) = 2e-19 Identities = 42/117 (35%), Positives = 62/117 (52%) Frame = -1 Query: 525 LGVRLLDSEDSGGVIQSKVDSSLVAEVNEKWFGDPYLL*LEEGIHKYKTTVFIQGEDDGT 346 +GVRL +E + + +V L+ + E D +++ E K +F +G D G Sbjct: 412 IGVRLEVAETNALLAHFRVRPILMDRIKEAQSKDEFVIKALEDPRGKKGKMFTKGTD-GV 470 Query: 345 LRYRDRLCIPDIDRLRERIMFETHNSRYFIDTCSTKMYHDLKEIYW*NDIKKRRSRF 175 LRY RL +PD D LR I+ E H + Y I +TKMY DLKE+YW +K+ + F Sbjct: 471 LRYGTRLYVPDSDGLRREILEEAHMAAYVIHPGATKMYQDLKEVYWWEGLKRDVAEF 527 Score = 50.1 bits (118), Expect(3) = 2e-19 Identities = 24/50 (48%), Positives = 31/50 (62%) Frame = -3 Query: 151 QHVKAEH*RPGGLAQNIDILIWK*EMINIDFITNLHCLFQKYDSIWVFID 2 Q VKAEH +P GL Q + + WK E I +DF+T L YDSIW+ +D Sbjct: 535 QQVKAEHQKPAGLLQPLPVPEWKWEHIAMDFVTGLPRTNGGYDSIWIVVD 584 Score = 21.9 bits (45), Expect(3) = 2e-19 Identities = 7/11 (63%), Positives = 11/11 (100%) Frame = -2 Query: 194 KKDVADFMAKC 162 K+DVA+F++KC Sbjct: 521 KRDVAEFVSKC 531 >gb|EOY08454.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 1400 Score = 68.9 bits (167), Expect(3) = 3e-19 Identities = 41/117 (35%), Positives = 62/117 (52%) Frame = -1 Query: 525 LGVRLLDSEDSGGVIQSKVDSSLVAEVNEKWFGDPYLL*LEEGIHKYKTTVFIQGEDDGT 346 +GVRL +E + + +V L+ + E D +++ E K +F +G D G Sbjct: 953 MGVRLEVAETNALLAHFRVRPILMDRIKEAQSKDEFVIKALEDPRGRKGKMFTKGTD-GV 1011 Query: 345 LRYRDRLCIPDIDRLRERIMFETHNSRYFIDTCSTKMYHDLKEIYW*NDIKKRRSRF 175 LRY RL +PD D LR I+ E H Y + +TKMY DLKE+YW ++K+ + F Sbjct: 1012 LRYGTRLYVPDGDGLRREILEEAHMVAYVVHPGATKMYQDLKEVYWWEELKRDVAEF 1068 Score = 50.4 bits (119), Expect(3) = 3e-19 Identities = 24/50 (48%), Positives = 31/50 (62%) Frame = -3 Query: 151 QHVKAEH*RPGGLAQNIDILIWK*EMINIDFITNLHCLFQKYDSIWVFID 2 Q VKAEH +P GL Q + + WK E I +DF+T L YDSIW+ +D Sbjct: 1076 QQVKAEHQKPAGLLQPLPVPEWKWEHIAMDFVTGLPRTSGGYDSIWIVVD 1125 Score = 21.9 bits (45), Expect(3) = 3e-19 Identities = 7/11 (63%), Positives = 11/11 (100%) Frame = -2 Query: 194 KKDVADFMAKC 162 K+DVA+F++KC Sbjct: 1062 KRDVAEFVSKC 1072 >gb|EOY21478.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 878 Score = 68.9 bits (167), Expect(3) = 3e-19 Identities = 41/117 (35%), Positives = 62/117 (52%) Frame = -1 Query: 525 LGVRLLDSEDSGGVIQSKVDSSLVAEVNEKWFGDPYLL*LEEGIHKYKTTVFIQGEDDGT 346 +GVRL +E + + +V L+ + E D +++ E K +F +G D G Sbjct: 559 IGVRLEVAETNALLAHFRVRPILMDRIKEAQSKDEFVIKALEDPQGRKGKMFTKGTD-GV 617 Query: 345 LRYRDRLCIPDIDRLRERIMFETHNSRYFIDTCSTKMYHDLKEIYW*NDIKKRRSRF 175 LRY RL +PD D LR I+ E H + Y + +TKMY DLKE+YW +K+ + F Sbjct: 618 LRYGTRLYVPDGDGLRREILEEAHMAAYVVHPGATKMYQDLKEVYWWEGLKRDVAEF 674 Score = 50.4 bits (119), Expect(3) = 3e-19 Identities = 24/50 (48%), Positives = 31/50 (62%) Frame = -3 Query: 151 QHVKAEH*RPGGLAQNIDILIWK*EMINIDFITNLHCLFQKYDSIWVFID 2 Q VKAEH +P GL Q + + WK E I +DF+T L YDSIW+ +D Sbjct: 682 QQVKAEHQKPAGLLQPLPVPEWKWEHIAMDFVTGLPRTSGGYDSIWIVVD 731 Score = 21.9 bits (45), Expect(3) = 3e-19 Identities = 7/11 (63%), Positives = 11/11 (100%) Frame = -2 Query: 194 KKDVADFMAKC 162 K+DVA+F++KC Sbjct: 668 KRDVAEFVSKC 678 >gb|EOY26451.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 679 Score = 68.9 bits (167), Expect(3) = 3e-19 Identities = 41/117 (35%), Positives = 62/117 (52%) Frame = -1 Query: 525 LGVRLLDSEDSGGVIQSKVDSSLVAEVNEKWFGDPYLL*LEEGIHKYKTTVFIQGEDDGT 346 +GVRL +E + + +V L+ + E D +++ E K +F +G D G Sbjct: 182 IGVRLEVAETNALLAHFRVRPILMDRIKEAQSKDEFVIKALEDPRGRKGKMFTKGTD-GV 240 Query: 345 LRYRDRLCIPDIDRLRERIMFETHNSRYFIDTCSTKMYHDLKEIYW*NDIKKRRSRF 175 LRY RL +PD D LR I+ E H + Y + +TKMY DLKE+YW +K+ + F Sbjct: 241 LRYGTRLYVPDGDGLRREILEEAHMAAYVVHPGATKMYQDLKEVYWWEGLKRDVAEF 297 Score = 50.4 bits (119), Expect(3) = 3e-19 Identities = 24/50 (48%), Positives = 31/50 (62%) Frame = -3 Query: 151 QHVKAEH*RPGGLAQNIDILIWK*EMINIDFITNLHCLFQKYDSIWVFID 2 Q VKAEH +P GL Q + + WK E I +DF+T L YDSIW+ +D Sbjct: 305 QQVKAEHQKPAGLLQPLPVPEWKWEHIAMDFVTGLPRTSGGYDSIWIVVD 354 Score = 21.9 bits (45), Expect(3) = 3e-19 Identities = 7/11 (63%), Positives = 11/11 (100%) Frame = -2 Query: 194 KKDVADFMAKC 162 K+DVA+F++KC Sbjct: 291 KRDVAEFVSKC 301 >gb|EOY20325.1| Retrotransposon protein, Ty3-gypsy subclass, putative [Theobroma cacao] Length = 460 Score = 68.9 bits (167), Expect(3) = 3e-19 Identities = 41/117 (35%), Positives = 62/117 (52%) Frame = -1 Query: 525 LGVRLLDSEDSGGVIQSKVDSSLVAEVNEKWFGDPYLL*LEEGIHKYKTTVFIQGEDDGT 346 +GVRL +E + + +V L+ + E D +++ E K +F +G D G Sbjct: 145 IGVRLEVAETNALLAHFRVRPILMDRIKEAQSKDEFVIKALEDPQGRKGKMFTKGTD-GV 203 Query: 345 LRYRDRLCIPDIDRLRERIMFETHNSRYFIDTCSTKMYHDLKEIYW*NDIKKRRSRF 175 LRY RL +PD D LR I+ E H + Y + +TKMY DLKE+YW +K+ + F Sbjct: 204 LRYGTRLYVPDGDGLRREILEEAHMAAYVVHPGATKMYQDLKEVYWWEGLKRDVAEF 260 Score = 50.4 bits (119), Expect(3) = 3e-19 Identities = 24/50 (48%), Positives = 31/50 (62%) Frame = -3 Query: 151 QHVKAEH*RPGGLAQNIDILIWK*EMINIDFITNLHCLFQKYDSIWVFID 2 Q VKAEH +P GL Q + + WK E I +DF+T L YDSIW+ +D Sbjct: 268 QQVKAEHQKPAGLLQPLPVPEWKWEHIAMDFVTGLPRTSGGYDSIWIVVD 317 Score = 21.9 bits (45), Expect(3) = 3e-19 Identities = 7/11 (63%), Positives = 11/11 (100%) Frame = -2 Query: 194 KKDVADFMAKC 162 K+DVA+F++KC Sbjct: 254 KRDVAEFVSKC 264 >ref|XP_004510585.1| PREDICTED: uncharacterized protein LOC101494113 [Cicer arietinum] Length = 1414 Score = 68.6 bits (166), Expect(3) = 4e-19 Identities = 33/95 (34%), Positives = 57/95 (60%) Frame = -1 Query: 474 KVDSSLVAEVNEKWFGDPYLL*LEEGIHKYKTTVFIQGEDDGTLRYRDRLCIPDIDRLRE 295 ++ ++V ++ E DPYL+ + + K + F+ + DG LR + RLC+P++ LR Sbjct: 1212 QIRPTIVDDIKEAQSQDPYLVNMVNNVQNGKISYFLV-DFDGVLRLKARLCVPNVCGLRR 1270 Query: 294 RIMFETHNSRYFIDTCSTKMYHDLKEIYW*NDIKK 190 +I+ E H+S Y I S KMY DL+++YW +K+ Sbjct: 1271 KILEEAHHSSYTIHPGSNKMYQDLRKLYWWEGMKR 1305 Score = 50.1 bits (118), Expect(3) = 4e-19 Identities = 24/49 (48%), Positives = 33/49 (67%) Frame = -3 Query: 151 QHVKAEH*RPGGLAQNIDILIWK*EMINIDFITNLHCLFQKYDSIWVFI 5 Q VKAEH +P GL Q ++I WK E+I +DF+T L + YDS+W+ I Sbjct: 1318 QQVKAEHQKPVGLLQPVEIPEWKWEVIAMDFVTGLPRTQRGYDSVWIKI 1366 Score = 22.3 bits (46), Expect(3) = 4e-19 Identities = 7/11 (63%), Positives = 11/11 (100%) Frame = -2 Query: 194 KKDVADFMAKC 162 K+DVADF+++C Sbjct: 1304 KRDVADFVSRC 1314 >gb|AAT38734.2| Polyprotein, putative [Solanum demissum] Length = 513 Score = 65.9 bits (159), Expect(2) = 5e-19 Identities = 29/55 (52%), Positives = 39/55 (70%) Frame = -1 Query: 339 YRDRLCIPDIDRLRERIMFETHNSRYFIDTCSTKMYHDLKEIYW*NDIKKRRSRF 175 Y+ RLC+ DID LRE ++ E H SRY + +TKMYHDL E+YW N +KK+ + F Sbjct: 103 YKGRLCVSDIDGLREYVLEEAHGSRYSTHSGATKMYHDLWEVYWWNGMKKKIAGF 157 Score = 55.1 bits (131), Expect(2) = 5e-19 Identities = 27/52 (51%), Positives = 36/52 (69%) Frame = -3 Query: 157 N*QHVKAEH*RPGGLAQNIDILIWK*EMINIDFITNLHCLFQKYDSIWVFID 2 N Q VKA+H RPGGL+Q IDI WK E +N+DF+ L +++D IWV I+ Sbjct: 163 NCQQVKAKHLRPGGLSQYIDIPTWKWEDMNMDFVVGLPRTRRQHDFIWVIIN 214 >gb|EOY08667.1| Retrotransposon protein, Ty3-gypsy subclass, putative [Theobroma cacao] Length = 521 Score = 67.4 bits (163), Expect(3) = 9e-19 Identities = 40/117 (34%), Positives = 63/117 (53%) Frame = -1 Query: 525 LGVRLLDSEDSGGVIQSKVDSSLVAEVNEKWFGDPYLL*LEEGIHKYKTTVFIQGEDDGT 346 +GVRL +E + + +V L+ ++ E + +++ E K +F +G D G Sbjct: 24 IGVRLEVAETNALLAHFRVRPILMDKIKEAQSKNEFVIKALEDPQGRKGKMFTKGTD-GV 82 Query: 345 LRYRDRLCIPDIDRLRERIMFETHNSRYFIDTCSTKMYHDLKEIYW*NDIKKRRSRF 175 LRY RL +PD D LR I+ E H + Y + +TKMY DLKE+YW +K+ + F Sbjct: 83 LRYGTRLYVPDGDGLRREILEEAHMAAYVVHPGATKMYQDLKEVYWWEGLKRDVAEF 139 Score = 50.4 bits (119), Expect(3) = 9e-19 Identities = 24/50 (48%), Positives = 31/50 (62%) Frame = -3 Query: 151 QHVKAEH*RPGGLAQNIDILIWK*EMINIDFITNLHCLFQKYDSIWVFID 2 Q VKAEH +P GL Q + + WK E I +DF+T L YDSIW+ +D Sbjct: 147 QQVKAEHQKPAGLLQPLPVPEWKWEHIAMDFVTGLPRTSGGYDSIWIVVD 196 Score = 21.9 bits (45), Expect(3) = 9e-19 Identities = 7/11 (63%), Positives = 11/11 (100%) Frame = -2 Query: 194 KKDVADFMAKC 162 K+DVA+F++KC Sbjct: 133 KRDVAEFVSKC 143