BLASTX nr result
ID: Atropa21_contig00034058
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Atropa21_contig00034058 (926 letters) Database: nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|ABI34339.1| Polyprotein, 3'-partial, putative [Solanum demissum] 91 6e-16 gb|AAT66771.2| Putative polyprotein, identical [Solanum demissum] 89 2e-15 gb|AAT39954.1| Putative integrase, identical [Solanum demissum] 89 2e-15 gb|EOX99963.1| Uncharacterized protein TCM_009073 [Theobroma cacao] 86 2e-14 ref|XP_006364939.1| PREDICTED: uncharacterized protein LOC102581... 85 3e-14 gb|EOY03326.1| DNA/RNA polymerases superfamily protein [Theobrom... 85 4e-14 gb|ADU56211.1| gag-pol polyprotein [Solanum lycopersicum] 85 4e-14 ref|XP_006356454.1| PREDICTED: uncharacterized protein LOC102599... 84 8e-14 gb|EOY26451.1| DNA/RNA polymerases superfamily protein [Theobrom... 84 8e-14 gb|EOY21678.1| DNA/RNA polymerases superfamily protein [Theobrom... 84 8e-14 gb|EOY20280.1| Uncharacterized protein TCM_045699 [Theobroma cacao] 84 8e-14 gb|EOY20275.1| DNA/RNA polymerases superfamily protein [Theobrom... 84 8e-14 gb|EOY14099.1| DNA/RNA polymerases superfamily protein [Theobrom... 84 8e-14 gb|EOY08667.1| Retrotransposon protein, Ty3-gypsy subclass, puta... 84 8e-14 gb|EOY08659.1| DNA/RNA polymerases superfamily protein [Theobrom... 84 8e-14 gb|EOY03146.1| Retrotransposon protein, putative [Theobroma cacao] 83 1e-13 gb|ABI34389.1| Polyprotein, putative [Solanum tuberosum] 83 1e-13 gb|EOY08678.1| DNA/RNA polymerases superfamily protein [Theobrom... 83 2e-13 gb|EOY26288.1| Retrotransposon protein, Ty3-gypsy subclass, puta... 82 2e-13 gb|EOY21478.1| DNA/RNA polymerases superfamily protein [Theobrom... 82 2e-13 >gb|ABI34339.1| Polyprotein, 3'-partial, putative [Solanum demissum] Length = 1475 Score = 90.9 bits (224), Expect = 6e-16 Identities = 48/93 (51%), Positives = 57/93 (61%) Frame = +1 Query: 121 FTSHFWRYIQKELGTRVELSTAFHPQNDNQFK*AI*FLRICCEPA*LTLLVIGINFYSWR 300 FTS+FWR Q ELGTRV+LSTAFHPQ D Q + I L + F Sbjct: 1332 FTSNFWRTFQDELGTRVDLSTAFHPQTDGQSERTIQVLEDMLRACVMDFGGQWDQFLPLA 1391 Query: 301 SFFYNNSYYSNIEMAPFKVMYGRRPRSLVGWLE 399 F YNNSY+S+I+MAPF+ +YGRR RS VGW E Sbjct: 1392 EFAYNNSYHSSIQMAPFEALYGRRCRSPVGWFE 1424 >gb|AAT66771.2| Putative polyprotein, identical [Solanum demissum] Length = 1771 Score = 89.0 bits (219), Expect = 2e-15 Identities = 47/93 (50%), Positives = 56/93 (60%) Frame = +1 Query: 121 FTSHFWRYIQKELGTRVELSTAFHPQNDNQFK*AI*FLRICCEPA*LTLLVIGINFYSWR 300 FTS FWR Q+ELGTRV LST+FHPQ D Q + I L + F Sbjct: 1488 FTSSFWRAFQEELGTRVHLSTSFHPQTDGQSERTIQVLEDMLRACVMDFGGQWEQFLPLA 1547 Query: 301 SFFYNNSYYSNIEMAPFKVMYGRRPRSLVGWLE 399 F YNNSY+S+I+MAPF+ +YGRR RS VGW E Sbjct: 1548 EFAYNNSYHSSIQMAPFEALYGRRCRSPVGWFE 1580 >gb|AAT39954.1| Putative integrase, identical [Solanum demissum] Length = 1609 Score = 89.0 bits (219), Expect = 2e-15 Identities = 46/93 (49%), Positives = 56/93 (60%) Frame = +1 Query: 121 FTSHFWRYIQKELGTRVELSTAFHPQNDNQFK*AI*FLRICCEPA*LTLLVIGINFYSWR 300 FTS FWR Q +LGTRV+LST FHPQ D Q + I L + + F Sbjct: 1185 FTSSFWRTFQDDLGTRVDLSTTFHPQTDGQSERTIQVLEDMLQACVMDFGGQWDQFLPLA 1244 Query: 301 SFFYNNSYYSNIEMAPFKVMYGRRPRSLVGWLE 399 F YNN+YYS+I+MAPF+ +YGRR RS VGW E Sbjct: 1245 EFAYNNNYYSSIQMAPFEALYGRRCRSPVGWFE 1277 >gb|EOX99963.1| Uncharacterized protein TCM_009073 [Theobroma cacao] Length = 421 Score = 85.9 bits (211), Expect = 2e-14 Identities = 52/139 (37%), Positives = 75/139 (53%), Gaps = 5/139 (3%) Frame = +1 Query: 1 LSREGNFISVQNLK*V-----FYGDYLISVISKKKSCLMDHNCM*FTSHFWRYIQKELGT 165 L++ +F+SV+ Y D ++ + S + D FTS FW +Q+ LGT Sbjct: 84 LTKSAHFLSVKTTYGAAQYARVYVDEIVRLHGIPISIVSDRGAQ-FTSRFWGKLQEALGT 142 Query: 166 RVELSTAFHPQNDNQFK*AI*FLRICCEPA*LTLLVIGINFYSWRSFFYNNSYYSNIEMA 345 +++ STAFHPQ D Q + I L + L V + F YNNS+ ++I+MA Sbjct: 143 KLDFSTAFHPQTDGQSERTIQTLEDMLRACVIDLGVRWEQYLPLVEFAYNNSFQTSIQMA 202 Query: 346 PFKVMYGRRPRSLVGWLEV 402 PFK +YGRR RS +GWLEV Sbjct: 203 PFKALYGRRCRSPIGWLEV 221 >ref|XP_006364939.1| PREDICTED: uncharacterized protein LOC102581051 [Solanum tuberosum] Length = 1946 Score = 85.1 bits (209), Expect = 3e-14 Identities = 48/100 (48%), Positives = 59/100 (59%) Frame = +1 Query: 121 FTSHFWRYIQKELGTRVELSTAFHPQNDNQFK*AI*FLRICCEPA*LTLLVIGINFYSWR 300 FTSHFW+ Q+ LGTRV+L+TAFHPQ D Q + I L L L + Sbjct: 1663 FTSHFWKSFQRGLGTRVKLTTAFHPQTDGQAERTIQTLEDMLRACVLELKGSWEDHLPLI 1722 Query: 301 SFFYNNSYYSNIEMAPFKVMYGRRPRSLVGWLEVVFIVFL 420 F YNNSY+S+I MAPF+ +YGRR RS VG EV + L Sbjct: 1723 EFSYNNSYHSSIGMAPFEALYGRRCRSSVGLFEVGEVALL 1762 >gb|EOY03326.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 1447 Score = 84.7 bits (208), Expect = 4e-14 Identities = 43/94 (45%), Positives = 58/94 (61%) Frame = +1 Query: 121 FTSHFWRYIQKELGTRVELSTAFHPQNDNQFK*AI*FLRICCEPA*LTLLVIGINFYSWR 300 FTS FW +Q+ LGT+++ STAFHPQ D Q + I L + L V + Sbjct: 1168 FTSRFWGKLQEALGTKLDFSTAFHPQTDGQSERTIQTLEAMLRACVIDLGVRWEQYLPLV 1227 Query: 301 SFFYNNSYYSNIEMAPFKVMYGRRPRSLVGWLEV 402 F YNNS+ ++I+MAPF+ +YGRR RS +GWLEV Sbjct: 1228 EFAYNNSFQTSIQMAPFEALYGRRCRSPIGWLEV 1261 >gb|ADU56211.1| gag-pol polyprotein [Solanum lycopersicum] Length = 367 Score = 84.7 bits (208), Expect = 4e-14 Identities = 46/98 (46%), Positives = 63/98 (64%), Gaps = 5/98 (5%) Frame = +1 Query: 121 FTSHFWRYIQKELGTRVELSTAFHPQNDNQFK*AI*FLR--ICCEPA*LTLLVIGINFYS 294 FTS FW+ + ELGTR++LSTAFHPQ D Q + I L IC ++ G ++ S Sbjct: 68 FTSKFWKILHAELGTRLDLSTAFHPQTDGQSERTIQVLEDMICA-----CVIEFGGHWDS 122 Query: 295 W---RSFFYNNSYYSNIEMAPFKVMYGRRPRSLVGWLE 399 + F YNNSY+S+I+MAPF+ +YGRR RS +GW + Sbjct: 123 FLPLAEFSYNNSYHSSIDMAPFEALYGRRCRSPIGWFD 160 >ref|XP_006356454.1| PREDICTED: uncharacterized protein LOC102599406 [Solanum tuberosum] Length = 859 Score = 84.0 bits (206), Expect = 8e-14 Identities = 45/93 (48%), Positives = 53/93 (56%) Frame = +1 Query: 121 FTSHFWRYIQKELGTRVELSTAFHPQNDNQFK*AI*FLRICCEPA*LTLLVIGINFYSWR 300 FTS FWR Q ELGTRV+L T FHPQ D Q + I L + Sbjct: 544 FTSSFWRTFQDELGTRVDLCTTFHPQTDGQSERTIKVLEDMLRACVMDFGGQWDQHLPLA 603 Query: 301 SFFYNNSYYSNIEMAPFKVMYGRRPRSLVGWLE 399 F YNNSY+S+I+MAPF+ +YGRR RS VGW E Sbjct: 604 EFAYNNSYHSSIQMAPFEALYGRRCRSPVGWFE 636 >gb|EOY26451.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 679 Score = 84.0 bits (206), Expect = 8e-14 Identities = 43/94 (45%), Positives = 58/94 (61%) Frame = +1 Query: 121 FTSHFWRYIQKELGTRVELSTAFHPQNDNQFK*AI*FLRICCEPA*LTLLVIGINFYSWR 300 FTS FW +Q+ LGT+++ STAFHPQ D Q + I L + L V + Sbjct: 400 FTSRFWGKLQEALGTKLDFSTAFHPQTDGQSERTIQTLEDMLRACVIDLGVRWEQYLPLV 459 Query: 301 SFFYNNSYYSNIEMAPFKVMYGRRPRSLVGWLEV 402 F YNNS+ ++I+MAPF+ +YGRR RS +GWLEV Sbjct: 460 EFAYNNSFQTSIQMAPFEALYGRRCRSPIGWLEV 493 >gb|EOY21678.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 448 Score = 84.0 bits (206), Expect = 8e-14 Identities = 43/94 (45%), Positives = 58/94 (61%) Frame = +1 Query: 121 FTSHFWRYIQKELGTRVELSTAFHPQNDNQFK*AI*FLRICCEPA*LTLLVIGINFYSWR 300 FTS FW +Q+ LGT+++ STAFHPQ D Q + I L + L V + Sbjct: 169 FTSRFWGKLQEALGTKLDFSTAFHPQTDGQSERTIQTLEDMLRACVIDLGVRWEQYLPLV 228 Query: 301 SFFYNNSYYSNIEMAPFKVMYGRRPRSLVGWLEV 402 F YNNS+ ++I+MAPF+ +YGRR RS +GWLEV Sbjct: 229 EFAYNNSFQTSIQMAPFEALYGRRCRSPIGWLEV 262 >gb|EOY20280.1| Uncharacterized protein TCM_045699 [Theobroma cacao] Length = 415 Score = 84.0 bits (206), Expect = 8e-14 Identities = 43/94 (45%), Positives = 58/94 (61%) Frame = +1 Query: 121 FTSHFWRYIQKELGTRVELSTAFHPQNDNQFK*AI*FLRICCEPA*LTLLVIGINFYSWR 300 FTS FW +Q+ LGT+++ STAFHPQ D Q + I L + L V + Sbjct: 136 FTSRFWGKLQEALGTKLDFSTAFHPQTDGQSERTIQTLEDMLRACVIDLGVKWEQYLPLV 195 Query: 301 SFFYNNSYYSNIEMAPFKVMYGRRPRSLVGWLEV 402 F YNNS+ ++I+MAPF+ +YGRR RS +GWLEV Sbjct: 196 EFAYNNSFQTSIQMAPFEALYGRRCRSPIGWLEV 229 >gb|EOY20275.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 562 Score = 84.0 bits (206), Expect = 8e-14 Identities = 43/94 (45%), Positives = 58/94 (61%) Frame = +1 Query: 121 FTSHFWRYIQKELGTRVELSTAFHPQNDNQFK*AI*FLRICCEPA*LTLLVIGINFYSWR 300 FTS FW +Q+ LGT+++ STAFHPQ D Q + I L + L V + Sbjct: 349 FTSRFWGKLQEALGTKLDFSTAFHPQTDGQSERTIQTLEDMLRACVIDLGVRWEQYLPLV 408 Query: 301 SFFYNNSYYSNIEMAPFKVMYGRRPRSLVGWLEV 402 F YNNS+ ++I+MAPF+ +YGRR RS +GWLEV Sbjct: 409 EFAYNNSFQTSIQMAPFEALYGRRCRSPIGWLEV 442 >gb|EOY14099.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 1502 Score = 84.0 bits (206), Expect = 8e-14 Identities = 43/94 (45%), Positives = 58/94 (61%) Frame = +1 Query: 121 FTSHFWRYIQKELGTRVELSTAFHPQNDNQFK*AI*FLRICCEPA*LTLLVIGINFYSWR 300 FTS FW +Q+ LGT+++ STAFHPQ D Q + I L + L V + Sbjct: 1270 FTSRFWGKLQEALGTKLDFSTAFHPQTDGQSERTIQTLEDMLRACVIDLGVRWEQYLPLV 1329 Query: 301 SFFYNNSYYSNIEMAPFKVMYGRRPRSLVGWLEV 402 F YNNS+ ++I+MAPF+ +YGRR RS +GWLEV Sbjct: 1330 EFAYNNSFQTSIQMAPFEALYGRRCRSPIGWLEV 1363 >gb|EOY08667.1| Retrotransposon protein, Ty3-gypsy subclass, putative [Theobroma cacao] Length = 521 Score = 84.0 bits (206), Expect = 8e-14 Identities = 43/94 (45%), Positives = 58/94 (61%) Frame = +1 Query: 121 FTSHFWRYIQKELGTRVELSTAFHPQNDNQFK*AI*FLRICCEPA*LTLLVIGINFYSWR 300 FTS FW +Q+ LGT+++ STAFHPQ D Q + I L + L V + Sbjct: 242 FTSRFWGKLQEALGTKLDFSTAFHPQTDGQSERTIQTLEDMLRACVIDLGVRWEQYLPLV 301 Query: 301 SFFYNNSYYSNIEMAPFKVMYGRRPRSLVGWLEV 402 F YNNS+ ++I+MAPF+ +YGRR RS +GWLEV Sbjct: 302 EFAYNNSFQTSIQMAPFEALYGRRCRSPIGWLEV 335 >gb|EOY08659.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 937 Score = 84.0 bits (206), Expect = 8e-14 Identities = 43/94 (45%), Positives = 58/94 (61%) Frame = +1 Query: 121 FTSHFWRYIQKELGTRVELSTAFHPQNDNQFK*AI*FLRICCEPA*LTLLVIGINFYSWR 300 FTS FW +Q+ LGT+++ STAFHPQ D Q + I L + L V + Sbjct: 630 FTSRFWGKLQEALGTKLDFSTAFHPQTDGQSEWTIQTLEDMLRACVIDLGVRWEQYLPLV 689 Query: 301 SFFYNNSYYSNIEMAPFKVMYGRRPRSLVGWLEV 402 F YNNS+ ++I+MAPF+ +YGRR RS +GWLEV Sbjct: 690 EFAYNNSFQTSIQMAPFEALYGRRCRSPIGWLEV 723 >gb|EOY03146.1| Retrotransposon protein, putative [Theobroma cacao] Length = 1480 Score = 83.2 bits (204), Expect = 1e-13 Identities = 43/94 (45%), Positives = 57/94 (60%) Frame = +1 Query: 121 FTSHFWRYIQKELGTRVELSTAFHPQNDNQFK*AI*FLRICCEPA*LTLLVIGINFYSWR 300 FTS FW +Q+ LGT+ + STAFHPQ D Q + I L + L V + Sbjct: 1239 FTSRFWGKLQEALGTKFDFSTAFHPQTDGQSERTIQTLEDMLRACVIDLGVRWEQYLPLV 1298 Query: 301 SFFYNNSYYSNIEMAPFKVMYGRRPRSLVGWLEV 402 F YNNS+ ++I+MAPF+ +YGRR RS +GWLEV Sbjct: 1299 EFAYNNSFQTSIQMAPFEALYGRRCRSPIGWLEV 1332 >gb|ABI34389.1| Polyprotein, putative [Solanum tuberosum] Length = 545 Score = 83.2 bits (204), Expect = 1e-13 Identities = 46/93 (49%), Positives = 54/93 (58%) Frame = +1 Query: 121 FTSHFWRYIQKELGTRVELSTAFHPQNDNQFK*AI*FLRICCEPA*LTLLVIGINFYSWR 300 FTS F R Q+ELGTRV LSTAFHPQ D Q + I L + F Sbjct: 94 FTSSFLRAFQEELGTRVHLSTAFHPQTDGQSERTIQVLEDMLRACVMDFGGQWDQFLPLA 153 Query: 301 SFFYNNSYYSNIEMAPFKVMYGRRPRSLVGWLE 399 F YNNSY+S+I+MAPF+ +YGRR S VGW E Sbjct: 154 EFAYNNSYHSSIQMAPFEALYGRRCHSPVGWFE 186 >gb|EOY08678.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 666 Score = 82.8 bits (203), Expect = 2e-13 Identities = 50/139 (35%), Positives = 74/139 (53%), Gaps = 5/139 (3%) Frame = +1 Query: 1 LSREGNFISVQNLK*V-----FYGDYLISVISKKKSCLMDHNCM*FTSHFWRYIQKELGT 165 L++ +F+SV+ Y D ++ + S + D FTS FW +Q+ LGT Sbjct: 450 LTKSAHFLSVKTTYGAAQYARVYVDEIVRLHGIPISIVSDRGAQ-FTSRFWGKLQEALGT 508 Query: 166 RVELSTAFHPQNDNQFK*AI*FLRICCEPA*LTLLVIGINFYSWRSFFYNNSYYSNIEMA 345 +++ ST FHPQ D Q + I L + L V + F YNNS+ ++I+MA Sbjct: 509 KLDFSTTFHPQTDGQSERTIQTLEDMLRACVIDLGVKWEQYLPLVEFAYNNSFQTSIQMA 568 Query: 346 PFKVMYGRRPRSLVGWLEV 402 PF+ +YGRR RS +GWLEV Sbjct: 569 PFEALYGRRCRSPIGWLEV 587 >gb|EOY26288.1| Retrotransposon protein, Ty3-gypsy subclass, putative [Theobroma cacao] Length = 308 Score = 82.4 bits (202), Expect = 2e-13 Identities = 42/94 (44%), Positives = 57/94 (60%) Frame = +1 Query: 121 FTSHFWRYIQKELGTRVELSTAFHPQNDNQFK*AI*FLRICCEPA*LTLLVIGINFYSWR 300 FTS FW +Q+ LGT+++ ST FHPQ D Q + I L + L V + Sbjct: 128 FTSRFWGKLQEALGTKLDFSTTFHPQTDGQSERTIQTLEDMLRACVIDLGVRWEQYLPLV 187 Query: 301 SFFYNNSYYSNIEMAPFKVMYGRRPRSLVGWLEV 402 F YNNS+ ++I+MAPF+ +YGRR RS +GWLEV Sbjct: 188 EFAYNNSFQTSIQMAPFEALYGRRCRSPIGWLEV 221 >gb|EOY21478.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 878 Score = 82.4 bits (202), Expect = 2e-13 Identities = 42/93 (45%), Positives = 57/93 (61%) Frame = +1 Query: 121 FTSHFWRYIQKELGTRVELSTAFHPQNDNQFK*AI*FLRICCEPA*LTLLVIGINFYSWR 300 FTS FW +Q+ LGT+++ STAFHPQ D Q + I L + L V + Sbjct: 777 FTSRFWGKLQEALGTKLDFSTAFHPQTDGQSERTIQTLEDMLRACVIDLGVRWEQYLPLV 836 Query: 301 SFFYNNSYYSNIEMAPFKVMYGRRPRSLVGWLE 399 F YNNS+ ++I+MAPF+ +YGRR RS +GWLE Sbjct: 837 EFAYNNSFQTSIQMAPFEALYGRRCRSPIGWLE 869