BLASTX nr result
ID: Atropa21_contig00038886
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Atropa21_contig00038886 (1160 letters) Database: nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|AAT66771.2| Putative polyprotein, identical [Solanum demissum] 105 3e-20 gb|ABI34339.1| Polyprotein, 3'-partial, putative [Solanum demissum] 104 6e-20 ref|XP_004239522.1| PREDICTED: uncharacterized protein LOC101244... 102 4e-19 gb|ADU56212.1| gag-pol polyprotein [Solanum lycopersicum] 102 4e-19 ref|XP_004239560.1| PREDICTED: uncharacterized protein LOC101255... 98 7e-18 gb|EOY03146.1| Retrotransposon protein, putative [Theobroma cacao] 95 5e-17 ref|WP_006199319.1| hypothetical protein, partial [Nodularia spu... 94 1e-16 ref|XP_006366848.1| PREDICTED: uncharacterized protein LOC102605... 93 2e-16 gb|EOY19264.1| Uncharacterized protein TCM_044274 [Theobroma cacao] 92 3e-16 gb|EOY31663.1| CCHC-type integrase [Theobroma cacao] 92 4e-16 gb|EOY21520.1| CCHC-type integrase, putative [Theobroma cacao] 92 4e-16 gb|EOY21478.1| DNA/RNA polymerases superfamily protein [Theobrom... 92 4e-16 gb|EOY20275.1| DNA/RNA polymerases superfamily protein [Theobrom... 92 4e-16 gb|EOY17292.1| CCHC-type integrase [Theobroma cacao] 92 4e-16 gb|EOY14099.1| DNA/RNA polymerases superfamily protein [Theobrom... 92 4e-16 gb|EOY08678.1| DNA/RNA polymerases superfamily protein [Theobrom... 92 4e-16 gb|EOY03103.1| CCHC-type integrase [Theobroma cacao] 92 4e-16 gb|EOX99807.1| CCHC-type integrase [Theobroma cacao] 92 4e-16 ref|XP_004243106.1| PREDICTED: uncharacterized protein LOC101256... 91 7e-16 gb|EOY26451.1| DNA/RNA polymerases superfamily protein [Theobrom... 91 1e-15 >gb|AAT66771.2| Putative polyprotein, identical [Solanum demissum] Length = 1771 Score = 105 bits (263), Expect = 3e-20 Identities = 51/90 (56%), Positives = 67/90 (74%), Gaps = 2/90 (2%) Frame = +2 Query: 896 IWLHFWCAY--EVFVDHYNLQYIFNQKDLNLR*HTWLELLKDYDMTFIYHPSKANILADA 1069 IW H+ E++ DH +LQYI +Q+DLN R W+ELLKDYD++ +YHP KAN++ADA Sbjct: 1180 IWRHYLYGVRCEIYTDHRSLQYIMSQRDLNSRQRRWIELLKDYDLSILYHPGKANVVADA 1239 Query: 1070 LSRKAMSMGSLALLHMWIHLFAMDIQSLSN 1159 LSRKA+SMGSLA L + A+DIQSL+N Sbjct: 1240 LSRKAVSMGSLAFLSVEERPLALDIQSLAN 1269 >gb|ABI34339.1| Polyprotein, 3'-partial, putative [Solanum demissum] Length = 1475 Score = 104 bits (260), Expect = 6e-20 Identities = 51/90 (56%), Positives = 66/90 (73%), Gaps = 2/90 (2%) Frame = +2 Query: 896 IWLHFWCAY--EVFVDHYNLQYIFNQKDLNLR*HTWLELLKDYDMTFIYHPSKANILADA 1069 IW H+ E++ DH +LQYI +Q+DLN R W+ELLKDYD++ +YHP KAN++ADA Sbjct: 1024 IWRHYLYGVRCEIYTDHRSLQYIMSQRDLNSRQRRWIELLKDYDLSILYHPGKANVVADA 1083 Query: 1070 LSRKAMSMGSLALLHMWIHLFAMDIQSLSN 1159 LSRKA+SMGSLA L + AMDIQ L+N Sbjct: 1084 LSRKAVSMGSLAFLSVEERPLAMDIQFLAN 1113 >ref|XP_004239522.1| PREDICTED: uncharacterized protein LOC101244956 [Solanum lycopersicum] Length = 933 Score = 102 bits (253), Expect = 4e-19 Identities = 52/90 (57%), Positives = 63/90 (70%), Gaps = 2/90 (2%) Frame = +2 Query: 896 IWLHFWCAY--EVFVDHYNLQYIFNQKDLNLR*HTWLELLKDYDMTFIYHPSKANILADA 1069 IW H+ EVF DH +L YIFN++DLNLR WLELL DY+MT +YHP K N++ADA Sbjct: 650 IWSHYLYDVHCEVFTDHRSLHYIFNKRDLNLRQWRWLELLNDYEMTILYHPGKENVVADA 709 Query: 1070 LSRKAMSMGSLALLHMWIHLFAMDIQSLSN 1159 S KA SMGSLA+L H A D+QSL+N Sbjct: 710 SSWKAASMGSLAMLQGSEHPLAKDVQSLAN 739 >gb|ADU56212.1| gag-pol polyprotein [Solanum lycopersicum] Length = 624 Score = 102 bits (253), Expect = 4e-19 Identities = 51/89 (57%), Positives = 63/89 (70%), Gaps = 2/89 (2%) Frame = +2 Query: 899 WLHFWCAY--EVFVDHYNLQYIFNQKDLNLR*HTWLELLKDYDMTFIYHPSKANILADAL 1072 W H+ EV+ DH +LQY+F QKDLNLR W+ELLKDYD+T +YHP KAN++A AL Sbjct: 390 WRHYLYGVKCEVYTDHRSLQYVFTQKDLNLRQRRWMELLKDYDITILYHPGKANVVAVAL 449 Query: 1073 SRKAMSMGSLALLHMWIHLFAMDIQSLSN 1159 SRKA SMGSLA L H A ++Q L+N Sbjct: 450 SRKAGSMGSLAHLQASRHPLAREVQILAN 478 >ref|XP_004239560.1| PREDICTED: uncharacterized protein LOC101255493 [Solanum lycopersicum] Length = 326 Score = 97.8 bits (242), Expect = 7e-18 Identities = 49/90 (54%), Positives = 64/90 (71%), Gaps = 2/90 (2%) Frame = +2 Query: 896 IWLHFWCAY--EVFVDHYNLQYIFNQKDLNLR*HTWLELLKDYDMTFIYHPSKANILADA 1069 IW H+ C EV H +LQYIFN +DLNLR WL+ LKDY MT +YH K I+A+A Sbjct: 42 IWRHYICGLHCEVLTYHRSLQYIFNSRDLNLRKRRWLDFLKDYYMTILYHLGKTYIVANA 101 Query: 1070 LSRKAMSMGSLALLHMWIHLFAMDIQSLSN 1159 LS+KA+SMGSLA+L + HL + D+QSL++ Sbjct: 102 LSQKAVSMGSLAMLKVIEHLLSRDVQSLTS 131 >gb|EOY03146.1| Retrotransposon protein, putative [Theobroma cacao] Length = 1480 Score = 95.1 bits (235), Expect = 5e-17 Identities = 48/90 (53%), Positives = 62/90 (68%), Gaps = 2/90 (2%) Frame = +2 Query: 896 IWLHFWCAY--EVFVDHYNLQYIFNQKDLNLR*HTWLELLKDYDMTFIYHPSKANILADA 1069 IW H+ E++ DH +L+YIF Q+DLNLR H W+ELLKDYD T +YHP KAN++ADA Sbjct: 991 IWRHYLYGETCEIYTDHKSLKYIFQQRDLNLRQHRWMELLKDYDCTILYHPGKANVVADA 1050 Query: 1070 LSRKAMSMGSLALLHMWIHLFAMDIQSLSN 1159 LSRK SMGSLA + + +I SL + Sbjct: 1051 LSRK--SMGSLAHISIGRRSLVREIHSLGD 1078 >ref|WP_006199319.1| hypothetical protein, partial [Nodularia spumigena] gi|119461524|gb|EAW42594.1| hypothetical protein N9414_02846 [Nodularia spumigena CCY9414] Length = 68 Score = 93.6 bits (231), Expect = 1e-16 Identities = 42/63 (66%), Positives = 53/63 (84%) Frame = +2 Query: 923 EVFVDHYNLQYIFNQKDLNLR*HTWLELLKDYDMTFIYHPSKANILADALSRKAMSMGSL 1102 E+F +H+ LQ++F QKD+NLR W+ELLKDYD+T YHP KAN++ADALSRKA+SMGSL Sbjct: 2 EIFTNHHILQHVFTQKDMNLRQRRWMELLKDYDVTIQYHPGKANVVADALSRKAVSMGSL 61 Query: 1103 ALL 1111 A L Sbjct: 62 ACL 64 >ref|XP_006366848.1| PREDICTED: uncharacterized protein LOC102605741 [Solanum tuberosum] Length = 823 Score = 93.2 bits (230), Expect = 2e-16 Identities = 44/66 (66%), Positives = 52/66 (78%), Gaps = 2/66 (3%) Frame = +2 Query: 896 IWLHFWCAY--EVFVDHYNLQYIFNQKDLNLR*HTWLELLKDYDMTFIYHPSKANILADA 1069 IW H+ EVF DH +LQYIF+Q+DLNLR WLELLKDYDMT +YHP KAN++ADA Sbjct: 467 IWRHYLYGVHCEVFTDHRSLQYIFDQRDLNLRQRRWLELLKDYDMTILYHPGKANVVADA 526 Query: 1070 LSRKAM 1087 LSRKA+ Sbjct: 527 LSRKAV 532 >gb|EOY19264.1| Uncharacterized protein TCM_044274 [Theobroma cacao] Length = 860 Score = 92.4 bits (228), Expect = 3e-16 Identities = 47/90 (52%), Positives = 62/90 (68%), Gaps = 2/90 (2%) Frame = +2 Query: 896 IWLHFWCAY--EVFVDHYNLQYIFNQKDLNLR*HTWLELLKDYDMTFIYHPSKANILADA 1069 IW H+ E+++DH +L+YIF Q+DLNLR W+ELLKDYD T +YHP KAN++ADA Sbjct: 305 IWRHYLYGETCEIYMDHKSLKYIFQQRDLNLRQRRWMELLKDYDCTILYHPGKANVVADA 364 Query: 1070 LSRKAMSMGSLALLHMWIHLFAMDIQSLSN 1159 LSRK SMGSLA + + +I SL + Sbjct: 365 LSRK--SMGSLAHISIGRRSLVREIHSLGD 392 >gb|EOY31663.1| CCHC-type integrase [Theobroma cacao] Length = 395 Score = 92.0 bits (227), Expect = 4e-16 Identities = 47/90 (52%), Positives = 61/90 (67%), Gaps = 2/90 (2%) Frame = +2 Query: 896 IWLHFWCAY--EVFVDHYNLQYIFNQKDLNLR*HTWLELLKDYDMTFIYHPSKANILADA 1069 IW H+ E++ DH +L+YIF Q+DLNLR W+ELLKDYD T +YHP KAN++ADA Sbjct: 97 IWRHYLYGETCEIYTDHKSLKYIFQQRDLNLRQRRWMELLKDYDCTILYHPGKANVVADA 156 Query: 1070 LSRKAMSMGSLALLHMWIHLFAMDIQSLSN 1159 LSRK SMGSLA + + +I SL + Sbjct: 157 LSRK--SMGSLAHISIGRRSLVREIHSLGD 184 >gb|EOY21520.1| CCHC-type integrase, putative [Theobroma cacao] Length = 508 Score = 92.0 bits (227), Expect = 4e-16 Identities = 47/90 (52%), Positives = 61/90 (67%), Gaps = 2/90 (2%) Frame = +2 Query: 896 IWLHFWCAY--EVFVDHYNLQYIFNQKDLNLR*HTWLELLKDYDMTFIYHPSKANILADA 1069 IW H+ E++ DH +L+YIF Q+DLNLR W+ELLKDYD T +YHP KAN++ADA Sbjct: 408 IWRHYLYGETCEIYTDHKSLKYIFQQRDLNLRQRRWMELLKDYDCTILYHPGKANVVADA 467 Query: 1070 LSRKAMSMGSLALLHMWIHLFAMDIQSLSN 1159 LSRK SMGSLA + + +I SL + Sbjct: 468 LSRK--SMGSLAHISIGRRSLVREIHSLGD 495 >gb|EOY21478.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 878 Score = 92.0 bits (227), Expect = 4e-16 Identities = 47/90 (52%), Positives = 61/90 (67%), Gaps = 2/90 (2%) Frame = +2 Query: 896 IWLHFWCAY--EVFVDHYNLQYIFNQKDLNLR*HTWLELLKDYDMTFIYHPSKANILADA 1069 IW H+ E++ DH +L+YIF Q+DLNLR W+ELLKDYD T +YHP KAN++ADA Sbjct: 471 IWRHYLYGETCEIYTDHKSLKYIFQQRDLNLRQRRWMELLKDYDCTILYHPGKANVVADA 530 Query: 1070 LSRKAMSMGSLALLHMWIHLFAMDIQSLSN 1159 LSRK SMGSLA + + +I SL + Sbjct: 531 LSRK--SMGSLAHIFIGRRSLVREIHSLGD 558 >gb|EOY20275.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 562 Score = 92.0 bits (227), Expect = 4e-16 Identities = 47/90 (52%), Positives = 61/90 (67%), Gaps = 2/90 (2%) Frame = +2 Query: 896 IWLHFWCAY--EVFVDHYNLQYIFNQKDLNLR*HTWLELLKDYDMTFIYHPSKANILADA 1069 IW H+ E++ DH +L+YIF Q+DLNLR W+ELLKDYD T +YHP KAN++ADA Sbjct: 10 IWRHYLYGETCEIYTDHKSLKYIFQQRDLNLRQRRWMELLKDYDCTILYHPGKANVVADA 69 Query: 1070 LSRKAMSMGSLALLHMWIHLFAMDIQSLSN 1159 LSRK SMGSLA + + +I SL + Sbjct: 70 LSRK--SMGSLAHISIGRRSLVREIHSLGD 97 >gb|EOY17292.1| CCHC-type integrase [Theobroma cacao] Length = 136 Score = 92.0 bits (227), Expect = 4e-16 Identities = 47/90 (52%), Positives = 61/90 (67%), Gaps = 2/90 (2%) Frame = +2 Query: 896 IWLHFWCAY--EVFVDHYNLQYIFNQKDLNLR*HTWLELLKDYDMTFIYHPSKANILADA 1069 IW H+ E++ DH +L+YIF Q+DLNLR W+ELLKDYD T +YHP KAN++ADA Sbjct: 37 IWRHYLYGETCEIYTDHKSLKYIFQQRDLNLRQRRWMELLKDYDCTILYHPGKANVVADA 96 Query: 1070 LSRKAMSMGSLALLHMWIHLFAMDIQSLSN 1159 LSRK SMGSLA + + +I SL + Sbjct: 97 LSRK--SMGSLAHISIGRRSLVREIHSLGD 124 >gb|EOY14099.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 1502 Score = 92.0 bits (227), Expect = 4e-16 Identities = 47/90 (52%), Positives = 61/90 (67%), Gaps = 2/90 (2%) Frame = +2 Query: 896 IWLHFWCAY--EVFVDHYNLQYIFNQKDLNLR*HTWLELLKDYDMTFIYHPSKANILADA 1069 IW H+ E++ DH +L+YIF Q+DLNLR W+ELLKDYD T +YHP KAN++ADA Sbjct: 986 IWRHYLYGETCEIYTDHKSLKYIFQQRDLNLRQRRWMELLKDYDCTILYHPGKANVVADA 1045 Query: 1070 LSRKAMSMGSLALLHMWIHLFAMDIQSLSN 1159 LSRK SMGSLA + + +I SL + Sbjct: 1046 LSRK--SMGSLAHISIGRRSLVREIHSLGD 1073 >gb|EOY08678.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 666 Score = 92.0 bits (227), Expect = 4e-16 Identities = 47/90 (52%), Positives = 61/90 (67%), Gaps = 2/90 (2%) Frame = +2 Query: 896 IWLHFWCAY--EVFVDHYNLQYIFNQKDLNLR*HTWLELLKDYDMTFIYHPSKANILADA 1069 IW H+ E++ DH +L+YIF Q+DLNLR W+ELLKDYD T +YHP KAN++ADA Sbjct: 188 IWRHYLYGETCEIYTDHKSLKYIFQQRDLNLRQRRWMELLKDYDCTILYHPGKANVVADA 247 Query: 1070 LSRKAMSMGSLALLHMWIHLFAMDIQSLSN 1159 LSRK SMGSLA + + +I SL + Sbjct: 248 LSRK--SMGSLAHISIGRRSLVREIHSLGD 275 >gb|EOY03103.1| CCHC-type integrase [Theobroma cacao] Length = 214 Score = 92.0 bits (227), Expect = 4e-16 Identities = 47/90 (52%), Positives = 61/90 (67%), Gaps = 2/90 (2%) Frame = +2 Query: 896 IWLHFWCAY--EVFVDHYNLQYIFNQKDLNLR*HTWLELLKDYDMTFIYHPSKANILADA 1069 IW H+ E++ DH +L+YIF Q+DLNLR W+ELLKDYD T +YHP KAN++ADA Sbjct: 37 IWRHYLYGETCEIYTDHKSLKYIFQQRDLNLRQRRWMELLKDYDCTILYHPGKANVVADA 96 Query: 1070 LSRKAMSMGSLALLHMWIHLFAMDIQSLSN 1159 LSRK SMGSLA + + +I SL + Sbjct: 97 LSRK--SMGSLAHISIGRRSLVREIHSLGD 124 >gb|EOX99807.1| CCHC-type integrase [Theobroma cacao] Length = 165 Score = 92.0 bits (227), Expect = 4e-16 Identities = 47/90 (52%), Positives = 61/90 (67%), Gaps = 2/90 (2%) Frame = +2 Query: 896 IWLHFWCAY--EVFVDHYNLQYIFNQKDLNLR*HTWLELLKDYDMTFIYHPSKANILADA 1069 IW H+ E++ DH +L+YIF Q+DLNLR W+ELLKDYD T +YHP KAN++ADA Sbjct: 37 IWRHYLYGETCEIYTDHKSLKYIFQQRDLNLRQRRWMELLKDYDCTILYHPGKANVVADA 96 Query: 1070 LSRKAMSMGSLALLHMWIHLFAMDIQSLSN 1159 LSRK SMGSLA + + +I SL + Sbjct: 97 LSRK--SMGSLAHISIGRRSLVREIHSLGD 124 >ref|XP_004243106.1| PREDICTED: uncharacterized protein LOC101256304 [Solanum lycopersicum] Length = 647 Score = 91.3 bits (225), Expect = 7e-16 Identities = 49/92 (53%), Positives = 62/92 (67%), Gaps = 2/92 (2%) Frame = +2 Query: 890 MVIWLHFWCAY--EVFVDHYNLQYIFNQKDLNLR*HTWLELLKDYDMTFIYHPSKANILA 1063 M IW+H+ +++ DH +LQYIF QK+LNLR WLELLKDYD+ +YHP KANI+A Sbjct: 469 MKIWMHYLYGVHVDIYTDHKSLQYIFKQKELNLRQRRWLELLKDYDIDILYHPGKANIVA 528 Query: 1064 DALSRKAMSMGSLALLHMWIHLFAMDIQSLSN 1159 DALSRK SMGSL + +IQ LS+ Sbjct: 529 DALSRK--SMGSLTDVQPERRDMVWEIQWLSS 558 >gb|EOY26451.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 679 Score = 90.5 bits (223), Expect = 1e-15 Identities = 46/90 (51%), Positives = 60/90 (66%), Gaps = 2/90 (2%) Frame = +2 Query: 896 IWLHFWCAY--EVFVDHYNLQYIFNQKDLNLR*HTWLELLKDYDMTFIYHPSKANILADA 1069 IW H+ E++ DH +L+YIF Q+D NLR W+ELLKDYD T +YHP KAN++ADA Sbjct: 94 IWRHYLYGETCEIYTDHKSLKYIFQQRDFNLRQRRWMELLKDYDCTILYHPGKANVVADA 153 Query: 1070 LSRKAMSMGSLALLHMWIHLFAMDIQSLSN 1159 LSRK SMGSLA + + +I SL + Sbjct: 154 LSRK--SMGSLAHISIGRRSLVREIHSLGD 181