BLASTX nr result
ID: Bupleurum21_contig00036406
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Bupleurum21_contig00036406 (591 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value dbj|BAA11674.1| unnamed protein product [Nicotiana tabacum] 300 9e-80 gb|AAV88069.1| hypothetical retrotransposon [Ipomoea batatas] 295 4e-78 gb|AER13172.1| putative gag/pol polyprotein [Phaseolus vulgaris] 282 3e-74 sp|P10978.1|POLX_TOBAC RecName: Full=Retrovirus-related Pol poly... 267 1e-69 gb|AAW22873.1| putative polyprotein [Solanum lycopersicum] 260 1e-67 >dbj|BAA11674.1| unnamed protein product [Nicotiana tabacum] Length = 1338 Score = 300 bits (769), Expect = 9e-80 Identities = 145/196 (73%), Positives = 171/196 (87%) Frame = -3 Query: 589 KHGYQKTSTDHCVFIKRFSSTDFIILLLYVDDMLIVGPNKERIASLKQLMNGTFSMKELG 410 +HGY+KT++DHCVF ++FS DFIILLLYVDDMLIVG N RI SLK+ ++ F+MK+LG Sbjct: 981 QHGYKKTTSDHCVFAQKFSDDDFIILLLYVDDMLIVGRNVSRINSLKEQLSKFFAMKDLG 1040 Query: 409 PAKHILGMQIIRNRKAKRLWLSQEAYINKVLQRFNMEGAKPVSTPLALHFKLSTKQSPST 230 PAK ILGM+I+R+R+AK+LWLSQE YI KVLQRFNME K VS PLA HF+LSTKQSPST Sbjct: 1041 PAKQILGMRIMRDREAKKLWLSQEKYIEKVLQRFNMEKTKAVSCPLANHFRLSTKQSPST 1100 Query: 229 FEEKKNMQRIPYASAVGSLMYAMVCTRPDISHAVGLVSRFLSNPGKEHWNAVKWIMRYLR 50 +E++ M+RIPYASAVGSLMYAMVCTRPDI+HAVG+VSRFLSNPGKEHW+AVKWI+RYLR Sbjct: 1101 DDERRKMERIPYASAVGSLMYAMVCTRPDIAHAVGVVSRFLSNPGKEHWDAVKWILRYLR 1160 Query: 49 GTADLKLCFGSDNASL 2 GT+ L LCFG DN L Sbjct: 1161 GTSKLCLCFGEDNPVL 1176 >gb|AAV88069.1| hypothetical retrotransposon [Ipomoea batatas] Length = 1415 Score = 295 bits (755), Expect = 4e-78 Identities = 138/191 (72%), Positives = 167/191 (87%) Frame = -3 Query: 589 KHGYQKTSTDHCVFIKRFSSTDFIILLLYVDDMLIVGPNKERIASLKQLMNGTFSMKELG 410 KHGY+KTS+DHCVF+ R+S DF+ILLLYVDDMLIVG N RI LKQ ++ +FSMK++G Sbjct: 978 KHGYKKTSSDHCVFVNRYSDDDFVILLLYVDDMLIVGRNASRIQELKQELSKSFSMKDMG 1037 Query: 409 PAKHILGMQIIRNRKAKRLWLSQEAYINKVLQRFNMEGAKPVSTPLALHFKLSTKQSPST 230 PAK ILGM+IIR+R+ K+LWLSQE YI KVL+RF+M AKPVSTPL +HFKL KQ PS+ Sbjct: 1038 PAKQILGMKIIRDRQNKKLWLSQEKYIEKVLERFHMNEAKPVSTPLDMHFKLCKKQCPSS 1097 Query: 229 FEEKKNMQRIPYASAVGSLMYAMVCTRPDISHAVGLVSRFLSNPGKEHWNAVKWIMRYLR 50 +EK+ MQR+PY+SAVGSLMYAMVCTRPDI+HAVG+VSRFLSNPG+EHW+AVKWI+RYLR Sbjct: 1098 EKEKEEMQRVPYSSAVGSLMYAMVCTRPDIAHAVGVVSRFLSNPGREHWDAVKWILRYLR 1157 Query: 49 GTADLKLCFGS 17 GT+ L LCFG+ Sbjct: 1158 GTSSLSLCFGT 1168 >gb|AER13172.1| putative gag/pol polyprotein [Phaseolus vulgaris] Length = 1556 Score = 282 bits (722), Expect = 3e-74 Identities = 134/196 (68%), Positives = 166/196 (84%) Frame = -3 Query: 589 KHGYQKTSTDHCVFIKRFSSTDFIILLLYVDDMLIVGPNKERIASLKQLMNGTFSMKELG 410 + GY+KT++DHCVF+K+F++ DFIILLLYVDD+LIVG + I LK+ ++ +F+MK++G Sbjct: 924 EQGYKKTTSDHCVFVKKFANDDFIILLLYVDDILIVGKDISMINRLKKQLSESFAMKDMG 983 Query: 409 PAKHILGMQIIRNRKAKRLWLSQEAYINKVLQRFNMEGAKPVSTPLALHFKLSTKQSPST 230 AK ILG++I+R+R+ K+LWLSQE Y+ +VLQRF ME AK VSTPLA HFKLSTKQSPS Sbjct: 984 AAKQILGIRIMRDRQEKKLWLSQENYVKRVLQRFQMENAKVVSTPLATHFKLSTKQSPSY 1043 Query: 229 FEEKKNMQRIPYASAVGSLMYAMVCTRPDISHAVGLVSRFLSNPGKEHWNAVKWIMRYLR 50 EK +MQRIPYASAVGSLMYAMVCTRPDI+H VG VSRF+SNPG+EHWNAVKWI+RYLR Sbjct: 1044 EYEKSDMQRIPYASAVGSLMYAMVCTRPDIAHVVGTVSRFMSNPGREHWNAVKWILRYLR 1103 Query: 49 GTADLKLCFGSDNASL 2 GT L+LCFG D +L Sbjct: 1104 GTTCLRLCFGGDKPTL 1119 >sp|P10978.1|POLX_TOBAC RecName: Full=Retrovirus-related Pol polyprotein from transposon TNT 1-94; Includes: RecName: Full=Protease; Includes: RecName: Full=Reverse transcriptase; Includes: RecName: Full=Endonuclease gi|20045|emb|CAA32025.1| unnamed protein product [Nicotiana tabacum] Length = 1328 Score = 267 bits (682), Expect = 1e-69 Identities = 130/193 (67%), Positives = 155/193 (80%) Frame = -3 Query: 580 YQKTSTDHCVFIKRFSSTDFIILLLYVDDMLIVGPNKERIASLKQLMNGTFSMKELGPAK 401 Y KT +D CV+ KRFS +FIILLLYVDDMLIVG +K IA LK ++ +F MK+LGPA+ Sbjct: 983 YLKTYSDPCVYFKRFSENNFIILLLYVDDMLIVGKDKGLIAKLKGDLSKSFDMKDLGPAQ 1042 Query: 400 HILGMQIIRNRKAKRLWLSQEAYINKVLQRFNMEGAKPVSTPLALHFKLSTKQSPSTFEE 221 ILGM+I+R R +++LWLSQE YI +VL+RFNM+ AKPVSTPLA H KLS K P+T EE Sbjct: 1043 QILGMKIVRERTSRKLWLSQEKYIERVLERFNMKNAKPVSTPLAGHLKLSKKMCPTTVEE 1102 Query: 220 KKNMQRIPYASAVGSLMYAMVCTRPDISHAVGLVSRFLSNPGKEHWNAVKWIMRYLRGTA 41 K NM ++PY+SAVGSLMYAMVCTRPDI+HAVG+VSRFL NPGKEHW AVKWI+RYLRGT Sbjct: 1103 KGNMAKVPYSSAVGSLMYAMVCTRPDIAHAVGVVSRFLENPGKEHWEAVKWILRYLRGTT 1162 Query: 40 DLKLCFGSDNASL 2 LCFG + L Sbjct: 1163 GDCLCFGGSDPIL 1175 >gb|AAW22873.1| putative polyprotein [Solanum lycopersicum] Length = 687 Score = 260 bits (665), Expect = 1e-67 Identities = 123/187 (65%), Positives = 155/187 (82%) Frame = -3 Query: 580 YQKTSTDHCVFIKRFSSTDFIILLLYVDDMLIVGPNKERIASLKQLMNGTFSMKELGPAK 401 Y++T+ D CV+ ++FS +FIIL LYVDDMLIVG + E I LK+ ++ +F MK+LGPAK Sbjct: 343 YKRTTADPCVYFRKFSEGNFIILCLYVDDMLIVGQDVEMICRLKEDLSKSFDMKDLGPAK 402 Query: 400 HILGMQIIRNRKAKRLWLSQEAYINKVLQRFNMEGAKPVSTPLALHFKLSTKQSPSTFEE 221 ILGM+I R+RKA +LWLSQE YI +VL+RFNM+ AKPV+TPLA HFKLS + P+T +E Sbjct: 403 QILGMEIARDRKAGKLWLSQENYIERVLERFNMKNAKPVNTPLAAHFKLSKRCCPTTEKE 462 Query: 220 KKNMQRIPYASAVGSLMYAMVCTRPDISHAVGLVSRFLSNPGKEHWNAVKWIMRYLRGTA 41 K++M IPY+S VGSLMYAMVCTRPDI+HAVGLVSR+L+NP K HW AVKWI+RYLRGT+ Sbjct: 463 KESMSHIPYSSVVGSLMYAMVCTRPDIAHAVGLVSRYLANPSKVHWEAVKWILRYLRGTS 522 Query: 40 DLKLCFG 20 +L LCFG Sbjct: 523 NLSLCFG 529