BLASTX nr result
ID: Lithospermum22_contig00026763
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Lithospermum22_contig00026763 (1538 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|AAC64917.1| gag-pol polyprotein [Glycine max] 214 5e-53 gb|AAO73527.1| gag-pol polyprotein [Glycine max] 214 5e-53 ref|NP_001235160.1| gag-protease polyprotein [Glycine max] gi|74... 212 2e-52 gb|AAO73529.1| gag-pol polyprotein [Glycine max] 209 2e-51 gb|AAO73521.1| gag-pol polyprotein [Glycine max] 208 3e-51 >gb|AAC64917.1| gag-pol polyprotein [Glycine max] Length = 1550 Score = 214 bits (545), Expect = 5e-53 Identities = 155/444 (34%), Positives = 216/444 (48%), Gaps = 50/444 (11%) Frame = +2 Query: 11 AKEAWETLACVFEGTHKVRVCRLQFLTTKFENLRMKEDETIIEFIIKLKEISNEFYDLGE 190 AK+AWE L EGT KV++ RLQ L TKFENL+MKE+E I EF + + EI+N LGE Sbjct: 78 AKDAWEILKTTHEGTSKVKMSRLQLLATKFENLKMKEEECIHEFHMNILEIANACTALGE 137 Query: 191 IMSEEKIVRKILRSLPNRFDSKVVVIEEANDISIMKVDELFGSLITFEMSLDDRRCK--- 361 M++EK+VRKILRSLP RFD KV IEEA DI M+VDEL GSL TFE+ L DR K Sbjct: 138 RMTDEKLVRKILRSLPKRFDMKVTAIEEAQDICNMRVDELIGSLQTFELGLSDRTEKKSK 197 Query: 362 -------DKKYGNTVKPDFSDSLSNSRFLLDTNVGK---RMDQ----------------- 460 D+ + D + L+N+ LL K RMD+ Sbjct: 198 NLAFVSNDEGEEDEYDLDTDEGLTNAVVLLGKQFNKVLNRMDRRQKPHVRNIPFDIRKGS 257 Query: 461 -----------*ATEIQCKGCEGYGHNQIQCPTFSQKK---CNIILSRE-KSKKLKDGVG 595 + IQC GCEGYGH + +CPT +K+ ++ S + +S++ D Sbjct: 258 EYQKRSDEKPSHSKGIQCHGCEGYGHIKAECPTHLKKQRKGLSVCRSDDTESEQESDSDR 317 Query: 596 KIVAFTAK--ATESDCMIXXXXXXXXXXXXXXXXXXXXXKLLAHCNKLTGLYMKQEKENK 769 + A T + + E K+L +L + E E + Sbjct: 318 DVNALTGRFESAEDSSDTDSEITFDELAISYRELCIKSEKILQQEAQLKKVIANLEAEKE 377 Query: 770 RLLDENNEMLIVNSGLNEKVKGLNAELEAMKKSVHMLNSGTSKLDEILSLGQSPNDHTGV 949 DE +E L ++ LN++LE M KS+ MLN G+ LDE+L LG++ + G+ Sbjct: 378 AHEDEISE-------LKGEIGFLNSKLENMTKSIKMLNKGSDLLDEVLQLGKNVGNQRGL 430 Query: 950 GY---VAGECSKESKFVPAAKIKESQLLYQXXXXXXXXXXXXXXRPHIEALPKNKRKKKW 1120 G+ AG + ++FVPA + + H K ++KKW Sbjct: 431 GFNHKSAGRTTM-TEFVPAKNSTGATMSQHRSR-------------HHGTQQKKSKRKKW 476 Query: 1121 ICHHCGKKGHIRPYCYKLYGRNIH 1192 CH+CGK GHI+P+CY L+G H Sbjct: 477 RCHYCGKYGHIKPFCYHLHGHPHH 500 >gb|AAO73527.1| gag-pol polyprotein [Glycine max] Length = 1576 Score = 214 bits (545), Expect = 5e-53 Identities = 156/443 (35%), Positives = 215/443 (48%), Gaps = 49/443 (11%) Frame = +2 Query: 11 AKEAWETLACVFEGTHKVRVCRLQFLTTKFENLRMKEDETIIEFIIKLKEISNEFYDLGE 190 AK+AWE L EGT KV++ RLQ L TKFENL+MKE+E I +F + + EI+N LGE Sbjct: 105 AKDAWEILKITHEGTSKVKMSRLQLLATKFENLKMKEEECIHDFHMNILEIANACTALGE 164 Query: 191 IMSEEKIVRKILRSLPNRFDSKVVVIEEANDISIMKVDELFGSLITFEMSLDDRRCK--- 361 +++EK+VRKILRSLP RFD KV IEEA DI M+VDEL GSL TFE+ L DR K Sbjct: 165 RITDEKLVRKILRSLPKRFDMKVTAIEEAQDICNMRVDELIGSLQTFELGLSDRAEKKSK 224 Query: 362 -------DKKYGNTVKPDFSDSLSNSRFLLDTNVGK---RMDQ----------------- 460 D+ + D + L+N+ LL K RMD+ Sbjct: 225 NLAFVSNDEGEEDEYDLDTDEGLTNAVVLLGKQFNKVLNRMDKRQKPHVQNIPFDIRKGS 284 Query: 461 -----------*ATEIQCKGCEGYGHNQIQCPTF---SQKKCNIILSREKSKKLKDGVGK 598 + IQC GCEGYGH +CPT +K ++ S +S++ D Sbjct: 285 KYQKRSDVKPSHSKGIQCHGCEGYGHIIAECPTHLKKHRKGLSVCQSDTESEQESDSDRD 344 Query: 599 IVAFTA--KATESDCMIXXXXXXXXXXXXXXXXXXXXXKLLAHCNKLTGLYMKQEKENKR 772 + A T + E K+L +L + E E + Sbjct: 345 VNALTGIFETAEDSSDTDSEITFDELAASYRKLCIKSEKILQQEAQLKKVIADLEAEKEA 404 Query: 773 LLDENNEMLIVNSGLNEKVKGLNAELEAMKKSVHMLNSGTSKLDEILSLGQSPNDHTGVG 952 +E +E L +V LN++LE MKKS+ MLN G+ LDE+L LG++ + G+G Sbjct: 405 HEEEISE-------LKGEVGFLNSKLETMKKSIKMLNKGSDTLDEVLLLGKNAGNQRGLG 457 Query: 953 Y---VAGECSKESKFVPAAKIKESQLLYQXXXXXXXXXXXXXXRPHIEALPKNKRKKKWI 1123 + AG + ++FVP AK + + Q H K ++KKW Sbjct: 458 FNPKFAGRTTM-TEFVP-AKNRTGTTMSQHLSR------------HHGTQQKKSKRKKWR 503 Query: 1124 CHHCGKKGHIRPYCYKLYGRNIH 1192 CH+CGK GHI+P+CY L+G H Sbjct: 504 CHYCGKYGHIKPFCYHLHGHPHH 526 >ref|NP_001235160.1| gag-protease polyprotein [Glycine max] gi|7488678|pir||T06419 gag-proteinase polyprotein - soybean retrovirus-like element gi|905361|gb|AAC18777.1| gag-protease polyprotein [Glycine max] Length = 640 Score = 212 bits (539), Expect = 2e-52 Identities = 155/444 (34%), Positives = 217/444 (48%), Gaps = 50/444 (11%) Frame = +2 Query: 11 AKEAWETLACVFEGTHKVRVCRLQFLTTKFENLRMKEDETIIEFIIKLKEISNEFYDLGE 190 AK+AWE L EGT KV++ RLQ L TKFENL+MKE+E I +F + + EI+N LGE Sbjct: 105 AKDAWEILKTTHEGTSKVKMSRLQLLATKFENLKMKEEECIHDFHMNILEIANACTALGE 164 Query: 191 IMSEEKIVRKILRSLPNRFDSKVVVIEEANDISIMKVDELFGSLITFEMSLDDRRCK--- 361 M++EK+VRKILRSLP RFD KV IEEA DI ++VDEL GSL TFE+ L DR K Sbjct: 165 RMTDEKLVRKILRSLPKRFDMKVTAIEEAQDICNLRVDELIGSLQTFELGLSDRTEKKSK 224 Query: 362 -------DKKYGNTVKPDFSDSLSNSRFLLDTNVGK---RMDQ----------------- 460 D+ + D + L+N+ LL K RMD+ Sbjct: 225 NLAFVSNDEGEEDEYDLDTDEGLTNAVVLLGKQFNKVLNRMDRRQKPHVRNIPFDIRKGS 284 Query: 461 -----------*ATEIQCKGCEGYGHNQIQCPTFSQKK---CNIILSRE-KSKKLKDGVG 595 + QC GCEGYGH + +CPT +K+ ++ S + +S++ D Sbjct: 285 EYQKRSDEKPSHSKGFQCHGCEGYGHIKAECPTHLKKQRKGLSVCRSDDTESEQESDSDR 344 Query: 596 KIVAFTAK--ATESDCMIXXXXXXXXXXXXXXXXXXXXXKLLAHCNKLTGLYMKQEKENK 769 + A T + + E K+L +L + E E + Sbjct: 345 DVNALTGRFESAEDSSDTDSEITFDELATSYRELCIKSEKILQQEAQLKKVIANLEAEKE 404 Query: 770 RLLDENNEMLIVNSGLNEKVKGLNAELEAMKKSVHMLNSGTSKLDEILSLGQSPNDHTGV 949 +E +E L +V LN++LE M KS+ MLN G+ LDE+L LG++ + G+ Sbjct: 405 AHEEEISE-------LKGEVGFLNSKLENMTKSIKMLNKGSDMLDEVLQLGKNVGNQRGL 457 Query: 950 GY---VAGECSKESKFVPAAKIKESQLLYQXXXXXXXXXXXXXXRPHIEALPKNKRKKKW 1120 G+ AG + ++FVP AKI + Q H K ++KKW Sbjct: 458 GFNHKSAGRITM-TEFVP-AKISTGATMSQHRSR------------HHGTQQKKSKRKKW 503 Query: 1121 ICHHCGKKGHIRPYCYKLYGRNIH 1192 CH+CGK GHI+P+CY L+G H Sbjct: 504 RCHYCGKYGHIKPFCYHLHGHPHH 527 >gb|AAO73529.1| gag-pol polyprotein [Glycine max] Length = 1577 Score = 209 bits (532), Expect = 2e-51 Identities = 152/444 (34%), Positives = 214/444 (48%), Gaps = 50/444 (11%) Frame = +2 Query: 11 AKEAWETLACVFEGTHKVRVCRLQFLTTKFENLRMKEDETIIEFIIKLKEISNEFYDLGE 190 AK+AWE L EGT KV++ RLQ L TKFENL+MKE+E I +F + + EI+N LGE Sbjct: 105 AKDAWEILKTTHEGTSKVKMSRLQLLATKFENLKMKEEECIHDFHMTILEIANACTALGE 164 Query: 191 IMSEEKIVRKILRSLPNRFDSKVVVIEEANDISIMKVDELFGSLITFEMSLDDRRCK--- 361 M++EK+VRKILRSLP RFD KV IEEA DI M+VDEL GSL TFE+ L DR K Sbjct: 165 RMTDEKLVRKILRSLPKRFDMKVTAIEEAQDICNMRVDELIGSLQTFELGLSDRTEKKSK 224 Query: 362 -------DKKYGNTVKPDFSDSLSNSRFLLDTNVGK---RMDQ----------------- 460 D+ + D + L+N+ L K RMD+ Sbjct: 225 NLAFVSNDEGEEDEYDLDTDEGLTNAVVFLGKQFNKVLNRMDRRQKPHVRNISLDIRKGS 284 Query: 461 -----------*ATEIQCKGCEGYGHNQIQCPTFSQKK---CNIILSRE-KSKKLKDGVG 595 + IQC+GCEGYGH + +CPT +K+ ++ S + +S++ D Sbjct: 285 EYQRKSDEKPSHSKGIQCRGCEGYGHIKAECPTHLKKQRKGLSVCRSDDTESEQESDSDR 344 Query: 596 KIVAFTAK--ATESDCMIXXXXXXXXXXXXXXXXXXXXXKLLAHCNKLTGLYMKQEKENK 769 + A T + + E K+L +L + E E + Sbjct: 345 DVNALTGRFESAEDSSDTDSEITFDELAIFYRELCIKSEKILQQEAQLKKVIANLEAEKE 404 Query: 770 RLLDENNEMLIVNSGLNEKVKGLNAELEAMKKSVHMLNSGTSKLDEILSLGQSPNDHTGV 949 +E S L +V LN++LE M KS+ MLN G+ LD++L LG+ + G+ Sbjct: 405 AHEEE-------ISKLKGEVGFLNSKLENMTKSIKMLNKGSDMLDZVLQLGKKVGNQRGL 457 Query: 950 GY---VAGECSKESKFVPAAKIKESQLLYQXXXXXXXXXXXXXXRPHIEALPKNKRKKKW 1120 G+ AG + ++FVPA + + H K ++KKW Sbjct: 458 GFNHKSAGRTTM-TEFVPAKNSTGATMSQHRSR-------------HHGTQQKRSKRKKW 503 Query: 1121 ICHHCGKKGHIRPYCYKLYGRNIH 1192 CH+CGK GHI+P+CY L+G H Sbjct: 504 RCHYCGKYGHIKPFCYHLHGHPHH 527 >gb|AAO73521.1| gag-pol polyprotein [Glycine max] Length = 1574 Score = 208 bits (530), Expect = 3e-51 Identities = 152/437 (34%), Positives = 209/437 (47%), Gaps = 48/437 (10%) Frame = +2 Query: 11 AKEAWETLACVFEGTHKVRVCRLQFLTTKFENLRMKEDETIIEFIIKLKEISNEFYDLGE 190 AK+AWE L EGT KV++ RLQ L TKFENL+MKE+E I +F + + EI+N LGE Sbjct: 105 AKDAWEILKITHEGTSKVKISRLQLLATKFENLKMKEEECIHDFHMNILEIANACTALGE 164 Query: 191 IMSEEKIVRKILRSLPNRFDSKVVVIEEANDISIMKVDELFGSLITFEMSLDDRRCKDKK 370 +++EK+VRKILRSLP RFD KV IEEA DI M+VDEL GSL TFE+ L DR K K Sbjct: 165 RITDEKLVRKILRSLPKRFDMKVTAIEEAQDICNMRVDELIGSLQTFELGLSDRAEKKSK 224 Query: 371 YGNTVKPD----------FSDSLSNSRFLLDTNVGK---RMDQ----------------- 460 V D + L+N+ LL K RMD+ Sbjct: 225 NLAFVSNDEGEEDEYDLNTDEGLTNAVVLLGKQFNKVLNRMDKRQKPHVQNIPFDIRKGS 284 Query: 461 -----------*ATEIQCKGCEGYGHNQIQCPTF---SQKKCNIILSREKSKKLKDGVGK 598 + IQC GCEGYGH +CPT +K ++ S +S++ D Sbjct: 285 KYQKKSDVKPSHSKGIQCHGCEGYGHIIAECPTHLKKHRKGLSVCQSDTESEQESDSDRD 344 Query: 599 IVAFTA--KATESDCMIXXXXXXXXXXXXXXXXXXXXXKLLAHCNKLTGLYMKQEKENKR 772 + A T + E K+L +L + E E + Sbjct: 345 VNALTGIFETAEDSSDTDSEITFDELATSYRKLCIKSEKILQQEAQLKKVIADLEAEKEA 404 Query: 773 LLDENNEMLIVNSGLNEKVKGLNAELEAMKKSVHMLNSGTSKLDEILSLGQSPNDHTGVG 952 +E +E L +V LN++LE M KS+ MLN G+ LDE+L LG++ + G+G Sbjct: 405 HKEEISE-------LKGEVGFLNSKLENMTKSIKMLNKGSDTLDEVLLLGKNAGNQRGLG 457 Query: 953 YVAGECSKE--SKFVPAAKIKESQLLYQXXXXXXXXXXXXXXRPHIEALPKNKRKKKWIC 1126 + + ++FVP AK + + Q H K ++KKW C Sbjct: 458 FNPKSAGRTTMTEFVP-AKNRTGATMSQHRSR------------HHGMQQKKSKRKKWRC 504 Query: 1127 HHCGKKGHIRPYCYKLY 1177 H+CGK GHI+P+CY L+ Sbjct: 505 HYCGKYGHIKPFCYHLH 521