BLASTX nr result
ID: Bupleurum21_contig00014051
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Bupleurum21_contig00014051 (1477 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002331075.1| predicted protein [Populus trichocarpa] gi|2... 226 2e-56 gb|ABW81051.1| tn7 reverse transcriptase [Arabidopsis lyrata sub... 197 8e-48 dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana] 193 8e-47 dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like ... 193 8e-47 gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana] 192 2e-46 >ref|XP_002331075.1| predicted protein [Populus trichocarpa] gi|222873039|gb|EEF10170.1| predicted protein [Populus trichocarpa] Length = 517 Score = 226 bits (575), Expect = 2e-56 Identities = 126/405 (31%), Positives = 208/405 (51%), Gaps = 15/405 (3%) Frame = -2 Query: 1470 LSFAGRVQLINSILMGIEGYWSMYIFLPHSILKTIQSIFSKFLWGGCLLQKPHYKVAWAD 1291 LS+AGRVQLINS+L I+ YW+ LP ++K ++ I FLW G ++ KVAW Sbjct: 92 LSYAGRVQLINSVLFSIQVYWASLFLLPGQVIKNVEQIMKSFLWSGSDMRTTGAKVAWDQ 151 Query: 1290 CCVPVFEGGLGLRDIYSWNCAAILFQLWRILT-SKQSLWVSWFYSQYLRTKSIWSIKVQP 1114 C+P EGGLG++ I WN A+L +W + S S+W +W S LR ++ W+IK Sbjct: 152 VCLPKKEGGLGIKSIKEWNKIALLKHIWNLCNDSDGSIWSTWIRSNLLRGRNFWTIKTPQ 211 Query: 1113 SASWCVKQILNSQSSALQYIKYNVGRNSNFWFWYDPW-PGDSLVNHFGTQIISIMVSSHD 937 + SW +IL +S A +KY +G W+D W P L + +G + I + + Sbjct: 212 NCSWAWGKILKLRSLAWPKMKYIIGDGMTTSLWFDNWHPHSPLADSYGERFIYDSGMAKN 271 Query: 936 VRAGSFLRGNSWSTGLLNHQLAIDLRHLLHTIP------IHREDSIVWDDSPVNK--VSD 781 + ++ + W T AI ++ IP + ++D +VW DSP ++ V Sbjct: 272 AKVNVLIQNSEWKTPTTQ---AIGWHPIIEAIPSNSNPKMGQKDELVWLDSPNHRFSVKV 328 Query: 780 IWQTIRPRRQSCPWFFAVWSAWCIKKCSFILWLALKHRLLSKDRMIRFGMVINPTCLLCN 601 W+ +R RQ W VW + + SF+LW+A++ +L ++D++ RFG+ C LC Sbjct: 329 AWEQLRRHRQMVEWHDIVWFKNAVPRHSFLLWMAVQQKLTTQDKLHRFGIHGPNRCSLCL 388 Query: 600 SADESISHLFVECTYSAAI----LNACPF-QLSNQWDDFLQGRFFQTPATCIQKEIAYLY 436 +E +HLF EC+Y+ AI + C +++ WD++++ L Sbjct: 389 RNNEDHNHLFFECSYTKAIWWDVCDRCDIPRMTKGWDEWIRWATVSWHGKSFVNFSCKLS 448 Query: 435 YAVAVYSIWK*RNARLHQQQTPMLPSAITMNVKAMVREKLASCRN 301 +A VY +W+ RNAR+ + P+ + ++ ++R+KL RN Sbjct: 449 FAATVYHVWQERNARIFAGMS-RTPNLVLNQIECIIRDKLDLMRN 492 >gb|ABW81051.1| tn7 reverse transcriptase [Arabidopsis lyrata subsp. lyrata] Length = 441 Score = 197 bits (500), Expect = 8e-48 Identities = 127/406 (31%), Positives = 195/406 (48%), Gaps = 16/406 (3%) Frame = -2 Query: 1470 LSFAGRVQLINSILMGIEGYWSMYIFLPHSILKTIQSIFSKFLWGGCLLQKPHYKVAWAD 1291 LSFAGR+QLI+S++ + +W LP++ +K I + S FLW G L + KV+W D Sbjct: 25 LSFAGRLQLISSVIHSLTNFWMSAFRLPNACIKEIDGLCSAFLWSGPELNRKKAKVSWND 84 Query: 1290 CCVPVFEGGLGLRDIYSWNCAAILFQLWRILTSKQSLWVSWFYSQYLRTKSIWSIK-VQP 1114 C+P EGGLGLR + N L +WR+L+S SLWV W +R S WS++ Sbjct: 85 VCMPKEEGGLGLRSLTEANKVCCLKLIWRLLSS-SSLWVQWLRQYVIRKGSFWSLRDTST 143 Query: 1113 SASWCVKQILNSQSSALQYIKYNVGRNSNFWFWYDPW-PGDSLVNHFGTQ-IISIMVSSH 940 SW +++L + A + +Y + FW+D W P L+ GT+ I + + H Sbjct: 144 LGSWMWRKLLKYRHLASGFTQYEIRNGKGVSFWHDNWSPLGPLIAISGTRGCIDMGIDIH 203 Query: 939 DVRAGSFLRGNSWSTGLLNHQLAIDLRHLLHTIPIHREDSIVWDD-----SPVNKVSDIW 775 A + +Q+ L L + ED ++W P + W Sbjct: 204 ATVAEALTHRRRRHRADHLNQMEAQLEELRTKGLVETEDVVLWKGKGGRFKPSFSTKETW 263 Query: 774 QTIRPRRQSCPWFFAVWSAWCIKKCSFILWLALKHRLLSKDRMIRFGMVINPTCLLCNSA 595 R ++ W+ +W + K SFI WLA K+RL + DRM+ + +N +C+ C Sbjct: 264 ADTREQKPRNEWYQGIWFSHATPKYSFITWLATKNRLSTGDRMMSWNAGVNLSCVFCQEQ 323 Query: 594 DESISHLFVECTYSAAILNACPFQL-----SNQWDDFLQGRFFQTPATCIQKEIAYLYYA 430 E+ +HLF C YS + + +L S W L+ T T + L YA Sbjct: 324 TETRNHLFFTCRYSREVWSGLTSKLLTRHYSTDWTTILK---LLTDKTLGNNRLFLLRYA 380 Query: 429 --VAVYSIWK*RNARLHQQQTPMLPSAITM-NVKAMVREKLASCRN 301 + VYSIWK RN+R H ++ LPSA+ + + VR KL++ R+ Sbjct: 381 FQILVYSIWKERNSRRHGEEP--LPSALLLKRLDKEVRNKLSTIRD 424 >dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana] Length = 1072 Score = 193 bits (491), Expect = 8e-47 Identities = 117/379 (30%), Positives = 176/379 (46%), Gaps = 17/379 (4%) Frame = -2 Query: 1470 LSFAGRVQLINSILMGIEGYWSMYIFLPHSILKTIQSIFSKFLWGGCLLQKPHYKVAWAD 1291 LSFAGR QLI+S++ G+ +W LP +K I+S+ SKFLW G + + KV+W D Sbjct: 658 LSFAGRTQLISSVIFGLINFWMSTFLLPKGCIKKIESLCSKFLWAGSIDGRKSSKVSWVD 717 Query: 1290 CCVPVFEGGLGLRDIYSWNCAAILFQLWRILTSKQSLWVSWFYSQYLRTKSIWSIKVQPS 1111 CC+P EGGLG R WN +L +W + SLW W L S W + + Sbjct: 718 CCLPKSEGGLGFRSFGEWNKTLLLRLIWVLFDRDTSLWAQWQRHHRLGHASFWQVNALQT 777 Query: 1110 ASWCVKQILNSQSSALQYIKYNVGRNSNFWFWYDPWPG-DSLVNHFGTQIISIMVSSHDV 934 W K +LN + A ++IK VG FW+D W L+ + G + Sbjct: 778 DPWTWKMLLNLRPLAEKFIKAKVGNGGTVSFWFDCWTSLGPLIKYLGDVGSRPLRIPFSA 837 Query: 933 RAGSFLRGNSW----STGLLNHQLAIDLRHLLHTIPIHREDSIVWDDSPVN----KVSDI 778 + + G+ W S L + L L P+ DS W V+ + Sbjct: 838 KVADAIDGSGWRLPLSRSLTADSILSHLASLPPPSPLMVSDSYSWCVDDVDCQGFSAAKT 897 Query: 777 WQTIRPRRQSCPWFFAVWSAWCIKKCSFILWLALKHRLLSKDRMIRFGMVINPTCLLCNS 598 W+ +RPRR W +VW + K +F W A +RL ++ R++ +G+V + C LC+ Sbjct: 898 WEVLRPRRPVKRWARSVWFKGAVPKHAFNFWTAQLNRLPTRQRLVSWGLVSSAECCLCSF 957 Query: 597 ADESISHLFVECTYSAAI-----LNACPFQ-LSNQWDDFLQGRFFQTPA--TCIQKEIAY 442 E+ HL + C +S+ + L CP Q L W + L T A + ++K +A Sbjct: 958 DTETRDHLLLLCDFSSQVWRMVFLRLCPRQRLLCTWAELLSWTRQSTAAAPSLLRKVVAQ 1017 Query: 441 LYYAVAVYSIWK*RNARLH 385 L VY++W+ RN LH Sbjct: 1018 L----VVYNLWRQRNLVLH 1032 >dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like [Arabidopsis thaliana] Length = 1072 Score = 193 bits (491), Expect = 8e-47 Identities = 117/379 (30%), Positives = 176/379 (46%), Gaps = 17/379 (4%) Frame = -2 Query: 1470 LSFAGRVQLINSILMGIEGYWSMYIFLPHSILKTIQSIFSKFLWGGCLLQKPHYKVAWAD 1291 LSFAGR QLI+S++ G+ +W LP +K I+S+ SKFLW G + + KV+W D Sbjct: 658 LSFAGRTQLISSVIFGLINFWMSTFLLPKGCIKKIESLCSKFLWAGSIDGRKSSKVSWVD 717 Query: 1290 CCVPVFEGGLGLRDIYSWNCAAILFQLWRILTSKQSLWVSWFYSQYLRTKSIWSIKVQPS 1111 CC+P EGGLG R WN +L +W + SLW W L S W + + Sbjct: 718 CCLPKSEGGLGFRSFGEWNKTLLLRLIWVLFDRDTSLWAQWQRHHRLGHASFWQVNALQT 777 Query: 1110 ASWCVKQILNSQSSALQYIKYNVGRNSNFWFWYDPWPG-DSLVNHFGTQIISIMVSSHDV 934 W K +LN + A ++IK VG FW+D W L+ + G + Sbjct: 778 DPWTWKMLLNLRPLAEKFIKAKVGNGGTVSFWFDCWTSLGPLIKYLGDVGSRPLRIPFSA 837 Query: 933 RAGSFLRGNSW----STGLLNHQLAIDLRHLLHTIPIHREDSIVWDDSPVN----KVSDI 778 + + G+ W S L + L L P+ DS W V+ + Sbjct: 838 KVADAIDGSGWRLPLSRSLTADSILSHLASLPPPSPLMVSDSYSWCVDDVDCQGFSAAKT 897 Query: 777 WQTIRPRRQSCPWFFAVWSAWCIKKCSFILWLALKHRLLSKDRMIRFGMVINPTCLLCNS 598 W+ +RPRR W +VW + K +F W A +RL ++ R++ +G+V + C LC+ Sbjct: 898 WEVLRPRRPVKRWAKSVWFKGAVPKHAFNFWTAQLNRLPTRQRLVSWGLVSSAECCLCSF 957 Query: 597 ADESISHLFVECTYSAAI-----LNACPFQ-LSNQWDDFLQGRFFQTPA--TCIQKEIAY 442 E+ HL + C +S+ + L CP Q L W + L T A + ++K +A Sbjct: 958 DTETRDHLLLLCDFSSQVWRMVFLRLCPRQRLLCTWAELLSWTRQSTAAAPSLLRKVVAQ 1017 Query: 441 LYYAVAVYSIWK*RNARLH 385 L VY++W+ RN LH Sbjct: 1018 L----VVYNLWRQRNLVLH 1032 >gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana] Length = 1213 Score = 192 bits (488), Expect = 2e-46 Identities = 118/412 (28%), Positives = 199/412 (48%), Gaps = 19/412 (4%) Frame = -2 Query: 1470 LSFAGRVQLINSILMGIEGYWSMYIFLPHSILKTIQSIFSKFLWGGCLLQKPHYKVAWAD 1291 LSFAGR+QLI+S++ G +W LP +K I+S+ S+FLW G + Q KV+WA Sbjct: 798 LSFAGRIQLISSVIFGSINFWMSTFLLPKGCIKRIESLCSRFLWSGNIEQAKGIKVSWAA 857 Query: 1290 CCVPVFEGGLGLRDIYSWNCAAILFQLWRILTSKQSLWVSWFYSQYLRTKSIWSIKVQPS 1111 C+P EGGLGLR + WN + +WR+ +K SLW W + +L S W+++ S Sbjct: 858 LCLPKSEGGLGLRRLLEWNKTLSMRLIWRLFVAKDSLWADWQHLHHLSRGSFWAVEGGQS 917 Query: 1110 ASWCVKQILNSQSSALQYIKYNVGRNSNFWFWYDPW----PGDSLVNHFGTQIISIMVSS 943 SW K++L+ + A Q++ VG +WYD W P ++ G + + + + Sbjct: 918 DSWTWKRLLSLRPLAHQFLVCKVGNGLKADYWYDNWTSLGPLFRIIGDIGPSSLRVPLLA 977 Query: 942 HDVRAGSFLRGNSWSTGLLNHQLAIDLRHLLHTIPI--HREDSIVWDDSPVN-------K 790 + S + W + A + L T+P+ ++ + + VN Sbjct: 978 ---KVASAFSEDGWRLPVSRSAPAKGIHDHLCTVPVPSTAQEDVDRYEWSVNGFLCQGFS 1034 Query: 789 VSDIWQTIRPRRQSCPWFFAVWSAWCIKKCSFILWLALKHRLLSKDRMIRFGMVINPTCL 610 + W+ IRP+ W ++W + K +F +W++ +RLL++ R+ +G + + C+ Sbjct: 1035 AAKTWEAIRPKATVKSWASSIWFKGAVPKYAFNMWVSHLNRLLTRQRLASWGHIQSDACV 1094 Query: 609 LCNSADESISHLFVECTYSAAI-----LNACPFQ-LSNQWDDFLQGRFFQTPATCIQKEI 448 LC+ A ES HL + C +SA + CP Q L + W + L +P + Sbjct: 1095 LCSFASESRDHLLLICEFSAQVWRLVFRRICPRQRLFSSWSELLSWVRQSSPEA--PPLL 1152 Query: 447 AYLYYAVAVYSIWK*RNARLHQQQTPMLPSAITMNVKAMVREKLASCRNFKK 292 + V VY++W+ RN LH + P+ I V +R ++S R K+ Sbjct: 1153 RKIVSQVVVYNLWRQRNNLLH-NSLRLAPAVIFKLVDREIRNIISSRRLRKR 1203