BLASTX nr result

ID: Bupleurum21_contig00014051 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Bupleurum21_contig00014051
         (1477 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002331075.1| predicted protein [Populus trichocarpa] gi|2...   226   2e-56
gb|ABW81051.1| tn7 reverse transcriptase [Arabidopsis lyrata sub...   197   8e-48
dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana]           193   8e-47
dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like ...   193   8e-47
gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana]       192   2e-46

>ref|XP_002331075.1| predicted protein [Populus trichocarpa] gi|222873039|gb|EEF10170.1|
            predicted protein [Populus trichocarpa]
          Length = 517

 Score =  226 bits (575), Expect = 2e-56
 Identities = 126/405 (31%), Positives = 208/405 (51%), Gaps = 15/405 (3%)
 Frame = -2

Query: 1470 LSFAGRVQLINSILMGIEGYWSMYIFLPHSILKTIQSIFSKFLWGGCLLQKPHYKVAWAD 1291
            LS+AGRVQLINS+L  I+ YW+    LP  ++K ++ I   FLW G  ++    KVAW  
Sbjct: 92   LSYAGRVQLINSVLFSIQVYWASLFLLPGQVIKNVEQIMKSFLWSGSDMRTTGAKVAWDQ 151

Query: 1290 CCVPVFEGGLGLRDIYSWNCAAILFQLWRILT-SKQSLWVSWFYSQYLRTKSIWSIKVQP 1114
             C+P  EGGLG++ I  WN  A+L  +W +   S  S+W +W  S  LR ++ W+IK   
Sbjct: 152  VCLPKKEGGLGIKSIKEWNKIALLKHIWNLCNDSDGSIWSTWIRSNLLRGRNFWTIKTPQ 211

Query: 1113 SASWCVKQILNSQSSALQYIKYNVGRNSNFWFWYDPW-PGDSLVNHFGTQIISIMVSSHD 937
            + SW   +IL  +S A   +KY +G       W+D W P   L + +G + I     + +
Sbjct: 212  NCSWAWGKILKLRSLAWPKMKYIIGDGMTTSLWFDNWHPHSPLADSYGERFIYDSGMAKN 271

Query: 936  VRAGSFLRGNSWSTGLLNHQLAIDLRHLLHTIP------IHREDSIVWDDSPVNK--VSD 781
             +    ++ + W T       AI    ++  IP      + ++D +VW DSP ++  V  
Sbjct: 272  AKVNVLIQNSEWKTPTTQ---AIGWHPIIEAIPSNSNPKMGQKDELVWLDSPNHRFSVKV 328

Query: 780  IWQTIRPRRQSCPWFFAVWSAWCIKKCSFILWLALKHRLLSKDRMIRFGMVINPTCLLCN 601
             W+ +R  RQ   W   VW    + + SF+LW+A++ +L ++D++ RFG+     C LC 
Sbjct: 329  AWEQLRRHRQMVEWHDIVWFKNAVPRHSFLLWMAVQQKLTTQDKLHRFGIHGPNRCSLCL 388

Query: 600  SADESISHLFVECTYSAAI----LNACPF-QLSNQWDDFLQGRFFQTPATCIQKEIAYLY 436
              +E  +HLF EC+Y+ AI     + C   +++  WD++++                 L 
Sbjct: 389  RNNEDHNHLFFECSYTKAIWWDVCDRCDIPRMTKGWDEWIRWATVSWHGKSFVNFSCKLS 448

Query: 435  YAVAVYSIWK*RNARLHQQQTPMLPSAITMNVKAMVREKLASCRN 301
            +A  VY +W+ RNAR+    +   P+ +   ++ ++R+KL   RN
Sbjct: 449  FAATVYHVWQERNARIFAGMS-RTPNLVLNQIECIIRDKLDLMRN 492


>gb|ABW81051.1| tn7 reverse transcriptase [Arabidopsis lyrata subsp. lyrata]
          Length = 441

 Score =  197 bits (500), Expect = 8e-48
 Identities = 127/406 (31%), Positives = 195/406 (48%), Gaps = 16/406 (3%)
 Frame = -2

Query: 1470 LSFAGRVQLINSILMGIEGYWSMYIFLPHSILKTIQSIFSKFLWGGCLLQKPHYKVAWAD 1291
            LSFAGR+QLI+S++  +  +W     LP++ +K I  + S FLW G  L +   KV+W D
Sbjct: 25   LSFAGRLQLISSVIHSLTNFWMSAFRLPNACIKEIDGLCSAFLWSGPELNRKKAKVSWND 84

Query: 1290 CCVPVFEGGLGLRDIYSWNCAAILFQLWRILTSKQSLWVSWFYSQYLRTKSIWSIK-VQP 1114
             C+P  EGGLGLR +   N    L  +WR+L+S  SLWV W     +R  S WS++    
Sbjct: 85   VCMPKEEGGLGLRSLTEANKVCCLKLIWRLLSS-SSLWVQWLRQYVIRKGSFWSLRDTST 143

Query: 1113 SASWCVKQILNSQSSALQYIKYNVGRNSNFWFWYDPW-PGDSLVNHFGTQ-IISIMVSSH 940
              SW  +++L  +  A  + +Y +       FW+D W P   L+   GT+  I + +  H
Sbjct: 144  LGSWMWRKLLKYRHLASGFTQYEIRNGKGVSFWHDNWSPLGPLIAISGTRGCIDMGIDIH 203

Query: 939  DVRAGSFLRGNSWSTGLLNHQLAIDLRHLLHTIPIHREDSIVWDD-----SPVNKVSDIW 775
               A +             +Q+   L  L     +  ED ++W        P     + W
Sbjct: 204  ATVAEALTHRRRRHRADHLNQMEAQLEELRTKGLVETEDVVLWKGKGGRFKPSFSTKETW 263

Query: 774  QTIRPRRQSCPWFFAVWSAWCIKKCSFILWLALKHRLLSKDRMIRFGMVINPTCLLCNSA 595
               R ++    W+  +W +    K SFI WLA K+RL + DRM+ +   +N +C+ C   
Sbjct: 264  ADTREQKPRNEWYQGIWFSHATPKYSFITWLATKNRLSTGDRMMSWNAGVNLSCVFCQEQ 323

Query: 594  DESISHLFVECTYSAAILNACPFQL-----SNQWDDFLQGRFFQTPATCIQKEIAYLYYA 430
             E+ +HLF  C YS  + +    +L     S  W   L+     T  T     +  L YA
Sbjct: 324  TETRNHLFFTCRYSREVWSGLTSKLLTRHYSTDWTTILK---LLTDKTLGNNRLFLLRYA 380

Query: 429  --VAVYSIWK*RNARLHQQQTPMLPSAITM-NVKAMVREKLASCRN 301
              + VYSIWK RN+R H ++   LPSA+ +  +   VR KL++ R+
Sbjct: 381  FQILVYSIWKERNSRRHGEEP--LPSALLLKRLDKEVRNKLSTIRD 424


>dbj|BAF01687.1| hypothetical protein [Arabidopsis thaliana]
          Length = 1072

 Score =  193 bits (491), Expect = 8e-47
 Identities = 117/379 (30%), Positives = 176/379 (46%), Gaps = 17/379 (4%)
 Frame = -2

Query: 1470 LSFAGRVQLINSILMGIEGYWSMYIFLPHSILKTIQSIFSKFLWGGCLLQKPHYKVAWAD 1291
            LSFAGR QLI+S++ G+  +W     LP   +K I+S+ SKFLW G +  +   KV+W D
Sbjct: 658  LSFAGRTQLISSVIFGLINFWMSTFLLPKGCIKKIESLCSKFLWAGSIDGRKSSKVSWVD 717

Query: 1290 CCVPVFEGGLGLRDIYSWNCAAILFQLWRILTSKQSLWVSWFYSQYLRTKSIWSIKVQPS 1111
            CC+P  EGGLG R    WN   +L  +W +     SLW  W     L   S W +    +
Sbjct: 718  CCLPKSEGGLGFRSFGEWNKTLLLRLIWVLFDRDTSLWAQWQRHHRLGHASFWQVNALQT 777

Query: 1110 ASWCVKQILNSQSSALQYIKYNVGRNSNFWFWYDPWPG-DSLVNHFGTQIISIMVSSHDV 934
              W  K +LN +  A ++IK  VG      FW+D W     L+ + G      +      
Sbjct: 778  DPWTWKMLLNLRPLAEKFIKAKVGNGGTVSFWFDCWTSLGPLIKYLGDVGSRPLRIPFSA 837

Query: 933  RAGSFLRGNSW----STGLLNHQLAIDLRHLLHTIPIHREDSIVWDDSPVN----KVSDI 778
            +    + G+ W    S  L    +   L  L    P+   DS  W    V+      +  
Sbjct: 838  KVADAIDGSGWRLPLSRSLTADSILSHLASLPPPSPLMVSDSYSWCVDDVDCQGFSAAKT 897

Query: 777  WQTIRPRRQSCPWFFAVWSAWCIKKCSFILWLALKHRLLSKDRMIRFGMVINPTCLLCNS 598
            W+ +RPRR    W  +VW    + K +F  W A  +RL ++ R++ +G+V +  C LC+ 
Sbjct: 898  WEVLRPRRPVKRWARSVWFKGAVPKHAFNFWTAQLNRLPTRQRLVSWGLVSSAECCLCSF 957

Query: 597  ADESISHLFVECTYSAAI-----LNACPFQ-LSNQWDDFLQGRFFQTPA--TCIQKEIAY 442
              E+  HL + C +S+ +     L  CP Q L   W + L      T A  + ++K +A 
Sbjct: 958  DTETRDHLLLLCDFSSQVWRMVFLRLCPRQRLLCTWAELLSWTRQSTAAAPSLLRKVVAQ 1017

Query: 441  LYYAVAVYSIWK*RNARLH 385
            L     VY++W+ RN  LH
Sbjct: 1018 L----VVYNLWRQRNLVLH 1032


>dbj|BAA97290.1| non-LTR retroelement reverse transcriptase-like [Arabidopsis
            thaliana]
          Length = 1072

 Score =  193 bits (491), Expect = 8e-47
 Identities = 117/379 (30%), Positives = 176/379 (46%), Gaps = 17/379 (4%)
 Frame = -2

Query: 1470 LSFAGRVQLINSILMGIEGYWSMYIFLPHSILKTIQSIFSKFLWGGCLLQKPHYKVAWAD 1291
            LSFAGR QLI+S++ G+  +W     LP   +K I+S+ SKFLW G +  +   KV+W D
Sbjct: 658  LSFAGRTQLISSVIFGLINFWMSTFLLPKGCIKKIESLCSKFLWAGSIDGRKSSKVSWVD 717

Query: 1290 CCVPVFEGGLGLRDIYSWNCAAILFQLWRILTSKQSLWVSWFYSQYLRTKSIWSIKVQPS 1111
            CC+P  EGGLG R    WN   +L  +W +     SLW  W     L   S W +    +
Sbjct: 718  CCLPKSEGGLGFRSFGEWNKTLLLRLIWVLFDRDTSLWAQWQRHHRLGHASFWQVNALQT 777

Query: 1110 ASWCVKQILNSQSSALQYIKYNVGRNSNFWFWYDPWPG-DSLVNHFGTQIISIMVSSHDV 934
              W  K +LN +  A ++IK  VG      FW+D W     L+ + G      +      
Sbjct: 778  DPWTWKMLLNLRPLAEKFIKAKVGNGGTVSFWFDCWTSLGPLIKYLGDVGSRPLRIPFSA 837

Query: 933  RAGSFLRGNSW----STGLLNHQLAIDLRHLLHTIPIHREDSIVWDDSPVN----KVSDI 778
            +    + G+ W    S  L    +   L  L    P+   DS  W    V+      +  
Sbjct: 838  KVADAIDGSGWRLPLSRSLTADSILSHLASLPPPSPLMVSDSYSWCVDDVDCQGFSAAKT 897

Query: 777  WQTIRPRRQSCPWFFAVWSAWCIKKCSFILWLALKHRLLSKDRMIRFGMVINPTCLLCNS 598
            W+ +RPRR    W  +VW    + K +F  W A  +RL ++ R++ +G+V +  C LC+ 
Sbjct: 898  WEVLRPRRPVKRWAKSVWFKGAVPKHAFNFWTAQLNRLPTRQRLVSWGLVSSAECCLCSF 957

Query: 597  ADESISHLFVECTYSAAI-----LNACPFQ-LSNQWDDFLQGRFFQTPA--TCIQKEIAY 442
              E+  HL + C +S+ +     L  CP Q L   W + L      T A  + ++K +A 
Sbjct: 958  DTETRDHLLLLCDFSSQVWRMVFLRLCPRQRLLCTWAELLSWTRQSTAAAPSLLRKVVAQ 1017

Query: 441  LYYAVAVYSIWK*RNARLH 385
            L     VY++W+ RN  LH
Sbjct: 1018 L----VVYNLWRQRNLVLH 1032


>gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana]
          Length = 1213

 Score =  192 bits (488), Expect = 2e-46
 Identities = 118/412 (28%), Positives = 199/412 (48%), Gaps = 19/412 (4%)
 Frame = -2

Query: 1470 LSFAGRVQLINSILMGIEGYWSMYIFLPHSILKTIQSIFSKFLWGGCLLQKPHYKVAWAD 1291
            LSFAGR+QLI+S++ G   +W     LP   +K I+S+ S+FLW G + Q    KV+WA 
Sbjct: 798  LSFAGRIQLISSVIFGSINFWMSTFLLPKGCIKRIESLCSRFLWSGNIEQAKGIKVSWAA 857

Query: 1290 CCVPVFEGGLGLRDIYSWNCAAILFQLWRILTSKQSLWVSWFYSQYLRTKSIWSIKVQPS 1111
             C+P  EGGLGLR +  WN    +  +WR+  +K SLW  W +  +L   S W+++   S
Sbjct: 858  LCLPKSEGGLGLRRLLEWNKTLSMRLIWRLFVAKDSLWADWQHLHHLSRGSFWAVEGGQS 917

Query: 1110 ASWCVKQILNSQSSALQYIKYNVGRNSNFWFWYDPW----PGDSLVNHFGTQIISIMVSS 943
             SW  K++L+ +  A Q++   VG      +WYD W    P   ++   G   + + + +
Sbjct: 918  DSWTWKRLLSLRPLAHQFLVCKVGNGLKADYWYDNWTSLGPLFRIIGDIGPSSLRVPLLA 977

Query: 942  HDVRAGSFLRGNSWSTGLLNHQLAIDLRHLLHTIPI--HREDSIVWDDSPVN-------K 790
               +  S    + W   +     A  +   L T+P+    ++ +   +  VN        
Sbjct: 978  ---KVASAFSEDGWRLPVSRSAPAKGIHDHLCTVPVPSTAQEDVDRYEWSVNGFLCQGFS 1034

Query: 789  VSDIWQTIRPRRQSCPWFFAVWSAWCIKKCSFILWLALKHRLLSKDRMIRFGMVINPTCL 610
             +  W+ IRP+     W  ++W    + K +F +W++  +RLL++ R+  +G + +  C+
Sbjct: 1035 AAKTWEAIRPKATVKSWASSIWFKGAVPKYAFNMWVSHLNRLLTRQRLASWGHIQSDACV 1094

Query: 609  LCNSADESISHLFVECTYSAAI-----LNACPFQ-LSNQWDDFLQGRFFQTPATCIQKEI 448
            LC+ A ES  HL + C +SA +        CP Q L + W + L      +P       +
Sbjct: 1095 LCSFASESRDHLLLICEFSAQVWRLVFRRICPRQRLFSSWSELLSWVRQSSPEA--PPLL 1152

Query: 447  AYLYYAVAVYSIWK*RNARLHQQQTPMLPSAITMNVKAMVREKLASCRNFKK 292
              +   V VY++W+ RN  LH     + P+ I   V   +R  ++S R  K+
Sbjct: 1153 RKIVSQVVVYNLWRQRNNLLH-NSLRLAPAVIFKLVDREIRNIISSRRLRKR 1203


Top