BLASTX nr result

ID: Bupleurum21_contig00019016 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Bupleurum21_contig00019016
         (1576 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002331075.1| predicted protein [Populus trichocarpa] gi|2...   214   7e-53
gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana]       196   1e-47
gb|AAC33226.1| putative non-LTR retroelement reverse transcripta...   194   7e-47
dbj|BAD95408.1| hypothetical protein [Arabidopsis thaliana]           189   1e-45
gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transc...   187   5e-45

>ref|XP_002331075.1| predicted protein [Populus trichocarpa] gi|222873039|gb|EEF10170.1|
            predicted protein [Populus trichocarpa]
          Length = 517

 Score =  214 bits (544), Expect = 7e-53
 Identities = 125/397 (31%), Positives = 192/397 (48%), Gaps = 15/397 (3%)
 Frame = +3

Query: 3    RLQLIKSVLLGIQGFWCMYLFLPNGVLQKIQSILSKFLWGGPSADTVHYKVAWNTCCLPL 182
            R+QLI SVL  IQ +W     LP  V++ ++ I+  FLW G    T   KVAW+  CLP 
Sbjct: 97   RVQLINSVLFSIQVYWASLFLLPGQVIKNVEQIMKSFLWSGSDMRTTGAKVAWDQVCLPK 156

Query: 183  SEGGLGIRNMFIWNRASILFNLWRLLRPSE-SPWIRWFHSYVLKSKSIWEASVPANSSWA 359
             EGGLGI+++  WN+ ++L ++W L   S+ S W  W  S +L+ ++ W    P N SWA
Sbjct: 157  KEGGLGIKSIKEWNKIALLKHIWNLCNDSDGSIWSTWIRSNLLRGRNFWTIKTPQNCSWA 216

Query: 360  VRKIMNSRSQALQFLQYTVGKDSQFSLWFDPW-PRPSLVDRFGRXXXXXXXXXXXARVST 536
              KI+  RS A   ++Y +G     SLWFD W P   L D +G            A+V+ 
Sbjct: 217  WGKILKLRSLAWPKMKYIIGDGMTTSLWFDNWHPHSPLADSYGERFIYDSGMAKNAKVNV 276

Query: 537  LQRSDQWAPFHSNHSLVIELRHLLQSI------TIASEDRITWDGITN--VKITNIWDSI 692
            L ++ +W    +  +  I    ++++I       +  +D + W    N    +   W+ +
Sbjct: 277  LIQNSEW---KTPTTQAIGWHPIIEAIPSNSNPKMGQKDELVWLDSPNHRFSVKVAWEQL 333

Query: 693  RPHASTSAWAIALWHSWAIPRCTFTAWLALQDRLLTRDRMSRFGFNNDLCCVLCNSDAES 872
            R H     W   +W   A+PR +F  W+A+Q +L T+D++ RFG +    C LC  + E 
Sbjct: 334  RRHRQMVEWHDIVWFKNAVPRHSFLLWMAVQQKLTTQDKLHRFGIHGPNRCSLCLRNNED 393

Query: 873  AKHLFAECSFSS----QVLAACPF-QLLGNWNEYCNGNFVQAHFTNVEKQFGILFFAVAV 1037
              HLF ECS++      V   C   ++   W+E+     V  H  +       L FA  V
Sbjct: 394  HNHLFFECSYTKAIWWDVCDRCDIPRMTKGWDEWIRWATVSWHGKSFVNFSCKLSFAATV 453

Query: 1038 HSIWKERNVRTHPSSAVAPKNVARLIFDIKSTIRAKL 1148
            + +W+ERN R     +  P  V   +  I+  IR KL
Sbjct: 454  YHVWQERNARIFAGMSRTPNLV---LNQIECIIRDKL 487


>gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana]
          Length = 1213

 Score =  196 bits (498), Expect = 1e-47
 Identities = 117/387 (30%), Positives = 182/387 (47%), Gaps = 16/387 (4%)
 Frame = +3

Query: 3    RLQLIKSVLLGIQGFWCMYLFLPNGVLQKIQSILSKFLWGGPSADTVHYKVAWNTCCLPL 182
            R+QLI SV+ G   FW     LP G +++I+S+ S+FLW G        KV+W   CLP 
Sbjct: 803  RIQLISSVIFGSINFWMSTFLLPKGCIKRIESLCSRFLWSGNIEQAKGIKVSWAALCLPK 862

Query: 183  SEGGLGIRNMFIWNRASILFNLWRLLRPSESPWIRWFHSYVLKSKSIWEASVPANSSWAV 362
            SEGGLG+R +  WN+   +  +WRL    +S W  W H + L   S W      + SW  
Sbjct: 863  SEGGLGLRRLLEWNKTLSMRLIWRLFVAKDSLWADWQHLHHLSRGSFWAVEGGQSDSWTW 922

Query: 363  RKIMNSRSQALQFLQYTVGKDSQFSLWFDPWPRPSLVDR-FGRXXXXXXXXXXXARVSTL 539
            +++++ R  A QFL   VG   +   W+D W     + R  G            A+V++ 
Sbjct: 923  KRLLSLRPLAHQFLVCKVGNGLKADYWYDNWTSLGPLFRIIGDIGPSSLRVPLLAKVASA 982

Query: 540  QRSDQWAPFHSNHSLVIELRHLLQSITIASE-----DRITWDG----ITNVKITNIWDSI 692
               D W    S  +    +   L ++ + S      DR  W               W++I
Sbjct: 983  FSEDGWRLPVSRSAPAKGIHDHLCTVPVPSTAQEDVDRYEWSVNGFLCQGFSAAKTWEAI 1042

Query: 693  RPHASTSAWAIALWHSWAIPRCTFTAWLALQDRLLTRDRMSRFGFNNDLCCVLCNSDAES 872
            RP A+  +WA ++W   A+P+  F  W++  +RLLTR R++ +G      CVLC+  +ES
Sbjct: 1043 RPKATVKSWASSIWFKGAVPKYAFNMWVSHLNRLLTRQRLASWGHIQSDACVLCSFASES 1102

Query: 873  AKHLFAECSFSSQV-----LAACPFQ-LLGNWNEYCNGNFVQAHFTNVEKQFGILFFAVA 1034
              HL   C FS+QV        CP Q L  +W+E    ++V+            +   V 
Sbjct: 1103 RDHLLLICEFSAQVWRLVFRRICPRQRLFSSWSELL--SWVRQSSPEAPPLLRKIVSQVV 1160

Query: 1035 VHSIWKERNVRTHPSSAVAPKNVARLI 1115
            V+++W++RN   H S  +AP  + +L+
Sbjct: 1161 VYNLWRQRNNLLHNSLRLAPAVIFKLV 1187


>gb|AAC33226.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1529

 Score =  194 bits (492), Expect = 7e-47
 Identities = 122/406 (30%), Positives = 195/406 (48%), Gaps = 19/406 (4%)
 Frame = +3

Query: 3    RLQLIKSVLLGIQGFWCMYLFLPNGVLQKIQSILSKFLWGGPSADTVHYKVAWNTCCLPL 182
            RL L+ SV++ I  FW     LP G +++I+ + S FLW GP  +    K+AW++ C P 
Sbjct: 1107 RLALLNSVIVSIANFWMSAYRLPAGCIREIEKLCSAFLWSGPVLNPKKAKIAWSSICQPK 1166

Query: 183  SEGGLGIRNMFIWNRASILFNLWRLLRPSESPWIRWFHSYVLKSKSIWEASVPAN-SSWA 359
             EGGLGI+++   N+ S L  +WRLL    S W+ W  +++++  + W A+  ++  SW 
Sbjct: 1167 KEGGLGIKSLAEANKVSCLKLIWRLLSTQPSLWVTWIWTFIIRKGTFWSANERSSLGSWM 1226

Query: 360  VRKIMNSRSQALQFLQYTVGKDSQFSLWFDPWPR-PSLVDRFGRXXXXXXXXXXXARVST 536
             +K++  R  A    +  V   S  S W+D W     L+D  G              + T
Sbjct: 1227 WKKLLKYRELAKSMHKVEVRNGSSTSFWYDHWSHLGRLLDITGTRRVIDLGIPLETNLET 1286

Query: 537  LQRSDQWAPFHSNHSLVI------ELRHLLQSITIASEDRITWDGITN-------VKITN 677
            + R+ Q    H  H   I      E++ L Q    A  D   W  + N        K+T 
Sbjct: 1287 VLRTHQ----HRQHRAAIYNRINAEIQRLQQQEREAGPDISLWRSLKNDFNKRFITKVT- 1341

Query: 678  IWDSIRPHASTSAWAIALWHSWAIPRCTFTAWLALQDRLLTRDRMSRFGFNNDLCCVLCN 857
             W+++R H     W   +W  ++ P+ +F  WL +Q+RL T DR+  +     + C LCN
Sbjct: 1342 -WNNVRTHQPQQNWYKGVWFPYSTPKYSFLLWLTVQNRLSTGDRIKAWNSGQLVTCTLCN 1400

Query: 858  SDAESAKHLFAECSFSSQVLAACPFQLLG-NWNEYCNGNFVQAHFTNVEKQFGILF---F 1025
            +  E+  HLF  C ++S V  A   +LL  N++   N  F     +N+ +    LF   F
Sbjct: 1401 NAEETRDHLFFSCQYTSYVWEALTQRLLSTNYSRDWNRLFTLLCTSNLPRDHLFLFRYVF 1460

Query: 1026 AVAVHSIWKERNVRTHPSSAVAPKNVARLIFDIKSTIRAKLATRGD 1163
              +++ IW+ERN R H     +P N  RLI  I  T+R ++++  D
Sbjct: 1461 QASIYHIWRERNARRH-GEISSPTN--RLIKLIDKTVRNRISSIRD 1503


>dbj|BAD95408.1| hypothetical protein [Arabidopsis thaliana]
          Length = 478

 Score =  189 bits (481), Expect = 1e-45
 Identities = 123/411 (29%), Positives = 193/411 (46%), Gaps = 27/411 (6%)
 Frame = +3

Query: 3    RLQLIKSVLLGIQGFWCMYLFLPNGVLQKIQSILSKFLWGGPSADTVHYKVAWNTCCLPL 182
            RLQLI SV+  +  FW     LP+  +++I SI S FLW GP  +T   KVAW+  C P 
Sbjct: 66   RLQLISSVIHSLTNFWMSAFRLPSACIKEIDSICSSFLWSGPELNTKKAKVAWSDVCTPK 125

Query: 183  SEGGLGIRNMFIWNRASILFNLWRLLRPSESPWIRWFHSYVLKSKSIWEASVPAN-SSWA 359
             EGGLGIR++   N+ S+L  +WR+L  S S W++W   Y+L+  S W  S      SW 
Sbjct: 126  DEGGLGIRSLKEANKVSLLKLIWRML-SSTSLWVQWLRLYLLRKGSFWSISGNTTLGSWM 184

Query: 360  VRKIMNSRSQALQFLQYTVGKDSQFSLWFDPWPR-PSLVDRFGRXXXXXXXXXXXARVST 536
             +KI+  R+ A  F+++ +   S  S WFD W +   L+D  G            A V+ 
Sbjct: 185  WKKILKHRALASGFVKHDIHNGSNTSFWFDNWSKIGRLIDVTGHRGCIDMGITLHASVAE 244

Query: 537  LQRSDQWAPFHSNHSLVIELRHLLQSI----TIASEDRITWDGITNV-----KITNIWDS 689
               + +  P    H  ++ +  ++  +      + ED + W G  ++          W +
Sbjct: 245  AVVNHR--PRRHRHDTLLRIEDVIAEVRHQGLTSGEDTVRWKGNGDIFKPCFNTKETWAA 302

Query: 690  IRPHASTSAWAIALWHSWAIPRCTFTAWLALQDRLLTRDRMSRFGFNNDLCCVLCNSDAE 869
             R       W   +W S A P+ +  AW+A+++RL T DRM  +    D  CVLC+   E
Sbjct: 303  TREPKLKVNWYKGVWFSHATPKYSVLAWIAIKNRLTTGDRMLSWNAGADSSCVLCHHLVE 362

Query: 870  SAKHLFAECSFSSQVLAACPFQLLGNWNEYCNGNFVQAHFTN---------VEKQFG--I 1016
            +  HLF  C +S++V +    +LL              HFTN           K  G  +
Sbjct: 363  TRDHLFFTCPYSAEVWSTLTRKLLSQ------------HFTNRWEAILKLLTNKSLGHEV 410

Query: 1017 LF-----FAVAVHSIWKERNVRTHPSSAVAPKNVARLIFDIKSTIRAKLAT 1154
             F     F + +HS+WKERN R H      P+  A+++  +   +R ++++
Sbjct: 411  PFLTRYTFQLTLHSLWKERNGRRH---GEVPQAAAQMVRFLDKQVRNRISS 458


>gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transcriptase [Brassica napus]
          Length = 1214

 Score =  187 bits (476), Expect = 5e-45
 Identities = 124/414 (29%), Positives = 192/414 (46%), Gaps = 20/414 (4%)
 Frame = +3

Query: 3    RLQLIKSVLLGIQGFWCMYLFLPNGVLQKIQSILSKFLWGGPSADTVHYKVAWNTCCLPL 182
            RLQLI SV+     FW     LP   L+ I+ + ++FLWG         KV+W   CLP 
Sbjct: 802  RLQLISSVIYSTVNFWLSSFILPKCCLKTIEQMCNRFLWGNDITRRGDIKVSWQNSCLPK 861

Query: 183  SEGGLGIRNMFIWNRASILFNLWRLLRPSESPWIRWFHSYVLKSKSIWEASVPANSSWAV 362
            +EGGLG+RN + WN+   L  +W L    +S W+ W H+  L+  + W A   ++ SW  
Sbjct: 862  AEGGLGLRNFWTWNKTLNLRLIWMLFARRDSLWVAWNHANRLRHVNFWNAEAASHHSWIW 921

Query: 363  RKIMNSRSQALQFLQYTVGKDSQFSLWFDPWPR-PSLVDRFGRXXXXXXXXXXXARVSTL 539
            + I+  R  A +FL+  VG     S W+D W     L++  G            A V+  
Sbjct: 922  KAILGLRPLAKRFLRGAVGNGQLLSYWYDHWSNLGPLIEAIGASGPQLTGIHESAVVTEA 981

Query: 540  QRSDQW--APFHSNHSLVIELRHLLQSITIAS----EDRITW--DGITNVKITN--IWDS 689
              S  W      + ++ +  LR  L +    S    ED  TW  +G ++   ++   W+ 
Sbjct: 982  SSSTGWILPSARTRNASLANLRSTLLNSPAPSGDRGEDTYTWYIEGSSSTSFSSKLTWEC 1041

Query: 690  IRPHASTSAWAIALWHSWAIPRCTFTAWLALQDRLLTRDRMSRFGFNNDLCCVLCNSDAE 869
            +R   +T  WA A+W+   IP+  F  W+A  +RL  R R + +  N    C +C  + E
Sbjct: 1042 LRQRDTTKLWAAAVWYKGCIPKYAFNFWVAHLNRLPVRARTTHWSTNRPSLCCVCQRETE 1101

Query: 870  SAKHLFAECSFSS----QVLAAC-PFQLLGNWN---EYCNGNFVQAHFTNVEKQFGILFF 1025
            +  HLF  C+  S    QVLA     Q+   W    E+   N  Q  F+   K+  +   
Sbjct: 1102 TRDHLFIHCTLGSLIWQQVLARFGRSQMFREWKDIIEWMLSN--QGSFSGTLKKLAV--- 1156

Query: 1026 AVAVHSIWKERNVRTHPSSAVAPKNVARLI-FDIKSTIRAKLATRGDFKKAASR 1184
              A+  IWKERN R H + + +   + + I   I+ +I A++ TR +FK   S+
Sbjct: 1157 QTAIFHIWKERNSRLHSAMSASHTAIFKQIDRSIRDSILARI-TRRNFKDLLSQ 1209


Top