BLASTX nr result

ID: Bupleurum21_contig00025929 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Bupleurum21_contig00025929
         (1305 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002331075.1| predicted protein [Populus trichocarpa] gi|2...   168   3e-39
gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana]       167   5e-39
gb|AAG50886.1|AC025294_24 hypothetical protein [Arabidopsis thal...   157   7e-36
emb|CAB39942.1| putative protein [Arabidopsis thaliana] gi|72678...   154   5e-35
gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transc...   151   3e-34

>ref|XP_002331075.1| predicted protein [Populus trichocarpa] gi|222873039|gb|EEF10170.1|
            predicted protein [Populus trichocarpa]
          Length = 517

 Score =  168 bits (425), Expect = 3e-39
 Identities = 99/342 (28%), Positives = 165/342 (48%), Gaps = 12/342 (3%)
 Frame = -3

Query: 1303 KVSWADCCLPLNEGGLGLRDLFSWNRAAVFFQLWRIIKAYD-SLWVTWFHSNYARNGSIW 1127
            KV+W   CLP  EGGLG++ +  WN+ A+   +W +    D S+W TW  SN  R  + W
Sbjct: 146  KVAWDQVCLPKKEGGLGIKSIKEWNKIALLKHIWNLCNDSDGSIWSTWIRSNLLRGRNFW 205

Query: 1126 HVKIKQSTPWCVRKILNSRRDAMQFITYRPGRNSCFSFWYDPW-PGESISNRFGEQIISV 950
             +K  Q+  W   KIL  R  A   + Y  G     S W+D W P   +++ +GE+ I  
Sbjct: 206  TIKTPQNCSWAWGKILKLRSLAWPKMKYIIGDGMTTSLWFDNWHPHSPLADSYGERFIYD 265

Query: 949  MESNPAAKVSSFIRDNTWCTGLSND---YLVIELRHLLSSISLAREDGIFWDGLPSA--N 785
                  AKV+  I+++ W T  +     + +IE     S+  + ++D + W   P+   +
Sbjct: 266  SGMAKNAKVNVLIQNSEWKTPTTQAIGWHPIIEAIPSNSNPKMGQKDELVWLDSPNHRFS 325

Query: 784  IATIWQSIRPRGSIVVWAPSVWNSWSIKKCSFFLWLAIKQRLLTKDRMLRFGMNVHPSCV 605
            +   W+ +R    +V W   VW   ++ + SF LW+A++Q+L T+D++ RFG++    C 
Sbjct: 326  VKVAWEQLRRHRQMVEWHDIVWFKNAVPRHSFLLWMAVQQKLTTQDKLHRFGIHGPNRCS 385

Query: 604  LCNSAPESVPHLMVHCSYGAT----VLNSCPF-NLSSTWEDFCIGNFAATGTSAIKRQIV 440
            LC    E   HL   CSY       V + C    ++  W+++      +    +      
Sbjct: 386  LCLRNNEDHNHLFFECSYTKAIWWDVCDRCDIPRMTKGWDEWIRWATVSWHGKSFVNFSC 445

Query: 439  NLYMAVAFYLVWKERNNRIHRSSNMTSGGLIQQVKAMVREKL 314
             L  A   Y VW+ERN RI    + T   ++ Q++ ++R+KL
Sbjct: 446  KLSFAATVYHVWQERNARIFAGMSRTPNLVLNQIECIIRDKL 487


>gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana]
          Length = 1213

 Score =  167 bits (423), Expect = 5e-39
 Identities = 104/351 (29%), Positives = 168/351 (47%), Gaps = 19/351 (5%)
 Frame = -3

Query: 1303 KVSWADCCLPLNEGGLGLRDLFSWNRAAVFFQLWRIIKAYDSLWVTWFHSNYARNGSIWH 1124
            KVSWA  CLP +EGGLGLR L  WN+      +WR+  A DSLW  W H ++   GS W 
Sbjct: 852  KVSWAALCLPKSEGGLGLRRLLEWNKTLSMRLIWRLFVAKDSLWADWQHLHHLSRGSFWA 911

Query: 1123 VKIKQSTPWCVRKILNSRRDAMQFITYRPGRNSCFSFWYDPWPG-ESISNRFGEQIISVM 947
            V+  QS  W  +++L+ R  A QF+  + G      +WYD W     +    G+   S +
Sbjct: 912  VEGGQSDSWTWKRLLSLRPLAHQFLVCKVGNGLKADYWYDNWTSLGPLFRIIGDIGPSSL 971

Query: 946  ESNPAAKVSSFIRDNTWCTGLSNDYLVIELRHLLSSI---SLARED---------GIFWD 803
                 AKV+S   ++ W   +S       +   L ++   S A+ED         G    
Sbjct: 972  RVPLLAKVASAFSEDGWRLPVSRSAPAKGIHDHLCTVPVPSTAQEDVDRYEWSVNGFLCQ 1031

Query: 802  GLPSANIATIWQSIRPRGSIVVWAPSVWNSWSIKKCSFFLWLAIKQRLLTKDRMLRFGMN 623
            G  +A     W++IRP+ ++  WA S+W   ++ K +F +W++   RLLT+ R+  +G  
Sbjct: 1032 GFSAAK---TWEAIRPKATVKSWASSIWFKGAVPKYAFNMWVSHLNRLLTRQRLASWGHI 1088

Query: 622  VHPSCVLCNSAPESVPHLMVHCSYGATV-----LNSCP-FNLSSTWEDFCIGNFAATGTS 461
               +CVLC+ A ES  HL++ C + A V        CP   L S+W +    ++    + 
Sbjct: 1089 QSDACVLCSFASESRDHLLLICEFSAQVWRLVFRRICPRQRLFSSWSELL--SWVRQSSP 1146

Query: 460  AIKRQIVNLYMAVAFYLVWKERNNRIHRSSNMTSGGLIQQVKAMVREKLTT 308
                 +  +   V  Y +W++RNN +H S  +    + + V   +R  +++
Sbjct: 1147 EAPPLLRKIVSQVVVYNLWRQRNNLLHNSLRLAPAVIFKLVDREIRNIISS 1197


>gb|AAG50886.1|AC025294_24 hypothetical protein [Arabidopsis thaliana]
          Length = 629

 Score =  157 bits (396), Expect = 7e-36
 Identities = 102/346 (29%), Positives = 150/346 (43%), Gaps = 15/346 (4%)
 Frame = -3

Query: 1303 KVSWADCCLPLNEGGLGLRDLFSWNRAAVFFQLWRIIKAYDSLWVTWFHSNYARNGSIWH 1124
            KVSW D C P  EGGLGLR L   N  +V   +WR+    DSLWV W   N  +  S W 
Sbjct: 264  KVSWDDICKPKQEGGLGLRSLTEANVVSVLKLIWRVTSNDDSLWVKWSKMNLLKQESFWS 323

Query: 1123 VKIKQST-PWCVRKILNSRRDAMQFITYRPGRNSCFSFWYDPWPG----ESISNRFGEQI 959
            +    S   W  +K+L  R  A  F        +  SFW+D W G      ++ + G+  
Sbjct: 324  LTPNSSLGSWMWKKMLKYRETAKPFSRVEVNNGARTSFWFDNWSGMGHLMDVTGQRGQID 383

Query: 958  ISVMESNPAAKVSSFIRDNTWCTGLSNDYLVIELRHLLSSISLAREDGIFWDG-----LP 794
            + +  +   A+  S  R     T   ND +   L     + +L RED   W G       
Sbjct: 384  LGISRNKTVAEAWSNRRRRKHRTEQLND-IEAALNQKYQTRNLLREDATLWRGKGDVFKT 442

Query: 793  SANIATIWQSIRPRGSIVVWAPSVWNSWSIKKCSFFLWLAIKQRLLTKDRMLRFGMNVHP 614
            S +    W  +R + + V W   VW S S  K  F  WLA++ RL T  RM  +      
Sbjct: 443  SFSTKDTWNQVRKKSNEVAWYKGVWFSHSTPKYQFCTWLALRNRLSTGYRMQLWNNGSDV 502

Query: 613  SCVLCNSAPESVPHLMVHCSYGATVLNSCPFNL-----SSTWEDFCIGNFAATGTSAIKR 449
             C  C+++ E+  HL   CSY + +  +   N+     S+ W+   +   + T T  I+ 
Sbjct: 503  KCTFCSTSIETRDHLFFSCSYASAIWTAIAKNVLQHRFSTDWQTI-VNYISETQTDRIRS 561

Query: 448  QIVNLYMAVAFYLVWKERNNRIHRSSNMTSGGLIQQVKAMVREKLT 311
             +      +  + VWKERN+R H     TS  LI  +   +R +L+
Sbjct: 562  FLSRYIFQLTVHTVWKERNDRRHGEEPRTSANLISWMDKQIRNQLS 607


>emb|CAB39942.1| putative protein [Arabidopsis thaliana] gi|7267871|emb|CAB78214.1|
            putative protein [Arabidopsis thaliana]
          Length = 473

 Score =  154 bits (389), Expect = 5e-35
 Identities = 113/354 (31%), Positives = 154/354 (43%), Gaps = 19/354 (5%)
 Frame = -3

Query: 1303 KVSWADCCLPLNEGGLGLRDLFSWNRAAVFFQLWRIIKAYDSLWVTWFHSNYARNGSIWH 1124
            K++WA  C P  EGGLGLR L   N       +WRII   DSLWV W  S+  +  S W 
Sbjct: 108  KITWAFVCKPKEEGGLGLRSLKEANDVCCLKLIWRIISHADSLWVKWIQSSLLKKVSFWA 167

Query: 1123 VKIKQST-PWCVRKILNSRRDAMQFITYRPGRNSCFSFWYDPWP--GESISNRFGEQIIS 953
            V+   S   W  RKIL  R  A           +  SFWYD W   G  I +      I 
Sbjct: 168  VRENTSLGSWMWRKILKFRDIARTLCKVEINNGARTSFWYDDWSDLGRLIDSAGDRGAID 227

Query: 952  VMESNPAAKVSSF--IRDNTWCTGLSNDYLVIELRHLLSSISLAR-EDGIFWDGLPSA-- 788
            +  +  A  V ++   R     T   N    +E R +LS  S  + ED   W G  +   
Sbjct: 228  LGINKHATVVEAWGNRRRRRHRTNFLNR---VEERLILSWNSRNQAEDRALWKGKENRFR 284

Query: 787  ---NIATIWQSIRPRGSIVVWAPSVWNSWSIKKCSFFLWLAIKQRLLTKDRMLRFGMNVH 617
               +    W  IR   + V W   VW + +I K +F +WLA+  RL T DRM  + M V 
Sbjct: 285  SIFSTKDTWNHIRTVSNKVAWYKGVWFAQAIPKHAFCMWLAVHNRLSTGDRMTLWNMGVD 344

Query: 616  PSCVLCNSAPESVPHLMVHCSYGA--------TVLNSCPFNLSSTWEDFCIGNFAATGTS 461
             +C+LCN A ES  HL   C +          T+ N+C +   + W+   I N +     
Sbjct: 345  ATCILCNKALESRDHLFFSCPFATEIWEPLAKTIYNTCFY---TDWQTI-INNVSRNWPD 400

Query: 460  AIKRQIVNLYMAVAFYLVWKERNNRIHRSSNMTSGGLIQQVKAMVREKLTTCKQ 299
             I   +    + V  Y +W+ERN R H +S  +S  LI  +   +R  L   KQ
Sbjct: 401  RIAGFLARCILQVTIYTLWRERNERKHGASPNSSSRLISWIDKHIRNHLMAIKQ 454


>gb|AAM82604.1|AF525305_2 putative AP endonuclease/reverse transcriptase [Brassica napus]
          Length = 1214

 Score =  151 bits (382), Expect = 3e-34
 Identities = 99/361 (27%), Positives = 162/361 (44%), Gaps = 19/361 (5%)
 Frame = -3

Query: 1303 KVSWADCCLPLNEGGLGLRDLFSWNRAAVFFQLWRIIKAYDSLWVTWFHSNYARNGSIWH 1124
            KVSW + CLP  EGGLGLR+ ++WN+      +W +    DSLWV W H+N  R+ + W+
Sbjct: 851  KVSWQNSCLPKAEGGLGLRNFWTWNKTLNLRLIWMLFARRDSLWVAWNHANRLRHVNFWN 910

Query: 1123 VKIKQSTPWCVRKILNSRRDAMQFITYRPGRNSCFSFWYDPW----PGESISNRFGEQII 956
             +      W  + IL  R  A +F+    G     S+WYD W    P        G Q+ 
Sbjct: 911  AEAASHHSWIWKAILGLRPLAKRFLRGAVGNGQLLSYWYDHWSNLGPLIEAIGASGPQLT 970

Query: 955  SVMESNPAAKVSS---FIRDNTWCTGLSNDYLVIELRHLLSSISLAREDGIFW--DGLPS 791
             + ES    + SS   +I  +      S   L   L +  +      ED   W  +G  S
Sbjct: 971  GIHESAVVTEASSSTGWILPSARTRNASLANLRSTLLNSPAPSGDRGEDTYTWYIEGSSS 1030

Query: 790  ANIAT--IWQSIRPRGSIVVWAPSVWNSWSIKKCSFFLWLAIKQRLLTKDRMLRFGMNVH 617
             + ++   W+ +R R +  +WA +VW    I K +F  W+A   RL  + R   +  N  
Sbjct: 1031 TSFSSKLTWECLRQRDTTKLWAAAVWYKGCIPKYAFNFWVAHLNRLPVRARTTHWSTNRP 1090

Query: 616  PSCVLCNSAPESVPHLMVHCSYGATVLNS--CPFNLSS---TWEDFCIGNFAATGTSAIK 452
              C +C    E+  HL +HC+ G+ +       F  S     W+D  I  +  +   +  
Sbjct: 1091 SLCCVCQRETETRDHLFIHCTLGSLIWQQVLARFGRSQMFREWKD--IIEWMLSNQGSFS 1148

Query: 451  RQIVNLYMAVAFYLVWKERNNRIHRSSNMTSGGLIQQVKAMVREKL---TTCKQFKQAAS 281
              +  L +  A + +WKERN+R+H + + +   + +Q+   +R+ +    T + FK   S
Sbjct: 1149 GTLKKLAVQTAIFHIWKERNSRLHSAMSASHTAIFKQIDRSIRDSILARITRRNFKDLLS 1208

Query: 280  R 278
            +
Sbjct: 1209 Q 1209


Top