BLASTX nr result

ID: Bupleurum21_contig00023301 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Bupleurum21_contig00023301
         (1491 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002331075.1| predicted protein [Populus trichocarpa] gi|2...   269   1e-69
gb|AAC63678.1| putative non-LTR retroelement reverse transcripta...   225   2e-56
gb|AAC95175.1| putative non-LTR retroelement reverse transcripta...   224   6e-56
gb|AAF98181.1|AC000107_4 F17F8.5 [Arabidopsis thaliana]               221   5e-55
gb|AAG50886.1|AC025294_24 hypothetical protein [Arabidopsis thal...   219   1e-54

>ref|XP_002331075.1| predicted protein [Populus trichocarpa] gi|222873039|gb|EEF10170.1|
            predicted protein [Populus trichocarpa]
          Length = 517

 Score =  269 bits (688), Expect = 1e-69
 Identities = 153/450 (34%), Positives = 241/450 (53%), Gaps = 10/450 (2%)
 Frame = -3

Query: 1417 LPIKFLGLPLLSNSPSDNDCQPLIDKVCKRIQSWTSRFLSFAGRITLINSILISIFGYWA 1238
            LP+K+LG+PLLS+      C+ L+D++  +++ WT R LS+AGR+ LINS+L SI  YWA
Sbjct: 54   LPMKYLGVPLLSSRLKAIYCKGLVDRITSKVRHWTCRTLSYAGRVQLINSVLFSIQVYWA 113

Query: 1237 MFLFLPVKVLKQLQSIFSKFLWNGNLGEKCTYKVAWDDCTRPKCCGGLGIKNLREWNKAA 1058
                LP +V+K ++ I   FLW+G+       KVAWD    PK  GGLGIK+++EWNK A
Sbjct: 114  SLFLLPGQVIKNVEQIMKSFLWSGSDMRTTGAKVAWDQVCLPKKEGGLGIKSIKEWNKIA 173

Query: 1057 IKFQLWRIISNQENSLWIIWVRNSFLKGKAFWTMKIPRACPWSIRKILKCRSESMPLISY 878
            +   +W + ++ + S+W  W+R++ L+G+ FWT+K P+ C W+  KILK RS + P + Y
Sbjct: 174  LLKHIWNLCNDSDGSIWSTWIRSNLLRGRNFWTIKTPQNCSWAWGKILKLRSLAWPKMKY 233

Query: 877  QIHHGENLLLWHDPWINNKPLIDILGCEIFNEVGSHSLATVSSIIHDGCWSLSSSNHY-L 701
             I  G    LW D W  + PL D  G     + G    A V+ +I +  W   ++     
Sbjct: 234  IIGDGMTTSLWFDNWHPHSPLADSYGERFIYDSGMAKNAKVNVLIQNSEWKTPTTQAIGW 293

Query: 700  ATVMRHIITNVN--IDRSDKNFWDGLTSDQITMSSIYKSSIQH-SCTNWSSFVWFSEAVP 530
              ++  I +N N  + + D+  W    + + ++   ++   +H     W   VWF  AVP
Sbjct: 294  HPIIEAIPSNSNPKMGQKDELVWLDSPNHRFSVKVAWEQLRRHRQMVEWHDIVWFKNAVP 353

Query: 529  RHSFCTWLACLNKLNTRVKLVSYGLMDSAVCGLCCQHDESVEHLFFNCAYSSQI-ISACP 353
            RHSF  W+A   KL T+ KL  +G+     C LC +++E   HLFF C+Y+  I    C 
Sbjct: 354  RHSFLLWMAVQQKLTTQDKLHRFGIHGPNRCSLCLRNNEDHNHLFFECSYTKAIWWDVCD 413

Query: 352  -YSIPHNLQS----IISMAVNLSRNDFKLLYIRLYTTAAVYFVWQQRNCRIWNPRQAKSA 188
               IP   +     I    V+     F     +L   A VY VWQ+RN RI+    +++ 
Sbjct: 414  RCDIPRMTKGWDEWIRWATVSWHGKSFVNFSCKLSFAATVYHVWQERNARIF-AGMSRTP 472

Query: 187  ISVIPLIKEMLRQKIYGCNRIRKILSRNKH 98
              V+  I+ ++R K+   + +R ++  N++
Sbjct: 473  NLVLNQIECIIRDKL---DLMRNVVPTNEN 499


>gb|AAC63678.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1216

 Score =  225 bits (574), Expect = 2e-56
 Identities = 141/453 (31%), Positives = 235/453 (51%), Gaps = 20/453 (4%)
 Frame = -3

Query: 1441 RCKFVWDSLPIKFLGLPLLSNSPSDNDCQPLIDKVCKRIQSWTSRFLSFAGRITLINSIL 1262
            R  F    LP+++LGLPL++   + +D  PLID++ +RI  WTSR+LSFAGR++LINS+L
Sbjct: 480  RYSFGVGKLPVRYLGLPLVTKRLTTSDYSPLIDQIRRRIGMWTSRYLSFAGRLSLINSVL 539

Query: 1261 ISIFGYWAMFLFLPVKVLKQLQSIFSKFLWNGNLGEKCTYKVAWDDCTRPKCCGGLGIKN 1082
             SI  +W     LP + + ++  I S  LW+G        KV+WD+  +PK  GGLG+++
Sbjct: 540  WSITNFWMNAFRLPRECINEINRISSALLWSGPELNPKKAKVSWDEICKPKKEGGLGLQS 599

Query: 1081 LREWNKAAIKFQLWRIISNQENSLWIIWVRNSFLKGKAFWTMKIPRAC-PWSIRKILKCR 905
            LRE NK +    +WR++S Q+ SLW+ W R + LK ++FW++        W  R++LK R
Sbjct: 600  LREANKVSSLKLIWRLLSCQD-SLWVKWTRMNLLKKESFWSIGTHSTLGSWIWRRLLKHR 658

Query: 904  SESMPLISYQIHHGENLLLWHDPWINNKPLIDILGCEIFNEVGSHSLATVSSIIHDGCWS 725
              +      ++++G N   W D W    PLI++ G     ++G     T++       WS
Sbjct: 659  EVAKSFCKIEVNNGVNTSFWFDNWSEKGPLINLTGARGAIDMGISRHMTLAE-----AWS 713

Query: 724  LSSSNHYLATVMRHI-------ITNVNIDRSDKNFWDGLT-------SDQITMSSIYKSS 587
                  +   ++            + NI+  D   W G         S + T + I  SS
Sbjct: 714  RRRRKRHRVEILNEFEEILLQKYQHRNIELEDAILWRGKEDVFKARFSTKDTWNHIRTSS 773

Query: 586  IQHSCTNWSSFVWFSEAVPRHSFCTWLACLNKLNTRVKLVSYGLMDSAVCGLCCQHDESV 407
             Q +   W   VWF+ A P+ SFC WLA  N+L+T  +++++       C  C    E+ 
Sbjct: 774  NQRA---WHKGVWFAHATPKFSFCAWLAIRNRLSTGDRMMTWNNGTPTTCVFCSSPMETR 830

Query: 406  EHLFFNCAYSSQIISACPYSI-PHNLQSIISMAVNL---SRNDFKLLYIRLYT-TAAVYF 242
            +HLFF C YSS+I ++   ++      +  S  VN    S+ D    ++  YT   +++ 
Sbjct: 831  DHLFFQCCYSSEIWTSIAKNVYKDRFSTKWSAVVNYISDSQPDRIQSFLSRYTFQVSIHS 890

Query: 241  VWQQRNCRIWNPRQAKSAISVIPLIKEMLRQKI 143
            +W++RN R  +  +++SA ++I  I + +R ++
Sbjct: 891  IWRERNSR-RHGEKSRSASNLIRQIDKTIRNQL 922


>gb|AAC95175.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1352

 Score =  224 bits (570), Expect = 6e-56
 Identities = 143/451 (31%), Positives = 229/451 (50%), Gaps = 17/451 (3%)
 Frame = -3

Query: 1465 NVIQSTLRRCKFVWDSLPIKFLGLPLLSNSPSDNDCQPLIDKVCKRIQSWTSRFLSFAGR 1286
            N   S L++  F   +LP+K+LGLPLL+   + +D  PL++K+  RI SWT+RFLSFAGR
Sbjct: 898  NAKTSILQQFPFELGTLPVKYLGLPLLTKRMTQSDYLPLVEKIRARITSWTNRFLSFAGR 957

Query: 1285 ITLINSILISIFGYWAMFLFLPVKVLKQLQSIFSKFLWNGNLGEKCTYKVAWDDCTRPKC 1106
            + LI S+L SI  +W     LP   L++++ +FS FLW+G        K+AW +  + K 
Sbjct: 958  LQLIKSVLSSITNFWLSVFRLPKACLQEIEKMFSAFLWSGPDLNTKKAKIAWSEVCKLKE 1017

Query: 1105 CGGLGIKNLREWNKAAIKFQLWRIISNQENSLWIIWVRNSFLKGKAFWTMKIPRAC-PWS 929
             GGLG+K L+E N+ ++   +WRI+S ++ SLW+ WV    ++ + FW++K       W 
Sbjct: 1018 EGGLGLKPLKEANEVSLLKLIWRILSARD-SLWVKWVNKHLIRKETFWSVKENTGLGSWL 1076

Query: 928  IRKILKCRSESMPLISYQIHHGENLLLWHDPWINNKPLIDILGCEIFNEVGSHSLATVSS 749
             RKILK R ++      ++  G     WHD W     L   +G     ++G  + ATV+ 
Sbjct: 1077 WRKILKQRDKARLFHRMEVRSGTFTSFWHDHWCPLGRLHQHMGSRGTIDLGIPNNATVAE 1136

Query: 748  IIHDGCWSLSSSNHYLATVMRHIITNVNIDRSDKNFWDGLTSDQITMSSIYKSSIQHSCT 569
            ++     +      + A  +  I + + + R D++  DG  S        +KSS   S T
Sbjct: 1137 VM-----NTHRRKRHRADFLNQIKSQIELARQDRS-TDGDRSLWKQKEDTFKSSFSSSKT 1190

Query: 568  -----------NWSSFVWFSEAVPRHSFCTWLACLNKLNTRVKLVSYGLMDSAVCGLCCQ 422
                       +W   VWFS + P++SF TWLA  N+L T  K+  +       C  C +
Sbjct: 1191 WQQIRSISLRCDWYRGVWFSASTPKYSFVTWLAFHNRLTTSDKICKWNSGARYDCVFCGE 1250

Query: 421  HDESVEHLFFNCAYSSQIISACPYSIPH-----NLQSIISMAVNLSRNDFKLLYIRLYTT 257
              E+ +HLFF+C YSS +  +    + +     N   I    ++ SR    +  +R    
Sbjct: 1251 ELETRDHLFFSCPYSSHVWFSLTKGLLNGRNILNWNLITPHLLDSSRPYLHVFTLRYAFQ 1310

Query: 256  AAVYFVWQQRNCRIWNPRQAKSAISVIPLIK 164
            A+++ +W++RNCR    R  ++AI    L K
Sbjct: 1311 ASIHSLWRERNCR----RHGETAIPAAKLAK 1337


>gb|AAF98181.1|AC000107_4 F17F8.5 [Arabidopsis thaliana]
          Length = 872

 Score =  221 bits (562), Expect = 5e-55
 Identities = 139/471 (29%), Positives = 235/471 (49%), Gaps = 33/471 (7%)
 Frame = -3

Query: 1489 TLRLRSICNVIQSTLRRCKFVWD--SLPIKFLGLPLLSNSPSDNDCQPLIDKVCKRIQSW 1316
            TL +  +  +I+  +   KF++D   LP+++LGLPL++   +  D  PL++++ KRI +W
Sbjct: 389  TLYMAGVSPIIKQEIA-AKFLFDVGQLPVRYLGLPLVTKRLTSADYSPLLEQIKKRIATW 447

Query: 1315 TSRFLSFAGRITLINSILISIFGYWAMFLFLPVKVLKQLQSIFSKFLWNGNLGEKCTYKV 1136
            T RF SFAGR  LI S+L SI  +W     LP + ++++  + S FLW+G+       K+
Sbjct: 448  TFRFFSFAGRFNLIKSVLWSICNFWLAAFRLPRQCIREIDKLCSSFLWSGSEMSSHKAKI 507

Query: 1135 AWDDCTRPKCCGGLGIKNLREWNKAAIKFQLWRIISNQENSLWIIWVRNSFLKGKAFWTM 956
            +WD   +PK  GGLG++NL+E N  +    +WRIISN  NSLW  WV    ++ K+ W++
Sbjct: 508  SWDIVCKPKAEGGLGLRNLKEANDVSCLKLVWRIISN-SNSLWTKWVAEYLIRKKSIWSL 566

Query: 955  KIPRAC-PWSIRKILKCRSESMPLISYQIHHGENLLLWHDPWINNKPLIDILGCEIFNEV 779
            K   +   W  RKILK R  +      ++ +GE+   W+D W  +  LID +G +   ++
Sbjct: 567  KQSTSMGSWIWRKILKIRDVAKSFSRVEVGNGESASFWYDHWSAHGRLIDTVGDKGTIDL 626

Query: 778  GSHSLATVSSIIHDGCWSLSSSNHYLATVMRHIITNV------NIDRSDKNFWDG----L 629
            G    A+V+       W+  S   +  +++  I   +      + D  D   W G     
Sbjct: 627  GIPREASVAD-----AWTRRSRRRHRTSLLNEIEEMMAYQRIHHSDAEDTVLWRGKNDVF 681

Query: 628  TSDQITMSSIYKSSIQHSCTNWSSFVWFSEAVPRHSFCTWLACLNKLNTRVKLVSYGLMD 449
                 T  + +      S  +W   VWF  A P+++ CTWLA  N+L T  +++ +    
Sbjct: 682  KPHFSTRDTWHLIKATSSTVSWHKGVWFRHATPKYALCTWLAIHNRLPTGDRMLKWNSSG 741

Query: 448  SAV--CGLCCQHDESVEHLFFNCAYSSQIISACPYSIPHNLQS-----IISMAVNLSRND 290
            S    C LC  + +++EHLFF+C+Y+S + +A    I     S     +++      ++ 
Sbjct: 742  SVSGNCVLCTNNSKTLEHLFFSCSYASTVWAALAKGIWKTRYSTRWSHLLTHISTHFQDR 801

Query: 289  FKLLYIRLYTTAAVYFVWQQRNCRI-------------WNPRQAKSAISVI 176
             +    R    A +Y VW++RN R              W  +Q ++ I++I
Sbjct: 802  VEGFLTRYIFQATIYHVWRERNGRRHDAAPNTPATVIGWIDKQTRNQITII 852


>gb|AAG50886.1|AC025294_24 hypothetical protein [Arabidopsis thaliana]
          Length = 629

 Score =  219 bits (558), Expect = 1e-54
 Identities = 136/458 (29%), Positives = 225/458 (49%), Gaps = 34/458 (7%)
 Frame = -3

Query: 1447 LRRCKFVWDSLPIKFLGLPLLSNSPSDNDCQPLIDKVCKRIQSWTSRFLSFAGRITLINS 1268
            + R  F    LP+++LGLPL++   +  D  PL +++  RI +WTSR+LSFAGR+ LI+S
Sbjct: 162  ISRYPFGLGQLPVRYLGLPLVTKRLTKEDLSPLFEQIRNRIGTWTSRYLSFAGRLNLISS 221

Query: 1267 ILISIFGYWAMFLFLPVKVLKQLQSIFSKFLWNGNLGEKCTYKVAWDDCTRPKCCGGLGI 1088
            +L S   +W     LP   LK++ SI S FLW+G    +   KV+WDD  +PK  GGLG+
Sbjct: 222  VLWSTMNFWMSAFRLPSACLKEINSICSAFLWSGPELHRRKAKVSWDDICKPKQEGGLGL 281

Query: 1087 KNLREWNKAAIKFQLWRIISNQENSLWIIWVRNSFLKGKAFWTMKIPRAC--PWSIRKIL 914
            ++L E N  ++   +WR+ SN ++SLW+ W + + LK ++FW++  P +    W  +K+L
Sbjct: 282  RSLTEANVVSVLKLIWRVTSN-DDSLWVKWSKMNLLKQESFWSL-TPNSSLGSWMWKKML 339

Query: 913  KCRSESMPLISYQIHHGENLLLWHDPWINNKPLIDILGCEIFNEVGSHSLATVSSIIHDG 734
            K R  + P    ++++G     W D W     L+D+ G     ++G     TV+      
Sbjct: 340  KYRETAKPFSRVEVNNGARTSFWFDNWSGMGHLMDVTGQRGQIDLGISRNKTVAE----- 394

Query: 733  CWSLSSSNHYLATVMRHIITNV-------NIDRSDKNFWDG-------LTSDQITMSSIY 596
             WS      +    +  I   +       N+ R D   W G         S + T + + 
Sbjct: 395  AWSNRRRRKHRTEQLNDIEAALNQKYQTRNLLREDATLWRGKGDVFKTSFSTKDTWNQVR 454

Query: 595  KSSIQHSCTNWSSFVWFSEAVPRHSFCTWLACLNKLNTRVKLVSYGLMDSAVCGLCCQHD 416
            K S +     W   VWFS + P++ FCTWLA  N+L+T  ++  +       C  C    
Sbjct: 455  KKSNE---VAWYKGVWFSHSTPKYQFCTWLALRNRLSTGYRMQLWNNGSDVKCTFCSTSI 511

Query: 415  ESVEHLFFNCAYSSQIISACPYSI-----PHNLQSIISMAVNLSRNDFKLLYIRLYTTAA 251
            E+ +HLFF+C+Y+S I +A   ++       + Q+I++       +  +    R      
Sbjct: 512  ETRDHLFFSCSYASAIWTAIAKNVLQHRFSTDWQTIVNYISETQTDRIRSFLSRYIFQLT 571

Query: 250  VYFVWQQRNCR-------------IWNPRQAKSAISVI 176
            V+ VW++RN R              W  +Q ++ +S+I
Sbjct: 572  VHTVWKERNDRRHGEEPRTSANLISWMDKQIRNQLSII 609


Top