BLASTX nr result

ID: Bupleurum21_contig00002617 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Bupleurum21_contig00002617
         (1447 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002331075.1| predicted protein [Populus trichocarpa] gi|2...   263   1e-67
gb|AAG50886.1|AC025294_24 hypothetical protein [Arabidopsis thal...   239   1e-60
gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana]       234   4e-59
gb|AAC95175.1| putative non-LTR retroelement reverse transcripta...   234   6e-59
dbj|BAD95408.1| hypothetical protein [Arabidopsis thaliana]           233   1e-58

>ref|XP_002331075.1| predicted protein [Populus trichocarpa] gi|222873039|gb|EEF10170.1|
            predicted protein [Populus trichocarpa]
          Length = 517

 Score =  263 bits (671), Expect = 1e-67
 Identities = 156/473 (32%), Positives = 242/473 (51%), Gaps = 17/473 (3%)
 Frame = +2

Query: 2    LAPNKLKSSIYFCNVDLDTQLQAINLSGFQPGSLPFTYLGLPLITSRLNTQQCMPLVMRL 181
            L PN  KS I+   V    + Q I++ GF+ G LP  YLG+PL++SRL    C  LV R+
Sbjct: 21   LYPNPNKSDIFLSGVLNAEREQIIHILGFREGELPMKYLGVPLLSSRLKAIYCKGLVDRI 80

Query: 182  CQRVNSWTNRFLSLAGRLQLLKSILFGIQGYWAAHIFLPQGVLAKIQSILSRFLWGGNSN 361
              +V  WT R LS AGR+QL+ S+LF IQ YWA+   LP  V+  ++ I+  FLW G+  
Sbjct: 81   TSKVRHWTCRTLSYAGRVQLINSVLFSIQVYWASLFLLPGQVIKNVEQIMKSFLWSGSDM 140

Query: 362  RKPHYKVAWVDCCLPAEEGGLGLRDLESWNTAAVLYQLWRLIKSSD-SLWIAWFKNCILR 538
            R    KVAW   CLP +EGGLG++ ++ WN  A+L  +W L   SD S+W  W ++ +LR
Sbjct: 141  RTTGAKVAWDQVCLPKKEGGLGIKSIKEWNKIALLKHIWNLCNDSDGSIWSTWIRSNLLR 200

Query: 539  NKALWTVKCSYSHSWCVRKILNIRPMALRYIKYEVGEGSNFLLWHDPW-AGEPLITQMSD 715
             +  WT+K   + SW   KIL +R +A   +KY +G+G    LW D W    PL     +
Sbjct: 201  GRNFWTIKTPQNCSWAWGKILKLRSLAWPKMKYIIGDGMTTSLWFDNWHPHSPLADSYGE 260

Query: 716  HIISVMESTSLAKVSSIMNNTSWSSGGSN----HPLAIELRHMISTVQIRRHDRVSWDGY 883
              I        AKV+ ++ N+ W +  +     HP+ IE     S  ++ + D + W   
Sbjct: 261  RFIYDSGMAKNAKVNVLIQNSEWKTPTTQAIGWHPI-IEAIPSNSNPKMGQKDELVWLDS 319

Query: 884  CN--VKLKHIWNTIRQVGTLPAWYPVVWHSWMIRKCSLHMWLAFKNRLLTRERMSRFGMG 1057
             N    +K  W  +R+   +  W+ +VW    + + S  +W+A + +L T++++ RFG+ 
Sbjct: 320  PNHRFSVKVAWEQLRRHRQMVEWHDIVWFKNAVPRHSFLLWMAVQQKLTTQDKLHRFGIH 379

Query: 1058 TSLFCTLCDTQQVETVAHIFTTCPYAV----EIMSASSFP-LNGCWARYALGDIACVALS 1222
                C+LC  +  E   H+F  C Y      ++      P +   W  +    I    +S
Sbjct: 380  GPNRCSLC-LRNNEDHNHLFFECSYTKAIWWDVCDRCDIPRMTKGWDEW----IRWATVS 434

Query: 1223 QDEKRMAS----LYLAVAMHLIWNERNLRIHSATSRPAAVLIMEIKRIVRDKL 1369
               K   +    L  A  ++ +W ERN RI +  SR   +++ +I+ I+RDKL
Sbjct: 435  WHGKSFVNFSCKLSFAATVYHVWQERNARIFAGMSRTPNLVLNQIECIIRDKL 487


>gb|AAG50886.1|AC025294_24 hypothetical protein [Arabidopsis thaliana]
          Length = 629

 Score =  239 bits (611), Expect = 1e-60
 Identities = 149/475 (31%), Positives = 224/475 (47%), Gaps = 22/475 (4%)
 Frame = +2

Query: 11   NKLKSSIYFCNVDLDTQLQAINLSGFQPGSLPFTYLGLPLITSRLNTQQCMPLVMRLCQR 190
            N  K+++Y   V    +   I+   F  G LP  YLGLPL+T RL  +   PL  ++  R
Sbjct: 142  NMEKTTLYTAGVSDHNRYMMISRYPFGLGQLPVRYLGLPLVTKRLTKEDLSPLFEQIRNR 201

Query: 191  VNSWTNRFLSLAGRLQLLKSILFGIQGYWAAHIFLPQGVLAKIQSILSRFLWGGNSNRKP 370
            + +WT+R+LS AGRL L+ S+L+    +W +   LP   L +I SI S FLW G    + 
Sbjct: 202  IGTWTSRYLSFAGRLNLISSVLWSTMNFWMSAFRLPSACLKEINSICSAFLWSGPELHRR 261

Query: 371  HYKVAWVDCCLPAEEGGLGLRDLESWNTAAVLYQLWRLIKSSDSLWIAWFKNCILRNKAL 550
              KV+W D C P +EGGLGLR L   N  +VL  +WR+  + DSLW+ W K  +L+ ++ 
Sbjct: 262  KAKVSWDDICKPKQEGGLGLRSLTEANVVSVLKLIWRVTSNDDSLWVKWSKMNLLKQESF 321

Query: 551  WTVKCSYS-HSWCVRKILNIRPMALRYIKYEVGEGSNFLLWHDPWAGEPLITQMSDHIIS 727
            W++  + S  SW  +K+L  R  A  + + EV  G+    W D W+G         H++ 
Sbjct: 322  WSLTPNSSLGSWMWKKMLKYRETAKPFSRVEVNNGARTSFWFDNWSG-------MGHLMD 374

Query: 728  VMESTSLAKVSSIMNNT---SWSSGGSN-------HPLAIELRHMISTVQIRRHDRVSWD 877
            V        +    N T   +WS+           + +   L     T  + R D   W 
Sbjct: 375  VTGQRGQIDLGISRNKTVAEAWSNRRRRKHRTEQLNDIEAALNQKYQTRNLLREDATLWR 434

Query: 878  GYCNV-----KLKHIWNTIRQVGTLPAWYPVVWHSWMIRKCSLHMWLAFKNRLLTRERMS 1042
            G  +V       K  WN +R+     AWY  VW S    K     WLA +NRL T  RM 
Sbjct: 435  GKGDVFKTSFSTKDTWNQVRKKSNEVAWYKGVWFSHSTPKYQFCTWLALRNRLSTGYRMQ 494

Query: 1043 RFGMGTSLFCTLCDTQQVETVAHIFTTCPYAVEIMSASSFPLNGCWARYA------LGDI 1204
             +  G+ + CT C T  +ET  H+F +C YA  I +A     N    R++      +  I
Sbjct: 495  LWNNGSDVKCTFCST-SIETRDHLFFSCSYASAIWTA--IAKNVLQHRFSTDWQTIVNYI 551

Query: 1205 ACVALSQDEKRMASLYLAVAMHLIWNERNLRIHSATSRPAAVLIMEIKRIVRDKL 1369
            +     +    ++     + +H +W ERN R H    R +A LI  + + +R++L
Sbjct: 552  SETQTDRIRSFLSRYIFQLTVHTVWKERNDRRHGEEPRTSANLISWMDKQIRNQL 606


>gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana]
          Length = 1213

 Score =  234 bits (597), Expect = 4e-59
 Identities = 144/472 (30%), Positives = 228/472 (48%), Gaps = 18/472 (3%)
 Frame = +2

Query: 2    LAPNKLKSSIYFCNVDLDTQLQAINLSGFQPGSLPFTYLGLPLITSRLNTQQCMPLVMRL 181
            L  NK KS +Y   ++   +  A    GF  G+LP  YLGLPL+  +L   +  PL+ ++
Sbjct: 728  LKVNKDKSHLYLAGLN-QLESNANAAYGFPIGTLPIRYLGLPLMNRKLRIAEYEPLLEKI 786

Query: 182  CQRVNSWTNRFLSLAGRLQLLKSILFGIQGYWAAHIFLPQGVLAKIQSILSRFLWGGNSN 361
              R  SW N+ LS AGR+QL+ S++FG   +W +   LP+G + +I+S+ SRFLW GN  
Sbjct: 787  TARFRSWVNKCLSFAGRIQLISSVIFGSINFWMSTFLLPKGCIKRIESLCSRFLWSGNIE 846

Query: 362  RKPHYKVAWVDCCLPAEEGGLGLRDLESWNTAAVLYQLWRLIKSSDSLWIAWFKNCILRN 541
            +    KV+W   CLP  EGGLGLR L  WN    +  +WRL  + DSLW  W     L  
Sbjct: 847  QAKGIKVSWAALCLPKSEGGLGLRRLLEWNKTLSMRLIWRLFVAKDSLWADWQHLHHLSR 906

Query: 542  KALWTVKCSYSHSWCVRKILNIRPMALRYIKYEVGEGSNFLLWHDPWAG-EPLITQMSDH 718
             + W V+   S SW  +++L++RP+A +++  +VG G     W+D W    PL   + D 
Sbjct: 907  GSFWAVEGGQSDSWTWKRLLSLRPLAHQFLVCKVGNGLKADYWYDNWTSLGPLFRIIGDI 966

Query: 719  IISVMESTSLAKVSSIMNNTSWSSGGSNHPLAIELRHMISTVQI-----RRHDRVSW--D 877
              S +    LAKV+S  +   W    S    A  +   + TV +        DR  W  +
Sbjct: 967  GPSSLRVPLLAKVASAFSEDGWRLPVSRSAPAKGIHDHLCTVPVPSTAQEDVDRYEWSVN 1026

Query: 878  GYC--NVKLKHIWNTIRQVGTLPAWYPVVWHSWMIRKCSLHMWLAFKNRLLTRERMSRFG 1051
            G+          W  IR   T+ +W   +W    + K + +MW++  NRLLTR+R++ +G
Sbjct: 1027 GFLCQGFSAAKTWEAIRPKATVKSWASSIWFKGAVPKYAFNMWVSHLNRLLTRQRLASWG 1086

Query: 1052 MGTSLFCTLCDTQQVETVAHIFTTCPYAVEIMS------ASSFPLNGCWARYALGDIACV 1213
               S  C LC     E+  H+   C ++ ++             L   W+      ++ V
Sbjct: 1087 HIQSDACVLCSFAS-ESRDHLLLICEFSAQVWRLVFRRICPRQRLFSSWSEL----LSWV 1141

Query: 1214 ALSQDE--KRMASLYLAVAMHLIWNERNLRIHSATSRPAAVLIMEIKRIVRD 1363
              S  E    +  +   V ++ +W +RN  +H++     AV+   + R +R+
Sbjct: 1142 RQSSPEAPPLLRKIVSQVVVYNLWRQRNNLLHNSLRLAPAVIFKLVDREIRN 1193


>gb|AAC95175.1| putative non-LTR retroelement reverse transcriptase [Arabidopsis
            thaliana]
          Length = 1352

 Score =  234 bits (596), Expect = 6e-59
 Identities = 149/459 (32%), Positives = 224/459 (48%), Gaps = 21/459 (4%)
 Frame = +2

Query: 20   KSSIYFCNVDLDTQLQAINLSGFQPGSLPFTYLGLPLITSRLNTQQCMPLVMRLCQRVNS 199
            KS+I+   +  + +   +    F+ G+LP  YLGLPL+T R+     +PLV ++  R+ S
Sbjct: 887  KSTIFMAGISPNAKTSILQQFPFELGTLPVKYLGLPLLTKRMTQSDYLPLVEKIRARITS 946

Query: 200  WTNRFLSLAGRLQLLKSILFGIQGYWAAHIFLPQGVLAKIQSILSRFLWGGNSNRKPHYK 379
            WTNRFLS AGRLQL+KS+L  I  +W +   LP+  L +I+ + S FLW G        K
Sbjct: 947  WTNRFLSFAGRLQLIKSVLSSITNFWLSVFRLPKACLQEIEKMFSAFLWSGPDLNTKKAK 1006

Query: 380  VAWVDCCLPAEEGGLGLRDLESWNTAAVLYQLWRLIKSSDSLWIAWFKNCILRNKALWTV 559
            +AW + C   EEGGLGL+ L+  N  ++L  +WR++ + DSLW+ W    ++R +  W+V
Sbjct: 1007 IAWSEVCKLKEEGGLGLKPLKEANEVSLLKLIWRILSARDSLWVKWVNKHLIRKETFWSV 1066

Query: 560  KCSYS-HSWCVRKILNIRPMALRYIKYEVGEGSNFLLWHDPWAGEPLITQ-MSDHIISVM 733
            K +    SW  RKIL  R  A  + + EV  G+    WHD W     + Q M       +
Sbjct: 1067 KENTGLGSWLWRKILKQRDKARLFHRMEVRSGTFTSFWHDHWCPLGRLHQHMGSRGTIDL 1126

Query: 734  ESTSLAKVSSIMNNTSWSSGGSNHPLAIELRHMISTVQIRRHDRVSWDG----------- 880
               + A V+ +MN     +       A  L  + S +++ R DR S DG           
Sbjct: 1127 GIPNNATVAEVMN-----THRRKRHRADFLNQIKSQIELARQDR-STDGDRSLWKQKEDT 1180

Query: 881  -YCNVKLKHIWNTIRQVGTLPAWYPVVWHSWMIRKCSLHMWLAFKNRLLTRERMSRFGMG 1057
               +      W  IR +     WY  VW S    K S   WLAF NRL T +++ ++  G
Sbjct: 1181 FKSSFSSSKTWQQIRSISLRCDWYRGVWFSASTPKYSFVTWLAFHNRLTTSDKICKWNSG 1240

Query: 1058 TSLFCTLCDTQQVETVAHIFTTCPYAVEI-MSASSFPLNGCWARYALG-DIACVALSQDE 1231
                C  C  +++ET  H+F +CPY+  +  S +   LNG   R  L  ++    L    
Sbjct: 1241 ARYDCVFCG-EELETRDHLFFSCPYSSHVWFSLTKGLLNG---RNILNWNLITPHLLDSS 1296

Query: 1232 KRMASLY-----LAVAMHLIWNERNLRIHSATSRPAAVL 1333
            +    ++        ++H +W ERN R H  T+ PAA L
Sbjct: 1297 RPYLHVFTLRYAFQASIHSLWRERNCRRHGETAIPAAKL 1335


>dbj|BAD95408.1| hypothetical protein [Arabidopsis thaliana]
          Length = 478

 Score =  233 bits (593), Expect = 1e-58
 Identities = 142/444 (31%), Positives = 218/444 (49%), Gaps = 16/444 (3%)
 Frame = +2

Query: 86   FQPGSLPFTYLGLPLITSRLNTQQCMPLVMRLCQRVNSWTNRFLSLAGRLQLLKSILFGI 265
            F  G+LP  YLGLPL+T ++ T    PLV ++  R+  WT R LS AGRLQL+ S++  +
Sbjct: 18   FASGALPVRYLGLPLLTKKMTTSDYGPLVEKIRVRIGKWTARHLSFAGRLQLISSVIHSL 77

Query: 266  QGYWAAHIFLPQGVLAKIQSILSRFLWGGNSNRKPHYKVAWVDCCLPAEEGGLGLRDLES 445
              +W +   LP   + +I SI S FLW G        KVAW D C P +EGGLG+R L+ 
Sbjct: 78   TNFWMSAFRLPSACIKEIDSICSSFLWSGPELNTKKAKVAWSDVCTPKDEGGLGIRSLKE 137

Query: 446  WNTAAVLYQLWRLIKSSDSLWIAWFKNCILRNKALWTVKCSYS-HSWCVRKILNIRPMAL 622
             N  ++L  +WR++ SS SLW+ W +  +LR  + W++  + +  SW  +KIL  R +A 
Sbjct: 138  ANKVSLLKLIWRML-SSTSLWVQWLRLYLLRKGSFWSISGNTTLGSWMWKKILKHRALAS 196

Query: 623  RYIKYEVGEGSNFLLWHDPWAGEPLITQMSDHIISV-MESTSLAKVSSIMNNTSWSSGGS 799
             ++K+++  GSN   W D W+    +  ++ H   + M  T  A V+  + N        
Sbjct: 197  GFVKHDIHNGSNTSFWFDNWSKIGRLIDVTGHRGCIDMGITLHASVAEAVVN--HRPRRH 254

Query: 800  NHPLAIELRHMISTVQ----IRRHDRVSWDGYCNV-----KLKHIWNTIRQVGTLPAWYP 952
             H   + +  +I+ V+        D V W G  ++       K  W   R+      WY 
Sbjct: 255  RHDTLLRIEDVIAEVRHQGLTSGEDTVRWKGNGDIFKPCFNTKETWAATREPKLKVNWYK 314

Query: 953  VVWHSWMIRKCSLHMWLAFKNRLLTRERMSRFGMGTSLFCTLCDTQQVETVAHIFTTCPY 1132
             VW S    K S+  W+A KNRL T +RM  +  G    C LC    VET  H+F TCPY
Sbjct: 315  GVWFSHATPKYSVLAWIAIKNRLTTGDRMLSWNAGADSSCVLCH-HLVETRDHLFFTCPY 373

Query: 1133 AVEIMSASSFPL-----NGCWARYALGDIACVALSQDEKRMASLYLAVAMHLIWNERNLR 1297
            + E+ S  +  L        W    L  +   +L  +   +      + +H +W ERN R
Sbjct: 374  SAEVWSTLTRKLLSQHFTNRW-EAILKLLTNKSLGHEVPFLTRYTFQLTLHSLWKERNGR 432

Query: 1298 IHSATSRPAAVLIMEIKRIVRDKL 1369
             H    + AA ++  + + VR+++
Sbjct: 433  RHGEVPQAAAQMVRFLDKQVRNRI 456


Top