BLASTX nr result

ID: Bupleurum21_contig00005681 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Bupleurum21_contig00005681
         (1377 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

dbj|BAD29960.1| cysteine protease [Daucus carota]                     652   0.0  
dbj|BAD29957.1| cysteine protease [Daucus carota]                     641   0.0  
dbj|BAD29958.1| cysteine protease [Daucus carota]                     640   0.0  
dbj|BAE80740.1| cysteine proteinase [Platycodon grandiflorus]         624   e-176
gb|AAK62661.1| F2G19.31/F2G19.31 [Arabidopsis thaliana] gi|19548...   609   e-172

>dbj|BAD29960.1| cysteine protease [Daucus carota]
          Length = 460

 Score =  652 bits (1681), Expect = 0.0
 Identities = 311/447 (69%), Positives = 357/447 (79%), Gaps = 9/447 (2%)
 Frame = -2

Query: 1334 ATDMSIITY-------RSDDEVTSMYESWLSKHGKSYDNALGDEKETRFQIFKNNLRYID 1176
            A DMSIITY        +DD + + YESWL KHGKSY NALG EKE RFQIFK+N  YID
Sbjct: 19   AADMSIITYDQTHAVGSTDDVIMAAYESWLVKHGKSY-NALG-EKEQRFQIFKDNFLYID 76

Query: 1175 SHNSAADKSYKLGLNKFADMTNDEYRSKYLGFKNKDLRKKM--RSERYASVAGESLPEFV 1002
              N+A D+S+KLGLN+FAD+TN+EYRSKY G + KD RKK+  +S+RYAS+AGESLPE V
Sbjct: 77   EQNAAKDRSFKLGLNRFADLTNEEYRSKYTGIRTKDSRKKVSGKSQRYASLAGESLPESV 136

Query: 1001 DWRDKGAVAAVKDQGGCGSCWAFSAIASVEGINQISTGELITLSEQELVDCDRSYNEGCE 822
            DWR+ GAVA+VKDQG CGSCWAFS I++VEGINQI+TG+LITLSEQELVDCDRSYNEGC 
Sbjct: 137  DWREHGAVASVKDQGQCGSCWAFSTISAVEGINQIATGKLITLSEQELVDCDRSYNEGCN 196

Query: 821  GGLMDYAFEFIINNGGIDSDKDYPYTGMDGTCDQNRKNAKVVTIDSYEDVPAYNEKALQK 642
            GGLMD AF+FIINNGGIDSD DYPYTG DG CDQ RKNAKVVTIDSYEDVP Y+EKALQK
Sbjct: 197  GGLMDDAFQFIINNGGIDSDADYPYTGRDGQCDQYRKNAKVVTIDSYEDVPEYDEKALQK 256

Query: 641  AVANQPVSVAIEAGGLDFQLYDSGIFSGSCGTAVDHGVVVIGYGSAGGKDYWIVRNSWGA 462
            A ANQP+SVAIEA G DFQ YDSGIF+G CGT +DHGVVV+GYG+  GKDYWIVRNSWGA
Sbjct: 257  AAANQPISVAIEASGRDFQFYDSGIFTGKCGTDLDHGVVVVGYGTENGKDYWIVRNSWGA 316

Query: 461  EWGEAGYLRMERNVRKSAGLCGIAIEPSYPVKNGQNXXXXXXXXXXXXXXXXXPNVCDDF 282
            +WGE GYLRMER +   AG+CGI  EPSYPVK+G N                  +VCD++
Sbjct: 317  DWGEKGYLRMERGISSKAGICGITSEPSYPVKSGVN---PPNPGPSPPSPKSPESVCDEY 373

Query: 281  NTCPSHTTCCCMFTYLNDCFSWGCCPLEAASCCEDGYSCCPHDYPVCHVYSGTCSVSTNS 102
             TCP  TTCCCM+ Y   CF+WGCCPLE ASCC+DGYSCCPHDYPVC+V +GTCS+S N+
Sbjct: 374  YTCPMSTTCCCMYEYYGYCFAWGCCPLEGASCCDDGYSCCPHDYPVCNVRAGTCSMSNNN 433

Query: 101  PFGVKSLKRIPATPTMPQSIKGKKVSA 21
            P GVK+++RI ATP      KGKKV+A
Sbjct: 434  PLGVKAIQRILATPNWQHGSKGKKVTA 460


>dbj|BAD29957.1| cysteine protease [Daucus carota]
          Length = 437

 Score =  641 bits (1654), Expect = 0.0
 Identities = 302/412 (73%), Positives = 343/412 (83%), Gaps = 12/412 (2%)
 Frame = -2

Query: 1340 SLATDMSIITY---------RSDDEVTSMYESWLSKHGKSYDNALGDEKETRFQIFKNNL 1188
            +LA+DMSII Y         R+DDEV +MY SWL KHGKSY NALG EKETRFQIFK+NL
Sbjct: 20   ALASDMSIINYDQTHTNSLIRTDDEVMTMYNSWLVKHGKSY-NALG-EKETRFQIFKDNL 77

Query: 1187 RYIDSHNSAADKSYKLGLNKFADMTNDEYRSKYLGFKNKDLRKKMR---SERYASVAGES 1017
            RYID+HN+  D+SY+LGLN+FAD+TN+EYR+KYLG K+++ R K+    S+RYA V GE 
Sbjct: 78   RYIDNHNADPDRSYELGLNRFADLTNEEYRAKYLGTKSRESRPKLSKGPSDRYAPVEGEE 137

Query: 1016 LPEFVDWRDKGAVAAVKDQGGCGSCWAFSAIASVEGINQISTGELITLSEQELVDCDRSY 837
            LP+ +DWR+KGAVAAVKDQG CGSCWAFSAI +VEGINQI+TGELITLSEQELVDCDRSY
Sbjct: 138  LPDSIDWREKGAVAAVKDQGSCGSCWAFSAIGAVEGINQITTGELITLSEQELVDCDRSY 197

Query: 836  NEGCEGGLMDYAFEFIINNGGIDSDKDYPYTGMDGTCDQNRKNAKVVTIDSYEDVPAYNE 657
            NEGCEGGLMDYAF FII NGGIDSD DYPYTG DGTC+QN++NAKVVTIDSYEDVP Y+E
Sbjct: 198  NEGCEGGLMDYAFNFIIKNGGIDSDLDYPYTGRDGTCNQNKENAKVVTIDSYEDVPVYDE 257

Query: 656  KALQKAVANQPVSVAIEAGGLDFQLYDSGIFSGSCGTAVDHGVVVIGYGSAGGKDYWIVR 477
            KALQKA ANQP+SVAIEAGG+DFQLY SGIF+G CGTAVDHGVVV+GYGS  G DYWIVR
Sbjct: 258  KALQKAAANQPISVAIEAGGMDFQLYVSGIFTGKCGTAVDHGVVVVGYGSEEGMDYWIVR 317

Query: 476  NSWGAEWGEAGYLRMERNVRKSAGLCGIAIEPSYPVKNGQNXXXXXXXXXXXXXXXXXPN 297
            NSWGA WGEAGYL+M+RNV KS+GLCGI IEPSYPVKNG N                  N
Sbjct: 318  NSWGAAWGEAGYLKMQRNVGKSSGLCGITIEPSYPVKNGDNPPNPGPTPPSPPSPSLPDN 377

Query: 296  VCDDFNTCPSHTTCCCMFTYLNDCFSWGCCPLEAASCCEDGYSCCPHDYPVC 141
            VCD + +CP+HTTCCC++T+   CF WGCCPLEAASCC+DGYSCCPHDYPVC
Sbjct: 378  VCDAYTSCPAHTTCCCLYTFGKQCFYWGCCPLEAASCCDDGYSCCPHDYPVC 429


>dbj|BAD29958.1| cysteine protease [Daucus carota]
          Length = 496

 Score =  640 bits (1650), Expect = 0.0
 Identities = 302/430 (70%), Positives = 347/430 (80%), Gaps = 10/430 (2%)
 Frame = -2

Query: 1346 TLSLATDMSIITY--------RSDDEVTSMYESWLSKHGKSYDNALGDEKETRFQIFKNN 1191
            T++ ATDMSIITY        ++DDE T+++ESWL  HGKSY NALG+E E RFQIFKNN
Sbjct: 15   TVAAATDMSIITYDETHAVGFKTDDEATTLFESWLVTHGKSY-NALGEE-EKRFQIFKNN 72

Query: 1190 LRYIDSHNSAADKSYKLGLNKFADMTNDEYRSKYLGFKNKDLRKKM--RSERYASVAGES 1017
            LRYID  N   D+ +KLGLNKFAD+TN+EYRSKY G K+KDLRKK+  +S RYA+++GES
Sbjct: 73   LRYIDEQNLVEDRGFKLGLNKFADLTNEEYRSKYTGIKSKDLRKKVSAKSGRYATLSGES 132

Query: 1016 LPEFVDWRDKGAVAAVKDQGGCGSCWAFSAIASVEGINQISTGELITLSEQELVDCDRSY 837
            LPE VDWR+ GAVA VKDQG CGSCWAFS I++VEGINQI+TG+LITLSEQELVDCDRSY
Sbjct: 133  LPESVDWRESGAVATVKDQGSCGSCWAFSTISAVEGINQIATGKLITLSEQELVDCDRSY 192

Query: 836  NEGCEGGLMDYAFEFIINNGGIDSDKDYPYTGMDGTCDQNRKNAKVVTIDSYEDVPAYNE 657
            NEGC GGLMDYAFEFIINNGGID+D DYPYTG DG CDQ RKNAKVVTIDSYEDVPAY+E
Sbjct: 193  NEGCNGGLMDYAFEFIINNGGIDTDVDYPYTGRDGKCDQYRKNAKVVTIDSYEDVPAYDE 252

Query: 656  KALQKAVANQPVSVAIEAGGLDFQLYDSGIFSGSCGTAVDHGVVVIGYGSAGGKDYWIVR 477
             AL+KA ANQP+SVAIEA G DFQ YDSGIF+G CG A+DHGVVV+GYG+  GKDYWIVR
Sbjct: 253  LALKKAAANQPISVAIEASGRDFQFYDSGIFTGKCGIALDHGVVVVGYGTENGKDYWIVR 312

Query: 476  NSWGAEWGEAGYLRMERNVRKSAGLCGIAIEPSYPVKNGQNXXXXXXXXXXXXXXXXXPN 297
            NSWGA+WGE GYLRMER +    G+CGIAIEPSYPVK G N                  +
Sbjct: 313  NSWGADWGENGYLRMERGISSKTGICGIAIEPSYPVKTGVN---PPNPGPSPPTPKTPES 369

Query: 296  VCDDFNTCPSHTTCCCMFTYLNDCFSWGCCPLEAASCCEDGYSCCPHDYPVCHVYSGTCS 117
            VCD++ TCP  TTCCCM+ Y   CF+WGCCPLE ASCC+DGYSCCPHDYPVC+V +GTCS
Sbjct: 370  VCDEYYTCPMSTTCCCMYEYYGYCFAWGCCPLEGASCCDDGYSCCPHDYPVCNVRAGTCS 429

Query: 116  VSTNSPFGVK 87
            +  N+P GV+
Sbjct: 430  MKYNNPLGVR 439


>dbj|BAE80740.1| cysteine proteinase [Platycodon grandiflorus]
          Length = 462

 Score =  624 bits (1608), Expect = e-176
 Identities = 299/438 (68%), Positives = 346/438 (78%), Gaps = 11/438 (2%)
 Frame = -2

Query: 1340 SLATDMSIITY----------RSDDEVTSMYESWLSKHGKSYDNALGDEKETRFQIFKNN 1191
            S A DMSII Y          R+DDEV +MYESWL KHGKSY NALG EKE RFQIFK+N
Sbjct: 20   SSALDMSIINYDATHASKSSWRTDDEVMAMYESWLVKHGKSY-NALG-EKEKRFQIFKDN 77

Query: 1190 LRYIDSHNSAADKSYKLGLNKFADMTNDEYRSKYLGFKNKDLRKKMRSERYASVAGESLP 1011
            LR+ID HN+  + SYK+GLN+FAD+TN+EYRS YLG K+K    K++S+RYA   G+SLP
Sbjct: 78   LRFIDEHNAEENLSYKVGLNRFADLTNEEYRSTYLGAKSKPKLSKVKSDRYAPRVGDSLP 137

Query: 1010 EFVDWRDKGAVAAVKDQGGCGSCWAFSAIASVEGINQISTGELITLSEQELVDCDRSYNE 831
            E VDWR KGAVA +KDQG CGSCWAFS + +VEGINQI TGELITLSEQELVDCD+SYNE
Sbjct: 138  ESVDWRAKGAVAPIKDQGSCGSCWAFSTVNAVEGINQIVTGELITLSEQELVDCDKSYNE 197

Query: 830  GCEGGLMDYAFEFIINNGGIDSDKDYPYTGMDGTCDQNRKNAKVVTIDSYEDVPAYNEKA 651
            GC+GGLMDY FEFIINNGGID+DKDYPY G D  CDQ RKNAKVVTIDSYEDVP  NE+A
Sbjct: 198  GCDGGLMDYGFEFIINNGGIDTDKDYPYLGRDARCDQYRKNAKVVTIDSYEDVPVNNEEA 257

Query: 650  LQKAVANQPVSVAIEAGGLDFQLYDSGIFSGSCGTAVDHGVVVIGYGSAGGKDYWIVRNS 471
            L+KAVA+QPVSV IE GG  FQ YDSGIF+G CGTA+DHGV V+GYG+  GKDYWIVRNS
Sbjct: 258  LKKAVASQPVSVGIEGGGRAFQFYDSGIFTGKCGTALDHGVNVVGYGTEKGKDYWIVRNS 317

Query: 470  WGAEWGEAGYLRMERNVR-KSAGLCGIAIEPSYPVKNGQNXXXXXXXXXXXXXXXXXPNV 294
            WG+ WGEAGY+RMERN+   S G CGIA+EPSYP+KNGQN                 P V
Sbjct: 318  WGSSWGEAGYIRMERNLAGTSVGKCGIAMEPSYPLKNGQN---PPNPGPSPPTPVRPPTV 374

Query: 293  CDDFNTCPSHTTCCCMFTYLNDCFSWGCCPLEAASCCEDGYSCCPHDYPVCHVYSGTCSV 114
            CDD+ TCP  +TCCC++ Y   CFSWGCCPL+ A+CC+D YSCCPHDYPVC+V +GTCS+
Sbjct: 375  CDDYYTCPESSTCCCVYEYYGYCFSWGCCPLDGATCCDDHYSCCPHDYPVCNVQAGTCSM 434

Query: 113  STNSPFGVKSLKRIPATP 60
            S N+P GVK+++RI ATP
Sbjct: 435  SKNNPLGVKAIQRILATP 452


>gb|AAK62661.1| F2G19.31/F2G19.31 [Arabidopsis thaliana] gi|19548039|gb|AAL87383.1|
            F2G19.31/F2G19.31 [Arabidopsis thaliana]
          Length = 462

 Score =  609 bits (1570), Expect = e-172
 Identities = 290/447 (64%), Positives = 339/447 (75%), Gaps = 11/447 (2%)
 Frame = -2

Query: 1346 TLSLATDMSIITY-----------RSDDEVTSMYESWLSKHGKSYDNALGDEKETRFQIF 1200
            T+S A DMSII+Y           RS+ EV S+YE+WL KHGK+       EK+ RF+IF
Sbjct: 17   TVSSAVDMSIISYDEKHGVSTTGGRSEAEVMSIYEAWLVKHGKAQSQNSLVEKDRRFEIF 76

Query: 1199 KNNLRYIDSHNSAADKSYKLGLNKFADMTNDEYRSKYLGFKNKDLRKKMRSERYASVAGE 1020
            K+NLR++D HN   + SY+LGL +FAD+TNDEYRSKYLG K +   ++  S RY +  G+
Sbjct: 77   KDNLRFVDEHNEK-NLSYRLGLTRFADLTNDEYRSKYLGAKMEKKGERRTSLRYEARVGD 135

Query: 1019 SLPEFVDWRDKGAVAAVKDQGGCGSCWAFSAIASVEGINQISTGELITLSEQELVDCDRS 840
             LPE +DWR KGAVA VKDQGGCGSCWAFS I +VEGINQI TG+LITLSEQELVDCD S
Sbjct: 136  ELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTS 195

Query: 839  YNEGCEGGLMDYAFEFIINNGGIDSDKDYPYTGMDGTCDQNRKNAKVVTIDSYEDVPAYN 660
            YNEGC GGLMDYAFEFII NGGID+DKDYPY G+DGTCDQ RKNAKVVTIDSYEDVP Y+
Sbjct: 196  YNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKGVDGTCDQIRKNAKVVTIDSYEDVPTYS 255

Query: 659  EKALQKAVANQPVSVAIEAGGLDFQLYDSGIFSGSCGTAVDHGVVVIGYGSAGGKDYWIV 480
            E++L+KAVA+QP+S+AIEAGG  FQLYDSGIF GSCGT +DHGVV +GYG+  GKDYWIV
Sbjct: 256  EESLKKAVAHQPISIAIEAGGRAFQLYDSGIFDGSCGTQLDHGVVAVGYGTENGKDYWIV 315

Query: 479  RNSWGAEWGEAGYLRMERNVRKSAGLCGIAIEPSYPVKNGQNXXXXXXXXXXXXXXXXXP 300
            RNSWG  WGE+GYLRM RN+  S+G CGIAIEPSYP+KNG+N                 P
Sbjct: 316  RNSWGKSWGESGYLRMARNIASSSGKCGIAIEPSYPIKNGEN---PPNPGPSPPSPIKPP 372

Query: 299  NVCDDFNTCPSHTTCCCMFTYLNDCFSWGCCPLEAASCCEDGYSCCPHDYPVCHVYSGTC 120
              CD + TCP   TCCC+F Y   CF+WGCCPLEAA+CC+D YSCCPH+YPVC +  GTC
Sbjct: 373  TQCDSYYTCPESNTCCCLFEYGKYCFAWGCCPLEAATCCDDNYSCCPHEYPVCDLDQGTC 432

Query: 119  SVSTNSPFGVKSLKRIPATPTMPQSIK 39
             +S NSPF VK+LKR PATP   Q  K
Sbjct: 433  LLSKNSPFSVKALKRKPATPFWSQGRK 459


Top