BLASTX nr result
ID: Bupleurum21_contig00005681
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Bupleurum21_contig00005681 (1377 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value dbj|BAD29960.1| cysteine protease [Daucus carota] 652 0.0 dbj|BAD29957.1| cysteine protease [Daucus carota] 641 0.0 dbj|BAD29958.1| cysteine protease [Daucus carota] 640 0.0 dbj|BAE80740.1| cysteine proteinase [Platycodon grandiflorus] 624 e-176 gb|AAK62661.1| F2G19.31/F2G19.31 [Arabidopsis thaliana] gi|19548... 609 e-172 >dbj|BAD29960.1| cysteine protease [Daucus carota] Length = 460 Score = 652 bits (1681), Expect = 0.0 Identities = 311/447 (69%), Positives = 357/447 (79%), Gaps = 9/447 (2%) Frame = -2 Query: 1334 ATDMSIITY-------RSDDEVTSMYESWLSKHGKSYDNALGDEKETRFQIFKNNLRYID 1176 A DMSIITY +DD + + YESWL KHGKSY NALG EKE RFQIFK+N YID Sbjct: 19 AADMSIITYDQTHAVGSTDDVIMAAYESWLVKHGKSY-NALG-EKEQRFQIFKDNFLYID 76 Query: 1175 SHNSAADKSYKLGLNKFADMTNDEYRSKYLGFKNKDLRKKM--RSERYASVAGESLPEFV 1002 N+A D+S+KLGLN+FAD+TN+EYRSKY G + KD RKK+ +S+RYAS+AGESLPE V Sbjct: 77 EQNAAKDRSFKLGLNRFADLTNEEYRSKYTGIRTKDSRKKVSGKSQRYASLAGESLPESV 136 Query: 1001 DWRDKGAVAAVKDQGGCGSCWAFSAIASVEGINQISTGELITLSEQELVDCDRSYNEGCE 822 DWR+ GAVA+VKDQG CGSCWAFS I++VEGINQI+TG+LITLSEQELVDCDRSYNEGC Sbjct: 137 DWREHGAVASVKDQGQCGSCWAFSTISAVEGINQIATGKLITLSEQELVDCDRSYNEGCN 196 Query: 821 GGLMDYAFEFIINNGGIDSDKDYPYTGMDGTCDQNRKNAKVVTIDSYEDVPAYNEKALQK 642 GGLMD AF+FIINNGGIDSD DYPYTG DG CDQ RKNAKVVTIDSYEDVP Y+EKALQK Sbjct: 197 GGLMDDAFQFIINNGGIDSDADYPYTGRDGQCDQYRKNAKVVTIDSYEDVPEYDEKALQK 256 Query: 641 AVANQPVSVAIEAGGLDFQLYDSGIFSGSCGTAVDHGVVVIGYGSAGGKDYWIVRNSWGA 462 A ANQP+SVAIEA G DFQ YDSGIF+G CGT +DHGVVV+GYG+ GKDYWIVRNSWGA Sbjct: 257 AAANQPISVAIEASGRDFQFYDSGIFTGKCGTDLDHGVVVVGYGTENGKDYWIVRNSWGA 316 Query: 461 EWGEAGYLRMERNVRKSAGLCGIAIEPSYPVKNGQNXXXXXXXXXXXXXXXXXPNVCDDF 282 +WGE GYLRMER + AG+CGI EPSYPVK+G N +VCD++ Sbjct: 317 DWGEKGYLRMERGISSKAGICGITSEPSYPVKSGVN---PPNPGPSPPSPKSPESVCDEY 373 Query: 281 NTCPSHTTCCCMFTYLNDCFSWGCCPLEAASCCEDGYSCCPHDYPVCHVYSGTCSVSTNS 102 TCP TTCCCM+ Y CF+WGCCPLE ASCC+DGYSCCPHDYPVC+V +GTCS+S N+ Sbjct: 374 YTCPMSTTCCCMYEYYGYCFAWGCCPLEGASCCDDGYSCCPHDYPVCNVRAGTCSMSNNN 433 Query: 101 PFGVKSLKRIPATPTMPQSIKGKKVSA 21 P GVK+++RI ATP KGKKV+A Sbjct: 434 PLGVKAIQRILATPNWQHGSKGKKVTA 460 >dbj|BAD29957.1| cysteine protease [Daucus carota] Length = 437 Score = 641 bits (1654), Expect = 0.0 Identities = 302/412 (73%), Positives = 343/412 (83%), Gaps = 12/412 (2%) Frame = -2 Query: 1340 SLATDMSIITY---------RSDDEVTSMYESWLSKHGKSYDNALGDEKETRFQIFKNNL 1188 +LA+DMSII Y R+DDEV +MY SWL KHGKSY NALG EKETRFQIFK+NL Sbjct: 20 ALASDMSIINYDQTHTNSLIRTDDEVMTMYNSWLVKHGKSY-NALG-EKETRFQIFKDNL 77 Query: 1187 RYIDSHNSAADKSYKLGLNKFADMTNDEYRSKYLGFKNKDLRKKMR---SERYASVAGES 1017 RYID+HN+ D+SY+LGLN+FAD+TN+EYR+KYLG K+++ R K+ S+RYA V GE Sbjct: 78 RYIDNHNADPDRSYELGLNRFADLTNEEYRAKYLGTKSRESRPKLSKGPSDRYAPVEGEE 137 Query: 1016 LPEFVDWRDKGAVAAVKDQGGCGSCWAFSAIASVEGINQISTGELITLSEQELVDCDRSY 837 LP+ +DWR+KGAVAAVKDQG CGSCWAFSAI +VEGINQI+TGELITLSEQELVDCDRSY Sbjct: 138 LPDSIDWREKGAVAAVKDQGSCGSCWAFSAIGAVEGINQITTGELITLSEQELVDCDRSY 197 Query: 836 NEGCEGGLMDYAFEFIINNGGIDSDKDYPYTGMDGTCDQNRKNAKVVTIDSYEDVPAYNE 657 NEGCEGGLMDYAF FII NGGIDSD DYPYTG DGTC+QN++NAKVVTIDSYEDVP Y+E Sbjct: 198 NEGCEGGLMDYAFNFIIKNGGIDSDLDYPYTGRDGTCNQNKENAKVVTIDSYEDVPVYDE 257 Query: 656 KALQKAVANQPVSVAIEAGGLDFQLYDSGIFSGSCGTAVDHGVVVIGYGSAGGKDYWIVR 477 KALQKA ANQP+SVAIEAGG+DFQLY SGIF+G CGTAVDHGVVV+GYGS G DYWIVR Sbjct: 258 KALQKAAANQPISVAIEAGGMDFQLYVSGIFTGKCGTAVDHGVVVVGYGSEEGMDYWIVR 317 Query: 476 NSWGAEWGEAGYLRMERNVRKSAGLCGIAIEPSYPVKNGQNXXXXXXXXXXXXXXXXXPN 297 NSWGA WGEAGYL+M+RNV KS+GLCGI IEPSYPVKNG N N Sbjct: 318 NSWGAAWGEAGYLKMQRNVGKSSGLCGITIEPSYPVKNGDNPPNPGPTPPSPPSPSLPDN 377 Query: 296 VCDDFNTCPSHTTCCCMFTYLNDCFSWGCCPLEAASCCEDGYSCCPHDYPVC 141 VCD + +CP+HTTCCC++T+ CF WGCCPLEAASCC+DGYSCCPHDYPVC Sbjct: 378 VCDAYTSCPAHTTCCCLYTFGKQCFYWGCCPLEAASCCDDGYSCCPHDYPVC 429 >dbj|BAD29958.1| cysteine protease [Daucus carota] Length = 496 Score = 640 bits (1650), Expect = 0.0 Identities = 302/430 (70%), Positives = 347/430 (80%), Gaps = 10/430 (2%) Frame = -2 Query: 1346 TLSLATDMSIITY--------RSDDEVTSMYESWLSKHGKSYDNALGDEKETRFQIFKNN 1191 T++ ATDMSIITY ++DDE T+++ESWL HGKSY NALG+E E RFQIFKNN Sbjct: 15 TVAAATDMSIITYDETHAVGFKTDDEATTLFESWLVTHGKSY-NALGEE-EKRFQIFKNN 72 Query: 1190 LRYIDSHNSAADKSYKLGLNKFADMTNDEYRSKYLGFKNKDLRKKM--RSERYASVAGES 1017 LRYID N D+ +KLGLNKFAD+TN+EYRSKY G K+KDLRKK+ +S RYA+++GES Sbjct: 73 LRYIDEQNLVEDRGFKLGLNKFADLTNEEYRSKYTGIKSKDLRKKVSAKSGRYATLSGES 132 Query: 1016 LPEFVDWRDKGAVAAVKDQGGCGSCWAFSAIASVEGINQISTGELITLSEQELVDCDRSY 837 LPE VDWR+ GAVA VKDQG CGSCWAFS I++VEGINQI+TG+LITLSEQELVDCDRSY Sbjct: 133 LPESVDWRESGAVATVKDQGSCGSCWAFSTISAVEGINQIATGKLITLSEQELVDCDRSY 192 Query: 836 NEGCEGGLMDYAFEFIINNGGIDSDKDYPYTGMDGTCDQNRKNAKVVTIDSYEDVPAYNE 657 NEGC GGLMDYAFEFIINNGGID+D DYPYTG DG CDQ RKNAKVVTIDSYEDVPAY+E Sbjct: 193 NEGCNGGLMDYAFEFIINNGGIDTDVDYPYTGRDGKCDQYRKNAKVVTIDSYEDVPAYDE 252 Query: 656 KALQKAVANQPVSVAIEAGGLDFQLYDSGIFSGSCGTAVDHGVVVIGYGSAGGKDYWIVR 477 AL+KA ANQP+SVAIEA G DFQ YDSGIF+G CG A+DHGVVV+GYG+ GKDYWIVR Sbjct: 253 LALKKAAANQPISVAIEASGRDFQFYDSGIFTGKCGIALDHGVVVVGYGTENGKDYWIVR 312 Query: 476 NSWGAEWGEAGYLRMERNVRKSAGLCGIAIEPSYPVKNGQNXXXXXXXXXXXXXXXXXPN 297 NSWGA+WGE GYLRMER + G+CGIAIEPSYPVK G N + Sbjct: 313 NSWGADWGENGYLRMERGISSKTGICGIAIEPSYPVKTGVN---PPNPGPSPPTPKTPES 369 Query: 296 VCDDFNTCPSHTTCCCMFTYLNDCFSWGCCPLEAASCCEDGYSCCPHDYPVCHVYSGTCS 117 VCD++ TCP TTCCCM+ Y CF+WGCCPLE ASCC+DGYSCCPHDYPVC+V +GTCS Sbjct: 370 VCDEYYTCPMSTTCCCMYEYYGYCFAWGCCPLEGASCCDDGYSCCPHDYPVCNVRAGTCS 429 Query: 116 VSTNSPFGVK 87 + N+P GV+ Sbjct: 430 MKYNNPLGVR 439 >dbj|BAE80740.1| cysteine proteinase [Platycodon grandiflorus] Length = 462 Score = 624 bits (1608), Expect = e-176 Identities = 299/438 (68%), Positives = 346/438 (78%), Gaps = 11/438 (2%) Frame = -2 Query: 1340 SLATDMSIITY----------RSDDEVTSMYESWLSKHGKSYDNALGDEKETRFQIFKNN 1191 S A DMSII Y R+DDEV +MYESWL KHGKSY NALG EKE RFQIFK+N Sbjct: 20 SSALDMSIINYDATHASKSSWRTDDEVMAMYESWLVKHGKSY-NALG-EKEKRFQIFKDN 77 Query: 1190 LRYIDSHNSAADKSYKLGLNKFADMTNDEYRSKYLGFKNKDLRKKMRSERYASVAGESLP 1011 LR+ID HN+ + SYK+GLN+FAD+TN+EYRS YLG K+K K++S+RYA G+SLP Sbjct: 78 LRFIDEHNAEENLSYKVGLNRFADLTNEEYRSTYLGAKSKPKLSKVKSDRYAPRVGDSLP 137 Query: 1010 EFVDWRDKGAVAAVKDQGGCGSCWAFSAIASVEGINQISTGELITLSEQELVDCDRSYNE 831 E VDWR KGAVA +KDQG CGSCWAFS + +VEGINQI TGELITLSEQELVDCD+SYNE Sbjct: 138 ESVDWRAKGAVAPIKDQGSCGSCWAFSTVNAVEGINQIVTGELITLSEQELVDCDKSYNE 197 Query: 830 GCEGGLMDYAFEFIINNGGIDSDKDYPYTGMDGTCDQNRKNAKVVTIDSYEDVPAYNEKA 651 GC+GGLMDY FEFIINNGGID+DKDYPY G D CDQ RKNAKVVTIDSYEDVP NE+A Sbjct: 198 GCDGGLMDYGFEFIINNGGIDTDKDYPYLGRDARCDQYRKNAKVVTIDSYEDVPVNNEEA 257 Query: 650 LQKAVANQPVSVAIEAGGLDFQLYDSGIFSGSCGTAVDHGVVVIGYGSAGGKDYWIVRNS 471 L+KAVA+QPVSV IE GG FQ YDSGIF+G CGTA+DHGV V+GYG+ GKDYWIVRNS Sbjct: 258 LKKAVASQPVSVGIEGGGRAFQFYDSGIFTGKCGTALDHGVNVVGYGTEKGKDYWIVRNS 317 Query: 470 WGAEWGEAGYLRMERNVR-KSAGLCGIAIEPSYPVKNGQNXXXXXXXXXXXXXXXXXPNV 294 WG+ WGEAGY+RMERN+ S G CGIA+EPSYP+KNGQN P V Sbjct: 318 WGSSWGEAGYIRMERNLAGTSVGKCGIAMEPSYPLKNGQN---PPNPGPSPPTPVRPPTV 374 Query: 293 CDDFNTCPSHTTCCCMFTYLNDCFSWGCCPLEAASCCEDGYSCCPHDYPVCHVYSGTCSV 114 CDD+ TCP +TCCC++ Y CFSWGCCPL+ A+CC+D YSCCPHDYPVC+V +GTCS+ Sbjct: 375 CDDYYTCPESSTCCCVYEYYGYCFSWGCCPLDGATCCDDHYSCCPHDYPVCNVQAGTCSM 434 Query: 113 STNSPFGVKSLKRIPATP 60 S N+P GVK+++RI ATP Sbjct: 435 SKNNPLGVKAIQRILATP 452 >gb|AAK62661.1| F2G19.31/F2G19.31 [Arabidopsis thaliana] gi|19548039|gb|AAL87383.1| F2G19.31/F2G19.31 [Arabidopsis thaliana] Length = 462 Score = 609 bits (1570), Expect = e-172 Identities = 290/447 (64%), Positives = 339/447 (75%), Gaps = 11/447 (2%) Frame = -2 Query: 1346 TLSLATDMSIITY-----------RSDDEVTSMYESWLSKHGKSYDNALGDEKETRFQIF 1200 T+S A DMSII+Y RS+ EV S+YE+WL KHGK+ EK+ RF+IF Sbjct: 17 TVSSAVDMSIISYDEKHGVSTTGGRSEAEVMSIYEAWLVKHGKAQSQNSLVEKDRRFEIF 76 Query: 1199 KNNLRYIDSHNSAADKSYKLGLNKFADMTNDEYRSKYLGFKNKDLRKKMRSERYASVAGE 1020 K+NLR++D HN + SY+LGL +FAD+TNDEYRSKYLG K + ++ S RY + G+ Sbjct: 77 KDNLRFVDEHNEK-NLSYRLGLTRFADLTNDEYRSKYLGAKMEKKGERRTSLRYEARVGD 135 Query: 1019 SLPEFVDWRDKGAVAAVKDQGGCGSCWAFSAIASVEGINQISTGELITLSEQELVDCDRS 840 LPE +DWR KGAVA VKDQGGCGSCWAFS I +VEGINQI TG+LITLSEQELVDCD S Sbjct: 136 ELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTS 195 Query: 839 YNEGCEGGLMDYAFEFIINNGGIDSDKDYPYTGMDGTCDQNRKNAKVVTIDSYEDVPAYN 660 YNEGC GGLMDYAFEFII NGGID+DKDYPY G+DGTCDQ RKNAKVVTIDSYEDVP Y+ Sbjct: 196 YNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKGVDGTCDQIRKNAKVVTIDSYEDVPTYS 255 Query: 659 EKALQKAVANQPVSVAIEAGGLDFQLYDSGIFSGSCGTAVDHGVVVIGYGSAGGKDYWIV 480 E++L+KAVA+QP+S+AIEAGG FQLYDSGIF GSCGT +DHGVV +GYG+ GKDYWIV Sbjct: 256 EESLKKAVAHQPISIAIEAGGRAFQLYDSGIFDGSCGTQLDHGVVAVGYGTENGKDYWIV 315 Query: 479 RNSWGAEWGEAGYLRMERNVRKSAGLCGIAIEPSYPVKNGQNXXXXXXXXXXXXXXXXXP 300 RNSWG WGE+GYLRM RN+ S+G CGIAIEPSYP+KNG+N P Sbjct: 316 RNSWGKSWGESGYLRMARNIASSSGKCGIAIEPSYPIKNGEN---PPNPGPSPPSPIKPP 372 Query: 299 NVCDDFNTCPSHTTCCCMFTYLNDCFSWGCCPLEAASCCEDGYSCCPHDYPVCHVYSGTC 120 CD + TCP TCCC+F Y CF+WGCCPLEAA+CC+D YSCCPH+YPVC + GTC Sbjct: 373 TQCDSYYTCPESNTCCCLFEYGKYCFAWGCCPLEAATCCDDNYSCCPHEYPVCDLDQGTC 432 Query: 119 SVSTNSPFGVKSLKRIPATPTMPQSIK 39 +S NSPF VK+LKR PATP Q K Sbjct: 433 LLSKNSPFSVKALKRKPATPFWSQGRK 459