BLASTX nr result

ID: Cnidium21_contig00017427 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cnidium21_contig00017427
         (1674 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas]                          569   e-160
ref|XP_002510459.1| cysteine protease, putative [Ricinus communi...   558   e-156
ref|XP_002307688.1| predicted protein [Populus trichocarpa] gi|2...   557   e-156
ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [C...   544   e-152
gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [A...   541   e-151

>dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas]
          Length = 441

 Score =  569 bits (1467), Expect = e-160
 Identities = 273/405 (67%), Positives = 317/405 (78%), Gaps = 4/405 (0%)
 Frame = +1

Query: 145  IYSSTTSD---LFETWCISHGKTYSSQEEKLYRLKIFEENYMYVTQHNKNNDIAANSSLS 315
            ++SS++S+   LFETWC  HGKTY+SQEEKL+RLK+F++NY +VT+HN      +  + S
Sbjct: 18   LFSSSSSEIAHLFETWCQQHGKTYASQEEKLFRLKVFQDNYDFVTEHN------SQGNSS 71

Query: 316  YTLSIDNAFADLTHQEFKASRLGLSSTGIIRMNLGGSSEG-SDGVTNVPTSLDWRDKGAV 492
            YTLS+ NAFADLTH EFKASRLGLSS     +N+  S+    D V +VP S+DWR  GAV
Sbjct: 72   YTLSL-NAFADLTHHEFKASRLGLSSAASASLNVDRSNRQIPDFVADVPASVDWRKNGAV 130

Query: 493  TNVKDQGSCGACWSFSATGAIEGINQIVTGSLTSLSEQELVDCDRSYNDGCEGGLMDYAY 672
            T VKDQG+CGACWSFSATGAIEGIN+IVTGSL SLSEQELVDCD+SYN+GCEGG+MDYA+
Sbjct: 131  TQVKDQGNCGACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDKSYNNGCEGGIMDYAF 190

Query: 673  QFVVKNKGIDTEDDYPYQSRDMTCNKNKLNRHVVTIDGYIDVRENDEKQLLAAVAAQPVS 852
            QFV+ N GIDTE+DYPYQ RD +CNK KL RHVVTIDGY+DV +N+EK+LL AVA QPVS
Sbjct: 191  QFVIDNHGIDTEEDYPYQGRDRSCNKEKLKRHVVTIDGYVDVPQNNEKELLKAVANQPVS 250

Query: 853  VGICGSERNFQLYSKGIFNGPCSTSLDHAVLIVGYGSENGVDYWILKNSWGKQWGMNGYM 1032
            VGICGSER FQLYSKGIF GPCSTSLDHAVLIVGYGSENGVDYWI+KNSWG  WGM+GYM
Sbjct: 251  VGICGSERAFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGSYWGMDGYM 310

Query: 1033 HMQRNSGNSQGICGINMMASYXXXXXXXXXXXXXXXXXKCSLLTSCSEGETCCCARTLFG 1212
            HMQRNSG+S+G+CGINM+ASY                 +C L T C EGETCCC   +FG
Sbjct: 311  HMQRNSGSSRGLCGINMLASYPKKTSPNPPPPAPPGPTRCDLFTHCGEGETCCCVHHIFG 370

Query: 1213 ICLSWKCCELNSAVXXXXXXXXXXXXYPICDTKRNMCLKQTGNYT 1347
            ICLSWKCCEL+SAV            YP+CDT RN+CLK  GN T
Sbjct: 371  ICLSWKCCELDSAVCCKDGRHCCPRDYPVCDTTRNICLKHYGNAT 415


>ref|XP_002510459.1| cysteine protease, putative [Ricinus communis]
            gi|223551160|gb|EEF52646.1| cysteine protease, putative
            [Ricinus communis]
          Length = 422

 Score =  558 bits (1437), Expect = e-156
 Identities = 270/393 (68%), Positives = 305/393 (77%)
 Frame = +1

Query: 151  SSTTSDLFETWCISHGKTYSSQEEKLYRLKIFEENYMYVTQHNKNNDIAANSSLSYTLSI 330
            SS  S LFE+W   HGKTY+S+E+KLYR KIFEENY +V +HN      +  + SYTLS+
Sbjct: 25   SSDISKLFESWTKEHGKTYTSKEDKLYRFKIFEENYEFVKKHN------SQGNSSYTLSL 78

Query: 331  DNAFADLTHQEFKASRLGLSSTGIIRMNLGGSSEGSDGVTNVPTSLDWRDKGAVTNVKDQ 510
             NAFADLTH EFKASRLGLS+          +    D V +VP S+DWR KGAV+ VKDQ
Sbjct: 79   -NAFADLTHHEFKASRLGLSAFSTSGKLSRRNFPLHDFVGDVPISIDWRKKGAVSQVKDQ 137

Query: 511  GSCGACWSFSATGAIEGINQIVTGSLTSLSEQELVDCDRSYNDGCEGGLMDYAYQFVVKN 690
            G+CGACWSFSATGAIEGIN+IVTGSL SLSEQELVDCDRSYN+GCEGGLMDYAYQFV++N
Sbjct: 138  GNCGACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDRSYNNGCEGGLMDYAYQFVIEN 197

Query: 691  KGIDTEDDYPYQSRDMTCNKNKLNRHVVTIDGYIDVRENDEKQLLAAVAAQPVSVGICGS 870
             GIDTE+DYPYQ+R+ TCNK KL RHVVTIDGY DV +N+EK+LL AVAAQPVSVGICGS
Sbjct: 198  NGIDTEEDYPYQAREKTCNKEKLKRHVVTIDGYTDVPQNNEKELLKAVAAQPVSVGICGS 257

Query: 871  ERNFQLYSKGIFNGPCSTSLDHAVLIVGYGSENGVDYWILKNSWGKQWGMNGYMHMQRNS 1050
            ER FQLYSKGIF GPCSTSLDHAVLIVGYGSENGVDYWI+KNSWG  WG+NGYM+M RNS
Sbjct: 258  ERAFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTHWGINGYMYMLRNS 317

Query: 1051 GNSQGICGINMMASYXXXXXXXXXXXXXXXXXKCSLLTSCSEGETCCCARTLFGICLSWK 1230
            GNSQG+CGINM+AS+                 KC L T C EGETCCC R +FG+C SWK
Sbjct: 318  GNSQGLCGINMLASFPVKTSPNPPPPAPPGPTKCDLFTRCGEGETCCCTRRIFGLCFSWK 377

Query: 1231 CCELNSAVXXXXXXXXXXXXYPICDTKRNMCLK 1329
            CCEL+SAV            YP+CDTKRNMCLK
Sbjct: 378  CCELDSAVCCKDGLHCCPHDYPVCDTKRNMCLK 410


>ref|XP_002307688.1| predicted protein [Populus trichocarpa] gi|222857137|gb|EEE94684.1|
            predicted protein [Populus trichocarpa]
          Length = 436

 Score =  557 bits (1435), Expect = e-156
 Identities = 269/400 (67%), Positives = 309/400 (77%), Gaps = 1/400 (0%)
 Frame = +1

Query: 151  SSTTSDLFETWCISHGKTYSSQEEKLYRLKIFEENYMYVTQHNKNNDIAANSSLSYTLSI 330
            SS  S LFETWC  HGK+Y+SQEE+ +RLK+FE+NY +VT+HN       NSS S  L  
Sbjct: 22   SSDISQLFETWCKEHGKSYTSQEERSHRLKVFEDNYDFVTKHNSKG----NSSYSLAL-- 75

Query: 331  DNAFADLTHQEFKASRLGLSSTGIIRMNLGGSSEGSDGVT-NVPTSLDWRDKGAVTNVKD 507
             NAFADLTH EFK SRLGLS+  +   NL   +    GV  ++P S+DWR+KG VTNVKD
Sbjct: 76   -NAFADLTHHEFKTSRLGLSAAPL---NLAHRNLEITGVVGDIPASIDWRNKGVVTNVKD 131

Query: 508  QGSCGACWSFSATGAIEGINQIVTGSLTSLSEQELVDCDRSYNDGCEGGLMDYAYQFVVK 687
            QGSCGACWSFSATGAIEGIN+IVTGSL SLSEQEL++CD+SYNDGC GGLMDYA+QFV+ 
Sbjct: 132  QGSCGACWSFSATGAIEGINKIVTGSLVSLSEQELIECDKSYNDGCGGGLMDYAFQFVIN 191

Query: 688  NKGIDTEDDYPYQSRDMTCNKNKLNRHVVTIDGYIDVRENDEKQLLAAVAAQPVSVGICG 867
            N GIDTE+DYPY++RD TCNK+++ R VVTID Y+DV EN+EKQLL AVAAQPVSVGICG
Sbjct: 192  NHGIDTEEDYPYRARDGTCNKDRMKRRVVTIDKYVDVPENNEKQLLQAVAAQPVSVGICG 251

Query: 868  SERNFQLYSKGIFNGPCSTSLDHAVLIVGYGSENGVDYWILKNSWGKQWGMNGYMHMQRN 1047
            SER FQ+YSKGIF GPCSTSLDHAVLIVGYGSENGVDYWI+KNSWG  WGM GYMHMQRN
Sbjct: 252  SERAFQMYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTGWGMRGYMHMQRN 311

Query: 1048 SGNSQGICGINMMASYXXXXXXXXXXXXXXXXXKCSLLTSCSEGETCCCARTLFGICLSW 1227
            SGNSQG+CGINM+ASY                 KC+LLT C+ GETCCCAR  FGIC+SW
Sbjct: 312  SGNSQGVCGINMLASYPVKTSPNPPPPPPPGPTKCNLLTYCAAGETCCCARKFFGICISW 371

Query: 1228 KCCELNSAVXXXXXXXXXXXXYPICDTKRNMCLKQTGNYT 1347
            KCC L+SAV            YP+CDT +NMC K+ GN T
Sbjct: 372  KCCGLDSAVCCKDRLHCCPHDYPVCDTDKNMCFKRAGNAT 411


>ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
            gi|449529596|ref|XP_004171784.1| PREDICTED: cysteine
            proteinase RD21a-like [Cucumis sativus]
          Length = 431

 Score =  544 bits (1402), Expect = e-152
 Identities = 266/399 (66%), Positives = 303/399 (75%)
 Frame = +1

Query: 151  SSTTSDLFETWCISHGKTYSSQEEKLYRLKIFEENYMYVTQHNKNNDIAANSSLSYTLSI 330
            +S  S+LFE WC  HGK+YSS EEKLYRL +F +NY +VT HN  ++       SYTLS+
Sbjct: 22   TSNVSELFEIWCTEHGKSYSSAEEKLYRLGVFADNYEFVTHHNNLDNS------SYTLSL 75

Query: 331  DNAFADLTHQEFKASRLGLSSTGIIRMNLGGSSEGSDGVTNVPTSLDWRDKGAVTNVKDQ 510
             N++ADLTH EFK SRLG S    +R       +      +VP SLDWR KGAVT VKDQ
Sbjct: 76   -NSYADLTHHEFKVSRLGFSPA--LRNFRPVLPQEPSLPRDVPDSLDWRKKGAVTAVKDQ 132

Query: 511  GSCGACWSFSATGAIEGINQIVTGSLTSLSEQELVDCDRSYNDGCEGGLMDYAYQFVVKN 690
            GSCGACWSFSATGA+EGINQI+TGSL SLSEQEL+DCDRSYN GC GGLMDYAYQFV+ N
Sbjct: 133  GSCGACWSFSATGAMEGINQIMTGSLISLSEQELIDCDRSYNSGCGGGLMDYAYQFVISN 192

Query: 691  KGIDTEDDYPYQSRDMTCNKNKLNRHVVTIDGYIDVRENDEKQLLAAVAAQPVSVGICGS 870
             GIDTE+DYPYQ+RD +C K+KL R+VVTIDGY D+  NDE +LL AVAAQPVSVGICGS
Sbjct: 193  HGIDTENDYPYQARDGSCRKDKLQRNVVTIDGYADIPSNDEGKLLQAVAAQPVSVGICGS 252

Query: 871  ERNFQLYSKGIFNGPCSTSLDHAVLIVGYGSENGVDYWILKNSWGKQWGMNGYMHMQRNS 1050
            ER FQLYSKGIF+GPCSTSLDHAVLIVGYGSENGVDYWI+KNSWGK WGM+GYMHMQRNS
Sbjct: 253  ERAFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKSWGMDGYMHMQRNS 312

Query: 1051 GNSQGICGINMMASYXXXXXXXXXXXXXXXXXKCSLLTSCSEGETCCCARTLFGICLSWK 1230
            GNS+G+CGIN +ASY                 KCS+LTSC+ GETCCCA+   G+CLSWK
Sbjct: 313  GNSEGVCGINKLASYPTKTNPNPPPSPPPGPTKCSILTSCAAGETCCCAKKFLGLCLSWK 372

Query: 1231 CCELNSAVXXXXXXXXXXXXYPICDTKRNMCLKQTGNYT 1347
            CC L+SAV            YPICDT RN+CLKQT N T
Sbjct: 373  CCGLSSAVCCKDGRHCCPFDYPICDTDRNLCLKQTMNGT 411


>gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana]
          Length = 437

 Score =  541 bits (1393), Expect = e-151
 Identities = 260/401 (64%), Positives = 306/401 (76%), Gaps = 2/401 (0%)
 Frame = +1

Query: 151  SSTTSDLFETWCISHGKTYSSQEEKLYRLKIFEENYMYVTQHNKNNDIAANSSLSYTLSI 330
            S   S+LF+ WC  HGKTY S+EE+  R++IF++N+ +VTQHN        ++ +Y+LS+
Sbjct: 25   SDDISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHN------LITNATYSLSL 78

Query: 331  DNAFADLTHQEFKASRLGLS--STGIIRMNLGGSSEGSDGVTNVPTSLDWRDKGAVTNVK 504
             NAFADLTH EFKASRLGLS  +  +I  + G S  GS     VP S+DWR KGAVTNVK
Sbjct: 79   -NAFADLTHHEFKASRLGLSVSAPSVIMASKGQSLGGS---VKVPDSVDWRKKGAVTNVK 134

Query: 505  DQGSCGACWSFSATGAIEGINQIVTGSLTSLSEQELVDCDRSYNDGCEGGLMDYAYQFVV 684
            DQGSCGACWSFSATGA+EGINQIVTG L SLSEQEL+DCD+SYN GC GGLMDYA++FV+
Sbjct: 135  DQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVI 194

Query: 685  KNKGIDTEDDYPYQSRDMTCNKNKLNRHVVTIDGYIDVRENDEKQLLAAVAAQPVSVGIC 864
            KN GIDTE DYPYQ RD TC K+KL + VVTID Y  V+ NDEK L+ AVAAQPVSVGIC
Sbjct: 195  KNHGIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGIC 254

Query: 865  GSERNFQLYSKGIFNGPCSTSLDHAVLIVGYGSENGVDYWILKNSWGKQWGMNGYMHMQR 1044
            GSER FQLYS+GIF+GPCSTSLDHAVLIVGYGS+NGVDYWI+KNSWGK WGM+G+MHMQR
Sbjct: 255  GSERAFQLYSRGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQR 314

Query: 1045 NSGNSQGICGINMMASYXXXXXXXXXXXXXXXXXKCSLLTSCSEGETCCCARTLFGICLS 1224
            N+ NS G+CGINM+ASY                 KC+L T CS GETCCCAR LFG+C S
Sbjct: 315  NTENSDGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCCCARELFGLCFS 374

Query: 1225 WKCCELNSAVXXXXXXXXXXXXYPICDTKRNMCLKQTGNYT 1347
            WKCCE+ SAV            YP+CDT R++CLK+TGN+T
Sbjct: 375  WKCCEIESAVCCKDGRHCCPHDYPVCDTTRSLCLKKTGNFT 415


Top