BLASTX nr result
ID: Coptis23_contig00002443
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Coptis23_contig00002443 (1332 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002307688.1| predicted protein [Populus trichocarpa] gi|2... 552 e-155 ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [C... 527 e-147 dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas] 525 e-146 ref|XP_002510459.1| cysteine protease, putative [Ricinus communi... 511 e-142 ref|XP_003523725.1| PREDICTED: oryzain alpha chain-like [Glycine... 509 e-142 >ref|XP_002307688.1| predicted protein [Populus trichocarpa] gi|222857137|gb|EEE94684.1| predicted protein [Populus trichocarpa] Length = 436 Score = 552 bits (1423), Expect = e-155 Identities = 263/404 (65%), Positives = 308/404 (76%), Gaps = 2/404 (0%) Frame = +3 Query: 123 SVSSLTTNDVFNSWCTQHGKTYTSQEEKLYRLKVFEDNLAYVTQHNSFINSSYTLDLNAF 302 S SS + +F +WC +HGK+YTSQEE+ +RLKVFEDN +VT+HNS NSSY+L LNAF Sbjct: 19 STSSSDISQLFETWCKEHGKSYTSQEERSHRLKVFEDNYDFVTKHNSKGNSSYSLALNAF 78 Query: 303 ADLTHHEFKSLRLGALKKPVLNTHRRSRGGVCVVRDVPASLDWRTKGAVTPVKDQASCGA 482 ADLTHHEFK+ RLG P LN R+ VV D+PAS+DWR KG VT VKDQ SCGA Sbjct: 79 ADLTHHEFKTSRLGLSAAP-LNLAHRNLEITGVVGDIPASIDWRNKGVVTNVKDQGSCGA 137 Query: 483 CWAFSSTGSIEGINQIVTGSLVSLSEQELVDCDQSYNSGCGGGLMDYAFQWVIKNRGIDT 662 CW+FS+TG+IEGIN+IVTGSLVSLSEQEL++CD+SYN GCGGGLMDYAFQ+VI N GIDT Sbjct: 138 CWSFSATGAIEGINKIVTGSLVSLSEQELIECDKSYNDGCGGGLMDYAFQFVINNHGIDT 197 Query: 663 ENDYPYQAGDSACNKNKMTRRVVTIDSYTDVPESDEEKLLQAVAIQPVSVGLCGSERSFQ 842 E DYPY+A D CNK++M RRVVTID Y DVPE++E++LLQAVA QPVSVG+CGSER+FQ Sbjct: 198 EEDYPYRARDGTCNKDRMKRRVVTIDKYVDVPENNEKQLLQAVAAQPVSVGICGSERAFQ 257 Query: 843 LYSKGIFSSPCSKSLDHAVLIVGYGSENGVDYWILKNSWGTNWGMNGYMHMQRNSGDKQG 1022 +YSKGIF+ PCS SLDHAVLIVGYGSENGVDYWI+KNSWGT WGM GYMHMQRNSG+ QG Sbjct: 258 MYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTGWGMRGYMHMQRNSGNSQG 317 Query: 1023 VCGINMLA--XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRRLLGICFSWKCCGLD 1196 VCGINMLA R+ GIC SWKCCGLD Sbjct: 318 VCGINMLASYPVKTSPNPPPPPPPGPTKCNLLTYCAAGETCCCARKFFGICISWKCCGLD 377 Query: 1197 SAVCCKDHRSCCPHDYPICDTNEQKCYKRAGNSTMVKGLEQRES 1328 SAVCCKD CCPHDYP+CDT++ C+KRAGN+T ++ +E + S Sbjct: 378 SAVCCKDRLHCCPHDYPVCDTDKNMCFKRAGNATRMEAIEGKTS 421 >ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus] gi|449529596|ref|XP_004171784.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus] Length = 431 Score = 527 bits (1357), Expect = e-147 Identities = 259/408 (63%), Positives = 300/408 (73%), Gaps = 9/408 (2%) Frame = +3 Query: 132 SLTTN--DVFNSWCTQHGKTYTSQEEKLYRLKVFEDNLAYVTQHNSFINSSYTLDLNAFA 305 S T+N ++F WCT+HGK+Y+S EEKLYRL VF DN +VT HN+ NSSYTL LN++A Sbjct: 20 SATSNVSELFEIWCTEHGKSYSSAEEKLYRLGVFADNYEFVTHHNNLDNSSYTLSLNSYA 79 Query: 306 DLTHHEFKSLRLG---ALK--KPVLNTHRRSRGGVCVVRDVPASLDWRTKGAVTPVKDQA 470 DLTHHEFK RLG AL+ +PVL + RDVP SLDWR KGAVT VKDQ Sbjct: 80 DLTHHEFKVSRLGFSPALRNFRPVLPQEPS------LPRDVPDSLDWRKKGAVTAVKDQG 133 Query: 471 SCGACWAFSSTGSIEGINQIVTGSLVSLSEQELVDCDQSYNSGCGGGLMDYAFQWVIKNR 650 SCGACW+FS+TG++EGINQI+TGSL+SLSEQEL+DCD+SYNSGCGGGLMDYA+Q+VI N Sbjct: 134 SCGACWSFSATGAMEGINQIMTGSLISLSEQELIDCDRSYNSGCGGGLMDYAYQFVISNH 193 Query: 651 GIDTENDYPYQAGDSACNKNKMTRRVVTIDSYTDVPESDEEKLLQAVAIQPVSVGLCGSE 830 GIDTENDYPYQA D +C K+K+ R VVTID Y D+P +DE KLLQAVA QPVSVG+CGSE Sbjct: 194 GIDTENDYPYQARDGSCRKDKLQRNVVTIDGYADIPSNDEGKLLQAVAAQPVSVGICGSE 253 Query: 831 RSFQLYSKGIFSSPCSKSLDHAVLIVGYGSENGVDYWILKNSWGTNWGMNGYMHMQRNSG 1010 R+FQLYSKGIFS PCS SLDHAVLIVGYGSENGVDYWI+KNSWG +WGM+GYMHMQRNSG Sbjct: 254 RAFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKSWGMDGYMHMQRNSG 313 Query: 1011 DKQGVCGINMLA--XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRRLLGICFSWKC 1184 + +GVCGIN LA ++ LG+C SWKC Sbjct: 314 NSEGVCGINKLASYPTKTNPNPPPSPPPGPTKCSILTSCAAGETCCCAKKFLGLCLSWKC 373 Query: 1185 CGLDSAVCCKDHRSCCPHDYPICDTNEQKCYKRAGNSTMVKGLEQRES 1328 CGL SAVCCKD R CCP DYPICDT+ C K+ N T + LE R S Sbjct: 374 CGLSSAVCCKDGRHCCPFDYPICDTDRNLCLKQTMNGTRTEILENRSS 421 >dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas] Length = 441 Score = 525 bits (1352), Expect = e-146 Identities = 255/406 (62%), Positives = 296/406 (72%), Gaps = 4/406 (0%) Frame = +3 Query: 123 SVSSLTTNDVFNSWCTQHGKTYTSQEEKLYRLKVFEDNLAYVTQHNSFINSSYTLDLNAF 302 S SS +F +WC QHGKTY SQEEKL+RLKVF+DN +VT+HNS NSSYTL LNAF Sbjct: 20 SSSSSEIAHLFETWCQQHGKTYASQEEKLFRLKVFQDNYDFVTEHNSQGNSSYTLSLNAF 79 Query: 303 ADLTHHEFKSLRLG--ALKKPVLNTHRRSRGGVCVVRDVPASLDWRTKGAVTPVKDQASC 476 ADLTHHEFK+ RLG + LN R +R V DVPAS+DWR GAVT VKDQ +C Sbjct: 80 ADLTHHEFKASRLGLSSAASASLNVDRSNRQIPDFVADVPASVDWRKNGAVTQVKDQGNC 139 Query: 477 GACWAFSSTGSIEGINQIVTGSLVSLSEQELVDCDQSYNSGCGGGLMDYAFQWVIKNRGI 656 GACW+FS+TG+IEGIN+IVTGSLVSLSEQELVDCD+SYN+GC GG+MDYAFQ+VI N GI Sbjct: 140 GACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDKSYNNGCEGGIMDYAFQFVIDNHGI 199 Query: 657 DTENDYPYQAGDSACNKNKMTRRVVTIDSYTDVPESDEEKLLQAVAIQPVSVGLCGSERS 836 DTE DYPYQ D +CNK K+ R VVTID Y DVP+++E++LL+AVA QPVSVG+CGSER+ Sbjct: 200 DTEEDYPYQGRDRSCNKEKLKRHVVTIDGYVDVPQNNEKELLKAVANQPVSVGICGSERA 259 Query: 837 FQLYSKGIFSSPCSKSLDHAVLIVGYGSENGVDYWILKNSWGTNWGMNGYMHMQRNSGDK 1016 FQLYSKGIF+ PCS SLDHAVLIVGYGSENGVDYWI+KNSWG+ WGM+GYMHMQRNSG Sbjct: 260 FQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGSYWGMDGYMHMQRNSGSS 319 Query: 1017 QGVCGINMLA--XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRRLLGICFSWKCCG 1190 +G+CGINMLA + GIC SWKCC Sbjct: 320 RGLCGINMLASYPKKTSPNPPPPAPPGPTRCDLFTHCGEGETCCCVHHIFGICLSWKCCE 379 Query: 1191 LDSAVCCKDHRSCCPHDYPICDTNEQKCYKRAGNSTMVKGLEQRES 1328 LDSAVCCKD R CCP DYP+CDT C K GN+T ++ + S Sbjct: 380 LDSAVCCKDGRHCCPRDYPVCDTTRNICLKHYGNATRIEKFAKNSS 425 >ref|XP_002510459.1| cysteine protease, putative [Ricinus communis] gi|223551160|gb|EEF52646.1| cysteine protease, putative [Ricinus communis] Length = 422 Score = 511 bits (1315), Expect = e-142 Identities = 247/395 (62%), Positives = 295/395 (74%), Gaps = 7/395 (1%) Frame = +3 Query: 117 HVSVSSLTTND----VFNSWCTQHGKTYTSQEEKLYRLKVFEDNLAYVTQHNSFINSSYT 284 ++S+SS +++ +F SW +HGKTYTS+E+KLYR K+FE+N +V +HNS NSSYT Sbjct: 16 NLSISSFSSSSDISKLFESWTKEHGKTYTSKEDKLYRFKIFEENYEFVKKHNSQGNSSYT 75 Query: 285 LDLNAFADLTHHEFKSLRLGALKKPVLNT-HRRSRGGVCVVRDVPASLDWRTKGAVTPVK 461 L LNAFADLTHHEFK+ RLG RR+ V DVP S+DWR KGAV+ VK Sbjct: 76 LSLNAFADLTHHEFKASRLGLSAFSTSGKLSRRNFPLHDFVGDVPISIDWRKKGAVSQVK 135 Query: 462 DQASCGACWAFSSTGSIEGINQIVTGSLVSLSEQELVDCDQSYNSGCGGGLMDYAFQWVI 641 DQ +CGACW+FS+TG+IEGIN+IVTGSLVSLSEQELVDCD+SYN+GC GGLMDYA+Q+VI Sbjct: 136 DQGNCGACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDRSYNNGCEGGLMDYAYQFVI 195 Query: 642 KNRGIDTENDYPYQAGDSACNKNKMTRRVVTIDSYTDVPESDEEKLLQAVAIQPVSVGLC 821 +N GIDTE DYPYQA + CNK K+ R VVTID YTDVP+++E++LL+AVA QPVSVG+C Sbjct: 196 ENNGIDTEEDYPYQAREKTCNKEKLKRHVVTIDGYTDVPQNNEKELLKAVAAQPVSVGIC 255 Query: 822 GSERSFQLYSKGIFSSPCSKSLDHAVLIVGYGSENGVDYWILKNSWGTNWGMNGYMHMQR 1001 GSER+FQLYSKGIF+ PCS SLDHAVLIVGYGSENGVDYWI+KNSWGT+WG+NGYM+M R Sbjct: 256 GSERAFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTHWGINGYMYMLR 315 Query: 1002 NSGDKQGVCGINMLA--XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRRLLGICFS 1175 NSG+ QG+CGINMLA RR+ G+CFS Sbjct: 316 NSGNSQGLCGINMLASFPVKTSPNPPPPAPPGPTKCDLFTRCGEGETCCCTRRIFGLCFS 375 Query: 1176 WKCCGLDSAVCCKDHRSCCPHDYPICDTNEQKCYK 1280 WKCC LDSAVCCKD CCPHDYP+CDT C K Sbjct: 376 WKCCELDSAVCCKDGLHCCPHDYPVCDTKRNMCLK 410 >ref|XP_003523725.1| PREDICTED: oryzain alpha chain-like [Glycine max] Length = 439 Score = 509 bits (1312), Expect = e-142 Identities = 249/398 (62%), Positives = 290/398 (72%), Gaps = 7/398 (1%) Frame = +3 Query: 126 VSSLTTNDVFNSWCTQHGKTYTSQEEKLYRLKVFEDNLAYVTQHNSFIN-----SSYTLD 290 +S+ T+++F WC +H KTY+S+EEKLYRLKVFEDN A+V QHN N SSYTL Sbjct: 24 LSASDTSELFEKWCKEHSKTYSSEEEKLYRLKVFEDNYAFVAQHNQNANNNNNNSSYTLS 83 Query: 291 LNAFADLTHHEFKSLRLGALKKPVLNTHRRSRGGVCVVRDVPASLDWRTKGAVTPVKDQA 470 LNAFADLTHHEFK+ RLG L +L R + +P+ +DWR GAVTPVKDQA Sbjct: 84 LNAFADLTHHEFKTTRLG-LPLTLLRFKRPQNQQSRDLLHIPSQIDWRQSGAVTPVKDQA 142 Query: 471 SCGACWAFSSTGSIEGINQIVTGSLVSLSEQELVDCDQSYNSGCGGGLMDYAFQWVIKNR 650 SCGACWAFS+TG+IEGIN+IVTGSLVSLSEQEL+DCD SYNSGCGGGLMD+A+Q+VI N+ Sbjct: 143 SCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDTSYNSGCGGGLMDFAYQFVIDNK 202 Query: 651 GIDTENDYPYQAGDSACNKNKMTRRVVTIDSYTDVPESDEEKLLQAVAIQPVSVGLCGSE 830 GIDTE+DYPYQA +C+K+K+ RR VTI+ Y DVP S+EE +L+AVA QPVSVG+CGSE Sbjct: 203 GIDTEDDYPYQARQRSCSKDKLKRRAVTIEDYVDVPPSEEE-ILKAVASQPVSVGICGSE 261 Query: 831 RSFQLYSKGIFSSPCSKSLDHAVLIVGYGSENGVDYWILKNSWGTNWGMNGYMHMQRNSG 1010 R FQLYSKGIF+ PCS LDHAVLIVGYGSENGVDYWI+KNSWG WGMNGY+HM RNSG Sbjct: 262 REFQLYSKGIFTGPCSTFLDHAVLIVGYGSENGVDYWIVKNSWGKYWGMNGYIHMIRNSG 321 Query: 1011 DKQGVCGINMLA--XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRRLLGICFSWKC 1184 + +G+CGIN LA + LGICFSWKC Sbjct: 322 NSKGICGINTLASYPVKTKPNPPIPPPPGPVRCNLFTHCSEGETCCCAKSFLGICFSWKC 381 Query: 1185 CGLDSAVCCKDHRSCCPHDYPICDTNEQKCYKRAGNST 1298 CGL SAVCCKD R CCP DYPICDT +C KR N T Sbjct: 382 CGLTSAVCCKDKRHCCPQDYPICDTRRGQCLKRTANGT 419