BLASTX nr result
ID: Rehmannia22_contig00018645
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia22_contig00018645 (2170 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EPS60205.1| hypothetical protein M569_14597, partial [Genlise... 598 e-168 ref|XP_002307688.2| cysteine protease family protein [Populus tr... 598 e-168 dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas] 589 e-165 ref|XP_006341989.1| PREDICTED: cysteine proteinase RD21a-like [S... 587 e-165 ref|NP_563855.1| papain-like cysteine peptidase [Arabidopsis tha... 582 e-163 gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [A... 582 e-163 ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arab... 580 e-163 ref|XP_006417526.1| hypothetical protein EUTSA_v10007640mg [Eutr... 578 e-162 ref|XP_004238304.1| PREDICTED: cysteine proteinase RD21a-like [S... 578 e-162 ref|XP_002510459.1| cysteine protease, putative [Ricinus communi... 572 e-160 ref|XP_006435079.1| hypothetical protein CICLE_v10001178mg [Citr... 569 e-159 ref|XP_006473576.1| PREDICTED: cysteine proteinase RD21a-like [C... 568 e-159 ref|XP_006307431.1| hypothetical protein CARUB_v10009056mg [Caps... 567 e-159 gb|EOY14881.1| JHL18I08.3 protein [Theobroma cacao] 566 e-158 ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [C... 561 e-157 gb|AAB60738.1| Strong similarity to Dianthus cysteine proteinase... 560 e-157 ref|XP_002280937.1| PREDICTED: cysteine proteinase RD21a [Vitis ... 560 e-157 ref|XP_004500967.1| PREDICTED: oryzain alpha chain-like [Cicer a... 558 e-156 ref|XP_004291399.1| PREDICTED: cysteine proteinase RD21a-like [F... 547 e-153 ref|XP_003523725.1| PREDICTED: oryzain alpha chain-like [Glycine... 543 e-152 >gb|EPS60205.1| hypothetical protein M569_14597, partial [Genlisea aurea] Length = 424 Score = 598 bits (1543), Expect = e-168 Identities = 278/410 (67%), Positives = 325/410 (79%), Gaps = 1/410 (0%) Frame = +1 Query: 544 QVPTCKSSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIRANSSYTLSL 723 QV SSSISDLFDSWC+E+GKTY SE+EREHRL VF +NY+++ HN RAN SYTLSL Sbjct: 17 QVHPIVSSSISDLFDSWCQEHGKTYVSEEEREHRLGVFSENYDFIASHNARANYSYTLSL 76 Query: 724 NAFADLTNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPSSLDWRNKGAVT 903 NAFADLT EF +YLG SPS DLLIR N G + N+ S +PSS+DWR KGAVT Sbjct: 77 NAFADLTRSEFGGRYLGFSPSGHDLLIRKNRGSGSYRSRNY---SAVPSSIDWRKKGAVT 133 Query: 904 GVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCGGGLMDYAYE 1083 G+KDQGSCGACWSFSATGA+EGINQI TGSLVSLSEQELIDCD SYN GC GGLMDYAYE Sbjct: 134 GIKDQGSCGACWSFSATGAIEGINQIVTGSLVSLSEQELIDCDHSYNQGCNGGLMDYAYE 193 Query: 1084 FIIKNKGIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLEAVATQPVSV 1263 FI+KNKGIDTEEDY Y+GRD +C+++KL + VVTIDSY DIP KNE+ LLEAVA+QPVSV Sbjct: 194 FILKNKGIDTEEDYSYKGRDASCSQNKLNKRVVTIDSYVDIPEKNEQMLLEAVASQPVSV 253 Query: 1264 GICGSDSSFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGRYWGMNGYMH 1443 GI G D+ FQ YS GIFTGPCSTSLDHAVLIVGYDSK+GKDYWI+KNSWG+ WGM+GYM+ Sbjct: 254 GISGGDAPFQFYSQGIFTGPCSTSLDHAVLIVGYDSKNGKDYWIVKNSWGKSWGMDGYMY 313 Query: 1444 MLRNTGTAEGVCGINMLA-XXXXXXXXXXXXXXXXXXTRCNLFTYCSSGETCCCARHFLG 1620 + RNTG G+C INM+A T+C+LF+YCS GETCCCAR FLG Sbjct: 314 VQRNTGNQNGICEINMMASYPVKTNPNPSPSPSPPGPTKCSLFSYCSQGETCCCARRFLG 373 Query: 1621 ICLKWRCCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRIGNSTFVKPI 1770 +C++++CC A SAVCC+D+ HCCP DYP+CDT +++C K GNST P+ Sbjct: 374 LCMRYKCCGAESAVCCEDNVHCCPQDYPICDTAQSVCRKMSGNSTMAIPV 423 >ref|XP_002307688.2| cysteine protease family protein [Populus trichocarpa] gi|550339725|gb|EEE94684.2| cysteine protease family protein [Populus trichocarpa] Length = 436 Score = 598 bits (1542), Expect = e-168 Identities = 275/411 (66%), Positives = 325/411 (79%) Frame = +1 Query: 550 PTCKSSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIRANSSYTLSLNA 729 P+ SS IS LF++WCKE+GK+YTS++ER HRLKVFE NY++V +HN + NSSY+L+LNA Sbjct: 18 PSTSSSDISQLFETWCKEHGKSYTSQEERSHRLKVFEDNYDFVTKHNSKGNSSYSLALNA 77 Query: 730 FADLTNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPSSLDWRNKGAVTGV 909 FADLT+HEFK LGLS + +L R N + G DIP+S+DWRNKG VT V Sbjct: 78 FADLTHHEFKTSRLGLSAAPLNLAHR-NLEITGVVG-------DIPASIDWRNKGVVTNV 129 Query: 910 KDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCGGGLMDYAYEFI 1089 KDQGSCGACWSFSATGA+EGIN+I TGSLVSLSEQELI+CDKSYNDGCGGGLMDYA++F+ Sbjct: 130 KDQGSCGACWSFSATGAIEGINKIVTGSLVSLSEQELIECDKSYNDGCGGGLMDYAFQFV 189 Query: 1090 IKNKGIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLEAVATQPVSVGI 1269 I N GIDTEEDYPYR RDGTCNKD++KR VVTID Y D+P NEK+LL+AVA QPVSVGI Sbjct: 190 INNHGIDTEEDYPYRARDGTCNKDRMKRRVVTIDKYVDVPENNEKQLLQAVAAQPVSVGI 249 Query: 1270 CGSDSSFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGRYWGMNGYMHML 1449 CGS+ +FQ+YS GIFTGPCSTSLDHAVLIVGY S++G DYWI+KNSWG WGM GYMHM Sbjct: 250 CGSERAFQMYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTGWGMRGYMHMQ 309 Query: 1450 RNTGTAEGVCGINMLAXXXXXXXXXXXXXXXXXXTRCNLFTYCSSGETCCCARHFLGICL 1629 RN+G ++GVCGINMLA T+CNL TYC++GETCCCAR F GIC+ Sbjct: 310 RNSGNSQGVCGINMLASYPVKTSPNPPPPPPPGPTKCNLLTYCAAGETCCCARKFFGICI 369 Query: 1630 KWRCCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRIGNSTFVKPIAKKS 1782 W+CC SAVCCKD HCCPHDYPVCDT +N+C KR GN+T ++ I K+ Sbjct: 370 SWKCCGLDSAVCCKDRLHCCPHDYPVCDTDKNMCFKRAGNATRMEAIEGKT 420 >dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas] Length = 441 Score = 589 bits (1518), Expect = e-165 Identities = 272/409 (66%), Positives = 325/409 (79%) Frame = +1 Query: 562 SSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIRANSSYTLSLNAFADL 741 SS I+ LF++WC+++GKTY S++E+ RLKVF+ NY++V +HN + NSSYTLSLNAFADL Sbjct: 23 SSEIAHLFETWCQQHGKTYASQEEKLFRLKVFQDNYDFVTEHNSQGNSSYTLSLNAFADL 82 Query: 742 TNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPSSLDWRNKGAVTGVKDQG 921 T+HEFKA LGLS +AS LN + P+FV +D+P+S+DWR GAVT VKDQG Sbjct: 83 THHEFKASRLGLSSAAS---ASLNVDRSNRQIPDFV--ADVPASVDWRKNGAVTQVKDQG 137 Query: 922 SCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCGGGLMDYAYEFIIKNK 1101 +CGACWSFSATGA+EGIN+I TGSLVSLSEQEL+DCDKSYN+GC GG+MDYA++F+I N Sbjct: 138 NCGACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDKSYNNGCEGGIMDYAFQFVIDNH 197 Query: 1102 GIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLEAVATQPVSVGICGSD 1281 GIDTEEDYPY+GRD +CNK+KLKRHVVTID Y D+P NEK+LL+AVA QPVSVGICGS+ Sbjct: 198 GIDTEEDYPYQGRDRSCNKEKLKRHVVTIDGYVDVPQNNEKELLKAVANQPVSVGICGSE 257 Query: 1282 SSFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGRYWGMNGYMHMLRNTG 1461 +FQLYS GIFTGPCSTSLDHAVLIVGY S++G DYWI+KNSWG YWGM+GYMHM RN+G Sbjct: 258 RAFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGSYWGMDGYMHMQRNSG 317 Query: 1462 TAEGVCGINMLAXXXXXXXXXXXXXXXXXXTRCNLFTYCSSGETCCCARHFLGICLKWRC 1641 ++ G+CGINMLA TRC+LFT+C GETCCC H GICL W+C Sbjct: 318 SSRGLCGINMLASYPKKTSPNPPPPAPPGPTRCDLFTHCGEGETCCCVHHIFGICLSWKC 377 Query: 1642 CEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRIGNSTFVKPIAKKSFS 1788 CE SAVCCKD HCCP DYPVCDT RNICLK GN+T ++ AK S S Sbjct: 378 CELDSAVCCKDGRHCCPRDYPVCDTTRNICLKHYGNATRIEKFAKNSSS 426 >ref|XP_006341989.1| PREDICTED: cysteine proteinase RD21a-like [Solanum tuberosum] Length = 439 Score = 587 bits (1512), Expect = e-165 Identities = 272/430 (63%), Positives = 326/430 (75%) Frame = +1 Query: 499 MCWXXXXXXXXXXXXQVPTCKSSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYV 678 M W Q P C SSISDLF++WC++ GK Y+SEQER +R KVFE+NY Y+ Sbjct: 1 MNWLLPSLVLVLLIFQQPFCTCSSISDLFETWCQQNGKKYSSEQERVYRFKVFEENYAYI 60 Query: 679 NQHNIRANSSYTLSLNAFADLTNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQES 858 +HN + NSSYTL LNA++DLT+HEF+ +LGLS SA+D IRL + + + Sbjct: 61 TEHNSKENSSYTLGLNAYSDLTHHEFRNSFLGLSSSANDF-IRLKGRGSGSSETGVLSDV 119 Query: 859 DIPSSLDWRNKGAVTGVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKS 1038 D PSSLDWR KGAVT VK+QGSCGACWSFSATGA+EGIN+I TGSLVSLSEQELIDCD+S Sbjct: 120 DAPSSLDWREKGAVTDVKNQGSCGACWSFSATGAMEGINKITTGSLVSLSEQELIDCDRS 179 Query: 1039 YNDGCGGGLMDYAYEFIIKNKGIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKN 1218 YN+GCGGGLMDYA+EF+IKN GIDTE+DYP+R R+GTCNK+KL+RHVVTID Y DIP + Sbjct: 180 YNEGCGGGLMDYAFEFVIKNGGIDTEKDYPFREREGTCNKNKLQRHVVTIDGYTDIPQND 239 Query: 1219 EKKLLEAVATQPVSVGICGSDSSFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWII 1398 E KLL+AVATQPVSVGICGS +FQ YS GIFTGPCST+LDHAVLIVGY S++G DYWII Sbjct: 240 EDKLLKAVATQPVSVGICGSARAFQSYSKGIFTGPCSTALDHAVLIVGYGSENGVDYWII 299 Query: 1399 KNSWGRYWGMNGYMHMLRNTGTAEGVCGINMLAXXXXXXXXXXXXXXXXXXTRCNLFTYC 1578 KNSWG WG+NGY+HM RN+G EG+CGIN LA ++C++FT C Sbjct: 300 KNSWGTSWGINGYIHMQRNSGNQEGICGINKLASYPTKTSPNPPTPPAPGPSKCSMFTSC 359 Query: 1579 SSGETCCCARHFLGICLKWRCCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRIGNSTF 1758 GETCCC FLGICL W+CC SAVCCKD HCCP DYP+CDT RN+CLKR+ N+T Sbjct: 360 GQGETCCCGSKFLGICLSWKCCGLDSAVCCKDGRHCCPQDYPICDTSRNLCLKRMNNATI 419 Query: 1759 VKPIAKKSFS 1788 V+ K++F+ Sbjct: 420 VQQPQKEAFT 429 >ref|NP_563855.1| papain-like cysteine peptidase [Arabidopsis thaliana] gi|110741821|dbj|BAE98853.1| papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana] gi|111074448|gb|ABH04597.1| At1g09850 [Arabidopsis thaliana] gi|332190386|gb|AEE28507.1| papain-like cysteine peptidase [Arabidopsis thaliana] Length = 437 Score = 582 bits (1501), Expect = e-163 Identities = 267/409 (65%), Positives = 321/409 (78%) Frame = +1 Query: 562 SSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIRANSSYTLSLNAFADL 741 S IS+LFD WC+++GKTY SE+ER+ R+++F+ N+++V QHN+ N++Y+LSLNAFADL Sbjct: 25 SDDISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADL 84 Query: 742 TNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPSSLDWRNKGAVTGVKDQG 921 T+HEFKA LGLS SA +++ A G + +P S+DWR KGAVT VKDQG Sbjct: 85 THHEFKASRLGLSVSAPSVIM-------ASKGQSLGGSVKVPDSVDWRKKGAVTNVKDQG 137 Query: 922 SCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCGGGLMDYAYEFIIKNK 1101 SCGACWSFSATGA+EGINQI TG L+SLSEQELIDCDKSYN GC GGLMDYA+EF+IKN Sbjct: 138 SCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNH 197 Query: 1102 GIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLEAVATQPVSVGICGSD 1281 GIDTE+DYPY+ RDGTC KDKLK+ VVTIDSYA + + +EK L+EAVA QPVSVGICGS+ Sbjct: 198 GIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSE 257 Query: 1282 SSFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGRYWGMNGYMHMLRNTG 1461 +FQLYS GIF+GPCSTSLDHAVLIVGY S++G DYWI+KNSWG+ WGM+G+MHM RNT Sbjct: 258 RAFQLYSSGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTE 317 Query: 1462 TAEGVCGINMLAXXXXXXXXXXXXXXXXXXTRCNLFTYCSSGETCCCARHFLGICLKWRC 1641 ++GVCGINMLA T+CNLFTYCSSGETCCCAR G+C W+C Sbjct: 318 NSDGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCCCARELFGLCFSWKC 377 Query: 1642 CEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRIGNSTFVKPIAKKSFS 1788 CE SAVCCKD HCCPHDYPVCDT R++CLK+ GN T +KP KK+ S Sbjct: 378 CEIESAVCCKDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKPFWKKNSS 426 >gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana] Length = 437 Score = 582 bits (1499), Expect = e-163 Identities = 267/409 (65%), Positives = 321/409 (78%) Frame = +1 Query: 562 SSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIRANSSYTLSLNAFADL 741 S IS+LFD WC+++GKTY SE+ER+ R+++F+ N+++V QHN+ N++Y+LSLNAFADL Sbjct: 25 SDDISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADL 84 Query: 742 TNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPSSLDWRNKGAVTGVKDQG 921 T+HEFKA LGLS SA +++ A G + +P S+DWR KGAVT VKDQG Sbjct: 85 THHEFKASRLGLSVSAPSVIM-------ASKGQSLGGSVKVPDSVDWRKKGAVTNVKDQG 137 Query: 922 SCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCGGGLMDYAYEFIIKNK 1101 SCGACWSFSATGA+EGINQI TG L+SLSEQELIDCDKSYN GC GGLMDYA+EF+IKN Sbjct: 138 SCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNH 197 Query: 1102 GIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLEAVATQPVSVGICGSD 1281 GIDTE+DYPY+ RDGTC KDKLK+ VVTIDSYA + + +EK L+EAVA QPVSVGICGS+ Sbjct: 198 GIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSE 257 Query: 1282 SSFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGRYWGMNGYMHMLRNTG 1461 +FQLYS GIF+GPCSTSLDHAVLIVGY S++G DYWI+KNSWG+ WGM+G+MHM RNT Sbjct: 258 RAFQLYSRGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTE 317 Query: 1462 TAEGVCGINMLAXXXXXXXXXXXXXXXXXXTRCNLFTYCSSGETCCCARHFLGICLKWRC 1641 ++GVCGINMLA T+CNLFTYCSSGETCCCAR G+C W+C Sbjct: 318 NSDGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCCCARELFGLCFSWKC 377 Query: 1642 CEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRIGNSTFVKPIAKKSFS 1788 CE SAVCCKD HCCPHDYPVCDT R++CLK+ GN T +KP KK+ S Sbjct: 378 CEIESAVCCKDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKPFWKKNSS 426 >ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp. lyrata] gi|297335615|gb|EFH66032.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp. lyrata] Length = 439 Score = 580 bits (1496), Expect = e-163 Identities = 268/411 (65%), Positives = 321/411 (78%), Gaps = 2/411 (0%) Frame = +1 Query: 562 SSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIRANSSYTLSLNAFADL 741 S IS+LFD WC+ +GKTY SE+ER+ R+++F+ N+++V QHN+ N++Y+LSLNAFADL Sbjct: 25 SDDISELFDDWCQRHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADL 84 Query: 742 TNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPSSLDWRNKGAVTGVKDQG 921 T+HEFKA LGLS SAS L++ A G + + +P S+DWR KGAVT VKDQG Sbjct: 85 THHEFKASRLGLSVSASSLIM-------ASKGQSLGGNAKVPDSVDWRKKGAVTNVKDQG 137 Query: 922 SCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCGGGLMDYAYEFIIKNK 1101 SCGACWSFSATGA+EGINQI TG L+SLSEQELIDCDKSYN GC GGLMDYA+EF+IKN Sbjct: 138 SCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNH 197 Query: 1102 GIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLEAVATQPVSVGICGSD 1281 GIDTE+DYPY+ RDGTC KDKLK+ VVTIDSYA + + +EK L EAVA QPVSVGICGS+ Sbjct: 198 GIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALREAVAAQPVSVGICGSE 257 Query: 1282 SSFQLYS--GGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGRYWGMNGYMHMLRN 1455 +FQLYS GIF+GPCSTSLDHAVLIVGY S++G DYWI+KNSWG+ WGM+G+MHM RN Sbjct: 258 RAFQLYSRVSGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRN 317 Query: 1456 TGTAEGVCGINMLAXXXXXXXXXXXXXXXXXXTRCNLFTYCSSGETCCCARHFLGICLKW 1635 TG +EG+CGINMLA T+CNLFTYCS+GETCCCAR+ G+C W Sbjct: 318 TGNSEGICGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSAGETCCCARNLFGLCFSW 377 Query: 1636 RCCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRIGNSTFVKPIAKKSFS 1788 +CCE SAVCC D HCCPHDYPVCDT R++CLK+ GN T +KP KK S Sbjct: 378 KCCEIESAVCCSDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKPFWKKDSS 428 >ref|XP_006417526.1| hypothetical protein EUTSA_v10007640mg [Eutrema salsugineum] gi|557095297|gb|ESQ35879.1| hypothetical protein EUTSA_v10007640mg [Eutrema salsugineum] Length = 444 Score = 578 bits (1491), Expect = e-162 Identities = 266/414 (64%), Positives = 321/414 (77%) Frame = +1 Query: 547 VPTCKSSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIRANSSYTLSLN 726 + + S I++LFD WC +GKTY SE+ER+HR+++F N+++V QHN +NS+Y+LSLN Sbjct: 25 ISSSSSDDIAELFDDWCHRHGKTYGSEEERQHRIQIFRDNHDFVTQHNHISNSTYSLSLN 84 Query: 727 AFADLTNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPSSLDWRNKGAVTG 906 AFADLT+HEFKA LGLS + L+ + + G + +P S+DWR KGAVT Sbjct: 85 AFADLTHHEFKASRLGLSAPSPSLMAKEQSL-----GVSERVRVKVPDSVDWRKKGAVTN 139 Query: 907 VKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCGGGLMDYAYEF 1086 VKDQGSCGACWSFSATGA+EGINQI TG L+SLSEQELIDCDKSYN GC GGLMDYA+EF Sbjct: 140 VKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEF 199 Query: 1087 IIKNKGIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLEAVATQPVSVG 1266 +IKN GIDTE+DYPY+ +DGTC KDKLK+ VVTIDSYA + + NEK L+EAVA+QPVSVG Sbjct: 200 VIKNHGIDTEKDYPYQEQDGTCKKDKLKKRVVTIDSYAGVASNNEKALMEAVASQPVSVG 259 Query: 1267 ICGSDSSFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGRYWGMNGYMHM 1446 ICGS+ +FQLYS GIF+GPCSTSLDHAVLIVGY S++G DYWI+KNSWG+ WGM+G+MHM Sbjct: 260 ICGSERAFQLYSSGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHM 319 Query: 1447 LRNTGTAEGVCGINMLAXXXXXXXXXXXXXXXXXXTRCNLFTYCSSGETCCCARHFLGIC 1626 RNTG +EGVCGINMLA T+CNLFTYCSSGETCCCAR G+C Sbjct: 320 QRNTGNSEGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCCCARTLFGLC 379 Query: 1627 LKWRCCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRIGNSTFVKPIAKKSFS 1788 W+CCE SAVCCKD HCCP DYPVCDT +++CLK+ GN T +KP KK+ S Sbjct: 380 FSWKCCELESAVCCKDGRHCCPRDYPVCDTTKSLCLKKTGNFTEIKPFWKKNSS 433 >ref|XP_004238304.1| PREDICTED: cysteine proteinase RD21a-like [Solanum lycopersicum] Length = 439 Score = 578 bits (1491), Expect = e-162 Identities = 268/430 (62%), Positives = 321/430 (74%) Frame = +1 Query: 499 MCWXXXXXXXXXXXXQVPTCKSSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYV 678 M W Q P C SSISDLF++WC++ GK Y+SEQER +R KVFE+NY Y+ Sbjct: 1 MKWLLPSLVLVLLIFQQPLCTCSSISDLFETWCQQNGKKYSSEQERMYRFKVFEENYAYI 60 Query: 679 NQHNIRANSSYTLSLNAFADLTNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQES 858 +HN + NSSYTL LNA++DLT+HEF+ +LGLS SA+D IRL + + + Sbjct: 61 TEHNSKGNSSYTLGLNAYSDLTHHEFRNSFLGLSSSANDF-IRLKGRGSGSSAAGVLSDV 119 Query: 859 DIPSSLDWRNKGAVTGVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKS 1038 D PSSLDWR+KGAVT VK+QGSCGACWSFSATGA+EGIN+I TGSLVSLSEQELIDCD+S Sbjct: 120 DAPSSLDWRDKGAVTNVKNQGSCGACWSFSATGAIEGINKITTGSLVSLSEQELIDCDRS 179 Query: 1039 YNDGCGGGLMDYAYEFIIKNKGIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKN 1218 YN GCGGGLMDYA+EF+IKN GIDTE+DYP+R ++GTCNK+KL+R VVTID Y DIP + Sbjct: 180 YNQGCGGGLMDYAFEFVIKNGGIDTEKDYPFREKEGTCNKNKLQRRVVTIDGYTDIPQND 239 Query: 1219 EKKLLEAVATQPVSVGICGSDSSFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWII 1398 E KLL+AVATQPVSVGICGS +FQ YS GIFTGPC T LDHAVLIVGY S++G DYWII Sbjct: 240 EDKLLKAVATQPVSVGICGSARAFQSYSKGIFTGPCPTDLDHAVLIVGYGSENGFDYWII 299 Query: 1399 KNSWGRYWGMNGYMHMLRNTGTAEGVCGINMLAXXXXXXXXXXXXXXXXXXTRCNLFTYC 1578 KNSWG WG+NGY+HM RN+G EG+CG+N LA ++C+ FT C Sbjct: 300 KNSWGTSWGINGYIHMQRNSGNQEGICGVNKLASYPTKTSPNPPNPPAPGPSKCSTFTSC 359 Query: 1579 SSGETCCCARHFLGICLKWRCCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRIGNSTF 1758 GETCCC FLGICL W+CC SAVCCKD HCCP DYP+CDT RN+CLKR+ N+T Sbjct: 360 GQGETCCCGLKFLGICLSWKCCGLDSAVCCKDGRHCCPWDYPICDTSRNLCLKRMSNATI 419 Query: 1759 VKPIAKKSFS 1788 V+ K+ F+ Sbjct: 420 VQQPQKEPFT 429 >ref|XP_002510459.1| cysteine protease, putative [Ricinus communis] gi|223551160|gb|EEF52646.1| cysteine protease, putative [Ricinus communis] Length = 422 Score = 572 bits (1474), Expect = e-160 Identities = 262/392 (66%), Positives = 314/392 (80%) Frame = +1 Query: 562 SSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIRANSSYTLSLNAFADL 741 SS IS LF+SW KE+GKTYTS++++ +R K+FE+NYE+V +HN + NSSYTLSLNAFADL Sbjct: 25 SSDISKLFESWTKEHGKTYTSKEDKLYRFKIFEENYEFVKKHNSQGNSSYTLSLNAFADL 84 Query: 742 TNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPSSLDWRNKGAVTGVKDQG 921 T+HEFKA LGLS ++ + N P +FV D+P S+DWR KGAV+ VKDQG Sbjct: 85 THHEFKASRLGLSAFSTSGKLSRRNFPLH----DFV--GDVPISIDWRKKGAVSQVKDQG 138 Query: 922 SCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCGGGLMDYAYEFIIKNK 1101 +CGACWSFSATGA+EGIN+I TGSLVSLSEQEL+DCD+SYN+GC GGLMDYAY+F+I+N Sbjct: 139 NCGACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDRSYNNGCEGGLMDYAYQFVIENN 198 Query: 1102 GIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLEAVATQPVSVGICGSD 1281 GIDTEEDYPY+ R+ TCNK+KLKRHVVTID Y D+P NEK+LL+AVA QPVSVGICGS+ Sbjct: 199 GIDTEEDYPYQAREKTCNKEKLKRHVVTIDGYTDVPQNNEKELLKAVAAQPVSVGICGSE 258 Query: 1282 SSFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGRYWGMNGYMHMLRNTG 1461 +FQLYS GIFTGPCSTSLDHAVLIVGY S++G DYWI+KNSWG +WG+NGYM+MLRN+G Sbjct: 259 RAFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTHWGINGYMYMLRNSG 318 Query: 1462 TAEGVCGINMLAXXXXXXXXXXXXXXXXXXTRCNLFTYCSSGETCCCARHFLGICLKWRC 1641 ++G+CGINMLA T+C+LFT C GETCCC R G+C W+C Sbjct: 319 NSQGLCGINMLASFPVKTSPNPPPPAPPGPTKCDLFTRCGEGETCCCTRRIFGLCFSWKC 378 Query: 1642 CEAVSAVCCKDHTHCCPHDYPVCDTKRNICLK 1737 CE SAVCCKD HCCPHDYPVCDTKRN+CLK Sbjct: 379 CELDSAVCCKDGLHCCPHDYPVCDTKRNMCLK 410 >ref|XP_006435079.1| hypothetical protein CICLE_v10001178mg [Citrus clementina] gi|557537201|gb|ESR48319.1| hypothetical protein CICLE_v10001178mg [Citrus clementina] Length = 441 Score = 569 bits (1466), Expect = e-159 Identities = 265/415 (63%), Positives = 318/415 (76%), Gaps = 1/415 (0%) Frame = +1 Query: 547 VPTCKSSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIRANSSYTLSLN 726 +P S I++LF++WCK++GK Y+SEQE++ RLK+FE NY +V QHN NSS+TLSLN Sbjct: 17 LPLNYCSDINELFETWCKQHGKVYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLN 76 Query: 727 AFADLTNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPSSLDWRNKGAVTG 906 AFADLT+ EFKA +LG S ++ D R N A++ P ++ D+P+S+DWR KGAVT Sbjct: 77 AFADLTHQEFKASFLGFSAASIDHDRRRN---ASVQSPGTLR--DVPASIDWRKKGAVTE 131 Query: 907 VKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCGGGLMDYAYEF 1086 VKDQ SCGACW+FSATGA+EGIN+I TGSLVSLSEQELIDCD+SYN GCGGGLMDYAY+F Sbjct: 132 VKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQF 191 Query: 1087 IIKNKGIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLEAVATQPVSVG 1266 +IKN GIDTE+DYPYRG+ G CNK KL RH+VTID Y D+P NEK+LL+AV QPVSVG Sbjct: 192 VIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVG 251 Query: 1267 ICGSDSSFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGRYWGMNGYMHM 1446 ICGS+ +FQLYS GIFTGPCSTSLDHAVLIVGYDS++G DYWIIKNSWGR WGMNGYMHM Sbjct: 252 ICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHM 311 Query: 1447 LRNTGTAEGVCGINMLAXXXXXXXXXXXXXXXXXXTRCNLFTYCSSGETCCCARHFLGIC 1626 RNTG + G+CGINMLA TRC+L TYC++GETCCC LGIC Sbjct: 312 QRNTGNSLGICGINMLASYPTKTGQNPPPSPPPGPTRCSLLTYCAAGETCCCGSSILGIC 371 Query: 1627 LKWRCCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRI-GNSTFVKPIAKKSFS 1788 L W+CC SAVCC DH +CCP +YP+CD+ R+ CL R GN T + I + S Sbjct: 372 LSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTRFTGNVTAAEAIEMRGSS 426 >ref|XP_006473576.1| PREDICTED: cysteine proteinase RD21a-like [Citrus sinensis] Length = 441 Score = 568 bits (1464), Expect = e-159 Identities = 264/415 (63%), Positives = 318/415 (76%), Gaps = 1/415 (0%) Frame = +1 Query: 547 VPTCKSSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIRANSSYTLSLN 726 +P S I++LF++WCK++GK Y+SEQE++ RLK+FE NY +V QHN NSS+TLSLN Sbjct: 17 LPPNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLN 76 Query: 727 AFADLTNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPSSLDWRNKGAVTG 906 AFADLT+ EFKA +LG S ++ D R N A++ P ++ D+P+S+DWR KGAVT Sbjct: 77 AFADLTHQEFKASFLGFSAASIDHDRRRN---ASVQSPGNLR--DVPASIDWRKKGAVTE 131 Query: 907 VKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCGGGLMDYAYEF 1086 VKDQ SCGACW+FSATGA+EGIN+I TGSLVSLSEQELIDCD+SYN GCGGGLMDYAY+F Sbjct: 132 VKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQF 191 Query: 1087 IIKNKGIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLEAVATQPVSVG 1266 +IKN GIDTE+DYPYRG+ G CNK KL RH+VTID Y D+P NEK+LL+AV QPVSVG Sbjct: 192 VIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVG 251 Query: 1267 ICGSDSSFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGRYWGMNGYMHM 1446 ICGS+ +FQLYS GIFTGPCSTSLDHAVLI+GYDS++G DYWIIKNSWGR WGMNGYMHM Sbjct: 252 ICGSERAFQLYSSGIFTGPCSTSLDHAVLIIGYDSENGVDYWIIKNSWGRSWGMNGYMHM 311 Query: 1447 LRNTGTAEGVCGINMLAXXXXXXXXXXXXXXXXXXTRCNLFTYCSSGETCCCARHFLGIC 1626 RNTG + G+CGINMLA TRC+L TYC+ GETCCC LGIC Sbjct: 312 QRNTGNSLGICGINMLASYPTKTGQNPPPSPPPGPTRCSLLTYCAPGETCCCGSSILGIC 371 Query: 1627 LKWRCCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRI-GNSTFVKPIAKKSFS 1788 L W+CC SAVCC DH +CCP +YP+CD+ R+ CL R+ GN T + I + S Sbjct: 372 LSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTRLTGNVTAAEAIEMRGSS 426 >ref|XP_006307431.1| hypothetical protein CARUB_v10009056mg [Capsella rubella] gi|482576142|gb|EOA40329.1| hypothetical protein CARUB_v10009056mg [Capsella rubella] Length = 467 Score = 567 bits (1461), Expect = e-159 Identities = 266/437 (60%), Positives = 318/437 (72%), Gaps = 28/437 (6%) Frame = +1 Query: 562 SSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIRANSSYTLSLNAFADL 741 S IS+LFD WC+ +GKTY SE E++HR ++F N+++V QHN+ N++Y+LSLNAFADL Sbjct: 27 SDDISELFDDWCQRHGKTYASEAEKQHRFQIFRDNHDFVTQHNLITNATYSLSLNAFADL 86 Query: 742 TNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPSSLDWRNKGAVTGVKDQG 921 + EFK LGLS SA +++ A G + +P SLDWR KGAVT VKDQG Sbjct: 87 NHSEFKTSRLGLSVSAPSVIM-------ASKGKSLGGSVKVPDSLDWRKKGAVTNVKDQG 139 Query: 922 SCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCGGGLMDYAYEFIIKNK 1101 SCGACWSFSATGA+EGINQI TG L+SLSEQELIDCDKSYNDGC GGLMDYA+EF+IKNK Sbjct: 140 SCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNDGCNGGLMDYAFEFVIKNK 199 Query: 1102 GIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLEAVATQPVSVGICGSD 1281 GIDTE+DYPY+ RDGTC KDKLK+ VV+IDSYA + +EK LLEAVA QPVSVGICGS+ Sbjct: 200 GIDTEKDYPYQERDGTCKKDKLKQRVVSIDSYAGVKPSDEKALLEAVAAQPVSVGICGSE 259 Query: 1282 SSFQLYSG----------------------------GIFTGPCSTSLDHAVLIVGYDSKD 1377 +FQLYS GIF+GPCSTSLDHAVLIVGY S++ Sbjct: 260 RAFQLYSSVSFKIRDTSILSSECSTFPCLKLYLMMQGIFSGPCSTSLDHAVLIVGYGSQN 319 Query: 1378 GKDYWIIKNSWGRYWGMNGYMHMLRNTGTAEGVCGINMLAXXXXXXXXXXXXXXXXXXTR 1557 G DYWI+KNSWG+ WGM+G+MHM RNTG ++G+CGINMLA T+ Sbjct: 320 GVDYWIVKNSWGKSWGMDGFMHMQRNTGNSQGICGINMLASYPIKTHPNPPPPSPPGPTK 379 Query: 1558 CNLFTYCSSGETCCCARHFLGICLKWRCCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLK 1737 CNLFTYCS+ ETCCCAR+ G+CL W+CCE SAVCCKD HCCPHDYPVCDT R++CLK Sbjct: 380 CNLFTYCSAAETCCCARNLFGLCLSWKCCEIESAVCCKDGRHCCPHDYPVCDTTRSLCLK 439 Query: 1738 RIGNSTFVKPIAKKSFS 1788 + GN T +KP KK+ S Sbjct: 440 KTGNFTAIKPFWKKNSS 456 >gb|EOY14881.1| JHL18I08.3 protein [Theobroma cacao] Length = 438 Score = 566 bits (1459), Expect = e-158 Identities = 262/405 (64%), Positives = 316/405 (78%) Frame = +1 Query: 565 SSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIRANSSYTLSLNAFADLT 744 S IS LF++WC ++GK Y+SE+E+ +RLKVFE+NY +V QHN NSSY+L+LNAFADLT Sbjct: 24 SHISHLFETWCDQHGKRYSSEEEKSYRLKVFEENYAFVTQHNGVGNSSYSLALNAFADLT 83 Query: 745 NHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPSSLDWRNKGAVTGVKDQGS 924 +HEFKA LGLS +A + + P V+ DIP+S+DWR KGAVT VKDQGS Sbjct: 84 HHEFKASRLGLSAAA------IEGSRPNLQLPGLVR--DIPASMDWRTKGAVTKVKDQGS 135 Query: 925 CGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCGGGLMDYAYEFIIKNKG 1104 CGACWSFSATGA+EGIN+I TG+LVSLSEQEL+DCD+SYN GC GGLMDYAY+F+I N G Sbjct: 136 CGACWSFSATGAIEGINKIVTGTLVSLSEQELVDCDRSYNSGCEGGLMDYAYQFVIDNHG 195 Query: 1105 IDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLEAVATQPVSVGICGSDS 1284 ID EEDYPY GR+ TCNK+K KR VVTID YA +PA NE LL+AVA QPVSVGICGS+ Sbjct: 196 IDNEEDYPYLGREKTCNKEKRKRRVVTIDGYAGVPANNEDLLLQAVAKQPVSVGICGSER 255 Query: 1285 SFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGRYWGMNGYMHMLRNTGT 1464 +FQLYS GIFTGPCS+SLDHAVLIVGY S++G DYWI+KNSWG WGMNGY+HMLRN+G Sbjct: 256 AFQLYSKGIFTGPCSSSLDHAVLIVGYGSENGVDYWIVKNSWGTRWGMNGYIHMLRNSGD 315 Query: 1465 AEGVCGINMLAXXXXXXXXXXXXXXXXXXTRCNLFTYCSSGETCCCARHFLGICLKWRCC 1644 ++G+CGINMLA T+C+LFTYCS+GETCCC GIC W+CC Sbjct: 316 SKGLCGINMLASYPTKTSPNPPSPPPPGPTKCDLFTYCSAGETCCCTHRIFGICFSWKCC 375 Query: 1645 EAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRIGNSTFVKPIAKK 1779 E SAVCCKD+ HCCP+DYPVCDTK++ CLKR+GN+T ++ K+ Sbjct: 376 ELDSAVCCKDNRHCCPYDYPVCDTKKSQCLKRVGNATRMEAFEKR 420 >ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus] gi|449529596|ref|XP_004171784.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus] Length = 431 Score = 561 bits (1445), Expect = e-157 Identities = 259/413 (62%), Positives = 314/413 (76%) Frame = +1 Query: 550 PTCKSSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIRANSSYTLSLNA 729 P +S++S+LF+ WC E+GK+Y+S +E+ +RL VF NYE+V HN NSSYTLSLN+ Sbjct: 18 PLSATSNVSELFEIWCTEHGKSYSSAEEKLYRLGVFADNYEFVTHHNNLDNSSYTLSLNS 77 Query: 730 FADLTNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPSSLDWRNKGAVTGV 909 +ADLT+HEFK LG SP+ + L P+ D+P SLDWR KGAVT V Sbjct: 78 YADLTHHEFKVSRLGFSPALRNFRPVLPQEPSL--------PRDVPDSLDWRKKGAVTAV 129 Query: 910 KDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCGGGLMDYAYEFI 1089 KDQGSCGACWSFSATGA+EGINQI TGSL+SLSEQELIDCD+SYN GCGGGLMDYAY+F+ Sbjct: 130 KDQGSCGACWSFSATGAMEGINQIMTGSLISLSEQELIDCDRSYNSGCGGGLMDYAYQFV 189 Query: 1090 IKNKGIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLEAVATQPVSVGI 1269 I N GIDTE DYPY+ RDG+C KDKL+R+VVTID YADIP+ +E KLL+AVA QPVSVGI Sbjct: 190 ISNHGIDTENDYPYQARDGSCRKDKLQRNVVTIDGYADIPSNDEGKLLQAVAAQPVSVGI 249 Query: 1270 CGSDSSFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGRYWGMNGYMHML 1449 CGS+ +FQLYS GIF+GPCSTSLDHAVLIVGY S++G DYWI+KNSWG+ WGM+GYMHM Sbjct: 250 CGSERAFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKSWGMDGYMHMQ 309 Query: 1450 RNTGTAEGVCGINMLAXXXXXXXXXXXXXXXXXXTRCNLFTYCSSGETCCCARHFLGICL 1629 RN+G +EGVCGIN LA T+C++ T C++GETCCCA+ FLG+CL Sbjct: 310 RNSGNSEGVCGINKLASYPTKTNPNPPPSPPPGPTKCSILTSCAAGETCCCAKKFLGLCL 369 Query: 1630 KWRCCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRIGNSTFVKPIAKKSFS 1788 W+CC SAVCCKD HCCP DYP+CDT RN+CLK+ N T + + +S S Sbjct: 370 SWKCCGLSSAVCCKDGRHCCPFDYPICDTDRNLCLKQTMNGTRTEILENRSSS 422 >gb|AAB60738.1| Strong similarity to Dianthus cysteine proteinase (gb|U17135) [Arabidopsis thaliana] Length = 416 Score = 560 bits (1444), Expect = e-157 Identities = 259/399 (64%), Positives = 310/399 (77%), Gaps = 7/399 (1%) Frame = +1 Query: 562 SSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIRANSSYTLSLNAFADL 741 S IS+LFD WC+++GKTY SE+ER+ R+++F+ N+++V QHN+ N++Y+LSLNAFADL Sbjct: 23 SDDISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADL 82 Query: 742 TNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPSSLDWRNKGAVTGVKDQG 921 T+HEFKA LGLS SA +++ A G + +P S+DWR KGAVT VKDQG Sbjct: 83 THHEFKASRLGLSVSAPSVIM-------ASKGQSLGGSVKVPDSVDWRKKGAVTNVKDQG 135 Query: 922 SCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCGGGLMDYAYEFIIKNK 1101 SCGACWSFSATGA+EGINQI TG L+SLSEQELIDCDKSYN GC GGLMDYA+EF+IKN Sbjct: 136 SCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNH 195 Query: 1102 GIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLEAVATQPVSVGICGSD 1281 GIDTE+DYPY+ RDGTC KDKLK+ VVTIDSYA + + +EK L+EAVA QPVSVGICGS+ Sbjct: 196 GIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSE 255 Query: 1282 SSFQLYSG-------GIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGRYWGMNGYM 1440 +FQLYS GIF+GPCSTSLDHAVLIVGY S++G DYWI+KNSWG+ WGM+G+M Sbjct: 256 RAFQLYSSKFYLLMQGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFM 315 Query: 1441 HMLRNTGTAEGVCGINMLAXXXXXXXXXXXXXXXXXXTRCNLFTYCSSGETCCCARHFLG 1620 HM RNT ++GVCGINMLA T+CNLFTYCSSGETCCCAR G Sbjct: 316 HMQRNTENSDGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCCCARELFG 375 Query: 1621 ICLKWRCCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLK 1737 +C W+CCE SAVCCKD HCCPHDYPVCDT R++CLK Sbjct: 376 LCFSWKCCEIESAVCCKDGRHCCPHDYPVCDTTRSLCLK 414 >ref|XP_002280937.1| PREDICTED: cysteine proteinase RD21a [Vitis vinifera] gi|302142569|emb|CBI19772.3| unnamed protein product [Vitis vinifera] Length = 436 Score = 560 bits (1443), Expect = e-157 Identities = 255/425 (60%), Positives = 324/425 (76%) Frame = +1 Query: 559 KSSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIRANSSYTLSLNAFAD 738 ++SS +DLF++WC++YGKTY+SE+E+ RLKVFE+N+ +V QHN AN+SYTL+LNAFAD Sbjct: 21 EASSTADLFEAWCEQYGKTYSSEEEKASRLKVFEENHAFVTQHNSMANASYTLALNAFAD 80 Query: 739 LTNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPSSLDWRNKGAVTGVKDQ 918 LT+HEFKA LG SP + + ++ P VQE +P ++DWR GAVTGVKDQ Sbjct: 81 LTHHEFKASRLGFSPGRAQSI-------RSVGTP--VQELHVPPAVDWRKSGAVTGVKDQ 131 Query: 919 GSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCGGGLMDYAYEFIIKN 1098 G+CG CWSFS TGA+EGIN+I TGSLVSLSEQEL+DCD+SYN GC GGLMDYAY+F+IKN Sbjct: 132 GNCGGCWSFSTTGAIEGINKIVTGSLVSLSEQELVDCDRSYNSGCEGGLMDYAYQFVIKN 191 Query: 1099 KGIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLEAVATQPVSVGICGS 1278 +GID+E DYPY G D CNK+KLK+H+VTID Y DIP +EK+LL+ VA QPVSVGICGS Sbjct: 192 QGIDSEADYPYVGMDKPCNKEKLKKHIVTIDGYTDIPPNDEKQLLQVVAKQPVSVGICGS 251 Query: 1279 DSSFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGRYWGMNGYMHMLRNT 1458 + +FQLYS G++TGPCS++LDHAVLIVGY ++DG D+WI+KNSWG +WGM GY+HMLRN Sbjct: 252 EKTFQLYSKGVYTGPCSSTLDHAVLIVGYGTEDGVDFWIVKNSWGEHWGMRGYIHMLRNN 311 Query: 1459 GTAEGVCGINMLAXXXXXXXXXXXXXXXXXXTRCNLFTYCSSGETCCCARHFLGICLKWR 1638 GTAEG+CGINMLA T+C+ F+ CS GETCCC+ F+G+CL W Sbjct: 312 GTAEGICGINMLASYPAKTSPNPPPPPTPGPTKCDFFSSCSEGETCCCSWRFIGVCLSWN 371 Query: 1639 CCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRIGNSTFVKPIAKKSFSI*LSKQCGYT 1818 CC A SAVCC ++ +CCP +P+CDTKRN CLK GN T V+ + ++ S+ K G++ Sbjct: 372 CCTAKSAVCCDNNNYCCPASHPICDTKRNRCLKPAGNGTGVEVLKRRGSSV---KFGGWS 428 Query: 1819 SYNSA 1833 S N A Sbjct: 429 SINDA 433 >ref|XP_004500967.1| PREDICTED: oryzain alpha chain-like [Cicer arietinum] Length = 436 Score = 558 bits (1438), Expect = e-156 Identities = 264/396 (66%), Positives = 298/396 (75%) Frame = +1 Query: 574 SDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIRANSSYTLSLNAFADLTNHE 753 S LF WCK++GKTY SEQE+ +R VFE NY +V QHN NSSYTLSLNAFADLT+HE Sbjct: 27 SKLFQEWCKQHGKTYPSEQEKRYRFNVFEDNYAFVAQHNQIGNSSYTLSLNAFADLTHHE 86 Query: 754 FKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPSSLDWRNKGAVTGVKDQGSCGA 933 FKA LGL PS S L + N +F+Q +PS +DWR GAV+ VKDQGSCGA Sbjct: 87 FKATRLGLPPS-SLLRFKFNRFQDQQRSDDFLQ---VPSEIDWRKNGAVSIVKDQGSCGA 142 Query: 934 CWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCGGGLMDYAYEFIIKNKGIDT 1113 CWSFSATGA+EGIN+I TGSLVSLSEQEL+DCD +YN GC GGLMDYAY+FII N GIDT Sbjct: 143 CWSFSATGAIEGINKIVTGSLVSLSEQELVDCDTTYNSGCDGGLMDYAYQFIIDNNGIDT 202 Query: 1114 EEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLEAVATQPVSVGICGSDSSFQ 1293 EEDYPY+ R C KDKLKR VVTID Y D+P +EKKLL+AVA QPVSVGICGS +FQ Sbjct: 203 EEDYPYQARQLLCKKDKLKRRVVTIDGYTDVPPNDEKKLLKAVAVQPVSVGICGSARAFQ 262 Query: 1294 LYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGRYWGMNGYMHMLRNTGTAEG 1473 LYS GIFTGPCSTSLDHAVLIVGY S++G DYWI+KNSWG+YWGMNGY+HMLRNT ++ G Sbjct: 263 LYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKYWGMNGYIHMLRNTDSSAG 322 Query: 1474 VCGINMLAXXXXXXXXXXXXXXXXXXTRCNLFTYCSSGETCCCARHFLGICLKWRCCEAV 1653 +CGINMLA +CNLFTYCS GETCCCA+ FLGIC W+CC Sbjct: 323 LCGINMLASYPTKTKPNPPVPPPPGPIKCNLFTYCSGGETCCCAKKFLGICFSWKCCGVT 382 Query: 1654 SAVCCKDHTHCCPHDYPVCDTKRNICLKRIGNSTFV 1761 SAVCCKD HCCP DYPVCD CLKRI N T + Sbjct: 383 SAVCCKDKRHCCPLDYPVCDASNGQCLKRIANGTIL 418 >ref|XP_004291399.1| PREDICTED: cysteine proteinase RD21a-like [Fragaria vesca subsp. vesca] Length = 441 Score = 547 bits (1409), Expect = e-153 Identities = 256/402 (63%), Positives = 307/402 (76%), Gaps = 4/402 (0%) Frame = +1 Query: 562 SSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIRANSSYTLSLNAFADL 741 SSS S+LF++WCK+YGK+Y+S++E+ +RL +FE+N ++ QHN NSSYTLSLN+F+DL Sbjct: 25 SSSSSELFEAWCKQYGKSYSSQEEKLYRLSLFEQNLAFITQHNDLGNSSYTLSLNSFSDL 84 Query: 742 TNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPSSLDWRNKGAVTGVKDQG 921 T+HEFKA LG SP+ L + + P+ + +PSS+DWR GAVT VKDQG Sbjct: 85 THHEFKASRLGFSPTFLRLYRKSDPKPSVV--------RHVPSSIDWRKNGAVTNVKDQG 136 Query: 922 SCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSY-NDGCGGGLMDYAYEFIIKN 1098 SCGACWSFSATGA+EGIN+I TGSLVSLSEQELIDCD+ Y N GC GGLMD A++FII N Sbjct: 137 SCGACWSFSATGAIEGINKIVTGSLVSLSEQELIDCDRVYPNSGCNGGLMDDAFQFIIDN 196 Query: 1099 KGIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLEAVATQPVSVGICGS 1278 GIDTEEDYPY+G DGTCNK KLKRHVVTID Y D+PA NE++LL+AVATQPVSVGI GS Sbjct: 197 NGIDTEEDYPYQGADGTCNKQKLKRHVVTIDGYTDVPANNEEQLLKAVATQPVSVGIAGS 256 Query: 1279 DSSFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGRYWGMNGYMHMLRNT 1458 FQ YS GIF GPCST+LDHAVLIVGY S++G DYWI+KNSWG+ WGMNGY+H+LR+ Sbjct: 257 GREFQFYSKGIFAGPCSTTLDHAVLIVGYGSENGVDYWIVKNSWGKNWGMNGYIHILRDH 316 Query: 1459 GTAEGVCGINMLA---XXXXXXXXXXXXXXXXXXTRCNLFTYCSSGETCCCARHFLGICL 1629 ++G+CGINMLA T+C+LF+ C GETCCCAR LGICL Sbjct: 317 SNSKGLCGINMLASYPTKTGENPPFPPPSPPPGPTKCDLFSKCGVGETCCCARKILGICL 376 Query: 1630 KWRCCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRIGNST 1755 WRCCE SAVCCKD HCCPHDYP+CDT+RN CL+ GN T Sbjct: 377 SWRCCEFTSAVCCKDRLHCCPHDYPICDTERNYCLQANGNLT 418 >ref|XP_003523725.1| PREDICTED: oryzain alpha chain-like [Glycine max] Length = 439 Score = 543 bits (1400), Expect = e-152 Identities = 259/417 (62%), Positives = 307/417 (73%), Gaps = 8/417 (1%) Frame = +1 Query: 562 SSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIRAN-----SSYTLSLN 726 +S S+LF+ WCKE+ KTY+SE+E+ +RLKVFE NY +V QHN AN SSYTLSLN Sbjct: 26 ASDTSELFEKWCKEHSKTYSSEEEKLYRLKVFEDNYAFVAQHNQNANNNNNNSSYTLSLN 85 Query: 727 AFADLTNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESD---IPSSLDWRNKGA 897 AFADLT+HEFK LGL + L+R P Q D IPS +DWR GA Sbjct: 86 AFADLTHHEFKTTRLGLPLT----LLRFKR-------PQNQQSRDLLHIPSQIDWRQSGA 134 Query: 898 VTGVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCGGGLMDYA 1077 VT VKDQ SCGACW+FSATGA+EGIN+I TGSLVSLSEQELIDCD SYN GCGGGLMD+A Sbjct: 135 VTPVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDTSYNSGCGGGLMDFA 194 Query: 1078 YEFIIKNKGIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLEAVATQPV 1257 Y+F+I NKGIDTE+DYPY+ R +C+KDKLKR VTI+ Y D+P +E+++L+AVA+QPV Sbjct: 195 YQFVIDNKGIDTEDDYPYQARQRSCSKDKLKRRAVTIEDYVDVP-PSEEEILKAVASQPV 253 Query: 1258 SVGICGSDSSFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGRYWGMNGY 1437 SVGICGS+ FQLYS GIFTGPCST LDHAVLIVGY S++G DYWI+KNSWG+YWGMNGY Sbjct: 254 SVGICGSEREFQLYSKGIFTGPCSTFLDHAVLIVGYGSENGVDYWIVKNSWGKYWGMNGY 313 Query: 1438 MHMLRNTGTAEGVCGINMLAXXXXXXXXXXXXXXXXXXTRCNLFTYCSSGETCCCARHFL 1617 +HM+RN+G ++G+CGIN LA RCNLFT+CS GETCCCA+ FL Sbjct: 314 IHMIRNSGNSKGICGINTLASYPVKTKPNPPIPPPPGPVRCNLFTHCSEGETCCCAKSFL 373 Query: 1618 GICLKWRCCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRIGNSTFVKPIAKKSFS 1788 GIC W+CC SAVCCKD HCCP DYP+CDT+R CLKR N T + FS Sbjct: 374 GICFSWKCCGLTSAVCCKDKRHCCPQDYPICDTRRGQCLKRTANGTTTITSENQDFS 430