BLASTX nr result
ID: Stemona21_contig00014615
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Stemona21_contig00014615 (1302 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006838704.1| hypothetical protein AMTR_s00002p00249780 [A... 527 e-147 ref|NP_001141813.1| uncharacterized protein LOC100273952 precurs... 519 e-144 gb|EOY14881.1| JHL18I08.3 protein [Theobroma cacao] 517 e-144 ref|XP_002510459.1| cysteine protease, putative [Ricinus communi... 516 e-143 ref|XP_006417526.1| hypothetical protein EUTSA_v10007640mg [Eutr... 513 e-143 ref|XP_002307688.2| cysteine protease family protein [Populus tr... 509 e-142 gb|EAY98636.1| hypothetical protein OsI_20560 [Oryza sativa Indi... 509 e-141 gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [A... 509 e-141 dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas] 508 e-141 ref|NP_001055994.1| Os05g0508300 [Oryza sativa Japonica Group] g... 508 e-141 ref|NP_563855.1| papain-like cysteine peptidase [Arabidopsis tha... 508 e-141 gb|EXC25025.1| Oryzain alpha chain [Morus notabilis] 507 e-141 ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [C... 504 e-140 ref|XP_004961575.1| PREDICTED: oryzain alpha chain-like [Setaria... 504 e-140 ref|XP_002440039.1| hypothetical protein SORBIDRAFT_09g024940 [S... 503 e-140 ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arab... 503 e-140 gb|AAB60738.1| Strong similarity to Dianthus cysteine proteinase... 496 e-137 ref|XP_004500967.1| PREDICTED: oryzain alpha chain-like [Cicer a... 494 e-137 ref|XP_006473576.1| PREDICTED: cysteine proteinase RD21a-like [C... 493 e-137 ref|XP_006435079.1| hypothetical protein CICLE_v10001178mg [Citr... 493 e-137 >ref|XP_006838704.1| hypothetical protein AMTR_s00002p00249780 [Amborella trichopoda] gi|548841210|gb|ERN01273.1| hypothetical protein AMTR_s00002p00249780 [Amborella trichopoda] Length = 475 Score = 527 bits (1357), Expect = e-147 Identities = 240/387 (62%), Positives = 284/387 (73%) Frame = -2 Query: 1163 ELFESWCRWHGKNYASDEEKLARFAVFEDXXXXXXXXXXXXXXXXALGLNAFADLAHHEF 984 ++FESWCR HG+ Y + EEK RF VF D +GLNAFADL HHEF Sbjct: 71 DIFESWCRRHGRTYGTVEEKEQRFRVFSDNLVFIREHNQRANSNYTVGLNAFADLTHHEF 130 Query: 983 XXXXXXXXXXXAVVEPSXXXXXXXXXXXXAPDSVDWRKKGAVTRVKDQGSCGACWSFSAT 804 ++ S P S+DWR KGAVT VKDQGSCGACW+FSAT Sbjct: 131 KIKRLGLCPS--ILRFSSSNFRSDQKKIDVPSSLDWRDKGAVTNVKDQGSCGACWAFSAT 188 Query: 803 GAMEGINKIVTGSLVSLSEQELVDCDKTYNSGCEGGLMDYAYKWVIQNHGIDTEEDYPYQ 624 GA+EGINKIVTGSL+SLSEQE++DCD TYNSGC GGLMDYA+KWV +NHGIDTE+DYPY+ Sbjct: 189 GAIEGINKIVTGSLISLSEQEIIDCDTTYNSGCGGGLMDYAFKWVTKNHGIDTEKDYPYR 248 Query: 623 AAEKTCLKNKLNHRVVTIDSYTDVPPNNEELLLQAVADQPVSVGICGSERTFQLYSKGIF 444 + +C+K+K VVTID +TD+P N+E+L+LQAVA QPVSVGICGSER+FQLYS GIF Sbjct: 249 EVQGSCIKDKAERHVVTIDGHTDIPSNSEDLILQAVAKQPVSVGICGSERSFQLYSSGIF 308 Query: 443 SGPCSTSLDHAVLIVGYGSENGVDYWILKNSWGNSWGMDGYMHMLRKSGNPQGVCGINML 264 SGPCSTSLDHAVLIVGYGS+NGVDYWI+KNSWG SWGMDGYMHMLR SG+ QGVCGINM+ Sbjct: 309 SGPCSTSLDHAVLIVGYGSKNGVDYWIVKNSWGTSWGMDGYMHMLRNSGDSQGVCGINMM 368 Query: 263 AXXXXXXXXXXXXXXXXXXTKCSLLTYCPEGSTCCCSWSILGLCFSWSCCELESAVCCKD 84 KCSLLTYCP G+TCCC+W LG+C SWSCC+L++AVCCKD Sbjct: 369 PSYPTKSGANPPPSPPPGPVKCSLLTYCPSGNTCCCTWRFLGICLSWSCCDLDNAVCCKD 428 Query: 83 HRYCCPHDYPICDSGSKQCFKGSGNYS 3 +YCCP DYP+C++ + C KGSGN++ Sbjct: 429 GQYCCPQDYPVCNTATGYCLKGSGNWT 455 >ref|NP_001141813.1| uncharacterized protein LOC100273952 precursor [Zea mays] gi|194706024|gb|ACF87096.1| unknown [Zea mays] gi|413945958|gb|AFW78607.1| hypothetical protein ZEAMMB73_489507 [Zea mays] Length = 460 Score = 519 bits (1336), Expect = e-144 Identities = 252/399 (63%), Positives = 282/399 (70%), Gaps = 14/399 (3%) Frame = -2 Query: 1157 FESWCRWHGKNYASDEEKLARFAVFEDXXXXXXXXXXXXXXXXA-------------LGL 1017 F++WC HGK YA+ EE+ AR AVF D A L L Sbjct: 36 FDAWCAEHGKAYATPEERAARLAVFADNAAFVAAHNARAGANAAGGGGGGAAPPSYTLAL 95 Query: 1016 NAFADLAHHEFXXXXXXXXXXXAVVEP-SXXXXXXXXXXXXAPDSVDWRKKGAVTRVKDQ 840 NAFADL H EF A + + PD++DWRK GAVT+VKDQ Sbjct: 96 NAFADLTHEEFRAARLGRIAPGAALRSRAAPVYWGLGGGAAVPDALDWRKSGAVTKVKDQ 155 Query: 839 GSCGACWSFSATGAMEGINKIVTGSLVSLSEQELVDCDKTYNSGCEGGLMDYAYKWVIQN 660 GSCGACWSFSATGAMEGINKI TGSLVSLSEQEL+DCD++YNSGC GGLMDYAYK+VI+N Sbjct: 156 GSCGACWSFSATGAMEGINKIKTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYKFVIKN 215 Query: 659 HGIDTEEDYPYQAAEKTCLKNKLNHRVVTIDSYTDVPPNNEELLLQAVADQPVSVGICGS 480 GIDTEEDYPY+ A+ TC KNKL RVVTID YTDVP N E+LLLQAVA QPVSVGICGS Sbjct: 216 GGIDTEEDYPYREADGTCNKNKLKKRVVTIDGYTDVPSNKEDLLLQAVAQQPVSVGICGS 275 Query: 479 ERTFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVDYWILKNSWGNSWGMDGYMHMLRKS 300 R FQLY +GIF GPC TSLDHAVLIVGYGSE G DYWI+KNSWG SWGM GYMHM R + Sbjct: 276 ARAFQLYYQGIFDGPCPTSLDHAVLIVGYGSEGGKDYWIVKNSWGESWGMKGYMHMHRNT 335 Query: 299 GNPQGVCGINMLAXXXXXXXXXXXXXXXXXXTKCSLLTYCPEGSTCCCSWSILGLCFSWS 120 G+ +GVCGINM+A TKCSLLTYCPEGSTCCCSW +LG C SWS Sbjct: 336 GDSKGVCGINMMASFPTKTSPNPPPSPGPGPTKCSLLTYCPEGSTCCCSWRVLGFCLSWS 395 Query: 119 CCELESAVCCKDHRYCCPHDYPICDSGSKQCFKGSGNYS 3 CCEL++AVCCKD+RYCCPHDYP+CD+G QC K SGN+S Sbjct: 396 CCELDNAVCCKDNRYCCPHDYPVCDTGRGQCLKASGNFS 434 >gb|EOY14881.1| JHL18I08.3 protein [Theobroma cacao] Length = 438 Score = 517 bits (1332), Expect = e-144 Identities = 246/384 (64%), Positives = 277/384 (72%) Frame = -2 Query: 1160 LFESWCRWHGKNYASDEEKLARFAVFEDXXXXXXXXXXXXXXXXALGLNAFADLAHHEFX 981 LFE+WC HGK Y+S+EEK R VFE+ +L LNAFADL HHEF Sbjct: 29 LFETWCDQHGKRYSSEEEKSYRLKVFEENYAFVTQHNGVGNSSYSLALNAFADLTHHEFK 88 Query: 980 XXXXXXXXXXAVVEPSXXXXXXXXXXXXAPDSVDWRKKGAVTRVKDQGSCGACWSFSATG 801 +E S P S+DWR KGAVT+VKDQGSCGACWSFSATG Sbjct: 89 ASRLGLSAA--AIEGSRPNLQLPGLVRDIPASMDWRTKGAVTKVKDQGSCGACWSFSATG 146 Query: 800 AMEGINKIVTGSLVSLSEQELVDCDKTYNSGCEGGLMDYAYKWVIQNHGIDTEEDYPYQA 621 A+EGINKIVTG+LVSLSEQELVDCD++YNSGCEGGLMDYAY++VI NHGID EEDYPY Sbjct: 147 AIEGINKIVTGTLVSLSEQELVDCDRSYNSGCEGGLMDYAYQFVIDNHGIDNEEDYPYLG 206 Query: 620 AEKTCLKNKLNHRVVTIDSYTDVPPNNEELLLQAVADQPVSVGICGSERTFQLYSKGIFS 441 EKTC K K RVVTID Y VP NNE+LLLQAVA QPVSVGICGSER FQLYSKGIF+ Sbjct: 207 REKTCNKEKRKRRVVTIDGYAGVPANNEDLLLQAVAKQPVSVGICGSERAFQLYSKGIFT 266 Query: 440 GPCSTSLDHAVLIVGYGSENGVDYWILKNSWGNSWGMDGYMHMLRKSGNPQGVCGINMLA 261 GPCS+SLDHAVLIVGYGSENGVDYWI+KNSWG WGM+GY+HMLR SG+ +G+CGINMLA Sbjct: 267 GPCSSSLDHAVLIVGYGSENGVDYWIVKNSWGTRWGMNGYIHMLRNSGDSKGLCGINMLA 326 Query: 260 XXXXXXXXXXXXXXXXXXTKCSLLTYCPEGSTCCCSWSILGLCFSWSCCELESAVCCKDH 81 TKC L TYC G TCCC+ I G+CFSW CCEL+SAVCCKD+ Sbjct: 327 SYPTKTSPNPPSPPPPGPTKCDLFTYCSAGETCCCTHRIFGICFSWKCCELDSAVCCKDN 386 Query: 80 RYCCPHDYPICDSGSKQCFKGSGN 9 R+CCP+DYP+CD+ QC K GN Sbjct: 387 RHCCPYDYPVCDTKKSQCLKRVGN 410 >ref|XP_002510459.1| cysteine protease, putative [Ricinus communis] gi|223551160|gb|EEF52646.1| cysteine protease, putative [Ricinus communis] Length = 422 Score = 516 bits (1328), Expect = e-143 Identities = 247/383 (64%), Positives = 278/383 (72%) Frame = -2 Query: 1163 ELFESWCRWHGKNYASDEEKLARFAVFEDXXXXXXXXXXXXXXXXALGLNAFADLAHHEF 984 +LFESW + HGK Y S E+KL RF +FE+ L LNAFADL HHEF Sbjct: 30 KLFESWTKEHGKTYTSKEDKLYRFKIFEENYEFVKKHNSQGNSSYTLSLNAFADLTHHEF 89 Query: 983 XXXXXXXXXXXAVVEPSXXXXXXXXXXXXAPDSVDWRKKGAVTRVKDQGSCGACWSFSAT 804 + S P S+DWRKKGAV++VKDQG+CGACWSFSAT Sbjct: 90 KASRLGLSAFSTSGKLSRRNFPLHDFVGDVPISIDWRKKGAVSQVKDQGNCGACWSFSAT 149 Query: 803 GAMEGINKIVTGSLVSLSEQELVDCDKTYNSGCEGGLMDYAYKWVIQNHGIDTEEDYPYQ 624 GA+EGINKIVTGSLVSLSEQELVDCD++YN+GCEGGLMDYAY++VI+N+GIDTEEDYPYQ Sbjct: 150 GAIEGINKIVTGSLVSLSEQELVDCDRSYNNGCEGGLMDYAYQFVIENNGIDTEEDYPYQ 209 Query: 623 AAEKTCLKNKLNHRVVTIDSYTDVPPNNEELLLQAVADQPVSVGICGSERTFQLYSKGIF 444 A EKTC K KL VVTID YTDVP NNE+ LL+AVA QPVSVGICGSER FQLYSKGIF Sbjct: 210 AREKTCNKEKLKRHVVTIDGYTDVPQNNEKELLKAVAAQPVSVGICGSERAFQLYSKGIF 269 Query: 443 SGPCSTSLDHAVLIVGYGSENGVDYWILKNSWGNSWGMDGYMHMLRKSGNPQGVCGINML 264 +GPCSTSLDHAVLIVGYGSENGVDYWI+KNSWG WG++GYM+MLR SGN QG+CGINML Sbjct: 270 TGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTHWGINGYMYMLRNSGNSQGLCGINML 329 Query: 263 AXXXXXXXXXXXXXXXXXXTKCSLLTYCPEGSTCCCSWSILGLCFSWSCCELESAVCCKD 84 A TKC L T C EG TCCC+ I GLCFSW CCEL+SAVCCKD Sbjct: 330 ASFPVKTSPNPPPPAPPGPTKCDLFTRCGEGETCCCTRRIFGLCFSWKCCELDSAVCCKD 389 Query: 83 HRYCCPHDYPICDSGSKQCFKGS 15 +CCPHDYP+CD+ C K S Sbjct: 390 GLHCCPHDYPVCDTKRNMCLKVS 412 >ref|XP_006417526.1| hypothetical protein EUTSA_v10007640mg [Eutrema salsugineum] gi|557095297|gb|ESQ35879.1| hypothetical protein EUTSA_v10007640mg [Eutrema salsugineum] Length = 444 Score = 513 bits (1321), Expect = e-143 Identities = 240/388 (61%), Positives = 277/388 (71%), Gaps = 1/388 (0%) Frame = -2 Query: 1163 ELFESWCRWHGKNYASDEEKLARFAVFEDXXXXXXXXXXXXXXXXALGLNAFADLAHHEF 984 ELF+ WC HGK Y S+EE+ R +F D +L LNAFADL HHEF Sbjct: 35 ELFDDWCHRHGKTYGSEEERQHRIQIFRDNHDFVTQHNHISNSTYSLSLNAFADLTHHEF 94 Query: 983 XXXXXXXXXXXA-VVEPSXXXXXXXXXXXXAPDSVDWRKKGAVTRVKDQGSCGACWSFSA 807 ++ PDSVDWRKKGAVT VKDQGSCGACWSFSA Sbjct: 95 KASRLGLSAPSPSLMAKEQSLGVSERVRVKVPDSVDWRKKGAVTNVKDQGSCGACWSFSA 154 Query: 806 TGAMEGINKIVTGSLVSLSEQELVDCDKTYNSGCEGGLMDYAYKWVIQNHGIDTEEDYPY 627 TGAMEGIN+IVTG L+SLSEQEL+DCDK+YN+GC GGLMDYA+++VI+NHGIDTE+DYPY Sbjct: 155 TGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKDYPY 214 Query: 626 QAAEKTCLKNKLNHRVVTIDSYTDVPPNNEELLLQAVADQPVSVGICGSERTFQLYSKGI 447 Q + TC K+KL RVVTIDSY V NNE+ L++AVA QPVSVGICGSER FQLYS GI Sbjct: 215 QEQDGTCKKDKLKKRVVTIDSYAGVASNNEKALMEAVASQPVSVGICGSERAFQLYSSGI 274 Query: 446 FSGPCSTSLDHAVLIVGYGSENGVDYWILKNSWGNSWGMDGYMHMLRKSGNPQGVCGINM 267 FSGPCSTSLDHAVLIVGYGS+NGVDYWI+KNSWG SWGMDG+MHM R +GN +GVCGINM Sbjct: 275 FSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTGNSEGVCGINM 334 Query: 266 LAXXXXXXXXXXXXXXXXXXTKCSLLTYCPEGSTCCCSWSILGLCFSWSCCELESAVCCK 87 LA TKC+L TYC G TCCC+ ++ GLCFSW CCELESAVCCK Sbjct: 335 LASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCCCARTLFGLCFSWKCCELESAVCCK 394 Query: 86 DHRYCCPHDYPICDSGSKQCFKGSGNYS 3 D R+CCP DYP+CD+ C K +GN++ Sbjct: 395 DGRHCCPRDYPVCDTTKSLCLKKTGNFT 422 >ref|XP_002307688.2| cysteine protease family protein [Populus trichocarpa] gi|550339725|gb|EEE94684.2| cysteine protease family protein [Populus trichocarpa] Length = 436 Score = 509 bits (1311), Expect = e-142 Identities = 240/385 (62%), Positives = 273/385 (70%) Frame = -2 Query: 1163 ELFESWCRWHGKNYASDEEKLARFAVFEDXXXXXXXXXXXXXXXXALGLNAFADLAHHEF 984 +LFE+WC+ HGK+Y S EE+ R VFED +L LNAFADL HHEF Sbjct: 27 QLFETWCKEHGKSYTSQEERSHRLKVFEDNYDFVTKHNSKGNSSYSLALNAFADLTHHEF 86 Query: 983 XXXXXXXXXXXAVVEPSXXXXXXXXXXXXAPDSVDWRKKGAVTRVKDQGSCGACWSFSAT 804 + + P S+DWR KG VT VKDQGSCGACWSFSAT Sbjct: 87 KTSRLGLSAAP--LNLAHRNLEITGVVGDIPASIDWRNKGVVTNVKDQGSCGACWSFSAT 144 Query: 803 GAMEGINKIVTGSLVSLSEQELVDCDKTYNSGCEGGLMDYAYKWVIQNHGIDTEEDYPYQ 624 GA+EGINKIVTGSLVSLSEQEL++CDK+YN GC GGLMDYA+++VI NHGIDTEEDYPY+ Sbjct: 145 GAIEGINKIVTGSLVSLSEQELIECDKSYNDGCGGGLMDYAFQFVINNHGIDTEEDYPYR 204 Query: 623 AAEKTCLKNKLNHRVVTIDSYTDVPPNNEELLLQAVADQPVSVGICGSERTFQLYSKGIF 444 A + TC K+++ RVVTID Y DVP NNE+ LLQAVA QPVSVGICGSER FQ+YSKGIF Sbjct: 205 ARDGTCNKDRMKRRVVTIDKYVDVPENNEKQLLQAVAAQPVSVGICGSERAFQMYSKGIF 264 Query: 443 SGPCSTSLDHAVLIVGYGSENGVDYWILKNSWGNSWGMDGYMHMLRKSGNPQGVCGINML 264 +GPCSTSLDHAVLIVGYGSENGVDYWI+KNSWG WGM GYMHM R SGN QGVCGINML Sbjct: 265 TGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTGWGMRGYMHMQRNSGNSQGVCGINML 324 Query: 263 AXXXXXXXXXXXXXXXXXXTKCSLLTYCPEGSTCCCSWSILGLCFSWSCCELESAVCCKD 84 A TKC+LLTYC G TCCC+ G+C SW CC L+SAVCCKD Sbjct: 325 ASYPVKTSPNPPPPPPPGPTKCNLLTYCAAGETCCCARKFFGICISWKCCGLDSAVCCKD 384 Query: 83 HRYCCPHDYPICDSGSKQCFKGSGN 9 +CCPHDYP+CD+ CFK +GN Sbjct: 385 RLHCCPHDYPVCDTDKNMCFKRAGN 409 >gb|EAY98636.1| hypothetical protein OsI_20560 [Oryza sativa Indica Group] Length = 449 Score = 509 bits (1310), Expect = e-141 Identities = 242/387 (62%), Positives = 277/387 (71%), Gaps = 2/387 (0%) Frame = -2 Query: 1157 FESWCRWHGKNYASDEEKLARFAVFEDXXXXXXXXXXXXXXXXALGLNAFADLAHHEFXX 978 FE+WC HG++YA+ E+ AR A F D L LNAFADL H EF Sbjct: 38 FEAWCAEHGRSYATPGERAARLAAFADNAAFVAAHNGAPASYA-LALNAFADLTHDEFRA 96 Query: 977 XXXXXXXXXAVV-EPSXXXXXXXXXXXXAPDSVDWRKKGAVTRVKDQGSCGACWSFSATG 801 + PD+VDWR+ GAVT+VKDQGSCGACWSFSATG Sbjct: 97 ARLGRLAAAGPGRDGGAPYLGVDGGVGAVPDAVDWRQSGAVTKVKDQGSCGACWSFSATG 156 Query: 800 AMEGINKIVTGSLVSLSEQELVDCDKTYNSGCEGGLMDYAYKWVIQNHGIDTEEDYPYQA 621 AMEGINKI TGSL+SLSEQEL+DCD++YNSGC GGLMDYAYK+V++N GIDTE DYPY+ Sbjct: 157 AMEGINKIKTGSLISLSEQELIDCDRSYNSGCGGGLMDYAYKFVVKNGGIDTEADYPYRE 216 Query: 620 AEKTCLKNKLNHRVVTIDSYTDVPPNNEELLLQAVADQPVSVGICGSERTFQLYSKGIFS 441 + TC KNKL RVVTID Y DVP NNE++LLQAVA QPVSVGICGS R FQLYSKGIF Sbjct: 217 TDGTCNKNKLKRRVVTIDGYKDVPANNEDMLLQAVAQQPVSVGICGSARAFQLYSKGIFD 276 Query: 440 GPCSTSLDHAVLIVGYGSENGVDYWILKNSWGNSWGMDGYMHMLRKSGNPQGVCGINMLA 261 GPC TSLDHA+LIVGYGSE G DYWI+KNSWG SWGM GYM+M R +GN GVCGIN + Sbjct: 277 GPCPTSLDHAILIVGYGSEGGKDYWIVKNSWGESWGMKGYMYMHRNTGNSNGVCGINQMP 336 Query: 260 XXXXXXXXXXXXXXXXXXTKCSLLTYCPEGSTCCCSWSILGLCFSWSCCELESAVCCKDH 81 TKCSLLTYCPEGSTCCCSW +LGLC SWSCCEL++AVCCKD+ Sbjct: 337 SFPTKSSPNPPPSPGPGPTKCSLLTYCPEGSTCCCSWRVLGLCLSWSCCELDNAVCCKDN 396 Query: 80 RYCCPHDYPICDSGSKQCFK-GSGNYS 3 RYCCPHDYP+CD+ S++CFK +GN+S Sbjct: 397 RYCCPHDYPVCDTASQRCFKANNGNFS 423 >gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana] Length = 437 Score = 509 bits (1310), Expect = e-141 Identities = 238/387 (61%), Positives = 278/387 (71%) Frame = -2 Query: 1163 ELFESWCRWHGKNYASDEEKLARFAVFEDXXXXXXXXXXXXXXXXALGLNAFADLAHHEF 984 ELF+ WC+ HGK Y S+EE+ R +F+D +L LNAFADL HHEF Sbjct: 30 ELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHEF 89 Query: 983 XXXXXXXXXXXAVVEPSXXXXXXXXXXXXAPDSVDWRKKGAVTRVKDQGSCGACWSFSAT 804 V + PDSVDWRKKGAVT VKDQGSCGACWSFSAT Sbjct: 90 KASRLGLSVSAPSVIMASKGQSLGGSVKV-PDSVDWRKKGAVTNVKDQGSCGACWSFSAT 148 Query: 803 GAMEGINKIVTGSLVSLSEQELVDCDKTYNSGCEGGLMDYAYKWVIQNHGIDTEEDYPYQ 624 GAMEGIN+IVTG L+SLSEQEL+DCDK+YN+GC GGLMDYA+++VI+NHGIDTE+DYPYQ Sbjct: 149 GAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKDYPYQ 208 Query: 623 AAEKTCLKNKLNHRVVTIDSYTDVPPNNEELLLQAVADQPVSVGICGSERTFQLYSKGIF 444 + TC K+KL +VVTIDSY V N+E+ L++AVA QPVSVGICGSER FQLYS+GIF Sbjct: 209 ERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQLYSRGIF 268 Query: 443 SGPCSTSLDHAVLIVGYGSENGVDYWILKNSWGNSWGMDGYMHMLRKSGNPQGVCGINML 264 SGPCSTSLDHAVLIVGYGS+NGVDYWI+KNSWG SWGMDG+MHM R + N GVCGINML Sbjct: 269 SGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTENSDGVCGINML 328 Query: 263 AXXXXXXXXXXXXXXXXXXTKCSLLTYCPEGSTCCCSWSILGLCFSWSCCELESAVCCKD 84 A TKC+L TYC G TCCC+ + GLCFSW CCE+ESAVCCKD Sbjct: 329 ASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCCCARELFGLCFSWKCCEIESAVCCKD 388 Query: 83 HRYCCPHDYPICDSGSKQCFKGSGNYS 3 R+CCPHDYP+CD+ C K +GN++ Sbjct: 389 GRHCCPHDYPVCDTTRSLCLKKTGNFT 415 >dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas] Length = 441 Score = 508 bits (1309), Expect = e-141 Identities = 242/385 (62%), Positives = 275/385 (71%), Gaps = 1/385 (0%) Frame = -2 Query: 1160 LFESWCRWHGKNYASDEEKLARFAVFEDXXXXXXXXXXXXXXXXALGLNAFADLAHHEFX 981 LFE+WC+ HGK YAS EEKL R VF+D L LNAFADL HHEF Sbjct: 29 LFETWCQQHGKTYASQEEKLFRLKVFQDNYDFVTEHNSQGNSSYTLSLNAFADLTHHEFK 88 Query: 980 XXXXXXXXXXAV-VEPSXXXXXXXXXXXXAPDSVDWRKKGAVTRVKDQGSCGACWSFSAT 804 + + P SVDWRK GAVT+VKDQG+CGACWSFSAT Sbjct: 89 ASRLGLSSAASASLNVDRSNRQIPDFVADVPASVDWRKNGAVTQVKDQGNCGACWSFSAT 148 Query: 803 GAMEGINKIVTGSLVSLSEQELVDCDKTYNSGCEGGLMDYAYKWVIQNHGIDTEEDYPYQ 624 GA+EGINKIVTGSLVSLSEQELVDCDK+YN+GCEGG+MDYA+++VI NHGIDTEEDYPYQ Sbjct: 149 GAIEGINKIVTGSLVSLSEQELVDCDKSYNNGCEGGIMDYAFQFVIDNHGIDTEEDYPYQ 208 Query: 623 AAEKTCLKNKLNHRVVTIDSYTDVPPNNEELLLQAVADQPVSVGICGSERTFQLYSKGIF 444 +++C K KL VVTID Y DVP NNE+ LL+AVA+QPVSVGICGSER FQLYSKGIF Sbjct: 209 GRDRSCNKEKLKRHVVTIDGYVDVPQNNEKELLKAVANQPVSVGICGSERAFQLYSKGIF 268 Query: 443 SGPCSTSLDHAVLIVGYGSENGVDYWILKNSWGNSWGMDGYMHMLRKSGNPQGVCGINML 264 +GPCSTSLDHAVLIVGYGSENGVDYWI+KNSWG+ WGMDGYMHM R SG+ +G+CGINML Sbjct: 269 TGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGSYWGMDGYMHMQRNSGSSRGLCGINML 328 Query: 263 AXXXXXXXXXXXXXXXXXXTKCSLLTYCPEGSTCCCSWSILGLCFSWSCCELESAVCCKD 84 A T+C L T+C EG TCCC I G+C SW CCEL+SAVCCKD Sbjct: 329 ASYPKKTSPNPPPPAPPGPTRCDLFTHCGEGETCCCVHHIFGICLSWKCCELDSAVCCKD 388 Query: 83 HRYCCPHDYPICDSGSKQCFKGSGN 9 R+CCP DYP+CD+ C K GN Sbjct: 389 GRHCCPRDYPVCDTTRNICLKHYGN 413 >ref|NP_001055994.1| Os05g0508300 [Oryza sativa Japonica Group] gi|48475189|gb|AAT44258.1| hypothetical protein [Oryza sativa Japonica Group] gi|113579545|dbj|BAF17908.1| Os05g0508300 [Oryza sativa Japonica Group] Length = 450 Score = 508 bits (1309), Expect = e-141 Identities = 242/388 (62%), Positives = 277/388 (71%), Gaps = 3/388 (0%) Frame = -2 Query: 1157 FESWCRWHGKNYASDEEKLARFAVFEDXXXXXXXXXXXXXXXXALGLNAFADLAHHEFXX 978 FE+WC HG++YA+ E+ AR A F D L LNAFADL H EF Sbjct: 38 FEAWCAEHGRSYATPGERAARLAAFADNAAFVAAHNGAPASYA-LALNAFADLTHDEFRA 96 Query: 977 XXXXXXXXXAVV--EPSXXXXXXXXXXXXAPDSVDWRKKGAVTRVKDQGSCGACWSFSAT 804 + PD+VDWR+ GAVT+VKDQGSCGACWSFSAT Sbjct: 97 ARLGRLAAAGGPGRDGGAPYLGVDGGVGAVPDAVDWRQSGAVTKVKDQGSCGACWSFSAT 156 Query: 803 GAMEGINKIVTGSLVSLSEQELVDCDKTYNSGCEGGLMDYAYKWVIQNHGIDTEEDYPYQ 624 GAMEGINKI TGSL+SLSEQEL+DCD++YNSGC GGLMDYAYK+V++N GIDTE DYPY+ Sbjct: 157 GAMEGINKIKTGSLISLSEQELIDCDRSYNSGCGGGLMDYAYKFVVKNGGIDTEADYPYR 216 Query: 623 AAEKTCLKNKLNHRVVTIDSYTDVPPNNEELLLQAVADQPVSVGICGSERTFQLYSKGIF 444 + TC KNKL RVVTID Y DVP NNE++LLQAVA QPVSVGICGS R FQLYSKGIF Sbjct: 217 ETDGTCNKNKLKRRVVTIDGYKDVPANNEDMLLQAVAQQPVSVGICGSARAFQLYSKGIF 276 Query: 443 SGPCSTSLDHAVLIVGYGSENGVDYWILKNSWGNSWGMDGYMHMLRKSGNPQGVCGINML 264 GPC TSLDHA+LIVGYGSE G DYWI+KNSWG SWGM GYM+M R +GN GVCGIN + Sbjct: 277 DGPCPTSLDHAILIVGYGSEGGKDYWIVKNSWGESWGMKGYMYMHRNTGNSNGVCGINQM 336 Query: 263 AXXXXXXXXXXXXXXXXXXTKCSLLTYCPEGSTCCCSWSILGLCFSWSCCELESAVCCKD 84 TKCSLLTYCPEGSTCCCSW +LGLC SWSCCEL++AVCCKD Sbjct: 337 PSFPTKSSPNPPPSPGPGPTKCSLLTYCPEGSTCCCSWRVLGLCLSWSCCELDNAVCCKD 396 Query: 83 HRYCCPHDYPICDSGSKQCFK-GSGNYS 3 +RYCCPHDYP+CD+ S++CFK +GN+S Sbjct: 397 NRYCCPHDYPVCDTASQRCFKANNGNFS 424 >ref|NP_563855.1| papain-like cysteine peptidase [Arabidopsis thaliana] gi|110741821|dbj|BAE98853.1| papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana] gi|111074448|gb|ABH04597.1| At1g09850 [Arabidopsis thaliana] gi|332190386|gb|AEE28507.1| papain-like cysteine peptidase [Arabidopsis thaliana] Length = 437 Score = 508 bits (1308), Expect = e-141 Identities = 238/387 (61%), Positives = 277/387 (71%) Frame = -2 Query: 1163 ELFESWCRWHGKNYASDEEKLARFAVFEDXXXXXXXXXXXXXXXXALGLNAFADLAHHEF 984 ELF+ WC+ HGK Y S+EE+ R +F+D +L LNAFADL HHEF Sbjct: 30 ELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHEF 89 Query: 983 XXXXXXXXXXXAVVEPSXXXXXXXXXXXXAPDSVDWRKKGAVTRVKDQGSCGACWSFSAT 804 V + PDSVDWRKKGAVT VKDQGSCGACWSFSAT Sbjct: 90 KASRLGLSVSAPSVIMASKGQSLGGSVKV-PDSVDWRKKGAVTNVKDQGSCGACWSFSAT 148 Query: 803 GAMEGINKIVTGSLVSLSEQELVDCDKTYNSGCEGGLMDYAYKWVIQNHGIDTEEDYPYQ 624 GAMEGIN+IVTG L+SLSEQEL+DCDK+YN+GC GGLMDYA+++VI+NHGIDTE+DYPYQ Sbjct: 149 GAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKDYPYQ 208 Query: 623 AAEKTCLKNKLNHRVVTIDSYTDVPPNNEELLLQAVADQPVSVGICGSERTFQLYSKGIF 444 + TC K+KL +VVTIDSY V N+E+ L++AVA QPVSVGICGSER FQLYS GIF Sbjct: 209 ERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQLYSSGIF 268 Query: 443 SGPCSTSLDHAVLIVGYGSENGVDYWILKNSWGNSWGMDGYMHMLRKSGNPQGVCGINML 264 SGPCSTSLDHAVLIVGYGS+NGVDYWI+KNSWG SWGMDG+MHM R + N GVCGINML Sbjct: 269 SGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTENSDGVCGINML 328 Query: 263 AXXXXXXXXXXXXXXXXXXTKCSLLTYCPEGSTCCCSWSILGLCFSWSCCELESAVCCKD 84 A TKC+L TYC G TCCC+ + GLCFSW CCE+ESAVCCKD Sbjct: 329 ASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCCCARELFGLCFSWKCCEIESAVCCKD 388 Query: 83 HRYCCPHDYPICDSGSKQCFKGSGNYS 3 R+CCPHDYP+CD+ C K +GN++ Sbjct: 389 GRHCCPHDYPVCDTTRSLCLKKTGNFT 415 >gb|EXC25025.1| Oryzain alpha chain [Morus notabilis] Length = 517 Score = 507 bits (1305), Expect = e-141 Identities = 242/381 (63%), Positives = 268/381 (70%) Frame = -2 Query: 1163 ELFESWCRWHGKNYASDEEKLARFAVFEDXXXXXXXXXXXXXXXXALGLNAFADLAHHEF 984 +LFE+WC HG++Y+S+EE+L R VFED L LNAFADL HHEF Sbjct: 28 QLFEAWCEKHGQSYSSEEERLYRLTVFEDNLAFVTQHNNMGNSSYTLSLNAFADLTHHEF 87 Query: 983 XXXXXXXXXXXAVVEPSXXXXXXXXXXXXAPDSVDWRKKGAVTRVKDQGSCGACWSFSAT 804 P P S+DWRKKGAVT VKDQGSCGACW+FSAT Sbjct: 88 KSSRLGFSSALLSSLPKLGSKLLDLRDV--PASLDWRKKGAVTNVKDQGSCGACWAFSAT 145 Query: 803 GAMEGINKIVTGSLVSLSEQELVDCDKTYNSGCEGGLMDYAYKWVIQNHGIDTEEDYPYQ 624 GA+EGINKIVTGSLVSLSEQEL+DCD +YN+GC+GGLMDYAY++VI NHGIDTEEDYPYQ Sbjct: 146 GAIEGINKIVTGSLVSLSEQELIDCDTSYNAGCDGGLMDYAYQFVIDNHGIDTEEDYPYQ 205 Query: 623 AAEKTCLKNKLNHRVVTIDSYTDVPPNNEELLLQAVADQPVSVGICGSERTFQLYSKGIF 444 A +K+C K KL RVVTID YTDV PNN LLQAV QPVSVGICGSER FQLYSKGIF Sbjct: 206 ARDKSCRKEKLKRRVVTIDGYTDVAPNNGLQLLQAVVTQPVSVGICGSERAFQLYSKGIF 265 Query: 443 SGPCSTSLDHAVLIVGYGSENGVDYWILKNSWGNSWGMDGYMHMLRKSGNPQGVCGINML 264 +GPCSTSLDHAVLIVGY SENGVDYWI+KNSWG WGMDGY+HM R +GN QGVCGINML Sbjct: 266 TGPCSTSLDHAVLIVGYDSENGVDYWIVKNSWGKQWGMDGYIHMQRNTGNSQGVCGINML 325 Query: 263 AXXXXXXXXXXXXXXXXXXTKCSLLTYCPEGSTCCCSWSILGLCFSWSCCELESAVCCKD 84 A T+CS C EG TCCCSW LGLCFSW CC L SAVCCKD Sbjct: 326 ASYPTKTSPNPPPSPSPGPTRCSFFAQCGEGETCCCSWRFLGLCFSWKCCGLNSAVCCKD 385 Query: 83 HRYCCPHDYPICDSGSKQCFK 21 +CCP DYP+CD+ C K Sbjct: 386 KIHCCPQDYPLCDTQRNVCLK 406 >ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus] gi|449529596|ref|XP_004171784.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus] Length = 431 Score = 504 bits (1298), Expect = e-140 Identities = 244/385 (63%), Positives = 269/385 (69%) Frame = -2 Query: 1163 ELFESWCRWHGKNYASDEEKLARFAVFEDXXXXXXXXXXXXXXXXALGLNAFADLAHHEF 984 ELFE WC HGK+Y+S EEKL R VF D L LN++ADL HHEF Sbjct: 27 ELFEIWCTEHGKSYSSAEEKLYRLGVFADNYEFVTHHNNLDNSSYTLSLNSYADLTHHEF 86 Query: 983 XXXXXXXXXXXAVVEPSXXXXXXXXXXXXAPDSVDWRKKGAVTRVKDQGSCGACWSFSAT 804 P PDS+DWRKKGAVT VKDQGSCGACWSFSAT Sbjct: 87 KVSRLGFSPALRNFRP--VLPQEPSLPRDVPDSLDWRKKGAVTAVKDQGSCGACWSFSAT 144 Query: 803 GAMEGINKIVTGSLVSLSEQELVDCDKTYNSGCEGGLMDYAYKWVIQNHGIDTEEDYPYQ 624 GAMEGIN+I+TGSL+SLSEQEL+DCD++YNSGC GGLMDYAY++VI NHGIDTE DYPYQ Sbjct: 145 GAMEGINQIMTGSLISLSEQELIDCDRSYNSGCGGGLMDYAYQFVISNHGIDTENDYPYQ 204 Query: 623 AAEKTCLKNKLNHRVVTIDSYTDVPPNNEELLLQAVADQPVSVGICGSERTFQLYSKGIF 444 A + +C K+KL VVTID Y D+P N+E LLQAVA QPVSVGICGSER FQLYSKGIF Sbjct: 205 ARDGSCRKDKLQRNVVTIDGYADIPSNDEGKLLQAVAAQPVSVGICGSERAFQLYSKGIF 264 Query: 443 SGPCSTSLDHAVLIVGYGSENGVDYWILKNSWGNSWGMDGYMHMLRKSGNPQGVCGINML 264 SGPCSTSLDHAVLIVGYGSENGVDYWI+KNSWG SWGMDGYMHM R SGN +GVCGIN L Sbjct: 265 SGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKSWGMDGYMHMQRNSGNSEGVCGINKL 324 Query: 263 AXXXXXXXXXXXXXXXXXXTKCSLLTYCPEGSTCCCSWSILGLCFSWSCCELESAVCCKD 84 A TKCS+LT C G TCCC+ LGLC SW CC L SAVCCKD Sbjct: 325 ASYPTKTNPNPPPSPPPGPTKCSILTSCAAGETCCCAKKFLGLCLSWKCCGLSSAVCCKD 384 Query: 83 HRYCCPHDYPICDSGSKQCFKGSGN 9 R+CCP DYPICD+ C K + N Sbjct: 385 GRHCCPFDYPICDTDRNLCLKQTMN 409 >ref|XP_004961575.1| PREDICTED: oryzain alpha chain-like [Setaria italica] Length = 454 Score = 504 bits (1297), Expect = e-140 Identities = 242/397 (60%), Positives = 277/397 (69%), Gaps = 11/397 (2%) Frame = -2 Query: 1160 LFESWCRWHGKNYASDEEKLARFAVFEDXXXXXXXXXXXXXXXXA------LGLNAFADL 999 LF++WC HGK YA+ EE+ AR AVF D L LNAFADL Sbjct: 32 LFDAWCAEHGKAYATPEERAARLAVFADNAAFVAAHNARANAVGGSPPSYTLALNAFADL 91 Query: 998 AHHEFXXXXXXXXXXXAVVEP-----SXXXXXXXXXXXXAPDSVDWRKKGAVTRVKDQGS 834 H EF V + PD+VDWRKKGAVT+VK+QGS Sbjct: 92 THEEFRAARLGRLAVGRVGATLRSAGAPVFGGLDGGVAAVPDAVDWRKKGAVTKVKNQGS 151 Query: 833 CGACWSFSATGAMEGINKIVTGSLVSLSEQELVDCDKTYNSGCEGGLMDYAYKWVIQNHG 654 CGACWSFSATGA+EGINKI TGSLVSLSEQEL+DCD++YN+GC GGLMDYA+K+VI+N G Sbjct: 152 CGACWSFSATGAIEGINKIKTGSLVSLSEQELIDCDRSYNNGCGGGLMDYAFKFVIKNGG 211 Query: 653 IDTEEDYPYQAAEKTCLKNKLNHRVVTIDSYTDVPPNNEELLLQAVADQPVSVGICGSER 474 IDTE+DYPY+ A+ TC KNKL RVVTID Y+DVP N E LLLQAVA QPVSVGICGS R Sbjct: 212 IDTEDDYPYRQADGTCNKNKLKRRVVTIDGYSDVPSNKENLLLQAVAQQPVSVGICGSAR 271 Query: 473 TFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVDYWILKNSWGNSWGMDGYMHMLRKSGN 294 FQLYS+GIF GPC TSLDHAVLIVGYGSE G DYWI+KNSWG WGM GYMHM R +G Sbjct: 272 AFQLYSQGIFDGPCPTSLDHAVLIVGYGSEGGKDYWIVKNSWGERWGMKGYMHMHRNTGA 331 Query: 293 PQGVCGINMLAXXXXXXXXXXXXXXXXXXTKCSLLTYCPEGSTCCCSWSILGLCFSWSCC 114 G+CGINM+ TKC+LLTYCPEGSTCCCSW +LGLC SWSCC Sbjct: 332 SSGICGINMMPSFPTKTSPNPPPSPGPGPTKCNLLTYCPEGSTCCCSWRVLGLCLSWSCC 391 Query: 113 ELESAVCCKDHRYCCPHDYPICDSGSKQCFKGSGNYS 3 L++A+CCKD+RYCCPHDYPICD+ QC + +GN+S Sbjct: 392 GLDNAICCKDNRYCCPHDYPICDTVRAQCLRANGNFS 428 >ref|XP_002440039.1| hypothetical protein SORBIDRAFT_09g024940 [Sorghum bicolor] gi|241945324|gb|EES18469.1| hypothetical protein SORBIDRAFT_09g024940 [Sorghum bicolor] Length = 463 Score = 503 bits (1295), Expect = e-140 Identities = 247/398 (62%), Positives = 278/398 (69%), Gaps = 12/398 (3%) Frame = -2 Query: 1160 LFESWCRWHGKNYASDEEKLARFAVFEDXXXXXXXXXXXXXXXXA--------LGLNAFA 1005 LF++WC HGK YA+ EE+ AR AVF D L LNAFA Sbjct: 40 LFDAWCAEHGKAYATPEERAARLAVFADNAAFVAAHNARVNAAGGGGAPPSYTLALNAFA 99 Query: 1004 DLAHHEFXXXXXXXXXXXAVVEPSXXXXXXXXXXXXA---PDSVDWRKKGAVTRVKDQGS 834 DL H EF A S PD++DWR+ GAVT+VKDQGS Sbjct: 100 DLTHEEFRAARLGRIAAGAAALRSPAAPVYRGLDGGLGAVPDALDWRENGAVTKVKDQGS 159 Query: 833 CGACWSFSATGAMEGINKIVTGSLVSLSEQELVDCDKTYNSGCEGGLMDYAYKWVIQNHG 654 CGACWSFSATGAMEGINKI TGSLVSLSEQEL+DCD++YNSGC GGLMDYAYK+V++N G Sbjct: 160 CGACWSFSATGAMEGINKIKTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYKFVVKNGG 219 Query: 653 IDTEEDYPYQAAEKTCLKNKLNHRVVTIDSYTDVPPNNEELLLQAVADQPVSVGICGSER 474 IDTEEDYPY+ A+ TC KNKL R+VTID Y+DVP N E+LLLQAVA QPVSVGICGS R Sbjct: 220 IDTEEDYPYREADGTCNKNKLKKRIVTIDGYSDVPSNKEDLLLQAVAQQPVSVGICGSAR 279 Query: 473 TFQLYS-KGIFSGPCSTSLDHAVLIVGYGSENGVDYWILKNSWGNSWGMDGYMHMLRKSG 297 FQLYS +GIF GPC TSLDHAVLIVGYGSE G DYWI+KNSWG SWGM GYMHM R +G Sbjct: 280 AFQLYSQQGIFDGPCPTSLDHAVLIVGYGSEGGKDYWIVKNSWGESWGMKGYMHMHRNTG 339 Query: 296 NPQGVCGINMLAXXXXXXXXXXXXXXXXXXTKCSLLTYCPEGSTCCCSWSILGLCFSWSC 117 + +GVCGINM+A TKCSLLTYCPEGSTCCCSW ILG C SWSC Sbjct: 340 DSKGVCGINMMASFPTKSSPNPPPSPGPGPTKCSLLTYCPEGSTCCCSWRILGFCLSWSC 399 Query: 116 CELESAVCCKDHRYCCPHDYPICDSGSKQCFKGSGNYS 3 CEL++AVCCKD++ CCPHDYP+CD+ C K SGN S Sbjct: 400 CELDNAVCCKDNKSCCPHDYPVCDTDRGLCLKASGNSS 437 >ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp. lyrata] gi|297335615|gb|EFH66032.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp. lyrata] Length = 439 Score = 503 bits (1294), Expect = e-140 Identities = 236/389 (60%), Positives = 280/389 (71%), Gaps = 2/389 (0%) Frame = -2 Query: 1163 ELFESWCRWHGKNYASDEEKLARFAVFEDXXXXXXXXXXXXXXXXALGLNAFADLAHHEF 984 ELF+ WC+ HGK Y S+EE+ R +F+D +L LNAFADL HHEF Sbjct: 30 ELFDDWCQRHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHEF 89 Query: 983 XXXXXXXXXXXAVVEPSXXXXXXXXXXXXAPDSVDWRKKGAVTRVKDQGSCGACWSFSAT 804 + + + PDSVDWRKKGAVT VKDQGSCGACWSFSAT Sbjct: 90 KASRLGLSVSASSLIMASKGQSLGGNAKV-PDSVDWRKKGAVTNVKDQGSCGACWSFSAT 148 Query: 803 GAMEGINKIVTGSLVSLSEQELVDCDKTYNSGCEGGLMDYAYKWVIQNHGIDTEEDYPYQ 624 GAMEGIN+IVTG L+SLSEQEL+DCDK+YN+GC GGLMDYA+++VI+NHGIDTE+DYPYQ Sbjct: 149 GAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKDYPYQ 208 Query: 623 AAEKTCLKNKLNHRVVTIDSYTDVPPNNEELLLQAVADQPVSVGICGSERTFQLYSK--G 450 + TC K+KL +VVTIDSY V N+E+ L +AVA QPVSVGICGSER FQLYS+ G Sbjct: 209 ERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALREAVAAQPVSVGICGSERAFQLYSRVSG 268 Query: 449 IFSGPCSTSLDHAVLIVGYGSENGVDYWILKNSWGNSWGMDGYMHMLRKSGNPQGVCGIN 270 IFSGPCSTSLDHAVLIVGYGS+NGVDYWI+KNSWG SWGMDG+MHM R +GN +G+CGIN Sbjct: 269 IFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTGNSEGICGIN 328 Query: 269 MLAXXXXXXXXXXXXXXXXXXTKCSLLTYCPEGSTCCCSWSILGLCFSWSCCELESAVCC 90 MLA TKC+L TYC G TCCC+ ++ GLCFSW CCE+ESAVCC Sbjct: 329 MLASYPIKTHPNPPPPSPPGPTKCNLFTYCSAGETCCCARNLFGLCFSWKCCEIESAVCC 388 Query: 89 KDHRYCCPHDYPICDSGSKQCFKGSGNYS 3 D R+CCPHDYP+CD+ C K +GN++ Sbjct: 389 SDGRHCCPHDYPVCDTTRSLCLKKTGNFT 417 >gb|AAB60738.1| Strong similarity to Dianthus cysteine proteinase (gb|U17135) [Arabidopsis thaliana] Length = 416 Score = 496 bits (1276), Expect = e-137 Identities = 236/388 (60%), Positives = 273/388 (70%), Gaps = 7/388 (1%) Frame = -2 Query: 1163 ELFESWCRWHGKNYASDEEKLARFAVFEDXXXXXXXXXXXXXXXXALGLNAFADLAHHEF 984 ELF+ WC+ HGK Y S+EE+ R +F+D +L LNAFADL HHEF Sbjct: 28 ELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHEF 87 Query: 983 XXXXXXXXXXXAVVEPSXXXXXXXXXXXXAPDSVDWRKKGAVTRVKDQGSCGACWSFSAT 804 V + PDSVDWRKKGAVT VKDQGSCGACWSFSAT Sbjct: 88 KASRLGLSVSAPSVIMASKGQSLGGSVKV-PDSVDWRKKGAVTNVKDQGSCGACWSFSAT 146 Query: 803 GAMEGINKIVTGSLVSLSEQELVDCDKTYNSGCEGGLMDYAYKWVIQNHGIDTEEDYPYQ 624 GAMEGIN+IVTG L+SLSEQEL+DCDK+YN+GC GGLMDYA+++VI+NHGIDTE+DYPYQ Sbjct: 147 GAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKDYPYQ 206 Query: 623 AAEKTCLKNKLNHRVVTIDSYTDVPPNNEELLLQAVADQPVSVGICGSERTFQLYS---- 456 + TC K+KL +VVTIDSY V N+E+ L++AVA QPVSVGICGSER FQLYS Sbjct: 207 ERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQLYSSKFY 266 Query: 455 ---KGIFSGPCSTSLDHAVLIVGYGSENGVDYWILKNSWGNSWGMDGYMHMLRKSGNPQG 285 +GIFSGPCSTSLDHAVLIVGYGS+NGVDYWI+KNSWG SWGMDG+MHM R + N G Sbjct: 267 LLMQGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTENSDG 326 Query: 284 VCGINMLAXXXXXXXXXXXXXXXXXXTKCSLLTYCPEGSTCCCSWSILGLCFSWSCCELE 105 VCGINMLA TKC+L TYC G TCCC+ + GLCFSW CCE+E Sbjct: 327 VCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCCCARELFGLCFSWKCCEIE 386 Query: 104 SAVCCKDHRYCCPHDYPICDSGSKQCFK 21 SAVCCKD R+CCPHDYP+CD+ C K Sbjct: 387 SAVCCKDGRHCCPHDYPVCDTTRSLCLK 414 >ref|XP_004500967.1| PREDICTED: oryzain alpha chain-like [Cicer arietinum] Length = 436 Score = 494 bits (1273), Expect = e-137 Identities = 236/387 (60%), Positives = 270/387 (69%), Gaps = 2/387 (0%) Frame = -2 Query: 1163 ELFESWCRWHGKNYASDEEKLARFAVFEDXXXXXXXXXXXXXXXXALGLNAFADLAHHEF 984 +LF+ WC+ HGK Y S++EK RF VFED L LNAFADL HHEF Sbjct: 28 KLFQEWCKQHGKTYPSEQEKRYRFNVFEDNYAFVAQHNQIGNSSYTLSLNAFADLTHHEF 87 Query: 983 XXXXXXXXXXXAVVEP--SXXXXXXXXXXXXAPDSVDWRKKGAVTRVKDQGSCGACWSFS 810 + P +DWRK GAV+ VKDQGSCGACWSFS Sbjct: 88 KATRLGLPPSSLLRFKFNRFQDQQRSDDFLQVPSEIDWRKNGAVSIVKDQGSCGACWSFS 147 Query: 809 ATGAMEGINKIVTGSLVSLSEQELVDCDKTYNSGCEGGLMDYAYKWVIQNHGIDTEEDYP 630 ATGA+EGINKIVTGSLVSLSEQELVDCD TYNSGC+GGLMDYAY+++I N+GIDTEEDYP Sbjct: 148 ATGAIEGINKIVTGSLVSLSEQELVDCDTTYNSGCDGGLMDYAYQFIIDNNGIDTEEDYP 207 Query: 629 YQAAEKTCLKNKLNHRVVTIDSYTDVPPNNEELLLQAVADQPVSVGICGSERTFQLYSKG 450 YQA + C K+KL RVVTID YTDVPPN+E+ LL+AVA QPVSVGICGS R FQLYSKG Sbjct: 208 YQARQLLCKKDKLKRRVVTIDGYTDVPPNDEKKLLKAVAVQPVSVGICGSARAFQLYSKG 267 Query: 449 IFSGPCSTSLDHAVLIVGYGSENGVDYWILKNSWGNSWGMDGYMHMLRKSGNPQGVCGIN 270 IF+GPCSTSLDHAVLIVGYGSENGVDYWI+KNSWG WGM+GY+HMLR + + G+CGIN Sbjct: 268 IFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKYWGMNGYIHMLRNTDSSAGLCGIN 327 Query: 269 MLAXXXXXXXXXXXXXXXXXXTKCSLLTYCPEGSTCCCSWSILGLCFSWSCCELESAVCC 90 MLA KC+L TYC G TCCC+ LG+CFSW CC + SAVCC Sbjct: 328 MLASYPTKTKPNPPVPPPPGPIKCNLFTYCSGGETCCCAKKFLGICFSWKCCGVTSAVCC 387 Query: 89 KDHRYCCPHDYPICDSGSKQCFKGSGN 9 KD R+CCP DYP+CD+ + QC K N Sbjct: 388 KDKRHCCPLDYPVCDASNGQCLKRIAN 414 >ref|XP_006473576.1| PREDICTED: cysteine proteinase RD21a-like [Citrus sinensis] Length = 441 Score = 493 bits (1268), Expect = e-137 Identities = 235/380 (61%), Positives = 264/380 (69%), Gaps = 1/380 (0%) Frame = -2 Query: 1163 ELFESWCRWHGKNYASDEEKLARFAVFEDXXXXXXXXXXXXXXXXALGLNAFADLAHHEF 984 ELFE+WC+ HGK Y+S++EK R +FED L LNAFADL H EF Sbjct: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86 Query: 983 XXXXXXXXXXXAVVEPSXXXXXXXXXXXXA-PDSVDWRKKGAVTRVKDQGSCGACWSFSA 807 + P S+DWRKKGAVT VKDQ SCGACW+FSA Sbjct: 87 KASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSA 146 Query: 806 TGAMEGINKIVTGSLVSLSEQELVDCDKTYNSGCEGGLMDYAYKWVIQNHGIDTEEDYPY 627 TGA+EGINKIVTGSLVSLSEQEL+DCD++YNSGC GGLMDYAY++VI+NHGIDTE+DYPY Sbjct: 147 TGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPY 206 Query: 626 QAAEKTCLKNKLNHRVVTIDSYTDVPPNNEELLLQAVADQPVSVGICGSERTFQLYSKGI 447 + C K KLN +VTID Y DVP NNE+ LLQAV QPVSVGICGSER FQLYS GI Sbjct: 207 RGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGI 266 Query: 446 FSGPCSTSLDHAVLIVGYGSENGVDYWILKNSWGNSWGMDGYMHMLRKSGNPQGVCGINM 267 F+GPCSTSLDHAVLI+GY SENGVDYWI+KNSWG SWGM+GYMHM R +GN G+CGINM Sbjct: 267 FTGPCSTSLDHAVLIIGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINM 326 Query: 266 LAXXXXXXXXXXXXXXXXXXTKCSLLTYCPEGSTCCCSWSILGLCFSWSCCELESAVCCK 87 LA T+CSLLTYC G TCCC SILG+C SW CC SAVCC Sbjct: 327 LASYPTKTGQNPPPSPPPGPTRCSLLTYCAPGETCCCGSSILGICLSWKCCGFSSAVCCS 386 Query: 86 DHRYCCPHDYPICDSGSKQC 27 DHRYCCP +YPICDS QC Sbjct: 387 DHRYCCPSNYPICDSVRHQC 406 >ref|XP_006435079.1| hypothetical protein CICLE_v10001178mg [Citrus clementina] gi|557537201|gb|ESR48319.1| hypothetical protein CICLE_v10001178mg [Citrus clementina] Length = 441 Score = 493 bits (1268), Expect = e-137 Identities = 236/380 (62%), Positives = 264/380 (69%), Gaps = 1/380 (0%) Frame = -2 Query: 1163 ELFESWCRWHGKNYASDEEKLARFAVFEDXXXXXXXXXXXXXXXXALGLNAFADLAHHEF 984 ELFE+WC+ HGK Y+S++EK R +FED L LNAFADL H EF Sbjct: 27 ELFETWCKQHGKVYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86 Query: 983 XXXXXXXXXXXAVVEPSXXXXXXXXXXXXA-PDSVDWRKKGAVTRVKDQGSCGACWSFSA 807 + P S+DWRKKGAVT VKDQ SCGACW+FSA Sbjct: 87 KASFLGFSAASIDHDRRRNASVQSPGTLRDVPASIDWRKKGAVTEVKDQASCGACWAFSA 146 Query: 806 TGAMEGINKIVTGSLVSLSEQELVDCDKTYNSGCEGGLMDYAYKWVIQNHGIDTEEDYPY 627 TGA+EGINKIVTGSLVSLSEQEL+DCD++YNSGC GGLMDYAY++VI+NHGIDTE+DYPY Sbjct: 147 TGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPY 206 Query: 626 QAAEKTCLKNKLNHRVVTIDSYTDVPPNNEELLLQAVADQPVSVGICGSERTFQLYSKGI 447 + C K KLN +VTID Y DVP NNE+ LLQAV QPVSVGICGSER FQLYS GI Sbjct: 207 RGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGI 266 Query: 446 FSGPCSTSLDHAVLIVGYGSENGVDYWILKNSWGNSWGMDGYMHMLRKSGNPQGVCGINM 267 F+GPCSTSLDHAVLIVGY SENGVDYWI+KNSWG SWGM+GYMHM R +GN G+CGINM Sbjct: 267 FTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINM 326 Query: 266 LAXXXXXXXXXXXXXXXXXXTKCSLLTYCPEGSTCCCSWSILGLCFSWSCCELESAVCCK 87 LA T+CSLLTYC G TCCC SILG+C SW CC SAVCC Sbjct: 327 LASYPTKTGQNPPPSPPPGPTRCSLLTYCAAGETCCCGSSILGICLSWKCCGFSSAVCCS 386 Query: 86 DHRYCCPHDYPICDSGSKQC 27 DHRYCCP +YPICDS QC Sbjct: 387 DHRYCCPSNYPICDSVRHQC 406