BLASTX nr result
ID: Jatropha_contig00025753
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Jatropha_contig00025753 (759 letters) Database: NCBI-nr (updated 2014/02/11) 35,149,712 sequences; 12,374,887,350 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002512963.1| cysteine protease, putative [Ricinus communi... 355 2e-98 gb|EEE85962.2| cysteine proteinase A494 precursor [Populus trich... 353 1e-97 emb|CAE54306.1| putative papain-like cysteine proteinase [Gossyp... 352 7e-97 ref|XP_002305451.1| predicted protein [Populus trichocarpa] 353 7e-97 gb|EMJ02509.1| hypothetical protein PRUPE_ppa007454mg [Prunus pe... 351 9e-97 ref|XP_004290288.1| PREDICTED: cysteine proteinase RD19a-like [F... 349 1e-96 gb|EOX97908.1| Papain family cysteine protease [Theobroma cacao] 350 2e-96 ref|XP_002313770.1| predicted protein [Populus trichocarpa] gi|2... 349 6e-96 gb|ABK92536.1| unknown [Populus trichocarpa] 349 6e-96 gb|AAX19661.1| cysteine proteinase [Populus tomentosa] 347 2e-95 dbj|BAJ53178.1| JHL18I08.12 [Jatropha curcas] 347 2e-94 ref|NP_001267989.1| cysteine protease precursor [Vitis vinifera]... 337 3e-93 gb|EOY14897.1| Papain family cysteine protease [Theobroma cacao] 338 6e-93 ref|NP_001241450.1| uncharacterized protein LOC100778716 precurs... 338 4e-92 ref|XP_004507484.1| PREDICTED: cysteine proteinase RD19a-like [C... 333 2e-91 ref|XP_004150061.1| PREDICTED: cysteine proteinase RD19a-like [C... 332 5e-91 gb|ESR48330.1| hypothetical protein CICLE_v10001559mg [Citrus cl... 330 2e-90 ref|XP_002510469.1| cysteine protease, putative [Ricinus communi... 334 3e-90 ref|XP_003606998.1| Cysteine proteinase [Medicago truncatula] gi... 329 3e-90 ref|XP_004230928.1| PREDICTED: cysteine proteinase RD19a-like [S... 330 8e-90 >ref|XP_002512963.1| cysteine protease, putative [Ricinus communis] gi|223547974|gb|EEF49466.1| cysteine protease, putative [Ricinus communis] Length = 373 Score = 355 bits (910), Expect(2) = 2e-98 Identities = 174/234 (74%), Positives = 198/234 (84%), Gaps = 3/234 (1%) Frame = +1 Query: 16 MAIRXXXXXXXXXXXXXAVIAETLTSSETVEDPLIRQVVD---DGRGEAHVLSAEHHFSL 186 MA+R AV AETLT+ EDPLIRQV D + ++L AEHHFSL Sbjct: 3 MAVRFSFFVISSILFVSAVTAETLTTDG--EDPLIRQVTDGQDESSANPNLLGAEHHFSL 60 Query: 187 FKKKFGKSYASQEEHNYRFKVFQANLRRAARHQKLDPSASHGVTQFSDLTPAEFKKQFLG 366 FKKKF K+YASQEEH+YRFK+F++NLRRA RHQKLDP+A+HGVTQFSDLT +EF++QFLG Sbjct: 61 FKKKFKKTYASQEEHDYRFKIFKSNLRRAERHQKLDPTATHGVTQFSDLTHSEFRRQFLG 120 Query: 367 LRSLRLPKDAHQAPILPTNDLPEDFDWREKGAVTNVKNQGSCGSCWSFSTTGALEGAHFL 546 LR LRLPKDA++AP+LPTNDLP DFDWREKGAVT VKNQGSCGSCWSFSTTGALEGA++L Sbjct: 121 LRRLRLPKDANEAPMLPTNDLPADFDWREKGAVTAVKNQGSCGSCWSFSTTGALEGANYL 180 Query: 547 STGELVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTLQAGGLMREE 708 +TG+LVSLSEQQLVDCDHECDP E G+CDSGCNGGLMNSAFEYTL+AGGLMREE Sbjct: 181 ATGKLVSLSEQQLVDCDHECDPAEEGACDSGCNGGLMNSAFEYTLKAGGLMREE 234 Score = 31.6 bits (70), Expect(2) = 2e-98 Identities = 12/17 (70%), Positives = 14/17 (82%) Frame = +3 Query: 708 NYSYPGTDLGACKFDKT 758 +Y Y GTD GAC+FDKT Sbjct: 235 DYPYTGTDRGACQFDKT 251 >gb|EEE85962.2| cysteine proteinase A494 precursor [Populus trichocarpa] Length = 368 Score = 353 bits (905), Expect(2) = 1e-97 Identities = 175/216 (81%), Positives = 192/216 (88%), Gaps = 2/216 (0%) Frame = +1 Query: 67 AVIAETLTSSETVEDPLIRQVVD-DGRGEAHVLSAE-HHFSLFKKKFGKSYASQEEHNYR 240 AV AETL +DPLIR+VVD +++LSAE HHFSLFK KF KSY SQEEH+YR Sbjct: 18 AVHAETLNG----DDPLIREVVDGQDASSSNLLSAEQHHFSLFKSKFKKSYGSQEEHDYR 73 Query: 241 FKVFQANLRRAARHQKLDPSASHGVTQFSDLTPAEFKKQFLGLRSLRLPKDAHQAPILPT 420 F VF+ANLRRAARHQ+LDP+ASHGVTQFSDLTPAEF+KQ LGLR LRLPKDA++APILPT Sbjct: 74 FSVFKANLRRAARHQELDPTASHGVTQFSDLTPAEFRKQVLGLRRLRLPKDANEAPILPT 133 Query: 421 NDLPEDFDWREKGAVTNVKNQGSCGSCWSFSTTGALEGAHFLSTGELVSLSEQQLVDCDH 600 +DLPEDFDWR+KGAV +KNQGSCGSCWSFS TGALEGAHFL+TGELVSLSEQQLVDCDH Sbjct: 134 SDLPEDFDWRDKGAVGPIKNQGSCGSCWSFSATGALEGAHFLATGELVSLSEQQLVDCDH 193 Query: 601 ECDPEEAGSCDSGCNGGLMNSAFEYTLQAGGLMREE 708 ECDPEE GSCDSGCNGGLMNSAFEYTL+AGGLMREE Sbjct: 194 ECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREE 229 Score = 31.2 bits (69), Expect(2) = 1e-97 Identities = 12/16 (75%), Positives = 13/16 (81%) Frame = +3 Query: 708 NYSYPGTDLGACKFDK 755 +Y Y GTD GACKFDK Sbjct: 230 DYPYTGTDRGACKFDK 245 >emb|CAE54306.1| putative papain-like cysteine proteinase [Gossypium hirsutum] Length = 373 Score = 352 bits (902), Expect(2) = 7e-97 Identities = 168/214 (78%), Positives = 189/214 (88%), Gaps = 1/214 (0%) Frame = +1 Query: 70 VIAETLTSSETVEDPLIRQVVDDGRG-EAHVLSAEHHFSLFKKKFGKSYASQEEHNYRFK 246 + ET ++ DPLI QV D G E +L+AEHH+SLFKK+F KSY SQ+EH+YRFK Sbjct: 21 ICTETFSAEGFEVDPLIEQVTDGHEGAEPQLLTAEHHYSLFKKRFKKSYGSQKEHDYRFK 80 Query: 247 VFQANLRRAARHQKLDPSASHGVTQFSDLTPAEFKKQFLGLRSLRLPKDAHQAPILPTND 426 +FQ NLRRAARHQ LDPSA+HGVTQFSDLTP EF+K +LGLR LRLPKDA +APILPT++ Sbjct: 81 IFQVNLRRAARHQNLDPSATHGVTQFSDLTPGEFRKAYLGLRRLRLPKDATEAPILPTDN 140 Query: 427 LPEDFDWREKGAVTNVKNQGSCGSCWSFSTTGALEGAHFLSTGELVSLSEQQLVDCDHEC 606 LP+DFDWREKGAVT VKNQGSCGSCWSFSTTGALEGA+FL+TG+LVSLSEQQLVDCDHEC Sbjct: 141 LPQDFDWREKGAVTPVKNQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHEC 200 Query: 607 DPEEAGSCDSGCNGGLMNSAFEYTLQAGGLMREE 708 DPEEAGSCDSGCNGGLMNSAFEYTL+AGGLMREE Sbjct: 201 DPEEAGSCDSGCNGGLMNSAFEYTLKAGGLMREE 234 Score = 29.6 bits (65), Expect(2) = 7e-97 Identities = 11/17 (64%), Positives = 12/17 (70%) Frame = +3 Query: 708 NYSYPGTDLGACKFDKT 758 +Y Y GTD G CKFD T Sbjct: 235 DYPYTGTDRGTCKFDNT 251 >ref|XP_002305451.1| predicted protein [Populus trichocarpa] Length = 368 Score = 353 bits (905), Expect(2) = 7e-97 Identities = 175/216 (81%), Positives = 192/216 (88%), Gaps = 2/216 (0%) Frame = +1 Query: 67 AVIAETLTSSETVEDPLIRQVVD-DGRGEAHVLSAE-HHFSLFKKKFGKSYASQEEHNYR 240 AV AETL +DPLIR+VVD +++LSAE HHFSLFK KF KSY SQEEH+YR Sbjct: 18 AVHAETLNG----DDPLIREVVDGQDASSSNLLSAEQHHFSLFKSKFKKSYGSQEEHDYR 73 Query: 241 FKVFQANLRRAARHQKLDPSASHGVTQFSDLTPAEFKKQFLGLRSLRLPKDAHQAPILPT 420 F VF+ANLRRAARHQ+LDP+ASHGVTQFSDLTPAEF+KQ LGLR LRLPKDA++APILPT Sbjct: 74 FSVFKANLRRAARHQELDPTASHGVTQFSDLTPAEFRKQVLGLRRLRLPKDANEAPILPT 133 Query: 421 NDLPEDFDWREKGAVTNVKNQGSCGSCWSFSTTGALEGAHFLSTGELVSLSEQQLVDCDH 600 +DLPEDFDWR+KGAV +KNQGSCGSCWSFS TGALEGAHFL+TGELVSLSEQQLVDCDH Sbjct: 134 SDLPEDFDWRDKGAVGPIKNQGSCGSCWSFSATGALEGAHFLATGELVSLSEQQLVDCDH 193 Query: 601 ECDPEEAGSCDSGCNGGLMNSAFEYTLQAGGLMREE 708 ECDPEE GSCDSGCNGGLMNSAFEYTL+AGGLMREE Sbjct: 194 ECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREE 229 Score = 28.5 bits (62), Expect(2) = 7e-97 Identities = 11/16 (68%), Positives = 12/16 (75%) Frame = +3 Query: 708 NYSYPGTDLGACKFDK 755 +Y Y GTD ACKFDK Sbjct: 230 DYPYTGTDRDACKFDK 245 >gb|EMJ02509.1| hypothetical protein PRUPE_ppa007454mg [Prunus persica] Length = 367 Score = 351 bits (900), Expect(2) = 9e-97 Identities = 169/202 (83%), Positives = 183/202 (90%), Gaps = 2/202 (0%) Frame = +1 Query: 109 DPLIRQVVDDGRGEAHV--LSAEHHFSLFKKKFGKSYASQEEHNYRFKVFQANLRRAARH 282 DPLIRQVVD G H L AEHHFSLFK +FGKSYASQEEH+YRF+VF+ANLRRAARH Sbjct: 25 DPLIRQVVDGGDDHQHDHRLGAEHHFSLFKHQFGKSYASQEEHDYRFEVFKANLRRAARH 84 Query: 283 QKLDPSASHGVTQFSDLTPAEFKKQFLGLRSLRLPKDAHQAPILPTNDLPEDFDWREKGA 462 Q LDPSA HGVT+FSD+TPAEF+K LGLR LRLP DA +APILPT +LPEDFDWR++GA Sbjct: 85 QMLDPSAQHGVTRFSDMTPAEFRKSQLGLRGLRLPSDATKAPILPTENLPEDFDWRDRGA 144 Query: 463 VTNVKNQGSCGSCWSFSTTGALEGAHFLSTGELVSLSEQQLVDCDHECDPEEAGSCDSGC 642 VT VKNQGSCGSCWSFSTTGALEGAHFL+TGELVSLSEQQLVDCDHECDPEEAGSCDSGC Sbjct: 145 VTAVKNQGSCGSCWSFSTTGALEGAHFLATGELVSLSEQQLVDCDHECDPEEAGSCDSGC 204 Query: 643 NGGLMNSAFEYTLQAGGLMREE 708 NGGLMNSAFEYTL+AGGLM+EE Sbjct: 205 NGGLMNSAFEYTLKAGGLMKEE 226 Score = 30.0 bits (66), Expect(2) = 9e-97 Identities = 11/16 (68%), Positives = 13/16 (81%) Frame = +3 Query: 711 YSYPGTDLGACKFDKT 758 Y Y GTD G+CKFDK+ Sbjct: 228 YPYTGTDRGSCKFDKS 243 >ref|XP_004290288.1| PREDICTED: cysteine proteinase RD19a-like [Fragaria vesca subsp. vesca] Length = 372 Score = 349 bits (895), Expect(2) = 1e-96 Identities = 167/213 (78%), Positives = 184/213 (86%), Gaps = 11/213 (5%) Frame = +1 Query: 103 VEDPLIRQVVDDGRGEAH-----------VLSAEHHFSLFKKKFGKSYASQEEHNYRFKV 249 V DPLIRQVVD G H +L AEHHFSLFK+KFGKSYASQEEH+YRF V Sbjct: 24 VSDPLIRQVVDGGDESPHHHHHHDHDGEAMLGAEHHFSLFKRKFGKSYASQEEHDYRFSV 83 Query: 250 FQANLRRAARHQKLDPSASHGVTQFSDLTPAEFKKQFLGLRSLRLPKDAHQAPILPTNDL 429 F+ N+RRA RHQ+LDP+A HGVT+FSDLTPAEF+K LGLR L+LP DA+ APILPT +L Sbjct: 84 FKTNMRRAKRHQRLDPTAQHGVTRFSDLTPAEFRKSHLGLRGLKLPADANTAPILPTENL 143 Query: 430 PEDFDWREKGAVTNVKNQGSCGSCWSFSTTGALEGAHFLSTGELVSLSEQQLVDCDHECD 609 PEDFDWRE+GAVT VKNQGSCGSCWSFSTTGALEGAHFL+TGELVSLSEQQLVDCDHECD Sbjct: 144 PEDFDWRERGAVTEVKNQGSCGSCWSFSTTGALEGAHFLATGELVSLSEQQLVDCDHECD 203 Query: 610 PEEAGSCDSGCNGGLMNSAFEYTLQAGGLMREE 708 PEEAGSCDSGCNGGLMNSAFEYTL+AGGLM+E+ Sbjct: 204 PEEAGSCDSGCNGGLMNSAFEYTLKAGGLMKEK 236 Score = 31.6 bits (70), Expect(2) = 1e-96 Identities = 12/17 (70%), Positives = 13/17 (76%) Frame = +3 Query: 708 NYSYPGTDLGACKFDKT 758 +Y Y GTD G CKFDKT Sbjct: 237 DYPYTGTDRGTCKFDKT 253 >gb|EOX97908.1| Papain family cysteine protease [Theobroma cacao] Length = 395 Score = 350 bits (897), Expect(2) = 2e-96 Identities = 170/214 (79%), Positives = 188/214 (87%), Gaps = 1/214 (0%) Frame = +1 Query: 70 VIAETLTSSETVEDPLIRQVVDDGRG-EAHVLSAEHHFSLFKKKFGKSYASQEEHNYRFK 246 + ET ++ + DPLIRQV D G E +L+AEHHFSLFK +F KSY SQEEH+YRFK Sbjct: 43 ISTETFSAEGSEVDPLIRQVTDGQDGAEPQLLTAEHHFSLFKSRFKKSYGSQEEHDYRFK 102 Query: 247 VFQANLRRAARHQKLDPSASHGVTQFSDLTPAEFKKQFLGLRSLRLPKDAHQAPILPTND 426 VFQ NLRRAARHQKLDPSASHGVTQFSDLTP EF++ +LGLR LRLPKDA +APILPT++ Sbjct: 103 VFQDNLRRAARHQKLDPSASHGVTQFSDLTPREFRRTYLGLRRLRLPKDATEAPILPTDN 162 Query: 427 LPEDFDWREKGAVTNVKNQGSCGSCWSFSTTGALEGAHFLSTGELVSLSEQQLVDCDHEC 606 LPEDFDW EKGAVT VKNQGSCGSCWSFSTTGALEGA+FL+TG+LVSLSEQQLVDCDHEC Sbjct: 163 LPEDFDWSEKGAVTPVKNQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHEC 222 Query: 607 DPEEAGSCDSGCNGGLMNSAFEYTLQAGGLMREE 708 DPEE SCDSGCNGGLMNSAFEYTL+AGGLMREE Sbjct: 223 DPEEPDSCDSGCNGGLMNSAFEYTLKAGGLMREE 256 Score = 30.0 bits (66), Expect(2) = 2e-96 Identities = 11/17 (64%), Positives = 13/17 (76%) Frame = +3 Query: 708 NYSYPGTDLGACKFDKT 758 +Y Y GTD G CKFDK+ Sbjct: 257 DYPYTGTDRGTCKFDKS 273 >ref|XP_002313770.1| predicted protein [Populus trichocarpa] gi|222850178|gb|EEE87725.1| cysteine proteinase A494 precursor [Populus trichocarpa] Length = 368 Score = 349 bits (896), Expect(2) = 6e-96 Identities = 174/216 (80%), Positives = 190/216 (87%), Gaps = 2/216 (0%) Frame = +1 Query: 67 AVIAETLTSSETVEDPLIRQVVD-DGRGEAHVLSAE-HHFSLFKKKFGKSYASQEEHNYR 240 A+ AET +D LIRQVV+ +++L+AE HHFSLFK+KF KSY SQEEH+YR Sbjct: 18 AISAETFNG----DDSLIRQVVEGQDESSSNLLTAEQHHFSLFKRKFKKSYLSQEEHDYR 73 Query: 241 FKVFQANLRRAARHQKLDPSASHGVTQFSDLTPAEFKKQFLGLRSLRLPKDAHQAPILPT 420 F VF++NLRRAARHQKLDP+ASHGVTQFSDLT AEF+KQ LGLR LRLPKDA+ APILPT Sbjct: 74 FSVFKSNLRRAARHQKLDPTASHGVTQFSDLTSAEFRKQVLGLRKLRLPKDANTAPILPT 133 Query: 421 NDLPEDFDWREKGAVTNVKNQGSCGSCWSFSTTGALEGAHFLSTGELVSLSEQQLVDCDH 600 NDLPEDFDWREKGAV VKNQGSCGSCWSFSTTGALEGAHFL+TGELVSLSEQQLVDCDH Sbjct: 134 NDLPEDFDWREKGAVGPVKNQGSCGSCWSFSTTGALEGAHFLATGELVSLSEQQLVDCDH 193 Query: 601 ECDPEEAGSCDSGCNGGLMNSAFEYTLQAGGLMREE 708 ECDPEE GSCDSGCNGGLMNSAFEYTL+AGGLMREE Sbjct: 194 ECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREE 229 Score = 28.9 bits (63), Expect(2) = 6e-96 Identities = 11/16 (68%), Positives = 12/16 (75%) Frame = +3 Query: 708 NYSYPGTDLGACKFDK 755 +Y Y G D GACKFDK Sbjct: 230 DYPYTGMDRGACKFDK 245 >gb|ABK92536.1| unknown [Populus trichocarpa] Length = 368 Score = 349 bits (896), Expect(2) = 6e-96 Identities = 174/216 (80%), Positives = 190/216 (87%), Gaps = 2/216 (0%) Frame = +1 Query: 67 AVIAETLTSSETVEDPLIRQVVD-DGRGEAHVLSAE-HHFSLFKKKFGKSYASQEEHNYR 240 A+ AET +D LIRQVV+ +++L+AE HHFSLFK+KF KSY SQEEH+YR Sbjct: 18 AISAETFNG----DDSLIRQVVEGQDESSSNLLTAEQHHFSLFKRKFKKSYLSQEEHDYR 73 Query: 241 FKVFQANLRRAARHQKLDPSASHGVTQFSDLTPAEFKKQFLGLRSLRLPKDAHQAPILPT 420 F VF++NLRRAARHQKLDP+ASHGVTQFSDLT AEF+KQ LGLR LRLPKDA+ APILPT Sbjct: 74 FSVFKSNLRRAARHQKLDPTASHGVTQFSDLTSAEFRKQVLGLRKLRLPKDANTAPILPT 133 Query: 421 NDLPEDFDWREKGAVTNVKNQGSCGSCWSFSTTGALEGAHFLSTGELVSLSEQQLVDCDH 600 NDLPEDFDWREKGAV VKNQGSCGSCWSFSTTGALEGAHFL+TGELVSLSEQQLVDCDH Sbjct: 134 NDLPEDFDWREKGAVGPVKNQGSCGSCWSFSTTGALEGAHFLATGELVSLSEQQLVDCDH 193 Query: 601 ECDPEEAGSCDSGCNGGLMNSAFEYTLQAGGLMREE 708 ECDPEE GSCDSGCNGGLMNSAFEYTL+AGGLMREE Sbjct: 194 ECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREE 229 Score = 28.9 bits (63), Expect(2) = 6e-96 Identities = 11/16 (68%), Positives = 12/16 (75%) Frame = +3 Query: 708 NYSYPGTDLGACKFDK 755 +Y Y G D GACKFDK Sbjct: 230 DYPYTGMDRGACKFDK 245 >gb|AAX19661.1| cysteine proteinase [Populus tomentosa] Length = 374 Score = 347 bits (891), Expect(2) = 2e-95 Identities = 175/217 (80%), Positives = 193/217 (88%), Gaps = 3/217 (1%) Frame = +1 Query: 67 AVIAETLTSSETVEDPLIRQVVDDGRGEA--HVLSAE-HHFSLFKKKFGKSYASQEEHNY 237 A+ AET +D LIRQVV+ G+ E+ ++L+AE HH SLFK+KF KSY SQEEH+Y Sbjct: 24 AISAETFNG----DDSLIRQVVE-GQDESSPNLLTAEQHHLSLFKRKFKKSYLSQEEHDY 78 Query: 238 RFKVFQANLRRAARHQKLDPSASHGVTQFSDLTPAEFKKQFLGLRSLRLPKDAHQAPILP 417 RF VF++NLRRAARHQKLDP+ASHGVTQFSDLT AEF+KQ LGLR LRLPKDA++APILP Sbjct: 79 RFSVFKSNLRRAARHQKLDPTASHGVTQFSDLTSAEFRKQVLGLRKLRLPKDANKAPILP 138 Query: 418 TNDLPEDFDWREKGAVTNVKNQGSCGSCWSFSTTGALEGAHFLSTGELVSLSEQQLVDCD 597 TNDLPEDFDWREKGAV VKNQGSCGSCWSFSTTGALEGAHFL+TGELVSLSEQQLVDCD Sbjct: 139 TNDLPEDFDWREKGAVGPVKNQGSCGSCWSFSTTGALEGAHFLATGELVSLSEQQLVDCD 198 Query: 598 HECDPEEAGSCDSGCNGGLMNSAFEYTLQAGGLMREE 708 HECDPEE GSCDSGCNGGLMNSAFEYTL+AGGLMREE Sbjct: 199 HECDPEEPGSCDSGCNGGLMNSAFEYTLKAGGLMREE 235 Score = 28.9 bits (63), Expect(2) = 2e-95 Identities = 11/16 (68%), Positives = 12/16 (75%) Frame = +3 Query: 708 NYSYPGTDLGACKFDK 755 +Y Y G D GACKFDK Sbjct: 236 DYPYTGMDRGACKFDK 251 >dbj|BAJ53178.1| JHL18I08.12 [Jatropha curcas] Length = 368 Score = 347 bits (889), Expect(2) = 2e-94 Identities = 164/212 (77%), Positives = 188/212 (88%) Frame = +1 Query: 73 IAETLTSSETVEDPLIRQVVDDGRGEAHVLSAEHHFSLFKKKFGKSYASQEEHNYRFKVF 252 IA T TS + ++DPLIRQVV DG + H+L+AEHHF+ FK KFGK+YA+QEEH+YRFK+F Sbjct: 19 IAST-TSPDELDDPLIRQVVPDG-DQDHLLNAEHHFTTFKAKFGKTYATQEEHDYRFKLF 76 Query: 253 QANLRRAARHQKLDPSASHGVTQFSDLTPAEFKKQFLGLRSLRLPKDAHQAPILPTNDLP 432 +ANLRRA +HQ +DP+A HGVT FSDLTP EF++Q+LGLR LRLP DAH+APILPTNDLP Sbjct: 77 KANLRRARKHQMMDPTAVHGVTMFSDLTPREFRRQYLGLRRLRLPADAHEAPILPTNDLP 136 Query: 433 EDFDWREKGAVTNVKNQGSCGSCWSFSTTGALEGAHFLSTGELVSLSEQQLVDCDHECDP 612 DFDWR+ GAVTNVKNQGSCGSCWSFS GALEGAHFL+TGELVSLSEQQLVDCDHECDP Sbjct: 137 TDFDWRDHGAVTNVKNQGSCGSCWSFSAAGALEGAHFLATGELVSLSEQQLVDCDHECDP 196 Query: 613 EEAGSCDSGCNGGLMNSAFEYTLQAGGLMREE 708 EE G+CDSGCNGGLM +AFEYTL+AGGL REE Sbjct: 197 EEYGACDSGCNGGLMTTAFEYTLKAGGLEREE 228 Score = 26.2 bits (56), Expect(2) = 2e-94 Identities = 9/16 (56%), Positives = 11/16 (68%) Frame = +3 Query: 708 NYSYPGTDLGACKFDK 755 +Y Y G D G CKFD+ Sbjct: 229 DYPYTGNDRGPCKFDR 244 >ref|NP_001267989.1| cysteine protease precursor [Vitis vinifera] gi|161778780|gb|ABX79341.1| cysteine protease [Vitis vinifera] Length = 377 Score = 337 bits (865), Expect(2) = 3e-93 Identities = 164/220 (74%), Positives = 192/220 (87%), Gaps = 6/220 (2%) Frame = +1 Query: 67 AVIAETLTSSETVEDPLIRQVVDD-----GRGEAHVLSAEHH-FSLFKKKFGKSYASQEE 228 A+ + L S + +D +IRQVV + G E ++L+A+HH FS+FK++FGKSYASQEE Sbjct: 19 ALTSSELHSGGSDDDIIIRQVVPELGDVEGGEEENLLTADHHHFSIFKRRFGKSYASQEE 78 Query: 229 HNYRFKVFQANLRRAARHQKLDPSASHGVTQFSDLTPAEFKKQFLGLRSLRLPKDAHQAP 408 H+YRFKVF+ANLRRA RHQ+LDPSA+HGVTQFSDLTPAEF+ +LGLR L+LP DA +AP Sbjct: 79 HDYRFKVFKANLRRARRHQQLDPSATHGVTQFSDLTPAEFRGTYLGLRPLKLPHDAQKAP 138 Query: 409 ILPTNDLPEDFDWREKGAVTNVKNQGSCGSCWSFSTTGALEGAHFLSTGELVSLSEQQLV 588 ILPTNDLPEDFDWR+ GAVT VKNQGSCGSCWSFSTTGALEGA+FL+TG LVSLSEQQLV Sbjct: 139 ILPTNDLPEDFDWRDHGAVTAVKNQGSCGSCWSFSTTGALEGANFLATGNLVSLSEQQLV 198 Query: 589 DCDHECDPEEAGSCDSGCNGGLMNSAFEYTLQAGGLMREE 708 +CDHECDPEE GSCDSGCNGGLMN+AFEYTL+AGGLM+EE Sbjct: 199 ECDHECDPEEMGSCDSGCNGGLMNTAFEYTLKAGGLMKEE 238 Score = 32.0 bits (71), Expect(2) = 3e-93 Identities = 12/17 (70%), Positives = 14/17 (82%) Frame = +3 Query: 708 NYSYPGTDLGACKFDKT 758 +Y Y GTD G+CKFDKT Sbjct: 239 DYPYTGTDRGSCKFDKT 255 >gb|EOY14897.1| Papain family cysteine protease [Theobroma cacao] Length = 401 Score = 338 bits (868), Expect(2) = 6e-93 Identities = 162/217 (74%), Positives = 188/217 (86%), Gaps = 4/217 (1%) Frame = +1 Query: 70 VIAETLTSSETVED-PLIRQVVDDGRGEA---HVLSAEHHFSLFKKKFGKSYASQEEHNY 237 V+A + S ED PLIRQVV +G GE H+L+AEHHF+LFK K+GK+YA+QEEH+Y Sbjct: 46 VVASAVVSDVVSEDDPLIRQVVSNGAGEDSDDHLLNAEHHFTLFKSKYGKTYATQEEHDY 105 Query: 238 RFKVFQANLRRAARHQKLDPSASHGVTQFSDLTPAEFKKQFLGLRSLRLPKDAHQAPILP 417 R VF+ANLRRA RHQ LDP+A HGVT+FSDLTP+EF++QFLGLR L+LP DA +APILP Sbjct: 106 RLGVFKANLRRAKRHQLLDPTAVHGVTKFSDLTPSEFRRQFLGLRPLKLPADAQKAPILP 165 Query: 418 TNDLPEDFDWREKGAVTNVKNQGSCGSCWSFSTTGALEGAHFLSTGELVSLSEQQLVDCD 597 TNDLP DFDWR+ GAVT VK+QGSCGSCWSFSTTGALEGAH+LSTGELVSLSEQQLVDCD Sbjct: 166 TNDLPTDFDWRDHGAVTGVKDQGSCGSCWSFSTTGALEGAHYLSTGELVSLSEQQLVDCD 225 Query: 598 HECDPEEAGSCDSGCNGGLMNSAFEYTLQAGGLMREE 708 HECDP+E G+CDSGCNGGLM +AFEYTL+AGGL RE+ Sbjct: 226 HECDPQEYGACDSGCNGGLMTTAFEYTLKAGGLERED 262 Score = 29.6 bits (65), Expect(2) = 6e-93 Identities = 11/17 (64%), Positives = 13/17 (76%) Frame = +3 Query: 708 NYSYPGTDLGACKFDKT 758 +Y Y G D GACKFDK+ Sbjct: 263 DYPYTGNDRGACKFDKS 279 >ref|NP_001241450.1| uncharacterized protein LOC100778716 precursor [Glycine max] gi|255639509|gb|ACU20049.1| unknown [Glycine max] Length = 366 Score = 338 bits (867), Expect(2) = 4e-92 Identities = 161/216 (74%), Positives = 187/216 (86%), Gaps = 3/216 (1%) Frame = +1 Query: 70 VIAETLTSSETVEDP---LIRQVVDDGRGEAHVLSAEHHFSLFKKKFGKSYASQEEHNYR 240 +++ T+ ++E ++D LIRQVV D + H+L+AEHHFS FK KFGK+YA+QEEH++R Sbjct: 13 LLSATVAAAERIDDEDDLLIRQVVPDAE-DHHLLNAEHHFSAFKTKFGKTYATQEEHDHR 71 Query: 241 FKVFQANLRRAARHQKLDPSASHGVTQFSDLTPAEFKKQFLGLRSLRLPKDAHQAPILPT 420 F++F+ NL RA HQKLDPSA HGVT+FSDLTPAEF++QFLGL+ LRLP DA +APILPT Sbjct: 72 FRIFKNNLLRAKSHQKLDPSAVHGVTRFSDLTPAEFRRQFLGLKPLRLPSDAQKAPILPT 131 Query: 421 NDLPEDFDWREKGAVTNVKNQGSCGSCWSFSTTGALEGAHFLSTGELVSLSEQQLVDCDH 600 NDLP DFDWRE GAVT VKNQGSCGSCWSFS GALEGAHFLSTGELVSLSEQQLVDCDH Sbjct: 132 NDLPTDFDWREHGAVTGVKNQGSCGSCWSFSAVGALEGAHFLSTGELVSLSEQQLVDCDH 191 Query: 601 ECDPEEAGSCDSGCNGGLMNSAFEYTLQAGGLMREE 708 ECDPEE G+CDSGCNGGLM +AFEYTLQAGGLMRE+ Sbjct: 192 ECDPEERGACDSGCNGGLMTTAFEYTLQAGGLMREK 227 Score = 27.3 bits (59), Expect(2) = 4e-92 Identities = 10/17 (58%), Positives = 12/17 (70%) Frame = +3 Query: 708 NYSYPGTDLGACKFDKT 758 +Y Y G D G CKFDK+ Sbjct: 228 DYPYTGRDRGPCKFDKS 244 >ref|XP_004507484.1| PREDICTED: cysteine proteinase RD19a-like [Cicer arietinum] Length = 363 Score = 333 bits (854), Expect(2) = 2e-91 Identities = 161/210 (76%), Positives = 183/210 (87%), Gaps = 1/210 (0%) Frame = +1 Query: 82 TLTSSETVE-DPLIRQVVDDGRGEAHVLSAEHHFSLFKKKFGKSYASQEEHNYRFKVFQA 258 T+ SSETVE DPLIRQVVDDG L AEHHF FK +FGK Y++++EH+YRF VF+A Sbjct: 18 TVFSSETVEEDPLIRQVVDDGGVR---LGAEHHFIEFKHRFGKVYSTKDEHDYRFNVFKA 74 Query: 259 NLRRAARHQKLDPSASHGVTQFSDLTPAEFKKQFLGLRSLRLPKDAHQAPILPTNDLPED 438 N+RRA RHQ +DPSA HGVT+FSDLTP EF+ LGLR LRLP DA+ APILPT++LP D Sbjct: 75 NMRRAKRHQLMDPSAVHGVTRFSDLTPREFRNSVLGLRGLRLPSDANTAPILPTDNLPAD 134 Query: 439 FDWREKGAVTNVKNQGSCGSCWSFSTTGALEGAHFLSTGELVSLSEQQLVDCDHECDPEE 618 FDWR++GAVT+VKNQGSCGSCWSFSTTGALEGAHFLSTGELVSLSEQQLVDCDHECDP+E Sbjct: 135 FDWRDRGAVTSVKNQGSCGSCWSFSTTGALEGAHFLSTGELVSLSEQQLVDCDHECDPDE 194 Query: 619 AGSCDSGCNGGLMNSAFEYTLQAGGLMREE 708 GSCDSGCNGGLMNSAFEY L++GG+MREE Sbjct: 195 PGSCDSGCNGGLMNSAFEYILKSGGVMREE 224 Score = 29.6 bits (65), Expect(2) = 2e-91 Identities = 11/16 (68%), Positives = 12/16 (75%) Frame = +3 Query: 708 NYSYPGTDLGACKFDK 755 +Y Y GTD G CKFDK Sbjct: 225 DYPYSGTDRGTCKFDK 240 >ref|XP_004150061.1| PREDICTED: cysteine proteinase RD19a-like [Cucumis sativus] gi|449519862|ref|XP_004166953.1| PREDICTED: cysteine proteinase RD19a-like [Cucumis sativus] Length = 377 Score = 332 bits (850), Expect(2) = 5e-91 Identities = 160/222 (72%), Positives = 191/222 (86%), Gaps = 8/222 (3%) Frame = +1 Query: 67 AVIAETLTSSETVEDPLIRQVVDDG------RGEAHVLSAEHHFSLFKKKFGKSYASQEE 228 + I + S E+ D +IRQVVDDG G+ +L A+HHFS+FK+KFGKSYAS+EE Sbjct: 17 SAIGSEVISGESDGDFIIRQVVDDGGVNEGSNGDDLLLGADHHFSVFKQKFGKSYASKEE 76 Query: 229 HNYRFKVFQANLRRAARHQKLDPSASHGVTQFSDLTPAEFKKQFLGLRSLRL--PKDAHQ 402 H++RF+VF+ANL+RA RHQ LDPSA+HGVTQFSDLTP+EF++ FLGLRS RL P DA++ Sbjct: 77 HDHRFRVFKANLKRAQRHQALDPSATHGVTQFSDLTPSEFRRSFLGLRSRRLGLPADANK 136 Query: 403 APILPTNDLPEDFDWREKGAVTNVKNQGSCGSCWSFSTTGALEGAHFLSTGELVSLSEQQ 582 APILPT+ LP DFDWR+KGAV+ VKNQGSCGSCWSFS TGALEGA+FL+TG+LVSLSEQQ Sbjct: 137 APILPTDGLPTDFDWRDKGAVSEVKNQGSCGSCWSFSATGALEGANFLATGKLVSLSEQQ 196 Query: 583 LVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTLQAGGLMREE 708 LVDCDHECDPEE GSCDSGCNGGLMNSAFEYTL++GGLM+E+ Sbjct: 197 LVDCDHECDPEEKGSCDSGCNGGLMNSAFEYTLKSGGLMKEQ 238 Score = 30.0 bits (66), Expect(2) = 5e-91 Identities = 11/17 (64%), Positives = 13/17 (76%) Frame = +3 Query: 708 NYSYPGTDLGACKFDKT 758 +Y Y GTD G CKFDK+ Sbjct: 239 DYPYTGTDRGTCKFDKS 255 >gb|ESR48330.1| hypothetical protein CICLE_v10001559mg [Citrus clementina] Length = 369 Score = 330 bits (846), Expect(2) = 2e-90 Identities = 162/217 (74%), Positives = 187/217 (86%), Gaps = 3/217 (1%) Frame = +1 Query: 67 AVIAETLTSSETVEDPLIRQVV--DDGRGEAHVLSAEHHFSLFKKKFGKSYASQEEHNYR 240 AV+A + S +D +IRQVV D + E H+L+AEHHFSLFK KF K+YA+QEEH+YR Sbjct: 16 AVLASAVAVSG--DDAMIRQVVPSDGEQSEDHLLNAEHHFSLFKSKFSKTYATQEEHDYR 73 Query: 241 FKVFQANLRRAARHQKLDPSASHGVTQFSDLTPAEFKKQFLGL-RSLRLPKDAHQAPILP 417 F+VF+ANLRRA R Q LDP+A HGVT+FSDLTP+EF++QFLGL R LRLP DA +APILP Sbjct: 74 FRVFKANLRRAKRRQLLDPTAVHGVTKFSDLTPSEFRRQFLGLNRRLRLPADAQKAPILP 133 Query: 418 TNDLPEDFDWREKGAVTNVKNQGSCGSCWSFSTTGALEGAHFLSTGELVSLSEQQLVDCD 597 TNDLP DFDWR+ GAVT VK+QG+CGSCWSFS TGALEGAHFLSTGELVSLSEQQLVDCD Sbjct: 134 TNDLPTDFDWRDHGAVTGVKDQGACGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCD 193 Query: 598 HECDPEEAGSCDSGCNGGLMNSAFEYTLQAGGLMREE 708 HECDPEE+GSCDSGCNGGLMNSAFEY L+AGG+ RE+ Sbjct: 194 HECDPEESGSCDSGCNGGLMNSAFEYILKAGGVEREK 230 Score = 29.6 bits (65), Expect(2) = 2e-90 Identities = 11/17 (64%), Positives = 14/17 (82%) Frame = +3 Query: 708 NYSYPGTDLGACKFDKT 758 +Y Y GTD G+CKFDK+ Sbjct: 231 DYPYTGTDGGSCKFDKS 247 >ref|XP_002510469.1| cysteine protease, putative [Ricinus communis] gi|223551170|gb|EEF52656.1| cysteine protease, putative [Ricinus communis] Length = 366 Score = 334 bits (857), Expect(2) = 3e-90 Identities = 156/206 (75%), Positives = 183/206 (88%) Frame = +1 Query: 91 SSETVEDPLIRQVVDDGRGEAHVLSAEHHFSLFKKKFGKSYASQEEHNYRFKVFQANLRR 270 +S+ ++DPLIRQVV D E ++LSA+HHF+ FK KFGK+YA+QEEH+YRFKVF+ANLRR Sbjct: 24 TSDELDDPLIRQVVPDV--EDYLLSAQHHFTAFKAKFGKNYATQEEHDYRFKVFKANLRR 81 Query: 271 AARHQKLDPSASHGVTQFSDLTPAEFKKQFLGLRSLRLPKDAHQAPILPTNDLPEDFDWR 450 A +HQ +DPSA HGVT+FSDLTP EF++Q+LGL+ LRLP DAH+APILPT+ +PEDFDWR Sbjct: 82 AQKHQLMDPSAVHGVTKFSDLTPREFRRQYLGLKKLRLPADAHEAPILPTDGIPEDFDWR 141 Query: 451 EKGAVTNVKNQGSCGSCWSFSTTGALEGAHFLSTGELVSLSEQQLVDCDHECDPEEAGSC 630 + GAVTNVKNQGSCGSCWSFS GALEGAHFL+TGELVSLSEQQLVDCDHECDP E G+C Sbjct: 142 DHGAVTNVKNQGSCGSCWSFSAAGALEGAHFLATGELVSLSEQQLVDCDHECDPTEYGAC 201 Query: 631 DSGCNGGLMNSAFEYTLQAGGLMREE 708 DSGCNGGLM +AFEY L+AGGL REE Sbjct: 202 DSGCNGGLMTNAFEYILKAGGLEREE 227 Score = 25.0 bits (53), Expect(2) = 3e-90 Identities = 8/16 (50%), Positives = 12/16 (75%) Frame = +3 Query: 708 NYSYPGTDLGACKFDK 755 +Y Y G+D G CKF++ Sbjct: 228 DYPYTGSDRGPCKFER 243 >ref|XP_003606998.1| Cysteine proteinase [Medicago truncatula] gi|355508053|gb|AES89195.1| Cysteine proteinase [Medicago truncatula] Length = 362 Score = 329 bits (844), Expect(2) = 3e-90 Identities = 155/201 (77%), Positives = 179/201 (89%) Frame = +1 Query: 106 EDPLIRQVVDDGRGEAHVLSAEHHFSLFKKKFGKSYASQEEHNYRFKVFQANLRRAARHQ 285 EDP+IRQVVD+ E L AEHHF+LFK KFGK Y+S++EH+YRFK+F++NL RA RHQ Sbjct: 27 EDPIIRQVVDE---EGVRLGAEHHFNLFKHKFGKVYSSKDEHDYRFKIFKSNLNRAKRHQ 83 Query: 286 KLDPSASHGVTQFSDLTPAEFKKQFLGLRSLRLPKDAHQAPILPTNDLPEDFDWREKGAV 465 +DPSA HGVT+FSDLTP EF+K LGLR + LPKDA+ APILPT++LP+DFDWREKGAV Sbjct: 84 LMDPSAVHGVTRFSDLTPREFRKSVLGLRGVGLPKDANAAPILPTDNLPKDFDWREKGAV 143 Query: 466 TNVKNQGSCGSCWSFSTTGALEGAHFLSTGELVSLSEQQLVDCDHECDPEEAGSCDSGCN 645 T VKNQGSCGSCWSFSTTGALEGAHFLSTG+LVSLSEQQLVDCDHECDPE+ GSCD+GCN Sbjct: 144 TAVKNQGSCGSCWSFSTTGALEGAHFLSTGKLVSLSEQQLVDCDHECDPEQPGSCDAGCN 203 Query: 646 GGLMNSAFEYTLQAGGLMREE 708 GGLMNSAFEY L++GG+MREE Sbjct: 204 GGLMNSAFEYILKSGGVMREE 224 Score = 30.0 bits (66), Expect(2) = 3e-90 Identities = 11/16 (68%), Positives = 13/16 (81%) Frame = +3 Query: 708 NYSYPGTDLGACKFDK 755 +Y Y GTD G+CKFDK Sbjct: 225 DYPYSGTDRGSCKFDK 240 >ref|XP_004230928.1| PREDICTED: cysteine proteinase RD19a-like [Solanum lycopersicum] Length = 369 Score = 330 bits (847), Expect(2) = 8e-90 Identities = 160/202 (79%), Positives = 180/202 (89%), Gaps = 1/202 (0%) Frame = +1 Query: 106 EDPLIRQVVDDGRGEAHVLSAEHHFSLFKKKFGKSYASQEEHNYRFKVFQANLRRAARHQ 285 +D LIRQVV D + H+L+AEHHF+LFKK+FGK+YAS EEH+YRF VF+ANLRRA RHQ Sbjct: 32 DDILIRQVVGDE--DHHMLNAEHHFTLFKKRFGKTYASDEEHHYRFSVFKANLRRAMRHQ 89 Query: 286 KLDPSASHGVTQFSDLTPAEFKKQFLGL-RSLRLPKDAHQAPILPTNDLPEDFDWREKGA 462 KLDPSA HGVTQFSD+TP EF ++FLG+ R LR P DA++APILPT DLP DFDWRE GA Sbjct: 90 KLDPSAVHGVTQFSDMTPDEFSQKFLGVNRRLRFPSDANKAPILPTEDLPSDFDWREHGA 149 Query: 463 VTNVKNQGSCGSCWSFSTTGALEGAHFLSTGELVSLSEQQLVDCDHECDPEEAGSCDSGC 642 VT VKNQGSCGSCWSFSTTGALEGA+FL+TG+LVSLSEQQLVDCDHECDPEE SCDSGC Sbjct: 150 VTPVKNQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEKDSCDSGC 209 Query: 643 NGGLMNSAFEYTLQAGGLMREE 708 +GGLMNSAFEYTL+AGGLMREE Sbjct: 210 SGGLMNSAFEYTLKAGGLMREE 231 Score = 27.3 bits (59), Expect(2) = 8e-90 Identities = 10/17 (58%), Positives = 11/17 (64%) Frame = +3 Query: 708 NYSYPGTDLGACKFDKT 758 +Y Y GTD CKFD T Sbjct: 232 DYPYTGTDKATCKFDNT 248