BLASTX nr result
ID: Glycyrrhiza29_contig00014747
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Glycyrrhiza29_contig00014747 (1750 letters) Database: ./nr 115,041,592 sequences; 42,171,959,267 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value XP_004500967.1 PREDICTED: zingipain-2 [Cicer arietinum] 684 0.0 XP_013462179.1 papain family cysteine protease [Medicago truncat... 659 0.0 KHN45886.1 Oryzain alpha chain [Glycine soja] 652 0.0 XP_003523725.1 PREDICTED: low-temperature-induced cysteine prote... 649 0.0 XP_019438032.1 PREDICTED: low-temperature-induced cysteine prote... 643 0.0 KYP64636.1 Oryzain alpha chain [Cajanus cajan] 637 0.0 XP_007136041.1 hypothetical protein PHAVU_009G013000g [Phaseolus... 635 0.0 XP_014501447.1 PREDICTED: zingipain-2 [Vigna radiata var. radiata] 632 0.0 XP_017437409.1 PREDICTED: zingipain-2 [Vigna angularis] BAT77731... 631 0.0 AGV54418.1 oryzain alpha chain-like protein [Phaseolus vulgaris] 629 0.0 XP_012071947.1 PREDICTED: low-temperature-induced cysteine prote... 612 0.0 EOY14881.1 JHL18I08.3 protein [Theobroma cacao] 611 0.0 XP_007017656.2 PREDICTED: zingipain-2 [Theobroma cacao] 610 0.0 XP_002510459.2 PREDICTED: low-temperature-induced cysteine prote... 607 0.0 XP_016687622.1 PREDICTED: low-temperature-induced cysteine prote... 607 0.0 XP_018821065.1 PREDICTED: low-temperature-induced cysteine prote... 605 0.0 XP_016167594.1 PREDICTED: cysteine proteinase COT44 isoform X2 [... 604 0.0 XP_015933819.1 PREDICTED: cysteine proteinase COT44 isoform X2 [... 602 0.0 XP_017612038.1 PREDICTED: zingipain-2 [Gossypium arboreum] 601 0.0 XP_012444786.1 PREDICTED: zingipain-2 [Gossypium raimondii] KJB5... 600 0.0 >XP_004500967.1 PREDICTED: zingipain-2 [Cicer arietinum] Length = 436 Score = 684 bits (1766), Expect = 0.0 Identities = 330/426 (77%), Positives = 352/426 (82%), Gaps = 2/426 (0%) Frame = -3 Query: 1535 STFDTSELFENWCKEHGKAYSSEEEKRYRSKVFEDNYDFVARHNQRGRNSSYTLSLNAFA 1356 + DTS+LF+ WCK+HGK Y SE+EKRYR VFEDNY FVA+HNQ G NSSYTLSLNAFA Sbjct: 22 TAIDTSKLFQEWCKQHGKTYPSEQEKRYRFNVFEDNYAFVAQHNQIG-NSSYTLSLNAFA 80 Query: 1355 DLTHHEFKASRLGLPPHSSFLRF--NRFQDQQPNDRHLDVPSEIDWRNTGAVTPVKDQGS 1182 DLTHHEFKA+RLGLPP SS LRF NRFQDQQ +D L VPSEIDWR GAV+ VKDQGS Sbjct: 81 DLTHHEFKATRLGLPP-SSLLRFKFNRFQDQQRSDDFLQVPSEIDWRKNGAVSIVKDQGS 139 Query: 1181 CGACWSFATTGAIEGINKIVTGSLVSLSEQELVDCDTTYNSGCEGGLMDYAYQFVIDNNG 1002 CGACWSF+ TGAIEGINKIVTGSLVSLSEQELVDCDTTYNSGC+GGLMDYAYQF+IDNNG Sbjct: 140 CGACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDTTYNSGCDGGLMDYAYQFIIDNNG 199 Query: 1001 IDTEEDYPYQGRQRLCNKDKLRRRVVTIDGYIDVPRNDEKRLLKAVADQPVSVGICGSER 822 IDTEEDYPYQ RQ LC KDKL+RRVVTIDGY DVP NDEK+LLKAVA QPVSVGICGS R Sbjct: 200 IDTEEDYPYQARQLLCKKDKLKRRVVTIDGYTDVPPNDEKKLLKAVAVQPVSVGICGSAR 259 Query: 821 AFQLYSKGIFNGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKNWGMNGHMHMLRNTGD 642 AFQLYSKGIF GPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGK WGMNG++HMLRNT Sbjct: 260 AFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKYWGMNGYIHMLRNTDS 319 Query: 641 SEGLCGINMLASYXXXXXXXXXXXXXXXXTKCNLFTYCSGGETCCCARSLLGICFKWKCC 462 S GLCGINMLASY KCNLFTYCSGGETCCCA+ LGICF WKCC Sbjct: 320 SAGLCGINMLASYPTKTKPNPPVPPPPGPIKCNLFTYCSGGETCCCAKKFLGICFSWKCC 379 Query: 461 GLTSAVCCKDKRHCCPQDYPVCDTRRGQCLKRISNGTKTNTFEKEDSTRGWGSQ*AFHQT 282 G+TSAVCCKDKRHCCP DYPVCD GQCLKRI+NGT T +KED FHQT Sbjct: 380 GVTSAVCCKDKRHCCPLDYPVCDASNGQCLKRIANGTILMTSDKED---------PFHQT 430 Query: 281 KGWGSH 264 + W S+ Sbjct: 431 RDWRSN 436 >XP_013462179.1 papain family cysteine protease [Medicago truncatula] KEH36214.1 papain family cysteine protease [Medicago truncatula] Length = 443 Score = 659 bits (1701), Expect = 0.0 Identities = 317/424 (74%), Positives = 346/424 (81%), Gaps = 3/424 (0%) Frame = -3 Query: 1526 DTSELFENWCKEHGKAYSSEEEKRYRSKVFEDNYDFVARHNQRGRNSSYTLSLNAFADLT 1347 DTS+LF+ W K+HGK Y SEEEKRYR KVF+DNY FV++HN+ G NSSYTLSLNAFADLT Sbjct: 31 DTSKLFQEWSKQHGKTYPSEEEKRYRFKVFQDNYAFVSQHNEMG-NSSYTLSLNAFADLT 89 Query: 1346 HHEFKASRLGLPPHSSFLRF--NRFQDQQPNDRH-LDVPSEIDWRNTGAVTPVKDQGSCG 1176 HHEFK +RLG P SS LRF N F+DQQ +D L VPSEIDWR + AVTPVKDQGSCG Sbjct: 90 HHEFKTTRLGFSP-SSLLRFKFNHFEDQQFDDNGILQVPSEIDWRKSDAVTPVKDQGSCG 148 Query: 1175 ACWSFATTGAIEGINKIVTGSLVSLSEQELVDCDTTYNSGCEGGLMDYAYQFVIDNNGID 996 ACWSF+ TGAIEGINKIVTGSLVSLSEQELVDCD TYNSGC+GGLMDYAYQF+IDN GID Sbjct: 149 ACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDRTYNSGCDGGLMDYAYQFIIDNKGID 208 Query: 995 TEEDYPYQGRQRLCNKDKLRRRVVTIDGYIDVPRNDEKRLLKAVADQPVSVGICGSERAF 816 TEEDYPYQ RQ LC KDKL+RRVVTIDGY DVP NDEK+LLKAVA QPVSVGICGS RAF Sbjct: 209 TEEDYPYQSRQLLCKKDKLKRRVVTIDGYTDVPPNDEKKLLKAVAVQPVSVGICGSARAF 268 Query: 815 QLYSKGIFNGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKNWGMNGHMHMLRNTGDSE 636 QLYSKGIF GPCST LDHAVLIVGYGSENGVDYWIVKNSWGK+WGMNG++HMLRNT +S Sbjct: 269 QLYSKGIFTGPCSTYLDHAVLIVGYGSENGVDYWIVKNSWGKSWGMNGYIHMLRNTDNSA 328 Query: 635 GLCGINMLASYXXXXXXXXXXXXXXXXTKCNLFTYCSGGETCCCARSLLGICFKWKCCGL 456 GLCGINMLASY +CNLFTYCS GETCCCA+ LGICF WKCCG Sbjct: 329 GLCGINMLASYPTKTSPNPPVPPPPGPIRCNLFTYCSRGETCCCAKKFLGICFSWKCCGK 388 Query: 455 TSAVCCKDKRHCCPQDYPVCDTRRGQCLKRISNGTKTNTFEKEDSTRGWGSQ*AFHQTKG 276 TSAVCCKD+RHCCP DYP+CD R QCLKRI+NGT T +K+D+ FHQT+ Sbjct: 389 TSAVCCKDERHCCPLDYPICDIGRSQCLKRIANGTTTMPSDKQDT---------FHQTRD 439 Query: 275 WGSH 264 W SH Sbjct: 440 WSSH 443 >KHN45886.1 Oryzain alpha chain [Glycine soja] Length = 439 Score = 652 bits (1681), Expect = 0.0 Identities = 310/419 (73%), Positives = 346/419 (82%), Gaps = 7/419 (1%) Frame = -3 Query: 1535 STFDTSELFENWCKEHGKAYSSEEEKRYRSKVFEDNYDFVARHNQRGRNS----SYTLSL 1368 S DTSELFE WCKEH K YSSEEEK YR KVFEDNY FVA+HNQ N+ SYTLSL Sbjct: 25 SASDTSELFEKWCKEHSKTYSSEEEKLYRLKVFEDNYAFVAQHNQNANNNNNNPSYTLSL 84 Query: 1367 NAFADLTHHEFKASRLGLPPHSSFLRFNRFQDQQPNDRHLDVPSEIDWRNTGAVTPVKDQ 1188 NAFADLTHHEFK +RLGLPP + LRF R Q+QQ D L +PS+IDWR +GAVTPVKDQ Sbjct: 85 NAFADLTHHEFKTTRLGLPP--TLLRFKRPQNQQSRDL-LHIPSQIDWRQSGAVTPVKDQ 141 Query: 1187 GSCGACWSFATTGAIEGINKIVTGSLVSLSEQELVDCDTTYNSGCEGGLMDYAYQFVIDN 1008 SCGACW+F+ TGAIEGINKIVTGSL+SLSEQEL+DCDT+YNSGC GGLMD+AYQFVIDN Sbjct: 142 ASCGACWAFSATGAIEGINKIVTGSLLSLSEQELIDCDTSYNSGCGGGLMDFAYQFVIDN 201 Query: 1007 NGIDTEEDYPYQGRQRLCNKDKLRRRVVTIDGYIDVPRNDEKRLLKAVADQPVSVGICGS 828 GIDTEEDYPYQ RQR C+KDKL+RR VTI+ Y+DVP ++E+ +LKAVA QPVSVGICGS Sbjct: 202 KGIDTEEDYPYQARQRSCSKDKLKRRAVTIEDYVDVPPSEEE-ILKAVASQPVSVGICGS 260 Query: 827 ERAFQLYSKGIFNGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKNWGMNGHMHMLRNT 648 ER FQLYSKGIF GPCST LDHAVLIVGYGSENGVDYWIVKNSWGK WGMNG++HM+RN+ Sbjct: 261 EREFQLYSKGIFTGPCSTFLDHAVLIVGYGSENGVDYWIVKNSWGKYWGMNGYIHMIRNS 320 Query: 647 GDSEGLCGINMLASYXXXXXXXXXXXXXXXXTKCNLFTYCSGGETCCCARSLLGICFKWK 468 G+S+G+CGIN LASY +CNLFT+CS GETCCCA+S LGICF WK Sbjct: 321 GNSKGICGINTLASYPVKTKPNPPIPPPPGPVRCNLFTHCSEGETCCCAKSFLGICFSWK 380 Query: 467 CCGLTSAVCCKDKRHCCPQDYPVCDTRRGQCLKRISNGTKTNTFEKED---STRGWGSQ 300 CCGLTSAVCCKDKRHCCPQDYP+CDTRRGQCLKR +NGT T T E +D +RGW SQ Sbjct: 381 CCGLTSAVCCKDKRHCCPQDYPICDTRRGQCLKRTANGTTTITSENQDFSHKSRGWKSQ 439 >XP_003523725.1 PREDICTED: low-temperature-induced cysteine proteinase [Glycine max] KRH61098.1 hypothetical protein GLYMA_04G028300 [Glycine max] Length = 439 Score = 649 bits (1675), Expect = 0.0 Identities = 310/419 (73%), Positives = 345/419 (82%), Gaps = 7/419 (1%) Frame = -3 Query: 1535 STFDTSELFENWCKEHGKAYSSEEEKRYRSKVFEDNYDFVARHNQRGRN----SSYTLSL 1368 S DTSELFE WCKEH K YSSEEEK YR KVFEDNY FVA+HNQ N SSYTLSL Sbjct: 25 SASDTSELFEKWCKEHSKTYSSEEEKLYRLKVFEDNYAFVAQHNQNANNNNNNSSYTLSL 84 Query: 1367 NAFADLTHHEFKASRLGLPPHSSFLRFNRFQDQQPNDRHLDVPSEIDWRNTGAVTPVKDQ 1188 NAFADLTHHEFK +RLGLP + LRF R Q+QQ D L +PS+IDWR +GAVTPVKDQ Sbjct: 85 NAFADLTHHEFKTTRLGLP--LTLLRFKRPQNQQSRDL-LHIPSQIDWRQSGAVTPVKDQ 141 Query: 1187 GSCGACWSFATTGAIEGINKIVTGSLVSLSEQELVDCDTTYNSGCEGGLMDYAYQFVIDN 1008 SCGACW+F+ TGAIEGINKIVTGSLVSLSEQEL+DCDT+YNSGC GGLMD+AYQFVIDN Sbjct: 142 ASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDTSYNSGCGGGLMDFAYQFVIDN 201 Query: 1007 NGIDTEEDYPYQGRQRLCNKDKLRRRVVTIDGYIDVPRNDEKRLLKAVADQPVSVGICGS 828 GIDTE+DYPYQ RQR C+KDKL+RR VTI+ Y+DVP ++E+ +LKAVA QPVSVGICGS Sbjct: 202 KGIDTEDDYPYQARQRSCSKDKLKRRAVTIEDYVDVPPSEEE-ILKAVASQPVSVGICGS 260 Query: 827 ERAFQLYSKGIFNGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKNWGMNGHMHMLRNT 648 ER FQLYSKGIF GPCST LDHAVLIVGYGSENGVDYWIVKNSWGK WGMNG++HM+RN+ Sbjct: 261 EREFQLYSKGIFTGPCSTFLDHAVLIVGYGSENGVDYWIVKNSWGKYWGMNGYIHMIRNS 320 Query: 647 GDSEGLCGINMLASYXXXXXXXXXXXXXXXXTKCNLFTYCSGGETCCCARSLLGICFKWK 468 G+S+G+CGIN LASY +CNLFT+CS GETCCCA+S LGICF WK Sbjct: 321 GNSKGICGINTLASYPVKTKPNPPIPPPPGPVRCNLFTHCSEGETCCCAKSFLGICFSWK 380 Query: 467 CCGLTSAVCCKDKRHCCPQDYPVCDTRRGQCLKRISNGTKTNTFEKED---STRGWGSQ 300 CCGLTSAVCCKDKRHCCPQDYP+CDTRRGQCLKR +NGT T T E +D +RGW SQ Sbjct: 381 CCGLTSAVCCKDKRHCCPQDYPICDTRRGQCLKRTANGTTTITSENQDFSHKSRGWKSQ 439 >XP_019438032.1 PREDICTED: low-temperature-induced cysteine proteinase [Lupinus angustifolius] XP_019438033.1 PREDICTED: low-temperature-induced cysteine proteinase [Lupinus angustifolius] OIW14825.1 hypothetical protein TanjilG_17050 [Lupinus angustifolius] Length = 439 Score = 643 bits (1659), Expect = 0.0 Identities = 304/413 (73%), Positives = 334/413 (80%), Gaps = 4/413 (0%) Frame = -3 Query: 1526 DTSELFENWCKEHGKAYSSEEEKRYRSKVFEDNYDFVARHNQRGRNSSYTLSLNAFADLT 1347 +T LFE WCK+H K YSSE+EK Y+ KVFEDNY FV +H + NSSYTLSLNAFADLT Sbjct: 29 NTYNLFETWCKQHNKTYSSEQEKLYKFKVFEDNYAFVTQHKNKVGNSSYTLSLNAFADLT 88 Query: 1346 HHEFKASRL-GLPPHSSFLRFNRFQDQQPNDRHLDVPSEIDWRNTGAVTPVKDQGSCGAC 1170 HHEFK SR+ GL P RFN Q+QQ +R L VPSE DWR GAVTPVKDQGSCGAC Sbjct: 89 HHEFKTSRIRGLSPR--LFRFNHSQNQQSGNRVLHVPSEFDWRKNGAVTPVKDQGSCGAC 146 Query: 1169 WSFATTGAIEGINKIVTGSLVSLSEQELVDCDTTYNSGCEGGLMDYAYQFVIDNNGIDTE 990 WSF+ TGAIEGINKIVTGSLVSLSEQELVDCD YNSGCEGGLMDYAYQFVIDN+GIDTE Sbjct: 147 WSFSATGAIEGINKIVTGSLVSLSEQELVDCDRNYNSGCEGGLMDYAYQFVIDNHGIDTE 206 Query: 989 EDYPYQGRQRLCNKDKLRRRVVTIDGYIDVPRNDEKRLLKAVADQPVSVGICGSERAFQL 810 +DYPYQ R C+KDKL+RRVVTIDGY DVP+ DEK+LL+AV QPVSVGICGS+RAFQL Sbjct: 207 KDYPYQAHDRTCSKDKLKRRVVTIDGYTDVPQGDEKKLLEAVVSQPVSVGICGSDRAFQL 266 Query: 809 YSKGIFNGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKNWGMNGHMHMLRNTGDSEGL 630 YSKGIF GPCST LDHAVLIVGYGSENGVDYWIVKNSWG +WGMNG++HM+RN+G+SEGL Sbjct: 267 YSKGIFTGPCSTYLDHAVLIVGYGSENGVDYWIVKNSWGTSWGMNGYIHMVRNSGNSEGL 326 Query: 629 CGINMLASYXXXXXXXXXXXXXXXXTKCNLFTYCSGGETCCCARSLLGICFKWKCCGLTS 450 CGIN LASY T+C+LFTYCS GETCCCA+S LGIC WKCCG+ S Sbjct: 327 CGINTLASYPIKTKPNPPTPPPPGPTRCSLFTYCSEGETCCCAKSFLGICLSWKCCGVNS 386 Query: 449 AVCCKDKRHCCPQDYPVCDTRRGQCLKRISNGTKTNTFEKEDS---TRGWGSQ 300 AVCCKDKRHCCP DYPVCDT RGQCLKR++N T T FE E S RGW SQ Sbjct: 387 AVCCKDKRHCCPHDYPVCDTARGQCLKRVANATITKAFENEGSFGTPRGWNSQ 439 >KYP64636.1 Oryzain alpha chain [Cajanus cajan] Length = 466 Score = 637 bits (1643), Expect = 0.0 Identities = 297/384 (77%), Positives = 327/384 (85%) Frame = -3 Query: 1520 SELFENWCKEHGKAYSSEEEKRYRSKVFEDNYDFVARHNQRGRNSSYTLSLNAFADLTHH 1341 S++FE WCKEHGK YSSEE+KRYR KVFEDNY FVA HN+ S+YTLSLNAFADLTHH Sbjct: 72 SQVFERWCKEHGKTYSSEEQKRYRLKVFEDNYAFVAEHNRNANTSTYTLSLNAFADLTHH 131 Query: 1340 EFKASRLGLPPHSSFLRFNRFQDQQPNDRHLDVPSEIDWRNTGAVTPVKDQGSCGACWSF 1161 EFK SRLGL S LRF R ++QQP L VPSEIDWR +GAVTPVKDQGSCGACW+F Sbjct: 132 EFKTSRLGLALPPSLLRFKRPRNQQP-PHLLQVPSEIDWRKSGAVTPVKDQGSCGACWAF 190 Query: 1160 ATTGAIEGINKIVTGSLVSLSEQELVDCDTTYNSGCEGGLMDYAYQFVIDNNGIDTEEDY 981 + TGAIEGINKIVTGSL SLSEQEL+DCD +YNSGCEGGLMDYAYQF+IDN GIDTE+DY Sbjct: 191 SATGAIEGINKIVTGSLESLSEQELIDCDRSYNSGCEGGLMDYAYQFIIDNGGIDTEDDY 250 Query: 980 PYQGRQRLCNKDKLRRRVVTIDGYIDVPRNDEKRLLKAVADQPVSVGICGSERAFQLYSK 801 PYQ R+R CNKDKLRRRVVTID Y+DVP N+E+ LLKAVA QPVSVGICGSERAFQLYS+ Sbjct: 251 PYQVRERTCNKDKLRRRVVTIDDYVDVPLNEEE-LLKAVATQPVSVGICGSERAFQLYSE 309 Query: 800 GIFNGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKNWGMNGHMHMLRNTGDSEGLCGI 621 GIF GPCST+LDHAVLIVGYGSENGVDYWIVKNSWGK WGMNG++HM+RN+GDS+G+CGI Sbjct: 310 GIFTGPCSTALDHAVLIVGYGSENGVDYWIVKNSWGKYWGMNGYIHMIRNSGDSKGICGI 369 Query: 620 NMLASYXXXXXXXXXXXXXXXXTKCNLFTYCSGGETCCCARSLLGICFKWKCCGLTSAVC 441 N LASY +CNLFT+CS GETCCCARS LGICF WKCCGLTSAVC Sbjct: 370 NTLASYPIKTKPNPPIPPPPGPVRCNLFTHCSQGETCCCARSFLGICFSWKCCGLTSAVC 429 Query: 440 CKDKRHCCPQDYPVCDTRRGQCLK 369 CKDKRHCCPQDYP+CDT +GQCLK Sbjct: 430 CKDKRHCCPQDYPICDTGKGQCLK 453 >XP_007136041.1 hypothetical protein PHAVU_009G013000g [Phaseolus vulgaris] ESW08035.1 hypothetical protein PHAVU_009G013000g [Phaseolus vulgaris] Length = 428 Score = 635 bits (1639), Expect = 0.0 Identities = 309/418 (73%), Positives = 342/418 (81%), Gaps = 6/418 (1%) Frame = -3 Query: 1538 ASTFDTSELFENWCKEHGKAYSSEEEKRYRSKVFEDNYDFVARHNQRGR--NSSYTLSLN 1365 AS DTS+LFE WCKEH K YSSEEEKRYR VFEDNY FV++HN+ NS+YTLSLN Sbjct: 18 ASASDTSDLFERWCKEHAKTYSSEEEKRYRFHVFEDNYAFVSQHNRNANDNNSTYTLSLN 77 Query: 1364 AFADLTHHEFKASRLGLPPHSSFLRFNRFQDQQPNDRHL-DVPSEIDWRNTGAVTPVKDQ 1188 AFADLTHHEFK SRLG P S LRF R Q+QQP RHL PS+IDWR +GAVTPVKDQ Sbjct: 78 AFADLTHHEFKTSRLGFSP--SLLRFKRVQNQQP--RHLLHNPSQIDWRQSGAVTPVKDQ 133 Query: 1187 GSCGACWSFATTGAIEGINKIVTGSLVSLSEQELVDCDTTYNSGCEGGLMDYAYQFVIDN 1008 SCGACW+F+ TGAIEGINKIVTGSL SLSEQELVDCDT+YNSGCEGGLMDYAYQFVIDN Sbjct: 134 ASCGACWAFSATGAIEGINKIVTGSLESLSEQELVDCDTSYNSGCEGGLMDYAYQFVIDN 193 Query: 1007 NGIDTEEDYPYQGRQRLCNKDKLRRRVVTIDGYIDVPRNDEKRLLKAVADQPVSVGICGS 828 GIDTE+DYPYQ RQR CNKDKL+R +VTID Y+D+P N+E+ LLKAVA QPVSVGICGS Sbjct: 194 KGIDTEDDYPYQARQRPCNKDKLKRHIVTIDDYVDLPPNEEE-LLKAVASQPVSVGICGS 252 Query: 827 ERAFQLYSKGIFNGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKNWGMNGHMHMLRNT 648 ERAFQLYS+GIF+GPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGK WGM G++HM+RNT Sbjct: 253 ERAFQLYSQGIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKYWGMEGYIHMIRNT 312 Query: 647 GDSEGLCGINMLASYXXXXXXXXXXXXXXXXTKCNLFTYCSGGETCCCARSLLGICFKWK 468 GD +G+CGIN LASY +CNLFT+CS GETCCCA+S LGICF WK Sbjct: 313 GDPKGICGINTLASY--PIKTKPNPPPPPAPVRCNLFTHCSEGETCCCAKSFLGICFSWK 370 Query: 467 CCGLTSAVCCKDKRHCCPQDYPVCDTRRGQCLKRISNGTKTNTFEKED---STRGWGS 303 CCGLTSAVCCKDKRHCCP+DYP+CDT + QCLK I+NGT T T +D RGW S Sbjct: 371 CCGLTSAVCCKDKRHCCPRDYPICDTEKSQCLK-ITNGTTTITSGNKDISNKPRGWKS 427 >XP_014501447.1 PREDICTED: zingipain-2 [Vigna radiata var. radiata] Length = 428 Score = 632 bits (1631), Expect = 0.0 Identities = 305/418 (72%), Positives = 344/418 (82%), Gaps = 6/418 (1%) Frame = -3 Query: 1538 ASTFDTSELFENWCKEHGKAYSSEEEKRYRSKVFEDNYDFVARHNQRGR--NSSYTLSLN 1365 A DTS+LFE WCKEH K YSSEEEKRYR +VFEDNY FV++HNQ NS+YTLSLN Sbjct: 18 APASDTSDLFERWCKEHAKTYSSEEEKRYRFRVFEDNYAFVSQHNQNANANNSTYTLSLN 77 Query: 1364 AFADLTHHEFKASRLGLPPHSSFLRFNRFQDQQPNDRHL-DVPSEIDWRNTGAVTPVKDQ 1188 AFADLTHHEFK SRLG P S RF R Q+QQP RHL +PSEIDWR +GAVTPVKDQ Sbjct: 78 AFADLTHHEFKTSRLGFSP--SLHRFKRVQNQQP--RHLLHLPSEIDWRQSGAVTPVKDQ 133 Query: 1187 GSCGACWSFATTGAIEGINKIVTGSLVSLSEQELVDCDTTYNSGCEGGLMDYAYQFVIDN 1008 +CGACWSF+ TGAIEGINKIVTGSL S+SEQELVDCDT+YNSGCEGGLMDYAYQFVIDN Sbjct: 134 STCGACWSFSATGAIEGINKIVTGSLESISEQELVDCDTSYNSGCEGGLMDYAYQFVIDN 193 Query: 1007 NGIDTEEDYPYQGRQRLCNKDKLRRRVVTIDGYIDVPRNDEKRLLKAVADQPVSVGICGS 828 GIDTE+DYPYQ RQR CNKDKL+RR+VTID Y D+P N+E+ LLKAVA QPVSVGICGS Sbjct: 194 KGIDTEDDYPYQARQRSCNKDKLKRRIVTIDDYADLPPNEEE-LLKAVASQPVSVGICGS 252 Query: 827 ERAFQLYSKGIFNGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKNWGMNGHMHMLRNT 648 ERAFQLYS+GIF+GPCST+LDHAVLIVGYGSENGVDYWIVKNSWG+ WGM+G++HM+RN+ Sbjct: 253 ERAFQLYSQGIFSGPCSTALDHAVLIVGYGSENGVDYWIVKNSWGRYWGMDGYIHMIRNS 312 Query: 647 GDSEGLCGINMLASYXXXXXXXXXXXXXXXXTKCNLFTYCSGGETCCCARSLLGICFKWK 468 GDS+G+CGIN LASY +CNLFT+CS GETCCCA+S LGICF WK Sbjct: 313 GDSKGICGINTLASY--PIKTTPNPPPPPAPVRCNLFTHCSEGETCCCAKSFLGICFSWK 370 Query: 467 CCGLTSAVCCKDKRHCCPQDYPVCDTRRGQCLKRISNGTKTNTFEKED---STRGWGS 303 CCGLT+AVCCKD+RHCCP DYP+CDT++ QCLK I+NGT T T +D RGW S Sbjct: 371 CCGLTTAVCCKDRRHCCPLDYPICDTKKSQCLK-ITNGTTTITTGNQDFSNKPRGWKS 427 >XP_017437409.1 PREDICTED: zingipain-2 [Vigna angularis] BAT77731.1 hypothetical protein VIGAN_02032600 [Vigna angularis var. angularis] Length = 428 Score = 631 bits (1628), Expect = 0.0 Identities = 303/418 (72%), Positives = 344/418 (82%), Gaps = 6/418 (1%) Frame = -3 Query: 1538 ASTFDTSELFENWCKEHGKAYSSEEEKRYRSKVFEDNYDFVARHNQRGR--NSSYTLSLN 1365 AS +TS+LFE WCKEH K YSSEEEKRYR +VFEDNY FV++HNQ NS+YTLSLN Sbjct: 18 ASASNTSDLFERWCKEHAKTYSSEEEKRYRFRVFEDNYAFVSQHNQNANVNNSTYTLSLN 77 Query: 1364 AFADLTHHEFKASRLGLPPHSSFLRFNRFQDQQPNDRHL-DVPSEIDWRNTGAVTPVKDQ 1188 AFADLTHHEFK SRLG P S RF R Q+QQP RHL +PSEIDWR +GAVTPVKDQ Sbjct: 78 AFADLTHHEFKTSRLGFSP--SLFRFKRVQNQQP--RHLLHLPSEIDWRQSGAVTPVKDQ 133 Query: 1187 GSCGACWSFATTGAIEGINKIVTGSLVSLSEQELVDCDTTYNSGCEGGLMDYAYQFVIDN 1008 SCGACW+F+ TGAIEGINKIVTGSL S+SEQELVDCDT+YNSGCEGGLMDYAYQF+IDN Sbjct: 134 ASCGACWAFSATGAIEGINKIVTGSLESISEQELVDCDTSYNSGCEGGLMDYAYQFIIDN 193 Query: 1007 NGIDTEEDYPYQGRQRLCNKDKLRRRVVTIDGYIDVPRNDEKRLLKAVADQPVSVGICGS 828 GIDTE+DYPYQ RQR CNKDKL+RR+VTID Y D+P N+E+ LLKAVA QPVSVGICGS Sbjct: 194 KGIDTEDDYPYQARQRSCNKDKLKRRIVTIDDYADLPPNEEE-LLKAVASQPVSVGICGS 252 Query: 827 ERAFQLYSKGIFNGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKNWGMNGHMHMLRNT 648 +RAFQLYS+GIF+GPCST+LDHAVLIVGYGSENGVDYWIVKNSWG+ WGMNG++HM+RN+ Sbjct: 253 DRAFQLYSQGIFSGPCSTALDHAVLIVGYGSENGVDYWIVKNSWGRYWGMNGYIHMIRNS 312 Query: 647 GDSEGLCGINMLASYXXXXXXXXXXXXXXXXTKCNLFTYCSGGETCCCARSLLGICFKWK 468 GDS+G+CGIN LASY +CNLFT+CS GETCCCA+S LG+CF WK Sbjct: 313 GDSKGICGINTLASY--PIKTKPNPPPPPAPVRCNLFTHCSEGETCCCAKSFLGLCFSWK 370 Query: 467 CCGLTSAVCCKDKRHCCPQDYPVCDTRRGQCLKRISNGTKTNTFEKED---STRGWGS 303 CCGLTSAVCCKD+RHCCP DYP+CDT++ QCLK I+N T T T +D RGW S Sbjct: 371 CCGLTSAVCCKDRRHCCPLDYPICDTKKSQCLK-ITNETTTITTGNQDFSNKPRGWKS 427 >AGV54418.1 oryzain alpha chain-like protein [Phaseolus vulgaris] Length = 467 Score = 629 bits (1622), Expect = 0.0 Identities = 305/418 (72%), Positives = 343/418 (82%), Gaps = 6/418 (1%) Frame = -3 Query: 1538 ASTFDTSELFENWCKEHGKAYSSEEEKRYRSKVFEDNYDFVARHNQRGR--NSSYTLSLN 1365 AS DTS+LFE WCKEH K YSSEEEKRYR VFEDNY FV++HN+ NS+YTLSLN Sbjct: 57 ASASDTSDLFERWCKEHAKTYSSEEEKRYRFHVFEDNYAFVSQHNRNANDNNSTYTLSLN 116 Query: 1364 AFADLTHHEFKASRLGLPPHSSFLRFNRFQDQQPNDRHL-DVPSEIDWRNTGAVTPVKDQ 1188 AFADLTHHEFK SRLG P S LRF R Q+QQP RHL PS+IDWR +GAVTPVKDQ Sbjct: 117 AFADLTHHEFKTSRLGFSP--SLLRFKRVQNQQP--RHLLHNPSQIDWRQSGAVTPVKDQ 172 Query: 1187 GSCGACWSFATTGAIEGINKIVTGSLVSLSEQELVDCDTTYNSGCEGGLMDYAYQFVIDN 1008 SCGACW+F+ TGAIEGINKIVTGSL SLSEQELVDCDT+YNSGCEGGLMD+AYQFVIDN Sbjct: 173 ASCGACWAFSATGAIEGINKIVTGSLESLSEQELVDCDTSYNSGCEGGLMDFAYQFVIDN 232 Query: 1007 NGIDTEEDYPYQGRQRLCNKDKLRRRVVTIDGYIDVPRNDEKRLLKAVADQPVSVGICGS 828 GIDTE+DYPYQ RQR C+KDKL+RR VTI+ Y+DVP ++E+ +LKAVA QPVSVGICGS Sbjct: 233 KGIDTEDDYPYQARQRSCSKDKLKRRAVTIEDYVDVPPSEEE-ILKAVASQPVSVGICGS 291 Query: 827 ERAFQLYSKGIFNGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKNWGMNGHMHMLRNT 648 ERAFQLYS+GIF+GPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGK WG++G++HM+RNT Sbjct: 292 ERAFQLYSQGIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKYWGIDGYIHMIRNT 351 Query: 647 GDSEGLCGINMLASYXXXXXXXXXXXXXXXXTKCNLFTYCSGGETCCCARSLLGICFKWK 468 GD +G+CGIN LASY +CNLFT+CS GETCCCA+S LGICF WK Sbjct: 352 GDPKGICGINTLASY--PIKTKPNPPPPPAPVRCNLFTHCSEGETCCCAKSFLGICFSWK 409 Query: 467 CCGLTSAVCCKDKRHCCPQDYPVCDTRRGQCLKRISNGTKTNTFEKED---STRGWGS 303 CCGLTSAVCCKDKRHCCP+DYP+CDT + QCLK I+NGT T T +D RGW S Sbjct: 410 CCGLTSAVCCKDKRHCCPRDYPICDTEKSQCLK-ITNGTTTITSGNKDISNKPRGWKS 466 >XP_012071947.1 PREDICTED: low-temperature-induced cysteine proteinase [Jatropha curcas] BAJ53169.1 JHL18I08.3 [Jatropha curcas] KDP38570.1 hypothetical protein JCGZ_04495 [Jatropha curcas] Length = 441 Score = 612 bits (1577), Expect = 0.0 Identities = 291/416 (69%), Positives = 329/416 (79%), Gaps = 4/416 (0%) Frame = -3 Query: 1538 ASTFDTSELFENWCKEHGKAYSSEEEKRYRSKVFEDNYDFVARHNQRGRNSSYTLSLNAF 1359 +S+ + + LFE WC++HGK Y+S+EEK +R KVF+DNYDFV HN +G NSSYTLSLNAF Sbjct: 21 SSSSEIAHLFETWCQQHGKTYASQEEKLFRLKVFQDNYDFVTEHNSQG-NSSYTLSLNAF 79 Query: 1358 ADLTHHEFKASRLGLPPHSSF-LRFNRFQDQQPNDRHLDVPSEIDWRNTGAVTPVKDQGS 1182 ADLTHHEFKASRLGL +S L +R Q P D DVP+ +DWR GAVT VKDQG+ Sbjct: 80 ADLTHHEFKASRLGLSSAASASLNVDRSNRQIP-DFVADVPASVDWRKNGAVTQVKDQGN 138 Query: 1181 CGACWSFATTGAIEGINKIVTGSLVSLSEQELVDCDTTYNSGCEGGLMDYAYQFVIDNNG 1002 CGACWSF+ TGAIEGINKIVTGSLVSLSEQELVDCD +YN+GCEGG+MDYA+QFVIDN+G Sbjct: 139 CGACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDKSYNNGCEGGIMDYAFQFVIDNHG 198 Query: 1001 IDTEEDYPYQGRQRLCNKDKLRRRVVTIDGYIDVPRNDEKRLLKAVADQPVSVGICGSER 822 IDTEEDYPYQGR R CNK+KL+R VVTIDGY+DVP+N+EK LLKAVA+QPVSVGICGSER Sbjct: 199 IDTEEDYPYQGRDRSCNKEKLKRHVVTIDGYVDVPQNNEKELLKAVANQPVSVGICGSER 258 Query: 821 AFQLYSKGIFNGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKNWGMNGHMHMLRNTGD 642 AFQLYSKGIF GPCSTSLDHAVLIVGYGSENGVDYWIVKNSWG WGM+G+MHM RN+G Sbjct: 259 AFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGSYWGMDGYMHMQRNSGS 318 Query: 641 SEGLCGINMLASYXXXXXXXXXXXXXXXXTKCNLFTYCSGGETCCCARSLLGICFKWKCC 462 S GLCGINMLASY T+C+LFT+C GETCCC + GIC WKCC Sbjct: 319 SRGLCGINMLASYPKKTSPNPPPPAPPGPTRCDLFTHCGEGETCCCVHHIFGICLSWKCC 378 Query: 461 GLTSAVCCKDKRHCCPQDYPVCDTRRGQCLKRISNGTKTNTFEKEDST---RGWGS 303 L SAVCCKD RHCCP+DYPVCDT R CLK N T+ F K S+ R W S Sbjct: 379 ELDSAVCCKDGRHCCPRDYPVCDTTRNICLKHYGNATRIEKFAKNSSSGKFRSWSS 434 >EOY14881.1 JHL18I08.3 protein [Theobroma cacao] Length = 438 Score = 611 bits (1576), Expect = 0.0 Identities = 293/412 (71%), Positives = 327/412 (79%) Frame = -3 Query: 1538 ASTFDTSELFENWCKEHGKAYSSEEEKRYRSKVFEDNYDFVARHNQRGRNSSYTLSLNAF 1359 AS S LFE WC +HGK YSSEEEK YR KVFE+NY FV +HN G NSSY+L+LNAF Sbjct: 21 ASPSHISHLFETWCDQHGKRYSSEEEKSYRLKVFEENYAFVTQHNGVG-NSSYSLALNAF 79 Query: 1358 ADLTHHEFKASRLGLPPHSSFLRFNRFQDQQPNDRHLDVPSEIDWRNTGAVTPVKDQGSC 1179 ADLTHHEFKASRLGL ++ + +R Q P D+P+ +DWR GAVT VKDQGSC Sbjct: 80 ADLTHHEFKASRLGLS--AAAIEGSRPNLQLPGLVR-DIPASMDWRTKGAVTKVKDQGSC 136 Query: 1178 GACWSFATTGAIEGINKIVTGSLVSLSEQELVDCDTTYNSGCEGGLMDYAYQFVIDNNGI 999 GACWSF+ TGAIEGINKIVTG+LVSLSEQELVDCD +YNSGCEGGLMDYAYQFVIDN+GI Sbjct: 137 GACWSFSATGAIEGINKIVTGTLVSLSEQELVDCDRSYNSGCEGGLMDYAYQFVIDNHGI 196 Query: 998 DTEEDYPYQGRQRLCNKDKLRRRVVTIDGYIDVPRNDEKRLLKAVADQPVSVGICGSERA 819 D EEDYPY GR++ CNK+K +RRVVTIDGY VP N+E LL+AVA QPVSVGICGSERA Sbjct: 197 DNEEDYPYLGREKTCNKEKRKRRVVTIDGYAGVPANNEDLLLQAVAKQPVSVGICGSERA 256 Query: 818 FQLYSKGIFNGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKNWGMNGHMHMLRNTGDS 639 FQLYSKGIF GPCS+SLDHAVLIVGYGSENGVDYWIVKNSWG WGMNG++HMLRN+GDS Sbjct: 257 FQLYSKGIFTGPCSSSLDHAVLIVGYGSENGVDYWIVKNSWGTRWGMNGYIHMLRNSGDS 316 Query: 638 EGLCGINMLASYXXXXXXXXXXXXXXXXTKCNLFTYCSGGETCCCARSLLGICFKWKCCG 459 +GLCGINMLASY TKC+LFTYCS GETCCC + GICF WKCC Sbjct: 317 KGLCGINMLASYPTKTSPNPPSPPPPGPTKCDLFTYCSAGETCCCTHRIFGICFSWKCCE 376 Query: 458 LTSAVCCKDKRHCCPQDYPVCDTRRGQCLKRISNGTKTNTFEKEDSTRGWGS 303 L SAVCCKD RHCCP DYPVCDT++ QCLKR+ N T+ FEK STR + S Sbjct: 377 LDSAVCCKDNRHCCPYDYPVCDTKKSQCLKRVGNATRMEAFEKRHSTRKFSS 428 >XP_007017656.2 PREDICTED: zingipain-2 [Theobroma cacao] Length = 438 Score = 610 bits (1572), Expect = 0.0 Identities = 292/412 (70%), Positives = 327/412 (79%) Frame = -3 Query: 1538 ASTFDTSELFENWCKEHGKAYSSEEEKRYRSKVFEDNYDFVARHNQRGRNSSYTLSLNAF 1359 AS S LFE WC +HGK YSSEEEK YR KVFE+NY FV +HN G NSSY+L+LNAF Sbjct: 21 ASPSHISHLFETWCDQHGKRYSSEEEKSYRLKVFEENYAFVTQHNGVG-NSSYSLALNAF 79 Query: 1358 ADLTHHEFKASRLGLPPHSSFLRFNRFQDQQPNDRHLDVPSEIDWRNTGAVTPVKDQGSC 1179 ADLTHHEFKASRLGL ++ + +R Q P D+P+ +DWR GAVT VKDQGSC Sbjct: 80 ADLTHHEFKASRLGLS--AAAIDGSRPNLQLPGLVR-DIPASMDWRTKGAVTKVKDQGSC 136 Query: 1178 GACWSFATTGAIEGINKIVTGSLVSLSEQELVDCDTTYNSGCEGGLMDYAYQFVIDNNGI 999 GACWSF+ TGAIEGINKIVTG+LVSLSEQELVDCD +YNSGCEGGLMDYAYQFVIDN+GI Sbjct: 137 GACWSFSATGAIEGINKIVTGTLVSLSEQELVDCDRSYNSGCEGGLMDYAYQFVIDNHGI 196 Query: 998 DTEEDYPYQGRQRLCNKDKLRRRVVTIDGYIDVPRNDEKRLLKAVADQPVSVGICGSERA 819 D EEDYPY GR++ CNK+K +RRVVTIDGY VP N+E LL+AVA QPVSVGICGSERA Sbjct: 197 DNEEDYPYLGREKTCNKEKRKRRVVTIDGYAGVPANNEDLLLQAVAKQPVSVGICGSERA 256 Query: 818 FQLYSKGIFNGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKNWGMNGHMHMLRNTGDS 639 FQLYSKGIF GPCS+SLDHAVLIVGYGSENGVDYWIVKNSWG WGMNG++H+LRN+GDS Sbjct: 257 FQLYSKGIFTGPCSSSLDHAVLIVGYGSENGVDYWIVKNSWGTRWGMNGYIHLLRNSGDS 316 Query: 638 EGLCGINMLASYXXXXXXXXXXXXXXXXTKCNLFTYCSGGETCCCARSLLGICFKWKCCG 459 +GLCGINMLASY TKC+LFTYCS GETCCC + GICF WKCC Sbjct: 317 KGLCGINMLASYPMKTSPNPPSPPPPGPTKCDLFTYCSAGETCCCTHRIFGICFSWKCCE 376 Query: 458 LTSAVCCKDKRHCCPQDYPVCDTRRGQCLKRISNGTKTNTFEKEDSTRGWGS 303 L SAVCCKD RHCCP DYPVCDT++ QCLKR+ N T+ FEK STR + S Sbjct: 377 LDSAVCCKDNRHCCPNDYPVCDTKKSQCLKRVGNATRMEAFEKRHSTRKFSS 428 >XP_002510459.2 PREDICTED: low-temperature-induced cysteine proteinase [Ricinus communis] Length = 466 Score = 607 bits (1564), Expect = 0.0 Identities = 288/412 (69%), Positives = 334/412 (81%) Frame = -3 Query: 1538 ASTFDTSELFENWCKEHGKAYSSEEEKRYRSKVFEDNYDFVARHNQRGRNSSYTLSLNAF 1359 +S+ D S+LFE+W KEHGK Y+S+E+K YR K+FE+NY+FV +HN +G NSSYTLSLNAF Sbjct: 23 SSSSDISKLFESWTKEHGKTYTSKEDKLYRFKIFEENYEFVKKHNSQG-NSSYTLSLNAF 81 Query: 1358 ADLTHHEFKASRLGLPPHSSFLRFNRFQDQQPNDRHLDVPSEIDWRNTGAVTPVKDQGSC 1179 ADLTHHEFKASRLGL S+ + +R ++ +D DVP IDWR GAV+ VKDQG+C Sbjct: 82 ADLTHHEFKASRLGLSAFSTSGKLSR-RNFPLHDFVGDVPISIDWRKKGAVSQVKDQGNC 140 Query: 1178 GACWSFATTGAIEGINKIVTGSLVSLSEQELVDCDTTYNSGCEGGLMDYAYQFVIDNNGI 999 GACWSF+ TGAIEGINKIVTGSLVSLSEQELVDCD +YN+GCEGGLMDYAYQFVI+NNGI Sbjct: 141 GACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDRSYNNGCEGGLMDYAYQFVIENNGI 200 Query: 998 DTEEDYPYQGRQRLCNKDKLRRRVVTIDGYIDVPRNDEKRLLKAVADQPVSVGICGSERA 819 DTEEDYPYQ R++ CNK+KL+R VVTIDGY DVP+N+EK LLKAVA QPVSVGICGSERA Sbjct: 201 DTEEDYPYQAREKTCNKEKLKRHVVTIDGYTDVPQNNEKELLKAVAAQPVSVGICGSERA 260 Query: 818 FQLYSKGIFNGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKNWGMNGHMHMLRNTGDS 639 FQLYSKGIF GPCSTSLDHAVLIVGYGSENGVDYWIVKNSWG +WG+NG+M+MLRN+G+S Sbjct: 261 FQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTHWGINGYMYMLRNSGNS 320 Query: 638 EGLCGINMLASYXXXXXXXXXXXXXXXXTKCNLFTYCSGGETCCCARSLLGICFKWKCCG 459 +GLCGINMLAS+ TKC+LFT C GETCCC R + G+CF WKCC Sbjct: 321 QGLCGINMLASFPVKTSPNPPPPAPPGPTKCDLFTRCGEGETCCCTRRIFGLCFSWKCCE 380 Query: 458 LTSAVCCKDKRHCCPQDYPVCDTRRGQCLKRISNGTKTNTFEKEDSTRGWGS 303 L SAVCCKD HCCP DYPVCDT+R CLK N T+ T K+ S+ +GS Sbjct: 381 LDSAVCCKDGLHCCPHDYPVCDTKRNMCLKFPGNATRMETVAKKSSSGMFGS 432 >XP_016687622.1 PREDICTED: low-temperature-induced cysteine proteinase [Gossypium hirsutum] Length = 489 Score = 607 bits (1564), Expect = 0.0 Identities = 283/396 (71%), Positives = 321/396 (81%) Frame = -3 Query: 1538 ASTFDTSELFENWCKEHGKAYSSEEEKRYRSKVFEDNYDFVARHNQRGRNSSYTLSLNAF 1359 AS S++FE WC +HGK+YSSEEEK YR KVFEDNY FV +HN NSSY+L+LNAF Sbjct: 81 ASPSHISKIFETWCHQHGKSYSSEEEKSYRLKVFEDNYAFVTQHNAM-TNSSYSLALNAF 139 Query: 1358 ADLTHHEFKASRLGLPPHSSFLRFNRFQDQQPNDRHLDVPSEIDWRNTGAVTPVKDQGSC 1179 ADLTHHEFKASRLGL + ++F R ++P D+P+ +DWR GAVT VKDQGSC Sbjct: 140 ADLTHHEFKASRLGLS--GAAIQFRRSTLREPRLVR-DIPASLDWREKGAVTQVKDQGSC 196 Query: 1178 GACWSFATTGAIEGINKIVTGSLVSLSEQELVDCDTTYNSGCEGGLMDYAYQFVIDNNGI 999 GACWSF+ TGAIEG+NKIVTGSL+SLSEQELVDCD TYN+GCEGGLMDYA+QFVI+N+GI Sbjct: 197 GACWSFSATGAIEGVNKIVTGSLISLSEQELVDCDKTYNTGCEGGLMDYAFQFVINNHGI 256 Query: 998 DTEEDYPYQGRQRLCNKDKLRRRVVTIDGYIDVPRNDEKRLLKAVADQPVSVGICGSERA 819 DTEEDYPYQGR+ CNK+KL+R VVTID Y DVP N+EK+LL+AVA QPVSVGICGSERA Sbjct: 257 DTEEDYPYQGREHTCNKEKLKRHVVTIDDYTDVPMNNEKKLLQAVATQPVSVGICGSERA 316 Query: 818 FQLYSKGIFNGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKNWGMNGHMHMLRNTGDS 639 FQLYSKGIF GPCSTSLDHAVLIVGYGSENGVDYWIVKNSWG WGMNG++HM+RNTG S Sbjct: 317 FQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTRWGMNGYIHMIRNTGKS 376 Query: 638 EGLCGINMLASYXXXXXXXXXXXXXXXXTKCNLFTYCSGGETCCCARSLLGICFKWKCCG 459 EG+CGINMLASY TKC+ FTYCS GETCCC + GICF WKCCG Sbjct: 377 EGICGINMLASYPIKTSPNPPPSPPPGPTKCDFFTYCSAGETCCCTHRIFGICFLWKCCG 436 Query: 458 LTSAVCCKDKRHCCPQDYPVCDTRRGQCLKRISNGT 351 L SAVCCKD RHCCP +YP+CDT+ QCLKR+ N T Sbjct: 437 LDSAVCCKDNRHCCPHNYPICDTKNNQCLKRVGNAT 472 >XP_018821065.1 PREDICTED: low-temperature-induced cysteine proteinase [Juglans regia] Length = 442 Score = 605 bits (1559), Expect = 0.0 Identities = 286/406 (70%), Positives = 326/406 (80%) Frame = -3 Query: 1520 SELFENWCKEHGKAYSSEEEKRYRSKVFEDNYDFVARHNQRGRNSSYTLSLNAFADLTHH 1341 S++FE WCK+HG+ YSSE EK YR +VF+DN+DFV ++N G NSSYTLSLNAFADLTHH Sbjct: 29 SKVFEAWCKQHGRTYSSEAEKLYRFRVFQDNFDFVTQYNDMG-NSSYTLSLNAFADLTHH 87 Query: 1340 EFKASRLGLPPHSSFLRFNRFQDQQPNDRHLDVPSEIDWRNTGAVTPVKDQGSCGACWSF 1161 EFKASRLG P L R + P ++PSE+DWR GAVT VKDQGSCGACWSF Sbjct: 88 EFKASRLGFSPAGMSLNRQR-PFRGPGSVVREIPSEMDWRKKGAVTHVKDQGSCGACWSF 146 Query: 1160 ATTGAIEGINKIVTGSLVSLSEQELVDCDTTYNSGCEGGLMDYAYQFVIDNNGIDTEEDY 981 + TGAIEGINKIVTGSLVSLSEQELVDCD +++SGCEGGLMDYAYQF+IDN+GIDTE+DY Sbjct: 147 SATGAIEGINKIVTGSLVSLSEQELVDCDRSFDSGCEGGLMDYAYQFIIDNHGIDTEDDY 206 Query: 980 PYQGRQRLCNKDKLRRRVVTIDGYIDVPRNDEKRLLKAVADQPVSVGICGSERAFQLYSK 801 PYQGR+R C K+KL+R VVTIDGY DV N+EK+LL+AVA QPVSVGICGSERAFQLYSK Sbjct: 207 PYQGRERSCIKEKLKRHVVTIDGYTDVQTNNEKQLLQAVATQPVSVGICGSERAFQLYSK 266 Query: 800 GIFNGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKNWGMNGHMHMLRNTGDSEGLCGI 621 GIF GPCSTSLDHAVLIVGYGSENGVDYWIVKNSWG WGM+G++HMLRN+G+S+GLCGI Sbjct: 267 GIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTRWGMDGYVHMLRNSGNSQGLCGI 326 Query: 620 NMLASYXXXXXXXXXXXXXXXXTKCNLFTYCSGGETCCCARSLLGICFKWKCCGLTSAVC 441 NMLASY T+C++FTYC GETCCCAR LLGIC WKCC L SAVC Sbjct: 327 NMLASYPTKTRPNPPPPPPPGPTRCDIFTYCGEGETCCCARHLLGICISWKCCELNSAVC 386 Query: 440 CKDKRHCCPQDYPVCDTRRGQCLKRISNGTKTNTFEKEDSTRGWGS 303 CKD HCCP DYP+CDT+R QCLKR NGT+ E+ S+ GS Sbjct: 387 CKDHLHCCPLDYPICDTKRIQCLKRAGNGTRMEALERRSSSGKSGS 432 >XP_016167594.1 PREDICTED: cysteine proteinase COT44 isoform X2 [Arachis ipaensis] Length = 454 Score = 604 bits (1557), Expect = 0.0 Identities = 292/424 (68%), Positives = 331/424 (78%), Gaps = 12/424 (2%) Frame = -3 Query: 1538 ASTFDTSELFENWCKEHGKAYSSEEEKRYRSKVFEDNYDFVARHNQRGRNSSYTLSLNAF 1359 AS+ DTS+LF +WC HGK YSS+EE+ YR KVF+DNYD+V RHNQ NS YTLSLNAF Sbjct: 33 ASSSDTSQLFRSWCDHHGKIYSSDEERSYRLKVFQDNYDYVQRHNQMA-NSPYTLSLNAF 91 Query: 1358 ADLTHHEFKASRLGLPPHSSFLRFNRFQDQQP-------NDRHL--DVPSEIDWRNTGAV 1206 ADLTH EFKAS LG SSFLRF QD Q ND ++ VPS IDWRN GAV Sbjct: 92 ADLTHQEFKASHLGALS-SSFLRFKNHQDHQSRYHNDNDNDNNILRQVPSSIDWRNEGAV 150 Query: 1205 TPVKDQGSCGACWSFATTGAIEGINKIVTGSLVSLSEQELVDCDTTYNSGCEGGLMDYAY 1026 TPVK+QGSCGACW+F+ TGAIEGINKIVTG+L SLSEQELVDCD YNSGCEGGLMDYAY Sbjct: 151 TPVKNQGSCGACWAFSATGAIEGINKIVTGTLESLSEQELVDCDKKYNSGCEGGLMDYAY 210 Query: 1025 QFVIDNNGIDTEEDYPYQGRQRLCNKDKLRRRVVTIDGYIDVPRNDEKRLLKAVADQPVS 846 QFVIDN+GIDTE DYP+ CNK+K++RRVVTIDGY DV ++EK+LL+AVA QPVS Sbjct: 211 QFVIDNHGIDTESDYPFLAHDAACNKNKMKRRVVTIDGYTDVLPSNEKKLLEAVATQPVS 270 Query: 845 VGICGSERAFQLYSKGIFNGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKNWGMNGHM 666 VGICGS RAFQLYS+GIF GPCST+LDHAVLIVGYGSENGVDYWIVKNSWG +WGMNG++ Sbjct: 271 VGICGSARAFQLYSQGIFTGPCSTALDHAVLIVGYGSENGVDYWIVKNSWGTSWGMNGYI 330 Query: 665 HMLRNTGDSEGLCGINMLASYXXXXXXXXXXXXXXXXTKCNLFTYCSGGETCCCARSLLG 486 HM+RN G S+G+CGINMLASY TKCNLFTYC ETCCC+ +LG Sbjct: 331 HMVRNNG-SQGICGINMLASYPTKTTPNPPPPPPPGPTKCNLFTYCPAAETCCCSWRVLG 389 Query: 485 ICFKWKCCGLTSAVCCKDKRHCCPQDYPVCDTRRGQCLKRISNGTKTNTFEKEDS---TR 315 +C +KCCGL SAVCCKD HCCPQDYP+CD R QCLKR+SNGT T +E +DS +R Sbjct: 390 LCLSYKCCGLDSAVCCKDNSHCCPQDYPICDIRNAQCLKRVSNGTTTMAYENKDSIRRSR 449 Query: 314 GWGS 303 GW S Sbjct: 450 GWSS 453 >XP_015933819.1 PREDICTED: cysteine proteinase COT44 isoform X2 [Arachis duranensis] Length = 452 Score = 602 bits (1553), Expect = 0.0 Identities = 292/422 (69%), Positives = 330/422 (78%), Gaps = 10/422 (2%) Frame = -3 Query: 1538 ASTFDTSELFENWCKEHGKAYSSEEEKRYRSKVFEDNYDFVARHNQRGRNSSYTLSLNAF 1359 AS+ DTS+LF +WC HGK YSS+EE+ YR KVF DNYD+V RHNQ NS YTLSLNAF Sbjct: 33 ASSPDTSQLFRSWCDHHGKTYSSDEERSYRLKVFLDNYDYVQRHNQMA-NSPYTLSLNAF 91 Query: 1358 ADLTHHEFKASRLGLPPHSSFLRFNRFQDQQP-----NDRHL--DVPSEIDWRNTGAVTP 1200 ADLTH E KAS LG SSFLRF QD QP ND ++ VPS IDWRN GAVTP Sbjct: 92 ADLTHQELKASHLGALS-SSFLRFKNRQDHQPRYHNDNDNNILRQVPSSIDWRNEGAVTP 150 Query: 1199 VKDQGSCGACWSFATTGAIEGINKIVTGSLVSLSEQELVDCDTTYNSGCEGGLMDYAYQF 1020 VK+QGSCGACW+F+ TGAIEGINKIVTG+L SLSEQELVDCD YNSGCEGGLMDYAYQF Sbjct: 151 VKNQGSCGACWAFSATGAIEGINKIVTGTLESLSEQELVDCDKKYNSGCEGGLMDYAYQF 210 Query: 1019 VIDNNGIDTEEDYPYQGRQRLCNKDKLRRRVVTIDGYIDVPRNDEKRLLKAVADQPVSVG 840 VIDN+GIDTE DYP+ CNK+K++RRVVTIDGY DV ++EK+LL+AVA QPVSVG Sbjct: 211 VIDNHGIDTESDYPFLAHDAACNKNKMKRRVVTIDGYTDVLPSNEKKLLEAVATQPVSVG 270 Query: 839 ICGSERAFQLYSKGIFNGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKNWGMNGHMHM 660 ICGS RAFQLYS+GIF GPCST+LDHAVLIVGYGSENGVDYWIVKNSWG +WGMNG++HM Sbjct: 271 ICGSARAFQLYSQGIFTGPCSTALDHAVLIVGYGSENGVDYWIVKNSWGTSWGMNGYIHM 330 Query: 659 LRNTGDSEGLCGINMLASYXXXXXXXXXXXXXXXXTKCNLFTYCSGGETCCCARSLLGIC 480 +RN G S+G+CGINMLASY TKCNLFTYC ETCCC+ +LG+C Sbjct: 331 VRNNG-SQGICGINMLASYPTKTTPNPPPPPPPGPTKCNLFTYCPAAETCCCSWRVLGLC 389 Query: 479 FKWKCCGLTSAVCCKDKRHCCPQDYPVCDTRRGQCLKRISNGTKTNTFEKEDS---TRGW 309 +KCCGL SAVCCKD HCCPQDYP+CD R QCLKR+SNGT T +E +DS +RGW Sbjct: 390 LSYKCCGLDSAVCCKDNSHCCPQDYPICDIRNAQCLKRVSNGTTTMAYENKDSIRRSRGW 449 Query: 308 GS 303 S Sbjct: 450 SS 451 >XP_017612038.1 PREDICTED: zingipain-2 [Gossypium arboreum] Length = 431 Score = 601 bits (1549), Expect = 0.0 Identities = 280/396 (70%), Positives = 319/396 (80%) Frame = -3 Query: 1538 ASTFDTSELFENWCKEHGKAYSSEEEKRYRSKVFEDNYDFVARHNQRGRNSSYTLSLNAF 1359 AS S+ FE WC++HGK+Y SEEEK YR KVFEDNY FV +HN NSSY+L+LNAF Sbjct: 23 ASPSHISKKFETWCQQHGKSYLSEEEKSYRLKVFEDNYAFVTQHNAMV-NSSYSLALNAF 81 Query: 1358 ADLTHHEFKASRLGLPPHSSFLRFNRFQDQQPNDRHLDVPSEIDWRNTGAVTPVKDQGSC 1179 AD THHEFKASRLGL + ++F R ++P D+P +DWR GAVT VKDQGSC Sbjct: 82 ADFTHHEFKASRLGLS--GAAIQFRRPNLREPRLVR-DIPDSLDWREKGAVTQVKDQGSC 138 Query: 1178 GACWSFATTGAIEGINKIVTGSLVSLSEQELVDCDTTYNSGCEGGLMDYAYQFVIDNNGI 999 GACWSF+ TGAIEG+NKIVTGSL+SLSEQELVDCD TYN+GCEGGLMDYA+QFVI+N+GI Sbjct: 139 GACWSFSATGAIEGVNKIVTGSLISLSEQELVDCDKTYNTGCEGGLMDYAFQFVINNHGI 198 Query: 998 DTEEDYPYQGRQRLCNKDKLRRRVVTIDGYIDVPRNDEKRLLKAVADQPVSVGICGSERA 819 DTEEDYPYQGR+ CNK+KL+R VVTID Y DVP N+EK+LL+AVA QPVSVGICGSERA Sbjct: 199 DTEEDYPYQGREHTCNKEKLKRHVVTIDDYTDVPMNNEKKLLQAVATQPVSVGICGSERA 258 Query: 818 FQLYSKGIFNGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKNWGMNGHMHMLRNTGDS 639 FQLYSKGIF GPCSTSLDHAVLIVGYGSENGVDYWIVKNSWG+ WGMNG++HM+RN+G S Sbjct: 259 FQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGRRWGMNGYIHMIRNSGKS 318 Query: 638 EGLCGINMLASYXXXXXXXXXXXXXXXXTKCNLFTYCSGGETCCCARSLLGICFKWKCCG 459 EG+CGINMLASY TKC+ FTYCS GETCCC + GICF WKCCG Sbjct: 319 EGICGINMLASYPIKTSPNPPPSPPPGPTKCDFFTYCSAGETCCCTHRIFGICFSWKCCG 378 Query: 458 LTSAVCCKDKRHCCPQDYPVCDTRRGQCLKRISNGT 351 L SAVCCKD RHCCP +YP+CDT+ QCLKR+ N T Sbjct: 379 LDSAVCCKDNRHCCPHNYPICDTKNNQCLKRVGNAT 414 >XP_012444786.1 PREDICTED: zingipain-2 [Gossypium raimondii] KJB58175.1 hypothetical protein B456_009G197900 [Gossypium raimondii] Length = 431 Score = 600 bits (1547), Expect = 0.0 Identities = 279/396 (70%), Positives = 317/396 (80%) Frame = -3 Query: 1538 ASTFDTSELFENWCKEHGKAYSSEEEKRYRSKVFEDNYDFVARHNQRGRNSSYTLSLNAF 1359 AS S++FE WC +HGK+YSSEEEK YR KVFEDNY FV +HN NSSY+L+LNAF Sbjct: 23 ASPSHISKIFETWCHQHGKSYSSEEEKSYRLKVFEDNYAFVTQHNAM-TNSSYSLALNAF 81 Query: 1358 ADLTHHEFKASRLGLPPHSSFLRFNRFQDQQPNDRHLDVPSEIDWRNTGAVTPVKDQGSC 1179 ADLTHHEFKASRLGL + R + ++ + D+P+ +DWR GAVT VKDQGSC Sbjct: 82 ADLTHHEFKASRLGLSGAAIQFRCSNLREPR---LVRDIPASLDWREKGAVTQVKDQGSC 138 Query: 1178 GACWSFATTGAIEGINKIVTGSLVSLSEQELVDCDTTYNSGCEGGLMDYAYQFVIDNNGI 999 GACWSF+ TGAIEG+NKIVTGSL+SLSEQELVDCD TYN+GCEGGLMDYA+QFVI+N+GI Sbjct: 139 GACWSFSATGAIEGVNKIVTGSLISLSEQELVDCDKTYNTGCEGGLMDYAFQFVINNHGI 198 Query: 998 DTEEDYPYQGRQRLCNKDKLRRRVVTIDGYIDVPRNDEKRLLKAVADQPVSVGICGSERA 819 DTEEDYPYQGR+ CNK+KL+R VVTID Y DVP +EK+LL+AVA QPVSVGICGSERA Sbjct: 199 DTEEDYPYQGREHTCNKEKLKRHVVTIDDYTDVPMTNEKKLLQAVATQPVSVGICGSERA 258 Query: 818 FQLYSKGIFNGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKNWGMNGHMHMLRNTGDS 639 FQLY KGIF GPCSTSLDHAVLIVGYGSENGVDYWIVKNSWG WGMNG++HM+RNTG S Sbjct: 259 FQLYCKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTRWGMNGYIHMIRNTGKS 318 Query: 638 EGLCGINMLASYXXXXXXXXXXXXXXXXTKCNLFTYCSGGETCCCARSLLGICFKWKCCG 459 EG+CGINMLASY TKC+ FTYCS GETCCC + GICF WKCCG Sbjct: 319 EGICGINMLASYPIKTSPNPPPSPPPGPTKCDFFTYCSAGETCCCTHRIFGICFLWKCCG 378 Query: 458 LTSAVCCKDKRHCCPQDYPVCDTRRGQCLKRISNGT 351 L SAVCCKD RHCCP +YP+CDT+ QCLKR+ N T Sbjct: 379 LDSAVCCKDNRHCCPHNYPICDTKNNQCLKRVGNAT 414