BLASTX nr result

ID: Glycyrrhiza34_contig00016126 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza34_contig00016126
         (1695 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

XP_004500967.1 PREDICTED: zingipain-2 [Cicer arietinum]               682   0.0  
XP_013462179.1 papain family cysteine protease [Medicago truncat...   657   0.0  
KHN45886.1 Oryzain alpha chain [Glycine soja]                         651   0.0  
XP_003523725.1 PREDICTED: low-temperature-induced cysteine prote...   647   0.0  
XP_019438032.1 PREDICTED: low-temperature-induced cysteine prote...   640   0.0  
KYP64636.1 Oryzain alpha chain [Cajanus cajan]                        635   0.0  
XP_007136041.1 hypothetical protein PHAVU_009G013000g [Phaseolus...   634   0.0  
XP_014501447.1 PREDICTED: zingipain-2 [Vigna radiata var. radiata]    630   0.0  
XP_017437409.1 PREDICTED: zingipain-2 [Vigna angularis] BAT77731...   629   0.0  
AGV54418.1 oryzain alpha chain-like protein [Phaseolus vulgaris]      627   0.0  
XP_012071947.1 PREDICTED: low-temperature-induced cysteine prote...   609   0.0  
EOY14881.1 JHL18I08.3 protein [Theobroma cacao]                       608   0.0  
XP_007017656.2 PREDICTED: zingipain-2 [Theobroma cacao]               607   0.0  
XP_002510459.2 PREDICTED: low-temperature-induced cysteine prote...   604   0.0  
XP_016687622.1 PREDICTED: low-temperature-induced cysteine prote...   605   0.0  
XP_018821065.1 PREDICTED: low-temperature-induced cysteine prote...   602   0.0  
XP_016167594.1 PREDICTED: cysteine proteinase COT44 isoform X2 [...   602   0.0  
XP_015933819.1 PREDICTED: cysteine proteinase COT44 isoform X2 [...   600   0.0  
XP_017612038.1 PREDICTED: zingipain-2 [Gossypium arboreum]            599   0.0  
XP_012444786.1 PREDICTED: zingipain-2 [Gossypium raimondii] KJB5...   598   0.0  

>XP_004500967.1 PREDICTED: zingipain-2 [Cicer arietinum]
          Length = 436

 Score =  682 bits (1759), Expect = 0.0
 Identities = 329/426 (77%), Positives = 352/426 (82%), Gaps = 2/426 (0%)
 Frame = +1

Query: 202  STFDTSELFENWCKEHGKAYSSEEEKRYRSKVFEDNYDFVARHNQRGRNSSYTLSLNAFA 381
            +  DTS+LF+ WCK+HGK Y SE+EKRYR  VFEDNY FVA+HNQ G NSSYTLSLNAFA
Sbjct: 22   TAIDTSKLFQEWCKQHGKTYPSEQEKRYRFNVFEDNYAFVAQHNQIG-NSSYTLSLNAFA 80

Query: 382  DLTHHEFKASRLGLPPHSSFLRF--NRFQDQQPNDRHLDVPSEIDWRNTGAVTPVKDQGS 555
            DLTHHEFKA+RLGLPP SS LRF  NRFQDQQ +D  L VPSEIDWR  GAV+ VKDQGS
Sbjct: 81   DLTHHEFKATRLGLPP-SSLLRFKFNRFQDQQRSDDFLQVPSEIDWRKNGAVSIVKDQGS 139

Query: 556  CGACWSFAATGAIEGINKIVTGSLLSLSEQELVDCDTTYNSGCEGGLMDYAYQFVIDNNG 735
            CGACWSF+ATGAIEGINKIVTGSL+SLSEQELVDCDTTYNSGC+GGLMDYAYQF+IDNNG
Sbjct: 140  CGACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDTTYNSGCDGGLMDYAYQFIIDNNG 199

Query: 736  IDTEEDYPYQGRQRLCNKDKLRRRVVTIDGYIDVPRNDEKRLLKAVADQPVSVGICGSER 915
            IDTEEDYPYQ RQ LC KDKL+RRVVTIDGY DVP NDEK+LLKAVA QPVSVGICGS R
Sbjct: 200  IDTEEDYPYQARQLLCKKDKLKRRVVTIDGYTDVPPNDEKKLLKAVAVQPVSVGICGSAR 259

Query: 916  AFQLYSKGIFNGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKNWGMNGHMHMLRNTGD 1095
            AFQLYSKGIF GPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGK WGMNG++HMLRNT  
Sbjct: 260  AFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKYWGMNGYIHMLRNTDS 319

Query: 1096 SEGLCGINMLASYXXXXXXXXXXXXXXXXXKCNLFTYCSGGETCCCARSLLGICFKWKCC 1275
            S GLCGINMLASY                 KCNLFTYCSGGETCCCA+  LGICF WKCC
Sbjct: 320  SAGLCGINMLASYPTKTKPNPPVPPPPGPIKCNLFTYCSGGETCCCAKKFLGICFSWKCC 379

Query: 1276 GLTSAVCCKDKRHCCSQDYPVCDTRRGQCLKRISNGTKTNTFEKEDSTRGWGSQ*AFHQT 1455
            G+TSAVCCKDKRHCC  DYPVCD   GQCLKRI+NGT   T +KED          FHQT
Sbjct: 380  GVTSAVCCKDKRHCCPLDYPVCDASNGQCLKRIANGTILMTSDKED---------PFHQT 430

Query: 1456 KGWGSH 1473
            + W S+
Sbjct: 431  RDWRSN 436


>XP_013462179.1 papain family cysteine protease [Medicago truncatula] KEH36214.1
            papain family cysteine protease [Medicago truncatula]
          Length = 443

 Score =  657 bits (1694), Expect = 0.0
 Identities = 316/424 (74%), Positives = 346/424 (81%), Gaps = 3/424 (0%)
 Frame = +1

Query: 211  DTSELFENWCKEHGKAYSSEEEKRYRSKVFEDNYDFVARHNQRGRNSSYTLSLNAFADLT 390
            DTS+LF+ W K+HGK Y SEEEKRYR KVF+DNY FV++HN+ G NSSYTLSLNAFADLT
Sbjct: 31   DTSKLFQEWSKQHGKTYPSEEEKRYRFKVFQDNYAFVSQHNEMG-NSSYTLSLNAFADLT 89

Query: 391  HHEFKASRLGLPPHSSFLRF--NRFQDQQPNDRH-LDVPSEIDWRNTGAVTPVKDQGSCG 561
            HHEFK +RLG  P SS LRF  N F+DQQ +D   L VPSEIDWR + AVTPVKDQGSCG
Sbjct: 90   HHEFKTTRLGFSP-SSLLRFKFNHFEDQQFDDNGILQVPSEIDWRKSDAVTPVKDQGSCG 148

Query: 562  ACWSFAATGAIEGINKIVTGSLLSLSEQELVDCDTTYNSGCEGGLMDYAYQFVIDNNGID 741
            ACWSF+ATGAIEGINKIVTGSL+SLSEQELVDCD TYNSGC+GGLMDYAYQF+IDN GID
Sbjct: 149  ACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDRTYNSGCDGGLMDYAYQFIIDNKGID 208

Query: 742  TEEDYPYQGRQRLCNKDKLRRRVVTIDGYIDVPRNDEKRLLKAVADQPVSVGICGSERAF 921
            TEEDYPYQ RQ LC KDKL+RRVVTIDGY DVP NDEK+LLKAVA QPVSVGICGS RAF
Sbjct: 209  TEEDYPYQSRQLLCKKDKLKRRVVTIDGYTDVPPNDEKKLLKAVAVQPVSVGICGSARAF 268

Query: 922  QLYSKGIFNGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKNWGMNGHMHMLRNTGDSE 1101
            QLYSKGIF GPCST LDHAVLIVGYGSENGVDYWIVKNSWGK+WGMNG++HMLRNT +S 
Sbjct: 269  QLYSKGIFTGPCSTYLDHAVLIVGYGSENGVDYWIVKNSWGKSWGMNGYIHMLRNTDNSA 328

Query: 1102 GLCGINMLASYXXXXXXXXXXXXXXXXXKCNLFTYCSGGETCCCARSLLGICFKWKCCGL 1281
            GLCGINMLASY                 +CNLFTYCS GETCCCA+  LGICF WKCCG 
Sbjct: 329  GLCGINMLASYPTKTSPNPPVPPPPGPIRCNLFTYCSRGETCCCAKKFLGICFSWKCCGK 388

Query: 1282 TSAVCCKDKRHCCSQDYPVCDTRRGQCLKRISNGTKTNTFEKEDSTRGWGSQ*AFHQTKG 1461
            TSAVCCKD+RHCC  DYP+CD  R QCLKRI+NGT T   +K+D+         FHQT+ 
Sbjct: 389  TSAVCCKDERHCCPLDYPICDIGRSQCLKRIANGTTTMPSDKQDT---------FHQTRD 439

Query: 1462 WGSH 1473
            W SH
Sbjct: 440  WSSH 443


>KHN45886.1 Oryzain alpha chain [Glycine soja]
          Length = 439

 Score =  651 bits (1680), Expect = 0.0
 Identities = 311/419 (74%), Positives = 346/419 (82%), Gaps = 7/419 (1%)
 Frame = +1

Query: 202  STFDTSELFENWCKEHGKAYSSEEEKRYRSKVFEDNYDFVARHNQRGRNS----SYTLSL 369
            S  DTSELFE WCKEH K YSSEEEK YR KVFEDNY FVA+HNQ   N+    SYTLSL
Sbjct: 25   SASDTSELFEKWCKEHSKTYSSEEEKLYRLKVFEDNYAFVAQHNQNANNNNNNPSYTLSL 84

Query: 370  NAFADLTHHEFKASRLGLPPHSSFLRFNRFQDQQPNDRHLDVPSEIDWRNTGAVTPVKDQ 549
            NAFADLTHHEFK +RLGLPP  + LRF R Q+QQ  D  L +PS+IDWR +GAVTPVKDQ
Sbjct: 85   NAFADLTHHEFKTTRLGLPP--TLLRFKRPQNQQSRDL-LHIPSQIDWRQSGAVTPVKDQ 141

Query: 550  GSCGACWSFAATGAIEGINKIVTGSLLSLSEQELVDCDTTYNSGCEGGLMDYAYQFVIDN 729
             SCGACW+F+ATGAIEGINKIVTGSLLSLSEQEL+DCDT+YNSGC GGLMD+AYQFVIDN
Sbjct: 142  ASCGACWAFSATGAIEGINKIVTGSLLSLSEQELIDCDTSYNSGCGGGLMDFAYQFVIDN 201

Query: 730  NGIDTEEDYPYQGRQRLCNKDKLRRRVVTIDGYIDVPRNDEKRLLKAVADQPVSVGICGS 909
             GIDTEEDYPYQ RQR C+KDKL+RR VTI+ Y+DVP ++E+ +LKAVA QPVSVGICGS
Sbjct: 202  KGIDTEEDYPYQARQRSCSKDKLKRRAVTIEDYVDVPPSEEE-ILKAVASQPVSVGICGS 260

Query: 910  ERAFQLYSKGIFNGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKNWGMNGHMHMLRNT 1089
            ER FQLYSKGIF GPCST LDHAVLIVGYGSENGVDYWIVKNSWGK WGMNG++HM+RN+
Sbjct: 261  EREFQLYSKGIFTGPCSTFLDHAVLIVGYGSENGVDYWIVKNSWGKYWGMNGYIHMIRNS 320

Query: 1090 GDSEGLCGINMLASYXXXXXXXXXXXXXXXXXKCNLFTYCSGGETCCCARSLLGICFKWK 1269
            G+S+G+CGIN LASY                 +CNLFT+CS GETCCCA+S LGICF WK
Sbjct: 321  GNSKGICGINTLASYPVKTKPNPPIPPPPGPVRCNLFTHCSEGETCCCAKSFLGICFSWK 380

Query: 1270 CCGLTSAVCCKDKRHCCSQDYPVCDTRRGQCLKRISNGTKTNTFEKED---STRGWGSQ 1437
            CCGLTSAVCCKDKRHCC QDYP+CDTRRGQCLKR +NGT T T E +D    +RGW SQ
Sbjct: 381  CCGLTSAVCCKDKRHCCPQDYPICDTRRGQCLKRTANGTTTITSENQDFSHKSRGWKSQ 439


>XP_003523725.1 PREDICTED: low-temperature-induced cysteine proteinase [Glycine max]
            KRH61098.1 hypothetical protein GLYMA_04G028300 [Glycine
            max]
          Length = 439

 Score =  647 bits (1668), Expect = 0.0
 Identities = 309/419 (73%), Positives = 345/419 (82%), Gaps = 7/419 (1%)
 Frame = +1

Query: 202  STFDTSELFENWCKEHGKAYSSEEEKRYRSKVFEDNYDFVARHNQRGRN----SSYTLSL 369
            S  DTSELFE WCKEH K YSSEEEK YR KVFEDNY FVA+HNQ   N    SSYTLSL
Sbjct: 25   SASDTSELFEKWCKEHSKTYSSEEEKLYRLKVFEDNYAFVAQHNQNANNNNNNSSYTLSL 84

Query: 370  NAFADLTHHEFKASRLGLPPHSSFLRFNRFQDQQPNDRHLDVPSEIDWRNTGAVTPVKDQ 549
            NAFADLTHHEFK +RLGLP   + LRF R Q+QQ  D  L +PS+IDWR +GAVTPVKDQ
Sbjct: 85   NAFADLTHHEFKTTRLGLP--LTLLRFKRPQNQQSRDL-LHIPSQIDWRQSGAVTPVKDQ 141

Query: 550  GSCGACWSFAATGAIEGINKIVTGSLLSLSEQELVDCDTTYNSGCEGGLMDYAYQFVIDN 729
             SCGACW+F+ATGAIEGINKIVTGSL+SLSEQEL+DCDT+YNSGC GGLMD+AYQFVIDN
Sbjct: 142  ASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDTSYNSGCGGGLMDFAYQFVIDN 201

Query: 730  NGIDTEEDYPYQGRQRLCNKDKLRRRVVTIDGYIDVPRNDEKRLLKAVADQPVSVGICGS 909
             GIDTE+DYPYQ RQR C+KDKL+RR VTI+ Y+DVP ++E+ +LKAVA QPVSVGICGS
Sbjct: 202  KGIDTEDDYPYQARQRSCSKDKLKRRAVTIEDYVDVPPSEEE-ILKAVASQPVSVGICGS 260

Query: 910  ERAFQLYSKGIFNGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKNWGMNGHMHMLRNT 1089
            ER FQLYSKGIF GPCST LDHAVLIVGYGSENGVDYWIVKNSWGK WGMNG++HM+RN+
Sbjct: 261  EREFQLYSKGIFTGPCSTFLDHAVLIVGYGSENGVDYWIVKNSWGKYWGMNGYIHMIRNS 320

Query: 1090 GDSEGLCGINMLASYXXXXXXXXXXXXXXXXXKCNLFTYCSGGETCCCARSLLGICFKWK 1269
            G+S+G+CGIN LASY                 +CNLFT+CS GETCCCA+S LGICF WK
Sbjct: 321  GNSKGICGINTLASYPVKTKPNPPIPPPPGPVRCNLFTHCSEGETCCCAKSFLGICFSWK 380

Query: 1270 CCGLTSAVCCKDKRHCCSQDYPVCDTRRGQCLKRISNGTKTNTFEKED---STRGWGSQ 1437
            CCGLTSAVCCKDKRHCC QDYP+CDTRRGQCLKR +NGT T T E +D    +RGW SQ
Sbjct: 381  CCGLTSAVCCKDKRHCCPQDYPICDTRRGQCLKRTANGTTTITSENQDFSHKSRGWKSQ 439


>XP_019438032.1 PREDICTED: low-temperature-induced cysteine proteinase [Lupinus
            angustifolius] XP_019438033.1 PREDICTED:
            low-temperature-induced cysteine proteinase [Lupinus
            angustifolius] OIW14825.1 hypothetical protein
            TanjilG_17050 [Lupinus angustifolius]
          Length = 439

 Score =  640 bits (1652), Expect = 0.0
 Identities = 302/413 (73%), Positives = 333/413 (80%), Gaps = 4/413 (0%)
 Frame = +1

Query: 211  DTSELFENWCKEHGKAYSSEEEKRYRSKVFEDNYDFVARHNQRGRNSSYTLSLNAFADLT 390
            +T  LFE WCK+H K YSSE+EK Y+ KVFEDNY FV +H  +  NSSYTLSLNAFADLT
Sbjct: 29   NTYNLFETWCKQHNKTYSSEQEKLYKFKVFEDNYAFVTQHKNKVGNSSYTLSLNAFADLT 88

Query: 391  HHEFKASRL-GLPPHSSFLRFNRFQDQQPNDRHLDVPSEIDWRNTGAVTPVKDQGSCGAC 567
            HHEFK SR+ GL P     RFN  Q+QQ  +R L VPSE DWR  GAVTPVKDQGSCGAC
Sbjct: 89   HHEFKTSRIRGLSPR--LFRFNHSQNQQSGNRVLHVPSEFDWRKNGAVTPVKDQGSCGAC 146

Query: 568  WSFAATGAIEGINKIVTGSLLSLSEQELVDCDTTYNSGCEGGLMDYAYQFVIDNNGIDTE 747
            WSF+ATGAIEGINKIVTGSL+SLSEQELVDCD  YNSGCEGGLMDYAYQFVIDN+GIDTE
Sbjct: 147  WSFSATGAIEGINKIVTGSLVSLSEQELVDCDRNYNSGCEGGLMDYAYQFVIDNHGIDTE 206

Query: 748  EDYPYQGRQRLCNKDKLRRRVVTIDGYIDVPRNDEKRLLKAVADQPVSVGICGSERAFQL 927
            +DYPYQ   R C+KDKL+RRVVTIDGY DVP+ DEK+LL+AV  QPVSVGICGS+RAFQL
Sbjct: 207  KDYPYQAHDRTCSKDKLKRRVVTIDGYTDVPQGDEKKLLEAVVSQPVSVGICGSDRAFQL 266

Query: 928  YSKGIFNGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKNWGMNGHMHMLRNTGDSEGL 1107
            YSKGIF GPCST LDHAVLIVGYGSENGVDYWIVKNSWG +WGMNG++HM+RN+G+SEGL
Sbjct: 267  YSKGIFTGPCSTYLDHAVLIVGYGSENGVDYWIVKNSWGTSWGMNGYIHMVRNSGNSEGL 326

Query: 1108 CGINMLASYXXXXXXXXXXXXXXXXXKCNLFTYCSGGETCCCARSLLGICFKWKCCGLTS 1287
            CGIN LASY                 +C+LFTYCS GETCCCA+S LGIC  WKCCG+ S
Sbjct: 327  CGINTLASYPIKTKPNPPTPPPPGPTRCSLFTYCSEGETCCCAKSFLGICLSWKCCGVNS 386

Query: 1288 AVCCKDKRHCCSQDYPVCDTRRGQCLKRISNGTKTNTFEKEDS---TRGWGSQ 1437
            AVCCKDKRHCC  DYPVCDT RGQCLKR++N T T  FE E S    RGW SQ
Sbjct: 387  AVCCKDKRHCCPHDYPVCDTARGQCLKRVANATITKAFENEGSFGTPRGWNSQ 439


>KYP64636.1 Oryzain alpha chain [Cajanus cajan]
          Length = 466

 Score =  635 bits (1638), Expect = 0.0
 Identities = 297/384 (77%), Positives = 327/384 (85%)
 Frame = +1

Query: 217  SELFENWCKEHGKAYSSEEEKRYRSKVFEDNYDFVARHNQRGRNSSYTLSLNAFADLTHH 396
            S++FE WCKEHGK YSSEE+KRYR KVFEDNY FVA HN+    S+YTLSLNAFADLTHH
Sbjct: 72   SQVFERWCKEHGKTYSSEEQKRYRLKVFEDNYAFVAEHNRNANTSTYTLSLNAFADLTHH 131

Query: 397  EFKASRLGLPPHSSFLRFNRFQDQQPNDRHLDVPSEIDWRNTGAVTPVKDQGSCGACWSF 576
            EFK SRLGL    S LRF R ++QQP    L VPSEIDWR +GAVTPVKDQGSCGACW+F
Sbjct: 132  EFKTSRLGLALPPSLLRFKRPRNQQP-PHLLQVPSEIDWRKSGAVTPVKDQGSCGACWAF 190

Query: 577  AATGAIEGINKIVTGSLLSLSEQELVDCDTTYNSGCEGGLMDYAYQFVIDNNGIDTEEDY 756
            +ATGAIEGINKIVTGSL SLSEQEL+DCD +YNSGCEGGLMDYAYQF+IDN GIDTE+DY
Sbjct: 191  SATGAIEGINKIVTGSLESLSEQELIDCDRSYNSGCEGGLMDYAYQFIIDNGGIDTEDDY 250

Query: 757  PYQGRQRLCNKDKLRRRVVTIDGYIDVPRNDEKRLLKAVADQPVSVGICGSERAFQLYSK 936
            PYQ R+R CNKDKLRRRVVTID Y+DVP N+E+ LLKAVA QPVSVGICGSERAFQLYS+
Sbjct: 251  PYQVRERTCNKDKLRRRVVTIDDYVDVPLNEEE-LLKAVATQPVSVGICGSERAFQLYSE 309

Query: 937  GIFNGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKNWGMNGHMHMLRNTGDSEGLCGI 1116
            GIF GPCST+LDHAVLIVGYGSENGVDYWIVKNSWGK WGMNG++HM+RN+GDS+G+CGI
Sbjct: 310  GIFTGPCSTALDHAVLIVGYGSENGVDYWIVKNSWGKYWGMNGYIHMIRNSGDSKGICGI 369

Query: 1117 NMLASYXXXXXXXXXXXXXXXXXKCNLFTYCSGGETCCCARSLLGICFKWKCCGLTSAVC 1296
            N LASY                 +CNLFT+CS GETCCCARS LGICF WKCCGLTSAVC
Sbjct: 370  NTLASYPIKTKPNPPIPPPPGPVRCNLFTHCSQGETCCCARSFLGICFSWKCCGLTSAVC 429

Query: 1297 CKDKRHCCSQDYPVCDTRRGQCLK 1368
            CKDKRHCC QDYP+CDT +GQCLK
Sbjct: 430  CKDKRHCCPQDYPICDTGKGQCLK 453


>XP_007136041.1 hypothetical protein PHAVU_009G013000g [Phaseolus vulgaris]
            ESW08035.1 hypothetical protein PHAVU_009G013000g
            [Phaseolus vulgaris]
          Length = 428

 Score =  634 bits (1634), Expect = 0.0
 Identities = 309/418 (73%), Positives = 342/418 (81%), Gaps = 6/418 (1%)
 Frame = +1

Query: 199  ASTFDTSELFENWCKEHGKAYSSEEEKRYRSKVFEDNYDFVARHNQRGR--NSSYTLSLN 372
            AS  DTS+LFE WCKEH K YSSEEEKRYR  VFEDNY FV++HN+     NS+YTLSLN
Sbjct: 18   ASASDTSDLFERWCKEHAKTYSSEEEKRYRFHVFEDNYAFVSQHNRNANDNNSTYTLSLN 77

Query: 373  AFADLTHHEFKASRLGLPPHSSFLRFNRFQDQQPNDRHL-DVPSEIDWRNTGAVTPVKDQ 549
            AFADLTHHEFK SRLG  P  S LRF R Q+QQP  RHL   PS+IDWR +GAVTPVKDQ
Sbjct: 78   AFADLTHHEFKTSRLGFSP--SLLRFKRVQNQQP--RHLLHNPSQIDWRQSGAVTPVKDQ 133

Query: 550  GSCGACWSFAATGAIEGINKIVTGSLLSLSEQELVDCDTTYNSGCEGGLMDYAYQFVIDN 729
             SCGACW+F+ATGAIEGINKIVTGSL SLSEQELVDCDT+YNSGCEGGLMDYAYQFVIDN
Sbjct: 134  ASCGACWAFSATGAIEGINKIVTGSLESLSEQELVDCDTSYNSGCEGGLMDYAYQFVIDN 193

Query: 730  NGIDTEEDYPYQGRQRLCNKDKLRRRVVTIDGYIDVPRNDEKRLLKAVADQPVSVGICGS 909
             GIDTE+DYPYQ RQR CNKDKL+R +VTID Y+D+P N+E+ LLKAVA QPVSVGICGS
Sbjct: 194  KGIDTEDDYPYQARQRPCNKDKLKRHIVTIDDYVDLPPNEEE-LLKAVASQPVSVGICGS 252

Query: 910  ERAFQLYSKGIFNGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKNWGMNGHMHMLRNT 1089
            ERAFQLYS+GIF+GPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGK WGM G++HM+RNT
Sbjct: 253  ERAFQLYSQGIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKYWGMEGYIHMIRNT 312

Query: 1090 GDSEGLCGINMLASYXXXXXXXXXXXXXXXXXKCNLFTYCSGGETCCCARSLLGICFKWK 1269
            GD +G+CGIN LASY                 +CNLFT+CS GETCCCA+S LGICF WK
Sbjct: 313  GDPKGICGINTLASY--PIKTKPNPPPPPAPVRCNLFTHCSEGETCCCAKSFLGICFSWK 370

Query: 1270 CCGLTSAVCCKDKRHCCSQDYPVCDTRRGQCLKRISNGTKTNTFEKED---STRGWGS 1434
            CCGLTSAVCCKDKRHCC +DYP+CDT + QCLK I+NGT T T   +D     RGW S
Sbjct: 371  CCGLTSAVCCKDKRHCCPRDYPICDTEKSQCLK-ITNGTTTITSGNKDISNKPRGWKS 427


>XP_014501447.1 PREDICTED: zingipain-2 [Vigna radiata var. radiata]
          Length = 428

 Score =  630 bits (1626), Expect = 0.0
 Identities = 305/418 (72%), Positives = 344/418 (82%), Gaps = 6/418 (1%)
 Frame = +1

Query: 199  ASTFDTSELFENWCKEHGKAYSSEEEKRYRSKVFEDNYDFVARHNQRGR--NSSYTLSLN 372
            A   DTS+LFE WCKEH K YSSEEEKRYR +VFEDNY FV++HNQ     NS+YTLSLN
Sbjct: 18   APASDTSDLFERWCKEHAKTYSSEEEKRYRFRVFEDNYAFVSQHNQNANANNSTYTLSLN 77

Query: 373  AFADLTHHEFKASRLGLPPHSSFLRFNRFQDQQPNDRHL-DVPSEIDWRNTGAVTPVKDQ 549
            AFADLTHHEFK SRLG  P  S  RF R Q+QQP  RHL  +PSEIDWR +GAVTPVKDQ
Sbjct: 78   AFADLTHHEFKTSRLGFSP--SLHRFKRVQNQQP--RHLLHLPSEIDWRQSGAVTPVKDQ 133

Query: 550  GSCGACWSFAATGAIEGINKIVTGSLLSLSEQELVDCDTTYNSGCEGGLMDYAYQFVIDN 729
             +CGACWSF+ATGAIEGINKIVTGSL S+SEQELVDCDT+YNSGCEGGLMDYAYQFVIDN
Sbjct: 134  STCGACWSFSATGAIEGINKIVTGSLESISEQELVDCDTSYNSGCEGGLMDYAYQFVIDN 193

Query: 730  NGIDTEEDYPYQGRQRLCNKDKLRRRVVTIDGYIDVPRNDEKRLLKAVADQPVSVGICGS 909
             GIDTE+DYPYQ RQR CNKDKL+RR+VTID Y D+P N+E+ LLKAVA QPVSVGICGS
Sbjct: 194  KGIDTEDDYPYQARQRSCNKDKLKRRIVTIDDYADLPPNEEE-LLKAVASQPVSVGICGS 252

Query: 910  ERAFQLYSKGIFNGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKNWGMNGHMHMLRNT 1089
            ERAFQLYS+GIF+GPCST+LDHAVLIVGYGSENGVDYWIVKNSWG+ WGM+G++HM+RN+
Sbjct: 253  ERAFQLYSQGIFSGPCSTALDHAVLIVGYGSENGVDYWIVKNSWGRYWGMDGYIHMIRNS 312

Query: 1090 GDSEGLCGINMLASYXXXXXXXXXXXXXXXXXKCNLFTYCSGGETCCCARSLLGICFKWK 1269
            GDS+G+CGIN LASY                 +CNLFT+CS GETCCCA+S LGICF WK
Sbjct: 313  GDSKGICGINTLASY--PIKTTPNPPPPPAPVRCNLFTHCSEGETCCCAKSFLGICFSWK 370

Query: 1270 CCGLTSAVCCKDKRHCCSQDYPVCDTRRGQCLKRISNGTKTNTFEKED---STRGWGS 1434
            CCGLT+AVCCKD+RHCC  DYP+CDT++ QCLK I+NGT T T   +D     RGW S
Sbjct: 371  CCGLTTAVCCKDRRHCCPLDYPICDTKKSQCLK-ITNGTTTITTGNQDFSNKPRGWKS 427


>XP_017437409.1 PREDICTED: zingipain-2 [Vigna angularis] BAT77731.1 hypothetical
            protein VIGAN_02032600 [Vigna angularis var. angularis]
          Length = 428

 Score =  629 bits (1623), Expect = 0.0
 Identities = 303/418 (72%), Positives = 344/418 (82%), Gaps = 6/418 (1%)
 Frame = +1

Query: 199  ASTFDTSELFENWCKEHGKAYSSEEEKRYRSKVFEDNYDFVARHNQRGR--NSSYTLSLN 372
            AS  +TS+LFE WCKEH K YSSEEEKRYR +VFEDNY FV++HNQ     NS+YTLSLN
Sbjct: 18   ASASNTSDLFERWCKEHAKTYSSEEEKRYRFRVFEDNYAFVSQHNQNANVNNSTYTLSLN 77

Query: 373  AFADLTHHEFKASRLGLPPHSSFLRFNRFQDQQPNDRHL-DVPSEIDWRNTGAVTPVKDQ 549
            AFADLTHHEFK SRLG  P  S  RF R Q+QQP  RHL  +PSEIDWR +GAVTPVKDQ
Sbjct: 78   AFADLTHHEFKTSRLGFSP--SLFRFKRVQNQQP--RHLLHLPSEIDWRQSGAVTPVKDQ 133

Query: 550  GSCGACWSFAATGAIEGINKIVTGSLLSLSEQELVDCDTTYNSGCEGGLMDYAYQFVIDN 729
             SCGACW+F+ATGAIEGINKIVTGSL S+SEQELVDCDT+YNSGCEGGLMDYAYQF+IDN
Sbjct: 134  ASCGACWAFSATGAIEGINKIVTGSLESISEQELVDCDTSYNSGCEGGLMDYAYQFIIDN 193

Query: 730  NGIDTEEDYPYQGRQRLCNKDKLRRRVVTIDGYIDVPRNDEKRLLKAVADQPVSVGICGS 909
             GIDTE+DYPYQ RQR CNKDKL+RR+VTID Y D+P N+E+ LLKAVA QPVSVGICGS
Sbjct: 194  KGIDTEDDYPYQARQRSCNKDKLKRRIVTIDDYADLPPNEEE-LLKAVASQPVSVGICGS 252

Query: 910  ERAFQLYSKGIFNGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKNWGMNGHMHMLRNT 1089
            +RAFQLYS+GIF+GPCST+LDHAVLIVGYGSENGVDYWIVKNSWG+ WGMNG++HM+RN+
Sbjct: 253  DRAFQLYSQGIFSGPCSTALDHAVLIVGYGSENGVDYWIVKNSWGRYWGMNGYIHMIRNS 312

Query: 1090 GDSEGLCGINMLASYXXXXXXXXXXXXXXXXXKCNLFTYCSGGETCCCARSLLGICFKWK 1269
            GDS+G+CGIN LASY                 +CNLFT+CS GETCCCA+S LG+CF WK
Sbjct: 313  GDSKGICGINTLASY--PIKTKPNPPPPPAPVRCNLFTHCSEGETCCCAKSFLGLCFSWK 370

Query: 1270 CCGLTSAVCCKDKRHCCSQDYPVCDTRRGQCLKRISNGTKTNTFEKED---STRGWGS 1434
            CCGLTSAVCCKD+RHCC  DYP+CDT++ QCLK I+N T T T   +D     RGW S
Sbjct: 371  CCGLTSAVCCKDRRHCCPLDYPICDTKKSQCLK-ITNETTTITTGNQDFSNKPRGWKS 427


>AGV54418.1 oryzain alpha chain-like protein [Phaseolus vulgaris]
          Length = 467

 Score =  627 bits (1617), Expect = 0.0
 Identities = 305/418 (72%), Positives = 343/418 (82%), Gaps = 6/418 (1%)
 Frame = +1

Query: 199  ASTFDTSELFENWCKEHGKAYSSEEEKRYRSKVFEDNYDFVARHNQRGR--NSSYTLSLN 372
            AS  DTS+LFE WCKEH K YSSEEEKRYR  VFEDNY FV++HN+     NS+YTLSLN
Sbjct: 57   ASASDTSDLFERWCKEHAKTYSSEEEKRYRFHVFEDNYAFVSQHNRNANDNNSTYTLSLN 116

Query: 373  AFADLTHHEFKASRLGLPPHSSFLRFNRFQDQQPNDRHL-DVPSEIDWRNTGAVTPVKDQ 549
            AFADLTHHEFK SRLG  P  S LRF R Q+QQP  RHL   PS+IDWR +GAVTPVKDQ
Sbjct: 117  AFADLTHHEFKTSRLGFSP--SLLRFKRVQNQQP--RHLLHNPSQIDWRQSGAVTPVKDQ 172

Query: 550  GSCGACWSFAATGAIEGINKIVTGSLLSLSEQELVDCDTTYNSGCEGGLMDYAYQFVIDN 729
             SCGACW+F+ATGAIEGINKIVTGSL SLSEQELVDCDT+YNSGCEGGLMD+AYQFVIDN
Sbjct: 173  ASCGACWAFSATGAIEGINKIVTGSLESLSEQELVDCDTSYNSGCEGGLMDFAYQFVIDN 232

Query: 730  NGIDTEEDYPYQGRQRLCNKDKLRRRVVTIDGYIDVPRNDEKRLLKAVADQPVSVGICGS 909
             GIDTE+DYPYQ RQR C+KDKL+RR VTI+ Y+DVP ++E+ +LKAVA QPVSVGICGS
Sbjct: 233  KGIDTEDDYPYQARQRSCSKDKLKRRAVTIEDYVDVPPSEEE-ILKAVASQPVSVGICGS 291

Query: 910  ERAFQLYSKGIFNGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKNWGMNGHMHMLRNT 1089
            ERAFQLYS+GIF+GPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGK WG++G++HM+RNT
Sbjct: 292  ERAFQLYSQGIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKYWGIDGYIHMIRNT 351

Query: 1090 GDSEGLCGINMLASYXXXXXXXXXXXXXXXXXKCNLFTYCSGGETCCCARSLLGICFKWK 1269
            GD +G+CGIN LASY                 +CNLFT+CS GETCCCA+S LGICF WK
Sbjct: 352  GDPKGICGINTLASY--PIKTKPNPPPPPAPVRCNLFTHCSEGETCCCAKSFLGICFSWK 409

Query: 1270 CCGLTSAVCCKDKRHCCSQDYPVCDTRRGQCLKRISNGTKTNTFEKED---STRGWGS 1434
            CCGLTSAVCCKDKRHCC +DYP+CDT + QCLK I+NGT T T   +D     RGW S
Sbjct: 410  CCGLTSAVCCKDKRHCCPRDYPICDTEKSQCLK-ITNGTTTITSGNKDISNKPRGWKS 466


>XP_012071947.1 PREDICTED: low-temperature-induced cysteine proteinase [Jatropha
            curcas] BAJ53169.1 JHL18I08.3 [Jatropha curcas]
            KDP38570.1 hypothetical protein JCGZ_04495 [Jatropha
            curcas]
          Length = 441

 Score =  609 bits (1570), Expect = 0.0
 Identities = 289/416 (69%), Positives = 328/416 (78%), Gaps = 4/416 (0%)
 Frame = +1

Query: 199  ASTFDTSELFENWCKEHGKAYSSEEEKRYRSKVFEDNYDFVARHNQRGRNSSYTLSLNAF 378
            +S+ + + LFE WC++HGK Y+S+EEK +R KVF+DNYDFV  HN +G NSSYTLSLNAF
Sbjct: 21   SSSSEIAHLFETWCQQHGKTYASQEEKLFRLKVFQDNYDFVTEHNSQG-NSSYTLSLNAF 79

Query: 379  ADLTHHEFKASRLGLPPHSSF-LRFNRFQDQQPNDRHLDVPSEIDWRNTGAVTPVKDQGS 555
            ADLTHHEFKASRLGL   +S  L  +R   Q P D   DVP+ +DWR  GAVT VKDQG+
Sbjct: 80   ADLTHHEFKASRLGLSSAASASLNVDRSNRQIP-DFVADVPASVDWRKNGAVTQVKDQGN 138

Query: 556  CGACWSFAATGAIEGINKIVTGSLLSLSEQELVDCDTTYNSGCEGGLMDYAYQFVIDNNG 735
            CGACWSF+ATGAIEGINKIVTGSL+SLSEQELVDCD +YN+GCEGG+MDYA+QFVIDN+G
Sbjct: 139  CGACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDKSYNNGCEGGIMDYAFQFVIDNHG 198

Query: 736  IDTEEDYPYQGRQRLCNKDKLRRRVVTIDGYIDVPRNDEKRLLKAVADQPVSVGICGSER 915
            IDTEEDYPYQGR R CNK+KL+R VVTIDGY+DVP+N+EK LLKAVA+QPVSVGICGSER
Sbjct: 199  IDTEEDYPYQGRDRSCNKEKLKRHVVTIDGYVDVPQNNEKELLKAVANQPVSVGICGSER 258

Query: 916  AFQLYSKGIFNGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKNWGMNGHMHMLRNTGD 1095
            AFQLYSKGIF GPCSTSLDHAVLIVGYGSENGVDYWIVKNSWG  WGM+G+MHM RN+G 
Sbjct: 259  AFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGSYWGMDGYMHMQRNSGS 318

Query: 1096 SEGLCGINMLASYXXXXXXXXXXXXXXXXXKCNLFTYCSGGETCCCARSLLGICFKWKCC 1275
            S GLCGINMLASY                 +C+LFT+C  GETCCC   + GIC  WKCC
Sbjct: 319  SRGLCGINMLASYPKKTSPNPPPPAPPGPTRCDLFTHCGEGETCCCVHHIFGICLSWKCC 378

Query: 1276 GLTSAVCCKDKRHCCSQDYPVCDTRRGQCLKRISNGTKTNTFEKEDST---RGWGS 1434
             L SAVCCKD RHCC +DYPVCDT R  CLK   N T+   F K  S+   R W S
Sbjct: 379  ELDSAVCCKDGRHCCPRDYPVCDTTRNICLKHYGNATRIEKFAKNSSSGKFRSWSS 434


>EOY14881.1 JHL18I08.3 protein [Theobroma cacao]
          Length = 438

 Score =  608 bits (1569), Expect = 0.0
 Identities = 291/412 (70%), Positives = 326/412 (79%)
 Frame = +1

Query: 199  ASTFDTSELFENWCKEHGKAYSSEEEKRYRSKVFEDNYDFVARHNQRGRNSSYTLSLNAF 378
            AS    S LFE WC +HGK YSSEEEK YR KVFE+NY FV +HN  G NSSY+L+LNAF
Sbjct: 21   ASPSHISHLFETWCDQHGKRYSSEEEKSYRLKVFEENYAFVTQHNGVG-NSSYSLALNAF 79

Query: 379  ADLTHHEFKASRLGLPPHSSFLRFNRFQDQQPNDRHLDVPSEIDWRNTGAVTPVKDQGSC 558
            ADLTHHEFKASRLGL   ++ +  +R   Q P     D+P+ +DWR  GAVT VKDQGSC
Sbjct: 80   ADLTHHEFKASRLGLS--AAAIEGSRPNLQLPGLVR-DIPASMDWRTKGAVTKVKDQGSC 136

Query: 559  GACWSFAATGAIEGINKIVTGSLLSLSEQELVDCDTTYNSGCEGGLMDYAYQFVIDNNGI 738
            GACWSF+ATGAIEGINKIVTG+L+SLSEQELVDCD +YNSGCEGGLMDYAYQFVIDN+GI
Sbjct: 137  GACWSFSATGAIEGINKIVTGTLVSLSEQELVDCDRSYNSGCEGGLMDYAYQFVIDNHGI 196

Query: 739  DTEEDYPYQGRQRLCNKDKLRRRVVTIDGYIDVPRNDEKRLLKAVADQPVSVGICGSERA 918
            D EEDYPY GR++ CNK+K +RRVVTIDGY  VP N+E  LL+AVA QPVSVGICGSERA
Sbjct: 197  DNEEDYPYLGREKTCNKEKRKRRVVTIDGYAGVPANNEDLLLQAVAKQPVSVGICGSERA 256

Query: 919  FQLYSKGIFNGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKNWGMNGHMHMLRNTGDS 1098
            FQLYSKGIF GPCS+SLDHAVLIVGYGSENGVDYWIVKNSWG  WGMNG++HMLRN+GDS
Sbjct: 257  FQLYSKGIFTGPCSSSLDHAVLIVGYGSENGVDYWIVKNSWGTRWGMNGYIHMLRNSGDS 316

Query: 1099 EGLCGINMLASYXXXXXXXXXXXXXXXXXKCNLFTYCSGGETCCCARSLLGICFKWKCCG 1278
            +GLCGINMLASY                 KC+LFTYCS GETCCC   + GICF WKCC 
Sbjct: 317  KGLCGINMLASYPTKTSPNPPSPPPPGPTKCDLFTYCSAGETCCCTHRIFGICFSWKCCE 376

Query: 1279 LTSAVCCKDKRHCCSQDYPVCDTRRGQCLKRISNGTKTNTFEKEDSTRGWGS 1434
            L SAVCCKD RHCC  DYPVCDT++ QCLKR+ N T+   FEK  STR + S
Sbjct: 377  LDSAVCCKDNRHCCPYDYPVCDTKKSQCLKRVGNATRMEAFEKRHSTRKFSS 428


>XP_007017656.2 PREDICTED: zingipain-2 [Theobroma cacao]
          Length = 438

 Score =  607 bits (1565), Expect = 0.0
 Identities = 290/412 (70%), Positives = 326/412 (79%)
 Frame = +1

Query: 199  ASTFDTSELFENWCKEHGKAYSSEEEKRYRSKVFEDNYDFVARHNQRGRNSSYTLSLNAF 378
            AS    S LFE WC +HGK YSSEEEK YR KVFE+NY FV +HN  G NSSY+L+LNAF
Sbjct: 21   ASPSHISHLFETWCDQHGKRYSSEEEKSYRLKVFEENYAFVTQHNGVG-NSSYSLALNAF 79

Query: 379  ADLTHHEFKASRLGLPPHSSFLRFNRFQDQQPNDRHLDVPSEIDWRNTGAVTPVKDQGSC 558
            ADLTHHEFKASRLGL   ++ +  +R   Q P     D+P+ +DWR  GAVT VKDQGSC
Sbjct: 80   ADLTHHEFKASRLGLS--AAAIDGSRPNLQLPGLVR-DIPASMDWRTKGAVTKVKDQGSC 136

Query: 559  GACWSFAATGAIEGINKIVTGSLLSLSEQELVDCDTTYNSGCEGGLMDYAYQFVIDNNGI 738
            GACWSF+ATGAIEGINKIVTG+L+SLSEQELVDCD +YNSGCEGGLMDYAYQFVIDN+GI
Sbjct: 137  GACWSFSATGAIEGINKIVTGTLVSLSEQELVDCDRSYNSGCEGGLMDYAYQFVIDNHGI 196

Query: 739  DTEEDYPYQGRQRLCNKDKLRRRVVTIDGYIDVPRNDEKRLLKAVADQPVSVGICGSERA 918
            D EEDYPY GR++ CNK+K +RRVVTIDGY  VP N+E  LL+AVA QPVSVGICGSERA
Sbjct: 197  DNEEDYPYLGREKTCNKEKRKRRVVTIDGYAGVPANNEDLLLQAVAKQPVSVGICGSERA 256

Query: 919  FQLYSKGIFNGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKNWGMNGHMHMLRNTGDS 1098
            FQLYSKGIF GPCS+SLDHAVLIVGYGSENGVDYWIVKNSWG  WGMNG++H+LRN+GDS
Sbjct: 257  FQLYSKGIFTGPCSSSLDHAVLIVGYGSENGVDYWIVKNSWGTRWGMNGYIHLLRNSGDS 316

Query: 1099 EGLCGINMLASYXXXXXXXXXXXXXXXXXKCNLFTYCSGGETCCCARSLLGICFKWKCCG 1278
            +GLCGINMLASY                 KC+LFTYCS GETCCC   + GICF WKCC 
Sbjct: 317  KGLCGINMLASYPMKTSPNPPSPPPPGPTKCDLFTYCSAGETCCCTHRIFGICFSWKCCE 376

Query: 1279 LTSAVCCKDKRHCCSQDYPVCDTRRGQCLKRISNGTKTNTFEKEDSTRGWGS 1434
            L SAVCCKD RHCC  DYPVCDT++ QCLKR+ N T+   FEK  STR + S
Sbjct: 377  LDSAVCCKDNRHCCPNDYPVCDTKKSQCLKRVGNATRMEAFEKRHSTRKFSS 428


>XP_002510459.2 PREDICTED: low-temperature-induced cysteine proteinase [Ricinus
            communis]
          Length = 466

 Score =  604 bits (1557), Expect = 0.0
 Identities = 286/412 (69%), Positives = 333/412 (80%)
 Frame = +1

Query: 199  ASTFDTSELFENWCKEHGKAYSSEEEKRYRSKVFEDNYDFVARHNQRGRNSSYTLSLNAF 378
            +S+ D S+LFE+W KEHGK Y+S+E+K YR K+FE+NY+FV +HN +G NSSYTLSLNAF
Sbjct: 23   SSSSDISKLFESWTKEHGKTYTSKEDKLYRFKIFEENYEFVKKHNSQG-NSSYTLSLNAF 81

Query: 379  ADLTHHEFKASRLGLPPHSSFLRFNRFQDQQPNDRHLDVPSEIDWRNTGAVTPVKDQGSC 558
            ADLTHHEFKASRLGL   S+  + +R ++   +D   DVP  IDWR  GAV+ VKDQG+C
Sbjct: 82   ADLTHHEFKASRLGLSAFSTSGKLSR-RNFPLHDFVGDVPISIDWRKKGAVSQVKDQGNC 140

Query: 559  GACWSFAATGAIEGINKIVTGSLLSLSEQELVDCDTTYNSGCEGGLMDYAYQFVIDNNGI 738
            GACWSF+ATGAIEGINKIVTGSL+SLSEQELVDCD +YN+GCEGGLMDYAYQFVI+NNGI
Sbjct: 141  GACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDRSYNNGCEGGLMDYAYQFVIENNGI 200

Query: 739  DTEEDYPYQGRQRLCNKDKLRRRVVTIDGYIDVPRNDEKRLLKAVADQPVSVGICGSERA 918
            DTEEDYPYQ R++ CNK+KL+R VVTIDGY DVP+N+EK LLKAVA QPVSVGICGSERA
Sbjct: 201  DTEEDYPYQAREKTCNKEKLKRHVVTIDGYTDVPQNNEKELLKAVAAQPVSVGICGSERA 260

Query: 919  FQLYSKGIFNGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKNWGMNGHMHMLRNTGDS 1098
            FQLYSKGIF GPCSTSLDHAVLIVGYGSENGVDYWIVKNSWG +WG+NG+M+MLRN+G+S
Sbjct: 261  FQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTHWGINGYMYMLRNSGNS 320

Query: 1099 EGLCGINMLASYXXXXXXXXXXXXXXXXXKCNLFTYCSGGETCCCARSLLGICFKWKCCG 1278
            +GLCGINMLAS+                 KC+LFT C  GETCCC R + G+CF WKCC 
Sbjct: 321  QGLCGINMLASFPVKTSPNPPPPAPPGPTKCDLFTRCGEGETCCCTRRIFGLCFSWKCCE 380

Query: 1279 LTSAVCCKDKRHCCSQDYPVCDTRRGQCLKRISNGTKTNTFEKEDSTRGWGS 1434
            L SAVCCKD  HCC  DYPVCDT+R  CLK   N T+  T  K+ S+  +GS
Sbjct: 381  LDSAVCCKDGLHCCPHDYPVCDTKRNMCLKFPGNATRMETVAKKSSSGMFGS 432


>XP_016687622.1 PREDICTED: low-temperature-induced cysteine proteinase [Gossypium
            hirsutum]
          Length = 489

 Score =  605 bits (1559), Expect = 0.0
 Identities = 282/396 (71%), Positives = 320/396 (80%)
 Frame = +1

Query: 199  ASTFDTSELFENWCKEHGKAYSSEEEKRYRSKVFEDNYDFVARHNQRGRNSSYTLSLNAF 378
            AS    S++FE WC +HGK+YSSEEEK YR KVFEDNY FV +HN    NSSY+L+LNAF
Sbjct: 81   ASPSHISKIFETWCHQHGKSYSSEEEKSYRLKVFEDNYAFVTQHNAM-TNSSYSLALNAF 139

Query: 379  ADLTHHEFKASRLGLPPHSSFLRFNRFQDQQPNDRHLDVPSEIDWRNTGAVTPVKDQGSC 558
            ADLTHHEFKASRLGL    + ++F R   ++P     D+P+ +DWR  GAVT VKDQGSC
Sbjct: 140  ADLTHHEFKASRLGLS--GAAIQFRRSTLREPRLVR-DIPASLDWREKGAVTQVKDQGSC 196

Query: 559  GACWSFAATGAIEGINKIVTGSLLSLSEQELVDCDTTYNSGCEGGLMDYAYQFVIDNNGI 738
            GACWSF+ATGAIEG+NKIVTGSL+SLSEQELVDCD TYN+GCEGGLMDYA+QFVI+N+GI
Sbjct: 197  GACWSFSATGAIEGVNKIVTGSLISLSEQELVDCDKTYNTGCEGGLMDYAFQFVINNHGI 256

Query: 739  DTEEDYPYQGRQRLCNKDKLRRRVVTIDGYIDVPRNDEKRLLKAVADQPVSVGICGSERA 918
            DTEEDYPYQGR+  CNK+KL+R VVTID Y DVP N+EK+LL+AVA QPVSVGICGSERA
Sbjct: 257  DTEEDYPYQGREHTCNKEKLKRHVVTIDDYTDVPMNNEKKLLQAVATQPVSVGICGSERA 316

Query: 919  FQLYSKGIFNGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKNWGMNGHMHMLRNTGDS 1098
            FQLYSKGIF GPCSTSLDHAVLIVGYGSENGVDYWIVKNSWG  WGMNG++HM+RNTG S
Sbjct: 317  FQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTRWGMNGYIHMIRNTGKS 376

Query: 1099 EGLCGINMLASYXXXXXXXXXXXXXXXXXKCNLFTYCSGGETCCCARSLLGICFKWKCCG 1278
            EG+CGINMLASY                 KC+ FTYCS GETCCC   + GICF WKCCG
Sbjct: 377  EGICGINMLASYPIKTSPNPPPSPPPGPTKCDFFTYCSAGETCCCTHRIFGICFLWKCCG 436

Query: 1279 LTSAVCCKDKRHCCSQDYPVCDTRRGQCLKRISNGT 1386
            L SAVCCKD RHCC  +YP+CDT+  QCLKR+ N T
Sbjct: 437  LDSAVCCKDNRHCCPHNYPICDTKNNQCLKRVGNAT 472


>XP_018821065.1 PREDICTED: low-temperature-induced cysteine proteinase [Juglans
            regia]
          Length = 442

 Score =  602 bits (1552), Expect = 0.0
 Identities = 284/406 (69%), Positives = 325/406 (80%)
 Frame = +1

Query: 217  SELFENWCKEHGKAYSSEEEKRYRSKVFEDNYDFVARHNQRGRNSSYTLSLNAFADLTHH 396
            S++FE WCK+HG+ YSSE EK YR +VF+DN+DFV ++N  G NSSYTLSLNAFADLTHH
Sbjct: 29   SKVFEAWCKQHGRTYSSEAEKLYRFRVFQDNFDFVTQYNDMG-NSSYTLSLNAFADLTHH 87

Query: 397  EFKASRLGLPPHSSFLRFNRFQDQQPNDRHLDVPSEIDWRNTGAVTPVKDQGSCGACWSF 576
            EFKASRLG  P    L   R   + P     ++PSE+DWR  GAVT VKDQGSCGACWSF
Sbjct: 88   EFKASRLGFSPAGMSLNRQR-PFRGPGSVVREIPSEMDWRKKGAVTHVKDQGSCGACWSF 146

Query: 577  AATGAIEGINKIVTGSLLSLSEQELVDCDTTYNSGCEGGLMDYAYQFVIDNNGIDTEEDY 756
            +ATGAIEGINKIVTGSL+SLSEQELVDCD +++SGCEGGLMDYAYQF+IDN+GIDTE+DY
Sbjct: 147  SATGAIEGINKIVTGSLVSLSEQELVDCDRSFDSGCEGGLMDYAYQFIIDNHGIDTEDDY 206

Query: 757  PYQGRQRLCNKDKLRRRVVTIDGYIDVPRNDEKRLLKAVADQPVSVGICGSERAFQLYSK 936
            PYQGR+R C K+KL+R VVTIDGY DV  N+EK+LL+AVA QPVSVGICGSERAFQLYSK
Sbjct: 207  PYQGRERSCIKEKLKRHVVTIDGYTDVQTNNEKQLLQAVATQPVSVGICGSERAFQLYSK 266

Query: 937  GIFNGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKNWGMNGHMHMLRNTGDSEGLCGI 1116
            GIF GPCSTSLDHAVLIVGYGSENGVDYWIVKNSWG  WGM+G++HMLRN+G+S+GLCGI
Sbjct: 267  GIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTRWGMDGYVHMLRNSGNSQGLCGI 326

Query: 1117 NMLASYXXXXXXXXXXXXXXXXXKCNLFTYCSGGETCCCARSLLGICFKWKCCGLTSAVC 1296
            NMLASY                 +C++FTYC  GETCCCAR LLGIC  WKCC L SAVC
Sbjct: 327  NMLASYPTKTRPNPPPPPPPGPTRCDIFTYCGEGETCCCARHLLGICISWKCCELNSAVC 386

Query: 1297 CKDKRHCCSQDYPVCDTRRGQCLKRISNGTKTNTFEKEDSTRGWGS 1434
            CKD  HCC  DYP+CDT+R QCLKR  NGT+    E+  S+   GS
Sbjct: 387  CKDHLHCCPLDYPICDTKRIQCLKRAGNGTRMEALERRSSSGKSGS 432


>XP_016167594.1 PREDICTED: cysteine proteinase COT44 isoform X2 [Arachis ipaensis]
          Length = 454

 Score =  602 bits (1552), Expect = 0.0
 Identities = 291/424 (68%), Positives = 330/424 (77%), Gaps = 12/424 (2%)
 Frame = +1

Query: 199  ASTFDTSELFENWCKEHGKAYSSEEEKRYRSKVFEDNYDFVARHNQRGRNSSYTLSLNAF 378
            AS+ DTS+LF +WC  HGK YSS+EE+ YR KVF+DNYD+V RHNQ   NS YTLSLNAF
Sbjct: 33   ASSSDTSQLFRSWCDHHGKIYSSDEERSYRLKVFQDNYDYVQRHNQMA-NSPYTLSLNAF 91

Query: 379  ADLTHHEFKASRLGLPPHSSFLRFNRFQDQQP-------NDRHL--DVPSEIDWRNTGAV 531
            ADLTH EFKAS LG    SSFLRF   QD Q        ND ++   VPS IDWRN GAV
Sbjct: 92   ADLTHQEFKASHLGALS-SSFLRFKNHQDHQSRYHNDNDNDNNILRQVPSSIDWRNEGAV 150

Query: 532  TPVKDQGSCGACWSFAATGAIEGINKIVTGSLLSLSEQELVDCDTTYNSGCEGGLMDYAY 711
            TPVK+QGSCGACW+F+ATGAIEGINKIVTG+L SLSEQELVDCD  YNSGCEGGLMDYAY
Sbjct: 151  TPVKNQGSCGACWAFSATGAIEGINKIVTGTLESLSEQELVDCDKKYNSGCEGGLMDYAY 210

Query: 712  QFVIDNNGIDTEEDYPYQGRQRLCNKDKLRRRVVTIDGYIDVPRNDEKRLLKAVADQPVS 891
            QFVIDN+GIDTE DYP+      CNK+K++RRVVTIDGY DV  ++EK+LL+AVA QPVS
Sbjct: 211  QFVIDNHGIDTESDYPFLAHDAACNKNKMKRRVVTIDGYTDVLPSNEKKLLEAVATQPVS 270

Query: 892  VGICGSERAFQLYSKGIFNGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKNWGMNGHM 1071
            VGICGS RAFQLYS+GIF GPCST+LDHAVLIVGYGSENGVDYWIVKNSWG +WGMNG++
Sbjct: 271  VGICGSARAFQLYSQGIFTGPCSTALDHAVLIVGYGSENGVDYWIVKNSWGTSWGMNGYI 330

Query: 1072 HMLRNTGDSEGLCGINMLASYXXXXXXXXXXXXXXXXXKCNLFTYCSGGETCCCARSLLG 1251
            HM+RN G S+G+CGINMLASY                 KCNLFTYC   ETCCC+  +LG
Sbjct: 331  HMVRNNG-SQGICGINMLASYPTKTTPNPPPPPPPGPTKCNLFTYCPAAETCCCSWRVLG 389

Query: 1252 ICFKWKCCGLTSAVCCKDKRHCCSQDYPVCDTRRGQCLKRISNGTKTNTFEKEDS---TR 1422
            +C  +KCCGL SAVCCKD  HCC QDYP+CD R  QCLKR+SNGT T  +E +DS   +R
Sbjct: 390  LCLSYKCCGLDSAVCCKDNSHCCPQDYPICDIRNAQCLKRVSNGTTTMAYENKDSIRRSR 449

Query: 1423 GWGS 1434
            GW S
Sbjct: 450  GWSS 453


>XP_015933819.1 PREDICTED: cysteine proteinase COT44 isoform X2 [Arachis duranensis]
          Length = 452

 Score =  600 bits (1548), Expect = 0.0
 Identities = 291/422 (68%), Positives = 329/422 (77%), Gaps = 10/422 (2%)
 Frame = +1

Query: 199  ASTFDTSELFENWCKEHGKAYSSEEEKRYRSKVFEDNYDFVARHNQRGRNSSYTLSLNAF 378
            AS+ DTS+LF +WC  HGK YSS+EE+ YR KVF DNYD+V RHNQ   NS YTLSLNAF
Sbjct: 33   ASSPDTSQLFRSWCDHHGKTYSSDEERSYRLKVFLDNYDYVQRHNQMA-NSPYTLSLNAF 91

Query: 379  ADLTHHEFKASRLGLPPHSSFLRFNRFQDQQP-----NDRHL--DVPSEIDWRNTGAVTP 537
            ADLTH E KAS LG    SSFLRF   QD QP     ND ++   VPS IDWRN GAVTP
Sbjct: 92   ADLTHQELKASHLGALS-SSFLRFKNRQDHQPRYHNDNDNNILRQVPSSIDWRNEGAVTP 150

Query: 538  VKDQGSCGACWSFAATGAIEGINKIVTGSLLSLSEQELVDCDTTYNSGCEGGLMDYAYQF 717
            VK+QGSCGACW+F+ATGAIEGINKIVTG+L SLSEQELVDCD  YNSGCEGGLMDYAYQF
Sbjct: 151  VKNQGSCGACWAFSATGAIEGINKIVTGTLESLSEQELVDCDKKYNSGCEGGLMDYAYQF 210

Query: 718  VIDNNGIDTEEDYPYQGRQRLCNKDKLRRRVVTIDGYIDVPRNDEKRLLKAVADQPVSVG 897
            VIDN+GIDTE DYP+      CNK+K++RRVVTIDGY DV  ++EK+LL+AVA QPVSVG
Sbjct: 211  VIDNHGIDTESDYPFLAHDAACNKNKMKRRVVTIDGYTDVLPSNEKKLLEAVATQPVSVG 270

Query: 898  ICGSERAFQLYSKGIFNGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKNWGMNGHMHM 1077
            ICGS RAFQLYS+GIF GPCST+LDHAVLIVGYGSENGVDYWIVKNSWG +WGMNG++HM
Sbjct: 271  ICGSARAFQLYSQGIFTGPCSTALDHAVLIVGYGSENGVDYWIVKNSWGTSWGMNGYIHM 330

Query: 1078 LRNTGDSEGLCGINMLASYXXXXXXXXXXXXXXXXXKCNLFTYCSGGETCCCARSLLGIC 1257
            +RN G S+G+CGINMLASY                 KCNLFTYC   ETCCC+  +LG+C
Sbjct: 331  VRNNG-SQGICGINMLASYPTKTTPNPPPPPPPGPTKCNLFTYCPAAETCCCSWRVLGLC 389

Query: 1258 FKWKCCGLTSAVCCKDKRHCCSQDYPVCDTRRGQCLKRISNGTKTNTFEKEDS---TRGW 1428
              +KCCGL SAVCCKD  HCC QDYP+CD R  QCLKR+SNGT T  +E +DS   +RGW
Sbjct: 390  LSYKCCGLDSAVCCKDNSHCCPQDYPICDIRNAQCLKRVSNGTTTMAYENKDSIRRSRGW 449

Query: 1429 GS 1434
             S
Sbjct: 450  SS 451


>XP_017612038.1 PREDICTED: zingipain-2 [Gossypium arboreum]
          Length = 431

 Score =  599 bits (1544), Expect = 0.0
 Identities = 279/396 (70%), Positives = 318/396 (80%)
 Frame = +1

Query: 199  ASTFDTSELFENWCKEHGKAYSSEEEKRYRSKVFEDNYDFVARHNQRGRNSSYTLSLNAF 378
            AS    S+ FE WC++HGK+Y SEEEK YR KVFEDNY FV +HN    NSSY+L+LNAF
Sbjct: 23   ASPSHISKKFETWCQQHGKSYLSEEEKSYRLKVFEDNYAFVTQHNAMV-NSSYSLALNAF 81

Query: 379  ADLTHHEFKASRLGLPPHSSFLRFNRFQDQQPNDRHLDVPSEIDWRNTGAVTPVKDQGSC 558
            AD THHEFKASRLGL    + ++F R   ++P     D+P  +DWR  GAVT VKDQGSC
Sbjct: 82   ADFTHHEFKASRLGLS--GAAIQFRRPNLREPRLVR-DIPDSLDWREKGAVTQVKDQGSC 138

Query: 559  GACWSFAATGAIEGINKIVTGSLLSLSEQELVDCDTTYNSGCEGGLMDYAYQFVIDNNGI 738
            GACWSF+ATGAIEG+NKIVTGSL+SLSEQELVDCD TYN+GCEGGLMDYA+QFVI+N+GI
Sbjct: 139  GACWSFSATGAIEGVNKIVTGSLISLSEQELVDCDKTYNTGCEGGLMDYAFQFVINNHGI 198

Query: 739  DTEEDYPYQGRQRLCNKDKLRRRVVTIDGYIDVPRNDEKRLLKAVADQPVSVGICGSERA 918
            DTEEDYPYQGR+  CNK+KL+R VVTID Y DVP N+EK+LL+AVA QPVSVGICGSERA
Sbjct: 199  DTEEDYPYQGREHTCNKEKLKRHVVTIDDYTDVPMNNEKKLLQAVATQPVSVGICGSERA 258

Query: 919  FQLYSKGIFNGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKNWGMNGHMHMLRNTGDS 1098
            FQLYSKGIF GPCSTSLDHAVLIVGYGSENGVDYWIVKNSWG+ WGMNG++HM+RN+G S
Sbjct: 259  FQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGRRWGMNGYIHMIRNSGKS 318

Query: 1099 EGLCGINMLASYXXXXXXXXXXXXXXXXXKCNLFTYCSGGETCCCARSLLGICFKWKCCG 1278
            EG+CGINMLASY                 KC+ FTYCS GETCCC   + GICF WKCCG
Sbjct: 319  EGICGINMLASYPIKTSPNPPPSPPPGPTKCDFFTYCSAGETCCCTHRIFGICFSWKCCG 378

Query: 1279 LTSAVCCKDKRHCCSQDYPVCDTRRGQCLKRISNGT 1386
            L SAVCCKD RHCC  +YP+CDT+  QCLKR+ N T
Sbjct: 379  LDSAVCCKDNRHCCPHNYPICDTKNNQCLKRVGNAT 414


>XP_012444786.1 PREDICTED: zingipain-2 [Gossypium raimondii] KJB58175.1 hypothetical
            protein B456_009G197900 [Gossypium raimondii]
          Length = 431

 Score =  598 bits (1542), Expect = 0.0
 Identities = 278/396 (70%), Positives = 316/396 (79%)
 Frame = +1

Query: 199  ASTFDTSELFENWCKEHGKAYSSEEEKRYRSKVFEDNYDFVARHNQRGRNSSYTLSLNAF 378
            AS    S++FE WC +HGK+YSSEEEK YR KVFEDNY FV +HN    NSSY+L+LNAF
Sbjct: 23   ASPSHISKIFETWCHQHGKSYSSEEEKSYRLKVFEDNYAFVTQHNAM-TNSSYSLALNAF 81

Query: 379  ADLTHHEFKASRLGLPPHSSFLRFNRFQDQQPNDRHLDVPSEIDWRNTGAVTPVKDQGSC 558
            ADLTHHEFKASRLGL   +   R +  ++ +      D+P+ +DWR  GAVT VKDQGSC
Sbjct: 82   ADLTHHEFKASRLGLSGAAIQFRCSNLREPR---LVRDIPASLDWREKGAVTQVKDQGSC 138

Query: 559  GACWSFAATGAIEGINKIVTGSLLSLSEQELVDCDTTYNSGCEGGLMDYAYQFVIDNNGI 738
            GACWSF+ATGAIEG+NKIVTGSL+SLSEQELVDCD TYN+GCEGGLMDYA+QFVI+N+GI
Sbjct: 139  GACWSFSATGAIEGVNKIVTGSLISLSEQELVDCDKTYNTGCEGGLMDYAFQFVINNHGI 198

Query: 739  DTEEDYPYQGRQRLCNKDKLRRRVVTIDGYIDVPRNDEKRLLKAVADQPVSVGICGSERA 918
            DTEEDYPYQGR+  CNK+KL+R VVTID Y DVP  +EK+LL+AVA QPVSVGICGSERA
Sbjct: 199  DTEEDYPYQGREHTCNKEKLKRHVVTIDDYTDVPMTNEKKLLQAVATQPVSVGICGSERA 258

Query: 919  FQLYSKGIFNGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKNWGMNGHMHMLRNTGDS 1098
            FQLY KGIF GPCSTSLDHAVLIVGYGSENGVDYWIVKNSWG  WGMNG++HM+RNTG S
Sbjct: 259  FQLYCKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTRWGMNGYIHMIRNTGKS 318

Query: 1099 EGLCGINMLASYXXXXXXXXXXXXXXXXXKCNLFTYCSGGETCCCARSLLGICFKWKCCG 1278
            EG+CGINMLASY                 KC+ FTYCS GETCCC   + GICF WKCCG
Sbjct: 319  EGICGINMLASYPIKTSPNPPPSPPPGPTKCDFFTYCSAGETCCCTHRIFGICFLWKCCG 378

Query: 1279 LTSAVCCKDKRHCCSQDYPVCDTRRGQCLKRISNGT 1386
            L SAVCCKD RHCC  +YP+CDT+  QCLKR+ N T
Sbjct: 379  LDSAVCCKDNRHCCPHNYPICDTKNNQCLKRVGNAT 414


Top