BLASTX nr result

ID: Rehmannia26_contig00016332 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia26_contig00016332
         (1745 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002307688.2| cysteine protease family protein [Populus tr...   598   e-168
gb|EPS60205.1| hypothetical protein M569_14597, partial [Genlise...   597   e-168
dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas]                          591   e-166
ref|XP_006341989.1| PREDICTED: cysteine proteinase RD21a-like [S...   585   e-164
ref|NP_563855.1| papain-like cysteine peptidase [Arabidopsis tha...   580   e-163
gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [A...   580   e-163
ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arab...   578   e-162
ref|XP_006417526.1| hypothetical protein EUTSA_v10007640mg [Eutr...   577   e-162
ref|XP_004238304.1| PREDICTED: cysteine proteinase RD21a-like [S...   577   e-162
ref|XP_002510459.1| cysteine protease, putative [Ricinus communi...   571   e-160
ref|XP_006435079.1| hypothetical protein CICLE_v10001178mg [Citr...   569   e-159
ref|XP_006473576.1| PREDICTED: cysteine proteinase RD21a-like [C...   568   e-159
gb|EOY14881.1| JHL18I08.3 protein [Theobroma cacao]                   568   e-159
ref|XP_006307431.1| hypothetical protein CARUB_v10009056mg [Caps...   565   e-158
ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [C...   561   e-157
ref|XP_002280937.1| PREDICTED: cysteine proteinase RD21a [Vitis ...   560   e-157
gb|AAB60738.1| Strong similarity to Dianthus cysteine proteinase...   558   e-156
ref|XP_004500967.1| PREDICTED: oryzain alpha chain-like [Cicer a...   557   e-156
ref|XP_004291399.1| PREDICTED: cysteine proteinase RD21a-like [F...   545   e-152
ref|XP_003523725.1| PREDICTED: oryzain alpha chain-like [Glycine...   543   e-152

>ref|XP_002307688.2| cysteine protease family protein [Populus trichocarpa]
            gi|550339725|gb|EEE94684.2| cysteine protease family
            protein [Populus trichocarpa]
          Length = 436

 Score =  598 bits (1543), Expect = e-168
 Identities = 277/411 (67%), Positives = 326/411 (79%)
 Frame = -2

Query: 1600 PTCKSSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIRANSSYTLSLNA 1421
            P+  SS IS LF++WCKE+GK+YTS++ER HRLKVFE NY++V +HN + NSSY+L+LNA
Sbjct: 18   PSTSSSDISQLFETWCKEHGKSYTSQEERSHRLKVFEDNYDFVTKHNSKGNSSYSLALNA 77

Query: 1420 FADLTNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPSSLDWRNKGAVTGV 1241
            FADLT+HEFK   LGLS +  +L  R N     + G       DIP+S+DWRNKG VT V
Sbjct: 78   FADLTHHEFKTSRLGLSAAPLNLAHR-NLEITGVVG-------DIPASIDWRNKGVVTNV 129

Query: 1240 KDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCGGGLMDYAYEFI 1061
            KDQGSCGACWSFSATGA+EGIN+I TGSLVSLSEQELI+CDKSYNDGCGGGLMDYA++F+
Sbjct: 130  KDQGSCGACWSFSATGAIEGINKIVTGSLVSLSEQELIECDKSYNDGCGGGLMDYAFQFV 189

Query: 1060 IKNKGIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLQAVANQPVSVGI 881
            I N GIDTEEDYPYR RDGTCNKD++KR VVTID Y D+P  NEK+LLQAVA QPVSVGI
Sbjct: 190  INNHGIDTEEDYPYRARDGTCNKDRMKRRVVTIDKYVDVPENNEKQLLQAVAAQPVSVGI 249

Query: 880  CGSDSSFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGRYWGMNGYMHML 701
            CGS+ +FQ+YS GIFTGPCSTSLDHAVLIVGY S++G DYWI+KNSWG  WGM GYMHM 
Sbjct: 250  CGSERAFQMYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTGWGMRGYMHMQ 309

Query: 700  RNTGTAEGVCGINMLAXXXXXXXXXXXXXXXXXPTRCNLFTYCSSGETCCCARHFLGICL 521
            RN+G ++GVCGINMLA                 PT+CNL TYC++GETCCCAR F GIC+
Sbjct: 310  RNSGNSQGVCGINMLASYPVKTSPNPPPPPPPGPTKCNLLTYCAAGETCCCARKFFGICI 369

Query: 520  KWRCCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRIGNSTFVKPIAKKS 368
             W+CC   SAVCCKD  HCCPHDYPVCDT +N+C KR GN+T ++ I  K+
Sbjct: 370  SWKCCGLDSAVCCKDRLHCCPHDYPVCDTDKNMCFKRAGNATRMEAIEGKT 420


>gb|EPS60205.1| hypothetical protein M569_14597, partial [Genlisea aurea]
          Length = 424

 Score =  597 bits (1540), Expect = e-168
 Identities = 278/410 (67%), Positives = 326/410 (79%), Gaps = 1/410 (0%)
 Frame = -2

Query: 1606 QVPTCKSSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIRANSSYTLSL 1427
            QV    SSSISDLFDSWC+E+GKTY SE+EREHRL VF +NY+++  HN RAN SYTLSL
Sbjct: 17   QVHPIVSSSISDLFDSWCQEHGKTYVSEEEREHRLGVFSENYDFIASHNARANYSYTLSL 76

Query: 1426 NAFADLTNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPSSLDWRNKGAVT 1247
            NAFADLT  EF  +YLG SPS  DLLIR N G  +    N+   S +PSS+DWR KGAVT
Sbjct: 77   NAFADLTRSEFGGRYLGFSPSGHDLLIRKNRGSGSYRSRNY---SAVPSSIDWRKKGAVT 133

Query: 1246 GVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCGGGLMDYAYE 1067
            G+KDQGSCGACWSFSATGA+EGINQI TGSLVSLSEQELIDCD SYN GC GGLMDYAYE
Sbjct: 134  GIKDQGSCGACWSFSATGAIEGINQIVTGSLVSLSEQELIDCDHSYNQGCNGGLMDYAYE 193

Query: 1066 FIIKNKGIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLQAVANQPVSV 887
            FI+KNKGIDTEEDY Y+GRD +C+++KL + VVTIDSY DIP KNE+ LL+AVA+QPVSV
Sbjct: 194  FILKNKGIDTEEDYSYKGRDASCSQNKLNKRVVTIDSYVDIPEKNEQMLLEAVASQPVSV 253

Query: 886  GICGSDSSFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGRYWGMNGYMH 707
            GI G D+ FQ YS GIFTGPCSTSLDHAVLIVGYDSK+GKDYWI+KNSWG+ WGM+GYM+
Sbjct: 254  GISGGDAPFQFYSQGIFTGPCSTSLDHAVLIVGYDSKNGKDYWIVKNSWGKSWGMDGYMY 313

Query: 706  MLRNTGTAEGVCGINMLA-XXXXXXXXXXXXXXXXXPTRCNLFTYCSSGETCCCARHFLG 530
            + RNTG   G+C INM+A                  PT+C+LF+YCS GETCCCAR FLG
Sbjct: 314  VQRNTGNQNGICEINMMASYPVKTNPNPSPSPSPPGPTKCSLFSYCSQGETCCCARRFLG 373

Query: 529  ICLKWRCCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRIGNSTFVKPI 380
            +C++++CC A SAVCC+D+ HCCP DYP+CDT +++C K  GNST   P+
Sbjct: 374  LCMRYKCCGAESAVCCEDNVHCCPQDYPICDTAQSVCRKMSGNSTMAIPV 423


>dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas]
          Length = 441

 Score =  591 bits (1524), Expect = e-166
 Identities = 274/409 (66%), Positives = 327/409 (79%)
 Frame = -2

Query: 1588 SSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIRANSSYTLSLNAFADL 1409
            SS I+ LF++WC+++GKTY S++E+  RLKVF+ NY++V +HN + NSSYTLSLNAFADL
Sbjct: 23   SSEIAHLFETWCQQHGKTYASQEEKLFRLKVFQDNYDFVTEHNSQGNSSYTLSLNAFADL 82

Query: 1408 TNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPSSLDWRNKGAVTGVKDQG 1229
            T+HEFKA  LGLS +AS     LN   +    P+FV  +D+P+S+DWR  GAVT VKDQG
Sbjct: 83   THHEFKASRLGLSSAAS---ASLNVDRSNRQIPDFV--ADVPASVDWRKNGAVTQVKDQG 137

Query: 1228 SCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCGGGLMDYAYEFIIKNK 1049
            +CGACWSFSATGA+EGIN+I TGSLVSLSEQEL+DCDKSYN+GC GG+MDYA++F+I N 
Sbjct: 138  NCGACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDKSYNNGCEGGIMDYAFQFVIDNH 197

Query: 1048 GIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLQAVANQPVSVGICGSD 869
            GIDTEEDYPY+GRD +CNK+KLKRHVVTID Y D+P  NEK+LL+AVANQPVSVGICGS+
Sbjct: 198  GIDTEEDYPYQGRDRSCNKEKLKRHVVTIDGYVDVPQNNEKELLKAVANQPVSVGICGSE 257

Query: 868  SSFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGRYWGMNGYMHMLRNTG 689
             +FQLYS GIFTGPCSTSLDHAVLIVGY S++G DYWI+KNSWG YWGM+GYMHM RN+G
Sbjct: 258  RAFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGSYWGMDGYMHMQRNSG 317

Query: 688  TAEGVCGINMLAXXXXXXXXXXXXXXXXXPTRCNLFTYCSSGETCCCARHFLGICLKWRC 509
            ++ G+CGINMLA                 PTRC+LFT+C  GETCCC  H  GICL W+C
Sbjct: 318  SSRGLCGINMLASYPKKTSPNPPPPAPPGPTRCDLFTHCGEGETCCCVHHIFGICLSWKC 377

Query: 508  CEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRIGNSTFVKPIAKKSFS 362
            CE  SAVCCKD  HCCP DYPVCDT RNICLK  GN+T ++  AK S S
Sbjct: 378  CELDSAVCCKDGRHCCPRDYPVCDTTRNICLKHYGNATRIEKFAKNSSS 426


>ref|XP_006341989.1| PREDICTED: cysteine proteinase RD21a-like [Solanum tuberosum]
          Length = 439

 Score =  585 bits (1507), Expect = e-164
 Identities = 273/430 (63%), Positives = 327/430 (76%)
 Frame = -2

Query: 1651 MCWXXXXXXXXXXXFQVPTCKSSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYV 1472
            M W           FQ P C  SSISDLF++WC++ GK Y+SEQER +R KVFE+NY Y+
Sbjct: 1    MNWLLPSLVLVLLIFQQPFCTCSSISDLFETWCQQNGKKYSSEQERVYRFKVFEENYAYI 60

Query: 1471 NQHNIRANSSYTLSLNAFADLTNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQES 1292
             +HN + NSSYTL LNA++DLT+HEF+  +LGLS SA+D  IRL    +       + + 
Sbjct: 61   TEHNSKENSSYTLGLNAYSDLTHHEFRNSFLGLSSSANDF-IRLKGRGSGSSETGVLSDV 119

Query: 1291 DIPSSLDWRNKGAVTGVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKS 1112
            D PSSLDWR KGAVT VK+QGSCGACWSFSATGA+EGIN+I TGSLVSLSEQELIDCD+S
Sbjct: 120  DAPSSLDWREKGAVTDVKNQGSCGACWSFSATGAMEGINKITTGSLVSLSEQELIDCDRS 179

Query: 1111 YNDGCGGGLMDYAYEFIIKNKGIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKN 932
            YN+GCGGGLMDYA+EF+IKN GIDTE+DYP+R R+GTCNK+KL+RHVVTID Y DIP  +
Sbjct: 180  YNEGCGGGLMDYAFEFVIKNGGIDTEKDYPFREREGTCNKNKLQRHVVTIDGYTDIPQND 239

Query: 931  EKKLLQAVANQPVSVGICGSDSSFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWII 752
            E KLL+AVA QPVSVGICGS  +FQ YS GIFTGPCST+LDHAVLIVGY S++G DYWII
Sbjct: 240  EDKLLKAVATQPVSVGICGSARAFQSYSKGIFTGPCSTALDHAVLIVGYGSENGVDYWII 299

Query: 751  KNSWGRYWGMNGYMHMLRNTGTAEGVCGINMLAXXXXXXXXXXXXXXXXXPTRCNLFTYC 572
            KNSWG  WG+NGY+HM RN+G  EG+CGIN LA                 P++C++FT C
Sbjct: 300  KNSWGTSWGINGYIHMQRNSGNQEGICGINKLASYPTKTSPNPPTPPAPGPSKCSMFTSC 359

Query: 571  SSGETCCCARHFLGICLKWRCCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRIGNSTF 392
              GETCCC   FLGICL W+CC   SAVCCKD  HCCP DYP+CDT RN+CLKR+ N+T 
Sbjct: 360  GQGETCCCGSKFLGICLSWKCCGLDSAVCCKDGRHCCPQDYPICDTSRNLCLKRMNNATI 419

Query: 391  VKPIAKKSFS 362
            V+   K++F+
Sbjct: 420  VQQPQKEAFT 429


>ref|NP_563855.1| papain-like cysteine peptidase [Arabidopsis thaliana]
            gi|110741821|dbj|BAE98853.1| papain-like cysteine
            peptidase XBCP3 [Arabidopsis thaliana]
            gi|111074448|gb|ABH04597.1| At1g09850 [Arabidopsis
            thaliana] gi|332190386|gb|AEE28507.1| papain-like
            cysteine peptidase [Arabidopsis thaliana]
          Length = 437

 Score =  580 bits (1496), Expect = e-163
 Identities = 267/409 (65%), Positives = 322/409 (78%)
 Frame = -2

Query: 1588 SSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIRANSSYTLSLNAFADL 1409
            S  IS+LFD WC+++GKTY SE+ER+ R+++F+ N+++V QHN+  N++Y+LSLNAFADL
Sbjct: 25   SDDISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADL 84

Query: 1408 TNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPSSLDWRNKGAVTGVKDQG 1229
            T+HEFKA  LGLS SA  +++       A  G +      +P S+DWR KGAVT VKDQG
Sbjct: 85   THHEFKASRLGLSVSAPSVIM-------ASKGQSLGGSVKVPDSVDWRKKGAVTNVKDQG 137

Query: 1228 SCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCGGGLMDYAYEFIIKNK 1049
            SCGACWSFSATGA+EGINQI TG L+SLSEQELIDCDKSYN GC GGLMDYA+EF+IKN 
Sbjct: 138  SCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNH 197

Query: 1048 GIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLQAVANQPVSVGICGSD 869
            GIDTE+DYPY+ RDGTC KDKLK+ VVTIDSYA + + +EK L++AVA QPVSVGICGS+
Sbjct: 198  GIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSE 257

Query: 868  SSFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGRYWGMNGYMHMLRNTG 689
             +FQLYS GIF+GPCSTSLDHAVLIVGY S++G DYWI+KNSWG+ WGM+G+MHM RNT 
Sbjct: 258  RAFQLYSSGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTE 317

Query: 688  TAEGVCGINMLAXXXXXXXXXXXXXXXXXPTRCNLFTYCSSGETCCCARHFLGICLKWRC 509
             ++GVCGINMLA                 PT+CNLFTYCSSGETCCCAR   G+C  W+C
Sbjct: 318  NSDGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCCCARELFGLCFSWKC 377

Query: 508  CEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRIGNSTFVKPIAKKSFS 362
            CE  SAVCCKD  HCCPHDYPVCDT R++CLK+ GN T +KP  KK+ S
Sbjct: 378  CEIESAVCCKDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKPFWKKNSS 426


>gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana]
          Length = 437

 Score =  580 bits (1494), Expect = e-163
 Identities = 267/409 (65%), Positives = 322/409 (78%)
 Frame = -2

Query: 1588 SSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIRANSSYTLSLNAFADL 1409
            S  IS+LFD WC+++GKTY SE+ER+ R+++F+ N+++V QHN+  N++Y+LSLNAFADL
Sbjct: 25   SDDISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADL 84

Query: 1408 TNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPSSLDWRNKGAVTGVKDQG 1229
            T+HEFKA  LGLS SA  +++       A  G +      +P S+DWR KGAVT VKDQG
Sbjct: 85   THHEFKASRLGLSVSAPSVIM-------ASKGQSLGGSVKVPDSVDWRKKGAVTNVKDQG 137

Query: 1228 SCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCGGGLMDYAYEFIIKNK 1049
            SCGACWSFSATGA+EGINQI TG L+SLSEQELIDCDKSYN GC GGLMDYA+EF+IKN 
Sbjct: 138  SCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNH 197

Query: 1048 GIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLQAVANQPVSVGICGSD 869
            GIDTE+DYPY+ RDGTC KDKLK+ VVTIDSYA + + +EK L++AVA QPVSVGICGS+
Sbjct: 198  GIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSE 257

Query: 868  SSFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGRYWGMNGYMHMLRNTG 689
             +FQLYS GIF+GPCSTSLDHAVLIVGY S++G DYWI+KNSWG+ WGM+G+MHM RNT 
Sbjct: 258  RAFQLYSRGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTE 317

Query: 688  TAEGVCGINMLAXXXXXXXXXXXXXXXXXPTRCNLFTYCSSGETCCCARHFLGICLKWRC 509
             ++GVCGINMLA                 PT+CNLFTYCSSGETCCCAR   G+C  W+C
Sbjct: 318  NSDGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCCCARELFGLCFSWKC 377

Query: 508  CEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRIGNSTFVKPIAKKSFS 362
            CE  SAVCCKD  HCCPHDYPVCDT R++CLK+ GN T +KP  KK+ S
Sbjct: 378  CEIESAVCCKDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKPFWKKNSS 426


>ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp.
            lyrata] gi|297335615|gb|EFH66032.1| hypothetical protein
            ARALYDRAFT_471096 [Arabidopsis lyrata subsp. lyrata]
          Length = 439

 Score =  578 bits (1491), Expect = e-162
 Identities = 268/411 (65%), Positives = 322/411 (78%), Gaps = 2/411 (0%)
 Frame = -2

Query: 1588 SSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIRANSSYTLSLNAFADL 1409
            S  IS+LFD WC+ +GKTY SE+ER+ R+++F+ N+++V QHN+  N++Y+LSLNAFADL
Sbjct: 25   SDDISELFDDWCQRHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADL 84

Query: 1408 TNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPSSLDWRNKGAVTGVKDQG 1229
            T+HEFKA  LGLS SAS L++       A  G +    + +P S+DWR KGAVT VKDQG
Sbjct: 85   THHEFKASRLGLSVSASSLIM-------ASKGQSLGGNAKVPDSVDWRKKGAVTNVKDQG 137

Query: 1228 SCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCGGGLMDYAYEFIIKNK 1049
            SCGACWSFSATGA+EGINQI TG L+SLSEQELIDCDKSYN GC GGLMDYA+EF+IKN 
Sbjct: 138  SCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNH 197

Query: 1048 GIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLQAVANQPVSVGICGSD 869
            GIDTE+DYPY+ RDGTC KDKLK+ VVTIDSYA + + +EK L +AVA QPVSVGICGS+
Sbjct: 198  GIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALREAVAAQPVSVGICGSE 257

Query: 868  SSFQLYS--GGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGRYWGMNGYMHMLRN 695
             +FQLYS   GIF+GPCSTSLDHAVLIVGY S++G DYWI+KNSWG+ WGM+G+MHM RN
Sbjct: 258  RAFQLYSRVSGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRN 317

Query: 694  TGTAEGVCGINMLAXXXXXXXXXXXXXXXXXPTRCNLFTYCSSGETCCCARHFLGICLKW 515
            TG +EG+CGINMLA                 PT+CNLFTYCS+GETCCCAR+  G+C  W
Sbjct: 318  TGNSEGICGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSAGETCCCARNLFGLCFSW 377

Query: 514  RCCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRIGNSTFVKPIAKKSFS 362
            +CCE  SAVCC D  HCCPHDYPVCDT R++CLK+ GN T +KP  KK  S
Sbjct: 378  KCCEIESAVCCSDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKPFWKKDSS 428


>ref|XP_006417526.1| hypothetical protein EUTSA_v10007640mg [Eutrema salsugineum]
            gi|557095297|gb|ESQ35879.1| hypothetical protein
            EUTSA_v10007640mg [Eutrema salsugineum]
          Length = 444

 Score =  577 bits (1488), Expect = e-162
 Identities = 266/414 (64%), Positives = 322/414 (77%)
 Frame = -2

Query: 1603 VPTCKSSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIRANSSYTLSLN 1424
            + +  S  I++LFD WC  +GKTY SE+ER+HR+++F  N+++V QHN  +NS+Y+LSLN
Sbjct: 25   ISSSSSDDIAELFDDWCHRHGKTYGSEEERQHRIQIFRDNHDFVTQHNHISNSTYSLSLN 84

Query: 1423 AFADLTNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPSSLDWRNKGAVTG 1244
            AFADLT+HEFKA  LGLS  +  L+ +  +      G +      +P S+DWR KGAVT 
Sbjct: 85   AFADLTHHEFKASRLGLSAPSPSLMAKEQSL-----GVSERVRVKVPDSVDWRKKGAVTN 139

Query: 1243 VKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCGGGLMDYAYEF 1064
            VKDQGSCGACWSFSATGA+EGINQI TG L+SLSEQELIDCDKSYN GC GGLMDYA+EF
Sbjct: 140  VKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEF 199

Query: 1063 IIKNKGIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLQAVANQPVSVG 884
            +IKN GIDTE+DYPY+ +DGTC KDKLK+ VVTIDSYA + + NEK L++AVA+QPVSVG
Sbjct: 200  VIKNHGIDTEKDYPYQEQDGTCKKDKLKKRVVTIDSYAGVASNNEKALMEAVASQPVSVG 259

Query: 883  ICGSDSSFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGRYWGMNGYMHM 704
            ICGS+ +FQLYS GIF+GPCSTSLDHAVLIVGY S++G DYWI+KNSWG+ WGM+G+MHM
Sbjct: 260  ICGSERAFQLYSSGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHM 319

Query: 703  LRNTGTAEGVCGINMLAXXXXXXXXXXXXXXXXXPTRCNLFTYCSSGETCCCARHFLGIC 524
             RNTG +EGVCGINMLA                 PT+CNLFTYCSSGETCCCAR   G+C
Sbjct: 320  QRNTGNSEGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCCCARTLFGLC 379

Query: 523  LKWRCCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRIGNSTFVKPIAKKSFS 362
              W+CCE  SAVCCKD  HCCP DYPVCDT +++CLK+ GN T +KP  KK+ S
Sbjct: 380  FSWKCCELESAVCCKDGRHCCPRDYPVCDTTKSLCLKKTGNFTEIKPFWKKNSS 433


>ref|XP_004238304.1| PREDICTED: cysteine proteinase RD21a-like [Solanum lycopersicum]
          Length = 439

 Score =  577 bits (1486), Expect = e-162
 Identities = 269/430 (62%), Positives = 322/430 (74%)
 Frame = -2

Query: 1651 MCWXXXXXXXXXXXFQVPTCKSSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYV 1472
            M W           FQ P C  SSISDLF++WC++ GK Y+SEQER +R KVFE+NY Y+
Sbjct: 1    MKWLLPSLVLVLLIFQQPLCTCSSISDLFETWCQQNGKKYSSEQERMYRFKVFEENYAYI 60

Query: 1471 NQHNIRANSSYTLSLNAFADLTNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQES 1292
             +HN + NSSYTL LNA++DLT+HEF+  +LGLS SA+D  IRL    +       + + 
Sbjct: 61   TEHNSKGNSSYTLGLNAYSDLTHHEFRNSFLGLSSSANDF-IRLKGRGSGSSAAGVLSDV 119

Query: 1291 DIPSSLDWRNKGAVTGVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKS 1112
            D PSSLDWR+KGAVT VK+QGSCGACWSFSATGA+EGIN+I TGSLVSLSEQELIDCD+S
Sbjct: 120  DAPSSLDWRDKGAVTNVKNQGSCGACWSFSATGAIEGINKITTGSLVSLSEQELIDCDRS 179

Query: 1111 YNDGCGGGLMDYAYEFIIKNKGIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKN 932
            YN GCGGGLMDYA+EF+IKN GIDTE+DYP+R ++GTCNK+KL+R VVTID Y DIP  +
Sbjct: 180  YNQGCGGGLMDYAFEFVIKNGGIDTEKDYPFREKEGTCNKNKLQRRVVTIDGYTDIPQND 239

Query: 931  EKKLLQAVANQPVSVGICGSDSSFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWII 752
            E KLL+AVA QPVSVGICGS  +FQ YS GIFTGPC T LDHAVLIVGY S++G DYWII
Sbjct: 240  EDKLLKAVATQPVSVGICGSARAFQSYSKGIFTGPCPTDLDHAVLIVGYGSENGFDYWII 299

Query: 751  KNSWGRYWGMNGYMHMLRNTGTAEGVCGINMLAXXXXXXXXXXXXXXXXXPTRCNLFTYC 572
            KNSWG  WG+NGY+HM RN+G  EG+CG+N LA                 P++C+ FT C
Sbjct: 300  KNSWGTSWGINGYIHMQRNSGNQEGICGVNKLASYPTKTSPNPPNPPAPGPSKCSTFTSC 359

Query: 571  SSGETCCCARHFLGICLKWRCCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRIGNSTF 392
              GETCCC   FLGICL W+CC   SAVCCKD  HCCP DYP+CDT RN+CLKR+ N+T 
Sbjct: 360  GQGETCCCGLKFLGICLSWKCCGLDSAVCCKDGRHCCPWDYPICDTSRNLCLKRMSNATI 419

Query: 391  VKPIAKKSFS 362
            V+   K+ F+
Sbjct: 420  VQQPQKEPFT 429


>ref|XP_002510459.1| cysteine protease, putative [Ricinus communis]
            gi|223551160|gb|EEF52646.1| cysteine protease, putative
            [Ricinus communis]
          Length = 422

 Score =  571 bits (1472), Expect = e-160
 Identities = 263/392 (67%), Positives = 315/392 (80%)
 Frame = -2

Query: 1588 SSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIRANSSYTLSLNAFADL 1409
            SS IS LF+SW KE+GKTYTS++++ +R K+FE+NYE+V +HN + NSSYTLSLNAFADL
Sbjct: 25   SSDISKLFESWTKEHGKTYTSKEDKLYRFKIFEENYEFVKKHNSQGNSSYTLSLNAFADL 84

Query: 1408 TNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPSSLDWRNKGAVTGVKDQG 1229
            T+HEFKA  LGLS  ++   +   N P      +FV   D+P S+DWR KGAV+ VKDQG
Sbjct: 85   THHEFKASRLGLSAFSTSGKLSRRNFPLH----DFV--GDVPISIDWRKKGAVSQVKDQG 138

Query: 1228 SCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCGGGLMDYAYEFIIKNK 1049
            +CGACWSFSATGA+EGIN+I TGSLVSLSEQEL+DCD+SYN+GC GGLMDYAY+F+I+N 
Sbjct: 139  NCGACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDRSYNNGCEGGLMDYAYQFVIENN 198

Query: 1048 GIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLQAVANQPVSVGICGSD 869
            GIDTEEDYPY+ R+ TCNK+KLKRHVVTID Y D+P  NEK+LL+AVA QPVSVGICGS+
Sbjct: 199  GIDTEEDYPYQAREKTCNKEKLKRHVVTIDGYTDVPQNNEKELLKAVAAQPVSVGICGSE 258

Query: 868  SSFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGRYWGMNGYMHMLRNTG 689
             +FQLYS GIFTGPCSTSLDHAVLIVGY S++G DYWI+KNSWG +WG+NGYM+MLRN+G
Sbjct: 259  RAFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTHWGINGYMYMLRNSG 318

Query: 688  TAEGVCGINMLAXXXXXXXXXXXXXXXXXPTRCNLFTYCSSGETCCCARHFLGICLKWRC 509
             ++G+CGINMLA                 PT+C+LFT C  GETCCC R   G+C  W+C
Sbjct: 319  NSQGLCGINMLASFPVKTSPNPPPPAPPGPTKCDLFTRCGEGETCCCTRRIFGLCFSWKC 378

Query: 508  CEAVSAVCCKDHTHCCPHDYPVCDTKRNICLK 413
            CE  SAVCCKD  HCCPHDYPVCDTKRN+CLK
Sbjct: 379  CELDSAVCCKDGLHCCPHDYPVCDTKRNMCLK 410


>ref|XP_006435079.1| hypothetical protein CICLE_v10001178mg [Citrus clementina]
            gi|557537201|gb|ESR48319.1| hypothetical protein
            CICLE_v10001178mg [Citrus clementina]
          Length = 441

 Score =  569 bits (1467), Expect = e-159
 Identities = 267/415 (64%), Positives = 319/415 (76%), Gaps = 1/415 (0%)
 Frame = -2

Query: 1603 VPTCKSSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIRANSSYTLSLN 1424
            +P    S I++LF++WCK++GK Y+SEQE++ RLK+FE NY +V QHN   NSS+TLSLN
Sbjct: 17   LPLNYCSDINELFETWCKQHGKVYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLN 76

Query: 1423 AFADLTNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPSSLDWRNKGAVTG 1244
            AFADLT+ EFKA +LG S ++ D   R N   A++  P  ++  D+P+S+DWR KGAVT 
Sbjct: 77   AFADLTHQEFKASFLGFSAASIDHDRRRN---ASVQSPGTLR--DVPASIDWRKKGAVTE 131

Query: 1243 VKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCGGGLMDYAYEF 1064
            VKDQ SCGACW+FSATGA+EGIN+I TGSLVSLSEQELIDCD+SYN GCGGGLMDYAY+F
Sbjct: 132  VKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQF 191

Query: 1063 IIKNKGIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLQAVANQPVSVG 884
            +IKN GIDTE+DYPYRG+ G CNK KL RH+VTID Y D+P  NEK+LLQAV  QPVSVG
Sbjct: 192  VIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVG 251

Query: 883  ICGSDSSFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGRYWGMNGYMHM 704
            ICGS+ +FQLYS GIFTGPCSTSLDHAVLIVGYDS++G DYWIIKNSWGR WGMNGYMHM
Sbjct: 252  ICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHM 311

Query: 703  LRNTGTAEGVCGINMLAXXXXXXXXXXXXXXXXXPTRCNLFTYCSSGETCCCARHFLGIC 524
             RNTG + G+CGINMLA                 PTRC+L TYC++GETCCC    LGIC
Sbjct: 312  QRNTGNSLGICGINMLASYPTKTGQNPPPSPPPGPTRCSLLTYCAAGETCCCGSSILGIC 371

Query: 523  LKWRCCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRI-GNSTFVKPIAKKSFS 362
            L W+CC   SAVCC DH +CCP +YP+CD+ R+ CL R  GN T  + I  +  S
Sbjct: 372  LSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTRFTGNVTAAEAIEMRGSS 426


>ref|XP_006473576.1| PREDICTED: cysteine proteinase RD21a-like [Citrus sinensis]
          Length = 441

 Score =  568 bits (1465), Expect = e-159
 Identities = 266/415 (64%), Positives = 319/415 (76%), Gaps = 1/415 (0%)
 Frame = -2

Query: 1603 VPTCKSSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIRANSSYTLSLN 1424
            +P    S I++LF++WCK++GK Y+SEQE++ RLK+FE NY +V QHN   NSS+TLSLN
Sbjct: 17   LPPNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLN 76

Query: 1423 AFADLTNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPSSLDWRNKGAVTG 1244
            AFADLT+ EFKA +LG S ++ D   R N   A++  P  ++  D+P+S+DWR KGAVT 
Sbjct: 77   AFADLTHQEFKASFLGFSAASIDHDRRRN---ASVQSPGNLR--DVPASIDWRKKGAVTE 131

Query: 1243 VKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCGGGLMDYAYEF 1064
            VKDQ SCGACW+FSATGA+EGIN+I TGSLVSLSEQELIDCD+SYN GCGGGLMDYAY+F
Sbjct: 132  VKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQF 191

Query: 1063 IIKNKGIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLQAVANQPVSVG 884
            +IKN GIDTE+DYPYRG+ G CNK KL RH+VTID Y D+P  NEK+LLQAV  QPVSVG
Sbjct: 192  VIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVG 251

Query: 883  ICGSDSSFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGRYWGMNGYMHM 704
            ICGS+ +FQLYS GIFTGPCSTSLDHAVLI+GYDS++G DYWIIKNSWGR WGMNGYMHM
Sbjct: 252  ICGSERAFQLYSSGIFTGPCSTSLDHAVLIIGYDSENGVDYWIIKNSWGRSWGMNGYMHM 311

Query: 703  LRNTGTAEGVCGINMLAXXXXXXXXXXXXXXXXXPTRCNLFTYCSSGETCCCARHFLGIC 524
             RNTG + G+CGINMLA                 PTRC+L TYC+ GETCCC    LGIC
Sbjct: 312  QRNTGNSLGICGINMLASYPTKTGQNPPPSPPPGPTRCSLLTYCAPGETCCCGSSILGIC 371

Query: 523  LKWRCCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRI-GNSTFVKPIAKKSFS 362
            L W+CC   SAVCC DH +CCP +YP+CD+ R+ CL R+ GN T  + I  +  S
Sbjct: 372  LSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTRLTGNVTAAEAIEMRGSS 426


>gb|EOY14881.1| JHL18I08.3 protein [Theobroma cacao]
          Length = 438

 Score =  568 bits (1463), Expect = e-159
 Identities = 264/405 (65%), Positives = 317/405 (78%)
 Frame = -2

Query: 1585 SSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIRANSSYTLSLNAFADLT 1406
            S IS LF++WC ++GK Y+SE+E+ +RLKVFE+NY +V QHN   NSSY+L+LNAFADLT
Sbjct: 24   SHISHLFETWCDQHGKRYSSEEEKSYRLKVFEENYAFVTQHNGVGNSSYSLALNAFADLT 83

Query: 1405 NHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPSSLDWRNKGAVTGVKDQGS 1226
            +HEFKA  LGLS +A      +      +  P  V+  DIP+S+DWR KGAVT VKDQGS
Sbjct: 84   HHEFKASRLGLSAAA------IEGSRPNLQLPGLVR--DIPASMDWRTKGAVTKVKDQGS 135

Query: 1225 CGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCGGGLMDYAYEFIIKNKG 1046
            CGACWSFSATGA+EGIN+I TG+LVSLSEQEL+DCD+SYN GC GGLMDYAY+F+I N G
Sbjct: 136  CGACWSFSATGAIEGINKIVTGTLVSLSEQELVDCDRSYNSGCEGGLMDYAYQFVIDNHG 195

Query: 1045 IDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLQAVANQPVSVGICGSDS 866
            ID EEDYPY GR+ TCNK+K KR VVTID YA +PA NE  LLQAVA QPVSVGICGS+ 
Sbjct: 196  IDNEEDYPYLGREKTCNKEKRKRRVVTIDGYAGVPANNEDLLLQAVAKQPVSVGICGSER 255

Query: 865  SFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGRYWGMNGYMHMLRNTGT 686
            +FQLYS GIFTGPCS+SLDHAVLIVGY S++G DYWI+KNSWG  WGMNGY+HMLRN+G 
Sbjct: 256  AFQLYSKGIFTGPCSSSLDHAVLIVGYGSENGVDYWIVKNSWGTRWGMNGYIHMLRNSGD 315

Query: 685  AEGVCGINMLAXXXXXXXXXXXXXXXXXPTRCNLFTYCSSGETCCCARHFLGICLKWRCC 506
            ++G+CGINMLA                 PT+C+LFTYCS+GETCCC     GIC  W+CC
Sbjct: 316  SKGLCGINMLASYPTKTSPNPPSPPPPGPTKCDLFTYCSAGETCCCTHRIFGICFSWKCC 375

Query: 505  EAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRIGNSTFVKPIAKK 371
            E  SAVCCKD+ HCCP+DYPVCDTK++ CLKR+GN+T ++   K+
Sbjct: 376  ELDSAVCCKDNRHCCPYDYPVCDTKKSQCLKRVGNATRMEAFEKR 420


>ref|XP_006307431.1| hypothetical protein CARUB_v10009056mg [Capsella rubella]
            gi|482576142|gb|EOA40329.1| hypothetical protein
            CARUB_v10009056mg [Capsella rubella]
          Length = 467

 Score =  565 bits (1456), Expect = e-158
 Identities = 266/437 (60%), Positives = 319/437 (72%), Gaps = 28/437 (6%)
 Frame = -2

Query: 1588 SSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIRANSSYTLSLNAFADL 1409
            S  IS+LFD WC+ +GKTY SE E++HR ++F  N+++V QHN+  N++Y+LSLNAFADL
Sbjct: 27   SDDISELFDDWCQRHGKTYASEAEKQHRFQIFRDNHDFVTQHNLITNATYSLSLNAFADL 86

Query: 1408 TNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPSSLDWRNKGAVTGVKDQG 1229
             + EFK   LGLS SA  +++       A  G +      +P SLDWR KGAVT VKDQG
Sbjct: 87   NHSEFKTSRLGLSVSAPSVIM-------ASKGKSLGGSVKVPDSLDWRKKGAVTNVKDQG 139

Query: 1228 SCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCGGGLMDYAYEFIIKNK 1049
            SCGACWSFSATGA+EGINQI TG L+SLSEQELIDCDKSYNDGC GGLMDYA+EF+IKNK
Sbjct: 140  SCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNDGCNGGLMDYAFEFVIKNK 199

Query: 1048 GIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLQAVANQPVSVGICGSD 869
            GIDTE+DYPY+ RDGTC KDKLK+ VV+IDSYA +   +EK LL+AVA QPVSVGICGS+
Sbjct: 200  GIDTEKDYPYQERDGTCKKDKLKQRVVSIDSYAGVKPSDEKALLEAVAAQPVSVGICGSE 259

Query: 868  SSFQLYSG----------------------------GIFTGPCSTSLDHAVLIVGYDSKD 773
             +FQLYS                             GIF+GPCSTSLDHAVLIVGY S++
Sbjct: 260  RAFQLYSSVSFKIRDTSILSSECSTFPCLKLYLMMQGIFSGPCSTSLDHAVLIVGYGSQN 319

Query: 772  GKDYWIIKNSWGRYWGMNGYMHMLRNTGTAEGVCGINMLAXXXXXXXXXXXXXXXXXPTR 593
            G DYWI+KNSWG+ WGM+G+MHM RNTG ++G+CGINMLA                 PT+
Sbjct: 320  GVDYWIVKNSWGKSWGMDGFMHMQRNTGNSQGICGINMLASYPIKTHPNPPPPSPPGPTK 379

Query: 592  CNLFTYCSSGETCCCARHFLGICLKWRCCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLK 413
            CNLFTYCS+ ETCCCAR+  G+CL W+CCE  SAVCCKD  HCCPHDYPVCDT R++CLK
Sbjct: 380  CNLFTYCSAAETCCCARNLFGLCLSWKCCEIESAVCCKDGRHCCPHDYPVCDTTRSLCLK 439

Query: 412  RIGNSTFVKPIAKKSFS 362
            + GN T +KP  KK+ S
Sbjct: 440  KTGNFTAIKPFWKKNSS 456


>ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
            gi|449529596|ref|XP_004171784.1| PREDICTED: cysteine
            proteinase RD21a-like [Cucumis sativus]
          Length = 431

 Score =  561 bits (1446), Expect = e-157
 Identities = 261/413 (63%), Positives = 315/413 (76%)
 Frame = -2

Query: 1600 PTCKSSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIRANSSYTLSLNA 1421
            P   +S++S+LF+ WC E+GK+Y+S +E+ +RL VF  NYE+V  HN   NSSYTLSLN+
Sbjct: 18   PLSATSNVSELFEIWCTEHGKSYSSAEEKLYRLGVFADNYEFVTHHNNLDNSSYTLSLNS 77

Query: 1420 FADLTNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPSSLDWRNKGAVTGV 1241
            +ADLT+HEFK   LG SP+  +    L   P+           D+P SLDWR KGAVT V
Sbjct: 78   YADLTHHEFKVSRLGFSPALRNFRPVLPQEPSL--------PRDVPDSLDWRKKGAVTAV 129

Query: 1240 KDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCGGGLMDYAYEFI 1061
            KDQGSCGACWSFSATGA+EGINQI TGSL+SLSEQELIDCD+SYN GCGGGLMDYAY+F+
Sbjct: 130  KDQGSCGACWSFSATGAMEGINQIMTGSLISLSEQELIDCDRSYNSGCGGGLMDYAYQFV 189

Query: 1060 IKNKGIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLQAVANQPVSVGI 881
            I N GIDTE DYPY+ RDG+C KDKL+R+VVTID YADIP+ +E KLLQAVA QPVSVGI
Sbjct: 190  ISNHGIDTENDYPYQARDGSCRKDKLQRNVVTIDGYADIPSNDEGKLLQAVAAQPVSVGI 249

Query: 880  CGSDSSFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGRYWGMNGYMHML 701
            CGS+ +FQLYS GIF+GPCSTSLDHAVLIVGY S++G DYWI+KNSWG+ WGM+GYMHM 
Sbjct: 250  CGSERAFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKSWGMDGYMHMQ 309

Query: 700  RNTGTAEGVCGINMLAXXXXXXXXXXXXXXXXXPTRCNLFTYCSSGETCCCARHFLGICL 521
            RN+G +EGVCGIN LA                 PT+C++ T C++GETCCCA+ FLG+CL
Sbjct: 310  RNSGNSEGVCGINKLASYPTKTNPNPPPSPPPGPTKCSILTSCAAGETCCCAKKFLGLCL 369

Query: 520  KWRCCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRIGNSTFVKPIAKKSFS 362
             W+CC   SAVCCKD  HCCP DYP+CDT RN+CLK+  N T  + +  +S S
Sbjct: 370  SWKCCGLSSAVCCKDGRHCCPFDYPICDTDRNLCLKQTMNGTRTEILENRSSS 422


>ref|XP_002280937.1| PREDICTED: cysteine proteinase RD21a [Vitis vinifera]
            gi|302142569|emb|CBI19772.3| unnamed protein product
            [Vitis vinifera]
          Length = 436

 Score =  560 bits (1442), Expect = e-157
 Identities = 256/425 (60%), Positives = 325/425 (76%)
 Frame = -2

Query: 1591 KSSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIRANSSYTLSLNAFAD 1412
            ++SS +DLF++WC++YGKTY+SE+E+  RLKVFE+N+ +V QHN  AN+SYTL+LNAFAD
Sbjct: 21   EASSTADLFEAWCEQYGKTYSSEEEKASRLKVFEENHAFVTQHNSMANASYTLALNAFAD 80

Query: 1411 LTNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPSSLDWRNKGAVTGVKDQ 1232
            LT+HEFKA  LG SP  +  +        ++  P  VQE  +P ++DWR  GAVTGVKDQ
Sbjct: 81   LTHHEFKASRLGFSPGRAQSI-------RSVGTP--VQELHVPPAVDWRKSGAVTGVKDQ 131

Query: 1231 GSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCGGGLMDYAYEFIIKN 1052
            G+CG CWSFS TGA+EGIN+I TGSLVSLSEQEL+DCD+SYN GC GGLMDYAY+F+IKN
Sbjct: 132  GNCGGCWSFSTTGAIEGINKIVTGSLVSLSEQELVDCDRSYNSGCEGGLMDYAYQFVIKN 191

Query: 1051 KGIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLQAVANQPVSVGICGS 872
            +GID+E DYPY G D  CNK+KLK+H+VTID Y DIP  +EK+LLQ VA QPVSVGICGS
Sbjct: 192  QGIDSEADYPYVGMDKPCNKEKLKKHIVTIDGYTDIPPNDEKQLLQVVAKQPVSVGICGS 251

Query: 871  DSSFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGRYWGMNGYMHMLRNT 692
            + +FQLYS G++TGPCS++LDHAVLIVGY ++DG D+WI+KNSWG +WGM GY+HMLRN 
Sbjct: 252  EKTFQLYSKGVYTGPCSSTLDHAVLIVGYGTEDGVDFWIVKNSWGEHWGMRGYIHMLRNN 311

Query: 691  GTAEGVCGINMLAXXXXXXXXXXXXXXXXXPTRCNLFTYCSSGETCCCARHFLGICLKWR 512
            GTAEG+CGINMLA                 PT+C+ F+ CS GETCCC+  F+G+CL W 
Sbjct: 312  GTAEGICGINMLASYPAKTSPNPPPPPTPGPTKCDFFSSCSEGETCCCSWRFIGVCLSWN 371

Query: 511  CCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRIGNSTFVKPIAKKSFSI*LSKQCGYT 332
            CC A SAVCC ++ +CCP  +P+CDTKRN CLK  GN T V+ + ++  S+   K  G++
Sbjct: 372  CCTAKSAVCCDNNNYCCPASHPICDTKRNRCLKPAGNGTGVEVLKRRGSSV---KFGGWS 428

Query: 331  SYSSA 317
            S + A
Sbjct: 429  SINDA 433


>gb|AAB60738.1| Strong similarity to Dianthus cysteine proteinase (gb|U17135)
            [Arabidopsis thaliana]
          Length = 416

 Score =  558 bits (1439), Expect = e-156
 Identities = 259/399 (64%), Positives = 311/399 (77%), Gaps = 7/399 (1%)
 Frame = -2

Query: 1588 SSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIRANSSYTLSLNAFADL 1409
            S  IS+LFD WC+++GKTY SE+ER+ R+++F+ N+++V QHN+  N++Y+LSLNAFADL
Sbjct: 23   SDDISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADL 82

Query: 1408 TNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPSSLDWRNKGAVTGVKDQG 1229
            T+HEFKA  LGLS SA  +++       A  G +      +P S+DWR KGAVT VKDQG
Sbjct: 83   THHEFKASRLGLSVSAPSVIM-------ASKGQSLGGSVKVPDSVDWRKKGAVTNVKDQG 135

Query: 1228 SCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCGGGLMDYAYEFIIKNK 1049
            SCGACWSFSATGA+EGINQI TG L+SLSEQELIDCDKSYN GC GGLMDYA+EF+IKN 
Sbjct: 136  SCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNH 195

Query: 1048 GIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLQAVANQPVSVGICGSD 869
            GIDTE+DYPY+ RDGTC KDKLK+ VVTIDSYA + + +EK L++AVA QPVSVGICGS+
Sbjct: 196  GIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSE 255

Query: 868  SSFQLYSG-------GIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGRYWGMNGYM 710
             +FQLYS        GIF+GPCSTSLDHAVLIVGY S++G DYWI+KNSWG+ WGM+G+M
Sbjct: 256  RAFQLYSSKFYLLMQGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFM 315

Query: 709  HMLRNTGTAEGVCGINMLAXXXXXXXXXXXXXXXXXPTRCNLFTYCSSGETCCCARHFLG 530
            HM RNT  ++GVCGINMLA                 PT+CNLFTYCSSGETCCCAR   G
Sbjct: 316  HMQRNTENSDGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCCCARELFG 375

Query: 529  ICLKWRCCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLK 413
            +C  W+CCE  SAVCCKD  HCCPHDYPVCDT R++CLK
Sbjct: 376  LCFSWKCCEIESAVCCKDGRHCCPHDYPVCDTTRSLCLK 414


>ref|XP_004500967.1| PREDICTED: oryzain alpha chain-like [Cicer arietinum]
          Length = 436

 Score =  557 bits (1435), Expect = e-156
 Identities = 265/396 (66%), Positives = 299/396 (75%)
 Frame = -2

Query: 1576 SDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIRANSSYTLSLNAFADLTNHE 1397
            S LF  WCK++GKTY SEQE+ +R  VFE NY +V QHN   NSSYTLSLNAFADLT+HE
Sbjct: 27   SKLFQEWCKQHGKTYPSEQEKRYRFNVFEDNYAFVAQHNQIGNSSYTLSLNAFADLTHHE 86

Query: 1396 FKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPSSLDWRNKGAVTGVKDQGSCGA 1217
            FKA  LGL PS S L  + N         +F+Q   +PS +DWR  GAV+ VKDQGSCGA
Sbjct: 87   FKATRLGLPPS-SLLRFKFNRFQDQQRSDDFLQ---VPSEIDWRKNGAVSIVKDQGSCGA 142

Query: 1216 CWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCGGGLMDYAYEFIIKNKGIDT 1037
            CWSFSATGA+EGIN+I TGSLVSLSEQEL+DCD +YN GC GGLMDYAY+FII N GIDT
Sbjct: 143  CWSFSATGAIEGINKIVTGSLVSLSEQELVDCDTTYNSGCDGGLMDYAYQFIIDNNGIDT 202

Query: 1036 EEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLQAVANQPVSVGICGSDSSFQ 857
            EEDYPY+ R   C KDKLKR VVTID Y D+P  +EKKLL+AVA QPVSVGICGS  +FQ
Sbjct: 203  EEDYPYQARQLLCKKDKLKRRVVTIDGYTDVPPNDEKKLLKAVAVQPVSVGICGSARAFQ 262

Query: 856  LYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGRYWGMNGYMHMLRNTGTAEG 677
            LYS GIFTGPCSTSLDHAVLIVGY S++G DYWI+KNSWG+YWGMNGY+HMLRNT ++ G
Sbjct: 263  LYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKYWGMNGYIHMLRNTDSSAG 322

Query: 676  VCGINMLAXXXXXXXXXXXXXXXXXPTRCNLFTYCSSGETCCCARHFLGICLKWRCCEAV 497
            +CGINMLA                 P +CNLFTYCS GETCCCA+ FLGIC  W+CC   
Sbjct: 323  LCGINMLASYPTKTKPNPPVPPPPGPIKCNLFTYCSGGETCCCAKKFLGICFSWKCCGVT 382

Query: 496  SAVCCKDHTHCCPHDYPVCDTKRNICLKRIGNSTFV 389
            SAVCCKD  HCCP DYPVCD     CLKRI N T +
Sbjct: 383  SAVCCKDKRHCCPLDYPVCDASNGQCLKRIANGTIL 418


>ref|XP_004291399.1| PREDICTED: cysteine proteinase RD21a-like [Fragaria vesca subsp.
            vesca]
          Length = 441

 Score =  545 bits (1404), Expect = e-152
 Identities = 256/402 (63%), Positives = 307/402 (76%), Gaps = 4/402 (0%)
 Frame = -2

Query: 1588 SSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIRANSSYTLSLNAFADL 1409
            SSS S+LF++WCK+YGK+Y+S++E+ +RL +FE+N  ++ QHN   NSSYTLSLN+F+DL
Sbjct: 25   SSSSSELFEAWCKQYGKSYSSQEEKLYRLSLFEQNLAFITQHNDLGNSSYTLSLNSFSDL 84

Query: 1408 TNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPSSLDWRNKGAVTGVKDQG 1229
            T+HEFKA  LG SP+   L  + +  P+ +          +PSS+DWR  GAVT VKDQG
Sbjct: 85   THHEFKASRLGFSPTFLRLYRKSDPKPSVV--------RHVPSSIDWRKNGAVTNVKDQG 136

Query: 1228 SCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSY-NDGCGGGLMDYAYEFIIKN 1052
            SCGACWSFSATGA+EGIN+I TGSLVSLSEQELIDCD+ Y N GC GGLMD A++FII N
Sbjct: 137  SCGACWSFSATGAIEGINKIVTGSLVSLSEQELIDCDRVYPNSGCNGGLMDDAFQFIIDN 196

Query: 1051 KGIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLQAVANQPVSVGICGS 872
             GIDTEEDYPY+G DGTCNK KLKRHVVTID Y D+PA NE++LL+AVA QPVSVGI GS
Sbjct: 197  NGIDTEEDYPYQGADGTCNKQKLKRHVVTIDGYTDVPANNEEQLLKAVATQPVSVGIAGS 256

Query: 871  DSSFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGRYWGMNGYMHMLRNT 692
               FQ YS GIF GPCST+LDHAVLIVGY S++G DYWI+KNSWG+ WGMNGY+H+LR+ 
Sbjct: 257  GREFQFYSKGIFAGPCSTTLDHAVLIVGYGSENGVDYWIVKNSWGKNWGMNGYIHILRDH 316

Query: 691  GTAEGVCGINMLA---XXXXXXXXXXXXXXXXXPTRCNLFTYCSSGETCCCARHFLGICL 521
              ++G+CGINMLA                    PT+C+LF+ C  GETCCCAR  LGICL
Sbjct: 317  SNSKGLCGINMLASYPTKTGENPPFPPPSPPPGPTKCDLFSKCGVGETCCCARKILGICL 376

Query: 520  KWRCCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRIGNST 395
             WRCCE  SAVCCKD  HCCPHDYP+CDT+RN CL+  GN T
Sbjct: 377  SWRCCEFTSAVCCKDRLHCCPHDYPICDTERNYCLQANGNLT 418


>ref|XP_003523725.1| PREDICTED: oryzain alpha chain-like [Glycine max]
          Length = 439

 Score =  543 bits (1400), Expect = e-152
 Identities = 260/417 (62%), Positives = 308/417 (73%), Gaps = 8/417 (1%)
 Frame = -2

Query: 1588 SSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIRAN-----SSYTLSLN 1424
            +S  S+LF+ WCKE+ KTY+SE+E+ +RLKVFE NY +V QHN  AN     SSYTLSLN
Sbjct: 26   ASDTSELFEKWCKEHSKTYSSEEEKLYRLKVFEDNYAFVAQHNQNANNNNNNSSYTLSLN 85

Query: 1423 AFADLTNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESD---IPSSLDWRNKGA 1253
            AFADLT+HEFK   LGL  +    L+R          P   Q  D   IPS +DWR  GA
Sbjct: 86   AFADLTHHEFKTTRLGLPLT----LLRFKR-------PQNQQSRDLLHIPSQIDWRQSGA 134

Query: 1252 VTGVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCGGGLMDYA 1073
            VT VKDQ SCGACW+FSATGA+EGIN+I TGSLVSLSEQELIDCD SYN GCGGGLMD+A
Sbjct: 135  VTPVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDTSYNSGCGGGLMDFA 194

Query: 1072 YEFIIKNKGIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLQAVANQPV 893
            Y+F+I NKGIDTE+DYPY+ R  +C+KDKLKR  VTI+ Y D+P  +E+++L+AVA+QPV
Sbjct: 195  YQFVIDNKGIDTEDDYPYQARQRSCSKDKLKRRAVTIEDYVDVP-PSEEEILKAVASQPV 253

Query: 892  SVGICGSDSSFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGRYWGMNGY 713
            SVGICGS+  FQLYS GIFTGPCST LDHAVLIVGY S++G DYWI+KNSWG+YWGMNGY
Sbjct: 254  SVGICGSEREFQLYSKGIFTGPCSTFLDHAVLIVGYGSENGVDYWIVKNSWGKYWGMNGY 313

Query: 712  MHMLRNTGTAEGVCGINMLAXXXXXXXXXXXXXXXXXPTRCNLFTYCSSGETCCCARHFL 533
            +HM+RN+G ++G+CGIN LA                 P RCNLFT+CS GETCCCA+ FL
Sbjct: 314  IHMIRNSGNSKGICGINTLASYPVKTKPNPPIPPPPGPVRCNLFTHCSEGETCCCAKSFL 373

Query: 532  GICLKWRCCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRIGNSTFVKPIAKKSFS 362
            GIC  W+CC   SAVCCKD  HCCP DYP+CDT+R  CLKR  N T       + FS
Sbjct: 374  GICFSWKCCGLTSAVCCKDKRHCCPQDYPICDTRRGQCLKRTANGTTTITSENQDFS 430


Top