BLASTX nr result

ID: Rehmannia22_contig00018645 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia22_contig00018645
         (2170 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EPS60205.1| hypothetical protein M569_14597, partial [Genlise...   598   e-168
ref|XP_002307688.2| cysteine protease family protein [Populus tr...   598   e-168
dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas]                          589   e-165
ref|XP_006341989.1| PREDICTED: cysteine proteinase RD21a-like [S...   587   e-165
ref|NP_563855.1| papain-like cysteine peptidase [Arabidopsis tha...   582   e-163
gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [A...   582   e-163
ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arab...   580   e-163
ref|XP_006417526.1| hypothetical protein EUTSA_v10007640mg [Eutr...   578   e-162
ref|XP_004238304.1| PREDICTED: cysteine proteinase RD21a-like [S...   578   e-162
ref|XP_002510459.1| cysteine protease, putative [Ricinus communi...   572   e-160
ref|XP_006435079.1| hypothetical protein CICLE_v10001178mg [Citr...   569   e-159
ref|XP_006473576.1| PREDICTED: cysteine proteinase RD21a-like [C...   568   e-159
ref|XP_006307431.1| hypothetical protein CARUB_v10009056mg [Caps...   567   e-159
gb|EOY14881.1| JHL18I08.3 protein [Theobroma cacao]                   566   e-158
ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [C...   561   e-157
gb|AAB60738.1| Strong similarity to Dianthus cysteine proteinase...   560   e-157
ref|XP_002280937.1| PREDICTED: cysteine proteinase RD21a [Vitis ...   560   e-157
ref|XP_004500967.1| PREDICTED: oryzain alpha chain-like [Cicer a...   558   e-156
ref|XP_004291399.1| PREDICTED: cysteine proteinase RD21a-like [F...   547   e-153
ref|XP_003523725.1| PREDICTED: oryzain alpha chain-like [Glycine...   543   e-152

>gb|EPS60205.1| hypothetical protein M569_14597, partial [Genlisea aurea]
          Length = 424

 Score =  598 bits (1543), Expect = e-168
 Identities = 278/410 (67%), Positives = 325/410 (79%), Gaps = 1/410 (0%)
 Frame = +1

Query: 544  QVPTCKSSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIRANSSYTLSL 723
            QV    SSSISDLFDSWC+E+GKTY SE+EREHRL VF +NY+++  HN RAN SYTLSL
Sbjct: 17   QVHPIVSSSISDLFDSWCQEHGKTYVSEEEREHRLGVFSENYDFIASHNARANYSYTLSL 76

Query: 724  NAFADLTNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPSSLDWRNKGAVT 903
            NAFADLT  EF  +YLG SPS  DLLIR N G  +    N+   S +PSS+DWR KGAVT
Sbjct: 77   NAFADLTRSEFGGRYLGFSPSGHDLLIRKNRGSGSYRSRNY---SAVPSSIDWRKKGAVT 133

Query: 904  GVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCGGGLMDYAYE 1083
            G+KDQGSCGACWSFSATGA+EGINQI TGSLVSLSEQELIDCD SYN GC GGLMDYAYE
Sbjct: 134  GIKDQGSCGACWSFSATGAIEGINQIVTGSLVSLSEQELIDCDHSYNQGCNGGLMDYAYE 193

Query: 1084 FIIKNKGIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLEAVATQPVSV 1263
            FI+KNKGIDTEEDY Y+GRD +C+++KL + VVTIDSY DIP KNE+ LLEAVA+QPVSV
Sbjct: 194  FILKNKGIDTEEDYSYKGRDASCSQNKLNKRVVTIDSYVDIPEKNEQMLLEAVASQPVSV 253

Query: 1264 GICGSDSSFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGRYWGMNGYMH 1443
            GI G D+ FQ YS GIFTGPCSTSLDHAVLIVGYDSK+GKDYWI+KNSWG+ WGM+GYM+
Sbjct: 254  GISGGDAPFQFYSQGIFTGPCSTSLDHAVLIVGYDSKNGKDYWIVKNSWGKSWGMDGYMY 313

Query: 1444 MLRNTGTAEGVCGINMLA-XXXXXXXXXXXXXXXXXXTRCNLFTYCSSGETCCCARHFLG 1620
            + RNTG   G+C INM+A                   T+C+LF+YCS GETCCCAR FLG
Sbjct: 314  VQRNTGNQNGICEINMMASYPVKTNPNPSPSPSPPGPTKCSLFSYCSQGETCCCARRFLG 373

Query: 1621 ICLKWRCCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRIGNSTFVKPI 1770
            +C++++CC A SAVCC+D+ HCCP DYP+CDT +++C K  GNST   P+
Sbjct: 374  LCMRYKCCGAESAVCCEDNVHCCPQDYPICDTAQSVCRKMSGNSTMAIPV 423


>ref|XP_002307688.2| cysteine protease family protein [Populus trichocarpa]
            gi|550339725|gb|EEE94684.2| cysteine protease family
            protein [Populus trichocarpa]
          Length = 436

 Score =  598 bits (1542), Expect = e-168
 Identities = 275/411 (66%), Positives = 325/411 (79%)
 Frame = +1

Query: 550  PTCKSSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIRANSSYTLSLNA 729
            P+  SS IS LF++WCKE+GK+YTS++ER HRLKVFE NY++V +HN + NSSY+L+LNA
Sbjct: 18   PSTSSSDISQLFETWCKEHGKSYTSQEERSHRLKVFEDNYDFVTKHNSKGNSSYSLALNA 77

Query: 730  FADLTNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPSSLDWRNKGAVTGV 909
            FADLT+HEFK   LGLS +  +L  R N     + G       DIP+S+DWRNKG VT V
Sbjct: 78   FADLTHHEFKTSRLGLSAAPLNLAHR-NLEITGVVG-------DIPASIDWRNKGVVTNV 129

Query: 910  KDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCGGGLMDYAYEFI 1089
            KDQGSCGACWSFSATGA+EGIN+I TGSLVSLSEQELI+CDKSYNDGCGGGLMDYA++F+
Sbjct: 130  KDQGSCGACWSFSATGAIEGINKIVTGSLVSLSEQELIECDKSYNDGCGGGLMDYAFQFV 189

Query: 1090 IKNKGIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLEAVATQPVSVGI 1269
            I N GIDTEEDYPYR RDGTCNKD++KR VVTID Y D+P  NEK+LL+AVA QPVSVGI
Sbjct: 190  INNHGIDTEEDYPYRARDGTCNKDRMKRRVVTIDKYVDVPENNEKQLLQAVAAQPVSVGI 249

Query: 1270 CGSDSSFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGRYWGMNGYMHML 1449
            CGS+ +FQ+YS GIFTGPCSTSLDHAVLIVGY S++G DYWI+KNSWG  WGM GYMHM 
Sbjct: 250  CGSERAFQMYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTGWGMRGYMHMQ 309

Query: 1450 RNTGTAEGVCGINMLAXXXXXXXXXXXXXXXXXXTRCNLFTYCSSGETCCCARHFLGICL 1629
            RN+G ++GVCGINMLA                  T+CNL TYC++GETCCCAR F GIC+
Sbjct: 310  RNSGNSQGVCGINMLASYPVKTSPNPPPPPPPGPTKCNLLTYCAAGETCCCARKFFGICI 369

Query: 1630 KWRCCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRIGNSTFVKPIAKKS 1782
             W+CC   SAVCCKD  HCCPHDYPVCDT +N+C KR GN+T ++ I  K+
Sbjct: 370  SWKCCGLDSAVCCKDRLHCCPHDYPVCDTDKNMCFKRAGNATRMEAIEGKT 420


>dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas]
          Length = 441

 Score =  589 bits (1518), Expect = e-165
 Identities = 272/409 (66%), Positives = 325/409 (79%)
 Frame = +1

Query: 562  SSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIRANSSYTLSLNAFADL 741
            SS I+ LF++WC+++GKTY S++E+  RLKVF+ NY++V +HN + NSSYTLSLNAFADL
Sbjct: 23   SSEIAHLFETWCQQHGKTYASQEEKLFRLKVFQDNYDFVTEHNSQGNSSYTLSLNAFADL 82

Query: 742  TNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPSSLDWRNKGAVTGVKDQG 921
            T+HEFKA  LGLS +AS     LN   +    P+FV  +D+P+S+DWR  GAVT VKDQG
Sbjct: 83   THHEFKASRLGLSSAAS---ASLNVDRSNRQIPDFV--ADVPASVDWRKNGAVTQVKDQG 137

Query: 922  SCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCGGGLMDYAYEFIIKNK 1101
            +CGACWSFSATGA+EGIN+I TGSLVSLSEQEL+DCDKSYN+GC GG+MDYA++F+I N 
Sbjct: 138  NCGACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDKSYNNGCEGGIMDYAFQFVIDNH 197

Query: 1102 GIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLEAVATQPVSVGICGSD 1281
            GIDTEEDYPY+GRD +CNK+KLKRHVVTID Y D+P  NEK+LL+AVA QPVSVGICGS+
Sbjct: 198  GIDTEEDYPYQGRDRSCNKEKLKRHVVTIDGYVDVPQNNEKELLKAVANQPVSVGICGSE 257

Query: 1282 SSFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGRYWGMNGYMHMLRNTG 1461
             +FQLYS GIFTGPCSTSLDHAVLIVGY S++G DYWI+KNSWG YWGM+GYMHM RN+G
Sbjct: 258  RAFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGSYWGMDGYMHMQRNSG 317

Query: 1462 TAEGVCGINMLAXXXXXXXXXXXXXXXXXXTRCNLFTYCSSGETCCCARHFLGICLKWRC 1641
            ++ G+CGINMLA                  TRC+LFT+C  GETCCC  H  GICL W+C
Sbjct: 318  SSRGLCGINMLASYPKKTSPNPPPPAPPGPTRCDLFTHCGEGETCCCVHHIFGICLSWKC 377

Query: 1642 CEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRIGNSTFVKPIAKKSFS 1788
            CE  SAVCCKD  HCCP DYPVCDT RNICLK  GN+T ++  AK S S
Sbjct: 378  CELDSAVCCKDGRHCCPRDYPVCDTTRNICLKHYGNATRIEKFAKNSSS 426


>ref|XP_006341989.1| PREDICTED: cysteine proteinase RD21a-like [Solanum tuberosum]
          Length = 439

 Score =  587 bits (1512), Expect = e-165
 Identities = 272/430 (63%), Positives = 326/430 (75%)
 Frame = +1

Query: 499  MCWXXXXXXXXXXXXQVPTCKSSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYV 678
            M W            Q P C  SSISDLF++WC++ GK Y+SEQER +R KVFE+NY Y+
Sbjct: 1    MNWLLPSLVLVLLIFQQPFCTCSSISDLFETWCQQNGKKYSSEQERVYRFKVFEENYAYI 60

Query: 679  NQHNIRANSSYTLSLNAFADLTNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQES 858
             +HN + NSSYTL LNA++DLT+HEF+  +LGLS SA+D  IRL    +       + + 
Sbjct: 61   TEHNSKENSSYTLGLNAYSDLTHHEFRNSFLGLSSSANDF-IRLKGRGSGSSETGVLSDV 119

Query: 859  DIPSSLDWRNKGAVTGVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKS 1038
            D PSSLDWR KGAVT VK+QGSCGACWSFSATGA+EGIN+I TGSLVSLSEQELIDCD+S
Sbjct: 120  DAPSSLDWREKGAVTDVKNQGSCGACWSFSATGAMEGINKITTGSLVSLSEQELIDCDRS 179

Query: 1039 YNDGCGGGLMDYAYEFIIKNKGIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKN 1218
            YN+GCGGGLMDYA+EF+IKN GIDTE+DYP+R R+GTCNK+KL+RHVVTID Y DIP  +
Sbjct: 180  YNEGCGGGLMDYAFEFVIKNGGIDTEKDYPFREREGTCNKNKLQRHVVTIDGYTDIPQND 239

Query: 1219 EKKLLEAVATQPVSVGICGSDSSFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWII 1398
            E KLL+AVATQPVSVGICGS  +FQ YS GIFTGPCST+LDHAVLIVGY S++G DYWII
Sbjct: 240  EDKLLKAVATQPVSVGICGSARAFQSYSKGIFTGPCSTALDHAVLIVGYGSENGVDYWII 299

Query: 1399 KNSWGRYWGMNGYMHMLRNTGTAEGVCGINMLAXXXXXXXXXXXXXXXXXXTRCNLFTYC 1578
            KNSWG  WG+NGY+HM RN+G  EG+CGIN LA                  ++C++FT C
Sbjct: 300  KNSWGTSWGINGYIHMQRNSGNQEGICGINKLASYPTKTSPNPPTPPAPGPSKCSMFTSC 359

Query: 1579 SSGETCCCARHFLGICLKWRCCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRIGNSTF 1758
              GETCCC   FLGICL W+CC   SAVCCKD  HCCP DYP+CDT RN+CLKR+ N+T 
Sbjct: 360  GQGETCCCGSKFLGICLSWKCCGLDSAVCCKDGRHCCPQDYPICDTSRNLCLKRMNNATI 419

Query: 1759 VKPIAKKSFS 1788
            V+   K++F+
Sbjct: 420  VQQPQKEAFT 429


>ref|NP_563855.1| papain-like cysteine peptidase [Arabidopsis thaliana]
            gi|110741821|dbj|BAE98853.1| papain-like cysteine
            peptidase XBCP3 [Arabidopsis thaliana]
            gi|111074448|gb|ABH04597.1| At1g09850 [Arabidopsis
            thaliana] gi|332190386|gb|AEE28507.1| papain-like
            cysteine peptidase [Arabidopsis thaliana]
          Length = 437

 Score =  582 bits (1501), Expect = e-163
 Identities = 267/409 (65%), Positives = 321/409 (78%)
 Frame = +1

Query: 562  SSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIRANSSYTLSLNAFADL 741
            S  IS+LFD WC+++GKTY SE+ER+ R+++F+ N+++V QHN+  N++Y+LSLNAFADL
Sbjct: 25   SDDISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADL 84

Query: 742  TNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPSSLDWRNKGAVTGVKDQG 921
            T+HEFKA  LGLS SA  +++       A  G +      +P S+DWR KGAVT VKDQG
Sbjct: 85   THHEFKASRLGLSVSAPSVIM-------ASKGQSLGGSVKVPDSVDWRKKGAVTNVKDQG 137

Query: 922  SCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCGGGLMDYAYEFIIKNK 1101
            SCGACWSFSATGA+EGINQI TG L+SLSEQELIDCDKSYN GC GGLMDYA+EF+IKN 
Sbjct: 138  SCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNH 197

Query: 1102 GIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLEAVATQPVSVGICGSD 1281
            GIDTE+DYPY+ RDGTC KDKLK+ VVTIDSYA + + +EK L+EAVA QPVSVGICGS+
Sbjct: 198  GIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSE 257

Query: 1282 SSFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGRYWGMNGYMHMLRNTG 1461
             +FQLYS GIF+GPCSTSLDHAVLIVGY S++G DYWI+KNSWG+ WGM+G+MHM RNT 
Sbjct: 258  RAFQLYSSGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTE 317

Query: 1462 TAEGVCGINMLAXXXXXXXXXXXXXXXXXXTRCNLFTYCSSGETCCCARHFLGICLKWRC 1641
             ++GVCGINMLA                  T+CNLFTYCSSGETCCCAR   G+C  W+C
Sbjct: 318  NSDGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCCCARELFGLCFSWKC 377

Query: 1642 CEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRIGNSTFVKPIAKKSFS 1788
            CE  SAVCCKD  HCCPHDYPVCDT R++CLK+ GN T +KP  KK+ S
Sbjct: 378  CEIESAVCCKDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKPFWKKNSS 426


>gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana]
          Length = 437

 Score =  582 bits (1499), Expect = e-163
 Identities = 267/409 (65%), Positives = 321/409 (78%)
 Frame = +1

Query: 562  SSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIRANSSYTLSLNAFADL 741
            S  IS+LFD WC+++GKTY SE+ER+ R+++F+ N+++V QHN+  N++Y+LSLNAFADL
Sbjct: 25   SDDISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADL 84

Query: 742  TNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPSSLDWRNKGAVTGVKDQG 921
            T+HEFKA  LGLS SA  +++       A  G +      +P S+DWR KGAVT VKDQG
Sbjct: 85   THHEFKASRLGLSVSAPSVIM-------ASKGQSLGGSVKVPDSVDWRKKGAVTNVKDQG 137

Query: 922  SCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCGGGLMDYAYEFIIKNK 1101
            SCGACWSFSATGA+EGINQI TG L+SLSEQELIDCDKSYN GC GGLMDYA+EF+IKN 
Sbjct: 138  SCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNH 197

Query: 1102 GIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLEAVATQPVSVGICGSD 1281
            GIDTE+DYPY+ RDGTC KDKLK+ VVTIDSYA + + +EK L+EAVA QPVSVGICGS+
Sbjct: 198  GIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSE 257

Query: 1282 SSFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGRYWGMNGYMHMLRNTG 1461
             +FQLYS GIF+GPCSTSLDHAVLIVGY S++G DYWI+KNSWG+ WGM+G+MHM RNT 
Sbjct: 258  RAFQLYSRGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTE 317

Query: 1462 TAEGVCGINMLAXXXXXXXXXXXXXXXXXXTRCNLFTYCSSGETCCCARHFLGICLKWRC 1641
             ++GVCGINMLA                  T+CNLFTYCSSGETCCCAR   G+C  W+C
Sbjct: 318  NSDGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCCCARELFGLCFSWKC 377

Query: 1642 CEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRIGNSTFVKPIAKKSFS 1788
            CE  SAVCCKD  HCCPHDYPVCDT R++CLK+ GN T +KP  KK+ S
Sbjct: 378  CEIESAVCCKDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKPFWKKNSS 426


>ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp.
            lyrata] gi|297335615|gb|EFH66032.1| hypothetical protein
            ARALYDRAFT_471096 [Arabidopsis lyrata subsp. lyrata]
          Length = 439

 Score =  580 bits (1496), Expect = e-163
 Identities = 268/411 (65%), Positives = 321/411 (78%), Gaps = 2/411 (0%)
 Frame = +1

Query: 562  SSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIRANSSYTLSLNAFADL 741
            S  IS+LFD WC+ +GKTY SE+ER+ R+++F+ N+++V QHN+  N++Y+LSLNAFADL
Sbjct: 25   SDDISELFDDWCQRHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADL 84

Query: 742  TNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPSSLDWRNKGAVTGVKDQG 921
            T+HEFKA  LGLS SAS L++       A  G +    + +P S+DWR KGAVT VKDQG
Sbjct: 85   THHEFKASRLGLSVSASSLIM-------ASKGQSLGGNAKVPDSVDWRKKGAVTNVKDQG 137

Query: 922  SCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCGGGLMDYAYEFIIKNK 1101
            SCGACWSFSATGA+EGINQI TG L+SLSEQELIDCDKSYN GC GGLMDYA+EF+IKN 
Sbjct: 138  SCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNH 197

Query: 1102 GIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLEAVATQPVSVGICGSD 1281
            GIDTE+DYPY+ RDGTC KDKLK+ VVTIDSYA + + +EK L EAVA QPVSVGICGS+
Sbjct: 198  GIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALREAVAAQPVSVGICGSE 257

Query: 1282 SSFQLYS--GGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGRYWGMNGYMHMLRN 1455
             +FQLYS   GIF+GPCSTSLDHAVLIVGY S++G DYWI+KNSWG+ WGM+G+MHM RN
Sbjct: 258  RAFQLYSRVSGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRN 317

Query: 1456 TGTAEGVCGINMLAXXXXXXXXXXXXXXXXXXTRCNLFTYCSSGETCCCARHFLGICLKW 1635
            TG +EG+CGINMLA                  T+CNLFTYCS+GETCCCAR+  G+C  W
Sbjct: 318  TGNSEGICGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSAGETCCCARNLFGLCFSW 377

Query: 1636 RCCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRIGNSTFVKPIAKKSFS 1788
            +CCE  SAVCC D  HCCPHDYPVCDT R++CLK+ GN T +KP  KK  S
Sbjct: 378  KCCEIESAVCCSDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKPFWKKDSS 428


>ref|XP_006417526.1| hypothetical protein EUTSA_v10007640mg [Eutrema salsugineum]
            gi|557095297|gb|ESQ35879.1| hypothetical protein
            EUTSA_v10007640mg [Eutrema salsugineum]
          Length = 444

 Score =  578 bits (1491), Expect = e-162
 Identities = 266/414 (64%), Positives = 321/414 (77%)
 Frame = +1

Query: 547  VPTCKSSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIRANSSYTLSLN 726
            + +  S  I++LFD WC  +GKTY SE+ER+HR+++F  N+++V QHN  +NS+Y+LSLN
Sbjct: 25   ISSSSSDDIAELFDDWCHRHGKTYGSEEERQHRIQIFRDNHDFVTQHNHISNSTYSLSLN 84

Query: 727  AFADLTNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPSSLDWRNKGAVTG 906
            AFADLT+HEFKA  LGLS  +  L+ +  +      G +      +P S+DWR KGAVT 
Sbjct: 85   AFADLTHHEFKASRLGLSAPSPSLMAKEQSL-----GVSERVRVKVPDSVDWRKKGAVTN 139

Query: 907  VKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCGGGLMDYAYEF 1086
            VKDQGSCGACWSFSATGA+EGINQI TG L+SLSEQELIDCDKSYN GC GGLMDYA+EF
Sbjct: 140  VKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEF 199

Query: 1087 IIKNKGIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLEAVATQPVSVG 1266
            +IKN GIDTE+DYPY+ +DGTC KDKLK+ VVTIDSYA + + NEK L+EAVA+QPVSVG
Sbjct: 200  VIKNHGIDTEKDYPYQEQDGTCKKDKLKKRVVTIDSYAGVASNNEKALMEAVASQPVSVG 259

Query: 1267 ICGSDSSFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGRYWGMNGYMHM 1446
            ICGS+ +FQLYS GIF+GPCSTSLDHAVLIVGY S++G DYWI+KNSWG+ WGM+G+MHM
Sbjct: 260  ICGSERAFQLYSSGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHM 319

Query: 1447 LRNTGTAEGVCGINMLAXXXXXXXXXXXXXXXXXXTRCNLFTYCSSGETCCCARHFLGIC 1626
             RNTG +EGVCGINMLA                  T+CNLFTYCSSGETCCCAR   G+C
Sbjct: 320  QRNTGNSEGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCCCARTLFGLC 379

Query: 1627 LKWRCCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRIGNSTFVKPIAKKSFS 1788
              W+CCE  SAVCCKD  HCCP DYPVCDT +++CLK+ GN T +KP  KK+ S
Sbjct: 380  FSWKCCELESAVCCKDGRHCCPRDYPVCDTTKSLCLKKTGNFTEIKPFWKKNSS 433


>ref|XP_004238304.1| PREDICTED: cysteine proteinase RD21a-like [Solanum lycopersicum]
          Length = 439

 Score =  578 bits (1491), Expect = e-162
 Identities = 268/430 (62%), Positives = 321/430 (74%)
 Frame = +1

Query: 499  MCWXXXXXXXXXXXXQVPTCKSSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYV 678
            M W            Q P C  SSISDLF++WC++ GK Y+SEQER +R KVFE+NY Y+
Sbjct: 1    MKWLLPSLVLVLLIFQQPLCTCSSISDLFETWCQQNGKKYSSEQERMYRFKVFEENYAYI 60

Query: 679  NQHNIRANSSYTLSLNAFADLTNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQES 858
             +HN + NSSYTL LNA++DLT+HEF+  +LGLS SA+D  IRL    +       + + 
Sbjct: 61   TEHNSKGNSSYTLGLNAYSDLTHHEFRNSFLGLSSSANDF-IRLKGRGSGSSAAGVLSDV 119

Query: 859  DIPSSLDWRNKGAVTGVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKS 1038
            D PSSLDWR+KGAVT VK+QGSCGACWSFSATGA+EGIN+I TGSLVSLSEQELIDCD+S
Sbjct: 120  DAPSSLDWRDKGAVTNVKNQGSCGACWSFSATGAIEGINKITTGSLVSLSEQELIDCDRS 179

Query: 1039 YNDGCGGGLMDYAYEFIIKNKGIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKN 1218
            YN GCGGGLMDYA+EF+IKN GIDTE+DYP+R ++GTCNK+KL+R VVTID Y DIP  +
Sbjct: 180  YNQGCGGGLMDYAFEFVIKNGGIDTEKDYPFREKEGTCNKNKLQRRVVTIDGYTDIPQND 239

Query: 1219 EKKLLEAVATQPVSVGICGSDSSFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWII 1398
            E KLL+AVATQPVSVGICGS  +FQ YS GIFTGPC T LDHAVLIVGY S++G DYWII
Sbjct: 240  EDKLLKAVATQPVSVGICGSARAFQSYSKGIFTGPCPTDLDHAVLIVGYGSENGFDYWII 299

Query: 1399 KNSWGRYWGMNGYMHMLRNTGTAEGVCGINMLAXXXXXXXXXXXXXXXXXXTRCNLFTYC 1578
            KNSWG  WG+NGY+HM RN+G  EG+CG+N LA                  ++C+ FT C
Sbjct: 300  KNSWGTSWGINGYIHMQRNSGNQEGICGVNKLASYPTKTSPNPPNPPAPGPSKCSTFTSC 359

Query: 1579 SSGETCCCARHFLGICLKWRCCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRIGNSTF 1758
              GETCCC   FLGICL W+CC   SAVCCKD  HCCP DYP+CDT RN+CLKR+ N+T 
Sbjct: 360  GQGETCCCGLKFLGICLSWKCCGLDSAVCCKDGRHCCPWDYPICDTSRNLCLKRMSNATI 419

Query: 1759 VKPIAKKSFS 1788
            V+   K+ F+
Sbjct: 420  VQQPQKEPFT 429


>ref|XP_002510459.1| cysteine protease, putative [Ricinus communis]
            gi|223551160|gb|EEF52646.1| cysteine protease, putative
            [Ricinus communis]
          Length = 422

 Score =  572 bits (1474), Expect = e-160
 Identities = 262/392 (66%), Positives = 314/392 (80%)
 Frame = +1

Query: 562  SSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIRANSSYTLSLNAFADL 741
            SS IS LF+SW KE+GKTYTS++++ +R K+FE+NYE+V +HN + NSSYTLSLNAFADL
Sbjct: 25   SSDISKLFESWTKEHGKTYTSKEDKLYRFKIFEENYEFVKKHNSQGNSSYTLSLNAFADL 84

Query: 742  TNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPSSLDWRNKGAVTGVKDQG 921
            T+HEFKA  LGLS  ++   +   N P      +FV   D+P S+DWR KGAV+ VKDQG
Sbjct: 85   THHEFKASRLGLSAFSTSGKLSRRNFPLH----DFV--GDVPISIDWRKKGAVSQVKDQG 138

Query: 922  SCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCGGGLMDYAYEFIIKNK 1101
            +CGACWSFSATGA+EGIN+I TGSLVSLSEQEL+DCD+SYN+GC GGLMDYAY+F+I+N 
Sbjct: 139  NCGACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDRSYNNGCEGGLMDYAYQFVIENN 198

Query: 1102 GIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLEAVATQPVSVGICGSD 1281
            GIDTEEDYPY+ R+ TCNK+KLKRHVVTID Y D+P  NEK+LL+AVA QPVSVGICGS+
Sbjct: 199  GIDTEEDYPYQAREKTCNKEKLKRHVVTIDGYTDVPQNNEKELLKAVAAQPVSVGICGSE 258

Query: 1282 SSFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGRYWGMNGYMHMLRNTG 1461
             +FQLYS GIFTGPCSTSLDHAVLIVGY S++G DYWI+KNSWG +WG+NGYM+MLRN+G
Sbjct: 259  RAFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTHWGINGYMYMLRNSG 318

Query: 1462 TAEGVCGINMLAXXXXXXXXXXXXXXXXXXTRCNLFTYCSSGETCCCARHFLGICLKWRC 1641
             ++G+CGINMLA                  T+C+LFT C  GETCCC R   G+C  W+C
Sbjct: 319  NSQGLCGINMLASFPVKTSPNPPPPAPPGPTKCDLFTRCGEGETCCCTRRIFGLCFSWKC 378

Query: 1642 CEAVSAVCCKDHTHCCPHDYPVCDTKRNICLK 1737
            CE  SAVCCKD  HCCPHDYPVCDTKRN+CLK
Sbjct: 379  CELDSAVCCKDGLHCCPHDYPVCDTKRNMCLK 410


>ref|XP_006435079.1| hypothetical protein CICLE_v10001178mg [Citrus clementina]
            gi|557537201|gb|ESR48319.1| hypothetical protein
            CICLE_v10001178mg [Citrus clementina]
          Length = 441

 Score =  569 bits (1466), Expect = e-159
 Identities = 265/415 (63%), Positives = 318/415 (76%), Gaps = 1/415 (0%)
 Frame = +1

Query: 547  VPTCKSSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIRANSSYTLSLN 726
            +P    S I++LF++WCK++GK Y+SEQE++ RLK+FE NY +V QHN   NSS+TLSLN
Sbjct: 17   LPLNYCSDINELFETWCKQHGKVYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLN 76

Query: 727  AFADLTNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPSSLDWRNKGAVTG 906
            AFADLT+ EFKA +LG S ++ D   R N   A++  P  ++  D+P+S+DWR KGAVT 
Sbjct: 77   AFADLTHQEFKASFLGFSAASIDHDRRRN---ASVQSPGTLR--DVPASIDWRKKGAVTE 131

Query: 907  VKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCGGGLMDYAYEF 1086
            VKDQ SCGACW+FSATGA+EGIN+I TGSLVSLSEQELIDCD+SYN GCGGGLMDYAY+F
Sbjct: 132  VKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQF 191

Query: 1087 IIKNKGIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLEAVATQPVSVG 1266
            +IKN GIDTE+DYPYRG+ G CNK KL RH+VTID Y D+P  NEK+LL+AV  QPVSVG
Sbjct: 192  VIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVG 251

Query: 1267 ICGSDSSFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGRYWGMNGYMHM 1446
            ICGS+ +FQLYS GIFTGPCSTSLDHAVLIVGYDS++G DYWIIKNSWGR WGMNGYMHM
Sbjct: 252  ICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHM 311

Query: 1447 LRNTGTAEGVCGINMLAXXXXXXXXXXXXXXXXXXTRCNLFTYCSSGETCCCARHFLGIC 1626
             RNTG + G+CGINMLA                  TRC+L TYC++GETCCC    LGIC
Sbjct: 312  QRNTGNSLGICGINMLASYPTKTGQNPPPSPPPGPTRCSLLTYCAAGETCCCGSSILGIC 371

Query: 1627 LKWRCCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRI-GNSTFVKPIAKKSFS 1788
            L W+CC   SAVCC DH +CCP +YP+CD+ R+ CL R  GN T  + I  +  S
Sbjct: 372  LSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTRFTGNVTAAEAIEMRGSS 426


>ref|XP_006473576.1| PREDICTED: cysteine proteinase RD21a-like [Citrus sinensis]
          Length = 441

 Score =  568 bits (1464), Expect = e-159
 Identities = 264/415 (63%), Positives = 318/415 (76%), Gaps = 1/415 (0%)
 Frame = +1

Query: 547  VPTCKSSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIRANSSYTLSLN 726
            +P    S I++LF++WCK++GK Y+SEQE++ RLK+FE NY +V QHN   NSS+TLSLN
Sbjct: 17   LPPNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLN 76

Query: 727  AFADLTNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPSSLDWRNKGAVTG 906
            AFADLT+ EFKA +LG S ++ D   R N   A++  P  ++  D+P+S+DWR KGAVT 
Sbjct: 77   AFADLTHQEFKASFLGFSAASIDHDRRRN---ASVQSPGNLR--DVPASIDWRKKGAVTE 131

Query: 907  VKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCGGGLMDYAYEF 1086
            VKDQ SCGACW+FSATGA+EGIN+I TGSLVSLSEQELIDCD+SYN GCGGGLMDYAY+F
Sbjct: 132  VKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQF 191

Query: 1087 IIKNKGIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLEAVATQPVSVG 1266
            +IKN GIDTE+DYPYRG+ G CNK KL RH+VTID Y D+P  NEK+LL+AV  QPVSVG
Sbjct: 192  VIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVG 251

Query: 1267 ICGSDSSFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGRYWGMNGYMHM 1446
            ICGS+ +FQLYS GIFTGPCSTSLDHAVLI+GYDS++G DYWIIKNSWGR WGMNGYMHM
Sbjct: 252  ICGSERAFQLYSSGIFTGPCSTSLDHAVLIIGYDSENGVDYWIIKNSWGRSWGMNGYMHM 311

Query: 1447 LRNTGTAEGVCGINMLAXXXXXXXXXXXXXXXXXXTRCNLFTYCSSGETCCCARHFLGIC 1626
             RNTG + G+CGINMLA                  TRC+L TYC+ GETCCC    LGIC
Sbjct: 312  QRNTGNSLGICGINMLASYPTKTGQNPPPSPPPGPTRCSLLTYCAPGETCCCGSSILGIC 371

Query: 1627 LKWRCCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRI-GNSTFVKPIAKKSFS 1788
            L W+CC   SAVCC DH +CCP +YP+CD+ R+ CL R+ GN T  + I  +  S
Sbjct: 372  LSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTRLTGNVTAAEAIEMRGSS 426


>ref|XP_006307431.1| hypothetical protein CARUB_v10009056mg [Capsella rubella]
            gi|482576142|gb|EOA40329.1| hypothetical protein
            CARUB_v10009056mg [Capsella rubella]
          Length = 467

 Score =  567 bits (1461), Expect = e-159
 Identities = 266/437 (60%), Positives = 318/437 (72%), Gaps = 28/437 (6%)
 Frame = +1

Query: 562  SSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIRANSSYTLSLNAFADL 741
            S  IS+LFD WC+ +GKTY SE E++HR ++F  N+++V QHN+  N++Y+LSLNAFADL
Sbjct: 27   SDDISELFDDWCQRHGKTYASEAEKQHRFQIFRDNHDFVTQHNLITNATYSLSLNAFADL 86

Query: 742  TNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPSSLDWRNKGAVTGVKDQG 921
             + EFK   LGLS SA  +++       A  G +      +P SLDWR KGAVT VKDQG
Sbjct: 87   NHSEFKTSRLGLSVSAPSVIM-------ASKGKSLGGSVKVPDSLDWRKKGAVTNVKDQG 139

Query: 922  SCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCGGGLMDYAYEFIIKNK 1101
            SCGACWSFSATGA+EGINQI TG L+SLSEQELIDCDKSYNDGC GGLMDYA+EF+IKNK
Sbjct: 140  SCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNDGCNGGLMDYAFEFVIKNK 199

Query: 1102 GIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLEAVATQPVSVGICGSD 1281
            GIDTE+DYPY+ RDGTC KDKLK+ VV+IDSYA +   +EK LLEAVA QPVSVGICGS+
Sbjct: 200  GIDTEKDYPYQERDGTCKKDKLKQRVVSIDSYAGVKPSDEKALLEAVAAQPVSVGICGSE 259

Query: 1282 SSFQLYSG----------------------------GIFTGPCSTSLDHAVLIVGYDSKD 1377
             +FQLYS                             GIF+GPCSTSLDHAVLIVGY S++
Sbjct: 260  RAFQLYSSVSFKIRDTSILSSECSTFPCLKLYLMMQGIFSGPCSTSLDHAVLIVGYGSQN 319

Query: 1378 GKDYWIIKNSWGRYWGMNGYMHMLRNTGTAEGVCGINMLAXXXXXXXXXXXXXXXXXXTR 1557
            G DYWI+KNSWG+ WGM+G+MHM RNTG ++G+CGINMLA                  T+
Sbjct: 320  GVDYWIVKNSWGKSWGMDGFMHMQRNTGNSQGICGINMLASYPIKTHPNPPPPSPPGPTK 379

Query: 1558 CNLFTYCSSGETCCCARHFLGICLKWRCCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLK 1737
            CNLFTYCS+ ETCCCAR+  G+CL W+CCE  SAVCCKD  HCCPHDYPVCDT R++CLK
Sbjct: 380  CNLFTYCSAAETCCCARNLFGLCLSWKCCEIESAVCCKDGRHCCPHDYPVCDTTRSLCLK 439

Query: 1738 RIGNSTFVKPIAKKSFS 1788
            + GN T +KP  KK+ S
Sbjct: 440  KTGNFTAIKPFWKKNSS 456


>gb|EOY14881.1| JHL18I08.3 protein [Theobroma cacao]
          Length = 438

 Score =  566 bits (1459), Expect = e-158
 Identities = 262/405 (64%), Positives = 316/405 (78%)
 Frame = +1

Query: 565  SSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIRANSSYTLSLNAFADLT 744
            S IS LF++WC ++GK Y+SE+E+ +RLKVFE+NY +V QHN   NSSY+L+LNAFADLT
Sbjct: 24   SHISHLFETWCDQHGKRYSSEEEKSYRLKVFEENYAFVTQHNGVGNSSYSLALNAFADLT 83

Query: 745  NHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPSSLDWRNKGAVTGVKDQGS 924
            +HEFKA  LGLS +A      +      +  P  V+  DIP+S+DWR KGAVT VKDQGS
Sbjct: 84   HHEFKASRLGLSAAA------IEGSRPNLQLPGLVR--DIPASMDWRTKGAVTKVKDQGS 135

Query: 925  CGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCGGGLMDYAYEFIIKNKG 1104
            CGACWSFSATGA+EGIN+I TG+LVSLSEQEL+DCD+SYN GC GGLMDYAY+F+I N G
Sbjct: 136  CGACWSFSATGAIEGINKIVTGTLVSLSEQELVDCDRSYNSGCEGGLMDYAYQFVIDNHG 195

Query: 1105 IDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLEAVATQPVSVGICGSDS 1284
            ID EEDYPY GR+ TCNK+K KR VVTID YA +PA NE  LL+AVA QPVSVGICGS+ 
Sbjct: 196  IDNEEDYPYLGREKTCNKEKRKRRVVTIDGYAGVPANNEDLLLQAVAKQPVSVGICGSER 255

Query: 1285 SFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGRYWGMNGYMHMLRNTGT 1464
            +FQLYS GIFTGPCS+SLDHAVLIVGY S++G DYWI+KNSWG  WGMNGY+HMLRN+G 
Sbjct: 256  AFQLYSKGIFTGPCSSSLDHAVLIVGYGSENGVDYWIVKNSWGTRWGMNGYIHMLRNSGD 315

Query: 1465 AEGVCGINMLAXXXXXXXXXXXXXXXXXXTRCNLFTYCSSGETCCCARHFLGICLKWRCC 1644
            ++G+CGINMLA                  T+C+LFTYCS+GETCCC     GIC  W+CC
Sbjct: 316  SKGLCGINMLASYPTKTSPNPPSPPPPGPTKCDLFTYCSAGETCCCTHRIFGICFSWKCC 375

Query: 1645 EAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRIGNSTFVKPIAKK 1779
            E  SAVCCKD+ HCCP+DYPVCDTK++ CLKR+GN+T ++   K+
Sbjct: 376  ELDSAVCCKDNRHCCPYDYPVCDTKKSQCLKRVGNATRMEAFEKR 420


>ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
            gi|449529596|ref|XP_004171784.1| PREDICTED: cysteine
            proteinase RD21a-like [Cucumis sativus]
          Length = 431

 Score =  561 bits (1445), Expect = e-157
 Identities = 259/413 (62%), Positives = 314/413 (76%)
 Frame = +1

Query: 550  PTCKSSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIRANSSYTLSLNA 729
            P   +S++S+LF+ WC E+GK+Y+S +E+ +RL VF  NYE+V  HN   NSSYTLSLN+
Sbjct: 18   PLSATSNVSELFEIWCTEHGKSYSSAEEKLYRLGVFADNYEFVTHHNNLDNSSYTLSLNS 77

Query: 730  FADLTNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPSSLDWRNKGAVTGV 909
            +ADLT+HEFK   LG SP+  +    L   P+           D+P SLDWR KGAVT V
Sbjct: 78   YADLTHHEFKVSRLGFSPALRNFRPVLPQEPSL--------PRDVPDSLDWRKKGAVTAV 129

Query: 910  KDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCGGGLMDYAYEFI 1089
            KDQGSCGACWSFSATGA+EGINQI TGSL+SLSEQELIDCD+SYN GCGGGLMDYAY+F+
Sbjct: 130  KDQGSCGACWSFSATGAMEGINQIMTGSLISLSEQELIDCDRSYNSGCGGGLMDYAYQFV 189

Query: 1090 IKNKGIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLEAVATQPVSVGI 1269
            I N GIDTE DYPY+ RDG+C KDKL+R+VVTID YADIP+ +E KLL+AVA QPVSVGI
Sbjct: 190  ISNHGIDTENDYPYQARDGSCRKDKLQRNVVTIDGYADIPSNDEGKLLQAVAAQPVSVGI 249

Query: 1270 CGSDSSFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGRYWGMNGYMHML 1449
            CGS+ +FQLYS GIF+GPCSTSLDHAVLIVGY S++G DYWI+KNSWG+ WGM+GYMHM 
Sbjct: 250  CGSERAFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKSWGMDGYMHMQ 309

Query: 1450 RNTGTAEGVCGINMLAXXXXXXXXXXXXXXXXXXTRCNLFTYCSSGETCCCARHFLGICL 1629
            RN+G +EGVCGIN LA                  T+C++ T C++GETCCCA+ FLG+CL
Sbjct: 310  RNSGNSEGVCGINKLASYPTKTNPNPPPSPPPGPTKCSILTSCAAGETCCCAKKFLGLCL 369

Query: 1630 KWRCCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRIGNSTFVKPIAKKSFS 1788
             W+CC   SAVCCKD  HCCP DYP+CDT RN+CLK+  N T  + +  +S S
Sbjct: 370  SWKCCGLSSAVCCKDGRHCCPFDYPICDTDRNLCLKQTMNGTRTEILENRSSS 422


>gb|AAB60738.1| Strong similarity to Dianthus cysteine proteinase (gb|U17135)
            [Arabidopsis thaliana]
          Length = 416

 Score =  560 bits (1444), Expect = e-157
 Identities = 259/399 (64%), Positives = 310/399 (77%), Gaps = 7/399 (1%)
 Frame = +1

Query: 562  SSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIRANSSYTLSLNAFADL 741
            S  IS+LFD WC+++GKTY SE+ER+ R+++F+ N+++V QHN+  N++Y+LSLNAFADL
Sbjct: 23   SDDISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADL 82

Query: 742  TNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPSSLDWRNKGAVTGVKDQG 921
            T+HEFKA  LGLS SA  +++       A  G +      +P S+DWR KGAVT VKDQG
Sbjct: 83   THHEFKASRLGLSVSAPSVIM-------ASKGQSLGGSVKVPDSVDWRKKGAVTNVKDQG 135

Query: 922  SCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCGGGLMDYAYEFIIKNK 1101
            SCGACWSFSATGA+EGINQI TG L+SLSEQELIDCDKSYN GC GGLMDYA+EF+IKN 
Sbjct: 136  SCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNH 195

Query: 1102 GIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLEAVATQPVSVGICGSD 1281
            GIDTE+DYPY+ RDGTC KDKLK+ VVTIDSYA + + +EK L+EAVA QPVSVGICGS+
Sbjct: 196  GIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSE 255

Query: 1282 SSFQLYSG-------GIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGRYWGMNGYM 1440
             +FQLYS        GIF+GPCSTSLDHAVLIVGY S++G DYWI+KNSWG+ WGM+G+M
Sbjct: 256  RAFQLYSSKFYLLMQGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFM 315

Query: 1441 HMLRNTGTAEGVCGINMLAXXXXXXXXXXXXXXXXXXTRCNLFTYCSSGETCCCARHFLG 1620
            HM RNT  ++GVCGINMLA                  T+CNLFTYCSSGETCCCAR   G
Sbjct: 316  HMQRNTENSDGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCCCARELFG 375

Query: 1621 ICLKWRCCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLK 1737
            +C  W+CCE  SAVCCKD  HCCPHDYPVCDT R++CLK
Sbjct: 376  LCFSWKCCEIESAVCCKDGRHCCPHDYPVCDTTRSLCLK 414


>ref|XP_002280937.1| PREDICTED: cysteine proteinase RD21a [Vitis vinifera]
            gi|302142569|emb|CBI19772.3| unnamed protein product
            [Vitis vinifera]
          Length = 436

 Score =  560 bits (1443), Expect = e-157
 Identities = 255/425 (60%), Positives = 324/425 (76%)
 Frame = +1

Query: 559  KSSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIRANSSYTLSLNAFAD 738
            ++SS +DLF++WC++YGKTY+SE+E+  RLKVFE+N+ +V QHN  AN+SYTL+LNAFAD
Sbjct: 21   EASSTADLFEAWCEQYGKTYSSEEEKASRLKVFEENHAFVTQHNSMANASYTLALNAFAD 80

Query: 739  LTNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPSSLDWRNKGAVTGVKDQ 918
            LT+HEFKA  LG SP  +  +        ++  P  VQE  +P ++DWR  GAVTGVKDQ
Sbjct: 81   LTHHEFKASRLGFSPGRAQSI-------RSVGTP--VQELHVPPAVDWRKSGAVTGVKDQ 131

Query: 919  GSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCGGGLMDYAYEFIIKN 1098
            G+CG CWSFS TGA+EGIN+I TGSLVSLSEQEL+DCD+SYN GC GGLMDYAY+F+IKN
Sbjct: 132  GNCGGCWSFSTTGAIEGINKIVTGSLVSLSEQELVDCDRSYNSGCEGGLMDYAYQFVIKN 191

Query: 1099 KGIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLEAVATQPVSVGICGS 1278
            +GID+E DYPY G D  CNK+KLK+H+VTID Y DIP  +EK+LL+ VA QPVSVGICGS
Sbjct: 192  QGIDSEADYPYVGMDKPCNKEKLKKHIVTIDGYTDIPPNDEKQLLQVVAKQPVSVGICGS 251

Query: 1279 DSSFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGRYWGMNGYMHMLRNT 1458
            + +FQLYS G++TGPCS++LDHAVLIVGY ++DG D+WI+KNSWG +WGM GY+HMLRN 
Sbjct: 252  EKTFQLYSKGVYTGPCSSTLDHAVLIVGYGTEDGVDFWIVKNSWGEHWGMRGYIHMLRNN 311

Query: 1459 GTAEGVCGINMLAXXXXXXXXXXXXXXXXXXTRCNLFTYCSSGETCCCARHFLGICLKWR 1638
            GTAEG+CGINMLA                  T+C+ F+ CS GETCCC+  F+G+CL W 
Sbjct: 312  GTAEGICGINMLASYPAKTSPNPPPPPTPGPTKCDFFSSCSEGETCCCSWRFIGVCLSWN 371

Query: 1639 CCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRIGNSTFVKPIAKKSFSI*LSKQCGYT 1818
            CC A SAVCC ++ +CCP  +P+CDTKRN CLK  GN T V+ + ++  S+   K  G++
Sbjct: 372  CCTAKSAVCCDNNNYCCPASHPICDTKRNRCLKPAGNGTGVEVLKRRGSSV---KFGGWS 428

Query: 1819 SYNSA 1833
            S N A
Sbjct: 429  SINDA 433


>ref|XP_004500967.1| PREDICTED: oryzain alpha chain-like [Cicer arietinum]
          Length = 436

 Score =  558 bits (1438), Expect = e-156
 Identities = 264/396 (66%), Positives = 298/396 (75%)
 Frame = +1

Query: 574  SDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIRANSSYTLSLNAFADLTNHE 753
            S LF  WCK++GKTY SEQE+ +R  VFE NY +V QHN   NSSYTLSLNAFADLT+HE
Sbjct: 27   SKLFQEWCKQHGKTYPSEQEKRYRFNVFEDNYAFVAQHNQIGNSSYTLSLNAFADLTHHE 86

Query: 754  FKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPSSLDWRNKGAVTGVKDQGSCGA 933
            FKA  LGL PS S L  + N         +F+Q   +PS +DWR  GAV+ VKDQGSCGA
Sbjct: 87   FKATRLGLPPS-SLLRFKFNRFQDQQRSDDFLQ---VPSEIDWRKNGAVSIVKDQGSCGA 142

Query: 934  CWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCGGGLMDYAYEFIIKNKGIDT 1113
            CWSFSATGA+EGIN+I TGSLVSLSEQEL+DCD +YN GC GGLMDYAY+FII N GIDT
Sbjct: 143  CWSFSATGAIEGINKIVTGSLVSLSEQELVDCDTTYNSGCDGGLMDYAYQFIIDNNGIDT 202

Query: 1114 EEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLEAVATQPVSVGICGSDSSFQ 1293
            EEDYPY+ R   C KDKLKR VVTID Y D+P  +EKKLL+AVA QPVSVGICGS  +FQ
Sbjct: 203  EEDYPYQARQLLCKKDKLKRRVVTIDGYTDVPPNDEKKLLKAVAVQPVSVGICGSARAFQ 262

Query: 1294 LYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGRYWGMNGYMHMLRNTGTAEG 1473
            LYS GIFTGPCSTSLDHAVLIVGY S++G DYWI+KNSWG+YWGMNGY+HMLRNT ++ G
Sbjct: 263  LYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKYWGMNGYIHMLRNTDSSAG 322

Query: 1474 VCGINMLAXXXXXXXXXXXXXXXXXXTRCNLFTYCSSGETCCCARHFLGICLKWRCCEAV 1653
            +CGINMLA                   +CNLFTYCS GETCCCA+ FLGIC  W+CC   
Sbjct: 323  LCGINMLASYPTKTKPNPPVPPPPGPIKCNLFTYCSGGETCCCAKKFLGICFSWKCCGVT 382

Query: 1654 SAVCCKDHTHCCPHDYPVCDTKRNICLKRIGNSTFV 1761
            SAVCCKD  HCCP DYPVCD     CLKRI N T +
Sbjct: 383  SAVCCKDKRHCCPLDYPVCDASNGQCLKRIANGTIL 418


>ref|XP_004291399.1| PREDICTED: cysteine proteinase RD21a-like [Fragaria vesca subsp.
            vesca]
          Length = 441

 Score =  547 bits (1409), Expect = e-153
 Identities = 256/402 (63%), Positives = 307/402 (76%), Gaps = 4/402 (0%)
 Frame = +1

Query: 562  SSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIRANSSYTLSLNAFADL 741
            SSS S+LF++WCK+YGK+Y+S++E+ +RL +FE+N  ++ QHN   NSSYTLSLN+F+DL
Sbjct: 25   SSSSSELFEAWCKQYGKSYSSQEEKLYRLSLFEQNLAFITQHNDLGNSSYTLSLNSFSDL 84

Query: 742  TNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPSSLDWRNKGAVTGVKDQG 921
            T+HEFKA  LG SP+   L  + +  P+ +          +PSS+DWR  GAVT VKDQG
Sbjct: 85   THHEFKASRLGFSPTFLRLYRKSDPKPSVV--------RHVPSSIDWRKNGAVTNVKDQG 136

Query: 922  SCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSY-NDGCGGGLMDYAYEFIIKN 1098
            SCGACWSFSATGA+EGIN+I TGSLVSLSEQELIDCD+ Y N GC GGLMD A++FII N
Sbjct: 137  SCGACWSFSATGAIEGINKIVTGSLVSLSEQELIDCDRVYPNSGCNGGLMDDAFQFIIDN 196

Query: 1099 KGIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLEAVATQPVSVGICGS 1278
             GIDTEEDYPY+G DGTCNK KLKRHVVTID Y D+PA NE++LL+AVATQPVSVGI GS
Sbjct: 197  NGIDTEEDYPYQGADGTCNKQKLKRHVVTIDGYTDVPANNEEQLLKAVATQPVSVGIAGS 256

Query: 1279 DSSFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGRYWGMNGYMHMLRNT 1458
               FQ YS GIF GPCST+LDHAVLIVGY S++G DYWI+KNSWG+ WGMNGY+H+LR+ 
Sbjct: 257  GREFQFYSKGIFAGPCSTTLDHAVLIVGYGSENGVDYWIVKNSWGKNWGMNGYIHILRDH 316

Query: 1459 GTAEGVCGINMLA---XXXXXXXXXXXXXXXXXXTRCNLFTYCSSGETCCCARHFLGICL 1629
              ++G+CGINMLA                     T+C+LF+ C  GETCCCAR  LGICL
Sbjct: 317  SNSKGLCGINMLASYPTKTGENPPFPPPSPPPGPTKCDLFSKCGVGETCCCARKILGICL 376

Query: 1630 KWRCCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRIGNST 1755
             WRCCE  SAVCCKD  HCCPHDYP+CDT+RN CL+  GN T
Sbjct: 377  SWRCCEFTSAVCCKDRLHCCPHDYPICDTERNYCLQANGNLT 418


>ref|XP_003523725.1| PREDICTED: oryzain alpha chain-like [Glycine max]
          Length = 439

 Score =  543 bits (1400), Expect = e-152
 Identities = 259/417 (62%), Positives = 307/417 (73%), Gaps = 8/417 (1%)
 Frame = +1

Query: 562  SSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIRAN-----SSYTLSLN 726
            +S  S+LF+ WCKE+ KTY+SE+E+ +RLKVFE NY +V QHN  AN     SSYTLSLN
Sbjct: 26   ASDTSELFEKWCKEHSKTYSSEEEKLYRLKVFEDNYAFVAQHNQNANNNNNNSSYTLSLN 85

Query: 727  AFADLTNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESD---IPSSLDWRNKGA 897
            AFADLT+HEFK   LGL  +    L+R          P   Q  D   IPS +DWR  GA
Sbjct: 86   AFADLTHHEFKTTRLGLPLT----LLRFKR-------PQNQQSRDLLHIPSQIDWRQSGA 134

Query: 898  VTGVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCGGGLMDYA 1077
            VT VKDQ SCGACW+FSATGA+EGIN+I TGSLVSLSEQELIDCD SYN GCGGGLMD+A
Sbjct: 135  VTPVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDTSYNSGCGGGLMDFA 194

Query: 1078 YEFIIKNKGIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLEAVATQPV 1257
            Y+F+I NKGIDTE+DYPY+ R  +C+KDKLKR  VTI+ Y D+P  +E+++L+AVA+QPV
Sbjct: 195  YQFVIDNKGIDTEDDYPYQARQRSCSKDKLKRRAVTIEDYVDVP-PSEEEILKAVASQPV 253

Query: 1258 SVGICGSDSSFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGRYWGMNGY 1437
            SVGICGS+  FQLYS GIFTGPCST LDHAVLIVGY S++G DYWI+KNSWG+YWGMNGY
Sbjct: 254  SVGICGSEREFQLYSKGIFTGPCSTFLDHAVLIVGYGSENGVDYWIVKNSWGKYWGMNGY 313

Query: 1438 MHMLRNTGTAEGVCGINMLAXXXXXXXXXXXXXXXXXXTRCNLFTYCSSGETCCCARHFL 1617
            +HM+RN+G ++G+CGIN LA                   RCNLFT+CS GETCCCA+ FL
Sbjct: 314  IHMIRNSGNSKGICGINTLASYPVKTKPNPPIPPPPGPVRCNLFTHCSEGETCCCAKSFL 373

Query: 1618 GICLKWRCCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRIGNSTFVKPIAKKSFS 1788
            GIC  W+CC   SAVCCKD  HCCP DYP+CDT+R  CLKR  N T       + FS
Sbjct: 374  GICFSWKCCGLTSAVCCKDKRHCCPQDYPICDTRRGQCLKRTANGTTTITSENQDFS 430


Top