BLASTX nr result

ID: Rehmannia23_contig00006339 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia23_contig00006339
         (1779 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EPS60205.1| hypothetical protein M569_14597, partial [Genlise...   598   e-168
ref|XP_002307688.2| cysteine protease family protein [Populus tr...   598   e-168
dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas]                          589   e-165
ref|XP_006341989.1| PREDICTED: cysteine proteinase RD21a-like [S...   587   e-165
ref|NP_563855.1| papain-like cysteine peptidase [Arabidopsis tha...   582   e-163
gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [A...   582   e-163
ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arab...   580   e-163
ref|XP_006417526.1| hypothetical protein EUTSA_v10007640mg [Eutr...   578   e-162
ref|XP_004238304.1| PREDICTED: cysteine proteinase RD21a-like [S...   578   e-162
ref|XP_002510459.1| cysteine protease, putative [Ricinus communi...   572   e-160
ref|XP_006435079.1| hypothetical protein CICLE_v10001178mg [Citr...   569   e-159
ref|XP_006473576.1| PREDICTED: cysteine proteinase RD21a-like [C...   568   e-159
ref|XP_006307431.1| hypothetical protein CARUB_v10009056mg [Caps...   567   e-159
gb|EOY14881.1| JHL18I08.3 protein [Theobroma cacao]                   566   e-158
ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [C...   561   e-157
gb|AAB60738.1| Strong similarity to Dianthus cysteine proteinase...   560   e-157
ref|XP_002280937.1| PREDICTED: cysteine proteinase RD21a [Vitis ...   560   e-157
ref|XP_004500967.1| PREDICTED: oryzain alpha chain-like [Cicer a...   558   e-156
ref|XP_004291399.1| PREDICTED: cysteine proteinase RD21a-like [F...   547   e-153
ref|XP_003523725.1| PREDICTED: oryzain alpha chain-like [Glycine...   543   e-152

>gb|EPS60205.1| hypothetical protein M569_14597, partial [Genlisea aurea]
          Length = 424

 Score =  598 bits (1543), Expect = e-168
 Identities = 278/410 (67%), Positives = 325/410 (79%), Gaps = 1/410 (0%)
 Frame = +3

Query: 159  QVPTCKSSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIRANSSYTLSL 338
            QV    SSSISDLFDSWC+E+GKTY SE+EREHRL VF +NY+++  HN RAN SYTLSL
Sbjct: 17   QVHPIVSSSISDLFDSWCQEHGKTYVSEEEREHRLGVFSENYDFIASHNARANYSYTLSL 76

Query: 339  NAFADLTNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPSSLDWRNKGAVT 518
            NAFADLT  EF  +YLG SPS  DLLIR N G  +    N+   S +PSS+DWR KGAVT
Sbjct: 77   NAFADLTRSEFGGRYLGFSPSGHDLLIRKNRGSGSYRSRNY---SAVPSSIDWRKKGAVT 133

Query: 519  GVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCGGGLMDYAYE 698
            G+KDQGSCGACWSFSATGA+EGINQI TGSLVSLSEQELIDCD SYN GC GGLMDYAYE
Sbjct: 134  GIKDQGSCGACWSFSATGAIEGINQIVTGSLVSLSEQELIDCDHSYNQGCNGGLMDYAYE 193

Query: 699  FIIKNKGIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLEAVATQPVSV 878
            FI+KNKGIDTEEDY Y+GRD +C+++KL + VVTIDSY DIP KNE+ LLEAVA+QPVSV
Sbjct: 194  FILKNKGIDTEEDYSYKGRDASCSQNKLNKRVVTIDSYVDIPEKNEQMLLEAVASQPVSV 253

Query: 879  GICGSDSSFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGRYWGMNGYMH 1058
            GI G D+ FQ YS GIFTGPCSTSLDHAVLIVGYDSK+GKDYWI+KNSWG+ WGM+GYM+
Sbjct: 254  GISGGDAPFQFYSQGIFTGPCSTSLDHAVLIVGYDSKNGKDYWIVKNSWGKSWGMDGYMY 313

Query: 1059 MLRNTGTAEGVCGINMLA-XXXXXXXXXXXXXXXXXXTRCNLFTYCSSGETCCCARHFLG 1235
            + RNTG   G+C INM+A                   T+C+LF+YCS GETCCCAR FLG
Sbjct: 314  VQRNTGNQNGICEINMMASYPVKTNPNPSPSPSPPGPTKCSLFSYCSQGETCCCARRFLG 373

Query: 1236 ICLKWRCCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRIGNSTFVKPI 1385
            +C++++CC A SAVCC+D+ HCCP DYP+CDT +++C K  GNST   P+
Sbjct: 374  LCMRYKCCGAESAVCCEDNVHCCPQDYPICDTAQSVCRKMSGNSTMAIPV 423


>ref|XP_002307688.2| cysteine protease family protein [Populus trichocarpa]
            gi|550339725|gb|EEE94684.2| cysteine protease family
            protein [Populus trichocarpa]
          Length = 436

 Score =  598 bits (1542), Expect = e-168
 Identities = 275/411 (66%), Positives = 325/411 (79%)
 Frame = +3

Query: 165  PTCKSSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIRANSSYTLSLNA 344
            P+  SS IS LF++WCKE+GK+YTS++ER HRLKVFE NY++V +HN + NSSY+L+LNA
Sbjct: 18   PSTSSSDISQLFETWCKEHGKSYTSQEERSHRLKVFEDNYDFVTKHNSKGNSSYSLALNA 77

Query: 345  FADLTNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPSSLDWRNKGAVTGV 524
            FADLT+HEFK   LGLS +  +L  R N     + G       DIP+S+DWRNKG VT V
Sbjct: 78   FADLTHHEFKTSRLGLSAAPLNLAHR-NLEITGVVG-------DIPASIDWRNKGVVTNV 129

Query: 525  KDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCGGGLMDYAYEFI 704
            KDQGSCGACWSFSATGA+EGIN+I TGSLVSLSEQELI+CDKSYNDGCGGGLMDYA++F+
Sbjct: 130  KDQGSCGACWSFSATGAIEGINKIVTGSLVSLSEQELIECDKSYNDGCGGGLMDYAFQFV 189

Query: 705  IKNKGIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLEAVATQPVSVGI 884
            I N GIDTEEDYPYR RDGTCNKD++KR VVTID Y D+P  NEK+LL+AVA QPVSVGI
Sbjct: 190  INNHGIDTEEDYPYRARDGTCNKDRMKRRVVTIDKYVDVPENNEKQLLQAVAAQPVSVGI 249

Query: 885  CGSDSSFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGRYWGMNGYMHML 1064
            CGS+ +FQ+YS GIFTGPCSTSLDHAVLIVGY S++G DYWI+KNSWG  WGM GYMHM 
Sbjct: 250  CGSERAFQMYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTGWGMRGYMHMQ 309

Query: 1065 RNTGTAEGVCGINMLAXXXXXXXXXXXXXXXXXXTRCNLFTYCSSGETCCCARHFLGICL 1244
            RN+G ++GVCGINMLA                  T+CNL TYC++GETCCCAR F GIC+
Sbjct: 310  RNSGNSQGVCGINMLASYPVKTSPNPPPPPPPGPTKCNLLTYCAAGETCCCARKFFGICI 369

Query: 1245 KWRCCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRIGNSTFVKPIAKKS 1397
             W+CC   SAVCCKD  HCCPHDYPVCDT +N+C KR GN+T ++ I  K+
Sbjct: 370  SWKCCGLDSAVCCKDRLHCCPHDYPVCDTDKNMCFKRAGNATRMEAIEGKT 420


>dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas]
          Length = 441

 Score =  589 bits (1518), Expect = e-165
 Identities = 272/409 (66%), Positives = 325/409 (79%)
 Frame = +3

Query: 177  SSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIRANSSYTLSLNAFADL 356
            SS I+ LF++WC+++GKTY S++E+  RLKVF+ NY++V +HN + NSSYTLSLNAFADL
Sbjct: 23   SSEIAHLFETWCQQHGKTYASQEEKLFRLKVFQDNYDFVTEHNSQGNSSYTLSLNAFADL 82

Query: 357  TNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPSSLDWRNKGAVTGVKDQG 536
            T+HEFKA  LGLS +AS     LN   +    P+FV  +D+P+S+DWR  GAVT VKDQG
Sbjct: 83   THHEFKASRLGLSSAAS---ASLNVDRSNRQIPDFV--ADVPASVDWRKNGAVTQVKDQG 137

Query: 537  SCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCGGGLMDYAYEFIIKNK 716
            +CGACWSFSATGA+EGIN+I TGSLVSLSEQEL+DCDKSYN+GC GG+MDYA++F+I N 
Sbjct: 138  NCGACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDKSYNNGCEGGIMDYAFQFVIDNH 197

Query: 717  GIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLEAVATQPVSVGICGSD 896
            GIDTEEDYPY+GRD +CNK+KLKRHVVTID Y D+P  NEK+LL+AVA QPVSVGICGS+
Sbjct: 198  GIDTEEDYPYQGRDRSCNKEKLKRHVVTIDGYVDVPQNNEKELLKAVANQPVSVGICGSE 257

Query: 897  SSFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGRYWGMNGYMHMLRNTG 1076
             +FQLYS GIFTGPCSTSLDHAVLIVGY S++G DYWI+KNSWG YWGM+GYMHM RN+G
Sbjct: 258  RAFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGSYWGMDGYMHMQRNSG 317

Query: 1077 TAEGVCGINMLAXXXXXXXXXXXXXXXXXXTRCNLFTYCSSGETCCCARHFLGICLKWRC 1256
            ++ G+CGINMLA                  TRC+LFT+C  GETCCC  H  GICL W+C
Sbjct: 318  SSRGLCGINMLASYPKKTSPNPPPPAPPGPTRCDLFTHCGEGETCCCVHHIFGICLSWKC 377

Query: 1257 CEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRIGNSTFVKPIAKKSFS 1403
            CE  SAVCCKD  HCCP DYPVCDT RNICLK  GN+T ++  AK S S
Sbjct: 378  CELDSAVCCKDGRHCCPRDYPVCDTTRNICLKHYGNATRIEKFAKNSSS 426


>ref|XP_006341989.1| PREDICTED: cysteine proteinase RD21a-like [Solanum tuberosum]
          Length = 439

 Score =  587 bits (1512), Expect = e-165
 Identities = 272/430 (63%), Positives = 326/430 (75%)
 Frame = +3

Query: 114  MCWXXXXXXXXXXXXQVPTCKSSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYV 293
            M W            Q P C  SSISDLF++WC++ GK Y+SEQER +R KVFE+NY Y+
Sbjct: 1    MNWLLPSLVLVLLIFQQPFCTCSSISDLFETWCQQNGKKYSSEQERVYRFKVFEENYAYI 60

Query: 294  NQHNIRANSSYTLSLNAFADLTNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQES 473
             +HN + NSSYTL LNA++DLT+HEF+  +LGLS SA+D  IRL    +       + + 
Sbjct: 61   TEHNSKENSSYTLGLNAYSDLTHHEFRNSFLGLSSSANDF-IRLKGRGSGSSETGVLSDV 119

Query: 474  DIPSSLDWRNKGAVTGVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKS 653
            D PSSLDWR KGAVT VK+QGSCGACWSFSATGA+EGIN+I TGSLVSLSEQELIDCD+S
Sbjct: 120  DAPSSLDWREKGAVTDVKNQGSCGACWSFSATGAMEGINKITTGSLVSLSEQELIDCDRS 179

Query: 654  YNDGCGGGLMDYAYEFIIKNKGIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKN 833
            YN+GCGGGLMDYA+EF+IKN GIDTE+DYP+R R+GTCNK+KL+RHVVTID Y DIP  +
Sbjct: 180  YNEGCGGGLMDYAFEFVIKNGGIDTEKDYPFREREGTCNKNKLQRHVVTIDGYTDIPQND 239

Query: 834  EKKLLEAVATQPVSVGICGSDSSFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWII 1013
            E KLL+AVATQPVSVGICGS  +FQ YS GIFTGPCST+LDHAVLIVGY S++G DYWII
Sbjct: 240  EDKLLKAVATQPVSVGICGSARAFQSYSKGIFTGPCSTALDHAVLIVGYGSENGVDYWII 299

Query: 1014 KNSWGRYWGMNGYMHMLRNTGTAEGVCGINMLAXXXXXXXXXXXXXXXXXXTRCNLFTYC 1193
            KNSWG  WG+NGY+HM RN+G  EG+CGIN LA                  ++C++FT C
Sbjct: 300  KNSWGTSWGINGYIHMQRNSGNQEGICGINKLASYPTKTSPNPPTPPAPGPSKCSMFTSC 359

Query: 1194 SSGETCCCARHFLGICLKWRCCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRIGNSTF 1373
              GETCCC   FLGICL W+CC   SAVCCKD  HCCP DYP+CDT RN+CLKR+ N+T 
Sbjct: 360  GQGETCCCGSKFLGICLSWKCCGLDSAVCCKDGRHCCPQDYPICDTSRNLCLKRMNNATI 419

Query: 1374 VKPIAKKSFS 1403
            V+   K++F+
Sbjct: 420  VQQPQKEAFT 429


>ref|NP_563855.1| papain-like cysteine peptidase [Arabidopsis thaliana]
            gi|110741821|dbj|BAE98853.1| papain-like cysteine
            peptidase XBCP3 [Arabidopsis thaliana]
            gi|111074448|gb|ABH04597.1| At1g09850 [Arabidopsis
            thaliana] gi|332190386|gb|AEE28507.1| papain-like
            cysteine peptidase [Arabidopsis thaliana]
          Length = 437

 Score =  582 bits (1501), Expect = e-163
 Identities = 267/409 (65%), Positives = 321/409 (78%)
 Frame = +3

Query: 177  SSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIRANSSYTLSLNAFADL 356
            S  IS+LFD WC+++GKTY SE+ER+ R+++F+ N+++V QHN+  N++Y+LSLNAFADL
Sbjct: 25   SDDISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADL 84

Query: 357  TNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPSSLDWRNKGAVTGVKDQG 536
            T+HEFKA  LGLS SA  +++       A  G +      +P S+DWR KGAVT VKDQG
Sbjct: 85   THHEFKASRLGLSVSAPSVIM-------ASKGQSLGGSVKVPDSVDWRKKGAVTNVKDQG 137

Query: 537  SCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCGGGLMDYAYEFIIKNK 716
            SCGACWSFSATGA+EGINQI TG L+SLSEQELIDCDKSYN GC GGLMDYA+EF+IKN 
Sbjct: 138  SCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNH 197

Query: 717  GIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLEAVATQPVSVGICGSD 896
            GIDTE+DYPY+ RDGTC KDKLK+ VVTIDSYA + + +EK L+EAVA QPVSVGICGS+
Sbjct: 198  GIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSE 257

Query: 897  SSFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGRYWGMNGYMHMLRNTG 1076
             +FQLYS GIF+GPCSTSLDHAVLIVGY S++G DYWI+KNSWG+ WGM+G+MHM RNT 
Sbjct: 258  RAFQLYSSGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTE 317

Query: 1077 TAEGVCGINMLAXXXXXXXXXXXXXXXXXXTRCNLFTYCSSGETCCCARHFLGICLKWRC 1256
             ++GVCGINMLA                  T+CNLFTYCSSGETCCCAR   G+C  W+C
Sbjct: 318  NSDGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCCCARELFGLCFSWKC 377

Query: 1257 CEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRIGNSTFVKPIAKKSFS 1403
            CE  SAVCCKD  HCCPHDYPVCDT R++CLK+ GN T +KP  KK+ S
Sbjct: 378  CEIESAVCCKDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKPFWKKNSS 426


>gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana]
          Length = 437

 Score =  582 bits (1499), Expect = e-163
 Identities = 267/409 (65%), Positives = 321/409 (78%)
 Frame = +3

Query: 177  SSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIRANSSYTLSLNAFADL 356
            S  IS+LFD WC+++GKTY SE+ER+ R+++F+ N+++V QHN+  N++Y+LSLNAFADL
Sbjct: 25   SDDISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADL 84

Query: 357  TNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPSSLDWRNKGAVTGVKDQG 536
            T+HEFKA  LGLS SA  +++       A  G +      +P S+DWR KGAVT VKDQG
Sbjct: 85   THHEFKASRLGLSVSAPSVIM-------ASKGQSLGGSVKVPDSVDWRKKGAVTNVKDQG 137

Query: 537  SCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCGGGLMDYAYEFIIKNK 716
            SCGACWSFSATGA+EGINQI TG L+SLSEQELIDCDKSYN GC GGLMDYA+EF+IKN 
Sbjct: 138  SCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNH 197

Query: 717  GIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLEAVATQPVSVGICGSD 896
            GIDTE+DYPY+ RDGTC KDKLK+ VVTIDSYA + + +EK L+EAVA QPVSVGICGS+
Sbjct: 198  GIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSE 257

Query: 897  SSFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGRYWGMNGYMHMLRNTG 1076
             +FQLYS GIF+GPCSTSLDHAVLIVGY S++G DYWI+KNSWG+ WGM+G+MHM RNT 
Sbjct: 258  RAFQLYSRGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTE 317

Query: 1077 TAEGVCGINMLAXXXXXXXXXXXXXXXXXXTRCNLFTYCSSGETCCCARHFLGICLKWRC 1256
             ++GVCGINMLA                  T+CNLFTYCSSGETCCCAR   G+C  W+C
Sbjct: 318  NSDGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCCCARELFGLCFSWKC 377

Query: 1257 CEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRIGNSTFVKPIAKKSFS 1403
            CE  SAVCCKD  HCCPHDYPVCDT R++CLK+ GN T +KP  KK+ S
Sbjct: 378  CEIESAVCCKDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKPFWKKNSS 426


>ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp.
            lyrata] gi|297335615|gb|EFH66032.1| hypothetical protein
            ARALYDRAFT_471096 [Arabidopsis lyrata subsp. lyrata]
          Length = 439

 Score =  580 bits (1496), Expect = e-163
 Identities = 268/411 (65%), Positives = 321/411 (78%), Gaps = 2/411 (0%)
 Frame = +3

Query: 177  SSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIRANSSYTLSLNAFADL 356
            S  IS+LFD WC+ +GKTY SE+ER+ R+++F+ N+++V QHN+  N++Y+LSLNAFADL
Sbjct: 25   SDDISELFDDWCQRHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADL 84

Query: 357  TNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPSSLDWRNKGAVTGVKDQG 536
            T+HEFKA  LGLS SAS L++       A  G +    + +P S+DWR KGAVT VKDQG
Sbjct: 85   THHEFKASRLGLSVSASSLIM-------ASKGQSLGGNAKVPDSVDWRKKGAVTNVKDQG 137

Query: 537  SCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCGGGLMDYAYEFIIKNK 716
            SCGACWSFSATGA+EGINQI TG L+SLSEQELIDCDKSYN GC GGLMDYA+EF+IKN 
Sbjct: 138  SCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNH 197

Query: 717  GIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLEAVATQPVSVGICGSD 896
            GIDTE+DYPY+ RDGTC KDKLK+ VVTIDSYA + + +EK L EAVA QPVSVGICGS+
Sbjct: 198  GIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALREAVAAQPVSVGICGSE 257

Query: 897  SSFQLYS--GGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGRYWGMNGYMHMLRN 1070
             +FQLYS   GIF+GPCSTSLDHAVLIVGY S++G DYWI+KNSWG+ WGM+G+MHM RN
Sbjct: 258  RAFQLYSRVSGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRN 317

Query: 1071 TGTAEGVCGINMLAXXXXXXXXXXXXXXXXXXTRCNLFTYCSSGETCCCARHFLGICLKW 1250
            TG +EG+CGINMLA                  T+CNLFTYCS+GETCCCAR+  G+C  W
Sbjct: 318  TGNSEGICGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSAGETCCCARNLFGLCFSW 377

Query: 1251 RCCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRIGNSTFVKPIAKKSFS 1403
            +CCE  SAVCC D  HCCPHDYPVCDT R++CLK+ GN T +KP  KK  S
Sbjct: 378  KCCEIESAVCCSDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKPFWKKDSS 428


>ref|XP_006417526.1| hypothetical protein EUTSA_v10007640mg [Eutrema salsugineum]
            gi|557095297|gb|ESQ35879.1| hypothetical protein
            EUTSA_v10007640mg [Eutrema salsugineum]
          Length = 444

 Score =  578 bits (1491), Expect = e-162
 Identities = 266/414 (64%), Positives = 321/414 (77%)
 Frame = +3

Query: 162  VPTCKSSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIRANSSYTLSLN 341
            + +  S  I++LFD WC  +GKTY SE+ER+HR+++F  N+++V QHN  +NS+Y+LSLN
Sbjct: 25   ISSSSSDDIAELFDDWCHRHGKTYGSEEERQHRIQIFRDNHDFVTQHNHISNSTYSLSLN 84

Query: 342  AFADLTNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPSSLDWRNKGAVTG 521
            AFADLT+HEFKA  LGLS  +  L+ +  +      G +      +P S+DWR KGAVT 
Sbjct: 85   AFADLTHHEFKASRLGLSAPSPSLMAKEQSL-----GVSERVRVKVPDSVDWRKKGAVTN 139

Query: 522  VKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCGGGLMDYAYEF 701
            VKDQGSCGACWSFSATGA+EGINQI TG L+SLSEQELIDCDKSYN GC GGLMDYA+EF
Sbjct: 140  VKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEF 199

Query: 702  IIKNKGIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLEAVATQPVSVG 881
            +IKN GIDTE+DYPY+ +DGTC KDKLK+ VVTIDSYA + + NEK L+EAVA+QPVSVG
Sbjct: 200  VIKNHGIDTEKDYPYQEQDGTCKKDKLKKRVVTIDSYAGVASNNEKALMEAVASQPVSVG 259

Query: 882  ICGSDSSFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGRYWGMNGYMHM 1061
            ICGS+ +FQLYS GIF+GPCSTSLDHAVLIVGY S++G DYWI+KNSWG+ WGM+G+MHM
Sbjct: 260  ICGSERAFQLYSSGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHM 319

Query: 1062 LRNTGTAEGVCGINMLAXXXXXXXXXXXXXXXXXXTRCNLFTYCSSGETCCCARHFLGIC 1241
             RNTG +EGVCGINMLA                  T+CNLFTYCSSGETCCCAR   G+C
Sbjct: 320  QRNTGNSEGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCCCARTLFGLC 379

Query: 1242 LKWRCCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRIGNSTFVKPIAKKSFS 1403
              W+CCE  SAVCCKD  HCCP DYPVCDT +++CLK+ GN T +KP  KK+ S
Sbjct: 380  FSWKCCELESAVCCKDGRHCCPRDYPVCDTTKSLCLKKTGNFTEIKPFWKKNSS 433


>ref|XP_004238304.1| PREDICTED: cysteine proteinase RD21a-like [Solanum lycopersicum]
          Length = 439

 Score =  578 bits (1491), Expect = e-162
 Identities = 268/430 (62%), Positives = 321/430 (74%)
 Frame = +3

Query: 114  MCWXXXXXXXXXXXXQVPTCKSSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYV 293
            M W            Q P C  SSISDLF++WC++ GK Y+SEQER +R KVFE+NY Y+
Sbjct: 1    MKWLLPSLVLVLLIFQQPLCTCSSISDLFETWCQQNGKKYSSEQERMYRFKVFEENYAYI 60

Query: 294  NQHNIRANSSYTLSLNAFADLTNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQES 473
             +HN + NSSYTL LNA++DLT+HEF+  +LGLS SA+D  IRL    +       + + 
Sbjct: 61   TEHNSKGNSSYTLGLNAYSDLTHHEFRNSFLGLSSSANDF-IRLKGRGSGSSAAGVLSDV 119

Query: 474  DIPSSLDWRNKGAVTGVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKS 653
            D PSSLDWR+KGAVT VK+QGSCGACWSFSATGA+EGIN+I TGSLVSLSEQELIDCD+S
Sbjct: 120  DAPSSLDWRDKGAVTNVKNQGSCGACWSFSATGAIEGINKITTGSLVSLSEQELIDCDRS 179

Query: 654  YNDGCGGGLMDYAYEFIIKNKGIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKN 833
            YN GCGGGLMDYA+EF+IKN GIDTE+DYP+R ++GTCNK+KL+R VVTID Y DIP  +
Sbjct: 180  YNQGCGGGLMDYAFEFVIKNGGIDTEKDYPFREKEGTCNKNKLQRRVVTIDGYTDIPQND 239

Query: 834  EKKLLEAVATQPVSVGICGSDSSFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWII 1013
            E KLL+AVATQPVSVGICGS  +FQ YS GIFTGPC T LDHAVLIVGY S++G DYWII
Sbjct: 240  EDKLLKAVATQPVSVGICGSARAFQSYSKGIFTGPCPTDLDHAVLIVGYGSENGFDYWII 299

Query: 1014 KNSWGRYWGMNGYMHMLRNTGTAEGVCGINMLAXXXXXXXXXXXXXXXXXXTRCNLFTYC 1193
            KNSWG  WG+NGY+HM RN+G  EG+CG+N LA                  ++C+ FT C
Sbjct: 300  KNSWGTSWGINGYIHMQRNSGNQEGICGVNKLASYPTKTSPNPPNPPAPGPSKCSTFTSC 359

Query: 1194 SSGETCCCARHFLGICLKWRCCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRIGNSTF 1373
              GETCCC   FLGICL W+CC   SAVCCKD  HCCP DYP+CDT RN+CLKR+ N+T 
Sbjct: 360  GQGETCCCGLKFLGICLSWKCCGLDSAVCCKDGRHCCPWDYPICDTSRNLCLKRMSNATI 419

Query: 1374 VKPIAKKSFS 1403
            V+   K+ F+
Sbjct: 420  VQQPQKEPFT 429


>ref|XP_002510459.1| cysteine protease, putative [Ricinus communis]
            gi|223551160|gb|EEF52646.1| cysteine protease, putative
            [Ricinus communis]
          Length = 422

 Score =  572 bits (1474), Expect = e-160
 Identities = 262/392 (66%), Positives = 314/392 (80%)
 Frame = +3

Query: 177  SSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIRANSSYTLSLNAFADL 356
            SS IS LF+SW KE+GKTYTS++++ +R K+FE+NYE+V +HN + NSSYTLSLNAFADL
Sbjct: 25   SSDISKLFESWTKEHGKTYTSKEDKLYRFKIFEENYEFVKKHNSQGNSSYTLSLNAFADL 84

Query: 357  TNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPSSLDWRNKGAVTGVKDQG 536
            T+HEFKA  LGLS  ++   +   N P      +FV   D+P S+DWR KGAV+ VKDQG
Sbjct: 85   THHEFKASRLGLSAFSTSGKLSRRNFPLH----DFV--GDVPISIDWRKKGAVSQVKDQG 138

Query: 537  SCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCGGGLMDYAYEFIIKNK 716
            +CGACWSFSATGA+EGIN+I TGSLVSLSEQEL+DCD+SYN+GC GGLMDYAY+F+I+N 
Sbjct: 139  NCGACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDRSYNNGCEGGLMDYAYQFVIENN 198

Query: 717  GIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLEAVATQPVSVGICGSD 896
            GIDTEEDYPY+ R+ TCNK+KLKRHVVTID Y D+P  NEK+LL+AVA QPVSVGICGS+
Sbjct: 199  GIDTEEDYPYQAREKTCNKEKLKRHVVTIDGYTDVPQNNEKELLKAVAAQPVSVGICGSE 258

Query: 897  SSFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGRYWGMNGYMHMLRNTG 1076
             +FQLYS GIFTGPCSTSLDHAVLIVGY S++G DYWI+KNSWG +WG+NGYM+MLRN+G
Sbjct: 259  RAFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTHWGINGYMYMLRNSG 318

Query: 1077 TAEGVCGINMLAXXXXXXXXXXXXXXXXXXTRCNLFTYCSSGETCCCARHFLGICLKWRC 1256
             ++G+CGINMLA                  T+C+LFT C  GETCCC R   G+C  W+C
Sbjct: 319  NSQGLCGINMLASFPVKTSPNPPPPAPPGPTKCDLFTRCGEGETCCCTRRIFGLCFSWKC 378

Query: 1257 CEAVSAVCCKDHTHCCPHDYPVCDTKRNICLK 1352
            CE  SAVCCKD  HCCPHDYPVCDTKRN+CLK
Sbjct: 379  CELDSAVCCKDGLHCCPHDYPVCDTKRNMCLK 410


>ref|XP_006435079.1| hypothetical protein CICLE_v10001178mg [Citrus clementina]
            gi|557537201|gb|ESR48319.1| hypothetical protein
            CICLE_v10001178mg [Citrus clementina]
          Length = 441

 Score =  569 bits (1466), Expect = e-159
 Identities = 265/415 (63%), Positives = 318/415 (76%), Gaps = 1/415 (0%)
 Frame = +3

Query: 162  VPTCKSSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIRANSSYTLSLN 341
            +P    S I++LF++WCK++GK Y+SEQE++ RLK+FE NY +V QHN   NSS+TLSLN
Sbjct: 17   LPLNYCSDINELFETWCKQHGKVYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLN 76

Query: 342  AFADLTNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPSSLDWRNKGAVTG 521
            AFADLT+ EFKA +LG S ++ D   R N   A++  P  ++  D+P+S+DWR KGAVT 
Sbjct: 77   AFADLTHQEFKASFLGFSAASIDHDRRRN---ASVQSPGTLR--DVPASIDWRKKGAVTE 131

Query: 522  VKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCGGGLMDYAYEF 701
            VKDQ SCGACW+FSATGA+EGIN+I TGSLVSLSEQELIDCD+SYN GCGGGLMDYAY+F
Sbjct: 132  VKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQF 191

Query: 702  IIKNKGIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLEAVATQPVSVG 881
            +IKN GIDTE+DYPYRG+ G CNK KL RH+VTID Y D+P  NEK+LL+AV  QPVSVG
Sbjct: 192  VIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVG 251

Query: 882  ICGSDSSFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGRYWGMNGYMHM 1061
            ICGS+ +FQLYS GIFTGPCSTSLDHAVLIVGYDS++G DYWIIKNSWGR WGMNGYMHM
Sbjct: 252  ICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHM 311

Query: 1062 LRNTGTAEGVCGINMLAXXXXXXXXXXXXXXXXXXTRCNLFTYCSSGETCCCARHFLGIC 1241
             RNTG + G+CGINMLA                  TRC+L TYC++GETCCC    LGIC
Sbjct: 312  QRNTGNSLGICGINMLASYPTKTGQNPPPSPPPGPTRCSLLTYCAAGETCCCGSSILGIC 371

Query: 1242 LKWRCCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRI-GNSTFVKPIAKKSFS 1403
            L W+CC   SAVCC DH +CCP +YP+CD+ R+ CL R  GN T  + I  +  S
Sbjct: 372  LSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTRFTGNVTAAEAIEMRGSS 426


>ref|XP_006473576.1| PREDICTED: cysteine proteinase RD21a-like [Citrus sinensis]
          Length = 441

 Score =  568 bits (1464), Expect = e-159
 Identities = 264/415 (63%), Positives = 318/415 (76%), Gaps = 1/415 (0%)
 Frame = +3

Query: 162  VPTCKSSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIRANSSYTLSLN 341
            +P    S I++LF++WCK++GK Y+SEQE++ RLK+FE NY +V QHN   NSS+TLSLN
Sbjct: 17   LPPNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLN 76

Query: 342  AFADLTNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPSSLDWRNKGAVTG 521
            AFADLT+ EFKA +LG S ++ D   R N   A++  P  ++  D+P+S+DWR KGAVT 
Sbjct: 77   AFADLTHQEFKASFLGFSAASIDHDRRRN---ASVQSPGNLR--DVPASIDWRKKGAVTE 131

Query: 522  VKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCGGGLMDYAYEF 701
            VKDQ SCGACW+FSATGA+EGIN+I TGSLVSLSEQELIDCD+SYN GCGGGLMDYAY+F
Sbjct: 132  VKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQF 191

Query: 702  IIKNKGIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLEAVATQPVSVG 881
            +IKN GIDTE+DYPYRG+ G CNK KL RH+VTID Y D+P  NEK+LL+AV  QPVSVG
Sbjct: 192  VIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVG 251

Query: 882  ICGSDSSFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGRYWGMNGYMHM 1061
            ICGS+ +FQLYS GIFTGPCSTSLDHAVLI+GYDS++G DYWIIKNSWGR WGMNGYMHM
Sbjct: 252  ICGSERAFQLYSSGIFTGPCSTSLDHAVLIIGYDSENGVDYWIIKNSWGRSWGMNGYMHM 311

Query: 1062 LRNTGTAEGVCGINMLAXXXXXXXXXXXXXXXXXXTRCNLFTYCSSGETCCCARHFLGIC 1241
             RNTG + G+CGINMLA                  TRC+L TYC+ GETCCC    LGIC
Sbjct: 312  QRNTGNSLGICGINMLASYPTKTGQNPPPSPPPGPTRCSLLTYCAPGETCCCGSSILGIC 371

Query: 1242 LKWRCCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRI-GNSTFVKPIAKKSFS 1403
            L W+CC   SAVCC DH +CCP +YP+CD+ R+ CL R+ GN T  + I  +  S
Sbjct: 372  LSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTRLTGNVTAAEAIEMRGSS 426


>ref|XP_006307431.1| hypothetical protein CARUB_v10009056mg [Capsella rubella]
            gi|482576142|gb|EOA40329.1| hypothetical protein
            CARUB_v10009056mg [Capsella rubella]
          Length = 467

 Score =  567 bits (1461), Expect = e-159
 Identities = 266/437 (60%), Positives = 318/437 (72%), Gaps = 28/437 (6%)
 Frame = +3

Query: 177  SSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIRANSSYTLSLNAFADL 356
            S  IS+LFD WC+ +GKTY SE E++HR ++F  N+++V QHN+  N++Y+LSLNAFADL
Sbjct: 27   SDDISELFDDWCQRHGKTYASEAEKQHRFQIFRDNHDFVTQHNLITNATYSLSLNAFADL 86

Query: 357  TNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPSSLDWRNKGAVTGVKDQG 536
             + EFK   LGLS SA  +++       A  G +      +P SLDWR KGAVT VKDQG
Sbjct: 87   NHSEFKTSRLGLSVSAPSVIM-------ASKGKSLGGSVKVPDSLDWRKKGAVTNVKDQG 139

Query: 537  SCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCGGGLMDYAYEFIIKNK 716
            SCGACWSFSATGA+EGINQI TG L+SLSEQELIDCDKSYNDGC GGLMDYA+EF+IKNK
Sbjct: 140  SCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNDGCNGGLMDYAFEFVIKNK 199

Query: 717  GIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLEAVATQPVSVGICGSD 896
            GIDTE+DYPY+ RDGTC KDKLK+ VV+IDSYA +   +EK LLEAVA QPVSVGICGS+
Sbjct: 200  GIDTEKDYPYQERDGTCKKDKLKQRVVSIDSYAGVKPSDEKALLEAVAAQPVSVGICGSE 259

Query: 897  SSFQLYSG----------------------------GIFTGPCSTSLDHAVLIVGYDSKD 992
             +FQLYS                             GIF+GPCSTSLDHAVLIVGY S++
Sbjct: 260  RAFQLYSSVSFKIRDTSILSSECSTFPCLKLYLMMQGIFSGPCSTSLDHAVLIVGYGSQN 319

Query: 993  GKDYWIIKNSWGRYWGMNGYMHMLRNTGTAEGVCGINMLAXXXXXXXXXXXXXXXXXXTR 1172
            G DYWI+KNSWG+ WGM+G+MHM RNTG ++G+CGINMLA                  T+
Sbjct: 320  GVDYWIVKNSWGKSWGMDGFMHMQRNTGNSQGICGINMLASYPIKTHPNPPPPSPPGPTK 379

Query: 1173 CNLFTYCSSGETCCCARHFLGICLKWRCCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLK 1352
            CNLFTYCS+ ETCCCAR+  G+CL W+CCE  SAVCCKD  HCCPHDYPVCDT R++CLK
Sbjct: 380  CNLFTYCSAAETCCCARNLFGLCLSWKCCEIESAVCCKDGRHCCPHDYPVCDTTRSLCLK 439

Query: 1353 RIGNSTFVKPIAKKSFS 1403
            + GN T +KP  KK+ S
Sbjct: 440  KTGNFTAIKPFWKKNSS 456


>gb|EOY14881.1| JHL18I08.3 protein [Theobroma cacao]
          Length = 438

 Score =  566 bits (1459), Expect = e-158
 Identities = 262/405 (64%), Positives = 316/405 (78%)
 Frame = +3

Query: 180  SSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIRANSSYTLSLNAFADLT 359
            S IS LF++WC ++GK Y+SE+E+ +RLKVFE+NY +V QHN   NSSY+L+LNAFADLT
Sbjct: 24   SHISHLFETWCDQHGKRYSSEEEKSYRLKVFEENYAFVTQHNGVGNSSYSLALNAFADLT 83

Query: 360  NHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPSSLDWRNKGAVTGVKDQGS 539
            +HEFKA  LGLS +A      +      +  P  V+  DIP+S+DWR KGAVT VKDQGS
Sbjct: 84   HHEFKASRLGLSAAA------IEGSRPNLQLPGLVR--DIPASMDWRTKGAVTKVKDQGS 135

Query: 540  CGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCGGGLMDYAYEFIIKNKG 719
            CGACWSFSATGA+EGIN+I TG+LVSLSEQEL+DCD+SYN GC GGLMDYAY+F+I N G
Sbjct: 136  CGACWSFSATGAIEGINKIVTGTLVSLSEQELVDCDRSYNSGCEGGLMDYAYQFVIDNHG 195

Query: 720  IDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLEAVATQPVSVGICGSDS 899
            ID EEDYPY GR+ TCNK+K KR VVTID YA +PA NE  LL+AVA QPVSVGICGS+ 
Sbjct: 196  IDNEEDYPYLGREKTCNKEKRKRRVVTIDGYAGVPANNEDLLLQAVAKQPVSVGICGSER 255

Query: 900  SFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGRYWGMNGYMHMLRNTGT 1079
            +FQLYS GIFTGPCS+SLDHAVLIVGY S++G DYWI+KNSWG  WGMNGY+HMLRN+G 
Sbjct: 256  AFQLYSKGIFTGPCSSSLDHAVLIVGYGSENGVDYWIVKNSWGTRWGMNGYIHMLRNSGD 315

Query: 1080 AEGVCGINMLAXXXXXXXXXXXXXXXXXXTRCNLFTYCSSGETCCCARHFLGICLKWRCC 1259
            ++G+CGINMLA                  T+C+LFTYCS+GETCCC     GIC  W+CC
Sbjct: 316  SKGLCGINMLASYPTKTSPNPPSPPPPGPTKCDLFTYCSAGETCCCTHRIFGICFSWKCC 375

Query: 1260 EAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRIGNSTFVKPIAKK 1394
            E  SAVCCKD+ HCCP+DYPVCDTK++ CLKR+GN+T ++   K+
Sbjct: 376  ELDSAVCCKDNRHCCPYDYPVCDTKKSQCLKRVGNATRMEAFEKR 420


>ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
            gi|449529596|ref|XP_004171784.1| PREDICTED: cysteine
            proteinase RD21a-like [Cucumis sativus]
          Length = 431

 Score =  561 bits (1445), Expect = e-157
 Identities = 259/413 (62%), Positives = 314/413 (76%)
 Frame = +3

Query: 165  PTCKSSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIRANSSYTLSLNA 344
            P   +S++S+LF+ WC E+GK+Y+S +E+ +RL VF  NYE+V  HN   NSSYTLSLN+
Sbjct: 18   PLSATSNVSELFEIWCTEHGKSYSSAEEKLYRLGVFADNYEFVTHHNNLDNSSYTLSLNS 77

Query: 345  FADLTNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPSSLDWRNKGAVTGV 524
            +ADLT+HEFK   LG SP+  +    L   P+           D+P SLDWR KGAVT V
Sbjct: 78   YADLTHHEFKVSRLGFSPALRNFRPVLPQEPSL--------PRDVPDSLDWRKKGAVTAV 129

Query: 525  KDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCGGGLMDYAYEFI 704
            KDQGSCGACWSFSATGA+EGINQI TGSL+SLSEQELIDCD+SYN GCGGGLMDYAY+F+
Sbjct: 130  KDQGSCGACWSFSATGAMEGINQIMTGSLISLSEQELIDCDRSYNSGCGGGLMDYAYQFV 189

Query: 705  IKNKGIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLEAVATQPVSVGI 884
            I N GIDTE DYPY+ RDG+C KDKL+R+VVTID YADIP+ +E KLL+AVA QPVSVGI
Sbjct: 190  ISNHGIDTENDYPYQARDGSCRKDKLQRNVVTIDGYADIPSNDEGKLLQAVAAQPVSVGI 249

Query: 885  CGSDSSFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGRYWGMNGYMHML 1064
            CGS+ +FQLYS GIF+GPCSTSLDHAVLIVGY S++G DYWI+KNSWG+ WGM+GYMHM 
Sbjct: 250  CGSERAFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKSWGMDGYMHMQ 309

Query: 1065 RNTGTAEGVCGINMLAXXXXXXXXXXXXXXXXXXTRCNLFTYCSSGETCCCARHFLGICL 1244
            RN+G +EGVCGIN LA                  T+C++ T C++GETCCCA+ FLG+CL
Sbjct: 310  RNSGNSEGVCGINKLASYPTKTNPNPPPSPPPGPTKCSILTSCAAGETCCCAKKFLGLCL 369

Query: 1245 KWRCCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRIGNSTFVKPIAKKSFS 1403
             W+CC   SAVCCKD  HCCP DYP+CDT RN+CLK+  N T  + +  +S S
Sbjct: 370  SWKCCGLSSAVCCKDGRHCCPFDYPICDTDRNLCLKQTMNGTRTEILENRSSS 422


>gb|AAB60738.1| Strong similarity to Dianthus cysteine proteinase (gb|U17135)
            [Arabidopsis thaliana]
          Length = 416

 Score =  560 bits (1444), Expect = e-157
 Identities = 259/399 (64%), Positives = 310/399 (77%), Gaps = 7/399 (1%)
 Frame = +3

Query: 177  SSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIRANSSYTLSLNAFADL 356
            S  IS+LFD WC+++GKTY SE+ER+ R+++F+ N+++V QHN+  N++Y+LSLNAFADL
Sbjct: 23   SDDISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADL 82

Query: 357  TNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPSSLDWRNKGAVTGVKDQG 536
            T+HEFKA  LGLS SA  +++       A  G +      +P S+DWR KGAVT VKDQG
Sbjct: 83   THHEFKASRLGLSVSAPSVIM-------ASKGQSLGGSVKVPDSVDWRKKGAVTNVKDQG 135

Query: 537  SCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCGGGLMDYAYEFIIKNK 716
            SCGACWSFSATGA+EGINQI TG L+SLSEQELIDCDKSYN GC GGLMDYA+EF+IKN 
Sbjct: 136  SCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNH 195

Query: 717  GIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLEAVATQPVSVGICGSD 896
            GIDTE+DYPY+ RDGTC KDKLK+ VVTIDSYA + + +EK L+EAVA QPVSVGICGS+
Sbjct: 196  GIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSE 255

Query: 897  SSFQLYSG-------GIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGRYWGMNGYM 1055
             +FQLYS        GIF+GPCSTSLDHAVLIVGY S++G DYWI+KNSWG+ WGM+G+M
Sbjct: 256  RAFQLYSSKFYLLMQGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFM 315

Query: 1056 HMLRNTGTAEGVCGINMLAXXXXXXXXXXXXXXXXXXTRCNLFTYCSSGETCCCARHFLG 1235
            HM RNT  ++GVCGINMLA                  T+CNLFTYCSSGETCCCAR   G
Sbjct: 316  HMQRNTENSDGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCCCARELFG 375

Query: 1236 ICLKWRCCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLK 1352
            +C  W+CCE  SAVCCKD  HCCPHDYPVCDT R++CLK
Sbjct: 376  LCFSWKCCEIESAVCCKDGRHCCPHDYPVCDTTRSLCLK 414


>ref|XP_002280937.1| PREDICTED: cysteine proteinase RD21a [Vitis vinifera]
            gi|302142569|emb|CBI19772.3| unnamed protein product
            [Vitis vinifera]
          Length = 436

 Score =  560 bits (1443), Expect = e-157
 Identities = 255/425 (60%), Positives = 324/425 (76%)
 Frame = +3

Query: 174  KSSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIRANSSYTLSLNAFAD 353
            ++SS +DLF++WC++YGKTY+SE+E+  RLKVFE+N+ +V QHN  AN+SYTL+LNAFAD
Sbjct: 21   EASSTADLFEAWCEQYGKTYSSEEEKASRLKVFEENHAFVTQHNSMANASYTLALNAFAD 80

Query: 354  LTNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPSSLDWRNKGAVTGVKDQ 533
            LT+HEFKA  LG SP  +  +        ++  P  VQE  +P ++DWR  GAVTGVKDQ
Sbjct: 81   LTHHEFKASRLGFSPGRAQSI-------RSVGTP--VQELHVPPAVDWRKSGAVTGVKDQ 131

Query: 534  GSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCGGGLMDYAYEFIIKN 713
            G+CG CWSFS TGA+EGIN+I TGSLVSLSEQEL+DCD+SYN GC GGLMDYAY+F+IKN
Sbjct: 132  GNCGGCWSFSTTGAIEGINKIVTGSLVSLSEQELVDCDRSYNSGCEGGLMDYAYQFVIKN 191

Query: 714  KGIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLEAVATQPVSVGICGS 893
            +GID+E DYPY G D  CNK+KLK+H+VTID Y DIP  +EK+LL+ VA QPVSVGICGS
Sbjct: 192  QGIDSEADYPYVGMDKPCNKEKLKKHIVTIDGYTDIPPNDEKQLLQVVAKQPVSVGICGS 251

Query: 894  DSSFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGRYWGMNGYMHMLRNT 1073
            + +FQLYS G++TGPCS++LDHAVLIVGY ++DG D+WI+KNSWG +WGM GY+HMLRN 
Sbjct: 252  EKTFQLYSKGVYTGPCSSTLDHAVLIVGYGTEDGVDFWIVKNSWGEHWGMRGYIHMLRNN 311

Query: 1074 GTAEGVCGINMLAXXXXXXXXXXXXXXXXXXTRCNLFTYCSSGETCCCARHFLGICLKWR 1253
            GTAEG+CGINMLA                  T+C+ F+ CS GETCCC+  F+G+CL W 
Sbjct: 312  GTAEGICGINMLASYPAKTSPNPPPPPTPGPTKCDFFSSCSEGETCCCSWRFIGVCLSWN 371

Query: 1254 CCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRIGNSTFVKPIAKKSFSI*LSKQCGYT 1433
            CC A SAVCC ++ +CCP  +P+CDTKRN CLK  GN T V+ + ++  S+   K  G++
Sbjct: 372  CCTAKSAVCCDNNNYCCPASHPICDTKRNRCLKPAGNGTGVEVLKRRGSSV---KFGGWS 428

Query: 1434 SYNSA 1448
            S N A
Sbjct: 429  SINDA 433


>ref|XP_004500967.1| PREDICTED: oryzain alpha chain-like [Cicer arietinum]
          Length = 436

 Score =  558 bits (1438), Expect = e-156
 Identities = 264/396 (66%), Positives = 298/396 (75%)
 Frame = +3

Query: 189  SDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIRANSSYTLSLNAFADLTNHE 368
            S LF  WCK++GKTY SEQE+ +R  VFE NY +V QHN   NSSYTLSLNAFADLT+HE
Sbjct: 27   SKLFQEWCKQHGKTYPSEQEKRYRFNVFEDNYAFVAQHNQIGNSSYTLSLNAFADLTHHE 86

Query: 369  FKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPSSLDWRNKGAVTGVKDQGSCGA 548
            FKA  LGL PS S L  + N         +F+Q   +PS +DWR  GAV+ VKDQGSCGA
Sbjct: 87   FKATRLGLPPS-SLLRFKFNRFQDQQRSDDFLQ---VPSEIDWRKNGAVSIVKDQGSCGA 142

Query: 549  CWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCGGGLMDYAYEFIIKNKGIDT 728
            CWSFSATGA+EGIN+I TGSLVSLSEQEL+DCD +YN GC GGLMDYAY+FII N GIDT
Sbjct: 143  CWSFSATGAIEGINKIVTGSLVSLSEQELVDCDTTYNSGCDGGLMDYAYQFIIDNNGIDT 202

Query: 729  EEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLEAVATQPVSVGICGSDSSFQ 908
            EEDYPY+ R   C KDKLKR VVTID Y D+P  +EKKLL+AVA QPVSVGICGS  +FQ
Sbjct: 203  EEDYPYQARQLLCKKDKLKRRVVTIDGYTDVPPNDEKKLLKAVAVQPVSVGICGSARAFQ 262

Query: 909  LYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGRYWGMNGYMHMLRNTGTAEG 1088
            LYS GIFTGPCSTSLDHAVLIVGY S++G DYWI+KNSWG+YWGMNGY+HMLRNT ++ G
Sbjct: 263  LYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKYWGMNGYIHMLRNTDSSAG 322

Query: 1089 VCGINMLAXXXXXXXXXXXXXXXXXXTRCNLFTYCSSGETCCCARHFLGICLKWRCCEAV 1268
            +CGINMLA                   +CNLFTYCS GETCCCA+ FLGIC  W+CC   
Sbjct: 323  LCGINMLASYPTKTKPNPPVPPPPGPIKCNLFTYCSGGETCCCAKKFLGICFSWKCCGVT 382

Query: 1269 SAVCCKDHTHCCPHDYPVCDTKRNICLKRIGNSTFV 1376
            SAVCCKD  HCCP DYPVCD     CLKRI N T +
Sbjct: 383  SAVCCKDKRHCCPLDYPVCDASNGQCLKRIANGTIL 418


>ref|XP_004291399.1| PREDICTED: cysteine proteinase RD21a-like [Fragaria vesca subsp.
            vesca]
          Length = 441

 Score =  547 bits (1409), Expect = e-153
 Identities = 256/402 (63%), Positives = 307/402 (76%), Gaps = 4/402 (0%)
 Frame = +3

Query: 177  SSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIRANSSYTLSLNAFADL 356
            SSS S+LF++WCK+YGK+Y+S++E+ +RL +FE+N  ++ QHN   NSSYTLSLN+F+DL
Sbjct: 25   SSSSSELFEAWCKQYGKSYSSQEEKLYRLSLFEQNLAFITQHNDLGNSSYTLSLNSFSDL 84

Query: 357  TNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPSSLDWRNKGAVTGVKDQG 536
            T+HEFKA  LG SP+   L  + +  P+ +          +PSS+DWR  GAVT VKDQG
Sbjct: 85   THHEFKASRLGFSPTFLRLYRKSDPKPSVV--------RHVPSSIDWRKNGAVTNVKDQG 136

Query: 537  SCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSY-NDGCGGGLMDYAYEFIIKN 713
            SCGACWSFSATGA+EGIN+I TGSLVSLSEQELIDCD+ Y N GC GGLMD A++FII N
Sbjct: 137  SCGACWSFSATGAIEGINKIVTGSLVSLSEQELIDCDRVYPNSGCNGGLMDDAFQFIIDN 196

Query: 714  KGIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLEAVATQPVSVGICGS 893
             GIDTEEDYPY+G DGTCNK KLKRHVVTID Y D+PA NE++LL+AVATQPVSVGI GS
Sbjct: 197  NGIDTEEDYPYQGADGTCNKQKLKRHVVTIDGYTDVPANNEEQLLKAVATQPVSVGIAGS 256

Query: 894  DSSFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGRYWGMNGYMHMLRNT 1073
               FQ YS GIF GPCST+LDHAVLIVGY S++G DYWI+KNSWG+ WGMNGY+H+LR+ 
Sbjct: 257  GREFQFYSKGIFAGPCSTTLDHAVLIVGYGSENGVDYWIVKNSWGKNWGMNGYIHILRDH 316

Query: 1074 GTAEGVCGINMLA---XXXXXXXXXXXXXXXXXXTRCNLFTYCSSGETCCCARHFLGICL 1244
              ++G+CGINMLA                     T+C+LF+ C  GETCCCAR  LGICL
Sbjct: 317  SNSKGLCGINMLASYPTKTGENPPFPPPSPPPGPTKCDLFSKCGVGETCCCARKILGICL 376

Query: 1245 KWRCCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRIGNST 1370
             WRCCE  SAVCCKD  HCCPHDYP+CDT+RN CL+  GN T
Sbjct: 377  SWRCCEFTSAVCCKDRLHCCPHDYPICDTERNYCLQANGNLT 418


>ref|XP_003523725.1| PREDICTED: oryzain alpha chain-like [Glycine max]
          Length = 439

 Score =  543 bits (1400), Expect = e-152
 Identities = 259/417 (62%), Positives = 307/417 (73%), Gaps = 8/417 (1%)
 Frame = +3

Query: 177  SSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIRAN-----SSYTLSLN 341
            +S  S+LF+ WCKE+ KTY+SE+E+ +RLKVFE NY +V QHN  AN     SSYTLSLN
Sbjct: 26   ASDTSELFEKWCKEHSKTYSSEEEKLYRLKVFEDNYAFVAQHNQNANNNNNNSSYTLSLN 85

Query: 342  AFADLTNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESD---IPSSLDWRNKGA 512
            AFADLT+HEFK   LGL  +    L+R          P   Q  D   IPS +DWR  GA
Sbjct: 86   AFADLTHHEFKTTRLGLPLT----LLRFKR-------PQNQQSRDLLHIPSQIDWRQSGA 134

Query: 513  VTGVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCGGGLMDYA 692
            VT VKDQ SCGACW+FSATGA+EGIN+I TGSLVSLSEQELIDCD SYN GCGGGLMD+A
Sbjct: 135  VTPVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDTSYNSGCGGGLMDFA 194

Query: 693  YEFIIKNKGIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLEAVATQPV 872
            Y+F+I NKGIDTE+DYPY+ R  +C+KDKLKR  VTI+ Y D+P  +E+++L+AVA+QPV
Sbjct: 195  YQFVIDNKGIDTEDDYPYQARQRSCSKDKLKRRAVTIEDYVDVP-PSEEEILKAVASQPV 253

Query: 873  SVGICGSDSSFQLYSGGIFTGPCSTSLDHAVLIVGYDSKDGKDYWIIKNSWGRYWGMNGY 1052
            SVGICGS+  FQLYS GIFTGPCST LDHAVLIVGY S++G DYWI+KNSWG+YWGMNGY
Sbjct: 254  SVGICGSEREFQLYSKGIFTGPCSTFLDHAVLIVGYGSENGVDYWIVKNSWGKYWGMNGY 313

Query: 1053 MHMLRNTGTAEGVCGINMLAXXXXXXXXXXXXXXXXXXTRCNLFTYCSSGETCCCARHFL 1232
            +HM+RN+G ++G+CGIN LA                   RCNLFT+CS GETCCCA+ FL
Sbjct: 314  IHMIRNSGNSKGICGINTLASYPVKTKPNPPIPPPPGPVRCNLFTHCSEGETCCCAKSFL 373

Query: 1233 GICLKWRCCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRIGNSTFVKPIAKKSFS 1403
            GIC  W+CC   SAVCCKD  HCCP DYP+CDT+R  CLKR  N T       + FS
Sbjct: 374  GICFSWKCCGLTSAVCCKDKRHCCPQDYPICDTRRGQCLKRTANGTTTITSENQDFS 430


Top