BLASTX nr result

ID: Rehmannia24_contig00008865 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia24_contig00008865
         (1726 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002307688.2| cysteine protease family protein [Populus tr...   600   e-169
gb|EPS60205.1| hypothetical protein M569_14597, partial [Genlise...   600   e-169
ref|XP_006341989.1| PREDICTED: cysteine proteinase RD21a-like [S...   595   e-167
dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas]                          591   e-166
ref|XP_004238304.1| PREDICTED: cysteine proteinase RD21a-like [S...   588   e-165
ref|NP_563855.1| papain-like cysteine peptidase [Arabidopsis tha...   584   e-164
gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [A...   583   e-164
ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arab...   582   e-163
ref|XP_006417526.1| hypothetical protein EUTSA_v10007640mg [Eutr...   580   e-163
ref|XP_006473576.1| PREDICTED: cysteine proteinase RD21a-like [C...   578   e-162
ref|XP_006435079.1| hypothetical protein CICLE_v10001178mg [Citr...   578   e-162
ref|XP_002510459.1| cysteine protease, putative [Ricinus communi...   573   e-161
ref|XP_006307431.1| hypothetical protein CARUB_v10009056mg [Caps...   571   e-160
gb|EOY14881.1| JHL18I08.3 protein [Theobroma cacao]                   568   e-159
ref|XP_004500967.1| PREDICTED: oryzain alpha chain-like [Cicer a...   563   e-158
gb|AAB60738.1| Strong similarity to Dianthus cysteine proteinase...   562   e-157
ref|XP_002280937.1| PREDICTED: cysteine proteinase RD21a [Vitis ...   562   e-157
ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [C...   562   e-157
ref|XP_004291399.1| PREDICTED: cysteine proteinase RD21a-like [F...   550   e-154
ref|XP_003523725.1| PREDICTED: oryzain alpha chain-like [Glycine...   549   e-153

>ref|XP_002307688.2| cysteine protease family protein [Populus trichocarpa]
            gi|550339725|gb|EEE94684.2| cysteine protease family
            protein [Populus trichocarpa]
          Length = 436

 Score =  600 bits (1548), Expect = e-169
 Identities = 278/428 (64%), Positives = 333/428 (77%)
 Frame = -2

Query: 1617 MCWLLFFVHLILLFSQVPTCKSSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYV 1438
            M +L  F   +L+    P+  SS IS LF++WCKE+GK+YTS++ER HRLKVFE NY++V
Sbjct: 1    MNFLYIFALTLLISVLSPSTSSSDISQLFETWCKEHGKSYTSQEERSHRLKVFEDNYDFV 60

Query: 1437 NQHNIRANSSYTLSLNAFADLTNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQES 1258
             +HN + NSSY+L+LNAFADLT+HEFK   LGLS +  +L  R N     + G       
Sbjct: 61   TKHNSKGNSSYSLALNAFADLTHHEFKTSRLGLSAAPLNLAHR-NLEITGVVG------- 112

Query: 1257 DIPSSLDWRNKGAVTGVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKS 1078
            DIP+S+DWRNKG VT VKDQGSCGACWSFSATGA+EGIN+I TGSLVSLSEQELI+CDKS
Sbjct: 113  DIPASIDWRNKGVVTNVKDQGSCGACWSFSATGAIEGINKIVTGSLVSLSEQELIECDKS 172

Query: 1077 YNDGCGGGLMDYAYEFIIKNKGIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKN 898
            YNDGCGGGLMDYA++F+I N GIDTEEDYPYR RDGTCNKD++KR VVTID Y D+P  N
Sbjct: 173  YNDGCGGGLMDYAFQFVINNHGIDTEEDYPYRARDGTCNKDRMKRRVVTIDKYVDVPENN 232

Query: 897  EKKLLEAVATQPVSVGICGSDSSFQLYSGGIFTGPCSTSLDHAVLIIGYDSKDGKDYWII 718
            EK+LL+AVA QPVSVGICGS+ +FQ+YS GIFTGPCSTSLDHAVLI+GY S++G DYWI+
Sbjct: 233  EKQLLQAVAAQPVSVGICGSERAFQMYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIV 292

Query: 717  KNSWGRYWGMNGYMHMLRNTGTAEGVCGVNMLAXXXXXXXXXXXXXXXXXPTRCNLFTYC 538
            KNSWG  WGM GYMHM RN+G ++GVCG+NMLA                 PT+CNL TYC
Sbjct: 293  KNSWGTGWGMRGYMHMQRNSGNSQGVCGINMLASYPVKTSPNPPPPPPPGPTKCNLLTYC 352

Query: 537  SSGETCCCARHFLGICLKWRCCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRIGNSTF 358
            ++GETCCCAR F GIC+ W+CC   SAVCCKD  HCCPHDYPVCDT +N+C KR GN+T 
Sbjct: 353  AAGETCCCARKFFGICISWKCCGLDSAVCCKDRLHCCPHDYPVCDTDKNMCFKRAGNATR 412

Query: 357  VKPIAKKS 334
            ++ I  K+
Sbjct: 413  MEAIEGKT 420


>gb|EPS60205.1| hypothetical protein M569_14597, partial [Genlisea aurea]
          Length = 424

 Score =  600 bits (1548), Expect = e-169
 Identities = 283/424 (66%), Positives = 333/424 (78%), Gaps = 2/424 (0%)
 Frame = -2

Query: 1611 WL-LFFVHLILLFSQVPTCKSSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVN 1435
            WL L  + L LL  QV    SSSISDLFDSWC+E+GKTY SE+EREHRL VF +NY+++ 
Sbjct: 5    WLSLCLIQLFLL--QVHPIVSSSISDLFDSWCQEHGKTYVSEEEREHRLGVFSENYDFIA 62

Query: 1434 QHNIRANSSYTLSLNAFADLTNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESD 1255
             HN RAN SYTLSLNAFADLT  EF  +YLG SPS  DLLIR N G  +    N+   S 
Sbjct: 63   SHNARANYSYTLSLNAFADLTRSEFGGRYLGFSPSGHDLLIRKNRGSGSYRSRNY---SA 119

Query: 1254 IPSSLDWRNKGAVTGVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSY 1075
            +PSS+DWR KGAVTG+KDQGSCGACWSFSATGA+EGINQI TGSLVSLSEQELIDCD SY
Sbjct: 120  VPSSIDWRKKGAVTGIKDQGSCGACWSFSATGAIEGINQIVTGSLVSLSEQELIDCDHSY 179

Query: 1074 NDGCGGGLMDYAYEFIIKNKGIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNE 895
            N GC GGLMDYAYEFI+KNKGIDTEEDY Y+GRD +C+++KL + VVTIDSY DIP KNE
Sbjct: 180  NQGCNGGLMDYAYEFILKNKGIDTEEDYSYKGRDASCSQNKLNKRVVTIDSYVDIPEKNE 239

Query: 894  KKLLEAVATQPVSVGICGSDSSFQLYSGGIFTGPCSTSLDHAVLIIGYDSKDGKDYWIIK 715
            + LLEAVA+QPVSVGI G D+ FQ YS GIFTGPCSTSLDHAVLI+GYDSK+GKDYWI+K
Sbjct: 240  QMLLEAVASQPVSVGISGGDAPFQFYSQGIFTGPCSTSLDHAVLIVGYDSKNGKDYWIVK 299

Query: 714  NSWGRYWGMNGYMHMLRNTGTAEGVCGVNMLA-XXXXXXXXXXXXXXXXXPTRCNLFTYC 538
            NSWG+ WGM+GYM++ RNTG   G+C +NM+A                  PT+C+LF+YC
Sbjct: 300  NSWGKSWGMDGYMYVQRNTGNQNGICEINMMASYPVKTNPNPSPSPSPPGPTKCSLFSYC 359

Query: 537  SSGETCCCARHFLGICLKWRCCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRIGNSTF 358
            S GETCCCAR FLG+C++++CC A SAVCC+D+ HCCP DYP+CDT +++C K  GNST 
Sbjct: 360  SQGETCCCARRFLGLCMRYKCCGAESAVCCEDNVHCCPQDYPICDTAQSVCRKMSGNSTM 419

Query: 357  VKPI 346
              P+
Sbjct: 420  AIPV 423


>ref|XP_006341989.1| PREDICTED: cysteine proteinase RD21a-like [Solanum tuberosum]
          Length = 439

 Score =  595 bits (1535), Expect = e-167
 Identities = 276/430 (64%), Positives = 334/430 (77%)
 Frame = -2

Query: 1617 MCWLLFFVHLILLFSQVPTCKSSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYV 1438
            M WLL  + L+LL  Q P C  SSISDLF++WC++ GK Y+SEQER +R KVFE+NY Y+
Sbjct: 1    MNWLLPSLVLVLLIFQQPFCTCSSISDLFETWCQQNGKKYSSEQERVYRFKVFEENYAYI 60

Query: 1437 NQHNIRANSSYTLSLNAFADLTNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQES 1258
             +HN + NSSYTL LNA++DLT+HEF+  +LGLS SA+D  IRL    +       + + 
Sbjct: 61   TEHNSKENSSYTLGLNAYSDLTHHEFRNSFLGLSSSANDF-IRLKGRGSGSSETGVLSDV 119

Query: 1257 DIPSSLDWRNKGAVTGVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKS 1078
            D PSSLDWR KGAVT VK+QGSCGACWSFSATGA+EGIN+I TGSLVSLSEQELIDCD+S
Sbjct: 120  DAPSSLDWREKGAVTDVKNQGSCGACWSFSATGAMEGINKITTGSLVSLSEQELIDCDRS 179

Query: 1077 YNDGCGGGLMDYAYEFIIKNKGIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKN 898
            YN+GCGGGLMDYA+EF+IKN GIDTE+DYP+R R+GTCNK+KL+RHVVTID Y DIP  +
Sbjct: 180  YNEGCGGGLMDYAFEFVIKNGGIDTEKDYPFREREGTCNKNKLQRHVVTIDGYTDIPQND 239

Query: 897  EKKLLEAVATQPVSVGICGSDSSFQLYSGGIFTGPCSTSLDHAVLIIGYDSKDGKDYWII 718
            E KLL+AVATQPVSVGICGS  +FQ YS GIFTGPCST+LDHAVLI+GY S++G DYWII
Sbjct: 240  EDKLLKAVATQPVSVGICGSARAFQSYSKGIFTGPCSTALDHAVLIVGYGSENGVDYWII 299

Query: 717  KNSWGRYWGMNGYMHMLRNTGTAEGVCGVNMLAXXXXXXXXXXXXXXXXXPTRCNLFTYC 538
            KNSWG  WG+NGY+HM RN+G  EG+CG+N LA                 P++C++FT C
Sbjct: 300  KNSWGTSWGINGYIHMQRNSGNQEGICGINKLASYPTKTSPNPPTPPAPGPSKCSMFTSC 359

Query: 537  SSGETCCCARHFLGICLKWRCCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRIGNSTF 358
              GETCCC   FLGICL W+CC   SAVCCKD  HCCP DYP+CDT RN+CLKR+ N+T 
Sbjct: 360  GQGETCCCGSKFLGICLSWKCCGLDSAVCCKDGRHCCPQDYPICDTSRNLCLKRMNNATI 419

Query: 357  VKPIAKKSFS 328
            V+   K++F+
Sbjct: 420  VQQPQKEAFT 429


>dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas]
          Length = 441

 Score =  591 bits (1523), Expect = e-166
 Identities = 274/424 (64%), Positives = 333/424 (78%)
 Frame = -2

Query: 1599 FVHLILLFSQVPTCKSSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIR 1420
            FV  +L +  + +  SS I+ LF++WC+++GKTY S++E+  RLKVF+ NY++V +HN +
Sbjct: 8    FVAFLLSYLFLFSSSSSEIAHLFETWCQQHGKTYASQEEKLFRLKVFQDNYDFVTEHNSQ 67

Query: 1419 ANSSYTLSLNAFADLTNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPSSL 1240
             NSSYTLSLNAFADLT+HEFKA  LGLS +AS     LN   +    P+FV  +D+P+S+
Sbjct: 68   GNSSYTLSLNAFADLTHHEFKASRLGLSSAAS---ASLNVDRSNRQIPDFV--ADVPASV 122

Query: 1239 DWRNKGAVTGVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCG 1060
            DWR  GAVT VKDQG+CGACWSFSATGA+EGIN+I TGSLVSLSEQEL+DCDKSYN+GC 
Sbjct: 123  DWRKNGAVTQVKDQGNCGACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDKSYNNGCE 182

Query: 1059 GGLMDYAYEFIIKNKGIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLE 880
            GG+MDYA++F+I N GIDTEEDYPY+GRD +CNK+KLKRHVVTID Y D+P  NEK+LL+
Sbjct: 183  GGIMDYAFQFVIDNHGIDTEEDYPYQGRDRSCNKEKLKRHVVTIDGYVDVPQNNEKELLK 242

Query: 879  AVATQPVSVGICGSDSSFQLYSGGIFTGPCSTSLDHAVLIIGYDSKDGKDYWIIKNSWGR 700
            AVA QPVSVGICGS+ +FQLYS GIFTGPCSTSLDHAVLI+GY S++G DYWI+KNSWG 
Sbjct: 243  AVANQPVSVGICGSERAFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGS 302

Query: 699  YWGMNGYMHMLRNTGTAEGVCGVNMLAXXXXXXXXXXXXXXXXXPTRCNLFTYCSSGETC 520
            YWGM+GYMHM RN+G++ G+CG+NMLA                 PTRC+LFT+C  GETC
Sbjct: 303  YWGMDGYMHMQRNSGSSRGLCGINMLASYPKKTSPNPPPPAPPGPTRCDLFTHCGEGETC 362

Query: 519  CCARHFLGICLKWRCCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRIGNSTFVKPIAK 340
            CC  H  GICL W+CCE  SAVCCKD  HCCP DYPVCDT RNICLK  GN+T ++  AK
Sbjct: 363  CCVHHIFGICLSWKCCELDSAVCCKDGRHCCPRDYPVCDTTRNICLKHYGNATRIEKFAK 422

Query: 339  KSFS 328
             S S
Sbjct: 423  NSSS 426


>ref|XP_004238304.1| PREDICTED: cysteine proteinase RD21a-like [Solanum lycopersicum]
          Length = 439

 Score =  588 bits (1516), Expect = e-165
 Identities = 274/430 (63%), Positives = 329/430 (76%)
 Frame = -2

Query: 1617 MCWLLFFVHLILLFSQVPTCKSSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYV 1438
            M WLL  + L+LL  Q P C  SSISDLF++WC++ GK Y+SEQER +R KVFE+NY Y+
Sbjct: 1    MKWLLPSLVLVLLIFQQPLCTCSSISDLFETWCQQNGKKYSSEQERMYRFKVFEENYAYI 60

Query: 1437 NQHNIRANSSYTLSLNAFADLTNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQES 1258
             +HN + NSSYTL LNA++DLT+HEF+  +LGLS SA+D  IRL    +       + + 
Sbjct: 61   TEHNSKGNSSYTLGLNAYSDLTHHEFRNSFLGLSSSANDF-IRLKGRGSGSSAAGVLSDV 119

Query: 1257 DIPSSLDWRNKGAVTGVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKS 1078
            D PSSLDWR+KGAVT VK+QGSCGACWSFSATGA+EGIN+I TGSLVSLSEQELIDCD+S
Sbjct: 120  DAPSSLDWRDKGAVTNVKNQGSCGACWSFSATGAIEGINKITTGSLVSLSEQELIDCDRS 179

Query: 1077 YNDGCGGGLMDYAYEFIIKNKGIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKN 898
            YN GCGGGLMDYA+EF+IKN GIDTE+DYP+R ++GTCNK+KL+R VVTID Y DIP  +
Sbjct: 180  YNQGCGGGLMDYAFEFVIKNGGIDTEKDYPFREKEGTCNKNKLQRRVVTIDGYTDIPQND 239

Query: 897  EKKLLEAVATQPVSVGICGSDSSFQLYSGGIFTGPCSTSLDHAVLIIGYDSKDGKDYWII 718
            E KLL+AVATQPVSVGICGS  +FQ YS GIFTGPC T LDHAVLI+GY S++G DYWII
Sbjct: 240  EDKLLKAVATQPVSVGICGSARAFQSYSKGIFTGPCPTDLDHAVLIVGYGSENGFDYWII 299

Query: 717  KNSWGRYWGMNGYMHMLRNTGTAEGVCGVNMLAXXXXXXXXXXXXXXXXXPTRCNLFTYC 538
            KNSWG  WG+NGY+HM RN+G  EG+CGVN LA                 P++C+ FT C
Sbjct: 300  KNSWGTSWGINGYIHMQRNSGNQEGICGVNKLASYPTKTSPNPPNPPAPGPSKCSTFTSC 359

Query: 537  SSGETCCCARHFLGICLKWRCCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRIGNSTF 358
              GETCCC   FLGICL W+CC   SAVCCKD  HCCP DYP+CDT RN+CLKR+ N+T 
Sbjct: 360  GQGETCCCGLKFLGICLSWKCCGLDSAVCCKDGRHCCPWDYPICDTSRNLCLKRMSNATI 419

Query: 357  VKPIAKKSFS 328
            V+   K+ F+
Sbjct: 420  VQQPQKEPFT 429


>ref|NP_563855.1| papain-like cysteine peptidase [Arabidopsis thaliana]
            gi|110741821|dbj|BAE98853.1| papain-like cysteine
            peptidase XBCP3 [Arabidopsis thaliana]
            gi|111074448|gb|ABH04597.1| At1g09850 [Arabidopsis
            thaliana] gi|332190386|gb|AEE28507.1| papain-like
            cysteine peptidase [Arabidopsis thaliana]
          Length = 437

 Score =  584 bits (1506), Expect = e-164
 Identities = 271/426 (63%), Positives = 330/426 (77%), Gaps = 2/426 (0%)
 Frame = -2

Query: 1599 FVHLILLFSQVPTCKSSS--ISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHN 1426
            F+ L   F  + +  SSS  IS+LFD WC+++GKTY SE+ER+ R+++F+ N+++V QHN
Sbjct: 8    FISLTFFFLLLVSSSSSSDDISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHN 67

Query: 1425 IRANSSYTLSLNAFADLTNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPS 1246
            +  N++Y+LSLNAFADLT+HEFKA  LGLS SA  +++       A  G +      +P 
Sbjct: 68   LITNATYSLSLNAFADLTHHEFKASRLGLSVSAPSVIM-------ASKGQSLGGSVKVPD 120

Query: 1245 SLDWRNKGAVTGVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDG 1066
            S+DWR KGAVT VKDQGSCGACWSFSATGA+EGINQI TG L+SLSEQELIDCDKSYN G
Sbjct: 121  SVDWRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAG 180

Query: 1065 CGGGLMDYAYEFIIKNKGIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKL 886
            C GGLMDYA+EF+IKN GIDTE+DYPY+ RDGTC KDKLK+ VVTIDSYA + + +EK L
Sbjct: 181  CNGGLMDYAFEFVIKNHGIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKAL 240

Query: 885  LEAVATQPVSVGICGSDSSFQLYSGGIFTGPCSTSLDHAVLIIGYDSKDGKDYWIIKNSW 706
            +EAVA QPVSVGICGS+ +FQLYS GIF+GPCSTSLDHAVLI+GY S++G DYWI+KNSW
Sbjct: 241  MEAVAAQPVSVGICGSERAFQLYSSGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSW 300

Query: 705  GRYWGMNGYMHMLRNTGTAEGVCGVNMLAXXXXXXXXXXXXXXXXXPTRCNLFTYCSSGE 526
            G+ WGM+G+MHM RNT  ++GVCG+NMLA                 PT+CNLFTYCSSGE
Sbjct: 301  GKSWGMDGFMHMQRNTENSDGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGE 360

Query: 525  TCCCARHFLGICLKWRCCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRIGNSTFVKPI 346
            TCCCAR   G+C  W+CCE  SAVCCKD  HCCPHDYPVCDT R++CLK+ GN T +KP 
Sbjct: 361  TCCCARELFGLCFSWKCCEIESAVCCKDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKPF 420

Query: 345  AKKSFS 328
             KK+ S
Sbjct: 421  WKKNSS 426


>gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana]
          Length = 437

 Score =  583 bits (1504), Expect = e-164
 Identities = 271/426 (63%), Positives = 330/426 (77%), Gaps = 2/426 (0%)
 Frame = -2

Query: 1599 FVHLILLFSQVPTCKSSS--ISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHN 1426
            F+ L   F  + +  SSS  IS+LFD WC+++GKTY SE+ER+ R+++F+ N+++V QHN
Sbjct: 8    FISLTFFFLLLVSSSSSSDDISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHN 67

Query: 1425 IRANSSYTLSLNAFADLTNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPS 1246
            +  N++Y+LSLNAFADLT+HEFKA  LGLS SA  +++       A  G +      +P 
Sbjct: 68   LITNATYSLSLNAFADLTHHEFKASRLGLSVSAPSVIM-------ASKGQSLGGSVKVPD 120

Query: 1245 SLDWRNKGAVTGVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDG 1066
            S+DWR KGAVT VKDQGSCGACWSFSATGA+EGINQI TG L+SLSEQELIDCDKSYN G
Sbjct: 121  SVDWRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAG 180

Query: 1065 CGGGLMDYAYEFIIKNKGIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKL 886
            C GGLMDYA+EF+IKN GIDTE+DYPY+ RDGTC KDKLK+ VVTIDSYA + + +EK L
Sbjct: 181  CNGGLMDYAFEFVIKNHGIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKAL 240

Query: 885  LEAVATQPVSVGICGSDSSFQLYSGGIFTGPCSTSLDHAVLIIGYDSKDGKDYWIIKNSW 706
            +EAVA QPVSVGICGS+ +FQLYS GIF+GPCSTSLDHAVLI+GY S++G DYWI+KNSW
Sbjct: 241  MEAVAAQPVSVGICGSERAFQLYSRGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSW 300

Query: 705  GRYWGMNGYMHMLRNTGTAEGVCGVNMLAXXXXXXXXXXXXXXXXXPTRCNLFTYCSSGE 526
            G+ WGM+G+MHM RNT  ++GVCG+NMLA                 PT+CNLFTYCSSGE
Sbjct: 301  GKSWGMDGFMHMQRNTENSDGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGE 360

Query: 525  TCCCARHFLGICLKWRCCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRIGNSTFVKPI 346
            TCCCAR   G+C  W+CCE  SAVCCKD  HCCPHDYPVCDT R++CLK+ GN T +KP 
Sbjct: 361  TCCCARELFGLCFSWKCCEIESAVCCKDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKPF 420

Query: 345  AKKSFS 328
             KK+ S
Sbjct: 421  WKKNSS 426


>ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp.
            lyrata] gi|297335615|gb|EFH66032.1| hypothetical protein
            ARALYDRAFT_471096 [Arabidopsis lyrata subsp. lyrata]
          Length = 439

 Score =  582 bits (1501), Expect = e-163
 Identities = 273/428 (63%), Positives = 330/428 (77%), Gaps = 4/428 (0%)
 Frame = -2

Query: 1599 FVHLILLFSQVPTCKSSS--ISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHN 1426
            FV L   F  + +  SSS  IS+LFD WC+ +GKTY SE+ER+ R+++F+ N+++V QHN
Sbjct: 8    FVSLTFFFLLLVSSPSSSDDISELFDDWCQRHGKTYGSEEERQQRIQIFKDNHDFVTQHN 67

Query: 1425 IRANSSYTLSLNAFADLTNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPS 1246
            +  N++Y+LSLNAFADLT+HEFKA  LGLS SAS L++       A  G +    + +P 
Sbjct: 68   LITNATYSLSLNAFADLTHHEFKASRLGLSVSASSLIM-------ASKGQSLGGNAKVPD 120

Query: 1245 SLDWRNKGAVTGVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDG 1066
            S+DWR KGAVT VKDQGSCGACWSFSATGA+EGINQI TG L+SLSEQELIDCDKSYN G
Sbjct: 121  SVDWRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAG 180

Query: 1065 CGGGLMDYAYEFIIKNKGIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKL 886
            C GGLMDYA+EF+IKN GIDTE+DYPY+ RDGTC KDKLK+ VVTIDSYA + + +EK L
Sbjct: 181  CNGGLMDYAFEFVIKNHGIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKAL 240

Query: 885  LEAVATQPVSVGICGSDSSFQLYS--GGIFTGPCSTSLDHAVLIIGYDSKDGKDYWIIKN 712
             EAVA QPVSVGICGS+ +FQLYS   GIF+GPCSTSLDHAVLI+GY S++G DYWI+KN
Sbjct: 241  REAVAAQPVSVGICGSERAFQLYSRVSGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKN 300

Query: 711  SWGRYWGMNGYMHMLRNTGTAEGVCGVNMLAXXXXXXXXXXXXXXXXXPTRCNLFTYCSS 532
            SWG+ WGM+G+MHM RNTG +EG+CG+NMLA                 PT+CNLFTYCS+
Sbjct: 301  SWGKSWGMDGFMHMQRNTGNSEGICGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSA 360

Query: 531  GETCCCARHFLGICLKWRCCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRIGNSTFVK 352
            GETCCCAR+  G+C  W+CCE  SAVCC D  HCCPHDYPVCDT R++CLK+ GN T +K
Sbjct: 361  GETCCCARNLFGLCFSWKCCEIESAVCCSDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIK 420

Query: 351  PIAKKSFS 328
            P  KK  S
Sbjct: 421  PFWKKDSS 428


>ref|XP_006417526.1| hypothetical protein EUTSA_v10007640mg [Eutrema salsugineum]
            gi|557095297|gb|ESQ35879.1| hypothetical protein
            EUTSA_v10007640mg [Eutrema salsugineum]
          Length = 444

 Score =  580 bits (1494), Expect = e-163
 Identities = 269/426 (63%), Positives = 328/426 (76%), Gaps = 1/426 (0%)
 Frame = -2

Query: 1602 FFVHLILLFS-QVPTCKSSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHN 1426
            FF+ L+   S  + +  S  I++LFD WC  +GKTY SE+ER+HR+++F  N+++V QHN
Sbjct: 13   FFLLLVSSLSFSISSSSSDDIAELFDDWCHRHGKTYGSEEERQHRIQIFRDNHDFVTQHN 72

Query: 1425 IRANSSYTLSLNAFADLTNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPS 1246
              +NS+Y+LSLNAFADLT+HEFKA  LGLS  +  L+ +  +      G +      +P 
Sbjct: 73   HISNSTYSLSLNAFADLTHHEFKASRLGLSAPSPSLMAKEQSL-----GVSERVRVKVPD 127

Query: 1245 SLDWRNKGAVTGVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDG 1066
            S+DWR KGAVT VKDQGSCGACWSFSATGA+EGINQI TG L+SLSEQELIDCDKSYN G
Sbjct: 128  SVDWRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAG 187

Query: 1065 CGGGLMDYAYEFIIKNKGIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKL 886
            C GGLMDYA+EF+IKN GIDTE+DYPY+ +DGTC KDKLK+ VVTIDSYA + + NEK L
Sbjct: 188  CNGGLMDYAFEFVIKNHGIDTEKDYPYQEQDGTCKKDKLKKRVVTIDSYAGVASNNEKAL 247

Query: 885  LEAVATQPVSVGICGSDSSFQLYSGGIFTGPCSTSLDHAVLIIGYDSKDGKDYWIIKNSW 706
            +EAVA+QPVSVGICGS+ +FQLYS GIF+GPCSTSLDHAVLI+GY S++G DYWI+KNSW
Sbjct: 248  MEAVASQPVSVGICGSERAFQLYSSGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSW 307

Query: 705  GRYWGMNGYMHMLRNTGTAEGVCGVNMLAXXXXXXXXXXXXXXXXXPTRCNLFTYCSSGE 526
            G+ WGM+G+MHM RNTG +EGVCG+NMLA                 PT+CNLFTYCSSGE
Sbjct: 308  GKSWGMDGFMHMQRNTGNSEGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGE 367

Query: 525  TCCCARHFLGICLKWRCCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRIGNSTFVKPI 346
            TCCCAR   G+C  W+CCE  SAVCCKD  HCCP DYPVCDT +++CLK+ GN T +KP 
Sbjct: 368  TCCCARTLFGLCFSWKCCELESAVCCKDGRHCCPRDYPVCDTTKSLCLKKTGNFTEIKPF 427

Query: 345  AKKSFS 328
             KK+ S
Sbjct: 428  WKKNSS 433


>ref|XP_006473576.1| PREDICTED: cysteine proteinase RD21a-like [Citrus sinensis]
          Length = 441

 Score =  578 bits (1491), Expect = e-162
 Identities = 272/428 (63%), Positives = 327/428 (76%), Gaps = 1/428 (0%)
 Frame = -2

Query: 1608 LLFFVHLILLFSQVPTCKSSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQH 1429
            L FF+  ILL S +P    S I++LF++WCK++GK Y+SEQE++ RLK+FE NY +V QH
Sbjct: 4    LAFFLLSILLLSSLPPNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQH 63

Query: 1428 NIRANSSYTLSLNAFADLTNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIP 1249
            N   NSS+TLSLNAFADLT+ EFKA +LG S ++ D   R N   A++  P  ++  D+P
Sbjct: 64   NNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRN---ASVQSPGNLR--DVP 118

Query: 1248 SSLDWRNKGAVTGVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYND 1069
            +S+DWR KGAVT VKDQ SCGACW+FSATGA+EGIN+I TGSLVSLSEQELIDCD+SYN 
Sbjct: 119  ASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNS 178

Query: 1068 GCGGGLMDYAYEFIIKNKGIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKK 889
            GCGGGLMDYAY+F+IKN GIDTE+DYPYRG+ G CNK KL RH+VTID Y D+P  NEK+
Sbjct: 179  GCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQ 238

Query: 888  LLEAVATQPVSVGICGSDSSFQLYSGGIFTGPCSTSLDHAVLIIGYDSKDGKDYWIIKNS 709
            LL+AV  QPVSVGICGS+ +FQLYS GIFTGPCSTSLDHAVLIIGYDS++G DYWIIKNS
Sbjct: 239  LLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIIGYDSENGVDYWIIKNS 298

Query: 708  WGRYWGMNGYMHMLRNTGTAEGVCGVNMLAXXXXXXXXXXXXXXXXXPTRCNLFTYCSSG 529
            WGR WGMNGYMHM RNTG + G+CG+NMLA                 PTRC+L TYC+ G
Sbjct: 299  WGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPPPSPPPGPTRCSLLTYCAPG 358

Query: 528  ETCCCARHFLGICLKWRCCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRI-GNSTFVK 352
            ETCCC    LGICL W+CC   SAVCC DH +CCP +YP+CD+ R+ CL R+ GN T  +
Sbjct: 359  ETCCCGSSILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTRLTGNVTAAE 418

Query: 351  PIAKKSFS 328
             I  +  S
Sbjct: 419  AIEMRGSS 426


>ref|XP_006435079.1| hypothetical protein CICLE_v10001178mg [Citrus clementina]
            gi|557537201|gb|ESR48319.1| hypothetical protein
            CICLE_v10001178mg [Citrus clementina]
          Length = 441

 Score =  578 bits (1491), Expect = e-162
 Identities = 271/428 (63%), Positives = 327/428 (76%), Gaps = 1/428 (0%)
 Frame = -2

Query: 1608 LLFFVHLILLFSQVPTCKSSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQH 1429
            L FF+  ILL S +P    S I++LF++WCK++GK Y+SEQE++ RLK+FE NY +V QH
Sbjct: 4    LAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKVYSSEQEKQQRLKIFEDNYAFVTQH 63

Query: 1428 NIRANSSYTLSLNAFADLTNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIP 1249
            N   NSS+TLSLNAFADLT+ EFKA +LG S ++ D   R N   A++  P  ++  D+P
Sbjct: 64   NNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRN---ASVQSPGTLR--DVP 118

Query: 1248 SSLDWRNKGAVTGVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYND 1069
            +S+DWR KGAVT VKDQ SCGACW+FSATGA+EGIN+I TGSLVSLSEQELIDCD+SYN 
Sbjct: 119  ASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNS 178

Query: 1068 GCGGGLMDYAYEFIIKNKGIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKK 889
            GCGGGLMDYAY+F+IKN GIDTE+DYPYRG+ G CNK KL RH+VTID Y D+P  NEK+
Sbjct: 179  GCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQ 238

Query: 888  LLEAVATQPVSVGICGSDSSFQLYSGGIFTGPCSTSLDHAVLIIGYDSKDGKDYWIIKNS 709
            LL+AV  QPVSVGICGS+ +FQLYS GIFTGPCSTSLDHAVLI+GYDS++G DYWIIKNS
Sbjct: 239  LLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNS 298

Query: 708  WGRYWGMNGYMHMLRNTGTAEGVCGVNMLAXXXXXXXXXXXXXXXXXPTRCNLFTYCSSG 529
            WGR WGMNGYMHM RNTG + G+CG+NMLA                 PTRC+L TYC++G
Sbjct: 299  WGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPPPSPPPGPTRCSLLTYCAAG 358

Query: 528  ETCCCARHFLGICLKWRCCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRI-GNSTFVK 352
            ETCCC    LGICL W+CC   SAVCC DH +CCP +YP+CD+ R+ CL R  GN T  +
Sbjct: 359  ETCCCGSSILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTRFTGNVTAAE 418

Query: 351  PIAKKSFS 328
             I  +  S
Sbjct: 419  AIEMRGSS 426


>ref|XP_002510459.1| cysteine protease, putative [Ricinus communis]
            gi|223551160|gb|EEF52646.1| cysteine protease, putative
            [Ricinus communis]
          Length = 422

 Score =  573 bits (1478), Expect = e-161
 Identities = 268/410 (65%), Positives = 322/410 (78%)
 Frame = -2

Query: 1608 LLFFVHLILLFSQVPTCKSSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQH 1429
            LLFF   I  FS      SS IS LF+SW KE+GKTYTS++++ +R K+FE+NYE+V +H
Sbjct: 12   LLFFNLSISSFSS-----SSDISKLFESWTKEHGKTYTSKEDKLYRFKIFEENYEFVKKH 66

Query: 1428 NIRANSSYTLSLNAFADLTNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIP 1249
            N + NSSYTLSLNAFADLT+HEFKA  LGLS  ++   +   N P      +FV   D+P
Sbjct: 67   NSQGNSSYTLSLNAFADLTHHEFKASRLGLSAFSTSGKLSRRNFPLH----DFV--GDVP 120

Query: 1248 SSLDWRNKGAVTGVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYND 1069
             S+DWR KGAV+ VKDQG+CGACWSFSATGA+EGIN+I TGSLVSLSEQEL+DCD+SYN+
Sbjct: 121  ISIDWRKKGAVSQVKDQGNCGACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDRSYNN 180

Query: 1068 GCGGGLMDYAYEFIIKNKGIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKK 889
            GC GGLMDYAY+F+I+N GIDTEEDYPY+ R+ TCNK+KLKRHVVTID Y D+P  NEK+
Sbjct: 181  GCEGGLMDYAYQFVIENNGIDTEEDYPYQAREKTCNKEKLKRHVVTIDGYTDVPQNNEKE 240

Query: 888  LLEAVATQPVSVGICGSDSSFQLYSGGIFTGPCSTSLDHAVLIIGYDSKDGKDYWIIKNS 709
            LL+AVA QPVSVGICGS+ +FQLYS GIFTGPCSTSLDHAVLI+GY S++G DYWI+KNS
Sbjct: 241  LLKAVAAQPVSVGICGSERAFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNS 300

Query: 708  WGRYWGMNGYMHMLRNTGTAEGVCGVNMLAXXXXXXXXXXXXXXXXXPTRCNLFTYCSSG 529
            WG +WG+NGYM+MLRN+G ++G+CG+NMLA                 PT+C+LFT C  G
Sbjct: 301  WGTHWGINGYMYMLRNSGNSQGLCGINMLASFPVKTSPNPPPPAPPGPTKCDLFTRCGEG 360

Query: 528  ETCCCARHFLGICLKWRCCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLK 379
            ETCCC R   G+C  W+CCE  SAVCCKD  HCCPHDYPVCDTKRN+CLK
Sbjct: 361  ETCCCTRRIFGLCFSWKCCELDSAVCCKDGLHCCPHDYPVCDTKRNMCLK 410


>ref|XP_006307431.1| hypothetical protein CARUB_v10009056mg [Capsella rubella]
            gi|482576142|gb|EOA40329.1| hypothetical protein
            CARUB_v10009056mg [Capsella rubella]
          Length = 467

 Score =  571 bits (1471), Expect = e-160
 Identities = 271/455 (59%), Positives = 327/455 (71%), Gaps = 28/455 (6%)
 Frame = -2

Query: 1608 LLFFVHLILLFSQVPTCKSSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQH 1429
            L FF   +LL S   +  S  IS+LFD WC+ +GKTY SE E++HR ++F  N+++V QH
Sbjct: 11   LTFF--FLLLVSSSSSSSSDDISELFDDWCQRHGKTYASEAEKQHRFQIFRDNHDFVTQH 68

Query: 1428 NIRANSSYTLSLNAFADLTNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIP 1249
            N+  N++Y+LSLNAFADL + EFK   LGLS SA  +++       A  G +      +P
Sbjct: 69   NLITNATYSLSLNAFADLNHSEFKTSRLGLSVSAPSVIM-------ASKGKSLGGSVKVP 121

Query: 1248 SSLDWRNKGAVTGVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYND 1069
             SLDWR KGAVT VKDQGSCGACWSFSATGA+EGINQI TG L+SLSEQELIDCDKSYND
Sbjct: 122  DSLDWRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYND 181

Query: 1068 GCGGGLMDYAYEFIIKNKGIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKK 889
            GC GGLMDYA+EF+IKNKGIDTE+DYPY+ RDGTC KDKLK+ VV+IDSYA +   +EK 
Sbjct: 182  GCNGGLMDYAFEFVIKNKGIDTEKDYPYQERDGTCKKDKLKQRVVSIDSYAGVKPSDEKA 241

Query: 888  LLEAVATQPVSVGICGSDSSFQLYSG----------------------------GIFTGP 793
            LLEAVA QPVSVGICGS+ +FQLYS                             GIF+GP
Sbjct: 242  LLEAVAAQPVSVGICGSERAFQLYSSVSFKIRDTSILSSECSTFPCLKLYLMMQGIFSGP 301

Query: 792  CSTSLDHAVLIIGYDSKDGKDYWIIKNSWGRYWGMNGYMHMLRNTGTAEGVCGVNMLAXX 613
            CSTSLDHAVLI+GY S++G DYWI+KNSWG+ WGM+G+MHM RNTG ++G+CG+NMLA  
Sbjct: 302  CSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTGNSQGICGINMLASY 361

Query: 612  XXXXXXXXXXXXXXXPTRCNLFTYCSSGETCCCARHFLGICLKWRCCEAVSAVCCKDHTH 433
                           PT+CNLFTYCS+ ETCCCAR+  G+CL W+CCE  SAVCCKD  H
Sbjct: 362  PIKTHPNPPPPSPPGPTKCNLFTYCSAAETCCCARNLFGLCLSWKCCEIESAVCCKDGRH 421

Query: 432  CCPHDYPVCDTKRNICLKRIGNSTFVKPIAKKSFS 328
            CCPHDYPVCDT R++CLK+ GN T +KP  KK+ S
Sbjct: 422  CCPHDYPVCDTTRSLCLKKTGNFTAIKPFWKKNSS 456


>gb|EOY14881.1| JHL18I08.3 protein [Theobroma cacao]
          Length = 438

 Score =  568 bits (1465), Expect = e-159
 Identities = 265/421 (62%), Positives = 322/421 (76%)
 Frame = -2

Query: 1599 FVHLILLFSQVPTCKSSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNIR 1420
            F+   LLF        S IS LF++WC ++GK Y+SE+E+ +RLKVFE+NY +V QHN  
Sbjct: 8    FLLSFLLFFDPSFASPSHISHLFETWCDQHGKRYSSEEEKSYRLKVFEENYAFVTQHNGV 67

Query: 1419 ANSSYTLSLNAFADLTNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPSSL 1240
             NSSY+L+LNAFADLT+HEFKA  LGLS +A      +      +  P  V+  DIP+S+
Sbjct: 68   GNSSYSLALNAFADLTHHEFKASRLGLSAAA------IEGSRPNLQLPGLVR--DIPASM 119

Query: 1239 DWRNKGAVTGVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGCG 1060
            DWR KGAVT VKDQGSCGACWSFSATGA+EGIN+I TG+LVSLSEQEL+DCD+SYN GC 
Sbjct: 120  DWRTKGAVTKVKDQGSCGACWSFSATGAIEGINKIVTGTLVSLSEQELVDCDRSYNSGCE 179

Query: 1059 GGLMDYAYEFIIKNKGIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLLE 880
            GGLMDYAY+F+I N GID EEDYPY GR+ TCNK+K KR VVTID YA +PA NE  LL+
Sbjct: 180  GGLMDYAYQFVIDNHGIDNEEDYPYLGREKTCNKEKRKRRVVTIDGYAGVPANNEDLLLQ 239

Query: 879  AVATQPVSVGICGSDSSFQLYSGGIFTGPCSTSLDHAVLIIGYDSKDGKDYWIIKNSWGR 700
            AVA QPVSVGICGS+ +FQLYS GIFTGPCS+SLDHAVLI+GY S++G DYWI+KNSWG 
Sbjct: 240  AVAKQPVSVGICGSERAFQLYSKGIFTGPCSSSLDHAVLIVGYGSENGVDYWIVKNSWGT 299

Query: 699  YWGMNGYMHMLRNTGTAEGVCGVNMLAXXXXXXXXXXXXXXXXXPTRCNLFTYCSSGETC 520
             WGMNGY+HMLRN+G ++G+CG+NMLA                 PT+C+LFTYCS+GETC
Sbjct: 300  RWGMNGYIHMLRNSGDSKGLCGINMLASYPTKTSPNPPSPPPPGPTKCDLFTYCSAGETC 359

Query: 519  CCARHFLGICLKWRCCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRIGNSTFVKPIAK 340
            CC     GIC  W+CCE  SAVCCKD+ HCCP+DYPVCDTK++ CLKR+GN+T ++   K
Sbjct: 360  CCTHRIFGICFSWKCCELDSAVCCKDNRHCCPYDYPVCDTKKSQCLKRVGNATRMEAFEK 419

Query: 339  K 337
            +
Sbjct: 420  R 420


>ref|XP_004500967.1| PREDICTED: oryzain alpha chain-like [Cicer arietinum]
          Length = 436

 Score =  563 bits (1452), Expect = e-158
 Identities = 268/420 (63%), Positives = 308/420 (73%)
 Frame = -2

Query: 1614 CWLLFFVHLILLFSQVPTCKSSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVN 1435
            C    F+ L+L  S +    +   S LF  WCK++GKTY SEQE+ +R  VFE NY +V 
Sbjct: 3    CLRFMFLQLLLSLSLLSFVTAIDTSKLFQEWCKQHGKTYPSEQEKRYRFNVFEDNYAFVA 62

Query: 1434 QHNIRANSSYTLSLNAFADLTNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESD 1255
            QHN   NSSYTLSLNAFADLT+HEFKA  LGL PS S L  + N         +F+Q   
Sbjct: 63   QHNQIGNSSYTLSLNAFADLTHHEFKATRLGLPPS-SLLRFKFNRFQDQQRSDDFLQ--- 118

Query: 1254 IPSSLDWRNKGAVTGVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSY 1075
            +PS +DWR  GAV+ VKDQGSCGACWSFSATGA+EGIN+I TGSLVSLSEQEL+DCD +Y
Sbjct: 119  VPSEIDWRKNGAVSIVKDQGSCGACWSFSATGAIEGINKIVTGSLVSLSEQELVDCDTTY 178

Query: 1074 NDGCGGGLMDYAYEFIIKNKGIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNE 895
            N GC GGLMDYAY+FII N GIDTEEDYPY+ R   C KDKLKR VVTID Y D+P  +E
Sbjct: 179  NSGCDGGLMDYAYQFIIDNNGIDTEEDYPYQARQLLCKKDKLKRRVVTIDGYTDVPPNDE 238

Query: 894  KKLLEAVATQPVSVGICGSDSSFQLYSGGIFTGPCSTSLDHAVLIIGYDSKDGKDYWIIK 715
            KKLL+AVA QPVSVGICGS  +FQLYS GIFTGPCSTSLDHAVLI+GY S++G DYWI+K
Sbjct: 239  KKLLKAVAVQPVSVGICGSARAFQLYSKGIFTGPCSTSLDHAVLIVGYGSENGVDYWIVK 298

Query: 714  NSWGRYWGMNGYMHMLRNTGTAEGVCGVNMLAXXXXXXXXXXXXXXXXXPTRCNLFTYCS 535
            NSWG+YWGMNGY+HMLRNT ++ G+CG+NMLA                 P +CNLFTYCS
Sbjct: 299  NSWGKYWGMNGYIHMLRNTDSSAGLCGINMLASYPTKTKPNPPVPPPPGPIKCNLFTYCS 358

Query: 534  SGETCCCARHFLGICLKWRCCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRIGNSTFV 355
             GETCCCA+ FLGIC  W+CC   SAVCCKD  HCCP DYPVCD     CLKRI N T +
Sbjct: 359  GGETCCCAKKFLGICFSWKCCGVTSAVCCKDKRHCCPLDYPVCDASNGQCLKRIANGTIL 418


>gb|AAB60738.1| Strong similarity to Dianthus cysteine proteinase (gb|U17135)
            [Arabidopsis thaliana]
          Length = 416

 Score =  562 bits (1449), Expect = e-157
 Identities = 263/416 (63%), Positives = 319/416 (76%), Gaps = 9/416 (2%)
 Frame = -2

Query: 1599 FVHLILLFSQVPTCKSSS--ISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHN 1426
            F+ L   F  + +  SSS  IS+LFD WC+++GKTY SE+ER+ R+++F+ N+++V QHN
Sbjct: 6    FISLTFFFLLLVSSSSSSDDISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHN 65

Query: 1425 IRANSSYTLSLNAFADLTNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPS 1246
            +  N++Y+LSLNAFADLT+HEFKA  LGLS SA  +++       A  G +      +P 
Sbjct: 66   LITNATYSLSLNAFADLTHHEFKASRLGLSVSAPSVIM-------ASKGQSLGGSVKVPD 118

Query: 1245 SLDWRNKGAVTGVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDG 1066
            S+DWR KGAVT VKDQGSCGACWSFSATGA+EGINQI TG L+SLSEQELIDCDKSYN G
Sbjct: 119  SVDWRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAG 178

Query: 1065 CGGGLMDYAYEFIIKNKGIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKL 886
            C GGLMDYA+EF+IKN GIDTE+DYPY+ RDGTC KDKLK+ VVTIDSYA + + +EK L
Sbjct: 179  CNGGLMDYAFEFVIKNHGIDTEKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKAL 238

Query: 885  LEAVATQPVSVGICGSDSSFQLYSG-------GIFTGPCSTSLDHAVLIIGYDSKDGKDY 727
            +EAVA QPVSVGICGS+ +FQLYS        GIF+GPCSTSLDHAVLI+GY S++G DY
Sbjct: 239  MEAVAAQPVSVGICGSERAFQLYSSKFYLLMQGIFSGPCSTSLDHAVLIVGYGSQNGVDY 298

Query: 726  WIIKNSWGRYWGMNGYMHMLRNTGTAEGVCGVNMLAXXXXXXXXXXXXXXXXXPTRCNLF 547
            WI+KNSWG+ WGM+G+MHM RNT  ++GVCG+NMLA                 PT+CNLF
Sbjct: 299  WIVKNSWGKSWGMDGFMHMQRNTENSDGVCGINMLASYPIKTHPNPPPPSPPGPTKCNLF 358

Query: 546  TYCSSGETCCCARHFLGICLKWRCCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLK 379
            TYCSSGETCCCAR   G+C  W+CCE  SAVCCKD  HCCPHDYPVCDT R++CLK
Sbjct: 359  TYCSSGETCCCARELFGLCFSWKCCEIESAVCCKDGRHCCPHDYPVCDTTRSLCLK 414


>ref|XP_002280937.1| PREDICTED: cysteine proteinase RD21a [Vitis vinifera]
            gi|302142569|emb|CBI19772.3| unnamed protein product
            [Vitis vinifera]
          Length = 436

 Score =  562 bits (1449), Expect = e-157
 Identities = 257/441 (58%), Positives = 332/441 (75%)
 Frame = -2

Query: 1605 LFFVHLILLFSQVPTCKSSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHN 1426
            L+ V +++L       ++SS +DLF++WC++YGKTY+SE+E+  RLKVFE+N+ +V QHN
Sbjct: 5    LWAVSILILAVHSSVSEASSTADLFEAWCEQYGKTYSSEEEKASRLKVFEENHAFVTQHN 64

Query: 1425 IRANSSYTLSLNAFADLTNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPS 1246
              AN+SYTL+LNAFADLT+HEFKA  LG SP  +  +        ++  P  VQE  +P 
Sbjct: 65   SMANASYTLALNAFADLTHHEFKASRLGFSPGRAQSI-------RSVGTP--VQELHVPP 115

Query: 1245 SLDWRNKGAVTGVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDG 1066
            ++DWR  GAVTGVKDQG+CG CWSFS TGA+EGIN+I TGSLVSLSEQEL+DCD+SYN G
Sbjct: 116  AVDWRKSGAVTGVKDQGNCGGCWSFSTTGAIEGINKIVTGSLVSLSEQELVDCDRSYNSG 175

Query: 1065 CGGGLMDYAYEFIIKNKGIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKL 886
            C GGLMDYAY+F+IKN+GID+E DYPY G D  CNK+KLK+H+VTID Y DIP  +EK+L
Sbjct: 176  CEGGLMDYAYQFVIKNQGIDSEADYPYVGMDKPCNKEKLKKHIVTIDGYTDIPPNDEKQL 235

Query: 885  LEAVATQPVSVGICGSDSSFQLYSGGIFTGPCSTSLDHAVLIIGYDSKDGKDYWIIKNSW 706
            L+ VA QPVSVGICGS+ +FQLYS G++TGPCS++LDHAVLI+GY ++DG D+WI+KNSW
Sbjct: 236  LQVVAKQPVSVGICGSEKTFQLYSKGVYTGPCSSTLDHAVLIVGYGTEDGVDFWIVKNSW 295

Query: 705  GRYWGMNGYMHMLRNTGTAEGVCGVNMLAXXXXXXXXXXXXXXXXXPTRCNLFTYCSSGE 526
            G +WGM GY+HMLRN GTAEG+CG+NMLA                 PT+C+ F+ CS GE
Sbjct: 296  GEHWGMRGYIHMLRNNGTAEGICGINMLASYPAKTSPNPPPPPTPGPTKCDFFSSCSEGE 355

Query: 525  TCCCARHFLGICLKWRCCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRIGNSTFVKPI 346
            TCCC+  F+G+CL W CC A SAVCC ++ +CCP  +P+CDTKRN CLK  GN T V+ +
Sbjct: 356  TCCCSWRFIGVCLSWNCCTAKSAVCCDNNNYCCPASHPICDTKRNRCLKPAGNGTGVEVL 415

Query: 345  AKKSFSI*LSKQCGYTSYNSA 283
             ++  S+   K  G++S N A
Sbjct: 416  KRRGSSV---KFGGWSSINDA 433


>ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
            gi|449529596|ref|XP_004171784.1| PREDICTED: cysteine
            proteinase RD21a-like [Cucumis sativus]
          Length = 431

 Score =  562 bits (1448), Expect = e-157
 Identities = 262/425 (61%), Positives = 320/425 (75%)
 Frame = -2

Query: 1602 FFVHLILLFSQVPTCKSSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQHNI 1423
            F    +LLF   P   +S++S+LF+ WC E+GK+Y+S +E+ +RL VF  NYE+V  HN 
Sbjct: 8    FLTLFLLLFR--PLSATSNVSELFEIWCTEHGKSYSSAEEKLYRLGVFADNYEFVTHHNN 65

Query: 1422 RANSSYTLSLNAFADLTNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESDIPSS 1243
              NSSYTLSLN++ADLT+HEFK   LG SP+  +    L   P+           D+P S
Sbjct: 66   LDNSSYTLSLNSYADLTHHEFKVSRLGFSPALRNFRPVLPQEPSL--------PRDVPDS 117

Query: 1242 LDWRNKGAVTGVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSYNDGC 1063
            LDWR KGAVT VKDQGSCGACWSFSATGA+EGINQI TGSL+SLSEQELIDCD+SYN GC
Sbjct: 118  LDWRKKGAVTAVKDQGSCGACWSFSATGAMEGINQIMTGSLISLSEQELIDCDRSYNSGC 177

Query: 1062 GGGLMDYAYEFIIKNKGIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKNEKKLL 883
            GGGLMDYAY+F+I N GIDTE DYPY+ RDG+C KDKL+R+VVTID YADIP+ +E KLL
Sbjct: 178  GGGLMDYAYQFVISNHGIDTENDYPYQARDGSCRKDKLQRNVVTIDGYADIPSNDEGKLL 237

Query: 882  EAVATQPVSVGICGSDSSFQLYSGGIFTGPCSTSLDHAVLIIGYDSKDGKDYWIIKNSWG 703
            +AVA QPVSVGICGS+ +FQLYS GIF+GPCSTSLDHAVLI+GY S++G DYWI+KNSWG
Sbjct: 238  QAVAAQPVSVGICGSERAFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWG 297

Query: 702  RYWGMNGYMHMLRNTGTAEGVCGVNMLAXXXXXXXXXXXXXXXXXPTRCNLFTYCSSGET 523
            + WGM+GYMHM RN+G +EGVCG+N LA                 PT+C++ T C++GET
Sbjct: 298  KSWGMDGYMHMQRNSGNSEGVCGINKLASYPTKTNPNPPPSPPPGPTKCSILTSCAAGET 357

Query: 522  CCCARHFLGICLKWRCCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRIGNSTFVKPIA 343
            CCCA+ FLG+CL W+CC   SAVCCKD  HCCP DYP+CDT RN+CLK+  N T  + + 
Sbjct: 358  CCCAKKFLGLCLSWKCCGLSSAVCCKDGRHCCPFDYPICDTDRNLCLKQTMNGTRTEILE 417

Query: 342  KKSFS 328
             +S S
Sbjct: 418  NRSSS 422


>ref|XP_004291399.1| PREDICTED: cysteine proteinase RD21a-like [Fragaria vesca subsp.
            vesca]
          Length = 441

 Score =  550 bits (1417), Expect = e-154
 Identities = 261/422 (61%), Positives = 318/422 (75%), Gaps = 6/422 (1%)
 Frame = -2

Query: 1608 LLFFVHLILLFSQ--VPTCKSSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVN 1435
            LL  + L+LL S   + +  SSS S+LF++WCK+YGK+Y+S++E+ +RL +FE+N  ++ 
Sbjct: 5    LLSLLTLLLLLSHPCLSSSSSSSSSELFEAWCKQYGKSYSSQEEKLYRLSLFEQNLAFIT 64

Query: 1434 QHNIRANSSYTLSLNAFADLTNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQESD 1255
            QHN   NSSYTLSLN+F+DLT+HEFKA  LG SP+   L  + +  P+ +          
Sbjct: 65   QHNDLGNSSYTLSLNSFSDLTHHEFKASRLGFSPTFLRLYRKSDPKPSVV--------RH 116

Query: 1254 IPSSLDWRNKGAVTGVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELIDCDKSY 1075
            +PSS+DWR  GAVT VKDQGSCGACWSFSATGA+EGIN+I TGSLVSLSEQELIDCD+ Y
Sbjct: 117  VPSSIDWRKNGAVTNVKDQGSCGACWSFSATGAIEGINKIVTGSLVSLSEQELIDCDRVY 176

Query: 1074 -NDGCGGGLMDYAYEFIIKNKGIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYADIPAKN 898
             N GC GGLMD A++FII N GIDTEEDYPY+G DGTCNK KLKRHVVTID Y D+PA N
Sbjct: 177  PNSGCNGGLMDDAFQFIIDNNGIDTEEDYPYQGADGTCNKQKLKRHVVTIDGYTDVPANN 236

Query: 897  EKKLLEAVATQPVSVGICGSDSSFQLYSGGIFTGPCSTSLDHAVLIIGYDSKDGKDYWII 718
            E++LL+AVATQPVSVGI GS   FQ YS GIF GPCST+LDHAVLI+GY S++G DYWI+
Sbjct: 237  EEQLLKAVATQPVSVGIAGSGREFQFYSKGIFAGPCSTTLDHAVLIVGYGSENGVDYWIV 296

Query: 717  KNSWGRYWGMNGYMHMLRNTGTAEGVCGVNMLA---XXXXXXXXXXXXXXXXXPTRCNLF 547
            KNSWG+ WGMNGY+H+LR+   ++G+CG+NMLA                    PT+C+LF
Sbjct: 297  KNSWGKNWGMNGYIHILRDHSNSKGLCGINMLASYPTKTGENPPFPPPSPPPGPTKCDLF 356

Query: 546  TYCSSGETCCCARHFLGICLKWRCCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRIGN 367
            + C  GETCCCAR  LGICL WRCCE  SAVCCKD  HCCPHDYP+CDT+RN CL+  GN
Sbjct: 357  SKCGVGETCCCARKILGICLSWRCCEFTSAVCCKDRLHCCPHDYPICDTERNYCLQANGN 416

Query: 366  ST 361
             T
Sbjct: 417  LT 418


>ref|XP_003523725.1| PREDICTED: oryzain alpha chain-like [Glycine max]
          Length = 439

 Score =  549 bits (1414), Expect = e-153
 Identities = 266/435 (61%), Positives = 318/435 (73%), Gaps = 8/435 (1%)
 Frame = -2

Query: 1608 LLFFVHLILLFSQVPTCKSSSISDLFDSWCKEYGKTYTSEQEREHRLKVFEKNYEYVNQH 1429
            LL F+ LILLF+      +S  S+LF+ WCKE+ KTY+SE+E+ +RLKVFE NY +V QH
Sbjct: 9    LLQFLSLILLFTLF-FLSASDTSELFEKWCKEHSKTYSSEEEKLYRLKVFEDNYAFVAQH 67

Query: 1428 NIRAN-----SSYTLSLNAFADLTNHEFKAKYLGLSPSASDLLIRLNNGPAAIDGPNFVQ 1264
            N  AN     SSYTLSLNAFADLT+HEFK   LGL  +    L+R          P   Q
Sbjct: 68   NQNANNNNNNSSYTLSLNAFADLTHHEFKTTRLGLPLT----LLRFKR-------PQNQQ 116

Query: 1263 ESD---IPSSLDWRNKGAVTGVKDQGSCGACWSFSATGAVEGINQIKTGSLVSLSEQELI 1093
              D   IPS +DWR  GAVT VKDQ SCGACW+FSATGA+EGIN+I TGSLVSLSEQELI
Sbjct: 117  SRDLLHIPSQIDWRQSGAVTPVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELI 176

Query: 1092 DCDKSYNDGCGGGLMDYAYEFIIKNKGIDTEEDYPYRGRDGTCNKDKLKRHVVTIDSYAD 913
            DCD SYN GCGGGLMD+AY+F+I NKGIDTE+DYPY+ R  +C+KDKLKR  VTI+ Y D
Sbjct: 177  DCDTSYNSGCGGGLMDFAYQFVIDNKGIDTEDDYPYQARQRSCSKDKLKRRAVTIEDYVD 236

Query: 912  IPAKNEKKLLEAVATQPVSVGICGSDSSFQLYSGGIFTGPCSTSLDHAVLIIGYDSKDGK 733
            +P  +E+++L+AVA+QPVSVGICGS+  FQLYS GIFTGPCST LDHAVLI+GY S++G 
Sbjct: 237  VP-PSEEEILKAVASQPVSVGICGSEREFQLYSKGIFTGPCSTFLDHAVLIVGYGSENGV 295

Query: 732  DYWIIKNSWGRYWGMNGYMHMLRNTGTAEGVCGVNMLAXXXXXXXXXXXXXXXXXPTRCN 553
            DYWI+KNSWG+YWGMNGY+HM+RN+G ++G+CG+N LA                 P RCN
Sbjct: 296  DYWIVKNSWGKYWGMNGYIHMIRNSGNSKGICGINTLASYPVKTKPNPPIPPPPGPVRCN 355

Query: 552  LFTYCSSGETCCCARHFLGICLKWRCCEAVSAVCCKDHTHCCPHDYPVCDTKRNICLKRI 373
            LFT+CS GETCCCA+ FLGIC  W+CC   SAVCCKD  HCCP DYP+CDT+R  CLKR 
Sbjct: 356  LFTHCSEGETCCCAKSFLGICFSWKCCGLTSAVCCKDKRHCCPQDYPICDTRRGQCLKRT 415

Query: 372  GNSTFVKPIAKKSFS 328
             N T       + FS
Sbjct: 416  ANGTTTITSENQDFS 430


Top