BLASTX nr result

ID: Cocculus23_contig00003865 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cocculus23_contig00003865
         (1468 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas]                          539   e-150
ref|XP_002307688.2| cysteine protease family protein [Populus tr...   534   e-149
ref|XP_006435079.1| hypothetical protein CICLE_v10001178mg [Citr...   534   e-149
ref|XP_006417526.1| hypothetical protein EUTSA_v10007640mg [Eutr...   533   e-149
ref|XP_006473576.1| PREDICTED: cysteine proteinase RD21a-like [C...   533   e-148
ref|NP_563855.1| papain-like cysteine peptidase [Arabidopsis tha...   527   e-147
ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arab...   527   e-147
gb|EXC25025.1| Oryzain alpha chain [Morus notabilis]                  526   e-147
gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [A...   526   e-147
ref|XP_007017656.1| JHL18I08.3 protein [Theobroma cacao] gi|5087...   525   e-146
ref|XP_002510459.1| cysteine protease, putative [Ricinus communi...   525   e-146
ref|XP_006838704.1| hypothetical protein AMTR_s00002p00249780 [A...   516   e-143
ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [C...   516   e-143
ref|XP_006341989.1| PREDICTED: cysteine proteinase RD21a-like [S...   514   e-143
ref|XP_004238304.1| PREDICTED: cysteine proteinase RD21a-like [S...   511   e-142
ref|XP_006307431.1| hypothetical protein CARUB_v10009056mg [Caps...   507   e-141
gb|AAB60738.1| Strong similarity to Dianthus cysteine proteinase...   507   e-141
ref|NP_001141813.1| uncharacterized protein LOC100273952 precurs...   506   e-140
ref|XP_004961575.1| PREDICTED: oryzain alpha chain-like [Setaria...   503   e-140
ref|XP_002440039.1| hypothetical protein SORBIDRAFT_09g024940 [S...   501   e-139

>dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas]
          Length = 441

 Score =  539 bits (1389), Expect = e-150
 Identities = 255/393 (64%), Positives = 298/393 (75%), Gaps = 3/393 (0%)
 Frame = -3

Query: 1466 RVFEDNLAFVDQHNSLSNSTYKVGLNAYADLTHHEFRASKLGLSVAASM---VDRSKSLG 1296
            +VF+DN  FV +HNS  NS+Y + LNA+ADLTHHEF+AS+LGLS AAS    VDRS    
Sbjct: 52   KVFQDNYDFVTEHNSQGNSSYTLSLNAFADLTHHEFKASRLGLSSAASASLNVDRSN--- 108

Query: 1295 TKARSGFVGDVPKSIDWRKKGAVTPVKDQASCGACWAFSTTGAIEGINQIVTGSLVSLSE 1116
             +    FV DVP S+DWRK GAVT VKDQ +CGACW+FS TGAIEGIN+IVTGSLVSLSE
Sbjct: 109  -RQIPDFVADVPASVDWRKNGAVTQVKDQGNCGACWSFSATGAIEGINKIVTGSLVSLSE 167

Query: 1115 QELIDCDRSYNSGCGGGLMDYAFKFVVKNHGIDTEEDYPYQAIDRSCNRNKLKRRVVTID 936
            QEL+DCD+SYN+GC GG+MDYAF+FV+ NHGIDTEEDYPYQ  DRSCN+ KLKR VVTID
Sbjct: 168  QELVDCDKSYNNGCEGGIMDYAFQFVIDNHGIDTEEDYPYQGRDRSCNKEKLKRHVVTID 227

Query: 935  GYTDIPSSNEEELLKAVASQPVSVGLCGSERGFQLYSTGIFSGPCATSLDHAVVIVGYGS 756
            GY D+P +NE+ELLKAVA+QPVSVG+CGSER FQLYS GIF+GPC+TSLDHAV+IVGYGS
Sbjct: 228  GYVDVPQNNEKELLKAVANQPVSVGICGSERAFQLYSKGIFTGPCSTSLDHAVLIVGYGS 287

Query: 755  ENGVDYWILKNSWGKSWGMDGYMHMQRNNGDKQGVCGINMLAXXXXXXXXXXXXXXXXXX 576
            ENGVDYWI+KNSWG  WGMDGYMHMQRN+G  +G+CGINMLA                  
Sbjct: 288  ENGVDYWIVKNSWGSYWGMDGYMHMQRNSGSSRGLCGINMLASYPKKTSPNPPPPAPPGP 347

Query: 575  TRCSLMSYCGAGETCCCGFRLLGLVCLSWKCCELDSAVCCKDHIYCCPHNYPICDTQTKQ 396
            TRC L ++CG GETCCC   + G +CLSWKCCELDSAVCCKD  +CCP +YP+CDT    
Sbjct: 348  TRCDLFTHCGEGETCCCVHHIFG-ICLSWKCCELDSAVCCKDGRHCCPRDYPVCDTTRNI 406

Query: 395  CLRAAGNYSMVKPFEXXXXXXXXXXXXXLFEAW 297
            CL+  GN + ++ F              L E W
Sbjct: 407  CLKHYGNATRIEKFAKNSSSGKFRSWSSLLEGW 439


>ref|XP_002307688.2| cysteine protease family protein [Populus trichocarpa]
            gi|550339725|gb|EEE94684.2| cysteine protease family
            protein [Populus trichocarpa]
          Length = 436

 Score =  534 bits (1376), Expect = e-149
 Identities = 243/373 (65%), Positives = 297/373 (79%), Gaps = 1/373 (0%)
 Frame = -3

Query: 1466 RVFEDNLAFVDQHNSLSNSTYKVGLNAYADLTHHEFRASKLGLSVAA-SMVDRSKSLGTK 1290
            +VFEDN  FV +HNS  NS+Y + LNA+ADLTHHEF+ S+LGLS A  ++  R+  +   
Sbjct: 51   KVFEDNYDFVTKHNSKGNSSYSLALNAFADLTHHEFKTSRLGLSAAPLNLAHRNLEI--- 107

Query: 1289 ARSGFVGDVPKSIDWRKKGAVTPVKDQASCGACWAFSTTGAIEGINQIVTGSLVSLSEQE 1110
              +G VGD+P SIDWR KG VT VKDQ SCGACW+FS TGAIEGIN+IVTGSLVSLSEQE
Sbjct: 108  --TGVVGDIPASIDWRNKGVVTNVKDQGSCGACWSFSATGAIEGINKIVTGSLVSLSEQE 165

Query: 1109 LIDCDRSYNSGCGGGLMDYAFKFVVKNHGIDTEEDYPYQAIDRSCNRNKLKRRVVTIDGY 930
            LI+CD+SYN GCGGGLMDYAF+FV+ NHGIDTEEDYPY+A D +CN++++KRRVVTID Y
Sbjct: 166  LIECDKSYNDGCGGGLMDYAFQFVINNHGIDTEEDYPYRARDGTCNKDRMKRRVVTIDKY 225

Query: 929  TDIPSSNEEELLKAVASQPVSVGLCGSERGFQLYSTGIFSGPCATSLDHAVVIVGYGSEN 750
             D+P +NE++LL+AVA+QPVSVG+CGSER FQ+YS GIF+GPC+TSLDHAV+IVGYGSEN
Sbjct: 226  VDVPENNEKQLLQAVAAQPVSVGICGSERAFQMYSKGIFTGPCSTSLDHAVLIVGYGSEN 285

Query: 749  GVDYWILKNSWGKSWGMDGYMHMQRNNGDKQGVCGINMLAXXXXXXXXXXXXXXXXXXTR 570
            GVDYWI+KNSWG  WGM GYMHMQRN+G+ QGVCGINMLA                  T+
Sbjct: 286  GVDYWIVKNSWGTGWGMRGYMHMQRNSGNSQGVCGINMLASYPVKTSPNPPPPPPPGPTK 345

Query: 569  CSLMSYCGAGETCCCGFRLLGLVCLSWKCCELDSAVCCKDHIYCCPHNYPICDTQTKQCL 390
            C+L++YC AGETCCC  +  G +C+SWKCC LDSAVCCKD ++CCPH+YP+CDT    C 
Sbjct: 346  CNLLTYCAAGETCCCARKFFG-ICISWKCCGLDSAVCCKDRLHCCPHDYPVCDTDKNMCF 404

Query: 389  RAAGNYSMVKPFE 351
            + AGN + ++  E
Sbjct: 405  KRAGNATRMEAIE 417


>ref|XP_006435079.1| hypothetical protein CICLE_v10001178mg [Citrus clementina]
            gi|557537201|gb|ESR48319.1| hypothetical protein
            CICLE_v10001178mg [Citrus clementina]
          Length = 441

 Score =  534 bits (1375), Expect = e-149
 Identities = 249/373 (66%), Positives = 295/373 (79%), Gaps = 1/373 (0%)
 Frame = -3

Query: 1466 RVFEDNLAFVDQHNSLSNSTYKVGLNAYADLTHHEFRASKLGLSVAASMVDRSKSLGTKA 1287
            ++FEDN AFV QHN++ NS++ + LNA+ADLTH EF+AS LG S A+   DR ++   ++
Sbjct: 51   KIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQS 110

Query: 1286 RSGFVGDVPKSIDWRKKGAVTPVKDQASCGACWAFSTTGAIEGINQIVTGSLVSLSEQEL 1107
              G + DVP SIDWRKKGAVT VKDQASCGACWAFS TGAIEGIN+IVTGSLVSLSEQEL
Sbjct: 111  -PGTLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQEL 169

Query: 1106 IDCDRSYNSGCGGGLMDYAFKFVVKNHGIDTEEDYPYQAIDRSCNRNKLKRRVVTIDGYT 927
            IDCDRSYNSGCGGGLMDYA++FV+KNHGIDTE+DYPY+     CN+ KL R +VTIDGY 
Sbjct: 170  IDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYK 229

Query: 926  DIPSSNEEELLKAVASQPVSVGLCGSERGFQLYSTGIFSGPCATSLDHAVVIVGYGSENG 747
            D+P +NE++LL+AV +QPVSVG+CGSER FQLYS+GIF+GPC+TSLDHAV+IVGY SENG
Sbjct: 230  DVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENG 289

Query: 746  VDYWILKNSWGKSWGMDGYMHMQRNNGDKQGVCGINMLAXXXXXXXXXXXXXXXXXXTRC 567
            VDYWI+KNSWG+SWGM+GYMHMQRN G+  G+CGINMLA                  TRC
Sbjct: 290  VDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPPPSPPPGPTRC 349

Query: 566  SLMSYCGAGETCCCGFRLLGLVCLSWKCCELDSAVCCKDHIYCCPHNYPICDTQTKQCL- 390
            SL++YC AGETCCCG  +LG +CLSWKCC   SAVCC DH YCCP NYPICD+   QCL 
Sbjct: 350  SLLTYCAAGETCCCGSSILG-ICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLT 408

Query: 389  RAAGNYSMVKPFE 351
            R  GN +  +  E
Sbjct: 409  RFTGNVTAAEAIE 421


>ref|XP_006417526.1| hypothetical protein EUTSA_v10007640mg [Eutrema salsugineum]
            gi|557095297|gb|ESQ35879.1| hypothetical protein
            EUTSA_v10007640mg [Eutrema salsugineum]
          Length = 444

 Score =  533 bits (1374), Expect = e-149
 Identities = 246/372 (66%), Positives = 297/372 (79%), Gaps = 1/372 (0%)
 Frame = -3

Query: 1466 RVFEDNLAFVDQHNSLSNSTYKVGLNAYADLTHHEFRASKLGLSVAA-SMVDRSKSLGTK 1290
            ++F DN  FV QHN +SNSTY + LNA+ADLTHHEF+AS+LGLS  + S++ + +SLG  
Sbjct: 59   QIFRDNHDFVTQHNHISNSTYSLSLNAFADLTHHEFKASRLGLSAPSPSLMAKEQSLGVS 118

Query: 1289 ARSGFVGDVPKSIDWRKKGAVTPVKDQASCGACWAFSTTGAIEGINQIVTGSLVSLSEQE 1110
             R      VP S+DWRKKGAVT VKDQ SCGACW+FS TGA+EGINQIVTG L+SLSEQE
Sbjct: 119  ERVRV--KVPDSVDWRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSEQE 176

Query: 1109 LIDCDRSYNSGCGGGLMDYAFKFVVKNHGIDTEEDYPYQAIDRSCNRNKLKRRVVTIDGY 930
            LIDCD+SYN+GC GGLMDYAF+FV+KNHGIDTE+DYPYQ  D +C ++KLK+RVVTID Y
Sbjct: 177  LIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKDYPYQEQDGTCKKDKLKKRVVTIDSY 236

Query: 929  TDIPSSNEEELLKAVASQPVSVGLCGSERGFQLYSTGIFSGPCATSLDHAVVIVGYGSEN 750
              + S+NE+ L++AVASQPVSVG+CGSER FQLYS+GIFSGPC+TSLDHAV+IVGYGS+N
Sbjct: 237  AGVASNNEKALMEAVASQPVSVGICGSERAFQLYSSGIFSGPCSTSLDHAVLIVGYGSQN 296

Query: 749  GVDYWILKNSWGKSWGMDGYMHMQRNNGDKQGVCGINMLAXXXXXXXXXXXXXXXXXXTR 570
            GVDYWI+KNSWGKSWGMDG+MHMQRN G+ +GVCGINMLA                  T+
Sbjct: 297  GVDYWIVKNSWGKSWGMDGFMHMQRNTGNSEGVCGINMLASYPIKTHPNPPPPSPPGPTK 356

Query: 569  CSLMSYCGAGETCCCGFRLLGLVCLSWKCCELDSAVCCKDHIYCCPHNYPICDTQTKQCL 390
            C+L +YC +GETCCC   L GL C SWKCCEL+SAVCCKD  +CCP +YP+CDT    CL
Sbjct: 357  CNLFTYCSSGETCCCARTLFGL-CFSWKCCELESAVCCKDGRHCCPRDYPVCDTTKSLCL 415

Query: 389  RAAGNYSMVKPF 354
            +  GN++ +KPF
Sbjct: 416  KKTGNFTEIKPF 427


>ref|XP_006473576.1| PREDICTED: cysteine proteinase RD21a-like [Citrus sinensis]
          Length = 441

 Score =  533 bits (1372), Expect = e-148
 Identities = 249/391 (63%), Positives = 297/391 (75%), Gaps = 1/391 (0%)
 Frame = -3

Query: 1466 RVFEDNLAFVDQHNSLSNSTYKVGLNAYADLTHHEFRASKLGLSVAASMVDRSKSLGTKA 1287
            ++FEDN AFV QHN++ NS++ + LNA+ADLTH EF+AS LG S A+   DR ++   ++
Sbjct: 51   KIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQS 110

Query: 1286 RSGFVGDVPKSIDWRKKGAVTPVKDQASCGACWAFSTTGAIEGINQIVTGSLVSLSEQEL 1107
              G + DVP SIDWRKKGAVT VKDQASCGACWAFS TGAIEGIN+IVTGSLVSLSEQEL
Sbjct: 111  -PGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQEL 169

Query: 1106 IDCDRSYNSGCGGGLMDYAFKFVVKNHGIDTEEDYPYQAIDRSCNRNKLKRRVVTIDGYT 927
            IDCDRSYNSGCGGGLMDYA++FV+KNHGIDTE+DYPY+     CN+ KL R +VTIDGY 
Sbjct: 170  IDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYK 229

Query: 926  DIPSSNEEELLKAVASQPVSVGLCGSERGFQLYSTGIFSGPCATSLDHAVVIVGYGSENG 747
            D+P +NE++LL+AV +QPVSVG+CGSER FQLYS+GIF+GPC+TSLDHAV+I+GY SENG
Sbjct: 230  DVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIIGYDSENG 289

Query: 746  VDYWILKNSWGKSWGMDGYMHMQRNNGDKQGVCGINMLAXXXXXXXXXXXXXXXXXXTRC 567
            VDYWI+KNSWG+SWGM+GYMHMQRN G+  G+CGINMLA                  TRC
Sbjct: 290  VDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPPPSPPPGPTRC 349

Query: 566  SLMSYCGAGETCCCGFRLLGLVCLSWKCCELDSAVCCKDHIYCCPHNYPICDTQTKQCL- 390
            SL++YC  GETCCCG  +LG +CLSWKCC   SAVCC DH YCCP NYPICD+   QCL 
Sbjct: 350  SLLTYCAPGETCCCGSSILG-ICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLT 408

Query: 389  RAAGNYSMVKPFEXXXXXXXXXXXXXLFEAW 297
            R  GN +  +  E               +AW
Sbjct: 409  RLTGNVTAAEAIEMRGSSWKFGSWSSFIDAW 439


>ref|NP_563855.1| papain-like cysteine peptidase [Arabidopsis thaliana]
            gi|110741821|dbj|BAE98853.1| papain-like cysteine
            peptidase XBCP3 [Arabidopsis thaliana]
            gi|111074448|gb|ABH04597.1| At1g09850 [Arabidopsis
            thaliana] gi|332190386|gb|AEE28507.1| papain-like
            cysteine peptidase [Arabidopsis thaliana]
          Length = 437

 Score =  527 bits (1358), Expect = e-147
 Identities = 241/374 (64%), Positives = 297/374 (79%), Gaps = 3/374 (0%)
 Frame = -3

Query: 1466 RVFEDNLAFVDQHNSLSNSTYKVGLNAYADLTHHEFRASKLGLSVAAS---MVDRSKSLG 1296
            ++F+DN  FV QHN ++N+TY + LNA+ADLTHHEF+AS+LGLSV+A    M  + +SLG
Sbjct: 54   QIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHEFKASRLGLSVSAPSVIMASKGQSLG 113

Query: 1295 TKARSGFVGDVPKSIDWRKKGAVTPVKDQASCGACWAFSTTGAIEGINQIVTGSLVSLSE 1116
               +      VP S+DWRKKGAVT VKDQ SCGACW+FS TGA+EGINQIVTG L+SLSE
Sbjct: 114  GSVK------VPDSVDWRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSE 167

Query: 1115 QELIDCDRSYNSGCGGGLMDYAFKFVVKNHGIDTEEDYPYQAIDRSCNRNKLKRRVVTID 936
            QELIDCD+SYN+GC GGLMDYAF+FV+KNHGIDTE+DYPYQ  D +C ++KLK++VVTID
Sbjct: 168  QELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKDYPYQERDGTCKKDKLKQKVVTID 227

Query: 935  GYTDIPSSNEEELLKAVASQPVSVGLCGSERGFQLYSTGIFSGPCATSLDHAVVIVGYGS 756
             Y  + S++E+ L++AVA+QPVSVG+CGSER FQLYS+GIFSGPC+TSLDHAV+IVGYGS
Sbjct: 228  SYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQLYSSGIFSGPCSTSLDHAVLIVGYGS 287

Query: 755  ENGVDYWILKNSWGKSWGMDGYMHMQRNNGDKQGVCGINMLAXXXXXXXXXXXXXXXXXX 576
            +NGVDYWI+KNSWGKSWGMDG+MHMQRN  +  GVCGINMLA                  
Sbjct: 288  QNGVDYWIVKNSWGKSWGMDGFMHMQRNTENSDGVCGINMLASYPIKTHPNPPPPSPPGP 347

Query: 575  TRCSLMSYCGAGETCCCGFRLLGLVCLSWKCCELDSAVCCKDHIYCCPHNYPICDTQTKQ 396
            T+C+L +YC +GETCCC   L GL C SWKCCE++SAVCCKD  +CCPH+YP+CDT    
Sbjct: 348  TKCNLFTYCSSGETCCCARELFGL-CFSWKCCEIESAVCCKDGRHCCPHDYPVCDTTRSL 406

Query: 395  CLRAAGNYSMVKPF 354
            CL+  GN++ +KPF
Sbjct: 407  CLKKTGNFTAIKPF 420


>ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp.
            lyrata] gi|297335615|gb|EFH66032.1| hypothetical protein
            ARALYDRAFT_471096 [Arabidopsis lyrata subsp. lyrata]
          Length = 439

 Score =  527 bits (1357), Expect = e-147
 Identities = 243/376 (64%), Positives = 299/376 (79%), Gaps = 5/376 (1%)
 Frame = -3

Query: 1466 RVFEDNLAFVDQHNSLSNSTYKVGLNAYADLTHHEFRASKLGLSVAAS---MVDRSKSLG 1296
            ++F+DN  FV QHN ++N+TY + LNA+ADLTHHEF+AS+LGLSV+AS   M  + +SLG
Sbjct: 54   QIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHEFKASRLGLSVSASSLIMASKGQSLG 113

Query: 1295 TKARSGFVGDVPKSIDWRKKGAVTPVKDQASCGACWAFSTTGAIEGINQIVTGSLVSLSE 1116
              A+      VP S+DWRKKGAVT VKDQ SCGACW+FS TGA+EGINQIVTG L+SLSE
Sbjct: 114  GNAK------VPDSVDWRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSE 167

Query: 1115 QELIDCDRSYNSGCGGGLMDYAFKFVVKNHGIDTEEDYPYQAIDRSCNRNKLKRRVVTID 936
            QELIDCD+SYN+GC GGLMDYAF+FV+KNHGIDTE+DYPYQ  D +C ++KLK++VVTID
Sbjct: 168  QELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKDYPYQERDGTCKKDKLKQKVVTID 227

Query: 935  GYTDIPSSNEEELLKAVASQPVSVGLCGSERGFQLYS--TGIFSGPCATSLDHAVVIVGY 762
             Y  + S++E+ L +AVA+QPVSVG+CGSER FQLYS  +GIFSGPC+TSLDHAV+IVGY
Sbjct: 228  SYAGVKSNDEKALREAVAAQPVSVGICGSERAFQLYSRVSGIFSGPCSTSLDHAVLIVGY 287

Query: 761  GSENGVDYWILKNSWGKSWGMDGYMHMQRNNGDKQGVCGINMLAXXXXXXXXXXXXXXXX 582
            GS+NGVDYWI+KNSWGKSWGMDG+MHMQRN G+ +G+CGINMLA                
Sbjct: 288  GSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTGNSEGICGINMLASYPIKTHPNPPPPSPP 347

Query: 581  XXTRCSLMSYCGAGETCCCGFRLLGLVCLSWKCCELDSAVCCKDHIYCCPHNYPICDTQT 402
              T+C+L +YC AGETCCC   L GL C SWKCCE++SAVCC D  +CCPH+YP+CDT  
Sbjct: 348  GPTKCNLFTYCSAGETCCCARNLFGL-CFSWKCCEIESAVCCSDGRHCCPHDYPVCDTTR 406

Query: 401  KQCLRAAGNYSMVKPF 354
              CL+  GN++ +KPF
Sbjct: 407  SLCLKKTGNFTAIKPF 422


>gb|EXC25025.1| Oryzain alpha chain [Morus notabilis]
          Length = 517

 Score =  526 bits (1356), Expect = e-147
 Identities = 248/359 (69%), Positives = 287/359 (79%)
 Frame = -3

Query: 1463 VFEDNLAFVDQHNSLSNSTYKVGLNAYADLTHHEFRASKLGLSVAASMVDRSKSLGTKAR 1284
            VFEDNLAFV QHN++ NS+Y + LNA+ADLTHHEF++S+LG S A  ++     LG+K  
Sbjct: 53   VFEDNLAFVTQHNNMGNSSYTLSLNAFADLTHHEFKSSRLGFSSA--LLSSLPKLGSKLL 110

Query: 1283 SGFVGDVPKSIDWRKKGAVTPVKDQASCGACWAFSTTGAIEGINQIVTGSLVSLSEQELI 1104
               + DVP S+DWRKKGAVT VKDQ SCGACWAFS TGAIEGIN+IVTGSLVSLSEQELI
Sbjct: 111  D--LRDVPASLDWRKKGAVTNVKDQGSCGACWAFSATGAIEGINKIVTGSLVSLSEQELI 168

Query: 1103 DCDRSYNSGCGGGLMDYAFKFVVKNHGIDTEEDYPYQAIDRSCNRNKLKRRVVTIDGYTD 924
            DCD SYN+GC GGLMDYA++FV+ NHGIDTEEDYPYQA D+SC + KLKRRVVTIDGYTD
Sbjct: 169  DCDTSYNAGCDGGLMDYAYQFVIDNHGIDTEEDYPYQARDKSCRKEKLKRRVVTIDGYTD 228

Query: 923  IPSSNEEELLKAVASQPVSVGLCGSERGFQLYSTGIFSGPCATSLDHAVVIVGYGSENGV 744
            +  +N  +LL+AV +QPVSVG+CGSER FQLYS GIF+GPC+TSLDHAV+IVGY SENGV
Sbjct: 229  VAPNNGLQLLQAVVTQPVSVGICGSERAFQLYSKGIFTGPCSTSLDHAVLIVGYDSENGV 288

Query: 743  DYWILKNSWGKSWGMDGYMHMQRNNGDKQGVCGINMLAXXXXXXXXXXXXXXXXXXTRCS 564
            DYWI+KNSWGK WGMDGY+HMQRN G+ QGVCGINMLA                  TRCS
Sbjct: 289  DYWIVKNSWGKQWGMDGYIHMQRNTGNSQGVCGINMLASYPTKTSPNPPPSPSPGPTRCS 348

Query: 563  LMSYCGAGETCCCGFRLLGLVCLSWKCCELDSAVCCKDHIYCCPHNYPICDTQTKQCLR 387
              + CG GETCCC +R LGL C SWKCC L+SAVCCKD I+CCP +YP+CDTQ   CL+
Sbjct: 349  FFAQCGEGETCCCSWRFLGL-CFSWKCCGLNSAVCCKDKIHCCPQDYPLCDTQRNVCLK 406


>gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana]
          Length = 437

 Score =  526 bits (1356), Expect = e-147
 Identities = 241/374 (64%), Positives = 296/374 (79%), Gaps = 3/374 (0%)
 Frame = -3

Query: 1466 RVFEDNLAFVDQHNSLSNSTYKVGLNAYADLTHHEFRASKLGLSVAAS---MVDRSKSLG 1296
            ++F+DN  FV QHN ++N+TY + LNA+ADLTHHEF+AS+LGLSV+A    M  + +SLG
Sbjct: 54   QIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHEFKASRLGLSVSAPSVIMASKGQSLG 113

Query: 1295 TKARSGFVGDVPKSIDWRKKGAVTPVKDQASCGACWAFSTTGAIEGINQIVTGSLVSLSE 1116
               +      VP S+DWRKKGAVT VKDQ SCGACW+FS TGA+EGINQIVTG L+SLSE
Sbjct: 114  GSVK------VPDSVDWRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSE 167

Query: 1115 QELIDCDRSYNSGCGGGLMDYAFKFVVKNHGIDTEEDYPYQAIDRSCNRNKLKRRVVTID 936
            QELIDCD+SYN+GC GGLMDYAF+FV+KNHGIDTE+DYPYQ  D +C ++KLK++VVTID
Sbjct: 168  QELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKDYPYQERDGTCKKDKLKQKVVTID 227

Query: 935  GYTDIPSSNEEELLKAVASQPVSVGLCGSERGFQLYSTGIFSGPCATSLDHAVVIVGYGS 756
             Y  + S++E+ L++AVA+QPVSVG+CGSER FQLYS GIFSGPC+TSLDHAV+IVGYGS
Sbjct: 228  SYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQLYSRGIFSGPCSTSLDHAVLIVGYGS 287

Query: 755  ENGVDYWILKNSWGKSWGMDGYMHMQRNNGDKQGVCGINMLAXXXXXXXXXXXXXXXXXX 576
            +NGVDYWI+KNSWGKSWGMDG+MHMQRN  +  GVCGINMLA                  
Sbjct: 288  QNGVDYWIVKNSWGKSWGMDGFMHMQRNTENSDGVCGINMLASYPIKTHPNPPPPSPPGP 347

Query: 575  TRCSLMSYCGAGETCCCGFRLLGLVCLSWKCCELDSAVCCKDHIYCCPHNYPICDTQTKQ 396
            T+C+L +YC +GETCCC   L GL C SWKCCE++SAVCCKD  +CCPH+YP+CDT    
Sbjct: 348  TKCNLFTYCSSGETCCCARELFGL-CFSWKCCEIESAVCCKDGRHCCPHDYPVCDTTRSL 406

Query: 395  CLRAAGNYSMVKPF 354
            CL+  GN++ +KPF
Sbjct: 407  CLKKTGNFTAIKPF 420


>ref|XP_007017656.1| JHL18I08.3 protein [Theobroma cacao] gi|508722984|gb|EOY14881.1|
            JHL18I08.3 protein [Theobroma cacao]
          Length = 438

 Score =  525 bits (1353), Expect = e-146
 Identities = 241/372 (64%), Positives = 292/372 (78%)
 Frame = -3

Query: 1466 RVFEDNLAFVDQHNSLSNSTYKVGLNAYADLTHHEFRASKLGLSVAASMVDRSKSLGTKA 1287
            +VFE+N AFV QHN + NS+Y + LNA+ADLTHHEF+AS+LGLS AA    R        
Sbjct: 52   KVFEENYAFVTQHNGVGNSSYSLALNAFADLTHHEFKASRLGLSAAAIEGSRPNL----Q 107

Query: 1286 RSGFVGDVPKSIDWRKKGAVTPVKDQASCGACWAFSTTGAIEGINQIVTGSLVSLSEQEL 1107
              G V D+P S+DWR KGAVT VKDQ SCGACW+FS TGAIEGIN+IVTG+LVSLSEQEL
Sbjct: 108  LPGLVRDIPASMDWRTKGAVTKVKDQGSCGACWSFSATGAIEGINKIVTGTLVSLSEQEL 167

Query: 1106 IDCDRSYNSGCGGGLMDYAFKFVVKNHGIDTEEDYPYQAIDRSCNRNKLKRRVVTIDGYT 927
            +DCDRSYNSGC GGLMDYA++FV+ NHGID EEDYPY   +++CN+ K KRRVVTIDGY 
Sbjct: 168  VDCDRSYNSGCEGGLMDYAYQFVIDNHGIDNEEDYPYLGREKTCNKEKRKRRVVTIDGYA 227

Query: 926  DIPSSNEEELLKAVASQPVSVGLCGSERGFQLYSTGIFSGPCATSLDHAVVIVGYGSENG 747
             +P++NE+ LL+AVA QPVSVG+CGSER FQLYS GIF+GPC++SLDHAV+IVGYGSENG
Sbjct: 228  GVPANNEDLLLQAVAKQPVSVGICGSERAFQLYSKGIFTGPCSSSLDHAVLIVGYGSENG 287

Query: 746  VDYWILKNSWGKSWGMDGYMHMQRNNGDKQGVCGINMLAXXXXXXXXXXXXXXXXXXTRC 567
            VDYWI+KNSWG  WGM+GY+HM RN+GD +G+CGINMLA                  T+C
Sbjct: 288  VDYWIVKNSWGTRWGMNGYIHMLRNSGDSKGLCGINMLASYPTKTSPNPPSPPPPGPTKC 347

Query: 566  SLMSYCGAGETCCCGFRLLGLVCLSWKCCELDSAVCCKDHIYCCPHNYPICDTQTKQCLR 387
             L +YC AGETCCC  R+ G +C SWKCCELDSAVCCKD+ +CCP++YP+CDT+  QCL+
Sbjct: 348  DLFTYCSAGETCCCTHRIFG-ICFSWKCCELDSAVCCKDNRHCCPYDYPVCDTKKSQCLK 406

Query: 386  AAGNYSMVKPFE 351
              GN + ++ FE
Sbjct: 407  RVGNATRMEAFE 418


>ref|XP_002510459.1| cysteine protease, putative [Ricinus communis]
            gi|223551160|gb|EEF52646.1| cysteine protease, putative
            [Ricinus communis]
          Length = 422

 Score =  525 bits (1351), Expect = e-146
 Identities = 241/362 (66%), Positives = 293/362 (80%)
 Frame = -3

Query: 1466 RVFEDNLAFVDQHNSLSNSTYKVGLNAYADLTHHEFRASKLGLSVAASMVDRSKSLGTKA 1287
            ++FE+N  FV +HNS  NS+Y + LNA+ADLTHHEF+AS+LGLS  ++    S+      
Sbjct: 54   KIFEENYEFVKKHNSQGNSSYTLSLNAFADLTHHEFKASRLGLSAFSTSGKLSRR--NFP 111

Query: 1286 RSGFVGDVPKSIDWRKKGAVTPVKDQASCGACWAFSTTGAIEGINQIVTGSLVSLSEQEL 1107
               FVGDVP SIDWRKKGAV+ VKDQ +CGACW+FS TGAIEGIN+IVTGSLVSLSEQEL
Sbjct: 112  LHDFVGDVPISIDWRKKGAVSQVKDQGNCGACWSFSATGAIEGINKIVTGSLVSLSEQEL 171

Query: 1106 IDCDRSYNSGCGGGLMDYAFKFVVKNHGIDTEEDYPYQAIDRSCNRNKLKRRVVTIDGYT 927
            +DCDRSYN+GC GGLMDYA++FV++N+GIDTEEDYPYQA +++CN+ KLKR VVTIDGYT
Sbjct: 172  VDCDRSYNNGCEGGLMDYAYQFVIENNGIDTEEDYPYQAREKTCNKEKLKRHVVTIDGYT 231

Query: 926  DIPSSNEEELLKAVASQPVSVGLCGSERGFQLYSTGIFSGPCATSLDHAVVIVGYGSENG 747
            D+P +NE+ELLKAVA+QPVSVG+CGSER FQLYS GIF+GPC+TSLDHAV+IVGYGSENG
Sbjct: 232  DVPQNNEKELLKAVAAQPVSVGICGSERAFQLYSKGIFTGPCSTSLDHAVLIVGYGSENG 291

Query: 746  VDYWILKNSWGKSWGMDGYMHMQRNNGDKQGVCGINMLAXXXXXXXXXXXXXXXXXXTRC 567
            VDYWI+KNSWG  WG++GYM+M RN+G+ QG+CGINMLA                  T+C
Sbjct: 292  VDYWIVKNSWGTHWGINGYMYMLRNSGNSQGLCGINMLASFPVKTSPNPPPPAPPGPTKC 351

Query: 566  SLMSYCGAGETCCCGFRLLGLVCLSWKCCELDSAVCCKDHIYCCPHNYPICDTQTKQCLR 387
             L + CG GETCCC  R+ GL C SWKCCELDSAVCCKD ++CCPH+YP+CDT+   CL+
Sbjct: 352  DLFTRCGEGETCCCTRRIFGL-CFSWKCCELDSAVCCKDGLHCCPHDYPVCDTKRNMCLK 410

Query: 386  AA 381
             +
Sbjct: 411  VS 412


>ref|XP_006838704.1| hypothetical protein AMTR_s00002p00249780 [Amborella trichopoda]
            gi|548841210|gb|ERN01273.1| hypothetical protein
            AMTR_s00002p00249780 [Amborella trichopoda]
          Length = 475

 Score =  516 bits (1329), Expect = e-143
 Identities = 236/366 (64%), Positives = 286/366 (78%)
 Frame = -3

Query: 1466 RVFEDNLAFVDQHNSLSNSTYKVGLNAYADLTHHEFRASKLGLSVAASMVDRSKSLGTKA 1287
            RVF DNL F+ +HN  +NS Y VGLNA+ADLTHHEF+  +LGL  +      S     + 
Sbjct: 95   RVFSDNLVFIREHNQRANSNYTVGLNAFADLTHHEFKIKRLGLCPSILRFSSSNFRSDQK 154

Query: 1286 RSGFVGDVPKSIDWRKKGAVTPVKDQASCGACWAFSTTGAIEGINQIVTGSLVSLSEQEL 1107
            +     DVP S+DWR KGAVT VKDQ SCGACWAFS TGAIEGIN+IVTGSL+SLSEQE+
Sbjct: 155  KI----DVPSSLDWRDKGAVTNVKDQGSCGACWAFSATGAIEGINKIVTGSLISLSEQEI 210

Query: 1106 IDCDRSYNSGCGGGLMDYAFKFVVKNHGIDTEEDYPYQAIDRSCNRNKLKRRVVTIDGYT 927
            IDCD +YNSGCGGGLMDYAFK+V KNHGIDTE+DYPY+ +  SC ++K +R VVTIDG+T
Sbjct: 211  IDCDTTYNSGCGGGLMDYAFKWVTKNHGIDTEKDYPYREVQGSCIKDKAERHVVTIDGHT 270

Query: 926  DIPSSNEEELLKAVASQPVSVGLCGSERGFQLYSTGIFSGPCATSLDHAVVIVGYGSENG 747
            DIPS++E+ +L+AVA QPVSVG+CGSER FQLYS+GIFSGPC+TSLDHAV+IVGYGS+NG
Sbjct: 271  DIPSNSEDLILQAVAKQPVSVGICGSERSFQLYSSGIFSGPCSTSLDHAVLIVGYGSKNG 330

Query: 746  VDYWILKNSWGKSWGMDGYMHMQRNNGDKQGVCGINMLAXXXXXXXXXXXXXXXXXXTRC 567
            VDYWI+KNSWG SWGMDGYMHM RN+GD QGVCGINM+                    +C
Sbjct: 331  VDYWIVKNSWGTSWGMDGYMHMLRNSGDSQGVCGINMMPSYPTKSGANPPPSPPPGPVKC 390

Query: 566  SLMSYCGAGETCCCGFRLLGLVCLSWKCCELDSAVCCKDHIYCCPHNYPICDTQTKQCLR 387
            SL++YC +G TCCC +R LG +CLSW CC+LD+AVCCKD  YCCP +YP+C+T T  CL+
Sbjct: 391  SLLTYCPSGNTCCCTWRFLG-ICLSWSCCDLDNAVCCKDGQYCCPQDYPVCNTATGYCLK 449

Query: 386  AAGNYS 369
             +GN++
Sbjct: 450  GSGNWT 455


>ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
            gi|449529596|ref|XP_004171784.1| PREDICTED: cysteine
            proteinase RD21a-like [Cucumis sativus]
          Length = 431

 Score =  516 bits (1329), Expect = e-143
 Identities = 244/363 (67%), Positives = 282/363 (77%)
 Frame = -3

Query: 1463 VFEDNLAFVDQHNSLSNSTYKVGLNAYADLTHHEFRASKLGLSVAASMVDRSKSLGTKAR 1284
            VF DN  FV  HN+L NS+Y + LN+YADLTHHEF+ S+LG S A     R+        
Sbjct: 52   VFADNYEFVTHHNNLDNSSYTLSLNSYADLTHHEFKVSRLGFSPAL----RNFRPVLPQE 107

Query: 1283 SGFVGDVPKSIDWRKKGAVTPVKDQASCGACWAFSTTGAIEGINQIVTGSLVSLSEQELI 1104
                 DVP S+DWRKKGAVT VKDQ SCGACW+FS TGA+EGINQI+TGSL+SLSEQELI
Sbjct: 108  PSLPRDVPDSLDWRKKGAVTAVKDQGSCGACWSFSATGAMEGINQIMTGSLISLSEQELI 167

Query: 1103 DCDRSYNSGCGGGLMDYAFKFVVKNHGIDTEEDYPYQAIDRSCNRNKLKRRVVTIDGYTD 924
            DCDRSYNSGCGGGLMDYA++FV+ NHGIDTE DYPYQA D SC ++KL+R VVTIDGY D
Sbjct: 168  DCDRSYNSGCGGGLMDYAYQFVISNHGIDTENDYPYQARDGSCRKDKLQRNVVTIDGYAD 227

Query: 923  IPSSNEEELLKAVASQPVSVGLCGSERGFQLYSTGIFSGPCATSLDHAVVIVGYGSENGV 744
            IPS++E +LL+AVA+QPVSVG+CGSER FQLYS GIFSGPC+TSLDHAV+IVGYGSENGV
Sbjct: 228  IPSNDEGKLLQAVAAQPVSVGICGSERAFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGV 287

Query: 743  DYWILKNSWGKSWGMDGYMHMQRNNGDKQGVCGINMLAXXXXXXXXXXXXXXXXXXTRCS 564
            DYWI+KNSWGKSWGMDGYMHMQRN+G+ +GVCGIN LA                  T+CS
Sbjct: 288  DYWIVKNSWGKSWGMDGYMHMQRNSGNSEGVCGINKLASYPTKTNPNPPPSPPPGPTKCS 347

Query: 563  LMSYCGAGETCCCGFRLLGLVCLSWKCCELDSAVCCKDHIYCCPHNYPICDTQTKQCLRA 384
            +++ C AGETCCC  + LGL CLSWKCC L SAVCCKD  +CCP +YPICDT    CL+ 
Sbjct: 348  ILTSCAAGETCCCAKKFLGL-CLSWKCCGLSSAVCCKDGRHCCPFDYPICDTDRNLCLKQ 406

Query: 383  AGN 375
              N
Sbjct: 407  TMN 409


>ref|XP_006341989.1| PREDICTED: cysteine proteinase RD21a-like [Solanum tuberosum]
          Length = 439

 Score =  514 bits (1324), Expect = e-143
 Identities = 237/372 (63%), Positives = 297/372 (79%), Gaps = 3/372 (0%)
 Frame = -3

Query: 1466 RVFEDNLAFVDQHNSLSNSTYKVGLNAYADLTHHEFRASKLGLSVAASMVDRSKSLGT-K 1290
            +VFE+N A++ +HNS  NS+Y +GLNAY+DLTHHEFR S LGLS +A+   R K  G+  
Sbjct: 51   KVFEENYAYITEHNSKENSSYTLGLNAYSDLTHHEFRNSFLGLSSSANDFIRLKGRGSGS 110

Query: 1289 ARSGFVGDV--PKSIDWRKKGAVTPVKDQASCGACWAFSTTGAIEGINQIVTGSLVSLSE 1116
            + +G + DV  P S+DWR+KGAVT VK+Q SCGACW+FS TGA+EGIN+I TGSLVSLSE
Sbjct: 111  SETGVLSDVDAPSSLDWREKGAVTDVKNQGSCGACWSFSATGAMEGINKITTGSLVSLSE 170

Query: 1115 QELIDCDRSYNSGCGGGLMDYAFKFVVKNHGIDTEEDYPYQAIDRSCNRNKLKRRVVTID 936
            QELIDCDRSYN GCGGGLMDYAF+FV+KN GIDTE+DYP++  + +CN+NKL+R VVTID
Sbjct: 171  QELIDCDRSYNEGCGGGLMDYAFEFVIKNGGIDTEKDYPFREREGTCNKNKLQRHVVTID 230

Query: 935  GYTDIPSSNEEELLKAVASQPVSVGLCGSERGFQLYSTGIFSGPCATSLDHAVVIVGYGS 756
            GYTDIP ++E++LLKAVA+QPVSVG+CGS R FQ YS GIF+GPC+T+LDHAV+IVGYGS
Sbjct: 231  GYTDIPQNDEDKLLKAVATQPVSVGICGSARAFQSYSKGIFTGPCSTALDHAVLIVGYGS 290

Query: 755  ENGVDYWILKNSWGKSWGMDGYMHMQRNNGDKQGVCGINMLAXXXXXXXXXXXXXXXXXX 576
            ENGVDYWI+KNSWG SWG++GY+HMQRN+G+++G+CGIN LA                  
Sbjct: 291  ENGVDYWIIKNSWGTSWGINGYIHMQRNSGNQEGICGINKLASYPTKTSPNPPTPPAPGP 350

Query: 575  TRCSLMSYCGAGETCCCGFRLLGLVCLSWKCCELDSAVCCKDHIYCCPHNYPICDTQTKQ 396
            ++CS+ + CG GETCCCG + LG +CLSWKCC LDSAVCCKD  +CCP +YPICDT    
Sbjct: 351  SKCSMFTSCGQGETCCCGSKFLG-ICLSWKCCGLDSAVCCKDGRHCCPQDYPICDTSRNL 409

Query: 395  CLRAAGNYSMVK 360
            CL+   N ++V+
Sbjct: 410  CLKRMNNATIVQ 421


>ref|XP_004238304.1| PREDICTED: cysteine proteinase RD21a-like [Solanum lycopersicum]
          Length = 439

 Score =  511 bits (1316), Expect = e-142
 Identities = 237/372 (63%), Positives = 293/372 (78%), Gaps = 3/372 (0%)
 Frame = -3

Query: 1466 RVFEDNLAFVDQHNSLSNSTYKVGLNAYADLTHHEFRASKLGLSVAASMVDRSKSLGT-K 1290
            +VFE+N A++ +HNS  NS+Y +GLNAY+DLTHHEFR S LGLS +A+   R K  G+  
Sbjct: 51   KVFEENYAYITEHNSKGNSSYTLGLNAYSDLTHHEFRNSFLGLSSSANDFIRLKGRGSGS 110

Query: 1289 ARSGFVGDV--PKSIDWRKKGAVTPVKDQASCGACWAFSTTGAIEGINQIVTGSLVSLSE 1116
            + +G + DV  P S+DWR KGAVT VK+Q SCGACW+FS TGAIEGIN+I TGSLVSLSE
Sbjct: 111  SAAGVLSDVDAPSSLDWRDKGAVTNVKNQGSCGACWSFSATGAIEGINKITTGSLVSLSE 170

Query: 1115 QELIDCDRSYNSGCGGGLMDYAFKFVVKNHGIDTEEDYPYQAIDRSCNRNKLKRRVVTID 936
            QELIDCDRSYN GCGGGLMDYAF+FV+KN GIDTE+DYP++  + +CN+NKL+RRVVTID
Sbjct: 171  QELIDCDRSYNQGCGGGLMDYAFEFVIKNGGIDTEKDYPFREKEGTCNKNKLQRRVVTID 230

Query: 935  GYTDIPSSNEEELLKAVASQPVSVGLCGSERGFQLYSTGIFSGPCATSLDHAVVIVGYGS 756
            GYTDIP ++E++LLKAVA+QPVSVG+CGS R FQ YS GIF+GPC T LDHAV+IVGYGS
Sbjct: 231  GYTDIPQNDEDKLLKAVATQPVSVGICGSARAFQSYSKGIFTGPCPTDLDHAVLIVGYGS 290

Query: 755  ENGVDYWILKNSWGKSWGMDGYMHMQRNNGDKQGVCGINMLAXXXXXXXXXXXXXXXXXX 576
            ENG DYWI+KNSWG SWG++GY+HMQRN+G+++G+CG+N LA                  
Sbjct: 291  ENGFDYWIIKNSWGTSWGINGYIHMQRNSGNQEGICGVNKLASYPTKTSPNPPNPPAPGP 350

Query: 575  TRCSLMSYCGAGETCCCGFRLLGLVCLSWKCCELDSAVCCKDHIYCCPHNYPICDTQTKQ 396
            ++CS  + CG GETCCCG + LG +CLSWKCC LDSAVCCKD  +CCP +YPICDT    
Sbjct: 351  SKCSTFTSCGQGETCCCGLKFLG-ICLSWKCCGLDSAVCCKDGRHCCPWDYPICDTSRNL 409

Query: 395  CLRAAGNYSMVK 360
            CL+   N ++V+
Sbjct: 410  CLKRMSNATIVQ 421


>ref|XP_006307431.1| hypothetical protein CARUB_v10009056mg [Capsella rubella]
            gi|482576142|gb|EOA40329.1| hypothetical protein
            CARUB_v10009056mg [Capsella rubella]
          Length = 467

 Score =  507 bits (1305), Expect = e-141
 Identities = 241/402 (59%), Positives = 292/402 (72%), Gaps = 31/402 (7%)
 Frame = -3

Query: 1466 RVFEDNLAFVDQHNSLSNSTYKVGLNAYADLTHHEFRASKLGLSVAAS---MVDRSKSLG 1296
            ++F DN  FV QHN ++N+TY + LNA+ADL H EF+ S+LGLSV+A    M  + KSLG
Sbjct: 56   QIFRDNHDFVTQHNLITNATYSLSLNAFADLNHSEFKTSRLGLSVSAPSVIMASKGKSLG 115

Query: 1295 TKARSGFVGDVPKSIDWRKKGAVTPVKDQASCGACWAFSTTGAIEGINQIVTGSLVSLSE 1116
               +      VP S+DWRKKGAVT VKDQ SCGACW+FS TGA+EGINQIVTG L+SLSE
Sbjct: 116  GSVK------VPDSLDWRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSE 169

Query: 1115 QELIDCDRSYNSGCGGGLMDYAFKFVVKNHGIDTEEDYPYQAIDRSCNRNKLKRRVVTID 936
            QELIDCD+SYN GC GGLMDYAF+FV+KN GIDTE+DYPYQ  D +C ++KLK+RVV+ID
Sbjct: 170  QELIDCDKSYNDGCNGGLMDYAFEFVIKNKGIDTEKDYPYQERDGTCKKDKLKQRVVSID 229

Query: 935  GYTDIPSSNEEELLKAVASQPVSVGLCGSERGFQLYST---------------------- 822
             Y  +  S+E+ LL+AVA+QPVSVG+CGSER FQLYS+                      
Sbjct: 230  SYAGVKPSDEKALLEAVAAQPVSVGICGSERAFQLYSSVSFKIRDTSILSSECSTFPCLK 289

Query: 821  ------GIFSGPCATSLDHAVVIVGYGSENGVDYWILKNSWGKSWGMDGYMHMQRNNGDK 660
                  GIFSGPC+TSLDHAV+IVGYGS+NGVDYWI+KNSWGKSWGMDG+MHMQRN G+ 
Sbjct: 290  LYLMMQGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTGNS 349

Query: 659  QGVCGINMLAXXXXXXXXXXXXXXXXXXTRCSLMSYCGAGETCCCGFRLLGLVCLSWKCC 480
            QG+CGINMLA                  T+C+L +YC A ETCCC   L GL CLSWKCC
Sbjct: 350  QGICGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSAAETCCCARNLFGL-CLSWKCC 408

Query: 479  ELDSAVCCKDHIYCCPHNYPICDTQTKQCLRAAGNYSMVKPF 354
            E++SAVCCKD  +CCPH+YP+CDT    CL+  GN++ +KPF
Sbjct: 409  EIESAVCCKDGRHCCPHDYPVCDTTRSLCLKKTGNFTAIKPF 450


>gb|AAB60738.1| Strong similarity to Dianthus cysteine proteinase (gb|U17135)
            [Arabidopsis thaliana]
          Length = 416

 Score =  507 bits (1305), Expect = e-141
 Identities = 236/370 (63%), Positives = 289/370 (78%), Gaps = 10/370 (2%)
 Frame = -3

Query: 1466 RVFEDNLAFVDQHNSLSNSTYKVGLNAYADLTHHEFRASKLGLSVAAS---MVDRSKSLG 1296
            ++F+DN  FV QHN ++N+TY + LNA+ADLTHHEF+AS+LGLSV+A    M  + +SLG
Sbjct: 52   QIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHEFKASRLGLSVSAPSVIMASKGQSLG 111

Query: 1295 TKARSGFVGDVPKSIDWRKKGAVTPVKDQASCGACWAFSTTGAIEGINQIVTGSLVSLSE 1116
               +      VP S+DWRKKGAVT VKDQ SCGACW+FS TGA+EGINQIVTG L+SLSE
Sbjct: 112  GSVK------VPDSVDWRKKGAVTNVKDQGSCGACWSFSATGAMEGINQIVTGDLISLSE 165

Query: 1115 QELIDCDRSYNSGCGGGLMDYAFKFVVKNHGIDTEEDYPYQAIDRSCNRNKLKRRVVTID 936
            QELIDCD+SYN+GC GGLMDYAF+FV+KNHGIDTE+DYPYQ  D +C ++KLK++VVTID
Sbjct: 166  QELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKDYPYQERDGTCKKDKLKQKVVTID 225

Query: 935  GYTDIPSSNEEELLKAVASQPVSVGLCGSERGFQLYST-------GIFSGPCATSLDHAV 777
             Y  + S++E+ L++AVA+QPVSVG+CGSER FQLYS+       GIFSGPC+TSLDHAV
Sbjct: 226  SYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQLYSSKFYLLMQGIFSGPCSTSLDHAV 285

Query: 776  VIVGYGSENGVDYWILKNSWGKSWGMDGYMHMQRNNGDKQGVCGINMLAXXXXXXXXXXX 597
            +IVGYGS+NGVDYWI+KNSWGKSWGMDG+MHMQRN  +  GVCGINMLA           
Sbjct: 286  LIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTENSDGVCGINMLASYPIKTHPNPP 345

Query: 596  XXXXXXXTRCSLMSYCGAGETCCCGFRLLGLVCLSWKCCELDSAVCCKDHIYCCPHNYPI 417
                   T+C+L +YC +GETCCC   L GL C SWKCCE++SAVCCKD  +CCPH+YP+
Sbjct: 346  PPSPPGPTKCNLFTYCSSGETCCCARELFGL-CFSWKCCEIESAVCCKDGRHCCPHDYPV 404

Query: 416  CDTQTKQCLR 387
            CDT    CL+
Sbjct: 405  CDTTRSLCLK 414


>ref|NP_001141813.1| uncharacterized protein LOC100273952 precursor [Zea mays]
            gi|194706024|gb|ACF87096.1| unknown [Zea mays]
            gi|413945958|gb|AFW78607.1| hypothetical protein
            ZEAMMB73_489507 [Zea mays]
          Length = 460

 Score =  506 bits (1303), Expect = e-140
 Identities = 240/381 (62%), Positives = 286/381 (75%), Gaps = 13/381 (3%)
 Frame = -3

Query: 1463 VFEDNLAFVDQHNSLSNS-------------TYKVGLNAYADLTHHEFRASKLGLSVAAS 1323
            VF DN AFV  HN+ + +             +Y + LNA+ADLTH EFRA++LG  +A  
Sbjct: 59   VFADNAAFVAAHNARAGANAAGGGGGGAAPPSYTLALNAFADLTHEEFRAARLG-RIAPG 117

Query: 1322 MVDRSKSLGTKARSGFVGDVPKSIDWRKKGAVTPVKDQASCGACWAFSTTGAIEGINQIV 1143
               RS++       G    VP ++DWRK GAVT VKDQ SCGACW+FS TGA+EGIN+I 
Sbjct: 118  AALRSRAAPVYWGLGGGAAVPDALDWRKSGAVTKVKDQGSCGACWSFSATGAMEGINKIK 177

Query: 1142 TGSLVSLSEQELIDCDRSYNSGCGGGLMDYAFKFVVKNHGIDTEEDYPYQAIDRSCNRNK 963
            TGSLVSLSEQELIDCDRSYNSGCGGGLMDYA+KFV+KN GIDTEEDYPY+  D +CN+NK
Sbjct: 178  TGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYKFVIKNGGIDTEEDYPYREADGTCNKNK 237

Query: 962  LKRRVVTIDGYTDIPSSNEEELLKAVASQPVSVGLCGSERGFQLYSTGIFSGPCATSLDH 783
            LK+RVVTIDGYTD+PS+ E+ LL+AVA QPVSVG+CGS R FQLY  GIF GPC TSLDH
Sbjct: 238  LKKRVVTIDGYTDVPSNKEDLLLQAVAQQPVSVGICGSARAFQLYYQGIFDGPCPTSLDH 297

Query: 782  AVVIVGYGSENGVDYWILKNSWGKSWGMDGYMHMQRNNGDKQGVCGINMLAXXXXXXXXX 603
            AV+IVGYGSE G DYWI+KNSWG+SWGM GYMHM RN GD +GVCGINM+A         
Sbjct: 298  AVLIVGYGSEGGKDYWIVKNSWGESWGMKGYMHMHRNTGDSKGVCGINMMASFPTKTSPN 357

Query: 602  XXXXXXXXXTRCSLMSYCGAGETCCCGFRLLGLVCLSWKCCELDSAVCCKDHIYCCPHNY 423
                     T+CSL++YC  G TCCC +R+LG  CLSW CCELD+AVCCKD+ YCCPH+Y
Sbjct: 358  PPPSPGPGPTKCSLLTYCPEGSTCCCSWRVLGF-CLSWSCCELDNAVCCKDNRYCCPHDY 416

Query: 422  PICDTQTKQCLRAAGNYSMVK 360
            P+CDT   QCL+A+GN+S ++
Sbjct: 417  PVCDTGRGQCLKASGNFSAIE 437


>ref|XP_004961575.1| PREDICTED: oryzain alpha chain-like [Setaria italica]
          Length = 454

 Score =  503 bits (1296), Expect = e-140
 Identities = 238/377 (63%), Positives = 282/377 (74%), Gaps = 9/377 (2%)
 Frame = -3

Query: 1463 VFEDNLAFVDQHNSLSNS------TYKVGLNAYADLTHHEFRASKLGLSVAASMVDRSKS 1302
            VF DN AFV  HN+ +N+      +Y + LNA+ADLTH EFRA++LG      +    +S
Sbjct: 56   VFADNAAFVAAHNARANAVGGSPPSYTLALNAFADLTHEEFRAARLGRLAVGRVGATLRS 115

Query: 1301 LGTKARSGFVGDV---PKSIDWRKKGAVTPVKDQASCGACWAFSTTGAIEGINQIVTGSL 1131
             G     G  G V   P ++DWRKKGAVT VK+Q SCGACW+FS TGAIEGIN+I TGSL
Sbjct: 116  AGAPVFGGLDGGVAAVPDAVDWRKKGAVTKVKNQGSCGACWSFSATGAIEGINKIKTGSL 175

Query: 1130 VSLSEQELIDCDRSYNSGCGGGLMDYAFKFVVKNHGIDTEEDYPYQAIDRSCNRNKLKRR 951
            VSLSEQELIDCDRSYN+GCGGGLMDYAFKFV+KN GIDTE+DYPY+  D +CN+NKLKRR
Sbjct: 176  VSLSEQELIDCDRSYNNGCGGGLMDYAFKFVIKNGGIDTEDDYPYRQADGTCNKNKLKRR 235

Query: 950  VVTIDGYTDIPSSNEEELLKAVASQPVSVGLCGSERGFQLYSTGIFSGPCATSLDHAVVI 771
            VVTIDGY+D+PS+ E  LL+AVA QPVSVG+CGS R FQLYS GIF GPC TSLDHAV+I
Sbjct: 236  VVTIDGYSDVPSNKENLLLQAVAQQPVSVGICGSARAFQLYSQGIFDGPCPTSLDHAVLI 295

Query: 770  VGYGSENGVDYWILKNSWGKSWGMDGYMHMQRNNGDKQGVCGINMLAXXXXXXXXXXXXX 591
            VGYGSE G DYWI+KNSWG+ WGM GYMHM RN G   G+CGINM+              
Sbjct: 296  VGYGSEGGKDYWIVKNSWGERWGMKGYMHMHRNTGASSGICGINMMPSFPTKTSPNPPPS 355

Query: 590  XXXXXTRCSLMSYCGAGETCCCGFRLLGLVCLSWKCCELDSAVCCKDHIYCCPHNYPICD 411
                 T+C+L++YC  G TCCC +R+LGL CLSW CC LD+A+CCKD+ YCCPH+YPICD
Sbjct: 356  PGPGPTKCNLLTYCPEGSTCCCSWRVLGL-CLSWSCCGLDNAICCKDNRYCCPHDYPICD 414

Query: 410  TQTKQCLRAAGNYSMVK 360
            T   QCLRA GN+S ++
Sbjct: 415  TVRAQCLRANGNFSGIE 431


>ref|XP_002440039.1| hypothetical protein SORBIDRAFT_09g024940 [Sorghum bicolor]
            gi|241945324|gb|EES18469.1| hypothetical protein
            SORBIDRAFT_09g024940 [Sorghum bicolor]
          Length = 463

 Score =  501 bits (1291), Expect = e-139
 Identities = 239/378 (63%), Positives = 286/378 (75%), Gaps = 10/378 (2%)
 Frame = -3

Query: 1463 VFEDNLAFVDQHNSLSNS--------TYKVGLNAYADLTHHEFRASKLGLSVAASMVDRS 1308
            VF DN AFV  HN+  N+        +Y + LNA+ADLTH EFRA++LG   A +   RS
Sbjct: 64   VFADNAAFVAAHNARVNAAGGGGAPPSYTLALNAFADLTHEEFRAARLGRIAAGAAALRS 123

Query: 1307 KSLGT-KARSGFVGDVPKSIDWRKKGAVTPVKDQASCGACWAFSTTGAIEGINQIVTGSL 1131
             +    +   G +G VP ++DWR+ GAVT VKDQ SCGACW+FS TGA+EGIN+I TGSL
Sbjct: 124  PAAPVYRGLDGGLGAVPDALDWRENGAVTKVKDQGSCGACWSFSATGAMEGINKIKTGSL 183

Query: 1130 VSLSEQELIDCDRSYNSGCGGGLMDYAFKFVVKNHGIDTEEDYPYQAIDRSCNRNKLKRR 951
            VSLSEQELIDCDRSYNSGCGGGLMDYA+KFVVKN GIDTEEDYPY+  D +CN+NKLK+R
Sbjct: 184  VSLSEQELIDCDRSYNSGCGGGLMDYAYKFVVKNGGIDTEEDYPYREADGTCNKNKLKKR 243

Query: 950  VVTIDGYTDIPSSNEEELLKAVASQPVSVGLCGSERGFQLYS-TGIFSGPCATSLDHAVV 774
            +VTIDGY+D+PS+ E+ LL+AVA QPVSVG+CGS R FQLYS  GIF GPC TSLDHAV+
Sbjct: 244  IVTIDGYSDVPSNKEDLLLQAVAQQPVSVGICGSARAFQLYSQQGIFDGPCPTSLDHAVL 303

Query: 773  IVGYGSENGVDYWILKNSWGKSWGMDGYMHMQRNNGDKQGVCGINMLAXXXXXXXXXXXX 594
            IVGYGSE G DYWI+KNSWG+SWGM GYMHM RN GD +GVCGINM+A            
Sbjct: 304  IVGYGSEGGKDYWIVKNSWGESWGMKGYMHMHRNTGDSKGVCGINMMASFPTKSSPNPPP 363

Query: 593  XXXXXXTRCSLMSYCGAGETCCCGFRLLGLVCLSWKCCELDSAVCCKDHIYCCPHNYPIC 414
                  T+CSL++YC  G TCCC +R+LG  CLSW CCELD+AVCCKD+  CCPH+YP+C
Sbjct: 364  SPGPGPTKCSLLTYCPEGSTCCCSWRILGF-CLSWSCCELDNAVCCKDNKSCCPHDYPVC 422

Query: 413  DTQTKQCLRAAGNYSMVK 360
            DT    CL+A+GN S ++
Sbjct: 423  DTDRGLCLKASGNSSAIE 440


Top