BLASTX nr result

ID: Stemona21_contig00014615 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Stemona21_contig00014615
         (1302 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006838704.1| hypothetical protein AMTR_s00002p00249780 [A...   527   e-147
ref|NP_001141813.1| uncharacterized protein LOC100273952 precurs...   519   e-144
gb|EOY14881.1| JHL18I08.3 protein [Theobroma cacao]                   517   e-144
ref|XP_002510459.1| cysteine protease, putative [Ricinus communi...   516   e-143
ref|XP_006417526.1| hypothetical protein EUTSA_v10007640mg [Eutr...   513   e-143
ref|XP_002307688.2| cysteine protease family protein [Populus tr...   509   e-142
gb|EAY98636.1| hypothetical protein OsI_20560 [Oryza sativa Indi...   509   e-141
gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [A...   509   e-141
dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas]                          508   e-141
ref|NP_001055994.1| Os05g0508300 [Oryza sativa Japonica Group] g...   508   e-141
ref|NP_563855.1| papain-like cysteine peptidase [Arabidopsis tha...   508   e-141
gb|EXC25025.1| Oryzain alpha chain [Morus notabilis]                  507   e-141
ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [C...   504   e-140
ref|XP_004961575.1| PREDICTED: oryzain alpha chain-like [Setaria...   504   e-140
ref|XP_002440039.1| hypothetical protein SORBIDRAFT_09g024940 [S...   503   e-140
ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arab...   503   e-140
gb|AAB60738.1| Strong similarity to Dianthus cysteine proteinase...   496   e-137
ref|XP_004500967.1| PREDICTED: oryzain alpha chain-like [Cicer a...   494   e-137
ref|XP_006473576.1| PREDICTED: cysteine proteinase RD21a-like [C...   493   e-137
ref|XP_006435079.1| hypothetical protein CICLE_v10001178mg [Citr...   493   e-137

>ref|XP_006838704.1| hypothetical protein AMTR_s00002p00249780 [Amborella trichopoda]
            gi|548841210|gb|ERN01273.1| hypothetical protein
            AMTR_s00002p00249780 [Amborella trichopoda]
          Length = 475

 Score =  527 bits (1357), Expect = e-147
 Identities = 240/387 (62%), Positives = 284/387 (73%)
 Frame = -2

Query: 1163 ELFESWCRWHGKNYASDEEKLARFAVFEDXXXXXXXXXXXXXXXXALGLNAFADLAHHEF 984
            ++FESWCR HG+ Y + EEK  RF VF D                 +GLNAFADL HHEF
Sbjct: 71   DIFESWCRRHGRTYGTVEEKEQRFRVFSDNLVFIREHNQRANSNYTVGLNAFADLTHHEF 130

Query: 983  XXXXXXXXXXXAVVEPSXXXXXXXXXXXXAPDSVDWRKKGAVTRVKDQGSCGACWSFSAT 804
                        ++  S             P S+DWR KGAVT VKDQGSCGACW+FSAT
Sbjct: 131  KIKRLGLCPS--ILRFSSSNFRSDQKKIDVPSSLDWRDKGAVTNVKDQGSCGACWAFSAT 188

Query: 803  GAMEGINKIVTGSLVSLSEQELVDCDKTYNSGCEGGLMDYAYKWVIQNHGIDTEEDYPYQ 624
            GA+EGINKIVTGSL+SLSEQE++DCD TYNSGC GGLMDYA+KWV +NHGIDTE+DYPY+
Sbjct: 189  GAIEGINKIVTGSLISLSEQEIIDCDTTYNSGCGGGLMDYAFKWVTKNHGIDTEKDYPYR 248

Query: 623  AAEKTCLKNKLNHRVVTIDSYTDVPPNNEELLLQAVADQPVSVGICGSERTFQLYSKGIF 444
              + +C+K+K    VVTID +TD+P N+E+L+LQAVA QPVSVGICGSER+FQLYS GIF
Sbjct: 249  EVQGSCIKDKAERHVVTIDGHTDIPSNSEDLILQAVAKQPVSVGICGSERSFQLYSSGIF 308

Query: 443  SGPCSTSLDHAVLIVGYGSENGVDYWILKNSWGNSWGMDGYMHMLRKSGNPQGVCGINML 264
            SGPCSTSLDHAVLIVGYGS+NGVDYWI+KNSWG SWGMDGYMHMLR SG+ QGVCGINM+
Sbjct: 309  SGPCSTSLDHAVLIVGYGSKNGVDYWIVKNSWGTSWGMDGYMHMLRNSGDSQGVCGINMM 368

Query: 263  AXXXXXXXXXXXXXXXXXXTKCSLLTYCPEGSTCCCSWSILGLCFSWSCCELESAVCCKD 84
                                KCSLLTYCP G+TCCC+W  LG+C SWSCC+L++AVCCKD
Sbjct: 369  PSYPTKSGANPPPSPPPGPVKCSLLTYCPSGNTCCCTWRFLGICLSWSCCDLDNAVCCKD 428

Query: 83   HRYCCPHDYPICDSGSKQCFKGSGNYS 3
             +YCCP DYP+C++ +  C KGSGN++
Sbjct: 429  GQYCCPQDYPVCNTATGYCLKGSGNWT 455


>ref|NP_001141813.1| uncharacterized protein LOC100273952 precursor [Zea mays]
            gi|194706024|gb|ACF87096.1| unknown [Zea mays]
            gi|413945958|gb|AFW78607.1| hypothetical protein
            ZEAMMB73_489507 [Zea mays]
          Length = 460

 Score =  519 bits (1336), Expect = e-144
 Identities = 252/399 (63%), Positives = 282/399 (70%), Gaps = 14/399 (3%)
 Frame = -2

Query: 1157 FESWCRWHGKNYASDEEKLARFAVFEDXXXXXXXXXXXXXXXXA-------------LGL 1017
            F++WC  HGK YA+ EE+ AR AVF D                A             L L
Sbjct: 36   FDAWCAEHGKAYATPEERAARLAVFADNAAFVAAHNARAGANAAGGGGGGAAPPSYTLAL 95

Query: 1016 NAFADLAHHEFXXXXXXXXXXXAVVEP-SXXXXXXXXXXXXAPDSVDWRKKGAVTRVKDQ 840
            NAFADL H EF           A +   +             PD++DWRK GAVT+VKDQ
Sbjct: 96   NAFADLTHEEFRAARLGRIAPGAALRSRAAPVYWGLGGGAAVPDALDWRKSGAVTKVKDQ 155

Query: 839  GSCGACWSFSATGAMEGINKIVTGSLVSLSEQELVDCDKTYNSGCEGGLMDYAYKWVIQN 660
            GSCGACWSFSATGAMEGINKI TGSLVSLSEQEL+DCD++YNSGC GGLMDYAYK+VI+N
Sbjct: 156  GSCGACWSFSATGAMEGINKIKTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYKFVIKN 215

Query: 659  HGIDTEEDYPYQAAEKTCLKNKLNHRVVTIDSYTDVPPNNEELLLQAVADQPVSVGICGS 480
             GIDTEEDYPY+ A+ TC KNKL  RVVTID YTDVP N E+LLLQAVA QPVSVGICGS
Sbjct: 216  GGIDTEEDYPYREADGTCNKNKLKKRVVTIDGYTDVPSNKEDLLLQAVAQQPVSVGICGS 275

Query: 479  ERTFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVDYWILKNSWGNSWGMDGYMHMLRKS 300
             R FQLY +GIF GPC TSLDHAVLIVGYGSE G DYWI+KNSWG SWGM GYMHM R +
Sbjct: 276  ARAFQLYYQGIFDGPCPTSLDHAVLIVGYGSEGGKDYWIVKNSWGESWGMKGYMHMHRNT 335

Query: 299  GNPQGVCGINMLAXXXXXXXXXXXXXXXXXXTKCSLLTYCPEGSTCCCSWSILGLCFSWS 120
            G+ +GVCGINM+A                  TKCSLLTYCPEGSTCCCSW +LG C SWS
Sbjct: 336  GDSKGVCGINMMASFPTKTSPNPPPSPGPGPTKCSLLTYCPEGSTCCCSWRVLGFCLSWS 395

Query: 119  CCELESAVCCKDHRYCCPHDYPICDSGSKQCFKGSGNYS 3
            CCEL++AVCCKD+RYCCPHDYP+CD+G  QC K SGN+S
Sbjct: 396  CCELDNAVCCKDNRYCCPHDYPVCDTGRGQCLKASGNFS 434


>gb|EOY14881.1| JHL18I08.3 protein [Theobroma cacao]
          Length = 438

 Score =  517 bits (1332), Expect = e-144
 Identities = 246/384 (64%), Positives = 277/384 (72%)
 Frame = -2

Query: 1160 LFESWCRWHGKNYASDEEKLARFAVFEDXXXXXXXXXXXXXXXXALGLNAFADLAHHEFX 981
            LFE+WC  HGK Y+S+EEK  R  VFE+                +L LNAFADL HHEF 
Sbjct: 29   LFETWCDQHGKRYSSEEEKSYRLKVFEENYAFVTQHNGVGNSSYSLALNAFADLTHHEFK 88

Query: 980  XXXXXXXXXXAVVEPSXXXXXXXXXXXXAPDSVDWRKKGAVTRVKDQGSCGACWSFSATG 801
                        +E S             P S+DWR KGAVT+VKDQGSCGACWSFSATG
Sbjct: 89   ASRLGLSAA--AIEGSRPNLQLPGLVRDIPASMDWRTKGAVTKVKDQGSCGACWSFSATG 146

Query: 800  AMEGINKIVTGSLVSLSEQELVDCDKTYNSGCEGGLMDYAYKWVIQNHGIDTEEDYPYQA 621
            A+EGINKIVTG+LVSLSEQELVDCD++YNSGCEGGLMDYAY++VI NHGID EEDYPY  
Sbjct: 147  AIEGINKIVTGTLVSLSEQELVDCDRSYNSGCEGGLMDYAYQFVIDNHGIDNEEDYPYLG 206

Query: 620  AEKTCLKNKLNHRVVTIDSYTDVPPNNEELLLQAVADQPVSVGICGSERTFQLYSKGIFS 441
             EKTC K K   RVVTID Y  VP NNE+LLLQAVA QPVSVGICGSER FQLYSKGIF+
Sbjct: 207  REKTCNKEKRKRRVVTIDGYAGVPANNEDLLLQAVAKQPVSVGICGSERAFQLYSKGIFT 266

Query: 440  GPCSTSLDHAVLIVGYGSENGVDYWILKNSWGNSWGMDGYMHMLRKSGNPQGVCGINMLA 261
            GPCS+SLDHAVLIVGYGSENGVDYWI+KNSWG  WGM+GY+HMLR SG+ +G+CGINMLA
Sbjct: 267  GPCSSSLDHAVLIVGYGSENGVDYWIVKNSWGTRWGMNGYIHMLRNSGDSKGLCGINMLA 326

Query: 260  XXXXXXXXXXXXXXXXXXTKCSLLTYCPEGSTCCCSWSILGLCFSWSCCELESAVCCKDH 81
                              TKC L TYC  G TCCC+  I G+CFSW CCEL+SAVCCKD+
Sbjct: 327  SYPTKTSPNPPSPPPPGPTKCDLFTYCSAGETCCCTHRIFGICFSWKCCELDSAVCCKDN 386

Query: 80   RYCCPHDYPICDSGSKQCFKGSGN 9
            R+CCP+DYP+CD+   QC K  GN
Sbjct: 387  RHCCPYDYPVCDTKKSQCLKRVGN 410


>ref|XP_002510459.1| cysteine protease, putative [Ricinus communis]
            gi|223551160|gb|EEF52646.1| cysteine protease, putative
            [Ricinus communis]
          Length = 422

 Score =  516 bits (1328), Expect = e-143
 Identities = 247/383 (64%), Positives = 278/383 (72%)
 Frame = -2

Query: 1163 ELFESWCRWHGKNYASDEEKLARFAVFEDXXXXXXXXXXXXXXXXALGLNAFADLAHHEF 984
            +LFESW + HGK Y S E+KL RF +FE+                 L LNAFADL HHEF
Sbjct: 30   KLFESWTKEHGKTYTSKEDKLYRFKIFEENYEFVKKHNSQGNSSYTLSLNAFADLTHHEF 89

Query: 983  XXXXXXXXXXXAVVEPSXXXXXXXXXXXXAPDSVDWRKKGAVTRVKDQGSCGACWSFSAT 804
                          + S             P S+DWRKKGAV++VKDQG+CGACWSFSAT
Sbjct: 90   KASRLGLSAFSTSGKLSRRNFPLHDFVGDVPISIDWRKKGAVSQVKDQGNCGACWSFSAT 149

Query: 803  GAMEGINKIVTGSLVSLSEQELVDCDKTYNSGCEGGLMDYAYKWVIQNHGIDTEEDYPYQ 624
            GA+EGINKIVTGSLVSLSEQELVDCD++YN+GCEGGLMDYAY++VI+N+GIDTEEDYPYQ
Sbjct: 150  GAIEGINKIVTGSLVSLSEQELVDCDRSYNNGCEGGLMDYAYQFVIENNGIDTEEDYPYQ 209

Query: 623  AAEKTCLKNKLNHRVVTIDSYTDVPPNNEELLLQAVADQPVSVGICGSERTFQLYSKGIF 444
            A EKTC K KL   VVTID YTDVP NNE+ LL+AVA QPVSVGICGSER FQLYSKGIF
Sbjct: 210  AREKTCNKEKLKRHVVTIDGYTDVPQNNEKELLKAVAAQPVSVGICGSERAFQLYSKGIF 269

Query: 443  SGPCSTSLDHAVLIVGYGSENGVDYWILKNSWGNSWGMDGYMHMLRKSGNPQGVCGINML 264
            +GPCSTSLDHAVLIVGYGSENGVDYWI+KNSWG  WG++GYM+MLR SGN QG+CGINML
Sbjct: 270  TGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTHWGINGYMYMLRNSGNSQGLCGINML 329

Query: 263  AXXXXXXXXXXXXXXXXXXTKCSLLTYCPEGSTCCCSWSILGLCFSWSCCELESAVCCKD 84
            A                  TKC L T C EG TCCC+  I GLCFSW CCEL+SAVCCKD
Sbjct: 330  ASFPVKTSPNPPPPAPPGPTKCDLFTRCGEGETCCCTRRIFGLCFSWKCCELDSAVCCKD 389

Query: 83   HRYCCPHDYPICDSGSKQCFKGS 15
              +CCPHDYP+CD+    C K S
Sbjct: 390  GLHCCPHDYPVCDTKRNMCLKVS 412


>ref|XP_006417526.1| hypothetical protein EUTSA_v10007640mg [Eutrema salsugineum]
            gi|557095297|gb|ESQ35879.1| hypothetical protein
            EUTSA_v10007640mg [Eutrema salsugineum]
          Length = 444

 Score =  513 bits (1321), Expect = e-143
 Identities = 240/388 (61%), Positives = 277/388 (71%), Gaps = 1/388 (0%)
 Frame = -2

Query: 1163 ELFESWCRWHGKNYASDEEKLARFAVFEDXXXXXXXXXXXXXXXXALGLNAFADLAHHEF 984
            ELF+ WC  HGK Y S+EE+  R  +F D                +L LNAFADL HHEF
Sbjct: 35   ELFDDWCHRHGKTYGSEEERQHRIQIFRDNHDFVTQHNHISNSTYSLSLNAFADLTHHEF 94

Query: 983  XXXXXXXXXXXA-VVEPSXXXXXXXXXXXXAPDSVDWRKKGAVTRVKDQGSCGACWSFSA 807
                         ++                PDSVDWRKKGAVT VKDQGSCGACWSFSA
Sbjct: 95   KASRLGLSAPSPSLMAKEQSLGVSERVRVKVPDSVDWRKKGAVTNVKDQGSCGACWSFSA 154

Query: 806  TGAMEGINKIVTGSLVSLSEQELVDCDKTYNSGCEGGLMDYAYKWVIQNHGIDTEEDYPY 627
            TGAMEGIN+IVTG L+SLSEQEL+DCDK+YN+GC GGLMDYA+++VI+NHGIDTE+DYPY
Sbjct: 155  TGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKDYPY 214

Query: 626  QAAEKTCLKNKLNHRVVTIDSYTDVPPNNEELLLQAVADQPVSVGICGSERTFQLYSKGI 447
            Q  + TC K+KL  RVVTIDSY  V  NNE+ L++AVA QPVSVGICGSER FQLYS GI
Sbjct: 215  QEQDGTCKKDKLKKRVVTIDSYAGVASNNEKALMEAVASQPVSVGICGSERAFQLYSSGI 274

Query: 446  FSGPCSTSLDHAVLIVGYGSENGVDYWILKNSWGNSWGMDGYMHMLRKSGNPQGVCGINM 267
            FSGPCSTSLDHAVLIVGYGS+NGVDYWI+KNSWG SWGMDG+MHM R +GN +GVCGINM
Sbjct: 275  FSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTGNSEGVCGINM 334

Query: 266  LAXXXXXXXXXXXXXXXXXXTKCSLLTYCPEGSTCCCSWSILGLCFSWSCCELESAVCCK 87
            LA                  TKC+L TYC  G TCCC+ ++ GLCFSW CCELESAVCCK
Sbjct: 335  LASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCCCARTLFGLCFSWKCCELESAVCCK 394

Query: 86   DHRYCCPHDYPICDSGSKQCFKGSGNYS 3
            D R+CCP DYP+CD+    C K +GN++
Sbjct: 395  DGRHCCPRDYPVCDTTKSLCLKKTGNFT 422


>ref|XP_002307688.2| cysteine protease family protein [Populus trichocarpa]
            gi|550339725|gb|EEE94684.2| cysteine protease family
            protein [Populus trichocarpa]
          Length = 436

 Score =  509 bits (1311), Expect = e-142
 Identities = 240/385 (62%), Positives = 273/385 (70%)
 Frame = -2

Query: 1163 ELFESWCRWHGKNYASDEEKLARFAVFEDXXXXXXXXXXXXXXXXALGLNAFADLAHHEF 984
            +LFE+WC+ HGK+Y S EE+  R  VFED                +L LNAFADL HHEF
Sbjct: 27   QLFETWCKEHGKSYTSQEERSHRLKVFEDNYDFVTKHNSKGNSSYSLALNAFADLTHHEF 86

Query: 983  XXXXXXXXXXXAVVEPSXXXXXXXXXXXXAPDSVDWRKKGAVTRVKDQGSCGACWSFSAT 804
                         +  +             P S+DWR KG VT VKDQGSCGACWSFSAT
Sbjct: 87   KTSRLGLSAAP--LNLAHRNLEITGVVGDIPASIDWRNKGVVTNVKDQGSCGACWSFSAT 144

Query: 803  GAMEGINKIVTGSLVSLSEQELVDCDKTYNSGCEGGLMDYAYKWVIQNHGIDTEEDYPYQ 624
            GA+EGINKIVTGSLVSLSEQEL++CDK+YN GC GGLMDYA+++VI NHGIDTEEDYPY+
Sbjct: 145  GAIEGINKIVTGSLVSLSEQELIECDKSYNDGCGGGLMDYAFQFVINNHGIDTEEDYPYR 204

Query: 623  AAEKTCLKNKLNHRVVTIDSYTDVPPNNEELLLQAVADQPVSVGICGSERTFQLYSKGIF 444
            A + TC K+++  RVVTID Y DVP NNE+ LLQAVA QPVSVGICGSER FQ+YSKGIF
Sbjct: 205  ARDGTCNKDRMKRRVVTIDKYVDVPENNEKQLLQAVAAQPVSVGICGSERAFQMYSKGIF 264

Query: 443  SGPCSTSLDHAVLIVGYGSENGVDYWILKNSWGNSWGMDGYMHMLRKSGNPQGVCGINML 264
            +GPCSTSLDHAVLIVGYGSENGVDYWI+KNSWG  WGM GYMHM R SGN QGVCGINML
Sbjct: 265  TGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTGWGMRGYMHMQRNSGNSQGVCGINML 324

Query: 263  AXXXXXXXXXXXXXXXXXXTKCSLLTYCPEGSTCCCSWSILGLCFSWSCCELESAVCCKD 84
            A                  TKC+LLTYC  G TCCC+    G+C SW CC L+SAVCCKD
Sbjct: 325  ASYPVKTSPNPPPPPPPGPTKCNLLTYCAAGETCCCARKFFGICISWKCCGLDSAVCCKD 384

Query: 83   HRYCCPHDYPICDSGSKQCFKGSGN 9
              +CCPHDYP+CD+    CFK +GN
Sbjct: 385  RLHCCPHDYPVCDTDKNMCFKRAGN 409


>gb|EAY98636.1| hypothetical protein OsI_20560 [Oryza sativa Indica Group]
          Length = 449

 Score =  509 bits (1310), Expect = e-141
 Identities = 242/387 (62%), Positives = 277/387 (71%), Gaps = 2/387 (0%)
 Frame = -2

Query: 1157 FESWCRWHGKNYASDEEKLARFAVFEDXXXXXXXXXXXXXXXXALGLNAFADLAHHEFXX 978
            FE+WC  HG++YA+  E+ AR A F D                 L LNAFADL H EF  
Sbjct: 38   FEAWCAEHGRSYATPGERAARLAAFADNAAFVAAHNGAPASYA-LALNAFADLTHDEFRA 96

Query: 977  XXXXXXXXXAVV-EPSXXXXXXXXXXXXAPDSVDWRKKGAVTRVKDQGSCGACWSFSATG 801
                         +               PD+VDWR+ GAVT+VKDQGSCGACWSFSATG
Sbjct: 97   ARLGRLAAAGPGRDGGAPYLGVDGGVGAVPDAVDWRQSGAVTKVKDQGSCGACWSFSATG 156

Query: 800  AMEGINKIVTGSLVSLSEQELVDCDKTYNSGCEGGLMDYAYKWVIQNHGIDTEEDYPYQA 621
            AMEGINKI TGSL+SLSEQEL+DCD++YNSGC GGLMDYAYK+V++N GIDTE DYPY+ 
Sbjct: 157  AMEGINKIKTGSLISLSEQELIDCDRSYNSGCGGGLMDYAYKFVVKNGGIDTEADYPYRE 216

Query: 620  AEKTCLKNKLNHRVVTIDSYTDVPPNNEELLLQAVADQPVSVGICGSERTFQLYSKGIFS 441
             + TC KNKL  RVVTID Y DVP NNE++LLQAVA QPVSVGICGS R FQLYSKGIF 
Sbjct: 217  TDGTCNKNKLKRRVVTIDGYKDVPANNEDMLLQAVAQQPVSVGICGSARAFQLYSKGIFD 276

Query: 440  GPCSTSLDHAVLIVGYGSENGVDYWILKNSWGNSWGMDGYMHMLRKSGNPQGVCGINMLA 261
            GPC TSLDHA+LIVGYGSE G DYWI+KNSWG SWGM GYM+M R +GN  GVCGIN + 
Sbjct: 277  GPCPTSLDHAILIVGYGSEGGKDYWIVKNSWGESWGMKGYMYMHRNTGNSNGVCGINQMP 336

Query: 260  XXXXXXXXXXXXXXXXXXTKCSLLTYCPEGSTCCCSWSILGLCFSWSCCELESAVCCKDH 81
                              TKCSLLTYCPEGSTCCCSW +LGLC SWSCCEL++AVCCKD+
Sbjct: 337  SFPTKSSPNPPPSPGPGPTKCSLLTYCPEGSTCCCSWRVLGLCLSWSCCELDNAVCCKDN 396

Query: 80   RYCCPHDYPICDSGSKQCFK-GSGNYS 3
            RYCCPHDYP+CD+ S++CFK  +GN+S
Sbjct: 397  RYCCPHDYPVCDTASQRCFKANNGNFS 423


>gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana]
          Length = 437

 Score =  509 bits (1310), Expect = e-141
 Identities = 238/387 (61%), Positives = 278/387 (71%)
 Frame = -2

Query: 1163 ELFESWCRWHGKNYASDEEKLARFAVFEDXXXXXXXXXXXXXXXXALGLNAFADLAHHEF 984
            ELF+ WC+ HGK Y S+EE+  R  +F+D                +L LNAFADL HHEF
Sbjct: 30   ELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHEF 89

Query: 983  XXXXXXXXXXXAVVEPSXXXXXXXXXXXXAPDSVDWRKKGAVTRVKDQGSCGACWSFSAT 804
                         V  +             PDSVDWRKKGAVT VKDQGSCGACWSFSAT
Sbjct: 90   KASRLGLSVSAPSVIMASKGQSLGGSVKV-PDSVDWRKKGAVTNVKDQGSCGACWSFSAT 148

Query: 803  GAMEGINKIVTGSLVSLSEQELVDCDKTYNSGCEGGLMDYAYKWVIQNHGIDTEEDYPYQ 624
            GAMEGIN+IVTG L+SLSEQEL+DCDK+YN+GC GGLMDYA+++VI+NHGIDTE+DYPYQ
Sbjct: 149  GAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKDYPYQ 208

Query: 623  AAEKTCLKNKLNHRVVTIDSYTDVPPNNEELLLQAVADQPVSVGICGSERTFQLYSKGIF 444
              + TC K+KL  +VVTIDSY  V  N+E+ L++AVA QPVSVGICGSER FQLYS+GIF
Sbjct: 209  ERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQLYSRGIF 268

Query: 443  SGPCSTSLDHAVLIVGYGSENGVDYWILKNSWGNSWGMDGYMHMLRKSGNPQGVCGINML 264
            SGPCSTSLDHAVLIVGYGS+NGVDYWI+KNSWG SWGMDG+MHM R + N  GVCGINML
Sbjct: 269  SGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTENSDGVCGINML 328

Query: 263  AXXXXXXXXXXXXXXXXXXTKCSLLTYCPEGSTCCCSWSILGLCFSWSCCELESAVCCKD 84
            A                  TKC+L TYC  G TCCC+  + GLCFSW CCE+ESAVCCKD
Sbjct: 329  ASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCCCARELFGLCFSWKCCEIESAVCCKD 388

Query: 83   HRYCCPHDYPICDSGSKQCFKGSGNYS 3
             R+CCPHDYP+CD+    C K +GN++
Sbjct: 389  GRHCCPHDYPVCDTTRSLCLKKTGNFT 415


>dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas]
          Length = 441

 Score =  508 bits (1309), Expect = e-141
 Identities = 242/385 (62%), Positives = 275/385 (71%), Gaps = 1/385 (0%)
 Frame = -2

Query: 1160 LFESWCRWHGKNYASDEEKLARFAVFEDXXXXXXXXXXXXXXXXALGLNAFADLAHHEFX 981
            LFE+WC+ HGK YAS EEKL R  VF+D                 L LNAFADL HHEF 
Sbjct: 29   LFETWCQQHGKTYASQEEKLFRLKVFQDNYDFVTEHNSQGNSSYTLSLNAFADLTHHEFK 88

Query: 980  XXXXXXXXXXAV-VEPSXXXXXXXXXXXXAPDSVDWRKKGAVTRVKDQGSCGACWSFSAT 804
                      +  +                P SVDWRK GAVT+VKDQG+CGACWSFSAT
Sbjct: 89   ASRLGLSSAASASLNVDRSNRQIPDFVADVPASVDWRKNGAVTQVKDQGNCGACWSFSAT 148

Query: 803  GAMEGINKIVTGSLVSLSEQELVDCDKTYNSGCEGGLMDYAYKWVIQNHGIDTEEDYPYQ 624
            GA+EGINKIVTGSLVSLSEQELVDCDK+YN+GCEGG+MDYA+++VI NHGIDTEEDYPYQ
Sbjct: 149  GAIEGINKIVTGSLVSLSEQELVDCDKSYNNGCEGGIMDYAFQFVIDNHGIDTEEDYPYQ 208

Query: 623  AAEKTCLKNKLNHRVVTIDSYTDVPPNNEELLLQAVADQPVSVGICGSERTFQLYSKGIF 444
              +++C K KL   VVTID Y DVP NNE+ LL+AVA+QPVSVGICGSER FQLYSKGIF
Sbjct: 209  GRDRSCNKEKLKRHVVTIDGYVDVPQNNEKELLKAVANQPVSVGICGSERAFQLYSKGIF 268

Query: 443  SGPCSTSLDHAVLIVGYGSENGVDYWILKNSWGNSWGMDGYMHMLRKSGNPQGVCGINML 264
            +GPCSTSLDHAVLIVGYGSENGVDYWI+KNSWG+ WGMDGYMHM R SG+ +G+CGINML
Sbjct: 269  TGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGSYWGMDGYMHMQRNSGSSRGLCGINML 328

Query: 263  AXXXXXXXXXXXXXXXXXXTKCSLLTYCPEGSTCCCSWSILGLCFSWSCCELESAVCCKD 84
            A                  T+C L T+C EG TCCC   I G+C SW CCEL+SAVCCKD
Sbjct: 329  ASYPKKTSPNPPPPAPPGPTRCDLFTHCGEGETCCCVHHIFGICLSWKCCELDSAVCCKD 388

Query: 83   HRYCCPHDYPICDSGSKQCFKGSGN 9
             R+CCP DYP+CD+    C K  GN
Sbjct: 389  GRHCCPRDYPVCDTTRNICLKHYGN 413


>ref|NP_001055994.1| Os05g0508300 [Oryza sativa Japonica Group] gi|48475189|gb|AAT44258.1|
            hypothetical protein [Oryza sativa Japonica Group]
            gi|113579545|dbj|BAF17908.1| Os05g0508300 [Oryza sativa
            Japonica Group]
          Length = 450

 Score =  508 bits (1309), Expect = e-141
 Identities = 242/388 (62%), Positives = 277/388 (71%), Gaps = 3/388 (0%)
 Frame = -2

Query: 1157 FESWCRWHGKNYASDEEKLARFAVFEDXXXXXXXXXXXXXXXXALGLNAFADLAHHEFXX 978
            FE+WC  HG++YA+  E+ AR A F D                 L LNAFADL H EF  
Sbjct: 38   FEAWCAEHGRSYATPGERAARLAAFADNAAFVAAHNGAPASYA-LALNAFADLTHDEFRA 96

Query: 977  XXXXXXXXXAVV--EPSXXXXXXXXXXXXAPDSVDWRKKGAVTRVKDQGSCGACWSFSAT 804
                          +               PD+VDWR+ GAVT+VKDQGSCGACWSFSAT
Sbjct: 97   ARLGRLAAAGGPGRDGGAPYLGVDGGVGAVPDAVDWRQSGAVTKVKDQGSCGACWSFSAT 156

Query: 803  GAMEGINKIVTGSLVSLSEQELVDCDKTYNSGCEGGLMDYAYKWVIQNHGIDTEEDYPYQ 624
            GAMEGINKI TGSL+SLSEQEL+DCD++YNSGC GGLMDYAYK+V++N GIDTE DYPY+
Sbjct: 157  GAMEGINKIKTGSLISLSEQELIDCDRSYNSGCGGGLMDYAYKFVVKNGGIDTEADYPYR 216

Query: 623  AAEKTCLKNKLNHRVVTIDSYTDVPPNNEELLLQAVADQPVSVGICGSERTFQLYSKGIF 444
              + TC KNKL  RVVTID Y DVP NNE++LLQAVA QPVSVGICGS R FQLYSKGIF
Sbjct: 217  ETDGTCNKNKLKRRVVTIDGYKDVPANNEDMLLQAVAQQPVSVGICGSARAFQLYSKGIF 276

Query: 443  SGPCSTSLDHAVLIVGYGSENGVDYWILKNSWGNSWGMDGYMHMLRKSGNPQGVCGINML 264
             GPC TSLDHA+LIVGYGSE G DYWI+KNSWG SWGM GYM+M R +GN  GVCGIN +
Sbjct: 277  DGPCPTSLDHAILIVGYGSEGGKDYWIVKNSWGESWGMKGYMYMHRNTGNSNGVCGINQM 336

Query: 263  AXXXXXXXXXXXXXXXXXXTKCSLLTYCPEGSTCCCSWSILGLCFSWSCCELESAVCCKD 84
                               TKCSLLTYCPEGSTCCCSW +LGLC SWSCCEL++AVCCKD
Sbjct: 337  PSFPTKSSPNPPPSPGPGPTKCSLLTYCPEGSTCCCSWRVLGLCLSWSCCELDNAVCCKD 396

Query: 83   HRYCCPHDYPICDSGSKQCFK-GSGNYS 3
            +RYCCPHDYP+CD+ S++CFK  +GN+S
Sbjct: 397  NRYCCPHDYPVCDTASQRCFKANNGNFS 424


>ref|NP_563855.1| papain-like cysteine peptidase [Arabidopsis thaliana]
            gi|110741821|dbj|BAE98853.1| papain-like cysteine
            peptidase XBCP3 [Arabidopsis thaliana]
            gi|111074448|gb|ABH04597.1| At1g09850 [Arabidopsis
            thaliana] gi|332190386|gb|AEE28507.1| papain-like
            cysteine peptidase [Arabidopsis thaliana]
          Length = 437

 Score =  508 bits (1308), Expect = e-141
 Identities = 238/387 (61%), Positives = 277/387 (71%)
 Frame = -2

Query: 1163 ELFESWCRWHGKNYASDEEKLARFAVFEDXXXXXXXXXXXXXXXXALGLNAFADLAHHEF 984
            ELF+ WC+ HGK Y S+EE+  R  +F+D                +L LNAFADL HHEF
Sbjct: 30   ELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHEF 89

Query: 983  XXXXXXXXXXXAVVEPSXXXXXXXXXXXXAPDSVDWRKKGAVTRVKDQGSCGACWSFSAT 804
                         V  +             PDSVDWRKKGAVT VKDQGSCGACWSFSAT
Sbjct: 90   KASRLGLSVSAPSVIMASKGQSLGGSVKV-PDSVDWRKKGAVTNVKDQGSCGACWSFSAT 148

Query: 803  GAMEGINKIVTGSLVSLSEQELVDCDKTYNSGCEGGLMDYAYKWVIQNHGIDTEEDYPYQ 624
            GAMEGIN+IVTG L+SLSEQEL+DCDK+YN+GC GGLMDYA+++VI+NHGIDTE+DYPYQ
Sbjct: 149  GAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKDYPYQ 208

Query: 623  AAEKTCLKNKLNHRVVTIDSYTDVPPNNEELLLQAVADQPVSVGICGSERTFQLYSKGIF 444
              + TC K+KL  +VVTIDSY  V  N+E+ L++AVA QPVSVGICGSER FQLYS GIF
Sbjct: 209  ERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQLYSSGIF 268

Query: 443  SGPCSTSLDHAVLIVGYGSENGVDYWILKNSWGNSWGMDGYMHMLRKSGNPQGVCGINML 264
            SGPCSTSLDHAVLIVGYGS+NGVDYWI+KNSWG SWGMDG+MHM R + N  GVCGINML
Sbjct: 269  SGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTENSDGVCGINML 328

Query: 263  AXXXXXXXXXXXXXXXXXXTKCSLLTYCPEGSTCCCSWSILGLCFSWSCCELESAVCCKD 84
            A                  TKC+L TYC  G TCCC+  + GLCFSW CCE+ESAVCCKD
Sbjct: 329  ASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCCCARELFGLCFSWKCCEIESAVCCKD 388

Query: 83   HRYCCPHDYPICDSGSKQCFKGSGNYS 3
             R+CCPHDYP+CD+    C K +GN++
Sbjct: 389  GRHCCPHDYPVCDTTRSLCLKKTGNFT 415


>gb|EXC25025.1| Oryzain alpha chain [Morus notabilis]
          Length = 517

 Score =  507 bits (1305), Expect = e-141
 Identities = 242/381 (63%), Positives = 268/381 (70%)
 Frame = -2

Query: 1163 ELFESWCRWHGKNYASDEEKLARFAVFEDXXXXXXXXXXXXXXXXALGLNAFADLAHHEF 984
            +LFE+WC  HG++Y+S+EE+L R  VFED                 L LNAFADL HHEF
Sbjct: 28   QLFEAWCEKHGQSYSSEEERLYRLTVFEDNLAFVTQHNNMGNSSYTLSLNAFADLTHHEF 87

Query: 983  XXXXXXXXXXXAVVEPSXXXXXXXXXXXXAPDSVDWRKKGAVTRVKDQGSCGACWSFSAT 804
                           P              P S+DWRKKGAVT VKDQGSCGACW+FSAT
Sbjct: 88   KSSRLGFSSALLSSLPKLGSKLLDLRDV--PASLDWRKKGAVTNVKDQGSCGACWAFSAT 145

Query: 803  GAMEGINKIVTGSLVSLSEQELVDCDKTYNSGCEGGLMDYAYKWVIQNHGIDTEEDYPYQ 624
            GA+EGINKIVTGSLVSLSEQEL+DCD +YN+GC+GGLMDYAY++VI NHGIDTEEDYPYQ
Sbjct: 146  GAIEGINKIVTGSLVSLSEQELIDCDTSYNAGCDGGLMDYAYQFVIDNHGIDTEEDYPYQ 205

Query: 623  AAEKTCLKNKLNHRVVTIDSYTDVPPNNEELLLQAVADQPVSVGICGSERTFQLYSKGIF 444
            A +K+C K KL  RVVTID YTDV PNN   LLQAV  QPVSVGICGSER FQLYSKGIF
Sbjct: 206  ARDKSCRKEKLKRRVVTIDGYTDVAPNNGLQLLQAVVTQPVSVGICGSERAFQLYSKGIF 265

Query: 443  SGPCSTSLDHAVLIVGYGSENGVDYWILKNSWGNSWGMDGYMHMLRKSGNPQGVCGINML 264
            +GPCSTSLDHAVLIVGY SENGVDYWI+KNSWG  WGMDGY+HM R +GN QGVCGINML
Sbjct: 266  TGPCSTSLDHAVLIVGYDSENGVDYWIVKNSWGKQWGMDGYIHMQRNTGNSQGVCGINML 325

Query: 263  AXXXXXXXXXXXXXXXXXXTKCSLLTYCPEGSTCCCSWSILGLCFSWSCCELESAVCCKD 84
            A                  T+CS    C EG TCCCSW  LGLCFSW CC L SAVCCKD
Sbjct: 326  ASYPTKTSPNPPPSPSPGPTRCSFFAQCGEGETCCCSWRFLGLCFSWKCCGLNSAVCCKD 385

Query: 83   HRYCCPHDYPICDSGSKQCFK 21
              +CCP DYP+CD+    C K
Sbjct: 386  KIHCCPQDYPLCDTQRNVCLK 406


>ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
            gi|449529596|ref|XP_004171784.1| PREDICTED: cysteine
            proteinase RD21a-like [Cucumis sativus]
          Length = 431

 Score =  504 bits (1298), Expect = e-140
 Identities = 244/385 (63%), Positives = 269/385 (69%)
 Frame = -2

Query: 1163 ELFESWCRWHGKNYASDEEKLARFAVFEDXXXXXXXXXXXXXXXXALGLNAFADLAHHEF 984
            ELFE WC  HGK+Y+S EEKL R  VF D                 L LN++ADL HHEF
Sbjct: 27   ELFEIWCTEHGKSYSSAEEKLYRLGVFADNYEFVTHHNNLDNSSYTLSLNSYADLTHHEF 86

Query: 983  XXXXXXXXXXXAVVEPSXXXXXXXXXXXXAPDSVDWRKKGAVTRVKDQGSCGACWSFSAT 804
                           P              PDS+DWRKKGAVT VKDQGSCGACWSFSAT
Sbjct: 87   KVSRLGFSPALRNFRP--VLPQEPSLPRDVPDSLDWRKKGAVTAVKDQGSCGACWSFSAT 144

Query: 803  GAMEGINKIVTGSLVSLSEQELVDCDKTYNSGCEGGLMDYAYKWVIQNHGIDTEEDYPYQ 624
            GAMEGIN+I+TGSL+SLSEQEL+DCD++YNSGC GGLMDYAY++VI NHGIDTE DYPYQ
Sbjct: 145  GAMEGINQIMTGSLISLSEQELIDCDRSYNSGCGGGLMDYAYQFVISNHGIDTENDYPYQ 204

Query: 623  AAEKTCLKNKLNHRVVTIDSYTDVPPNNEELLLQAVADQPVSVGICGSERTFQLYSKGIF 444
            A + +C K+KL   VVTID Y D+P N+E  LLQAVA QPVSVGICGSER FQLYSKGIF
Sbjct: 205  ARDGSCRKDKLQRNVVTIDGYADIPSNDEGKLLQAVAAQPVSVGICGSERAFQLYSKGIF 264

Query: 443  SGPCSTSLDHAVLIVGYGSENGVDYWILKNSWGNSWGMDGYMHMLRKSGNPQGVCGINML 264
            SGPCSTSLDHAVLIVGYGSENGVDYWI+KNSWG SWGMDGYMHM R SGN +GVCGIN L
Sbjct: 265  SGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKSWGMDGYMHMQRNSGNSEGVCGINKL 324

Query: 263  AXXXXXXXXXXXXXXXXXXTKCSLLTYCPEGSTCCCSWSILGLCFSWSCCELESAVCCKD 84
            A                  TKCS+LT C  G TCCC+   LGLC SW CC L SAVCCKD
Sbjct: 325  ASYPTKTNPNPPPSPPPGPTKCSILTSCAAGETCCCAKKFLGLCLSWKCCGLSSAVCCKD 384

Query: 83   HRYCCPHDYPICDSGSKQCFKGSGN 9
             R+CCP DYPICD+    C K + N
Sbjct: 385  GRHCCPFDYPICDTDRNLCLKQTMN 409


>ref|XP_004961575.1| PREDICTED: oryzain alpha chain-like [Setaria italica]
          Length = 454

 Score =  504 bits (1297), Expect = e-140
 Identities = 242/397 (60%), Positives = 277/397 (69%), Gaps = 11/397 (2%)
 Frame = -2

Query: 1160 LFESWCRWHGKNYASDEEKLARFAVFEDXXXXXXXXXXXXXXXXA------LGLNAFADL 999
            LF++WC  HGK YA+ EE+ AR AVF D                       L LNAFADL
Sbjct: 32   LFDAWCAEHGKAYATPEERAARLAVFADNAAFVAAHNARANAVGGSPPSYTLALNAFADL 91

Query: 998  AHHEFXXXXXXXXXXXAVVEP-----SXXXXXXXXXXXXAPDSVDWRKKGAVTRVKDQGS 834
             H EF            V        +             PD+VDWRKKGAVT+VK+QGS
Sbjct: 92   THEEFRAARLGRLAVGRVGATLRSAGAPVFGGLDGGVAAVPDAVDWRKKGAVTKVKNQGS 151

Query: 833  CGACWSFSATGAMEGINKIVTGSLVSLSEQELVDCDKTYNSGCEGGLMDYAYKWVIQNHG 654
            CGACWSFSATGA+EGINKI TGSLVSLSEQEL+DCD++YN+GC GGLMDYA+K+VI+N G
Sbjct: 152  CGACWSFSATGAIEGINKIKTGSLVSLSEQELIDCDRSYNNGCGGGLMDYAFKFVIKNGG 211

Query: 653  IDTEEDYPYQAAEKTCLKNKLNHRVVTIDSYTDVPPNNEELLLQAVADQPVSVGICGSER 474
            IDTE+DYPY+ A+ TC KNKL  RVVTID Y+DVP N E LLLQAVA QPVSVGICGS R
Sbjct: 212  IDTEDDYPYRQADGTCNKNKLKRRVVTIDGYSDVPSNKENLLLQAVAQQPVSVGICGSAR 271

Query: 473  TFQLYSKGIFSGPCSTSLDHAVLIVGYGSENGVDYWILKNSWGNSWGMDGYMHMLRKSGN 294
             FQLYS+GIF GPC TSLDHAVLIVGYGSE G DYWI+KNSWG  WGM GYMHM R +G 
Sbjct: 272  AFQLYSQGIFDGPCPTSLDHAVLIVGYGSEGGKDYWIVKNSWGERWGMKGYMHMHRNTGA 331

Query: 293  PQGVCGINMLAXXXXXXXXXXXXXXXXXXTKCSLLTYCPEGSTCCCSWSILGLCFSWSCC 114
              G+CGINM+                   TKC+LLTYCPEGSTCCCSW +LGLC SWSCC
Sbjct: 332  SSGICGINMMPSFPTKTSPNPPPSPGPGPTKCNLLTYCPEGSTCCCSWRVLGLCLSWSCC 391

Query: 113  ELESAVCCKDHRYCCPHDYPICDSGSKQCFKGSGNYS 3
             L++A+CCKD+RYCCPHDYPICD+   QC + +GN+S
Sbjct: 392  GLDNAICCKDNRYCCPHDYPICDTVRAQCLRANGNFS 428


>ref|XP_002440039.1| hypothetical protein SORBIDRAFT_09g024940 [Sorghum bicolor]
            gi|241945324|gb|EES18469.1| hypothetical protein
            SORBIDRAFT_09g024940 [Sorghum bicolor]
          Length = 463

 Score =  503 bits (1295), Expect = e-140
 Identities = 247/398 (62%), Positives = 278/398 (69%), Gaps = 12/398 (3%)
 Frame = -2

Query: 1160 LFESWCRWHGKNYASDEEKLARFAVFEDXXXXXXXXXXXXXXXXA--------LGLNAFA 1005
            LF++WC  HGK YA+ EE+ AR AVF D                         L LNAFA
Sbjct: 40   LFDAWCAEHGKAYATPEERAARLAVFADNAAFVAAHNARVNAAGGGGAPPSYTLALNAFA 99

Query: 1004 DLAHHEFXXXXXXXXXXXAVVEPSXXXXXXXXXXXXA---PDSVDWRKKGAVTRVKDQGS 834
            DL H EF           A    S                PD++DWR+ GAVT+VKDQGS
Sbjct: 100  DLTHEEFRAARLGRIAAGAAALRSPAAPVYRGLDGGLGAVPDALDWRENGAVTKVKDQGS 159

Query: 833  CGACWSFSATGAMEGINKIVTGSLVSLSEQELVDCDKTYNSGCEGGLMDYAYKWVIQNHG 654
            CGACWSFSATGAMEGINKI TGSLVSLSEQEL+DCD++YNSGC GGLMDYAYK+V++N G
Sbjct: 160  CGACWSFSATGAMEGINKIKTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYKFVVKNGG 219

Query: 653  IDTEEDYPYQAAEKTCLKNKLNHRVVTIDSYTDVPPNNEELLLQAVADQPVSVGICGSER 474
            IDTEEDYPY+ A+ TC KNKL  R+VTID Y+DVP N E+LLLQAVA QPVSVGICGS R
Sbjct: 220  IDTEEDYPYREADGTCNKNKLKKRIVTIDGYSDVPSNKEDLLLQAVAQQPVSVGICGSAR 279

Query: 473  TFQLYS-KGIFSGPCSTSLDHAVLIVGYGSENGVDYWILKNSWGNSWGMDGYMHMLRKSG 297
             FQLYS +GIF GPC TSLDHAVLIVGYGSE G DYWI+KNSWG SWGM GYMHM R +G
Sbjct: 280  AFQLYSQQGIFDGPCPTSLDHAVLIVGYGSEGGKDYWIVKNSWGESWGMKGYMHMHRNTG 339

Query: 296  NPQGVCGINMLAXXXXXXXXXXXXXXXXXXTKCSLLTYCPEGSTCCCSWSILGLCFSWSC 117
            + +GVCGINM+A                  TKCSLLTYCPEGSTCCCSW ILG C SWSC
Sbjct: 340  DSKGVCGINMMASFPTKSSPNPPPSPGPGPTKCSLLTYCPEGSTCCCSWRILGFCLSWSC 399

Query: 116  CELESAVCCKDHRYCCPHDYPICDSGSKQCFKGSGNYS 3
            CEL++AVCCKD++ CCPHDYP+CD+    C K SGN S
Sbjct: 400  CELDNAVCCKDNKSCCPHDYPVCDTDRGLCLKASGNSS 437


>ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp.
            lyrata] gi|297335615|gb|EFH66032.1| hypothetical protein
            ARALYDRAFT_471096 [Arabidopsis lyrata subsp. lyrata]
          Length = 439

 Score =  503 bits (1294), Expect = e-140
 Identities = 236/389 (60%), Positives = 280/389 (71%), Gaps = 2/389 (0%)
 Frame = -2

Query: 1163 ELFESWCRWHGKNYASDEEKLARFAVFEDXXXXXXXXXXXXXXXXALGLNAFADLAHHEF 984
            ELF+ WC+ HGK Y S+EE+  R  +F+D                +L LNAFADL HHEF
Sbjct: 30   ELFDDWCQRHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHEF 89

Query: 983  XXXXXXXXXXXAVVEPSXXXXXXXXXXXXAPDSVDWRKKGAVTRVKDQGSCGACWSFSAT 804
                       + +  +             PDSVDWRKKGAVT VKDQGSCGACWSFSAT
Sbjct: 90   KASRLGLSVSASSLIMASKGQSLGGNAKV-PDSVDWRKKGAVTNVKDQGSCGACWSFSAT 148

Query: 803  GAMEGINKIVTGSLVSLSEQELVDCDKTYNSGCEGGLMDYAYKWVIQNHGIDTEEDYPYQ 624
            GAMEGIN+IVTG L+SLSEQEL+DCDK+YN+GC GGLMDYA+++VI+NHGIDTE+DYPYQ
Sbjct: 149  GAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKDYPYQ 208

Query: 623  AAEKTCLKNKLNHRVVTIDSYTDVPPNNEELLLQAVADQPVSVGICGSERTFQLYSK--G 450
              + TC K+KL  +VVTIDSY  V  N+E+ L +AVA QPVSVGICGSER FQLYS+  G
Sbjct: 209  ERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALREAVAAQPVSVGICGSERAFQLYSRVSG 268

Query: 449  IFSGPCSTSLDHAVLIVGYGSENGVDYWILKNSWGNSWGMDGYMHMLRKSGNPQGVCGIN 270
            IFSGPCSTSLDHAVLIVGYGS+NGVDYWI+KNSWG SWGMDG+MHM R +GN +G+CGIN
Sbjct: 269  IFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTGNSEGICGIN 328

Query: 269  MLAXXXXXXXXXXXXXXXXXXTKCSLLTYCPEGSTCCCSWSILGLCFSWSCCELESAVCC 90
            MLA                  TKC+L TYC  G TCCC+ ++ GLCFSW CCE+ESAVCC
Sbjct: 329  MLASYPIKTHPNPPPPSPPGPTKCNLFTYCSAGETCCCARNLFGLCFSWKCCEIESAVCC 388

Query: 89   KDHRYCCPHDYPICDSGSKQCFKGSGNYS 3
             D R+CCPHDYP+CD+    C K +GN++
Sbjct: 389  SDGRHCCPHDYPVCDTTRSLCLKKTGNFT 417


>gb|AAB60738.1| Strong similarity to Dianthus cysteine proteinase (gb|U17135)
            [Arabidopsis thaliana]
          Length = 416

 Score =  496 bits (1276), Expect = e-137
 Identities = 236/388 (60%), Positives = 273/388 (70%), Gaps = 7/388 (1%)
 Frame = -2

Query: 1163 ELFESWCRWHGKNYASDEEKLARFAVFEDXXXXXXXXXXXXXXXXALGLNAFADLAHHEF 984
            ELF+ WC+ HGK Y S+EE+  R  +F+D                +L LNAFADL HHEF
Sbjct: 28   ELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHEF 87

Query: 983  XXXXXXXXXXXAVVEPSXXXXXXXXXXXXAPDSVDWRKKGAVTRVKDQGSCGACWSFSAT 804
                         V  +             PDSVDWRKKGAVT VKDQGSCGACWSFSAT
Sbjct: 88   KASRLGLSVSAPSVIMASKGQSLGGSVKV-PDSVDWRKKGAVTNVKDQGSCGACWSFSAT 146

Query: 803  GAMEGINKIVTGSLVSLSEQELVDCDKTYNSGCEGGLMDYAYKWVIQNHGIDTEEDYPYQ 624
            GAMEGIN+IVTG L+SLSEQEL+DCDK+YN+GC GGLMDYA+++VI+NHGIDTE+DYPYQ
Sbjct: 147  GAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKDYPYQ 206

Query: 623  AAEKTCLKNKLNHRVVTIDSYTDVPPNNEELLLQAVADQPVSVGICGSERTFQLYS---- 456
              + TC K+KL  +VVTIDSY  V  N+E+ L++AVA QPVSVGICGSER FQLYS    
Sbjct: 207  ERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQLYSSKFY 266

Query: 455  ---KGIFSGPCSTSLDHAVLIVGYGSENGVDYWILKNSWGNSWGMDGYMHMLRKSGNPQG 285
               +GIFSGPCSTSLDHAVLIVGYGS+NGVDYWI+KNSWG SWGMDG+MHM R + N  G
Sbjct: 267  LLMQGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTENSDG 326

Query: 284  VCGINMLAXXXXXXXXXXXXXXXXXXTKCSLLTYCPEGSTCCCSWSILGLCFSWSCCELE 105
            VCGINMLA                  TKC+L TYC  G TCCC+  + GLCFSW CCE+E
Sbjct: 327  VCGINMLASYPIKTHPNPPPPSPPGPTKCNLFTYCSSGETCCCARELFGLCFSWKCCEIE 386

Query: 104  SAVCCKDHRYCCPHDYPICDSGSKQCFK 21
            SAVCCKD R+CCPHDYP+CD+    C K
Sbjct: 387  SAVCCKDGRHCCPHDYPVCDTTRSLCLK 414


>ref|XP_004500967.1| PREDICTED: oryzain alpha chain-like [Cicer arietinum]
          Length = 436

 Score =  494 bits (1273), Expect = e-137
 Identities = 236/387 (60%), Positives = 270/387 (69%), Gaps = 2/387 (0%)
 Frame = -2

Query: 1163 ELFESWCRWHGKNYASDEEKLARFAVFEDXXXXXXXXXXXXXXXXALGLNAFADLAHHEF 984
            +LF+ WC+ HGK Y S++EK  RF VFED                 L LNAFADL HHEF
Sbjct: 28   KLFQEWCKQHGKTYPSEQEKRYRFNVFEDNYAFVAQHNQIGNSSYTLSLNAFADLTHHEF 87

Query: 983  XXXXXXXXXXXAVVEP--SXXXXXXXXXXXXAPDSVDWRKKGAVTRVKDQGSCGACWSFS 810
                        +                   P  +DWRK GAV+ VKDQGSCGACWSFS
Sbjct: 88   KATRLGLPPSSLLRFKFNRFQDQQRSDDFLQVPSEIDWRKNGAVSIVKDQGSCGACWSFS 147

Query: 809  ATGAMEGINKIVTGSLVSLSEQELVDCDKTYNSGCEGGLMDYAYKWVIQNHGIDTEEDYP 630
            ATGA+EGINKIVTGSLVSLSEQELVDCD TYNSGC+GGLMDYAY+++I N+GIDTEEDYP
Sbjct: 148  ATGAIEGINKIVTGSLVSLSEQELVDCDTTYNSGCDGGLMDYAYQFIIDNNGIDTEEDYP 207

Query: 629  YQAAEKTCLKNKLNHRVVTIDSYTDVPPNNEELLLQAVADQPVSVGICGSERTFQLYSKG 450
            YQA +  C K+KL  RVVTID YTDVPPN+E+ LL+AVA QPVSVGICGS R FQLYSKG
Sbjct: 208  YQARQLLCKKDKLKRRVVTIDGYTDVPPNDEKKLLKAVAVQPVSVGICGSARAFQLYSKG 267

Query: 449  IFSGPCSTSLDHAVLIVGYGSENGVDYWILKNSWGNSWGMDGYMHMLRKSGNPQGVCGIN 270
            IF+GPCSTSLDHAVLIVGYGSENGVDYWI+KNSWG  WGM+GY+HMLR + +  G+CGIN
Sbjct: 268  IFTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKYWGMNGYIHMLRNTDSSAGLCGIN 327

Query: 269  MLAXXXXXXXXXXXXXXXXXXTKCSLLTYCPEGSTCCCSWSILGLCFSWSCCELESAVCC 90
            MLA                   KC+L TYC  G TCCC+   LG+CFSW CC + SAVCC
Sbjct: 328  MLASYPTKTKPNPPVPPPPGPIKCNLFTYCSGGETCCCAKKFLGICFSWKCCGVTSAVCC 387

Query: 89   KDHRYCCPHDYPICDSGSKQCFKGSGN 9
            KD R+CCP DYP+CD+ + QC K   N
Sbjct: 388  KDKRHCCPLDYPVCDASNGQCLKRIAN 414


>ref|XP_006473576.1| PREDICTED: cysteine proteinase RD21a-like [Citrus sinensis]
          Length = 441

 Score =  493 bits (1268), Expect = e-137
 Identities = 235/380 (61%), Positives = 264/380 (69%), Gaps = 1/380 (0%)
 Frame = -2

Query: 1163 ELFESWCRWHGKNYASDEEKLARFAVFEDXXXXXXXXXXXXXXXXALGLNAFADLAHHEF 984
            ELFE+WC+ HGK Y+S++EK  R  +FED                 L LNAFADL H EF
Sbjct: 27   ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86

Query: 983  XXXXXXXXXXXAVVEPSXXXXXXXXXXXXA-PDSVDWRKKGAVTRVKDQGSCGACWSFSA 807
                          +                P S+DWRKKGAVT VKDQ SCGACW+FSA
Sbjct: 87   KASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSA 146

Query: 806  TGAMEGINKIVTGSLVSLSEQELVDCDKTYNSGCEGGLMDYAYKWVIQNHGIDTEEDYPY 627
            TGA+EGINKIVTGSLVSLSEQEL+DCD++YNSGC GGLMDYAY++VI+NHGIDTE+DYPY
Sbjct: 147  TGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPY 206

Query: 626  QAAEKTCLKNKLNHRVVTIDSYTDVPPNNEELLLQAVADQPVSVGICGSERTFQLYSKGI 447
            +     C K KLN  +VTID Y DVP NNE+ LLQAV  QPVSVGICGSER FQLYS GI
Sbjct: 207  RGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGI 266

Query: 446  FSGPCSTSLDHAVLIVGYGSENGVDYWILKNSWGNSWGMDGYMHMLRKSGNPQGVCGINM 267
            F+GPCSTSLDHAVLI+GY SENGVDYWI+KNSWG SWGM+GYMHM R +GN  G+CGINM
Sbjct: 267  FTGPCSTSLDHAVLIIGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINM 326

Query: 266  LAXXXXXXXXXXXXXXXXXXTKCSLLTYCPEGSTCCCSWSILGLCFSWSCCELESAVCCK 87
            LA                  T+CSLLTYC  G TCCC  SILG+C SW CC   SAVCC 
Sbjct: 327  LASYPTKTGQNPPPSPPPGPTRCSLLTYCAPGETCCCGSSILGICLSWKCCGFSSAVCCS 386

Query: 86   DHRYCCPHDYPICDSGSKQC 27
            DHRYCCP +YPICDS   QC
Sbjct: 387  DHRYCCPSNYPICDSVRHQC 406


>ref|XP_006435079.1| hypothetical protein CICLE_v10001178mg [Citrus clementina]
            gi|557537201|gb|ESR48319.1| hypothetical protein
            CICLE_v10001178mg [Citrus clementina]
          Length = 441

 Score =  493 bits (1268), Expect = e-137
 Identities = 236/380 (62%), Positives = 264/380 (69%), Gaps = 1/380 (0%)
 Frame = -2

Query: 1163 ELFESWCRWHGKNYASDEEKLARFAVFEDXXXXXXXXXXXXXXXXALGLNAFADLAHHEF 984
            ELFE+WC+ HGK Y+S++EK  R  +FED                 L LNAFADL H EF
Sbjct: 27   ELFETWCKQHGKVYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86

Query: 983  XXXXXXXXXXXAVVEPSXXXXXXXXXXXXA-PDSVDWRKKGAVTRVKDQGSCGACWSFSA 807
                          +                P S+DWRKKGAVT VKDQ SCGACW+FSA
Sbjct: 87   KASFLGFSAASIDHDRRRNASVQSPGTLRDVPASIDWRKKGAVTEVKDQASCGACWAFSA 146

Query: 806  TGAMEGINKIVTGSLVSLSEQELVDCDKTYNSGCEGGLMDYAYKWVIQNHGIDTEEDYPY 627
            TGA+EGINKIVTGSLVSLSEQEL+DCD++YNSGC GGLMDYAY++VI+NHGIDTE+DYPY
Sbjct: 147  TGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPY 206

Query: 626  QAAEKTCLKNKLNHRVVTIDSYTDVPPNNEELLLQAVADQPVSVGICGSERTFQLYSKGI 447
            +     C K KLN  +VTID Y DVP NNE+ LLQAV  QPVSVGICGSER FQLYS GI
Sbjct: 207  RGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGI 266

Query: 446  FSGPCSTSLDHAVLIVGYGSENGVDYWILKNSWGNSWGMDGYMHMLRKSGNPQGVCGINM 267
            F+GPCSTSLDHAVLIVGY SENGVDYWI+KNSWG SWGM+GYMHM R +GN  G+CGINM
Sbjct: 267  FTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINM 326

Query: 266  LAXXXXXXXXXXXXXXXXXXTKCSLLTYCPEGSTCCCSWSILGLCFSWSCCELESAVCCK 87
            LA                  T+CSLLTYC  G TCCC  SILG+C SW CC   SAVCC 
Sbjct: 327  LASYPTKTGQNPPPSPPPGPTRCSLLTYCAAGETCCCGSSILGICLSWKCCGFSSAVCCS 386

Query: 86   DHRYCCPHDYPICDSGSKQC 27
            DHRYCCP +YPICDS   QC
Sbjct: 387  DHRYCCPSNYPICDSVRHQC 406


Top