BLASTX nr result

ID: Sinomenium21_contig00006955 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Sinomenium21_contig00006955
         (1541 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EXB56943.1| Aspartic proteinase Asp1 [Morus notabilis]             449   e-123
ref|XP_006440945.1| hypothetical protein CICLE_v10019473mg [Citr...   445   e-122
ref|XP_006485774.1| PREDICTED: aspartic proteinase Asp1-like [Ci...   444   e-122
ref|XP_006826817.1| hypothetical protein AMTR_s00010p00056950 [A...   426   e-116
ref|XP_006374352.1| aspartyl protease family protein [Populus tr...   425   e-116
emb|CBI15437.3| unnamed protein product [Vitis vinifera]              417   e-114
ref|XP_002275943.1| PREDICTED: aspartic proteinase Asp1-like [Vi...   417   e-114
ref|XP_003570503.1| PREDICTED: aspartic proteinase Asp1-like [Br...   410   e-112
ref|XP_006354730.1| PREDICTED: aspartic proteinase Asp1-like [So...   407   e-111
dbj|BAJ86250.1| predicted protein [Hordeum vulgare subsp. vulgare]    404   e-110
gb|ACN34727.1| unknown [Zea mays] gi|413923868|gb|AFW63800.1| hy...   404   e-110
ref|XP_002452609.1| hypothetical protein SORBIDRAFT_04g028990 [S...   403   e-109
ref|XP_007036501.1| Eukaryotic aspartyl protease family protein,...   402   e-109
ref|XP_007036500.1| Eukaryotic aspartyl protease family protein,...   402   e-109
ref|XP_002511959.1| protein with unknown function [Ricinus commu...   401   e-109
ref|NP_001146721.1| uncharacterized protein LOC100280323 [Zea ma...   400   e-109
ref|NP_001048015.1| Os02g0730700 [Oryza sativa Japonica Group] g...   392   e-106
gb|EEC73939.1| hypothetical protein OsI_08807 [Oryza sativa Indi...   392   e-106
ref|XP_004137470.1| PREDICTED: aspartic proteinase Asp1-like [Cu...   391   e-106
gb|EMT28382.1| Aspartic proteinase Asp1 [Aegilops tauschii]           384   e-104

>gb|EXB56943.1| Aspartic proteinase Asp1 [Morus notabilis]
          Length = 569

 Score =  449 bits (1155), Expect = e-123
 Identities = 234/461 (50%), Positives = 300/461 (65%), Gaps = 12/461 (2%)
 Frame = +3

Query: 153  PQLQGVVIITLPPSDNPSKGKTITSIFTLSDPSPSTXXXXXXXXXXXXXLEHXXXXXXXX 332
            PQ++GVVIITLPP DNPS GKTIT+ FTLS+ SP+                         
Sbjct: 7    PQIKGVVIITLPPPDNPSLGKTITA-FTLSNSSPTQTHQESQNQNNLPIQSPQNPQLQFP 65

Query: 333  XXXXXXXXXXXXXXXXXXASYVWRTY---YVPSNTSRQLRGTNENKEFSSFVFTIYPKLL 503
                                 ++      +V      + R +N+++   SF+F +Y KL 
Sbjct: 66   FPRLRLFHGVPRRLFALLGISIFTLVLFSHVFPTVVEEFRRSNDDEGPESFIFPLYSKL- 124

Query: 504  GRPQKIHGLDIEFKLGKIVERERSSSLEMIDDGGLFRKVEKL-------GESVVFPVRGN 662
            G P K    D+E KLG+ V+ ++ ++     D    +KV KL         S + PVRGN
Sbjct: 125  GVPGK---KDVELKLGRFVDFDKENAGVSFGDRVKTQKVNKLVSSTAKVDSSAILPVRGN 181

Query: 663  IYPDGLYYISMFVGNPPRPYYLDMDTGSDLTWIQCDAPCTSCSKGPNPLYKPTKGKIVPP 842
            +YPDGLYY  + VGNPPRPY+LDMDTGSDLTWIQCDAPCTSC+KG NPLYKPTKG IVP 
Sbjct: 182  VYPDGLYYTQILVGNPPRPYHLDMDTGSDLTWIQCDAPCTSCAKGANPLYKPTKGNIVPS 241

Query: 843  KDLLCAEVQNDDEPGYCESCKQCNYEIEYADHSSSRGVLVKDKLQQRIANGTMLNSDFVF 1022
            KD  C E++ + +PG+C++C+QC+YEI+YAD SSS GVL KD L   + NG++ N + VF
Sbjct: 242  KDSFCTEIRRNQKPGHCKTCQQCDYEIQYADRSSSLGVLAKDGLHLVMENGSLANVNVVF 301

Query: 1023 GCAYDQQGELSVSPAKTDGILGLSRATIGLPSQLASQGIIRNVVGHCINSDEDGHGYMFL 1202
            GCAYDQQG L  + AKTDGILGLSRA + LPSQLAS+GII+NVVGHC+ ++  G GYMFL
Sbjct: 302  GCAYDQQGLLLNTLAKTDGILGLSRAKVSLPSQLASKGIIKNVVGHCLTTNAGGGGYMFL 361

Query: 1203 GDDFVPQWGMTWVSMLDSPSINFYHTKTVKVSYGQR--SLGSPDSGGTQVVFDSGSSYTY 1376
            GDDFVP WGM+W+ ML SPS++FY ++ V ++YG    +LG+  S   Q+VFDSGSSYTY
Sbjct: 362  GDDFVPHWGMSWIPMLRSPSMDFYQSEIVSINYGSSALNLGAWSSKARQLVFDSGSSYTY 421

Query: 1377 FTKDAYSGLITSLEDSLGGRLTLDESDPTLPVCWRAEAPIS 1499
            F K AYS L+ SLE+     L  D SDP+LP+CWRAE P++
Sbjct: 422  FNKRAYSALLASLEEVSTTGLVRDRSDPSLPICWRAETPLN 462


>ref|XP_006440945.1| hypothetical protein CICLE_v10019473mg [Citrus clementina]
            gi|557543207|gb|ESR54185.1| hypothetical protein
            CICLE_v10019473mg [Citrus clementina]
          Length = 577

 Score =  445 bits (1144), Expect = e-122
 Identities = 238/468 (50%), Positives = 297/468 (63%), Gaps = 20/468 (4%)
 Frame = +3

Query: 153  PQLQGVVIITLPPSDNPSKGKTITSIFTLSDPSPSTXXXXXXXXXXXXXLE--HXXXXXX 326
            PQL GVVIITLPP +NPS GKTIT+ +TL+D SP +                 H      
Sbjct: 11   PQLTGVVIITLPPPNNPSLGKTITA-YTLTDNSPQSQQTRHRQQQEHPLPPQLHPPQNSQ 69

Query: 327  XXXXXXXXXXXXXXXXXXXXASYVWRTYYVPSNTSRQL----RGTNENKEFSSFVFTIYP 494
                                A  ++      S  S  L    +  N+++   SFVF +Y 
Sbjct: 70   FNFSLPMLFPGLPRKLFLFLAISIFALILYGSVFSYTLQDRYKSNNDDENKESFVFPLYH 129

Query: 495  KLLGRPQKIHGLDIEFKLGKIVERERSSSLEMIDDGGLFRKVEKLGESVV---------- 644
            K   R  ++   D EFKLG+ V+ +  S +  ++DG +     K+ + +V          
Sbjct: 130  KFGIR--EVSQRDAEFKLGRFVDLDGESVVASVNDGIIRPHKSKINKKLVSSNAVAVDSS 187

Query: 645  --FPVRGNIYPDGLYYISMFVGNPPRPYYLDMDTGSDLTWIQCDAPCTSCSKGPNPLYKP 818
              FP+RGNIYPDGLY+  M VGNPPRPYYLDMDTGSDLTWIQCDAPC+SC+KG NPLYKP
Sbjct: 188  SIFPLRGNIYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKP 247

Query: 819  TKGKIVPPKDLLCAEVQNDDEPGYCESCKQCNYEIEYADHSSSRGVLVKDKLQQRIANGT 998
              G I+P KD LC E+Q + +PGYCE+C+QC+YEIEYADHSSS GVL +D+L   I NG+
Sbjct: 248  RMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGS 307

Query: 999  MLNSDFVFGCAYDQQGELSVSPAKTDGILGLSRATIGLPSQLASQGIIRNVVGHCINSDE 1178
            +   + VFGCAYDQQG L  +  KTDGILGLSRA + LPSQLASQGII+NVVGHC+ ++ 
Sbjct: 308  LTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNA 367

Query: 1179 DGHGYMFLGDDFVPQWGMTWVSMLDSPSINFYHTKTVKVSYGQR--SLGSPDSGGTQVVF 1352
             G GYMFLG D VP WGM WV MLDSP +  YHT+ +K++YG    +LG+ +S     +F
Sbjct: 368  GGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSQVGWALF 427

Query: 1353 DSGSSYTYFTKDAYSGLITSLEDSLGGRLTLDESDPTLPVCWRAEAPI 1496
            D+GSSYTYFTK AYS LI SL++     L LD SDPTLPVCWRA+ PI
Sbjct: 428  DTGSSYTYFTKQAYSELIASLKEVSSDGLVLDASDPTLPVCWRAKFPI 475


>ref|XP_006485774.1| PREDICTED: aspartic proteinase Asp1-like [Citrus sinensis]
          Length = 577

 Score =  444 bits (1142), Expect = e-122
 Identities = 237/468 (50%), Positives = 297/468 (63%), Gaps = 20/468 (4%)
 Frame = +3

Query: 153  PQLQGVVIITLPPSDNPSKGKTITSIFTLSDPSPSTXXXXXXXXXXXXXLE--HXXXXXX 326
            PQL GVVIITLPP +NPS GKTIT+ +TL+D SP +                 H      
Sbjct: 11   PQLTGVVIITLPPPNNPSLGKTITA-YTLTDNSPQSQQTHHQQQQEHPLPAQLHPPQDSQ 69

Query: 327  XXXXXXXXXXXXXXXXXXXXASYVWRTYYVPSNTSRQL----RGTNENKEFSSFVFTIYP 494
                                A  ++      S  S  L    +  N+++   SFVF +Y 
Sbjct: 70   FNFSLPMLFPVLPRKLFLFLAISIFALILYGSVFSYTLQHRYKSNNDDENKESFVFPLYH 129

Query: 495  KLLGRPQKIHGLDIEFKLGKIVERERSSSLEMIDDGGLFRKVEKLGESVV---------- 644
            K   R  ++   D EFKLG+ V+ +  S +  ++DG +     K+ + +V          
Sbjct: 130  KFGIR--EVLQRDAEFKLGRFVDLDGESVVASVNDGIIRPHKSKINKKLVPSNAVAVDSS 187

Query: 645  --FPVRGNIYPDGLYYISMFVGNPPRPYYLDMDTGSDLTWIQCDAPCTSCSKGPNPLYKP 818
              FP+RGN+YPDGLY+  M VGNPPRPYYLDMDTGSDLTWIQCDAPC+SC+KG NPLYKP
Sbjct: 188  STFPLRGNVYPDGLYFTYMIVGNPPRPYYLDMDTGSDLTWIQCDAPCSSCAKGANPLYKP 247

Query: 819  TKGKIVPPKDLLCAEVQNDDEPGYCESCKQCNYEIEYADHSSSRGVLVKDKLQQRIANGT 998
              G I+P KD LC E+Q + +PGYCE+C+QC+YEIEYADHSSS GVL +D+L   I NG+
Sbjct: 248  RMGNILPYKDSLCMEIQRNHKPGYCETCQQCDYEIEYADHSSSMGVLARDELHLTIENGS 307

Query: 999  MLNSDFVFGCAYDQQGELSVSPAKTDGILGLSRATIGLPSQLASQGIIRNVVGHCINSDE 1178
            +   + VFGCAYDQQG L  +  KTDGILGLSRA + LPSQLASQGII+NVVGHC+ ++ 
Sbjct: 308  LTKPNVVFGCAYDQQGLLLNTLVKTDGILGLSRAKVSLPSQLASQGIIKNVVGHCLTTNA 367

Query: 1179 DGHGYMFLGDDFVPQWGMTWVSMLDSPSINFYHTKTVKVSYGQR--SLGSPDSGGTQVVF 1352
             G GYMFLG D VP WGM WV MLDSP +  YHT+ +K++YG    +LG+ +S     +F
Sbjct: 368  GGGGYMFLGHDLVPSWGMAWVPMLDSPFMELYHTEILKINYGSSPLNLGARNSRVGWALF 427

Query: 1353 DSGSSYTYFTKDAYSGLITSLEDSLGGRLTLDESDPTLPVCWRAEAPI 1496
            D+GSSYTYFTK AYS LI SL++     L LD SDPTLPVCWRA+ PI
Sbjct: 428  DTGSSYTYFTKQAYSELIASLKEVSSNGLVLDASDPTLPVCWRAKFPI 475


>ref|XP_006826817.1| hypothetical protein AMTR_s00010p00056950 [Amborella trichopoda]
            gi|548831246|gb|ERM94054.1| hypothetical protein
            AMTR_s00010p00056950 [Amborella trichopoda]
          Length = 545

 Score =  426 bits (1096), Expect = e-116
 Identities = 223/454 (49%), Positives = 286/454 (62%), Gaps = 6/454 (1%)
 Frame = +3

Query: 153  PQLQGVVIITLPPSDNPSKGKTITSIFTLSDPSPSTXXXXXXXXXXXXXLEHXXXXXXXX 332
            P++QG VII+LPP D+PSKGKTIT+   +SDPS                +          
Sbjct: 4    PEIQGFVIISLPPPDDPSKGKTITAFTMVSDPSHQNENQSQNQQTQQPQIASNSIAGSSR 63

Query: 333  XXXXXXXXXXXXXXXXXXAS-YVWRTYYVPSNTSRQLRGTNENKEFSSFVFTIYPKLLGR 509
                              A  + W+     S  S     T  +K   SF++ +YPK    
Sbjct: 64   GRIGSIVVRVLAMLGAVVAVLFFWQWV---SGFSEMDYETERSKNNPSFLYNLYPKW--- 117

Query: 510  PQKIHGLDIEFKLGKIVERERSSSLEMIDDGGLFRKVEKLGESVVFPVRGNIYPDGLYYI 689
             ++    D   +LG  V+R+       + D      +  +  S +FPV+GN+YPDGLYYI
Sbjct: 118  SEEAIEKDAALRLGTFVKRDEVRI--GLRDVKTLEAISSINSSTIFPVKGNVYPDGLYYI 175

Query: 690  SMFVGNPPRPYYLDMDTGSDLTWIQCDAPCTSCSKGPNPLYKPTKGKIVPPKDLLCAEVQ 869
            S+ VGNP RPYYLDMDTGSDLTWIQC+APCT+C+KGP+PLY P+K  +VP KD  C EVQ
Sbjct: 176  SILVGNPRRPYYLDMDTGSDLTWIQCNAPCTNCAKGPHPLYNPSKQNLVPSKDPFCLEVQ 235

Query: 870  NDDEPGYCESCKQCNYEIEYADHSSSRGVLVKDKLQQRIANGTMLNSDFVFGCAYDQQGE 1049
             +D+  +  +  QC+Y+IEYAD SSS GVLV+D LQ  I NGT++ +  VFGCAYDQ+G+
Sbjct: 236  VNDKGKFAGASHQCDYDIEYADQSSSMGVLVRDDLQLMITNGTVIKTGLVFGCAYDQRGK 295

Query: 1050 LSVSPAKTDGILGLSRATIGLPSQLASQGIIRNVVGHCINSDEDGHGYMFLGDDFVPQWG 1229
            L  SPAKTDGILGLS A + LPSQLAS+G+++NVVGHCI +D +G GYMFLGDDF+PQW 
Sbjct: 296  LGHSPAKTDGILGLSSAKVSLPSQLASRGLMKNVVGHCIRNDANGGGYMFLGDDFIPQWR 355

Query: 1230 MTWVSMLDSPSINFYHTKTVKVSYGQRSLGSPDSGGT-----QVVFDSGSSYTYFTKDAY 1394
            MTWV ML SPS N YH +  K+S G R +   D GG      +VVFDSGSSY+Y TK AY
Sbjct: 356  MTWVPMLSSPSTNAYHAEVSKISLGSRPI---DGGGLITKIGRVVFDSGSSYSYLTKQAY 412

Query: 1395 SGLITSLEDSLGGRLTLDESDPTLPVCWRAEAPI 1496
            + LI SL+D     L LD+SD TLPVCW+A++P+
Sbjct: 413  TSLIKSLKDVAEKGLVLDDSDKTLPVCWKAKSPL 446


>ref|XP_006374352.1| aspartyl protease family protein [Populus trichocarpa]
            gi|550322111|gb|ERP52149.1| aspartyl protease family
            protein [Populus trichocarpa]
          Length = 603

 Score =  425 bits (1093), Expect = e-116
 Identities = 229/470 (48%), Positives = 291/470 (61%), Gaps = 21/470 (4%)
 Frame = +3

Query: 150  APQLQGVVIITLPPSDNPSKGKTITSIFTLSDPSPSTXXXXXXXXXXXXXLEHXXXXXXX 329
            +PQL+GVVII+LPP DNPS GKTIT+    ++  P +             +         
Sbjct: 8    SPQLKGVVIISLPPPDNPSLGKTITAFTLTNNDYPQSHQTPQTHQEDQLPISSPPPPPSQ 67

Query: 330  XXXXXXXXXXXXXXXXXXXASYVWRTYYVPS-------NTSRQLRGTN---ENKEFSSFV 479
                                S+V+ + +  +       NT ++L+  N   ++++  S+V
Sbjct: 68   NSQLQFPSSRLFLGTPRKLLSFVFISLFALAIYSSLFTNTFQELKSNNNDDDDQKPKSYV 127

Query: 480  FTIYPKLLGRPQKIHGLDIEFKLGKIVERERSSSLEMIDDGGLFRKVEKLGES------- 638
            F +Y KL  R   ++  D+E  L + V +E  + +  +D      K+ KL  S       
Sbjct: 128  FPLYHKLGIREIPLN--DLENHLRRFVYKE--NLVASVDHLNGPHKISKLASSNAAAAMD 183

Query: 639  --VVFPVRGNIYPDGLYYISMFVGNPPRPYYLDMDTGSDLTWIQCDAPCTSCSKGPNPLY 812
               +FPVRGN+YPDG          PP+PYYLD DTGSDLTWIQCDAPCTSC+KG N  Y
Sbjct: 184  SSAIFPVRGNLYPDG----------PPQPYYLDFDTGSDLTWIQCDAPCTSCAKGANAWY 233

Query: 813  KPTKGKIVPPKDLLCAEVQNDDEPGYCESCKQCNYEIEYADHSSSRGVLVKDKLQQRIAN 992
            KP +G IVPPKDLLC EVQ + + GYCE+C QC+YEIEYADHSSS GVL  DKL   +AN
Sbjct: 234  KPRRGNIVPPKDLLCMEVQRNQKAGYCETCDQCDYEIEYADHSSSMGVLATDKLLLMVAN 293

Query: 993  GTMLNSDFVFGCAYDQQGELSVSPAKTDGILGLSRATIGLPSQLASQGIIRNVVGHCINS 1172
            G++   +F+FGCAYDQQG L  +  KTDGILGLSRA + LPSQLASQGII NV+GHC+ +
Sbjct: 294  GSLTKLNFIFGCAYDQQGLLLKTLVKTDGILGLSRAKVSLPSQLASQGIINNVIGHCLTT 353

Query: 1173 DEDGHGYMFLGDDFVPQWGMTWVSMLDSPSINFYHTKTVKVSYGQR--SLGSPDSGGTQV 1346
            D  G GYMFLGDDFVP+WGM WV MLDSPS+ FYHT+ VK++YG    SLG  +S    +
Sbjct: 354  DLGGGGYMFLGDDFVPRWGMAWVPMLDSPSMEFYHTEVVKLNYGSSPLSLGGMESRVKHI 413

Query: 1347 VFDSGSSYTYFTKDAYSGLITSLEDSLGGRLTLDESDPTLPVCWRAEAPI 1496
            +FDSGSSYTYF K+AYS L+ SL +  G  L    SD TLP+CWRA  PI
Sbjct: 414  LFDSGSSYTYFPKEAYSELVASLNEVSGAGLVQSTSDTTLPLCWRANFPI 463


>emb|CBI15437.3| unnamed protein product [Vitis vinifera]
          Length = 473

 Score =  417 bits (1071), Expect = e-114
 Identities = 211/367 (57%), Positives = 261/367 (71%), Gaps = 5/367 (1%)
 Frame = +3

Query: 411  YVPSNTSRQLRGTNENKEFSSFVFTIYPKLLGRPQKIHGLDIEFKLGKIVERERSSSLEM 590
            +  S+   +LR  N+++E +SF+  +YPKL  R       D+E KLGK V+   +     
Sbjct: 16   FASSSPLVELRRKNDDREPTSFILPLYPKLGSRSLG----DLELKLGKFVDFHVND---- 67

Query: 591  IDDGGLFR---KVEKLGESVVFPVRGNIYPDGLYYISMFVGNPPRPYYLDMDTGSDLTWI 761
            +  GG+ +    V     S +FPVRG++YP+GLY+  +FVG+PPR Y+LDMDTGSDLTWI
Sbjct: 68   MKPGGINKLATSVSAFDSSTIFPVRGDVYPNGLYFTHIFVGSPPRRYFLDMDTGSDLTWI 127

Query: 762  QCDAPCTSCSKGPNPLYKPTKGKIVPPKDLLCAEVQNDDEPGYCESCKQCNYEIEYADHS 941
            QCDAPCTSC+KGPNPLYKP KG +VP KD LC EVQ + + GYCE+C+QC+YEIEYADHS
Sbjct: 128  QCDAPCTSCAKGPNPLYKPKKGNLVPLKDSLCVEVQRNLKTGYCETCEQCDYEIEYADHS 187

Query: 942  SSRGVLVKDKLQQRIANGTMLNSDFVFGCAYDQQGELSVSPAKTDGILGLSRATIGLPSQ 1121
            SS GVL  D L   +ANG++     +FGCAYDQQG L  S AKTDGILGLS+A + LPSQ
Sbjct: 188  SSMGVLASDDLHLMLANGSLTKLGIMFGCAYDQQGLLLNSLAKTDGILGLSKAKVSLPSQ 247

Query: 1122 LASQGIIRNVVGHCINSDEDGHGYMFLGDDFVPQWGMTWVSMLDSPSINFYHTKTVKVSY 1301
            LASQ II NV+GHC+ SD  G GYMFLGDDFVP WGM WV ML+S S N YH++ +K+S+
Sbjct: 248  LASQRIINNVLGHCLTSDATGGGYMFLGDDFVPYWGMAWVPMLNSHSPN-YHSQIMKISH 306

Query: 1302 GQR--SLGSPDSGGTQVVFDSGSSYTYFTKDAYSGLITSLEDSLGGRLTLDESDPTLPVC 1475
            G R  SLG  D    +VVFD+GSSYTYF K+AY  L+ SL+D     L  D SDPTLPVC
Sbjct: 307  GSRQLSLGRQDGRTERVVFDTGSSYTYFPKEAYYALVASLKDVSDEGLIQDGSDPTLPVC 366

Query: 1476 WRAEAPI 1496
            WRA+ PI
Sbjct: 367  WRAKFPI 373


>ref|XP_002275943.1| PREDICTED: aspartic proteinase Asp1-like [Vitis vinifera]
          Length = 686

 Score =  417 bits (1071), Expect = e-114
 Identities = 211/367 (57%), Positives = 261/367 (71%), Gaps = 5/367 (1%)
 Frame = +3

Query: 411  YVPSNTSRQLRGTNENKEFSSFVFTIYPKLLGRPQKIHGLDIEFKLGKIVERERSSSLEM 590
            +  S+   +LR  N+++E +SF+  +YPKL  R       D+E KLGK V+   +     
Sbjct: 229  FASSSPLVELRRKNDDREPTSFILPLYPKLGSRSLG----DLELKLGKFVDFHVND---- 280

Query: 591  IDDGGLFR---KVEKLGESVVFPVRGNIYPDGLYYISMFVGNPPRPYYLDMDTGSDLTWI 761
            +  GG+ +    V     S +FPVRG++YP+GLY+  +FVG+PPR Y+LDMDTGSDLTWI
Sbjct: 281  MKPGGINKLATSVSAFDSSTIFPVRGDVYPNGLYFTHIFVGSPPRRYFLDMDTGSDLTWI 340

Query: 762  QCDAPCTSCSKGPNPLYKPTKGKIVPPKDLLCAEVQNDDEPGYCESCKQCNYEIEYADHS 941
            QCDAPCTSC+KGPNPLYKP KG +VP KD LC EVQ + + GYCE+C+QC+YEIEYADHS
Sbjct: 341  QCDAPCTSCAKGPNPLYKPKKGNLVPLKDSLCVEVQRNLKTGYCETCEQCDYEIEYADHS 400

Query: 942  SSRGVLVKDKLQQRIANGTMLNSDFVFGCAYDQQGELSVSPAKTDGILGLSRATIGLPSQ 1121
            SS GVL  D L   +ANG++     +FGCAYDQQG L  S AKTDGILGLS+A + LPSQ
Sbjct: 401  SSMGVLASDDLHLMLANGSLTKLGIMFGCAYDQQGLLLNSLAKTDGILGLSKAKVSLPSQ 460

Query: 1122 LASQGIIRNVVGHCINSDEDGHGYMFLGDDFVPQWGMTWVSMLDSPSINFYHTKTVKVSY 1301
            LASQ II NV+GHC+ SD  G GYMFLGDDFVP WGM WV ML+S S N YH++ +K+S+
Sbjct: 461  LASQRIINNVLGHCLTSDATGGGYMFLGDDFVPYWGMAWVPMLNSHSPN-YHSQIMKISH 519

Query: 1302 GQR--SLGSPDSGGTQVVFDSGSSYTYFTKDAYSGLITSLEDSLGGRLTLDESDPTLPVC 1475
            G R  SLG  D    +VVFD+GSSYTYF K+AY  L+ SL+D     L  D SDPTLPVC
Sbjct: 520  GSRQLSLGRQDGRTERVVFDTGSSYTYFPKEAYYALVASLKDVSDEGLIQDGSDPTLPVC 579

Query: 1476 WRAEAPI 1496
            WRA+ PI
Sbjct: 580  WRAKFPI 586


>ref|XP_003570503.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
          Length = 564

 Score =  410 bits (1054), Expect = e-112
 Identities = 227/469 (48%), Positives = 275/469 (58%), Gaps = 19/469 (4%)
 Frame = +3

Query: 153  PQLQGVVIITLPPSDNPSKGKTITSIFTLSDPSPSTXXXXXXXXXXXXXLEHXXXXXXXX 332
            PQL GVVIITLPP D PSKGKTIT+     DP                            
Sbjct: 18   PQLHGVVIITLPPPDQPSKGKTITAYTYTDDPGTPPTPPPPPRRPRSGMDPAAARRPRRV 77

Query: 333  XXXXXXXXXXXXXXXXXXASYVWRTYYVPSNTSRQLRGTNENK------EFSSFVFTIYP 494
                              A+Y    Y   S+ + Q  G  E +      E  SF+  +YP
Sbjct: 78   VSPRRAAAMVLVLGAFALAAY----YCFYSDVAVQFLGVEEEEVEKERNETRSFLLPLYP 133

Query: 495  KLL-GRPQKIHGLDIEFKLGKIVERERSSSLEMIDDGGLFRKVEKL----------GESV 641
            K   GR  +  G DI+    KI            DDGG+ + V KL            +V
Sbjct: 134  KTRQGRALREFG-DIKLAAKKI------------DDGGVRKGVNKLEAKRATSAGTNSTV 180

Query: 642  VFPVRGNIYPDGLYYISMFVGNPPRPYYLDMDTGSDLTWIQCDAPCTSCSKGPNPLYKPT 821
            + P++GN++PDG YY S+FVGNPPRPY+LD+DTGSDLTWIQCDAPCT+C+KGP+PLYKP 
Sbjct: 181  LLPIKGNVFPDGQYYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPA 240

Query: 822  KGKIVPPKDLLCAEVQNDDEPGYCESCKQCNYEIEYADHSSSRGVLVKDKLQQRIANGTM 1001
            K KIVPP+DLLC E+Q D    YC +CKQC+YEIEYAD SSS GVL KD +     NG  
Sbjct: 241  KEKIVPPRDLLCQELQGDQN--YCATCKQCDYEIEYADRSSSMGVLAKDDMHMIATNGGR 298

Query: 1002 LNSDFVFGCAYDQQGELSVSPAKTDGILGLSRATIGLPSQLASQGIIRNVVGHCINSDED 1181
               DFVFGCAYDQQG+L  SPAKTDGILGLS A I LPSQLASQGII NV GHCI  + +
Sbjct: 299  EKLDFVFGCAYDQQGQLLTSPAKTDGILGLSSAAISLPSQLASQGIISNVFGHCITKEPN 358

Query: 1182 GHGYMFLGDDFVPQWGMTWVSMLDSPSINFYHTKTVKVSYGQRSLGSPDSGGT--QVVFD 1355
            G GYMFLGDD+VP+WGMTW  +   P  N YHT+  KV+YG + L      G+  QV+FD
Sbjct: 359  GGGYMFLGDDYVPRWGMTWAPIRGGPD-NLYHTEAQKVNYGDQQLRMHGQAGSSIQVIFD 417

Query: 1356 SGSSYTYFTKDAYSGLITSLEDSLGGRLTLDESDPTLPVCWRAEAPISY 1502
            SGSSYTY   + Y  L+T+++         D SD TLP+CW+A+  + Y
Sbjct: 418  SGSSYTYLPDEIYKKLVTAIKYDYPS-FVQDTSDTTLPLCWKADFDVRY 465


>ref|XP_006354730.1| PREDICTED: aspartic proteinase Asp1-like [Solanum tuberosum]
          Length = 558

 Score =  407 bits (1045), Expect = e-111
 Identities = 222/456 (48%), Positives = 281/456 (61%), Gaps = 7/456 (1%)
 Frame = +3

Query: 150  APQLQGVVIITLPPSDNPSKGKTITSIFTLSD-PSPSTXXXXXXXXXXXXXLEHXXXXXX 326
            +P +QGVVIITLPP DNPS GKTIT+ FTLSD P+                 +       
Sbjct: 7    SPPIQGVVIITLPPPDNPSYGKTITA-FTLSDSPTHQQQQEEEPPQQSQPHNQDLNTGVL 65

Query: 327  XXXXXXXXXXXXXXXXXXXXASYVWRTYY--VPSNTSRQLRGTNENKEFS--SFVFTIYP 494
                                 S +  +++  +   T  +LR    + + S  SF+  +YP
Sbjct: 66   RASLERSFFFRPKIVFGLLGISLIALSFWSSLTQETLFELRDVEHDHKSSNSSFILPLYP 125

Query: 495  KLLGRPQKIHGLDIEFKLGKIVERERSSSLEMIDDGGLFRKVEKLGESVVFPVRGNIYPD 674
            K  G        D+EFKLG+ V+ +    ++            KL  SV FPVRGNI+ +
Sbjct: 126  KRGGAWNSRR--DVEFKLGRFVDFKPDKFMDQEKIAKSLSAATKLDSSVNFPVRGNIHSE 183

Query: 675  GLYYISMFVGNPPRPYYLDMDTGSDLTWIQCDAPCTSCSKGPNPLYKPTKGKIVPPKDLL 854
            GLYY  M VGNPPRPY+LD+DTGSDL WIQCDAPCTSC+KG +PLYKP    ++PPK+  
Sbjct: 184  GLYYTYMLVGNPPRPYFLDIDTGSDLMWIQCDAPCTSCAKGAHPLYKPRNVNMIPPKNPY 243

Query: 855  CAEVQNDDEPGYCESCKQCNYEIEYADHSSSRGVLVKDKLQQRIANGTMLNSDFVFGCAY 1034
            C EVQ + +  YC++C QC+YEIEYAD SSS GVL KD+LQ  +ANGT      VFGCAY
Sbjct: 244  CVEVQENLKSKYCDNCHQCDYEIEYADRSSSVGVLAKDELQLVLANGTGTKPSVVFGCAY 303

Query: 1035 DQQGELSVSPAKTDGILGLSRATIGLPSQLASQGIIRNVVGHCINSDEDGHGYMFLGDDF 1214
            DQQG L  + A TDGILGLSRA I LPSQLAS G+I NV+GHC+ +D +G GY+FLG+DF
Sbjct: 304  DQQGTLLNTLASTDGILGLSRAPISLPSQLASHGLINNVIGHCLRTDTNG-GYLFLGNDF 362

Query: 1215 VPQWGMTWVSMLDSPSINFYHTKTVKVSYGQRS--LGSPDSGGTQVVFDSGSSYTYFTKD 1388
            VPQW M+WV ML++P  N Y  + +K++YG +   LGS   G   VVFDSGS+YTYFT  
Sbjct: 363  VPQWRMSWVPMLNNPFPNLYQAQLMKMNYGGKELRLGSTSYGQGTVVFDSGSTYTYFTDQ 422

Query: 1389 AYSGLITSLEDSLGGRLTLDESDPTLPVCWRAEAPI 1496
            AY  LI+ LE+     L  D SD TLP+CWRA+ P+
Sbjct: 423  AYKALISMLEEISSEDLIKDASDTTLPICWRAKFPV 458


>dbj|BAJ86250.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 551

 Score =  404 bits (1039), Expect = e-110
 Identities = 225/463 (48%), Positives = 280/463 (60%), Gaps = 18/463 (3%)
 Frame = +3

Query: 153  PQLQGVVIITLPPSDNPSKGKTITSIFTLSDPSPSTXXXXXXXXXXXXXLEHXXXXXXXX 332
            PQL GVVIITLPP D PSKGKTIT+ FT +D   +                         
Sbjct: 15   PQLHGVVIITLPPPDQPSKGKTITA-FTYTDEPGAGAPSPPHPHRGPPMAAAGREARRSR 73

Query: 333  XXXXXXXXXXXXXXXXXXASYVWRTYYVPSNTSRQLRGTNENK------EFSSFVFTIYP 494
                              A   + ++Y  S+ + Q  G  E +      E  SF+F +YP
Sbjct: 74   RAGSPRRAAAMVLALGALALAAYYSFY--SDVAVQFLGMEEEEAQRERNETKSFLFQLYP 131

Query: 495  KL-LGRPQKIHGLDIEFKLGKIVERERSSSLEMIDDGGLFRKVEK-----------LGES 638
            K   GR  +  G   + KL          + + +DDGG  RKV K              +
Sbjct: 132  KAHQGRGLREFG---DIKL----------AAKRVDDGG--RKVTKKLDVKGAASAGTNST 176

Query: 639  VVFPVRGNIYPDGLYYISMFVGNPPRPYYLDMDTGSDLTWIQCDAPCTSCSKGPNPLYKP 818
            V+ P++GN++PDG YY S+FVGNPPRPY+LD+DTGSDLTWIQCDAPCT+C+KGP+PLYKP
Sbjct: 177  VLLPIKGNVFPDGQYYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKP 236

Query: 819  TKGKIVPPKDLLCAEVQNDDEPGYCESCKQCNYEIEYADHSSSRGVLVKDKLQQRIANGT 998
             K KIVPP+D LC E+Q D    YCE+CKQC+YEIEYAD SSS GVL KD +     NG 
Sbjct: 237  AKEKIVPPRDSLCQELQGDQN--YCETCKQCDYEIEYADRSSSMGVLAKDDMHLIATNGG 294

Query: 999  MLNSDFVFGCAYDQQGELSVSPAKTDGILGLSRATIGLPSQLASQGIIRNVVGHCINSDE 1178
                DFVFGCAYDQQG+L  SPAKTDGILGLS A I LPSQLAS+GII NV GHCI  + 
Sbjct: 295  REKLDFVFGCAYDQQGQLLSSPAKTDGILGLSSAAISLPSQLASKGIISNVFGHCITRET 354

Query: 1179 DGHGYMFLGDDFVPQWGMTWVSMLDSPSINFYHTKTVKVSYGQRSLGSPDSGGTQVVFDS 1358
            +G GYMFLGDD+VP+WGMTW  +   P  N YHT+  KV+YG + L + +S   QV+FDS
Sbjct: 355  NGGGYMFLGDDYVPRWGMTWAPIRGGPD-NLYHTEAQKVNYGDQELHAGNS--VQVIFDS 411

Query: 1359 GSSYTYFTKDAYSGLITSLEDSLGGRLTLDESDPTLPVCWRAE 1487
            GSSYTY  ++ Y  LI ++++        D SD TLP+CW+A+
Sbjct: 412  GSSYTYLPEEMYKNLIDAIKED-SPSFVQDSSDTTLPLCWKAD 453


>gb|ACN34727.1| unknown [Zea mays] gi|413923868|gb|AFW63800.1| hypothetical protein
            ZEAMMB73_012138 [Zea mays]
          Length = 557

 Score =  404 bits (1039), Expect = e-110
 Identities = 218/466 (46%), Positives = 275/466 (59%), Gaps = 16/466 (3%)
 Frame = +3

Query: 153  PQLQGVVIITLPPSDNPSKGKTITSIFTLSDPSPSTXXXXXXXXXXXXXLEHXXXXXXXX 332
            PQL GVVIITLPP+D PSKGKT+T+    +DP P                          
Sbjct: 16   PQLHGVVIITLPPADQPSKGKTVTAFAYTNDPPPPRSPPDPVMGYPAAT------EARRR 69

Query: 333  XXXXXXXXXXXXXXXXXXASYVWRTYYVPSNTSRQLRGTNENKE----FSSFVFTIYPKL 500
                              A  V   Y   S+ + Q  G  + +E      SF+  +YPK 
Sbjct: 70   PRRALSTRRVATAALVLGALAVAAYYCFYSDVAVQFLGMEQEEEQRNETRSFLLPLYPKA 129

Query: 501  L-GRPQKIHGLDIEFKLGKIVERERSSSLEMIDDGGLFRK---------VEKLGESVVFP 650
              GR  +      EF   K+  R        +DDGG   +           +   + + P
Sbjct: 130  RQGRALR------EFGDVKLAARR-------VDDGGRKARNRMEVAKAATARTNSTALLP 176

Query: 651  VRGNIYPDGLYYISMFVGNPPRPYYLDMDTGSDLTWIQCDAPCTSCSKGPNPLYKPTKGK 830
            ++GN++PDG YY S+F+GNPPRPY+LD+DTGSDLTWIQCDAPCT+C+KGP+PLYKP K K
Sbjct: 177  IKGNVFPDGQYYTSIFIGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPAKEK 236

Query: 831  IVPPKDLLCAEVQNDDEPGYCESCKQCNYEIEYADHSSSRGVLVKDKLQQRIANGTMLNS 1010
            IVPP+DLLC E+Q +    YCE+CKQC+YEIEYAD SSS GVL +D +     NG     
Sbjct: 237  IVPPRDLLCQELQGNQN--YCETCKQCDYEIEYADQSSSMGVLARDDMHMIATNGGREKL 294

Query: 1011 DFVFGCAYDQQGELSVSPAKTDGILGLSRATIGLPSQLASQGIIRNVVGHCINSDEDGHG 1190
            DFVFGCAYDQQG+L  SPAKTDGILGLS A I  PSQLAS GII NV GHCI  ++ G G
Sbjct: 295  DFVFGCAYDQQGQLLSSPAKTDGILGLSSAAISFPSQLASHGIIANVFGHCITREQGGGG 354

Query: 1191 YMFLGDDFVPQWGMTWVSMLDSPSINFYHTKTVKVSYGQRSLGSPDSGGT--QVVFDSGS 1364
            YMFLGDD+VP+WG+TW S+   P  N YHT+   V YG + L  P+  G+  QV+FDSGS
Sbjct: 355  YMFLGDDYVPRWGVTWTSIRSGPD-NLYHTQAHHVKYGDQQLRRPEQAGSTVQVIFDSGS 413

Query: 1365 SYTYFTKDAYSGLITSLEDSLGGRLTLDESDPTLPVCWRAEAPISY 1502
            SYTY   + Y  L+ +++ +  G    D SD TLP+CW+A+ P+ Y
Sbjct: 414  SYTYLPNEIYENLVAAIKYASPG-FVQDTSDRTLPLCWKADFPVRY 458


>ref|XP_002452609.1| hypothetical protein SORBIDRAFT_04g028990 [Sorghum bicolor]
            gi|241932440|gb|EES05585.1| hypothetical protein
            SORBIDRAFT_04g028990 [Sorghum bicolor]
          Length = 557

 Score =  403 bits (1035), Expect = e-109
 Identities = 218/456 (47%), Positives = 276/456 (60%), Gaps = 6/456 (1%)
 Frame = +3

Query: 153  PQLQGVVIITLPPSDNPSKGKTITSIFTLSDPSPSTXXXXXXXXXXXXXLEHXXXXXXXX 332
            PQL GVVIITLPPSD PSKGKTIT+ FT +D +P                +         
Sbjct: 14   PQLHGVVIITLPPSDQPSKGKTITA-FTYTDDAPPPPRPPEPVMGYPAATQVRRRPRRVL 72

Query: 333  XXXXXXXXXXXXXXXXXXASYVWRT-YYVPSNTSRQLRGTNENKEFSSFVFTIYPKLL-G 506
                              A Y + +   V      Q     +  E  SF+  ++PK   G
Sbjct: 73   STRRVAAAALVLGALAVAAYYCFYSDVAVQFLGMEQEEAQKDRNETRSFLLPLHPKARQG 132

Query: 507  RPQKIHGLDIEFKLGKIVERERSSSLEMIDDGGLFRKVEKLG--ESVVFPVRGNIYPDGL 680
            R  +  G D++    +I +  R +  +M        K    G   + + P++GN++PDG 
Sbjct: 133  RALREFG-DVKLAARRIDDGWRKARNKME-----VAKAAAAGTNSTALLPIKGNVFPDGQ 186

Query: 681  YYISMFVGNPPRPYYLDMDTGSDLTWIQCDAPCTSCSKGPNPLYKPTKGKIVPPKDLLCA 860
            YY S+FVGNPPRPY+LD+DTGSDLTWIQCDAPCT+C+KGP+PLYKPTK KIVPP+DLLC 
Sbjct: 187  YYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPTKEKIVPPRDLLCQ 246

Query: 861  EVQNDDEPGYCESCKQCNYEIEYADHSSSRGVLVKDKLQQRIANGTMLNSDFVFGCAYDQ 1040
            E+Q +    YCE+CKQC+YEIEYAD SSS GVL +D +     NG     DFVFGCAYDQ
Sbjct: 247  ELQGNQN--YCETCKQCDYEIEYADQSSSMGVLARDDMHLIATNGGREKLDFVFGCAYDQ 304

Query: 1041 QGELSVSPAKTDGILGLSRATIGLPSQLASQGIIRNVVGHCINSDEDGHGYMFLGDDFVP 1220
            QG+L  SPAKTDGILGLS A I LPSQLAS GII N+ GHCI  ++ G GYMFLGDD+VP
Sbjct: 305  QGQLLSSPAKTDGILGLSNAAISLPSQLASHGIISNIFGHCITREQGGGGYMFLGDDYVP 364

Query: 1221 QWGMTWVSMLDSPSINFYHTKTVKVSYGQRSLGSPDSGG--TQVVFDSGSSYTYFTKDAY 1394
            +WG+TW S+   P  N YHT+   V YG + L   +  G   QV+FDSGSSYTY   + Y
Sbjct: 365  RWGITWTSIRSGPD-NLYHTEAHHVKYGDQQLRMREQAGNTVQVIFDSGSSYTYLPDEIY 423

Query: 1395 SGLITSLEDSLGGRLTLDESDPTLPVCWRAEAPISY 1502
              L+ +++ +  G    D SD TLP+CW+A+ P+ Y
Sbjct: 424  ENLVAAIKYASPG-FVQDSSDRTLPLCWKADFPVRY 458


>ref|XP_007036501.1| Eukaryotic aspartyl protease family protein, putative isoform 2
            [Theobroma cacao] gi|508773746|gb|EOY21002.1| Eukaryotic
            aspartyl protease family protein, putative isoform 2
            [Theobroma cacao]
          Length = 520

 Score =  402 bits (1033), Expect = e-109
 Identities = 205/375 (54%), Positives = 259/375 (69%), Gaps = 14/375 (3%)
 Frame = +3

Query: 420  SNTSRQLRGTN--ENKEFSSFVFTIYPKLLGRPQKIHGLDIEFKLGKIVERERSSSLEMI 593
            SNT  +LR +N  ++++  SF+F +Y KL        G D+E KLG+ V+ ++ + +  +
Sbjct: 110  SNTFVELRNSNNDDDEKPQSFIFPLYHKL--------GADLELKLGRFVDVDKENLVASV 161

Query: 594  DDGGL-FRKVEKLGES---------VVFPVRGNIYPDGLYYISMFVGNPPRPYYLDMDTG 743
            + G    +K+ KL  S          + PVRGN+YPDGLY+  M VGNP R Y+LD+DTG
Sbjct: 162  EGGATGTQKINKLVASNAAVIDSSGTILPVRGNVYPDGLYFTYMLVGNPQRRYFLDIDTG 221

Query: 744  SDLTWIQCDAPCTSCSKGPNPLYKPTKGKIVPPKDLLCAEVQNDDEPGYCESCKQCNYEI 923
            SDLTWIQCDAPC+SC+KG NPLYKPT+  IV  KDL+C EVQ + +P  CE+C+QC+YEI
Sbjct: 222  SDLTWIQCDAPCSSCAKGANPLYKPTRVNIVASKDLMCTEVQKNQKPQNCETCQQCDYEI 281

Query: 924  EYADHSSSRGVLVKDKLQQRIANGTMLNSDFVFGCAYDQQGELSVSPAKTDGILGLSRAT 1103
            EYAD SSS GVL +D+L    ANG+  N D VFGCAYDQQG L  + +KTDGILGLSRA 
Sbjct: 282  EYADRSSSLGVLARDELHLVTANGSTTNLDVVFGCAYDQQGILLNTLSKTDGILGLSRAK 341

Query: 1104 IGLPSQLASQGIIRNVVGHCINSDEDGHGYMFLGDDFVPQWGMTWVSMLDSPSINFYHTK 1283
            + LPSQLAS+GII NVVGHC+ +D    GYMFLGDDFVP WGM+WV ML SPS  FYHT+
Sbjct: 342  VSLPSQLASKGIINNVVGHCLATDVGASGYMFLGDDFVPNWGMSWVPMLGSPSTEFYHTQ 401

Query: 1284 TVKVSYGQR--SLGSPDSGGTQVVFDSGSSYTYFTKDAYSGLITSLEDSLGGRLTLDESD 1457
             VK++YG    SLG   S   +VVFDSGSSYTYF K AY+ L+ SL +        D +D
Sbjct: 402  IVKINYGSSSLSLGRQHSSIGRVVFDSGSSYTYFMKQAYAELVASLSEVSEVGFIQDVAD 461

Query: 1458 PTLPVCWRAEAPISY 1502
             TLP+CW+A  PI +
Sbjct: 462  TTLPMCWQAPFPIRF 476


>ref|XP_007036500.1| Eukaryotic aspartyl protease family protein, putative isoform 1
            [Theobroma cacao] gi|508773745|gb|EOY21001.1| Eukaryotic
            aspartyl protease family protein, putative isoform 1
            [Theobroma cacao]
          Length = 576

 Score =  402 bits (1033), Expect = e-109
 Identities = 205/375 (54%), Positives = 259/375 (69%), Gaps = 14/375 (3%)
 Frame = +3

Query: 420  SNTSRQLRGTN--ENKEFSSFVFTIYPKLLGRPQKIHGLDIEFKLGKIVERERSSSLEMI 593
            SNT  +LR +N  ++++  SF+F +Y KL        G D+E KLG+ V+ ++ + +  +
Sbjct: 110  SNTFVELRNSNNDDDEKPQSFIFPLYHKL--------GADLELKLGRFVDVDKENLVASV 161

Query: 594  DDGGL-FRKVEKLGES---------VVFPVRGNIYPDGLYYISMFVGNPPRPYYLDMDTG 743
            + G    +K+ KL  S          + PVRGN+YPDGLY+  M VGNP R Y+LD+DTG
Sbjct: 162  EGGATGTQKINKLVASNAAVIDSSGTILPVRGNVYPDGLYFTYMLVGNPQRRYFLDIDTG 221

Query: 744  SDLTWIQCDAPCTSCSKGPNPLYKPTKGKIVPPKDLLCAEVQNDDEPGYCESCKQCNYEI 923
            SDLTWIQCDAPC+SC+KG NPLYKPT+  IV  KDL+C EVQ + +P  CE+C+QC+YEI
Sbjct: 222  SDLTWIQCDAPCSSCAKGANPLYKPTRVNIVASKDLMCTEVQKNQKPQNCETCQQCDYEI 281

Query: 924  EYADHSSSRGVLVKDKLQQRIANGTMLNSDFVFGCAYDQQGELSVSPAKTDGILGLSRAT 1103
            EYAD SSS GVL +D+L    ANG+  N D VFGCAYDQQG L  + +KTDGILGLSRA 
Sbjct: 282  EYADRSSSLGVLARDELHLVTANGSTTNLDVVFGCAYDQQGILLNTLSKTDGILGLSRAK 341

Query: 1104 IGLPSQLASQGIIRNVVGHCINSDEDGHGYMFLGDDFVPQWGMTWVSMLDSPSINFYHTK 1283
            + LPSQLAS+GII NVVGHC+ +D    GYMFLGDDFVP WGM+WV ML SPS  FYHT+
Sbjct: 342  VSLPSQLASKGIINNVVGHCLATDVGASGYMFLGDDFVPNWGMSWVPMLGSPSTEFYHTQ 401

Query: 1284 TVKVSYGQR--SLGSPDSGGTQVVFDSGSSYTYFTKDAYSGLITSLEDSLGGRLTLDESD 1457
             VK++YG    SLG   S   +VVFDSGSSYTYF K AY+ L+ SL +        D +D
Sbjct: 402  IVKINYGSSSLSLGRQHSSIGRVVFDSGSSYTYFMKQAYAELVASLSEVSEVGFIQDVAD 461

Query: 1458 PTLPVCWRAEAPISY 1502
             TLP+CW+A  PI +
Sbjct: 462  TTLPMCWQAPFPIRF 476


>ref|XP_002511959.1| protein with unknown function [Ricinus communis]
            gi|223549139|gb|EEF50628.1| protein with unknown function
            [Ricinus communis]
          Length = 583

 Score =  401 bits (1031), Expect = e-109
 Identities = 209/382 (54%), Positives = 266/382 (69%), Gaps = 12/382 (3%)
 Frame = +3

Query: 387  ASYVWRTYYVPSNTSRQLRGTNENKE--FSSFVFTIYPKLLGRPQKIHGLDIEFKLGKIV 560
            A  V+R+ +  SNT  +L+ ++++ +    SF+F +Y K   R  +I   ++E K  + V
Sbjct: 104  AVIVYRSLF--SNTLLELKVSDDDNDEKTKSFIFPLYHKFGIR--EISQSNLEHKSIRSV 159

Query: 561  ERERSSSLEMIDDGGLFRKVEKLGES--------VVFPVRGNIYPDGLYYISMFVGNPPR 716
             +E   +    DD  +  +  KL  S         VFPVRGN+YPDGLY+  + VGNPPR
Sbjct: 160  YKESLVASVNDDDVIVPNRNYKLASSNAAAVDSSSVFPVRGNVYPDGLYFTYILVGNPPR 219

Query: 717  PYYLDMDTGSDLTWIQCDAPCTSCSKGPNPLYKPTKGKIVPPKDLLCAEVQNDDEPGYCE 896
            PYYLD+DT SDLTWIQCDAPCTSC+KG N LYKP +  IV PKD LC E+  + + GYCE
Sbjct: 220  PYYLDIDTASDLTWIQCDAPCTSCAKGANALYKPRRDNIVTPKDSLCVELHRNQKAGYCE 279

Query: 897  SCKQCNYEIEYADHSSSRGVLVKDKLQQRIANGTMLNSDFVFGCAYDQQGELSVSPAKTD 1076
            +C+QC+YEIEYADHSSS GVL +D+L   +ANG+  N  F FGCAYDQQG L  +  KTD
Sbjct: 280  TCQQCDYEIEYADHSSSMGVLARDELHLTMANGSSTNLKFNFGCAYDQQGLLLNTLVKTD 339

Query: 1077 GILGLSRATIGLPSQLASQGIIRNVVGHCINSDEDGHGYMFLGDDFVPQWGMTWVSMLDS 1256
            GILGLS+A + LPSQLA++GII NVVGHC+ +D  G GYMFLGDDFVP+WGM+WV MLDS
Sbjct: 340  GILGLSKAKVSLPSQLANRGIINNVVGHCLANDVVGGGYMFLGDDFVPRWGMSWVPMLDS 399

Query: 1257 PSINFYHTKTVKVSYGQ--RSLGSPDSGGTQVVFDSGSSYTYFTKDAYSGLITSLEDSLG 1430
            PSI+ Y T+ +K++YG    SLG  +    ++VFDSGSSYTYFTK+AYS L+ SL+   G
Sbjct: 400  PSIDSYQTQIMKLNYGSGPLSLGGQERRVRRIVFDSGSSYTYFTKEAYSELVASLKQVSG 459

Query: 1431 GRLTLDESDPTLPVCWRAEAPI 1496
              L  D SDPTLP CWRA+ PI
Sbjct: 460  EALIQDTSDPTLPFCWRAKFPI 481


>ref|NP_001146721.1| uncharacterized protein LOC100280323 [Zea mays]
            gi|219888491|gb|ACL54620.1| unknown [Zea mays]
          Length = 557

 Score =  400 bits (1028), Expect = e-109
 Identities = 217/466 (46%), Positives = 274/466 (58%), Gaps = 16/466 (3%)
 Frame = +3

Query: 153  PQLQGVVIITLPPSDNPSKGKTITSIFTLSDPSPSTXXXXXXXXXXXXXLEHXXXXXXXX 332
            PQL GVVIITLPP+D PSKGKT+T+    +DP P                          
Sbjct: 16   PQLHGVVIITLPPADQPSKGKTVTAFAYTNDPPPPRSPPDPVMGYPAAT------EARRR 69

Query: 333  XXXXXXXXXXXXXXXXXXASYVWRTYYVPSNTSRQLRGTNENKE----FSSFVFTIYPKL 500
                              A  V   Y   S+ + Q  G  + +E      SF+  +YPK 
Sbjct: 70   PRRALSTRRVATAALVLGALAVAAYYCFYSDVAVQFLGMEQEEEQRNETRSFLLPLYPKA 129

Query: 501  L-GRPQKIHGLDIEFKLGKIVERERSSSLEMIDDGGLFRK---------VEKLGESVVFP 650
              GR  +      EF   K+  R        +DDGG   +           +   + + P
Sbjct: 130  RQGRALR------EFGDVKLAARR-------VDDGGRKARNRMEVAKAATARTNSTALLP 176

Query: 651  VRGNIYPDGLYYISMFVGNPPRPYYLDMDTGSDLTWIQCDAPCTSCSKGPNPLYKPTKGK 830
            ++GN++PDG YY S+F+GNPPRPY+LD+DTGSDLTWIQCDAPCT+ +KGP+PLYKP K K
Sbjct: 177  IKGNVFPDGQYYTSIFIGNPPRPYFLDVDTGSDLTWIQCDAPCTNFAKGPHPLYKPAKEK 236

Query: 831  IVPPKDLLCAEVQNDDEPGYCESCKQCNYEIEYADHSSSRGVLVKDKLQQRIANGTMLNS 1010
            IVPP+DLLC E+Q +    YCE+CKQC+YEIEYAD SSS GVL +D +     NG     
Sbjct: 237  IVPPRDLLCQELQGNQN--YCETCKQCDYEIEYADQSSSMGVLARDDMHMIATNGGREKL 294

Query: 1011 DFVFGCAYDQQGELSVSPAKTDGILGLSRATIGLPSQLASQGIIRNVVGHCINSDEDGHG 1190
            DFVFGCAYDQQG+L  SPAKTDGILGLS A I  PSQLAS GII NV GHCI  ++ G G
Sbjct: 295  DFVFGCAYDQQGQLLSSPAKTDGILGLSSAAISFPSQLASHGIIANVFGHCITREQGGGG 354

Query: 1191 YMFLGDDFVPQWGMTWVSMLDSPSINFYHTKTVKVSYGQRSLGSPDSGGT--QVVFDSGS 1364
            YMFLGDD+VP+WG+TW S+   P  N YHT+   V YG + L  P+  G+  QV+FDSGS
Sbjct: 355  YMFLGDDYVPRWGVTWTSIRSGPD-NLYHTQAHHVKYGDQQLRRPEQAGSTVQVIFDSGS 413

Query: 1365 SYTYFTKDAYSGLITSLEDSLGGRLTLDESDPTLPVCWRAEAPISY 1502
            SYTY   + Y  L+ +++ +  G    D SD TLP+CW+A+ P+ Y
Sbjct: 414  SYTYLPNEIYENLVAAIKYASPG-FVQDTSDRTLPLCWKADFPVRY 458


>ref|NP_001048015.1| Os02g0730700 [Oryza sativa Japonica Group]
            gi|46390468|dbj|BAD15929.1| putative nucellin-like
            aspartic protease [Oryza sativa Japonica Group]
            gi|46390864|dbj|BAD16368.1| putative nucellin-like
            aspartic protease [Oryza sativa Japonica Group]
            gi|113537546|dbj|BAF09929.1| Os02g0730700 [Oryza sativa
            Japonica Group] gi|215697021|dbj|BAG91015.1| unnamed
            protein product [Oryza sativa Japonica Group]
            gi|222623612|gb|EEE57744.1| hypothetical protein
            OsJ_08261 [Oryza sativa Japonica Group]
          Length = 573

 Score =  392 bits (1008), Expect = e-106
 Identities = 225/471 (47%), Positives = 276/471 (58%), Gaps = 21/471 (4%)
 Frame = +3

Query: 153  PQLQGVVIITLPPSDNPSKGKTITSIFTLSD----PSPSTXXXXXXXXXXXXXLEHXXXX 320
            PQL GVVIITLPP D PSKGKTIT+ FT +D    P P T                    
Sbjct: 17   PQLHGVVIITLPPPDQPSKGKTITA-FTYTDDDVTPPPPTPPPTHLPTRALVPAGAGAGA 75

Query: 321  XXXXXXXXXXXXXXXXXXXXXXASYVWRTYYVPSNTSRQLRGT-----NENKEFSSFVFT 485
                                  A  V   Y   S+ + Q  G      NE  E  SF+  
Sbjct: 76   EARRSRRGFSPRRAAAMVLVLGALAVAAYYSFYSDVAVQFLGMQEEAQNERNETKSFLLP 135

Query: 486  IYPKLL-GRPQKIHGLDIEFKLGKI-------VERERSSSLEMIDDGGLFRKVEKLG--E 635
            +YPK   GR  +  G DI+    +        V R+  + LE+       +K    G   
Sbjct: 136  LYPKARQGRALREFG-DIKLAARRFDNDGGGGVGRKSRNKLEV-------KKAAAAGTNS 187

Query: 636  SVVFPVRGNIYPDGLYYISMFVGNPPRPYYLDMDTGSDLTWIQCDAPCTSCSKGPNPLYK 815
            + + P++GN++PDG YY S+FVGNPPRPY+LD+DTGSDLTWIQCDAPCT+C+KGP+PLYK
Sbjct: 188  TALLPIKGNVFPDGQYYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYK 247

Query: 816  PTKGKIVPPKDLLCAEVQNDDEPGYCESCKQCNYEIEYADHSSSRGVLVKDKLQQRIANG 995
            P K KIVPPKDLLC E+Q +    YCE+CKQC+YEIEYAD SSS GVL +D +     NG
Sbjct: 248  PAKEKIVPPKDLLCQELQGNQN--YCETCKQCDYEIEYADRSSSMGVLARDDMHIITTNG 305

Query: 996  TMLNSDFVFGCAYDQQGELSVSPAKTDGILGLSRATIGLPSQLASQGIIRNVVGHCINSD 1175
                 DFVFGCAYDQQG+L  SPAKTDGILGLS A I LPSQLA+QGII NV GHCI  D
Sbjct: 306  GREKLDFVFGCAYDQQGQLLASPAKTDGILGLSSAGISLPSQLANQGIISNVFGHCITRD 365

Query: 1176 EDGHGYMFLGDDFVPQWGMTWVSMLDSPSINFYHTKTVKVSYGQRSLGSPDSGG--TQVV 1349
             +G GYMFLGDD+VP+WGMT   +  +P  N +HT+  KV YG + L    + G   QV+
Sbjct: 366  PNGGGYMFLGDDYVPRWGMTSTPIRSAPD-NLFHTEAQKVYYGDQQLSMRGASGNSVQVI 424

Query: 1350 FDSGSSYTYFTKDAYSGLITSLEDSLGGRLTLDESDPTLPVCWRAEAPISY 1502
            FDSGSSYTY   + Y  LI +++ +       D SD TLP+C   + P+ Y
Sbjct: 425  FDSGSSYTYLPDEIYKNLIAAIKYAY-PNFVQDSSDRTLPLCLATDFPVRY 474


>gb|EEC73939.1| hypothetical protein OsI_08807 [Oryza sativa Indica Group]
          Length = 574

 Score =  392 bits (1008), Expect = e-106
 Identities = 225/471 (47%), Positives = 276/471 (58%), Gaps = 21/471 (4%)
 Frame = +3

Query: 153  PQLQGVVIITLPPSDNPSKGKTITSIFTLSD----PSPSTXXXXXXXXXXXXXLEHXXXX 320
            PQL GVVIITLPP D PSKGKTIT+ FT +D    P P T                    
Sbjct: 18   PQLHGVVIITLPPPDQPSKGKTITA-FTYTDDDVTPPPPTPPPTHLPTRALVPAGAGAGA 76

Query: 321  XXXXXXXXXXXXXXXXXXXXXXASYVWRTYYVPSNTSRQLRGT-----NENKEFSSFVFT 485
                                  A  V   Y   S+ + Q  G      NE  E  SF+  
Sbjct: 77   EARRSRRGFSPRRAAAMVLVLGALAVAAYYSFYSDVAVQFLGMQEEAQNERNETKSFLLP 136

Query: 486  IYPKLL-GRPQKIHGLDIEFKLGKI-------VERERSSSLEMIDDGGLFRKVEKLG--E 635
            +YPK   GR  +  G DI+    +        V R+  + LE+       +K    G   
Sbjct: 137  LYPKARQGRALREFG-DIKLAARRFDNDGGGGVGRKSRNKLEV-------KKAAAAGTNS 188

Query: 636  SVVFPVRGNIYPDGLYYISMFVGNPPRPYYLDMDTGSDLTWIQCDAPCTSCSKGPNPLYK 815
            + + P++GN++PDG YY S+FVGNPPRPY+LD+DTGSDLTWIQCDAPCT+C+KGP+PLYK
Sbjct: 189  TALLPIKGNVFPDGQYYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYK 248

Query: 816  PTKGKIVPPKDLLCAEVQNDDEPGYCESCKQCNYEIEYADHSSSRGVLVKDKLQQRIANG 995
            P K KIVPPKDLLC E+Q +    YCE+CKQC+YEIEYAD SSS GVL +D +     NG
Sbjct: 249  PAKEKIVPPKDLLCQELQGNQN--YCETCKQCDYEIEYADRSSSMGVLARDDMHIITTNG 306

Query: 996  TMLNSDFVFGCAYDQQGELSVSPAKTDGILGLSRATIGLPSQLASQGIIRNVVGHCINSD 1175
                 DFVFGCAYDQQG+L  SPAKTDGILGLS A I LPSQLA+QGII NV GHCI  D
Sbjct: 307  GREKLDFVFGCAYDQQGQLLASPAKTDGILGLSSAGISLPSQLANQGIISNVFGHCITRD 366

Query: 1176 EDGHGYMFLGDDFVPQWGMTWVSMLDSPSINFYHTKTVKVSYGQRSLGSPDSGG--TQVV 1349
             +G GYMFLGDD+VP+WGMT   +  +P  N +HT+  KV YG + L    + G   QV+
Sbjct: 367  PNGGGYMFLGDDYVPRWGMTSTPIRSAPD-NLFHTEAQKVYYGDQQLSMRGASGNSVQVI 425

Query: 1350 FDSGSSYTYFTKDAYSGLITSLEDSLGGRLTLDESDPTLPVCWRAEAPISY 1502
            FDSGSSYTY   + Y  LI +++ +       D SD TLP+C   + P+ Y
Sbjct: 426  FDSGSSYTYLPDEIYKNLIAAIKYAY-PNFVQDSSDRTLPLCLATDFPVRY 475


>ref|XP_004137470.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
            gi|449486840|ref|XP_004157418.1| PREDICTED: aspartic
            proteinase Asp1-like [Cucumis sativus]
          Length = 570

 Score =  391 bits (1005), Expect = e-106
 Identities = 221/473 (46%), Positives = 282/473 (59%), Gaps = 26/473 (5%)
 Frame = +3

Query: 156  QLQGVVIITLPPSDNPSKGKTITSIFTLSD--PSPSTXXXXXXXXXXXXXLEHXXXXXXX 329
            +++GVV+ITLPP DNPS GK++T+ FTL+D  P P                +H       
Sbjct: 5    KIKGVVVITLPPPDNPSLGKSVTA-FTLTDDFPEPPGESVAVDQEVQQPNNDHLTLPPNL 63

Query: 330  XXXXXXXXXXXXXXXXXXXAS----------YVWRTYYVPSN---TSRQLRGTNENKEF- 467
                                +           +   Y   SN   T R+LR +  N +  
Sbjct: 64   PIQAPLSQRSIPLSRELFAGTPRKLVFVLGIALAAVYLYASNFPETIRELRRSERNDDDR 123

Query: 468  -SSFVFTIYPKLLGRPQKIHGLDIEFKLGKIVERERSSSLEMIDDGGLFRKVEKL----- 629
             SSF+F +Y     + +     D + KLG+ V   +       +D     K  KL     
Sbjct: 124  PSSFLFPLY----FQSELGDSSDFQLKLGRTVRVNKDDLGVRFNDVLGVPKPSKLISASL 179

Query: 630  --GESVVFPVRGNIYPDGLYYISMFVGNPPRPYYLDMDTGSDLTWIQCDAPCTSCSKGPN 803
                S VFPVRG+IYPDGLYY  + VG PPRPY+LD+DTGSDLTW+QCDAPC+SC KG +
Sbjct: 180  KSDSSAVFPVRGDIYPDGLYYTYIMVGEPPRPYFLDIDTGSDLTWVQCDAPCSSCGKGRS 239

Query: 804  PLYKPTKGKIVPPKDLLCAEVQNDDEPGYCESCKQCNYEIEYADHSSSRGVLVKDKLQQR 983
            PLYKP +  +V  KD LC EVQ + +   C +C+QCNYE++YAD SSS GVLVKD+   R
Sbjct: 240  PLYKPRRENVVSFKDSLCMEVQRNYDGDQCAACQQCNYEVQYADQSSSLGVLVKDEFTLR 299

Query: 984  IANGTMLNSDFVFGCAYDQQGELSVSPAKTDGILGLSRATIGLPSQLASQGIIRNVVGHC 1163
             +NG++   + +FGCAYDQQG L  + +KTDGILGLSRA + LPSQLAS+GII NVVGHC
Sbjct: 300  FSNGSLTKLNAIFGCAYDQQGLLLNTLSKTDGILGLSRAKVSLPSQLASRGIINNVVGHC 359

Query: 1164 INSDEDGHGYMFLGDDFVPQWGMTWVSMLDSPSINFYHTKTVKVSYGQ--RSLGSPDSGG 1337
            +  D  G GY+FLGDDFVPQWGM WV+MLDSPSI+FY TK V++ YG    SL +  S  
Sbjct: 360  LTGDPAGGGYLFLGDDFVPQWGMAWVAMLDSPSIDFYQTKVVRIDYGSIPLSLDTWGSSR 419

Query: 1338 TQVVFDSGSSYTYFTKDAYSGLITSLEDSLGGRLTLDESDPTLPVCWRAEAPI 1496
             QVVFDSGSSYTYFTK+AY  L+ +LE+     L L +S  T  +CW+ E  I
Sbjct: 420  EQVVFDSGSSYTYFTKEAYYQLVANLEEVSAFGLILQDSSDT--ICWKTEQSI 470


>gb|EMT28382.1| Aspartic proteinase Asp1 [Aegilops tauschii]
          Length = 473

 Score =  384 bits (987), Expect = e-104
 Identities = 194/358 (54%), Positives = 241/358 (67%), Gaps = 13/358 (3%)
 Frame = +3

Query: 453  ENKEFSSFVFTIYPKL-LGRPQKIHGLDIEFKLGKIVERERSSSLEMIDDGGLFRKVEKL 629
            E  E  SF+F +YPK   GR  +  G   + KL          + + +DDGG  +  +KL
Sbjct: 35   ERNETKSFLFQLYPKAHQGRALREFG---DIKL----------AAKRVDDGGGRKVTKKL 81

Query: 630  ----------GESVVFPVRGNIYPDGLYYISMFVGNPPRPYYLDMDTGSDLTWIQCDAPC 779
                        +V+ P++GN++PDG YY S+FVGNPPRPY+LD+DTGSDLTWIQCDAPC
Sbjct: 82   DVKGATSAGTNSTVLLPIKGNVFPDGQYYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPC 141

Query: 780  TSCSKGPNPLYKPTKGKIVPPKDLLCAEVQNDDEPGYCESCKQCNYEIEYADHSSSRGVL 959
            T+C++GP+PLYKP K KIVPP+DLLC E+Q D    YCE+CKQC+YEIEYAD SSS GVL
Sbjct: 142  TNCAQGPHPLYKPAKEKIVPPRDLLCQELQGDQN--YCETCKQCDYEIEYADRSSSMGVL 199

Query: 960  VKDKLQQRIANGTMLNSDFVFGCAYDQQGELSVSPAKTDGILGLSRATIGLPSQLASQGI 1139
             KD +     NG     DFVFGCAYDQQG+L  SPAKTDGILGLS A I LPSQLAS+GI
Sbjct: 200  AKDDMHLIATNGGKEKLDFVFGCAYDQQGQLLSSPAKTDGILGLSSAAISLPSQLASKGI 259

Query: 1140 IRNVVGHCINSDEDGHGYMFLGDDFVPQWGMTWVSMLDSPSINFYHTKTVKVSYGQRSLG 1319
            I N+ GHCI  + +G GYMFLGDD+VP+WGMTW  +   P  N YHT+  KV+YG + L 
Sbjct: 260  ISNIFGHCITRETNGGGYMFLGDDYVPRWGMTWAPIRGGPD-NLYHTEAQKVNYGDQELS 318

Query: 1320 SPDSGG--TQVVFDSGSSYTYFTKDAYSGLITSLEDSLGGRLTLDESDPTLPVCWRAE 1487
                 G   QV+FDSGSSYTY  ++ Y  LI +++D        D SD TLP+CW+A+
Sbjct: 319  MHGHAGNSVQVIFDSGSSYTYLPEEMYKNLIDAIKDD-SPNFVQDSSDTTLPLCWKAD 375


Top