BLASTX nr result

ID: Ephedra25_contig00000832 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Ephedra25_contig00000832
         (1440 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|ABK25480.1| unknown [Picea sitchensis]                             426   e-116
ref|XP_002300215.2| aspartyl protease family protein [Populus tr...   392   e-106
ref|XP_002329464.1| predicted protein [Populus trichocarpa] gi|5...   385   e-104
ref|XP_002522914.1| Aspartic proteinase nepenthesin-1 precursor,...   382   e-103
ref|XP_006828037.1| hypothetical protein AMTR_s00008p00256490 [A...   380   e-103
ref|XP_002446566.1| hypothetical protein SORBIDRAFT_06g018170 [S...   380   e-102
ref|XP_004239638.1| PREDICTED: aspartic proteinase nepenthesin-1...   379   e-102
ref|XP_004975767.1| PREDICTED: aspartic proteinase nepenthesin-1...   377   e-102
ref|XP_002272802.1| PREDICTED: aspartic proteinase nepenthesin-1...   377   e-102
emb|CAD40873.2| OSJNBa0064H22.10 [Oryza sativa Japonica Group] g...   376   e-101
ref|NP_001052922.1| Os04g0448300 [Oryza sativa Japonica Group] g...   376   e-101
ref|XP_006345762.1| PREDICTED: aspartic proteinase nepenthesin-1...   373   e-101
gb|EXB80380.1| Aspartic proteinase nepenthesin-1 [Morus notabilis]    370   1e-99
ref|XP_004168776.1| PREDICTED: aspartic proteinase nepenthesin-1...   370   1e-99
ref|XP_004144695.1| PREDICTED: LOW QUALITY PROTEIN: aspartic pro...   370   1e-99
gb|EOX92742.1| Eukaryotic aspartyl protease family protein [Theo...   369   1e-99
gb|AFW58416.1| hypothetical protein ZEAMMB73_998053 [Zea mays]        369   2e-99
ref|XP_003581287.1| PREDICTED: aspartic proteinase nepenthesin-1...   369   2e-99
ref|NP_565298.2| aspartyl protease family protein [Arabidopsis t...   368   4e-99
gb|EAY94310.1| hypothetical protein OsI_16079 [Oryza sativa Indi...   365   2e-98

>gb|ABK25480.1| unknown [Picea sitchensis]
          Length = 460

 Score =  426 bits (1095), Expect = e-116
 Identities = 221/416 (53%), Positives = 281/416 (67%), Gaps = 16/416 (3%)
 Frame = +1

Query: 10   PNLSLRVELLRRDYK------ENLTTTERLRRGVERSIERLQKFKAAQVTKLDAGGTFQT 171
            P + LR++L+R D         N+++TER +R ++RS +RL+K + +    +D     + 
Sbjct: 51   PLIGLRIDLVRTDSPLSPFSPGNISSTERFKRAIKRSQDRLEKLQMS----VDEVKAVEA 106

Query: 172  DVTPGEGEFLMKIGIGTPASTYEAILDTGSDLTWTQCQPCKSCYEQSAPIYDPTKSATSG 351
             V  G GEFLMK+ IGTP+ ++ AILDTGSDLTWTQC+PC  CY Q  PIYDP++S+T  
Sbjct: 107  PVYAGNGEFLMKMAIGTPSLSFSAILDTGSDLTWTQCKPCTDCYPQPTPIYDPSQSSTYS 166

Query: 352  TTPCGAPLCNALPEFTCPNSKCEYLYQYGDYSSTSGYLATETFTLSSQEIPKLTFGCGQD 531
              PC + +C ALP ++C  + CEYLY YGD SST G L+ E+FTL+SQ +P + FGCGQ+
Sbjct: 167  KVPCSSSMCQALPMYSCSGANCEYLYSYGDQSSTQGILSYESFTLTSQSLPHIAFGCGQE 226

Query: 532  NEGGGFSPSDGLVGFGRGPLSLVSQLGTT---KFSYCLTSV--SAKATSPLFL--XXXXX 690
            NEGGGFS   GLVGFGRGPLSL+SQLG +   KFSYCL S+  S   TSPLF+       
Sbjct: 227  NEGGGFSQGGGLVGFGRGPLSLISQLGQSLGNKFSYCLVSITDSPSKTSPLFIGKTASLN 286

Query: 691  XXXXXXXPLIRSTMHPTFYYLSLEGVSIGALKLAIPKGTFDLQSDGTGGLIIDSGTTITH 870
                   PL++S   PTFYYLSLEG+S+G   L I  GTFDLQ DGTGG+IIDSGTT+T+
Sbjct: 287  AKTVSSTPLVQSRSRPTFYYLSLEGISVGGQLLDIADGTFDLQLDGTGGVIIDSGTTVTY 346

Query: 871  LEQAAYNEIASALSSAVKLTPVTSSQLGLDLCFNNPPRG---FQFPDMTLSFAGGANMVL 1041
            LEQ+ Y+ +  A+ S++ L  V  S +GLDLCF  P  G     FP +T  F  GA+  L
Sbjct: 347  LEQSGYDVVKKAVISSINLPQVDGSNIGLDLCF-EPQSGSSTSHFPTITFHFE-GADFNL 404

Query: 1042 PAENYLIQDSSAVICLAMLPSNGMSILGNIQQQNFQIIYDTGANALSFARTSCGSL 1209
            P ENY+  DSS + CLAMLPSNGMSI GNIQQQN+QI+YD   N LSFA T C +L
Sbjct: 405  PKENYIYTDSSGIACLAMLPSNGMSIFGNIQQQNYQILYDNERNVLSFAPTVCDTL 460


>ref|XP_002300215.2| aspartyl protease family protein [Populus trichocarpa]
            gi|550348628|gb|EEE85020.2| aspartyl protease family
            protein [Populus trichocarpa]
          Length = 439

 Score =  392 bits (1008), Expect = e-106
 Identities = 207/402 (51%), Positives = 255/402 (63%), Gaps = 7/402 (1%)
 Frame = +1

Query: 25   RVELLRRDYKENLTTTERLRRGVERSIERLQKFKAAQVTKLDAGGTFQTDVTPGEGEFLM 204
            RV L   D  +NLT  ER+R GV+R   RLQ+ +A  +    +    +  V PG GEFLM
Sbjct: 41   RVRLKHVDSGKNLTKLERIRHGVKRGRNRLQRLQAMALVA-SSSSEIEAPVLPGNGEFLM 99

Query: 205  KIGIGTPASTYEAILDTGSDLTWTQCQPCKSCYEQSAPIYDPTKSATSGTTPCGAPLCNA 384
            K+ IGTP  TY AILDTGSDL WTQC+PC  C+ QS PI+DP KS++     C + LC A
Sbjct: 100  KLAIGTPPETYSAILDTGSDLIWTQCKPCTQCFHQSTPIFDPKKSSSFSKLSCSSQLCEA 159

Query: 385  LPEFTCPNSKCEYLYQYGDYSSTSGYLATETFTLSSQEIPKLTFGCGQDNEGGGFSPSDG 564
            LP+ +C N+ CEYLY YGDYSST G LA+ET T     +P + FGCG DNEG GFS   G
Sbjct: 160  LPQSSC-NNGCEYLYSYGDYSSTQGILASETLTFGKASVPHVAFGCGADNEGSGFSQGAG 218

Query: 565  LVGFGRGPLSLVSQLGTTKFSYCLTSVSAKATSPLFL----XXXXXXXXXXXXPLIRSTM 732
            LVG GRGPLSLVSQL   KFSYCLT+V    TS L +                PLI S  
Sbjct: 219  LVGLGRGPLSLVSQLKEPKFSYCLTTVDDTKTSTLLMGSLASVNASSSAIKTTPLIHSPA 278

Query: 733  HPTFYYLSLEGVSIGALKLAIPKGTFDLQSDGTGGLIIDSGTTITHLEQAAYNEIASALS 912
            HP+FYYLSLEG+S+G  +L I K TF LQ DG+GGLIIDSGTTIT+LE++A+N +A   +
Sbjct: 279  HPSFYYLSLEGISVGDTRLPIKKSTFSLQDDGSGGLIIDSGTTITYLEESAFNLVAKEFT 338

Query: 913  SAVKLTPVTSSQLGLDLCFNNP--PRGFQFPDMTLSFAGGANMVLPAENYLIQDSS-AVI 1083
            + + L   +S   GLD+CF  P      + P +   F  GA++ LPAENY+I DSS  V 
Sbjct: 339  AKINLPVDSSGSTGLDVCFTLPSGSTNIEVPKLVFHF-DGADLELPAENYMIGDSSMGVA 397

Query: 1084 CLAMLPSNGMSILGNIQQQNFQIIYDTGANALSFARTSCGSL 1209
            CLAM  S+GMSI GN+QQQN  +++D     LSF  T C  L
Sbjct: 398  CLAMGSSSGMSIFGNVQQQNMLVLHDLEKETLSFLPTQCDLL 439


>ref|XP_002329464.1| predicted protein [Populus trichocarpa]
            gi|566222317|ref|XP_006370905.1| aspartyl protease family
            protein [Populus trichocarpa] gi|550316486|gb|ERP48702.1|
            aspartyl protease family protein [Populus trichocarpa]
          Length = 439

 Score =  385 bits (990), Expect = e-104
 Identities = 203/402 (50%), Positives = 254/402 (63%), Gaps = 7/402 (1%)
 Frame = +1

Query: 25   RVELLRRDYKENLTTTERLRRGVERSIERLQKFKAAQVTKLDAGGTFQTDVTPGEGEFLM 204
            R +L   D  +NLT  ER++ GV+R   RLQ+FKA  +    +       V PG GEFLM
Sbjct: 41   RAKLKHVDSGKNLTKFERIQHGVKRGRHRLQRFKAMALVA-SSNSEIDAPVLPGNGEFLM 99

Query: 205  KIGIGTPASTYEAILDTGSDLTWTQCQPCKSCYEQSAPIYDPTKSATSGTTPCGAPLCNA 384
            K+ IGTP  TY AI+DTGSDL WTQC+PC  C++Q  PI+DP KS++     C + LC A
Sbjct: 100  KLAIGTPPETYSAIMDTGSDLIWTQCKPCTQCFDQPTPIFDPKKSSSFSKLSCSSKLCEA 159

Query: 385  LPEFTCPNSKCEYLYQYGDYSSTSGYLATETFTLSSQEIPKLTFGCGQDNEGGGFSPSDG 564
            LP+ TC +  CEYLY YGDYSST G LA+ET T     +P++ FGCG+DNEG GFS   G
Sbjct: 160  LPQSTCSDG-CEYLYGYGDYSSTQGMLASETLTFGKVSVPEVAFGCGEDNEGSGFSQGSG 218

Query: 565  LVGFGRGPLSLVSQLGTTKFSYCLTSVSAKATSPLFL----XXXXXXXXXXXXPLIRSTM 732
            LVG GRGPLSLVSQL   KFSYCLTSV     S L +                PLI+++ 
Sbjct: 219  LVGLGRGPLSLVSQLKEPKFSYCLTSVDDTKASTLLMGSLASVKASDSEIKTTPLIQNSA 278

Query: 733  HPTFYYLSLEGVSIGALKLAIPKGTFDLQSDGTGGLIIDSGTTITHLEQAAYNEIASALS 912
             P+FYYLSLEG+S+G   L I K TF LQ DG+GGLIIDSGTTIT+LEQ+A++ +A   +
Sbjct: 279  QPSFYYLSLEGISVGDTSLPIKKSTFSLQEDGSGGLIIDSGTTITYLEQSAFDLVAKEFT 338

Query: 913  SAVKLTPVTSSQLGLDLCFNNP--PRGFQFPDMTLSFAGGANMVLPAENYLIQDSS-AVI 1083
            S + L    S   GL++CF  P      + P +   F  GA++ LPAENY+I D+S  V 
Sbjct: 339  SQINLPVDNSGSTGLEVCFTLPSGSTDIEVPKLVFHF-DGADLELPAENYMIADASMGVA 397

Query: 1084 CLAMLPSNGMSILGNIQQQNFQIIYDTGANALSFARTSCGSL 1209
            CLAM  S+GMSI GNIQQQN  +++D     LSF  T C  L
Sbjct: 398  CLAMGSSSGMSIFGNIQQQNMLVLHDLEKETLSFLPTQCDEL 439


>ref|XP_002522914.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
            communis] gi|223537841|gb|EEF39457.1| Aspartic proteinase
            nepenthesin-1 precursor, putative [Ricinus communis]
          Length = 442

 Score =  382 bits (981), Expect = e-103
 Identities = 199/402 (49%), Positives = 257/402 (63%), Gaps = 7/402 (1%)
 Frame = +1

Query: 25   RVELLRRDYKENLTTTERLRRGVERSIERLQKFKAAQVTKLDAGGTFQTDVTPGEGEFLM 204
            R+ L   D  +NLT  +R++ G++R+  RL++  A  V    +     + V  G GEFLM
Sbjct: 44   RITLKHVDSDKNLTKFQRIQHGIKRANHRLERLNA-MVLAASSNAEINSPVLSGNGEFLM 102

Query: 205  KIGIGTPASTYEAILDTGSDLTWTQCQPCKSCYEQSAPIYDPTKSATSGTTPCGAPLCNA 384
             + IGTP  TY AI+DTGSDL WTQC+PC  C++Q +PI+DP KS++     C + LC A
Sbjct: 103  NLAIGTPPETYSAIMDTGSDLIWTQCKPCTQCFDQPSPIFDPKKSSSFSKLSCSSQLCKA 162

Query: 385  LPEFTCPNSKCEYLYQYGDYSSTSGYLATETFTLSSQEIPKLTFGCGQDNEGGGFSPSDG 564
            LP+ +C +S CEYLY YGDYSST G +ATETFT     IP + FGCG+DNEG GF+   G
Sbjct: 163  LPQSSCSDS-CEYLYTYGDYSSTQGTMATETFTFGKVSIPNVGFGCGEDNEGDGFTQGSG 221

Query: 565  LVGFGRGPLSLVSQLGTTKFSYCLTSVSAKATSPLFL----XXXXXXXXXXXXPLIRSTM 732
            LVG GRGPLSLVSQL   KFSYCLTS+    TS L +                PLI++ +
Sbjct: 222  LVGLGRGPLSLVSQLKEAKFSYCLTSIDDTKTSTLLMGSLASVNGTSAAIRTTPLIQNPL 281

Query: 733  HPTFYYLSLEGVSIGALKLAIPKGTFDLQSDGTGGLIIDSGTTITHLEQAAYNEIASALS 912
             P+FYYLSLEG+S+G  +L I + TF LQ DGTGGLIIDSGTTIT+LE++A++ +    +
Sbjct: 282  QPSFYYLSLEGISVGGTRLPIKESTFQLQDDGTGGLIIDSGTTITYLEESAFDLVKKEFT 341

Query: 913  SAVKLTPVTSSQLGLDLCFNNP--PRGFQFPDMTLSFAGGANMVLPAENYLIQDSS-AVI 1083
            S + L    S   GL+LC+N P      + P + L F  GA++ LP ENY+I DSS  VI
Sbjct: 342  SQMGLPVDNSGATGLELCYNLPSDTSELEVPKLVLHFT-GADLELPGENYMIADSSMGVI 400

Query: 1084 CLAMLPSNGMSILGNIQQQNFQIIYDTGANALSFARTSCGSL 1209
            CLAM  S GMSI GN+QQQN  + +D     LSF  T+CG L
Sbjct: 401  CLAMGSSGGMSIFGNVQQQNMFVSHDLEKETLSFLPTNCGQL 442


>ref|XP_006828037.1| hypothetical protein AMTR_s00008p00256490 [Amborella trichopoda]
            gi|548832672|gb|ERM95453.1| hypothetical protein
            AMTR_s00008p00256490 [Amborella trichopoda]
          Length = 436

 Score =  380 bits (977), Expect = e-103
 Identities = 201/411 (48%), Positives = 256/411 (62%), Gaps = 11/411 (2%)
 Frame = +1

Query: 10   PNLSLRVELLRRDYKENLTTTERLRRGVERSIERLQKFKAAQVTKLDAGGT--FQTDVTP 183
            P   +RV+L+  D   N T  +RL+R V R   RL+K ++     LD  G    +  V  
Sbjct: 27   PESGIRVDLVHVDAGLNFTALQRLQRAVTRGKLRLEKLQSKTTAALDGSGEVDIEAPVHV 86

Query: 184  GEGEFLMKIGIGTPASTYEAILDTGSDLTWTQCQPCKSCYEQSAPIYDPTKSATSGTTPC 363
            G GEFLMK+ IGTP  +Y AI+DTGSDL WTQC PC  C++Q  PI+DP KS+T G   C
Sbjct: 87   GNGEFLMKLAIGTPPVSYSAIVDTGSDLVWTQCLPCDKCFKQPTPIFDPAKSSTFGKLSC 146

Query: 364  GAPLCNALPEFTCPNSKCEYLYQYGDYSSTSGYLATETFTLSSQEIPKLTFGCGQDNEGG 543
             + LC ALP  TC +  CEY+Y YGDYSST G LATE FT     + ++ FGCG  N+G 
Sbjct: 147  KSDLCQALPSSTC-DPDCEYVYTYGDYSSTQGTLATELFTFGGVSVSEVGFGCGNYNQGR 205

Query: 544  GFSPSDGLVGFGRGPLSLVSQLG---TTKFSYCLTSV--SAKATSPLFL-XXXXXXXXXX 705
            GFS   GLVG GRGPLSL++QLG     KFSYCL S+  S  ATSPL L           
Sbjct: 206  GFSQGAGLVGLGRGPLSLITQLGGSVANKFSYCLKSIDDSDSATSPLLLGAEAKTTGEVI 265

Query: 706  XXPLIRSTMHPTFYYLSLEGVSIGALKLAIPKGTFDLQSDGTGGLIIDSGTTITHLEQAA 885
              PL+R+    +FYY++LEG+S+G   L I   TF++++DG GG+I+DSGTTIT+LE A 
Sbjct: 266  TTPLVRNPEQFSFYYITLEGISVGGYLLPIKNTTFEMKADGNGGMIVDSGTTITYLEVAG 325

Query: 886  YNEIASALSSAVKLTPVTSSQLGLDLCFNNPPRG--FQFPDMTLSFAGGANMVLPAENYL 1059
            Y E+  A  S +K      S  GLDLCF+ P      + P +TL F GG ++ LPAENY 
Sbjct: 326  YREVRKAFLSKMKTPETDGSATGLDLCFSLPSSATEVEVPTLTLHFGGGGSLELPAENYF 385

Query: 1060 IQD-SSAVICLAMLPSNGMSILGNIQQQNFQIIYDTGANALSFARTSCGSL 1209
            I D S+ ++CLAM+P++GMSILGN+QQQNF + YD G   LSF    C  L
Sbjct: 386  IADESTGLLCLAMMPASGMSILGNVQQQNFLVQYDLGKELLSFTSAQCDKL 436


>ref|XP_002446566.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
            gi|241937749|gb|EES10894.1| hypothetical protein
            SORBIDRAFT_06g018170 [Sorghum bicolor]
          Length = 452

 Score =  380 bits (975), Expect = e-102
 Identities = 196/413 (47%), Positives = 257/413 (62%), Gaps = 17/413 (4%)
 Frame = +1

Query: 22   LRVELLRRDYKENLTTTERLRRGVERSIERLQKF--KAAQVTKLDAGGTFQTDVTPGEGE 195
            LRV L   D   N +  + L+R   RS  R+ +   +A  V  +  GG  Q  V  G GE
Sbjct: 40   LRVRLTHVDAHGNYSRLQLLQRAARRSHHRMSRLVARATGVKAVAGGGDLQVPVHAGNGE 99

Query: 196  FLMKIGIGTPASTYEAILDTGSDLTWTQCQPCKSCYEQSAPIYDPTKSATSGTTPCGAPL 375
            FLM + IGTPA +Y AI+DTGSDL WTQC+PC  C++QS P++DP+ S+T  T PC + L
Sbjct: 100  FLMDVAIGTPALSYAAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPCSSAL 159

Query: 376  CNALPEFTCPN-SKCEYLYQYGDYSSTSGYLATETFTLSSQ--EIPKLTFGCGQDNEGGG 546
            C+ LP  TC + SKC Y Y YGD SST G LA+ETFTL  +  ++P + FGCG  NEG G
Sbjct: 160  CSDLPTSTCTSASKCGYTYTYGDASSTQGVLASETFTLGKEKKKLPGVAFGCGDTNEGDG 219

Query: 547  FSPSDGLVGFGRGPLSLVSQLGTTKFSYCLTSV-SAKATSPLFL-------XXXXXXXXX 702
            F+   GLVG GRGPLSLVSQLG  KFSYCLTS+      SPL L                
Sbjct: 220  FTQGAGLVGLGRGPLSLVSQLGLDKFSYCLTSLDDGDGKSPLLLGGSAAAISESAATAPV 279

Query: 703  XXXPLIRSTMHPTFYYLSLEGVSIGALKLAIPKGTFDLQSDGTGGLIIDSGTTITHLEQA 882
               PL+++   P+FYY+SL G+++G+ ++ +P   F +Q DGTGG+I+DSGT+IT+LE  
Sbjct: 280  QTTPLVKNPSQPSFYYVSLTGLTVGSTRITLPASAFAIQDDGTGGVIVDSGTSITYLELQ 339

Query: 883  AYNEIASALSSAVKLTPVTSSQLGLDLCFNNPPRG---FQFPDMTLSFAGGANMVLPAEN 1053
             Y  +  A  + + L  V  S++GLDLCF  P +G    Q P + L F GGA++ LPAEN
Sbjct: 340  GYRALKKAFVAQMALPTVDGSEIGLDLCFQGPAKGVDEVQVPKLVLHFDGGADLDLPAEN 399

Query: 1054 YLIQDS-SAVICLAMLPSNGMSILGNIQQQNFQIIYDTGANALSFARTSCGSL 1209
            Y++ DS S  +CL + PS G+SI+GN QQQNFQ +YD   + LSFA   C  L
Sbjct: 400  YMVLDSASGALCLTVAPSRGLSIIGNFQQQNFQFVYDVAGDTLSFAPVQCNKL 452


>ref|XP_004239638.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Solanum
            lycopersicum]
          Length = 441

 Score =  379 bits (973), Expect = e-102
 Identities = 201/410 (49%), Positives = 257/410 (62%), Gaps = 9/410 (2%)
 Frame = +1

Query: 7    NPNLSLRVELLRRDYKENLTTTERLRRGVERSIERLQKFK-AAQVTKLDAGGTFQTDVTP 183
            N +   R+ L   D   N T  ERL+R + R   RLQ+    A ++  D     ++ +  
Sbjct: 33   NNHKGFRLSLKHVDSGGNFTKFERLQRAMARGKSRLQRLSLVATLSSRDETNDVKSTIHA 92

Query: 184  GEGEFLMKIGIGTPASTYEAILDTGSDLTWTQCQPCKSCYEQSAPIYDPTKSATSGTTPC 363
            G GEFLM+I IG+P+ +Y AI+DTGSDL WTQC+PCK C++QS PI+DP+KS+T     C
Sbjct: 93   GNGEFLMQISIGSPSESYNAIMDTGSDLIWTQCKPCKECFDQSTPIFDPSKSSTFEKISC 152

Query: 364  GAPLCNALPEFTCPNSKCEYLYQYGDYSSTSGYLATETFTLSSQEIPKLTFGCGQDNEGG 543
               LC ALP  +C  S CEY+Y YGDYSS+ G+LA+ETFT     IP + FGCG DNEG 
Sbjct: 153  SNKLCEALPISSCGGSNCEYMYTYGDYSSSEGFLASETFTFGKVSIPNVAFGCGNDNEGS 212

Query: 544  GFSPSDGLVGFGRGPLSLVSQLGTTKFSYCLTSVS--AKATSPLFL---XXXXXXXXXXX 708
            GFS   GLVG GRGPLSLVSQL  ++FSYCLTS++  A +TS   L              
Sbjct: 213  GFSQGAGLVGLGRGPLSLVSQLHMSRFSYCLTSINEDADSTSSTLLMGSMARDDYNNIIT 272

Query: 709  XPLIRSTMHPTFYYLSLEGVSIGALKLAIPKGTFDLQSDGTGGLIIDSGTTITHLEQAAY 888
             PL+++   P+FYYLSL+G+S+G  +LAI K TF L  DG+GG+IIDSGTTIT+LE++A+
Sbjct: 273  TPLVKNPTQPSFYYLSLKGISVGDTQLAIKKSTFSLNKDGSGGMIIDSGTTITYLEESAF 332

Query: 889  NEIASALSSAVKLTPVTSSQLGLDLCFNNP--PRGFQFPDMTLSFAGGANMVLPAENYLI 1062
            + +    SS V L    SS  GLDLCF  P      Q P +   F  GA+M LPAENY+I
Sbjct: 333  SLLKKEFSSQVNLAVDDSSSTGLDLCFKLPSNTNNIQVPKLIFHFE-GADMDLPAENYMI 391

Query: 1063 QDS-SAVICLAMLPSNGMSILGNIQQQNFQIIYDTGANALSFARTSCGSL 1209
             DS   + CLAM  S+GMSI GN+QQQN  +I+D     LSF    C  L
Sbjct: 392  ADSRMGIACLAMGSSSGMSIFGNVQQQNMMVIHDLDKETLSFVPKQCDKL 441


>ref|XP_004975767.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Setaria italica]
          Length = 446

 Score =  377 bits (969), Expect = e-102
 Identities = 192/415 (46%), Positives = 256/415 (61%), Gaps = 19/415 (4%)
 Frame = +1

Query: 22   LRVELLRRDYKENLTTTERLRRGVERSIERLQKFKA--------AQVTKLDAGGTFQTDV 177
            LRV L   D   N +  + L+R   RS  R+ +  A        +    + +GG  Q  V
Sbjct: 32   LRVRLTHVDAHGNYSRLQLLQRAARRSHHRMSRLVARTTGVPIPSSSKAVASGGDLQVPV 91

Query: 178  TPGEGEFLMKIGIGTPASTYEAILDTGSDLTWTQCQPCKSCYEQSAPIYDPTKSATSGTT 357
              G GEFLM + IGTPA +Y AI+DTGSDL WTQC+PC  C++QS P++DP+ S+T    
Sbjct: 92   HAGNGEFLMDLAIGTPALSYAAIVDTGSDLVWTQCKPCVECFKQSTPVFDPSSSSTYAPV 151

Query: 358  PCGAPLCNALPEFTCPN-SKCEYLYQYGDYSSTSGYLATETFTLSSQEIPKLTFGCGQDN 534
            PC + LC  LP  +C + S+C Y Y YGD SST G LATETFTL+  ++P++ FGCG  N
Sbjct: 152  PCSSALCGDLPSSSCTSASRCGYTYTYGDASSTQGVLATETFTLAKSKLPEVAFGCGDTN 211

Query: 535  EGGGFSPSDGLVGFGRGPLSLVSQLGTTKFSYCLTSVSAKATSPLFL------XXXXXXX 696
            EG GFS   GLVG GRGPLSLV+QLG  KFSYCLTS+ A + SPL L             
Sbjct: 212  EGDGFSQGAGLVGLGRGPLSLVTQLGLDKFSYCLTSLDATSKSPLLLGSVAGISESAATA 271

Query: 697  XXXXXPLIRSTMHPTFYYLSLEGVSIGALKLAIPKGTFDLQSDGTGGLIIDSGTTITHLE 876
                 PL+++   P+FYY++L G+++G+  + +P   F +Q DGTGG+I+DSGT+IT+LE
Sbjct: 272  PVQSTPLVKNPSQPSFYYVTLTGLTVGSTHITLPTSAFAIQDDGTGGVIVDSGTSITYLE 331

Query: 877  QAAYNEIASALSSAVKLTPVTSSQLGLDLCFNNPPR---GFQFPDMTLSFAGGANMVLPA 1047
               Y  +  A  + + L  V  S++GLDLCF  P +   G Q P +   F GGA++ LPA
Sbjct: 332  LQGYRALKKAFVAQMSLPVVDGSEIGLDLCFRAPAKGVDGVQVPKLVFHFDGGADLDLPA 391

Query: 1048 ENYLIQDS-SAVICLAMLPSNGMSILGNIQQQNFQIIYDTGANALSFARTSCGSL 1209
            ENY++ DS S  +CL +  S G+SI+GN QQQNFQ +YD  A+ LSFA   C  L
Sbjct: 392  ENYMVLDSASGALCLTVAASRGLSIIGNFQQQNFQFVYDVAADTLSFAPVQCDKL 446


>ref|XP_002272802.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 436

 Score =  377 bits (968), Expect = e-102
 Identities = 201/399 (50%), Positives = 249/399 (62%), Gaps = 4/399 (1%)
 Frame = +1

Query: 25   RVELLRRDYKENLTTTERLRRGVERSIERLQKFKAAQVTKLDAGGTFQTDVTPGEGEFLM 204
            RV L   D   N T  ERL+R ++R   RLQ+  A   +      + +  V  G GEFLM
Sbjct: 43   RVSLRHVDSGGNYTKFERLQRAMKRGKLRLQRLSAKTAS---FESSVEAPVHAGNGEFLM 99

Query: 205  KIGIGTPASTYEAILDTGSDLTWTQCQPCKSCYEQSAPIYDPTKSATSGTTPCGAPLCNA 384
            K+ IGTPA TY AI+DTGSDL WTQC+PCK C++Q  PI+DP KS++    PC + LC A
Sbjct: 100  KLAIGTPAETYSAIMDTGSDLIWTQCKPCKDCFDQPTPIFDPKKSSSFSKLPCSSDLCAA 159

Query: 385  LPEFTCPNSKCEYLYQYGDYSSTSGYLATETFTLSSQEIPKLTFGCGQDNEGGGFSPSDG 564
            LP  +C +  CEYLY YGDYSST G LATETF      + K+ FGCG+DN+G GFS   G
Sbjct: 160  LPISSCSDG-CEYLYSYGDYSSTQGVLATETFAFGDASVSKIGFGCGEDNDGSGFSQGAG 218

Query: 565  LVGFGRGPLSLVSQLGTTKFSYCLTSV-SAKATSPLFLXXXXXXXXXXXXPLIRSTMHPT 741
            LVG GRGPLSL+SQLG  KFSYCLTS+  +K  S L +            PLI++   P+
Sbjct: 219  LVGLGRGPLSLISQLGEPKFSYCLTSMDDSKGISSLLVGSEATMKNAITTPLIQNPSQPS 278

Query: 742  FYYLSLEGVSIGALKLAIPKGTFDLQSDGTGGLIIDSGTTITHLEQAAYNEIASALSSAV 921
            FYYLSLEG+S+G   L I K TF +Q+DG+GGLIIDSGTTIT+LE +A+  +     S +
Sbjct: 279  FYYLSLEGISVGDTLLPIEKSTFSIQNDGSGGLIIDSGTTITYLEDSAFAALKKEFISQL 338

Query: 922  KLTPVTSSQLGLDLCFNNPPRG--FQFPDMTLSFAGGANMVLPAENYLIQDSS-AVICLA 1092
            KL    S   GLDLCF  PP       P +   F  GA++ LPAENY+I DS   VICL 
Sbjct: 339  KLDVDESGSTGLDLCFTLPPDASTVDVPQLVFHFE-GADLKLPAENYIIADSGLGVICLT 397

Query: 1093 MLPSNGMSILGNIQQQNFQIIYDTGANALSFARTSCGSL 1209
            M  S+GMSI GN QQQN  +++D     +SFA   C  L
Sbjct: 398  MGSSSGMSIFGNFQQQNIVVLHDLEKETISFAPAQCNQL 436


>emb|CAD40873.2| OSJNBa0064H22.10 [Oryza sativa Japonica Group]
            gi|116310063|emb|CAH67084.1| H0818E04.1 [Oryza sativa
            Indica Group] gi|116310186|emb|CAH67198.1|
            OSIGBa0152K17.10 [Oryza sativa Indica Group]
          Length = 444

 Score =  376 bits (966), Expect = e-101
 Identities = 193/414 (46%), Positives = 252/414 (60%), Gaps = 18/414 (4%)
 Frame = +1

Query: 22   LRVELLRRDYKENLTTTERLRRGVERSIERLQKFKAAQV------TKLDAGGTFQTDVTP 183
            LRV L   D   N +  + LRR   RS  R+ +  A         +K   GG  Q  V  
Sbjct: 31   LRVHLTHVDAHGNYSRHQLLRRAARRSHHRMSRLVARATGVPMTSSKAAGGGDLQVPVHA 90

Query: 184  GEGEFLMKIGIGTPASTYEAILDTGSDLTWTQCQPCKSCYEQSAPIYDPTKSATSGTTPC 363
            G GEFLM + IGTPA  Y AI+DTGSDL WTQC+PC  C++QS P++DP+ S+T  T PC
Sbjct: 91   GNGEFLMDVSIGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPC 150

Query: 364  GAPLCNALPEFTCPN-SKCEYLYQYGDYSSTSGYLATETFTLSSQEIPKLTFGCGQDNEG 540
             +  C+ LP   C + SKC Y Y YGD SST G LATETFTL+  ++P + FGCG  NEG
Sbjct: 151  SSASCSDLPTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKSKLPGVVFGCGDTNEG 210

Query: 541  GGFSPSDGLVGFGRGPLSLVSQLGTTKFSYCLTSVSAKATSPLFL-------XXXXXXXX 699
             GFS   GLVG GRGPLSLVSQLG  KFSYCLTS+     SPL L               
Sbjct: 211  DGFSQGAGLVGLGRGPLSLVSQLGLDKFSYCLTSLDDTNNSPLLLGSLAGISEASAAASS 270

Query: 700  XXXXPLIRSTMHPTFYYLSLEGVSIGALKLAIPKGTFDLQSDGTGGLIIDSGTTITHLEQ 879
                PLI++   P+FYY+SL+ +++G+ ++++P   F +Q DGTGG+I+DSGT+IT+LE 
Sbjct: 271  VQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVIVDSGTSITYLEV 330

Query: 880  AAYNEIASALSSAVKLTPVTSSQLGLDLCFNNPPRG---FQFPDMTLSFAGGANMVLPAE 1050
              Y  +  A ++ + L     S +GLDLCF  P +G    + P +   F GGA++ LPAE
Sbjct: 331  QGYRALKKAFAAQMALPAADGSGVGLDLCFRAPAKGVDQVEVPRLVFHFDGGADLDLPAE 390

Query: 1051 NYLIQD-SSAVICLAMLPSNGMSILGNIQQQNFQIIYDTGANALSFARTSCGSL 1209
            NY++ D  S  +CL ++ S G+SI+GN QQQNFQ +YD G + LSFA   C  L
Sbjct: 391  NYMVLDGGSGALCLTVMGSRGLSIIGNFQQQNFQFVYDVGHDTLSFAPVQCNKL 444


>ref|NP_001052922.1| Os04g0448300 [Oryza sativa Japonica Group]
            gi|113564493|dbj|BAF14836.1| Os04g0448300 [Oryza sativa
            Japonica Group] gi|215766465|dbj|BAG98773.1| unnamed
            protein product [Oryza sativa Japonica Group]
            gi|215767943|dbj|BAH00172.1| unnamed protein product
            [Oryza sativa Japonica Group]
          Length = 454

 Score =  376 bits (966), Expect = e-101
 Identities = 193/414 (46%), Positives = 252/414 (60%), Gaps = 18/414 (4%)
 Frame = +1

Query: 22   LRVELLRRDYKENLTTTERLRRGVERSIERLQKFKAAQV------TKLDAGGTFQTDVTP 183
            LRV L   D   N +  + LRR   RS  R+ +  A         +K   GG  Q  V  
Sbjct: 41   LRVHLTHVDAHGNYSRHQLLRRAARRSHHRMSRLVARATGVPMTSSKAAGGGDLQVPVHA 100

Query: 184  GEGEFLMKIGIGTPASTYEAILDTGSDLTWTQCQPCKSCYEQSAPIYDPTKSATSGTTPC 363
            G GEFLM + IGTPA  Y AI+DTGSDL WTQC+PC  C++QS P++DP+ S+T  T PC
Sbjct: 101  GNGEFLMDVSIGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPC 160

Query: 364  GAPLCNALPEFTCPN-SKCEYLYQYGDYSSTSGYLATETFTLSSQEIPKLTFGCGQDNEG 540
             +  C+ LP   C + SKC Y Y YGD SST G LATETFTL+  ++P + FGCG  NEG
Sbjct: 161  SSASCSDLPTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKSKLPGVVFGCGDTNEG 220

Query: 541  GGFSPSDGLVGFGRGPLSLVSQLGTTKFSYCLTSVSAKATSPLFL-------XXXXXXXX 699
             GFS   GLVG GRGPLSLVSQLG  KFSYCLTS+     SPL L               
Sbjct: 221  DGFSQGAGLVGLGRGPLSLVSQLGLDKFSYCLTSLDDTNNSPLLLGSLAGISEASAAASS 280

Query: 700  XXXXPLIRSTMHPTFYYLSLEGVSIGALKLAIPKGTFDLQSDGTGGLIIDSGTTITHLEQ 879
                PLI++   P+FYY+SL+ +++G+ ++++P   F +Q DGTGG+I+DSGT+IT+LE 
Sbjct: 281  VQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVIVDSGTSITYLEV 340

Query: 880  AAYNEIASALSSAVKLTPVTSSQLGLDLCFNNPPRG---FQFPDMTLSFAGGANMVLPAE 1050
              Y  +  A ++ + L     S +GLDLCF  P +G    + P +   F GGA++ LPAE
Sbjct: 341  QGYRALKKAFAAQMALPAADGSGVGLDLCFRAPAKGVDQVEVPRLVFHFDGGADLDLPAE 400

Query: 1051 NYLIQD-SSAVICLAMLPSNGMSILGNIQQQNFQIIYDTGANALSFARTSCGSL 1209
            NY++ D  S  +CL ++ S G+SI+GN QQQNFQ +YD G + LSFA   C  L
Sbjct: 401  NYMVLDGGSGALCLTVMGSRGLSIIGNFQQQNFQFVYDVGHDTLSFAPVQCNKL 454


>ref|XP_006345762.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Solanum tuberosum]
          Length = 444

 Score =  373 bits (958), Expect = e-101
 Identities = 198/413 (47%), Positives = 258/413 (62%), Gaps = 12/413 (2%)
 Frame = +1

Query: 7    NPNLSLRVELLRRDYKENLTTTERLRRGVERSIERLQKFKA----AQVTKLDAGGTFQTD 174
            N +   ++ L   D   N T  ERL+R + R   RLQ+       A ++  D     ++ 
Sbjct: 33   NNHKGFKLNLKHVDSGGNFTKFERLQRAMARGKSRLQRLSLVANFATLSSKDETNDVKST 92

Query: 175  VTPGEGEFLMKIGIGTPASTYEAILDTGSDLTWTQCQPCKSCYEQSAPIYDPTKSATSGT 354
            +  G GEFLM+I IG+P+ +Y AI+DTGSDL WTQC+PCK C++QS PI+DP+KS+T   
Sbjct: 93   IHAGNGEFLMQISIGSPSESYNAIMDTGSDLIWTQCKPCKECFDQSTPIFDPSKSSTFEK 152

Query: 355  TPCGAPLCNALPEFTCPNSKCEYLYQYGDYSSTSGYLATETFTLSSQEIPKLTFGCGQDN 534
              C   LC ALP  +C ++ CEY+Y YGDYSS+ G+LA+ETFT     IP + FGCG DN
Sbjct: 153  ISCSNKLCEALPTSSCGDNNCEYMYTYGDYSSSEGFLASETFTFGKVSIPNVAFGCGNDN 212

Query: 535  EGGGFSPSDGLVGFGRGPLSLVSQLGTTKFSYCLTSVSAKA---TSPLFL--XXXXXXXX 699
            EG GFS   GLVG GRG LSLVSQL  ++FSYCLTS++  A   +S L +          
Sbjct: 213  EGSGFSQGAGLVGLGRGSLSLVSQLHMSRFSYCLTSINEDAYTKSSTLLMGSMAHDDYNN 272

Query: 700  XXXXPLIRSTMHPTFYYLSLEGVSIGALKLAIPKGTFDLQSDGTGGLIIDSGTTITHLEQ 879
                PL+++   P+FYYLSL+G+S+G  +LAI K TF L  DGTGG+IIDSGTTIT+LE+
Sbjct: 273  IITTPLVKNPTQPSFYYLSLKGISVGDTQLAIKKSTFSLNKDGTGGMIIDSGTTITYLEE 332

Query: 880  AAYNEIASALSSAVKLTPVTSSQLGLDLCFNNP--PRGFQFPDMTLSFAGGANMVLPAEN 1053
            +A++ +    SS V L    SS  GLDLCF  P      + P +   F  GA+M LPAEN
Sbjct: 333  SAFSLLKKEFSSQVNLPVDDSSSTGLDLCFILPSNTNNIEVPKLIFHFE-GADMDLPAEN 391

Query: 1054 YLIQDS-SAVICLAMLPSNGMSILGNIQQQNFQIIYDTGANALSFARTSCGSL 1209
            Y+I DS   + CLAM  S+GMSI GN+QQQN  +I+D     LSF  T C  L
Sbjct: 392  YMIADSRMGIACLAMGSSSGMSIFGNVQQQNMMVIHDLDKETLSFVPTQCDKL 444


>gb|EXB80380.1| Aspartic proteinase nepenthesin-1 [Morus notabilis]
          Length = 457

 Score =  370 bits (949), Expect = 1e-99
 Identities = 201/407 (49%), Positives = 256/407 (62%), Gaps = 15/407 (3%)
 Frame = +1

Query: 25   RVELLRRDYKENLTTTERLRRGVERSIERLQKFKA-AQVTKLDAGGTFQTDVTPGEGEFL 201
            RVEL R D+ +NLT  ERL+RG++R   RLQ+  A A  +K D     +T V  G GEFL
Sbjct: 50   RVELKRVDHGKNLTKFERLQRGIKRGKHRLQRLNAMALASKTDDSSNVKTPVKAGNGEFL 109

Query: 202  MKIGIGTPASTYEAILDTGSDLTWTQCQPCKSCYEQSAPIYDPTKSATSGTTPCGAPLCN 381
            MK+ IGTP  ++ AI+DTGSDL WTQC PC +C++QS PI+DP KS++    PC + LC 
Sbjct: 110  MKLSIGTPPESFSAIMDTGSDLVWTQCLPCSNCFDQSTPIFDPKKSSSFSKLPCSSSLCE 169

Query: 382  ALPEFTCPNSKCEYLYQYGDYSSTSGYLATETFTLSSQEIPKLTFGCGQDNEGGGFSPSD 561
            ALP  TC +  CEY Y YGDYSST G LA+ETF+     +  + FGCG DNEG GF+   
Sbjct: 170  ALPSSTCSDG-CEYFYGYGDYSSTEGVLASETFSFGDGSVKGIGFGCGGDNEGDGFAQGA 228

Query: 562  GLVGFGRGPLSLVSQLGTTKFSYCLTSVSAKA-TSPLFL--------XXXXXXXXXXXXP 714
            GLVG GRGPLSLVSQL   KFSYCLTS++  + TS L +                    P
Sbjct: 229  GLVGLGRGPLSLVSQLKEPKFSYCLTSMADDSKTSSLLMGSLATKMGGKNDTSFEGKTTP 288

Query: 715  LIRSTMHPTFYYLSLEGVSIGALKLAIPKGTFDLQSDGTGGLIIDSGTTITHLEQAAYNE 894
            LI++   P+FYYLSLEG+S+G   L I KGTF ++ DG+GGLIIDSGTTIT+LE   ++ 
Sbjct: 289  LIKNPSQPSFYYLSLEGISVGDRLLDIEKGTFSIKEDGSGGLIIDSGTTITYLEHKGFDV 348

Query: 895  IASALSSAVK--LTPVTSSQLGLDLCFNNP--PRGFQFPDMTLSFAGGANMVLPAENYLI 1062
            +     S +K  L+   S    +DLCFN P   +  Q P +   F  GA++ LP ENY++
Sbjct: 349  LKKEFVSQMKGILSVDNSGSQAMDLCFNLPKGTKTVQVPKLVFHFK-GADLELPPENYIL 407

Query: 1063 QDSS-AVICLAMLPSNGMSILGNIQQQNFQIIYDTGANALSFARTSC 1200
             DS   V+CLAM  S+GMSI GNIQQQN  +++D     LSF  T C
Sbjct: 408  SDSDLGVLCLAMGASSGMSIFGNIQQQNLLVVHDLENERLSFVPTQC 454


>ref|XP_004168776.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 461

 Score =  370 bits (949), Expect = 1e-99
 Identities = 198/416 (47%), Positives = 255/416 (61%), Gaps = 16/416 (3%)
 Frame = +1

Query: 10   PNLSLRVELLRRDYKENLTTTERLRRGVERSIERLQKFKAAQVTKLDA--GGTFQTDVTP 183
            P+   RV L   D+ +NLT  ERLRRGV R   RL +  A  +   +A  G   +  V  
Sbjct: 47   PSHGFRVRLKHVDHVKNLTRFERLRRGVARGKNRLHRLNAMVLAAANATVGDQVKAPVVA 106

Query: 184  GEGEFLMKIGIGTPASTYEAILDTGSDLTWTQCQPCKSCYEQSAPIYDPTKSATSGTTPC 363
            G GEFLMK+ IG+P  ++ AI+DTGSDL WTQC+PC+ C++QS PI+DP +S++     C
Sbjct: 107  GNGEFLMKLAIGSPPRSFSAIMDTGSDLIWTQCKPCQQCFDQSTPIFDPKQSSSFYKISC 166

Query: 364  GAPLCNALPEFTCPNSKCEYLYQYGDYSSTSGYLATETFTLSSQ-----EIPKLTFGCGQ 528
             + LC ALP  TC +  CEYLY YGD SST G LA ETFT          IP L FGCG 
Sbjct: 167  SSELCGALPTSTCSSDGCEYLYTYGDSSSTQGVLAFETFTFGDSTEDQISIPGLGFGCGN 226

Query: 529  DNEGGGFSPSDGLVGFGRGPLSLVSQLGTTKFSYCLTSVSAKATSPLFL------XXXXX 690
            DN G GFS   GLVG GRGPLSLVSQL   KF+YCLT++     S L L           
Sbjct: 227  DNNGDGFSQGAGLVGLGRGPLSLVSQLKEQKFAYCLTAIDDSKPSSLLLGSLANITPKTS 286

Query: 691  XXXXXXXPLIRSTMHPTFYYLSLEGVSIGALKLAIPKGTFDLQSDGTGGLIIDSGTTITH 870
                   PLI++   P+FYYLSL+G+S+G  +L+IPK TF+L  DG+GG+IIDSGTTIT+
Sbjct: 287  KDEMKTTPLIKNPSQPSFYYLSLQGISVGGTQLSIPKSTFELHDDGSGGVIIDSGTTITY 346

Query: 871  LEQAAYNEIASALSSAVKLTPVTSSQLGLDLCFNNP--PRGFQFPDMTLSFAGGANMVLP 1044
            +E +A+  + +   + + L    S   GLDLCFN P      + P +T  F  GA++ LP
Sbjct: 347  VENSAFTSLKNEFIAQMNLPVDDSGTGGLDLCFNLPAGTNQVEVPKLTFHFK-GADLELP 405

Query: 1045 AENYLIQDSSA-VICLAMLPSNGMSILGNIQQQNFQIIYDTGANALSFARTSCGSL 1209
             ENY+I DS A ++CLA+  S GMSI GN+QQQNF +++D     LSF  T C S+
Sbjct: 406  GENYMIGDSKAGLLCLAIGSSRGMSIFGNLQQQNFMVVHDLQEETLSFLPTQCDSI 461


>ref|XP_004144695.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
            nepenthesin-1-like, partial [Cucumis sativus]
          Length = 716

 Score =  370 bits (949), Expect = 1e-99
 Identities = 198/416 (47%), Positives = 255/416 (61%), Gaps = 16/416 (3%)
 Frame = +1

Query: 10   PNLSLRVELLRRDYKENLTTTERLRRGVERSIERLQKFKAAQVTKLDA--GGTFQTDVTP 183
            P+   RV L   D+ +NLT  ERLRRGV R   RL +  A  +   +A  G   +  V  
Sbjct: 302  PSHGFRVRLKHVDHVKNLTRFERLRRGVARGKNRLHRLNAMVLAAANATVGDQVKAPVVA 361

Query: 184  GEGEFLMKIGIGTPASTYEAILDTGSDLTWTQCQPCKSCYEQSAPIYDPTKSATSGTTPC 363
            G GEFLMK+ IG+P  ++ AI+DTGSDL WTQC+PC+ C++QS PI+DP +S++     C
Sbjct: 362  GNGEFLMKLAIGSPPRSFSAIMDTGSDLIWTQCKPCQQCFDQSTPIFDPKQSSSFYKISC 421

Query: 364  GAPLCNALPEFTCPNSKCEYLYQYGDYSSTSGYLATETFTLSSQ-----EIPKLTFGCGQ 528
             + LC ALP  TC +  CEYLY YGD SST G LA ETFT          IP L FGCG 
Sbjct: 422  SSELCGALPTSTCSSDGCEYLYTYGDSSSTQGVLAFETFTFGDSTEDQISIPGLGFGCGN 481

Query: 529  DNEGGGFSPSDGLVGFGRGPLSLVSQLGTTKFSYCLTSVSAKATSPLFL------XXXXX 690
            DN G GFS   GLVG GRGPLSLVSQL   KF+YCLT++     S L L           
Sbjct: 482  DNNGDGFSQGAGLVGLGRGPLSLVSQLKEQKFAYCLTAIDDSKPSSLLLGSLANITPKTS 541

Query: 691  XXXXXXXPLIRSTMHPTFYYLSLEGVSIGALKLAIPKGTFDLQSDGTGGLIIDSGTTITH 870
                   PLI++   P+FYYLSL+G+S+G  +L+IPK TF+L  DG+GG+IIDSGTTIT+
Sbjct: 542  KDEMKTTPLIKNPSQPSFYYLSLQGISVGGTQLSIPKSTFELHDDGSGGVIIDSGTTITY 601

Query: 871  LEQAAYNEIASALSSAVKLTPVTSSQLGLDLCFNNP--PRGFQFPDMTLSFAGGANMVLP 1044
            +E +A+  + +   + + L    S   GLDLCFN P      + P +T  F  GA++ LP
Sbjct: 602  VENSAFTSLKNEFIAQMNLPVDDSGTGGLDLCFNLPAGTNQVEVPKLTFHFK-GADLELP 660

Query: 1045 AENYLIQDSSA-VICLAMLPSNGMSILGNIQQQNFQIIYDTGANALSFARTSCGSL 1209
             ENY+I DS A ++CLA+  S GMSI GN+QQQNF +++D     LSF  T C S+
Sbjct: 661  GENYMIGDSKAGLLCLAIGSSRGMSIFGNLQQQNFMVVHDLQEETLSFLPTQCDSI 716


>gb|EOX92742.1| Eukaryotic aspartyl protease family protein [Theobroma cacao]
          Length = 441

 Score =  369 bits (948), Expect = 1e-99
 Identities = 195/402 (48%), Positives = 251/402 (62%), Gaps = 7/402 (1%)
 Frame = +1

Query: 25   RVELLRRDYKENLTTTERLRRGVERSIERLQKFKAAQVTKLDAGGTFQTDVTPGEGEFLM 204
            RV L   D  +NLT  ER++RGV+R   RLQ+  A  +   DA    Q  +T G GEFLM
Sbjct: 43   RVTLRHVDSGKNLTKWERIQRGVKRGNHRLQRLNAMVLAATDAS-ELQAPITAGNGEFLM 101

Query: 205  KIGIGTPASTYEAILDTGSDLTWTQCQPCKSCYEQSAPIYDPTKSATSGTTPCGAPLCNA 384
             + IGTP  +Y AILDTGSDL WTQC+PC  C++Q  PI+DP KS++     C + LC+A
Sbjct: 102  DLAIGTPPESYSAILDTGSDLIWTQCKPCSQCFDQPTPIFDPKKSSSFSKLSCSSHLCSA 161

Query: 385  LPEFTCPNSKCEYLYQYGDYSSTSGYLATETFTLSSQEIPKLTFGCGQDNEGGGFSPSDG 564
            LP+  C +  CEYLY YGDYSST G +A ETFT     +P + FGCG DN+G GF+   G
Sbjct: 162  LPQSACSDG-CEYLYTYGDYSSTQGVMAVETFTFGKVSVPNIGFGCGGDNQGDGFTQGAG 220

Query: 565  LVGFGRGPLSLVSQLGTTKFSYCLTSVSAKATSPLFL----XXXXXXXXXXXXPLIRSTM 732
            LVG GRGP+SLVSQL   KFSYCLTS+     S L +                PLI +  
Sbjct: 221  LVGLGRGPVSLVSQLKQGKFSYCLTSIDDTKKSTLLMGSIASVNRTLGAIKTTPLIHNPT 280

Query: 733  HPTFYYLSLEGVSIGALKLAIPKGTFDLQSDGTGGLIIDSGTTITHLEQAAYNEIASALS 912
             P+FYYLSL+G+++G  +L I K TF L+ DGTGG+IIDSGTTIT+LE+ A++ +     
Sbjct: 281  QPSFYYLSLKGITVGDTRLPIKKSTFALEDDGTGGVIIDSGTTITYLEERAFDLVKKEFI 340

Query: 913  SAVKLTPVTSSQLGLDLCFNNP--PRGFQFPDMTLSFAGGANMVLPAENYLIQDSSA-VI 1083
            S +KL+  TS   GL+LCF  P      + P     F  GA++ LP ENY+I DSS+ ++
Sbjct: 341  SQMKLSVDTSGSTGLELCFTLPSGSTDVEVPKFIFHFE-GADLDLPGENYMIADSSSGLL 399

Query: 1084 CLAMLPSNGMSILGNIQQQNFQIIYDTGANALSFARTSCGSL 1209
            CLAM  S+GMSI GN+QQQN  +++D     LSF  T C  L
Sbjct: 400  CLAMGSSSGMSIFGNVQQQNMLVLHDLEKATLSFQHTQCDKL 441


>gb|AFW58416.1| hypothetical protein ZEAMMB73_998053 [Zea mays]
          Length = 475

 Score =  369 bits (947), Expect = 2e-99
 Identities = 198/437 (45%), Positives = 255/437 (58%), Gaps = 36/437 (8%)
 Frame = +1

Query: 7    NPNL-SLRVELLRRDYKENLTTTERLRRGVERSIERLQKF-------------KAAQVTK 144
            NP L  LRV L   D   N +  + L+R   RS  R+ +              KAA    
Sbjct: 39   NPKLRGLRVRLTHVDAHGNYSRLQLLQRAARRSHHRMSRLVARATGAASTSSSKAAAAGD 98

Query: 145  LDAGGTFQTDVTPGEGEFLMKIGIGTPASTYEAILDTGSDLTWTQCQPCKSCYEQSAPIY 324
               G   Q  V  G GEFLM + +GTPA  Y AI+DTGSDL WTQC+PC  C+ Q+ P++
Sbjct: 99   GSGGKDLQVPVHAGNGEFLMDLSVGTPALPYAAIVDTGSDLVWTQCKPCVECFNQTTPVF 158

Query: 325  DPTKSATSGTTPCGAPLCNALPEFTCPNSK--------CEYLYQYGDYSSTSGYLATETF 480
            DP  S+T    PC + LC  LP  TC +S         C Y Y YGD SST G LATETF
Sbjct: 159  DPAASSTYAALPCSSALCADLPTSTCASSSSSSSASSPCGYTYTYGDASSTQGVLATETF 218

Query: 481  TLSSQEIPKLTFGCGQDNEGGGFSPSDGLVGFGRGPLSLVSQLGTTKFSYCLTSV-SAKA 657
            TL+ Q++P + FGCG  NEG GF+   GLVG GRGPLSLVSQLG  +FSYCLTS+  A  
Sbjct: 219  TLARQKVPGVAFGCGDTNEGDGFTQGAGLVGLGRGPLSLVSQLGIDRFSYCLTSLDDAAG 278

Query: 658  TSPLFL------XXXXXXXXXXXXPLIRSTMHPTFYYLSLEGVSIGALKLAIPKGTFDLQ 819
             SPL L                  PL+++   P+FYY+SL G+++G+ +LA+P   F +Q
Sbjct: 279  RSPLLLGSAAGISASAATAPAQTTPLVKNPSQPSFYYVSLTGLTVGSTRLALPSSAFAIQ 338

Query: 820  SDGTGGLIIDSGTTITHLEQAAYNEIASALSSAVKLTPVTSSQLGLDLCFNNPPRG---- 987
             DGTGG+I+DSGT+IT+LE  AY  +  A  + + L  V +S++GLDLCF  P       
Sbjct: 339  DDGTGGVIVDSGTSITYLELRAYRALRKAFVAHMSLPTVDASEIGLDLCFQGPAGAVDQD 398

Query: 988  --FQFPDMTLSFAGGANMVLPAENYLIQDS-SAVICLAMLPSNGMSILGNIQQQNFQIIY 1158
               Q P + L F GGA++ LPAENY++ DS S  +CL ++ S G+SI+GN QQQNFQ +Y
Sbjct: 399  VQVQVPKLVLHFDGGADLDLPAENYMVLDSASGALCLTVMASRGLSIIGNFQQQNFQFVY 458

Query: 1159 DTGANALSFARTSCGSL 1209
            D   + LSFA   C  L
Sbjct: 459  DVAGDTLSFAPAECNKL 475


>ref|XP_003581287.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
            distachyon]
          Length = 468

 Score =  369 bits (947), Expect = 2e-99
 Identities = 192/412 (46%), Positives = 250/412 (60%), Gaps = 16/412 (3%)
 Frame = +1

Query: 22   LRVELLRRDYKENLTTTERLRRGVERSIERLQKFKAAQVT---KLDAGGTFQTDVTPGEG 192
            LRV L   D   N T  + LRR   RS  R+ +  A   T   K  A    Q  V  G G
Sbjct: 57   LRVPLTHVDAHGNYTKLQLLRRAARRSHHRMSRLVARTATGSVKAAAAPDLQVPVHAGNG 116

Query: 193  EFLMKIGIGTPASTYEAILDTGSDLTWTQCQPCKSCYEQSAPIYDPTKSATSGTTPCGAP 372
            EFLM + IGTPA  Y AI+DTGSDL WTQC+PC  C+ QS P++DP+ S+T  T PC + 
Sbjct: 117  EFLMDMSIGTPALAYAAIVDTGSDLVWTQCKPCVECFNQSTPVFDPSSSSTYSTLPCSSS 176

Query: 373  LCNALPEFTCPNSK--CEYLYQYGDYSSTSGYLATETFTLSSQEIPKLTFGCGQDNEGGG 546
            LC+ LP  TC ++   C Y Y YGD SST G LA ETFTL+  ++P + FGCG  NEG G
Sbjct: 177  LCSDLPTSTCTSAAKDCGYTYTYGDASSTQGVLAAETFTLAKTKLPGVAFGCGDTNEGDG 236

Query: 547  FSPSDGLVGFGRGPLSLVSQLGTTKFSYCLTSVSAKATSPLFL-------XXXXXXXXXX 705
            F+   GLVG GRGPLSLVSQLG  KFSYCLTS+   + SPL L                 
Sbjct: 237  FTQGAGLVGLGRGPLSLVSQLGLGKFSYCLTSLDDTSKSPLLLGSLAAISTDTASAAAIQ 296

Query: 706  XXPLIRSTMHPTFYYLSLEGVSIGALKLAIPKGTFDLQSDGTGGLIIDSGTTITHLEQAA 885
              PLI++   P+FYY++L+ +++G+ ++ +P   F +Q DGTGG+I+DSGT+IT+LE   
Sbjct: 297  TTPLIKNPSQPSFYYVTLKALTVGSTRIPLPGSAFAVQDDGTGGVIVDSGTSITYLELQG 356

Query: 886  YNEIASALSSAVKLTPVTSSQLGLDLCFNNPPRG---FQFPDMTLSFAGGANMVLPAENY 1056
            Y  +  A ++ +KL     S +GLDLCF  P  G    + P + L F GGA++ LPAENY
Sbjct: 357  YRPLKKAFAAQMKLPVADGSAVGLDLCFKAPASGVDDVEVPKLVLHFDGGADLDLPAENY 416

Query: 1057 LIQDS-SAVICLAMLPSNGMSILGNIQQQNFQIIYDTGANALSFARTSCGSL 1209
            ++ DS S  +CL ++ S G+SI+GN QQQN Q +YD   + LSFA   C  L
Sbjct: 417  MVLDSASGALCLTVMGSRGLSIIGNFQQQNIQFVYDVDKDTLSFAPVQCAKL 468


>ref|NP_565298.2| aspartyl protease family protein [Arabidopsis thaliana]
            gi|30102688|gb|AAP21262.1| At2g03200 [Arabidopsis
            thaliana] gi|110736021|dbj|BAE99983.1| putative
            chloroplast nucleoid DNA binding protein [Arabidopsis
            thaliana] gi|330250580|gb|AEC05674.1| aspartyl protease
            family protein [Arabidopsis thaliana]
          Length = 461

 Score =  368 bits (944), Expect = 4e-99
 Identities = 197/422 (46%), Positives = 255/422 (60%), Gaps = 22/422 (5%)
 Frame = +1

Query: 10   PNLSLRVELLRRDYKENLTTTERLRRGVERSIERLQKFKAAQV----TKLDAGGTFQTDV 177
            P    R+ L   D  +NLT  ++++RG+ R   RL +  A  V    +K D     +   
Sbjct: 41   PRSGFRLSLRHVDSGKNLTKIQKIQRGINRGFHRLNRLGAVAVLAVASKPDDTNNIKAPT 100

Query: 178  TPGEGEFLMKIGIGTPASTYEAILDTGSDLTWTQCQPCKSCYEQSAPIYDPTKSATSGTT 357
              G GEFLM++ IG PA  Y AI+DTGSDL WTQC+PC  C++Q  PI+DP KS++    
Sbjct: 101  HGGSGEFLMELSIGNPAVKYSAIVDTGSDLIWTQCKPCTECFDQPTPIFDPEKSSSYSKV 160

Query: 358  PCGAPLCNALPEFTCPNSK--CEYLYQYGDYSSTSGYLATETFTLSSQ-EIPKLTFGCGQ 528
             C + LCNALP   C   K  CEYLY YGDYSST G LATETFT   +  I  + FGCG 
Sbjct: 161  GCSSGLCNALPRSNCNEDKDACEYLYTYGDYSSTRGLLATETFTFEDENSISGIGFGCGV 220

Query: 529  DNEGGGFSPSDGLVGFGRGPLSLVSQLGTTKFSYCLTSV-SAKATSPLFL---------- 675
            +NEG GFS   GLVG GRGPLSL+SQL  TKFSYCLTS+  ++A+S LF+          
Sbjct: 221  ENEGDGFSQGSGLVGLGRGPLSLISQLKETKFSYCLTSIEDSEASSSLFIGSLASGIVNK 280

Query: 676  -XXXXXXXXXXXXPLIRSTMHPTFYYLSLEGVSIGALKLAIPKGTFDLQSDGTGGLIIDS 852
                          L+R+   P+FYYL L+G+++GA +L++ K TF+L  DGTGG+IIDS
Sbjct: 281  TGASLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELAEDGTGGMIIDS 340

Query: 853  GTTITHLEQAAYNEIASALSSAVKLTPVTSSQLGLDLCFNNP--PRGFQFPDMTLSFAGG 1026
            GTTIT+LE+ A+  +    +S + L    S   GLDLCF  P   +    P M   F  G
Sbjct: 341  GTTITYLEETAFKVLKEEFTSRMSLPVDDSGSTGLDLCFKLPDAAKNIAVPKMIFHFK-G 399

Query: 1027 ANMVLPAENYLIQDSS-AVICLAMLPSNGMSILGNIQQQNFQIIYDTGANALSFARTSCG 1203
            A++ LP ENY++ DSS  V+CLAM  SNGMSI GN+QQQNF +++D     +SF  T CG
Sbjct: 400  ADLELPGENYMVADSSTGVLCLAMGSSNGMSIFGNVQQQNFNVLHDLEKETVSFVPTECG 459

Query: 1204 SL 1209
             L
Sbjct: 460  KL 461


>gb|EAY94310.1| hypothetical protein OsI_16079 [Oryza sativa Indica Group]
          Length = 423

 Score =  365 bits (938), Expect = 2e-98
 Identities = 188/408 (46%), Positives = 246/408 (60%), Gaps = 12/408 (2%)
 Frame = +1

Query: 22   LRVELLRRDYKENLTTTERLRRGVERSIERLQKFKAAQVTKLDAGGTFQTDVTPGEGEFL 201
            LRV L   D   N +  + LRR   RS  R+ +                  V  G GEFL
Sbjct: 31   LRVHLTHVDAHGNYSRHQLLRRAARRSHHRMSRL---------------VPVHAGNGEFL 75

Query: 202  MKIGIGTPASTYEAILDTGSDLTWTQCQPCKSCYEQSAPIYDPTKSATSGTTPCGAPLCN 381
            M + IGTPA  Y AI+DTGSDL WTQC+PC  C++QS P++DP+ S+T  T PC +  C+
Sbjct: 76   MDVSIGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPCSSASCS 135

Query: 382  ALPEFTCPN-SKCEYLYQYGDYSSTSGYLATETFTLSSQEIPKLTFGCGQDNEGGGFSPS 558
             LP   C + SKC Y Y YGD SST G LATETFTL+  ++P + FGCG  NEG GFS  
Sbjct: 136  DLPTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKSKLPGVVFGCGDTNEGDGFSQG 195

Query: 559  DGLVGFGRGPLSLVSQLGTTKFSYCLTSVSAKATSPLFL-------XXXXXXXXXXXXPL 717
             GLVG GRGPLSLVSQLG  KFSYCLTS+     SPL L                   PL
Sbjct: 196  AGLVGLGRGPLSLVSQLGLDKFSYCLTSLDDTNNSPLLLGSLAGISEASAAASSVQTTPL 255

Query: 718  IRSTMHPTFYYLSLEGVSIGALKLAIPKGTFDLQSDGTGGLIIDSGTTITHLEQAAYNEI 897
            I++   P+FYY+SL+ +++G+ ++++P   F +Q DGTGG+I+DSGT+IT+LE   Y  +
Sbjct: 256  IKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVIVDSGTSITYLEVQGYRAL 315

Query: 898  ASALSSAVKLTPVTSSQLGLDLCFNNPPRG---FQFPDMTLSFAGGANMVLPAENYLIQD 1068
              A ++ + L     S +GLDLCF  P +G    + P +   F GGA++ LPAENY++ D
Sbjct: 316  KKAFAAQMALPAADGSGVGLDLCFRAPAKGVDQVEVPRLVFHFDGGADLDLPAENYMVLD 375

Query: 1069 -SSAVICLAMLPSNGMSILGNIQQQNFQIIYDTGANALSFARTSCGSL 1209
              S  +CL ++ S G+SI+GN QQQNFQ +YD G + LSFA   C  L
Sbjct: 376  GGSGALCLTVMGSRGLSIIGNFQQQNFQFVYDVGHDTLSFAPVQCNKL 423


Top