BLASTX nr result

ID: Ephedra28_contig00001252 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Ephedra28_contig00001252
         (1363 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|ABK25480.1| unknown [Picea sitchensis]                             409   e-111
ref|XP_002300215.2| aspartyl protease family protein [Populus tr...   373   e-100
ref|XP_002329464.1| predicted protein [Populus trichocarpa] gi|5...   369   2e-99
ref|XP_002522914.1| Aspartic proteinase nepenthesin-1 precursor,...   368   4e-99
ref|XP_004975767.1| PREDICTED: aspartic proteinase nepenthesin-1...   367   5e-99
ref|XP_004239638.1| PREDICTED: aspartic proteinase nepenthesin-1...   367   8e-99
ref|XP_002446566.1| hypothetical protein SORBIDRAFT_06g018170 [S...   365   2e-98
ref|XP_006828037.1| hypothetical protein AMTR_s00008p00256490 [A...   363   7e-98
ref|XP_002272802.1| PREDICTED: aspartic proteinase nepenthesin-1...   363   1e-97
ref|XP_006345762.1| PREDICTED: aspartic proteinase nepenthesin-1...   362   2e-97
emb|CAD40873.2| OSJNBa0064H22.10 [Oryza sativa Japonica Group] g...   362   3e-97
ref|NP_001052922.1| Os04g0448300 [Oryza sativa Japonica Group] g...   362   3e-97
gb|AFW58416.1| hypothetical protein ZEAMMB73_998053 [Zea mays]        358   3e-96
gb|EAY94310.1| hypothetical protein OsI_16079 [Oryza sativa Indi...   355   3e-95
ref|XP_003581287.1| PREDICTED: aspartic proteinase nepenthesin-1...   352   2e-94
dbj|BAJ94496.1| predicted protein [Hordeum vulgare subsp. vulgar...   352   3e-94
ref|XP_006466172.1| PREDICTED: aspartic proteinase nepenthesin-1...   351   4e-94
gb|EOX92742.1| Eukaryotic aspartyl protease family protein [Theo...   351   5e-94
gb|EMT14245.1| Aspartic proteinase nepenthesin-1 [Aegilops tausc...   350   1e-93
ref|NP_565298.2| aspartyl protease family protein [Arabidopsis t...   350   1e-93

>gb|ABK25480.1| unknown [Picea sitchensis]
          Length = 460

 Score =  409 bits (1050), Expect = e-111
 Identities = 211/382 (55%), Positives = 263/382 (68%), Gaps = 10/382 (2%)
 Frame = -3

Query: 1361 ERSIERLQKFKAAQVTKLDAGGTFQTDVTPGEGEFLMKIGIGTPASTYEAILDTGSDLTW 1182
            +RS +RL+K + +    +D     +  V  G GEFLMK+ IGTP+ ++ AILDTGSDLTW
Sbjct: 85   KRSQDRLEKLQMS----VDEVKAVEAPVYAGNGEFLMKMAIGTPSLSFSAILDTGSDLTW 140

Query: 1181 TQCQPCKSCYEQSAPIYDPTKSATSGRTPCGTPLCNALPEFTCPNSKCEYLYQYGDYSST 1002
            TQC+PC  CY Q  PIYDP++S+T  + PC + +C ALP ++C  + CEYLY YGD SST
Sbjct: 141  TQCKPCTDCYPQPTPIYDPSQSSTYSKVPCSSSMCQALPMYSCSGANCEYLYSYGDQSST 200

Query: 1001 SGYLATETFTLSSQEIPKLTFGCGQDNEGGGFSPSDGLVGFGRGPLSLVSQLGTT---KF 831
             G L+ E+FTL+SQ +P + FGCGQ+NEGGGFS   GLVGFGRGPLSL+SQLG +   KF
Sbjct: 201  QGILSYESFTLTSQSLPHIAFGCGQENEGGGFSQGGGLVGFGRGPLSLISQLGQSLGNKF 260

Query: 830  SYCLTSV--SAKATSPLFL--XXXXXXXXXXXTPLIRSTMHPTFYYLSLQGVSIGGLKLA 663
            SYCL S+  S   TSPLF+             TPL++S   PTFYYLSL+G+S+GG  L 
Sbjct: 261  SYCLVSITDSPSKTSPLFIGKTASLNAKTVSSTPLVQSRSRPTFYYLSLEGISVGGQLLD 320

Query: 662  IPKGTFDLQSDGTGGLIIDSGTTITHLEQAAYNEIASALSSAVKLTPVTSSQLGLDLCFN 483
            I  GTFDLQ DGTGG+IIDSGTT+T+LEQ+ Y+ +  A+ S++ L  V  S +GLDLCF 
Sbjct: 321  IADGTFDLQLDGTGGVIIDSGTTVTYLEQSGYDVVKKAVISSINLPQVDGSNIGLDLCF- 379

Query: 482  NPPRG---FQFPDMTLSFAGGANMVLPAENYLIQDSSAVICLAMLPSNGMSILGNIQQQN 312
             P  G     FP +T  F  GA+  LP ENY+  DSS + CLAMLPSNGMSI GNIQQQN
Sbjct: 380  EPQSGSSTSHFPTITFHFE-GADFNLPKENYIYTDSSGIACLAMLPSNGMSIFGNIQQQN 438

Query: 311  FQIIYDTGANALSFARTSCGGL 246
            +QI+YD   N LSFA T C  L
Sbjct: 439  YQILYDNERNVLSFAPTVCDTL 460


>ref|XP_002300215.2| aspartyl protease family protein [Populus trichocarpa]
            gi|550348628|gb|EEE85020.2| aspartyl protease family
            protein [Populus trichocarpa]
          Length = 439

 Score =  373 bits (957), Expect = e-100
 Identities = 194/376 (51%), Positives = 242/376 (64%), Gaps = 7/376 (1%)
 Frame = -3

Query: 1361 ERSIERLQKFKAAQVTKLDAGGTFQTDVTPGEGEFLMKIGIGTPASTYEAILDTGSDLTW 1182
            +R   RLQ+ +A  +    +    +  V PG GEFLMK+ IGTP  TY AILDTGSDL W
Sbjct: 64   KRGRNRLQRLQAMALVA-SSSSEIEAPVLPGNGEFLMKLAIGTPPETYSAILDTGSDLIW 122

Query: 1181 TQCQPCKSCYEQSAPIYDPTKSATSGRTPCGTPLCNALPEFTCPNSKCEYLYQYGDYSST 1002
            TQC+PC  C+ QS PI+DP KS++  +  C + LC ALP+ +C N+ CEYLY YGDYSST
Sbjct: 123  TQCKPCTQCFHQSTPIFDPKKSSSFSKLSCSSQLCEALPQSSC-NNGCEYLYSYGDYSST 181

Query: 1001 SGYLATETFTLSSQEIPKLTFGCGQDNEGGGFSPSDGLVGFGRGPLSLVSQLGTTKFSYC 822
             G LA+ET T     +P + FGCG DNEG GFS   GLVG GRGPLSLVSQL   KFSYC
Sbjct: 182  QGILASETLTFGKASVPHVAFGCGADNEGSGFSQGAGLVGLGRGPLSLVSQLKEPKFSYC 241

Query: 821  LTSVSAKATSPLFL----XXXXXXXXXXXTPLIRSTMHPTFYYLSLQGVSIGGLKLAIPK 654
            LT+V    TS L +               TPLI S  HP+FYYLSL+G+S+G  +L I K
Sbjct: 242  LTTVDDTKTSTLLMGSLASVNASSSAIKTTPLIHSPAHPSFYYLSLEGISVGDTRLPIKK 301

Query: 653  GTFDLQSDGTGGLIIDSGTTITHLEQAAYNEIASALSSAVKLTPVTSSQLGLDLCFNNP- 477
             TF LQ DG+GGLIIDSGTTIT+LE++A+N +A   ++ + L   +S   GLD+CF  P 
Sbjct: 302  STFSLQDDGSGGLIIDSGTTITYLEESAFNLVAKEFTAKINLPVDSSGSTGLDVCFTLPS 361

Query: 476  -PRGFQFPDMTLSFAGGANMVLPAENYLIQDSS-AVICLAMLPSNGMSILGNIQQQNFQI 303
                 + P +   F  GA++ LPAENY+I DSS  V CLAM  S+GMSI GN+QQQN  +
Sbjct: 362  GSTNIEVPKLVFHF-DGADLELPAENYMIGDSSMGVACLAMGSSSGMSIFGNVQQQNMLV 420

Query: 302  IYDTGANALSFARTSC 255
            ++D     LSF  T C
Sbjct: 421  LHDLEKETLSFLPTQC 436


>ref|XP_002329464.1| predicted protein [Populus trichocarpa]
            gi|566222317|ref|XP_006370905.1| aspartyl protease family
            protein [Populus trichocarpa] gi|550316486|gb|ERP48702.1|
            aspartyl protease family protein [Populus trichocarpa]
          Length = 439

 Score =  369 bits (946), Expect = 2e-99
 Identities = 193/379 (50%), Positives = 242/379 (63%), Gaps = 7/379 (1%)
 Frame = -3

Query: 1361 ERSIERLQKFKAAQVTKLDAGGTFQTDVTPGEGEFLMKIGIGTPASTYEAILDTGSDLTW 1182
            +R   RLQ+FKA  +    +       V PG GEFLMK+ IGTP  TY AI+DTGSDL W
Sbjct: 64   KRGRHRLQRFKAMALVA-SSNSEIDAPVLPGNGEFLMKLAIGTPPETYSAIMDTGSDLIW 122

Query: 1181 TQCQPCKSCYEQSAPIYDPTKSATSGRTPCGTPLCNALPEFTCPNSKCEYLYQYGDYSST 1002
            TQC+PC  C++Q  PI+DP KS++  +  C + LC ALP+ TC +  CEYLY YGDYSST
Sbjct: 123  TQCKPCTQCFDQPTPIFDPKKSSSFSKLSCSSKLCEALPQSTCSDG-CEYLYGYGDYSST 181

Query: 1001 SGYLATETFTLSSQEIPKLTFGCGQDNEGGGFSPSDGLVGFGRGPLSLVSQLGTTKFSYC 822
             G LA+ET T     +P++ FGCG+DNEG GFS   GLVG GRGPLSLVSQL   KFSYC
Sbjct: 182  QGMLASETLTFGKVSVPEVAFGCGEDNEGSGFSQGSGLVGLGRGPLSLVSQLKEPKFSYC 241

Query: 821  LTSVSAKATSPLFL----XXXXXXXXXXXTPLIRSTMHPTFYYLSLQGVSIGGLKLAIPK 654
            LTSV     S L +               TPLI+++  P+FYYLSL+G+S+G   L I K
Sbjct: 242  LTSVDDTKASTLLMGSLASVKASDSEIKTTPLIQNSAQPSFYYLSLEGISVGDTSLPIKK 301

Query: 653  GTFDLQSDGTGGLIIDSGTTITHLEQAAYNEIASALSSAVKLTPVTSSQLGLDLCFNNP- 477
             TF LQ DG+GGLIIDSGTTIT+LEQ+A++ +A   +S + L    S   GL++CF  P 
Sbjct: 302  STFSLQEDGSGGLIIDSGTTITYLEQSAFDLVAKEFTSQINLPVDNSGSTGLEVCFTLPS 361

Query: 476  -PRGFQFPDMTLSFAGGANMVLPAENYLIQDSS-AVICLAMLPSNGMSILGNIQQQNFQI 303
                 + P +   F  GA++ LPAENY+I D+S  V CLAM  S+GMSI GNIQQQN  +
Sbjct: 362  GSTDIEVPKLVFHF-DGADLELPAENYMIADASMGVACLAMGSSSGMSIFGNIQQQNMLV 420

Query: 302  IYDTGANALSFARTSCGGL 246
            ++D     LSF  T C  L
Sbjct: 421  LHDLEKETLSFLPTQCDEL 439


>ref|XP_002522914.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
            communis] gi|223537841|gb|EEF39457.1| Aspartic proteinase
            nepenthesin-1 precursor, putative [Ricinus communis]
          Length = 442

 Score =  368 bits (944), Expect = 4e-99
 Identities = 192/379 (50%), Positives = 246/379 (64%), Gaps = 7/379 (1%)
 Frame = -3

Query: 1361 ERSIERLQKFKAAQVTKLDAGGTFQTDVTPGEGEFLMKIGIGTPASTYEAILDTGSDLTW 1182
            +R+  RL++  A  V    +     + V  G GEFLM + IGTP  TY AI+DTGSDL W
Sbjct: 67   KRANHRLERLNA-MVLAASSNAEINSPVLSGNGEFLMNLAIGTPPETYSAIMDTGSDLIW 125

Query: 1181 TQCQPCKSCYEQSAPIYDPTKSATSGRTPCGTPLCNALPEFTCPNSKCEYLYQYGDYSST 1002
            TQC+PC  C++Q +PI+DP KS++  +  C + LC ALP+ +C +S CEYLY YGDYSST
Sbjct: 126  TQCKPCTQCFDQPSPIFDPKKSSSFSKLSCSSQLCKALPQSSCSDS-CEYLYTYGDYSST 184

Query: 1001 SGYLATETFTLSSQEIPKLTFGCGQDNEGGGFSPSDGLVGFGRGPLSLVSQLGTTKFSYC 822
             G +ATETFT     IP + FGCG+DNEG GF+   GLVG GRGPLSLVSQL   KFSYC
Sbjct: 185  QGTMATETFTFGKVSIPNVGFGCGEDNEGDGFTQGSGLVGLGRGPLSLVSQLKEAKFSYC 244

Query: 821  LTSVSAKATSPLFL----XXXXXXXXXXXTPLIRSTMHPTFYYLSLQGVSIGGLKLAIPK 654
            LTS+    TS L +               TPLI++ + P+FYYLSL+G+S+GG +L I +
Sbjct: 245  LTSIDDTKTSTLLMGSLASVNGTSAAIRTTPLIQNPLQPSFYYLSLEGISVGGTRLPIKE 304

Query: 653  GTFDLQSDGTGGLIIDSGTTITHLEQAAYNEIASALSSAVKLTPVTSSQLGLDLCFNNP- 477
             TF LQ DGTGGLIIDSGTTIT+LE++A++ +    +S + L    S   GL+LC+N P 
Sbjct: 305  STFQLQDDGTGGLIIDSGTTITYLEESAFDLVKKEFTSQMGLPVDNSGATGLELCYNLPS 364

Query: 476  -PRGFQFPDMTLSFAGGANMVLPAENYLIQDSS-AVICLAMLPSNGMSILGNIQQQNFQI 303
                 + P + L F  GA++ LP ENY+I DSS  VICLAM  S GMSI GN+QQQN  +
Sbjct: 365  DTSELEVPKLVLHFT-GADLELPGENYMIADSSMGVICLAMGSSGGMSIFGNVQQQNMFV 423

Query: 302  IYDTGANALSFARTSCGGL 246
             +D     LSF  T+CG L
Sbjct: 424  SHDLEKETLSFLPTNCGQL 442


>ref|XP_004975767.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Setaria italica]
          Length = 446

 Score =  367 bits (943), Expect = 5e-99
 Identities = 181/364 (49%), Positives = 237/364 (65%), Gaps = 11/364 (3%)
 Frame = -3

Query: 1304 AGGTFQTDVTPGEGEFLMKIGIGTPASTYEAILDTGSDLTWTQCQPCKSCYEQSAPIYDP 1125
            +GG  Q  V  G GEFLM + IGTPA +Y AI+DTGSDL WTQC+PC  C++QS P++DP
Sbjct: 83   SGGDLQVPVHAGNGEFLMDLAIGTPALSYAAIVDTGSDLVWTQCKPCVECFKQSTPVFDP 142

Query: 1124 TKSATSGRTPCGTPLCNALPEFTCPN-SKCEYLYQYGDYSSTSGYLATETFTLSSQEIPK 948
            + S+T    PC + LC  LP  +C + S+C Y Y YGD SST G LATETFTL+  ++P+
Sbjct: 143  SSSSTYAPVPCSSALCGDLPSSSCTSASRCGYTYTYGDASSTQGVLATETFTLAKSKLPE 202

Query: 947  LTFGCGQDNEGGGFSPSDGLVGFGRGPLSLVSQLGTTKFSYCLTSVSAKATSPLFL---- 780
            + FGCG  NEG GFS   GLVG GRGPLSLV+QLG  KFSYCLTS+ A + SPL L    
Sbjct: 203  VAFGCGDTNEGDGFSQGAGLVGLGRGPLSLVTQLGLDKFSYCLTSLDATSKSPLLLGSVA 262

Query: 779  --XXXXXXXXXXXTPLIRSTMHPTFYYLSLQGVSIGGLKLAIPKGTFDLQSDGTGGLIID 606
                         TPL+++   P+FYY++L G+++G   + +P   F +Q DGTGG+I+D
Sbjct: 263  GISESAATAPVQSTPLVKNPSQPSFYYVTLTGLTVGSTHITLPTSAFAIQDDGTGGVIVD 322

Query: 605  SGTTITHLEQAAYNEIASALSSAVKLTPVTSSQLGLDLCFNNPPR---GFQFPDMTLSFA 435
            SGT+IT+LE   Y  +  A  + + L  V  S++GLDLCF  P +   G Q P +   F 
Sbjct: 323  SGTSITYLELQGYRALKKAFVAQMSLPVVDGSEIGLDLCFRAPAKGVDGVQVPKLVFHFD 382

Query: 434  GGANMVLPAENYLIQDS-SAVICLAMLPSNGMSILGNIQQQNFQIIYDTGANALSFARTS 258
            GGA++ LPAENY++ DS S  +CL +  S G+SI+GN QQQNFQ +YD  A+ LSFA   
Sbjct: 383  GGADLDLPAENYMVLDSASGALCLTVAASRGLSIIGNFQQQNFQFVYDVAADTLSFAPVQ 442

Query: 257  CGGL 246
            C  L
Sbjct: 443  CDKL 446


>ref|XP_004239638.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Solanum
            lycopersicum]
          Length = 441

 Score =  367 bits (941), Expect = 8e-99
 Identities = 192/380 (50%), Positives = 245/380 (64%), Gaps = 9/380 (2%)
 Frame = -3

Query: 1358 RSIERLQKFK-AAQVTKLDAGGTFQTDVTPGEGEFLMKIGIGTPASTYEAILDTGSDLTW 1182
            R   RLQ+    A ++  D     ++ +  G GEFLM+I IG+P+ +Y AI+DTGSDL W
Sbjct: 63   RGKSRLQRLSLVATLSSRDETNDVKSTIHAGNGEFLMQISIGSPSESYNAIMDTGSDLIW 122

Query: 1181 TQCQPCKSCYEQSAPIYDPTKSATSGRTPCGTPLCNALPEFTCPNSKCEYLYQYGDYSST 1002
            TQC+PCK C++QS PI+DP+KS+T  +  C   LC ALP  +C  S CEY+Y YGDYSS+
Sbjct: 123  TQCKPCKECFDQSTPIFDPSKSSTFEKISCSNKLCEALPISSCGGSNCEYMYTYGDYSSS 182

Query: 1001 SGYLATETFTLSSQEIPKLTFGCGQDNEGGGFSPSDGLVGFGRGPLSLVSQLGTTKFSYC 822
             G+LA+ETFT     IP + FGCG DNEG GFS   GLVG GRGPLSLVSQL  ++FSYC
Sbjct: 183  EGFLASETFTFGKVSIPNVAFGCGNDNEGSGFSQGAGLVGLGRGPLSLVSQLHMSRFSYC 242

Query: 821  LTSVS--AKATSPLFL---XXXXXXXXXXXTPLIRSTMHPTFYYLSLQGVSIGGLKLAIP 657
            LTS++  A +TS   L              TPL+++   P+FYYLSL+G+S+G  +LAI 
Sbjct: 243  LTSINEDADSTSSTLLMGSMARDDYNNIITTPLVKNPTQPSFYYLSLKGISVGDTQLAIK 302

Query: 656  KGTFDLQSDGTGGLIIDSGTTITHLEQAAYNEIASALSSAVKLTPVTSSQLGLDLCFNNP 477
            K TF L  DG+GG+IIDSGTTIT+LE++A++ +    SS V L    SS  GLDLCF  P
Sbjct: 303  KSTFSLNKDGSGGMIIDSGTTITYLEESAFSLLKKEFSSQVNLAVDDSSSTGLDLCFKLP 362

Query: 476  --PRGFQFPDMTLSFAGGANMVLPAENYLIQDS-SAVICLAMLPSNGMSILGNIQQQNFQ 306
                  Q P +   F  GA+M LPAENY+I DS   + CLAM  S+GMSI GN+QQQN  
Sbjct: 363  SNTNNIQVPKLIFHFE-GADMDLPAENYMIADSRMGIACLAMGSSSGMSIFGNVQQQNMM 421

Query: 305  IIYDTGANALSFARTSCGGL 246
            +I+D     LSF    C  L
Sbjct: 422  VIHDLDKETLSFVPKQCDKL 441


>ref|XP_002446566.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
            gi|241937749|gb|EES10894.1| hypothetical protein
            SORBIDRAFT_06g018170 [Sorghum bicolor]
          Length = 452

 Score =  365 bits (938), Expect = 2e-98
 Identities = 188/388 (48%), Positives = 245/388 (63%), Gaps = 17/388 (4%)
 Frame = -3

Query: 1358 RSIERLQKF--KAAQVTKLDAGGTFQTDVTPGEGEFLMKIGIGTPASTYEAILDTGSDLT 1185
            RS  R+ +   +A  V  +  GG  Q  V  G GEFLM + IGTPA +Y AI+DTGSDL 
Sbjct: 65   RSHHRMSRLVARATGVKAVAGGGDLQVPVHAGNGEFLMDVAIGTPALSYAAIVDTGSDLV 124

Query: 1184 WTQCQPCKSCYEQSAPIYDPTKSATSGRTPCGTPLCNALPEFTCPN-SKCEYLYQYGDYS 1008
            WTQC+PC  C++QS P++DP+ S+T    PC + LC+ LP  TC + SKC Y Y YGD S
Sbjct: 125  WTQCKPCVDCFKQSTPVFDPSSSSTYATVPCSSALCSDLPTSTCTSASKCGYTYTYGDAS 184

Query: 1007 STSGYLATETFTLSSQ--EIPKLTFGCGQDNEGGGFSPSDGLVGFGRGPLSLVSQLGTTK 834
            ST G LA+ETFTL  +  ++P + FGCG  NEG GF+   GLVG GRGPLSLVSQLG  K
Sbjct: 185  STQGVLASETFTLGKEKKKLPGVAFGCGDTNEGDGFTQGAGLVGLGRGPLSLVSQLGLDK 244

Query: 833  FSYCLTSV-SAKATSPLFL-------XXXXXXXXXXXTPLIRSTMHPTFYYLSLQGVSIG 678
            FSYCLTS+      SPL L                  TPL+++   P+FYY+SL G+++G
Sbjct: 245  FSYCLTSLDDGDGKSPLLLGGSAAAISESAATAPVQTTPLVKNPSQPSFYYVSLTGLTVG 304

Query: 677  GLKLAIPKGTFDLQSDGTGGLIIDSGTTITHLEQAAYNEIASALSSAVKLTPVTSSQLGL 498
              ++ +P   F +Q DGTGG+I+DSGT+IT+LE   Y  +  A  + + L  V  S++GL
Sbjct: 305  STRITLPASAFAIQDDGTGGVIVDSGTSITYLELQGYRALKKAFVAQMALPTVDGSEIGL 364

Query: 497  DLCFNNPPRG---FQFPDMTLSFAGGANMVLPAENYLIQDS-SAVICLAMLPSNGMSILG 330
            DLCF  P +G    Q P + L F GGA++ LPAENY++ DS S  +CL + PS G+SI+G
Sbjct: 365  DLCFQGPAKGVDEVQVPKLVLHFDGGADLDLPAENYMVLDSASGALCLTVAPSRGLSIIG 424

Query: 329  NIQQQNFQIIYDTGANALSFARTSCGGL 246
            N QQQNFQ +YD   + LSFA   C  L
Sbjct: 425  NFQQQNFQFVYDVAGDTLSFAPVQCNKL 452


>ref|XP_006828037.1| hypothetical protein AMTR_s00008p00256490 [Amborella trichopoda]
            gi|548832672|gb|ERM95453.1| hypothetical protein
            AMTR_s00008p00256490 [Amborella trichopoda]
          Length = 436

 Score =  363 bits (933), Expect = 7e-98
 Identities = 190/378 (50%), Positives = 242/378 (64%), Gaps = 11/378 (2%)
 Frame = -3

Query: 1346 RLQKFKAAQVTKLDAGGT--FQTDVTPGEGEFLMKIGIGTPASTYEAILDTGSDLTWTQC 1173
            RL+K ++     LD  G    +  V  G GEFLMK+ IGTP  +Y AI+DTGSDL WTQC
Sbjct: 60   RLEKLQSKTTAALDGSGEVDIEAPVHVGNGEFLMKLAIGTPPVSYSAIVDTGSDLVWTQC 119

Query: 1172 QPCKSCYEQSAPIYDPTKSATSGRTPCGTPLCNALPEFTCPNSKCEYLYQYGDYSSTSGY 993
             PC  C++Q  PI+DP KS+T G+  C + LC ALP  TC +  CEY+Y YGDYSST G 
Sbjct: 120  LPCDKCFKQPTPIFDPAKSSTFGKLSCKSDLCQALPSSTC-DPDCEYVYTYGDYSSTQGT 178

Query: 992  LATETFTLSSQEIPKLTFGCGQDNEGGGFSPSDGLVGFGRGPLSLVSQLG---TTKFSYC 822
            LATE FT     + ++ FGCG  N+G GFS   GLVG GRGPLSL++QLG     KFSYC
Sbjct: 179  LATELFTFGGVSVSEVGFGCGNYNQGRGFSQGAGLVGLGRGPLSLITQLGGSVANKFSYC 238

Query: 821  LTSV--SAKATSPLFL-XXXXXXXXXXXTPLIRSTMHPTFYYLSLQGVSIGGLKLAIPKG 651
            L S+  S  ATSPL L            TPL+R+    +FYY++L+G+S+GG  L I   
Sbjct: 239  LKSIDDSDSATSPLLLGAEAKTTGEVITTPLVRNPEQFSFYYITLEGISVGGYLLPIKNT 298

Query: 650  TFDLQSDGTGGLIIDSGTTITHLEQAAYNEIASALSSAVKLTPVTSSQLGLDLCFNNPPR 471
            TF++++DG GG+I+DSGTTIT+LE A Y E+  A  S +K      S  GLDLCF+ P  
Sbjct: 299  TFEMKADGNGGMIVDSGTTITYLEVAGYREVRKAFLSKMKTPETDGSATGLDLCFSLPSS 358

Query: 470  G--FQFPDMTLSFAGGANMVLPAENYLIQD-SSAVICLAMLPSNGMSILGNIQQQNFQII 300
                + P +TL F GG ++ LPAENY I D S+ ++CLAM+P++GMSILGN+QQQNF + 
Sbjct: 359  ATEVEVPTLTLHFGGGGSLELPAENYFIADESTGLLCLAMMPASGMSILGNVQQQNFLVQ 418

Query: 299  YDTGANALSFARTSCGGL 246
            YD G   LSF    C  L
Sbjct: 419  YDLGKELLSFTSAQCDKL 436


>ref|XP_002272802.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 436

 Score =  363 bits (931), Expect = 1e-97
 Identities = 190/371 (51%), Positives = 237/371 (63%), Gaps = 4/371 (1%)
 Frame = -3

Query: 1346 RLQKFKAAQVTKLDAGGTFQTDVTPGEGEFLMKIGIGTPASTYEAILDTGSDLTWTQCQP 1167
            RLQ+  A   +      + +  V  G GEFLMK+ IGTPA TY AI+DTGSDL WTQC+P
Sbjct: 71   RLQRLSAKTAS---FESSVEAPVHAGNGEFLMKLAIGTPAETYSAIMDTGSDLIWTQCKP 127

Query: 1166 CKSCYEQSAPIYDPTKSATSGRTPCGTPLCNALPEFTCPNSKCEYLYQYGDYSSTSGYLA 987
            CK C++Q  PI+DP KS++  + PC + LC ALP  +C +  CEYLY YGDYSST G LA
Sbjct: 128  CKDCFDQPTPIFDPKKSSSFSKLPCSSDLCAALPISSCSDG-CEYLYSYGDYSSTQGVLA 186

Query: 986  TETFTLSSQEIPKLTFGCGQDNEGGGFSPSDGLVGFGRGPLSLVSQLGTTKFSYCLTSV- 810
            TETF      + K+ FGCG+DN+G GFS   GLVG GRGPLSL+SQLG  KFSYCLTS+ 
Sbjct: 187  TETFAFGDASVSKIGFGCGEDNDGSGFSQGAGLVGLGRGPLSLISQLGEPKFSYCLTSMD 246

Query: 809  SAKATSPLFLXXXXXXXXXXXTPLIRSTMHPTFYYLSLQGVSIGGLKLAIPKGTFDLQSD 630
             +K  S L +           TPLI++   P+FYYLSL+G+S+G   L I K TF +Q+D
Sbjct: 247  DSKGISSLLVGSEATMKNAITTPLIQNPSQPSFYYLSLEGISVGDTLLPIEKSTFSIQND 306

Query: 629  GTGGLIIDSGTTITHLEQAAYNEIASALSSAVKLTPVTSSQLGLDLCFNNPPRG--FQFP 456
            G+GGLIIDSGTTIT+LE +A+  +     S +KL    S   GLDLCF  PP       P
Sbjct: 307  GSGGLIIDSGTTITYLEDSAFAALKKEFISQLKLDVDESGSTGLDLCFTLPPDASTVDVP 366

Query: 455  DMTLSFAGGANMVLPAENYLIQDSS-AVICLAMLPSNGMSILGNIQQQNFQIIYDTGANA 279
             +   F  GA++ LPAENY+I DS   VICL M  S+GMSI GN QQQN  +++D     
Sbjct: 367  QLVFHFE-GADLKLPAENYIIADSGLGVICLTMGSSSGMSIFGNFQQQNIVVLHDLEKET 425

Query: 278  LSFARTSCGGL 246
            +SFA   C  L
Sbjct: 426  ISFAPAQCNQL 436


>ref|XP_006345762.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Solanum tuberosum]
          Length = 444

 Score =  362 bits (929), Expect = 2e-97
 Identities = 190/383 (49%), Positives = 246/383 (64%), Gaps = 12/383 (3%)
 Frame = -3

Query: 1358 RSIERLQKFKA----AQVTKLDAGGTFQTDVTPGEGEFLMKIGIGTPASTYEAILDTGSD 1191
            R   RLQ+       A ++  D     ++ +  G GEFLM+I IG+P+ +Y AI+DTGSD
Sbjct: 63   RGKSRLQRLSLVANFATLSSKDETNDVKSTIHAGNGEFLMQISIGSPSESYNAIMDTGSD 122

Query: 1190 LTWTQCQPCKSCYEQSAPIYDPTKSATSGRTPCGTPLCNALPEFTCPNSKCEYLYQYGDY 1011
            L WTQC+PCK C++QS PI+DP+KS+T  +  C   LC ALP  +C ++ CEY+Y YGDY
Sbjct: 123  LIWTQCKPCKECFDQSTPIFDPSKSSTFEKISCSNKLCEALPTSSCGDNNCEYMYTYGDY 182

Query: 1010 SSTSGYLATETFTLSSQEIPKLTFGCGQDNEGGGFSPSDGLVGFGRGPLSLVSQLGTTKF 831
            SS+ G+LA+ETFT     IP + FGCG DNEG GFS   GLVG GRG LSLVSQL  ++F
Sbjct: 183  SSSEGFLASETFTFGKVSIPNVAFGCGNDNEGSGFSQGAGLVGLGRGSLSLVSQLHMSRF 242

Query: 830  SYCLTSVSAKA---TSPLFL--XXXXXXXXXXXTPLIRSTMHPTFYYLSLQGVSIGGLKL 666
            SYCLTS++  A   +S L +             TPL+++   P+FYYLSL+G+S+G  +L
Sbjct: 243  SYCLTSINEDAYTKSSTLLMGSMAHDDYNNIITTPLVKNPTQPSFYYLSLKGISVGDTQL 302

Query: 665  AIPKGTFDLQSDGTGGLIIDSGTTITHLEQAAYNEIASALSSAVKLTPVTSSQLGLDLCF 486
            AI K TF L  DGTGG+IIDSGTTIT+LE++A++ +    SS V L    SS  GLDLCF
Sbjct: 303  AIKKSTFSLNKDGTGGMIIDSGTTITYLEESAFSLLKKEFSSQVNLPVDDSSSTGLDLCF 362

Query: 485  NNP--PRGFQFPDMTLSFAGGANMVLPAENYLIQDS-SAVICLAMLPSNGMSILGNIQQQ 315
              P      + P +   F  GA+M LPAENY+I DS   + CLAM  S+GMSI GN+QQQ
Sbjct: 363  ILPSNTNNIEVPKLIFHFE-GADMDLPAENYMIADSRMGIACLAMGSSSGMSIFGNVQQQ 421

Query: 314  NFQIIYDTGANALSFARTSCGGL 246
            N  +I+D     LSF  T C  L
Sbjct: 422  NMMVIHDLDKETLSFVPTQCDKL 444


>emb|CAD40873.2| OSJNBa0064H22.10 [Oryza sativa Japonica Group]
            gi|116310063|emb|CAH67084.1| H0818E04.1 [Oryza sativa
            Indica Group] gi|116310186|emb|CAH67198.1|
            OSIGBa0152K17.10 [Oryza sativa Indica Group]
          Length = 444

 Score =  362 bits (928), Expect = 3e-97
 Identities = 180/369 (48%), Positives = 234/369 (63%), Gaps = 12/369 (3%)
 Frame = -3

Query: 1316 TKLDAGGTFQTDVTPGEGEFLMKIGIGTPASTYEAILDTGSDLTWTQCQPCKSCYEQSAP 1137
            +K   GG  Q  V  G GEFLM + IGTPA  Y AI+DTGSDL WTQC+PC  C++QS P
Sbjct: 76   SKAAGGGDLQVPVHAGNGEFLMDVSIGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTP 135

Query: 1136 IYDPTKSATSGRTPCGTPLCNALPEFTCPN-SKCEYLYQYGDYSSTSGYLATETFTLSSQ 960
            ++DP+ S+T    PC +  C+ LP   C + SKC Y Y YGD SST G LATETFTL+  
Sbjct: 136  VFDPSSSSTYATVPCSSASCSDLPTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKS 195

Query: 959  EIPKLTFGCGQDNEGGGFSPSDGLVGFGRGPLSLVSQLGTTKFSYCLTSVSAKATSPLFL 780
            ++P + FGCG  NEG GFS   GLVG GRGPLSLVSQLG  KFSYCLTS+     SPL L
Sbjct: 196  KLPGVVFGCGDTNEGDGFSQGAGLVGLGRGPLSLVSQLGLDKFSYCLTSLDDTNNSPLLL 255

Query: 779  -------XXXXXXXXXXXTPLIRSTMHPTFYYLSLQGVSIGGLKLAIPKGTFDLQSDGTG 621
                              TPLI++   P+FYY+SL+ +++G  ++++P   F +Q DGTG
Sbjct: 256  GSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTG 315

Query: 620  GLIIDSGTTITHLEQAAYNEIASALSSAVKLTPVTSSQLGLDLCFNNPPRG---FQFPDM 450
            G+I+DSGT+IT+LE   Y  +  A ++ + L     S +GLDLCF  P +G    + P +
Sbjct: 316  GVIVDSGTSITYLEVQGYRALKKAFAAQMALPAADGSGVGLDLCFRAPAKGVDQVEVPRL 375

Query: 449  TLSFAGGANMVLPAENYLIQD-SSAVICLAMLPSNGMSILGNIQQQNFQIIYDTGANALS 273
               F GGA++ LPAENY++ D  S  +CL ++ S G+SI+GN QQQNFQ +YD G + LS
Sbjct: 376  VFHFDGGADLDLPAENYMVLDGGSGALCLTVMGSRGLSIIGNFQQQNFQFVYDVGHDTLS 435

Query: 272  FARTSCGGL 246
            FA   C  L
Sbjct: 436  FAPVQCNKL 444


>ref|NP_001052922.1| Os04g0448300 [Oryza sativa Japonica Group]
            gi|113564493|dbj|BAF14836.1| Os04g0448300 [Oryza sativa
            Japonica Group] gi|215766465|dbj|BAG98773.1| unnamed
            protein product [Oryza sativa Japonica Group]
            gi|215767943|dbj|BAH00172.1| unnamed protein product
            [Oryza sativa Japonica Group]
          Length = 454

 Score =  362 bits (928), Expect = 3e-97
 Identities = 180/369 (48%), Positives = 234/369 (63%), Gaps = 12/369 (3%)
 Frame = -3

Query: 1316 TKLDAGGTFQTDVTPGEGEFLMKIGIGTPASTYEAILDTGSDLTWTQCQPCKSCYEQSAP 1137
            +K   GG  Q  V  G GEFLM + IGTPA  Y AI+DTGSDL WTQC+PC  C++QS P
Sbjct: 86   SKAAGGGDLQVPVHAGNGEFLMDVSIGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTP 145

Query: 1136 IYDPTKSATSGRTPCGTPLCNALPEFTCPN-SKCEYLYQYGDYSSTSGYLATETFTLSSQ 960
            ++DP+ S+T    PC +  C+ LP   C + SKC Y Y YGD SST G LATETFTL+  
Sbjct: 146  VFDPSSSSTYATVPCSSASCSDLPTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKS 205

Query: 959  EIPKLTFGCGQDNEGGGFSPSDGLVGFGRGPLSLVSQLGTTKFSYCLTSVSAKATSPLFL 780
            ++P + FGCG  NEG GFS   GLVG GRGPLSLVSQLG  KFSYCLTS+     SPL L
Sbjct: 206  KLPGVVFGCGDTNEGDGFSQGAGLVGLGRGPLSLVSQLGLDKFSYCLTSLDDTNNSPLLL 265

Query: 779  -------XXXXXXXXXXXTPLIRSTMHPTFYYLSLQGVSIGGLKLAIPKGTFDLQSDGTG 621
                              TPLI++   P+FYY+SL+ +++G  ++++P   F +Q DGTG
Sbjct: 266  GSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTG 325

Query: 620  GLIIDSGTTITHLEQAAYNEIASALSSAVKLTPVTSSQLGLDLCFNNPPRG---FQFPDM 450
            G+I+DSGT+IT+LE   Y  +  A ++ + L     S +GLDLCF  P +G    + P +
Sbjct: 326  GVIVDSGTSITYLEVQGYRALKKAFAAQMALPAADGSGVGLDLCFRAPAKGVDQVEVPRL 385

Query: 449  TLSFAGGANMVLPAENYLIQD-SSAVICLAMLPSNGMSILGNIQQQNFQIIYDTGANALS 273
               F GGA++ LPAENY++ D  S  +CL ++ S G+SI+GN QQQNFQ +YD G + LS
Sbjct: 386  VFHFDGGADLDLPAENYMVLDGGSGALCLTVMGSRGLSIIGNFQQQNFQFVYDVGHDTLS 445

Query: 272  FARTSCGGL 246
            FA   C  L
Sbjct: 446  FAPVQCNKL 454


>gb|AFW58416.1| hypothetical protein ZEAMMB73_998053 [Zea mays]
          Length = 475

 Score =  358 bits (919), Expect = 3e-96
 Identities = 185/384 (48%), Positives = 236/384 (61%), Gaps = 22/384 (5%)
 Frame = -3

Query: 1331 KAAQVTKLDAGGTFQTDVTPGEGEFLMKIGIGTPASTYEAILDTGSDLTWTQCQPCKSCY 1152
            KAA       G   Q  V  G GEFLM + +GTPA  Y AI+DTGSDL WTQC+PC  C+
Sbjct: 92   KAAAAGDGSGGKDLQVPVHAGNGEFLMDLSVGTPALPYAAIVDTGSDLVWTQCKPCVECF 151

Query: 1151 EQSAPIYDPTKSATSGRTPCGTPLCNALPEFTCPNSK--------CEYLYQYGDYSSTSG 996
             Q+ P++DP  S+T    PC + LC  LP  TC +S         C Y Y YGD SST G
Sbjct: 152  NQTTPVFDPAASSTYAALPCSSALCADLPTSTCASSSSSSSASSPCGYTYTYGDASSTQG 211

Query: 995  YLATETFTLSSQEIPKLTFGCGQDNEGGGFSPSDGLVGFGRGPLSLVSQLGTTKFSYCLT 816
             LATETFTL+ Q++P + FGCG  NEG GF+   GLVG GRGPLSLVSQLG  +FSYCLT
Sbjct: 212  VLATETFTLARQKVPGVAFGCGDTNEGDGFTQGAGLVGLGRGPLSLVSQLGIDRFSYCLT 271

Query: 815  SV-SAKATSPLFL------XXXXXXXXXXXTPLIRSTMHPTFYYLSLQGVSIGGLKLAIP 657
            S+  A   SPL L                 TPL+++   P+FYY+SL G+++G  +LA+P
Sbjct: 272  SLDDAAGRSPLLLGSAAGISASAATAPAQTTPLVKNPSQPSFYYVSLTGLTVGSTRLALP 331

Query: 656  KGTFDLQSDGTGGLIIDSGTTITHLEQAAYNEIASALSSAVKLTPVTSSQLGLDLCFNNP 477
               F +Q DGTGG+I+DSGT+IT+LE  AY  +  A  + + L  V +S++GLDLCF  P
Sbjct: 332  SSAFAIQDDGTGGVIVDSGTSITYLELRAYRALRKAFVAHMSLPTVDASEIGLDLCFQGP 391

Query: 476  PRG------FQFPDMTLSFAGGANMVLPAENYLIQDS-SAVICLAMLPSNGMSILGNIQQ 318
                      Q P + L F GGA++ LPAENY++ DS S  +CL ++ S G+SI+GN QQ
Sbjct: 392  AGAVDQDVQVQVPKLVLHFDGGADLDLPAENYMVLDSASGALCLTVMASRGLSIIGNFQQ 451

Query: 317  QNFQIIYDTGANALSFARTSCGGL 246
            QNFQ +YD   + LSFA   C  L
Sbjct: 452  QNFQFVYDVAGDTLSFAPAECNKL 475


>gb|EAY94310.1| hypothetical protein OsI_16079 [Oryza sativa Indica Group]
          Length = 423

 Score =  355 bits (910), Expect = 3e-95
 Identities = 176/357 (49%), Positives = 229/357 (64%), Gaps = 12/357 (3%)
 Frame = -3

Query: 1280 VTPGEGEFLMKIGIGTPASTYEAILDTGSDLTWTQCQPCKSCYEQSAPIYDPTKSATSGR 1101
            V  G GEFLM + IGTPA  Y AI+DTGSDL WTQC+PC  C++QS P++DP+ S+T   
Sbjct: 67   VHAGNGEFLMDVSIGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYAT 126

Query: 1100 TPCGTPLCNALPEFTCPN-SKCEYLYQYGDYSSTSGYLATETFTLSSQEIPKLTFGCGQD 924
             PC +  C+ LP   C + SKC Y Y YGD SST G LATETFTL+  ++P + FGCG  
Sbjct: 127  VPCSSASCSDLPTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKSKLPGVVFGCGDT 186

Query: 923  NEGGGFSPSDGLVGFGRGPLSLVSQLGTTKFSYCLTSVSAKATSPLFL-------XXXXX 765
            NEG GFS   GLVG GRGPLSLVSQLG  KFSYCLTS+     SPL L            
Sbjct: 187  NEGDGFSQGAGLVGLGRGPLSLVSQLGLDKFSYCLTSLDDTNNSPLLLGSLAGISEASAA 246

Query: 764  XXXXXXTPLIRSTMHPTFYYLSLQGVSIGGLKLAIPKGTFDLQSDGTGGLIIDSGTTITH 585
                  TPLI++   P+FYY+SL+ +++G  ++++P   F +Q DGTGG+I+DSGT+IT+
Sbjct: 247  ASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVIVDSGTSITY 306

Query: 584  LEQAAYNEIASALSSAVKLTPVTSSQLGLDLCFNNPPRG---FQFPDMTLSFAGGANMVL 414
            LE   Y  +  A ++ + L     S +GLDLCF  P +G    + P +   F GGA++ L
Sbjct: 307  LEVQGYRALKKAFAAQMALPAADGSGVGLDLCFRAPAKGVDQVEVPRLVFHFDGGADLDL 366

Query: 413  PAENYLIQD-SSAVICLAMLPSNGMSILGNIQQQNFQIIYDTGANALSFARTSCGGL 246
            PAENY++ D  S  +CL ++ S G+SI+GN QQQNFQ +YD G + LSFA   C  L
Sbjct: 367  PAENYMVLDGGSGALCLTVMGSRGLSIIGNFQQQNFQFVYDVGHDTLSFAPVQCNKL 423


>ref|XP_003581287.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
            distachyon]
          Length = 468

 Score =  352 bits (903), Expect = 2e-94
 Identities = 182/387 (47%), Positives = 238/387 (61%), Gaps = 16/387 (4%)
 Frame = -3

Query: 1358 RSIERLQKFKAAQVT---KLDAGGTFQTDVTPGEGEFLMKIGIGTPASTYEAILDTGSDL 1188
            RS  R+ +  A   T   K  A    Q  V  G GEFLM + IGTPA  Y AI+DTGSDL
Sbjct: 82   RSHHRMSRLVARTATGSVKAAAAPDLQVPVHAGNGEFLMDMSIGTPALAYAAIVDTGSDL 141

Query: 1187 TWTQCQPCKSCYEQSAPIYDPTKSATSGRTPCGTPLCNALPEFTCPNSK--CEYLYQYGD 1014
             WTQC+PC  C+ QS P++DP+ S+T    PC + LC+ LP  TC ++   C Y Y YGD
Sbjct: 142  VWTQCKPCVECFNQSTPVFDPSSSSTYSTLPCSSSLCSDLPTSTCTSAAKDCGYTYTYGD 201

Query: 1013 YSSTSGYLATETFTLSSQEIPKLTFGCGQDNEGGGFSPSDGLVGFGRGPLSLVSQLGTTK 834
             SST G LA ETFTL+  ++P + FGCG  NEG GF+   GLVG GRGPLSLVSQLG  K
Sbjct: 202  ASSTQGVLAAETFTLAKTKLPGVAFGCGDTNEGDGFTQGAGLVGLGRGPLSLVSQLGLGK 261

Query: 833  FSYCLTSVSAKATSPLFL-------XXXXXXXXXXXTPLIRSTMHPTFYYLSLQGVSIGG 675
            FSYCLTS+   + SPL L                  TPLI++   P+FYY++L+ +++G 
Sbjct: 262  FSYCLTSLDDTSKSPLLLGSLAAISTDTASAAAIQTTPLIKNPSQPSFYYVTLKALTVGS 321

Query: 674  LKLAIPKGTFDLQSDGTGGLIIDSGTTITHLEQAAYNEIASALSSAVKLTPVTSSQLGLD 495
             ++ +P   F +Q DGTGG+I+DSGT+IT+LE   Y  +  A ++ +KL     S +GLD
Sbjct: 322  TRIPLPGSAFAVQDDGTGGVIVDSGTSITYLELQGYRPLKKAFAAQMKLPVADGSAVGLD 381

Query: 494  LCFNNPPRG---FQFPDMTLSFAGGANMVLPAENYLIQDS-SAVICLAMLPSNGMSILGN 327
            LCF  P  G    + P + L F GGA++ LPAENY++ DS S  +CL ++ S G+SI+GN
Sbjct: 382  LCFKAPASGVDDVEVPKLVLHFDGGADLDLPAENYMVLDSASGALCLTVMGSRGLSIIGN 441

Query: 326  IQQQNFQIIYDTGANALSFARTSCGGL 246
             QQQN Q +YD   + LSFA   C  L
Sbjct: 442  FQQQNIQFVYDVDKDTLSFAPVQCAKL 468


>dbj|BAJ94496.1| predicted protein [Hordeum vulgare subsp. vulgare]
            gi|326514480|dbj|BAJ96227.1| predicted protein [Hordeum
            vulgare subsp. vulgare]
          Length = 449

 Score =  352 bits (902), Expect = 3e-94
 Identities = 175/359 (48%), Positives = 226/359 (62%), Gaps = 11/359 (3%)
 Frame = -3

Query: 1289 QTDVTPGEGEFLMKIGIGTPASTYEAILDTGSDLTWTQCQPCKSCYEQSAPIYDPTKSAT 1110
            Q  V  G GEFLM + IGTPA  Y AI+DTGSDL WTQC+PC  C+ QS P++DP+ S+T
Sbjct: 92   QVPVHAGNGEFLMDMSIGTPAVAYAAIIDTGSDLVWTQCKPCVECFNQSTPVFDPSSSST 151

Query: 1109 SGRTPCGTPLCNALPEFTCPNSKCEYLYQYGDYSSTSGYLATETFTLSSQEIPKLTFGCG 930
                PC + LC+ LP   C ++KC Y Y YGD SST G LA ETFTL+  ++P + FGCG
Sbjct: 152  YAALPCSSTLCSDLPSSKCTSAKCGYTYTYGDSSSTQGVLAAETFTLAKTKLPDVAFGCG 211

Query: 929  QDNEGGGFSPSDGLVGFGRGPLSLVSQLGTTKFSYCLTSVSAKATSPLFL-------XXX 771
              NEG GF+   GLVG GRGPLSLVSQLG  KFSYCLTS+   + SPL L          
Sbjct: 212  DTNEGDGFTQGAGLVGLGRGPLSLVSQLGLNKFSYCLTSLDDTSKSPLLLGSLATISESA 271

Query: 770  XXXXXXXXTPLIRSTMHPTFYYLSLQGVSIGGLKLAIPKGTFDLQSDGTGGLIIDSGTTI 591
                    TPLIR+   P+FYY++L+G+++G   + +P   F +Q DGTGG+I+DSGT+I
Sbjct: 272  AAASSVQTTPLIRNPSQPSFYYVNLKGLTVGSTHITLPSSAFAVQDDGTGGVIVDSGTSI 331

Query: 590  THLEQAAYNEIASALSSAVKLTPVTSSQLGLDLCFNNPPRG---FQFPDMTLSFAGGANM 420
            T+LE   Y  +  A ++ +KL     S +GLD CF  P  G    + P +      GA++
Sbjct: 332  TYLELQGYRALKKAFAAQMKLPAADGSGIGLDTCFEAPASGVDQVEVPKLVFHL-DGADL 390

Query: 419  VLPAENYLIQDS-SAVICLAMLPSNGMSILGNIQQQNFQIIYDTGANALSFARTSCGGL 246
             LPAENY++ DS S  +CL ++ S G+SI+GN QQQN Q +YD G N LSFA   C  L
Sbjct: 391  DLPAENYMVLDSGSGALCLTVMGSRGLSIIGNFQQQNIQFVYDVGENTLSFAPVQCAKL 449


>ref|XP_006466172.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Citrus sinensis]
          Length = 438

 Score =  351 bits (901), Expect = 4e-94
 Identities = 184/382 (48%), Positives = 240/382 (62%), Gaps = 10/382 (2%)
 Frame = -3

Query: 1361 ERSIERLQKFKAAQVTKLDAGGTFQTDVTPGEGEFLMKIGIGTPASTYEAILDTGSDLTW 1182
            +R   RLQ+F A  +   D     ++ V  G GE+LM + IG+PA ++ AILDTGSDL W
Sbjct: 58   KRGQHRLQRFNAMSLAASDTASDLKSSVHAGTGEYLMDLSIGSPAVSFSAILDTGSDLIW 117

Query: 1181 TQCQPCKSCYEQSAPIYDPTKSATSGRTPCGTPLCNALPEFTC-PNSKCEYLYQYGDYSS 1005
            TQC+PC+ C++Q+ PI+DP +S++  + PC + LC ALP+  C  N+ CEY+Y YGD SS
Sbjct: 118  TQCKPCQVCFDQATPIFDPKESSSYSKIPCSSALCKALPQQECNANNACEYIYSYGDTSS 177

Query: 1004 TSGYLATETFTLSSQEIPKLTFGCGQDNEGGGFSPSDGLVGFGRGPLSLVSQLGTTKFSY 825
            + G LATETFT     +P + FGCG DNEG GFS   GLVG GRGPLSLVSQL   KFSY
Sbjct: 178  SQGVLATETFTFGDVSVPNIGFGCGSDNEGDGFSQGAGLVGLGRGPLSLVSQLKEPKFSY 237

Query: 824  CLTSVSAKATSPLFL-----XXXXXXXXXXXTPLIRSTMHPTFYYLSLQGVSIGGLKLAI 660
            CLTS+ A  TS L +                TPLI+S +  +FYYL L+G+S+GG +L I
Sbjct: 238  CLTSIDAAKTSTLLMGSLASANSSSSDQILTTPLIKSPLQASFYYLPLEGISVGGTRLPI 297

Query: 659  PKGTFDLQSDGTGGLIIDSGTTITHLEQAAYNEIASALSSAVKLTPV-TSSQLGLDLCFN 483
                F LQ DG+GGLIIDSGTT+T+L  +A++ +     S  KL+    + Q GLD+CF 
Sbjct: 298  DASNFALQEDGSGGLIIDSGTTLTYLIDSAFDLVKKEFISQTKLSVTDAADQTGLDVCFK 357

Query: 482  NP--PRGFQFPDMTLSFAGGANMVLPAENYLIQDSS-AVICLAMLPSNGMSILGNIQQQN 312
             P      + P +   F  GA++ LP ENY+I DSS  + CLAM  S+GMSI GN+QQQN
Sbjct: 358  LPSGSTDVEVPKLVFHFK-GADVDLPPENYMIADSSMGLACLAMGSSSGMSIFGNVQQQN 416

Query: 311  FQIIYDTGANALSFARTSCGGL 246
              ++YD     LSF  T C  L
Sbjct: 417  MLVLYDLAKETLSFIPTQCDKL 438


>gb|EOX92742.1| Eukaryotic aspartyl protease family protein [Theobroma cacao]
          Length = 441

 Score =  351 bits (900), Expect = 5e-94
 Identities = 184/379 (48%), Positives = 238/379 (62%), Gaps = 7/379 (1%)
 Frame = -3

Query: 1361 ERSIERLQKFKAAQVTKLDAGGTFQTDVTPGEGEFLMKIGIGTPASTYEAILDTGSDLTW 1182
            +R   RLQ+  A  +   DA    Q  +T G GEFLM + IGTP  +Y AILDTGSDL W
Sbjct: 66   KRGNHRLQRLNAMVLAATDAS-ELQAPITAGNGEFLMDLAIGTPPESYSAILDTGSDLIW 124

Query: 1181 TQCQPCKSCYEQSAPIYDPTKSATSGRTPCGTPLCNALPEFTCPNSKCEYLYQYGDYSST 1002
            TQC+PC  C++Q  PI+DP KS++  +  C + LC+ALP+  C +  CEYLY YGDYSST
Sbjct: 125  TQCKPCSQCFDQPTPIFDPKKSSSFSKLSCSSHLCSALPQSACSDG-CEYLYTYGDYSST 183

Query: 1001 SGYLATETFTLSSQEIPKLTFGCGQDNEGGGFSPSDGLVGFGRGPLSLVSQLGTTKFSYC 822
             G +A ETFT     +P + FGCG DN+G GF+   GLVG GRGP+SLVSQL   KFSYC
Sbjct: 184  QGVMAVETFTFGKVSVPNIGFGCGGDNQGDGFTQGAGLVGLGRGPVSLVSQLKQGKFSYC 243

Query: 821  LTSVSAKATSPLFL----XXXXXXXXXXXTPLIRSTMHPTFYYLSLQGVSIGGLKLAIPK 654
            LTS+     S L +               TPLI +   P+FYYLSL+G+++G  +L I K
Sbjct: 244  LTSIDDTKKSTLLMGSIASVNRTLGAIKTTPLIHNPTQPSFYYLSLKGITVGDTRLPIKK 303

Query: 653  GTFDLQSDGTGGLIIDSGTTITHLEQAAYNEIASALSSAVKLTPVTSSQLGLDLCFNNP- 477
             TF L+ DGTGG+IIDSGTTIT+LE+ A++ +     S +KL+  TS   GL+LCF  P 
Sbjct: 304  STFALEDDGTGGVIIDSGTTITYLEERAFDLVKKEFISQMKLSVDTSGSTGLELCFTLPS 363

Query: 476  -PRGFQFPDMTLSFAGGANMVLPAENYLIQDSSA-VICLAMLPSNGMSILGNIQQQNFQI 303
                 + P     F  GA++ LP ENY+I DSS+ ++CLAM  S+GMSI GN+QQQN  +
Sbjct: 364  GSTDVEVPKFIFHFE-GADLDLPGENYMIADSSSGLLCLAMGSSSGMSIFGNVQQQNMLV 422

Query: 302  IYDTGANALSFARTSCGGL 246
            ++D     LSF  T C  L
Sbjct: 423  LHDLEKATLSFQHTQCDKL 441


>gb|EMT14245.1| Aspartic proteinase nepenthesin-1 [Aegilops tauschii]
          Length = 499

 Score =  350 bits (897), Expect = 1e-93
 Identities = 172/356 (48%), Positives = 226/356 (63%), Gaps = 11/356 (3%)
 Frame = -3

Query: 1280 VTPGEGEFLMKIGIGTPASTYEAILDTGSDLTWTQCQPCKSCYEQSAPIYDPTKSATSGR 1101
            V  G GEFLM + IGTPA  Y AI+DTGSDL WTQC+PC  C+ QS P++DP+ S+T   
Sbjct: 145  VHAGNGEFLMDMSIGTPAVAYAAIIDTGSDLVWTQCKPCVECFNQSTPVFDPSSSSTYAA 204

Query: 1100 TPCGTPLCNALPEFTCPNSKCEYLYQYGDYSSTSGYLATETFTLSSQEIPKLTFGCGQDN 921
             PC +  C+ LP   C ++KC Y Y YGD SST G LA ETFTL+  ++P + FGCG  N
Sbjct: 205  LPCSSSFCSDLPSSKCTSAKCGYTYTYGDSSSTQGVLAAETFTLAKTKLPDVAFGCGDTN 264

Query: 920  EGGGFSPSDGLVGFGRGPLSLVSQLGTTKFSYCLTSVSAKATSPLFL-------XXXXXX 762
            EG GF+   GLVG GRGPLSLVSQLG  KFSYCLTS+   + SPL L             
Sbjct: 265  EGDGFTQGAGLVGLGRGPLSLVSQLGLKKFSYCLTSLDDTSKSPLLLGSLASISESAAAA 324

Query: 761  XXXXXTPLIRSTMHPTFYYLSLQGVSIGGLKLAIPKGTFDLQSDGTGGLIIDSGTTITHL 582
                 TPLI++   P+FYY++L+G+++G   + +P   F +Q DGTGG+I+DSGT+IT+L
Sbjct: 325  SSVQTTPLIKNPSQPSFYYVNLKGLTVGSTHITLPSSAFAVQDDGTGGVIVDSGTSITYL 384

Query: 581  EQAAYNEIASALSSAVKLTPVTSSQLGLDLCFNNPPRG---FQFPDMTLSFAGGANMVLP 411
            E   Y  +  A ++ +KL     S +GLD+CF  P  G    + P +   F  GA++ LP
Sbjct: 385  ELQGYRALKKAFAAQMKLPAADGSGIGLDMCFEAPASGVDQVEVPKLVFHF-NGADLDLP 443

Query: 410  AENYLIQDS-SAVICLAMLPSNGMSILGNIQQQNFQIIYDTGANALSFARTSCGGL 246
            AENY++ DS S  +C+ ++ S G+SI+GN QQQN Q +YD G N LSFA   C  L
Sbjct: 444  AENYMVLDSGSGALCVTVMGSRGLSIIGNFQQQNIQFVYDVGENTLSFAPVQCAKL 499


>ref|NP_565298.2| aspartyl protease family protein [Arabidopsis thaliana]
            gi|30102688|gb|AAP21262.1| At2g03200 [Arabidopsis
            thaliana] gi|110736021|dbj|BAE99983.1| putative
            chloroplast nucleoid DNA binding protein [Arabidopsis
            thaliana] gi|330250580|gb|AEC05674.1| aspartyl protease
            family protein [Arabidopsis thaliana]
          Length = 461

 Score =  350 bits (897), Expect = 1e-93
 Identities = 188/393 (47%), Positives = 239/393 (60%), Gaps = 22/393 (5%)
 Frame = -3

Query: 1358 RSIERLQKFKAAQV----TKLDAGGTFQTDVTPGEGEFLMKIGIGTPASTYEAILDTGSD 1191
            R   RL +  A  V    +K D     +     G GEFLM++ IG PA  Y AI+DTGSD
Sbjct: 70   RGFHRLNRLGAVAVLAVASKPDDTNNIKAPTHGGSGEFLMELSIGNPAVKYSAIVDTGSD 129

Query: 1190 LTWTQCQPCKSCYEQSAPIYDPTKSATSGRTPCGTPLCNALPEFTCPNSK--CEYLYQYG 1017
            L WTQC+PC  C++Q  PI+DP KS++  +  C + LCNALP   C   K  CEYLY YG
Sbjct: 130  LIWTQCKPCTECFDQPTPIFDPEKSSSYSKVGCSSGLCNALPRSNCNEDKDACEYLYTYG 189

Query: 1016 DYSSTSGYLATETFTLSSQ-EIPKLTFGCGQDNEGGGFSPSDGLVGFGRGPLSLVSQLGT 840
            DYSST G LATETFT   +  I  + FGCG +NEG GFS   GLVG GRGPLSL+SQL  
Sbjct: 190  DYSSTRGLLATETFTFEDENSISGIGFGCGVENEGDGFSQGSGLVGLGRGPLSLISQLKE 249

Query: 839  TKFSYCLTSV-SAKATSPLFL-----------XXXXXXXXXXXTPLIRSTMHPTFYYLSL 696
            TKFSYCLTS+  ++A+S LF+                        L+R+   P+FYYL L
Sbjct: 250  TKFSYCLTSIEDSEASSSLFIGSLASGIVNKTGASLDGEVTKTMSLLRNPDQPSFYYLEL 309

Query: 695  QGVSIGGLKLAIPKGTFDLQSDGTGGLIIDSGTTITHLEQAAYNEIASALSSAVKLTPVT 516
            QG+++G  +L++ K TF+L  DGTGG+IIDSGTTIT+LE+ A+  +    +S + L    
Sbjct: 310  QGITVGAKRLSVEKSTFELAEDGTGGMIIDSGTTITYLEETAFKVLKEEFTSRMSLPVDD 369

Query: 515  SSQLGLDLCFNNP--PRGFQFPDMTLSFAGGANMVLPAENYLIQDSS-AVICLAMLPSNG 345
            S   GLDLCF  P   +    P M   F  GA++ LP ENY++ DSS  V+CLAM  SNG
Sbjct: 370  SGSTGLDLCFKLPDAAKNIAVPKMIFHFK-GADLELPGENYMVADSSTGVLCLAMGSSNG 428

Query: 344  MSILGNIQQQNFQIIYDTGANALSFARTSCGGL 246
            MSI GN+QQQNF +++D     +SF  T CG L
Sbjct: 429  MSIFGNVQQQNFNVLHDLEKETVSFVPTECGKL 461


Top