BLASTX nr result

ID: Forsythia21_contig00023039 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Forsythia21_contig00023039
         (1446 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011070840.1| PREDICTED: aspartic proteinase nepenthesin-1...   642   0.0  
ref|XP_012855375.1| PREDICTED: aspartic proteinase nepenthesin-2...   617   e-174
gb|EYU22624.1| hypothetical protein MIMGU_mgv1a025299mg [Erythra...   615   e-173
gb|EPS60725.1| hypothetical protein M569_14077, partial [Genlise...   549   e-153
emb|CDP00568.1| unnamed protein product [Coffea canephora]            547   e-153
ref|XP_006362527.1| PREDICTED: protein ASPARTIC PROTEASE IN GUAR...   541   e-151
ref|XP_009767089.1| PREDICTED: aspartic proteinase nepenthesin-1...   538   e-150
ref|XP_009601064.1| PREDICTED: aspartic proteinase CDR1 [Nicotia...   538   e-150
ref|XP_004238970.1| PREDICTED: aspartic proteinase CDR1 [Solanum...   536   e-149
ref|XP_011021582.1| PREDICTED: aspartic proteinase nepenthesin-1...   531   e-148
ref|XP_009356142.1| PREDICTED: aspartic proteinase nepenthesin-1...   526   e-146
ref|XP_002311432.2| hypothetical protein POPTR_0008s11480g [Popu...   526   e-146
ref|XP_002278677.2| PREDICTED: aspartic proteinase nepenthesin-1...   526   e-146
ref|XP_007227595.1| hypothetical protein PRUPE_ppa017015mg [Prun...   525   e-146
ref|XP_002524401.1| basic 7S globulin 2 precursor small subunit,...   523   e-145
ref|XP_008339708.1| PREDICTED: LOW QUALITY PROTEIN: aspartic pro...   522   e-145
ref|XP_012082020.1| PREDICTED: aspartic proteinase nepenthesin-1...   522   e-145
ref|XP_010265280.1| PREDICTED: aspartic proteinase nepenthesin-1...   518   e-144
ref|XP_010262754.1| PREDICTED: aspartic proteinase nepenthesin-1...   515   e-143
ref|NP_189198.1| aspartyl protease family protein [Arabidopsis t...   514   e-143

>ref|XP_011070840.1| PREDICTED: aspartic proteinase nepenthesin-1 [Sesamum indicum]
          Length = 456

 Score =  642 bits (1656), Expect = 0.0
 Identities = 318/407 (78%), Positives = 344/407 (84%), Gaps = 4/407 (0%)
 Frame = -3

Query: 1246 SEALSADSRRVSALHQ-RHGHFHTKLPITSAASSGSGQYLVSLHLGTPPQRLLLVADTGS 1070
            SEALSAD+ R+S L      H H KLP+ SAASSGSGQYLVSLHLGTPPQRLLLVADTGS
Sbjct: 50   SEALSADNHRLSTLFSVLRKHPHPKLPVNSAASSGSGQYLVSLHLGTPPQRLLLVADTGS 109

Query: 1069 DLTWVSCSACRRHCPSRATFFPRHSTTFSPYHCFDSSCKLVPHPRKHFPCNHTRLHSTCR 890
            DLTWVSCSACR HC  RATFFPR S TFSPYHCFDS+C LVPHP+K   CN TRLH+ CR
Sbjct: 110  DLTWVSCSACRSHCSPRATFFPRRSATFSPYHCFDSACTLVPHPKKAPHCNRTRLHTPCR 169

Query: 889  YKYSYSDGSVTSGFFSHETTAFNSSSDKLVQFQHLSFGCGFNNSGPSVSGPSFNGANGVM 710
            Y+YSYSDGS+T+GFFS ETT FNSS+ KL++FQ LSFGCGF NSGPSVSGPSFNGANGV+
Sbjct: 170  YEYSYSDGSITNGFFSRETTTFNSSAGKLLKFQRLSFGCGFWNSGPSVSGPSFNGANGVL 229

Query: 709  GLGRGPISFSSQLGREFGHTFSYCLMDYTLSPPPTSYLLIGGSQYGGA---HKLSYTPLL 539
            GLGRGPISFSSQLGR FGH FSYCLMDY+LSPPPTSYLLIGG    GA    KLSYTPLL
Sbjct: 230  GLGRGPISFSSQLGRVFGHKFSYCLMDYSLSPPPTSYLLIGGGLGNGAVRKAKLSYTPLL 289

Query: 538  INPFSPTFYYIGIENVFINNVKLRINPSVWAIDELGNGGTVVDSGTTLTFLTLPAYRSIL 359
            IN  SPTFYYIGIE+VFI + KLRI+PSVWAIDELGNGGTV+DSGTTLTFL  PAYR IL
Sbjct: 290  INSLSPTFYYIGIESVFIEDKKLRISPSVWAIDELGNGGTVLDSGTTLTFLAEPAYRRIL 349

Query: 358  AKVDRLVKLPKSVDPTLGFDLCFNVSSDLKPSLPRLSFKLRGGSLFAPPPLNYFIDVADG 179
            A   RLVKLPKS DP LGFDLC NVS   + SLP+LSF+L GG+LF+PPP NYFID A+G
Sbjct: 350  AVFQRLVKLPKSSDPNLGFDLCLNVSGLSRTSLPQLSFRLSGGALFSPPPQNYFIDTAEG 409

Query: 178  VKCLALQPVAPSSGFSVIGNLMQQGYTFEFAKDRMRLGFSRSGCSVP 38
            VKCLALQPV   SGFSVIGNLMQQGYTFEF KDR RLGF+R GC+VP
Sbjct: 410  VKCLALQPVVSESGFSVIGNLMQQGYTFEFDKDRSRLGFTRHGCAVP 456


>ref|XP_012855375.1| PREDICTED: aspartic proteinase nepenthesin-2 [Erythranthe guttatus]
          Length = 461

 Score =  617 bits (1590), Expect = e-174
 Identities = 319/460 (69%), Positives = 360/460 (78%), Gaps = 15/460 (3%)
 Frame = -3

Query: 1372 LMAF----ISMSSCLLIFSLLIALPDLSSAXXXXXXXXXXXXXXXPS---EALSADSRRV 1214
            LMAF    +S+    L F L + +P  S+A               P    E+LSAD+RR+
Sbjct: 2    LMAFNSSHLSVFFTFLFFLLAVVIPHSSAAADHYLKLPLLHKNHYPPTSPESLSADNRRL 61

Query: 1213 SALHQRHG--HFHTKLPITSAASSGSGQYLVSLHLGTPPQRLLLVADTGSDLTWVSCSAC 1040
            S L    G    H +LP+ SAAS GSGQYLVSLHLGTPPQRLLLVADTGSDLTWVSCSAC
Sbjct: 62   STLLSAIGGKRSHAQLPLHSAASFGSGQYLVSLHLGTPPQRLLLVADTGSDLTWVSCSAC 121

Query: 1039 RRHCPSRA--TFFPRHSTTFSPYHCFDSSCKLVPHPRKHFPCNHTRLHSTCRYKYSYSDG 866
            R +C  RA  +FFPR S TFSP+HC+  +C L+PHP+K   CNHTRLHSTCRY+YSYSDG
Sbjct: 122  RSNCTPRAAVSFFPRQSATFSPHHCYSPACTLIPHPKKAPHCNHTRLHSTCRYEYSYSDG 181

Query: 865  SVTSGFFSHETTAFNSSSDKLVQFQHLSFGCGFNNSGPSVSGPSFNGANGVMGLGRGPIS 686
            SVTSGFFSHETTAFN+S+ KL++F+ LSFGCGF+NSGPSVSGPSFNGANGVMGLGRGPIS
Sbjct: 182  SVTSGFFSHETTAFNTSAGKLLKFRPLSFGCGFSNSGPSVSGPSFNGANGVMGLGRGPIS 241

Query: 685  FSSQLGREFGHTFSYCLMDYTLSPPPTSYLLIGGSQYGGAH-KLSYTPLLINPFSPTFYY 509
            FSSQLGR+FGH FSYCLMDYTLSPPPTSYLLIGG     A  KLSYTPLL NP SPTFYY
Sbjct: 242  FSSQLGRQFGHKFSYCLMDYTLSPPPTSYLLIGGGGSAAAKPKLSYTPLLQNPLSPTFYY 301

Query: 508  IGIENVFINNVKLRINPSVWAIDELGNGGTVVDSGTTLTFLTLPAYRSILAKVDRLVKLP 329
            IGIENV +N+ KL I+PSVWAIDE GNGGTVVDSGTTLTFL  PAY+ ILA  +RLVKLP
Sbjct: 302  IGIENVIVNDTKLPISPSVWAIDESGNGGTVVDSGTTLTFLAEPAYKKILAVFERLVKLP 361

Query: 328  KSVDPTLGFDLCFNVSS---DLKPSLPRLSFKLRGGSLFAPPPLNYFIDVADGVKCLALQ 158
               +P  GFDLC NVS+       SLP+LSF+L GGS+F+PPP NYFID A+ VKCLALQ
Sbjct: 362  TLSEPIPGFDLCLNVSAGGGSPGTSLPQLSFQLAGGSVFSPPPRNYFIDAAEDVKCLALQ 421

Query: 157  PVAPSSGFSVIGNLMQQGYTFEFAKDRMRLGFSRSGCSVP 38
            PVA ++GFSVIGNLMQQGYTFEF KDR RLGF+R GC+VP
Sbjct: 422  PVASAAGFSVIGNLMQQGYTFEFDKDRARLGFTRRGCAVP 461


>gb|EYU22624.1| hypothetical protein MIMGU_mgv1a025299mg [Erythranthe guttata]
          Length = 457

 Score =  615 bits (1586), Expect = e-173
 Identities = 307/410 (74%), Positives = 343/410 (83%), Gaps = 8/410 (1%)
 Frame = -3

Query: 1243 EALSADSRRVSALHQRHG--HFHTKLPITSAASSGSGQYLVSLHLGTPPQRLLLVADTGS 1070
            E+LSAD+RR+S L    G    H +LP+ SAAS GSGQYLVSLHLGTPPQRLLLVADTGS
Sbjct: 48   ESLSADNRRLSTLLSAIGGKRSHAQLPLHSAASFGSGQYLVSLHLGTPPQRLLLVADTGS 107

Query: 1069 DLTWVSCSACRRHCPSRA--TFFPRHSTTFSPYHCFDSSCKLVPHPRKHFPCNHTRLHST 896
            DLTWVSCSACR +C  RA  +FFPR S TFSP+HC+  +C L+PHP+K   CNHTRLHST
Sbjct: 108  DLTWVSCSACRSNCTPRAAVSFFPRQSATFSPHHCYSPACTLIPHPKKAPHCNHTRLHST 167

Query: 895  CRYKYSYSDGSVTSGFFSHETTAFNSSSDKLVQFQHLSFGCGFNNSGPSVSGPSFNGANG 716
            CRY+YSYSDGSVTSGFFSHETTAFN+S+ KL++F+ LSFGCGF+NSGPSVSGPSFNGANG
Sbjct: 168  CRYEYSYSDGSVTSGFFSHETTAFNTSAGKLLKFRPLSFGCGFSNSGPSVSGPSFNGANG 227

Query: 715  VMGLGRGPISFSSQLGREFGHTFSYCLMDYTLSPPPTSYLLIGGSQYGGAH-KLSYTPLL 539
            VMGLGRGPISFSSQLGR+FGH FSYCLMDYTLSPPPTSYLLIGG     A  KLSYTPLL
Sbjct: 228  VMGLGRGPISFSSQLGRQFGHKFSYCLMDYTLSPPPTSYLLIGGGGSAAAKPKLSYTPLL 287

Query: 538  INPFSPTFYYIGIENVFINNVKLRINPSVWAIDELGNGGTVVDSGTTLTFLTLPAYRSIL 359
             NP SPTFYYIGIENV +N+ KL I+PSVWAIDE GNGGTVVDSGTTLTFL  PAY+ IL
Sbjct: 288  QNPLSPTFYYIGIENVIVNDTKLPISPSVWAIDESGNGGTVVDSGTTLTFLAEPAYKKIL 347

Query: 358  AKVDRLVKLPKSVDPTLGFDLCFNVSS---DLKPSLPRLSFKLRGGSLFAPPPLNYFIDV 188
            A  +RLVKLP   +P  GFDLC NVS+       SLP+LSF+L GGS+F+PPP NYFID 
Sbjct: 348  AVFERLVKLPTLSEPIPGFDLCLNVSAGGGSPGTSLPQLSFQLAGGSVFSPPPRNYFIDA 407

Query: 187  ADGVKCLALQPVAPSSGFSVIGNLMQQGYTFEFAKDRMRLGFSRSGCSVP 38
            A+ VKCLALQPVA ++GFSVIGNLMQQGYTFEF KDR RLGF+R GC+VP
Sbjct: 408  AEDVKCLALQPVASAAGFSVIGNLMQQGYTFEFDKDRARLGFTRRGCAVP 457


>gb|EPS60725.1| hypothetical protein M569_14077, partial [Genlisea aurea]
          Length = 432

 Score =  549 bits (1415), Expect = e-153
 Identities = 269/406 (66%), Positives = 318/406 (78%), Gaps = 3/406 (0%)
 Frame = -3

Query: 1246 SEALSADSRRVSALHQRHGHFHTKLPITSAASSGSGQYLVSLHLGTPPQRLLLVADTGSD 1067
            SEAL+AD+RR+S L +R    H +LP+ SAASSGSGQYLV+LHLG+PPQRL LVADTGSD
Sbjct: 34   SEALAADNRRLSDLSKRS---HPRLPVISAASSGSGQYLVTLHLGSPPQRLFLVADTGSD 90

Query: 1066 LTWVSCSACRRHCPSRAT--FFPRHSTTFSPYHCFDSSCKLVPHPRKHFPCNHTRLHSTC 893
            LTWVSCSAC R C  RA   FFPR S++FSPYHCFDS C +VP P++   CNHTRLHS C
Sbjct: 91   LTWVSCSACSRQCSGRAAAGFFPRRSSSFSPYHCFDSECSVVPRPKQAARCNHTRLHSAC 150

Query: 892  RYKYSYSDGSVTSGFFSHETTAFNSSSDKLVQFQHLSFGCGFNNSGPSVSGPSFNGANGV 713
            RY+YSYSDGSVT GFFSHET  FN+S+ KL +F HLSFGCGF+N    + GP+ NG NGV
Sbjct: 151  RYEYSYSDGSVTRGFFSHETMEFNTSAGKLERFSHLSFGCGFSN----IPGPNLNGPNGV 206

Query: 712  MGLGRGPISFSSQLGREFGHTFSYCLMDYTLSPPPTSYLLIGG-SQYGGAHKLSYTPLLI 536
            +GLGRGPISF +Q+G+ FGH FSYCL DYTLSPPPTSYLLIGG S      +LSYT LL 
Sbjct: 207  LGLGRGPISFFTQMGQVFGHKFSYCLKDYTLSPPPTSYLLIGGGSSVVTEQRLSYTKLLT 266

Query: 535  NPFSPTFYYIGIENVFINNVKLRINPSVWAIDELGNGGTVVDSGTTLTFLTLPAYRSILA 356
            NP SPTFYY+ I+ V +N VKL I+PSVW+IDELGNGGTV+DSGTTLT+L  PAYR ILA
Sbjct: 267  NPLSPTFYYVKIDGVIVNGVKLPISPSVWSIDELGNGGTVLDSGTTLTYLAPPAYREILA 326

Query: 355  KVDRLVKLPKSVDPTLGFDLCFNVSSDLKPSLPRLSFKLRGGSLFAPPPLNYFIDVADGV 176
               RLV+ P S   + GFD C N +S    +LPRLSF+L GGS ++PPP NYFID  +GV
Sbjct: 327  AFQRLVEPPGSARRSSGFDFCLNTTSGSGATLPRLSFELDGGSDYSPPPRNYFIDTPEGV 386

Query: 175  KCLALQPVAPSSGFSVIGNLMQQGYTFEFAKDRMRLGFSRSGCSVP 38
             CLA++PV  ++GFSVIGNLMQQG+TFEF +D  R+G++RSGC  P
Sbjct: 387  TCLAVRPVTSAAGFSVIGNLMQQGFTFEFDRDLGRVGYTRSGCGAP 432


>emb|CDP00568.1| unnamed protein product [Coffea canephora]
          Length = 476

 Score =  547 bits (1409), Expect = e-153
 Identities = 276/418 (66%), Positives = 322/418 (77%), Gaps = 16/418 (3%)
 Frame = -3

Query: 1246 SEALSADSRRVSALHQRHGHFHTK-------LPITSAASSGSGQYLVSLHLGTPPQRLLL 1088
            SE L +D+ R+++LH  H H H K       LP+TS AS G+GQY VSL LGTPPQ  LL
Sbjct: 62   SEVLLSDTHRLNSLHH-HRHLHRKNSTSTAHLPLTSGASFGAGQYFVSLSLGTPPQPFLL 120

Query: 1087 VADTGSDLTWVSCSACRRHCPSR---ATFFPRHSTTFSPYHCFDSSCKLVPHPRKHFPCN 917
            VADTGSDL WV+CSACR +C SR   + F  RHSTTFSP HC+DS C+LVPHP +  PCN
Sbjct: 121  VADTGSDLIWVTCSACR-NCSSRPPNSAFLARHSTTFSPSHCYDSVCQLVPHPHR-VPCN 178

Query: 916  HTRLHSTCRYKYSYSDGSVTSGFFSHETTAFNSSSDKLVQFQHLSFGCGFNNSGPSVSGP 737
            HTR HSTCRY+YSYSDGS++SG FS ETT FN+SS K+V+F+ L+FGCGF  SGPSV+GP
Sbjct: 179  HTRRHSTCRYEYSYSDGSLSSGIFSRETTTFNTSSGKVVKFRDLAFGCGFRASGPSVTGP 238

Query: 736  SFNGANGVMGLGRGPISFSSQLGREFGHTFSYCLMDYTLSPPPTSYLLIGGSQ------Y 575
            SFNGA GV+GLG GPISF SQLGR+FG+ FSYCLMDYTLSP PTSYLLIGG         
Sbjct: 239  SFNGAQGVLGLGLGPISFPSQLGRKFGNKFSYCLMDYTLSPTPTSYLLIGGGGGPEDGVV 298

Query: 574  GGAHKLSYTPLLINPFSPTFYYIGIENVFINNVKLRINPSVWAIDELGNGGTVVDSGTTL 395
            GGA K+SYTPL+ N  SPTFYYIGIE  ++  ++LRI+PSVWAID+LGNGGTV+DSGTTL
Sbjct: 299  GGA-KMSYTPLINNSLSPTFYYIGIEAAYVGGIELRISPSVWAIDDLGNGGTVMDSGTTL 357

Query: 394  TFLTLPAYRSILAKVDRLVKLPKSVDPTLGFDLCFNVSSDLKPSLPRLSFKLRGGSLFAP 215
            TFL  PAY  +L +  R VKLPKS      FD C NVS   +PSLPRL FKL GGS+F+P
Sbjct: 358  TFLVKPAYDKVLQEFMRRVKLPKSDRRNPNFDFCVNVSGVSRPSLPRLRFKLAGGSMFSP 417

Query: 214  PPLNYFIDVADGVKCLALQPVAPSSGFSVIGNLMQQGYTFEFAKDRMRLGFSRSGCSV 41
            PP NYFID A+ VKCLALQPV   SGFS+IGN+MQQG+ FEF +DR RLGF+R GC+V
Sbjct: 418  PPQNYFIDTAENVKCLALQPVVQPSGFSLIGNVMQQGFMFEFDRDRWRLGFTRRGCAV 475


>ref|XP_006362527.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Solanum
            tuberosum]
          Length = 454

 Score =  541 bits (1393), Expect = e-151
 Identities = 268/411 (65%), Positives = 319/411 (77%), Gaps = 8/411 (1%)
 Frame = -3

Query: 1246 SEALSADSRRVSALHQRHGHFHT----KLPITSAASSGSGQYLVSLHLGTPPQRLLLVAD 1079
            S++LS+D RR++ L+   GH  T    KLP+TS A++GSGQY V L LGTPPQRLLLVAD
Sbjct: 46   SQSLSSDIRRLNTLYSSLGHRSTTRSAKLPVTSGATTGSGQYFVDLRLGTPPQRLLLVAD 105

Query: 1078 TGSDLTWVSCSACRRHCPSR---ATFFPRHSTTFSPYHCFDSSCKLVPHPRKHFPCNHTR 908
            TGSDL WVSCSACR +C SR   + F  RHS+T+ PYHC+D  C+LVP+P     CNHTR
Sbjct: 106  TGSDLVWVSCSACR-NCSSRPPNSAFLARHSSTYFPYHCYDKKCRLVPNPTG-VACNHTR 163

Query: 907  LHSTCRYKYSYSDGSVTSGFFSHETTAFNSSSDKLVQFQHLSFGCGFNNSGPSVSGPSFN 728
            LHS CRY+YSYSDGS T GFFS ETT  N+SS + V+F++L+FGC F  +GPS++GPSFN
Sbjct: 164  LHSPCRYEYSYSDGSETKGFFSTETTTLNASSGRPVKFRNLAFGCSFEATGPSIAGPSFN 223

Query: 727  GANGVMGLGRGPISFSSQLGREFGHTFSYCLMDYTLSPPPTSYLLIGGSQ-YGGAHKLSY 551
            GA GVMGLGRG IS SSQLGR FG+ FSYCLMDYTLSP PTSYLLIG S       K++Y
Sbjct: 224  GAQGVMGLGRGSISLSSQLGRRFGNKFSYCLMDYTLSPTPTSYLLIGRSTAVNDPKKMNY 283

Query: 550  TPLLINPFSPTFYYIGIENVFINNVKLRINPSVWAIDELGNGGTVVDSGTTLTFLTLPAY 371
            TP++ NPFS TFYYIGIE+V I +VKL I PSVWAIDELGNGGTV+DSGTTLTFL  PAY
Sbjct: 284  TPMISNPFSSTFYYIGIESVHIEDVKLPIRPSVWAIDELGNGGTVMDSGTTLTFLAEPAY 343

Query: 370  RSILAKVDRLVKLPKSVDPTLGFDLCFNVSSDLKPSLPRLSFKLRGGSLFAPPPLNYFID 191
            R I+    RLV LP++ +PT+GFDLC NVS + +PS P++SFKL G S+ +PP  NYFID
Sbjct: 344  RRIVQAFKRLVTLPEADEPTVGFDLCVNVSGESRPSFPKMSFKLSGNSILSPPSGNYFID 403

Query: 190  VADGVKCLALQPVAPSSGFSVIGNLMQQGYTFEFAKDRMRLGFSRSGCSVP 38
             A+ VKCLALQP+   SGFSVIGNLMQQG+ FEF +D+ R+GFSR GC  P
Sbjct: 404  TAENVKCLALQPLTTPSGFSVIGNLMQQGFMFEFDRDQSRIGFSRHGCGKP 454


>ref|XP_009767089.1| PREDICTED: aspartic proteinase nepenthesin-1 [Nicotiana sylvestris]
          Length = 448

 Score =  538 bits (1387), Expect = e-150
 Identities = 267/409 (65%), Positives = 317/409 (77%), Gaps = 6/409 (1%)
 Frame = -3

Query: 1246 SEALSADSRRVSALHQRHGHFH---TKLPITSAASSGSGQYLVSLHLGTPPQRLLLVADT 1076
            S++LS+D RR++ L+    H      KLP+TS ASSGSGQY V L LGTPPQRLLLVADT
Sbjct: 42   SQSLSSDLRRLNTLYSSLNHRSIRSAKLPLTSGASSGSGQYFVDLKLGTPPQRLLLVADT 101

Query: 1075 GSDLTWVSCSACRRHCPSR---ATFFPRHSTTFSPYHCFDSSCKLVPHPRKHFPCNHTRL 905
            GSDL WV+CSACR +C SR   + F  RHS+T+ P+HC+D  C+LVP+PR    CN TR 
Sbjct: 102  GSDLVWVTCSACR-NCSSRRRGSAFLARHSSTYFPFHCYDKKCRLVPNPRG-VACNLTRQ 159

Query: 904  HSTCRYKYSYSDGSVTSGFFSHETTAFNSSSDKLVQFQHLSFGCGFNNSGPSVSGPSFNG 725
            HS CRY YSYSD S T GFFS ETT  N+SS   V+F+  +FGC F  +GPS++GPSFNG
Sbjct: 160  HSPCRYVYSYSDESETRGFFSTETTTLNASSGSAVKFKKFAFGCSFEATGPSITGPSFNG 219

Query: 724  ANGVMGLGRGPISFSSQLGREFGHTFSYCLMDYTLSPPPTSYLLIGGSQYGGAHKLSYTP 545
            A GVMGLGRG IS +SQLGR FG+ FSYCLMDYTLSP PTSYLLIG S      K+SYTP
Sbjct: 220  AQGVMGLGRGSISLASQLGRRFGNKFSYCLMDYTLSPTPTSYLLIGRSAQVNDSKMSYTP 279

Query: 544  LLINPFSPTFYYIGIENVFINNVKLRINPSVWAIDELGNGGTVVDSGTTLTFLTLPAYRS 365
            ++ NPF+ TFYYIGIE+V+I +VKL+INPSVWAIDELGNGGTV+DSGTTLTFL  PAYR 
Sbjct: 280  MINNPFTSTFYYIGIESVYIEHVKLQINPSVWAIDELGNGGTVMDSGTTLTFLAEPAYRR 339

Query: 364  ILAKVDRLVKLPKSVDPTLGFDLCFNVSSDLKPSLPRLSFKLRGGSLFAPPPLNYFIDVA 185
            I+ +  RLV+LP+  DPTLGFDLC NVS   +PS P++SFKL G S+ +PPP NYFID A
Sbjct: 340  IVREFKRLVRLPEVNDPTLGFDLCVNVSGVSRPSFPKMSFKLSGDSVLSPPPGNYFIDTA 399

Query: 184  DGVKCLALQPVAPSSGFSVIGNLMQQGYTFEFAKDRMRLGFSRSGCSVP 38
            + VKCLALQP+A  SGFSVIGNLMQQG+ FEF +DR R+GF+R GC +P
Sbjct: 400  EDVKCLALQPLAAPSGFSVIGNLMQQGFVFEFDRDRSRIGFTRHGCGLP 448


>ref|XP_009601064.1| PREDICTED: aspartic proteinase CDR1 [Nicotiana tomentosiformis]
          Length = 453

 Score =  538 bits (1386), Expect = e-150
 Identities = 268/409 (65%), Positives = 315/409 (77%), Gaps = 6/409 (1%)
 Frame = -3

Query: 1246 SEALSADSRRVSALHQRHGHFH---TKLPITSAASSGSGQYLVSLHLGTPPQRLLLVADT 1076
            S++LS+D RR++ L+    H      KLP+TS ASSGSGQY V L LGTPPQRLLLVADT
Sbjct: 47   SQSLSSDLRRINTLYSSVNHRSIRSAKLPLTSGASSGSGQYFVDLKLGTPPQRLLLVADT 106

Query: 1075 GSDLTWVSCSACRRHCPSR---ATFFPRHSTTFSPYHCFDSSCKLVPHPRKHFPCNHTRL 905
            GSDL WV+CSACR +C SR   + F  RHS+T+ P+HC+D  C+LVP+PR    CNHTR 
Sbjct: 107  GSDLVWVTCSACR-NCSSRRRGSAFLARHSSTYFPFHCYDKKCRLVPNPRG-VACNHTRQ 164

Query: 904  HSTCRYKYSYSDGSVTSGFFSHETTAFNSSSDKLVQFQHLSFGCGFNNSGPSVSGPSFNG 725
            HS CRY YSYSD S T GFFS ETT  N+SS   V+F+   FGC F  SGPS++GPSFNG
Sbjct: 165  HSPCRYVYSYSDESETRGFFSTETTTLNASSGSAVKFKKFVFGCSFEASGPSITGPSFNG 224

Query: 724  ANGVMGLGRGPISFSSQLGREFGHTFSYCLMDYTLSPPPTSYLLIGGSQYGGAHKLSYTP 545
            A GVMGLGRG IS +SQLGR FG+ FSYCLMDYTLSP PTSYLLIG S      K+SYTP
Sbjct: 225  AQGVMGLGRGSISLASQLGRRFGNKFSYCLMDYTLSPTPTSYLLIGRSAEVNDSKMSYTP 284

Query: 544  LLINPFSPTFYYIGIENVFINNVKLRINPSVWAIDELGNGGTVVDSGTTLTFLTLPAYRS 365
            ++ NPF+ TFYYIGIE+V+I  VKL+I+PSVWAIDELGNGGTV+DSGTTLTFL  PAYR 
Sbjct: 285  MINNPFTSTFYYIGIESVYIEGVKLQISPSVWAIDELGNGGTVMDSGTTLTFLAEPAYRR 344

Query: 364  ILAKVDRLVKLPKSVDPTLGFDLCFNVSSDLKPSLPRLSFKLRGGSLFAPPPLNYFIDVA 185
            I+ +  RLV+LP+  DPTL FD C NVSS  KPS P++SFKLRG S+ +P P NYFID A
Sbjct: 345  IVKEFKRLVRLPEVDDPTLEFDFCVNVSSVSKPSFPKMSFKLRGDSVLSPTPGNYFIDTA 404

Query: 184  DGVKCLALQPVAPSSGFSVIGNLMQQGYTFEFAKDRMRLGFSRSGCSVP 38
            + VKCLALQP+A  SGFSVIGNLMQQG+ FEF +DR R+GF+R GC +P
Sbjct: 405  EDVKCLALQPLAAPSGFSVIGNLMQQGFVFEFDRDRSRIGFTRHGCGLP 453


>ref|XP_004238970.1| PREDICTED: aspartic proteinase CDR1 [Solanum lycopersicum]
          Length = 453

 Score =  536 bits (1381), Expect = e-149
 Identities = 265/411 (64%), Positives = 317/411 (77%), Gaps = 8/411 (1%)
 Frame = -3

Query: 1246 SEALSADSRRVSALHQRHGHFH----TKLPITSAASSGSGQYLVSLHLGTPPQRLLLVAD 1079
            S++LS+D  R++ L+   GH       KLP+TS A++GSGQY V L LGTPPQRLLLVAD
Sbjct: 45   SQSLSSDIHRLNTLYSSLGHRSITRSAKLPLTSGATTGSGQYFVDLRLGTPPQRLLLVAD 104

Query: 1078 TGSDLTWVSCSACRRHCPSR---ATFFPRHSTTFSPYHCFDSSCKLVPHPRKHFPCNHTR 908
            TGSDL WVSCSACR +C SR   + F  RHS+T+ PYHC+D  C+LVP+P     CNHTR
Sbjct: 105  TGSDLVWVSCSACR-NCSSRPRNSAFLARHSSTYLPYHCYDKKCRLVPNPTG-VACNHTR 162

Query: 907  LHSTCRYKYSYSDGSVTSGFFSHETTAFNSSSDKLVQFQHLSFGCGFNNSGPSVSGPSFN 728
            LHS CRY+YSYSDGS T GFFS ETT  N+SS + V+F++L+FGC F  SGPS++GPSFN
Sbjct: 163  LHSPCRYEYSYSDGSETKGFFSTETTTLNASSGRPVKFRNLAFGCSFEASGPSIAGPSFN 222

Query: 727  GANGVMGLGRGPISFSSQLGREFGHTFSYCLMDYTLSPPPTSYLLIGGSQ-YGGAHKLSY 551
            GA GVMGLGRG IS +SQLGR FG+ FSYCLMDYTLSP PTSYLLIG S       K++Y
Sbjct: 223  GAQGVMGLGRGSISLASQLGRRFGNKFSYCLMDYTLSPTPTSYLLIGRSTAVNDPKKMNY 282

Query: 550  TPLLINPFSPTFYYIGIENVFINNVKLRINPSVWAIDELGNGGTVVDSGTTLTFLTLPAY 371
            TP++ NPF+ TFYYIGIE+V+I +VKL I PSVW IDELGNGGTV+DSGTTLTFL  PAY
Sbjct: 283  TPMISNPFTSTFYYIGIESVYIEDVKLPIRPSVWEIDELGNGGTVMDSGTTLTFLAEPAY 342

Query: 370  RSILAKVDRLVKLPKSVDPTLGFDLCFNVSSDLKPSLPRLSFKLRGGSLFAPPPLNYFID 191
            R I+    RLV LP++ +PT+GFDLC NVS + +PS P++SFKL G S+ +PP  NYFID
Sbjct: 343  RRIVQAFKRLVTLPEADEPTVGFDLCVNVSGESRPSFPKMSFKLSGNSILSPPSGNYFID 402

Query: 190  VADGVKCLALQPVAPSSGFSVIGNLMQQGYTFEFAKDRMRLGFSRSGCSVP 38
             A+ VKCLALQP+   SGFSVIGNLMQQG+ FEF +DR R+GFSR GC  P
Sbjct: 403  TAEDVKCLALQPLTAPSGFSVIGNLMQQGFMFEFDRDRSRIGFSRHGCGKP 453


>ref|XP_011021582.1| PREDICTED: aspartic proteinase nepenthesin-1 [Populus euphratica]
          Length = 486

 Score =  531 bits (1367), Expect = e-148
 Identities = 274/457 (59%), Positives = 334/457 (73%), Gaps = 19/457 (4%)
 Frame = -3

Query: 1354 MSSCLLIFSLLIALPDLSSAXXXXXXXXXXXXXXXPS--EALSADSRRVSALH------Q 1199
            +S  LL   LL+A  DLS++               P+  ++LS+D +R+S LH      Q
Sbjct: 30   VSLSLLFHLLLLAFVDLSTSTTEYLKLPLLHKTPFPTPLQSLSSDLQRLSLLHHSHHRHQ 89

Query: 1198 RHGHFHTKLPITSAASSGSGQYLVSLHLGTPPQRLLLVADTGSDLTWVSCSACRRHC--- 1028
             H    +K P+ S ASSGSGQY VS+ LG+PPQ LLLVADTGSDLTW+ CSAC+ +C   
Sbjct: 90   NHRQASSKSPLISGASSGSGQYFVSIRLGSPPQTLLLVADTGSDLTWLRCSACKTNCSIH 149

Query: 1027 PSRATFFPRHSTTFSPYHCFDSSCKLVPHPRKHFPCNHTRLHSTCRYKYSYSDGSVTSGF 848
            P  +TF  RHSTTFSP HCF S C+LVPHP  + PCNHTRLHSTCRY+Y YSDGS TSGF
Sbjct: 150  PPGSTFLARHSTTFSPAHCFSSLCQLVPHPNPN-PCNHTRLHSTCRYEYVYSDGSKTSGF 208

Query: 847  FSHETTAFNSSSDKLVQFQHLSFGCGFNNSGPSVSGPSFNGANGVMGLGRGPISFSSQLG 668
            FS ETT  N+SS + ++ ++++FGCGF+ SGPS+   SFNGA+GVMGLGRGPISF+SQLG
Sbjct: 209  FSKETTTLNTSSGREMKLKNIAFGCGFHVSGPSLIRSSFNGASGVMGLGRGPISFASQLG 268

Query: 667  REFGHTFSYCLMDYTLSPPPTSYLLIG---GSQYGGAHKLSYTPLLINPFSPTFYYIGIE 497
            R FG +FSYCLMDYTLSPPPTSYL+IG    S+      +SYTPLL+NP +PTFYYI I+
Sbjct: 269  RRFGRSFSYCLMDYTLSPPPTSYLMIGDVVSSKKDNKSVMSYTPLLVNPEAPTFYYIAIK 328

Query: 496  NVFINNVKLRINPSVWAIDELGNGGTVVDSGTTLTFLTLPAYRSILAKVDRLVKLPK--- 326
             VF++ VKLRI+PSVW+IDELGNGGTV+DSGTTLTFL  PAYR IL+   R VKLP    
Sbjct: 329  GVFVDGVKLRIDPSVWSIDELGNGGTVIDSGTTLTFLIEPAYREILSAFKREVKLPSPTP 388

Query: 325  -SVDPTLGFDLCFNVSSDLKPSLPRLSFKLRGGSLFAPPPLNYFIDVADGVKCLALQPV- 152
                   GFDLC NV+   +P  PRLS +L G SL++PPP NYFID+++G+KCLA+QPV 
Sbjct: 389  GGASTQSGFDLCVNVTGVSRPRFPRLSLELGGESLYSPPPRNYFIDISEGIKCLAIQPVE 448

Query: 151  APSSGFSVIGNLMQQGYTFEFAKDRMRLGFSRSGCSV 41
            A S GFSVIGNLMQQG+  EF + + RLGFSR GC+V
Sbjct: 449  AESGGFSVIGNLMQQGFLLEFDRGKSRLGFSRRGCAV 485


>ref|XP_009356142.1| PREDICTED: aspartic proteinase nepenthesin-1 [Pyrus x bretschneideri]
          Length = 454

 Score =  526 bits (1355), Expect = e-146
 Identities = 261/409 (63%), Positives = 307/409 (75%), Gaps = 6/409 (1%)
 Frame = -3

Query: 1246 SEALSADSRRVSALHQRHGHFHTKLPITSAASSGSGQYLVSLHLGTPPQRLLLVADTGSD 1067
            S+ LS D+ R+S LH R       LP+ S ASSGSGQY V L +GTPPQRLLLVADTGSD
Sbjct: 50   SQTLSHDTHRLSLLHSRRRDI--TLPVVSGASSGSGQYFVDLRIGTPPQRLLLVADTGSD 107

Query: 1066 LTWVSCSACRRHCPSR---ATFFPRHSTTFSPYHCFDSSCKLVPHPRKHFPCNHTRLHST 896
            L W++CSAC   C +R   + F  RHS+TFSPYHC++S+CKLVP P  + PCNHTRLHS 
Sbjct: 108  LVWLTCSACT-DCSNRGPGSAFLARHSSTFSPYHCYNSACKLVPPPDPN-PCNHTRLHSP 165

Query: 895  CRYKYSYSDGSVTSGFFSHETTAFNSSSDKLVQFQHLSFGCGFNNSGPSVSGPSFNGANG 716
            CRY+YSYSDGS+T+GFFS ETT  N+SS    +  HLSFGC F   GPS++GPSFNGA G
Sbjct: 166  CRYEYSYSDGSLTAGFFSKETTTLNTSSGTHTELPHLSFGCAFRVEGPSITGPSFNGAQG 225

Query: 715  VMGLGRGPISFSSQLGREFGHTFSYCLMDYTLSPPPTSYLLIGG---SQYGGAHKLSYTP 545
            VMGLGRGPISFSSQLGR FG+ FSYCLMDY L P PTSYL IGG   S+     K S+TP
Sbjct: 226  VMGLGRGPISFSSQLGRRFGNKFSYCLMDYPLPPSPTSYLRIGGGSPSRVVSNKKFSFTP 285

Query: 544  LLINPFSPTFYYIGIENVFINNVKLRINPSVWAIDELGNGGTVVDSGTTLTFLTLPAYRS 365
            L +N F+PTFYYIGI++V ++  KL I PSVWA+D  GNGGTV+DSGTTL+FL  PAYR 
Sbjct: 286  LQVNNFAPTFYYIGIKSVSVHGAKLPIRPSVWALDSSGNGGTVIDSGTTLSFLPEPAYRL 345

Query: 364  ILAKVDRLVKLPKSVDPTLGFDLCFNVSSDLKPSLPRLSFKLRGGSLFAPPPLNYFIDVA 185
            ILA   R ++L    +PT GFDLC NVS   +P LPR+SFKL G S+FAPPP +YFID A
Sbjct: 346  ILAAFKRNIRLASPANPTPGFDLCVNVSGASRPRLPRMSFKLAGNSVFAPPPSSYFIDTA 405

Query: 184  DGVKCLALQPVAPSSGFSVIGNLMQQGYTFEFAKDRMRLGFSRSGCSVP 38
            D +KCLA+QPV   SGF VIGNLMQQG+ FEF +DR  LGFSR GC++P
Sbjct: 406  DRIKCLAIQPVESGSGFGVIGNLMQQGFLFEFDRDRSLLGFSRHGCALP 454


>ref|XP_002311432.2| hypothetical protein POPTR_0008s11480g [Populus trichocarpa]
            gi|550332858|gb|EEE88799.2| hypothetical protein
            POPTR_0008s11480g [Populus trichocarpa]
          Length = 486

 Score =  526 bits (1355), Expect = e-146
 Identities = 271/457 (59%), Positives = 332/457 (72%), Gaps = 19/457 (4%)
 Frame = -3

Query: 1354 MSSCLLIFSLLIALPDLSSAXXXXXXXXXXXXXXXPS--EALSADSRRVSALH------Q 1199
            +S  LL   LL+A  DLS++               P+  ++LS+D +R+S LH      Q
Sbjct: 30   VSLSLLFHLLLLAFVDLSTSTTEYLKLPLLHKTPFPTPLQSLSSDLQRLSLLHHSHHRHQ 89

Query: 1198 RHGHFHTKLPITSAASSGSGQYLVSLHLGTPPQRLLLVADTGSDLTWVSCSACRRHC--- 1028
             H    +K P+ S ASSGSGQY VS+ LG+PPQ LLLVADTGSDLTWV CSAC+ +C   
Sbjct: 90   NHRRTSSKSPLMSGASSGSGQYFVSIRLGSPPQTLLLVADTGSDLTWVRCSACKTNCSIH 149

Query: 1027 PSRATFFPRHSTTFSPYHCFDSSCKLVPHPRKHFPCNHTRLHSTCRYKYSYSDGSVTSGF 848
            P  +TF  RHSTTFSP HCF S C+LVP P  + PCNHTRLHSTCRY+Y YSDGS TSGF
Sbjct: 150  PPGSTFLARHSTTFSPTHCFSSLCQLVPQPNPN-PCNHTRLHSTCRYEYVYSDGSKTSGF 208

Query: 847  FSHETTAFNSSSDKLVQFQHLSFGCGFNNSGPSVSGPSFNGANGVMGLGRGPISFSSQLG 668
            FS ETT  N+SS + ++ + ++FGCGF+ SGPS+ G SFNGA+GVMGLGRGPISF+SQLG
Sbjct: 209  FSKETTTLNTSSGREMKLKSIAFGCGFHASGPSLIGSSFNGASGVMGLGRGPISFASQLG 268

Query: 667  REFGHTFSYCLMDYTLSPPPTSYLLIG---GSQYGGAHKLSYTPLLINPFSPTFYYIGIE 497
            R FG +FSYCL+DYTLSPPPTSYL+IG    ++      +S+TPLLINP +PTFYYI I+
Sbjct: 269  RRFGRSFSYCLLDYTLSPPPTSYLMIGDVVSTKKDNKSMMSFTPLLINPEAPTFYYISIK 328

Query: 496  NVFINNVKLRINPSVWAIDELGNGGTVVDSGTTLTFLTLPAYRSILAKVDRLVKLPK--- 326
             VF++ VKL I+PSVW++DELGNGGTV+DSGTTLTFLT PAYR IL+   R VKLP    
Sbjct: 329  GVFVDGVKLHIDPSVWSLDELGNGGTVIDSGTTLTFLTEPAYREILSAFKREVKLPSPTP 388

Query: 325  -SVDPTLGFDLCFNVSSDLKPSLPRLSFKLRGGSLFAPPPLNYFIDVADGVKCLALQPVA 149
                   GFDLC NV+   +P  PRLS +L G SL++PPP NYFID+++G+KCLA+QPV 
Sbjct: 389  GGASTQSGFDLCVNVTGVSRPRFPRLSLELGGESLYSPPPRNYFIDISEGIKCLAIQPVE 448

Query: 148  PSSG-FSVIGNLMQQGYTFEFAKDRMRLGFSRSGCSV 41
              SG FSVIGNLMQQG+  EF + + RLGFSR GC+V
Sbjct: 449  AESGRFSVIGNLMQQGFLLEFDRGKSRLGFSRRGCAV 485


>ref|XP_002278677.2| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
          Length = 458

 Score =  526 bits (1354), Expect = e-146
 Identities = 266/410 (64%), Positives = 306/410 (74%), Gaps = 7/410 (1%)
 Frame = -3

Query: 1246 SEALSADSRRVSALHQR-HGHFHTKLPITSAASSGSGQYLVSLHLGTPPQRLLLVADTGS 1070
            S+ALS DS R+S      H     K P+ S AS+GSGQY V L LGTPPQ+LLLVADTGS
Sbjct: 51   SQALSFDSHRLSFFFSALHTPQSLKSPVVSGASTGSGQYFVDLRLGTPPQKLLLVADTGS 110

Query: 1069 DLTWVSCSACR---RHCPSRATFFPRHSTTFSPYHCFDSSCKLVPHPRKHFPCNHTRLHS 899
            DL WV CSACR   RH P  A F  RHSTTFSP HC+DS+C+LVP P KH  CNH RLHS
Sbjct: 111  DLVWVKCSACRNCTRHTPGSA-FLARHSTTFSPNHCYDSACQLVPLP-KHHRCNHARLHS 168

Query: 898  TCRYKYSYSDGSVTSGFFSHETTAFNSSSDKLVQFQHLSFGCGFNNSGPSVSGPSFNGAN 719
             CRY+YSY DGS TSGFFS ETT  N+SS +  + + ++FGC F  SGPSVSG SFNGA+
Sbjct: 169  PCRYEYSYGDGSKTSGFFSKETTTLNTSSGREAKLKGIAFGCAFRISGPSVSGASFNGAH 228

Query: 718  GVMGLGRGPISFSSQLGREFGHTFSYCLMDYTLSPPPTSYLLIGGSQYG---GAHKLSYT 548
            GVMGLGRGPIS SSQLG  FG+ FSYCLMD+ +SP PTSYLLIG +Q     G  ++ +T
Sbjct: 229  GVMGLGRGPISLSSQLGHRFGNKFSYCLMDHDISPSPTSYLLIGSTQNDVAPGKRRMRFT 288

Query: 547  PLLINPFSPTFYYIGIENVFINNVKLRINPSVWAIDELGNGGTVVDSGTTLTFLTLPAYR 368
            PL INP SPTFYYIGIE+V ++ +KL INPSVWA+DELGNGGT+VDSGTTLTFL  PAY 
Sbjct: 289  PLHINPLSPTFYYIGIESVSVDGIKLPINPSVWALDELGNGGTIVDSGTTLTFLPEPAYL 348

Query: 367  SILAKVDRLVKLPKSVDPTLGFDLCFNVSSDLKPSLPRLSFKLRGGSLFAPPPLNYFIDV 188
             IL  + R V+LP   +PT GFDLC NVS    P LP+LSFKL G S+F+PPP NYF+D 
Sbjct: 349  QILTVIKRRVRLPSPAEPTPGFDLCVNVSEIEHPRLPKLSFKLGGDSVFSPPPRNYFVDT 408

Query: 187  ADGVKCLALQPVAPSSGFSVIGNLMQQGYTFEFAKDRMRLGFSRSGCSVP 38
             + VKCLALQ V   SGFSVIGNLMQQG+  EF KDR RLGFSR GC++P
Sbjct: 409  DEDVKCLALQAVMTPSGFSVIGNLMQQGFLLEFDKDRTRLGFSRHGCALP 458


>ref|XP_007227595.1| hypothetical protein PRUPE_ppa017015mg [Prunus persica]
            gi|462424531|gb|EMJ28794.1| hypothetical protein
            PRUPE_ppa017015mg [Prunus persica]
          Length = 447

 Score =  525 bits (1351), Expect = e-146
 Identities = 262/408 (64%), Positives = 311/408 (76%), Gaps = 5/408 (1%)
 Frame = -3

Query: 1246 SEALSADSRRVSALHQRHGHFHTKLPITSAASSGSGQYLVSLHLGTPPQRLLLVADTGSD 1067
            S+ALS D+ R+S LH R      K P+ S AS+GSGQY V L LGTPPQ LLLVADTGSD
Sbjct: 44   SQALSHDTHRLSLLHARRHDI--KSPVVSGASTGSGQYFVDLRLGTPPQSLLLVADTGSD 101

Query: 1066 LTWVSCSACRRHCPSR---ATFFPRHSTTFSPYHCFDSSCKLVPHPRKHFPCNHTRLHST 896
            L W++CSAC  +C +R   + F  RHS+TFSPYHC+DS+C L+P P    PCN TRLHS 
Sbjct: 102  LVWLTCSACT-NCSNRDPGSAFLARHSSTFSPYHCYDSACTLIPQPDPS-PCNRTRLHSP 159

Query: 895  CRYKYSYSDGSVTSGFFSHETTAFNSSSDKLVQFQHLSFGCGFNNSGPSVSGPSFNGANG 716
            CRY+Y+YSDGS+T+GFFS ETT   +SS +  Q  +LSFGCGF  SGPSV+GPSFNGA+G
Sbjct: 160  CRYEYTYSDGSLTAGFFSRETTTLKTSSGRETQLPNLSFGCGFRVSGPSVTGPSFNGAHG 219

Query: 715  VMGLGRGPISFSSQLGREFGHTFSYCLMDYTLSPPPTSYLLIGGS-QYGGAHKLSYTPLL 539
            VMGLGRGPISF+SQLGR FG+ FSYCLMDYTLSPPPTSYL IGG   +    K+ +TP+L
Sbjct: 220  VMGLGRGPISFASQLGRRFGNKFSYCLMDYTLSPPPTSYLRIGGGFPHDVVSKIRFTPML 279

Query: 538  INPFSPTFYYIGIENVFINNVKLRINPSVWAIDELGNGGTVVDSGTTLTFLTLPAYRSIL 359
            +NP SPTFYYIGI++  +N  KL I+PSVW++D  GNGGTV+DSGTTLTFL   AYR IL
Sbjct: 280  VNPLSPTFYYIGIKSASVNGRKLPIHPSVWSLDRAGNGGTVIDSGTTLTFLPETAYRVIL 339

Query: 358  AKVDRLVK-LPKSVDPTLGFDLCFNVSSDLKPSLPRLSFKLRGGSLFAPPPLNYFIDVAD 182
            A   R ++ L K   PT GFDLC NVS   +PSLPRLSF+L G +LFAPPP +YFID A+
Sbjct: 340  AAFKRSLRLLAKPAKPTPGFDLCINVSGVARPSLPRLSFRLVGNALFAPPPSSYFIDTAE 399

Query: 181  GVKCLALQPVAPSSGFSVIGNLMQQGYTFEFAKDRMRLGFSRSGCSVP 38
             VKCLA+QPV   SGF VIGNLMQQG+ FEF +D+ RLGFSR GC+ P
Sbjct: 400  QVKCLAIQPVDSGSGFGVIGNLMQQGFLFEFDRDKSRLGFSRHGCARP 447


>ref|XP_002524401.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
            communis] gi|223536362|gb|EEF38012.1| basic 7S globulin 2
            precursor small subunit, putative [Ricinus communis]
          Length = 455

 Score =  523 bits (1348), Expect = e-145
 Identities = 267/456 (58%), Positives = 319/456 (69%), Gaps = 17/456 (3%)
 Frame = -3

Query: 1354 MSSCLLIFSLLIALPDLSSAXXXXXXXXXXXXXXXP------SEALSAD-SRRVSALH-- 1202
            M S L  F LLI L   SSA                      SEAL+ D +RR+S LH  
Sbjct: 1    MVSLLFFFFLLITLCPSSSAAANTTTEYLKLPLLHKTPFTSPSEALAFDINRRLSLLHHH 60

Query: 1201 ---QRHGHFHTKLPITSAASSGSGQYLVSLHLGTPPQRLLLVADTGSDLTWVSCSACRR- 1034
               Q+H     + P+ S ASSGSGQY VSL +GTPPQ LLLVADTGSDL WV CS CR  
Sbjct: 61   RHQQQHKQNSFRSPVISGASSGSGQYFVSLRIGTPPQTLLLVADTGSDLIWVKCSPCRNC 120

Query: 1033 -HCPSRATFFPRHSTTFSPYHCFDSSCKLVPHPRKHFPCNHTRLHSTCRYKYSYSDGSVT 857
             H    + FF RHSTT+S  HC+   C+LVPHP  + PCN TRLHS CRY+Y+Y+D S T
Sbjct: 121  SHRSPGSAFFARHSTTYSAIHCYSPQCQLVPHPHPN-PCNRTRLHSPCRYQYTYADSSTT 179

Query: 856  SGFFSHETTAFNSSSDKLVQFQHLSFGCGFNNSGPSVSGPSFNGANGVMGLGRGPISFSS 677
            +GFFS E    N+S+ K+ +   LSFGCGF  SGPS++G SF GA GVMGLGR PISFSS
Sbjct: 180  TGFFSKEALTLNTSTGKVKKLNGLSFGCGFRISGPSLTGASFEGAQGVMGLGRAPISFSS 239

Query: 676  QLGREFGHTFSYCLMDYTLSPPPTSYLLIGGSQYGGAHK---LSYTPLLINPFSPTFYYI 506
            QLGR FG  FSYCLMDYTLSPPPTS+L IGG+Q     K   +S+TPLLINP SPTFYYI
Sbjct: 240  QLGRRFGSKFSYCLMDYTLSPPPTSFLTIGGAQNVAVSKKGIMSFTPLLINPLSPTFYYI 299

Query: 505  GIENVFINNVKLRINPSVWAIDELGNGGTVVDSGTTLTFLTLPAYRSILAKVDRLVKLPK 326
             I+ V++N VKL INPSVW+ID+LGNGGT++DSGTTLTF+T PAY  IL    + VKLP 
Sbjct: 300  AIKGVYVNGVKLPINPSVWSIDDLGNGGTIIDSGTTLTFITEPAYTEILKAFKKRVKLPS 359

Query: 325  SVDPTLGFDLCFNVSSDLKPSLPRLSFKLRGGSLFAPPPLNYFIDVADGVKCLALQPVAP 146
              +PT GFDLC NVS   +P+LPR+SF L GGS+F+PPP NYFI+  D +KCLA+QPV+ 
Sbjct: 360  PAEPTPGFDLCMNVSGVTRPALPRMSFNLAGGSVFSPPPRNYFIETGDQIKCLAVQPVSQ 419

Query: 145  SSGFSVIGNLMQQGYTFEFAKDRMRLGFSRSGCSVP 38
              GFSV+GNLMQQG+  EF +D+ RLGF+R GC++P
Sbjct: 420  DGGFSVLGNLMQQGFLLEFDRDKSRLGFTRRGCALP 455


>ref|XP_008339708.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase nepenthesin-1
            [Malus domestica]
          Length = 458

 Score =  522 bits (1345), Expect = e-145
 Identities = 261/411 (63%), Positives = 308/411 (74%), Gaps = 8/411 (1%)
 Frame = -3

Query: 1246 SEALSADSRR-VSALHQ-RHGHFHTKLPITSAASSGSGQYLVSLHLGTPPQRLLLVADTG 1073
            S+ LS D+   +S LH  R       LP+ S ASSGSGQY V L +GTPPQRLLLV DTG
Sbjct: 50   SQTLSHDTHXXLSLLHSXRRRRRDITLPVVSGASSGSGQYFVDLRIGTPPQRLLLVTDTG 109

Query: 1072 SDLTWVSCSACRRHCPSR---ATFFPRHSTTFSPYHCFDSSCKLVPHPRKHFPCNHTRLH 902
            SDL W++CSAC   C +R   + F  RHS+TFSPYHC+DS+CKL+P P  + PCNHTRLH
Sbjct: 110  SDLVWLTCSACT-DCSNREPGSAFLARHSSTFSPYHCYDSACKLIPPPDPN-PCNHTRLH 167

Query: 901  STCRYKYSYSDGSVTSGFFSHETTAFNSSSDKLVQFQHLSFGCGFNNSGPSVSGPSFNGA 722
            S CRY+YSYSDGS+T+GFFS ETT  N+SS    Q  +LSFGC F   GPS++GPSFNGA
Sbjct: 168  SPCRYEYSYSDGSLTAGFFSKETTTLNTSSGTRTQLPNLSFGCAFRVEGPSITGPSFNGA 227

Query: 721  NGVMGLGRGPISFSSQLGREFGHTFSYCLMDYTLSPPPTSYLLIGG---SQYGGAHKLSY 551
             GVMGLGRGPISFSSQLGR F + FSYCLMDYTLSP PTSYL IGG   S+     K S+
Sbjct: 228  QGVMGLGRGPISFSSQLGRRFXNKFSYCLMDYTLSPSPTSYLRIGGGSPSRVVSNTKFSF 287

Query: 550  TPLLINPFSPTFYYIGIENVFINNVKLRINPSVWAIDELGNGGTVVDSGTTLTFLTLPAY 371
            TPL +N F+PTFYYIGI++V ++  KL I PSVWA+DE GNGG V+DSGTTL+FL  PAY
Sbjct: 288  TPLQVNDFAPTFYYIGIKSVSVHGAKLPIRPSVWALDESGNGGIVIDSGTTLSFLPEPAY 347

Query: 370  RSILAKVDRLVKLPKSVDPTLGFDLCFNVSSDLKPSLPRLSFKLRGGSLFAPPPLNYFID 191
            R ILA   R ++L +  +PT GFDLC NVS   +P LPR+SFKL G S+FAPPP +YFID
Sbjct: 348  RVILAAFKRNIRLARPANPTXGFDLCVNVSGASRPRLPRMSFKLAGNSVFAPPPSSYFID 407

Query: 190  VADGVKCLALQPVAPSSGFSVIGNLMQQGYTFEFAKDRMRLGFSRSGCSVP 38
             AD +KCLA+QPV   SGF VIGNLMQQG+ FEF +DR RLGFSR GC++P
Sbjct: 408  TADRIKCLAIQPVESGSGFGVIGNLMQQGFLFEFDRDRSRLGFSRHGCALP 458


>ref|XP_012082020.1| PREDICTED: aspartic proteinase nepenthesin-1 [Jatropha curcas]
            gi|643718002|gb|KDP29358.1| hypothetical protein
            JCGZ_18279 [Jatropha curcas]
          Length = 455

 Score =  522 bits (1344), Expect = e-145
 Identities = 259/409 (63%), Positives = 315/409 (77%), Gaps = 6/409 (1%)
 Frame = -3

Query: 1246 SEALSADSRRVSALHQRHGHFHTKLPITSAASSGSGQYLVSLHLGTPPQRLLLVADTGSD 1067
            ++AL  D RR+S LH++      K P+ S AS+GSGQY VSL LG+P Q LLLVADTGSD
Sbjct: 51   AQALPFDIRRLSLLHRQRTSL--KSPVISGASTGSGQYFVSLRLGSPAQTLLLVADTGSD 108

Query: 1066 LTWVSCSACRR---HCPSRATFFPRHSTTFSPYHCFDSSCKLVPHPRKHFPCNHTRLHST 896
            L WV CSAC+    + P  A F  RHS+TFS  HCF+S C+LVPHPR + PCN TRLHS 
Sbjct: 109  LVWVKCSACKNCSNYSPGSA-FLARHSSTFSLIHCFNSQCRLVPHPRPN-PCNRTRLHSP 166

Query: 895  CRYKYSYSDGSVTSGFFSHETTAFNSSSDKLVQFQHLSFGCGFNNSGPSVSGPSFNGANG 716
            CRY+YSY+DGS TSGFFS ETT  N+S+ +  + ++L+FGCGF  SGPS++G SF GA+G
Sbjct: 167  CRYEYSYADGSSTSGFFSKETTTLNTSAGREKKLKNLAFGCGFRISGPSLTGASFAGAHG 226

Query: 715  VMGLGRGPISFSSQLGREFGHTFSYCLMDYTLSPPPTSYLLIGGSQYGGAHK---LSYTP 545
            V+GLGR PISFSSQLGR FG+ FSYCLMDYTLSPPPTSYL+IGG Q     +   L++TP
Sbjct: 227  VIGLGRAPISFSSQLGRRFGNKFSYCLMDYTLSPPPTSYLMIGGHQNSAVSRKRILNFTP 286

Query: 544  LLINPFSPTFYYIGIENVFINNVKLRINPSVWAIDELGNGGTVVDSGTTLTFLTLPAYRS 365
            LL+N  SPTFYYIGI++V ++ VKL INPSVW+ID+LGNGGT++DSGTTLTFL  PAYR 
Sbjct: 287  LLVNSLSPTFYYIGIKSVSVDGVKLPINPSVWSIDDLGNGGTIIDSGTTLTFLVEPAYRE 346

Query: 364  ILAKVDRLVKLPKSVDPTLGFDLCFNVSSDLKPSLPRLSFKLRGGSLFAPPPLNYFIDVA 185
            IL+ + R VKLP   + T GFDLC NVS   +P  PR+S +L G S+F+PPP NYFID +
Sbjct: 347  ILSAIKRRVKLPGPGELTPGFDLCVNVSGVRRPVFPRMSLELAGNSVFSPPPRNYFIDTS 406

Query: 184  DGVKCLALQPVAPSSGFSVIGNLMQQGYTFEFAKDRMRLGFSRSGCSVP 38
            +GVKCLA+QPV   SGFSVIGNLMQQGY  EF +DR RLGF+RSGC++P
Sbjct: 407  EGVKCLAIQPVNSGSGFSVIGNLMQQGYLLEFDRDRSRLGFARSGCALP 455


>ref|XP_010265280.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Nelumbo nucifera]
          Length = 523

 Score =  518 bits (1335), Expect = e-144
 Identities = 267/482 (55%), Positives = 329/482 (68%), Gaps = 14/482 (2%)
 Frame = -3

Query: 1441 PLVSPISVQYLIVQLQVFLLPTTLMAFISMSSCLLIFSLLI--------ALPDLSSAXXX 1286
            P  SP S+  L+  L  F L  T++  + +    L+  +L+        A  +  S    
Sbjct: 42   PFQSP-SLSLLLRLLSFFFLSQTMVVRVPLPRVFLLPLVLLFSFINAQAAFNNSGSVEYL 100

Query: 1285 XXXXXXXXXXXXPSEALSADSRRVSALHQRHGHFHT-KLPITSAASSGSGQYLVSLHLGT 1109
                        P++ LS DS R+S L     +    K PI S AS+GSGQY V   +GT
Sbjct: 101  KLRLLHRNPFVSPAQVLSLDSERLSVLFSALRNRRAFKAPIVSGASTGSGQYFVDFRIGT 160

Query: 1108 PPQRLLLVADTGSDLTWVSCSAC---RRHCPSRATFFPRHSTTFSPYHCFDSSCKLVPHP 938
            PPQ LLLVADTGSDL WV CSAC    +H P  A F  RHSTTF+P HC+DS+C+ VPHP
Sbjct: 161  PPQSLLLVADTGSDLVWVKCSACWNCSKHPPGSA-FLARHSTTFAPVHCYDSACQHVPHP 219

Query: 937  RKHFPCNHTRLHSTCRYKYSYSDGSVTSGFFSHETTAFNSSSDKLVQFQHLSFGCGFNNS 758
             KH PCNHTRLHSTCRY YSY+DGS TSG F+ ETT  N+S     + + L+FGCGFN S
Sbjct: 220  LKHQPCNHTRLHSTCRYDYSYADGSRTSGLFATETTTLNTSYGGAARLKDLAFGCGFNVS 279

Query: 757  GPSVSGPSFNGANGVMGLGRGPISFSSQLGREFGHTFSYCLMDYTLSPPPTSYLLIGGSQ 578
            GPSVS  SFNGA+GVMGLGRGP+SFSSQ+G+ FG+ FSYCL DYT+SPPPTSYLLIG + 
Sbjct: 280  GPSVSDASFNGAHGVMGLGRGPVSFSSQVGKLFGNKFSYCLKDYTISPPPTSYLLIGETH 339

Query: 577  --YGGAHKLSYTPLLINPFSPTFYYIGIENVFINNVKLRINPSVWAIDELGNGGTVVDSG 404
                   ++S+TPL  NP SP+FYY+GI++VF++ V L I+PS+WA+D  GNGGTV+DSG
Sbjct: 340  GPITKKQRMSFTPLHTNPLSPSFYYVGIKSVFVDGVGLPIDPSIWALDNQGNGGTVIDSG 399

Query: 403  TTLTFLTLPAYRSILAKVDRLVKLPKSVDPTLGFDLCFNVSSDLKPSLPRLSFKLRGGSL 224
            TTLTFL  PAYR +L    R VKLP++ DP    DLC NVS    P LP+LSF+L GGS+
Sbjct: 400  TTLTFLAEPAYRQVLTAFRRRVKLPRTTDPASSLDLCVNVSGVANPRLPKLSFRLDGGSV 459

Query: 223  FAPPPLNYFIDVADGVKCLALQPVAPSSGFSVIGNLMQQGYTFEFAKDRMRLGFSRSGCS 44
            F+PP  NYFID A+G+KCLA+QPV   SGFSVIGNLMQQG+ FEF ++R  LGFSR GC+
Sbjct: 460  FSPPARNYFIDAAEGIKCLAMQPVTSPSGFSVIGNLMQQGFLFEFDRERSWLGFSRHGCA 519

Query: 43   VP 38
            +P
Sbjct: 520  LP 521


>ref|XP_010262754.1| PREDICTED: aspartic proteinase nepenthesin-1 [Nelumbo nucifera]
          Length = 460

 Score =  515 bits (1327), Expect = e-143
 Identities = 247/408 (60%), Positives = 308/408 (75%), Gaps = 5/408 (1%)
 Frame = -3

Query: 1246 SEALSADSRRVSALHQR-HGHFHTKLPITSAASSGSGQYLVSLHLGTPPQRLLLVADTGS 1070
            ++ALS DS R+S L+         K P+ S AS+G GQY V   +G+PPQ+LLLVADTGS
Sbjct: 54   AQALSLDSHRLSVLYSALQSRKSLKSPVVSGASTGFGQYFVDFRIGSPPQKLLLVADTGS 113

Query: 1069 DLTWVSCSACR---RHCPSRATFFPRHSTTFSPYHCFDSSCKLVPHPRKHFPCNHTRLHS 899
            DL WV CSACR   +H P  A F  RHSTTF+P HC+D +C+LVPHP KH PCNHT LHS
Sbjct: 114  DLVWVKCSACRNCSKHAPGLA-FLARHSTTFAPIHCYDPACQLVPHPVKHQPCNHTLLHS 172

Query: 898  TCRYKYSYSDGSVTSGFFSHETTAFNSSSDKLVQFQHLSFGCGFNNSGPSVSGPSFNGAN 719
            TCRY+Y Y+D S TSGFFS ET   N+S  ++ + + L+FGCGF+ SGPSVSG SFNGA+
Sbjct: 173  TCRYEYLYADESRTSGFFSRETVTLNTSFGRVARLKKLAFGCGFHISGPSVSGASFNGAH 232

Query: 718  GVMGLGRGPISFSSQLGREFGHTFSYCLMDYTLSPPPTSYLLIGGSQ-YGGAHKLSYTPL 542
            GVMGLGRGP SFSSQ+G+ FG+ FSYCLMDYT+SPPPTSYLLIG +Q       +S+TPL
Sbjct: 233  GVMGLGRGPTSFSSQVGKRFGYKFSYCLMDYTISPPPTSYLLIGETQPITRKQMMSFTPL 292

Query: 541  LINPFSPTFYYIGIENVFINNVKLRINPSVWAIDELGNGGTVVDSGTTLTFLTLPAYRSI 362
              +  SP+FYYIGI++VFI+ V L I+PS+WA+D  GNGGTV+DSGTTLTF+  PAYR +
Sbjct: 293  HTSALSPSFYYIGIKSVFIDGVGLPIDPSIWALDNQGNGGTVIDSGTTLTFIAEPAYRQV 352

Query: 361  LAKVDRLVKLPKSVDPTLGFDLCFNVSSDLKPSLPRLSFKLRGGSLFAPPPLNYFIDVAD 182
            L    + ++LP++ DP+   D C NVS    PSLP+LSF+L G S+F+PP  NYFID A+
Sbjct: 353  LTAFKKRIRLPRTTDPSSSLDFCVNVSGVANPSLPKLSFRLEGDSVFSPPARNYFIDAAE 412

Query: 181  GVKCLALQPVAPSSGFSVIGNLMQQGYTFEFAKDRMRLGFSRSGCSVP 38
            GVKCLA++PV   SGFS+IGNLMQQG+ FEF ++R RLGFSR GC++P
Sbjct: 413  GVKCLAMRPVTTPSGFSIIGNLMQQGFLFEFDRERSRLGFSRHGCALP 460


>ref|NP_189198.1| aspartyl protease family protein [Arabidopsis thaliana]
            gi|11994761|dbj|BAB03090.1| chloroplast nucleoid DNA
            binding protein-like; nucellin-like protein [Arabidopsis
            thaliana] gi|189339286|gb|ACD89063.1| At3g25700
            [Arabidopsis thaliana] gi|332643533|gb|AEE77054.1|
            aspartyl protease family protein [Arabidopsis thaliana]
          Length = 452

 Score =  514 bits (1325), Expect = e-143
 Identities = 266/465 (57%), Positives = 324/465 (69%), Gaps = 6/465 (1%)
 Frame = -3

Query: 1414 YLIVQLQVFLLPTTLMAFISMSSCLLIFSLLIALPDLSSAXXXXXXXXXXXXXXXPSEAL 1235
            +L   L +FLLP + +A +S  +  L   LL   P  S                  ++AL
Sbjct: 7    FLCSFLSLFLLPPSNIAAVSNHNKYLKLPLLRKSPFPSP-----------------TQAL 49

Query: 1234 SADSRRVSALHQRHGHF-HTKLPITSAASSGSGQYLVSLHLGTPPQRLLLVADTGSDLTW 1058
            + D+RR+  L  R       K P+ S A+SGSGQY V L +G PPQ LLL+ADTGSDL W
Sbjct: 50   ALDTRRLHFLSLRRKPIPFVKSPVVSGAASGSGQYFVDLRIGQPPQSLLLIADTGSDLVW 109

Query: 1057 VSCSACRR---HCPSRATFFPRHSTTFSPYHCFDSSCKLVPHPRKHFPCNHTRLHSTCRY 887
            V CSACR    H P+   FFPRHS+TFSP HC+D  C+LVP P +   CNHTR+HSTC Y
Sbjct: 110  VKCSACRNCSHHSPA-TVFFPRHSSTFSPAHCYDPVCRLVPKPDRAPICNHTRIHSTCHY 168

Query: 886  KYSYSDGSVTSGFFSHETTAFNSSSDKLVQFQHLSFGCGFNNSGPSVSGPSFNGANGVMG 707
            +Y Y+DGS+TSG F+ ETT+  +SS K  + + ++FGCGF  SG SVSG SFNGANGVMG
Sbjct: 169  EYGYADGSLTSGLFARETTSLKTSSGKEARLKSVAFGCGFRISGQSVSGTSFNGANGVMG 228

Query: 706  LGRGPISFSSQLGREFGHTFSYCLMDYTLSPPPTSYLLIGGSQYGGAHKLSYTPLLINPF 527
            LGRGPISF+SQLGR FG+ FSYCLMDYTLSPPPTSYL+IG     G  KL +TPLL NP 
Sbjct: 229  LGRGPISFASQLGRRFGNKFSYCLMDYTLSPPPTSYLIIGNGG-DGISKLFFTPLLTNPL 287

Query: 526  SPTFYYIGIENVFINNVKLRINPSVWAIDELGNGGTVVDSGTTLTFLTLPAYRSILAKVD 347
            SPTFYY+ +++VF+N  KLRI+PS+W ID+ GNGGTVVDSGTTL FL  PAYRS++A V 
Sbjct: 288  SPTFYYVKLKSVFVNGAKLRIDPSIWEIDDSGNGGTVVDSGTTLAFLAEPAYRSVIAAVR 347

Query: 346  RLVKLPKSVDPTLGFDLCFNVSSDLKPS--LPRLSFKLRGGSLFAPPPLNYFIDVADGVK 173
            R VKLP +   T GFDLC NVS   KP   LPRL F+  GG++F PPP NYFI+  + ++
Sbjct: 348  RRVKLPIADALTPGFDLCVNVSGVTKPEKILPRLKFEFSGGAVFVPPPRNYFIETEEQIQ 407

Query: 172  CLALQPVAPSSGFSVIGNLMQQGYTFEFAKDRMRLGFSRSGCSVP 38
            CLA+Q V P  GFSVIGNLMQQG+ FEF +DR RLGFSR GC++P
Sbjct: 408  CLAIQSVDPKVGFSVIGNLMQQGFLFEFDRDRSRLGFSRRGCALP 452


Top