BLASTX nr result

ID: Akebia24_contig00001004 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia24_contig00001004
         (2039 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007046609.1| Eukaryotic aspartyl protease family protein,...   583   e-163
ref|XP_002534038.1| Aspartic proteinase nepenthesin-1 precursor,...   566   e-158
ref|XP_007046607.1| Eukaryotic aspartyl protease family protein ...   565   e-158
ref|XP_007046606.1| Eukaryotic aspartyl protease family protein ...   561   e-157
ref|XP_002269880.1| PREDICTED: aspartic proteinase-like protein ...   558   e-156
ref|XP_006467009.1| PREDICTED: aspartic proteinase-like protein ...   557   e-156
ref|XP_006425380.1| hypothetical protein CICLE_v10025374mg [Citr...   557   e-156
ref|XP_007204828.1| hypothetical protein PRUPE_ppa004096mg [Prun...   544   e-152
ref|NP_849967.1| aspartyl protease family protein [Arabidopsis t...   543   e-152
ref|XP_002884082.1| aspartyl protease family protein [Arabidopsi...   540   e-151
ref|XP_006574660.1| PREDICTED: aspartic proteinase-like protein ...   539   e-150
ref|XP_006299902.1| hypothetical protein CARUB_v10016111mg [Caps...   539   e-150
ref|XP_004232517.1| PREDICTED: aspartic proteinase-like protein ...   539   e-150
ref|XP_007203645.1| hypothetical protein PRUPE_ppa004265mg [Prun...   538   e-150
ref|XP_006340776.1| PREDICTED: aspartic proteinase-like protein ...   537   e-150
gb|EXC35303.1| Aspartic proteinase-like protein 1 [Morus notabilis]   536   e-149
ref|XP_002534040.1| Aspartic proteinase nepenthesin-2 precursor,...   536   e-149
ref|XP_006409248.1| hypothetical protein EUTSA_v10022648mg [Eutr...   535   e-149
ref|XP_006828808.1| hypothetical protein AMTR_s00001p00126200 [A...   531   e-148
ref|XP_006599302.1| PREDICTED: aspartic proteinase-like protein ...   528   e-147

>ref|XP_007046609.1| Eukaryotic aspartyl protease family protein, putative isoform 1
            [Theobroma cacao] gi|508698870|gb|EOX90766.1| Eukaryotic
            aspartyl protease family protein, putative isoform 1
            [Theobroma cacao]
          Length = 519

 Score =  583 bits (1502), Expect = e-163
 Identities = 288/525 (54%), Positives = 378/525 (72%), Gaps = 5/525 (0%)
 Frame = +1

Query: 154  MDSSAFVLLVLLIFAWSSLSCHGFRTFGFDIHHRYSDSVKGILNVDELPKMGSLEYYAAM 333
            + S + VLL++++   +   C+GF TFGFDIHHRYSD VK  L VDELP  GSLEYY+AM
Sbjct: 4    LSSYSCVLLLVVLGLSAGSCCYGFGTFGFDIHHRYSDPVKDFLTVDELPAKGSLEYYSAM 63

Query: 334  AHRDRIIRGRGLAGDKAQF-LTFSNGNKTFLINPLGFLHYANISLGTPSLWFLVALDTGS 510
             HRD+II+GR LA    Q  +TF +GN+T+ ++ LGFL+YAN+S+G+P+L FLVALDTGS
Sbjct: 64   VHRDKIIKGRRLATANDQTPVTFLDGNETYRLSGLGFLYYANVSVGSPALSFLVALDTGS 123

Query: 511  DLFWIPCDCKSCIKALSTNNGSKLDLNIYSPXXXXXXKKVPCNSSLCKHQGRCSGTPTKC 690
            DLFW+PCDC SC++ LST +G  +D NIYSP       KVPC+S +C+ Q RCS + + C
Sbjct: 124  DLFWLPCDCSSCVQGLSTADGQTIDFNIYSPNTSSTSSKVPCSSDMCEQQKRCSSSQSNC 183

Query: 691  PYQVIYLSNGTSSSGILVEDVLHLTTDGHDNHSAVVDAPITFGCGQDQTGSFLDGAAPNG 870
            PYQ++YLSNGTSS+G+LVEDVLHLTTD  ++ +  V A ITFGCG+ QTGSFL+GAAPNG
Sbjct: 184  PYQILYLSNGTSSTGVLVEDVLHLTTD--EDKTKAVQAKITFGCGKVQTGSFLNGAAPNG 241

Query: 871  LFGLGMDKSSVPSILSSAGLTADSFSMCFGPNGVGRISFGDKGSPDQEETPFNLNQLHPT 1050
            LFGLGMD  SVPS L++  +T++SFSMCFG +G+GRI+FGD+GS  Q ETPFNL + HPT
Sbjct: 242  LFGLGMDNISVPSTLANENITSNSFSMCFGRDGIGRITFGDRGSSYQGETPFNLRKSHPT 301

Query: 1051 YNISVIQLSLGTNLTDIDFSAIFDSGTSFTYLNDPAYSYLSESFNSQTKDKRHPPDTR-I 1227
            YN+S+ Q+++G N  D+DFSA+FDSGTSFTYLNDPAY+++SESFN+   +KRH  D+  +
Sbjct: 302  YNVSITQINVGGNAGDLDFSAVFDSGTSFTYLNDPAYTFISESFNNMAIEKRHTSDSSDL 361

Query: 1228 PFEYCYDLRSDANSTLAPNVTLIMKGGSQFPVFDPLVFISIEKELL---VYCLGVVKSPD 1398
            PF+YCYDL ++  +   P V L MKGG  F V DP+V +S++ ++    +YCLGVVKS D
Sbjct: 362  PFDYCYDLSANQTNFTYPVVNLTMKGGDSFFVDDPIVVVSLKVKVHSGDLYCLGVVKSDD 421

Query: 1399 VNIIGQNFMTGKRVVFDREKSVLRWKESNCYDIEGSSXXXXXXXXXXXXXXXXXXXXNYT 1578
            VNIIGQNFMTG R+VFDREK VL W  S+CYDIE  +                       
Sbjct: 422  VNIIGQNFMTGYRIVFDREKMVLGWNPSDCYDIEAKTLPVRPPTAVPPAVA-------VN 474

Query: 1579 PEATKETGNGSRTSGAPPSSDGKSSQLKSICFTFLMFYLFILAMV 1713
            PEAT   GN S  SGA P    +S ++K++ +  ++  +   A++
Sbjct: 475  PEATAGNGNTSHISGASPPMANQSPKMKTLSYALIVALIPFFALI 519


>ref|XP_002534038.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
            communis] gi|223525945|gb|EEF28342.1| Aspartic proteinase
            nepenthesin-1 precursor, putative [Ricinus communis]
          Length = 533

 Score =  567 bits (1460), Expect = e-158
 Identities = 284/523 (54%), Positives = 369/523 (70%), Gaps = 6/523 (1%)
 Frame = +1

Query: 163  SAFVLLVLLIFAWSSLSCHGFRTFGFDIHHRYSDSVKGILNVDELPKMGSLEYYAAMAHR 342
            S+F+LL++L+ + SS S +GF TFGFD+HHRYSD VKG+L+VD+LP+ GSL YYA+MAHR
Sbjct: 19   SSFLLLLVLMLSSSSFS-YGFGTFGFDLHHRYSDPVKGMLSVDDLPEKGSLHYYASMAHR 77

Query: 343  DRIIRGRGLAGDKAQF-LTFSNGNKTFLINPLGFLHYANISLGTPSLWFLVALDTGSDLF 519
            D +I GR L  D     LTF +GN+T+  + LGFLHYAN+S+GTPSL +LVALDTGSDLF
Sbjct: 78   DILIHGRKLVSDNTSTPLTFFSGNETYRFSSLGFLHYANVSIGTPSLSYLVALDTGSDLF 137

Query: 520  WIPCDCKS--CIKALSTNNGSKLDLNIYSPXXXXXXKKVPCNSSLCKHQGRCSGTPTKCP 693
            W+PCDC +  C++ L   +G ++D NIY P      + +PCN++LC  Q RC    + CP
Sbjct: 138  WLPCDCTNSGCVQGLQFPSGEQIDFNIYRPNASSTSQTIPCNNTLCSRQSRCPSAQSTCP 197

Query: 694  YQVIYLSNGTSSSGILVEDVLHLTTDGHDNHSAVVDAPITFGCGQDQTGSFLDGAAPNGL 873
            YQV YLSNGTSS+G+LVED+LHLTTD  D  S  +DA I FGCG+ QTGSFLDGAAPNGL
Sbjct: 198  YQVQYLSNGTSSTGVLVEDLLHLTTD--DAQSRALDAKIIFGCGRVQTGSFLDGAAPNGL 255

Query: 874  FGLGMDKSSVPSILSSAGLTADSFSMCFGPNGVGRISFGDKGSPDQEETPFNLNQLHPTY 1053
            FGLGM   SVPS L+  G T++SFSMCFG +G+GRISFGD GS  Q ETPFNL QLHPTY
Sbjct: 256  FGLGMTNISVPSTLAREGYTSNSFSMCFGRDGIGRISFGDTGSSGQGETPFNLRQLHPTY 315

Query: 1054 NISVIQLSLGTNLTDIDFSAIFDSGTSFTYLNDPAYSYLSESFNSQTKDKRHPPDTRIPF 1233
            N+S+ ++++G    D++FSAIFDSGTSFTYLNDPAY+ +SESFN   K+KR+   + IPF
Sbjct: 316  NVSITKINVGGRDADLEFSAIFDSGTSFTYLNDPAYTLISESFNIGAKEKRYSSISDIPF 375

Query: 1234 EYCYDLRSDANSTLAPNVTLIMKGGSQFPVFDPLVFISIEKELLVYCLGVVKSPDVNIIG 1413
            EYCY++ S+  +   P V L+M+GGSQF V DP+V + ++    +YCL +VKS DVNIIG
Sbjct: 376  EYCYEMSSNQTNLEIPTVNLVMQGGSQFNVTDPIVIVILQGGASIYCLAIVKSGDVNIIG 435

Query: 1414 QNFMTGKRVVFDREKSVLRWKESNCYDIEGSSXXXXXXXXXXXXXXXXXXXXNYTPEATK 1593
            QNFMTG R+VF+RE++VL WK S+CYD   ++                       P+AT 
Sbjct: 436  QNFMTGYRIVFNRERNVLGWKASDCYDDMDTTTFPVDPISPGIPPATA-----VNPQATA 490

Query: 1594 ETGNGSRTSGAPP---SSDGKSSQLKSICFTFLMFYLFILAMV 1713
             +GN +  SG PP   ++  K  +L S+ F  +M  +    +V
Sbjct: 491  GSGNTTEVSGTPPPVGNNAPKLPKLNSLTFAIIMVLIPFFTIV 533


>ref|XP_007046607.1| Eukaryotic aspartyl protease family protein isoform 2 [Theobroma
            cacao] gi|508698868|gb|EOX90764.1| Eukaryotic aspartyl
            protease family protein isoform 2 [Theobroma cacao]
          Length = 519

 Score =  565 bits (1456), Expect = e-158
 Identities = 288/496 (58%), Positives = 353/496 (71%), Gaps = 6/496 (1%)
 Frame = +1

Query: 226  RTFGFDIHHRYSDSVKGILN----VDELPKMGSLEYYAAMAHRDRIIRGRGLAGDKAQFL 393
            R F F +HHR+S+ VK   N    +   P  GS EYYA +AHRDR++RGR L+G  A  +
Sbjct: 26   RIFTFKMHHRFSEPVKNWSNSTGKLSHWPVKGSFEYYAVLAHRDRLLRGRQLSGINAP-I 84

Query: 394  TFSNGNKTFLINPLGFLHYANISLGTPSLWFLVALDTGSDLFWIPCDCKSCIKALSTNNG 573
            +FS+GN TF I+ LGFLHY  + LGTP + F+VALDTGSDLFW+PCDC  C     T   
Sbjct: 85   SFSDGNSTFRISSLGFLHYTTVQLGTPGVKFMVALDTGSDLFWVPCDCNKCAPTEGTTYA 144

Query: 574  SKLDLNIYSPXXXXXXKKVPCNSSLCKHQGRCSGTPTKCPYQVIYLSNGTSSSGILVEDV 753
            S  +L+IY P      KKV CNSSLC  + +C GT + CPY V Y+S  TS+SG+LVEDV
Sbjct: 145  SDFELSIYDPKGSSTSKKVTCNSSLCALRNQCLGTFSNCPYMVSYMSAQTSTSGVLVEDV 204

Query: 754  LHLTTDGHDNHSAVVDAPITFGCGQDQTGSFLDGAAPNGLFGLGMDKSSVPSILSSAGLT 933
            LHLTT+  D H  +V A +TFGCGQ Q+GSFLD AAPNGLFGLGM+K SVPSILS  GLT
Sbjct: 205  LHLTTE--DGHPELVKAYVTFGCGQVQSGSFLDVAAPNGLFGLGMEKISVPSILSQEGLT 262

Query: 934  ADSFSMCFGPNGVGRISFGDKGSPDQEETPFNLNQLHPTYNISVIQLSLGTNLTDIDFSA 1113
            ADSFSMCFG +G+GRISFGDKGSPDQEETPFNLN   PTYNI++ Q+ +GT L D DF+A
Sbjct: 263  ADSFSMCFGHDGIGRISFGDKGSPDQEETPFNLNPSRPTYNITITQIRVGTTLIDDDFTA 322

Query: 1114 IFDSGTSFTYLNDPAYSYLSESFNSQTKDKRHPPDTRIPFEYCYDLRSDANSTLAPNVTL 1293
            +FDSGTSFTYL DP YS LSE+F+SQ +D+R PPD+RIPFEYCYD+  DAN++L P+++L
Sbjct: 323  LFDSGTSFTYLVDPTYSNLSENFHSQAQDRRRPPDSRIPFEYCYDMSPDANASLIPSMSL 382

Query: 1294 IMKGGSQFPVFDPLVFISIEKELLVYCLGVVKSPDVNIIGQNFMTGKRVVFDREKSVLRW 1473
             MKG SQFPV+DP++ IS ++  LVYCL VVKS ++NIIGQNFMTG RVVFDRE+ VL W
Sbjct: 383  TMKGESQFPVYDPIIVISTQQSKLVYCLAVVKSTELNIIGQNFMTGYRVVFDRERFVLGW 442

Query: 1474 KESNCYDIEGSSXXXXXXXXXXXXXXXXXXXXNY-TPEATKETG-NGSRTSGAPPSSDGK 1647
            K+ +CYDI+ +S                    NY TPEATK+ G N S TS A  S   +
Sbjct: 443  KKFDCYDIDETSASVVESHAASAPPAFAVGIRNYSTPEATKDIGKNNSHTSFALRSCHFQ 502

Query: 1648 SSQLKSICFTFLMFYL 1695
             S L  + F  ++  L
Sbjct: 503  VSPLSCLGFVSILSLL 518


>ref|XP_007046606.1| Eukaryotic aspartyl protease family protein isoform 1 [Theobroma
            cacao] gi|508698867|gb|EOX90763.1| Eukaryotic aspartyl
            protease family protein isoform 1 [Theobroma cacao]
          Length = 518

 Score =  561 bits (1446), Expect = e-157
 Identities = 288/496 (58%), Positives = 353/496 (71%), Gaps = 6/496 (1%)
 Frame = +1

Query: 226  RTFGFDIHHRYSDSVKGILN----VDELPKMGSLEYYAAMAHRDRIIRGRGLAGDKAQFL 393
            R F F +HHR+S+ VK   N    +   P  GS EYYA +AHRDR++RGR L+G  A  +
Sbjct: 26   RIFTFKMHHRFSEPVKNWSNSTGKLSHWPVKGSFEYYAVLAHRDRLLRGRQLSGINAP-I 84

Query: 394  TFSNGNKTFLINPLGFLHYANISLGTPSLWFLVALDTGSDLFWIPCDCKSCIKALSTNNG 573
            +FS+GN TF I+ LGFLHY  + LGTP + F+VALDTGSDLFW+PCDC  C     T   
Sbjct: 85   SFSDGNSTFRISSLGFLHYTTVQLGTPGVKFMVALDTGSDLFWVPCDCNKCAPTEGTTYA 144

Query: 574  SKLDLNIYSPXXXXXXKKVPCNSSLCKHQGRCSGTPTKCPYQVIYLSNGTSSSGILVEDV 753
            S  +L+IY P      KKV CNSSLC  + +C GT + CPY V Y+S  TS+SG+LVEDV
Sbjct: 145  SDFELSIYDPKGSSTSKKVTCNSSLCALRNQCLGTFSNCPYMVSYMSAQTSTSGVLVEDV 204

Query: 754  LHLTTDGHDNHSAVVDAPITFGCGQDQTGSFLDGAAPNGLFGLGMDKSSVPSILSSAGLT 933
            LHLTT+  D H  +V A +TFGCGQ Q+GSFLD AAPNGLFGLGM+K SVPSILS  GLT
Sbjct: 205  LHLTTE--DGHPELVKAYVTFGCGQVQSGSFLDVAAPNGLFGLGMEKISVPSILSQEGLT 262

Query: 934  ADSFSMCFGPNGVGRISFGDKGSPDQEETPFNLNQLHPTYNISVIQLSLGTNLTDIDFSA 1113
            ADSFSMCFG +G+GRISFGDKGSPDQEETPFNLN   PTYNI++ Q+ +GT L D DF+A
Sbjct: 263  ADSFSMCFGHDGIGRISFGDKGSPDQEETPFNLNPSRPTYNITITQIRVGTTLIDDDFTA 322

Query: 1114 IFDSGTSFTYLNDPAYSYLSESFNSQTKDKRHPPDTRIPFEYCYDLRSDANSTLAPNVTL 1293
            +FDSGTSFTYL DP YS LSE+F+SQ +D+R PPD+RIPFEYCYD+  DAN++L P+++L
Sbjct: 323  LFDSGTSFTYLVDPTYSNLSENFHSQAQDRRRPPDSRIPFEYCYDMSPDANASLIPSMSL 382

Query: 1294 IMKGGSQFPVFDPLVFISIEKELLVYCLGVVKSPDVNIIGQNFMTGKRVVFDREKSVLRW 1473
             MKG SQFPV+DP++ IS + + LVYCL VVKS ++NIIGQNFMTG RVVFDRE+ VL W
Sbjct: 383  TMKGESQFPVYDPIIVISTQSK-LVYCLAVVKSTELNIIGQNFMTGYRVVFDRERFVLGW 441

Query: 1474 KESNCYDIEGSSXXXXXXXXXXXXXXXXXXXXNY-TPEATKETG-NGSRTSGAPPSSDGK 1647
            K+ +CYDI+ +S                    NY TPEATK+ G N S TS A  S   +
Sbjct: 442  KKFDCYDIDETSASVVESHAASAPPAFAVGIRNYSTPEATKDIGKNNSHTSFALRSCHFQ 501

Query: 1648 SSQLKSICFTFLMFYL 1695
             S L  + F  ++  L
Sbjct: 502  VSPLSCLGFVSILSLL 517


>ref|XP_002269880.1| PREDICTED: aspartic proteinase-like protein 1 [Vitis vinifera]
            gi|297739017|emb|CBI28369.3| unnamed protein product
            [Vitis vinifera]
          Length = 518

 Score =  558 bits (1437), Expect = e-156
 Identities = 293/523 (56%), Positives = 362/523 (69%), Gaps = 9/523 (1%)
 Frame = +1

Query: 163  SAFVLLVLLIFAWSSLSCHGFRTFGFDIHHRYSDSVK----GILN---VDELPKMGSLEY 321
            S F++++L I  + S  CH  R F F +HHR+S+ VK    G  N       P  GS EY
Sbjct: 6    SVFIVILLSILGFRS--CHA-RIFSFQMHHRFSEPVKKWSEGAGNGFPAGNWPAKGSFEY 62

Query: 322  YAAMAHRDRIIRGRGLAGDKAQFLTFSNGNKTFLINPLGFLHYANISLGTPSLWFLVALD 501
            YA +AHRDR +RGR L+ D    LTFS+GN TF I+ LGFLHY  +SLGTP   FLVALD
Sbjct: 63   YAELAHRDRALRGRRLS-DIDGLLTFSDGNSTFRISSLGFLHYTTVSLGTPGKKFLVALD 121

Query: 502  TGSDLFWIPCDCKSCIKALSTNNGSKLDLNIYSPXXXXXXKKVPCNSSLCKHQGRCSGTP 681
            TGSDLFW+PCDC  C     T   S  +L+IY+P      +KV C++SLC H+ RC GT 
Sbjct: 122  TGSDLFWVPCDCSRCAPTEGTTYASDFELSIYNPKGSSTSRKVTCDNSLCAHRNRCLGTF 181

Query: 682  TKCPYQVIYLSNGTSSSGILVEDVLHLTTDGHDNHSAVVDAPITFGCGQDQTGSFLDGAA 861
            + CPY V Y+S  TS+SGILVEDVLHLTT+  DN    V+A +TFGCGQ QTGSFLD AA
Sbjct: 182  SNCPYMVSYVSAETSTSGILVEDVLHLTTE--DNRQEFVEAYVTFGCGQVQTGSFLDIAA 239

Query: 862  PNGLFGLGMDKSSVPSILSSAGLTADSFSMCFGPNGVGRISFGDKGSPDQEETPFNLNQL 1041
            PNGLFGLG++K SVPSILS  G TADSFSMCFGP+G+GRISFGDKGSPDQEETPFNLN L
Sbjct: 240  PNGLFGLGLEKISVPSILSKEGFTADSFSMCFGPDGIGRISFGDKGSPDQEETPFNLNAL 299

Query: 1042 HPTYNISVIQLSLGTNLTDIDFSAIFDSGTSFTYLNDPAYSYLSESFNSQTKDKRHPPDT 1221
            HPTYNI+V Q+ +GT L D+DF+A+FDSGTSFTYL DP Y+ + +SF+SQ +D R PPD+
Sbjct: 300  HPTYNITVTQVRVGTTLIDLDFTALFDSGTSFTYLVDPIYTNVLKSFHSQAQDSRRPPDS 359

Query: 1222 RIPFEYCYDLRSDANSTLAPNVTLIMKGGSQFPVFDPLVFISIEKELLVYCLGVVKSPDV 1401
            RIPFE+CYD+    N++L P+++L MKGGSQFPV+DP++ IS + E L+YC+ VV+S ++
Sbjct: 360  RIPFEFCYDMSPGENTSLIPSMSLTMKGGSQFPVYDPIIIISSQSE-LIYCMAVVRSAEL 418

Query: 1402 NIIGQNFMTGKRVVFDREKSVLRWKESNCYDIEGSSXXXXXXXXXXXXXXXXXXXXNYTP 1581
            NIIGQNFMTG R++FDREK VL WKE  C DIE SS                    N T 
Sbjct: 419  NIIGQNFMTGYRIIFDREKLVLGWKEFECDDIENSS-VPIRPRATSVPPAVAVGVGNDTT 477

Query: 1582 EATKETGN--GSRTSGAPPSSDGKSSQLKSICFTFLMFYLFIL 1704
            ++T++T N   SR S A P     + +L   CF  L   L +L
Sbjct: 478  KSTRDTRNFSQSRNSVASPLFHRITPEL--TCFILLFILLLLL 518


>ref|XP_006467009.1| PREDICTED: aspartic proteinase-like protein 1-like [Citrus sinensis]
          Length = 517

 Score =  557 bits (1436), Expect = e-156
 Identities = 277/451 (61%), Positives = 339/451 (75%), Gaps = 5/451 (1%)
 Frame = +1

Query: 172  VLLVLLIFAWSSLSCHGFRTFGFDIHHRYSDSVKGILNVDELPKMGSLEYYAAMAHRDRI 351
            V ++L++ +  +  C GF TFGFD HHRYSD VKGIL VD+LPK GS  YY+A+AHRDR 
Sbjct: 10   VCVLLILLSCCAGCCFGFGTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRY 69

Query: 352  --IRGRGLA--GDKAQFLTFSNGNKTFLINPLGFLHYANISLGTPSLWFLVALDTGSDLF 519
              +RGRGLA  G+    LTFS GN T+ +N LGFLHY N+S+G P+L F+VALDTGSDLF
Sbjct: 70   FRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLF 129

Query: 520  WIPCDCKSCIKALSTNNGSKLDLNIYSPXXXXXXKKVPCNSSLCKHQGRCSGTPTKCPYQ 699
            W+PCDC SC+  L++++G  +D NIYSP       KVPCNS+LC+ Q +C    + CPYQ
Sbjct: 130  WLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQ 189

Query: 700  VIYLSNGTSSSGILVEDVLHLTTDGHDNHSAVVDAPITFGCGQDQTGSFLDGAAPNGLFG 879
            V YLS+GT S+G LVEDVLHL TD  +  S  VD+ I+FGCG+ QTGSFLDGAAPNGLFG
Sbjct: 190  VRYLSDGTMSTGFLVEDVLHLATD--EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFG 247

Query: 880  LGMDKSSVPSILSSAGLTADSFSMCFGPNGVGRISFGDKGSPDQEETPFNLNQLHPTYNI 1059
            LGMDK+SVPSIL++ GL  +SFSMCFG +G GRISFGDKGSP Q ETPF+L Q HPTYNI
Sbjct: 248  LGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNI 307

Query: 1060 SVIQLSLGTNLTDIDFSAIFDSGTSFTYLNDPAYSYLSESFNSQTKDKRHPPDTRIPFEY 1239
            ++ Q+S+G N  + +FSAIFDSGTSFTYLNDPAY+ +SE+FNS  K+KR    + +PFEY
Sbjct: 308  TITQVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEY 367

Query: 1240 CYDLRSDANSTLAPNVTLIMKGGSQFPVFDPLVFISIE-KELLVYCLGVVKSPDVNIIGQ 1416
            CY L  +  +   P V L MKGG  F V DP+V +S E K L +YCLGVVKS +VNIIGQ
Sbjct: 368  CYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGQ 427

Query: 1417 NFMTGKRVVFDREKSVLRWKESNCYDIEGSS 1509
            NFMTG  +VFDREK+VL WK S+CY +  SS
Sbjct: 428  NFMTGYNIVFDREKNVLGWKASDCYGVNNSS 458


>ref|XP_006425380.1| hypothetical protein CICLE_v10025374mg [Citrus clementina]
            gi|557527370|gb|ESR38620.1| hypothetical protein
            CICLE_v10025374mg [Citrus clementina]
          Length = 517

 Score =  557 bits (1435), Expect = e-156
 Identities = 277/451 (61%), Positives = 340/451 (75%), Gaps = 5/451 (1%)
 Frame = +1

Query: 172  VLLVLLIFAWSSLSCHGFRTFGFDIHHRYSDSVKGILNVDELPKMGSLEYYAAMAHRDRI 351
            V ++L++ +  +  C GF TFGFD HHRYSD VKGIL VD+LPK GS  YY+A+AHRDR 
Sbjct: 10   VCVLLILLSCCAGCCFGFGTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRY 69

Query: 352  --IRGRGLA--GDKAQFLTFSNGNKTFLINPLGFLHYANISLGTPSLWFLVALDTGSDLF 519
              +RGRGLA  G+    LTFS GN T+ +N LGFLHYAN+S+G P+L F+VALDTGSDLF
Sbjct: 70   FRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYANVSVGQPALSFIVALDTGSDLF 129

Query: 520  WIPCDCKSCIKALSTNNGSKLDLNIYSPXXXXXXKKVPCNSSLCKHQGRCSGTPTKCPYQ 699
            W+PCDC SC+  L++++G  +D NIYSP       KVPCNS+LC+ Q +C    + CPYQ
Sbjct: 130  WLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQ 189

Query: 700  VIYLSNGTSSSGILVEDVLHLTTDGHDNHSAVVDAPITFGCGQDQTGSFLDGAAPNGLFG 879
            V YLS+GT S+G LVEDVLHL TD  +  S  VD+ I+FGCG+ QTGSFLDGAAPNGLFG
Sbjct: 190  VRYLSDGTMSTGFLVEDVLHLATD--EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFG 247

Query: 880  LGMDKSSVPSILSSAGLTADSFSMCFGPNGVGRISFGDKGSPDQEETPFNLNQLHPTYNI 1059
            LGMDK+SVPSIL++ GL  +SFSMCFG +G GRISFGDKGSP Q ETPF+L Q HPTYNI
Sbjct: 248  LGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNI 307

Query: 1060 SVIQLSLGTNLTDIDFSAIFDSGTSFTYLNDPAYSYLSESFNSQTKDKRHPPDTRIPFEY 1239
            ++ Q+S+G N  + +FSAIFDSGTSFTYLN+PAY+ +SE+FNS  K+KR    + +PFEY
Sbjct: 308  TITQVSVGGNAANFEFSAIFDSGTSFTYLNNPAYTQISETFNSLAKEKRETSTSDLPFEY 367

Query: 1240 CYDLRSDANSTLAPNVTLIMKGGSQFPVFDPLVFISIE-KELLVYCLGVVKSPDVNIIGQ 1416
            CY L  +  +   P V L MKGG  F V DP+V +S E K L +YCLGVVKS +VNIIGQ
Sbjct: 368  CYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGQ 427

Query: 1417 NFMTGKRVVFDREKSVLRWKESNCYDIEGSS 1509
            NFMTG  +VFDREK+VL WK S+CY +  SS
Sbjct: 428  NFMTGYNIVFDREKNVLGWKASDCYGVNNSS 458


>ref|XP_007204828.1| hypothetical protein PRUPE_ppa004096mg [Prunus persica]
            gi|462400359|gb|EMJ06027.1| hypothetical protein
            PRUPE_ppa004096mg [Prunus persica]
          Length = 530

 Score =  544 bits (1401), Expect = e-152
 Identities = 288/509 (56%), Positives = 352/509 (69%), Gaps = 19/509 (3%)
 Frame = +1

Query: 157  DSSAFVLLVLLIFAWSSLSCHGFRTFGFDIHHRYSDSVK------GILN-VDELPKMGSL 315
            D   FV+L+  +      SCHG R F F +HHR+SD VK      G L+  D LP  GS 
Sbjct: 4    DLCKFVVLLFFLSILGLQSCHG-RIFSFKMHHRFSDPVKEWSAVSGKLSPADNLPAKGSF 62

Query: 316  EYYAAMAHRDRIIRGRGLA----GDKAQFLTFSNGNKTFLINPLGFLHYANISLGTPSLW 483
            EYY+ +A RDR +RGR LA     D    L FS+GN TF I+ LGFLHY  + LGTP + 
Sbjct: 63   EYYSELARRDRFLRGRKLAQSDQSDTTTPLAFSDGNSTFRISSLGFLHYTTVQLGTPGMK 122

Query: 484  FLVALDTGSDLFWIPCD--CKSCIKALSTNN-----GSKLDLNIYSPXXXXXXKKVPCNS 642
            F+VALDTGSDLFW+PC+    + +K    N          +++ Y P      K+V CN+
Sbjct: 123  FMVALDTGSDLFWVPCEGTAYAPVKLAERNQILSYADYDFEVSKYDPEGSSTSKRVSCNN 182

Query: 643  SLCKHQGRCSGTPTKCPYQVIYLSNGTSSSGILVEDVLHLTTDGHDNHSAVVDAPITFGC 822
            SLC H+ RC G+   CPY V Y+S  TS+SGILVEDVLHL T+  D+H  +V+A +TFGC
Sbjct: 183  SLCAHRNRCMGSFNNCPYMVSYVSAETSTSGILVEDVLHLKTE--DSHRELVEAYVTFGC 240

Query: 823  GQDQTGSFLDGAAPNGLFGLGMDKSSVPSILSSAGLTADSFSMCFGPNGVGRISFGDKGS 1002
            GQ Q+GSFLD AAPNGLFGLGM+K SVPSILS  G TADSFSMCFG +GVGRI+FGDKGS
Sbjct: 241  GQVQSGSFLDVAAPNGLFGLGMEKISVPSILSREGFTADSFSMCFGHDGVGRINFGDKGS 300

Query: 1003 PDQEETPFNLNQLHPTYNISVIQLSLGTNLTDIDFSAIFDSGTSFTYLNDPAYSYLSESF 1182
            PDQEETPFN+N  HPTYNISV Q+ +GT+L DIDF+A+FDSGTSFTYL DP Y+ LSESF
Sbjct: 301  PDQEETPFNVNPSHPTYNISVTQIRVGTDLMDIDFTALFDSGTSFTYLGDPTYTRLSESF 360

Query: 1183 NSQTKDKRHPPDTRIPFEYCYDLRSDANSTLAPNVTLIMKGGSQFPVFDPLVFISIEKEL 1362
            NS  +DKR PPD RIPFEYCYD+ SDAN++  P+++L MKGGSQF V+DP++ IS + E 
Sbjct: 361  NSLARDKRRPPDPRIPFEYCYDMSSDANASFIPSLSLTMKGGSQFAVYDPIIVISTQSE- 419

Query: 1363 LVYCLGVVKSPDVNIIGQNFMTGKRVVFDREKSVLRWKESNCYDIEG-SSXXXXXXXXXX 1539
            LVYCL VVKS  +NIIGQN+MTG  VVFDREK VL WK+ +CYD+E  +S          
Sbjct: 420  LVYCLAVVKSTQLNIIGQNYMTGYNVVFDREKFVLGWKKFDCYDVENHTSLPFKPNSTNV 479

Query: 1540 XXXXXXXXXXNYTPEATKETGNGSRTSGA 1626
                      + TPE+TK+T N S+TS A
Sbjct: 480  PPAVAVGLGHHSTPESTKKTRN-SQTSAA 507


>ref|NP_849967.1| aspartyl protease family protein [Arabidopsis thaliana]
            gi|17978947|gb|AAL47439.1| putative chloroplast nucleoid
            DNA-binding protein [Arabidopsis thaliana]
            gi|22655368|gb|AAM98276.1| At2g17760/At2g17760
            [Arabidopsis thaliana] gi|330251585|gb|AEC06679.1|
            aspartyl protease family protein [Arabidopsis thaliana]
          Length = 513

 Score =  543 bits (1400), Expect = e-152
 Identities = 275/514 (53%), Positives = 348/514 (67%), Gaps = 1/514 (0%)
 Frame = +1

Query: 175  LLVLLIFAWSSLSCHGFRTFGFDIHHRYSDSVKGILNVDELPKMGSLEYYAAMAHRDRII 354
            LL+LL  +W    C GF  FGF+ HHR+SD V G+L  D LP   S +YY  MAHRDR+I
Sbjct: 14   LLILLASSWVLDRCEGFGEFGFEFHHRFSDQVVGVLPGDGLPNRDSSKYYRVMAHRDRLI 73

Query: 355  RGRGLAGDKAQFLTFSNGNKTFLINPLGFLHYANISLGTPSLWFLVALDTGSDLFWIPCD 534
            RGR LA +    +TFS+GN+T  ++ LGFLHYAN+++GTPS WF+VALDTGSDLFW+PCD
Sbjct: 74   RGRRLANEDQSLVTFSDGNETVRVDALGFLHYANVTVGTPSDWFMVALDTGSDLFWLPCD 133

Query: 535  CKSCIKALSTNNGSKLDLNIYSPXXXXXXKKVPCNSSLCKHQGRCSGTPTKCPYQVIYLS 714
            C +C++ L    GS LDLNIYSP       KVPCNS+LC    RC+   + CPYQ+ YLS
Sbjct: 134  CTNCVRELKAPGGSSLDLNIYSPNASSTSTKVPCNSTLCTRGDRCASPESDCPYQIRYLS 193

Query: 715  NGTSSSGILVEDVLHLTTDGHDNHSAVVDAPITFGCGQDQTGSFLDGAAPNGLFGLGMDK 894
            NGTSS+G+LVEDVLHL +  +D  S  + A +TFGCGQ QTG F DGAAPNGLFGLG++ 
Sbjct: 194  NGTSSTGVLVEDVLHLVS--NDKSSKAIPARVTFGCGQVQTGVFHDGAAPNGLFGLGLED 251

Query: 895  SSVPSILSSAGLTADSFSMCFGPNGVGRISFGDKGSPDQEETPFNLNQLHPTYNISVIQL 1074
             SVPS+L+  G+ A+SFSMCFG +G GRISFGDKGS DQ ETP N+ Q HPTYNI+V ++
Sbjct: 252  ISVPSVLAKEGIAANSFSMCFGNDGAGRISFGDKGSVDQRETPLNIRQPHPTYNITVTKI 311

Query: 1075 SLGTNLTDIDFSAIFDSGTSFTYLNDPAYSYLSESFNSQTKDKRH-PPDTRIPFEYCYDL 1251
            S+G N  D++F A+FDSGTSFTYL D AY+ +SESFNS   DKR+   D+ +PFEYCY L
Sbjct: 312  SVGGNTGDLEFDAVFDSGTSFTYLTDAAYTLISESFNSLALDKRYQTTDSELPFEYCYAL 371

Query: 1252 RSDANSTLAPNVTLIMKGGSQFPVFDPLVFISIEKELLVYCLGVVKSPDVNIIGQNFMTG 1431
              + +S   P V L MKGGS +PV+ PLV I + K+  VYCL ++K  D++IIGQNFMTG
Sbjct: 372  SPNKDSFQYPAVNLTMKGGSSYPVYHPLVVIPM-KDTDVYCLAIMKIEDISIIGQNFMTG 430

Query: 1432 KRVVFDREKSVLRWKESNCYDIEGSSXXXXXXXXXXXXXXXXXXXXNYTPEATKETGNGS 1611
             RVVFDREK +L WKES+CY  E S+                    ++ PEAT       
Sbjct: 431  YRVVFDREKLILGWKESDCYTGETSA---RTLPSNRSSSSARPPASSFDPEATNIPSQRP 487

Query: 1612 RTSGAPPSSDGKSSQLKSICFTFLMFYLFILAMV 1713
             TS         +S   S+  +  +F+  ILA++
Sbjct: 488  NTS--------TTSAAYSLSISLSLFFFSILAIL 513


>ref|XP_002884082.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
            gi|297329922|gb|EFH60341.1| aspartyl protease family
            protein [Arabidopsis lyrata subsp. lyrata]
          Length = 513

 Score =  540 bits (1392), Expect = e-151
 Identities = 274/514 (53%), Positives = 346/514 (67%), Gaps = 1/514 (0%)
 Frame = +1

Query: 175  LLVLLIFAWSSLSCHGFRTFGFDIHHRYSDSVKGILNVDELPKMGSLEYYAAMAHRDRII 354
            L++LL  +W    C GF  FGF+ HHR+SD V G+L  D LP   S +YY  MAHRDR+I
Sbjct: 14   LIILLASSWVLERCEGFGEFGFEFHHRFSDQVVGVLPGDGLPNRDSSKYYRVMAHRDRLI 73

Query: 355  RGRGLAGDKAQFLTFSNGNKTFLINPLGFLHYANISLGTPSLWFLVALDTGSDLFWIPCD 534
            RGR LA +    +TFS+GN+T  ++ LGFLHYAN+++GTPS WFLVALDTGSDLFW+PCD
Sbjct: 74   RGRRLANEDQSLVTFSDGNETIRVDALGFLHYANVTVGTPSDWFLVALDTGSDLFWLPCD 133

Query: 535  CKSCIKALSTNNGSKLDLNIYSPXXXXXXKKVPCNSSLCKHQGRCSGTPTKCPYQVIYLS 714
            C +C++ L    GS LDLNIYSP       KVPCNS+LC    RC+   + CPYQ+ YLS
Sbjct: 134  CTNCVRELKAPGGSSLDLNIYSPNASSTSTKVPCNSTLCTRGDRCASPESNCPYQIRYLS 193

Query: 715  NGTSSSGILVEDVLHLTTDGHDNHSAVVDAPITFGCGQDQTGSFLDGAAPNGLFGLGMDK 894
            NGTSS+G+LVEDVLHL +  +D  S  + A +T GCGQ QTG F DGAAPNGLFGLG++ 
Sbjct: 194  NGTSSTGVLVEDVLHLVS--NDKSSKAIPARVTLGCGQVQTGVFHDGAAPNGLFGLGLED 251

Query: 895  SSVPSILSSAGLTADSFSMCFGPNGVGRISFGDKGSPDQEETPFNLNQLHPTYNISVIQL 1074
             SVPS+L+  G+ A+SFSMCFG +G GRISFGDKGS DQ ETP N+ Q HPTYNI+V ++
Sbjct: 252  ISVPSVLAKEGIAANSFSMCFGNDGAGRISFGDKGSVDQRETPLNIRQPHPTYNITVTKI 311

Query: 1075 SLGTNLTDIDFSAIFDSGTSFTYLNDPAYSYLSESFNSQTKDKRH-PPDTRIPFEYCYDL 1251
            S+  N  D++F A+FDSGTSFTYL D AY+ +SESFNS   DKR+   D+ +PFEYCY L
Sbjct: 312  SVEGNTGDLEFDAVFDSGTSFTYLTDAAYTLISESFNSLALDKRYQTTDSELPFEYCYAL 371

Query: 1252 RSDANSTLAPNVTLIMKGGSQFPVFDPLVFISIEKELLVYCLGVVKSPDVNIIGQNFMTG 1431
              + +S   P V L MKGGS +PV+ PLV I + K+  VYCL ++K  D++IIGQNFMTG
Sbjct: 372  SPNKDSFQYPAVNLTMKGGSSYPVYHPLVVIPM-KDTDVYCLAILKIEDISIIGQNFMTG 430

Query: 1432 KRVVFDREKSVLRWKESNCYDIEGSSXXXXXXXXXXXXXXXXXXXXNYTPEATKETGNGS 1611
             RVVFDREK +L WKES+CY  E S+                    ++ PEAT       
Sbjct: 431  YRVVFDREKLILGWKESDCYTGETSA---RTLPSNRSSSSARPPASSFDPEATNIPSQRP 487

Query: 1612 RTSGAPPSSDGKSSQLKSICFTFLMFYLFILAMV 1713
             TS         SS   S+  +  +F+  ILA++
Sbjct: 488  NTS--------TSSAAYSLSISLSLFFFSILAIL 513


>ref|XP_006574660.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
          Length = 524

 Score =  539 bits (1389), Expect = e-150
 Identities = 282/530 (53%), Positives = 365/530 (68%), Gaps = 13/530 (2%)
 Frame = +1

Query: 154  MDSSAFVLLVLLIFAWSSLSCHGFRTFGFDIHHRYSDSVK-----GILNVDELPKMGSLE 318
            M +S F+++ LL   W    CHG   + F +HHR+S+ V+         +   P+ G++E
Sbjct: 1    MLASVFIIVSLLSL-WECCQCHG-HVYTFTMHHRHSEPVRKWSHSAAAGIPAPPEEGTVE 58

Query: 319  YYAAMAHRDRIIRGRGLAGDKAQFLTFSNGNKTFLINPLGFLHYANISLGTPSLWFLVAL 498
            YYA +A RDR++RGR L+   A  L FS+GN TF I+ LGFLHY  + +GTP + F+VAL
Sbjct: 59   YYAELADRDRLLRGRKLSQIDAG-LAFSDGNSTFRISSLGFLHYTTVQIGTPGVKFMVAL 117

Query: 499  DTGSDLFWIPCDCKSCIKALSTNNGSKL-----DLNIYSPXXXXXXKKVPCNSSLCKHQG 663
            DTGSDLFW+PCDC  C  + ST   S L     DLN+Y+P      KKV CN+SLC H+ 
Sbjct: 118  DTGSDLFWVPCDCTRCAASDSTAFASALATQDFDLNVYNPNGSSTSKKVTCNNSLCTHRS 177

Query: 664  RCSGTPTKCPYQVIYLSNGTSSSGILVEDVLHLTTDGHDNHSAVVDAPITFGCGQDQTGS 843
            +C GT + CPY V Y+S  TS+SGILVEDVLHLT +  DNH  +V+A + FGCGQ Q+GS
Sbjct: 178  QCLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTQE--DNHHDLVEANVIFGCGQIQSGS 235

Query: 844  FLDGAAPNGLFGLGMDKSSVPSILSSAGLTADSFSMCFGPNGVGRISFGDKGSPDQEETP 1023
            FLD AAPNGLFGLGM+K SVPS+LS  G TADSFSMCFG +G+GRISFGDKGS DQ+ETP
Sbjct: 236  FLDVAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCFGRDGIGRISFGDKGSFDQDETP 295

Query: 1024 FNLNQLHPTYNISVIQLSLGTNLTDIDFSAIFDSGTSFTYLNDPAYSYLSESFNSQTKDK 1203
            FNLN  HPTYNI+V Q+ +GT + D++F+A+FDSGTSFTYL DP Y+ L+ESF+SQ +D+
Sbjct: 296  FNLNPSHPTYNITVTQVRVGTTVIDVEFTALFDSGTSFTYLVDPTYTRLTESFHSQVQDR 355

Query: 1204 RHPPDTRIPFEYCYDLRSDANSTLAPNVTLIMKGGSQFPVFDPLVFISIEKELLVYCLGV 1383
            RH  D+RIPFEYCYD+  DAN++L P+V+L M GGS F V+DP++ IS + E LVYCL V
Sbjct: 356  RHRSDSRIPFEYCYDMSPDANTSLIPSVSLTMGGGSHFAVYDPIIIISTQSE-LVYCLAV 414

Query: 1384 VKSPDVNIIGQNFMTGKRVVFDREKSVLRWKESNCYDIE--GSSXXXXXXXXXXXXXXXX 1557
            VKS ++NIIGQNFMTG RVVFDREK VL WK+ +CYDIE    +                
Sbjct: 415  VKSAELNIIGQNFMTGYRVVFDREKLVLGWKKFDCYDIEDHNDAIPTRPRSHADVPPAVA 474

Query: 1558 XXXXNY-TPEATKETGNGSRTSGAPPSSDGKSSQLKSICFTFLMFYLFIL 1704
                NY   ++T+++   S+ S A PSS    S L +     ++ +++IL
Sbjct: 475  AGLGNYPATDSTRKSKYNSQRSIASPSSHCSHSSLPTFLGFLVLCFVYIL 524


>ref|XP_006299902.1| hypothetical protein CARUB_v10016111mg [Capsella rubella]
            gi|482568611|gb|EOA32800.1| hypothetical protein
            CARUB_v10016111mg [Capsella rubella]
          Length = 513

 Score =  539 bits (1389), Expect = e-150
 Identities = 277/512 (54%), Positives = 345/512 (67%), Gaps = 1/512 (0%)
 Frame = +1

Query: 175  LLVLLIFAWSSLSCHGFRTFGFDIHHRYSDSVKGILNVDELPKMGSLEYYAAMAHRDRII 354
            L++LL  +W    C GF  FGF+ HHR+SD V   L  D LP   S +YY  MAHRDR+I
Sbjct: 14   LILLLSSSWVLDRCEGFGEFGFEFHHRFSDQVVRALPGDGLPNRDSSKYYRVMAHRDRLI 73

Query: 355  RGRGLAGDKAQFLTFSNGNKTFLINPLGFLHYANISLGTPSLWFLVALDTGSDLFWIPCD 534
            RGR LA      +TFS+GN+T  ++ LGFLHYAN+S+GTPS WFLVALDTGSDLFW+PCD
Sbjct: 74   RGRRLASVDQSLVTFSDGNETVRVDALGFLHYANVSIGTPSDWFLVALDTGSDLFWLPCD 133

Query: 535  CKSCIKALSTNNGSKLDLNIYSPXXXXXXKKVPCNSSLCKHQGRCSGTPTKCPYQVIYLS 714
            C +C++ L    GS L+LNIYSP       KVPCNSSLC    RC+   + CPYQ+ YLS
Sbjct: 134  CTNCVRELKAPGGSSLELNIYSPNVSSTSSKVPCNSSLCTRGDRCASPQSNCPYQIRYLS 193

Query: 715  NGTSSSGILVEDVLHLTTDGHDNHSAVVDAPITFGCGQDQTGSFLDGAAPNGLFGLGMDK 894
            NGTSS+G+LVEDVLHL +  +D  S  + A +T GCGQ QTG F DGAAPNGLFGLG++ 
Sbjct: 194  NGTSSTGVLVEDVLHLVS--NDKSSKTIPARVTLGCGQVQTGVFHDGAAPNGLFGLGLED 251

Query: 895  SSVPSILSSAGLTADSFSMCFGPNGVGRISFGDKGSPDQEETPFNLNQLHPTYNISVIQL 1074
             SVPS+L+  G+ A+SFSMCFG +G GRISFGDKGS DQ ETP N+ Q HPTYNI+V ++
Sbjct: 252  ISVPSVLAKEGIAANSFSMCFGTDGAGRISFGDKGSVDQRETPLNIRQPHPTYNITVTKI 311

Query: 1075 SLGTNLTDIDFSAIFDSGTSFTYLNDPAYSYLSESFNSQTKDKRH-PPDTRIPFEYCYDL 1251
            S+G N+ D++F A+FDSGTSFTYL D AY+ +SESFNS   DKR+   D+ +PFEYCY L
Sbjct: 312  SVGGNVGDLEFDAVFDSGTSFTYLTDAAYTLISESFNSLALDKRYQTTDSDLPFEYCYAL 371

Query: 1252 RSDANSTLAPNVTLIMKGGSQFPVFDPLVFISIEKELLVYCLGVVKSPDVNIIGQNFMTG 1431
             ++ +S   P V L MKGGS +PV+ PLV I + K+  VYCL ++K  +++IIGQNFMTG
Sbjct: 372  SANKDSFQYPAVNLTMKGGSSYPVYHPLVVIPM-KDTDVYCLAIMKIEEISIIGQNFMTG 430

Query: 1432 KRVVFDREKSVLRWKESNCYDIEGSSXXXXXXXXXXXXXXXXXXXXNYTPEATKETGNGS 1611
             RVVFDREK VL WKES+CY  E S+                    +Y PEAT      S
Sbjct: 431  YRVVFDREKLVLGWKESDCYTGETSA---RTLPSNRSSASARPPTSSYEPEATNIPSQRS 487

Query: 1612 RTSGAPPSSDGKSSQLKSICFTFLMFYLFILA 1707
             TS         +S   SI  +  +F+  ILA
Sbjct: 488  NTS---------TSSAYSISISLSLFFFSILA 510


>ref|XP_004232517.1| PREDICTED: aspartic proteinase-like protein 1-like [Solanum
            lycopersicum]
          Length = 539

 Score =  539 bits (1389), Expect = e-150
 Identities = 273/528 (51%), Positives = 350/528 (66%), Gaps = 17/528 (3%)
 Frame = +1

Query: 175  LLVLLIFAWSSLSCHGFRTFGFDIHHRYSDSVKGILNVDELPKMGSLEYYAAMAHRDRII 354
            ++ L I  +   +  GF TFGFDIHHRYSD VKGIL++  LP+ G++EYY+A   RDR +
Sbjct: 15   IIFLAILGYQLKTTDGFGTFGFDIHHRYSDPVKGILDLHGLPEKGTVEYYSAWTQRDRFV 74

Query: 355  RGRGLAG--DKAQFLTFSNGNKTFLINPLGFLHYANISLGTPSLWFLVALDTGSDLFWIP 528
            +GR LA   +    L FS GN+T  ++ LGFLHYAN+++G+P L FLVALDTGSDLFW+P
Sbjct: 75   KGRRLADTTNPTVPLAFSGGNETLRLSSLGFLHYANVTVGSPGLSFLVALDTGSDLFWLP 134

Query: 529  CDCKSCIKALSTNNGSKLDLNIYSPXXXXXXKKVPCNSSLCKHQGRCSGTPTKCPYQVIY 708
            CDC +C++AL T +G +++LNIYSP      + VPCN +LC    RC  +   C Y V Y
Sbjct: 135  CDCSNCVRALQTRSGGRINLNIYSPNTSSTSEIVPCNGTLCGQNRRCLASQNACAYGVAY 194

Query: 709  LSNGTSSSGILVEDVLHLTTDGHDNHSAVVDAPITFGCGQDQTGSFLDGAAPNGLFGLGM 888
            LSN TSSSG+LVED+LHL T+     S  ++API  GCG  QTG+FL GAAPNGLFGLG+
Sbjct: 195  LSNNTSSSGVLVEDILHLETNNAQQKS--IEAPIALGCGIRQTGAFLTGAAPNGLFGLGI 252

Query: 889  DKSSVPSILSSAGLTADSFSMCFGPNGVGRISFGDKGSPDQEETPFNLNQLHPTYNISVI 1068
            +  SVPS+L+S GL A+SFSMCFGP+G+GRI FGDKGSP Q ETP NL+Q HPTYNIS+ 
Sbjct: 253  ENISVPSMLASKGLAANSFSMCFGPDGIGRIVFGDKGSPGQGETPLNLDQPHPTYNISLT 312

Query: 1069 QLSLGTNLTDIDFSAIFDSGTSFTYLNDPAYSYLSESFNSQTKDKRHPPDTRIPFEYCYD 1248
             +++G+ +TD+DF+AIFDSGTSFTYLNDP Y  ++E+F+S+ K  R  PD  IPFEYCY 
Sbjct: 313  GITVGSKITDLDFTAIFDSGTSFTYLNDPVYKVITENFDSEAKQPRIQPDGTIPFEYCYG 372

Query: 1249 LRSDANSTLAPNVTLIMKGGSQFPVFDPLVFISIEKELLVYCLGVVKSPDVNIIGQNFMT 1428
            + ++  +   P+V L MKGG+QF +FDP++ +S+      YCL VVKS DVNIIGQNFMT
Sbjct: 373  ISANQTTFEVPDVNLTMKGGNQFYLFDPIIMLSLPDGSGAYCLAVVKSGDVNIIGQNFMT 432

Query: 1429 GKRVVFDREKSVLRWKESNCYDIEGSSXXXXXXXXXXXXXXXXXXXXNYTPEATKETGNG 1608
            G  V+FDREK VL WK S+CYD   S+                    +  PEATK  GN 
Sbjct: 433  GYHVIFDREKMVLGWKASDCYDSGESNDRSTTLPVNKRNSTEAPSPASVVPEATK--GNA 490

Query: 1609 SRTSGAPPSSDGKSS---------------QLKSICFTFLMFYLFILA 1707
            S    A       SS               QL    F+F  +YL I++
Sbjct: 491  SANEPATSFPSVPSSRPAGNHAPHLNSFYYQLMMAIFSFFNYYLIIIS 538


>ref|XP_007203645.1| hypothetical protein PRUPE_ppa004265mg [Prunus persica]
            gi|462399176|gb|EMJ04844.1| hypothetical protein
            PRUPE_ppa004265mg [Prunus persica]
          Length = 519

 Score =  538 bits (1386), Expect = e-150
 Identities = 281/527 (53%), Positives = 358/527 (67%), Gaps = 8/527 (1%)
 Frame = +1

Query: 148  QSMDSSAF----VLLVLLIFAWSSLSCHGFRTFGFDIHHRYSDSVKGILNVDELPKMGSL 315
            ++  SS+F    +L+ + I  W+S +C GF ++GFDIHHR+SD VK IL  DELP+ GS 
Sbjct: 7    RASSSSSFTATRLLVSVFILGWASRTCSGFGSYGFDIHHRFSDPVKAILGSDELPEKGSA 66

Query: 316  EYYAAMAHRDRIIRGRGLA-GDKAQFLTFSNGNKTFLINPLGFLHYANISLGTPSLWFLV 492
            EYYAAMAHRDR+IRGR L+  D+   LTF  GN+T+ I   G LHYAN+S+GTPS  +LV
Sbjct: 67   EYYAAMAHRDRLIRGRHLSTADETTPLTFVYGNETYQIGAFGHLHYANVSVGTPSTSYLV 126

Query: 493  ALDTGSDLFWIPCDCKSCIKALSTNNGSKLDLNIYSPXXXXXXKKVPCNSSLCKHQGRCS 672
            ALDTGSDL W+PCDC SC++ L  +NG      IYSP      KKV CNS+ C+    C+
Sbjct: 127  ALDTGSDLLWLPCDCSSCVRGLKFSNGVVKKFEIYSPNTSSTSKKVSCNSTYCEQPQHCA 186

Query: 673  GTPTKCPYQVIYLSNGTSSSGILVEDVLHLTTDGHDNHSAVVDAPITFGCGQDQTGSFLD 852
               + C Y++ YLSN TSS+G+LVEDVLHLTTD  D     V+A I FGCG++QTG FLD
Sbjct: 187  SAASDCHYKIEYLSNDTSSTGVLVEDVLHLTTD--DAKQKDVNAQIGFGCGKEQTGIFLD 244

Query: 853  GAAPNGLFGLGMDKSSVPSILSSAGLTADSFSMCFGPNGVGRISFGDKGSPDQEETPFNL 1032
            GAAPNGL GLGMD  S+PSIL+S GL ++SFSMCFG +G GRISFGD GS DQ ETPFNL
Sbjct: 245  GAAPNGLLGLGMDDVSIPSILASQGLASNSFSMCFGLDGSGRISFGDNGSLDQAETPFNL 304

Query: 1033 --NQLHPTYNISVIQLSLGTNLTDIDFSAIFDSGTSFTYLNDPAYSYLSESFNSQTKDKR 1206
               + +PTYNI++ QL++G ++TD++F AIFDSGTSFTYLNDPAY+ ++E+FNS  K+K+
Sbjct: 305  KNGRAYPTYNITITQLAIGESVTDLEFYAIFDSGTSFTYLNDPAYTQITENFNSALKNKQ 364

Query: 1207 HPPDTRIPFEYCYDLRSDANSTLAPNVTLIMKGGSQFPVFDPL-VFISIEKELLVYCLGV 1383
               D+ IPFEYCYD+    N T    V L +KGG Q+P+ DPL VF + +   ++YCLG+
Sbjct: 365  RSKDSSIPFEYCYDI--SPNQT----VNLTLKGGKQYPLLDPLVVFANEDGTPMLYCLGI 418

Query: 1384 VKSPDVNIIGQNFMTGKRVVFDREKSVLRWKESNCYDIEGSSXXXXXXXXXXXXXXXXXX 1563
            VKS DVNIIGQNFMTG RV+FDRE+ VL WKESNCY++E +                   
Sbjct: 419  VKSGDVNIIGQNFMTGYRVIFDRERMVLGWKESNCYNVEDT----VTLPVTKSKSPAASP 474

Query: 1564 XXNYTPEATKETGNGSRTSGAPPSSDGKSSQLKSICFTFLMFYLFIL 1704
                 PEAT  + N   TS  PPS+        +   T ++F  F +
Sbjct: 475  SSTINPEATAGSTN---TSHIPPSNHSPKLNSFACALTMVLFACFAI 518


>ref|XP_006340776.1| PREDICTED: aspartic proteinase-like protein 1-like [Solanum
            tuberosum]
          Length = 539

 Score =  537 bits (1384), Expect = e-150
 Identities = 270/526 (51%), Positives = 345/526 (65%), Gaps = 15/526 (2%)
 Frame = +1

Query: 175  LLVLLIFAWSSLSCHGFRTFGFDIHHRYSDSVKGILNVDELPKMGSLEYYAAMAHRDRII 354
            +L L I  +      GF TFGFDIHHRYSD VKGIL++  LP+ GS+EYY+A   RDR +
Sbjct: 15   ILFLAILGYQLQRTDGFGTFGFDIHHRYSDPVKGILDLHGLPEKGSVEYYSAWTQRDRFV 74

Query: 355  RGRGLAG--DKAQFLTFSNGNKTFLINPLGFLHYANISLGTPSLWFLVALDTGSDLFWIP 528
            +GR LA   +    LTFS GN+T  ++ LGFLHYAN+++G+P L FLVALDTGSDLFW+P
Sbjct: 75   KGRRLADTTNPTVPLTFSGGNETLQLSSLGFLHYANVTVGSPGLSFLVALDTGSDLFWLP 134

Query: 529  CDCKSCIKALSTNNGSKLDLNIYSPXXXXXXKKVPCNSSLCKHQGRCSGTPTKCPYQVIY 708
            CDC +C++AL T NG + +LNIYSP      + VPCN +LC    RC  +   C Y V Y
Sbjct: 135  CDCSNCVRALQTRNGGRRNLNIYSPNTSSTSEVVPCNGTLCGQNRRCLASQNACAYGVAY 194

Query: 709  LSNGTSSSGILVEDVLHLTTDGHDNHSAVVDAPITFGCGQDQTGSFLDGAAPNGLFGLGM 888
            LSN TSSSG+LVED+LHL T+     S  ++API  GCG  QTG+FL GAAPNGLFGLG+
Sbjct: 195  LSNNTSSSGVLVEDILHLETNNAQQKS--IEAPIALGCGIRQTGAFLTGAAPNGLFGLGI 252

Query: 889  DKSSVPSILSSAGLTADSFSMCFGPNGVGRISFGDKGSPDQEETPFNLNQLHPTYNISVI 1068
            +  SVPS+L+S GL A+SFS CFGP+G+GRI FGDKGSP Q ETP NL+Q HPTYNIS+ 
Sbjct: 253  ENISVPSMLASKGLAANSFSTCFGPDGIGRIVFGDKGSPGQGETPLNLDQPHPTYNISLT 312

Query: 1069 QLSLGTNLTDIDFSAIFDSGTSFTYLNDPAYSYLSESFNSQTKDKRHPPDTRIPFEYCYD 1248
             +++G+ +TD+DF+AIFDSGTSFTYLNDP Y  ++E+F+S+ K  R  PD  IPFEYCY 
Sbjct: 313  GITVGSKITDLDFTAIFDSGTSFTYLNDPVYKVITENFDSEAKQPRIQPDGEIPFEYCYG 372

Query: 1249 LRSDANSTLAPNVTLIMKGGSQFPVFDPLVFISIEKELLVYCLGVVKSPDVNIIGQNFMT 1428
            + ++      P V L MKGG+Q  +FDP++ +S+      YCL VVKS DVNIIGQNFMT
Sbjct: 373  ISANQTRFEVPEVNLTMKGGNQLYLFDPIIMLSLPDGSGAYCLAVVKSGDVNIIGQNFMT 432

Query: 1429 GKRVVFDREKSVLRWKESNCYDIEGSSXXXXXXXXXXXXXXXXXXXXNYTPEATKETGNG 1608
            G RV+FDREK VL WK S+CYD   S+                    +  PEATK   + 
Sbjct: 433  GYRVIFDREKMVLGWKASDCYDSGESNDKSTTLPVNKHNSTEAPSPASVVPEATKGNASA 492

Query: 1609 SRTSGAPPSSDGKS-------------SQLKSICFTFLMFYLFILA 1707
            +  + + PS                   Q     F+F  ++L I++
Sbjct: 493  NEPATSLPSVPSSRPAGNHAPHLNSFYCQFMMAIFSFFSYFLIIIS 538


>gb|EXC35303.1| Aspartic proteinase-like protein 1 [Morus notabilis]
          Length = 497

 Score =  536 bits (1382), Expect = e-149
 Identities = 267/446 (59%), Positives = 327/446 (73%), Gaps = 9/446 (2%)
 Frame = +1

Query: 190  IFAWSSLSCHG--------FRTFGFDIHHRYSDSVKGILNVD-ELPKMGSLEYYAAMAHR 342
            +F +S L C G        FR F F +HHR+SD VK   +   ++P+ GS EYYA +A R
Sbjct: 18   LFFFSVLICGGGCPGRIFSFRIFSFQMHHRFSDPVKRWSSAAADVPEKGSFEYYAHLADR 77

Query: 343  DRIIRGRGLAGDKAQFLTFSNGNKTFLINPLGFLHYANISLGTPSLWFLVALDTGSDLFW 522
            DR++RGR L+      L FS+GN T  I  LGFLHY  + LGTP   F+VALDTGSDLFW
Sbjct: 78   DRLLRGRKLSELGNSPLAFSDGNSTVRITSLGFLHYTTVKLGTPGTTFMVALDTGSDLFW 137

Query: 523  IPCDCKSCIKALSTNNGSKLDLNIYSPXXXXXXKKVPCNSSLCKHQGRCSGTPTKCPYQV 702
            +PCDC  C    +T+      L++Y P      KKV CN SLC H+ RC GT + CPY V
Sbjct: 138  VPCDCSRCAPTDATSYAPDFQLSMYDPKGSSTSKKVTCNDSLCVHRSRCLGTFSSCPYMV 197

Query: 703  IYLSNGTSSSGILVEDVLHLTTDGHDNHSAVVDAPITFGCGQDQTGSFLDGAAPNGLFGL 882
             Y+S  TS+SGIL+EDVLHL  +  D H  +V+A +TFGCGQ Q+GSFLD AAPNGLFGL
Sbjct: 198  SYVSAETSTSGILIEDVLHLKKE--DKHEELVEAYVTFGCGQVQSGSFLDVAAPNGLFGL 255

Query: 883  GMDKSSVPSILSSAGLTADSFSMCFGPNGVGRISFGDKGSPDQEETPFNLNQLHPTYNIS 1062
            GM+K SVPSILS  G  A+SFSMCFG +G+GRISFGDKGSPDQ+ETPFNLN  HPTYNI+
Sbjct: 256  GMEKISVPSILSKEGFIANSFSMCFGQDGIGRISFGDKGSPDQDETPFNLNPSHPTYNIT 315

Query: 1063 VIQLSLGTNLTDIDFSAIFDSGTSFTYLNDPAYSYLSESFNSQTKDKRHPPDTRIPFEYC 1242
            V Q+ +GT+L D DFSA+FDSGTSFTYL +P Y+ LSESF+SQ +D R P D+RIPFEYC
Sbjct: 316  VTQIRVGTSLFDADFSALFDSGTSFTYLVEPIYTRLSESFHSQVQDSRRPTDSRIPFEYC 375

Query: 1243 YDLRSDANSTLAPNVTLIMKGGSQFPVFDPLVFISIEKELLVYCLGVVKSPDVNIIGQNF 1422
            YD+  +ANS+L P+++L MKGG QF V+DP++ IS + E +VYCL VVKS ++NIIGQNF
Sbjct: 376  YDMSPEANSSLIPSLSLTMKGGCQFAVYDPIIVISTQNE-IVYCLAVVKSAELNIIGQNF 434

Query: 1423 MTGKRVVFDREKSVLRWKESNCYDIE 1500
            MTG RVVFDREK VL WK+S+CYDIE
Sbjct: 435  MTGYRVVFDREKLVLGWKKSDCYDIE 460


>ref|XP_002534040.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
            communis] gi|223525947|gb|EEF28344.1| Aspartic proteinase
            nepenthesin-2 precursor, putative [Ricinus communis]
          Length = 518

 Score =  536 bits (1380), Expect = e-149
 Identities = 267/451 (59%), Positives = 333/451 (73%), Gaps = 4/451 (0%)
 Frame = +1

Query: 169  FVLLVLLIFAWS-SLSCHGFRTFGFDIHHRYSDSVKGILNVD---ELPKMGSLEYYAAMA 336
            F++  LL+  W    +C G R F F +HHR+SD +K + +       P  GS EYYA +A
Sbjct: 7    FLVFSLLLSVWVFPQNCKG-RIFTFKMHHRFSDMLKDLSDSTTSRNFPSKGSFEYYAELA 65

Query: 337  HRDRIIRGRGLAGDKAQFLTFSNGNKTFLINPLGFLHYANISLGTPSLWFLVALDTGSDL 516
            HRD+++RGR L   +A  L FS+GN TF I+ LGFLHY  + LGTP + F+VALDTGSDL
Sbjct: 66   HRDQMLRGRKLYNVEAP-LAFSDGNSTFRISSLGFLHYTTVELGTPGMKFMVALDTGSDL 124

Query: 517  FWIPCDCKSCIKALSTNNGSKLDLNIYSPXXXXXXKKVPCNSSLCKHQGRCSGTPTKCPY 696
            FW+PCDC  C         S  +L+IY P      KKV CN++LC H+ RC GT + CPY
Sbjct: 125  FWVPCDCSKCAPTQGVAYASDFELSIYDPKQSSTSKKVTCNNNLCAHRNRCLGTFSSCPY 184

Query: 697  QVIYLSNGTSSSGILVEDVLHLTTDGHDNHSAVVDAPITFGCGQDQTGSFLDGAAPNGLF 876
             V Y+S  TS+SGILVEDVLHLT++  D++   + A +TFGCGQ Q+GSFL+ AAPNGLF
Sbjct: 185  MVSYVSAQTSTSGILVEDVLHLTSE--DSNQESIKAYVTFGCGQVQSGSFLNTAAPNGLF 242

Query: 877  GLGMDKSSVPSILSSAGLTADSFSMCFGPNGVGRISFGDKGSPDQEETPFNLNQLHPTYN 1056
            GLGMD+ SVPSILS  GLTADSFSMCFG +GVGRISFGDKGSPDQEETPFN N  HP+YN
Sbjct: 243  GLGMDQISVPSILSREGLTADSFSMCFGHDGVGRISFGDKGSPDQEETPFNSNPSHPSYN 302

Query: 1057 ISVIQLSLGTNLTDIDFSAIFDSGTSFTYLNDPAYSYLSESFNSQTKDKRHPPDTRIPFE 1236
            ISV Q+ +GT L D+DF+A+FDSGTSFTYL +P Y+ +SE+F++Q +DKR PPD RIPFE
Sbjct: 303  ISVTQVRVGTTLVDVDFTALFDSGTSFTYLINPIYAMVSENFHAQAQDKRRPPDPRIPFE 362

Query: 1237 YCYDLRSDANSTLAPNVTLIMKGGSQFPVFDPLVFISIEKELLVYCLGVVKSPDVNIIGQ 1416
            YCYD+   ANS+L P+++L MKG   F VFDP++ I+ + E LVYCL +VKS ++NIIGQ
Sbjct: 363  YCYDMSPGANSSLIPSMSLTMKGRGHFTVFDPIIVITTQNE-LVYCLAIVKSTELNIIGQ 421

Query: 1417 NFMTGKRVVFDREKSVLRWKESNCYDIEGSS 1509
            NFMTG RVVFDREK VL WKE++CYD E +S
Sbjct: 422  NFMTGYRVVFDREKLVLGWKETDCYDQEYNS 452


>ref|XP_006409248.1| hypothetical protein EUTSA_v10022648mg [Eutrema salsugineum]
            gi|312282765|dbj|BAJ34248.1| unnamed protein product
            [Thellungiella halophila] gi|557110410|gb|ESQ50701.1|
            hypothetical protein EUTSA_v10022648mg [Eutrema
            salsugineum]
          Length = 515

 Score =  535 bits (1377), Expect = e-149
 Identities = 270/515 (52%), Positives = 346/515 (67%), Gaps = 2/515 (0%)
 Frame = +1

Query: 175  LLVLLIFAWSSLSCHGFRTFGFDIHHRYSDSVKGILNVDELPKMGSLEYYAAMAHRDRII 354
            L+++L+ +W    C G   FGF+ HHR+SD V G+L  D LP   S +YY  MAHRDR+I
Sbjct: 14   LILMLVSSWVLDRCEGLGEFGFEFHHRFSDQVVGVLPGDGLPNRDSSKYYRVMAHRDRLI 73

Query: 355  RGRGLAGDKAQFLTFSNGNKTFLINPLGFLHYANISLGTPSLWFLVALDTGSDLFWIPCD 534
            RGR LA +    +TF++GN+T  +N LGFLHYAN+++GTPS WFLVALDTGSDLFW+PCD
Sbjct: 74   RGRRLASEDQSLVTFADGNETIRVNALGFLHYANVTVGTPSDWFLVALDTGSDLFWLPCD 133

Query: 535  CKS-CIKALSTNNGSKLDLNIYSPXXXXXXKKVPCNSSLCKHQGRCSGTPTKCPYQVIYL 711
            C + C++ L    GS LDLNIYSP       KVPCNS+LC    RC+   + CPYQ+ YL
Sbjct: 134  CSTNCVRELKAPGGSSLDLNIYSPNASSTSSKVPCNSTLCTRVDRCASPLSDCPYQIRYL 193

Query: 712  SNGTSSSGILVEDVLHLTTDGHDNHSAVVDAPITFGCGQDQTGSFLDGAAPNGLFGLGMD 891
            SNGTSS+G+LVEDVLHL +   + +S  + A IT GCG  QTG F DGAAPNGLFGLG++
Sbjct: 194  SNGTSSTGVLVEDVLHLVS--MEKNSKPIRARITLGCGLVQTGVFHDGAAPNGLFGLGLE 251

Query: 892  KSSVPSILSSAGLTADSFSMCFGPNGVGRISFGDKGSPDQEETPFNLNQLHPTYNISVIQ 1071
              SVPS+L+  G+ A+SFSMCFG +G GRISFGDKGS DQ ETP N+ Q HPTYN++V Q
Sbjct: 252  DISVPSVLAKEGIAANSFSMCFGDDGAGRISFGDKGSVDQRETPLNIRQPHPTYNVTVTQ 311

Query: 1072 LSLGTNLTDIDFSAIFDSGTSFTYLNDPAYSYLSESFNSQTKDKRHPPDTRIPFEYCYDL 1251
            +S+G N  D++F A+FD+GTSFTYL D  Y+ +SESFNS   DKR+  D+ +PFEYCY +
Sbjct: 312  ISVGGNTGDLEFDAVFDTGTSFTYLTDAPYTLISESFNSLALDKRYQTDSELPFEYCYAV 371

Query: 1252 RSDANSTLAPNVTLIMKGGSQFPVFDPLVFISIEKELLVYCLGVVKSPDVNIIGQNFMTG 1431
              +  S   P+V L MKGGS +PV+ PL+ + IE + +VYCL ++KS D++IIGQNFMTG
Sbjct: 372  SPNKKSFEYPDVNLTMKGGSSYPVYHPLIVVPIE-DTVVYCLAIMKSEDISIIGQNFMTG 430

Query: 1432 KRVVFDREKSVLRWKESNCYDIEGSSXXXXXXXXXXXXXXXXXXXXNYTPEATKETGNGS 1611
             RVVFDREK +L WKES+C   E S+                    ++ PEAT       
Sbjct: 431  YRVVFDREKLILGWKESDCSTGETSA---RTQPSNRSSSSARPPASSFDPEAT------- 480

Query: 1612 RTSGAPPSSDGKSSQLKSICFTFLMFYLF-ILAMV 1713
                  PSS   SS   S+  +    Y F ILA++
Sbjct: 481  NIPSQRPSSSSSSSYSYSLSLSLPFLYFFSILAIL 515


>ref|XP_006828808.1| hypothetical protein AMTR_s00001p00126200 [Amborella trichopoda]
            gi|548833787|gb|ERM96224.1| hypothetical protein
            AMTR_s00001p00126200 [Amborella trichopoda]
          Length = 522

 Score =  531 bits (1369), Expect = e-148
 Identities = 270/510 (52%), Positives = 354/510 (69%), Gaps = 6/510 (1%)
 Frame = +1

Query: 193  FAWSSLSCHGFRTFGFDIHHRYSDSVKGILNV------DELPKMGSLEYYAAMAHRDRII 354
            F +   SCH  +TFGFD+HH++S+ VK  +++      +E P+ GS +YY ++ H D  +
Sbjct: 16   FGFLFWSCHCRQTFGFDLHHKFSEPVKEWMSLRHGIGYEEWPESGSEDYYLSLVHHDHNL 75

Query: 355  RGRGLAGDKAQFLTFSNGNKTFLINPLGFLHYANISLGTPSLWFLVALDTGSDLFWIPCD 534
            RGRG++   A  LTF++GN TF ++ LGFLHY+ ++LGTP++ FLVALDTGSDLFW+PCD
Sbjct: 76   RGRGISEIGAP-LTFADGNTTFKLSSLGFLHYSFVTLGTPNVTFLVALDTGSDLFWVPCD 134

Query: 535  CKSCIKALSTNNGSKLDLNIYSPXXXXXXKKVPCNSSLCKHQGRCSGTPTKCPYQVIYLS 714
            C  C   LS + G   +LNIY+       K V C++SLC+ Q  CS +   CPYQV Y+S
Sbjct: 135  CSRCAPTLSMSYGFDFELNIYNSNASSTSKHVSCSNSLCQWQSECSRSTGHCPYQVSYVS 194

Query: 715  NGTSSSGILVEDVLHLTTDGHDNHSAVVDAPITFGCGQDQTGSFLDGAAPNGLFGLGMDK 894
            + TSSSG+L+EDVL+LTTD     S VV APITFGCGQ Q+GSFLD AAPNGLFGLG++K
Sbjct: 195  DDTSSSGVLIEDVLYLTTDD----SQVVKAPITFGCGQVQSGSFLDAAAPNGLFGLGVEK 250

Query: 895  SSVPSILSSAGLTADSFSMCFGPNGVGRISFGDKGSPDQEETPFNLNQLHPTYNISVIQL 1074
             SVPSILS  GL  DSFSMCFG +G+GRI FGD GS DQEETPFNL+Q +PTYNIS+  +
Sbjct: 251  LSVPSILSGLGLIHDSFSMCFGQDGIGRIRFGDNGSSDQEETPFNLDQSYPTYNISITDI 310

Query: 1075 SLGTNLTDIDFSAIFDSGTSFTYLNDPAYSYLSESFNSQTKDKRHPPDTRIPFEYCYDLR 1254
             +G++     FSA+FDSGTSFTYL DP Y+ L++SF+ Q  DKRH PD+R+PFEYCY+  
Sbjct: 311  QVGSSSIKTGFSALFDSGTSFTYLADPIYTRLAKSFDIQVPDKRHQPDSRLPFEYCYNAS 370

Query: 1255 SDANSTLAPNVTLIMKGGSQFPVFDPLVFISIEKELLVYCLGVVKSPDVNIIGQNFMTGK 1434
            S+ NS + P+V+L+M+GGS+FP++DP++  S +   +VYCL VVK   +NIIGQNFMTG 
Sbjct: 371  SNVNSNI-PDVSLLMQGGSRFPIYDPIISFSTQGH-IVYCLAVVKGEGMNIIGQNFMTGL 428

Query: 1435 RVVFDREKSVLRWKESNCYDIEGSSXXXXXXXXXXXXXXXXXXXXNYTPEATKETGNGSR 1614
            R+VFDREK VL WK+ NCYD+E +S                    NY PE TK  GN ++
Sbjct: 429  RIVFDREKLVLGWKKFNCYDVENTS-TLDIKPPYTVPPSSSVAPDNYAPEDTKTMGNTTQ 487

Query: 1615 TSGAPPSSDGKSSQLKSICFTFLMFYLFIL 1704
             S  PP     +++L    FT  +  LF+L
Sbjct: 488  VSIPPPPPLSDAARLFVFGFTRALSPLFLL 517


>ref|XP_006599302.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
          Length = 519

 Score =  528 bits (1361), Expect = e-147
 Identities = 277/525 (52%), Positives = 361/525 (68%), Gaps = 12/525 (2%)
 Frame = +1

Query: 166  AFVLLVLLIFAWSSLSCHGFRTFGFDIHHRYSDSVKG-----ILNVDELPKMGSLEYYAA 330
            +FV ++  +F   SL CHG   + F +HHR+S+ V+         +   P+ G++EYYA 
Sbjct: 3    SFVFIIASLFL--SL-CHG-HVYTFTMHHRHSEPVRKWSHSTASGIPAPPEKGTVEYYAE 58

Query: 331  MAHRDRIIRGRGLAGDKAQFLTFSNGNKTFLINPLGFLHYANISLGTPSLWFLVALDTGS 510
            +A RDR++RGR L+      L FS+GN TF I+ LGFLHY  + +GTP + F+VALDTGS
Sbjct: 59   LADRDRLLRGRKLS-QIDDGLAFSDGNSTFRISSLGFLHYTTVQIGTPGVKFMVALDTGS 117

Query: 511  DLFWIPCDCKSCI----KALSTNNGSKLDLNIYSPXXXXXXKKVPCNSSLCKHQGRCSGT 678
            DLFW+PCDC  C      A ++   S  DLN+Y+P      KKV CN+SLC H+ +C GT
Sbjct: 118  DLFWVPCDCTRCAATDSSAFASAFASDFDLNVYNPNGSSTSKKVTCNNSLCMHRSQCLGT 177

Query: 679  PTKCPYQVIYLSNGTSSSGILVEDVLHLTTDGHDNHSAVVDAPITFGCGQDQTGSFLDGA 858
             + CPY V Y+S  TS+SGILVEDVLHLT +  DNH  +V+A + FGCGQ Q+GSFLD A
Sbjct: 178  LSNCPYMVSYVSAETSTSGILVEDVLHLTQE--DNHHDLVEANVIFGCGQIQSGSFLDVA 235

Query: 859  APNGLFGLGMDKSSVPSILSSAGLTADSFSMCFGPNGVGRISFGDKGSPDQEETPFNLNQ 1038
            APNGLFGLGM+K SVPS+LS  G TADSFSMCFG +G+GRISFGDKGS DQ+ETPFNLN 
Sbjct: 236  APNGLFGLGMEKISVPSMLSREGFTADSFSMCFGRDGIGRISFGDKGSFDQDETPFNLNP 295

Query: 1039 LHPTYNISVIQLSLGTNLTDIDFSAIFDSGTSFTYLNDPAYSYLSESFNSQTKDKRHPPD 1218
             HPTYNI+V Q+ +GT L D++F+A+FDSGTSFTYL DP Y+ L+ESF+SQ +D+RH  D
Sbjct: 296  SHPTYNITVTQVRVGTTLIDVEFTALFDSGTSFTYLVDPTYTRLTESFHSQVQDRRHRSD 355

Query: 1219 TRIPFEYCYDLRSDANSTLAPNVTLIMKGGSQFPVFDPLVFISIEKELLVYCLGVVKSPD 1398
            +RIPFEYCYD+  DAN++L P+V+L M GGS F V+DP++ IS + E LVYCL VVK+ +
Sbjct: 356  SRIPFEYCYDMSPDANTSLIPSVSLTMGGGSHFAVYDPIIIISTQSE-LVYCLAVVKTAE 414

Query: 1399 VNIIGQNFMTGKRVVFDREKSVLRWKESNCYDIE--GSSXXXXXXXXXXXXXXXXXXXXN 1572
            +NIIGQNFMTG RVVFDREK VL WK+ +CYDIE    +                    N
Sbjct: 415  LNIIGQNFMTGYRVVFDREKLVLGWKKFDCYDIEDHNDAIPTRPHSHADVPPAVAAGLGN 474

Query: 1573 Y-TPEATKETGNGSRTSGAPPSSDGKSSQLKSICFTFLMFYLFIL 1704
            Y   + T+++   S+ S A PSS    + L +     ++ +++IL
Sbjct: 475  YPATDPTRKSKYNSQRSIASPSSHYSHTSLPTFLGFLVLCFVYIL 519


Top