BLASTX nr result

ID: Akebia25_contig00000734 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia25_contig00000734
         (2122 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007046609.1| Eukaryotic aspartyl protease family protein,...   583   e-163
ref|XP_002534038.1| Aspartic proteinase nepenthesin-1 precursor,...   565   e-158
ref|XP_007046607.1| Eukaryotic aspartyl protease family protein ...   563   e-158
ref|XP_007046606.1| Eukaryotic aspartyl protease family protein ...   560   e-156
ref|XP_002269880.1| PREDICTED: aspartic proteinase-like protein ...   556   e-155
ref|XP_006467009.1| PREDICTED: aspartic proteinase-like protein ...   556   e-155
ref|XP_006425380.1| hypothetical protein CICLE_v10025374mg [Citr...   555   e-155
ref|XP_007204828.1| hypothetical protein PRUPE_ppa004096mg [Prun...   542   e-151
ref|NP_849967.1| aspartyl protease family protein [Arabidopsis t...   542   e-151
ref|XP_004232517.1| PREDICTED: aspartic proteinase-like protein ...   541   e-151
ref|XP_006340776.1| PREDICTED: aspartic proteinase-like protein ...   539   e-150
ref|XP_002884082.1| aspartyl protease family protein [Arabidopsi...   539   e-150
ref|XP_006574660.1| PREDICTED: aspartic proteinase-like protein ...   538   e-150
ref|XP_006299902.1| hypothetical protein CARUB_v10016111mg [Caps...   538   e-150
ref|XP_007203645.1| hypothetical protein PRUPE_ppa004265mg [Prun...   536   e-149
gb|EXC35303.1| Aspartic proteinase-like protein 1 [Morus notabilis]   535   e-149
ref|XP_002534040.1| Aspartic proteinase nepenthesin-2 precursor,...   534   e-149
ref|XP_006409248.1| hypothetical protein EUTSA_v10022648mg [Eutr...   533   e-148
ref|XP_006828808.1| hypothetical protein AMTR_s00001p00126200 [A...   531   e-148
ref|XP_006599302.1| PREDICTED: aspartic proteinase-like protein ...   527   e-147

>ref|XP_007046609.1| Eukaryotic aspartyl protease family protein, putative isoform 1
            [Theobroma cacao] gi|508698870|gb|EOX90766.1| Eukaryotic
            aspartyl protease family protein, putative isoform 1
            [Theobroma cacao]
          Length = 519

 Score =  583 bits (1502), Expect = e-163
 Identities = 289/525 (55%), Positives = 379/525 (72%), Gaps = 5/525 (0%)
 Frame = -3

Query: 2030 MDSSAFVLLVLLIFAWSSLSCHGFRTFGFDIHHRYSDSVKGILNVDELPKMGSLEYYAAM 1851
            + S + VLL++++   +   C+GF TFGFDIHHRYSD VK  L VDELP  GSLEYY+AM
Sbjct: 4    LSSYSCVLLLVVLGLSAGSCCYGFGTFGFDIHHRYSDPVKDFLTVDELPAKGSLEYYSAM 63

Query: 1850 THRDRIIRGRGLAGDKAQF-LTFSNGNKTFLINPLGFLHYANISLGTPSLWFLVALDTGS 1674
             HRD+II+GR LA    Q  +TF +GN+T+ ++ LGFL+YAN+S+G+P+L FLVALDTGS
Sbjct: 64   VHRDKIIKGRRLATANDQTPVTFLDGNETYRLSGLGFLYYANVSVGSPALSFLVALDTGS 123

Query: 1673 DLFWIPCDCKSCIKALSTNNGSKLDLNIYSPXXXXXSKKVPCNSSLCKHQGRCSGTPTKC 1494
            DLFW+PCDC SC++ LST +G  +D NIYSP     S KVPC+S +C+ Q RCS + + C
Sbjct: 124  DLFWLPCDCSSCVQGLSTADGQTIDFNIYSPNTSSTSSKVPCSSDMCEQQKRCSSSQSNC 183

Query: 1493 PYQVIYLSNGTSSSGILVEDVLHLTTDGHDNHSAVVDAPITFGCGQDQTGSFLDGAAPNG 1314
            PYQ++YLSNGTSS+G+LVEDVLHLTTD  ++ +  V A ITFGCG+ QTGSFL+GAAPNG
Sbjct: 184  PYQILYLSNGTSSTGVLVEDVLHLTTD--EDKTKAVQAKITFGCGKVQTGSFLNGAAPNG 241

Query: 1313 LFGLGMDKSSVPSILSSAGLTADSFSMCFGPNGVGRISFGDKGSPDQEETPFNLNQLHPT 1134
            LFGLGMD  SVPS L++  +T++SFSMCFG +G+GRI+FGD+GS  Q ETPFNL + HPT
Sbjct: 242  LFGLGMDNISVPSTLANENITSNSFSMCFGRDGIGRITFGDRGSSYQGETPFNLRKSHPT 301

Query: 1133 YNISVIQLSLGTNLTDIDFSAIFDSGTSFTYLNDPAYSYLSESFNSQTKDKRHPPDTR-I 957
            YN+S+ Q+++G N  D+DFSA+FDSGTSFTYLNDPAY+++SESFN+   +KRH  D+  +
Sbjct: 302  YNVSITQINVGGNAGDLDFSAVFDSGTSFTYLNDPAYTFISESFNNMAIEKRHTSDSSDL 361

Query: 956  PFEYCYDLRSDANSTLAPNVTLIMKGGSQFPVFDPLVFISIEKELL---VYCLGVVKSPD 786
            PF+YCYDL ++  +   P V L MKGG  F V DP+V +S++ ++    +YCLGVVKS D
Sbjct: 362  PFDYCYDLSANQTNFTYPVVNLTMKGGDSFFVDDPIVVVSLKVKVHSGDLYCLGVVKSDD 421

Query: 785  VNIIGQNFMTGKRVVFDREKSVLRWKESNCYDIEGSSXXXXXXXXXXXXXXXXXXXPNYT 606
            VNIIGQNFMTG R+VFDREK VL W  S+CYDIE  +                       
Sbjct: 422  VNIIGQNFMTGYRIVFDREKMVLGWNPSDCYDIEAKTLPVRPPTAVPPAVA-------VN 474

Query: 605  PEATKETGNGSRTSGAPPSSDGKSSQLKSICFTFLMFYLFILAMV 471
            PEAT   GN S  SGA P    +S ++K++ +  ++  +   A++
Sbjct: 475  PEATAGNGNTSHISGASPPMANQSPKMKTLSYALIVALIPFFALI 519


>ref|XP_002534038.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
            communis] gi|223525945|gb|EEF28342.1| Aspartic proteinase
            nepenthesin-1 precursor, putative [Ricinus communis]
          Length = 533

 Score =  565 bits (1456), Expect = e-158
 Identities = 284/523 (54%), Positives = 369/523 (70%), Gaps = 6/523 (1%)
 Frame = -3

Query: 2021 SAFVLLVLLIFAWSSLSCHGFRTFGFDIHHRYSDSVKGILNVDELPKMGSLEYYAAMTHR 1842
            S+F+LL++L+ + SS S +GF TFGFD+HHRYSD VKG+L+VD+LP+ GSL YYA+M HR
Sbjct: 19   SSFLLLLVLMLSSSSFS-YGFGTFGFDLHHRYSDPVKGMLSVDDLPEKGSLHYYASMAHR 77

Query: 1841 DRIIRGRGLAGDKAQF-LTFSNGNKTFLINPLGFLHYANISLGTPSLWFLVALDTGSDLF 1665
            D +I GR L  D     LTF +GN+T+  + LGFLHYAN+S+GTPSL +LVALDTGSDLF
Sbjct: 78   DILIHGRKLVSDNTSTPLTFFSGNETYRFSSLGFLHYANVSIGTPSLSYLVALDTGSDLF 137

Query: 1664 WIPCDCKS--CIKALSTNNGSKLDLNIYSPXXXXXSKKVPCNSSLCKHQGRCSGTPTKCP 1491
            W+PCDC +  C++ L   +G ++D NIY P     S+ +PCN++LC  Q RC    + CP
Sbjct: 138  WLPCDCTNSGCVQGLQFPSGEQIDFNIYRPNASSTSQTIPCNNTLCSRQSRCPSAQSTCP 197

Query: 1490 YQVIYLSNGTSSSGILVEDVLHLTTDGHDNHSAVVDAPITFGCGQDQTGSFLDGAAPNGL 1311
            YQV YLSNGTSS+G+LVED+LHLTTD  D  S  +DA I FGCG+ QTGSFLDGAAPNGL
Sbjct: 198  YQVQYLSNGTSSTGVLVEDLLHLTTD--DAQSRALDAKIIFGCGRVQTGSFLDGAAPNGL 255

Query: 1310 FGLGMDKSSVPSILSSAGLTADSFSMCFGPNGVGRISFGDKGSPDQEETPFNLNQLHPTY 1131
            FGLGM   SVPS L+  G T++SFSMCFG +G+GRISFGD GS  Q ETPFNL QLHPTY
Sbjct: 256  FGLGMTNISVPSTLAREGYTSNSFSMCFGRDGIGRISFGDTGSSGQGETPFNLRQLHPTY 315

Query: 1130 NISVIQLSLGTNLTDIDFSAIFDSGTSFTYLNDPAYSYLSESFNSQTKDKRHPPDTRIPF 951
            N+S+ ++++G    D++FSAIFDSGTSFTYLNDPAY+ +SESFN   K+KR+   + IPF
Sbjct: 316  NVSITKINVGGRDADLEFSAIFDSGTSFTYLNDPAYTLISESFNIGAKEKRYSSISDIPF 375

Query: 950  EYCYDLRSDANSTLAPNVTLIMKGGSQFPVFDPLVFISIEKELLVYCLGVVKSPDVNIIG 771
            EYCY++ S+  +   P V L+M+GGSQF V DP+V + ++    +YCL +VKS DVNIIG
Sbjct: 376  EYCYEMSSNQTNLEIPTVNLVMQGGSQFNVTDPIVIVILQGGASIYCLAIVKSGDVNIIG 435

Query: 770  QNFMTGKRVVFDREKSVLRWKESNCYDIEGSSXXXXXXXXXXXXXXXXXXXPNYTPEATK 591
            QNFMTG R+VF+RE++VL WK S+CYD   ++                       P+AT 
Sbjct: 436  QNFMTGYRIVFNRERNVLGWKASDCYDDMDTTTFPVDPISPGIPPATA-----VNPQATA 490

Query: 590  ETGNGSRTSGAPP---SSDGKSSQLKSICFTFLMFYLFILAMV 471
             +GN +  SG PP   ++  K  +L S+ F  +M  +    +V
Sbjct: 491  GSGNTTEVSGTPPPVGNNAPKLPKLNSLTFAIIMVLIPFFTIV 533


>ref|XP_007046607.1| Eukaryotic aspartyl protease family protein isoform 2 [Theobroma
            cacao] gi|508698868|gb|EOX90764.1| Eukaryotic aspartyl
            protease family protein isoform 2 [Theobroma cacao]
          Length = 519

 Score =  563 bits (1452), Expect = e-158
 Identities = 288/496 (58%), Positives = 353/496 (71%), Gaps = 6/496 (1%)
 Frame = -3

Query: 1958 RTFGFDIHHRYSDSVKGILN----VDELPKMGSLEYYAAMTHRDRIIRGRGLAGDKAQFL 1791
            R F F +HHR+S+ VK   N    +   P  GS EYYA + HRDR++RGR L+G  A  +
Sbjct: 26   RIFTFKMHHRFSEPVKNWSNSTGKLSHWPVKGSFEYYAVLAHRDRLLRGRQLSGINAP-I 84

Query: 1790 TFSNGNKTFLINPLGFLHYANISLGTPSLWFLVALDTGSDLFWIPCDCKSCIKALSTNNG 1611
            +FS+GN TF I+ LGFLHY  + LGTP + F+VALDTGSDLFW+PCDC  C     T   
Sbjct: 85   SFSDGNSTFRISSLGFLHYTTVQLGTPGVKFMVALDTGSDLFWVPCDCNKCAPTEGTTYA 144

Query: 1610 SKLDLNIYSPXXXXXSKKVPCNSSLCKHQGRCSGTPTKCPYQVIYLSNGTSSSGILVEDV 1431
            S  +L+IY P     SKKV CNSSLC  + +C GT + CPY V Y+S  TS+SG+LVEDV
Sbjct: 145  SDFELSIYDPKGSSTSKKVTCNSSLCALRNQCLGTFSNCPYMVSYMSAQTSTSGVLVEDV 204

Query: 1430 LHLTTDGHDNHSAVVDAPITFGCGQDQTGSFLDGAAPNGLFGLGMDKSSVPSILSSAGLT 1251
            LHLTT+  D H  +V A +TFGCGQ Q+GSFLD AAPNGLFGLGM+K SVPSILS  GLT
Sbjct: 205  LHLTTE--DGHPELVKAYVTFGCGQVQSGSFLDVAAPNGLFGLGMEKISVPSILSQEGLT 262

Query: 1250 ADSFSMCFGPNGVGRISFGDKGSPDQEETPFNLNQLHPTYNISVIQLSLGTNLTDIDFSA 1071
            ADSFSMCFG +G+GRISFGDKGSPDQEETPFNLN   PTYNI++ Q+ +GT L D DF+A
Sbjct: 263  ADSFSMCFGHDGIGRISFGDKGSPDQEETPFNLNPSRPTYNITITQIRVGTTLIDDDFTA 322

Query: 1070 IFDSGTSFTYLNDPAYSYLSESFNSQTKDKRHPPDTRIPFEYCYDLRSDANSTLAPNVTL 891
            +FDSGTSFTYL DP YS LSE+F+SQ +D+R PPD+RIPFEYCYD+  DAN++L P+++L
Sbjct: 323  LFDSGTSFTYLVDPTYSNLSENFHSQAQDRRRPPDSRIPFEYCYDMSPDANASLIPSMSL 382

Query: 890  IMKGGSQFPVFDPLVFISIEKELLVYCLGVVKSPDVNIIGQNFMTGKRVVFDREKSVLRW 711
             MKG SQFPV+DP++ IS ++  LVYCL VVKS ++NIIGQNFMTG RVVFDRE+ VL W
Sbjct: 383  TMKGESQFPVYDPIIVISTQQSKLVYCLAVVKSTELNIIGQNFMTGYRVVFDRERFVLGW 442

Query: 710  KESNCYDIEGSSXXXXXXXXXXXXXXXXXXXPNY-TPEATKETG-NGSRTSGAPPSSDGK 537
            K+ +CYDI+ +S                    NY TPEATK+ G N S TS A  S   +
Sbjct: 443  KKFDCYDIDETSASVVESHAASAPPAFAVGIRNYSTPEATKDIGKNNSHTSFALRSCHFQ 502

Query: 536  SSQLKSICFTFLMFYL 489
             S L  + F  ++  L
Sbjct: 503  VSPLSCLGFVSILSLL 518


>ref|XP_007046606.1| Eukaryotic aspartyl protease family protein isoform 1 [Theobroma
            cacao] gi|508698867|gb|EOX90763.1| Eukaryotic aspartyl
            protease family protein isoform 1 [Theobroma cacao]
          Length = 518

 Score =  560 bits (1442), Expect = e-156
 Identities = 288/496 (58%), Positives = 353/496 (71%), Gaps = 6/496 (1%)
 Frame = -3

Query: 1958 RTFGFDIHHRYSDSVKGILN----VDELPKMGSLEYYAAMTHRDRIIRGRGLAGDKAQFL 1791
            R F F +HHR+S+ VK   N    +   P  GS EYYA + HRDR++RGR L+G  A  +
Sbjct: 26   RIFTFKMHHRFSEPVKNWSNSTGKLSHWPVKGSFEYYAVLAHRDRLLRGRQLSGINAP-I 84

Query: 1790 TFSNGNKTFLINPLGFLHYANISLGTPSLWFLVALDTGSDLFWIPCDCKSCIKALSTNNG 1611
            +FS+GN TF I+ LGFLHY  + LGTP + F+VALDTGSDLFW+PCDC  C     T   
Sbjct: 85   SFSDGNSTFRISSLGFLHYTTVQLGTPGVKFMVALDTGSDLFWVPCDCNKCAPTEGTTYA 144

Query: 1610 SKLDLNIYSPXXXXXSKKVPCNSSLCKHQGRCSGTPTKCPYQVIYLSNGTSSSGILVEDV 1431
            S  +L+IY P     SKKV CNSSLC  + +C GT + CPY V Y+S  TS+SG+LVEDV
Sbjct: 145  SDFELSIYDPKGSSTSKKVTCNSSLCALRNQCLGTFSNCPYMVSYMSAQTSTSGVLVEDV 204

Query: 1430 LHLTTDGHDNHSAVVDAPITFGCGQDQTGSFLDGAAPNGLFGLGMDKSSVPSILSSAGLT 1251
            LHLTT+  D H  +V A +TFGCGQ Q+GSFLD AAPNGLFGLGM+K SVPSILS  GLT
Sbjct: 205  LHLTTE--DGHPELVKAYVTFGCGQVQSGSFLDVAAPNGLFGLGMEKISVPSILSQEGLT 262

Query: 1250 ADSFSMCFGPNGVGRISFGDKGSPDQEETPFNLNQLHPTYNISVIQLSLGTNLTDIDFSA 1071
            ADSFSMCFG +G+GRISFGDKGSPDQEETPFNLN   PTYNI++ Q+ +GT L D DF+A
Sbjct: 263  ADSFSMCFGHDGIGRISFGDKGSPDQEETPFNLNPSRPTYNITITQIRVGTTLIDDDFTA 322

Query: 1070 IFDSGTSFTYLNDPAYSYLSESFNSQTKDKRHPPDTRIPFEYCYDLRSDANSTLAPNVTL 891
            +FDSGTSFTYL DP YS LSE+F+SQ +D+R PPD+RIPFEYCYD+  DAN++L P+++L
Sbjct: 323  LFDSGTSFTYLVDPTYSNLSENFHSQAQDRRRPPDSRIPFEYCYDMSPDANASLIPSMSL 382

Query: 890  IMKGGSQFPVFDPLVFISIEKELLVYCLGVVKSPDVNIIGQNFMTGKRVVFDREKSVLRW 711
             MKG SQFPV+DP++ IS + + LVYCL VVKS ++NIIGQNFMTG RVVFDRE+ VL W
Sbjct: 383  TMKGESQFPVYDPIIVISTQSK-LVYCLAVVKSTELNIIGQNFMTGYRVVFDRERFVLGW 441

Query: 710  KESNCYDIEGSSXXXXXXXXXXXXXXXXXXXPNY-TPEATKETG-NGSRTSGAPPSSDGK 537
            K+ +CYDI+ +S                    NY TPEATK+ G N S TS A  S   +
Sbjct: 442  KKFDCYDIDETSASVVESHAASAPPAFAVGIRNYSTPEATKDIGKNNSHTSFALRSCHFQ 501

Query: 536  SSQLKSICFTFLMFYL 489
             S L  + F  ++  L
Sbjct: 502  VSPLSCLGFVSILSLL 517


>ref|XP_002269880.1| PREDICTED: aspartic proteinase-like protein 1 [Vitis vinifera]
            gi|297739017|emb|CBI28369.3| unnamed protein product
            [Vitis vinifera]
          Length = 518

 Score =  556 bits (1433), Expect = e-155
 Identities = 293/523 (56%), Positives = 362/523 (69%), Gaps = 9/523 (1%)
 Frame = -3

Query: 2021 SAFVLLVLLIFAWSSLSCHGFRTFGFDIHHRYSDSVK----GILN---VDELPKMGSLEY 1863
            S F++++L I  + S  CH  R F F +HHR+S+ VK    G  N       P  GS EY
Sbjct: 6    SVFIVILLSILGFRS--CHA-RIFSFQMHHRFSEPVKKWSEGAGNGFPAGNWPAKGSFEY 62

Query: 1862 YAAMTHRDRIIRGRGLAGDKAQFLTFSNGNKTFLINPLGFLHYANISLGTPSLWFLVALD 1683
            YA + HRDR +RGR L+ D    LTFS+GN TF I+ LGFLHY  +SLGTP   FLVALD
Sbjct: 63   YAELAHRDRALRGRRLS-DIDGLLTFSDGNSTFRISSLGFLHYTTVSLGTPGKKFLVALD 121

Query: 1682 TGSDLFWIPCDCKSCIKALSTNNGSKLDLNIYSPXXXXXSKKVPCNSSLCKHQGRCSGTP 1503
            TGSDLFW+PCDC  C     T   S  +L+IY+P     S+KV C++SLC H+ RC GT 
Sbjct: 122  TGSDLFWVPCDCSRCAPTEGTTYASDFELSIYNPKGSSTSRKVTCDNSLCAHRNRCLGTF 181

Query: 1502 TKCPYQVIYLSNGTSSSGILVEDVLHLTTDGHDNHSAVVDAPITFGCGQDQTGSFLDGAA 1323
            + CPY V Y+S  TS+SGILVEDVLHLTT+  DN    V+A +TFGCGQ QTGSFLD AA
Sbjct: 182  SNCPYMVSYVSAETSTSGILVEDVLHLTTE--DNRQEFVEAYVTFGCGQVQTGSFLDIAA 239

Query: 1322 PNGLFGLGMDKSSVPSILSSAGLTADSFSMCFGPNGVGRISFGDKGSPDQEETPFNLNQL 1143
            PNGLFGLG++K SVPSILS  G TADSFSMCFGP+G+GRISFGDKGSPDQEETPFNLN L
Sbjct: 240  PNGLFGLGLEKISVPSILSKEGFTADSFSMCFGPDGIGRISFGDKGSPDQEETPFNLNAL 299

Query: 1142 HPTYNISVIQLSLGTNLTDIDFSAIFDSGTSFTYLNDPAYSYLSESFNSQTKDKRHPPDT 963
            HPTYNI+V Q+ +GT L D+DF+A+FDSGTSFTYL DP Y+ + +SF+SQ +D R PPD+
Sbjct: 300  HPTYNITVTQVRVGTTLIDLDFTALFDSGTSFTYLVDPIYTNVLKSFHSQAQDSRRPPDS 359

Query: 962  RIPFEYCYDLRSDANSTLAPNVTLIMKGGSQFPVFDPLVFISIEKELLVYCLGVVKSPDV 783
            RIPFE+CYD+    N++L P+++L MKGGSQFPV+DP++ IS + E L+YC+ VV+S ++
Sbjct: 360  RIPFEFCYDMSPGENTSLIPSMSLTMKGGSQFPVYDPIIIISSQSE-LIYCMAVVRSAEL 418

Query: 782  NIIGQNFMTGKRVVFDREKSVLRWKESNCYDIEGSSXXXXXXXXXXXXXXXXXXXPNYTP 603
            NIIGQNFMTG R++FDREK VL WKE  C DIE SS                    N T 
Sbjct: 419  NIIGQNFMTGYRIIFDREKLVLGWKEFECDDIENSS-VPIRPRATSVPPAVAVGVGNDTT 477

Query: 602  EATKETGN--GSRTSGAPPSSDGKSSQLKSICFTFLMFYLFIL 480
            ++T++T N   SR S A P     + +L   CF  L   L +L
Sbjct: 478  KSTRDTRNFSQSRNSVASPLFHRITPEL--TCFILLFILLLLL 518


>ref|XP_006467009.1| PREDICTED: aspartic proteinase-like protein 1-like [Citrus sinensis]
          Length = 517

 Score =  556 bits (1432), Expect = e-155
 Identities = 277/451 (61%), Positives = 339/451 (75%), Gaps = 5/451 (1%)
 Frame = -3

Query: 2012 VLLVLLIFAWSSLSCHGFRTFGFDIHHRYSDSVKGILNVDELPKMGSLEYYAAMTHRDRI 1833
            V ++L++ +  +  C GF TFGFD HHRYSD VKGIL VD+LPK GS  YY+A+ HRDR 
Sbjct: 10   VCVLLILLSCCAGCCFGFGTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRY 69

Query: 1832 --IRGRGLA--GDKAQFLTFSNGNKTFLINPLGFLHYANISLGTPSLWFLVALDTGSDLF 1665
              +RGRGLA  G+    LTFS GN T+ +N LGFLHY N+S+G P+L F+VALDTGSDLF
Sbjct: 70   FRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLF 129

Query: 1664 WIPCDCKSCIKALSTNNGSKLDLNIYSPXXXXXSKKVPCNSSLCKHQGRCSGTPTKCPYQ 1485
            W+PCDC SC+  L++++G  +D NIYSP     S KVPCNS+LC+ Q +C    + CPYQ
Sbjct: 130  WLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQ 189

Query: 1484 VIYLSNGTSSSGILVEDVLHLTTDGHDNHSAVVDAPITFGCGQDQTGSFLDGAAPNGLFG 1305
            V YLS+GT S+G LVEDVLHL TD  +  S  VD+ I+FGCG+ QTGSFLDGAAPNGLFG
Sbjct: 190  VRYLSDGTMSTGFLVEDVLHLATD--EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFG 247

Query: 1304 LGMDKSSVPSILSSAGLTADSFSMCFGPNGVGRISFGDKGSPDQEETPFNLNQLHPTYNI 1125
            LGMDK+SVPSIL++ GL  +SFSMCFG +G GRISFGDKGSP Q ETPF+L Q HPTYNI
Sbjct: 248  LGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNI 307

Query: 1124 SVIQLSLGTNLTDIDFSAIFDSGTSFTYLNDPAYSYLSESFNSQTKDKRHPPDTRIPFEY 945
            ++ Q+S+G N  + +FSAIFDSGTSFTYLNDPAY+ +SE+FNS  K+KR    + +PFEY
Sbjct: 308  TITQVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEY 367

Query: 944  CYDLRSDANSTLAPNVTLIMKGGSQFPVFDPLVFISIE-KELLVYCLGVVKSPDVNIIGQ 768
            CY L  +  +   P V L MKGG  F V DP+V +S E K L +YCLGVVKS +VNIIGQ
Sbjct: 368  CYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGQ 427

Query: 767  NFMTGKRVVFDREKSVLRWKESNCYDIEGSS 675
            NFMTG  +VFDREK+VL WK S+CY +  SS
Sbjct: 428  NFMTGYNIVFDREKNVLGWKASDCYGVNNSS 458


>ref|XP_006425380.1| hypothetical protein CICLE_v10025374mg [Citrus clementina]
            gi|557527370|gb|ESR38620.1| hypothetical protein
            CICLE_v10025374mg [Citrus clementina]
          Length = 517

 Score =  555 bits (1431), Expect = e-155
 Identities = 277/451 (61%), Positives = 340/451 (75%), Gaps = 5/451 (1%)
 Frame = -3

Query: 2012 VLLVLLIFAWSSLSCHGFRTFGFDIHHRYSDSVKGILNVDELPKMGSLEYYAAMTHRDRI 1833
            V ++L++ +  +  C GF TFGFD HHRYSD VKGIL VD+LPK GS  YY+A+ HRDR 
Sbjct: 10   VCVLLILLSCCAGCCFGFGTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRY 69

Query: 1832 --IRGRGLA--GDKAQFLTFSNGNKTFLINPLGFLHYANISLGTPSLWFLVALDTGSDLF 1665
              +RGRGLA  G+    LTFS GN T+ +N LGFLHYAN+S+G P+L F+VALDTGSDLF
Sbjct: 70   FRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYANVSVGQPALSFIVALDTGSDLF 129

Query: 1664 WIPCDCKSCIKALSTNNGSKLDLNIYSPXXXXXSKKVPCNSSLCKHQGRCSGTPTKCPYQ 1485
            W+PCDC SC+  L++++G  +D NIYSP     S KVPCNS+LC+ Q +C    + CPYQ
Sbjct: 130  WLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQ 189

Query: 1484 VIYLSNGTSSSGILVEDVLHLTTDGHDNHSAVVDAPITFGCGQDQTGSFLDGAAPNGLFG 1305
            V YLS+GT S+G LVEDVLHL TD  +  S  VD+ I+FGCG+ QTGSFLDGAAPNGLFG
Sbjct: 190  VRYLSDGTMSTGFLVEDVLHLATD--EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFG 247

Query: 1304 LGMDKSSVPSILSSAGLTADSFSMCFGPNGVGRISFGDKGSPDQEETPFNLNQLHPTYNI 1125
            LGMDK+SVPSIL++ GL  +SFSMCFG +G GRISFGDKGSP Q ETPF+L Q HPTYNI
Sbjct: 248  LGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNI 307

Query: 1124 SVIQLSLGTNLTDIDFSAIFDSGTSFTYLNDPAYSYLSESFNSQTKDKRHPPDTRIPFEY 945
            ++ Q+S+G N  + +FSAIFDSGTSFTYLN+PAY+ +SE+FNS  K+KR    + +PFEY
Sbjct: 308  TITQVSVGGNAANFEFSAIFDSGTSFTYLNNPAYTQISETFNSLAKEKRETSTSDLPFEY 367

Query: 944  CYDLRSDANSTLAPNVTLIMKGGSQFPVFDPLVFISIE-KELLVYCLGVVKSPDVNIIGQ 768
            CY L  +  +   P V L MKGG  F V DP+V +S E K L +YCLGVVKS +VNIIGQ
Sbjct: 368  CYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGQ 427

Query: 767  NFMTGKRVVFDREKSVLRWKESNCYDIEGSS 675
            NFMTG  +VFDREK+VL WK S+CY +  SS
Sbjct: 428  NFMTGYNIVFDREKNVLGWKASDCYGVNNSS 458


>ref|XP_007204828.1| hypothetical protein PRUPE_ppa004096mg [Prunus persica]
            gi|462400359|gb|EMJ06027.1| hypothetical protein
            PRUPE_ppa004096mg [Prunus persica]
          Length = 530

 Score =  542 bits (1397), Expect = e-151
 Identities = 288/509 (56%), Positives = 352/509 (69%), Gaps = 19/509 (3%)
 Frame = -3

Query: 2027 DSSAFVLLVLLIFAWSSLSCHGFRTFGFDIHHRYSDSVK------GILN-VDELPKMGSL 1869
            D   FV+L+  +      SCHG R F F +HHR+SD VK      G L+  D LP  GS 
Sbjct: 4    DLCKFVVLLFFLSILGLQSCHG-RIFSFKMHHRFSDPVKEWSAVSGKLSPADNLPAKGSF 62

Query: 1868 EYYAAMTHRDRIIRGRGLA----GDKAQFLTFSNGNKTFLINPLGFLHYANISLGTPSLW 1701
            EYY+ +  RDR +RGR LA     D    L FS+GN TF I+ LGFLHY  + LGTP + 
Sbjct: 63   EYYSELARRDRFLRGRKLAQSDQSDTTTPLAFSDGNSTFRISSLGFLHYTTVQLGTPGMK 122

Query: 1700 FLVALDTGSDLFWIPCD--CKSCIKALSTNN-----GSKLDLNIYSPXXXXXSKKVPCNS 1542
            F+VALDTGSDLFW+PC+    + +K    N          +++ Y P     SK+V CN+
Sbjct: 123  FMVALDTGSDLFWVPCEGTAYAPVKLAERNQILSYADYDFEVSKYDPEGSSTSKRVSCNN 182

Query: 1541 SLCKHQGRCSGTPTKCPYQVIYLSNGTSSSGILVEDVLHLTTDGHDNHSAVVDAPITFGC 1362
            SLC H+ RC G+   CPY V Y+S  TS+SGILVEDVLHL T+  D+H  +V+A +TFGC
Sbjct: 183  SLCAHRNRCMGSFNNCPYMVSYVSAETSTSGILVEDVLHLKTE--DSHRELVEAYVTFGC 240

Query: 1361 GQDQTGSFLDGAAPNGLFGLGMDKSSVPSILSSAGLTADSFSMCFGPNGVGRISFGDKGS 1182
            GQ Q+GSFLD AAPNGLFGLGM+K SVPSILS  G TADSFSMCFG +GVGRI+FGDKGS
Sbjct: 241  GQVQSGSFLDVAAPNGLFGLGMEKISVPSILSREGFTADSFSMCFGHDGVGRINFGDKGS 300

Query: 1181 PDQEETPFNLNQLHPTYNISVIQLSLGTNLTDIDFSAIFDSGTSFTYLNDPAYSYLSESF 1002
            PDQEETPFN+N  HPTYNISV Q+ +GT+L DIDF+A+FDSGTSFTYL DP Y+ LSESF
Sbjct: 301  PDQEETPFNVNPSHPTYNISVTQIRVGTDLMDIDFTALFDSGTSFTYLGDPTYTRLSESF 360

Query: 1001 NSQTKDKRHPPDTRIPFEYCYDLRSDANSTLAPNVTLIMKGGSQFPVFDPLVFISIEKEL 822
            NS  +DKR PPD RIPFEYCYD+ SDAN++  P+++L MKGGSQF V+DP++ IS + E 
Sbjct: 361  NSLARDKRRPPDPRIPFEYCYDMSSDANASFIPSLSLTMKGGSQFAVYDPIIVISTQSE- 419

Query: 821  LVYCLGVVKSPDVNIIGQNFMTGKRVVFDREKSVLRWKESNCYDIEG-SSXXXXXXXXXX 645
            LVYCL VVKS  +NIIGQN+MTG  VVFDREK VL WK+ +CYD+E  +S          
Sbjct: 420  LVYCLAVVKSTQLNIIGQNYMTGYNVVFDREKFVLGWKKFDCYDVENHTSLPFKPNSTNV 479

Query: 644  XXXXXXXXXPNYTPEATKETGNGSRTSGA 558
                      + TPE+TK+T N S+TS A
Sbjct: 480  PPAVAVGLGHHSTPESTKKTRN-SQTSAA 507


>ref|NP_849967.1| aspartyl protease family protein [Arabidopsis thaliana]
            gi|17978947|gb|AAL47439.1| putative chloroplast nucleoid
            DNA-binding protein [Arabidopsis thaliana]
            gi|22655368|gb|AAM98276.1| At2g17760/At2g17760
            [Arabidopsis thaliana] gi|330251585|gb|AEC06679.1|
            aspartyl protease family protein [Arabidopsis thaliana]
          Length = 513

 Score =  542 bits (1396), Expect = e-151
 Identities = 275/514 (53%), Positives = 348/514 (67%), Gaps = 1/514 (0%)
 Frame = -3

Query: 2009 LLVLLIFAWSSLSCHGFRTFGFDIHHRYSDSVKGILNVDELPKMGSLEYYAAMTHRDRII 1830
            LL+LL  +W    C GF  FGF+ HHR+SD V G+L  D LP   S +YY  M HRDR+I
Sbjct: 14   LLILLASSWVLDRCEGFGEFGFEFHHRFSDQVVGVLPGDGLPNRDSSKYYRVMAHRDRLI 73

Query: 1829 RGRGLAGDKAQFLTFSNGNKTFLINPLGFLHYANISLGTPSLWFLVALDTGSDLFWIPCD 1650
            RGR LA +    +TFS+GN+T  ++ LGFLHYAN+++GTPS WF+VALDTGSDLFW+PCD
Sbjct: 74   RGRRLANEDQSLVTFSDGNETVRVDALGFLHYANVTVGTPSDWFMVALDTGSDLFWLPCD 133

Query: 1649 CKSCIKALSTNNGSKLDLNIYSPXXXXXSKKVPCNSSLCKHQGRCSGTPTKCPYQVIYLS 1470
            C +C++ L    GS LDLNIYSP     S KVPCNS+LC    RC+   + CPYQ+ YLS
Sbjct: 134  CTNCVRELKAPGGSSLDLNIYSPNASSTSTKVPCNSTLCTRGDRCASPESDCPYQIRYLS 193

Query: 1469 NGTSSSGILVEDVLHLTTDGHDNHSAVVDAPITFGCGQDQTGSFLDGAAPNGLFGLGMDK 1290
            NGTSS+G+LVEDVLHL +  +D  S  + A +TFGCGQ QTG F DGAAPNGLFGLG++ 
Sbjct: 194  NGTSSTGVLVEDVLHLVS--NDKSSKAIPARVTFGCGQVQTGVFHDGAAPNGLFGLGLED 251

Query: 1289 SSVPSILSSAGLTADSFSMCFGPNGVGRISFGDKGSPDQEETPFNLNQLHPTYNISVIQL 1110
             SVPS+L+  G+ A+SFSMCFG +G GRISFGDKGS DQ ETP N+ Q HPTYNI+V ++
Sbjct: 252  ISVPSVLAKEGIAANSFSMCFGNDGAGRISFGDKGSVDQRETPLNIRQPHPTYNITVTKI 311

Query: 1109 SLGTNLTDIDFSAIFDSGTSFTYLNDPAYSYLSESFNSQTKDKRH-PPDTRIPFEYCYDL 933
            S+G N  D++F A+FDSGTSFTYL D AY+ +SESFNS   DKR+   D+ +PFEYCY L
Sbjct: 312  SVGGNTGDLEFDAVFDSGTSFTYLTDAAYTLISESFNSLALDKRYQTTDSELPFEYCYAL 371

Query: 932  RSDANSTLAPNVTLIMKGGSQFPVFDPLVFISIEKELLVYCLGVVKSPDVNIIGQNFMTG 753
              + +S   P V L MKGGS +PV+ PLV I + K+  VYCL ++K  D++IIGQNFMTG
Sbjct: 372  SPNKDSFQYPAVNLTMKGGSSYPVYHPLVVIPM-KDTDVYCLAIMKIEDISIIGQNFMTG 430

Query: 752  KRVVFDREKSVLRWKESNCYDIEGSSXXXXXXXXXXXXXXXXXXXPNYTPEATKETGNGS 573
             RVVFDREK +L WKES+CY  E S+                    ++ PEAT       
Sbjct: 431  YRVVFDREKLILGWKESDCYTGETSA---RTLPSNRSSSSARPPASSFDPEATNIPSQRP 487

Query: 572  RTSGAPPSSDGKSSQLKSICFTFLMFYLFILAMV 471
             TS         +S   S+  +  +F+  ILA++
Sbjct: 488  NTS--------TTSAAYSLSISLSLFFFSILAIL 513


>ref|XP_004232517.1| PREDICTED: aspartic proteinase-like protein 1-like [Solanum
            lycopersicum]
          Length = 539

 Score =  541 bits (1394), Expect = e-151
 Identities = 275/528 (52%), Positives = 352/528 (66%), Gaps = 17/528 (3%)
 Frame = -3

Query: 2009 LLVLLIFAWSSLSCHGFRTFGFDIHHRYSDSVKGILNVDELPKMGSLEYYAAMTHRDRII 1830
            ++ L I  +   +  GF TFGFDIHHRYSD VKGIL++  LP+ G++EYY+A T RDR +
Sbjct: 15   IIFLAILGYQLKTTDGFGTFGFDIHHRYSDPVKGILDLHGLPEKGTVEYYSAWTQRDRFV 74

Query: 1829 RGRGLAG--DKAQFLTFSNGNKTFLINPLGFLHYANISLGTPSLWFLVALDTGSDLFWIP 1656
            +GR LA   +    L FS GN+T  ++ LGFLHYAN+++G+P L FLVALDTGSDLFW+P
Sbjct: 75   KGRRLADTTNPTVPLAFSGGNETLRLSSLGFLHYANVTVGSPGLSFLVALDTGSDLFWLP 134

Query: 1655 CDCKSCIKALSTNNGSKLDLNIYSPXXXXXSKKVPCNSSLCKHQGRCSGTPTKCPYQVIY 1476
            CDC +C++AL T +G +++LNIYSP     S+ VPCN +LC    RC  +   C Y V Y
Sbjct: 135  CDCSNCVRALQTRSGGRINLNIYSPNTSSTSEIVPCNGTLCGQNRRCLASQNACAYGVAY 194

Query: 1475 LSNGTSSSGILVEDVLHLTTDGHDNHSAVVDAPITFGCGQDQTGSFLDGAAPNGLFGLGM 1296
            LSN TSSSG+LVED+LHL T+     S  ++API  GCG  QTG+FL GAAPNGLFGLG+
Sbjct: 195  LSNNTSSSGVLVEDILHLETNNAQQKS--IEAPIALGCGIRQTGAFLTGAAPNGLFGLGI 252

Query: 1295 DKSSVPSILSSAGLTADSFSMCFGPNGVGRISFGDKGSPDQEETPFNLNQLHPTYNISVI 1116
            +  SVPS+L+S GL A+SFSMCFGP+G+GRI FGDKGSP Q ETP NL+Q HPTYNIS+ 
Sbjct: 253  ENISVPSMLASKGLAANSFSMCFGPDGIGRIVFGDKGSPGQGETPLNLDQPHPTYNISLT 312

Query: 1115 QLSLGTNLTDIDFSAIFDSGTSFTYLNDPAYSYLSESFNSQTKDKRHPPDTRIPFEYCYD 936
             +++G+ +TD+DF+AIFDSGTSFTYLNDP Y  ++E+F+S+ K  R  PD  IPFEYCY 
Sbjct: 313  GITVGSKITDLDFTAIFDSGTSFTYLNDPVYKVITENFDSEAKQPRIQPDGTIPFEYCYG 372

Query: 935  LRSDANSTLAPNVTLIMKGGSQFPVFDPLVFISIEKELLVYCLGVVKSPDVNIIGQNFMT 756
            + ++  +   P+V L MKGG+QF +FDP++ +S+      YCL VVKS DVNIIGQNFMT
Sbjct: 373  ISANQTTFEVPDVNLTMKGGNQFYLFDPIIMLSLPDGSGAYCLAVVKSGDVNIIGQNFMT 432

Query: 755  GKRVVFDREKSVLRWKESNCYDIEGSSXXXXXXXXXXXXXXXXXXXPNYTPEATKETGNG 576
            G  V+FDREK VL WK S+CYD   S+                    +  PEATK  GN 
Sbjct: 433  GYHVIFDREKMVLGWKASDCYDSGESNDRSTTLPVNKRNSTEAPSPASVVPEATK--GNA 490

Query: 575  SRTSGAPPSSDGKSS---------------QLKSICFTFLMFYLFILA 477
            S    A       SS               QL    F+F  +YL I++
Sbjct: 491  SANEPATSFPSVPSSRPAGNHAPHLNSFYYQLMMAIFSFFNYYLIIIS 538


>ref|XP_006340776.1| PREDICTED: aspartic proteinase-like protein 1-like [Solanum
            tuberosum]
          Length = 539

 Score =  539 bits (1389), Expect = e-150
 Identities = 272/526 (51%), Positives = 347/526 (65%), Gaps = 15/526 (2%)
 Frame = -3

Query: 2009 LLVLLIFAWSSLSCHGFRTFGFDIHHRYSDSVKGILNVDELPKMGSLEYYAAMTHRDRII 1830
            +L L I  +      GF TFGFDIHHRYSD VKGIL++  LP+ GS+EYY+A T RDR +
Sbjct: 15   ILFLAILGYQLQRTDGFGTFGFDIHHRYSDPVKGILDLHGLPEKGSVEYYSAWTQRDRFV 74

Query: 1829 RGRGLAG--DKAQFLTFSNGNKTFLINPLGFLHYANISLGTPSLWFLVALDTGSDLFWIP 1656
            +GR LA   +    LTFS GN+T  ++ LGFLHYAN+++G+P L FLVALDTGSDLFW+P
Sbjct: 75   KGRRLADTTNPTVPLTFSGGNETLQLSSLGFLHYANVTVGSPGLSFLVALDTGSDLFWLP 134

Query: 1655 CDCKSCIKALSTNNGSKLDLNIYSPXXXXXSKKVPCNSSLCKHQGRCSGTPTKCPYQVIY 1476
            CDC +C++AL T NG + +LNIYSP     S+ VPCN +LC    RC  +   C Y V Y
Sbjct: 135  CDCSNCVRALQTRNGGRRNLNIYSPNTSSTSEVVPCNGTLCGQNRRCLASQNACAYGVAY 194

Query: 1475 LSNGTSSSGILVEDVLHLTTDGHDNHSAVVDAPITFGCGQDQTGSFLDGAAPNGLFGLGM 1296
            LSN TSSSG+LVED+LHL T+     S  ++API  GCG  QTG+FL GAAPNGLFGLG+
Sbjct: 195  LSNNTSSSGVLVEDILHLETNNAQQKS--IEAPIALGCGIRQTGAFLTGAAPNGLFGLGI 252

Query: 1295 DKSSVPSILSSAGLTADSFSMCFGPNGVGRISFGDKGSPDQEETPFNLNQLHPTYNISVI 1116
            +  SVPS+L+S GL A+SFS CFGP+G+GRI FGDKGSP Q ETP NL+Q HPTYNIS+ 
Sbjct: 253  ENISVPSMLASKGLAANSFSTCFGPDGIGRIVFGDKGSPGQGETPLNLDQPHPTYNISLT 312

Query: 1115 QLSLGTNLTDIDFSAIFDSGTSFTYLNDPAYSYLSESFNSQTKDKRHPPDTRIPFEYCYD 936
             +++G+ +TD+DF+AIFDSGTSFTYLNDP Y  ++E+F+S+ K  R  PD  IPFEYCY 
Sbjct: 313  GITVGSKITDLDFTAIFDSGTSFTYLNDPVYKVITENFDSEAKQPRIQPDGEIPFEYCYG 372

Query: 935  LRSDANSTLAPNVTLIMKGGSQFPVFDPLVFISIEKELLVYCLGVVKSPDVNIIGQNFMT 756
            + ++      P V L MKGG+Q  +FDP++ +S+      YCL VVKS DVNIIGQNFMT
Sbjct: 373  ISANQTRFEVPEVNLTMKGGNQLYLFDPIIMLSLPDGSGAYCLAVVKSGDVNIIGQNFMT 432

Query: 755  GKRVVFDREKSVLRWKESNCYDIEGSSXXXXXXXXXXXXXXXXXXXPNYTPEATKETGNG 576
            G RV+FDREK VL WK S+CYD   S+                    +  PEATK   + 
Sbjct: 433  GYRVIFDREKMVLGWKASDCYDSGESNDKSTTLPVNKHNSTEAPSPASVVPEATKGNASA 492

Query: 575  SRTSGAPPSSDGKS-------------SQLKSICFTFLMFYLFILA 477
            +  + + PS                   Q     F+F  ++L I++
Sbjct: 493  NEPATSLPSVPSSRPAGNHAPHLNSFYCQFMMAIFSFFSYFLIIIS 538


>ref|XP_002884082.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
            gi|297329922|gb|EFH60341.1| aspartyl protease family
            protein [Arabidopsis lyrata subsp. lyrata]
          Length = 513

 Score =  539 bits (1388), Expect = e-150
 Identities = 274/514 (53%), Positives = 346/514 (67%), Gaps = 1/514 (0%)
 Frame = -3

Query: 2009 LLVLLIFAWSSLSCHGFRTFGFDIHHRYSDSVKGILNVDELPKMGSLEYYAAMTHRDRII 1830
            L++LL  +W    C GF  FGF+ HHR+SD V G+L  D LP   S +YY  M HRDR+I
Sbjct: 14   LIILLASSWVLERCEGFGEFGFEFHHRFSDQVVGVLPGDGLPNRDSSKYYRVMAHRDRLI 73

Query: 1829 RGRGLAGDKAQFLTFSNGNKTFLINPLGFLHYANISLGTPSLWFLVALDTGSDLFWIPCD 1650
            RGR LA +    +TFS+GN+T  ++ LGFLHYAN+++GTPS WFLVALDTGSDLFW+PCD
Sbjct: 74   RGRRLANEDQSLVTFSDGNETIRVDALGFLHYANVTVGTPSDWFLVALDTGSDLFWLPCD 133

Query: 1649 CKSCIKALSTNNGSKLDLNIYSPXXXXXSKKVPCNSSLCKHQGRCSGTPTKCPYQVIYLS 1470
            C +C++ L    GS LDLNIYSP     S KVPCNS+LC    RC+   + CPYQ+ YLS
Sbjct: 134  CTNCVRELKAPGGSSLDLNIYSPNASSTSTKVPCNSTLCTRGDRCASPESNCPYQIRYLS 193

Query: 1469 NGTSSSGILVEDVLHLTTDGHDNHSAVVDAPITFGCGQDQTGSFLDGAAPNGLFGLGMDK 1290
            NGTSS+G+LVEDVLHL +  +D  S  + A +T GCGQ QTG F DGAAPNGLFGLG++ 
Sbjct: 194  NGTSSTGVLVEDVLHLVS--NDKSSKAIPARVTLGCGQVQTGVFHDGAAPNGLFGLGLED 251

Query: 1289 SSVPSILSSAGLTADSFSMCFGPNGVGRISFGDKGSPDQEETPFNLNQLHPTYNISVIQL 1110
             SVPS+L+  G+ A+SFSMCFG +G GRISFGDKGS DQ ETP N+ Q HPTYNI+V ++
Sbjct: 252  ISVPSVLAKEGIAANSFSMCFGNDGAGRISFGDKGSVDQRETPLNIRQPHPTYNITVTKI 311

Query: 1109 SLGTNLTDIDFSAIFDSGTSFTYLNDPAYSYLSESFNSQTKDKRH-PPDTRIPFEYCYDL 933
            S+  N  D++F A+FDSGTSFTYL D AY+ +SESFNS   DKR+   D+ +PFEYCY L
Sbjct: 312  SVEGNTGDLEFDAVFDSGTSFTYLTDAAYTLISESFNSLALDKRYQTTDSELPFEYCYAL 371

Query: 932  RSDANSTLAPNVTLIMKGGSQFPVFDPLVFISIEKELLVYCLGVVKSPDVNIIGQNFMTG 753
              + +S   P V L MKGGS +PV+ PLV I + K+  VYCL ++K  D++IIGQNFMTG
Sbjct: 372  SPNKDSFQYPAVNLTMKGGSSYPVYHPLVVIPM-KDTDVYCLAILKIEDISIIGQNFMTG 430

Query: 752  KRVVFDREKSVLRWKESNCYDIEGSSXXXXXXXXXXXXXXXXXXXPNYTPEATKETGNGS 573
             RVVFDREK +L WKES+CY  E S+                    ++ PEAT       
Sbjct: 431  YRVVFDREKLILGWKESDCYTGETSA---RTLPSNRSSSSARPPASSFDPEATNIPSQRP 487

Query: 572  RTSGAPPSSDGKSSQLKSICFTFLMFYLFILAMV 471
             TS         SS   S+  +  +F+  ILA++
Sbjct: 488  NTS--------TSSAAYSLSISLSLFFFSILAIL 513


>ref|XP_006574660.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
          Length = 524

 Score =  538 bits (1385), Expect = e-150
 Identities = 282/530 (53%), Positives = 365/530 (68%), Gaps = 13/530 (2%)
 Frame = -3

Query: 2030 MDSSAFVLLVLLIFAWSSLSCHGFRTFGFDIHHRYSDSVK-----GILNVDELPKMGSLE 1866
            M +S F+++ LL   W    CHG   + F +HHR+S+ V+         +   P+ G++E
Sbjct: 1    MLASVFIIVSLLSL-WECCQCHG-HVYTFTMHHRHSEPVRKWSHSAAAGIPAPPEEGTVE 58

Query: 1865 YYAAMTHRDRIIRGRGLAGDKAQFLTFSNGNKTFLINPLGFLHYANISLGTPSLWFLVAL 1686
            YYA +  RDR++RGR L+   A  L FS+GN TF I+ LGFLHY  + +GTP + F+VAL
Sbjct: 59   YYAELADRDRLLRGRKLSQIDAG-LAFSDGNSTFRISSLGFLHYTTVQIGTPGVKFMVAL 117

Query: 1685 DTGSDLFWIPCDCKSCIKALSTNNGSKL-----DLNIYSPXXXXXSKKVPCNSSLCKHQG 1521
            DTGSDLFW+PCDC  C  + ST   S L     DLN+Y+P     SKKV CN+SLC H+ 
Sbjct: 118  DTGSDLFWVPCDCTRCAASDSTAFASALATQDFDLNVYNPNGSSTSKKVTCNNSLCTHRS 177

Query: 1520 RCSGTPTKCPYQVIYLSNGTSSSGILVEDVLHLTTDGHDNHSAVVDAPITFGCGQDQTGS 1341
            +C GT + CPY V Y+S  TS+SGILVEDVLHLT +  DNH  +V+A + FGCGQ Q+GS
Sbjct: 178  QCLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTQE--DNHHDLVEANVIFGCGQIQSGS 235

Query: 1340 FLDGAAPNGLFGLGMDKSSVPSILSSAGLTADSFSMCFGPNGVGRISFGDKGSPDQEETP 1161
            FLD AAPNGLFGLGM+K SVPS+LS  G TADSFSMCFG +G+GRISFGDKGS DQ+ETP
Sbjct: 236  FLDVAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCFGRDGIGRISFGDKGSFDQDETP 295

Query: 1160 FNLNQLHPTYNISVIQLSLGTNLTDIDFSAIFDSGTSFTYLNDPAYSYLSESFNSQTKDK 981
            FNLN  HPTYNI+V Q+ +GT + D++F+A+FDSGTSFTYL DP Y+ L+ESF+SQ +D+
Sbjct: 296  FNLNPSHPTYNITVTQVRVGTTVIDVEFTALFDSGTSFTYLVDPTYTRLTESFHSQVQDR 355

Query: 980  RHPPDTRIPFEYCYDLRSDANSTLAPNVTLIMKGGSQFPVFDPLVFISIEKELLVYCLGV 801
            RH  D+RIPFEYCYD+  DAN++L P+V+L M GGS F V+DP++ IS + E LVYCL V
Sbjct: 356  RHRSDSRIPFEYCYDMSPDANTSLIPSVSLTMGGGSHFAVYDPIIIISTQSE-LVYCLAV 414

Query: 800  VKSPDVNIIGQNFMTGKRVVFDREKSVLRWKESNCYDIE--GSSXXXXXXXXXXXXXXXX 627
            VKS ++NIIGQNFMTG RVVFDREK VL WK+ +CYDIE    +                
Sbjct: 415  VKSAELNIIGQNFMTGYRVVFDREKLVLGWKKFDCYDIEDHNDAIPTRPRSHADVPPAVA 474

Query: 626  XXXPNY-TPEATKETGNGSRTSGAPPSSDGKSSQLKSICFTFLMFYLFIL 480
                NY   ++T+++   S+ S A PSS    S L +     ++ +++IL
Sbjct: 475  AGLGNYPATDSTRKSKYNSQRSIASPSSHCSHSSLPTFLGFLVLCFVYIL 524


>ref|XP_006299902.1| hypothetical protein CARUB_v10016111mg [Capsella rubella]
            gi|482568611|gb|EOA32800.1| hypothetical protein
            CARUB_v10016111mg [Capsella rubella]
          Length = 513

 Score =  538 bits (1385), Expect = e-150
 Identities = 277/512 (54%), Positives = 345/512 (67%), Gaps = 1/512 (0%)
 Frame = -3

Query: 2009 LLVLLIFAWSSLSCHGFRTFGFDIHHRYSDSVKGILNVDELPKMGSLEYYAAMTHRDRII 1830
            L++LL  +W    C GF  FGF+ HHR+SD V   L  D LP   S +YY  M HRDR+I
Sbjct: 14   LILLLSSSWVLDRCEGFGEFGFEFHHRFSDQVVRALPGDGLPNRDSSKYYRVMAHRDRLI 73

Query: 1829 RGRGLAGDKAQFLTFSNGNKTFLINPLGFLHYANISLGTPSLWFLVALDTGSDLFWIPCD 1650
            RGR LA      +TFS+GN+T  ++ LGFLHYAN+S+GTPS WFLVALDTGSDLFW+PCD
Sbjct: 74   RGRRLASVDQSLVTFSDGNETVRVDALGFLHYANVSIGTPSDWFLVALDTGSDLFWLPCD 133

Query: 1649 CKSCIKALSTNNGSKLDLNIYSPXXXXXSKKVPCNSSLCKHQGRCSGTPTKCPYQVIYLS 1470
            C +C++ L    GS L+LNIYSP     S KVPCNSSLC    RC+   + CPYQ+ YLS
Sbjct: 134  CTNCVRELKAPGGSSLELNIYSPNVSSTSSKVPCNSSLCTRGDRCASPQSNCPYQIRYLS 193

Query: 1469 NGTSSSGILVEDVLHLTTDGHDNHSAVVDAPITFGCGQDQTGSFLDGAAPNGLFGLGMDK 1290
            NGTSS+G+LVEDVLHL +  +D  S  + A +T GCGQ QTG F DGAAPNGLFGLG++ 
Sbjct: 194  NGTSSTGVLVEDVLHLVS--NDKSSKTIPARVTLGCGQVQTGVFHDGAAPNGLFGLGLED 251

Query: 1289 SSVPSILSSAGLTADSFSMCFGPNGVGRISFGDKGSPDQEETPFNLNQLHPTYNISVIQL 1110
             SVPS+L+  G+ A+SFSMCFG +G GRISFGDKGS DQ ETP N+ Q HPTYNI+V ++
Sbjct: 252  ISVPSVLAKEGIAANSFSMCFGTDGAGRISFGDKGSVDQRETPLNIRQPHPTYNITVTKI 311

Query: 1109 SLGTNLTDIDFSAIFDSGTSFTYLNDPAYSYLSESFNSQTKDKRH-PPDTRIPFEYCYDL 933
            S+G N+ D++F A+FDSGTSFTYL D AY+ +SESFNS   DKR+   D+ +PFEYCY L
Sbjct: 312  SVGGNVGDLEFDAVFDSGTSFTYLTDAAYTLISESFNSLALDKRYQTTDSDLPFEYCYAL 371

Query: 932  RSDANSTLAPNVTLIMKGGSQFPVFDPLVFISIEKELLVYCLGVVKSPDVNIIGQNFMTG 753
             ++ +S   P V L MKGGS +PV+ PLV I + K+  VYCL ++K  +++IIGQNFMTG
Sbjct: 372  SANKDSFQYPAVNLTMKGGSSYPVYHPLVVIPM-KDTDVYCLAIMKIEEISIIGQNFMTG 430

Query: 752  KRVVFDREKSVLRWKESNCYDIEGSSXXXXXXXXXXXXXXXXXXXPNYTPEATKETGNGS 573
             RVVFDREK VL WKES+CY  E S+                    +Y PEAT      S
Sbjct: 431  YRVVFDREKLVLGWKESDCYTGETSA---RTLPSNRSSASARPPTSSYEPEATNIPSQRS 487

Query: 572  RTSGAPPSSDGKSSQLKSICFTFLMFYLFILA 477
             TS         +S   SI  +  +F+  ILA
Sbjct: 488  NTS---------TSSAYSISISLSLFFFSILA 510


>ref|XP_007203645.1| hypothetical protein PRUPE_ppa004265mg [Prunus persica]
            gi|462399176|gb|EMJ04844.1| hypothetical protein
            PRUPE_ppa004265mg [Prunus persica]
          Length = 519

 Score =  536 bits (1382), Expect = e-149
 Identities = 281/527 (53%), Positives = 358/527 (67%), Gaps = 8/527 (1%)
 Frame = -3

Query: 2036 QSMDSSAF----VLLVLLIFAWSSLSCHGFRTFGFDIHHRYSDSVKGILNVDELPKMGSL 1869
            ++  SS+F    +L+ + I  W+S +C GF ++GFDIHHR+SD VK IL  DELP+ GS 
Sbjct: 7    RASSSSSFTATRLLVSVFILGWASRTCSGFGSYGFDIHHRFSDPVKAILGSDELPEKGSA 66

Query: 1868 EYYAAMTHRDRIIRGRGLA-GDKAQFLTFSNGNKTFLINPLGFLHYANISLGTPSLWFLV 1692
            EYYAAM HRDR+IRGR L+  D+   LTF  GN+T+ I   G LHYAN+S+GTPS  +LV
Sbjct: 67   EYYAAMAHRDRLIRGRHLSTADETTPLTFVYGNETYQIGAFGHLHYANVSVGTPSTSYLV 126

Query: 1691 ALDTGSDLFWIPCDCKSCIKALSTNNGSKLDLNIYSPXXXXXSKKVPCNSSLCKHQGRCS 1512
            ALDTGSDL W+PCDC SC++ L  +NG      IYSP     SKKV CNS+ C+    C+
Sbjct: 127  ALDTGSDLLWLPCDCSSCVRGLKFSNGVVKKFEIYSPNTSSTSKKVSCNSTYCEQPQHCA 186

Query: 1511 GTPTKCPYQVIYLSNGTSSSGILVEDVLHLTTDGHDNHSAVVDAPITFGCGQDQTGSFLD 1332
               + C Y++ YLSN TSS+G+LVEDVLHLTTD  D     V+A I FGCG++QTG FLD
Sbjct: 187  SAASDCHYKIEYLSNDTSSTGVLVEDVLHLTTD--DAKQKDVNAQIGFGCGKEQTGIFLD 244

Query: 1331 GAAPNGLFGLGMDKSSVPSILSSAGLTADSFSMCFGPNGVGRISFGDKGSPDQEETPFNL 1152
            GAAPNGL GLGMD  S+PSIL+S GL ++SFSMCFG +G GRISFGD GS DQ ETPFNL
Sbjct: 245  GAAPNGLLGLGMDDVSIPSILASQGLASNSFSMCFGLDGSGRISFGDNGSLDQAETPFNL 304

Query: 1151 --NQLHPTYNISVIQLSLGTNLTDIDFSAIFDSGTSFTYLNDPAYSYLSESFNSQTKDKR 978
               + +PTYNI++ QL++G ++TD++F AIFDSGTSFTYLNDPAY+ ++E+FNS  K+K+
Sbjct: 305  KNGRAYPTYNITITQLAIGESVTDLEFYAIFDSGTSFTYLNDPAYTQITENFNSALKNKQ 364

Query: 977  HPPDTRIPFEYCYDLRSDANSTLAPNVTLIMKGGSQFPVFDPL-VFISIEKELLVYCLGV 801
               D+ IPFEYCYD+    N T    V L +KGG Q+P+ DPL VF + +   ++YCLG+
Sbjct: 365  RSKDSSIPFEYCYDI--SPNQT----VNLTLKGGKQYPLLDPLVVFANEDGTPMLYCLGI 418

Query: 800  VKSPDVNIIGQNFMTGKRVVFDREKSVLRWKESNCYDIEGSSXXXXXXXXXXXXXXXXXX 621
            VKS DVNIIGQNFMTG RV+FDRE+ VL WKESNCY++E +                   
Sbjct: 419  VKSGDVNIIGQNFMTGYRVIFDRERMVLGWKESNCYNVEDT----VTLPVTKSKSPAASP 474

Query: 620  XPNYTPEATKETGNGSRTSGAPPSSDGKSSQLKSICFTFLMFYLFIL 480
                 PEAT  + N   TS  PPS+        +   T ++F  F +
Sbjct: 475  SSTINPEATAGSTN---TSHIPPSNHSPKLNSFACALTMVLFACFAI 518


>gb|EXC35303.1| Aspartic proteinase-like protein 1 [Morus notabilis]
          Length = 497

 Score =  535 bits (1378), Expect = e-149
 Identities = 267/446 (59%), Positives = 327/446 (73%), Gaps = 9/446 (2%)
 Frame = -3

Query: 1994 IFAWSSLSCHG--------FRTFGFDIHHRYSDSVKGILNVD-ELPKMGSLEYYAAMTHR 1842
            +F +S L C G        FR F F +HHR+SD VK   +   ++P+ GS EYYA +  R
Sbjct: 18   LFFFSVLICGGGCPGRIFSFRIFSFQMHHRFSDPVKRWSSAAADVPEKGSFEYYAHLADR 77

Query: 1841 DRIIRGRGLAGDKAQFLTFSNGNKTFLINPLGFLHYANISLGTPSLWFLVALDTGSDLFW 1662
            DR++RGR L+      L FS+GN T  I  LGFLHY  + LGTP   F+VALDTGSDLFW
Sbjct: 78   DRLLRGRKLSELGNSPLAFSDGNSTVRITSLGFLHYTTVKLGTPGTTFMVALDTGSDLFW 137

Query: 1661 IPCDCKSCIKALSTNNGSKLDLNIYSPXXXXXSKKVPCNSSLCKHQGRCSGTPTKCPYQV 1482
            +PCDC  C    +T+      L++Y P     SKKV CN SLC H+ RC GT + CPY V
Sbjct: 138  VPCDCSRCAPTDATSYAPDFQLSMYDPKGSSTSKKVTCNDSLCVHRSRCLGTFSSCPYMV 197

Query: 1481 IYLSNGTSSSGILVEDVLHLTTDGHDNHSAVVDAPITFGCGQDQTGSFLDGAAPNGLFGL 1302
             Y+S  TS+SGIL+EDVLHL  +  D H  +V+A +TFGCGQ Q+GSFLD AAPNGLFGL
Sbjct: 198  SYVSAETSTSGILIEDVLHLKKE--DKHEELVEAYVTFGCGQVQSGSFLDVAAPNGLFGL 255

Query: 1301 GMDKSSVPSILSSAGLTADSFSMCFGPNGVGRISFGDKGSPDQEETPFNLNQLHPTYNIS 1122
            GM+K SVPSILS  G  A+SFSMCFG +G+GRISFGDKGSPDQ+ETPFNLN  HPTYNI+
Sbjct: 256  GMEKISVPSILSKEGFIANSFSMCFGQDGIGRISFGDKGSPDQDETPFNLNPSHPTYNIT 315

Query: 1121 VIQLSLGTNLTDIDFSAIFDSGTSFTYLNDPAYSYLSESFNSQTKDKRHPPDTRIPFEYC 942
            V Q+ +GT+L D DFSA+FDSGTSFTYL +P Y+ LSESF+SQ +D R P D+RIPFEYC
Sbjct: 316  VTQIRVGTSLFDADFSALFDSGTSFTYLVEPIYTRLSESFHSQVQDSRRPTDSRIPFEYC 375

Query: 941  YDLRSDANSTLAPNVTLIMKGGSQFPVFDPLVFISIEKELLVYCLGVVKSPDVNIIGQNF 762
            YD+  +ANS+L P+++L MKGG QF V+DP++ IS + E +VYCL VVKS ++NIIGQNF
Sbjct: 376  YDMSPEANSSLIPSLSLTMKGGCQFAVYDPIIVISTQNE-IVYCLAVVKSAELNIIGQNF 434

Query: 761  MTGKRVVFDREKSVLRWKESNCYDIE 684
            MTG RVVFDREK VL WK+S+CYDIE
Sbjct: 435  MTGYRVVFDREKLVLGWKKSDCYDIE 460


>ref|XP_002534040.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
            communis] gi|223525947|gb|EEF28344.1| Aspartic proteinase
            nepenthesin-2 precursor, putative [Ricinus communis]
          Length = 518

 Score =  534 bits (1376), Expect = e-149
 Identities = 267/451 (59%), Positives = 333/451 (73%), Gaps = 4/451 (0%)
 Frame = -3

Query: 2015 FVLLVLLIFAWS-SLSCHGFRTFGFDIHHRYSDSVKGILNVD---ELPKMGSLEYYAAMT 1848
            F++  LL+  W    +C G R F F +HHR+SD +K + +       P  GS EYYA + 
Sbjct: 7    FLVFSLLLSVWVFPQNCKG-RIFTFKMHHRFSDMLKDLSDSTTSRNFPSKGSFEYYAELA 65

Query: 1847 HRDRIIRGRGLAGDKAQFLTFSNGNKTFLINPLGFLHYANISLGTPSLWFLVALDTGSDL 1668
            HRD+++RGR L   +A  L FS+GN TF I+ LGFLHY  + LGTP + F+VALDTGSDL
Sbjct: 66   HRDQMLRGRKLYNVEAP-LAFSDGNSTFRISSLGFLHYTTVELGTPGMKFMVALDTGSDL 124

Query: 1667 FWIPCDCKSCIKALSTNNGSKLDLNIYSPXXXXXSKKVPCNSSLCKHQGRCSGTPTKCPY 1488
            FW+PCDC  C         S  +L+IY P     SKKV CN++LC H+ RC GT + CPY
Sbjct: 125  FWVPCDCSKCAPTQGVAYASDFELSIYDPKQSSTSKKVTCNNNLCAHRNRCLGTFSSCPY 184

Query: 1487 QVIYLSNGTSSSGILVEDVLHLTTDGHDNHSAVVDAPITFGCGQDQTGSFLDGAAPNGLF 1308
             V Y+S  TS+SGILVEDVLHLT++  D++   + A +TFGCGQ Q+GSFL+ AAPNGLF
Sbjct: 185  MVSYVSAQTSTSGILVEDVLHLTSE--DSNQESIKAYVTFGCGQVQSGSFLNTAAPNGLF 242

Query: 1307 GLGMDKSSVPSILSSAGLTADSFSMCFGPNGVGRISFGDKGSPDQEETPFNLNQLHPTYN 1128
            GLGMD+ SVPSILS  GLTADSFSMCFG +GVGRISFGDKGSPDQEETPFN N  HP+YN
Sbjct: 243  GLGMDQISVPSILSREGLTADSFSMCFGHDGVGRISFGDKGSPDQEETPFNSNPSHPSYN 302

Query: 1127 ISVIQLSLGTNLTDIDFSAIFDSGTSFTYLNDPAYSYLSESFNSQTKDKRHPPDTRIPFE 948
            ISV Q+ +GT L D+DF+A+FDSGTSFTYL +P Y+ +SE+F++Q +DKR PPD RIPFE
Sbjct: 303  ISVTQVRVGTTLVDVDFTALFDSGTSFTYLINPIYAMVSENFHAQAQDKRRPPDPRIPFE 362

Query: 947  YCYDLRSDANSTLAPNVTLIMKGGSQFPVFDPLVFISIEKELLVYCLGVVKSPDVNIIGQ 768
            YCYD+   ANS+L P+++L MKG   F VFDP++ I+ + E LVYCL +VKS ++NIIGQ
Sbjct: 363  YCYDMSPGANSSLIPSMSLTMKGRGHFTVFDPIIVITTQNE-LVYCLAIVKSTELNIIGQ 421

Query: 767  NFMTGKRVVFDREKSVLRWKESNCYDIEGSS 675
            NFMTG RVVFDREK VL WKE++CYD E +S
Sbjct: 422  NFMTGYRVVFDREKLVLGWKETDCYDQEYNS 452


>ref|XP_006409248.1| hypothetical protein EUTSA_v10022648mg [Eutrema salsugineum]
            gi|312282765|dbj|BAJ34248.1| unnamed protein product
            [Thellungiella halophila] gi|557110410|gb|ESQ50701.1|
            hypothetical protein EUTSA_v10022648mg [Eutrema
            salsugineum]
          Length = 515

 Score =  533 bits (1373), Expect = e-148
 Identities = 270/515 (52%), Positives = 346/515 (67%), Gaps = 2/515 (0%)
 Frame = -3

Query: 2009 LLVLLIFAWSSLSCHGFRTFGFDIHHRYSDSVKGILNVDELPKMGSLEYYAAMTHRDRII 1830
            L+++L+ +W    C G   FGF+ HHR+SD V G+L  D LP   S +YY  M HRDR+I
Sbjct: 14   LILMLVSSWVLDRCEGLGEFGFEFHHRFSDQVVGVLPGDGLPNRDSSKYYRVMAHRDRLI 73

Query: 1829 RGRGLAGDKAQFLTFSNGNKTFLINPLGFLHYANISLGTPSLWFLVALDTGSDLFWIPCD 1650
            RGR LA +    +TF++GN+T  +N LGFLHYAN+++GTPS WFLVALDTGSDLFW+PCD
Sbjct: 74   RGRRLASEDQSLVTFADGNETIRVNALGFLHYANVTVGTPSDWFLVALDTGSDLFWLPCD 133

Query: 1649 CKS-CIKALSTNNGSKLDLNIYSPXXXXXSKKVPCNSSLCKHQGRCSGTPTKCPYQVIYL 1473
            C + C++ L    GS LDLNIYSP     S KVPCNS+LC    RC+   + CPYQ+ YL
Sbjct: 134  CSTNCVRELKAPGGSSLDLNIYSPNASSTSSKVPCNSTLCTRVDRCASPLSDCPYQIRYL 193

Query: 1472 SNGTSSSGILVEDVLHLTTDGHDNHSAVVDAPITFGCGQDQTGSFLDGAAPNGLFGLGMD 1293
            SNGTSS+G+LVEDVLHL +   + +S  + A IT GCG  QTG F DGAAPNGLFGLG++
Sbjct: 194  SNGTSSTGVLVEDVLHLVS--MEKNSKPIRARITLGCGLVQTGVFHDGAAPNGLFGLGLE 251

Query: 1292 KSSVPSILSSAGLTADSFSMCFGPNGVGRISFGDKGSPDQEETPFNLNQLHPTYNISVIQ 1113
              SVPS+L+  G+ A+SFSMCFG +G GRISFGDKGS DQ ETP N+ Q HPTYN++V Q
Sbjct: 252  DISVPSVLAKEGIAANSFSMCFGDDGAGRISFGDKGSVDQRETPLNIRQPHPTYNVTVTQ 311

Query: 1112 LSLGTNLTDIDFSAIFDSGTSFTYLNDPAYSYLSESFNSQTKDKRHPPDTRIPFEYCYDL 933
            +S+G N  D++F A+FD+GTSFTYL D  Y+ +SESFNS   DKR+  D+ +PFEYCY +
Sbjct: 312  ISVGGNTGDLEFDAVFDTGTSFTYLTDAPYTLISESFNSLALDKRYQTDSELPFEYCYAV 371

Query: 932  RSDANSTLAPNVTLIMKGGSQFPVFDPLVFISIEKELLVYCLGVVKSPDVNIIGQNFMTG 753
              +  S   P+V L MKGGS +PV+ PL+ + IE + +VYCL ++KS D++IIGQNFMTG
Sbjct: 372  SPNKKSFEYPDVNLTMKGGSSYPVYHPLIVVPIE-DTVVYCLAIMKSEDISIIGQNFMTG 430

Query: 752  KRVVFDREKSVLRWKESNCYDIEGSSXXXXXXXXXXXXXXXXXXXPNYTPEATKETGNGS 573
             RVVFDREK +L WKES+C   E S+                    ++ PEAT       
Sbjct: 431  YRVVFDREKLILGWKESDCSTGETSA---RTQPSNRSSSSARPPASSFDPEAT------- 480

Query: 572  RTSGAPPSSDGKSSQLKSICFTFLMFYLF-ILAMV 471
                  PSS   SS   S+  +    Y F ILA++
Sbjct: 481  NIPSQRPSSSSSSSYSYSLSLSLPFLYFFSILAIL 515


>ref|XP_006828808.1| hypothetical protein AMTR_s00001p00126200 [Amborella trichopoda]
            gi|548833787|gb|ERM96224.1| hypothetical protein
            AMTR_s00001p00126200 [Amborella trichopoda]
          Length = 522

 Score =  531 bits (1369), Expect = e-148
 Identities = 271/510 (53%), Positives = 355/510 (69%), Gaps = 6/510 (1%)
 Frame = -3

Query: 1991 FAWSSLSCHGFRTFGFDIHHRYSDSVKGILNV------DELPKMGSLEYYAAMTHRDRII 1830
            F +   SCH  +TFGFD+HH++S+ VK  +++      +E P+ GS +YY ++ H D  +
Sbjct: 16   FGFLFWSCHCRQTFGFDLHHKFSEPVKEWMSLRHGIGYEEWPESGSEDYYLSLVHHDHNL 75

Query: 1829 RGRGLAGDKAQFLTFSNGNKTFLINPLGFLHYANISLGTPSLWFLVALDTGSDLFWIPCD 1650
            RGRG++   A  LTF++GN TF ++ LGFLHY+ ++LGTP++ FLVALDTGSDLFW+PCD
Sbjct: 76   RGRGISEIGAP-LTFADGNTTFKLSSLGFLHYSFVTLGTPNVTFLVALDTGSDLFWVPCD 134

Query: 1649 CKSCIKALSTNNGSKLDLNIYSPXXXXXSKKVPCNSSLCKHQGRCSGTPTKCPYQVIYLS 1470
            C  C   LS + G   +LNIY+      SK V C++SLC+ Q  CS +   CPYQV Y+S
Sbjct: 135  CSRCAPTLSMSYGFDFELNIYNSNASSTSKHVSCSNSLCQWQSECSRSTGHCPYQVSYVS 194

Query: 1469 NGTSSSGILVEDVLHLTTDGHDNHSAVVDAPITFGCGQDQTGSFLDGAAPNGLFGLGMDK 1290
            + TSSSG+L+EDVL+LTTD     S VV APITFGCGQ Q+GSFLD AAPNGLFGLG++K
Sbjct: 195  DDTSSSGVLIEDVLYLTTDD----SQVVKAPITFGCGQVQSGSFLDAAAPNGLFGLGVEK 250

Query: 1289 SSVPSILSSAGLTADSFSMCFGPNGVGRISFGDKGSPDQEETPFNLNQLHPTYNISVIQL 1110
             SVPSILS  GL  DSFSMCFG +G+GRI FGD GS DQEETPFNL+Q +PTYNIS+  +
Sbjct: 251  LSVPSILSGLGLIHDSFSMCFGQDGIGRIRFGDNGSSDQEETPFNLDQSYPTYNISITDI 310

Query: 1109 SLGTNLTDIDFSAIFDSGTSFTYLNDPAYSYLSESFNSQTKDKRHPPDTRIPFEYCYDLR 930
             +G++     FSA+FDSGTSFTYL DP Y+ L++SF+ Q  DKRH PD+R+PFEYCY+  
Sbjct: 311  QVGSSSIKTGFSALFDSGTSFTYLADPIYTRLAKSFDIQVPDKRHQPDSRLPFEYCYNAS 370

Query: 929  SDANSTLAPNVTLIMKGGSQFPVFDPLVFISIEKELLVYCLGVVKSPDVNIIGQNFMTGK 750
            S+ NS + P+V+L+M+GGS+FP++DP++  S +   +VYCL VVK   +NIIGQNFMTG 
Sbjct: 371  SNVNSNI-PDVSLLMQGGSRFPIYDPIISFSTQGH-IVYCLAVVKGEGMNIIGQNFMTGL 428

Query: 749  RVVFDREKSVLRWKESNCYDIEGSSXXXXXXXXXXXXXXXXXXXPNYTPEATKETGNGSR 570
            R+VFDREK VL WK+ NCYD+E +S                    NY PE TK  GN ++
Sbjct: 429  RIVFDREKLVLGWKKFNCYDVENTS-TLDIKPPYTVPPSSSVAPDNYAPEDTKTMGNTTQ 487

Query: 569  TSGAPPSSDGKSSQLKSICFTFLMFYLFIL 480
             S  PP     +++L    FT  +  LF+L
Sbjct: 488  VSIPPPPPLSDAARLFVFGFTRALSPLFLL 517


>ref|XP_006599302.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
          Length = 519

 Score =  527 bits (1357), Expect = e-147
 Identities = 277/525 (52%), Positives = 361/525 (68%), Gaps = 12/525 (2%)
 Frame = -3

Query: 2018 AFVLLVLLIFAWSSLSCHGFRTFGFDIHHRYSDSVKG-----ILNVDELPKMGSLEYYAA 1854
            +FV ++  +F   SL CHG   + F +HHR+S+ V+         +   P+ G++EYYA 
Sbjct: 3    SFVFIIASLFL--SL-CHG-HVYTFTMHHRHSEPVRKWSHSTASGIPAPPEKGTVEYYAE 58

Query: 1853 MTHRDRIIRGRGLAGDKAQFLTFSNGNKTFLINPLGFLHYANISLGTPSLWFLVALDTGS 1674
            +  RDR++RGR L+      L FS+GN TF I+ LGFLHY  + +GTP + F+VALDTGS
Sbjct: 59   LADRDRLLRGRKLS-QIDDGLAFSDGNSTFRISSLGFLHYTTVQIGTPGVKFMVALDTGS 117

Query: 1673 DLFWIPCDCKSCI----KALSTNNGSKLDLNIYSPXXXXXSKKVPCNSSLCKHQGRCSGT 1506
            DLFW+PCDC  C      A ++   S  DLN+Y+P     SKKV CN+SLC H+ +C GT
Sbjct: 118  DLFWVPCDCTRCAATDSSAFASAFASDFDLNVYNPNGSSTSKKVTCNNSLCMHRSQCLGT 177

Query: 1505 PTKCPYQVIYLSNGTSSSGILVEDVLHLTTDGHDNHSAVVDAPITFGCGQDQTGSFLDGA 1326
             + CPY V Y+S  TS+SGILVEDVLHLT +  DNH  +V+A + FGCGQ Q+GSFLD A
Sbjct: 178  LSNCPYMVSYVSAETSTSGILVEDVLHLTQE--DNHHDLVEANVIFGCGQIQSGSFLDVA 235

Query: 1325 APNGLFGLGMDKSSVPSILSSAGLTADSFSMCFGPNGVGRISFGDKGSPDQEETPFNLNQ 1146
            APNGLFGLGM+K SVPS+LS  G TADSFSMCFG +G+GRISFGDKGS DQ+ETPFNLN 
Sbjct: 236  APNGLFGLGMEKISVPSMLSREGFTADSFSMCFGRDGIGRISFGDKGSFDQDETPFNLNP 295

Query: 1145 LHPTYNISVIQLSLGTNLTDIDFSAIFDSGTSFTYLNDPAYSYLSESFNSQTKDKRHPPD 966
             HPTYNI+V Q+ +GT L D++F+A+FDSGTSFTYL DP Y+ L+ESF+SQ +D+RH  D
Sbjct: 296  SHPTYNITVTQVRVGTTLIDVEFTALFDSGTSFTYLVDPTYTRLTESFHSQVQDRRHRSD 355

Query: 965  TRIPFEYCYDLRSDANSTLAPNVTLIMKGGSQFPVFDPLVFISIEKELLVYCLGVVKSPD 786
            +RIPFEYCYD+  DAN++L P+V+L M GGS F V+DP++ IS + E LVYCL VVK+ +
Sbjct: 356  SRIPFEYCYDMSPDANTSLIPSVSLTMGGGSHFAVYDPIIIISTQSE-LVYCLAVVKTAE 414

Query: 785  VNIIGQNFMTGKRVVFDREKSVLRWKESNCYDIE--GSSXXXXXXXXXXXXXXXXXXXPN 612
            +NIIGQNFMTG RVVFDREK VL WK+ +CYDIE    +                    N
Sbjct: 415  LNIIGQNFMTGYRVVFDREKLVLGWKKFDCYDIEDHNDAIPTRPHSHADVPPAVAAGLGN 474

Query: 611  Y-TPEATKETGNGSRTSGAPPSSDGKSSQLKSICFTFLMFYLFIL 480
            Y   + T+++   S+ S A PSS    + L +     ++ +++IL
Sbjct: 475  YPATDPTRKSKYNSQRSIASPSSHYSHTSLPTFLGFLVLCFVYIL 519


Top