BLASTX nr result

ID: Rehmannia26_contig00006979 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia26_contig00006979
         (1564 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EPS67763.1| hypothetical protein M569_07010, partial [Genlise...   891   0.0  
ref|XP_006487082.1| PREDICTED: LOW QUALITY PROTEIN: presequence ...   877   0.0  
ref|XP_006384425.1| hypothetical protein POPTR_0004s14960g [Popu...   876   0.0  
ref|XP_002330286.1| predicted protein [Populus trichocarpa]           876   0.0  
gb|EOX98217.1| Presequence protease 2 isoform 3 [Theobroma cacao]     874   0.0  
gb|EOX98216.1| Presequence protease 2 isoform 2 [Theobroma cacao]     874   0.0  
gb|EOX98215.1| Presequence protease 2 isoform 1 [Theobroma cacao]     874   0.0  
emb|CBI32433.3| unnamed protein product [Vitis vinifera]              874   0.0  
ref|XP_002282024.1| PREDICTED: presequence protease 2, chloropla...   874   0.0  
ref|XP_006423047.1| hypothetical protein CICLE_v10027722mg [Citr...   873   0.0  
ref|XP_002313107.1| hypothetical protein POPTR_0009s10650g [Popu...   868   0.0  
ref|XP_004296078.1| PREDICTED: presequence protease 1, chloropla...   867   0.0  
gb|EMJ02012.1| hypothetical protein PRUPE_ppa025698mg, partial [...   867   0.0  
ref|XP_006346464.1| PREDICTED: presequence protease 1, chloropla...   866   0.0  
ref|XP_004511282.1| PREDICTED: presequence protease 1, chloropla...   865   0.0  
ref|XP_004230817.1| PREDICTED: presequence protease 1, chloropla...   865   0.0  
ref|XP_003517606.1| PREDICTED: presequence protease 2, chloropla...   861   0.0  
ref|XP_002518787.1| zinc metalloprotease, putative [Ricinus comm...   860   0.0  
ref|XP_006829680.1| hypothetical protein AMTR_s00126p00013900 [A...   857   0.0  
ref|XP_003636021.1| Presequence protease [Medicago truncatula] g...   852   0.0  

>gb|EPS67763.1| hypothetical protein M569_07010, partial [Genlisea aurea]
          Length = 947

 Score =  891 bits (2302), Expect = 0.0
 Identities = 440/522 (84%), Positives = 477/522 (91%), Gaps = 1/522 (0%)
 Frame = +1

Query: 1    FNSDAVEASMNTIEFSLRENNTGSFPRGLALMLRSIGKWIYDMDPFQPLKYQEPLKALKA 180
            F+S+AVEASMNTIEFSLRENNTGSFPRGLALMLRSIGKWIYDMDPF+PLKYQEPL+ALKA
Sbjct: 343  FSSEAVEASMNTIEFSLRENNTGSFPRGLALMLRSIGKWIYDMDPFEPLKYQEPLRALKA 402

Query: 181  RIAEEGSKTVFAPLIEKFILSNPHRVTIEMQPDPEKASRDEAAEKENLEKVKASMTEEDL 360
            RIA EGSK VFAPLIE FIL N H V +EMQPDPEK+S DE AEK  LEKVK+SMT+EDL
Sbjct: 403  RIAGEGSKAVFAPLIENFILKNRHLVVVEMQPDPEKSSSDEVAEKNILEKVKSSMTQEDL 462

Query: 361  AELARATHELKLKQETPDPPEALKCVPTLSLEDIPKKPIHIPTEVGDINGTKVLQHDLFT 540
            AELARAT +LKLKQETPDPPE LKCVP+LSL+DIP KP+ +P+EVG+ING  VLQHDLFT
Sbjct: 463  AELARATQDLKLKQETPDPPEVLKCVPSLSLQDIPTKPMAVPSEVGNINGVNVLQHDLFT 522

Query: 541  NDVLYAEVVFNMSSLKQELLPLVPLFCQSLLEMGTKDMDFVQLNQLIGRKTGGISVYPFT 720
            NDVLYAEVVFNMSSLK ELLPLVPLFCQSLLEMGTKDMDFV+LNQLIGRKTGGISVYPFT
Sbjct: 523  NDVLYAEVVFNMSSLKPELLPLVPLFCQSLLEMGTKDMDFVRLNQLIGRKTGGISVYPFT 582

Query: 721  SSVRGKEDPCSRIIVRGKAMSERAEDLFNLVNCVIQEVQLTDQKRFKQFVSQSKARMENR 900
            SSVRG+EDPCS IIVRGKAMSER EDLFNLVN V+Q+VQLTDQKRFKQFVSQSKARMENR
Sbjct: 583  SSVRGREDPCSHIIVRGKAMSERVEDLFNLVNVVLQDVQLTDQKRFKQFVSQSKARMENR 642

Query: 901  LRGSGHGIAAARMDAKLNVAGWVSEKMGGVSYLEFLQALEKKVDDDWXXXXXXXXXXRRT 1080
            +RGSGH IAAARMDAKLN AGW+SE+MGG+SYLEFL+ALE ++DDDW          RRT
Sbjct: 643  IRGSGHSIAAARMDAKLNAAGWISEQMGGISYLEFLRALETQIDDDWSAVSSSLEEIRRT 702

Query: 1081 LISKNDCLINLTADGKNLQNAEKYVSKFLDMLPTNSPVGSTGW-SARLPRTNEALVIPTQ 1257
            L+SKN CL+NLTADGKNL+N+EK+V KFLD LP+ SP GS  W S+RLP TNEA+V+PTQ
Sbjct: 703  LVSKNGCLVNLTADGKNLKNSEKHVGKFLDFLPSTSPNGSNDWTSSRLPLTNEAIVVPTQ 762

Query: 1258 VNYVGKAANLFETGYQLKGSAYVISKYLNNSWLWDRVRVSGGAYGGFCDFDTHSGVFSFL 1437
            VNYVGKAANLFETGYQLKGSAYVISKYLNN+WLWDRVRVSGGAYGGFCDFDTHSGVFSFL
Sbjct: 763  VNYVGKAANLFETGYQLKGSAYVISKYLNNTWLWDRVRVSGGAYGGFCDFDTHSGVFSFL 822

Query: 1438 SYRDPNLLKTLDVYDGTSNFLRELEMDDDALTKAIIGTIGDV 1563
            SYRDPNLLKTLDVYDGT NFL+ELEMD DALTKAIIGTIGDV
Sbjct: 823  SYRDPNLLKTLDVYDGTGNFLKELEMDGDALTKAIIGTIGDV 864


>ref|XP_006487082.1| PREDICTED: LOW QUALITY PROTEIN: presequence protease 2,
            chloroplastic/mitochondrial-like [Citrus sinensis]
          Length = 1082

 Score =  877 bits (2265), Expect = 0.0
 Identities = 427/521 (81%), Positives = 476/521 (91%)
 Frame = +1

Query: 1    FNSDAVEASMNTIEFSLRENNTGSFPRGLALMLRSIGKWIYDMDPFQPLKYQEPLKALKA 180
            F+SDAVEASMNTIEFSLRENNTGSFPRGL+LMLRS+GKWIYDM+PF+PLKY++PL ALKA
Sbjct: 479  FDSDAVEASMNTIEFSLRENNTGSFPRGLSLMLRSMGKWIYDMNPFEPLKYEKPLMALKA 538

Query: 181  RIAEEGSKTVFAPLIEKFILSNPHRVTIEMQPDPEKASRDEAAEKENLEKVKASMTEEDL 360
            R+AEEGSK VF+PLIEK+IL+NPH VT+EMQPDPEKASRDEAAEKE L KVK+SMT+EDL
Sbjct: 539  RLAEEGSKAVFSPLIEKYILNNPHCVTVEMQPDPEKASRDEAAEKEILAKVKSSMTKEDL 598

Query: 361  AELARATHELKLKQETPDPPEALKCVPTLSLEDIPKKPIHIPTEVGDINGTKVLQHDLFT 540
            AELARAT EL+LKQETPDPPEAL+ VP+LSL DIPK+PI +PTEVGDING KVLQHDLFT
Sbjct: 599  AELARATEELRLKQETPDPPEALRSVPSLSLRDIPKEPIRVPTEVGDINGVKVLQHDLFT 658

Query: 541  NDVLYAEVVFNMSSLKQELLPLVPLFCQSLLEMGTKDMDFVQLNQLIGRKTGGISVYPFT 720
            NDVLY EVVF+MSSLKQELLPL+PLFCQSL EMGTKD+ FVQLNQLIGRKTGGISVYPFT
Sbjct: 659  NDVLYTEVVFDMSSLKQELLPLIPLFCQSLKEMGTKDLSFVQLNQLIGRKTGGISVYPFT 718

Query: 721  SSVRGKEDPCSRIIVRGKAMSERAEDLFNLVNCVIQEVQLTDQKRFKQFVSQSKARMENR 900
            SS+RGKEDPC  ++VRGKAM+ +AEDLFNL NCV+QEVQLTDQ+RFKQFVSQSKARMENR
Sbjct: 719  SSIRGKEDPCCCMVVRGKAMAGQAEDLFNLFNCVLQEVQLTDQQRFKQFVSQSKARMENR 778

Query: 901  LRGSGHGIAAARMDAKLNVAGWVSEKMGGVSYLEFLQALEKKVDDDWXXXXXXXXXXRRT 1080
            LRGSGHGIAAARMDAKLN AGW+SE+MGGVSYLEFLQALE+KVD DW          RR+
Sbjct: 779  LRGSGHGIAAARMDAKLNTAGWISEQMGGVSYLEFLQALEEKVDQDWAGISSSLEEIRRS 838

Query: 1081 LISKNDCLINLTADGKNLQNAEKYVSKFLDMLPTNSPVGSTGWSARLPRTNEALVIPTQV 1260
             +S+  CLIN+TADGKNL+N+E++V KFLDMLPTNSPV    W A LP  NEA+VIPTQV
Sbjct: 839  FLSREGCLINMTADGKNLKNSERFVGKFLDMLPTNSPVERVKWKAHLPSANEAIVIPTQV 898

Query: 1261 NYVGKAANLFETGYQLKGSAYVISKYLNNSWLWDRVRVSGGAYGGFCDFDTHSGVFSFLS 1440
            NYVGKAAN+FETGY+L GSAYVISK+++N WLWDRVRVSGGAYGGFCDFD+HSGVFSFLS
Sbjct: 899  NYVGKAANIFETGYKLNGSAYVISKHISNVWLWDRVRVSGGAYGGFCDFDSHSGVFSFLS 958

Query: 1441 YRDPNLLKTLDVYDGTSNFLRELEMDDDALTKAIIGTIGDV 1563
            YRDPNLLKTLD+YDGT +FLRELEMDDD LTKAIIGTIGDV
Sbjct: 959  YRDPNLLKTLDIYDGTVDFLRELEMDDDTLTKAIIGTIGDV 999


>ref|XP_006384425.1| hypothetical protein POPTR_0004s14960g [Populus trichocarpa]
            gi|550341043|gb|ERP62222.1| hypothetical protein
            POPTR_0004s14960g [Populus trichocarpa]
          Length = 1091

 Score =  876 bits (2263), Expect = 0.0
 Identities = 430/521 (82%), Positives = 473/521 (90%)
 Frame = +1

Query: 1    FNSDAVEASMNTIEFSLRENNTGSFPRGLALMLRSIGKWIYDMDPFQPLKYQEPLKALKA 180
            F ++AVEASMNTIEFSLRENNTGSFPRGL+LMLRSI KWIYDM+PF+PLKY++PL  LKA
Sbjct: 488  FETEAVEASMNTIEFSLRENNTGSFPRGLSLMLRSISKWIYDMNPFEPLKYEKPLMDLKA 547

Query: 181  RIAEEGSKTVFAPLIEKFILSNPHRVTIEMQPDPEKASRDEAAEKENLEKVKASMTEEDL 360
            RIAEEG K VF+PLIEKFIL+NPHRVT+EMQPDPEKAS DEAAE+E LEKVKASMTEEDL
Sbjct: 548  RIAEEGYKAVFSPLIEKFILNNPHRVTVEMQPDPEKASHDEAAEREILEKVKASMTEEDL 607

Query: 361  AELARATHELKLKQETPDPPEALKCVPTLSLEDIPKKPIHIPTEVGDINGTKVLQHDLFT 540
            AELARAT ELKLKQETPDPPEAL+ VP+L L DIPK+PIH+PTEVGDING KVL+HDLFT
Sbjct: 608  AELARATQELKLKQETPDPPEALRSVPSLFLCDIPKEPIHVPTEVGDINGVKVLKHDLFT 667

Query: 541  NDVLYAEVVFNMSSLKQELLPLVPLFCQSLLEMGTKDMDFVQLNQLIGRKTGGISVYPFT 720
            NDVLYAE+VFNM SLKQELLPLVPLFCQSLLEMGTKD+ FVQLNQLIGRKTGGIS+YPFT
Sbjct: 668  NDVLYAEIVFNMRSLKQELLPLVPLFCQSLLEMGTKDLTFVQLNQLIGRKTGGISLYPFT 727

Query: 721  SSVRGKEDPCSRIIVRGKAMSERAEDLFNLVNCVIQEVQLTDQKRFKQFVSQSKARMENR 900
            SSVRG+EDPCS I+ RGKAM+ R EDLFNLVNCV+QEVQ TDQ+RFKQFVSQSKARMENR
Sbjct: 728  SSVRGREDPCSHIVARGKAMAGRVEDLFNLVNCVLQEVQFTDQQRFKQFVSQSKARMENR 787

Query: 901  LRGSGHGIAAARMDAKLNVAGWVSEKMGGVSYLEFLQALEKKVDDDWXXXXXXXXXXRRT 1080
            LRGSGHGIAAARMDAKLNVAGW+SE+MGGVSYLEFL+ALEK+VD DW          R +
Sbjct: 788  LRGSGHGIAAARMDAKLNVAGWISEQMGGVSYLEFLKALEKRVDQDWAGVSSSLEEIRMS 847

Query: 1081 LISKNDCLINLTADGKNLQNAEKYVSKFLDMLPTNSPVGSTGWSARLPRTNEALVIPTQV 1260
            L SKN CLIN+TADGKNL N+EKYVSKFLD+LP+ S V +  W+ARL   NEA+VIPTQV
Sbjct: 848  LFSKNGCLINMTADGKNLTNSEKYVSKFLDLLPSKSSVEAAAWNARLSPGNEAIVIPTQV 907

Query: 1261 NYVGKAANLFETGYQLKGSAYVISKYLNNSWLWDRVRVSGGAYGGFCDFDTHSGVFSFLS 1440
            NYVGKAAN+++TGYQL GSAYVISKY++N+WLWDRVRVSGGAYGGFCDFDTHSGVFSFLS
Sbjct: 908  NYVGKAANIYDTGYQLNGSAYVISKYISNTWLWDRVRVSGGAYGGFCDFDTHSGVFSFLS 967

Query: 1441 YRDPNLLKTLDVYDGTSNFLRELEMDDDALTKAIIGTIGDV 1563
            YRDPNLLKTLDVYDG+  FLRELEMDDD L KAIIGTIGDV
Sbjct: 968  YRDPNLLKTLDVYDGSGAFLRELEMDDDTLAKAIIGTIGDV 1008


>ref|XP_002330286.1| predicted protein [Populus trichocarpa]
          Length = 1007

 Score =  876 bits (2263), Expect = 0.0
 Identities = 430/521 (82%), Positives = 473/521 (90%)
 Frame = +1

Query: 1    FNSDAVEASMNTIEFSLRENNTGSFPRGLALMLRSIGKWIYDMDPFQPLKYQEPLKALKA 180
            F ++AVEASMNTIEFSLRENNTGSFPRGL+LMLRSI KWIYDM+PF+PLKY++PL  LKA
Sbjct: 404  FETEAVEASMNTIEFSLRENNTGSFPRGLSLMLRSISKWIYDMNPFEPLKYEKPLMDLKA 463

Query: 181  RIAEEGSKTVFAPLIEKFILSNPHRVTIEMQPDPEKASRDEAAEKENLEKVKASMTEEDL 360
            RIAEEG K VF+PLIEKFIL+NPHRVT+EMQPDPEKAS DEAAE+E LEKVKASMTEEDL
Sbjct: 464  RIAEEGYKAVFSPLIEKFILNNPHRVTVEMQPDPEKASHDEAAEREILEKVKASMTEEDL 523

Query: 361  AELARATHELKLKQETPDPPEALKCVPTLSLEDIPKKPIHIPTEVGDINGTKVLQHDLFT 540
            AELARAT ELKLKQETPDPPEAL+ VP+L L DIPK+PIH+PTEVGDING KVL+HDLFT
Sbjct: 524  AELARATQELKLKQETPDPPEALRSVPSLFLCDIPKEPIHVPTEVGDINGVKVLKHDLFT 583

Query: 541  NDVLYAEVVFNMSSLKQELLPLVPLFCQSLLEMGTKDMDFVQLNQLIGRKTGGISVYPFT 720
            NDVLYAE+VFNM SLKQELLPLVPLFCQSLLEMGTKD+ FVQLNQLIGRKTGGIS+YPFT
Sbjct: 584  NDVLYAEIVFNMRSLKQELLPLVPLFCQSLLEMGTKDLTFVQLNQLIGRKTGGISLYPFT 643

Query: 721  SSVRGKEDPCSRIIVRGKAMSERAEDLFNLVNCVIQEVQLTDQKRFKQFVSQSKARMENR 900
            SSVRG+EDPCS I+ RGKAM+ R EDLFNLVNCV+QEVQ TDQ+RFKQFVSQSKARMENR
Sbjct: 644  SSVRGREDPCSHIVARGKAMAGRVEDLFNLVNCVLQEVQFTDQQRFKQFVSQSKARMENR 703

Query: 901  LRGSGHGIAAARMDAKLNVAGWVSEKMGGVSYLEFLQALEKKVDDDWXXXXXXXXXXRRT 1080
            LRGSGHGIAAARMDAKLNVAGW+SE+MGGVSYLEFL+ALEK+VD DW          R +
Sbjct: 704  LRGSGHGIAAARMDAKLNVAGWISEQMGGVSYLEFLKALEKRVDQDWAGVSSSLEEIRMS 763

Query: 1081 LISKNDCLINLTADGKNLQNAEKYVSKFLDMLPTNSPVGSTGWSARLPRTNEALVIPTQV 1260
            L SKN CLIN+TADGKNL N+EKYVSKFLD+LP+ S V +  W+ARL   NEA+VIPTQV
Sbjct: 764  LFSKNGCLINMTADGKNLTNSEKYVSKFLDLLPSKSSVEAAAWNARLSPGNEAIVIPTQV 823

Query: 1261 NYVGKAANLFETGYQLKGSAYVISKYLNNSWLWDRVRVSGGAYGGFCDFDTHSGVFSFLS 1440
            NYVGKAAN+++TGYQL GSAYVISKY++N+WLWDRVRVSGGAYGGFCDFDTHSGVFSFLS
Sbjct: 824  NYVGKAANIYDTGYQLNGSAYVISKYISNTWLWDRVRVSGGAYGGFCDFDTHSGVFSFLS 883

Query: 1441 YRDPNLLKTLDVYDGTSNFLRELEMDDDALTKAIIGTIGDV 1563
            YRDPNLLKTLDVYDG+  FLRELEMDDD L KAIIGTIGDV
Sbjct: 884  YRDPNLLKTLDVYDGSGAFLRELEMDDDTLAKAIIGTIGDV 924


>gb|EOX98217.1| Presequence protease 2 isoform 3 [Theobroma cacao]
          Length = 1041

 Score =  874 bits (2258), Expect = 0.0
 Identities = 427/521 (81%), Positives = 477/521 (91%)
 Frame = +1

Query: 1    FNSDAVEASMNTIEFSLRENNTGSFPRGLALMLRSIGKWIYDMDPFQPLKYQEPLKALKA 180
            F++DAVEASMNTIEFSLRENNTGSFPRGL+LMLRSIGKWIYDMDPF+PLKY++PL  LKA
Sbjct: 482  FDTDAVEASMNTIEFSLRENNTGSFPRGLSLMLRSIGKWIYDMDPFEPLKYEKPLMILKA 541

Query: 181  RIAEEGSKTVFAPLIEKFILSNPHRVTIEMQPDPEKASRDEAAEKENLEKVKASMTEEDL 360
            RIAEEGSK VF+PLIEKFIL+NPH VTIEMQPDPEKASRDEAAEKE L KVKASMTEEDL
Sbjct: 542  RIAEEGSKAVFSPLIEKFILNNPHCVTIEMQPDPEKASRDEAAEKEILNKVKASMTEEDL 601

Query: 361  AELARATHELKLKQETPDPPEALKCVPTLSLEDIPKKPIHIPTEVGDINGTKVLQHDLFT 540
            AELARAT ELKLKQETPDPPEAL+ VP+LSL DIPK+PI +PTEVGDING KVLQHDLFT
Sbjct: 602  AELARATQELKLKQETPDPPEALRSVPSLSLHDIPKEPIRVPTEVGDINGVKVLQHDLFT 661

Query: 541  NDVLYAEVVFNMSSLKQELLPLVPLFCQSLLEMGTKDMDFVQLNQLIGRKTGGISVYPFT 720
            NDVLY +VVF+MSSLK+ELLPLVPLFCQSLLEMGTKD+ FVQLNQLIGRKTGGISVYPFT
Sbjct: 662  NDVLYTDVVFDMSSLKRELLPLVPLFCQSLLEMGTKDLSFVQLNQLIGRKTGGISVYPFT 721

Query: 721  SSVRGKEDPCSRIIVRGKAMSERAEDLFNLVNCVIQEVQLTDQKRFKQFVSQSKARMENR 900
            SS++GKEDPCS IIVRGK+M+  A+DLFNL+NCVIQEVQ TDQ+RFKQFVSQSKARME+R
Sbjct: 722  SSIQGKEDPCSHIIVRGKSMAGCADDLFNLINCVIQEVQFTDQQRFKQFVSQSKARMESR 781

Query: 901  LRGSGHGIAAARMDAKLNVAGWVSEKMGGVSYLEFLQALEKKVDDDWXXXXXXXXXXRRT 1080
            LRGSGHGIAAARMDAKLNV+GW+SE+MGGVSYLEFLQ LE++VD+DW          R++
Sbjct: 782  LRGSGHGIAAARMDAKLNVSGWISEQMGGVSYLEFLQGLEERVDNDWAGISSSLEEIRKS 841

Query: 1081 LISKNDCLINLTADGKNLQNAEKYVSKFLDMLPTNSPVGSTGWSARLPRTNEALVIPTQV 1260
            L+S+  CLIN+TADGKNL N EK VSKFLD+LP+NS V    WSARLP  NEA+VIPTQV
Sbjct: 842  LLSREGCLINMTADGKNLSNTEKLVSKFLDLLPSNSVVERASWSARLPSNNEAIVIPTQV 901

Query: 1261 NYVGKAANLFETGYQLKGSAYVISKYLNNSWLWDRVRVSGGAYGGFCDFDTHSGVFSFLS 1440
            NYVGKAANL++ GYQL GSAYVISK+++N+WLWDRVRVSGGAYGGFC+FDTHSGVF+FLS
Sbjct: 902  NYVGKAANLYDGGYQLNGSAYVISKHISNTWLWDRVRVSGGAYGGFCNFDTHSGVFTFLS 961

Query: 1441 YRDPNLLKTLDVYDGTSNFLRELEMDDDALTKAIIGTIGDV 1563
            YRDPNLL+TLD+YDGT +FLRELEMDDD LTKAIIGT+GDV
Sbjct: 962  YRDPNLLETLDIYDGTGDFLRELEMDDDTLTKAIIGTVGDV 1002


>gb|EOX98216.1| Presequence protease 2 isoform 2 [Theobroma cacao]
          Length = 1040

 Score =  874 bits (2258), Expect = 0.0
 Identities = 427/521 (81%), Positives = 477/521 (91%)
 Frame = +1

Query: 1    FNSDAVEASMNTIEFSLRENNTGSFPRGLALMLRSIGKWIYDMDPFQPLKYQEPLKALKA 180
            F++DAVEASMNTIEFSLRENNTGSFPRGL+LMLRSIGKWIYDMDPF+PLKY++PL  LKA
Sbjct: 482  FDTDAVEASMNTIEFSLRENNTGSFPRGLSLMLRSIGKWIYDMDPFEPLKYEKPLMILKA 541

Query: 181  RIAEEGSKTVFAPLIEKFILSNPHRVTIEMQPDPEKASRDEAAEKENLEKVKASMTEEDL 360
            RIAEEGSK VF+PLIEKFIL+NPH VTIEMQPDPEKASRDEAAEKE L KVKASMTEEDL
Sbjct: 542  RIAEEGSKAVFSPLIEKFILNNPHCVTIEMQPDPEKASRDEAAEKEILNKVKASMTEEDL 601

Query: 361  AELARATHELKLKQETPDPPEALKCVPTLSLEDIPKKPIHIPTEVGDINGTKVLQHDLFT 540
            AELARAT ELKLKQETPDPPEAL+ VP+LSL DIPK+PI +PTEVGDING KVLQHDLFT
Sbjct: 602  AELARATQELKLKQETPDPPEALRSVPSLSLHDIPKEPIRVPTEVGDINGVKVLQHDLFT 661

Query: 541  NDVLYAEVVFNMSSLKQELLPLVPLFCQSLLEMGTKDMDFVQLNQLIGRKTGGISVYPFT 720
            NDVLY +VVF+MSSLK+ELLPLVPLFCQSLLEMGTKD+ FVQLNQLIGRKTGGISVYPFT
Sbjct: 662  NDVLYTDVVFDMSSLKRELLPLVPLFCQSLLEMGTKDLSFVQLNQLIGRKTGGISVYPFT 721

Query: 721  SSVRGKEDPCSRIIVRGKAMSERAEDLFNLVNCVIQEVQLTDQKRFKQFVSQSKARMENR 900
            SS++GKEDPCS IIVRGK+M+  A+DLFNL+NCVIQEVQ TDQ+RFKQFVSQSKARME+R
Sbjct: 722  SSIQGKEDPCSHIIVRGKSMAGCADDLFNLINCVIQEVQFTDQQRFKQFVSQSKARMESR 781

Query: 901  LRGSGHGIAAARMDAKLNVAGWVSEKMGGVSYLEFLQALEKKVDDDWXXXXXXXXXXRRT 1080
            LRGSGHGIAAARMDAKLNV+GW+SE+MGGVSYLEFLQ LE++VD+DW          R++
Sbjct: 782  LRGSGHGIAAARMDAKLNVSGWISEQMGGVSYLEFLQGLEERVDNDWAGISSSLEEIRKS 841

Query: 1081 LISKNDCLINLTADGKNLQNAEKYVSKFLDMLPTNSPVGSTGWSARLPRTNEALVIPTQV 1260
            L+S+  CLIN+TADGKNL N EK VSKFLD+LP+NS V    WSARLP  NEA+VIPTQV
Sbjct: 842  LLSREGCLINMTADGKNLSNTEKLVSKFLDLLPSNSVVERASWSARLPSNNEAIVIPTQV 901

Query: 1261 NYVGKAANLFETGYQLKGSAYVISKYLNNSWLWDRVRVSGGAYGGFCDFDTHSGVFSFLS 1440
            NYVGKAANL++ GYQL GSAYVISK+++N+WLWDRVRVSGGAYGGFC+FDTHSGVF+FLS
Sbjct: 902  NYVGKAANLYDGGYQLNGSAYVISKHISNTWLWDRVRVSGGAYGGFCNFDTHSGVFTFLS 961

Query: 1441 YRDPNLLKTLDVYDGTSNFLRELEMDDDALTKAIIGTIGDV 1563
            YRDPNLL+TLD+YDGT +FLRELEMDDD LTKAIIGT+GDV
Sbjct: 962  YRDPNLLETLDIYDGTGDFLRELEMDDDTLTKAIIGTVGDV 1002


>gb|EOX98215.1| Presequence protease 2 isoform 1 [Theobroma cacao]
          Length = 1037

 Score =  874 bits (2258), Expect = 0.0
 Identities = 427/521 (81%), Positives = 477/521 (91%)
 Frame = +1

Query: 1    FNSDAVEASMNTIEFSLRENNTGSFPRGLALMLRSIGKWIYDMDPFQPLKYQEPLKALKA 180
            F++DAVEASMNTIEFSLRENNTGSFPRGL+LMLRSIGKWIYDMDPF+PLKY++PL  LKA
Sbjct: 482  FDTDAVEASMNTIEFSLRENNTGSFPRGLSLMLRSIGKWIYDMDPFEPLKYEKPLMILKA 541

Query: 181  RIAEEGSKTVFAPLIEKFILSNPHRVTIEMQPDPEKASRDEAAEKENLEKVKASMTEEDL 360
            RIAEEGSK VF+PLIEKFIL+NPH VTIEMQPDPEKASRDEAAEKE L KVKASMTEEDL
Sbjct: 542  RIAEEGSKAVFSPLIEKFILNNPHCVTIEMQPDPEKASRDEAAEKEILNKVKASMTEEDL 601

Query: 361  AELARATHELKLKQETPDPPEALKCVPTLSLEDIPKKPIHIPTEVGDINGTKVLQHDLFT 540
            AELARAT ELKLKQETPDPPEAL+ VP+LSL DIPK+PI +PTEVGDING KVLQHDLFT
Sbjct: 602  AELARATQELKLKQETPDPPEALRSVPSLSLHDIPKEPIRVPTEVGDINGVKVLQHDLFT 661

Query: 541  NDVLYAEVVFNMSSLKQELLPLVPLFCQSLLEMGTKDMDFVQLNQLIGRKTGGISVYPFT 720
            NDVLY +VVF+MSSLK+ELLPLVPLFCQSLLEMGTKD+ FVQLNQLIGRKTGGISVYPFT
Sbjct: 662  NDVLYTDVVFDMSSLKRELLPLVPLFCQSLLEMGTKDLSFVQLNQLIGRKTGGISVYPFT 721

Query: 721  SSVRGKEDPCSRIIVRGKAMSERAEDLFNLVNCVIQEVQLTDQKRFKQFVSQSKARMENR 900
            SS++GKEDPCS IIVRGK+M+  A+DLFNL+NCVIQEVQ TDQ+RFKQFVSQSKARME+R
Sbjct: 722  SSIQGKEDPCSHIIVRGKSMAGCADDLFNLINCVIQEVQFTDQQRFKQFVSQSKARMESR 781

Query: 901  LRGSGHGIAAARMDAKLNVAGWVSEKMGGVSYLEFLQALEKKVDDDWXXXXXXXXXXRRT 1080
            LRGSGHGIAAARMDAKLNV+GW+SE+MGGVSYLEFLQ LE++VD+DW          R++
Sbjct: 782  LRGSGHGIAAARMDAKLNVSGWISEQMGGVSYLEFLQGLEERVDNDWAGISSSLEEIRKS 841

Query: 1081 LISKNDCLINLTADGKNLQNAEKYVSKFLDMLPTNSPVGSTGWSARLPRTNEALVIPTQV 1260
            L+S+  CLIN+TADGKNL N EK VSKFLD+LP+NS V    WSARLP  NEA+VIPTQV
Sbjct: 842  LLSREGCLINMTADGKNLSNTEKLVSKFLDLLPSNSVVERASWSARLPSNNEAIVIPTQV 901

Query: 1261 NYVGKAANLFETGYQLKGSAYVISKYLNNSWLWDRVRVSGGAYGGFCDFDTHSGVFSFLS 1440
            NYVGKAANL++ GYQL GSAYVISK+++N+WLWDRVRVSGGAYGGFC+FDTHSGVF+FLS
Sbjct: 902  NYVGKAANLYDGGYQLNGSAYVISKHISNTWLWDRVRVSGGAYGGFCNFDTHSGVFTFLS 961

Query: 1441 YRDPNLLKTLDVYDGTSNFLRELEMDDDALTKAIIGTIGDV 1563
            YRDPNLL+TLD+YDGT +FLRELEMDDD LTKAIIGT+GDV
Sbjct: 962  YRDPNLLETLDIYDGTGDFLRELEMDDDTLTKAIIGTVGDV 1002


>emb|CBI32433.3| unnamed protein product [Vitis vinifera]
          Length = 1098

 Score =  874 bits (2257), Expect = 0.0
 Identities = 427/521 (81%), Positives = 475/521 (91%)
 Frame = +1

Query: 1    FNSDAVEASMNTIEFSLRENNTGSFPRGLALMLRSIGKWIYDMDPFQPLKYQEPLKALKA 180
            FNS+AVEASMNTIEFSLRENNTGSFPRGL+LMLRSIGKWIYDMDPF+PLKY++PL ALKA
Sbjct: 495  FNSEAVEASMNTIEFSLRENNTGSFPRGLSLMLRSIGKWIYDMDPFEPLKYEKPLMALKA 554

Query: 181  RIAEEGSKTVFAPLIEKFILSNPHRVTIEMQPDPEKASRDEAAEKENLEKVKASMTEEDL 360
            RIAEEGSK VF+PLIEK+IL+NPH VT+EMQPDPEKASRDEA E+E LEKVKA MTEEDL
Sbjct: 555  RIAEEGSKAVFSPLIEKYILNNPHCVTVEMQPDPEKASRDEAVEREILEKVKAGMTEEDL 614

Query: 361  AELARATHELKLKQETPDPPEALKCVPTLSLEDIPKKPIHIPTEVGDINGTKVLQHDLFT 540
            AELARAT EL+LKQETPDPPEALK VP+LSL DIPK+PIH+P E+G IN  KVL+HDLFT
Sbjct: 615  AELARATQELRLKQETPDPPEALKSVPSLSLLDIPKEPIHVPIEIGVINDVKVLRHDLFT 674

Query: 541  NDVLYAEVVFNMSSLKQELLPLVPLFCQSLLEMGTKDMDFVQLNQLIGRKTGGISVYPFT 720
            NDVLY E+VF+MSSLKQ+LLPLVPLFCQSL+EMGTKDMDFVQLNQLIGRKTGGISVYPFT
Sbjct: 675  NDVLYTEIVFDMSSLKQDLLPLVPLFCQSLMEMGTKDMDFVQLNQLIGRKTGGISVYPFT 734

Query: 721  SSVRGKEDPCSRIIVRGKAMSERAEDLFNLVNCVIQEVQLTDQKRFKQFVSQSKARMENR 900
            SSVRGKE PCS IIVRGKAM+  AEDLFNLVNC++QEVQ TDQ+RFKQFVSQSKARMENR
Sbjct: 735  SSVRGKEYPCSHIIVRGKAMAGCAEDLFNLVNCILQEVQFTDQQRFKQFVSQSKARMENR 794

Query: 901  LRGSGHGIAAARMDAKLNVAGWVSEKMGGVSYLEFLQALEKKVDDDWXXXXXXXXXXRRT 1080
            LRGSGHGIAAARMDAKLN AGW++E+MGGVSYLEFLQALE+KVD DW          R++
Sbjct: 795  LRGSGHGIAAARMDAKLNTAGWIAEQMGGVSYLEFLQALEEKVDQDWIGISSSLEEIRKS 854

Query: 1081 LISKNDCLINLTADGKNLQNAEKYVSKFLDMLPTNSPVGSTGWSARLPRTNEALVIPTQV 1260
            L+S+  CLIN+T++GKNL N+EKYVSKFLD+LP +S V  T W+ RL   NEA+VIPTQV
Sbjct: 855  LLSRKGCLINMTSEGKNLMNSEKYVSKFLDLLPGSSSVEKTTWNGRLSSENEAIVIPTQV 914

Query: 1261 NYVGKAANLFETGYQLKGSAYVISKYLNNSWLWDRVRVSGGAYGGFCDFDTHSGVFSFLS 1440
            NYVGKA N+++TGYQLKGSAYVISKY++N+WLWDRVRVSGGAYGGFCDFDTHSGVFSFLS
Sbjct: 915  NYVGKATNIYDTGYQLKGSAYVISKYISNTWLWDRVRVSGGAYGGFCDFDTHSGVFSFLS 974

Query: 1441 YRDPNLLKTLDVYDGTSNFLRELEMDDDALTKAIIGTIGDV 1563
            YRDPNLLKTLDVYDGT +FLR+LEMDDD LTKAIIGTIGDV
Sbjct: 975  YRDPNLLKTLDVYDGTGDFLRQLEMDDDTLTKAIIGTIGDV 1015


>ref|XP_002282024.1| PREDICTED: presequence protease 2, chloroplastic/mitochondrial-like
            [Vitis vinifera]
          Length = 1080

 Score =  874 bits (2257), Expect = 0.0
 Identities = 427/521 (81%), Positives = 475/521 (91%)
 Frame = +1

Query: 1    FNSDAVEASMNTIEFSLRENNTGSFPRGLALMLRSIGKWIYDMDPFQPLKYQEPLKALKA 180
            FNS+AVEASMNTIEFSLRENNTGSFPRGL+LMLRSIGKWIYDMDPF+PLKY++PL ALKA
Sbjct: 477  FNSEAVEASMNTIEFSLRENNTGSFPRGLSLMLRSIGKWIYDMDPFEPLKYEKPLMALKA 536

Query: 181  RIAEEGSKTVFAPLIEKFILSNPHRVTIEMQPDPEKASRDEAAEKENLEKVKASMTEEDL 360
            RIAEEGSK VF+PLIEK+IL+NPH VT+EMQPDPEKASRDEA E+E LEKVKA MTEEDL
Sbjct: 537  RIAEEGSKAVFSPLIEKYILNNPHCVTVEMQPDPEKASRDEAVEREILEKVKAGMTEEDL 596

Query: 361  AELARATHELKLKQETPDPPEALKCVPTLSLEDIPKKPIHIPTEVGDINGTKVLQHDLFT 540
            AELARAT EL+LKQETPDPPEALK VP+LSL DIPK+PIH+P E+G IN  KVL+HDLFT
Sbjct: 597  AELARATQELRLKQETPDPPEALKSVPSLSLLDIPKEPIHVPIEIGVINDVKVLRHDLFT 656

Query: 541  NDVLYAEVVFNMSSLKQELLPLVPLFCQSLLEMGTKDMDFVQLNQLIGRKTGGISVYPFT 720
            NDVLY E+VF+MSSLKQ+LLPLVPLFCQSL+EMGTKDMDFVQLNQLIGRKTGGISVYPFT
Sbjct: 657  NDVLYTEIVFDMSSLKQDLLPLVPLFCQSLMEMGTKDMDFVQLNQLIGRKTGGISVYPFT 716

Query: 721  SSVRGKEDPCSRIIVRGKAMSERAEDLFNLVNCVIQEVQLTDQKRFKQFVSQSKARMENR 900
            SSVRGKE PCS IIVRGKAM+  AEDLFNLVNC++QEVQ TDQ+RFKQFVSQSKARMENR
Sbjct: 717  SSVRGKEYPCSHIIVRGKAMAGCAEDLFNLVNCILQEVQFTDQQRFKQFVSQSKARMENR 776

Query: 901  LRGSGHGIAAARMDAKLNVAGWVSEKMGGVSYLEFLQALEKKVDDDWXXXXXXXXXXRRT 1080
            LRGSGHGIAAARMDAKLN AGW++E+MGGVSYLEFLQALE+KVD DW          R++
Sbjct: 777  LRGSGHGIAAARMDAKLNTAGWIAEQMGGVSYLEFLQALEEKVDQDWIGISSSLEEIRKS 836

Query: 1081 LISKNDCLINLTADGKNLQNAEKYVSKFLDMLPTNSPVGSTGWSARLPRTNEALVIPTQV 1260
            L+S+  CLIN+T++GKNL N+EKYVSKFLD+LP +S V  T W+ RL   NEA+VIPTQV
Sbjct: 837  LLSRKGCLINMTSEGKNLMNSEKYVSKFLDLLPGSSSVEKTTWNGRLSSENEAIVIPTQV 896

Query: 1261 NYVGKAANLFETGYQLKGSAYVISKYLNNSWLWDRVRVSGGAYGGFCDFDTHSGVFSFLS 1440
            NYVGKA N+++TGYQLKGSAYVISKY++N+WLWDRVRVSGGAYGGFCDFDTHSGVFSFLS
Sbjct: 897  NYVGKATNIYDTGYQLKGSAYVISKYISNTWLWDRVRVSGGAYGGFCDFDTHSGVFSFLS 956

Query: 1441 YRDPNLLKTLDVYDGTSNFLRELEMDDDALTKAIIGTIGDV 1563
            YRDPNLLKTLDVYDGT +FLR+LEMDDD LTKAIIGTIGDV
Sbjct: 957  YRDPNLLKTLDVYDGTGDFLRQLEMDDDTLTKAIIGTIGDV 997


>ref|XP_006423047.1| hypothetical protein CICLE_v10027722mg [Citrus clementina]
            gi|557524981|gb|ESR36287.1| hypothetical protein
            CICLE_v10027722mg [Citrus clementina]
          Length = 1082

 Score =  873 bits (2255), Expect = 0.0
 Identities = 425/521 (81%), Positives = 475/521 (91%)
 Frame = +1

Query: 1    FNSDAVEASMNTIEFSLRENNTGSFPRGLALMLRSIGKWIYDMDPFQPLKYQEPLKALKA 180
            F+SDAVEASMNTIEFSLRENNTGSFPRGL+LMLRS+GKWIYDM+PF+PLKY++PL ALKA
Sbjct: 479  FDSDAVEASMNTIEFSLRENNTGSFPRGLSLMLRSMGKWIYDMNPFEPLKYEKPLMALKA 538

Query: 181  RIAEEGSKTVFAPLIEKFILSNPHRVTIEMQPDPEKASRDEAAEKENLEKVKASMTEEDL 360
            R+AEEG K VF+PLIEK+IL+NPH VT+EMQPDPEKASRDEAAEKE L KVK+SMT+EDL
Sbjct: 539  RLAEEGPKAVFSPLIEKYILNNPHCVTVEMQPDPEKASRDEAAEKEILAKVKSSMTKEDL 598

Query: 361  AELARATHELKLKQETPDPPEALKCVPTLSLEDIPKKPIHIPTEVGDINGTKVLQHDLFT 540
            AELARAT EL+LKQETPDPPEAL+ VP+LSL DIPK+PI +PTEVGDING KVLQHDLFT
Sbjct: 599  AELARATEELRLKQETPDPPEALRSVPSLSLRDIPKEPIRVPTEVGDINGVKVLQHDLFT 658

Query: 541  NDVLYAEVVFNMSSLKQELLPLVPLFCQSLLEMGTKDMDFVQLNQLIGRKTGGISVYPFT 720
            NDVLY EVVF+MSSLKQELLPL+PLFCQSL EMGTKD+ FVQL+QLIGRKTGGISVYPFT
Sbjct: 659  NDVLYTEVVFDMSSLKQELLPLIPLFCQSLKEMGTKDLSFVQLDQLIGRKTGGISVYPFT 718

Query: 721  SSVRGKEDPCSRIIVRGKAMSERAEDLFNLVNCVIQEVQLTDQKRFKQFVSQSKARMENR 900
            SS+RGKEDPC  ++VRGKAM+ +AEDLFNL NCV+QEVQLTDQ+RFKQFVSQSKARMENR
Sbjct: 719  SSIRGKEDPCCCMVVRGKAMAGQAEDLFNLFNCVLQEVQLTDQQRFKQFVSQSKARMENR 778

Query: 901  LRGSGHGIAAARMDAKLNVAGWVSEKMGGVSYLEFLQALEKKVDDDWXXXXXXXXXXRRT 1080
            LRGSGHGIAAARMDAKLN AGW+SE+MGGVSYLEFLQALE+KVD DW          RR+
Sbjct: 779  LRGSGHGIAAARMDAKLNTAGWISEQMGGVSYLEFLQALEEKVDQDWAGISSSLEEIRRS 838

Query: 1081 LISKNDCLINLTADGKNLQNAEKYVSKFLDMLPTNSPVGSTGWSARLPRTNEALVIPTQV 1260
             +S+  CLIN+TADGKNL+N+E++V KFLDMLPTNSPV    W A LP  NEA+VIPTQV
Sbjct: 839  FLSREGCLINITADGKNLKNSERFVGKFLDMLPTNSPVERVKWKAHLPSANEAIVIPTQV 898

Query: 1261 NYVGKAANLFETGYQLKGSAYVISKYLNNSWLWDRVRVSGGAYGGFCDFDTHSGVFSFLS 1440
            NYVGKAAN+FETGY+L GSAYVISK+++N WLWDRVRVSGGAYGGFCDFD+HSGVFSFLS
Sbjct: 899  NYVGKAANIFETGYKLNGSAYVISKHISNVWLWDRVRVSGGAYGGFCDFDSHSGVFSFLS 958

Query: 1441 YRDPNLLKTLDVYDGTSNFLRELEMDDDALTKAIIGTIGDV 1563
            YRDPNLLKTLD+YDGT +FLRELEMDDD LTKAIIGTIGDV
Sbjct: 959  YRDPNLLKTLDIYDGTVDFLRELEMDDDTLTKAIIGTIGDV 999


>ref|XP_002313107.1| hypothetical protein POPTR_0009s10650g [Populus trichocarpa]
            gi|222849515|gb|EEE87062.1| hypothetical protein
            POPTR_0009s10650g [Populus trichocarpa]
          Length = 1006

 Score =  868 bits (2244), Expect = 0.0
 Identities = 426/521 (81%), Positives = 472/521 (90%)
 Frame = +1

Query: 1    FNSDAVEASMNTIEFSLRENNTGSFPRGLALMLRSIGKWIYDMDPFQPLKYQEPLKALKA 180
            F +DAVEASMNTIEFSLRENNTGSFPRGL+LML+SI KWIYDMDPF+PLKY++PL ALKA
Sbjct: 403  FETDAVEASMNTIEFSLRENNTGSFPRGLSLMLQSISKWIYDMDPFEPLKYEKPLMALKA 462

Query: 181  RIAEEGSKTVFAPLIEKFILSNPHRVTIEMQPDPEKASRDEAAEKENLEKVKASMTEEDL 360
            RIAEEGSK VF+PLIEKFIL+N HRVTIEMQPDPEKASRDEAAE+E LEKVKASMTEEDL
Sbjct: 463  RIAEEGSKAVFSPLIEKFILNNLHRVTIEMQPDPEKASRDEAAEREILEKVKASMTEEDL 522

Query: 361  AELARATHELKLKQETPDPPEALKCVPTLSLEDIPKKPIHIPTEVGDINGTKVLQHDLFT 540
            AELARAT EL+LKQETPDPPEAL+ VP+LSL DIPK+P+H+PTE GDING KVL+HDLFT
Sbjct: 523  AELARATQELRLKQETPDPPEALRSVPSLSLLDIPKEPLHVPTEAGDINGVKVLKHDLFT 582

Query: 541  NDVLYAEVVFNMSSLKQELLPLVPLFCQSLLEMGTKDMDFVQLNQLIGRKTGGISVYPFT 720
            NDVLYAE+VFNM SLKQELLPLVPLFCQSLLEMGTKD+ FVQLNQLIGRKTGGISVYPFT
Sbjct: 583  NDVLYAEIVFNMRSLKQELLPLVPLFCQSLLEMGTKDLTFVQLNQLIGRKTGGISVYPFT 642

Query: 721  SSVRGKEDPCSRIIVRGKAMSERAEDLFNLVNCVIQEVQLTDQKRFKQFVSQSKARMENR 900
            SS++G+EDPCS II +GKAM+ R EDLFNLVNCV+QEVQ TDQ+RFKQFVSQSKA MENR
Sbjct: 643  SSIQGREDPCSHIIAQGKAMAGRVEDLFNLVNCVLQEVQFTDQQRFKQFVSQSKAGMENR 702

Query: 901  LRGSGHGIAAARMDAKLNVAGWVSEKMGGVSYLEFLQALEKKVDDDWXXXXXXXXXXRRT 1080
            LRGSGH IAA RMDAKLNV GW+SE+MGGVSYLEFLQALE++VD DW          R +
Sbjct: 703  LRGSGHRIAATRMDAKLNVTGWISEQMGGVSYLEFLQALEERVDQDWAGVSSSLEEIRTS 762

Query: 1081 LISKNDCLINLTADGKNLQNAEKYVSKFLDMLPTNSPVGSTGWSARLPRTNEALVIPTQV 1260
            L+SKN CLIN+TADGKNL N+EKYVSKFLD+LP+ S V +  W+ARL   NEA+VIPTQV
Sbjct: 763  LLSKNGCLINMTADGKNLTNSEKYVSKFLDLLPSKSSVEAAAWNARLSPGNEAIVIPTQV 822

Query: 1261 NYVGKAANLFETGYQLKGSAYVISKYLNNSWLWDRVRVSGGAYGGFCDFDTHSGVFSFLS 1440
            NYVGKAAN+++TGYQL GSAYVISKY++N+WLWDRVRVSGGAYGGFCD DTHSGVFSFLS
Sbjct: 823  NYVGKAANIYDTGYQLNGSAYVISKYISNTWLWDRVRVSGGAYGGFCDLDTHSGVFSFLS 882

Query: 1441 YRDPNLLKTLDVYDGTSNFLRELEMDDDALTKAIIGTIGDV 1563
            YRDPNLLKTLDVYDGT  FLR+LEMDDD L+KAIIGTIGDV
Sbjct: 883  YRDPNLLKTLDVYDGTGAFLRQLEMDDDTLSKAIIGTIGDV 923


>ref|XP_004296078.1| PREDICTED: presequence protease 1, chloroplastic/mitochondrial-like
            [Fragaria vesca subsp. vesca]
          Length = 1073

 Score =  867 bits (2239), Expect = 0.0
 Identities = 422/521 (80%), Positives = 476/521 (91%)
 Frame = +1

Query: 1    FNSDAVEASMNTIEFSLRENNTGSFPRGLALMLRSIGKWIYDMDPFQPLKYQEPLKALKA 180
            F++ AVEASMNTIEFSLRENNTGSFPRGL+LMLRS+GKWIYDMDPFQPLKY++PL ALKA
Sbjct: 470  FDTAAVEASMNTIEFSLRENNTGSFPRGLSLMLRSMGKWIYDMDPFQPLKYEKPLLALKA 529

Query: 181  RIAEEGSKTVFAPLIEKFILSNPHRVTIEMQPDPEKASRDEAAEKENLEKVKASMTEEDL 360
            RI EEGSK VF+PLIEKFIL+NPHRV +EMQPDPEKASRDEAAEKE LEKVKA MTEEDL
Sbjct: 530  RIEEEGSKAVFSPLIEKFILNNPHRVVVEMQPDPEKASRDEAAEKEILEKVKAGMTEEDL 589

Query: 361  AELARATHELKLKQETPDPPEALKCVPTLSLEDIPKKPIHIPTEVGDINGTKVLQHDLFT 540
            AELARAT +LKLKQETPDPPEAL+ VP+LSL+DIPK+PI IPTEVGDING K+LQHDLFT
Sbjct: 590  AELARATQDLKLKQETPDPPEALRSVPSLSLQDIPKEPIAIPTEVGDINGVKILQHDLFT 649

Query: 541  NDVLYAEVVFNMSSLKQELLPLVPLFCQSLLEMGTKDMDFVQLNQLIGRKTGGISVYPFT 720
            NDVLY EVVF+MS  KQELLPLVPLFCQSLLEMGTKD+ FVQLNQLIGRKTGGISVYP T
Sbjct: 650  NDVLYTEVVFDMSLPKQELLPLVPLFCQSLLEMGTKDLSFVQLNQLIGRKTGGISVYPMT 709

Query: 721  SSVRGKEDPCSRIIVRGKAMSERAEDLFNLVNCVIQEVQLTDQKRFKQFVSQSKARMENR 900
            SSVRGK+D CS IIVRGKAM+ RA+DLF+L+NC++QEVQ TDQ+RFKQFVSQSKARMENR
Sbjct: 710  SSVRGKKDACSHIIVRGKAMAGRADDLFHLMNCILQEVQFTDQQRFKQFVSQSKARMENR 769

Query: 901  LRGSGHGIAAARMDAKLNVAGWVSEKMGGVSYLEFLQALEKKVDDDWXXXXXXXXXXRRT 1080
            LRGSGHGIAAARMDAKLNVAGW+SE+MGG SYLEFLQ LE+KVD+DW          R++
Sbjct: 770  LRGSGHGIAAARMDAKLNVAGWISEQMGGFSYLEFLQDLEQKVDNDWEKISSSLEEIRKS 829

Query: 1081 LISKNDCLINLTADGKNLQNAEKYVSKFLDMLPTNSPVGSTGWSARLPRTNEALVIPTQV 1260
            L+S+  CLIN+TA+GKNL N+EK+V KFLD+LP+ SP+  T W+ARLP TNEALVIPTQV
Sbjct: 830  LLSREGCLINMTAEGKNLTNSEKFVGKFLDLLPSKSPLTRTTWNARLPSTNEALVIPTQV 889

Query: 1261 NYVGKAANLFETGYQLKGSAYVISKYLNNSWLWDRVRVSGGAYGGFCDFDTHSGVFSFLS 1440
            NYVGKAAN+++TGYQL GSAYVISKY++N+WLWDRVRVSGGAYGGFCDFD+HSGVFSFLS
Sbjct: 890  NYVGKAANIYDTGYQLNGSAYVISKYISNTWLWDRVRVSGGAYGGFCDFDSHSGVFSFLS 949

Query: 1441 YRDPNLLKTLDVYDGTSNFLRELEMDDDALTKAIIGTIGDV 1563
            YRDPNLLKTLD+YDGT  FLR+L+MD++ LTK+IIGTIGDV
Sbjct: 950  YRDPNLLKTLDIYDGTGEFLRQLDMDEETLTKSIIGTIGDV 990


>gb|EMJ02012.1| hypothetical protein PRUPE_ppa025698mg, partial [Prunus persica]
          Length = 986

 Score =  867 bits (2239), Expect = 0.0
 Identities = 422/521 (80%), Positives = 477/521 (91%)
 Frame = +1

Query: 1    FNSDAVEASMNTIEFSLRENNTGSFPRGLALMLRSIGKWIYDMDPFQPLKYQEPLKALKA 180
            F++DAVEASMNTIEFSLRENNTGSFPRGL+LMLRS+GKWIYDMDPF+PLKY++PL ALKA
Sbjct: 384  FDTDAVEASMNTIEFSLRENNTGSFPRGLSLMLRSMGKWIYDMDPFEPLKYEKPLLALKA 443

Query: 181  RIAEEGSKTVFAPLIEKFILSNPHRVTIEMQPDPEKASRDEAAEKENLEKVKASMTEEDL 360
            RI  EGSK VF+PLIEKFIL+N HRV +EMQPDPEKASRDE AEK+ L+KVKA MTEEDL
Sbjct: 444  RIEAEGSKAVFSPLIEKFILNNRHRVVVEMQPDPEKASRDEEAEKQILDKVKAGMTEEDL 503

Query: 361  AELARATHELKLKQETPDPPEALKCVPTLSLEDIPKKPIHIPTEVGDINGTKVLQHDLFT 540
            AELARAT EL+L+QETPDPPEAL+ VP+LSL+DIPK+P  +PTEVGDING KVLQHDLFT
Sbjct: 504  AELARATQELRLRQETPDPPEALRSVPSLSLQDIPKEPTRVPTEVGDINGVKVLQHDLFT 563

Query: 541  NDVLYAEVVFNMSSLKQELLPLVPLFCQSLLEMGTKDMDFVQLNQLIGRKTGGISVYPFT 720
            NDVLY EVVFNMSSLKQELLPLVPLFCQSLLEMGTKD+ FVQLNQLIGRKTGGISVYP T
Sbjct: 564  NDVLYTEVVFNMSSLKQELLPLVPLFCQSLLEMGTKDLSFVQLNQLIGRKTGGISVYPMT 623

Query: 721  SSVRGKEDPCSRIIVRGKAMSERAEDLFNLVNCVIQEVQLTDQKRFKQFVSQSKARMENR 900
            SSVRGKEDPCS IIVRGKAM+ RA+DLF+L NCV+QEVQ TDQ+RFKQFVSQSKARMENR
Sbjct: 624  SSVRGKEDPCSHIIVRGKAMAGRADDLFHLFNCVLQEVQFTDQQRFKQFVSQSKARMENR 683

Query: 901  LRGSGHGIAAARMDAKLNVAGWVSEKMGGVSYLEFLQALEKKVDDDWXXXXXXXXXXRRT 1080
            LRGSGHGIAAARMDAKLNVAGW+SE+MGGVSYLEFLQALE+KVD DW          R++
Sbjct: 684  LRGSGHGIAAARMDAKLNVAGWISEQMGGVSYLEFLQALEEKVDQDWDGISSSLEEIRKS 743

Query: 1081 LISKNDCLINLTADGKNLQNAEKYVSKFLDMLPTNSPVGSTGWSARLPRTNEALVIPTQV 1260
            L+S+N C++N+TA+GKNL N+EK+VSKFLD+LP NSPV ++ W+ARLP +NEA+VIPTQV
Sbjct: 744  LLSRNGCIVNMTAEGKNLTNSEKFVSKFLDLLP-NSPVATSTWNARLPSSNEAIVIPTQV 802

Query: 1261 NYVGKAANLFETGYQLKGSAYVISKYLNNSWLWDRVRVSGGAYGGFCDFDTHSGVFSFLS 1440
            NYVGKAAN+++TGYQL GSAYVISKY+ N+WLWDRVRVSGGAYGGFCDFD+HSGVFSFLS
Sbjct: 803  NYVGKAANIYDTGYQLNGSAYVISKYICNTWLWDRVRVSGGAYGGFCDFDSHSGVFSFLS 862

Query: 1441 YRDPNLLKTLDVYDGTSNFLRELEMDDDALTKAIIGTIGDV 1563
            YRDPNL KTL VYDGT +FLR+L+MDD+ LTK+IIGTIGDV
Sbjct: 863  YRDPNLFKTLGVYDGTGDFLRQLDMDDETLTKSIIGTIGDV 903


>ref|XP_006346464.1| PREDICTED: presequence protease 1, chloroplastic/mitochondrial-like
            [Solanum tuberosum]
          Length = 1072

 Score =  866 bits (2238), Expect = 0.0
 Identities = 420/521 (80%), Positives = 479/521 (91%)
 Frame = +1

Query: 1    FNSDAVEASMNTIEFSLRENNTGSFPRGLALMLRSIGKWIYDMDPFQPLKYQEPLKALKA 180
            F+ DAVEASMNTIEFSLRENNTGSFPRGLALMLRSIGKW+YDMDPF+PLKYQ+PL+ALKA
Sbjct: 469  FDLDAVEASMNTIEFSLRENNTGSFPRGLALMLRSIGKWVYDMDPFEPLKYQKPLEALKA 528

Query: 181  RIAEEGSKTVFAPLIEKFILSNPHRVTIEMQPDPEKASRDEAAEKENLEKVKASMTEEDL 360
            RIA+EGSK VFAPL++++IL NPHRVT+EMQPDPEKASR+E  EKE L+KVKASMT+EDL
Sbjct: 529  RIAKEGSKAVFAPLMDQYILRNPHRVTVEMQPDPEKASREEQIEKETLDKVKASMTQEDL 588

Query: 361  AELARATHELKLKQETPDPPEALKCVPTLSLEDIPKKPIHIPTEVGDINGTKVLQHDLFT 540
            AELARATHEL+LKQETPDPPEALK VP+LSL+DIP++P+ +PTE+GDING KVL+HDLFT
Sbjct: 589  AELARATHELRLKQETPDPPEALKSVPSLSLQDIPREPVLVPTEIGDINGVKVLKHDLFT 648

Query: 541  NDVLYAEVVFNMSSLKQELLPLVPLFCQSLLEMGTKDMDFVQLNQLIGRKTGGISVYPFT 720
            NDVLYAEVVFN+SSLKQELLPLVPLFCQSLLEMGTKD+DFVQLNQLIGRKTGG+SVYPFT
Sbjct: 649  NDVLYAEVVFNLSSLKQELLPLVPLFCQSLLEMGTKDLDFVQLNQLIGRKTGGLSVYPFT 708

Query: 721  SSVRGKEDPCSRIIVRGKAMSERAEDLFNLVNCVIQEVQLTDQKRFKQFVSQSKARMENR 900
            SSV GK +PCS+IIVRGKAMS+R EDLF L+N V+Q+VQL DQKRFKQFVSQS++RMENR
Sbjct: 709  SSVHGKVEPCSKIIVRGKAMSQRTEDLFYLINRVLQDVQLDDQKRFKQFVSQSRSRMENR 768

Query: 901  LRGSGHGIAAARMDAKLNVAGWVSEKMGGVSYLEFLQALEKKVDDDWXXXXXXXXXXRRT 1080
            LRGSGH IAAARM AKLNVAGW+SE+MGGVSYLEFL+ LE +V+ DW          R++
Sbjct: 769  LRGSGHSIAAARMGAKLNVAGWISEQMGGVSYLEFLKVLEDQVEKDWPQISSSLEEIRKS 828

Query: 1081 LISKNDCLINLTADGKNLQNAEKYVSKFLDMLPTNSPVGSTGWSARLPRTNEALVIPTQV 1260
            L+SKN CLINLTADGKNL NAEK++S+FLD+LP+ S V S  W+A+L R+NEA V+PTQV
Sbjct: 829  LLSKNGCLINLTADGKNLNNAEKHISEFLDLLPSTSLVESAAWNAQLSRSNEAFVVPTQV 888

Query: 1261 NYVGKAANLFETGYQLKGSAYVISKYLNNSWLWDRVRVSGGAYGGFCDFDTHSGVFSFLS 1440
            NYVGKAANL+E GY+LKGSAYVIS Y++N+WLWDRVRVSGGAYGGFC FD+HSGVFSFLS
Sbjct: 889  NYVGKAANLYEAGYELKGSAYVISNYISNTWLWDRVRVSGGAYGGFCSFDSHSGVFSFLS 948

Query: 1441 YRDPNLLKTLDVYDGTSNFLRELEMDDDALTKAIIGTIGDV 1563
            YRDPNLLKTLDVYDGTS+FL+ELEMDDDALTKAIIGTIGDV
Sbjct: 949  YRDPNLLKTLDVYDGTSSFLKELEMDDDALTKAIIGTIGDV 989


>ref|XP_004511282.1| PREDICTED: presequence protease 1, chloroplastic/mitochondrial-like
            [Cicer arietinum]
          Length = 1080

 Score =  865 bits (2236), Expect = 0.0
 Identities = 417/522 (79%), Positives = 481/522 (92%), Gaps = 1/522 (0%)
 Frame = +1

Query: 1    FNSDAVEASMNTIEFSLRENNTGSFPRGLALMLRSIGKWIYDMDPFQPLKYQEPLKALKA 180
            F++DA+EASMNTIEFSLRENNTGSFPRGL+LML+SIGKWIYDM+P +PLKY++PL+ LK+
Sbjct: 476  FDTDAIEASMNTIEFSLRENNTGSFPRGLSLMLQSIGKWIYDMNPLEPLKYEKPLQDLKS 535

Query: 181  RIAEEGSKTVFAPLIEKFILSNPHRVTIEMQPDPEKASRDEAAEKENLEKVKASMTEEDL 360
            +IA+EGSK+VF+PLIEKFIL+NPH+VT++MQPDPEKA+RDE  EK+ L+K+KASMT EDL
Sbjct: 536  KIAKEGSKSVFSPLIEKFILNNPHKVTVQMQPDPEKAARDEETEKQVLQKIKASMTTEDL 595

Query: 361  AELARATHELKLKQETPDPPEALKCVPTLSLEDIPKKPIHIPTEVGDINGTKVLQHDLFT 540
            AELARATHEL+LKQETPDPPEALK VP+LSL+DIPK+PI +PTEVGDING KVLQHDLFT
Sbjct: 596  AELARATHELRLKQETPDPPEALKTVPSLSLQDIPKEPIRVPTEVGDINGVKVLQHDLFT 655

Query: 541  NDVLYAEVVFNMSSLKQELLPLVPLFCQSLLEMGTKDMDFVQLNQLIGRKTGGISVYPFT 720
            NDVLY E+VF+MSSLKQELLPLVPLFCQSLLEMGTKD+ FVQLNQLIGRKTGGISVYPFT
Sbjct: 656  NDVLYTEIVFDMSSLKQELLPLVPLFCQSLLEMGTKDLTFVQLNQLIGRKTGGISVYPFT 715

Query: 721  SSVRGKEDPCSRIIVRGKAMSERAEDLFNLVNCVIQEVQLTDQKRFKQFVSQSKARMENR 900
            SSV+GKEDPCS +IVRGKAMS RAEDL++LVN V+Q+VQ TDQ+RFKQFVSQS+ARMENR
Sbjct: 716  SSVQGKEDPCSHMIVRGKAMSGRAEDLYDLVNSVLQDVQFTDQQRFKQFVSQSRARMENR 775

Query: 901  LRGSGHGIAAARMDAKLNVAGWVSEKMGGVSYLEFLQALEKKVDDDWXXXXXXXXXXRRT 1080
            LRGSGHGIAAARMDAKLN AGW+SEKMGG+SYLEFLQ LEK+VD+DW          R+T
Sbjct: 776  LRGSGHGIAAARMDAKLNAAGWMSEKMGGLSYLEFLQTLEKRVDEDWADISSSLEEIRKT 835

Query: 1081 LISKNDCLINLTADGKNLQNAEKYVSKFLDMLPTNSPVGSTG-WSARLPRTNEALVIPTQ 1257
            + SK  CLIN+TADGKNL N +K+VSKF+DMLPT+SP+ +T  W+ARLP TNEA+VIPTQ
Sbjct: 836  VFSKQGCLINITADGKNLANMDKFVSKFVDMLPTSSPIATTNIWNARLPLTNEAIVIPTQ 895

Query: 1258 VNYVGKAANLFETGYQLKGSAYVISKYLNNSWLWDRVRVSGGAYGGFCDFDTHSGVFSFL 1437
            VNYVGKA N+++ GY+L GSAYVISKY++N+WLWDRVRVSGGAYGGFCDFDTHSGVFSFL
Sbjct: 896  VNYVGKATNVYDAGYKLNGSAYVISKYISNTWLWDRVRVSGGAYGGFCDFDTHSGVFSFL 955

Query: 1438 SYRDPNLLKTLDVYDGTSNFLRELEMDDDALTKAIIGTIGDV 1563
            SYRDPNLLKTL+VYDGT +FLRELE+DDD LTKAIIGTIGDV
Sbjct: 956  SYRDPNLLKTLEVYDGTGDFLRELEIDDDTLTKAIIGTIGDV 997


>ref|XP_004230817.1| PREDICTED: presequence protease 1, chloroplastic/mitochondrial-like
            [Solanum lycopersicum]
          Length = 1072

 Score =  865 bits (2234), Expect = 0.0
 Identities = 419/521 (80%), Positives = 478/521 (91%)
 Frame = +1

Query: 1    FNSDAVEASMNTIEFSLRENNTGSFPRGLALMLRSIGKWIYDMDPFQPLKYQEPLKALKA 180
            F+SDAVEASMNTIEFSLRENNTGSFPRGLALMLRSIGKW+YDMDPF+PLKYQ+PL+ALKA
Sbjct: 469  FDSDAVEASMNTIEFSLRENNTGSFPRGLALMLRSIGKWVYDMDPFEPLKYQKPLEALKA 528

Query: 181  RIAEEGSKTVFAPLIEKFILSNPHRVTIEMQPDPEKASRDEAAEKENLEKVKASMTEEDL 360
            RIA+EGSK VFAPL++++IL NPHRVT+EMQPDPEKASR+E  EKE L+KVKASMT+EDL
Sbjct: 529  RIAKEGSKAVFAPLMDQYILRNPHRVTVEMQPDPEKASREEQIEKETLDKVKASMTQEDL 588

Query: 361  AELARATHELKLKQETPDPPEALKCVPTLSLEDIPKKPIHIPTEVGDINGTKVLQHDLFT 540
            AELARATHEL+LKQETPDPPEALK VP+LSL+DIP++P+ +PTE+GDING KVL+HDLFT
Sbjct: 589  AELARATHELRLKQETPDPPEALKSVPSLSLQDIPREPVLVPTEIGDINGVKVLKHDLFT 648

Query: 541  NDVLYAEVVFNMSSLKQELLPLVPLFCQSLLEMGTKDMDFVQLNQLIGRKTGGISVYPFT 720
            NDVLYAEVVFN+SSLKQELLPLVPLFCQSLLEMGTKD+DFVQLNQLIGRKTGG+SVYPFT
Sbjct: 649  NDVLYAEVVFNLSSLKQELLPLVPLFCQSLLEMGTKDLDFVQLNQLIGRKTGGLSVYPFT 708

Query: 721  SSVRGKEDPCSRIIVRGKAMSERAEDLFNLVNCVIQEVQLTDQKRFKQFVSQSKARMENR 900
            SSV GK +PCS+IIVRGKAMS+R EDLF L+N V+Q+VQL DQKRFKQFVSQS++RMENR
Sbjct: 709  SSVHGKVEPCSKIIVRGKAMSQRTEDLFYLINRVLQDVQLDDQKRFKQFVSQSRSRMENR 768

Query: 901  LRGSGHGIAAARMDAKLNVAGWVSEKMGGVSYLEFLQALEKKVDDDWXXXXXXXXXXRRT 1080
            LRGSGH +AAARM AKLNVAGW+SE+MGGVSYLEFL+ LE +V+ DW          R++
Sbjct: 769  LRGSGHSVAAARMGAKLNVAGWISEQMGGVSYLEFLKVLEDQVEKDWSQISSSLEEIRKS 828

Query: 1081 LISKNDCLINLTADGKNLQNAEKYVSKFLDMLPTNSPVGSTGWSARLPRTNEALVIPTQV 1260
            L+SKN CLINLTADGKNL NAEK++SKFLD+LP+ S V    W+A+L R+NEA V+PTQV
Sbjct: 829  LLSKNGCLINLTADGKNLNNAEKHISKFLDLLPSTSLVEPAAWNAQLSRSNEAFVVPTQV 888

Query: 1261 NYVGKAANLFETGYQLKGSAYVISKYLNNSWLWDRVRVSGGAYGGFCDFDTHSGVFSFLS 1440
            NYVGKAANL+E GY+LKGSAYVIS Y +N+WLWDRVRVSGGAYGGFC FD+HSGVFSFLS
Sbjct: 889  NYVGKAANLYEAGYELKGSAYVISNYTSNTWLWDRVRVSGGAYGGFCSFDSHSGVFSFLS 948

Query: 1441 YRDPNLLKTLDVYDGTSNFLRELEMDDDALTKAIIGTIGDV 1563
            YRDPNLLKTLDVYDGTS+FL+ELEMD+DALTKAIIGTIGDV
Sbjct: 949  YRDPNLLKTLDVYDGTSSFLKELEMDNDALTKAIIGTIGDV 989


>ref|XP_003517606.1| PREDICTED: presequence protease 2, chloroplastic/mitochondrial
            [Glycine max]
          Length = 1078

 Score =  861 bits (2225), Expect = 0.0
 Identities = 412/521 (79%), Positives = 477/521 (91%)
 Frame = +1

Query: 1    FNSDAVEASMNTIEFSLRENNTGSFPRGLALMLRSIGKWIYDMDPFQPLKYQEPLKALKA 180
            F++DA+EASMNTIEFSLRENNTGSFPRGL+LML+SIGKWIYDM+PF+PLKY++PL+ LK+
Sbjct: 475  FDTDAIEASMNTIEFSLRENNTGSFPRGLSLMLQSIGKWIYDMNPFEPLKYEKPLQDLKS 534

Query: 181  RIAEEGSKTVFAPLIEKFILSNPHRVTIEMQPDPEKASRDEAAEKENLEKVKASMTEEDL 360
            RIA+EGSK+VF+PLIEKFIL+NPH+VT+EMQPDPEKA+RDE AEK+ L+KVKASMT EDL
Sbjct: 535  RIAKEGSKSVFSPLIEKFILNNPHQVTVEMQPDPEKAARDEVAEKQILQKVKASMTTEDL 594

Query: 361  AELARATHELKLKQETPDPPEALKCVPTLSLEDIPKKPIHIPTEVGDINGTKVLQHDLFT 540
            AELARATHEL+LKQETPDPPEALK VP+LSL+DIPK+PI +PTEVGDING KVLQHDLFT
Sbjct: 595  AELARATHELRLKQETPDPPEALKTVPSLSLQDIPKEPIRVPTEVGDINGVKVLQHDLFT 654

Query: 541  NDVLYAEVVFNMSSLKQELLPLVPLFCQSLLEMGTKDMDFVQLNQLIGRKTGGISVYPFT 720
            NDVLY E+VFNM SLKQELLPLVPLFCQSLLEMGTKD+ FVQLNQLIGRKTGGISVYPFT
Sbjct: 655  NDVLYTEIVFNMKSLKQELLPLVPLFCQSLLEMGTKDLTFVQLNQLIGRKTGGISVYPFT 714

Query: 721  SSVRGKEDPCSRIIVRGKAMSERAEDLFNLVNCVIQEVQLTDQKRFKQFVSQSKARMENR 900
            SSVRGKEDPCS +++RGKAM+   EDL++LVN V+Q+VQ TDQ+RFKQFVSQS+ARMENR
Sbjct: 715  SSVRGKEDPCSHMVIRGKAMAGHIEDLYDLVNSVLQDVQFTDQQRFKQFVSQSRARMENR 774

Query: 901  LRGSGHGIAAARMDAKLNVAGWVSEKMGGVSYLEFLQALEKKVDDDWXXXXXXXXXXRRT 1080
            LRGSGHGIAAARMDAKLN AGW+SEKMGG+SYLEFL+ LE++VD DW          R++
Sbjct: 775  LRGSGHGIAAARMDAKLNAAGWMSEKMGGLSYLEFLRTLEERVDQDWADISSSLEEIRKS 834

Query: 1081 LISKNDCLINLTADGKNLQNAEKYVSKFLDMLPTNSPVGSTGWSARLPRTNEALVIPTQV 1260
            + SK  CLIN+TAD KNL   EK +SKF+D+LPT+SP+ +T W+ RLP TNEA+VIPTQV
Sbjct: 835  IFSKQGCLINVTADRKNLAKTEKVLSKFVDLLPTSSPIATTTWNVRLPLTNEAIVIPTQV 894

Query: 1261 NYVGKAANLFETGYQLKGSAYVISKYLNNSWLWDRVRVSGGAYGGFCDFDTHSGVFSFLS 1440
            NY+GKAAN+++TGY+L GSAYVISKY++N+WLWDRVRVSGGAYGGFCDFDTHSGVFSFLS
Sbjct: 895  NYIGKAANIYDTGYRLNGSAYVISKYISNTWLWDRVRVSGGAYGGFCDFDTHSGVFSFLS 954

Query: 1441 YRDPNLLKTLDVYDGTSNFLRELEMDDDALTKAIIGTIGDV 1563
            YRDPNLLKTLDVYDGT +FLREL++DDD LTKAIIGTIGDV
Sbjct: 955  YRDPNLLKTLDVYDGTGDFLRELQIDDDTLTKAIIGTIGDV 995


>ref|XP_002518787.1| zinc metalloprotease, putative [Ricinus communis]
            gi|223542168|gb|EEF43712.1| zinc metalloprotease,
            putative [Ricinus communis]
          Length = 774

 Score =  860 bits (2221), Expect = 0.0
 Identities = 424/521 (81%), Positives = 468/521 (89%)
 Frame = +1

Query: 1    FNSDAVEASMNTIEFSLRENNTGSFPRGLALMLRSIGKWIYDMDPFQPLKYQEPLKALKA 180
            F +DAVEASMNTIEFSLRENNTGSFPRGL+LMLRS+GKWIYD DPF+PLKY++PL  LKA
Sbjct: 171  FETDAVEASMNTIEFSLRENNTGSFPRGLSLMLRSMGKWIYDRDPFEPLKYEKPLLDLKA 230

Query: 181  RIAEEGSKTVFAPLIEKFILSNPHRVTIEMQPDPEKASRDEAAEKENLEKVKASMTEEDL 360
            RIA+EGSK VF+PLIEKFIL NPH VT+EM+PDPEKASRDE AE+E LEKVK +MTEEDL
Sbjct: 231  RIAKEGSKAVFSPLIEKFILKNPHCVTVEMRPDPEKASRDEVAEREILEKVKGNMTEEDL 290

Query: 361  AELARATHELKLKQETPDPPEALKCVPTLSLEDIPKKPIHIPTEVGDINGTKVLQHDLFT 540
            AELARAT EL+LKQETPDPPE LK VP+LSL DIPK+PI +PTEVGDING KVL+HDLFT
Sbjct: 291  AELARATQELRLKQETPDPPETLKTVPSLSLNDIPKEPIRVPTEVGDINGVKVLRHDLFT 350

Query: 541  NDVLYAEVVFNMSSLKQELLPLVPLFCQSLLEMGTKDMDFVQLNQLIGRKTGGISVYPFT 720
            NDVLYAEVVFNM  LKQELLPLVPLFCQSLLEMGTKD+ FVQLNQLIGR+TGGISVYPFT
Sbjct: 351  NDVLYAEVVFNMRPLKQELLPLVPLFCQSLLEMGTKDLTFVQLNQLIGRRTGGISVYPFT 410

Query: 721  SSVRGKEDPCSRIIVRGKAMSERAEDLFNLVNCVIQEVQLTDQKRFKQFVSQSKARMENR 900
            SSVRG  +PCS IIVRGKAM+ RAEDLF+LVN V+QEVQ TDQ+RFKQFVSQSKARMENR
Sbjct: 411  SSVRGLAEPCSHIIVRGKAMAGRAEDLFDLVNRVLQEVQFTDQQRFKQFVSQSKARMENR 470

Query: 901  LRGSGHGIAAARMDAKLNVAGWVSEKMGGVSYLEFLQALEKKVDDDWXXXXXXXXXXRRT 1080
            LRGSGHGIAAARMDAKLNVAGW+SE+MGGVSYLEFLQ LE+KVD DW          R +
Sbjct: 471  LRGSGHGIAAARMDAKLNVAGWISEQMGGVSYLEFLQGLEEKVDQDWPLVSSSLEEIRSS 530

Query: 1081 LISKNDCLINLTADGKNLQNAEKYVSKFLDMLPTNSPVGSTGWSARLPRTNEALVIPTQV 1260
            L+S+N CLINLTADGKNL N+EK V KFLD+LP+NS   +  W+ARL   NEA+VIPTQV
Sbjct: 531  LLSRNSCLINLTADGKNLTNSEKLVGKFLDLLPSNSFADNAAWNARLSPGNEAIVIPTQV 590

Query: 1261 NYVGKAANLFETGYQLKGSAYVISKYLNNSWLWDRVRVSGGAYGGFCDFDTHSGVFSFLS 1440
            NYVGKAANL++TGYQL GSAYVISKY++N+WLWDRVRVSGGAYGGFCDFDTHSGVFSFLS
Sbjct: 591  NYVGKAANLYDTGYQLNGSAYVISKYISNTWLWDRVRVSGGAYGGFCDFDTHSGVFSFLS 650

Query: 1441 YRDPNLLKTLDVYDGTSNFLRELEMDDDALTKAIIGTIGDV 1563
            YRDPNLLKTLDVYDGT +FLR++EMDDD LTKAIIGTIGDV
Sbjct: 651  YRDPNLLKTLDVYDGTGDFLRDIEMDDDTLTKAIIGTIGDV 691


>ref|XP_006829680.1| hypothetical protein AMTR_s00126p00013900 [Amborella trichopoda]
            gi|548835199|gb|ERM97096.1| hypothetical protein
            AMTR_s00126p00013900 [Amborella trichopoda]
          Length = 1075

 Score =  857 bits (2213), Expect = 0.0
 Identities = 414/521 (79%), Positives = 468/521 (89%)
 Frame = +1

Query: 1    FNSDAVEASMNTIEFSLRENNTGSFPRGLALMLRSIGKWIYDMDPFQPLKYQEPLKALKA 180
            F+ +A+EASMNTIEFSLRENNTGSFPRGL+LMLRSIGKWIYDMDPF+PLKY++PL  LKA
Sbjct: 472  FDVEAIEASMNTIEFSLRENNTGSFPRGLSLMLRSIGKWIYDMDPFEPLKYEKPLNDLKA 531

Query: 181  RIAEEGSKTVFAPLIEKFILSNPHRVTIEMQPDPEKASRDEAAEKENLEKVKASMTEEDL 360
            RIAEEGSK VF+PLI+KFIL NPHRVTIEMQPD EKASRDEA EKE+LEKVKASMTEEDL
Sbjct: 532  RIAEEGSKAVFSPLIQKFILDNPHRVTIEMQPDTEKASRDEADEKESLEKVKASMTEEDL 591

Query: 361  AELARATHELKLKQETPDPPEALKCVPTLSLEDIPKKPIHIPTEVGDINGTKVLQHDLFT 540
            AELARAT EL+LKQETPDPPE LKCVP+LSL DIPK PIH+P E+G+ING KVLQH+LFT
Sbjct: 592  AELARATQELRLKQETPDPPEVLKCVPSLSLHDIPKHPIHVPIEIGEINGVKVLQHELFT 651

Query: 541  NDVLYAEVVFNMSSLKQELLPLVPLFCQSLLEMGTKDMDFVQLNQLIGRKTGGISVYPFT 720
            NDVLYAEVVF+M  +KQELLPL+PLFCQSLLEMGTKDMDFVQLNQLIGRKTGGIS+YPFT
Sbjct: 652  NDVLYAEVVFDMCLVKQELLPLIPLFCQSLLEMGTKDMDFVQLNQLIGRKTGGISIYPFT 711

Query: 721  SSVRGKEDPCSRIIVRGKAMSERAEDLFNLVNCVIQEVQLTDQKRFKQFVSQSKARMENR 900
            SS+RGK +PCSRIIVR K+M+ R +DLFNLVN V+Q+VQ TDQ+RFKQFV QSKARME+R
Sbjct: 712  SSIRGKVEPCSRIIVRAKSMAARVDDLFNLVNTVLQDVQFTDQQRFKQFVCQSKARMESR 771

Query: 901  LRGSGHGIAAARMDAKLNVAGWVSEKMGGVSYLEFLQALEKKVDDDWXXXXXXXXXXRRT 1080
            LRGSGHGIAAARMDAKLN AGW++E+MGG+SYL+FL+ LEK+VD DW          RR+
Sbjct: 772  LRGSGHGIAAARMDAKLNTAGWIAEQMGGISYLQFLETLEKQVDQDWSAISCSLEDIRRS 831

Query: 1081 LISKNDCLINLTADGKNLQNAEKYVSKFLDMLPTNSPVGSTGWSARLPRTNEALVIPTQV 1260
            L+S+  CLINLTADGKNL N+EK+VSKFLD+LP  S + +T W A+L   NEALVIPTQV
Sbjct: 832  LLSRKGCLINLTADGKNLSNSEKHVSKFLDLLPATSSLETTSWKAQLYLGNEALVIPTQV 891

Query: 1261 NYVGKAANLFETGYQLKGSAYVISKYLNNSWLWDRVRVSGGAYGGFCDFDTHSGVFSFLS 1440
            NYVGKA NL++TGYQL GS YVIS Y+ N+WLWDRVRVSGGAYGGFCDFDTHSGVFS+LS
Sbjct: 892  NYVGKAGNLYDTGYQLNGSTYVISMYIGNTWLWDRVRVSGGAYGGFCDFDTHSGVFSYLS 951

Query: 1441 YRDPNLLKTLDVYDGTSNFLRELEMDDDALTKAIIGTIGDV 1563
            YRDPNLLKTLD+YDGT+NFLRELE+D+D LTKAIIGTIGDV
Sbjct: 952  YRDPNLLKTLDIYDGTANFLRELELDEDTLTKAIIGTIGDV 992


>ref|XP_003636021.1| Presequence protease [Medicago truncatula]
            gi|355501956|gb|AES83159.1| Presequence protease
            [Medicago truncatula]
          Length = 698

 Score =  852 bits (2202), Expect = 0.0
 Identities = 410/522 (78%), Positives = 477/522 (91%), Gaps = 1/522 (0%)
 Frame = +1

Query: 1    FNSDAVEASMNTIEFSLRENNTGSFPRGLALMLRSIGKWIYDMDPFQPLKYQEPLKALKA 180
            F++DA+EASMNTIEFSLRENNTGSFPRGL+LML+SIGKWIYDM+P +PLKY++PL+ LK+
Sbjct: 94   FDTDAIEASMNTIEFSLRENNTGSFPRGLSLMLQSIGKWIYDMNPLEPLKYEKPLQDLKS 153

Query: 181  RIAEEGSKTVFAPLIEKFILSNPHRVTIEMQPDPEKASRDEAAEKENLEKVKASMTEEDL 360
            +IA+EGSK+VF+PLIEKFIL+N H+VT++MQPDPEKA+R+EA EK+ L++VKASMT EDL
Sbjct: 154  KIAKEGSKSVFSPLIEKFILNNLHKVTVQMQPDPEKAAREEATEKQILQEVKASMTTEDL 213

Query: 361  AELARATHELKLKQETPDPPEALKCVPTLSLEDIPKKPIHIPTEVGDINGTKVLQHDLFT 540
            AEL RAT EL+LKQETPDPPEALK VP+LSL+DIPK+PIH+PTEVGDING KVLQHDLFT
Sbjct: 214  AELTRATQELRLKQETPDPPEALKTVPSLSLQDIPKEPIHVPTEVGDINGVKVLQHDLFT 273

Query: 541  NDVLYAEVVFNMSSLKQELLPLVPLFCQSLLEMGTKDMDFVQLNQLIGRKTGGISVYPFT 720
            NDVLY ++VF+MSSLKQELLPLVPLFCQSLLEMGTKD+ FVQLNQLIGRKTGGISVYPFT
Sbjct: 274  NDVLYTDIVFDMSSLKQELLPLVPLFCQSLLEMGTKDLTFVQLNQLIGRKTGGISVYPFT 333

Query: 721  SSVRGKEDPCSRIIVRGKAMSERAEDLFNLVNCVIQEVQLTDQKRFKQFVSQSKARMENR 900
            SSV+GKEDPCS +IVRGKAM+ RAEDL++LVN V+Q+VQ TDQ+RFKQFVSQS+ARMENR
Sbjct: 334  SSVQGKEDPCSHMIVRGKAMAGRAEDLYDLVNSVLQDVQFTDQQRFKQFVSQSRARMENR 393

Query: 901  LRGSGHGIAAARMDAKLNVAGWVSEKMGGVSYLEFLQALEKKVDDDWXXXXXXXXXXRRT 1080
            LRGSGHGIAAARMDAKLN AGW+SEKMGG+SYLEFLQ LEK++D DW          R+T
Sbjct: 394  LRGSGHGIAAARMDAKLNAAGWMSEKMGGLSYLEFLQTLEKRIDQDWADISSSLEEIRKT 453

Query: 1081 LISKNDCLINLTADGKNLQNAEKYVSKFLDMLPTNSPVGSTG-WSARLPRTNEALVIPTQ 1257
            + SK  CLIN+TADGKNL N +K+VSKF+DMLPT+SP+ +   W+ RLP TNEA+VIPTQ
Sbjct: 454  VFSKQGCLINITADGKNLANTDKFVSKFVDMLPTSSPIATPNIWNVRLPLTNEAIVIPTQ 513

Query: 1258 VNYVGKAANLFETGYQLKGSAYVISKYLNNSWLWDRVRVSGGAYGGFCDFDTHSGVFSFL 1437
            VNYVGKA N+++ GY+L GSAYVISKY++N+WLWDRVRVSGGAYGGFCDFDTHSGVFSFL
Sbjct: 514  VNYVGKATNVYDAGYKLNGSAYVISKYISNTWLWDRVRVSGGAYGGFCDFDTHSGVFSFL 573

Query: 1438 SYRDPNLLKTLDVYDGTSNFLRELEMDDDALTKAIIGTIGDV 1563
            SYRDPNLLKTL+VYDGT +FLRELE+DDD LTKAIIGTIGDV
Sbjct: 574  SYRDPNLLKTLEVYDGTGDFLRELEIDDDTLTKAIIGTIGDV 615


Top