BLASTX nr result

ID: Catharanthus22_contig00000767 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus22_contig00000767
         (3593 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006346464.1| PREDICTED: presequence protease 1, chloropla...  1696   0.0  
ref|XP_004230817.1| PREDICTED: presequence protease 1, chloropla...  1691   0.0  
ref|XP_002282024.1| PREDICTED: presequence protease 2, chloropla...  1682   0.0  
emb|CBI32433.3| unnamed protein product [Vitis vinifera]             1671   0.0  
ref|XP_006487082.1| PREDICTED: LOW QUALITY PROTEIN: presequence ...  1667   0.0  
ref|XP_006423047.1| hypothetical protein CICLE_v10027722mg [Citr...  1666   0.0  
ref|XP_004136986.1| PREDICTED: presequence protease 1, chloropla...  1645   0.0  
ref|XP_004159889.1| PREDICTED: LOW QUALITY PROTEIN: presequence ...  1645   0.0  
ref|XP_004296078.1| PREDICTED: presequence protease 1, chloropla...  1642   0.0  
ref|XP_006384425.1| hypothetical protein POPTR_0004s14960g [Popu...  1634   0.0  
ref|XP_003517606.1| PREDICTED: presequence protease 2, chloropla...  1631   0.0  
gb|EMJ02012.1| hypothetical protein PRUPE_ppa025698mg, partial [...  1630   0.0  
gb|EOX98216.1| Presequence protease 2 isoform 2 [Theobroma cacao]    1627   0.0  
ref|XP_002330286.1| predicted protein [Populus trichocarpa]          1622   0.0  
gb|ESW29233.1| hypothetical protein PHAVU_002G054400g [Phaseolus...  1616   0.0  
ref|XP_004511282.1| PREDICTED: presequence protease 1, chloropla...  1615   0.0  
gb|EOX98215.1| Presequence protease 2 isoform 1 [Theobroma cacao]    1606   0.0  
gb|EOX98217.1| Presequence protease 2 isoform 3 [Theobroma cacao]    1605   0.0  
ref|XP_006829680.1| hypothetical protein AMTR_s00126p00013900 [A...  1592   0.0  
ref|XP_002313107.1| hypothetical protein POPTR_0009s10650g [Popu...  1592   0.0  

>ref|XP_006346464.1| PREDICTED: presequence protease 1, chloroplastic/mitochondrial-like
            [Solanum tuberosum]
          Length = 1072

 Score = 1696 bits (4393), Expect = 0.0
 Identities = 849/1080 (78%), Positives = 954/1080 (88%), Gaps = 4/1080 (0%)
 Frame = +1

Query: 151  MERAVLLRSLPSTTVISRTRLFTRSLHRLAPPRASSIYAKRCRLLPNLHRR-SLLRSYLP 327
            MERAVLLRSL ST+ ++ +R+F+RS HR A     S  A+R RLL NLHRR SL+RS + 
Sbjct: 1    MERAVLLRSLSSTSSLAFSRIFSRSSHRFA-----SYSARRHRLLQNLHRRRSLVRSNVR 55

Query: 328  VLSNRSPNFSSLRTQFSSQSVRAIATSAPQS--EVFGADDDVAEKLGFEKVSEEFIEECK 501
             +S+      +L+ QF   SVRAIATS+PQS  E  GADD+VAEK GFEKVSE+FI+ECK
Sbjct: 56   GISSSI----NLKRQFYPLSVRAIATSSPQSSQEFLGADDEVAEKFGFEKVSEQFIDECK 111

Query: 502  SRAILYKHKKTGAEVMSVSNDDENKVFGIVFRTPPSDSTGIPHILEHSVLCGSRKYPLKE 681
            S+A+LYKHKKTGAEVMSVSNDDENKVFG+VFRTPP DSTGIPHILEHSVLCGSRKYPLKE
Sbjct: 112  SKAVLYKHKKTGAEVMSVSNDDENKVFGVVFRTPPKDSTGIPHILEHSVLCGSRKYPLKE 171

Query: 682  PFVELLKVSLQTFLNAFTYPDRTCYPVASTNTKDFYNLVDVYLDAVFFPRCVEDYQTFQQ 861
            PFVELLK SL TFLNAFTYPDRTCYPVASTNTKDFYNLVDVYLDAVFFP+CVED+QTFQQ
Sbjct: 172  PFVELLKGSLNTFLNAFTYPDRTCYPVASTNTKDFYNLVDVYLDAVFFPKCVEDFQTFQQ 231

Query: 862  EGWHYELNDPSEDIIYKGVVFNEMKGVYSQPDSILGRTSQQALFPNNTYGVDSGGDPEVI 1041
            EGWHYELNDPS+DI +KGVVFNEMKGVYSQPD++LGRTSQQALFP+NTYGVDSGGDP VI
Sbjct: 232  EGWHYELNDPSDDITFKGVVFNEMKGVYSQPDNLLGRTSQQALFPDNTYGVDSGGDPRVI 291

Query: 1042 PKLTFEEFKEFHRKYYHPSNARIWFYGDDDPNERLRILSEYLDMFEGSSAPLESRVQPQK 1221
            P L+FEEFKEFHRK+YHPSNARIWFYGDDDPNERLRILSEYL+MF+ SSAP ESRV+PQ+
Sbjct: 292  PSLSFEEFKEFHRKFYHPSNARIWFYGDDDPNERLRILSEYLNMFDASSAPQESRVEPQR 351

Query: 1222 LFSEPVRVIEKYPA-EGDDLEKKHMVCVNWLLSEQPLDLETELALGFLDHLLMGTPASPL 1398
            LFSEPVR++EKYP  E  DL+KKHMVCVNWLLS++PLDLETELALGFLDHLL+GTPASPL
Sbjct: 352  LFSEPVRIVEKYPVGEDGDLKKKHMVCVNWLLSDKPLDLETELALGFLDHLLLGTPASPL 411

Query: 1399 RKILLESGLGDALVGGGVEDELLQPQFSIGLKGVKKDDIQRVEELIMNTLKKLAEEGFDS 1578
            RKILLESG GDA+VGGG+EDELLQPQFSIGLKGV +++IQ+VEELIM+TL+ L E+GFD 
Sbjct: 412  RKILLESGFGDAIVGGGIEDELLQPQFSIGLKGVSEENIQKVEELIMSTLEGLVEKGFDL 471

Query: 1579 DAVEASMNTIEFALRENNTGSFPRGLALMLRAMGKWIYDMNPFVPLKYKKPLMDLKARIA 1758
            DAVEASMNTIEF+LRENNTGSFPRGLALMLR++GKW+YDM+PF PLKY+KPL  LKARIA
Sbjct: 472  DAVEASMNTIEFSLRENNTGSFPRGLALMLRSIGKWVYDMDPFEPLKYQKPLEALKARIA 531

Query: 1759 EQGSKAVFSPLIVEYILNNPHRVTVEMQPDPEKASRDEEKEKENLSKVKASMTEEDLAEL 1938
            ++GSKAVF+PL+ +YIL NPHRVTVEMQPDPEKASR+E+ EKE L KVKASMT+EDLAEL
Sbjct: 532  KEGSKAVFAPLMDQYILRNPHRVTVEMQPDPEKASREEQIEKETLDKVKASMTQEDLAEL 591

Query: 1939 ARATEELRLKQETPDPPEALKCVPSLSLQDIPKKPVHVPIEVGDINGVKVLQHDLFTNDV 2118
            ARAT ELRLKQETPDPPEALK VPSLSLQDIP++PV VP E+GDINGVKVL+HDLFTNDV
Sbjct: 592  ARATHELRLKQETPDPPEALKSVPSLSLQDIPREPVLVPTEIGDINGVKVLKHDLFTNDV 651

Query: 2119 LYSEVVFDMSSLKQELLPLVPLFCQSLLEMGTKDLDFVQLNQLIGRKTGGISVYPFTSTV 2298
            LY+EVVF++SSLKQELLPLVPLFCQSLLEMGTKDLDFVQLNQLIGRKTGG+SVYPFTS+V
Sbjct: 652  LYAEVVFNLSSLKQELLPLVPLFCQSLLEMGTKDLDFVQLNQLIGRKTGGLSVYPFTSSV 711

Query: 2299 RGKEEPCSHIVVRGKAMSSRTEDLFNLINCILQDVQLTDQKRFKQFVSQSKARMENRVRG 2478
             GK EPCS I+VRGKAMS RTEDLF LIN +LQDVQL DQKRFKQFVSQS++RMENR+RG
Sbjct: 712  HGKVEPCSKIIVRGKAMSQRTEDLFYLINRVLQDVQLDDQKRFKQFVSQSRSRMENRLRG 771

Query: 2479 SGHGIAAARMDAKLNSAGWISEQMGGISYLEFLRSLEEKVDKDWPEIASSLEEIRRSLFS 2658
            SGH IAAARM AKLN AGWISEQMGG+SYLEFL+ LE++V+KDWP+I+SSLEEIR+SL S
Sbjct: 772  SGHSIAAARMGAKLNVAGWISEQMGGVSYLEFLKVLEDQVEKDWPQISSSLEEIRKSLLS 831

Query: 2659 KSGCLINLTSDGKNLTNAEKHVSKFXXXXXXXXXXXXXXWTARLPFTNEAIVIPTQVNYV 2838
            K+GCLINLT+DGKNL NAEKH+S+F              W A+L  +NEA V+PTQVNYV
Sbjct: 832  KNGCLINLTADGKNLNNAEKHISEFLDLLPSTSLVESAAWNAQLSRSNEAFVVPTQVNYV 891

Query: 2839 GKAANLYETGYQLKGSAYVISKYISNTWLWDHVRVSGGAYGGFCDFDTHSGVFSFLSYRD 3018
            GKAANLYE GY+LKGSAYVIS YISNTWLWD VRVSGGAYGGFC FD+HSGVFSFLSYRD
Sbjct: 892  GKAANLYEAGYELKGSAYVISNYISNTWLWDRVRVSGGAYGGFCSFDSHSGVFSFLSYRD 951

Query: 3019 PNLLKTLNIYDGTSQFLRELDMDDDALTRAIIGTIGDVDSYQLPDAKGYTSLLRYLLGVT 3198
            PNLLKTL++YDGTS FL+EL+MDDDALT+AIIGTIGDVDSYQLPDAKGY+SLLRYLLGVT
Sbjct: 952  PNLLKTLDVYDGTSSFLKELEMDDDALTKAIIGTIGDVDSYQLPDAKGYSSLLRYLLGVT 1011

Query: 3199 XXXXXXXXXXILSTRLKDFKEFADVIEXXXXXXXXXXXXXRDDVEAANKECSDFFQVKKA 3378
                      ILST L+DF++F DV+E              DDVEAANKE S+F +VKKA
Sbjct: 1012 DEERQRRREEILSTSLEDFRKFGDVMEAVKDKGVVVAVASPDDVEAANKERSNFLEVKKA 1071


>ref|XP_004230817.1| PREDICTED: presequence protease 1, chloroplastic/mitochondrial-like
            [Solanum lycopersicum]
          Length = 1072

 Score = 1691 bits (4378), Expect = 0.0
 Identities = 846/1080 (78%), Positives = 954/1080 (88%), Gaps = 4/1080 (0%)
 Frame = +1

Query: 151  MERAVLLRSLPSTTVISRTRLFTRSLHRLAPPRASSIYAKRCRLLPNLHRR-SLLRSYLP 327
            MERAVLLRSL ST+ ++ +R+F+RS HR A     S  A+R RLL NL RR SL+RS + 
Sbjct: 1    MERAVLLRSLSSTSTLAFSRIFSRSSHRFA-----SYSARRHRLLQNLQRRRSLVRSNVR 55

Query: 328  VLSNRSPNFSSLRTQFSSQSVRAIATSAPQS--EVFGADDDVAEKLGFEKVSEEFIEECK 501
             +S+      +L+ QF   SVRAIATS+PQS  E  GADD+VAEK GFEKVSE+FI+ECK
Sbjct: 56   GISSSI----NLKRQFYPLSVRAIATSSPQSSQEFLGADDEVAEKFGFEKVSEQFIDECK 111

Query: 502  SRAILYKHKKTGAEVMSVSNDDENKVFGIVFRTPPSDSTGIPHILEHSVLCGSRKYPLKE 681
            S+A+LYKHKKTGAEVMSVSNDDENKVFG+VFRTPP DSTGIPHILEHSVLCGSRKYPLKE
Sbjct: 112  SKAVLYKHKKTGAEVMSVSNDDENKVFGVVFRTPPKDSTGIPHILEHSVLCGSRKYPLKE 171

Query: 682  PFVELLKVSLQTFLNAFTYPDRTCYPVASTNTKDFYNLVDVYLDAVFFPRCVEDYQTFQQ 861
            PFVELLK SL TFLNAFTYPDRTCYPVASTNTKDFYNLVDVYLDAVFFP+CVED+QTFQQ
Sbjct: 172  PFVELLKGSLNTFLNAFTYPDRTCYPVASTNTKDFYNLVDVYLDAVFFPKCVEDFQTFQQ 231

Query: 862  EGWHYELNDPSEDIIYKGVVFNEMKGVYSQPDSILGRTSQQALFPNNTYGVDSGGDPEVI 1041
            EGWHYELNDPS++I +KGVVFNEMKGVYSQPD++LGRTSQQALFP+NTYGVDSGGDP VI
Sbjct: 232  EGWHYELNDPSDEITFKGVVFNEMKGVYSQPDNLLGRTSQQALFPDNTYGVDSGGDPRVI 291

Query: 1042 PKLTFEEFKEFHRKYYHPSNARIWFYGDDDPNERLRILSEYLDMFEGSSAPLESRVQPQK 1221
            P L+FE+FKEFHRK+YHPSNARIWFYGDDDPNERLRILSEYL+MF+ SSAP ESRV+PQ+
Sbjct: 292  PSLSFEDFKEFHRKFYHPSNARIWFYGDDDPNERLRILSEYLNMFDASSAPHESRVEPQR 351

Query: 1222 LFSEPVRVIEKYPA-EGDDLEKKHMVCVNWLLSEQPLDLETELALGFLDHLLMGTPASPL 1398
            LFSEPVR++EKYP  E  DL+KKHMVCVNWLLS++PLDLETELALGFLDHLL+GTPASPL
Sbjct: 352  LFSEPVRIVEKYPVGEDGDLKKKHMVCVNWLLSDKPLDLETELALGFLDHLLLGTPASPL 411

Query: 1399 RKILLESGLGDALVGGGVEDELLQPQFSIGLKGVKKDDIQRVEELIMNTLKKLAEEGFDS 1578
            RKILLESGLGDA+VGGG+EDELLQPQFSIGLKGV +++IQ+VEELIM+TL+ LAE+GFDS
Sbjct: 412  RKILLESGLGDAIVGGGIEDELLQPQFSIGLKGVSEENIQKVEELIMSTLQGLAEKGFDS 471

Query: 1579 DAVEASMNTIEFALRENNTGSFPRGLALMLRAMGKWIYDMNPFVPLKYKKPLMDLKARIA 1758
            DAVEASMNTIEF+LRENNTGSFPRGLALMLR++GKW+YDM+PF PLKY+KPL  LKARIA
Sbjct: 472  DAVEASMNTIEFSLRENNTGSFPRGLALMLRSIGKWVYDMDPFEPLKYQKPLEALKARIA 531

Query: 1759 EQGSKAVFSPLIVEYILNNPHRVTVEMQPDPEKASRDEEKEKENLSKVKASMTEEDLAEL 1938
            ++GSKAVF+PL+ +YIL NPHRVTVEMQPDPEKASR+E+ EKE L KVKASMT+EDLAEL
Sbjct: 532  KEGSKAVFAPLMDQYILRNPHRVTVEMQPDPEKASREEQIEKETLDKVKASMTQEDLAEL 591

Query: 1939 ARATEELRLKQETPDPPEALKCVPSLSLQDIPKKPVHVPIEVGDINGVKVLQHDLFTNDV 2118
            ARAT ELRLKQETPDPPEALK VPSLSLQDIP++PV VP E+GDINGVKVL+HDLFTNDV
Sbjct: 592  ARATHELRLKQETPDPPEALKSVPSLSLQDIPREPVLVPTEIGDINGVKVLKHDLFTNDV 651

Query: 2119 LYSEVVFDMSSLKQELLPLVPLFCQSLLEMGTKDLDFVQLNQLIGRKTGGISVYPFTSTV 2298
            LY+EVVF++SSLKQELLPLVPLFCQSLLEMGTKDLDFVQLNQLIGRKTGG+SVYPFTS+V
Sbjct: 652  LYAEVVFNLSSLKQELLPLVPLFCQSLLEMGTKDLDFVQLNQLIGRKTGGLSVYPFTSSV 711

Query: 2299 RGKEEPCSHIVVRGKAMSSRTEDLFNLINCILQDVQLTDQKRFKQFVSQSKARMENRVRG 2478
             GK EPCS I+VRGKAMS RTEDLF LIN +LQDVQL DQKRFKQFVSQS++RMENR+RG
Sbjct: 712  HGKVEPCSKIIVRGKAMSQRTEDLFYLINRVLQDVQLDDQKRFKQFVSQSRSRMENRLRG 771

Query: 2479 SGHGIAAARMDAKLNSAGWISEQMGGISYLEFLRSLEEKVDKDWPEIASSLEEIRRSLFS 2658
            SGH +AAARM AKLN AGWISEQMGG+SYLEFL+ LE++V+KDW +I+SSLEEIR+SL S
Sbjct: 772  SGHSVAAARMGAKLNVAGWISEQMGGVSYLEFLKVLEDQVEKDWSQISSSLEEIRKSLLS 831

Query: 2659 KSGCLINLTSDGKNLTNAEKHVSKFXXXXXXXXXXXXXXWTARLPFTNEAIVIPTQVNYV 2838
            K+GCLINLT+DGKNL NAEKH+SKF              W A+L  +NEA V+PTQVNYV
Sbjct: 832  KNGCLINLTADGKNLNNAEKHISKFLDLLPSTSLVEPAAWNAQLSRSNEAFVVPTQVNYV 891

Query: 2839 GKAANLYETGYQLKGSAYVISKYISNTWLWDHVRVSGGAYGGFCDFDTHSGVFSFLSYRD 3018
            GKAANLYE GY+LKGSAYVIS Y SNTWLWD VRVSGGAYGGFC FD+HSGVFSFLSYRD
Sbjct: 892  GKAANLYEAGYELKGSAYVISNYTSNTWLWDRVRVSGGAYGGFCSFDSHSGVFSFLSYRD 951

Query: 3019 PNLLKTLNIYDGTSQFLRELDMDDDALTRAIIGTIGDVDSYQLPDAKGYTSLLRYLLGVT 3198
            PNLLKTL++YDGTS FL+EL+MD+DALT+AIIGTIGDVDSYQLPDAKGY+SLLRYLLGVT
Sbjct: 952  PNLLKTLDVYDGTSSFLKELEMDNDALTKAIIGTIGDVDSYQLPDAKGYSSLLRYLLGVT 1011

Query: 3199 XXXXXXXXXXILSTRLKDFKEFADVIEXXXXXXXXXXXXXRDDVEAANKECSDFFQVKKA 3378
                      ILST L+DF++F DV+E              DDVEAANKE S+F +VKKA
Sbjct: 1012 DEERQRRREEILSTSLEDFRKFGDVMEAVKDKGVVVAVASPDDVEAANKERSNFLEVKKA 1071


>ref|XP_002282024.1| PREDICTED: presequence protease 2, chloroplastic/mitochondrial-like
            [Vitis vinifera]
          Length = 1080

 Score = 1682 bits (4357), Expect = 0.0
 Identities = 844/1082 (78%), Positives = 944/1082 (87%), Gaps = 6/1082 (0%)
 Frame = +1

Query: 151  MERAVLLRSLPSTTVISRTRLFTRSLHRLAPPRAS---SIYAKRCRLLPNLHRRSLLRSY 321
            MERA LLRS+  +T ++  R F RS HRL+ P AS   S+     R    L RRS+LR +
Sbjct: 1    MERAALLRSITCST-LACNRFFLRSSHRLSLPSASFSSSLSRSHHRSFGTLTRRSVLRRH 59

Query: 322  LPVLSNRSPNFSSLRTQFSSQSVRAIATSAPQ--SEVFGADDDVAEKLGFEKVSEEFIEE 495
              +L + S +  S R  FSS S +AIATS  Q  S+  G+ DD+AEK GF+KVSE+FI+E
Sbjct: 60   WRLLPSSS-SIPSTRC-FSSLSPKAIATSPEQASSDAVGSQDDLAEKYGFDKVSEQFIQE 117

Query: 496  CKSRAILYKHKKTGAEVMSVSNDDENKVFGIVFRTPPSDSTGIPHILEHSVLCGSRKYPL 675
            CKS+A+LYKHKKTGAEVMSVSNDDENKVFGIVFRTPP DSTGIPHILEHSVLCGSRKYPL
Sbjct: 118  CKSKAVLYKHKKTGAEVMSVSNDDENKVFGIVFRTPPKDSTGIPHILEHSVLCGSRKYPL 177

Query: 676  KEPFVELLKVSLQTFLNAFTYPDRTCYPVASTNTKDFYNLVDVYLDAVFFPRCVEDYQTF 855
            KEPFVELLK SL TFLNAFTYPDRTCYPVASTNTKDFYNLVDVYLDAV FP+CVED+QTF
Sbjct: 178  KEPFVELLKGSLHTFLNAFTYPDRTCYPVASTNTKDFYNLVDVYLDAVLFPKCVEDFQTF 237

Query: 856  QQEGWHYELNDPSEDIIYKGVVFNEMKGVYSQPDSILGRTSQQALFPNNTYGVDSGGDPE 1035
            QQEGWHYELN+PSEDI YKGVVFNEMKGVYSQPD+ILGRT+QQALFP+NTYGVDSGGDP+
Sbjct: 238  QQEGWHYELNNPSEDISYKGVVFNEMKGVYSQPDNILGRTAQQALFPDNTYGVDSGGDPK 297

Query: 1036 VIPKLTFEEFKEFHRKYYHPSNARIWFYGDDDPNERLRILSEYLDMFEGSSAPLESRVQP 1215
            VIPKLTFE+FKEFHRKYYHP NARIWFYGDDDPNERLRIL+EYLD+F+ S A  ES+V+P
Sbjct: 298  VIPKLTFEDFKEFHRKYYHPGNARIWFYGDDDPNERLRILNEYLDLFDTSPASSESKVEP 357

Query: 1216 QKLFSEPVRVIEKYPA-EGDDLEKKHMVCVNWLLSEQPLDLETELALGFLDHLLMGTPAS 1392
            QKLFS PVR++EKYPA +G DL KKHMVC+NWLLS++PLDLETEL LGFLDHL++GTPAS
Sbjct: 358  QKLFSNPVRIVEKYPAGKGGDLRKKHMVCLNWLLSDKPLDLETELTLGFLDHLMLGTPAS 417

Query: 1393 PLRKILLESGLGDALVGGGVEDELLQPQFSIGLKGVKKDDIQRVEELIMNTLKKLAEEGF 1572
            PLRKILLESGLGDA+VGGG+EDELLQPQFSIGLKGV +DDI +VEEL+M+TLK LA+EGF
Sbjct: 418  PLRKILLESGLGDAIVGGGMEDELLQPQFSIGLKGVSEDDIHKVEELVMSTLKSLAKEGF 477

Query: 1573 DSDAVEASMNTIEFALRENNTGSFPRGLALMLRAMGKWIYDMNPFVPLKYKKPLMDLKAR 1752
            +S+AVEASMNTIEF+LRENNTGSFPRGL+LMLR++GKWIYDM+PF PLKY+KPLM LKAR
Sbjct: 478  NSEAVEASMNTIEFSLRENNTGSFPRGLSLMLRSIGKWIYDMDPFEPLKYEKPLMALKAR 537

Query: 1753 IAEQGSKAVFSPLIVEYILNNPHRVTVEMQPDPEKASRDEEKEKENLSKVKASMTEEDLA 1932
            IAE+GSKAVFSPLI +YILNNPH VTVEMQPDPEKASRDE  E+E L KVKA MTEEDLA
Sbjct: 538  IAEEGSKAVFSPLIEKYILNNPHCVTVEMQPDPEKASRDEAVEREILEKVKAGMTEEDLA 597

Query: 1933 ELARATEELRLKQETPDPPEALKCVPSLSLQDIPKKPVHVPIEVGDINGVKVLQHDLFTN 2112
            ELARAT+ELRLKQETPDPPEALK VPSLSL DIPK+P+HVPIE+G IN VKVL+HDLFTN
Sbjct: 598  ELARATQELRLKQETPDPPEALKSVPSLSLLDIPKEPIHVPIEIGVINDVKVLRHDLFTN 657

Query: 2113 DVLYSEVVFDMSSLKQELLPLVPLFCQSLLEMGTKDLDFVQLNQLIGRKTGGISVYPFTS 2292
            DVLY+E+VFDMSSLKQ+LLPLVPLFCQSL+EMGTKD+DFVQLNQLIGRKTGGISVYPFTS
Sbjct: 658  DVLYTEIVFDMSSLKQDLLPLVPLFCQSLMEMGTKDMDFVQLNQLIGRKTGGISVYPFTS 717

Query: 2293 TVRGKEEPCSHIVVRGKAMSSRTEDLFNLINCILQDVQLTDQKRFKQFVSQSKARMENRV 2472
            +VRGKE PCSHI+VRGKAM+   EDLFNL+NCILQ+VQ TDQ+RFKQFVSQSKARMENR+
Sbjct: 718  SVRGKEYPCSHIIVRGKAMAGCAEDLFNLVNCILQEVQFTDQQRFKQFVSQSKARMENRL 777

Query: 2473 RGSGHGIAAARMDAKLNSAGWISEQMGGISYLEFLRSLEEKVDKDWPEIASSLEEIRRSL 2652
            RGSGHGIAAARMDAKLN+AGWI+EQMGG+SYLEFL++LEEKVD+DW  I+SSLEEIR+SL
Sbjct: 778  RGSGHGIAAARMDAKLNTAGWIAEQMGGVSYLEFLQALEEKVDQDWIGISSSLEEIRKSL 837

Query: 2653 FSKSGCLINLTSDGKNLTNAEKHVSKFXXXXXXXXXXXXXXWTARLPFTNEAIVIPTQVN 2832
             S+ GCLIN+TS+GKNL N+EK+VSKF              W  RL   NEAIVIPTQVN
Sbjct: 838  LSRKGCLINMTSEGKNLMNSEKYVSKFLDLLPGSSSVEKTTWNGRLSSENEAIVIPTQVN 897

Query: 2833 YVGKAANLYETGYQLKGSAYVISKYISNTWLWDHVRVSGGAYGGFCDFDTHSGVFSFLSY 3012
            YVGKA N+Y+TGYQLKGSAYVISKYISNTWLWD VRVSGGAYGGFCDFDTHSGVFSFLSY
Sbjct: 898  YVGKATNIYDTGYQLKGSAYVISKYISNTWLWDRVRVSGGAYGGFCDFDTHSGVFSFLSY 957

Query: 3013 RDPNLLKTLNIYDGTSQFLRELDMDDDALTRAIIGTIGDVDSYQLPDAKGYTSLLRYLLG 3192
            RDPNLLKTL++YDGT  FLR+L+MDDD LT+AIIGTIGDVD+YQLPDAKGY+SLLRYLLG
Sbjct: 958  RDPNLLKTLDVYDGTGDFLRQLEMDDDTLTKAIIGTIGDVDAYQLPDAKGYSSLLRYLLG 1017

Query: 3193 VTXXXXXXXXXXILSTRLKDFKEFADVIEXXXXXXXXXXXXXRDDVEAANKECSDFFQVK 3372
            VT          ILST LKDFKEFAD IE              DDV+AANKE  +FFQVK
Sbjct: 1018 VTEEERQKRREEILSTSLKDFKEFADAIEAAKHKGVVVAVASPDDVDAANKEHPNFFQVK 1077

Query: 3373 KA 3378
            KA
Sbjct: 1078 KA 1079


>emb|CBI32433.3| unnamed protein product [Vitis vinifera]
          Length = 1098

 Score = 1671 bits (4328), Expect = 0.0
 Identities = 844/1100 (76%), Positives = 944/1100 (85%), Gaps = 24/1100 (2%)
 Frame = +1

Query: 151  MERAVLLRSLPSTTVISRTRLFTRSLHRLAPPRAS---SIYAKRCRLLPNLHRRSLLRSY 321
            MERA LLRS+  +T ++  R F RS HRL+ P AS   S+     R    L RRS+LR +
Sbjct: 1    MERAALLRSITCST-LACNRFFLRSSHRLSLPSASFSSSLSRSHHRSFGTLTRRSVLRRH 59

Query: 322  LPVLSNRSPNFSSLRTQFSSQSVRAIATSAPQ--SEVFGADDDVAEKLGFEKVSEEFIEE 495
              +L + S +  S R  FSS S +AIATS  Q  S+  G+ DD+AEK GF+KVSE+FI+E
Sbjct: 60   WRLLPSSS-SIPSTRC-FSSLSPKAIATSPEQASSDAVGSQDDLAEKYGFDKVSEQFIQE 117

Query: 496  CKSRAILYKHKKTGAEVMSVSNDDENKVFGIVFRTPPSDSTGIPHILEHSVLCGSRKYPL 675
            CKS+A+LYKHKKTGAEVMSVSNDDENKVFGIVFRTPP DSTGIPHILEHSVLCGSRKYPL
Sbjct: 118  CKSKAVLYKHKKTGAEVMSVSNDDENKVFGIVFRTPPKDSTGIPHILEHSVLCGSRKYPL 177

Query: 676  KEPFVELLKVSLQTFLNAFTYPDRTCYPVASTNTKDFYNLVDVYLDAVFFPRCVEDYQTF 855
            KEPFVELLK SL TFLNAFTYPDRTCYPVASTNTKDFYNLVDVYLDAV FP+CVED+QTF
Sbjct: 178  KEPFVELLKGSLHTFLNAFTYPDRTCYPVASTNTKDFYNLVDVYLDAVLFPKCVEDFQTF 237

Query: 856  QQEGWHYELNDPSEDIIYKGVVFNEMKGVYSQPDSILGRTSQQA---------------- 987
            QQEGWHYELN+PSEDI YKGVVFNEMKGVYSQPD+ILGRT+QQA                
Sbjct: 238  QQEGWHYELNNPSEDISYKGVVFNEMKGVYSQPDNILGRTAQQASFLDKYGVCGYEEPIG 297

Query: 988  --LFPNNTYGVDSGGDPEVIPKLTFEEFKEFHRKYYHPSNARIWFYGDDDPNERLRILSE 1161
              LFP+NTYGVDSGGDP+VIPKLTFE+FKEFHRKYYHP NARIWFYGDDDPNERLRIL+E
Sbjct: 298  SALFPDNTYGVDSGGDPKVIPKLTFEDFKEFHRKYYHPGNARIWFYGDDDPNERLRILNE 357

Query: 1162 YLDMFEGSSAPLESRVQPQKLFSEPVRVIEKYPA-EGDDLEKKHMVCVNWLLSEQPLDLE 1338
            YLD+F+ S A  ES+V+PQKLFS PVR++EKYPA +G DL KKHMVC+NWLLS++PLDLE
Sbjct: 358  YLDLFDTSPASSESKVEPQKLFSNPVRIVEKYPAGKGGDLRKKHMVCLNWLLSDKPLDLE 417

Query: 1339 TELALGFLDHLLMGTPASPLRKILLESGLGDALVGGGVEDELLQPQFSIGLKGVKKDDIQ 1518
            TEL LGFLDHL++GTPASPLRKILLESGLGDA+VGGG+EDELLQPQFSIGLKGV +DDI 
Sbjct: 418  TELTLGFLDHLMLGTPASPLRKILLESGLGDAIVGGGMEDELLQPQFSIGLKGVSEDDIH 477

Query: 1519 RVEELIMNTLKKLAEEGFDSDAVEASMNTIEFALRENNTGSFPRGLALMLRAMGKWIYDM 1698
            +VEEL+M+TLK LA+EGF+S+AVEASMNTIEF+LRENNTGSFPRGL+LMLR++GKWIYDM
Sbjct: 478  KVEELVMSTLKSLAKEGFNSEAVEASMNTIEFSLRENNTGSFPRGLSLMLRSIGKWIYDM 537

Query: 1699 NPFVPLKYKKPLMDLKARIAEQGSKAVFSPLIVEYILNNPHRVTVEMQPDPEKASRDEEK 1878
            +PF PLKY+KPLM LKARIAE+GSKAVFSPLI +YILNNPH VTVEMQPDPEKASRDE  
Sbjct: 538  DPFEPLKYEKPLMALKARIAEEGSKAVFSPLIEKYILNNPHCVTVEMQPDPEKASRDEAV 597

Query: 1879 EKENLSKVKASMTEEDLAELARATEELRLKQETPDPPEALKCVPSLSLQDIPKKPVHVPI 2058
            E+E L KVKA MTEEDLAELARAT+ELRLKQETPDPPEALK VPSLSL DIPK+P+HVPI
Sbjct: 598  EREILEKVKAGMTEEDLAELARATQELRLKQETPDPPEALKSVPSLSLLDIPKEPIHVPI 657

Query: 2059 EVGDINGVKVLQHDLFTNDVLYSEVVFDMSSLKQELLPLVPLFCQSLLEMGTKDLDFVQL 2238
            E+G IN VKVL+HDLFTNDVLY+E+VFDMSSLKQ+LLPLVPLFCQSL+EMGTKD+DFVQL
Sbjct: 658  EIGVINDVKVLRHDLFTNDVLYTEIVFDMSSLKQDLLPLVPLFCQSLMEMGTKDMDFVQL 717

Query: 2239 NQLIGRKTGGISVYPFTSTVRGKEEPCSHIVVRGKAMSSRTEDLFNLINCILQDVQLTDQ 2418
            NQLIGRKTGGISVYPFTS+VRGKE PCSHI+VRGKAM+   EDLFNL+NCILQ+VQ TDQ
Sbjct: 718  NQLIGRKTGGISVYPFTSSVRGKEYPCSHIIVRGKAMAGCAEDLFNLVNCILQEVQFTDQ 777

Query: 2419 KRFKQFVSQSKARMENRVRGSGHGIAAARMDAKLNSAGWISEQMGGISYLEFLRSLEEKV 2598
            +RFKQFVSQSKARMENR+RGSGHGIAAARMDAKLN+AGWI+EQMGG+SYLEFL++LEEKV
Sbjct: 778  QRFKQFVSQSKARMENRLRGSGHGIAAARMDAKLNTAGWIAEQMGGVSYLEFLQALEEKV 837

Query: 2599 DKDWPEIASSLEEIRRSLFSKSGCLINLTSDGKNLTNAEKHVSKFXXXXXXXXXXXXXXW 2778
            D+DW  I+SSLEEIR+SL S+ GCLIN+TS+GKNL N+EK+VSKF              W
Sbjct: 838  DQDWIGISSSLEEIRKSLLSRKGCLINMTSEGKNLMNSEKYVSKFLDLLPGSSSVEKTTW 897

Query: 2779 TARLPFTNEAIVIPTQVNYVGKAANLYETGYQLKGSAYVISKYISNTWLWDHVRVSGGAY 2958
              RL   NEAIVIPTQVNYVGKA N+Y+TGYQLKGSAYVISKYISNTWLWD VRVSGGAY
Sbjct: 898  NGRLSSENEAIVIPTQVNYVGKATNIYDTGYQLKGSAYVISKYISNTWLWDRVRVSGGAY 957

Query: 2959 GGFCDFDTHSGVFSFLSYRDPNLLKTLNIYDGTSQFLRELDMDDDALTRAIIGTIGDVDS 3138
            GGFCDFDTHSGVFSFLSYRDPNLLKTL++YDGT  FLR+L+MDDD LT+AIIGTIGDVD+
Sbjct: 958  GGFCDFDTHSGVFSFLSYRDPNLLKTLDVYDGTGDFLRQLEMDDDTLTKAIIGTIGDVDA 1017

Query: 3139 YQLPDAKGYTSLLRYLLGVTXXXXXXXXXXILSTRLKDFKEFADVIEXXXXXXXXXXXXX 3318
            YQLPDAKGY+SLLRYLLGVT          ILST LKDFKEFAD IE             
Sbjct: 1018 YQLPDAKGYSSLLRYLLGVTEEERQKRREEILSTSLKDFKEFADAIEAAKHKGVVVAVAS 1077

Query: 3319 RDDVEAANKECSDFFQVKKA 3378
             DDV+AANKE  +FFQVKKA
Sbjct: 1078 PDDVDAANKEHPNFFQVKKA 1097


>ref|XP_006487082.1| PREDICTED: LOW QUALITY PROTEIN: presequence protease 2,
            chloroplastic/mitochondrial-like [Citrus sinensis]
          Length = 1082

 Score = 1667 bits (4316), Expect = 0.0
 Identities = 843/1082 (77%), Positives = 939/1082 (86%), Gaps = 6/1082 (0%)
 Frame = +1

Query: 151  MERAVLLRSLPSTTVISRTRLFTRSLHRLAPPRASSIYAKRC---RLLPNLHRRSLLRSY 321
            MERA LLRSL  T++ S  R + RS    A   +SS+   R    RL+ NL RRSLLR  
Sbjct: 1    MERAALLRSLSCTSLASN-RFYFRSFVPRAKFSSSSVAVARRNHHRLINNLTRRSLLRGD 59

Query: 322  LPVLSNRSPNFSSLRTQFSSQSVRAIAT-SAPQS-EVFGADDDVAEKLGFEKVSEEFIEE 495
              +  + S         FSS S RA+A+ S P S EV    ++VAEKLGFEKVSEEFI E
Sbjct: 60   SRLHLSLSSYSLQFNKHFSSLSPRAVASPSTPSSPEVAEVSNEVAEKLGFEKVSEEFIGE 119

Query: 496  CKSRAILYKHKKTGAEVMSVSNDDENKVFGIVFRTPPSDSTGIPHILEHSVLCGSRKYPL 675
            CKS+A+L+KHKKTGAEVMSVSNDDENKVFGIVFRTPP DSTGIPHILEHSVLCGSRKYPL
Sbjct: 120  CKSKAVLFKHKKTGAEVMSVSNDDENKVFGIVFRTPPKDSTGIPHILEHSVLCGSRKYPL 179

Query: 676  KEPFVELLKVSLQTFLNAFTYPDRTCYPVASTNTKDFYNLVDVYLDAVFFPRCVEDYQTF 855
            KEPFVELLK SL TFLNAFTYPDRTCYPVASTNTKDFYNLVDVYLDAVFFP+CVED+QTF
Sbjct: 180  KEPFVELLKGSLNTFLNAFTYPDRTCYPVASTNTKDFYNLVDVYLDAVFFPKCVEDFQTF 239

Query: 856  QQEGWHYELNDPSEDIIYKGVVFNEMKGVYSQPDSILGRTSQQALFPNNTYGVDSGGDPE 1035
            QQEGWH++L++PSEDI YKGVVFNEMKGVYSQPD+ILGR +QQALFP+N YGVDSGGDP+
Sbjct: 240  QQEGWHFKLDNPSEDITYKGVVFNEMKGVYSQPDNILGRAAQQALFPDNAYGVDSGGDPK 299

Query: 1036 VIPKLTFEEFKEFHRKYYHPSNARIWFYGDDDPNERLRILSEYLDMFEGSSAPLESRVQP 1215
            VIPKLTFEEFKEFHRKYYHPSNARIWFYGDDDPNERLRILSEYL+MFE SSAP ES V+ 
Sbjct: 300  VIPKLTFEEFKEFHRKYYHPSNARIWFYGDDDPNERLRILSEYLNMFEASSAPNESIVEK 359

Query: 1216 QKLFSEPVRVIEKYPA-EGDDLEKKHMVCVNWLLSEQPLDLETELALGFLDHLLMGTPAS 1392
            QKLFSEPVR+IEKYPA +  D++KK+MVC+NWLLS++PLDLETELALGFLDHL++GTPAS
Sbjct: 360  QKLFSEPVRIIEKYPAGDAGDIKKKNMVCLNWLLSDKPLDLETELALGFLDHLMLGTPAS 419

Query: 1393 PLRKILLESGLGDALVGGGVEDELLQPQFSIGLKGVKKDDIQRVEELIMNTLKKLAEEGF 1572
            PLRKILLESGLGDA+VGGG+EDELLQPQFSIGLK V +DDIQ VEELIM+TLKKLA+EGF
Sbjct: 420  PLRKILLESGLGDAIVGGGIEDELLQPQFSIGLKNVSEDDIQTVEELIMDTLKKLADEGF 479

Query: 1573 DSDAVEASMNTIEFALRENNTGSFPRGLALMLRAMGKWIYDMNPFVPLKYKKPLMDLKAR 1752
            DSDAVEASMNTIEF+LRENNTGSFPRGL+LMLR+MGKWIYDMNPF PLKY+KPLM LKAR
Sbjct: 480  DSDAVEASMNTIEFSLRENNTGSFPRGLSLMLRSMGKWIYDMNPFEPLKYEKPLMALKAR 539

Query: 1753 IAEQGSKAVFSPLIVEYILNNPHRVTVEMQPDPEKASRDEEKEKENLSKVKASMTEEDLA 1932
            +AE+GSKAVFSPLI +YILNNPH VTVEMQPDPEKASRDE  EKE L+KVK+SMT+EDLA
Sbjct: 540  LAEEGSKAVFSPLIEKYILNNPHCVTVEMQPDPEKASRDEAAEKEILAKVKSSMTKEDLA 599

Query: 1933 ELARATEELRLKQETPDPPEALKCVPSLSLQDIPKKPVHVPIEVGDINGVKVLQHDLFTN 2112
            ELARATEELRLKQETPDPPEAL+ VPSLSL+DIPK+P+ VP EVGDINGVKVLQHDLFTN
Sbjct: 600  ELARATEELRLKQETPDPPEALRSVPSLSLRDIPKEPIRVPTEVGDINGVKVLQHDLFTN 659

Query: 2113 DVLYSEVVFDMSSLKQELLPLVPLFCQSLLEMGTKDLDFVQLNQLIGRKTGGISVYPFTS 2292
            DVLY+EVVFDMSSLKQELLPL+PLFCQSL EMGTKDL FVQLNQLIGRKTGGISVYPFTS
Sbjct: 660  DVLYTEVVFDMSSLKQELLPLIPLFCQSLKEMGTKDLSFVQLNQLIGRKTGGISVYPFTS 719

Query: 2293 TVRGKEEPCSHIVVRGKAMSSRTEDLFNLINCILQDVQLTDQKRFKQFVSQSKARMENRV 2472
            ++RGKE+PC  +VVRGKAM+ + EDLFNL NC+LQ+VQLTDQ+RFKQFVSQSKARMENR+
Sbjct: 720  SIRGKEDPCCCMVVRGKAMAGQAEDLFNLFNCVLQEVQLTDQQRFKQFVSQSKARMENRL 779

Query: 2473 RGSGHGIAAARMDAKLNSAGWISEQMGGISYLEFLRSLEEKVDKDWPEIASSLEEIRRSL 2652
            RGSGHGIAAARMDAKLN+AGWISEQMGG+SYLEFL++LEEKVD+DW  I+SSLEEIRRS 
Sbjct: 780  RGSGHGIAAARMDAKLNTAGWISEQMGGVSYLEFLQALEEKVDQDWAGISSSLEEIRRSF 839

Query: 2653 FSKSGCLINLTSDGKNLTNAEKHVSKFXXXXXXXXXXXXXXWTARLPFTNEAIVIPTQVN 2832
             S+ GCLIN+T+DGKNL N+E+ V KF              W A LP  NEAIVIPTQVN
Sbjct: 840  LSREGCLINMTADGKNLKNSERFVGKFLDMLPTNSPVERVKWKAHLPSANEAIVIPTQVN 899

Query: 2833 YVGKAANLYETGYQLKGSAYVISKYISNTWLWDHVRVSGGAYGGFCDFDTHSGVFSFLSY 3012
            YVGKAAN++ETGY+L GSAYVISK+ISN WLWD VRVSGGAYGGFCDFD+HSGVFSFLSY
Sbjct: 900  YVGKAANIFETGYKLNGSAYVISKHISNVWLWDRVRVSGGAYGGFCDFDSHSGVFSFLSY 959

Query: 3013 RDPNLLKTLNIYDGTSQFLRELDMDDDALTRAIIGTIGDVDSYQLPDAKGYTSLLRYLLG 3192
            RDPNLLKTL+IYDGT  FLREL+MDDD LT+AIIGTIGDVD+YQLPDAKGY+SLLR+LLG
Sbjct: 960  RDPNLLKTLDIYDGTVDFLRELEMDDDTLTKAIIGTIGDVDAYQLPDAKGYSSLLRHLLG 1019

Query: 3193 VTXXXXXXXXXXILSTRLKDFKEFADVIEXXXXXXXXXXXXXRDDVEAANKECSDFFQVK 3372
            +T          ILST LKDFKEFADV+E              DDV+AANKE ++ F+VK
Sbjct: 1020 ITEEERQRRREEILSTSLKDFKEFADVLEAIKDRGVAVAVASPDDVDAANKERANLFEVK 1079

Query: 3373 KA 3378
            KA
Sbjct: 1080 KA 1081


>ref|XP_006423047.1| hypothetical protein CICLE_v10027722mg [Citrus clementina]
            gi|557524981|gb|ESR36287.1| hypothetical protein
            CICLE_v10027722mg [Citrus clementina]
          Length = 1082

 Score = 1666 bits (4314), Expect = 0.0
 Identities = 842/1082 (77%), Positives = 939/1082 (86%), Gaps = 6/1082 (0%)
 Frame = +1

Query: 151  MERAVLLRSLPSTTVISRTRLFTRSLHRLAPPRASSIYAKRC---RLLPNLHRRSLLRSY 321
            MERA LLRSL  T++ S  R + RS    A   +SS+   R    RL+ NL RRSLLR  
Sbjct: 1    MERAALLRSLSCTSLASN-RFYFRSFVPRAKFSSSSVAVARRNHHRLINNLTRRSLLRGD 59

Query: 322  LPVLSNRSPNFSSLRTQFSSQSVRAIAT-SAPQS-EVFGADDDVAEKLGFEKVSEEFIEE 495
              +  + S         FSS S RA+A+ S P S EV    ++VAEKLGFEKVSEEFI E
Sbjct: 60   SRLRFSLSSYSLQFNKHFSSLSPRAVASPSTPSSPEVAEVSNEVAEKLGFEKVSEEFIGE 119

Query: 496  CKSRAILYKHKKTGAEVMSVSNDDENKVFGIVFRTPPSDSTGIPHILEHSVLCGSRKYPL 675
            CKS+A+L+KHKKTGAEVMSVSNDDENKVFGIVFRTPP DSTGIPHILEHSVLCGSRKYPL
Sbjct: 120  CKSKAVLFKHKKTGAEVMSVSNDDENKVFGIVFRTPPKDSTGIPHILEHSVLCGSRKYPL 179

Query: 676  KEPFVELLKVSLQTFLNAFTYPDRTCYPVASTNTKDFYNLVDVYLDAVFFPRCVEDYQTF 855
            KEPFVELLK SL TFLNAFTYPDRTCYPVASTNTKDFYNLVDVYLDAVFFP+CVED+QTF
Sbjct: 180  KEPFVELLKGSLNTFLNAFTYPDRTCYPVASTNTKDFYNLVDVYLDAVFFPKCVEDFQTF 239

Query: 856  QQEGWHYELNDPSEDIIYKGVVFNEMKGVYSQPDSILGRTSQQALFPNNTYGVDSGGDPE 1035
            QQEGWH+EL++PSEDI YKGVVFNEMKGVYSQPD+ILGR +QQALFP+N YGVDSGGDP+
Sbjct: 240  QQEGWHFELDNPSEDITYKGVVFNEMKGVYSQPDNILGRAAQQALFPDNAYGVDSGGDPK 299

Query: 1036 VIPKLTFEEFKEFHRKYYHPSNARIWFYGDDDPNERLRILSEYLDMFEGSSAPLESRVQP 1215
            VIPKLTFEEFKEFHRKYYHPSNARIWFYGDDDPNERLRILSEYL+MFE SSAP ES V+ 
Sbjct: 300  VIPKLTFEEFKEFHRKYYHPSNARIWFYGDDDPNERLRILSEYLNMFEASSAPNESIVEK 359

Query: 1216 QKLFSEPVRVIEKYPA-EGDDLEKKHMVCVNWLLSEQPLDLETELALGFLDHLLMGTPAS 1392
            QKLFSEPVR+IEKYPA +  D++KK+MVC+NWLLS++PLDLETELALGFLDHL++GTPAS
Sbjct: 360  QKLFSEPVRIIEKYPAGDAGDIKKKNMVCLNWLLSDKPLDLETELALGFLDHLMLGTPAS 419

Query: 1393 PLRKILLESGLGDALVGGGVEDELLQPQFSIGLKGVKKDDIQRVEELIMNTLKKLAEEGF 1572
            PLRKILLESGLGDA+VGGG+EDELLQPQFSIGLK V +DDIQ+VEELIM+TLKKLA+EGF
Sbjct: 420  PLRKILLESGLGDAIVGGGIEDELLQPQFSIGLKNVSEDDIQKVEELIMDTLKKLADEGF 479

Query: 1573 DSDAVEASMNTIEFALRENNTGSFPRGLALMLRAMGKWIYDMNPFVPLKYKKPLMDLKAR 1752
            DSDAVEASMNTIEF+LRENNTGSFPRGL+LMLR+MGKWIYDMNPF PLKY+KPLM LKAR
Sbjct: 480  DSDAVEASMNTIEFSLRENNTGSFPRGLSLMLRSMGKWIYDMNPFEPLKYEKPLMALKAR 539

Query: 1753 IAEQGSKAVFSPLIVEYILNNPHRVTVEMQPDPEKASRDEEKEKENLSKVKASMTEEDLA 1932
            +AE+G KAVFSPLI +YILNNPH VTVEMQPDPEKASRDE  EKE L+KVK+SMT+EDLA
Sbjct: 540  LAEEGPKAVFSPLIEKYILNNPHCVTVEMQPDPEKASRDEAAEKEILAKVKSSMTKEDLA 599

Query: 1933 ELARATEELRLKQETPDPPEALKCVPSLSLQDIPKKPVHVPIEVGDINGVKVLQHDLFTN 2112
            ELARATEELRLKQETPDPPEAL+ VPSLSL+DIPK+P+ VP EVGDINGVKVLQHDLFTN
Sbjct: 600  ELARATEELRLKQETPDPPEALRSVPSLSLRDIPKEPIRVPTEVGDINGVKVLQHDLFTN 659

Query: 2113 DVLYSEVVFDMSSLKQELLPLVPLFCQSLLEMGTKDLDFVQLNQLIGRKTGGISVYPFTS 2292
            DVLY+EVVFDMSSLKQELLPL+PLFCQSL EMGTKDL FVQL+QLIGRKTGGISVYPFTS
Sbjct: 660  DVLYTEVVFDMSSLKQELLPLIPLFCQSLKEMGTKDLSFVQLDQLIGRKTGGISVYPFTS 719

Query: 2293 TVRGKEEPCSHIVVRGKAMSSRTEDLFNLINCILQDVQLTDQKRFKQFVSQSKARMENRV 2472
            ++RGKE+PC  +VVRGKAM+ + EDLFNL NC+LQ+VQLTDQ+RFKQFVSQSKARMENR+
Sbjct: 720  SIRGKEDPCCCMVVRGKAMAGQAEDLFNLFNCVLQEVQLTDQQRFKQFVSQSKARMENRL 779

Query: 2473 RGSGHGIAAARMDAKLNSAGWISEQMGGISYLEFLRSLEEKVDKDWPEIASSLEEIRRSL 2652
            RGSGHGIAAARMDAKLN+AGWISEQMGG+SYLEFL++LEEKVD+DW  I+SSLEEIRRS 
Sbjct: 780  RGSGHGIAAARMDAKLNTAGWISEQMGGVSYLEFLQALEEKVDQDWAGISSSLEEIRRSF 839

Query: 2653 FSKSGCLINLTSDGKNLTNAEKHVSKFXXXXXXXXXXXXXXWTARLPFTNEAIVIPTQVN 2832
             S+ GCLIN+T+DGKNL N+E+ V KF              W A LP  NEAIVIPTQVN
Sbjct: 840  LSREGCLINITADGKNLKNSERFVGKFLDMLPTNSPVERVKWKAHLPSANEAIVIPTQVN 899

Query: 2833 YVGKAANLYETGYQLKGSAYVISKYISNTWLWDHVRVSGGAYGGFCDFDTHSGVFSFLSY 3012
            YVGKAAN++ETGY+L GSAYVISK+ISN WLWD VRVSGGAYGGFCDFD+HSGVFSFLSY
Sbjct: 900  YVGKAANIFETGYKLNGSAYVISKHISNVWLWDRVRVSGGAYGGFCDFDSHSGVFSFLSY 959

Query: 3013 RDPNLLKTLNIYDGTSQFLRELDMDDDALTRAIIGTIGDVDSYQLPDAKGYTSLLRYLLG 3192
            RDPNLLKTL+IYDGT  FLREL+MDDD LT+AIIGTIGDVD+YQLPDAKGY+SLLR+LLG
Sbjct: 960  RDPNLLKTLDIYDGTVDFLRELEMDDDTLTKAIIGTIGDVDAYQLPDAKGYSSLLRHLLG 1019

Query: 3193 VTXXXXXXXXXXILSTRLKDFKEFADVIEXXXXXXXXXXXXXRDDVEAANKECSDFFQVK 3372
            +T          ILST LKDFKEFADV+E              DDV+AANKE ++ F+VK
Sbjct: 1020 ITEEERQRRREEILSTSLKDFKEFADVLEAIKDRGVAVAVASPDDVDAANKERANLFEVK 1079

Query: 3373 KA 3378
            KA
Sbjct: 1080 KA 1081


>ref|XP_004136986.1| PREDICTED: presequence protease 1, chloroplastic/mitochondrial-like
            [Cucumis sativus]
          Length = 1084

 Score = 1645 bits (4261), Expect = 0.0
 Identities = 823/1084 (75%), Positives = 926/1084 (85%), Gaps = 8/1084 (0%)
 Frame = +1

Query: 151  MERAVLLRSLPSTTVISRTRLFTRSLHRLAP----PRASSIYAKRCRLLPNLHRRSLLRS 318
            ME++V LRSL  ++++   R+F RS HRL P    PR+S +  K  R  P+  RRSLL  
Sbjct: 1    MEKSVFLRSLTCSSLVCN-RIFFRSAHRLCPSTLPPRSSFVSRKLHRFNPSFSRRSLLPR 59

Query: 319  YLPVLSNRSPNFSS-LRTQFSSQSVRAIATSAPQS--EVFGADDDVAEKLGFEKVSEEFI 489
             L +L   S + SS  R QFSS + RA+A+    S  E     D+VAEKLGFEKVSEEFI
Sbjct: 60   QLKLLPAYSQSRSSHFRKQFSSLAPRAVASPPAHSPPEFAEVSDEVAEKLGFEKVSEEFI 119

Query: 490  EECKSRAILYKHKKTGAEVMSVSNDDENKVFGIVFRTPPSDSTGIPHILEHSVLCGSRKY 669
             ECKS+A+L++HKKTGAEVMSVSNDDENKVFGIVFRTPPSDSTGIPHILEHSVLCGSRKY
Sbjct: 120  GECKSKAVLFRHKKTGAEVMSVSNDDENKVFGIVFRTPPSDSTGIPHILEHSVLCGSRKY 179

Query: 670  PLKEPFVELLKVSLQTFLNAFTYPDRTCYPVASTNTKDFYNLVDVYLDAVFFPRCVEDYQ 849
            P+KEPFVELLK SL TFLNAFTYPDRTCYPVASTNTKDFYNLVDVYLDAVFFP+CVED++
Sbjct: 180  PVKEPFVELLKGSLHTFLNAFTYPDRTCYPVASTNTKDFYNLVDVYLDAVFFPKCVEDFK 239

Query: 850  TFQQEGWHYELNDPSEDIIYKGVVFNEMKGVYSQPDSILGRTSQQALFPNNTYGVDSGGD 1029
            TFQQEGWHYELNDPSEDI YKGVVFNEMKGVYSQPD+ILGR +QQALFP+NTYGVDSGGD
Sbjct: 240  TFQQEGWHYELNDPSEDISYKGVVFNEMKGVYSQPDNILGRVTQQALFPDNTYGVDSGGD 299

Query: 1030 PEVIPKLTFEEFKEFHRKYYHPSNARIWFYGDDDPNERLRILSEYLDMFEGSSAPLESRV 1209
            P VIPKLTFEEFKEFH K+YHP NARIWFYGDDDP ERLRIL +YLDMF+ S    +S++
Sbjct: 300  PRVIPKLTFEEFKEFHSKFYHPGNARIWFYGDDDPVERLRILKDYLDMFDASPVSDQSKI 359

Query: 1210 QPQKLFSEPVRVIEKYPA-EGDDLEKKHMVCVNWLLSEQPLDLETELALGFLDHLLMGTP 1386
              Q+LFSEPVR++EKYP+ +G DL+KKHMVCVNWLLSE+PLDLETELALGFLDHL++GTP
Sbjct: 360  GQQRLFSEPVRIVEKYPSGDGGDLKKKHMVCVNWLLSEKPLDLETELALGFLDHLMLGTP 419

Query: 1387 ASPLRKILLESGLGDALVGGGVEDELLQPQFSIGLKGVKKDDIQRVEELIMNTLKKLAEE 1566
            ASPLRKILLESGLG+A++GGG+EDELLQPQFSIGLKGV  DDI +VEELI+NT KKLAEE
Sbjct: 420  ASPLRKILLESGLGEAILGGGIEDELLQPQFSIGLKGVLDDDIPKVEELILNTFKKLAEE 479

Query: 1567 GFDSDAVEASMNTIEFALRENNTGSFPRGLALMLRAMGKWIYDMNPFVPLKYKKPLMDLK 1746
            GFD+DAVEASMNTIEF+LRENNTGSFPRGL+LMLR++GKWIYDMNPF PLKY++PL  LK
Sbjct: 480  GFDNDAVEASMNTIEFSLRENNTGSFPRGLSLMLRSIGKWIYDMNPFEPLKYEEPLKALK 539

Query: 1747 ARIAEQGSKAVFSPLIVEYILNNPHRVTVEMQPDPEKASRDEEKEKENLSKVKASMTEED 1926
            ARIA +G KAVFSPLI ++ILNNPHRVT+EMQPDPEKASRDE  EKE L KVK SMTEED
Sbjct: 540  ARIAAEGPKAVFSPLIEKFILNNPHRVTIEMQPDPEKASRDEATEKEILQKVKESMTEED 599

Query: 1927 LAELARATEELRLKQETPDPPEALKCVPSLSLQDIPKKPVHVPIEVGDINGVKVLQHDLF 2106
            LAELARAT+ELRLKQETPDPPEALKCVP L L+DIPK+P  VP E+G++NGV VLQHDLF
Sbjct: 600  LAELARATQELRLKQETPDPPEALKCVPCLCLEDIPKEPTRVPTEIGNVNGVTVLQHDLF 659

Query: 2107 TNDVLYSEVVFDMSSLKQELLPLVPLFCQSLLEMGTKDLDFVQLNQLIGRKTGGISVYPF 2286
            TNDVLYSEVVFDMSSLKQELLPLVPLFCQSLLEMGTKDL FVQLNQLIGRKTGGISVYPF
Sbjct: 660  TNDVLYSEVVFDMSSLKQELLPLVPLFCQSLLEMGTKDLTFVQLNQLIGRKTGGISVYPF 719

Query: 2287 TSTVRGKEEPCSHIVVRGKAMSSRTEDLFNLINCILQDVQLTDQKRFKQFVSQSKARMEN 2466
            TS++RG ++ C+H+VVRGKAMS   EDLFNL+NCILQ+VQ TDQ+RFKQFVSQSK+RMEN
Sbjct: 720  TSSIRGNDKACTHMVVRGKAMSGCAEDLFNLMNCILQEVQFTDQQRFKQFVSQSKSRMEN 779

Query: 2467 RVRGSGHGIAAARMDAKLNSAGWISEQMGGISYLEFLRSLEEKVDKDWPEIASSLEEIRR 2646
            R+RGSGHGIAAARMDAKLNSAGWISEQMGG+SY+EFL++LEEKVD++W EI+SSLEEIR+
Sbjct: 780  RLRGSGHGIAAARMDAKLNSAGWISEQMGGLSYMEFLQTLEEKVDQNWTEISSSLEEIRQ 839

Query: 2647 SLFSKSGCLINLTSDGKNLTNAEKHVSKFXXXXXXXXXXXXXXWTARLPFTNEAIVIPTQ 2826
            SL S+  CL+N+T+DGKNL  +EK + KF              W ARL   NEAIVIPTQ
Sbjct: 840  SLLSRKNCLVNITADGKNLIKSEKFIGKFLDLLPNQPIIKNSTWNARLSSDNEAIVIPTQ 899

Query: 2827 VNYVGKAANLYETGYQLKGSAYVISKYISNTWLWDHVRVSGGAYGGFCDFDTHSGVFSFL 3006
            VNYVGKAAN+YETGYQL GSAYVISK+ISNTWLWD VRVSGGAYGGFCDFD+HSGVFSFL
Sbjct: 900  VNYVGKAANIYETGYQLDGSAYVISKFISNTWLWDRVRVSGGAYGGFCDFDSHSGVFSFL 959

Query: 3007 SYRDPNLLKTLNIYDGTSQFLRELDMDDDALTRAIIGTIGDVDSYQLPDAKGYTSLLRYL 3186
            SYRDPNLLKTL++YDGT  FLREL++DDD L +AIIGTIGDVDSYQLPDAKGY+SLLRYL
Sbjct: 960  SYRDPNLLKTLDVYDGTVDFLRELELDDDTLAKAIIGTIGDVDSYQLPDAKGYSSLLRYL 1019

Query: 3187 LGVTXXXXXXXXXXILSTRLKDFKEFADVIEXXXXXXXXXXXXXRDDVEAANKECSDFFQ 3366
            LG+T          ILST LKDFK FAD +E              +DVE A+ E   FFQ
Sbjct: 1020 LGITEEERQRRREEILSTSLKDFKNFADALEAVRNKGVVVSVASPEDVETAHGERPGFFQ 1079

Query: 3367 VKKA 3378
            VKKA
Sbjct: 1080 VKKA 1083


>ref|XP_004159889.1| PREDICTED: LOW QUALITY PROTEIN: presequence protease 1,
            chloroplastic/mitochondrial-like [Cucumis sativus]
          Length = 1084

 Score = 1645 bits (4259), Expect = 0.0
 Identities = 823/1084 (75%), Positives = 925/1084 (85%), Gaps = 8/1084 (0%)
 Frame = +1

Query: 151  MERAVLLRSLPSTTVISRTRLFTRSLHRLAP----PRASSIYAKRCRLLPNLHRRSLLRS 318
            ME++V LRSL  ++++   R+F RS HRL P    PR+S +  K  R  P+  RRSLL  
Sbjct: 1    MEKSVFLRSLTCSSLVCN-RIFFRSAHRLCPSTLPPRSSFVSRKLHRFNPSFSRRSLLPR 59

Query: 319  YLPVLSNRSPNFSS-LRTQFSSQSVRAIATSAPQS--EVFGADDDVAEKLGFEKVSEEFI 489
             L +L   S + SS  R QFSS + RA+A+    S  E     D+VAEKLGFEKVSEEFI
Sbjct: 60   QLKLLPAYSQSRSSHFRKQFSSLAPRAVASPPAHSPPEFAEVSDEVAEKLGFEKVSEEFI 119

Query: 490  EECKSRAILYKHKKTGAEVMSVSNDDENKVFGIVFRTPPSDSTGIPHILEHSVLCGSRKY 669
             ECKS+A+L++HKKTGAEVMSVSNDDENKVFGIVFRTPPSDSTGIPHILEHSVLCGSRKY
Sbjct: 120  GECKSKAVLFRHKKTGAEVMSVSNDDENKVFGIVFRTPPSDSTGIPHILEHSVLCGSRKY 179

Query: 670  PLKEPFVELLKVSLQTFLNAFTYPDRTCYPVASTNTKDFYNLVDVYLDAVFFPRCVEDYQ 849
            P+KEPFVELLK SL TFLNAFTYPDRTCYPVASTNTKDFYNLVDVYLDAVFFP+CVED++
Sbjct: 180  PVKEPFVELLKGSLHTFLNAFTYPDRTCYPVASTNTKDFYNLVDVYLDAVFFPKCVEDFK 239

Query: 850  TFQQEGWHYELNDPSEDIIYKGVVFNEMKGVYSQPDSILGRTSQQALFPNNTYGVDSGGD 1029
            TFQQEGWHYELNDPSEDI YKGVVFNEMKGVYSQPD+ILGR +QQALFP+NTYGVDSGGD
Sbjct: 240  TFQQEGWHYELNDPSEDISYKGVVFNEMKGVYSQPDNILGRVTQQALFPDNTYGVDSGGD 299

Query: 1030 PEVIPKLTFEEFKEFHRKYYHPSNARIWFYGDDDPNERLRILSEYLDMFEGSSAPLESRV 1209
            P VIPKLTFEEFKEFH K+YHP NARIWFYGDDDP ERLRIL +YLDMF+ S    +S++
Sbjct: 300  PRVIPKLTFEEFKEFHSKFYHPGNARIWFYGDDDPVERLRILKDYLDMFDASPVSDQSKI 359

Query: 1210 QPQKLFSEPVRVIEKYPA-EGDDLEKKHMVCVNWLLSEQPLDLETELALGFLDHLLMGTP 1386
              Q+LFSEPVR++EKYP+ +G DL KKHMVCVNWLLSE+PLDLETELALGFLDHL++GTP
Sbjct: 360  GQQRLFSEPVRIVEKYPSGDGGDLXKKHMVCVNWLLSEKPLDLETELALGFLDHLMLGTP 419

Query: 1387 ASPLRKILLESGLGDALVGGGVEDELLQPQFSIGLKGVKKDDIQRVEELIMNTLKKLAEE 1566
            ASPLRKILLESGLG+A++GGG+EDELLQPQFSIGLKGV  DDI +VEELI+NT KKLAEE
Sbjct: 420  ASPLRKILLESGLGEAILGGGIEDELLQPQFSIGLKGVLDDDIPKVEELILNTFKKLAEE 479

Query: 1567 GFDSDAVEASMNTIEFALRENNTGSFPRGLALMLRAMGKWIYDMNPFVPLKYKKPLMDLK 1746
            GFD+DAVEASMNTIEF+LRENNTGSFPRGL+LMLR++GKWIYDMNPF PLKY++PL  LK
Sbjct: 480  GFDNDAVEASMNTIEFSLRENNTGSFPRGLSLMLRSIGKWIYDMNPFEPLKYEEPLKALK 539

Query: 1747 ARIAEQGSKAVFSPLIVEYILNNPHRVTVEMQPDPEKASRDEEKEKENLSKVKASMTEED 1926
            ARIA +G KAVFSPLI ++ILNNPHRVT+EMQPDPEKASRDE  EKE L KVK SMTEED
Sbjct: 540  ARIAAEGPKAVFSPLIEKFILNNPHRVTIEMQPDPEKASRDEATEKEILQKVKESMTEED 599

Query: 1927 LAELARATEELRLKQETPDPPEALKCVPSLSLQDIPKKPVHVPIEVGDINGVKVLQHDLF 2106
            LAELARAT+ELRLKQETPDPPEALKCVP L L+DIPK+P  VP E+G++NGV VLQHDLF
Sbjct: 600  LAELARATQELRLKQETPDPPEALKCVPCLCLEDIPKEPTRVPTEIGNVNGVTVLQHDLF 659

Query: 2107 TNDVLYSEVVFDMSSLKQELLPLVPLFCQSLLEMGTKDLDFVQLNQLIGRKTGGISVYPF 2286
            TNDVLYSEVVFDMSSLKQELLPLVPLFCQSLLEMGTKDL FVQLNQLIGRKTGGISVYPF
Sbjct: 660  TNDVLYSEVVFDMSSLKQELLPLVPLFCQSLLEMGTKDLTFVQLNQLIGRKTGGISVYPF 719

Query: 2287 TSTVRGKEEPCSHIVVRGKAMSSRTEDLFNLINCILQDVQLTDQKRFKQFVSQSKARMEN 2466
            TS++RG ++ C+H+VVRGKAMS   EDLFNL+NCILQ+VQ TDQ+RFKQFVSQSK+RMEN
Sbjct: 720  TSSIRGNDKACTHMVVRGKAMSGCAEDLFNLMNCILQEVQFTDQQRFKQFVSQSKSRMEN 779

Query: 2467 RVRGSGHGIAAARMDAKLNSAGWISEQMGGISYLEFLRSLEEKVDKDWPEIASSLEEIRR 2646
            R+RGSGHGIAAARMDAKLNSAGWISEQMGG+SY+EFL++LEEKVD++W EI+SSLEEIR+
Sbjct: 780  RLRGSGHGIAAARMDAKLNSAGWISEQMGGLSYMEFLQTLEEKVDQNWTEISSSLEEIRQ 839

Query: 2647 SLFSKSGCLINLTSDGKNLTNAEKHVSKFXXXXXXXXXXXXXXWTARLPFTNEAIVIPTQ 2826
            SL S+  CL+N+T+DGKNL  +EK + KF              W ARL   NEAIVIPTQ
Sbjct: 840  SLLSRKNCLVNITADGKNLIKSEKFIGKFLDLLPNQPIIKNSTWNARLSSDNEAIVIPTQ 899

Query: 2827 VNYVGKAANLYETGYQLKGSAYVISKYISNTWLWDHVRVSGGAYGGFCDFDTHSGVFSFL 3006
            VNYVGKAAN+YETGYQL GSAYVISK+ISNTWLWD VRVSGGAYGGFCDFD+HSGVFSFL
Sbjct: 900  VNYVGKAANIYETGYQLDGSAYVISKFISNTWLWDRVRVSGGAYGGFCDFDSHSGVFSFL 959

Query: 3007 SYRDPNLLKTLNIYDGTSQFLRELDMDDDALTRAIIGTIGDVDSYQLPDAKGYTSLLRYL 3186
            SYRDPNLLKTL++YDGT  FLREL++DDD L +AIIGTIGDVDSYQLPDAKGY+SLLRYL
Sbjct: 960  SYRDPNLLKTLDVYDGTVDFLRELELDDDTLAKAIIGTIGDVDSYQLPDAKGYSSLLRYL 1019

Query: 3187 LGVTXXXXXXXXXXILSTRLKDFKEFADVIEXXXXXXXXXXXXXRDDVEAANKECSDFFQ 3366
            LG+T          ILST LKDFK FAD +E              +DVE A+ E   FFQ
Sbjct: 1020 LGITEEERQRRREEILSTSLKDFKNFADALEAVRNKGVVVSVASPEDVETAHGERPGFFQ 1079

Query: 3367 VKKA 3378
            VKKA
Sbjct: 1080 VKKA 1083


>ref|XP_004296078.1| PREDICTED: presequence protease 1, chloroplastic/mitochondrial-like
            [Fragaria vesca subsp. vesca]
          Length = 1073

 Score = 1642 bits (4251), Expect = 0.0
 Identities = 825/1081 (76%), Positives = 931/1081 (86%), Gaps = 5/1081 (0%)
 Frame = +1

Query: 151  MERAVLLRSLPSTTVISRTRLFTRSLHRLAPPRASSIYAKRCR--LLPNLHRRSLLRSYL 324
            ME A LLRS  S+T  +      R     +   +S++   R R  L P+L RR+ L    
Sbjct: 1    MEGAALLRSSLSSTNRAFFSFRPRFSRSFSSSASSALRTNRHRQILRPSLLRRTFL---- 56

Query: 325  PVLSNRSPNFSSLRTQFSSQSVRAIAT--SAPQSEVFGADDDVAEKLGFEKVSEEFIEEC 498
              L   SP+FS    +FSS S RA+AT  +   SE  G  D+VAEKLGFEKV+EEFI EC
Sbjct: 57   --LPAASPHFSR---RFSSLSPRAVATPLTPSPSESSGVSDEVAEKLGFEKVTEEFIGEC 111

Query: 499  KSRAILYKHKKTGAEVMSVSNDDENKVFGIVFRTPPSDSTGIPHILEHSVLCGSRKYPLK 678
            KS+A+L++HKKTGA+++SVSNDDENKVFGIVFRTPP+DSTGIPHILEHSVLCGSRKYPLK
Sbjct: 112  KSKALLFRHKKTGAQMISVSNDDENKVFGIVFRTPPNDSTGIPHILEHSVLCGSRKYPLK 171

Query: 679  EPFVELLKVSLQTFLNAFTYPDRTCYPVASTNTKDFYNLVDVYLDAVFFPRCVEDYQTFQ 858
            EPFVELLK SL TFLNAFTYPDRTCYPVASTNTKDFYNLVDVYLDAVFFP+CVED+QTFQ
Sbjct: 172  EPFVELLKGSLNTFLNAFTYPDRTCYPVASTNTKDFYNLVDVYLDAVFFPKCVEDFQTFQ 231

Query: 859  QEGWHYELNDPSEDIIYKGVVFNEMKGVYSQPDSILGRTSQQALFPNNTYGVDSGGDPEV 1038
            QEGWHYELNDPSEDI YKGVVFNEMKGVYSQPD+ILGR +QQALFP+NTYGVDSGGDP+V
Sbjct: 232  QEGWHYELNDPSEDISYKGVVFNEMKGVYSQPDNILGRIAQQALFPDNTYGVDSGGDPKV 291

Query: 1039 IPKLTFEEFKEFHRKYYHPSNARIWFYGDDDPNERLRILSEYLDMFEGSSAPLESRVQPQ 1218
            IPKLT+EEFKEFHRKYYHPSNARIWFYGDDDP ERLRILSEYLDMF+ SSAP ESRVQ Q
Sbjct: 292  IPKLTYEEFKEFHRKYYHPSNARIWFYGDDDPTERLRILSEYLDMFDASSAPNESRVQTQ 351

Query: 1219 KLFSEPVRVIEKYPA-EGDDLEKKHMVCVNWLLSEQPLDLETELALGFLDHLLMGTPASP 1395
            KLFSEPVR+ E YPA EG DL+KK MVC+NWLLSE+PLDLETELALGFLDHL++GTPASP
Sbjct: 352  KLFSEPVRISETYPAGEGGDLKKKDMVCINWLLSEKPLDLETELALGFLDHLMLGTPASP 411

Query: 1396 LRKILLESGLGDALVGGGVEDELLQPQFSIGLKGVKKDDIQRVEELIMNTLKKLAEEGFD 1575
            LRKILLESGLG+A++GGGVEDELLQPQFSIGLKGV +DDI ++EEL+M+TL+ LA+EGFD
Sbjct: 412  LRKILLESGLGEAIIGGGVEDELLQPQFSIGLKGVSQDDIPKIEELVMSTLQNLADEGFD 471

Query: 1576 SDAVEASMNTIEFALRENNTGSFPRGLALMLRAMGKWIYDMNPFVPLKYKKPLMDLKARI 1755
            + AVEASMNTIEF+LRENNTGSFPRGL+LMLR+MGKWIYDM+PF PLKY+KPL+ LKARI
Sbjct: 472  TAAVEASMNTIEFSLRENNTGSFPRGLSLMLRSMGKWIYDMDPFQPLKYEKPLLALKARI 531

Query: 1756 AEQGSKAVFSPLIVEYILNNPHRVTVEMQPDPEKASRDEEKEKENLSKVKASMTEEDLAE 1935
             E+GSKAVFSPLI ++ILNNPHRV VEMQPDPEKASRDE  EKE L KVKA MTEEDLAE
Sbjct: 532  EEEGSKAVFSPLIEKFILNNPHRVVVEMQPDPEKASRDEAAEKEILEKVKAGMTEEDLAE 591

Query: 1936 LARATEELRLKQETPDPPEALKCVPSLSLQDIPKKPVHVPIEVGDINGVKVLQHDLFTND 2115
            LARAT++L+LKQETPDPPEAL+ VPSLSLQDIPK+P+ +P EVGDINGVK+LQHDLFTND
Sbjct: 592  LARATQDLKLKQETPDPPEALRSVPSLSLQDIPKEPIAIPTEVGDINGVKILQHDLFTND 651

Query: 2116 VLYSEVVFDMSSLKQELLPLVPLFCQSLLEMGTKDLDFVQLNQLIGRKTGGISVYPFTST 2295
            VLY+EVVFDMS  KQELLPLVPLFCQSLLEMGTKDL FVQLNQLIGRKTGGISVYP TS+
Sbjct: 652  VLYTEVVFDMSLPKQELLPLVPLFCQSLLEMGTKDLSFVQLNQLIGRKTGGISVYPMTSS 711

Query: 2296 VRGKEEPCSHIVVRGKAMSSRTEDLFNLINCILQDVQLTDQKRFKQFVSQSKARMENRVR 2475
            VRGK++ CSHI+VRGKAM+ R +DLF+L+NCILQ+VQ TDQ+RFKQFVSQSKARMENR+R
Sbjct: 712  VRGKKDACSHIIVRGKAMAGRADDLFHLMNCILQEVQFTDQQRFKQFVSQSKARMENRLR 771

Query: 2476 GSGHGIAAARMDAKLNSAGWISEQMGGISYLEFLRSLEEKVDKDWPEIASSLEEIRRSLF 2655
            GSGHGIAAARMDAKLN AGWISEQMGG SYLEFL+ LE+KVD DW +I+SSLEEIR+SL 
Sbjct: 772  GSGHGIAAARMDAKLNVAGWISEQMGGFSYLEFLQDLEQKVDNDWEKISSSLEEIRKSLL 831

Query: 2656 SKSGCLINLTSDGKNLTNAEKHVSKFXXXXXXXXXXXXXXWTARLPFTNEAIVIPTQVNY 2835
            S+ GCLIN+T++GKNLTN+EK V KF              W ARLP TNEA+VIPTQVNY
Sbjct: 832  SREGCLINMTAEGKNLTNSEKFVGKFLDLLPSKSPLTRTTWNARLPSTNEALVIPTQVNY 891

Query: 2836 VGKAANLYETGYQLKGSAYVISKYISNTWLWDHVRVSGGAYGGFCDFDTHSGVFSFLSYR 3015
            VGKAAN+Y+TGYQL GSAYVISKYISNTWLWD VRVSGGAYGGFCDFD+HSGVFSFLSYR
Sbjct: 892  VGKAANIYDTGYQLNGSAYVISKYISNTWLWDRVRVSGGAYGGFCDFDSHSGVFSFLSYR 951

Query: 3016 DPNLLKTLNIYDGTSQFLRELDMDDDALTRAIIGTIGDVDSYQLPDAKGYTSLLRYLLGV 3195
            DPNLLKTL+IYDGT +FLR+LDMD++ LT++IIGTIGDVDSYQLPDAKGY+SL+R+LLGV
Sbjct: 952  DPNLLKTLDIYDGTGEFLRQLDMDEETLTKSIIGTIGDVDSYQLPDAKGYSSLMRHLLGV 1011

Query: 3196 TXXXXXXXXXXILSTRLKDFKEFADVIEXXXXXXXXXXXXXRDDVEAANKECSDFFQVKK 3375
            +          ILST LKDFKEFA+ I+              DDV+AA KE S+ F+VKK
Sbjct: 1012 SDEERQIRREEILSTSLKDFKEFANAIDEVKDKGVSVAVASPDDVDAAQKERSNLFEVKK 1071

Query: 3376 A 3378
            A
Sbjct: 1072 A 1072


>ref|XP_006384425.1| hypothetical protein POPTR_0004s14960g [Populus trichocarpa]
            gi|550341043|gb|ERP62222.1| hypothetical protein
            POPTR_0004s14960g [Populus trichocarpa]
          Length = 1091

 Score = 1634 bits (4231), Expect = 0.0
 Identities = 829/1099 (75%), Positives = 927/1099 (84%), Gaps = 23/1099 (2%)
 Frame = +1

Query: 151  MERAVLLRS------------------LPSTTVISRTRLFTRSLHRLAPPRASSIYAKRC 276
            ME AVLLRS                  L S++  S +    R+ HR   P  S   A+R 
Sbjct: 1    METAVLLRSSNKLILNHRYYCPHKFFRLLSSSSSSPSSFTPRNSHRSINPLTSRSLARR- 59

Query: 277  RLLPNLHRRSLLRSYLPVLSNRSPNFSSLRTQFSSQSVRAIATSAPQSEVFGADDDVAEK 456
                   RR  L       S+ SP+F   +  FS+ S  AI+T     +V    D+VA K
Sbjct: 60   -------RRRKLLPLSATSSSSSPSFHFNKHHFSTLSPHAISTQY-SPDVSNVSDEVAAK 111

Query: 457  LGFEKVSEEFIEECKSRAILYKHKKTGAEVMSVSNDDENKVFGIVFRTPPSDSTGIPHIL 636
             GFEKVSEEFI ECKS+A+L+KHKKTGAEVMSVSNDDENKVFGIVFRTPP DSTGIPHIL
Sbjct: 112  YGFEKVSEEFIGECKSKAVLFKHKKTGAEVMSVSNDDENKVFGIVFRTPPKDSTGIPHIL 171

Query: 637  EHSVLCGSRKYPLKEPFVELLKVSLQTFLNAFTYPDRTCYPVASTNTKDFYNLVDVYLDA 816
            EHSVLCGSRKYPLKEPFVELLK SL TFLNAFTYPDRTCYPVASTNTKDFYNLVDVYLDA
Sbjct: 172  EHSVLCGSRKYPLKEPFVELLKGSLHTFLNAFTYPDRTCYPVASTNTKDFYNLVDVYLDA 231

Query: 817  VFFPRCVEDYQTFQQEGWHYELNDPSEDIIYKG-VVFNEMKGVYSQPDSILGRTSQQALF 993
            VFFP+CVEDYQTFQQEGWH+ELNDPSE+I YKG VVFNEMKGVYSQPD+ILGRT+QQA  
Sbjct: 232  VFFPKCVEDYQTFQQEGWHFELNDPSEEISYKGCVVFNEMKGVYSQPDNILGRTAQQASS 291

Query: 994  P---NNTYGVDSGGDPEVIPKLTFEEFKEFHRKYYHPSNARIWFYGDDDPNERLRILSEY 1164
            P    NTYGVDSGGDP+VIP+LTFE+FKEFH KYYHPSNARIWFYGDDDP ERLRILSEY
Sbjct: 292  PISNYNTYGVDSGGDPKVIPQLTFEQFKEFHGKYYHPSNARIWFYGDDDPTERLRILSEY 351

Query: 1165 LDMFEGSSAPLESRVQPQKLFSEPVRVIEKYPA-EGDDLEKKHMVCVNWLLSEQPLDLET 1341
            LDMF+ SSAP ESRV+ QKLFS PVR+IEKYPA +G DL+KKHMVC+NWLL+++PLDLET
Sbjct: 352  LDMFDASSAPNESRVEQQKLFSAPVRIIEKYPAGDGGDLKKKHMVCLNWLLADKPLDLET 411

Query: 1342 ELALGFLDHLLMGTPASPLRKILLESGLGDALVGGGVEDELLQPQFSIGLKGVKKDDIQR 1521
            EL LGFLDHL++GTPASPLRKILLESGLGDA+VGGG+EDELLQPQFSIGLKGV ++DIQ+
Sbjct: 412  ELTLGFLDHLMLGTPASPLRKILLESGLGDAIVGGGIEDELLQPQFSIGLKGVFEEDIQK 471

Query: 1522 VEELIMNTLKKLAEEGFDSDAVEASMNTIEFALRENNTGSFPRGLALMLRAMGKWIYDMN 1701
            VEEL+M+TLKKLAEEGF+++AVEASMNTIEF+LRENNTGSFPRGL+LMLR++ KWIYDMN
Sbjct: 472  VEELVMSTLKKLAEEGFETEAVEASMNTIEFSLRENNTGSFPRGLSLMLRSISKWIYDMN 531

Query: 1702 PFVPLKYKKPLMDLKARIAEQGSKAVFSPLIVEYILNNPHRVTVEMQPDPEKASRDEEKE 1881
            PF PLKY+KPLMDLKARIAE+G KAVFSPLI ++ILNNPHRVTVEMQPDPEKAS DE  E
Sbjct: 532  PFEPLKYEKPLMDLKARIAEEGYKAVFSPLIEKFILNNPHRVTVEMQPDPEKASHDEAAE 591

Query: 1882 KENLSKVKASMTEEDLAELARATEELRLKQETPDPPEALKCVPSLSLQDIPKKPVHVPIE 2061
            +E L KVKASMTEEDLAELARAT+EL+LKQETPDPPEAL+ VPSL L DIPK+P+HVP E
Sbjct: 592  REILEKVKASMTEEDLAELARATQELKLKQETPDPPEALRSVPSLFLCDIPKEPIHVPTE 651

Query: 2062 VGDINGVKVLQHDLFTNDVLYSEVVFDMSSLKQELLPLVPLFCQSLLEMGTKDLDFVQLN 2241
            VGDINGVKVL+HDLFTNDVLY+E+VF+M SLKQELLPLVPLFCQSLLEMGTKDL FVQLN
Sbjct: 652  VGDINGVKVLKHDLFTNDVLYAEIVFNMRSLKQELLPLVPLFCQSLLEMGTKDLTFVQLN 711

Query: 2242 QLIGRKTGGISVYPFTSTVRGKEEPCSHIVVRGKAMSSRTEDLFNLINCILQDVQLTDQK 2421
            QLIGRKTGGIS+YPFTS+VRG+E+PCSHIV RGKAM+ R EDLFNL+NC+LQ+VQ TDQ+
Sbjct: 712  QLIGRKTGGISLYPFTSSVRGREDPCSHIVARGKAMAGRVEDLFNLVNCVLQEVQFTDQQ 771

Query: 2422 RFKQFVSQSKARMENRVRGSGHGIAAARMDAKLNSAGWISEQMGGISYLEFLRSLEEKVD 2601
            RFKQFVSQSKARMENR+RGSGHGIAAARMDAKLN AGWISEQMGG+SYLEFL++LE++VD
Sbjct: 772  RFKQFVSQSKARMENRLRGSGHGIAAARMDAKLNVAGWISEQMGGVSYLEFLKALEKRVD 831

Query: 2602 KDWPEIASSLEEIRRSLFSKSGCLINLTSDGKNLTNAEKHVSKFXXXXXXXXXXXXXXWT 2781
            +DW  ++SSLEEIR SLFSK+GCLIN+T+DGKNLTN+EK+VSKF              W 
Sbjct: 832  QDWAGVSSSLEEIRMSLFSKNGCLINMTADGKNLTNSEKYVSKFLDLLPSKSSVEAAAWN 891

Query: 2782 ARLPFTNEAIVIPTQVNYVGKAANLYETGYQLKGSAYVISKYISNTWLWDHVRVSGGAYG 2961
            ARL   NEAIVIPTQVNYVGKAAN+Y+TGYQL GSAYVISKYISNTWLWD VRVSGGAYG
Sbjct: 892  ARLSPGNEAIVIPTQVNYVGKAANIYDTGYQLNGSAYVISKYISNTWLWDRVRVSGGAYG 951

Query: 2962 GFCDFDTHSGVFSFLSYRDPNLLKTLNIYDGTSQFLRELDMDDDALTRAIIGTIGDVDSY 3141
            GFCDFDTHSGVFSFLSYRDPNLLKTL++YDG+  FLREL+MDDD L +AIIGTIGDVDSY
Sbjct: 952  GFCDFDTHSGVFSFLSYRDPNLLKTLDVYDGSGAFLRELEMDDDTLAKAIIGTIGDVDSY 1011

Query: 3142 QLPDAKGYTSLLRYLLGVTXXXXXXXXXXILSTRLKDFKEFADVIEXXXXXXXXXXXXXR 3321
            QL DAKGY+SLLRYLLG+T          ILST LKDFKEF +VIE              
Sbjct: 1012 QLADAKGYSSLLRYLLGITEEERQKRREEILSTSLKDFKEFGEVIEAVKDKGVSVVVASP 1071

Query: 3322 DDVEAANKECSDFFQVKKA 3378
            +DV AANKE S++F VKKA
Sbjct: 1072 EDVHAANKERSNYFDVKKA 1090


>ref|XP_003517606.1| PREDICTED: presequence protease 2, chloroplastic/mitochondrial
            [Glycine max]
          Length = 1078

 Score = 1631 bits (4224), Expect = 0.0
 Identities = 813/1083 (75%), Positives = 921/1083 (85%), Gaps = 7/1083 (0%)
 Frame = +1

Query: 151  MERAVLLRSLPSTTVISRTRLFTRSLHRLAPPRASSIYAK------RCRLLPNLHRRSLL 312
            MERA L+R LP ++V+ R  L + S H   P  + SI         R   LP   R S  
Sbjct: 1    MERAALVRCLPCSSVLCRRYLHSHS-HLCRPSSSISIIPSLSLPTIRPLCLPR-RRSSSS 58

Query: 313  RSYLPVLSNRSPNFSSLRTQFSSQSVRAIATSAPQSEVFGADDDVAEKLGFEKVSEEFIE 492
               LP+    + N    R  FSS + RA+ + +P S     +D+VA KLGFEKVSEEFI 
Sbjct: 59   SRLLPLYFRTTIN----RKHFSSLAPRAVLSPSPSSGFAEVNDEVALKLGFEKVSEEFIP 114

Query: 493  ECKSRAILYKHKKTGAEVMSVSNDDENKVFGIVFRTPPSDSTGIPHILEHSVLCGSRKYP 672
            ECKS+A+L++H KTGA+VMSVSNDD+NKVFGIVFRTPP DSTGIPHILEHSVLCGSRKYP
Sbjct: 115  ECKSKAVLFRHIKTGAQVMSVSNDDDNKVFGIVFRTPPKDSTGIPHILEHSVLCGSRKYP 174

Query: 673  LKEPFVELLKVSLQTFLNAFTYPDRTCYPVASTNTKDFYNLVDVYLDAVFFPRCVEDYQT 852
            LKEPFVELLK SL TFLNAFTYPDRTCYPVASTN KDFYNLVDVYLDAVFFPRCVED+Q 
Sbjct: 175  LKEPFVELLKGSLHTFLNAFTYPDRTCYPVASTNAKDFYNLVDVYLDAVFFPRCVEDFQI 234

Query: 853  FQQEGWHYELNDPSEDIIYKGVVFNEMKGVYSQPDSILGRTSQQALFPNNTYGVDSGGDP 1032
            FQQEGWH+ELNDPSEDI YKGVVFNEMKGVYSQPD+ILGR +QQALFP+ TYGVDSGGDP
Sbjct: 235  FQQEGWHFELNDPSEDITYKGVVFNEMKGVYSQPDNILGRAAQQALFPDTTYGVDSGGDP 294

Query: 1033 EVIPKLTFEEFKEFHRKYYHPSNARIWFYGDDDPNERLRILSEYLDMFEGSSAPLESRVQ 1212
             VIPKLTFEEFKEFHRKYYHPSN+RIWFYGDDDPNERLRILSEYLD+F+ S A  ESRV+
Sbjct: 295  RVIPKLTFEEFKEFHRKYYHPSNSRIWFYGDDDPNERLRILSEYLDLFDSSLASHESRVE 354

Query: 1213 PQKLFSEPVRVIEKYPA-EGDDLEKKHMVCVNWLLSEQPLDLETELALGFLDHLLMGTPA 1389
            PQ LFS+PVR++E YPA EG DL+KKHMVC+NWLLS++PLDLETEL LGFL+HLL+GTPA
Sbjct: 355  PQTLFSKPVRIVETYPAGEGGDLKKKHMVCLNWLLSDKPLDLETELTLGFLNHLLLGTPA 414

Query: 1390 SPLRKILLESGLGDALVGGGVEDELLQPQFSIGLKGVKKDDIQRVEELIMNTLKKLAEEG 1569
            SPLRKILLES LGDA+VGGGVEDELLQPQFSIG+KGV +DDI +VEEL+ +TLKKLAEEG
Sbjct: 415  SPLRKILLESRLGDAIVGGGVEDELLQPQFSIGMKGVSEDDIHKVEELVTSTLKKLAEEG 474

Query: 1570 FDSDAVEASMNTIEFALRENNTGSFPRGLALMLRAMGKWIYDMNPFVPLKYKKPLMDLKA 1749
            FD+DA+EASMNTIEF+LRENNTGSFPRGL+LML+++GKWIYDMNPF PLKY+KPL DLK+
Sbjct: 475  FDTDAIEASMNTIEFSLRENNTGSFPRGLSLMLQSIGKWIYDMNPFEPLKYEKPLQDLKS 534

Query: 1750 RIAEQGSKAVFSPLIVEYILNNPHRVTVEMQPDPEKASRDEEKEKENLSKVKASMTEEDL 1929
            RIA++GSK+VFSPLI ++ILNNPH+VTVEMQPDPEKA+RDE  EK+ L KVKASMT EDL
Sbjct: 535  RIAKEGSKSVFSPLIEKFILNNPHQVTVEMQPDPEKAARDEVAEKQILQKVKASMTTEDL 594

Query: 1930 AELARATEELRLKQETPDPPEALKCVPSLSLQDIPKKPVHVPIEVGDINGVKVLQHDLFT 2109
            AELARAT ELRLKQETPDPPEALK VPSLSLQDIPK+P+ VP EVGDINGVKVLQHDLFT
Sbjct: 595  AELARATHELRLKQETPDPPEALKTVPSLSLQDIPKEPIRVPTEVGDINGVKVLQHDLFT 654

Query: 2110 NDVLYSEVVFDMSSLKQELLPLVPLFCQSLLEMGTKDLDFVQLNQLIGRKTGGISVYPFT 2289
            NDVLY+E+VF+M SLKQELLPLVPLFCQSLLEMGTKDL FVQLNQLIGRKTGGISVYPFT
Sbjct: 655  NDVLYTEIVFNMKSLKQELLPLVPLFCQSLLEMGTKDLTFVQLNQLIGRKTGGISVYPFT 714

Query: 2290 STVRGKEEPCSHIVVRGKAMSSRTEDLFNLINCILQDVQLTDQKRFKQFVSQSKARMENR 2469
            S+VRGKE+PCSH+V+RGKAM+   EDL++L+N +LQDVQ TDQ+RFKQFVSQS+ARMENR
Sbjct: 715  SSVRGKEDPCSHMVIRGKAMAGHIEDLYDLVNSVLQDVQFTDQQRFKQFVSQSRARMENR 774

Query: 2470 VRGSGHGIAAARMDAKLNSAGWISEQMGGISYLEFLRSLEEKVDKDWPEIASSLEEIRRS 2649
            +RGSGHGIAAARMDAKLN+AGW+SE+MGG+SYLEFLR+LEE+VD+DW +I+SSLEEIR+S
Sbjct: 775  LRGSGHGIAAARMDAKLNAAGWMSEKMGGLSYLEFLRTLEERVDQDWADISSSLEEIRKS 834

Query: 2650 LFSKSGCLINLTSDGKNLTNAEKHVSKFXXXXXXXXXXXXXXWTARLPFTNEAIVIPTQV 2829
            +FSK GCLIN+T+D KNL   EK +SKF              W  RLP TNEAIVIPTQV
Sbjct: 835  IFSKQGCLINVTADRKNLAKTEKVLSKFVDLLPTSSPIATTTWNVRLPLTNEAIVIPTQV 894

Query: 2830 NYVGKAANLYETGYQLKGSAYVISKYISNTWLWDHVRVSGGAYGGFCDFDTHSGVFSFLS 3009
            NY+GKAAN+Y+TGY+L GSAYVISKYISNTWLWD VRVSGGAYGGFCDFDTHSGVFSFLS
Sbjct: 895  NYIGKAANIYDTGYRLNGSAYVISKYISNTWLWDRVRVSGGAYGGFCDFDTHSGVFSFLS 954

Query: 3010 YRDPNLLKTLNIYDGTSQFLRELDMDDDALTRAIIGTIGDVDSYQLPDAKGYTSLLRYLL 3189
            YRDPNLLKTL++YDGT  FLREL +DDD LT+AIIGTIGDVD+YQLPDAKGY+S+LRYLL
Sbjct: 955  YRDPNLLKTLDVYDGTGDFLRELQIDDDTLTKAIIGTIGDVDAYQLPDAKGYSSMLRYLL 1014

Query: 3190 GVTXXXXXXXXXXILSTRLKDFKEFADVIEXXXXXXXXXXXXXRDDVEAANKECSDFFQV 3369
            G+T          ILST LKDFK F D +E              +DV+ ANK+  DFFQV
Sbjct: 1015 GITEEERQRRREEILSTSLKDFKIFMDAMEAVKDKGVVVAVASPEDVDTANKDRPDFFQV 1074

Query: 3370 KKA 3378
            KKA
Sbjct: 1075 KKA 1077


>gb|EMJ02012.1| hypothetical protein PRUPE_ppa025698mg, partial [Prunus persica]
          Length = 986

 Score = 1630 bits (4221), Expect = 0.0
 Identities = 798/984 (81%), Positives = 889/984 (90%), Gaps = 1/984 (0%)
 Frame = +1

Query: 430  GADDDVAEKLGFEKVSEEFIEECKSRAILYKHKKTGAEVMSVSNDDENKVFGIVFRTPPS 609
            G +D+V EKLGFEKVSEEFI ECKS+A+L++HKKTGA+V+SVSNDDENKVFGIVFRTPP+
Sbjct: 3    GVEDEVVEKLGFEKVSEEFIGECKSKALLFRHKKTGAQVISVSNDDENKVFGIVFRTPPN 62

Query: 610  DSTGIPHILEHSVLCGSRKYPLKEPFVELLKVSLQTFLNAFTYPDRTCYPVASTNTKDFY 789
            DSTGIPHILEHSVLCGSRKYPLKEPFVELLK SL TFLNAFTYPDRTCYPVASTNTKDFY
Sbjct: 63   DSTGIPHILEHSVLCGSRKYPLKEPFVELLKGSLNTFLNAFTYPDRTCYPVASTNTKDFY 122

Query: 790  NLVDVYLDAVFFPRCVEDYQTFQQEGWHYELNDPSEDIIYKGVVFNEMKGVYSQPDSILG 969
            NLVDVYLDAVFFP+CVED++TFQQEGWHYELNDPSEDI YKGVVFNEMKGVYSQPD+ILG
Sbjct: 123  NLVDVYLDAVFFPKCVEDFRTFQQEGWHYELNDPSEDISYKGVVFNEMKGVYSQPDNILG 182

Query: 970  RTSQQALFPNNTYGVDSGGDPEVIPKLTFEEFKEFHRKYYHPSNARIWFYGDDDPNERLR 1149
            R SQQALFP+NTYGVDSGGDP+VIPKLTFEEFKEFHRKYYHPSNARIWFYGDDDP ERLR
Sbjct: 183  RASQQALFPDNTYGVDSGGDPKVIPKLTFEEFKEFHRKYYHPSNARIWFYGDDDPTERLR 242

Query: 1150 ILSEYLDMFEGSSAPLESRVQPQKLFSEPVRVIEKYPA-EGDDLEKKHMVCVNWLLSEQP 1326
            ILSEYLDMF+ SS+P ESR+Q QKLFSEP+R+ EKYPA EG DL KK+MVC+NWLLS++P
Sbjct: 243  ILSEYLDMFDASSSPNESRIQAQKLFSEPIRISEKYPAGEGGDLRKKNMVCLNWLLSDKP 302

Query: 1327 LDLETELALGFLDHLLMGTPASPLRKILLESGLGDALVGGGVEDELLQPQFSIGLKGVKK 1506
            LDLETEL LGFLDHL++GTPASPLRKILLESGLG+A+VGGGVEDELLQPQFSIGLKGV +
Sbjct: 303  LDLETELTLGFLDHLMLGTPASPLRKILLESGLGEAIVGGGVEDELLQPQFSIGLKGVSE 362

Query: 1507 DDIQRVEELIMNTLKKLAEEGFDSDAVEASMNTIEFALRENNTGSFPRGLALMLRAMGKW 1686
            DDIQ VEE++M+TLKKLAEEGFD+DAVEASMNTIEF+LRENNTGSFPRGL+LMLR+MGKW
Sbjct: 363  DDIQNVEEVVMSTLKKLAEEGFDTDAVEASMNTIEFSLRENNTGSFPRGLSLMLRSMGKW 422

Query: 1687 IYDMNPFVPLKYKKPLMDLKARIAEQGSKAVFSPLIVEYILNNPHRVTVEMQPDPEKASR 1866
            IYDM+PF PLKY+KPL+ LKARI  +GSKAVFSPLI ++ILNN HRV VEMQPDPEKASR
Sbjct: 423  IYDMDPFEPLKYEKPLLALKARIEAEGSKAVFSPLIEKFILNNRHRVVVEMQPDPEKASR 482

Query: 1867 DEEKEKENLSKVKASMTEEDLAELARATEELRLKQETPDPPEALKCVPSLSLQDIPKKPV 2046
            DEE EK+ L KVKA MTEEDLAELARAT+ELRL+QETPDPPEAL+ VPSLSLQDIPK+P 
Sbjct: 483  DEEAEKQILDKVKAGMTEEDLAELARATQELRLRQETPDPPEALRSVPSLSLQDIPKEPT 542

Query: 2047 HVPIEVGDINGVKVLQHDLFTNDVLYSEVVFDMSSLKQELLPLVPLFCQSLLEMGTKDLD 2226
             VP EVGDINGVKVLQHDLFTNDVLY+EVVF+MSSLKQELLPLVPLFCQSLLEMGTKDL 
Sbjct: 543  RVPTEVGDINGVKVLQHDLFTNDVLYTEVVFNMSSLKQELLPLVPLFCQSLLEMGTKDLS 602

Query: 2227 FVQLNQLIGRKTGGISVYPFTSTVRGKEEPCSHIVVRGKAMSSRTEDLFNLINCILQDVQ 2406
            FVQLNQLIGRKTGGISVYP TS+VRGKE+PCSHI+VRGKAM+ R +DLF+L NC+LQ+VQ
Sbjct: 603  FVQLNQLIGRKTGGISVYPMTSSVRGKEDPCSHIIVRGKAMAGRADDLFHLFNCVLQEVQ 662

Query: 2407 LTDQKRFKQFVSQSKARMENRVRGSGHGIAAARMDAKLNSAGWISEQMGGISYLEFLRSL 2586
             TDQ+RFKQFVSQSKARMENR+RGSGHGIAAARMDAKLN AGWISEQMGG+SYLEFL++L
Sbjct: 663  FTDQQRFKQFVSQSKARMENRLRGSGHGIAAARMDAKLNVAGWISEQMGGVSYLEFLQAL 722

Query: 2587 EEKVDKDWPEIASSLEEIRRSLFSKSGCLINLTSDGKNLTNAEKHVSKFXXXXXXXXXXX 2766
            EEKVD+DW  I+SSLEEIR+SL S++GC++N+T++GKNLTN+EK VSKF           
Sbjct: 723  EEKVDQDWDGISSSLEEIRKSLLSRNGCIVNMTAEGKNLTNSEKFVSKF-LDLLPNSPVA 781

Query: 2767 XXXWTARLPFTNEAIVIPTQVNYVGKAANLYETGYQLKGSAYVISKYISNTWLWDHVRVS 2946
               W ARLP +NEAIVIPTQVNYVGKAAN+Y+TGYQL GSAYVISKYI NTWLWD VRVS
Sbjct: 782  TSTWNARLPSSNEAIVIPTQVNYVGKAANIYDTGYQLNGSAYVISKYICNTWLWDRVRVS 841

Query: 2947 GGAYGGFCDFDTHSGVFSFLSYRDPNLLKTLNIYDGTSQFLRELDMDDDALTRAIIGTIG 3126
            GGAYGGFCDFD+HSGVFSFLSYRDPNL KTL +YDGT  FLR+LDMDD+ LT++IIGTIG
Sbjct: 842  GGAYGGFCDFDSHSGVFSFLSYRDPNLFKTLGVYDGTGDFLRQLDMDDETLTKSIIGTIG 901

Query: 3127 DVDSYQLPDAKGYTSLLRYLLGVTXXXXXXXXXXILSTRLKDFKEFADVIEXXXXXXXXX 3306
            DVDSYQLPDAKGY+SLLR+LLGVT          ILST +KDFKEFA+ I+         
Sbjct: 902  DVDSYQLPDAKGYSSLLRHLLGVTEEERQRRREEILSTSVKDFKEFAEAIDAVKNKGVVV 961

Query: 3307 XXXXRDDVEAANKECSDFFQVKKA 3378
                 DDVEAA+KE ++FF+VKKA
Sbjct: 962  AVASPDDVEAAHKEQNNFFEVKKA 985


>gb|EOX98216.1| Presequence protease 2 isoform 2 [Theobroma cacao]
          Length = 1040

 Score = 1627 bits (4212), Expect = 0.0
 Identities = 807/1040 (77%), Positives = 916/1040 (88%), Gaps = 8/1040 (0%)
 Frame = +1

Query: 151  MERAVLLRSLPSTTVISRTRLFTRSLH-RLAPPRASSIYAK---RCRLLPN--LHRRSLL 312
            MER  LLRSL  +++     LF+   H R    ++S++ A      RL+PN  L RR+  
Sbjct: 1    MERTALLRSLSCSSLACNKFLFSAPKHSRSFLSKSSTVSAAGRYHRRLIPNRSLIRRNNW 60

Query: 313  RSYLPVLSNRSPNFSSLRTQFSSQSVRAIAT-SAPQSEVFGADDDVAEKLGFEKVSEEFI 489
            RS     S+ S  F+     FSS S RA+A+ + P  ++ G +D+VAEKLGFEKVSEEFI
Sbjct: 61   RSLSVASSHSSLRFTYSNKNFSSLSPRAVASPTQPSPDIAGVEDEVAEKLGFEKVSEEFI 120

Query: 490  EECKSRAILYKHKKTGAEVMSVSNDDENKVFGIVFRTPPSDSTGIPHILEHSVLCGSRKY 669
             ECKS+A+L+KHKKTGAEVMSVSNDDENKVFGIVFRTPP DSTGIPHILEHSVLCGSRKY
Sbjct: 121  GECKSKAVLFKHKKTGAEVMSVSNDDENKVFGIVFRTPPKDSTGIPHILEHSVLCGSRKY 180

Query: 670  PLKEPFVELLKVSLQTFLNAFTYPDRTCYPVASTNTKDFYNLVDVYLDAVFFPRCVEDYQ 849
            PLKEPFVELLK SL TFLNAFTYPDRTCYPVASTN KDFYNLVDVYLDAVFFP+C+ED+Q
Sbjct: 181  PLKEPFVELLKGSLHTFLNAFTYPDRTCYPVASTNAKDFYNLVDVYLDAVFFPKCIEDFQ 240

Query: 850  TFQQEGWHYELNDPSEDIIYKGVVFNEMKGVYSQPDSILGRTSQQALFPNNTYGVDSGGD 1029
            TFQQEGWHYELND SEDI YKGVVFNEMKGVYSQPD++LGRT+QQALFP+NTYGVDSGGD
Sbjct: 241  TFQQEGWHYELNDTSEDITYKGVVFNEMKGVYSQPDNLLGRTAQQALFPDNTYGVDSGGD 300

Query: 1030 PEVIPKLTFEEFKEFHRKYYHPSNARIWFYGDDDPNERLRILSEYLDMFEGSSAPLESRV 1209
            P+VIPKLT+EEFKEFHRKYYHPSNARIWFYGDDDP ERLRILSEYLDMF+ S+AP ES+V
Sbjct: 301  PQVIPKLTYEEFKEFHRKYYHPSNARIWFYGDDDPIERLRILSEYLDMFDASTAPDESKV 360

Query: 1210 QPQKLFSEPVRVIEKYPA-EGDDLEKKHMVCVNWLLSEQPLDLETELALGFLDHLLMGTP 1386
            +PQKLFSEPVR +EKYP  EG DL+KKHMVC+NWLLS++PLDL+TEL LGFLDHL++GTP
Sbjct: 361  EPQKLFSEPVRFVEKYPVGEGGDLKKKHMVCLNWLLSDKPLDLQTELTLGFLDHLMLGTP 420

Query: 1387 ASPLRKILLESGLGDALVGGGVEDELLQPQFSIGLKGVKKDDIQRVEELIMNTLKKLAEE 1566
            ASPLRK+LLESGLGDA++GGGVEDELLQPQFSIGLKGV +DDI +VEELIM++LKKLAEE
Sbjct: 421  ASPLRKVLLESGLGDAIIGGGVEDELLQPQFSIGLKGVSEDDIPKVEELIMSSLKKLAEE 480

Query: 1567 GFDSDAVEASMNTIEFALRENNTGSFPRGLALMLRAMGKWIYDMNPFVPLKYKKPLMDLK 1746
            GFD+DAVEASMNTIEF+LRENNTGSFPRGL+LMLR++GKWIYDM+PF PLKY+KPLM LK
Sbjct: 481  GFDTDAVEASMNTIEFSLRENNTGSFPRGLSLMLRSIGKWIYDMDPFEPLKYEKPLMILK 540

Query: 1747 ARIAEQGSKAVFSPLIVEYILNNPHRVTVEMQPDPEKASRDEEKEKENLSKVKASMTEED 1926
            ARIAE+GSKAVFSPLI ++ILNNPH VT+EMQPDPEKASRDE  EKE L+KVKASMTEED
Sbjct: 541  ARIAEEGSKAVFSPLIEKFILNNPHCVTIEMQPDPEKASRDEAAEKEILNKVKASMTEED 600

Query: 1927 LAELARATEELRLKQETPDPPEALKCVPSLSLQDIPKKPVHVPIEVGDINGVKVLQHDLF 2106
            LAELARAT+EL+LKQETPDPPEAL+ VPSLSL DIPK+P+ VP EVGDINGVKVLQHDLF
Sbjct: 601  LAELARATQELKLKQETPDPPEALRSVPSLSLHDIPKEPIRVPTEVGDINGVKVLQHDLF 660

Query: 2107 TNDVLYSEVVFDMSSLKQELLPLVPLFCQSLLEMGTKDLDFVQLNQLIGRKTGGISVYPF 2286
            TNDVLY++VVFDMSSLK+ELLPLVPLFCQSLLEMGTKDL FVQLNQLIGRKTGGISVYPF
Sbjct: 661  TNDVLYTDVVFDMSSLKRELLPLVPLFCQSLLEMGTKDLSFVQLNQLIGRKTGGISVYPF 720

Query: 2287 TSTVRGKEEPCSHIVVRGKAMSSRTEDLFNLINCILQDVQLTDQKRFKQFVSQSKARMEN 2466
            TS+++GKE+PCSHI+VRGK+M+   +DLFNLINC++Q+VQ TDQ+RFKQFVSQSKARME+
Sbjct: 721  TSSIQGKEDPCSHIIVRGKSMAGCADDLFNLINCVIQEVQFTDQQRFKQFVSQSKARMES 780

Query: 2467 RVRGSGHGIAAARMDAKLNSAGWISEQMGGISYLEFLRSLEEKVDKDWPEIASSLEEIRR 2646
            R+RGSGHGIAAARMDAKLN +GWISEQMGG+SYLEFL+ LEE+VD DW  I+SSLEEIR+
Sbjct: 781  RLRGSGHGIAAARMDAKLNVSGWISEQMGGVSYLEFLQGLEERVDNDWAGISSSLEEIRK 840

Query: 2647 SLFSKSGCLINLTSDGKNLTNAEKHVSKFXXXXXXXXXXXXXXWTARLPFTNEAIVIPTQ 2826
            SL S+ GCLIN+T+DGKNL+N EK VSKF              W+ARLP  NEAIVIPTQ
Sbjct: 841  SLLSREGCLINMTADGKNLSNTEKLVSKFLDLLPSNSVVERASWSARLPSNNEAIVIPTQ 900

Query: 2827 VNYVGKAANLYETGYQLKGSAYVISKYISNTWLWDHVRVSGGAYGGFCDFDTHSGVFSFL 3006
            VNYVGKAANLY+ GYQL GSAYVISK+ISNTWLWD VRVSGGAYGGFC+FDTHSGVF+FL
Sbjct: 901  VNYVGKAANLYDGGYQLNGSAYVISKHISNTWLWDRVRVSGGAYGGFCNFDTHSGVFTFL 960

Query: 3007 SYRDPNLLKTLNIYDGTSQFLRELDMDDDALTRAIIGTIGDVDSYQLPDAKGYTSLLRYL 3186
            SYRDPNLL+TL+IYDGT  FLREL+MDDD LT+AIIGT+GDVD+YQLPDAKGY+SL+RYL
Sbjct: 961  SYRDPNLLETLDIYDGTGDFLRELEMDDDTLTKAIIGTVGDVDAYQLPDAKGYSSLVRYL 1020

Query: 3187 LGVTXXXXXXXXXXILSTRL 3246
            LG+T          ILSTR+
Sbjct: 1021 LGITEEERQRRREEILSTRV 1040


>ref|XP_002330286.1| predicted protein [Populus trichocarpa]
          Length = 1007

 Score = 1622 bits (4199), Expect = 0.0
 Identities = 804/1007 (79%), Positives = 895/1007 (88%), Gaps = 5/1007 (0%)
 Frame = +1

Query: 373  FSSQSVRAIATSAPQSEVFGADDDVAEKLGFEKVSEEFIEECKSRAILYKHKKTGAEVMS 552
            FS+ S  AI+T     +V    D+VA K GFEKVSEEFI ECKS+A+L+KHKKTGAEVMS
Sbjct: 1    FSTLSPHAISTQY-SPDVSNVSDEVAAKYGFEKVSEEFIGECKSKAVLFKHKKTGAEVMS 59

Query: 553  VSNDDENKVFGIVFRTPPSDSTGIPHILEHSVLCGSRKYPLKEPFVELLKVSLQTFLNAF 732
            VSNDDENKVFGIVFRTPP DSTGIPHILEHSVLCGSRKYPLKEPFVELLK SL TFLNAF
Sbjct: 60   VSNDDENKVFGIVFRTPPKDSTGIPHILEHSVLCGSRKYPLKEPFVELLKGSLHTFLNAF 119

Query: 733  TYPDRTCYPVASTNTKDFYNLVDVYLDAVFFPRCVEDYQTFQQEGWHYELNDPSEDIIYK 912
            TYPDRTCYPVASTNTKDFYNLVDVYLDAVFFP+CVEDYQTFQQEGWH+ELNDPSE+I YK
Sbjct: 120  TYPDRTCYPVASTNTKDFYNLVDVYLDAVFFPKCVEDYQTFQQEGWHFELNDPSEEISYK 179

Query: 913  G-VVFNEMKGVYSQPDSILGRTSQQALFP---NNTYGVDSGGDPEVIPKLTFEEFKEFHR 1080
            G VVFNEMKGVYSQPD+ILGRT+QQA  P    NTYGVDSGGDP+VIP+LTFE+FKEFH 
Sbjct: 180  GCVVFNEMKGVYSQPDNILGRTAQQASSPISNYNTYGVDSGGDPKVIPQLTFEQFKEFHG 239

Query: 1081 KYYHPSNARIWFYGDDDPNERLRILSEYLDMFEGSSAPLESRVQPQKLFSEPVRVIEKYP 1260
            KYYHPSNARIWFYGDDDP ERLRILSEYLDMF+ SSAP ESRV+ QKLFS PVR+IEKYP
Sbjct: 240  KYYHPSNARIWFYGDDDPTERLRILSEYLDMFDASSAPNESRVEQQKLFSAPVRIIEKYP 299

Query: 1261 A-EGDDLEKKHMVCVNWLLSEQPLDLETELALGFLDHLLMGTPASPLRKILLESGLGDAL 1437
            A +G DL+KKHMVC+NWLL+++PLDLETEL LGFLDHL++GTPASPLRKILLESGLGDA+
Sbjct: 300  AGDGGDLKKKHMVCLNWLLADKPLDLETELTLGFLDHLMLGTPASPLRKILLESGLGDAI 359

Query: 1438 VGGGVEDELLQPQFSIGLKGVKKDDIQRVEELIMNTLKKLAEEGFDSDAVEASMNTIEFA 1617
            VGGG+EDELLQPQFSIGLKGV ++DIQ+VEEL+M+TLKKLAEEGF+++AVEASMNTIEF+
Sbjct: 360  VGGGIEDELLQPQFSIGLKGVFEEDIQKVEELVMSTLKKLAEEGFETEAVEASMNTIEFS 419

Query: 1618 LRENNTGSFPRGLALMLRAMGKWIYDMNPFVPLKYKKPLMDLKARIAEQGSKAVFSPLIV 1797
            LRENNTGSFPRGL+LMLR++ KWIYDMNPF PLKY+KPLMDLKARIAE+G KAVFSPLI 
Sbjct: 420  LRENNTGSFPRGLSLMLRSISKWIYDMNPFEPLKYEKPLMDLKARIAEEGYKAVFSPLIE 479

Query: 1798 EYILNNPHRVTVEMQPDPEKASRDEEKEKENLSKVKASMTEEDLAELARATEELRLKQET 1977
            ++ILNNPHRVTVEMQPDPEKAS DE  E+E L KVKASMTEEDLAELARAT+EL+LKQET
Sbjct: 480  KFILNNPHRVTVEMQPDPEKASHDEAAEREILEKVKASMTEEDLAELARATQELKLKQET 539

Query: 1978 PDPPEALKCVPSLSLQDIPKKPVHVPIEVGDINGVKVLQHDLFTNDVLYSEVVFDMSSLK 2157
            PDPPEAL+ VPSL L DIPK+P+HVP EVGDINGVKVL+HDLFTNDVLY+E+VF+M SLK
Sbjct: 540  PDPPEALRSVPSLFLCDIPKEPIHVPTEVGDINGVKVLKHDLFTNDVLYAEIVFNMRSLK 599

Query: 2158 QELLPLVPLFCQSLLEMGTKDLDFVQLNQLIGRKTGGISVYPFTSTVRGKEEPCSHIVVR 2337
            QELLPLVPLFCQSLLEMGTKDL FVQLNQLIGRKTGGIS+YPFTS+VRG+E+PCSHIV R
Sbjct: 600  QELLPLVPLFCQSLLEMGTKDLTFVQLNQLIGRKTGGISLYPFTSSVRGREDPCSHIVAR 659

Query: 2338 GKAMSSRTEDLFNLINCILQDVQLTDQKRFKQFVSQSKARMENRVRGSGHGIAAARMDAK 2517
            GKAM+ R EDLFNL+NC+LQ+VQ TDQ+RFKQFVSQSKARMENR+RGSGHGIAAARMDAK
Sbjct: 660  GKAMAGRVEDLFNLVNCVLQEVQFTDQQRFKQFVSQSKARMENRLRGSGHGIAAARMDAK 719

Query: 2518 LNSAGWISEQMGGISYLEFLRSLEEKVDKDWPEIASSLEEIRRSLFSKSGCLINLTSDGK 2697
            LN AGWISEQMGG+SYLEFL++LE++VD+DW  ++SSLEEIR SLFSK+GCLIN+T+DGK
Sbjct: 720  LNVAGWISEQMGGVSYLEFLKALEKRVDQDWAGVSSSLEEIRMSLFSKNGCLINMTADGK 779

Query: 2698 NLTNAEKHVSKFXXXXXXXXXXXXXXWTARLPFTNEAIVIPTQVNYVGKAANLYETGYQL 2877
            NLTN+EK+VSKF              W ARL   NEAIVIPTQVNYVGKAAN+Y+TGYQL
Sbjct: 780  NLTNSEKYVSKFLDLLPSKSSVEAAAWNARLSPGNEAIVIPTQVNYVGKAANIYDTGYQL 839

Query: 2878 KGSAYVISKYISNTWLWDHVRVSGGAYGGFCDFDTHSGVFSFLSYRDPNLLKTLNIYDGT 3057
             GSAYVISKYISNTWLWD VRVSGGAYGGFCDFDTHSGVFSFLSYRDPNLLKTL++YDG+
Sbjct: 840  NGSAYVISKYISNTWLWDRVRVSGGAYGGFCDFDTHSGVFSFLSYRDPNLLKTLDVYDGS 899

Query: 3058 SQFLRELDMDDDALTRAIIGTIGDVDSYQLPDAKGYTSLLRYLLGVTXXXXXXXXXXILS 3237
              FLREL+MDDD L +AIIGTIGDVDSYQL DAKGY+SLLRYLLG+T          ILS
Sbjct: 900  GAFLRELEMDDDTLAKAIIGTIGDVDSYQLADAKGYSSLLRYLLGITEEERQKRREEILS 959

Query: 3238 TRLKDFKEFADVIEXXXXXXXXXXXXXRDDVEAANKECSDFFQVKKA 3378
            T LKDFKEF +VIE              +DV+AANKE S++F VKKA
Sbjct: 960  TSLKDFKEFGEVIEAVKDKGVSVVVASPEDVDAANKERSNYFDVKKA 1006


>gb|ESW29233.1| hypothetical protein PHAVU_002G054400g [Phaseolus vulgaris]
          Length = 1078

 Score = 1616 bits (4185), Expect = 0.0
 Identities = 808/1091 (74%), Positives = 921/1091 (84%), Gaps = 15/1091 (1%)
 Frame = +1

Query: 151  MERAVLLRSLPSTTVISRTRLFTRSLHRLAPP------RASSIYAKRC--RLLPNLHRRS 306
            MERA L+R LP ++V+ RT L +    RL+ P      R SS + +R   RLLP      
Sbjct: 1    MERAALVRCLPCSSVLCRTYLHSHGCRRLSIPSFPTSSRPSSSFLRRRSPRLLP------ 54

Query: 307  LLRSYLPVLSNRSPNFSSLRTQFSSQSVRAIATSAPQSEVFG------ADDDVAEKLGFE 468
                     S+  P+F +   +F S S RA+ + +P S           +D+VA + GF+
Sbjct: 55   --------ASSSPPHFRTSSNRFCSFSPRAVLSPSPSSSPSPPPAFPQVEDEVALQFGFQ 106

Query: 469  KVSEEFIEECKSRAILYKHKKTGAEVMSVSNDDENKVFGIVFRTPPSDSTGIPHILEHSV 648
             VSEEFI ECKS+A+L++H KTGA+VMSVSNDDENKVFGIVFRTPP+DSTGIPHILEHSV
Sbjct: 107  IVSEEFIPECKSKAVLFRHIKTGAQVMSVSNDDENKVFGIVFRTPPNDSTGIPHILEHSV 166

Query: 649  LCGSRKYPLKEPFVELLKVSLQTFLNAFTYPDRTCYPVASTNTKDFYNLVDVYLDAVFFP 828
            LCGSRKYPLKEPFVELLK SL TFLNAFTYPDRTCYPVASTN+KDFYNLVDVYLDAVFFP
Sbjct: 167  LCGSRKYPLKEPFVELLKGSLHTFLNAFTYPDRTCYPVASTNSKDFYNLVDVYLDAVFFP 226

Query: 829  RCVEDYQTFQQEGWHYELNDPSEDIIYKGVVFNEMKGVYSQPDSILGRTSQQALFPNNTY 1008
            +CVED+Q FQQEGWH+ELNDPSEDI YKGVVFNEMKGVYSQPD+ILGR SQQALFP+ TY
Sbjct: 227  KCVEDFQIFQQEGWHFELNDPSEDITYKGVVFNEMKGVYSQPDNILGRASQQALFPDTTY 286

Query: 1009 GVDSGGDPEVIPKLTFEEFKEFHRKYYHPSNARIWFYGDDDPNERLRILSEYLDMFEGSS 1188
            GVDSGGDP VIPKLTFEEFKEFHRKYYHPSN+RIWFYG+DDP ERLRILSEYLD+F+ S 
Sbjct: 287  GVDSGGDPRVIPKLTFEEFKEFHRKYYHPSNSRIWFYGNDDPKERLRILSEYLDLFDSSL 346

Query: 1189 APLESRVQPQKLFSEPVRVIEKYPA-EGDDLEKKHMVCVNWLLSEQPLDLETELALGFLD 1365
            A  ESR++PQ LFS+PVR++E YPA EG DL+KKHMVC+NWLLS++PLDLETELA+GFL+
Sbjct: 347  ASEESRIEPQTLFSKPVRIVETYPAGEGGDLKKKHMVCLNWLLSDKPLDLETELAIGFLN 406

Query: 1366 HLLMGTPASPLRKILLESGLGDALVGGGVEDELLQPQFSIGLKGVKKDDIQRVEELIMNT 1545
            HLL+GTPASPLRKILLESGLGDA+VGGGVEDELLQPQFSIGLKGV +DDI +VEEL+ +T
Sbjct: 407  HLLLGTPASPLRKILLESGLGDAIVGGGVEDELLQPQFSIGLKGVSEDDIHKVEELVTST 466

Query: 1546 LKKLAEEGFDSDAVEASMNTIEFALRENNTGSFPRGLALMLRAMGKWIYDMNPFVPLKYK 1725
            LKKLAEEGFD+DA+EASMNTIEF+LRENNTGSFPRGL+LML+++GKWIYDMNPF PLKY+
Sbjct: 467  LKKLAEEGFDTDAIEASMNTIEFSLRENNTGSFPRGLSLMLQSIGKWIYDMNPFEPLKYE 526

Query: 1726 KPLMDLKARIAEQGSKAVFSPLIVEYILNNPHRVTVEMQPDPEKASRDEEKEKENLSKVK 1905
            KPL  LK+RIAE+G K+VFSPLI ++ILNNPH+VTVEMQPDPEKA+R+E  EK  L KVK
Sbjct: 527  KPLQGLKSRIAEEGPKSVFSPLIEKFILNNPHKVTVEMQPDPEKAAREEATEKHILQKVK 586

Query: 1906 ASMTEEDLAELARATEELRLKQETPDPPEALKCVPSLSLQDIPKKPVHVPIEVGDINGVK 2085
             SMT EDLAEL RAT ELRLKQETPD PEALK VPSLSLQDIPK+P+ VP EVGDINGVK
Sbjct: 587  TSMTTEDLAELTRATHELRLKQETPDSPEALKTVPSLSLQDIPKEPIRVPTEVGDINGVK 646

Query: 2086 VLQHDLFTNDVLYSEVVFDMSSLKQELLPLVPLFCQSLLEMGTKDLDFVQLNQLIGRKTG 2265
            VLQHDLFTNDVLY+E+VF+M+SLKQELLPLVPLFCQSLLEMGTKDL FVQLNQLIGRKTG
Sbjct: 647  VLQHDLFTNDVLYTEIVFNMNSLKQELLPLVPLFCQSLLEMGTKDLSFVQLNQLIGRKTG 706

Query: 2266 GISVYPFTSTVRGKEEPCSHIVVRGKAMSSRTEDLFNLINCILQDVQLTDQKRFKQFVSQ 2445
            GISVYPFTS+VRGKE+PCSH+VVRGKAM+   EDL++L+N +LQDVQ TDQ+RFKQFVSQ
Sbjct: 707  GISVYPFTSSVRGKEDPCSHMVVRGKAMAGCIEDLYDLVNSVLQDVQFTDQQRFKQFVSQ 766

Query: 2446 SKARMENRVRGSGHGIAAARMDAKLNSAGWISEQMGGISYLEFLRSLEEKVDKDWPEIAS 2625
            S+ARMENR+RGSGHGIAAARMDAKLN+AGW+SE+MGG+SYLEFLR+LEE+VD+DW +I+S
Sbjct: 767  SRARMENRLRGSGHGIAAARMDAKLNAAGWMSEKMGGLSYLEFLRTLEERVDQDWVDISS 826

Query: 2626 SLEEIRRSLFSKSGCLINLTSDGKNLTNAEKHVSKFXXXXXXXXXXXXXXWTARLPFTNE 2805
            SLEEIR+S+FSK GCL+N+T+D KNL NAEK VSKF                  LP TNE
Sbjct: 827  SLEEIRKSIFSKQGCLVNVTADRKNLANAEKVVSKFVDLLPTRSPIAATNRDFTLPLTNE 886

Query: 2806 AIVIPTQVNYVGKAANLYETGYQLKGSAYVISKYISNTWLWDHVRVSGGAYGGFCDFDTH 2985
            AIVIPTQVNYVGKAAN+Y+ GYQL GSAYVISKYISNTWLWD VRVSGGAYGGFCDFDTH
Sbjct: 887  AIVIPTQVNYVGKAANIYDVGYQLNGSAYVISKYISNTWLWDRVRVSGGAYGGFCDFDTH 946

Query: 2986 SGVFSFLSYRDPNLLKTLNIYDGTSQFLRELDMDDDALTRAIIGTIGDVDSYQLPDAKGY 3165
            SGVFSFLSYRDPNLLKTL++YDGT  FLREL +DDD LT+AIIGTIGDVD+YQLPDAKGY
Sbjct: 947  SGVFSFLSYRDPNLLKTLDVYDGTGDFLRELQIDDDTLTKAIIGTIGDVDAYQLPDAKGY 1006

Query: 3166 TSLLRYLLGVTXXXXXXXXXXILSTRLKDFKEFADVIEXXXXXXXXXXXXXRDDVEAANK 3345
            +S+LRYLLG+T          ILST LKDFK F D +E              +DV+AANK
Sbjct: 1007 SSMLRYLLGITEEERQRRREEILSTSLKDFKNFTDAMEAVKNKGVVVAVASPEDVDAANK 1066

Query: 3346 ECSDFFQVKKA 3378
            +  DFFQVKKA
Sbjct: 1067 DRPDFFQVKKA 1077


>ref|XP_004511282.1| PREDICTED: presequence protease 1, chloroplastic/mitochondrial-like
            [Cicer arietinum]
          Length = 1080

 Score = 1615 bits (4181), Expect = 0.0
 Identities = 810/1086 (74%), Positives = 929/1086 (85%), Gaps = 10/1086 (0%)
 Frame = +1

Query: 151  MERAVLLRSLP-STTVISRTRL---FTRSLHRLAPPRASSIYAKRCRLLPNLHR--RSLL 312
            MERA L+RSL  S+  + R+     F+ ++  ++     S   +   LL   H   R  L
Sbjct: 1    MERAALVRSLSCSSRYLCRSCSSFSFSSTISTISTTTKPSSILRNPLLLRRRHSSIRLPL 60

Query: 313  RSYLPVLSNRSPNFSSLRTQFSSQSVRAIATSAPQSEVFG--ADDDVAEKLGFEKVSEEF 486
             S  P+L  R+ N    R  FS+   RA   S+P     G    D+VA +LGFEKVSEEF
Sbjct: 61   SSSSPLLYFRNRN----RNHFSTS--RASLVSSPDISGGGEVVKDEVARELGFEKVSEEF 114

Query: 487  IEECKSRAILYKHKKTGAEVMSVSNDDENKVFGIVFRTPPSDSTGIPHILEHSVLCGSRK 666
            I ECKS+A+L++H KTGA+VMSVSN+DENKVFGIVFRTPP+DSTGIPHILEHSVLCGSRK
Sbjct: 115  ITECKSKAVLFRHLKTGAQVMSVSNNDENKVFGIVFRTPPNDSTGIPHILEHSVLCGSRK 174

Query: 667  YPLKEPFVELLKVSLQTFLNAFTYPDRTCYPVASTNTKDFYNLVDVYLDAVFFPRCVEDY 846
            YPLKEPFVELLK SL TFLNAFTYPDRTCYPVASTNTKDFYNLVDVYLDAVFFP+CV+D 
Sbjct: 175  YPLKEPFVELLKGSLHTFLNAFTYPDRTCYPVASTNTKDFYNLVDVYLDAVFFPKCVDDL 234

Query: 847  QTFQQEGWHYELNDPSEDIIYKGVVFNEMKGVYSQPDSILGRTSQQALFPNNTYGVDSGG 1026
            QTFQQEGWHYELN PSEDI YKGVVFNEMKGVYSQPD+ILGR +QQALFP+NTYGVDSGG
Sbjct: 235  QTFQQEGWHYELNHPSEDITYKGVVFNEMKGVYSQPDNILGRAAQQALFPDNTYGVDSGG 294

Query: 1027 DPEVIPKLTFEEFKEFHRKYYHPSNARIWFYGDDDPNERLRILSEYLDMFEGSSAPLESR 1206
            DP VIP LTFEEFKEFHRKYYHPSN+RIWFYGDDDPNERLRILSEYL+MF+ SSAP ES+
Sbjct: 295  DPRVIPNLTFEEFKEFHRKYYHPSNSRIWFYGDDDPNERLRILSEYLNMFDASSAPNESK 354

Query: 1207 VQPQKLFSEPVRVIEKYPA-EGDDLEKKHMVCVNWLLSEQPLDLETELALGFLDHLLMGT 1383
            V+PQKLFS+P+R++E YPA EG DL KKHMVC+NWLL+++PLDLETELALGFL+HLL+GT
Sbjct: 355  VEPQKLFSKPIRIVETYPAGEGGDL-KKHMVCLNWLLADKPLDLETELALGFLNHLLLGT 413

Query: 1384 PASPLRKILLESGLGDALVGGGVEDELLQPQFSIGLKGVKKDDIQRVEELIMNTLKKLAE 1563
            PASPLRK+LLES LGDA+VGGG+EDELLQPQFSIG+KGV +DDI +VEELIM+TLKKLAE
Sbjct: 414  PASPLRKVLLESRLGDAIVGGGLEDELLQPQFSIGMKGVSEDDIHKVEELIMSTLKKLAE 473

Query: 1564 EGFDSDAVEASMNTIEFALRENNTGSFPRGLALMLRAMGKWIYDMNPFVPLKYKKPLMDL 1743
            EGFD+DA+EASMNTIEF+LRENNTGSFPRGL+LML+++GKWIYDMNP  PLKY+KPL DL
Sbjct: 474  EGFDTDAIEASMNTIEFSLRENNTGSFPRGLSLMLQSIGKWIYDMNPLEPLKYEKPLQDL 533

Query: 1744 KARIAEQGSKAVFSPLIVEYILNNPHRVTVEMQPDPEKASRDEEKEKENLSKVKASMTEE 1923
            K++IA++GSK+VFSPLI ++ILNNPH+VTV+MQPDPEKA+RDEE EK+ L K+KASMT E
Sbjct: 534  KSKIAKEGSKSVFSPLIEKFILNNPHKVTVQMQPDPEKAARDEETEKQVLQKIKASMTTE 593

Query: 1924 DLAELARATEELRLKQETPDPPEALKCVPSLSLQDIPKKPVHVPIEVGDINGVKVLQHDL 2103
            DLAELARAT ELRLKQETPDPPEALK VPSLSLQDIPK+P+ VP EVGDINGVKVLQHDL
Sbjct: 594  DLAELARATHELRLKQETPDPPEALKTVPSLSLQDIPKEPIRVPTEVGDINGVKVLQHDL 653

Query: 2104 FTNDVLYSEVVFDMSSLKQELLPLVPLFCQSLLEMGTKDLDFVQLNQLIGRKTGGISVYP 2283
            FTNDVLY+E+VFDMSSLKQELLPLVPLFCQSLLEMGTKDL FVQLNQLIGRKTGGISVYP
Sbjct: 654  FTNDVLYTEIVFDMSSLKQELLPLVPLFCQSLLEMGTKDLTFVQLNQLIGRKTGGISVYP 713

Query: 2284 FTSTVRGKEEPCSHIVVRGKAMSSRTEDLFNLINCILQDVQLTDQKRFKQFVSQSKARME 2463
            FTS+V+GKE+PCSH++VRGKAMS R EDL++L+N +LQDVQ TDQ+RFKQFVSQS+ARME
Sbjct: 714  FTSSVQGKEDPCSHMIVRGKAMSGRAEDLYDLVNSVLQDVQFTDQQRFKQFVSQSRARME 773

Query: 2464 NRVRGSGHGIAAARMDAKLNSAGWISEQMGGISYLEFLRSLEEKVDKDWPEIASSLEEIR 2643
            NR+RGSGHGIAAARMDAKLN+AGW+SE+MGG+SYLEFL++LE++VD+DW +I+SSLEEIR
Sbjct: 774  NRLRGSGHGIAAARMDAKLNAAGWMSEKMGGLSYLEFLQTLEKRVDEDWADISSSLEEIR 833

Query: 2644 RSLFSKSGCLINLTSDGKNLTNAEKHVSKF-XXXXXXXXXXXXXXWTARLPFTNEAIVIP 2820
            +++FSK GCLIN+T+DGKNL N +K VSKF               W ARLP TNEAIVIP
Sbjct: 834  KTVFSKQGCLINITADGKNLANMDKFVSKFVDMLPTSSPIATTNIWNARLPLTNEAIVIP 893

Query: 2821 TQVNYVGKAANLYETGYQLKGSAYVISKYISNTWLWDHVRVSGGAYGGFCDFDTHSGVFS 3000
            TQVNYVGKA N+Y+ GY+L GSAYVISKYISNTWLWD VRVSGGAYGGFCDFDTHSGVFS
Sbjct: 894  TQVNYVGKATNVYDAGYKLNGSAYVISKYISNTWLWDRVRVSGGAYGGFCDFDTHSGVFS 953

Query: 3001 FLSYRDPNLLKTLNIYDGTSQFLRELDMDDDALTRAIIGTIGDVDSYQLPDAKGYTSLLR 3180
            FLSYRDPNLLKTL +YDGT  FLREL++DDD LT+AIIGTIGDVD+YQLPDAKGY+S+LR
Sbjct: 954  FLSYRDPNLLKTLEVYDGTGDFLRELEIDDDTLTKAIIGTIGDVDAYQLPDAKGYSSMLR 1013

Query: 3181 YLLGVTXXXXXXXXXXILSTRLKDFKEFADVIEXXXXXXXXXXXXXRDDVEAANKECSDF 3360
            YLLG+T          ILST  KDFK+F   +E              +DVEAANKE ++F
Sbjct: 1014 YLLGITEEERQRRREEILSTSSKDFKQFIAAMEAVKDKGVVVAVASPEDVEAANKELANF 1073

Query: 3361 FQVKKA 3378
            FQVKKA
Sbjct: 1074 FQVKKA 1079


>gb|EOX98215.1| Presequence protease 2 isoform 1 [Theobroma cacao]
          Length = 1037

 Score = 1606 bits (4159), Expect = 0.0
 Identities = 795/1017 (78%), Positives = 901/1017 (88%), Gaps = 8/1017 (0%)
 Frame = +1

Query: 151  MERAVLLRSLPSTTVISRTRLFTRSLH-RLAPPRASSIYAK---RCRLLPN--LHRRSLL 312
            MER  LLRSL  +++     LF+   H R    ++S++ A      RL+PN  L RR+  
Sbjct: 1    MERTALLRSLSCSSLACNKFLFSAPKHSRSFLSKSSTVSAAGRYHRRLIPNRSLIRRNNW 60

Query: 313  RSYLPVLSNRSPNFSSLRTQFSSQSVRAIAT-SAPQSEVFGADDDVAEKLGFEKVSEEFI 489
            RS     S+ S  F+     FSS S RA+A+ + P  ++ G +D+VAEKLGFEKVSEEFI
Sbjct: 61   RSLSVASSHSSLRFTYSNKNFSSLSPRAVASPTQPSPDIAGVEDEVAEKLGFEKVSEEFI 120

Query: 490  EECKSRAILYKHKKTGAEVMSVSNDDENKVFGIVFRTPPSDSTGIPHILEHSVLCGSRKY 669
             ECKS+A+L+KHKKTGAEVMSVSNDDENKVFGIVFRTPP DSTGIPHILEHSVLCGSRKY
Sbjct: 121  GECKSKAVLFKHKKTGAEVMSVSNDDENKVFGIVFRTPPKDSTGIPHILEHSVLCGSRKY 180

Query: 670  PLKEPFVELLKVSLQTFLNAFTYPDRTCYPVASTNTKDFYNLVDVYLDAVFFPRCVEDYQ 849
            PLKEPFVELLK SL TFLNAFTYPDRTCYPVASTN KDFYNLVDVYLDAVFFP+C+ED+Q
Sbjct: 181  PLKEPFVELLKGSLHTFLNAFTYPDRTCYPVASTNAKDFYNLVDVYLDAVFFPKCIEDFQ 240

Query: 850  TFQQEGWHYELNDPSEDIIYKGVVFNEMKGVYSQPDSILGRTSQQALFPNNTYGVDSGGD 1029
            TFQQEGWHYELND SEDI YKGVVFNEMKGVYSQPD++LGRT+QQALFP+NTYGVDSGGD
Sbjct: 241  TFQQEGWHYELNDTSEDITYKGVVFNEMKGVYSQPDNLLGRTAQQALFPDNTYGVDSGGD 300

Query: 1030 PEVIPKLTFEEFKEFHRKYYHPSNARIWFYGDDDPNERLRILSEYLDMFEGSSAPLESRV 1209
            P+VIPKLT+EEFKEFHRKYYHPSNARIWFYGDDDP ERLRILSEYLDMF+ S+AP ES+V
Sbjct: 301  PQVIPKLTYEEFKEFHRKYYHPSNARIWFYGDDDPIERLRILSEYLDMFDASTAPDESKV 360

Query: 1210 QPQKLFSEPVRVIEKYPA-EGDDLEKKHMVCVNWLLSEQPLDLETELALGFLDHLLMGTP 1386
            +PQKLFSEPVR +EKYP  EG DL+KKHMVC+NWLLS++PLDL+TEL LGFLDHL++GTP
Sbjct: 361  EPQKLFSEPVRFVEKYPVGEGGDLKKKHMVCLNWLLSDKPLDLQTELTLGFLDHLMLGTP 420

Query: 1387 ASPLRKILLESGLGDALVGGGVEDELLQPQFSIGLKGVKKDDIQRVEELIMNTLKKLAEE 1566
            ASPLRK+LLESGLGDA++GGGVEDELLQPQFSIGLKGV +DDI +VEELIM++LKKLAEE
Sbjct: 421  ASPLRKVLLESGLGDAIIGGGVEDELLQPQFSIGLKGVSEDDIPKVEELIMSSLKKLAEE 480

Query: 1567 GFDSDAVEASMNTIEFALRENNTGSFPRGLALMLRAMGKWIYDMNPFVPLKYKKPLMDLK 1746
            GFD+DAVEASMNTIEF+LRENNTGSFPRGL+LMLR++GKWIYDM+PF PLKY+KPLM LK
Sbjct: 481  GFDTDAVEASMNTIEFSLRENNTGSFPRGLSLMLRSIGKWIYDMDPFEPLKYEKPLMILK 540

Query: 1747 ARIAEQGSKAVFSPLIVEYILNNPHRVTVEMQPDPEKASRDEEKEKENLSKVKASMTEED 1926
            ARIAE+GSKAVFSPLI ++ILNNPH VT+EMQPDPEKASRDE  EKE L+KVKASMTEED
Sbjct: 541  ARIAEEGSKAVFSPLIEKFILNNPHCVTIEMQPDPEKASRDEAAEKEILNKVKASMTEED 600

Query: 1927 LAELARATEELRLKQETPDPPEALKCVPSLSLQDIPKKPVHVPIEVGDINGVKVLQHDLF 2106
            LAELARAT+EL+LKQETPDPPEAL+ VPSLSL DIPK+P+ VP EVGDINGVKVLQHDLF
Sbjct: 601  LAELARATQELKLKQETPDPPEALRSVPSLSLHDIPKEPIRVPTEVGDINGVKVLQHDLF 660

Query: 2107 TNDVLYSEVVFDMSSLKQELLPLVPLFCQSLLEMGTKDLDFVQLNQLIGRKTGGISVYPF 2286
            TNDVLY++VVFDMSSLK+ELLPLVPLFCQSLLEMGTKDL FVQLNQLIGRKTGGISVYPF
Sbjct: 661  TNDVLYTDVVFDMSSLKRELLPLVPLFCQSLLEMGTKDLSFVQLNQLIGRKTGGISVYPF 720

Query: 2287 TSTVRGKEEPCSHIVVRGKAMSSRTEDLFNLINCILQDVQLTDQKRFKQFVSQSKARMEN 2466
            TS+++GKE+PCSHI+VRGK+M+   +DLFNLINC++Q+VQ TDQ+RFKQFVSQSKARME+
Sbjct: 721  TSSIQGKEDPCSHIIVRGKSMAGCADDLFNLINCVIQEVQFTDQQRFKQFVSQSKARMES 780

Query: 2467 RVRGSGHGIAAARMDAKLNSAGWISEQMGGISYLEFLRSLEEKVDKDWPEIASSLEEIRR 2646
            R+RGSGHGIAAARMDAKLN +GWISEQMGG+SYLEFL+ LEE+VD DW  I+SSLEEIR+
Sbjct: 781  RLRGSGHGIAAARMDAKLNVSGWISEQMGGVSYLEFLQGLEERVDNDWAGISSSLEEIRK 840

Query: 2647 SLFSKSGCLINLTSDGKNLTNAEKHVSKFXXXXXXXXXXXXXXWTARLPFTNEAIVIPTQ 2826
            SL S+ GCLIN+T+DGKNL+N EK VSKF              W+ARLP  NEAIVIPTQ
Sbjct: 841  SLLSREGCLINMTADGKNLSNTEKLVSKFLDLLPSNSVVERASWSARLPSNNEAIVIPTQ 900

Query: 2827 VNYVGKAANLYETGYQLKGSAYVISKYISNTWLWDHVRVSGGAYGGFCDFDTHSGVFSFL 3006
            VNYVGKAANLY+ GYQL GSAYVISK+ISNTWLWD VRVSGGAYGGFC+FDTHSGVF+FL
Sbjct: 901  VNYVGKAANLYDGGYQLNGSAYVISKHISNTWLWDRVRVSGGAYGGFCNFDTHSGVFTFL 960

Query: 3007 SYRDPNLLKTLNIYDGTSQFLRELDMDDDALTRAIIGTIGDVDSYQLPDAKGYTSLL 3177
            SYRDPNLL+TL+IYDGT  FLREL+MDDD LT+AIIGT+GDVD+YQLPDAKGY+  L
Sbjct: 961  SYRDPNLLETLDIYDGTGDFLRELEMDDDTLTKAIIGTVGDVDAYQLPDAKGYSRFL 1017


>gb|EOX98217.1| Presequence protease 2 isoform 3 [Theobroma cacao]
          Length = 1041

 Score = 1605 bits (4156), Expect = 0.0
 Identities = 794/1014 (78%), Positives = 900/1014 (88%), Gaps = 8/1014 (0%)
 Frame = +1

Query: 151  MERAVLLRSLPSTTVISRTRLFTRSLH-RLAPPRASSIYAK---RCRLLPN--LHRRSLL 312
            MER  LLRSL  +++     LF+   H R    ++S++ A      RL+PN  L RR+  
Sbjct: 1    MERTALLRSLSCSSLACNKFLFSAPKHSRSFLSKSSTVSAAGRYHRRLIPNRSLIRRNNW 60

Query: 313  RSYLPVLSNRSPNFSSLRTQFSSQSVRAIAT-SAPQSEVFGADDDVAEKLGFEKVSEEFI 489
            RS     S+ S  F+     FSS S RA+A+ + P  ++ G +D+VAEKLGFEKVSEEFI
Sbjct: 61   RSLSVASSHSSLRFTYSNKNFSSLSPRAVASPTQPSPDIAGVEDEVAEKLGFEKVSEEFI 120

Query: 490  EECKSRAILYKHKKTGAEVMSVSNDDENKVFGIVFRTPPSDSTGIPHILEHSVLCGSRKY 669
             ECKS+A+L+KHKKTGAEVMSVSNDDENKVFGIVFRTPP DSTGIPHILEHSVLCGSRKY
Sbjct: 121  GECKSKAVLFKHKKTGAEVMSVSNDDENKVFGIVFRTPPKDSTGIPHILEHSVLCGSRKY 180

Query: 670  PLKEPFVELLKVSLQTFLNAFTYPDRTCYPVASTNTKDFYNLVDVYLDAVFFPRCVEDYQ 849
            PLKEPFVELLK SL TFLNAFTYPDRTCYPVASTN KDFYNLVDVYLDAVFFP+C+ED+Q
Sbjct: 181  PLKEPFVELLKGSLHTFLNAFTYPDRTCYPVASTNAKDFYNLVDVYLDAVFFPKCIEDFQ 240

Query: 850  TFQQEGWHYELNDPSEDIIYKGVVFNEMKGVYSQPDSILGRTSQQALFPNNTYGVDSGGD 1029
            TFQQEGWHYELND SEDI YKGVVFNEMKGVYSQPD++LGRT+QQALFP+NTYGVDSGGD
Sbjct: 241  TFQQEGWHYELNDTSEDITYKGVVFNEMKGVYSQPDNLLGRTAQQALFPDNTYGVDSGGD 300

Query: 1030 PEVIPKLTFEEFKEFHRKYYHPSNARIWFYGDDDPNERLRILSEYLDMFEGSSAPLESRV 1209
            P+VIPKLT+EEFKEFHRKYYHPSNARIWFYGDDDP ERLRILSEYLDMF+ S+AP ES+V
Sbjct: 301  PQVIPKLTYEEFKEFHRKYYHPSNARIWFYGDDDPIERLRILSEYLDMFDASTAPDESKV 360

Query: 1210 QPQKLFSEPVRVIEKYPA-EGDDLEKKHMVCVNWLLSEQPLDLETELALGFLDHLLMGTP 1386
            +PQKLFSEPVR +EKYP  EG DL+KKHMVC+NWLLS++PLDL+TEL LGFLDHL++GTP
Sbjct: 361  EPQKLFSEPVRFVEKYPVGEGGDLKKKHMVCLNWLLSDKPLDLQTELTLGFLDHLMLGTP 420

Query: 1387 ASPLRKILLESGLGDALVGGGVEDELLQPQFSIGLKGVKKDDIQRVEELIMNTLKKLAEE 1566
            ASPLRK+LLESGLGDA++GGGVEDELLQPQFSIGLKGV +DDI +VEELIM++LKKLAEE
Sbjct: 421  ASPLRKVLLESGLGDAIIGGGVEDELLQPQFSIGLKGVSEDDIPKVEELIMSSLKKLAEE 480

Query: 1567 GFDSDAVEASMNTIEFALRENNTGSFPRGLALMLRAMGKWIYDMNPFVPLKYKKPLMDLK 1746
            GFD+DAVEASMNTIEF+LRENNTGSFPRGL+LMLR++GKWIYDM+PF PLKY+KPLM LK
Sbjct: 481  GFDTDAVEASMNTIEFSLRENNTGSFPRGLSLMLRSIGKWIYDMDPFEPLKYEKPLMILK 540

Query: 1747 ARIAEQGSKAVFSPLIVEYILNNPHRVTVEMQPDPEKASRDEEKEKENLSKVKASMTEED 1926
            ARIAE+GSKAVFSPLI ++ILNNPH VT+EMQPDPEKASRDE  EKE L+KVKASMTEED
Sbjct: 541  ARIAEEGSKAVFSPLIEKFILNNPHCVTIEMQPDPEKASRDEAAEKEILNKVKASMTEED 600

Query: 1927 LAELARATEELRLKQETPDPPEALKCVPSLSLQDIPKKPVHVPIEVGDINGVKVLQHDLF 2106
            LAELARAT+EL+LKQETPDPPEAL+ VPSLSL DIPK+P+ VP EVGDINGVKVLQHDLF
Sbjct: 601  LAELARATQELKLKQETPDPPEALRSVPSLSLHDIPKEPIRVPTEVGDINGVKVLQHDLF 660

Query: 2107 TNDVLYSEVVFDMSSLKQELLPLVPLFCQSLLEMGTKDLDFVQLNQLIGRKTGGISVYPF 2286
            TNDVLY++VVFDMSSLK+ELLPLVPLFCQSLLEMGTKDL FVQLNQLIGRKTGGISVYPF
Sbjct: 661  TNDVLYTDVVFDMSSLKRELLPLVPLFCQSLLEMGTKDLSFVQLNQLIGRKTGGISVYPF 720

Query: 2287 TSTVRGKEEPCSHIVVRGKAMSSRTEDLFNLINCILQDVQLTDQKRFKQFVSQSKARMEN 2466
            TS+++GKE+PCSHI+VRGK+M+   +DLFNLINC++Q+VQ TDQ+RFKQFVSQSKARME+
Sbjct: 721  TSSIQGKEDPCSHIIVRGKSMAGCADDLFNLINCVIQEVQFTDQQRFKQFVSQSKARMES 780

Query: 2467 RVRGSGHGIAAARMDAKLNSAGWISEQMGGISYLEFLRSLEEKVDKDWPEIASSLEEIRR 2646
            R+RGSGHGIAAARMDAKLN +GWISEQMGG+SYLEFL+ LEE+VD DW  I+SSLEEIR+
Sbjct: 781  RLRGSGHGIAAARMDAKLNVSGWISEQMGGVSYLEFLQGLEERVDNDWAGISSSLEEIRK 840

Query: 2647 SLFSKSGCLINLTSDGKNLTNAEKHVSKFXXXXXXXXXXXXXXWTARLPFTNEAIVIPTQ 2826
            SL S+ GCLIN+T+DGKNL+N EK VSKF              W+ARLP  NEAIVIPTQ
Sbjct: 841  SLLSREGCLINMTADGKNLSNTEKLVSKFLDLLPSNSVVERASWSARLPSNNEAIVIPTQ 900

Query: 2827 VNYVGKAANLYETGYQLKGSAYVISKYISNTWLWDHVRVSGGAYGGFCDFDTHSGVFSFL 3006
            VNYVGKAANLY+ GYQL GSAYVISK+ISNTWLWD VRVSGGAYGGFC+FDTHSGVF+FL
Sbjct: 901  VNYVGKAANLYDGGYQLNGSAYVISKHISNTWLWDRVRVSGGAYGGFCNFDTHSGVFTFL 960

Query: 3007 SYRDPNLLKTLNIYDGTSQFLRELDMDDDALTRAIIGTIGDVDSYQLPDAKGYT 3168
            SYRDPNLL+TL+IYDGT  FLREL+MDDD LT+AIIGT+GDVD+YQLPDAKGY+
Sbjct: 961  SYRDPNLLETLDIYDGTGDFLRELEMDDDTLTKAIIGTVGDVDAYQLPDAKGYS 1014


>ref|XP_006829680.1| hypothetical protein AMTR_s00126p00013900 [Amborella trichopoda]
            gi|548835199|gb|ERM97096.1| hypothetical protein
            AMTR_s00126p00013900 [Amborella trichopoda]
          Length = 1075

 Score = 1592 bits (4122), Expect = 0.0
 Identities = 796/1081 (73%), Positives = 913/1081 (84%), Gaps = 6/1081 (0%)
 Frame = +1

Query: 151  MERAVLLRSLPSTTVISR-TRLFTRSLHRLA--PPRASSIYAKRCRLLPNLHRRSLLRSY 321
            MER VLLRSL  +T   R   L  RS  + A  P     + + R R LP L   S +R  
Sbjct: 1    MERVVLLRSLSCSTACMRFLSLKPRSSWKTASTPLTQQLLISPRNRGLP-LACGSRMRWV 59

Query: 322  LPVLSNRSPNFSSLRTQFSSQSVRAIATSAPQ-SEVFGADDDVAEKLGFEKVSEEFIEEC 498
                   +  ++    +  S S +AIAT + Q S       D+A +LGFEKVSE+ IEEC
Sbjct: 60   ------STSRYAFQHKRGFSVSPQAIATPSKQASSGIDGSHDIAHELGFEKVSEQLIEEC 113

Query: 499  KSRAILYKHKKTGAEVMSVSNDDENKVFGIVFRTPPSDSTGIPHILEHSVLCGSRKYPLK 678
            KS+AILYKHKKTGAEV+SV NDDENKVFGIVFRTPP DSTGIPHILEHSVLCGSRKYPLK
Sbjct: 114  KSKAILYKHKKTGAEVISVVNDDENKVFGIVFRTPPKDSTGIPHILEHSVLCGSRKYPLK 173

Query: 679  EPFVELLKVSLQTFLNAFTYPDRTCYPVASTNTKDFYNLVDVYLDAVFFPRCVEDYQTFQ 858
            EPFVELLK SL TFLNAFTYPDRTCYPVASTNTKDFYNLVDVYLDAVFFP+C+EDYQTFQ
Sbjct: 174  EPFVELLKGSLHTFLNAFTYPDRTCYPVASTNTKDFYNLVDVYLDAVFFPKCIEDYQTFQ 233

Query: 859  QEGWHYELNDPSEDIIYKGVVFNEMKGVYSQPDSILGRTSQQALFPNNTYGVDSGGDPEV 1038
            QEGWHYELN+P E+I  KGVVFNEMKGVYSQPD+I+GR SQQ +FP+NTYGVDSGGDP+V
Sbjct: 234  QEGWHYELNNPEEEISLKGVVFNEMKGVYSQPDNIMGRISQQVMFPDNTYGVDSGGDPKV 293

Query: 1039 IPKLTFEEFKEFHRKYYHPSNARIWFYGDDDPNERLRILSEYLDMFEGSSAPLESRVQPQ 1218
            IPKLTFEEFKEFHRKYYHPSN++IWFYGDDDPNERLR +S YLD F+ SSAP ES+V PQ
Sbjct: 294  IPKLTFEEFKEFHRKYYHPSNSKIWFYGDDDPNERLRTISVYLDQFDASSAPYESKVVPQ 353

Query: 1219 KLFSEPVRVIEKYPAEGD--DLEKKHMVCVNWLLSEQPLDLETELALGFLDHLLMGTPAS 1392
            KLF +PV+V+EKYPA GD  DL+KKHMV +NWLLSE+PLDLETELALGFLDHL++GTPAS
Sbjct: 354  KLFPKPVKVVEKYPA-GDTGDLKKKHMVSLNWLLSEEPLDLETELALGFLDHLMLGTPAS 412

Query: 1393 PLRKILLESGLGDALVGGGVEDELLQPQFSIGLKGVKKDDIQRVEELIMNTLKKLAEEGF 1572
            PLRK LLESGLGDAL+GGG+EDELLQPQFS+GLKGV ++D+++VE+LI+ TL++LA +GF
Sbjct: 413  PLRKTLLESGLGDALIGGGIEDELLQPQFSVGLKGVAEEDVRKVEDLIIQTLEELANKGF 472

Query: 1573 DSDAVEASMNTIEFALRENNTGSFPRGLALMLRAMGKWIYDMNPFVPLKYKKPLMDLKAR 1752
            D +A+EASMNTIEF+LRENNTGSFPRGL+LMLR++GKWIYDM+PF PLKY+KPL DLKAR
Sbjct: 473  DVEAIEASMNTIEFSLRENNTGSFPRGLSLMLRSIGKWIYDMDPFEPLKYEKPLNDLKAR 532

Query: 1753 IAEQGSKAVFSPLIVEYILNNPHRVTVEMQPDPEKASRDEEKEKENLSKVKASMTEEDLA 1932
            IAE+GSKAVFSPLI ++IL+NPHRVT+EMQPD EKASRDE  EKE+L KVKASMTEEDLA
Sbjct: 533  IAEEGSKAVFSPLIQKFILDNPHRVTIEMQPDTEKASRDEADEKESLEKVKASMTEEDLA 592

Query: 1933 ELARATEELRLKQETPDPPEALKCVPSLSLQDIPKKPVHVPIEVGDINGVKVLQHDLFTN 2112
            ELARAT+ELRLKQETPDPPE LKCVPSLSL DIPK P+HVPIE+G+INGVKVLQH+LFTN
Sbjct: 593  ELARATQELRLKQETPDPPEVLKCVPSLSLHDIPKHPIHVPIEIGEINGVKVLQHELFTN 652

Query: 2113 DVLYSEVVFDMSSLKQELLPLVPLFCQSLLEMGTKDLDFVQLNQLIGRKTGGISVYPFTS 2292
            DVLY+EVVFDM  +KQELLPL+PLFCQSLLEMGTKD+DFVQLNQLIGRKTGGIS+YPFTS
Sbjct: 653  DVLYAEVVFDMCLVKQELLPLIPLFCQSLLEMGTKDMDFVQLNQLIGRKTGGISIYPFTS 712

Query: 2293 TVRGKEEPCSHIVVRGKAMSSRTEDLFNLINCILQDVQLTDQKRFKQFVSQSKARMENRV 2472
            ++RGK EPCS I+VR K+M++R +DLFNL+N +LQDVQ TDQ+RFKQFV QSKARME+R+
Sbjct: 713  SIRGKVEPCSRIIVRAKSMAARVDDLFNLVNTVLQDVQFTDQQRFKQFVCQSKARMESRL 772

Query: 2473 RGSGHGIAAARMDAKLNSAGWISEQMGGISYLEFLRSLEEKVDKDWPEIASSLEEIRRSL 2652
            RGSGHGIAAARMDAKLN+AGWI+EQMGGISYL+FL +LE++VD+DW  I+ SLE+IRRSL
Sbjct: 773  RGSGHGIAAARMDAKLNTAGWIAEQMGGISYLQFLETLEKQVDQDWSAISCSLEDIRRSL 832

Query: 2653 FSKSGCLINLTSDGKNLTNAEKHVSKFXXXXXXXXXXXXXXWTARLPFTNEAIVIPTQVN 2832
             S+ GCLINLT+DGKNL+N+EKHVSKF              W A+L   NEA+VIPTQVN
Sbjct: 833  LSRKGCLINLTADGKNLSNSEKHVSKFLDLLPATSSLETTSWKAQLYLGNEALVIPTQVN 892

Query: 2833 YVGKAANLYETGYQLKGSAYVISKYISNTWLWDHVRVSGGAYGGFCDFDTHSGVFSFLSY 3012
            YVGKA NLY+TGYQL GS YVIS YI NTWLWD VRVSGGAYGGFCDFDTHSGVFS+LSY
Sbjct: 893  YVGKAGNLYDTGYQLNGSTYVISMYIGNTWLWDRVRVSGGAYGGFCDFDTHSGVFSYLSY 952

Query: 3013 RDPNLLKTLNIYDGTSQFLRELDMDDDALTRAIIGTIGDVDSYQLPDAKGYTSLLRYLLG 3192
            RDPNLLKTL+IYDGT+ FLREL++D+D LT+AIIGTIGDVD YQLPDAKGY+S+LRYLLG
Sbjct: 953  RDPNLLKTLDIYDGTANFLRELELDEDTLTKAIIGTIGDVDGYQLPDAKGYSSMLRYLLG 1012

Query: 3193 VTXXXXXXXXXXILSTRLKDFKEFADVIEXXXXXXXXXXXXXRDDVEAANKECSDFFQVK 3372
            +T          ILST LKDF +FADV++              DDV AAN+E   FFQVK
Sbjct: 1013 ITEEERQKRHEEILSTSLKDFHDFADVVDVVKHKGVVVAVASEDDVTAANEERPGFFQVK 1072

Query: 3373 K 3375
            K
Sbjct: 1073 K 1073


>ref|XP_002313107.1| hypothetical protein POPTR_0009s10650g [Populus trichocarpa]
            gi|222849515|gb|EEE87062.1| hypothetical protein
            POPTR_0009s10650g [Populus trichocarpa]
          Length = 1006

 Score = 1592 bits (4122), Expect = 0.0
 Identities = 782/979 (79%), Positives = 873/979 (89%), Gaps = 1/979 (0%)
 Frame = +1

Query: 445  VAEKLGFEKVSEEFIEECKSRAILYKHKKTGAEVMSVSNDDENKVFGIVFRTPPSDSTGI 624
            VA K GFEKVSE+FI ECKSRA+L KHKKTGAEVMSVSNDDENKVFGIVFRTPP DSTGI
Sbjct: 30   VAAKYGFEKVSEDFIGECKSRAVLLKHKKTGAEVMSVSNDDENKVFGIVFRTPPKDSTGI 89

Query: 625  PHILEHSVLCGSRKYPLKEPFVELLKVSLQTFLNAFTYPDRTCYPVASTNTKDFYNLVDV 804
            PHILEHSVLCGSRKYPLKEPFVELLK SL TFLNAFTYPDRTCYPVASTNTKDFYNLVDV
Sbjct: 90   PHILEHSVLCGSRKYPLKEPFVELLKGSLHTFLNAFTYPDRTCYPVASTNTKDFYNLVDV 149

Query: 805  YLDAVFFPRCVEDYQTFQQEGWHYELNDPSEDIIYKGVVFNEMKGVYSQPDSILGRTSQQ 984
            YLDAVFFP+CVED+ TFQQEGWH ELN+PSE+I YKGVVFNEMKGVYSQPD+ILGRT+Q 
Sbjct: 150  YLDAVFFPKCVEDHHTFQQEGWHLELNNPSEEISYKGVVFNEMKGVYSQPDNILGRTAQL 209

Query: 985  ALFPNNTYGVDSGGDPEVIPKLTFEEFKEFHRKYYHPSNARIWFYGDDDPNERLRILSEY 1164
            A   NNTYGVDSGGDP+VIPKLTFE+FKEFH KYYHPSNARIWFYGDDDP ERLRILSEY
Sbjct: 210  A---NNTYGVDSGGDPKVIPKLTFEQFKEFHGKYYHPSNARIWFYGDDDPTERLRILSEY 266

Query: 1165 LDMFEGSSAPLESRVQPQKLFSEPVRVIEKYPA-EGDDLEKKHMVCVNWLLSEQPLDLET 1341
            LDMF+ SSA  ESR++ QK FSEPVR++EKYPA +G DL+KKHMVC+NWLL+++PLDLET
Sbjct: 267  LDMFDASSASNESRIEQQKFFSEPVRIVEKYPAGDGSDLKKKHMVCLNWLLADKPLDLET 326

Query: 1342 ELALGFLDHLLMGTPASPLRKILLESGLGDALVGGGVEDELLQPQFSIGLKGVKKDDIQR 1521
            EL LGFLDHL++GTPASPLRKILLESGLGDA+VGGGVEDELLQPQFSIGLKGV ++DI++
Sbjct: 327  ELTLGFLDHLMLGTPASPLRKILLESGLGDAIVGGGVEDELLQPQFSIGLKGVSEEDIEK 386

Query: 1522 VEELIMNTLKKLAEEGFDSDAVEASMNTIEFALRENNTGSFPRGLALMLRAMGKWIYDMN 1701
            VEEL+M+TLKKLAEEGF++DAVEASMNTIEF+LRENNTGSFPRGL+LML+++ KWIYDM+
Sbjct: 387  VEELVMSTLKKLAEEGFETDAVEASMNTIEFSLRENNTGSFPRGLSLMLQSISKWIYDMD 446

Query: 1702 PFVPLKYKKPLMDLKARIAEQGSKAVFSPLIVEYILNNPHRVTVEMQPDPEKASRDEEKE 1881
            PF PLKY+KPLM LKARIAE+GSKAVFSPLI ++ILNN HRVT+EMQPDPEKASRDE  E
Sbjct: 447  PFEPLKYEKPLMALKARIAEEGSKAVFSPLIEKFILNNLHRVTIEMQPDPEKASRDEAAE 506

Query: 1882 KENLSKVKASMTEEDLAELARATEELRLKQETPDPPEALKCVPSLSLQDIPKKPVHVPIE 2061
            +E L KVKASMTEEDLAELARAT+ELRLKQETPDPPEAL+ VPSLSL DIPK+P+HVP E
Sbjct: 507  REILEKVKASMTEEDLAELARATQELRLKQETPDPPEALRSVPSLSLLDIPKEPLHVPTE 566

Query: 2062 VGDINGVKVLQHDLFTNDVLYSEVVFDMSSLKQELLPLVPLFCQSLLEMGTKDLDFVQLN 2241
             GDINGVKVL+HDLFTNDVLY+E+VF+M SLKQELLPLVPLFCQSLLEMGTKDL FVQLN
Sbjct: 567  AGDINGVKVLKHDLFTNDVLYAEIVFNMRSLKQELLPLVPLFCQSLLEMGTKDLTFVQLN 626

Query: 2242 QLIGRKTGGISVYPFTSTVRGKEEPCSHIVVRGKAMSSRTEDLFNLINCILQDVQLTDQK 2421
            QLIGRKTGGISVYPFTS+++G+E+PCSHI+ +GKAM+ R EDLFNL+NC+LQ+VQ TDQ+
Sbjct: 627  QLIGRKTGGISVYPFTSSIQGREDPCSHIIAQGKAMAGRVEDLFNLVNCVLQEVQFTDQQ 686

Query: 2422 RFKQFVSQSKARMENRVRGSGHGIAAARMDAKLNSAGWISEQMGGISYLEFLRSLEEKVD 2601
            RFKQFVSQSKA MENR+RGSGH IAA RMDAKLN  GWISEQMGG+SYLEFL++LEE+VD
Sbjct: 687  RFKQFVSQSKAGMENRLRGSGHRIAATRMDAKLNVTGWISEQMGGVSYLEFLQALEERVD 746

Query: 2602 KDWPEIASSLEEIRRSLFSKSGCLINLTSDGKNLTNAEKHVSKFXXXXXXXXXXXXXXWT 2781
            +DW  ++SSLEEIR SL SK+GCLIN+T+DGKNLTN+EK+VSKF              W 
Sbjct: 747  QDWAGVSSSLEEIRTSLLSKNGCLINMTADGKNLTNSEKYVSKFLDLLPSKSSVEAAAWN 806

Query: 2782 ARLPFTNEAIVIPTQVNYVGKAANLYETGYQLKGSAYVISKYISNTWLWDHVRVSGGAYG 2961
            ARL   NEAIVIPTQVNYVGKAAN+Y+TGYQL GSAYVISKYISNTWLWD VRVSGGAYG
Sbjct: 807  ARLSPGNEAIVIPTQVNYVGKAANIYDTGYQLNGSAYVISKYISNTWLWDRVRVSGGAYG 866

Query: 2962 GFCDFDTHSGVFSFLSYRDPNLLKTLNIYDGTSQFLRELDMDDDALTRAIIGTIGDVDSY 3141
            GFCD DTHSGVFSFLSYRDPNLLKTL++YDGT  FLR+L+MDDD L++AIIGTIGDVDSY
Sbjct: 867  GFCDLDTHSGVFSFLSYRDPNLLKTLDVYDGTGAFLRQLEMDDDTLSKAIIGTIGDVDSY 926

Query: 3142 QLPDAKGYTSLLRYLLGVTXXXXXXXXXXILSTRLKDFKEFADVIEXXXXXXXXXXXXXR 3321
            QLPDAKGY+SLLRYLLG+T          ILST LKDFKEF +VIE              
Sbjct: 927  QLPDAKGYSSLLRYLLGITEEERQKRREEILSTSLKDFKEFGEVIEAVKDKWVSVAVASP 986

Query: 3322 DDVEAANKECSDFFQVKKA 3378
            DDV+ ANKE S++F VKKA
Sbjct: 987  DDVDDANKERSNYFDVKKA 1005


Top