BLASTX nr result

ID: Cocculus23_contig00001880 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cocculus23_contig00001880
         (3822 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002282024.1| PREDICTED: presequence protease 2, chloropla...  1674   0.0  
emb|CBI32433.3| unnamed protein product [Vitis vinifera]             1663   0.0  
ref|XP_007200813.1| hypothetical protein PRUPE_ppa025698mg, part...  1654   0.0  
ref|XP_006423047.1| hypothetical protein CICLE_v10027722mg [Citr...  1649   0.0  
ref|XP_004296078.1| PREDICTED: presequence protease 1, chloropla...  1647   0.0  
ref|XP_006384425.1| hypothetical protein POPTR_0004s14960g [Popu...  1646   0.0  
ref|XP_006487082.1| PREDICTED: LOW QUALITY PROTEIN: presequence ...  1644   0.0  
ref|XP_007042385.1| Presequence protease 2 isoform 2 [Theobroma ...  1635   0.0  
ref|XP_004136986.1| PREDICTED: presequence protease 1, chloropla...  1632   0.0  
ref|XP_004159889.1| PREDICTED: LOW QUALITY PROTEIN: presequence ...  1630   0.0  
ref|XP_003517606.1| PREDICTED: presequence protease 2, chloropla...  1623   0.0  
ref|XP_006346464.1| PREDICTED: presequence protease 1, chloropla...  1618   0.0  
ref|XP_004230817.1| PREDICTED: presequence protease 1, chloropla...  1618   0.0  
ref|XP_002313107.1| hypothetical protein POPTR_0009s10650g [Popu...  1617   0.0  
ref|XP_004511282.1| PREDICTED: presequence protease 1, chloropla...  1610   0.0  
ref|XP_007157239.1| hypothetical protein PHAVU_002G054400g [Phas...  1608   0.0  
ref|XP_006829680.1| hypothetical protein AMTR_s00126p00013900 [A...  1604   0.0  
ref|XP_007042384.1| Presequence protease 2 isoform 1 [Theobroma ...  1599   0.0  
ref|XP_007042386.1| Presequence protease 2 isoform 3 [Theobroma ...  1598   0.0  
ref|XP_006406530.1| hypothetical protein EUTSA_v10019955mg [Eutr...  1579   0.0  

>ref|XP_002282024.1| PREDICTED: presequence protease 2, chloroplastic/mitochondrial-like
            [Vitis vinifera]
          Length = 1080

 Score = 1674 bits (4336), Expect = 0.0
 Identities = 823/1000 (82%), Positives = 905/1000 (90%)
 Frame = -3

Query: 3460 KAIATPSKHVSPDFSGAHDDVAEKLGFEKVSEEFIDECKSKAVLYKHKKTGAEVMSVSND 3281
            KAIAT  +  S D  G+ DD+AEK GF+KVSE+FI ECKSKAVLYKHKKTGAEVMSVSND
Sbjct: 81   KAIATSPEQASSDAVGSQDDLAEKYGFDKVSEQFIQECKSKAVLYKHKKTGAEVMSVSND 140

Query: 3280 DENKVFGVVFRTPPKDSTGIPHILEHSVLCGSRKYPLKEPFVELLKGSLNTFLNAFTYPD 3101
            DENKVFG+VFRTPPKDSTGIPHILEHSVLCGSRKYPLKEPFVELLKGSL+TFLNAFTYPD
Sbjct: 141  DENKVFGIVFRTPPKDSTGIPHILEHSVLCGSRKYPLKEPFVELLKGSLHTFLNAFTYPD 200

Query: 3100 RTCYPVASTNAKDFYNLVDVYLDAVFFPKCVEDFQTFQQEGWHYELNDPSEDISFKGVVF 2921
            RTCYPVASTN KDFYNLVDVYLDAV FPKCVEDFQTFQQEGWHYELN+PSEDIS+KGVVF
Sbjct: 201  RTCYPVASTNTKDFYNLVDVYLDAVLFPKCVEDFQTFQQEGWHYELNNPSEDISYKGVVF 260

Query: 2920 NEMKGVYSQPDNILGRSSQQALFPDNTYGVDSGGDPKVIPKLTFEEFKNFHRKYYHPSNA 2741
            NEMKGVYSQPDNILGR++QQALFPDNTYGVDSGGDPKVIPKLTFE+FK FHRKYYHP NA
Sbjct: 261  NEMKGVYSQPDNILGRTAQQALFPDNTYGVDSGGDPKVIPKLTFEDFKEFHRKYYHPGNA 320

Query: 2740 RIWFYGDDDPSERLRILSEYLDMFDESSAPDESKIEPQKLFTEPVRIIEKYPAGEGGDLK 2561
            RIWFYGDDDP+ERLRIL+EYLD+FD S A  ESK+EPQKLF+ PVRI+EKYPAG+GGDL+
Sbjct: 321  RIWFYGDDDPNERLRILNEYLDLFDTSPASSESKVEPQKLFSNPVRIVEKYPAGKGGDLR 380

Query: 2560 KKHMVCLNWLLSDKPLDVETEXXXXXXXXXXLGTPASPLRRILLESGLGEAIVGGGIEDE 2381
            KKHMVCLNWLLSDKPLD+ETE          LGTPASPLR+ILLESGLG+AIVGGG+EDE
Sbjct: 381  KKHMVCLNWLLSDKPLDLETELTLGFLDHLMLGTPASPLRKILLESGLGDAIVGGGMEDE 440

Query: 2380 LLQPQFSVGLKGVSEDDIQKVEELILNNLQKLAEEGFDPEAVEASLNTIEFSLRENNTGS 2201
            LLQPQFS+GLKGVSEDDI KVEEL+++ L+ LA+EGF+ EAVEAS+NTIEFSLRENNTGS
Sbjct: 441  LLQPQFSIGLKGVSEDDIHKVEELVMSTLKSLAKEGFNSEAVEASMNTIEFSLRENNTGS 500

Query: 2200 FPRGLSLMLRSIGKWIYDLDPFEPLKYEKPLLALKARISEEGPKAVFCPLIEKFILNNPH 2021
            FPRGLSLMLRSIGKWIYD+DPFEPLKYEKPL+ALKARI+EEG KAVF PLIEK+ILNNPH
Sbjct: 501  FPRGLSLMLRSIGKWIYDMDPFEPLKYEKPLMALKARIAEEGSKAVFSPLIEKYILNNPH 560

Query: 2020 RVTIEMQPDPVKASQDEAAEREILEKVKASMTEVDLAELARATRELRLKQETPDPPEALR 1841
             VT+EMQPDP KAS+DEA EREILEKVKA MTE DLAELARAT+ELRLKQETPDPPEAL+
Sbjct: 561  CVTVEMQPDPEKASRDEAVEREILEKVKAGMTEEDLAELARATQELRLKQETPDPPEALK 620

Query: 1840 SVPSLSLNDIPKKPIQIPIEVGEINGVRTLQHDLFTNDVLYAEVVFDMAHLKAELVPLVP 1661
            SVPSLSL DIPK+PI +PIE+G IN V+ L+HDLFTNDVLY E+VFDM+ LK +L+PLVP
Sbjct: 621  SVPSLSLLDIPKEPIHVPIEIGVINDVKVLRHDLFTNDVLYTEIVFDMSSLKQDLLPLVP 680

Query: 1660 LFCQSLLEMGTKDMDFVQLNQLIGRKTGGISVYPFTSSVRGKEDPTSHVIVRGKSMSGRA 1481
            LFCQSL+EMGTKDMDFVQLNQLIGRKTGGISVYPFTSSVRGKE P SH+IVRGK+M+G A
Sbjct: 681  LFCQSLMEMGTKDMDFVQLNQLIGRKTGGISVYPFTSSVRGKEYPCSHIIVRGKAMAGCA 740

Query: 1480 EDLFNLVNCILHDVQFTDQQRFKQFVSQSKARMENRLRGSGHGIAAARMDAKLNAAGWIA 1301
            EDLFNLVNCIL +VQFTDQQRFKQFVSQSKARMENRLRGSGHGIAAARMDAKLN AGWIA
Sbjct: 741  EDLFNLVNCILQEVQFTDQQRFKQFVSQSKARMENRLRGSGHGIAAARMDAKLNTAGWIA 800

Query: 1300 EQMGGISYLEFLRSLEEKVEKDWFVISSSLEEIRKSLLSKKGCLVNLTAEGKTLTSTEKF 1121
            EQMGG+SYLEFL++LEEKV++DW  ISSSLEEIRKSLLS+KGCL+N+T+EGK L ++EK+
Sbjct: 801  EQMGGVSYLEFLQALEEKVDQDWIGISSSLEEIRKSLLSRKGCLINMTSEGKNLMNSEKY 860

Query: 1120 VSKFLDXXXXXXXXXXXSFNARLLPVNEAIVIPTQVNYIGKAANIYETGYQLNGSAYVIS 941
            VSKFLD           ++N RL   NEAIVIPTQVNY+GKA NIY+TGYQL GSAYVIS
Sbjct: 861  VSKFLDLLPGSSSVEKTTWNGRLSSENEAIVIPTQVNYVGKATNIYDTGYQLKGSAYVIS 920

Query: 940  KYISNTWLWDRVRVSGGAYGGFCDFDTHSGVFSYLSYRDPNLLKTLEVYDGTTNFLRELE 761
            KYISNTWLWDRVRVSGGAYGGFCDFDTHSGVFS+LSYRDPNLLKTL+VYDGT +FLR+LE
Sbjct: 921  KYISNTWLWDRVRVSGGAYGGFCDFDTHSGVFSFLSYRDPNLLKTLDVYDGTGDFLRQLE 980

Query: 760  LDDDSLTKAIIGTIGDVDAYQLPDAKGYSSMLRFLLGVTEEERQKRREEILSSSLNDFKA 581
            +DDD+LTKAIIGTIGDVDAYQLPDAKGYSS+LR+LLGVTEEERQKRREEILS+SL DFK 
Sbjct: 981  MDDDTLTKAIIGTIGDVDAYQLPDAKGYSSLLRYLLGVTEEERQKRREEILSTSLKDFKE 1040

Query: 580  FAEAIEAVKEKGXXXXXXXXXXXXXANKERSNFFEVKQVL 461
            FA+AIEA K KG             ANKE  NFF+VK+ L
Sbjct: 1041 FADAIEAAKHKGVVVAVASPDDVDAANKEHPNFFQVKKAL 1080


>emb|CBI32433.3| unnamed protein product [Vitis vinifera]
          Length = 1098

 Score = 1663 bits (4307), Expect = 0.0
 Identities = 823/1018 (80%), Positives = 905/1018 (88%), Gaps = 18/1018 (1%)
 Frame = -3

Query: 3460 KAIATPSKHVSPDFSGAHDDVAEKLGFEKVSEEFIDECKSKAVLYKHKKTGAEVMSVSND 3281
            KAIAT  +  S D  G+ DD+AEK GF+KVSE+FI ECKSKAVLYKHKKTGAEVMSVSND
Sbjct: 81   KAIATSPEQASSDAVGSQDDLAEKYGFDKVSEQFIQECKSKAVLYKHKKTGAEVMSVSND 140

Query: 3280 DENKVFGVVFRTPPKDSTGIPHILEHSVLCGSRKYPLKEPFVELLKGSLNTFLNAFTYPD 3101
            DENKVFG+VFRTPPKDSTGIPHILEHSVLCGSRKYPLKEPFVELLKGSL+TFLNAFTYPD
Sbjct: 141  DENKVFGIVFRTPPKDSTGIPHILEHSVLCGSRKYPLKEPFVELLKGSLHTFLNAFTYPD 200

Query: 3100 RTCYPVASTNAKDFYNLVDVYLDAVFFPKCVEDFQTFQQEGWHYELNDPSEDISFKGVVF 2921
            RTCYPVASTN KDFYNLVDVYLDAV FPKCVEDFQTFQQEGWHYELN+PSEDIS+KGVVF
Sbjct: 201  RTCYPVASTNTKDFYNLVDVYLDAVLFPKCVEDFQTFQQEGWHYELNNPSEDISYKGVVF 260

Query: 2920 NEMKGVYSQPDNILGRSSQQA------------------LFPDNTYGVDSGGDPKVIPKL 2795
            NEMKGVYSQPDNILGR++QQA                  LFPDNTYGVDSGGDPKVIPKL
Sbjct: 261  NEMKGVYSQPDNILGRTAQQASFLDKYGVCGYEEPIGSALFPDNTYGVDSGGDPKVIPKL 320

Query: 2794 TFEEFKNFHRKYYHPSNARIWFYGDDDPSERLRILSEYLDMFDESSAPDESKIEPQKLFT 2615
            TFE+FK FHRKYYHP NARIWFYGDDDP+ERLRIL+EYLD+FD S A  ESK+EPQKLF+
Sbjct: 321  TFEDFKEFHRKYYHPGNARIWFYGDDDPNERLRILNEYLDLFDTSPASSESKVEPQKLFS 380

Query: 2614 EPVRIIEKYPAGEGGDLKKKHMVCLNWLLSDKPLDVETEXXXXXXXXXXLGTPASPLRRI 2435
             PVRI+EKYPAG+GGDL+KKHMVCLNWLLSDKPLD+ETE          LGTPASPLR+I
Sbjct: 381  NPVRIVEKYPAGKGGDLRKKHMVCLNWLLSDKPLDLETELTLGFLDHLMLGTPASPLRKI 440

Query: 2434 LLESGLGEAIVGGGIEDELLQPQFSVGLKGVSEDDIQKVEELILNNLQKLAEEGFDPEAV 2255
            LLESGLG+AIVGGG+EDELLQPQFS+GLKGVSEDDI KVEEL+++ L+ LA+EGF+ EAV
Sbjct: 441  LLESGLGDAIVGGGMEDELLQPQFSIGLKGVSEDDIHKVEELVMSTLKSLAKEGFNSEAV 500

Query: 2254 EASLNTIEFSLRENNTGSFPRGLSLMLRSIGKWIYDLDPFEPLKYEKPLLALKARISEEG 2075
            EAS+NTIEFSLRENNTGSFPRGLSLMLRSIGKWIYD+DPFEPLKYEKPL+ALKARI+EEG
Sbjct: 501  EASMNTIEFSLRENNTGSFPRGLSLMLRSIGKWIYDMDPFEPLKYEKPLMALKARIAEEG 560

Query: 2074 PKAVFCPLIEKFILNNPHRVTIEMQPDPVKASQDEAAEREILEKVKASMTEVDLAELARA 1895
             KAVF PLIEK+ILNNPH VT+EMQPDP KAS+DEA EREILEKVKA MTE DLAELARA
Sbjct: 561  SKAVFSPLIEKYILNNPHCVTVEMQPDPEKASRDEAVEREILEKVKAGMTEEDLAELARA 620

Query: 1894 TRELRLKQETPDPPEALRSVPSLSLNDIPKKPIQIPIEVGEINGVRTLQHDLFTNDVLYA 1715
            T+ELRLKQETPDPPEAL+SVPSLSL DIPK+PI +PIE+G IN V+ L+HDLFTNDVLY 
Sbjct: 621  TQELRLKQETPDPPEALKSVPSLSLLDIPKEPIHVPIEIGVINDVKVLRHDLFTNDVLYT 680

Query: 1714 EVVFDMAHLKAELVPLVPLFCQSLLEMGTKDMDFVQLNQLIGRKTGGISVYPFTSSVRGK 1535
            E+VFDM+ LK +L+PLVPLFCQSL+EMGTKDMDFVQLNQLIGRKTGGISVYPFTSSVRGK
Sbjct: 681  EIVFDMSSLKQDLLPLVPLFCQSLMEMGTKDMDFVQLNQLIGRKTGGISVYPFTSSVRGK 740

Query: 1534 EDPTSHVIVRGKSMSGRAEDLFNLVNCILHDVQFTDQQRFKQFVSQSKARMENRLRGSGH 1355
            E P SH+IVRGK+M+G AEDLFNLVNCIL +VQFTDQQRFKQFVSQSKARMENRLRGSGH
Sbjct: 741  EYPCSHIIVRGKAMAGCAEDLFNLVNCILQEVQFTDQQRFKQFVSQSKARMENRLRGSGH 800

Query: 1354 GIAAARMDAKLNAAGWIAEQMGGISYLEFLRSLEEKVEKDWFVISSSLEEIRKSLLSKKG 1175
            GIAAARMDAKLN AGWIAEQMGG+SYLEFL++LEEKV++DW  ISSSLEEIRKSLLS+KG
Sbjct: 801  GIAAARMDAKLNTAGWIAEQMGGVSYLEFLQALEEKVDQDWIGISSSLEEIRKSLLSRKG 860

Query: 1174 CLVNLTAEGKTLTSTEKFVSKFLDXXXXXXXXXXXSFNARLLPVNEAIVIPTQVNYIGKA 995
            CL+N+T+EGK L ++EK+VSKFLD           ++N RL   NEAIVIPTQVNY+GKA
Sbjct: 861  CLINMTSEGKNLMNSEKYVSKFLDLLPGSSSVEKTTWNGRLSSENEAIVIPTQVNYVGKA 920

Query: 994  ANIYETGYQLNGSAYVISKYISNTWLWDRVRVSGGAYGGFCDFDTHSGVFSYLSYRDPNL 815
             NIY+TGYQL GSAYVISKYISNTWLWDRVRVSGGAYGGFCDFDTHSGVFS+LSYRDPNL
Sbjct: 921  TNIYDTGYQLKGSAYVISKYISNTWLWDRVRVSGGAYGGFCDFDTHSGVFSFLSYRDPNL 980

Query: 814  LKTLEVYDGTTNFLRELELDDDSLTKAIIGTIGDVDAYQLPDAKGYSSMLRFLLGVTEEE 635
            LKTL+VYDGT +FLR+LE+DDD+LTKAIIGTIGDVDAYQLPDAKGYSS+LR+LLGVTEEE
Sbjct: 981  LKTLDVYDGTGDFLRQLEMDDDTLTKAIIGTIGDVDAYQLPDAKGYSSLLRYLLGVTEEE 1040

Query: 634  RQKRREEILSSSLNDFKAFAEAIEAVKEKGXXXXXXXXXXXXXANKERSNFFEVKQVL 461
            RQKRREEILS+SL DFK FA+AIEA K KG             ANKE  NFF+VK+ L
Sbjct: 1041 RQKRREEILSTSLKDFKEFADAIEAAKHKGVVVAVASPDDVDAANKEHPNFFQVKKAL 1098


>ref|XP_007200813.1| hypothetical protein PRUPE_ppa025698mg, partial [Prunus persica]
            gi|462396213|gb|EMJ02012.1| hypothetical protein
            PRUPE_ppa025698mg, partial [Prunus persica]
          Length = 986

 Score = 1654 bits (4282), Expect = 0.0
 Identities = 812/987 (82%), Positives = 901/987 (91%)
 Frame = -3

Query: 3421 FSGAHDDVAEKLGFEKVSEEFIDECKSKAVLYKHKKTGAEVMSVSNDDENKVFGVVFRTP 3242
            FSG  D+V EKLGFEKVSEEFI ECKSKA+L++HKKTGA+V+SVSNDDENKVFG+VFRTP
Sbjct: 1    FSGVEDEVVEKLGFEKVSEEFIGECKSKALLFRHKKTGAQVISVSNDDENKVFGIVFRTP 60

Query: 3241 PKDSTGIPHILEHSVLCGSRKYPLKEPFVELLKGSLNTFLNAFTYPDRTCYPVASTNAKD 3062
            P DSTGIPHILEHSVLCGSRKYPLKEPFVELLKGSLNTFLNAFTYPDRTCYPVASTN KD
Sbjct: 61   PNDSTGIPHILEHSVLCGSRKYPLKEPFVELLKGSLNTFLNAFTYPDRTCYPVASTNTKD 120

Query: 3061 FYNLVDVYLDAVFFPKCVEDFQTFQQEGWHYELNDPSEDISFKGVVFNEMKGVYSQPDNI 2882
            FYNLVDVYLDAVFFPKCVEDF+TFQQEGWHYELNDPSEDIS+KGVVFNEMKGVYSQPDNI
Sbjct: 121  FYNLVDVYLDAVFFPKCVEDFRTFQQEGWHYELNDPSEDISYKGVVFNEMKGVYSQPDNI 180

Query: 2881 LGRSSQQALFPDNTYGVDSGGDPKVIPKLTFEEFKNFHRKYYHPSNARIWFYGDDDPSER 2702
            LGR+SQQALFPDNTYGVDSGGDPKVIPKLTFEEFK FHRKYYHPSNARIWFYGDDDP+ER
Sbjct: 181  LGRASQQALFPDNTYGVDSGGDPKVIPKLTFEEFKEFHRKYYHPSNARIWFYGDDDPTER 240

Query: 2701 LRILSEYLDMFDESSAPDESKIEPQKLFTEPVRIIEKYPAGEGGDLKKKHMVCLNWLLSD 2522
            LRILSEYLDMFD SS+P+ES+I+ QKLF+EP+RI EKYPAGEGGDL+KK+MVCLNWLLSD
Sbjct: 241  LRILSEYLDMFDASSSPNESRIQAQKLFSEPIRISEKYPAGEGGDLRKKNMVCLNWLLSD 300

Query: 2521 KPLDVETEXXXXXXXXXXLGTPASPLRRILLESGLGEAIVGGGIEDELLQPQFSVGLKGV 2342
            KPLD+ETE          LGTPASPLR+ILLESGLGEAIVGGG+EDELLQPQFS+GLKGV
Sbjct: 301  KPLDLETELTLGFLDHLMLGTPASPLRKILLESGLGEAIVGGGVEDELLQPQFSIGLKGV 360

Query: 2341 SEDDIQKVEELILNNLQKLAEEGFDPEAVEASLNTIEFSLRENNTGSFPRGLSLMLRSIG 2162
            SEDDIQ VEE++++ L+KLAEEGFD +AVEAS+NTIEFSLRENNTGSFPRGLSLMLRS+G
Sbjct: 361  SEDDIQNVEEVVMSTLKKLAEEGFDTDAVEASMNTIEFSLRENNTGSFPRGLSLMLRSMG 420

Query: 2161 KWIYDLDPFEPLKYEKPLLALKARISEEGPKAVFCPLIEKFILNNPHRVTIEMQPDPVKA 1982
            KWIYD+DPFEPLKYEKPLLALKARI  EG KAVF PLIEKFILNN HRV +EMQPDP KA
Sbjct: 421  KWIYDMDPFEPLKYEKPLLALKARIEAEGSKAVFSPLIEKFILNNRHRVVVEMQPDPEKA 480

Query: 1981 SQDEAAEREILEKVKASMTEVDLAELARATRELRLKQETPDPPEALRSVPSLSLNDIPKK 1802
            S+DE AE++IL+KVKA MTE DLAELARAT+ELRL+QETPDPPEALRSVPSLSL DIPK+
Sbjct: 481  SRDEEAEKQILDKVKAGMTEEDLAELARATQELRLRQETPDPPEALRSVPSLSLQDIPKE 540

Query: 1801 PIQIPIEVGEINGVRTLQHDLFTNDVLYAEVVFDMAHLKAELVPLVPLFCQSLLEMGTKD 1622
            P ++P EVG+INGV+ LQHDLFTNDVLY EVVF+M+ LK EL+PLVPLFCQSLLEMGTKD
Sbjct: 541  PTRVPTEVGDINGVKVLQHDLFTNDVLYTEVVFNMSSLKQELLPLVPLFCQSLLEMGTKD 600

Query: 1621 MDFVQLNQLIGRKTGGISVYPFTSSVRGKEDPTSHVIVRGKSMSGRAEDLFNLVNCILHD 1442
            + FVQLNQLIGRKTGGISVYP TSSVRGKEDP SH+IVRGK+M+GRA+DLF+L NC+L +
Sbjct: 601  LSFVQLNQLIGRKTGGISVYPMTSSVRGKEDPCSHIIVRGKAMAGRADDLFHLFNCVLQE 660

Query: 1441 VQFTDQQRFKQFVSQSKARMENRLRGSGHGIAAARMDAKLNAAGWIAEQMGGISYLEFLR 1262
            VQFTDQQRFKQFVSQSKARMENRLRGSGHGIAAARMDAKLN AGWI+EQMGG+SYLEFL+
Sbjct: 661  VQFTDQQRFKQFVSQSKARMENRLRGSGHGIAAARMDAKLNVAGWISEQMGGVSYLEFLQ 720

Query: 1261 SLEEKVEKDWFVISSSLEEIRKSLLSKKGCLVNLTAEGKTLTSTEKFVSKFLDXXXXXXX 1082
            +LEEKV++DW  ISSSLEEIRKSLLS+ GC+VN+TAEGK LT++EKFVSKFLD       
Sbjct: 721  ALEEKVDQDWDGISSSLEEIRKSLLSRNGCIVNMTAEGKNLTNSEKFVSKFLD-LLPNSP 779

Query: 1081 XXXXSFNARLLPVNEAIVIPTQVNYIGKAANIYETGYQLNGSAYVISKYISNTWLWDRVR 902
                ++NARL   NEAIVIPTQVNY+GKAANIY+TGYQLNGSAYVISKYI NTWLWDRVR
Sbjct: 780  VATSTWNARLPSSNEAIVIPTQVNYVGKAANIYDTGYQLNGSAYVISKYICNTWLWDRVR 839

Query: 901  VSGGAYGGFCDFDTHSGVFSYLSYRDPNLLKTLEVYDGTTNFLRELELDDDSLTKAIIGT 722
            VSGGAYGGFCDFD+HSGVFS+LSYRDPNL KTL VYDGT +FLR+L++DD++LTK+IIGT
Sbjct: 840  VSGGAYGGFCDFDSHSGVFSFLSYRDPNLFKTLGVYDGTGDFLRQLDMDDETLTKSIIGT 899

Query: 721  IGDVDAYQLPDAKGYSSMLRFLLGVTEEERQKRREEILSSSLNDFKAFAEAIEAVKEKGX 542
            IGDVD+YQLPDAKGYSS+LR LLGVTEEERQ+RREEILS+S+ DFK FAEAI+AVK KG 
Sbjct: 900  IGDVDSYQLPDAKGYSSLLRHLLGVTEEERQRRREEILSTSVKDFKEFAEAIDAVKNKGV 959

Query: 541  XXXXXXXXXXXXANKERSNFFEVKQVL 461
                        A+KE++NFFEVK+ L
Sbjct: 960  VVAVASPDDVEAAHKEQNNFFEVKKAL 986


>ref|XP_006423047.1| hypothetical protein CICLE_v10027722mg [Citrus clementina]
            gi|557524981|gb|ESR36287.1| hypothetical protein
            CICLE_v10027722mg [Citrus clementina]
          Length = 1082

 Score = 1649 bits (4269), Expect = 0.0
 Identities = 804/1000 (80%), Positives = 907/1000 (90%)
 Frame = -3

Query: 3460 KAIATPSKHVSPDFSGAHDDVAEKLGFEKVSEEFIDECKSKAVLYKHKKTGAEVMSVSND 3281
            +A+A+PS   SP+ +   ++VAEKLGFEKVSEEFI ECKSKAVL+KHKKTGAEVMSVSND
Sbjct: 83   RAVASPSTPSSPEVAEVSNEVAEKLGFEKVSEEFIGECKSKAVLFKHKKTGAEVMSVSND 142

Query: 3280 DENKVFGVVFRTPPKDSTGIPHILEHSVLCGSRKYPLKEPFVELLKGSLNTFLNAFTYPD 3101
            DENKVFG+VFRTPPKDSTGIPHILEHSVLCGSRKYPLKEPFVELLKGSLNTFLNAFTYPD
Sbjct: 143  DENKVFGIVFRTPPKDSTGIPHILEHSVLCGSRKYPLKEPFVELLKGSLNTFLNAFTYPD 202

Query: 3100 RTCYPVASTNAKDFYNLVDVYLDAVFFPKCVEDFQTFQQEGWHYELNDPSEDISFKGVVF 2921
            RTCYPVASTN KDFYNLVDVYLDAVFFPKCVEDFQTFQQEGWH+EL++PSEDI++KGVVF
Sbjct: 203  RTCYPVASTNTKDFYNLVDVYLDAVFFPKCVEDFQTFQQEGWHFELDNPSEDITYKGVVF 262

Query: 2920 NEMKGVYSQPDNILGRSSQQALFPDNTYGVDSGGDPKVIPKLTFEEFKNFHRKYYHPSNA 2741
            NEMKGVYSQPDNILGR++QQALFPDN YGVDSGGDPKVIPKLTFEEFK FHRKYYHPSNA
Sbjct: 263  NEMKGVYSQPDNILGRAAQQALFPDNAYGVDSGGDPKVIPKLTFEEFKEFHRKYYHPSNA 322

Query: 2740 RIWFYGDDDPSERLRILSEYLDMFDESSAPDESKIEPQKLFTEPVRIIEKYPAGEGGDLK 2561
            RIWFYGDDDP+ERLRILSEYL+MF+ SSAP+ES +E QKLF+EPVRIIEKYPAG+ GD+K
Sbjct: 323  RIWFYGDDDPNERLRILSEYLNMFEASSAPNESIVEKQKLFSEPVRIIEKYPAGDAGDIK 382

Query: 2560 KKHMVCLNWLLSDKPLDVETEXXXXXXXXXXLGTPASPLRRILLESGLGEAIVGGGIEDE 2381
            KK+MVCLNWLLSDKPLD+ETE          LGTPASPLR+ILLESGLG+AIVGGGIEDE
Sbjct: 383  KKNMVCLNWLLSDKPLDLETELALGFLDHLMLGTPASPLRKILLESGLGDAIVGGGIEDE 442

Query: 2380 LLQPQFSVGLKGVSEDDIQKVEELILNNLQKLAEEGFDPEAVEASLNTIEFSLRENNTGS 2201
            LLQPQFS+GLK VSEDDIQKVEELI++ L+KLA+EGFD +AVEAS+NTIEFSLRENNTGS
Sbjct: 443  LLQPQFSIGLKNVSEDDIQKVEELIMDTLKKLADEGFDSDAVEASMNTIEFSLRENNTGS 502

Query: 2200 FPRGLSLMLRSIGKWIYDLDPFEPLKYEKPLLALKARISEEGPKAVFCPLIEKFILNNPH 2021
            FPRGLSLMLRS+GKWIYD++PFEPLKYEKPL+ALKAR++EEGPKAVF PLIEK+ILNNPH
Sbjct: 503  FPRGLSLMLRSMGKWIYDMNPFEPLKYEKPLMALKARLAEEGPKAVFSPLIEKYILNNPH 562

Query: 2020 RVTIEMQPDPVKASQDEAAEREILEKVKASMTEVDLAELARATRELRLKQETPDPPEALR 1841
             VT+EMQPDP KAS+DEAAE+EIL KVK+SMT+ DLAELARAT ELRLKQETPDPPEALR
Sbjct: 563  CVTVEMQPDPEKASRDEAAEKEILAKVKSSMTKEDLAELARATEELRLKQETPDPPEALR 622

Query: 1840 SVPSLSLNDIPKKPIQIPIEVGEINGVRTLQHDLFTNDVLYAEVVFDMAHLKAELVPLVP 1661
            SVPSLSL DIPK+PI++P EVG+INGV+ LQHDLFTNDVLY EVVFDM+ LK EL+PL+P
Sbjct: 623  SVPSLSLRDIPKEPIRVPTEVGDINGVKVLQHDLFTNDVLYTEVVFDMSSLKQELLPLIP 682

Query: 1660 LFCQSLLEMGTKDMDFVQLNQLIGRKTGGISVYPFTSSVRGKEDPTSHVIVRGKSMSGRA 1481
            LFCQSL EMGTKD+ FVQL+QLIGRKTGGISVYPFTSS+RGKEDP   ++VRGK+M+G+A
Sbjct: 683  LFCQSLKEMGTKDLSFVQLDQLIGRKTGGISVYPFTSSIRGKEDPCCCMVVRGKAMAGQA 742

Query: 1480 EDLFNLVNCILHDVQFTDQQRFKQFVSQSKARMENRLRGSGHGIAAARMDAKLNAAGWIA 1301
            EDLFNL NC+L +VQ TDQQRFKQFVSQSKARMENRLRGSGHGIAAARMDAKLN AGWI+
Sbjct: 743  EDLFNLFNCVLQEVQLTDQQRFKQFVSQSKARMENRLRGSGHGIAAARMDAKLNTAGWIS 802

Query: 1300 EQMGGISYLEFLRSLEEKVEKDWFVISSSLEEIRKSLLSKKGCLVNLTAEGKTLTSTEKF 1121
            EQMGG+SYLEFL++LEEKV++DW  ISSSLEEIR+S LS++GCL+N+TA+GK L ++E+F
Sbjct: 803  EQMGGVSYLEFLQALEEKVDQDWAGISSSLEEIRRSFLSREGCLINITADGKNLKNSERF 862

Query: 1120 VSKFLDXXXXXXXXXXXSFNARLLPVNEAIVIPTQVNYIGKAANIYETGYQLNGSAYVIS 941
            V KFLD            + A L   NEAIVIPTQVNY+GKAANI+ETGY+LNGSAYVIS
Sbjct: 863  VGKFLDMLPTNSPVERVKWKAHLPSANEAIVIPTQVNYVGKAANIFETGYKLNGSAYVIS 922

Query: 940  KYISNTWLWDRVRVSGGAYGGFCDFDTHSGVFSYLSYRDPNLLKTLEVYDGTTNFLRELE 761
            K+ISN WLWDRVRVSGGAYGGFCDFD+HSGVFS+LSYRDPNLLKTL++YDGT +FLRELE
Sbjct: 923  KHISNVWLWDRVRVSGGAYGGFCDFDSHSGVFSFLSYRDPNLLKTLDIYDGTVDFLRELE 982

Query: 760  LDDDSLTKAIIGTIGDVDAYQLPDAKGYSSMLRFLLGVTEEERQKRREEILSSSLNDFKA 581
            +DDD+LTKAIIGTIGDVDAYQLPDAKGYSS+LR LLG+TEEERQ+RREEILS+SL DFK 
Sbjct: 983  MDDDTLTKAIIGTIGDVDAYQLPDAKGYSSLLRHLLGITEEERQRRREEILSTSLKDFKE 1042

Query: 580  FAEAIEAVKEKGXXXXXXXXXXXXXANKERSNFFEVKQVL 461
            FA+ +EA+K++G             ANKER+N FEVK+ L
Sbjct: 1043 FADVLEAIKDRGVAVAVASPDDVDAANKERANLFEVKKAL 1082


>ref|XP_004296078.1| PREDICTED: presequence protease 1, chloroplastic/mitochondrial-like
            [Fragaria vesca subsp. vesca]
          Length = 1073

 Score = 1647 bits (4266), Expect = 0.0
 Identities = 807/1000 (80%), Positives = 900/1000 (90%)
 Frame = -3

Query: 3460 KAIATPSKHVSPDFSGAHDDVAEKLGFEKVSEEFIDECKSKAVLYKHKKTGAEVMSVSND 3281
            +A+ATP      + SG  D+VAEKLGFEKV+EEFI ECKSKA+L++HKKTGA+++SVSND
Sbjct: 74   RAVATPLTPSPSESSGVSDEVAEKLGFEKVTEEFIGECKSKALLFRHKKTGAQMISVSND 133

Query: 3280 DENKVFGVVFRTPPKDSTGIPHILEHSVLCGSRKYPLKEPFVELLKGSLNTFLNAFTYPD 3101
            DENKVFG+VFRTPP DSTGIPHILEHSVLCGSRKYPLKEPFVELLKGSLNTFLNAFTYPD
Sbjct: 134  DENKVFGIVFRTPPNDSTGIPHILEHSVLCGSRKYPLKEPFVELLKGSLNTFLNAFTYPD 193

Query: 3100 RTCYPVASTNAKDFYNLVDVYLDAVFFPKCVEDFQTFQQEGWHYELNDPSEDISFKGVVF 2921
            RTCYPVASTN KDFYNLVDVYLDAVFFPKCVEDFQTFQQEGWHYELNDPSEDIS+KGVVF
Sbjct: 194  RTCYPVASTNTKDFYNLVDVYLDAVFFPKCVEDFQTFQQEGWHYELNDPSEDISYKGVVF 253

Query: 2920 NEMKGVYSQPDNILGRSSQQALFPDNTYGVDSGGDPKVIPKLTFEEFKNFHRKYYHPSNA 2741
            NEMKGVYSQPDNILGR +QQALFPDNTYGVDSGGDPKVIPKLT+EEFK FHRKYYHPSNA
Sbjct: 254  NEMKGVYSQPDNILGRIAQQALFPDNTYGVDSGGDPKVIPKLTYEEFKEFHRKYYHPSNA 313

Query: 2740 RIWFYGDDDPSERLRILSEYLDMFDESSAPDESKIEPQKLFTEPVRIIEKYPAGEGGDLK 2561
            RIWFYGDDDP+ERLRILSEYLDMFD SSAP+ES+++ QKLF+EPVRI E YPAGEGGDLK
Sbjct: 314  RIWFYGDDDPTERLRILSEYLDMFDASSAPNESRVQTQKLFSEPVRISETYPAGEGGDLK 373

Query: 2560 KKHMVCLNWLLSDKPLDVETEXXXXXXXXXXLGTPASPLRRILLESGLGEAIVGGGIEDE 2381
            KK MVC+NWLLS+KPLD+ETE          LGTPASPLR+ILLESGLGEAI+GGG+EDE
Sbjct: 374  KKDMVCINWLLSEKPLDLETELALGFLDHLMLGTPASPLRKILLESGLGEAIIGGGVEDE 433

Query: 2380 LLQPQFSVGLKGVSEDDIQKVEELILNNLQKLAEEGFDPEAVEASLNTIEFSLRENNTGS 2201
            LLQPQFS+GLKGVS+DDI K+EEL+++ LQ LA+EGFD  AVEAS+NTIEFSLRENNTGS
Sbjct: 434  LLQPQFSIGLKGVSQDDIPKIEELVMSTLQNLADEGFDTAAVEASMNTIEFSLRENNTGS 493

Query: 2200 FPRGLSLMLRSIGKWIYDLDPFEPLKYEKPLLALKARISEEGPKAVFCPLIEKFILNNPH 2021
            FPRGLSLMLRS+GKWIYD+DPF+PLKYEKPLLALKARI EEG KAVF PLIEKFILNNPH
Sbjct: 494  FPRGLSLMLRSMGKWIYDMDPFQPLKYEKPLLALKARIEEEGSKAVFSPLIEKFILNNPH 553

Query: 2020 RVTIEMQPDPVKASQDEAAEREILEKVKASMTEVDLAELARATRELRLKQETPDPPEALR 1841
            RV +EMQPDP KAS+DEAAE+EILEKVKA MTE DLAELARAT++L+LKQETPDPPEALR
Sbjct: 554  RVVVEMQPDPEKASRDEAAEKEILEKVKAGMTEEDLAELARATQDLKLKQETPDPPEALR 613

Query: 1840 SVPSLSLNDIPKKPIQIPIEVGEINGVRTLQHDLFTNDVLYAEVVFDMAHLKAELVPLVP 1661
            SVPSLSL DIPK+PI IP EVG+INGV+ LQHDLFTNDVLY EVVFDM+  K EL+PLVP
Sbjct: 614  SVPSLSLQDIPKEPIAIPTEVGDINGVKILQHDLFTNDVLYTEVVFDMSLPKQELLPLVP 673

Query: 1660 LFCQSLLEMGTKDMDFVQLNQLIGRKTGGISVYPFTSSVRGKEDPTSHVIVRGKSMSGRA 1481
            LFCQSLLEMGTKD+ FVQLNQLIGRKTGGISVYP TSSVRGK+D  SH+IVRGK+M+GRA
Sbjct: 674  LFCQSLLEMGTKDLSFVQLNQLIGRKTGGISVYPMTSSVRGKKDACSHIIVRGKAMAGRA 733

Query: 1480 EDLFNLVNCILHDVQFTDQQRFKQFVSQSKARMENRLRGSGHGIAAARMDAKLNAAGWIA 1301
            +DLF+L+NCIL +VQFTDQQRFKQFVSQSKARMENRLRGSGHGIAAARMDAKLN AGWI+
Sbjct: 734  DDLFHLMNCILQEVQFTDQQRFKQFVSQSKARMENRLRGSGHGIAAARMDAKLNVAGWIS 793

Query: 1300 EQMGGISYLEFLRSLEEKVEKDWFVISSSLEEIRKSLLSKKGCLVNLTAEGKTLTSTEKF 1121
            EQMGG SYLEFL+ LE+KV+ DW  ISSSLEEIRKSLLS++GCL+N+TAEGK LT++EKF
Sbjct: 794  EQMGGFSYLEFLQDLEQKVDNDWEKISSSLEEIRKSLLSREGCLINMTAEGKNLTNSEKF 853

Query: 1120 VSKFLDXXXXXXXXXXXSFNARLLPVNEAIVIPTQVNYIGKAANIYETGYQLNGSAYVIS 941
            V KFLD           ++NARL   NEA+VIPTQVNY+GKAANIY+TGYQLNGSAYVIS
Sbjct: 854  VGKFLDLLPSKSPLTRTTWNARLPSTNEALVIPTQVNYVGKAANIYDTGYQLNGSAYVIS 913

Query: 940  KYISNTWLWDRVRVSGGAYGGFCDFDTHSGVFSYLSYRDPNLLKTLEVYDGTTNFLRELE 761
            KYISNTWLWDRVRVSGGAYGGFCDFD+HSGVFS+LSYRDPNLLKTL++YDGT  FLR+L+
Sbjct: 914  KYISNTWLWDRVRVSGGAYGGFCDFDSHSGVFSFLSYRDPNLLKTLDIYDGTGEFLRQLD 973

Query: 760  LDDDSLTKAIIGTIGDVDAYQLPDAKGYSSMLRFLLGVTEEERQKRREEILSSSLNDFKA 581
            +D+++LTK+IIGTIGDVD+YQLPDAKGYSS++R LLGV++EERQ RREEILS+SL DFK 
Sbjct: 974  MDEETLTKSIIGTIGDVDSYQLPDAKGYSSLMRHLLGVSDEERQIRREEILSTSLKDFKE 1033

Query: 580  FAEAIEAVKEKGXXXXXXXXXXXXXANKERSNFFEVKQVL 461
            FA AI+ VK+KG             A KERSN FEVK+ L
Sbjct: 1034 FANAIDEVKDKGVSVAVASPDDVDAAQKERSNLFEVKKAL 1073


>ref|XP_006384425.1| hypothetical protein POPTR_0004s14960g [Populus trichocarpa]
            gi|550341043|gb|ERP62222.1| hypothetical protein
            POPTR_0004s14960g [Populus trichocarpa]
          Length = 1091

 Score = 1646 bits (4263), Expect = 0.0
 Identities = 813/998 (81%), Positives = 900/998 (90%), Gaps = 4/998 (0%)
 Frame = -3

Query: 3442 SKHVSPDFSGAHDDVAEKLGFEKVSEEFIDECKSKAVLYKHKKTGAEVMSVSNDDENKVF 3263
            S   SPD S   D+VA K GFEKVSEEFI ECKSKAVL+KHKKTGAEVMSVSNDDENKVF
Sbjct: 94   STQYSPDVSNVSDEVAAKYGFEKVSEEFIGECKSKAVLFKHKKTGAEVMSVSNDDENKVF 153

Query: 3262 GVVFRTPPKDSTGIPHILEHSVLCGSRKYPLKEPFVELLKGSLNTFLNAFTYPDRTCYPV 3083
            G+VFRTPPKDSTGIPHILEHSVLCGSRKYPLKEPFVELLKGSL+TFLNAFTYPDRTCYPV
Sbjct: 154  GIVFRTPPKDSTGIPHILEHSVLCGSRKYPLKEPFVELLKGSLHTFLNAFTYPDRTCYPV 213

Query: 3082 ASTNAKDFYNLVDVYLDAVFFPKCVEDFQTFQQEGWHYELNDPSEDISFKG-VVFNEMKG 2906
            ASTN KDFYNLVDVYLDAVFFPKCVED+QTFQQEGWH+ELNDPSE+IS+KG VVFNEMKG
Sbjct: 214  ASTNTKDFYNLVDVYLDAVFFPKCVEDYQTFQQEGWHFELNDPSEEISYKGCVVFNEMKG 273

Query: 2905 VYSQPDNILGRSSQQALFPD---NTYGVDSGGDPKVIPKLTFEEFKNFHRKYYHPSNARI 2735
            VYSQPDNILGR++QQA  P    NTYGVDSGGDPKVIP+LTFE+FK FH KYYHPSNARI
Sbjct: 274  VYSQPDNILGRTAQQASSPISNYNTYGVDSGGDPKVIPQLTFEQFKEFHGKYYHPSNARI 333

Query: 2734 WFYGDDDPSERLRILSEYLDMFDESSAPDESKIEPQKLFTEPVRIIEKYPAGEGGDLKKK 2555
            WFYGDDDP+ERLRILSEYLDMFD SSAP+ES++E QKLF+ PVRIIEKYPAG+GGDLKKK
Sbjct: 334  WFYGDDDPTERLRILSEYLDMFDASSAPNESRVEQQKLFSAPVRIIEKYPAGDGGDLKKK 393

Query: 2554 HMVCLNWLLSDKPLDVETEXXXXXXXXXXLGTPASPLRRILLESGLGEAIVGGGIEDELL 2375
            HMVCLNWLL+DKPLD+ETE          LGTPASPLR+ILLESGLG+AIVGGGIEDELL
Sbjct: 394  HMVCLNWLLADKPLDLETELTLGFLDHLMLGTPASPLRKILLESGLGDAIVGGGIEDELL 453

Query: 2374 QPQFSVGLKGVSEDDIQKVEELILNNLQKLAEEGFDPEAVEASLNTIEFSLRENNTGSFP 2195
            QPQFS+GLKGV E+DIQKVEEL+++ L+KLAEEGF+ EAVEAS+NTIEFSLRENNTGSFP
Sbjct: 454  QPQFSIGLKGVFEEDIQKVEELVMSTLKKLAEEGFETEAVEASMNTIEFSLRENNTGSFP 513

Query: 2194 RGLSLMLRSIGKWIYDLDPFEPLKYEKPLLALKARISEEGPKAVFCPLIEKFILNNPHRV 2015
            RGLSLMLRSI KWIYD++PFEPLKYEKPL+ LKARI+EEG KAVF PLIEKFILNNPHRV
Sbjct: 514  RGLSLMLRSISKWIYDMNPFEPLKYEKPLMDLKARIAEEGYKAVFSPLIEKFILNNPHRV 573

Query: 2014 TIEMQPDPVKASQDEAAEREILEKVKASMTEVDLAELARATRELRLKQETPDPPEALRSV 1835
            T+EMQPDP KAS DEAAEREILEKVKASMTE DLAELARAT+EL+LKQETPDPPEALRSV
Sbjct: 574  TVEMQPDPEKASHDEAAEREILEKVKASMTEEDLAELARATQELKLKQETPDPPEALRSV 633

Query: 1834 PSLSLNDIPKKPIQIPIEVGEINGVRTLQHDLFTNDVLYAEVVFDMAHLKAELVPLVPLF 1655
            PSL L DIPK+PI +P EVG+INGV+ L+HDLFTNDVLYAE+VF+M  LK EL+PLVPLF
Sbjct: 634  PSLFLCDIPKEPIHVPTEVGDINGVKVLKHDLFTNDVLYAEIVFNMRSLKQELLPLVPLF 693

Query: 1654 CQSLLEMGTKDMDFVQLNQLIGRKTGGISVYPFTSSVRGKEDPTSHVIVRGKSMSGRAED 1475
            CQSLLEMGTKD+ FVQLNQLIGRKTGGIS+YPFTSSVRG+EDP SH++ RGK+M+GR ED
Sbjct: 694  CQSLLEMGTKDLTFVQLNQLIGRKTGGISLYPFTSSVRGREDPCSHIVARGKAMAGRVED 753

Query: 1474 LFNLVNCILHDVQFTDQQRFKQFVSQSKARMENRLRGSGHGIAAARMDAKLNAAGWIAEQ 1295
            LFNLVNC+L +VQFTDQQRFKQFVSQSKARMENRLRGSGHGIAAARMDAKLN AGWI+EQ
Sbjct: 754  LFNLVNCVLQEVQFTDQQRFKQFVSQSKARMENRLRGSGHGIAAARMDAKLNVAGWISEQ 813

Query: 1294 MGGISYLEFLRSLEEKVEKDWFVISSSLEEIRKSLLSKKGCLVNLTAEGKTLTSTEKFVS 1115
            MGG+SYLEFL++LE++V++DW  +SSSLEEIR SL SK GCL+N+TA+GK LT++EK+VS
Sbjct: 814  MGGVSYLEFLKALEKRVDQDWAGVSSSLEEIRMSLFSKNGCLINMTADGKNLTNSEKYVS 873

Query: 1114 KFLDXXXXXXXXXXXSFNARLLPVNEAIVIPTQVNYIGKAANIYETGYQLNGSAYVISKY 935
            KFLD           ++NARL P NEAIVIPTQVNY+GKAANIY+TGYQLNGSAYVISKY
Sbjct: 874  KFLDLLPSKSSVEAAAWNARLSPGNEAIVIPTQVNYVGKAANIYDTGYQLNGSAYVISKY 933

Query: 934  ISNTWLWDRVRVSGGAYGGFCDFDTHSGVFSYLSYRDPNLLKTLEVYDGTTNFLRELELD 755
            ISNTWLWDRVRVSGGAYGGFCDFDTHSGVFS+LSYRDPNLLKTL+VYDG+  FLRELE+D
Sbjct: 934  ISNTWLWDRVRVSGGAYGGFCDFDTHSGVFSFLSYRDPNLLKTLDVYDGSGAFLRELEMD 993

Query: 754  DDSLTKAIIGTIGDVDAYQLPDAKGYSSMLRFLLGVTEEERQKRREEILSSSLNDFKAFA 575
            DD+L KAIIGTIGDVD+YQL DAKGYSS+LR+LLG+TEEERQKRREEILS+SL DFK F 
Sbjct: 994  DDTLAKAIIGTIGDVDSYQLADAKGYSSLLRYLLGITEEERQKRREEILSTSLKDFKEFG 1053

Query: 574  EAIEAVKEKGXXXXXXXXXXXXXANKERSNFFEVKQVL 461
            E IEAVK+KG             ANKERSN+F+VK+ L
Sbjct: 1054 EVIEAVKDKGVSVVVASPEDVHAANKERSNYFDVKKAL 1091


>ref|XP_006487082.1| PREDICTED: LOW QUALITY PROTEIN: presequence protease 2,
            chloroplastic/mitochondrial-like [Citrus sinensis]
          Length = 1082

 Score = 1644 bits (4256), Expect = 0.0
 Identities = 802/1000 (80%), Positives = 905/1000 (90%)
 Frame = -3

Query: 3460 KAIATPSKHVSPDFSGAHDDVAEKLGFEKVSEEFIDECKSKAVLYKHKKTGAEVMSVSND 3281
            +A+A+PS   SP+ +   ++VAEKLGFEKVSEEFI ECKSKAVL+KHKKTGAEVMSVSND
Sbjct: 83   RAVASPSTPSSPEVAEVSNEVAEKLGFEKVSEEFIGECKSKAVLFKHKKTGAEVMSVSND 142

Query: 3280 DENKVFGVVFRTPPKDSTGIPHILEHSVLCGSRKYPLKEPFVELLKGSLNTFLNAFTYPD 3101
            DENKVFG+VFRTPPKDSTGIPHILEHSVLCGSRKYPLKEPFVELLKGSLNTFLNAFTYPD
Sbjct: 143  DENKVFGIVFRTPPKDSTGIPHILEHSVLCGSRKYPLKEPFVELLKGSLNTFLNAFTYPD 202

Query: 3100 RTCYPVASTNAKDFYNLVDVYLDAVFFPKCVEDFQTFQQEGWHYELNDPSEDISFKGVVF 2921
            RTCYPVASTN KDFYNLVDVYLDAVFFPKCVEDFQTFQQEGWH++L++PSEDI++KGVVF
Sbjct: 203  RTCYPVASTNTKDFYNLVDVYLDAVFFPKCVEDFQTFQQEGWHFKLDNPSEDITYKGVVF 262

Query: 2920 NEMKGVYSQPDNILGRSSQQALFPDNTYGVDSGGDPKVIPKLTFEEFKNFHRKYYHPSNA 2741
            NEMKGVYSQPDNILGR++QQALFPDN YGVDSGGDPKVIPKLTFEEFK FHRKYYHPSNA
Sbjct: 263  NEMKGVYSQPDNILGRAAQQALFPDNAYGVDSGGDPKVIPKLTFEEFKEFHRKYYHPSNA 322

Query: 2740 RIWFYGDDDPSERLRILSEYLDMFDESSAPDESKIEPQKLFTEPVRIIEKYPAGEGGDLK 2561
            RIWFYGDDDP+ERLRILSEYL+MF+ SSAP+ES +E QKLF+EPVRIIEKYPAG+ GD+K
Sbjct: 323  RIWFYGDDDPNERLRILSEYLNMFEASSAPNESIVEKQKLFSEPVRIIEKYPAGDAGDIK 382

Query: 2560 KKHMVCLNWLLSDKPLDVETEXXXXXXXXXXLGTPASPLRRILLESGLGEAIVGGGIEDE 2381
            KK+MVCLNWLLSDKPLD+ETE          LGTPASPLR+ILLESGLG+AIVGGGIEDE
Sbjct: 383  KKNMVCLNWLLSDKPLDLETELALGFLDHLMLGTPASPLRKILLESGLGDAIVGGGIEDE 442

Query: 2380 LLQPQFSVGLKGVSEDDIQKVEELILNNLQKLAEEGFDPEAVEASLNTIEFSLRENNTGS 2201
            LLQPQFS+GLK VSEDDIQ VEELI++ L+KLA+EGFD +AVEAS+NTIEFSLRENNTGS
Sbjct: 443  LLQPQFSIGLKNVSEDDIQTVEELIMDTLKKLADEGFDSDAVEASMNTIEFSLRENNTGS 502

Query: 2200 FPRGLSLMLRSIGKWIYDLDPFEPLKYEKPLLALKARISEEGPKAVFCPLIEKFILNNPH 2021
            FPRGLSLMLRS+GKWIYD++PFEPLKYEKPL+ALKAR++EEG KAVF PLIEK+ILNNPH
Sbjct: 503  FPRGLSLMLRSMGKWIYDMNPFEPLKYEKPLMALKARLAEEGSKAVFSPLIEKYILNNPH 562

Query: 2020 RVTIEMQPDPVKASQDEAAEREILEKVKASMTEVDLAELARATRELRLKQETPDPPEALR 1841
             VT+EMQPDP KAS+DEAAE+EIL KVK+SMT+ DLAELARAT ELRLKQETPDPPEALR
Sbjct: 563  CVTVEMQPDPEKASRDEAAEKEILAKVKSSMTKEDLAELARATEELRLKQETPDPPEALR 622

Query: 1840 SVPSLSLNDIPKKPIQIPIEVGEINGVRTLQHDLFTNDVLYAEVVFDMAHLKAELVPLVP 1661
            SVPSLSL DIPK+PI++P EVG+INGV+ LQHDLFTNDVLY EVVFDM+ LK EL+PL+P
Sbjct: 623  SVPSLSLRDIPKEPIRVPTEVGDINGVKVLQHDLFTNDVLYTEVVFDMSSLKQELLPLIP 682

Query: 1660 LFCQSLLEMGTKDMDFVQLNQLIGRKTGGISVYPFTSSVRGKEDPTSHVIVRGKSMSGRA 1481
            LFCQSL EMGTKD+ FVQLNQLIGRKTGGISVYPFTSS+RGKEDP   ++VRGK+M+G+A
Sbjct: 683  LFCQSLKEMGTKDLSFVQLNQLIGRKTGGISVYPFTSSIRGKEDPCCCMVVRGKAMAGQA 742

Query: 1480 EDLFNLVNCILHDVQFTDQQRFKQFVSQSKARMENRLRGSGHGIAAARMDAKLNAAGWIA 1301
            EDLFNL NC+L +VQ TDQQRFKQFVSQSKARMENRLRGSGHGIAAARMDAKLN AGWI+
Sbjct: 743  EDLFNLFNCVLQEVQLTDQQRFKQFVSQSKARMENRLRGSGHGIAAARMDAKLNTAGWIS 802

Query: 1300 EQMGGISYLEFLRSLEEKVEKDWFVISSSLEEIRKSLLSKKGCLVNLTAEGKTLTSTEKF 1121
            EQMGG+SYLEFL++LEEKV++DW  ISSSLEEIR+S LS++GCL+N+TA+GK L ++E+F
Sbjct: 803  EQMGGVSYLEFLQALEEKVDQDWAGISSSLEEIRRSFLSREGCLINMTADGKNLKNSERF 862

Query: 1120 VSKFLDXXXXXXXXXXXSFNARLLPVNEAIVIPTQVNYIGKAANIYETGYQLNGSAYVIS 941
            V KFLD            + A L   NEAIVIPTQVNY+GKAANI+ETGY+LNGSAYVIS
Sbjct: 863  VGKFLDMLPTNSPVERVKWKAHLPSANEAIVIPTQVNYVGKAANIFETGYKLNGSAYVIS 922

Query: 940  KYISNTWLWDRVRVSGGAYGGFCDFDTHSGVFSYLSYRDPNLLKTLEVYDGTTNFLRELE 761
            K+ISN WLWDRVRVSGGAYGGFCDFD+HSGVFS+LSYRDPNLLKTL++YDGT +FLRELE
Sbjct: 923  KHISNVWLWDRVRVSGGAYGGFCDFDSHSGVFSFLSYRDPNLLKTLDIYDGTVDFLRELE 982

Query: 760  LDDDSLTKAIIGTIGDVDAYQLPDAKGYSSMLRFLLGVTEEERQKRREEILSSSLNDFKA 581
            +DDD+LTKAIIGTIGDVDAYQLPDAKGYSS+LR LLG+TEEERQ+RREEILS+SL DFK 
Sbjct: 983  MDDDTLTKAIIGTIGDVDAYQLPDAKGYSSLLRHLLGITEEERQRRREEILSTSLKDFKE 1042

Query: 580  FAEAIEAVKEKGXXXXXXXXXXXXXANKERSNFFEVKQVL 461
            FA+ +EA+K++G             ANKER+N FEVK+ L
Sbjct: 1043 FADVLEAIKDRGVAVAVASPDDVDAANKERANLFEVKKAL 1082


>ref|XP_007042385.1| Presequence protease 2 isoform 2 [Theobroma cacao]
            gi|508706320|gb|EOX98216.1| Presequence protease 2
            isoform 2 [Theobroma cacao]
          Length = 1040

 Score = 1635 bits (4235), Expect = 0.0
 Identities = 792/953 (83%), Positives = 890/953 (93%)
 Frame = -3

Query: 3460 KAIATPSKHVSPDFSGAHDDVAEKLGFEKVSEEFIDECKSKAVLYKHKKTGAEVMSVSND 3281
            +A+A+P++  SPD +G  D+VAEKLGFEKVSEEFI ECKSKAVL+KHKKTGAEVMSVSND
Sbjct: 87   RAVASPTQP-SPDIAGVEDEVAEKLGFEKVSEEFIGECKSKAVLFKHKKTGAEVMSVSND 145

Query: 3280 DENKVFGVVFRTPPKDSTGIPHILEHSVLCGSRKYPLKEPFVELLKGSLNTFLNAFTYPD 3101
            DENKVFG+VFRTPPKDSTGIPHILEHSVLCGSRKYPLKEPFVELLKGSL+TFLNAFTYPD
Sbjct: 146  DENKVFGIVFRTPPKDSTGIPHILEHSVLCGSRKYPLKEPFVELLKGSLHTFLNAFTYPD 205

Query: 3100 RTCYPVASTNAKDFYNLVDVYLDAVFFPKCVEDFQTFQQEGWHYELNDPSEDISFKGVVF 2921
            RTCYPVASTNAKDFYNLVDVYLDAVFFPKC+EDFQTFQQEGWHYELND SEDI++KGVVF
Sbjct: 206  RTCYPVASTNAKDFYNLVDVYLDAVFFPKCIEDFQTFQQEGWHYELNDTSEDITYKGVVF 265

Query: 2920 NEMKGVYSQPDNILGRSSQQALFPDNTYGVDSGGDPKVIPKLTFEEFKNFHRKYYHPSNA 2741
            NEMKGVYSQPDN+LGR++QQALFPDNTYGVDSGGDP+VIPKLT+EEFK FHRKYYHPSNA
Sbjct: 266  NEMKGVYSQPDNLLGRTAQQALFPDNTYGVDSGGDPQVIPKLTYEEFKEFHRKYYHPSNA 325

Query: 2740 RIWFYGDDDPSERLRILSEYLDMFDESSAPDESKIEPQKLFTEPVRIIEKYPAGEGGDLK 2561
            RIWFYGDDDP ERLRILSEYLDMFD S+APDESK+EPQKLF+EPVR +EKYP GEGGDLK
Sbjct: 326  RIWFYGDDDPIERLRILSEYLDMFDASTAPDESKVEPQKLFSEPVRFVEKYPVGEGGDLK 385

Query: 2560 KKHMVCLNWLLSDKPLDVETEXXXXXXXXXXLGTPASPLRRILLESGLGEAIVGGGIEDE 2381
            KKHMVCLNWLLSDKPLD++TE          LGTPASPLR++LLESGLG+AI+GGG+EDE
Sbjct: 386  KKHMVCLNWLLSDKPLDLQTELTLGFLDHLMLGTPASPLRKVLLESGLGDAIIGGGVEDE 445

Query: 2380 LLQPQFSVGLKGVSEDDIQKVEELILNNLQKLAEEGFDPEAVEASLNTIEFSLRENNTGS 2201
            LLQPQFS+GLKGVSEDDI KVEELI+++L+KLAEEGFD +AVEAS+NTIEFSLRENNTGS
Sbjct: 446  LLQPQFSIGLKGVSEDDIPKVEELIMSSLKKLAEEGFDTDAVEASMNTIEFSLRENNTGS 505

Query: 2200 FPRGLSLMLRSIGKWIYDLDPFEPLKYEKPLLALKARISEEGPKAVFCPLIEKFILNNPH 2021
            FPRGLSLMLRSIGKWIYD+DPFEPLKYEKPL+ LKARI+EEG KAVF PLIEKFILNNPH
Sbjct: 506  FPRGLSLMLRSIGKWIYDMDPFEPLKYEKPLMILKARIAEEGSKAVFSPLIEKFILNNPH 565

Query: 2020 RVTIEMQPDPVKASQDEAAEREILEKVKASMTEVDLAELARATRELRLKQETPDPPEALR 1841
             VTIEMQPDP KAS+DEAAE+EIL KVKASMTE DLAELARAT+EL+LKQETPDPPEALR
Sbjct: 566  CVTIEMQPDPEKASRDEAAEKEILNKVKASMTEEDLAELARATQELKLKQETPDPPEALR 625

Query: 1840 SVPSLSLNDIPKKPIQIPIEVGEINGVRTLQHDLFTNDVLYAEVVFDMAHLKAELVPLVP 1661
            SVPSLSL+DIPK+PI++P EVG+INGV+ LQHDLFTNDVLY +VVFDM+ LK EL+PLVP
Sbjct: 626  SVPSLSLHDIPKEPIRVPTEVGDINGVKVLQHDLFTNDVLYTDVVFDMSSLKRELLPLVP 685

Query: 1660 LFCQSLLEMGTKDMDFVQLNQLIGRKTGGISVYPFTSSVRGKEDPTSHVIVRGKSMSGRA 1481
            LFCQSLLEMGTKD+ FVQLNQLIGRKTGGISVYPFTSS++GKEDP SH+IVRGKSM+G A
Sbjct: 686  LFCQSLLEMGTKDLSFVQLNQLIGRKTGGISVYPFTSSIQGKEDPCSHIIVRGKSMAGCA 745

Query: 1480 EDLFNLVNCILHDVQFTDQQRFKQFVSQSKARMENRLRGSGHGIAAARMDAKLNAAGWIA 1301
            +DLFNL+NC++ +VQFTDQQRFKQFVSQSKARME+RLRGSGHGIAAARMDAKLN +GWI+
Sbjct: 746  DDLFNLINCVIQEVQFTDQQRFKQFVSQSKARMESRLRGSGHGIAAARMDAKLNVSGWIS 805

Query: 1300 EQMGGISYLEFLRSLEEKVEKDWFVISSSLEEIRKSLLSKKGCLVNLTAEGKTLTSTEKF 1121
            EQMGG+SYLEFL+ LEE+V+ DW  ISSSLEEIRKSLLS++GCL+N+TA+GK L++TEK 
Sbjct: 806  EQMGGVSYLEFLQGLEERVDNDWAGISSSLEEIRKSLLSREGCLINMTADGKNLSNTEKL 865

Query: 1120 VSKFLDXXXXXXXXXXXSFNARLLPVNEAIVIPTQVNYIGKAANIYETGYQLNGSAYVIS 941
            VSKFLD           S++ARL   NEAIVIPTQVNY+GKAAN+Y+ GYQLNGSAYVIS
Sbjct: 866  VSKFLDLLPSNSVVERASWSARLPSNNEAIVIPTQVNYVGKAANLYDGGYQLNGSAYVIS 925

Query: 940  KYISNTWLWDRVRVSGGAYGGFCDFDTHSGVFSYLSYRDPNLLKTLEVYDGTTNFLRELE 761
            K+ISNTWLWDRVRVSGGAYGGFC+FDTHSGVF++LSYRDPNLL+TL++YDGT +FLRELE
Sbjct: 926  KHISNTWLWDRVRVSGGAYGGFCNFDTHSGVFTFLSYRDPNLLETLDIYDGTGDFLRELE 985

Query: 760  LDDDSLTKAIIGTIGDVDAYQLPDAKGYSSMLRFLLGVTEEERQKRREEILSS 602
            +DDD+LTKAIIGT+GDVDAYQLPDAKGYSS++R+LLG+TEEERQ+RREEILS+
Sbjct: 986  MDDDTLTKAIIGTVGDVDAYQLPDAKGYSSLVRYLLGITEEERQRRREEILST 1038


>ref|XP_004136986.1| PREDICTED: presequence protease 1, chloroplastic/mitochondrial-like
            [Cucumis sativus]
          Length = 1084

 Score = 1632 bits (4227), Expect = 0.0
 Identities = 798/1000 (79%), Positives = 895/1000 (89%)
 Frame = -3

Query: 3460 KAIATPSKHVSPDFSGAHDDVAEKLGFEKVSEEFIDECKSKAVLYKHKKTGAEVMSVSND 3281
            +A+A+P  H  P+F+   D+VAEKLGFEKVSEEFI ECKSKAVL++HKKTGAEVMSVSND
Sbjct: 85   RAVASPPAHSPPEFAEVSDEVAEKLGFEKVSEEFIGECKSKAVLFRHKKTGAEVMSVSND 144

Query: 3280 DENKVFGVVFRTPPKDSTGIPHILEHSVLCGSRKYPLKEPFVELLKGSLNTFLNAFTYPD 3101
            DENKVFG+VFRTPP DSTGIPHILEHSVLCGSRKYP+KEPFVELLKGSL+TFLNAFTYPD
Sbjct: 145  DENKVFGIVFRTPPSDSTGIPHILEHSVLCGSRKYPVKEPFVELLKGSLHTFLNAFTYPD 204

Query: 3100 RTCYPVASTNAKDFYNLVDVYLDAVFFPKCVEDFQTFQQEGWHYELNDPSEDISFKGVVF 2921
            RTCYPVASTN KDFYNLVDVYLDAVFFPKCVEDF+TFQQEGWHYELNDPSEDIS+KGVVF
Sbjct: 205  RTCYPVASTNTKDFYNLVDVYLDAVFFPKCVEDFKTFQQEGWHYELNDPSEDISYKGVVF 264

Query: 2920 NEMKGVYSQPDNILGRSSQQALFPDNTYGVDSGGDPKVIPKLTFEEFKNFHRKYYHPSNA 2741
            NEMKGVYSQPDNILGR +QQALFPDNTYGVDSGGDP+VIPKLTFEEFK FH K+YHP NA
Sbjct: 265  NEMKGVYSQPDNILGRVTQQALFPDNTYGVDSGGDPRVIPKLTFEEFKEFHSKFYHPGNA 324

Query: 2740 RIWFYGDDDPSERLRILSEYLDMFDESSAPDESKIEPQKLFTEPVRIIEKYPAGEGGDLK 2561
            RIWFYGDDDP ERLRIL +YLDMFD S   D+SKI  Q+LF+EPVRI+EKYP+G+GGDLK
Sbjct: 325  RIWFYGDDDPVERLRILKDYLDMFDASPVSDQSKIGQQRLFSEPVRIVEKYPSGDGGDLK 384

Query: 2560 KKHMVCLNWLLSDKPLDVETEXXXXXXXXXXLGTPASPLRRILLESGLGEAIVGGGIEDE 2381
            KKHMVC+NWLLS+KPLD+ETE          LGTPASPLR+ILLESGLGEAI+GGGIEDE
Sbjct: 385  KKHMVCVNWLLSEKPLDLETELALGFLDHLMLGTPASPLRKILLESGLGEAILGGGIEDE 444

Query: 2380 LLQPQFSVGLKGVSEDDIQKVEELILNNLQKLAEEGFDPEAVEASLNTIEFSLRENNTGS 2201
            LLQPQFS+GLKGV +DDI KVEELILN  +KLAEEGFD +AVEAS+NTIEFSLRENNTGS
Sbjct: 445  LLQPQFSIGLKGVLDDDIPKVEELILNTFKKLAEEGFDNDAVEASMNTIEFSLRENNTGS 504

Query: 2200 FPRGLSLMLRSIGKWIYDLDPFEPLKYEKPLLALKARISEEGPKAVFCPLIEKFILNNPH 2021
            FPRGLSLMLRSIGKWIYD++PFEPLKYE+PL ALKARI+ EGPKAVF PLIEKFILNNPH
Sbjct: 505  FPRGLSLMLRSIGKWIYDMNPFEPLKYEEPLKALKARIAAEGPKAVFSPLIEKFILNNPH 564

Query: 2020 RVTIEMQPDPVKASQDEAAEREILEKVKASMTEVDLAELARATRELRLKQETPDPPEALR 1841
            RVTIEMQPDP KAS+DEA E+EIL+KVK SMTE DLAELARAT+ELRLKQETPDPPEAL+
Sbjct: 565  RVTIEMQPDPEKASRDEATEKEILQKVKESMTEEDLAELARATQELRLKQETPDPPEALK 624

Query: 1840 SVPSLSLNDIPKKPIQIPIEVGEINGVRTLQHDLFTNDVLYAEVVFDMAHLKAELVPLVP 1661
             VP L L DIPK+P ++P E+G +NGV  LQHDLFTNDVLY+EVVFDM+ LK EL+PLVP
Sbjct: 625  CVPCLCLEDIPKEPTRVPTEIGNVNGVTVLQHDLFTNDVLYSEVVFDMSSLKQELLPLVP 684

Query: 1660 LFCQSLLEMGTKDMDFVQLNQLIGRKTGGISVYPFTSSVRGKEDPTSHVIVRGKSMSGRA 1481
            LFCQSLLEMGTKD+ FVQLNQLIGRKTGGISVYPFTSS+RG +   +H++VRGK+MSG A
Sbjct: 685  LFCQSLLEMGTKDLTFVQLNQLIGRKTGGISVYPFTSSIRGNDKACTHMVVRGKAMSGCA 744

Query: 1480 EDLFNLVNCILHDVQFTDQQRFKQFVSQSKARMENRLRGSGHGIAAARMDAKLNAAGWIA 1301
            EDLFNL+NCIL +VQFTDQQRFKQFVSQSK+RMENRLRGSGHGIAAARMDAKLN+AGWI+
Sbjct: 745  EDLFNLMNCILQEVQFTDQQRFKQFVSQSKSRMENRLRGSGHGIAAARMDAKLNSAGWIS 804

Query: 1300 EQMGGISYLEFLRSLEEKVEKDWFVISSSLEEIRKSLLSKKGCLVNLTAEGKTLTSTEKF 1121
            EQMGG+SY+EFL++LEEKV+++W  ISSSLEEIR+SLLS+K CLVN+TA+GK L  +EKF
Sbjct: 805  EQMGGLSYMEFLQTLEEKVDQNWTEISSSLEEIRQSLLSRKNCLVNITADGKNLIKSEKF 864

Query: 1120 VSKFLDXXXXXXXXXXXSFNARLLPVNEAIVIPTQVNYIGKAANIYETGYQLNGSAYVIS 941
            + KFLD           ++NARL   NEAIVIPTQVNY+GKAANIYETGYQL+GSAYVIS
Sbjct: 865  IGKFLDLLPNQPIIKNSTWNARLSSDNEAIVIPTQVNYVGKAANIYETGYQLDGSAYVIS 924

Query: 940  KYISNTWLWDRVRVSGGAYGGFCDFDTHSGVFSYLSYRDPNLLKTLEVYDGTTNFLRELE 761
            K+ISNTWLWDRVRVSGGAYGGFCDFD+HSGVFS+LSYRDPNLLKTL+VYDGT +FLRELE
Sbjct: 925  KFISNTWLWDRVRVSGGAYGGFCDFDSHSGVFSFLSYRDPNLLKTLDVYDGTVDFLRELE 984

Query: 760  LDDDSLTKAIIGTIGDVDAYQLPDAKGYSSMLRFLLGVTEEERQKRREEILSSSLNDFKA 581
            LDDD+L KAIIGTIGDVD+YQLPDAKGYSS+LR+LLG+TEEERQ+RREEILS+SL DFK 
Sbjct: 985  LDDDTLAKAIIGTIGDVDSYQLPDAKGYSSLLRYLLGITEEERQRRREEILSTSLKDFKN 1044

Query: 580  FAEAIEAVKEKGXXXXXXXXXXXXXANKERSNFFEVKQVL 461
            FA+A+EAV+ KG             A+ ER  FF+VK+ L
Sbjct: 1045 FADALEAVRNKGVVVSVASPEDVETAHGERPGFFQVKKAL 1084


>ref|XP_004159889.1| PREDICTED: LOW QUALITY PROTEIN: presequence protease 1,
            chloroplastic/mitochondrial-like [Cucumis sativus]
          Length = 1084

 Score = 1630 bits (4221), Expect = 0.0
 Identities = 797/1000 (79%), Positives = 894/1000 (89%)
 Frame = -3

Query: 3460 KAIATPSKHVSPDFSGAHDDVAEKLGFEKVSEEFIDECKSKAVLYKHKKTGAEVMSVSND 3281
            +A+A+P  H  P+F+   D+VAEKLGFEKVSEEFI ECKSKAVL++HKKTGAEVMSVSND
Sbjct: 85   RAVASPPAHSPPEFAEVSDEVAEKLGFEKVSEEFIGECKSKAVLFRHKKTGAEVMSVSND 144

Query: 3280 DENKVFGVVFRTPPKDSTGIPHILEHSVLCGSRKYPLKEPFVELLKGSLNTFLNAFTYPD 3101
            DENKVFG+VFRTPP DSTGIPHILEHSVLCGSRKYP+KEPFVELLKGSL+TFLNAFTYPD
Sbjct: 145  DENKVFGIVFRTPPSDSTGIPHILEHSVLCGSRKYPVKEPFVELLKGSLHTFLNAFTYPD 204

Query: 3100 RTCYPVASTNAKDFYNLVDVYLDAVFFPKCVEDFQTFQQEGWHYELNDPSEDISFKGVVF 2921
            RTCYPVASTN KDFYNLVDVYLDAVFFPKCVEDF+TFQQEGWHYELNDPSEDIS+KGVVF
Sbjct: 205  RTCYPVASTNTKDFYNLVDVYLDAVFFPKCVEDFKTFQQEGWHYELNDPSEDISYKGVVF 264

Query: 2920 NEMKGVYSQPDNILGRSSQQALFPDNTYGVDSGGDPKVIPKLTFEEFKNFHRKYYHPSNA 2741
            NEMKGVYSQPDNILGR +QQALFPDNTYGVDSGGDP+VIPKLTFEEFK FH K+YHP NA
Sbjct: 265  NEMKGVYSQPDNILGRVTQQALFPDNTYGVDSGGDPRVIPKLTFEEFKEFHSKFYHPGNA 324

Query: 2740 RIWFYGDDDPSERLRILSEYLDMFDESSAPDESKIEPQKLFTEPVRIIEKYPAGEGGDLK 2561
            RIWFYGDDDP ERLRIL +YLDMFD S   D+SKI  Q+LF+EPVRI+EKYP+G+GGDL 
Sbjct: 325  RIWFYGDDDPVERLRILKDYLDMFDASPVSDQSKIGQQRLFSEPVRIVEKYPSGDGGDLX 384

Query: 2560 KKHMVCLNWLLSDKPLDVETEXXXXXXXXXXLGTPASPLRRILLESGLGEAIVGGGIEDE 2381
            KKHMVC+NWLLS+KPLD+ETE          LGTPASPLR+ILLESGLGEAI+GGGIEDE
Sbjct: 385  KKHMVCVNWLLSEKPLDLETELALGFLDHLMLGTPASPLRKILLESGLGEAILGGGIEDE 444

Query: 2380 LLQPQFSVGLKGVSEDDIQKVEELILNNLQKLAEEGFDPEAVEASLNTIEFSLRENNTGS 2201
            LLQPQFS+GLKGV +DDI KVEELILN  +KLAEEGFD +AVEAS+NTIEFSLRENNTGS
Sbjct: 445  LLQPQFSIGLKGVLDDDIPKVEELILNTFKKLAEEGFDNDAVEASMNTIEFSLRENNTGS 504

Query: 2200 FPRGLSLMLRSIGKWIYDLDPFEPLKYEKPLLALKARISEEGPKAVFCPLIEKFILNNPH 2021
            FPRGLSLMLRSIGKWIYD++PFEPLKYE+PL ALKARI+ EGPKAVF PLIEKFILNNPH
Sbjct: 505  FPRGLSLMLRSIGKWIYDMNPFEPLKYEEPLKALKARIAAEGPKAVFSPLIEKFILNNPH 564

Query: 2020 RVTIEMQPDPVKASQDEAAEREILEKVKASMTEVDLAELARATRELRLKQETPDPPEALR 1841
            RVTIEMQPDP KAS+DEA E+EIL+KVK SMTE DLAELARAT+ELRLKQETPDPPEAL+
Sbjct: 565  RVTIEMQPDPEKASRDEATEKEILQKVKESMTEEDLAELARATQELRLKQETPDPPEALK 624

Query: 1840 SVPSLSLNDIPKKPIQIPIEVGEINGVRTLQHDLFTNDVLYAEVVFDMAHLKAELVPLVP 1661
             VP L L DIPK+P ++P E+G +NGV  LQHDLFTNDVLY+EVVFDM+ LK EL+PLVP
Sbjct: 625  CVPCLCLEDIPKEPTRVPTEIGNVNGVTVLQHDLFTNDVLYSEVVFDMSSLKQELLPLVP 684

Query: 1660 LFCQSLLEMGTKDMDFVQLNQLIGRKTGGISVYPFTSSVRGKEDPTSHVIVRGKSMSGRA 1481
            LFCQSLLEMGTKD+ FVQLNQLIGRKTGGISVYPFTSS+RG +   +H++VRGK+MSG A
Sbjct: 685  LFCQSLLEMGTKDLTFVQLNQLIGRKTGGISVYPFTSSIRGNDKACTHMVVRGKAMSGCA 744

Query: 1480 EDLFNLVNCILHDVQFTDQQRFKQFVSQSKARMENRLRGSGHGIAAARMDAKLNAAGWIA 1301
            EDLFNL+NCIL +VQFTDQQRFKQFVSQSK+RMENRLRGSGHGIAAARMDAKLN+AGWI+
Sbjct: 745  EDLFNLMNCILQEVQFTDQQRFKQFVSQSKSRMENRLRGSGHGIAAARMDAKLNSAGWIS 804

Query: 1300 EQMGGISYLEFLRSLEEKVEKDWFVISSSLEEIRKSLLSKKGCLVNLTAEGKTLTSTEKF 1121
            EQMGG+SY+EFL++LEEKV+++W  ISSSLEEIR+SLLS+K CLVN+TA+GK L  +EKF
Sbjct: 805  EQMGGLSYMEFLQTLEEKVDQNWTEISSSLEEIRQSLLSRKNCLVNITADGKNLIKSEKF 864

Query: 1120 VSKFLDXXXXXXXXXXXSFNARLLPVNEAIVIPTQVNYIGKAANIYETGYQLNGSAYVIS 941
            + KFLD           ++NARL   NEAIVIPTQVNY+GKAANIYETGYQL+GSAYVIS
Sbjct: 865  IGKFLDLLPNQPIIKNSTWNARLSSDNEAIVIPTQVNYVGKAANIYETGYQLDGSAYVIS 924

Query: 940  KYISNTWLWDRVRVSGGAYGGFCDFDTHSGVFSYLSYRDPNLLKTLEVYDGTTNFLRELE 761
            K+ISNTWLWDRVRVSGGAYGGFCDFD+HSGVFS+LSYRDPNLLKTL+VYDGT +FLRELE
Sbjct: 925  KFISNTWLWDRVRVSGGAYGGFCDFDSHSGVFSFLSYRDPNLLKTLDVYDGTVDFLRELE 984

Query: 760  LDDDSLTKAIIGTIGDVDAYQLPDAKGYSSMLRFLLGVTEEERQKRREEILSSSLNDFKA 581
            LDDD+L KAIIGTIGDVD+YQLPDAKGYSS+LR+LLG+TEEERQ+RREEILS+SL DFK 
Sbjct: 985  LDDDTLAKAIIGTIGDVDSYQLPDAKGYSSLLRYLLGITEEERQRRREEILSTSLKDFKN 1044

Query: 580  FAEAIEAVKEKGXXXXXXXXXXXXXANKERSNFFEVKQVL 461
            FA+A+EAV+ KG             A+ ER  FF+VK+ L
Sbjct: 1045 FADALEAVRNKGVVVSVASPEDVETAHGERPGFFQVKKAL 1084


>ref|XP_003517606.1| PREDICTED: presequence protease 2, chloroplastic/mitochondrial
            [Glycine max]
          Length = 1078

 Score = 1623 bits (4203), Expect = 0.0
 Identities = 792/1000 (79%), Positives = 901/1000 (90%)
 Frame = -3

Query: 3460 KAIATPSKHVSPDFSGAHDDVAEKLGFEKVSEEFIDECKSKAVLYKHKKTGAEVMSVSND 3281
            +A+ +PS   S  F+  +D+VA KLGFEKVSEEFI ECKSKAVL++H KTGA+VMSVSND
Sbjct: 81   RAVLSPSP--SSGFAEVNDEVALKLGFEKVSEEFIPECKSKAVLFRHIKTGAQVMSVSND 138

Query: 3280 DENKVFGVVFRTPPKDSTGIPHILEHSVLCGSRKYPLKEPFVELLKGSLNTFLNAFTYPD 3101
            D+NKVFG+VFRTPPKDSTGIPHILEHSVLCGSRKYPLKEPFVELLKGSL+TFLNAFTYPD
Sbjct: 139  DDNKVFGIVFRTPPKDSTGIPHILEHSVLCGSRKYPLKEPFVELLKGSLHTFLNAFTYPD 198

Query: 3100 RTCYPVASTNAKDFYNLVDVYLDAVFFPKCVEDFQTFQQEGWHYELNDPSEDISFKGVVF 2921
            RTCYPVASTNAKDFYNLVDVYLDAVFFP+CVEDFQ FQQEGWH+ELNDPSEDI++KGVVF
Sbjct: 199  RTCYPVASTNAKDFYNLVDVYLDAVFFPRCVEDFQIFQQEGWHFELNDPSEDITYKGVVF 258

Query: 2920 NEMKGVYSQPDNILGRSSQQALFPDNTYGVDSGGDPKVIPKLTFEEFKNFHRKYYHPSNA 2741
            NEMKGVYSQPDNILGR++QQALFPD TYGVDSGGDP+VIPKLTFEEFK FHRKYYHPSN+
Sbjct: 259  NEMKGVYSQPDNILGRAAQQALFPDTTYGVDSGGDPRVIPKLTFEEFKEFHRKYYHPSNS 318

Query: 2740 RIWFYGDDDPSERLRILSEYLDMFDESSAPDESKIEPQKLFTEPVRIIEKYPAGEGGDLK 2561
            RIWFYGDDDP+ERLRILSEYLD+FD S A  ES++EPQ LF++PVRI+E YPAGEGGDLK
Sbjct: 319  RIWFYGDDDPNERLRILSEYLDLFDSSLASHESRVEPQTLFSKPVRIVETYPAGEGGDLK 378

Query: 2560 KKHMVCLNWLLSDKPLDVETEXXXXXXXXXXLGTPASPLRRILLESGLGEAIVGGGIEDE 2381
            KKHMVCLNWLLSDKPLD+ETE          LGTPASPLR+ILLES LG+AIVGGG+EDE
Sbjct: 379  KKHMVCLNWLLSDKPLDLETELTLGFLNHLLLGTPASPLRKILLESRLGDAIVGGGVEDE 438

Query: 2380 LLQPQFSVGLKGVSEDDIQKVEELILNNLQKLAEEGFDPEAVEASLNTIEFSLRENNTGS 2201
            LLQPQFS+G+KGVSEDDI KVEEL+ + L+KLAEEGFD +A+EAS+NTIEFSLRENNTGS
Sbjct: 439  LLQPQFSIGMKGVSEDDIHKVEELVTSTLKKLAEEGFDTDAIEASMNTIEFSLRENNTGS 498

Query: 2200 FPRGLSLMLRSIGKWIYDLDPFEPLKYEKPLLALKARISEEGPKAVFCPLIEKFILNNPH 2021
            FPRGLSLML+SIGKWIYD++PFEPLKYEKPL  LK+RI++EG K+VF PLIEKFILNNPH
Sbjct: 499  FPRGLSLMLQSIGKWIYDMNPFEPLKYEKPLQDLKSRIAKEGSKSVFSPLIEKFILNNPH 558

Query: 2020 RVTIEMQPDPVKASQDEAAEREILEKVKASMTEVDLAELARATRELRLKQETPDPPEALR 1841
            +VT+EMQPDP KA++DE AE++IL+KVKASMT  DLAELARAT ELRLKQETPDPPEAL+
Sbjct: 559  QVTVEMQPDPEKAARDEVAEKQILQKVKASMTTEDLAELARATHELRLKQETPDPPEALK 618

Query: 1840 SVPSLSLNDIPKKPIQIPIEVGEINGVRTLQHDLFTNDVLYAEVVFDMAHLKAELVPLVP 1661
            +VPSLSL DIPK+PI++P EVG+INGV+ LQHDLFTNDVLY E+VF+M  LK EL+PLVP
Sbjct: 619  TVPSLSLQDIPKEPIRVPTEVGDINGVKVLQHDLFTNDVLYTEIVFNMKSLKQELLPLVP 678

Query: 1660 LFCQSLLEMGTKDMDFVQLNQLIGRKTGGISVYPFTSSVRGKEDPTSHVIVRGKSMSGRA 1481
            LFCQSLLEMGTKD+ FVQLNQLIGRKTGGISVYPFTSSVRGKEDP SH+++RGK+M+G  
Sbjct: 679  LFCQSLLEMGTKDLTFVQLNQLIGRKTGGISVYPFTSSVRGKEDPCSHMVIRGKAMAGHI 738

Query: 1480 EDLFNLVNCILHDVQFTDQQRFKQFVSQSKARMENRLRGSGHGIAAARMDAKLNAAGWIA 1301
            EDL++LVN +L DVQFTDQQRFKQFVSQS+ARMENRLRGSGHGIAAARMDAKLNAAGW++
Sbjct: 739  EDLYDLVNSVLQDVQFTDQQRFKQFVSQSRARMENRLRGSGHGIAAARMDAKLNAAGWMS 798

Query: 1300 EQMGGISYLEFLRSLEEKVEKDWFVISSSLEEIRKSLLSKKGCLVNLTAEGKTLTSTEKF 1121
            E+MGG+SYLEFLR+LEE+V++DW  ISSSLEEIRKS+ SK+GCL+N+TA+ K L  TEK 
Sbjct: 799  EKMGGLSYLEFLRTLEERVDQDWADISSSLEEIRKSIFSKQGCLINVTADRKNLAKTEKV 858

Query: 1120 VSKFLDXXXXXXXXXXXSFNARLLPVNEAIVIPTQVNYIGKAANIYETGYQLNGSAYVIS 941
            +SKF+D           ++N RL   NEAIVIPTQVNYIGKAANIY+TGY+LNGSAYVIS
Sbjct: 859  LSKFVDLLPTSSPIATTTWNVRLPLTNEAIVIPTQVNYIGKAANIYDTGYRLNGSAYVIS 918

Query: 940  KYISNTWLWDRVRVSGGAYGGFCDFDTHSGVFSYLSYRDPNLLKTLEVYDGTTNFLRELE 761
            KYISNTWLWDRVRVSGGAYGGFCDFDTHSGVFS+LSYRDPNLLKTL+VYDGT +FLREL+
Sbjct: 919  KYISNTWLWDRVRVSGGAYGGFCDFDTHSGVFSFLSYRDPNLLKTLDVYDGTGDFLRELQ 978

Query: 760  LDDDSLTKAIIGTIGDVDAYQLPDAKGYSSMLRFLLGVTEEERQKRREEILSSSLNDFKA 581
            +DDD+LTKAIIGTIGDVDAYQLPDAKGYSSMLR+LLG+TEEERQ+RREEILS+SL DFK 
Sbjct: 979  IDDDTLTKAIIGTIGDVDAYQLPDAKGYSSMLRYLLGITEEERQRRREEILSTSLKDFKI 1038

Query: 580  FAEAIEAVKEKGXXXXXXXXXXXXXANKERSNFFEVKQVL 461
            F +A+EAVK+KG             ANK+R +FF+VK+ L
Sbjct: 1039 FMDAMEAVKDKGVVVAVASPEDVDTANKDRPDFFQVKKAL 1078


>ref|XP_006346464.1| PREDICTED: presequence protease 1, chloroplastic/mitochondrial-like
            [Solanum tuberosum]
          Length = 1072

 Score = 1618 bits (4190), Expect = 0.0
 Identities = 788/1000 (78%), Positives = 894/1000 (89%)
 Frame = -3

Query: 3460 KAIATPSKHVSPDFSGAHDDVAEKLGFEKVSEEFIDECKSKAVLYKHKKTGAEVMSVSND 3281
            +AIAT S   S +F GA D+VAEK GFEKVSE+FIDECKSKAVLYKHKKTGAEVMSVSND
Sbjct: 73   RAIATSSPQSSQEFLGADDEVAEKFGFEKVSEQFIDECKSKAVLYKHKKTGAEVMSVSND 132

Query: 3280 DENKVFGVVFRTPPKDSTGIPHILEHSVLCGSRKYPLKEPFVELLKGSLNTFLNAFTYPD 3101
            DENKVFGVVFRTPPKDSTGIPHILEHSVLCGSRKYPLKEPFVELLKGSLNTFLNAFTYPD
Sbjct: 133  DENKVFGVVFRTPPKDSTGIPHILEHSVLCGSRKYPLKEPFVELLKGSLNTFLNAFTYPD 192

Query: 3100 RTCYPVASTNAKDFYNLVDVYLDAVFFPKCVEDFQTFQQEGWHYELNDPSEDISFKGVVF 2921
            RTCYPVASTN KDFYNLVDVYLDAVFFPKCVEDFQTFQQEGWHYELNDPS+DI+FKGVVF
Sbjct: 193  RTCYPVASTNTKDFYNLVDVYLDAVFFPKCVEDFQTFQQEGWHYELNDPSDDITFKGVVF 252

Query: 2920 NEMKGVYSQPDNILGRSSQQALFPDNTYGVDSGGDPKVIPKLTFEEFKNFHRKYYHPSNA 2741
            NEMKGVYSQPDN+LGR+SQQALFPDNTYGVDSGGDP+VIP L+FEEFK FHRK+YHPSNA
Sbjct: 253  NEMKGVYSQPDNLLGRTSQQALFPDNTYGVDSGGDPRVIPSLSFEEFKEFHRKFYHPSNA 312

Query: 2740 RIWFYGDDDPSERLRILSEYLDMFDESSAPDESKIEPQKLFTEPVRIIEKYPAGEGGDLK 2561
            RIWFYGDDDP+ERLRILSEYL+MFD SSAP ES++EPQ+LF+EPVRI+EKYP GE GDLK
Sbjct: 313  RIWFYGDDDPNERLRILSEYLNMFDASSAPQESRVEPQRLFSEPVRIVEKYPVGEDGDLK 372

Query: 2560 KKHMVCLNWLLSDKPLDVETEXXXXXXXXXXLGTPASPLRRILLESGLGEAIVGGGIEDE 2381
            KKHMVC+NWLLSDKPLD+ETE          LGTPASPLR+ILLESG G+AIVGGGIEDE
Sbjct: 373  KKHMVCVNWLLSDKPLDLETELALGFLDHLLLGTPASPLRKILLESGFGDAIVGGGIEDE 432

Query: 2380 LLQPQFSVGLKGVSEDDIQKVEELILNNLQKLAEEGFDPEAVEASLNTIEFSLRENNTGS 2201
            LLQPQFS+GLKGVSE++IQKVEELI++ L+ L E+GFD +AVEAS+NTIEFSLRENNTGS
Sbjct: 433  LLQPQFSIGLKGVSEENIQKVEELIMSTLEGLVEKGFDLDAVEASMNTIEFSLRENNTGS 492

Query: 2200 FPRGLSLMLRSIGKWIYDLDPFEPLKYEKPLLALKARISEEGPKAVFCPLIEKFILNNPH 2021
            FPRGL+LMLRSIGKW+YD+DPFEPLKY+KPL ALKARI++EG KAVF PL++++IL NPH
Sbjct: 493  FPRGLALMLRSIGKWVYDMDPFEPLKYQKPLEALKARIAKEGSKAVFAPLMDQYILRNPH 552

Query: 2020 RVTIEMQPDPVKASQDEAAEREILEKVKASMTEVDLAELARATRELRLKQETPDPPEALR 1841
            RVT+EMQPDP KAS++E  E+E L+KVKASMT+ DLAELARAT ELRLKQETPDPPEAL+
Sbjct: 553  RVTVEMQPDPEKASREEQIEKETLDKVKASMTQEDLAELARATHELRLKQETPDPPEALK 612

Query: 1840 SVPSLSLNDIPKKPIQIPIEVGEINGVRTLQHDLFTNDVLYAEVVFDMAHLKAELVPLVP 1661
            SVPSLSL DIP++P+ +P E+G+INGV+ L+HDLFTNDVLYAEVVF+++ LK EL+PLVP
Sbjct: 613  SVPSLSLQDIPREPVLVPTEIGDINGVKVLKHDLFTNDVLYAEVVFNLSSLKQELLPLVP 672

Query: 1660 LFCQSLLEMGTKDMDFVQLNQLIGRKTGGISVYPFTSSVRGKEDPTSHVIVRGKSMSGRA 1481
            LFCQSLLEMGTKD+DFVQLNQLIGRKTGG+SVYPFTSSV GK +P S +IVRGK+MS R 
Sbjct: 673  LFCQSLLEMGTKDLDFVQLNQLIGRKTGGLSVYPFTSSVHGKVEPCSKIIVRGKAMSQRT 732

Query: 1480 EDLFNLVNCILHDVQFTDQQRFKQFVSQSKARMENRLRGSGHGIAAARMDAKLNAAGWIA 1301
            EDLF L+N +L DVQ  DQ+RFKQFVSQS++RMENRLRGSGH IAAARM AKLN AGWI+
Sbjct: 733  EDLFYLINRVLQDVQLDDQKRFKQFVSQSRSRMENRLRGSGHSIAAARMGAKLNVAGWIS 792

Query: 1300 EQMGGISYLEFLRSLEEKVEKDWFVISSSLEEIRKSLLSKKGCLVNLTAEGKTLTSTEKF 1121
            EQMGG+SYLEFL+ LE++VEKDW  ISSSLEEIRKSLLSK GCL+NLTA+GK L + EK 
Sbjct: 793  EQMGGVSYLEFLKVLEDQVEKDWPQISSSLEEIRKSLLSKNGCLINLTADGKNLNNAEKH 852

Query: 1120 VSKFLDXXXXXXXXXXXSFNARLLPVNEAIVIPTQVNYIGKAANIYETGYQLNGSAYVIS 941
            +S+FLD           ++NA+L   NEA V+PTQVNY+GKAAN+YE GY+L GSAYVIS
Sbjct: 853  ISEFLDLLPSTSLVESAAWNAQLSRSNEAFVVPTQVNYVGKAANLYEAGYELKGSAYVIS 912

Query: 940  KYISNTWLWDRVRVSGGAYGGFCDFDTHSGVFSYLSYRDPNLLKTLEVYDGTTNFLRELE 761
             YISNTWLWDRVRVSGGAYGGFC FD+HSGVFS+LSYRDPNLLKTL+VYDGT++FL+ELE
Sbjct: 913  NYISNTWLWDRVRVSGGAYGGFCSFDSHSGVFSFLSYRDPNLLKTLDVYDGTSSFLKELE 972

Query: 760  LDDDSLTKAIIGTIGDVDAYQLPDAKGYSSMLRFLLGVTEEERQKRREEILSSSLNDFKA 581
            +DDD+LTKAIIGTIGDVD+YQLPDAKGYSS+LR+LLGVT+EERQ+RREEILS+SL DF+ 
Sbjct: 973  MDDDALTKAIIGTIGDVDSYQLPDAKGYSSLLRYLLGVTDEERQRRREEILSTSLEDFRK 1032

Query: 580  FAEAIEAVKEKGXXXXXXXXXXXXXANKERSNFFEVKQVL 461
            F + +EAVK+KG             ANKERSNF EVK+ L
Sbjct: 1033 FGDVMEAVKDKGVVVAVASPDDVEAANKERSNFLEVKKAL 1072


>ref|XP_004230817.1| PREDICTED: presequence protease 1, chloroplastic/mitochondrial-like
            [Solanum lycopersicum]
          Length = 1072

 Score = 1618 bits (4190), Expect = 0.0
 Identities = 787/1000 (78%), Positives = 895/1000 (89%)
 Frame = -3

Query: 3460 KAIATPSKHVSPDFSGAHDDVAEKLGFEKVSEEFIDECKSKAVLYKHKKTGAEVMSVSND 3281
            +AIAT S   S +F GA D+VAEK GFEKVSE+FIDECKSKAVLYKHKKTGAEVMSVSND
Sbjct: 73   RAIATSSPQSSQEFLGADDEVAEKFGFEKVSEQFIDECKSKAVLYKHKKTGAEVMSVSND 132

Query: 3280 DENKVFGVVFRTPPKDSTGIPHILEHSVLCGSRKYPLKEPFVELLKGSLNTFLNAFTYPD 3101
            DENKVFGVVFRTPPKDSTGIPHILEHSVLCGSRKYPLKEPFVELLKGSLNTFLNAFTYPD
Sbjct: 133  DENKVFGVVFRTPPKDSTGIPHILEHSVLCGSRKYPLKEPFVELLKGSLNTFLNAFTYPD 192

Query: 3100 RTCYPVASTNAKDFYNLVDVYLDAVFFPKCVEDFQTFQQEGWHYELNDPSEDISFKGVVF 2921
            RTCYPVASTN KDFYNLVDVYLDAVFFPKCVEDFQTFQQEGWHYELNDPS++I+FKGVVF
Sbjct: 193  RTCYPVASTNTKDFYNLVDVYLDAVFFPKCVEDFQTFQQEGWHYELNDPSDEITFKGVVF 252

Query: 2920 NEMKGVYSQPDNILGRSSQQALFPDNTYGVDSGGDPKVIPKLTFEEFKNFHRKYYHPSNA 2741
            NEMKGVYSQPDN+LGR+SQQALFPDNTYGVDSGGDP+VIP L+FE+FK FHRK+YHPSNA
Sbjct: 253  NEMKGVYSQPDNLLGRTSQQALFPDNTYGVDSGGDPRVIPSLSFEDFKEFHRKFYHPSNA 312

Query: 2740 RIWFYGDDDPSERLRILSEYLDMFDESSAPDESKIEPQKLFTEPVRIIEKYPAGEGGDLK 2561
            RIWFYGDDDP+ERLRILSEYL+MFD SSAP ES++EPQ+LF+EPVRI+EKYP GE GDLK
Sbjct: 313  RIWFYGDDDPNERLRILSEYLNMFDASSAPHESRVEPQRLFSEPVRIVEKYPVGEDGDLK 372

Query: 2560 KKHMVCLNWLLSDKPLDVETEXXXXXXXXXXLGTPASPLRRILLESGLGEAIVGGGIEDE 2381
            KKHMVC+NWLLSDKPLD+ETE          LGTPASPLR+ILLESGLG+AIVGGGIEDE
Sbjct: 373  KKHMVCVNWLLSDKPLDLETELALGFLDHLLLGTPASPLRKILLESGLGDAIVGGGIEDE 432

Query: 2380 LLQPQFSVGLKGVSEDDIQKVEELILNNLQKLAEEGFDPEAVEASLNTIEFSLRENNTGS 2201
            LLQPQFS+GLKGVSE++IQKVEELI++ LQ LAE+GFD +AVEAS+NTIEFSLRENNTGS
Sbjct: 433  LLQPQFSIGLKGVSEENIQKVEELIMSTLQGLAEKGFDSDAVEASMNTIEFSLRENNTGS 492

Query: 2200 FPRGLSLMLRSIGKWIYDLDPFEPLKYEKPLLALKARISEEGPKAVFCPLIEKFILNNPH 2021
            FPRGL+LMLRSIGKW+YD+DPFEPLKY+KPL ALKARI++EG KAVF PL++++IL NPH
Sbjct: 493  FPRGLALMLRSIGKWVYDMDPFEPLKYQKPLEALKARIAKEGSKAVFAPLMDQYILRNPH 552

Query: 2020 RVTIEMQPDPVKASQDEAAEREILEKVKASMTEVDLAELARATRELRLKQETPDPPEALR 1841
            RVT+EMQPDP KAS++E  E+E L+KVKASMT+ DLAELARAT ELRLKQETPDPPEAL+
Sbjct: 553  RVTVEMQPDPEKASREEQIEKETLDKVKASMTQEDLAELARATHELRLKQETPDPPEALK 612

Query: 1840 SVPSLSLNDIPKKPIQIPIEVGEINGVRTLQHDLFTNDVLYAEVVFDMAHLKAELVPLVP 1661
            SVPSLSL DIP++P+ +P E+G+INGV+ L+HDLFTNDVLYAEVVF+++ LK EL+PLVP
Sbjct: 613  SVPSLSLQDIPREPVLVPTEIGDINGVKVLKHDLFTNDVLYAEVVFNLSSLKQELLPLVP 672

Query: 1660 LFCQSLLEMGTKDMDFVQLNQLIGRKTGGISVYPFTSSVRGKEDPTSHVIVRGKSMSGRA 1481
            LFCQSLLEMGTKD+DFVQLNQLIGRKTGG+SVYPFTSSV GK +P S +IVRGK+MS R 
Sbjct: 673  LFCQSLLEMGTKDLDFVQLNQLIGRKTGGLSVYPFTSSVHGKVEPCSKIIVRGKAMSQRT 732

Query: 1480 EDLFNLVNCILHDVQFTDQQRFKQFVSQSKARMENRLRGSGHGIAAARMDAKLNAAGWIA 1301
            EDLF L+N +L DVQ  DQ+RFKQFVSQS++RMENRLRGSGH +AAARM AKLN AGWI+
Sbjct: 733  EDLFYLINRVLQDVQLDDQKRFKQFVSQSRSRMENRLRGSGHSVAAARMGAKLNVAGWIS 792

Query: 1300 EQMGGISYLEFLRSLEEKVEKDWFVISSSLEEIRKSLLSKKGCLVNLTAEGKTLTSTEKF 1121
            EQMGG+SYLEFL+ LE++VEKDW  ISSSLEEIRKSLLSK GCL+NLTA+GK L + EK 
Sbjct: 793  EQMGGVSYLEFLKVLEDQVEKDWSQISSSLEEIRKSLLSKNGCLINLTADGKNLNNAEKH 852

Query: 1120 VSKFLDXXXXXXXXXXXSFNARLLPVNEAIVIPTQVNYIGKAANIYETGYQLNGSAYVIS 941
            +SKFLD           ++NA+L   NEA V+PTQVNY+GKAAN+YE GY+L GSAYVIS
Sbjct: 853  ISKFLDLLPSTSLVEPAAWNAQLSRSNEAFVVPTQVNYVGKAANLYEAGYELKGSAYVIS 912

Query: 940  KYISNTWLWDRVRVSGGAYGGFCDFDTHSGVFSYLSYRDPNLLKTLEVYDGTTNFLRELE 761
             Y SNTWLWDRVRVSGGAYGGFC FD+HSGVFS+LSYRDPNLLKTL+VYDGT++FL+ELE
Sbjct: 913  NYTSNTWLWDRVRVSGGAYGGFCSFDSHSGVFSFLSYRDPNLLKTLDVYDGTSSFLKELE 972

Query: 760  LDDDSLTKAIIGTIGDVDAYQLPDAKGYSSMLRFLLGVTEEERQKRREEILSSSLNDFKA 581
            +D+D+LTKAIIGTIGDVD+YQLPDAKGYSS+LR+LLGVT+EERQ+RREEILS+SL DF+ 
Sbjct: 973  MDNDALTKAIIGTIGDVDSYQLPDAKGYSSLLRYLLGVTDEERQRRREEILSTSLEDFRK 1032

Query: 580  FAEAIEAVKEKGXXXXXXXXXXXXXANKERSNFFEVKQVL 461
            F + +EAVK+KG             ANKERSNF EVK+ L
Sbjct: 1033 FGDVMEAVKDKGVVVAVASPDDVEAANKERSNFLEVKKAL 1072


>ref|XP_002313107.1| hypothetical protein POPTR_0009s10650g [Populus trichocarpa]
            gi|222849515|gb|EEE87062.1| hypothetical protein
            POPTR_0009s10650g [Populus trichocarpa]
          Length = 1006

 Score = 1617 bits (4186), Expect = 0.0
 Identities = 799/994 (80%), Positives = 890/994 (89%)
 Frame = -3

Query: 3442 SKHVSPDFSGAHDDVAEKLGFEKVSEEFIDECKSKAVLYKHKKTGAEVMSVSNDDENKVF 3263
            S+H  P  S     VA K GFEKVSE+FI ECKS+AVL KHKKTGAEVMSVSNDDENKVF
Sbjct: 18   SRHRHPMLS--RSPVAAKYGFEKVSEDFIGECKSRAVLLKHKKTGAEVMSVSNDDENKVF 75

Query: 3262 GVVFRTPPKDSTGIPHILEHSVLCGSRKYPLKEPFVELLKGSLNTFLNAFTYPDRTCYPV 3083
            G+VFRTPPKDSTGIPHILEHSVLCGSRKYPLKEPFVELLKGSL+TFLNAFTYPDRTCYPV
Sbjct: 76   GIVFRTPPKDSTGIPHILEHSVLCGSRKYPLKEPFVELLKGSLHTFLNAFTYPDRTCYPV 135

Query: 3082 ASTNAKDFYNLVDVYLDAVFFPKCVEDFQTFQQEGWHYELNDPSEDISFKGVVFNEMKGV 2903
            ASTN KDFYNLVDVYLDAVFFPKCVED  TFQQEGWH ELN+PSE+IS+KGVVFNEMKGV
Sbjct: 136  ASTNTKDFYNLVDVYLDAVFFPKCVEDHHTFQQEGWHLELNNPSEEISYKGVVFNEMKGV 195

Query: 2902 YSQPDNILGRSSQQALFPDNTYGVDSGGDPKVIPKLTFEEFKNFHRKYYHPSNARIWFYG 2723
            YSQPDNILGR++Q A   +NTYGVDSGGDPKVIPKLTFE+FK FH KYYHPSNARIWFYG
Sbjct: 196  YSQPDNILGRTAQLA---NNTYGVDSGGDPKVIPKLTFEQFKEFHGKYYHPSNARIWFYG 252

Query: 2722 DDDPSERLRILSEYLDMFDESSAPDESKIEPQKLFTEPVRIIEKYPAGEGGDLKKKHMVC 2543
            DDDP+ERLRILSEYLDMFD SSA +ES+IE QK F+EPVRI+EKYPAG+G DLKKKHMVC
Sbjct: 253  DDDPTERLRILSEYLDMFDASSASNESRIEQQKFFSEPVRIVEKYPAGDGSDLKKKHMVC 312

Query: 2542 LNWLLSDKPLDVETEXXXXXXXXXXLGTPASPLRRILLESGLGEAIVGGGIEDELLQPQF 2363
            LNWLL+DKPLD+ETE          LGTPASPLR+ILLESGLG+AIVGGG+EDELLQPQF
Sbjct: 313  LNWLLADKPLDLETELTLGFLDHLMLGTPASPLRKILLESGLGDAIVGGGVEDELLQPQF 372

Query: 2362 SVGLKGVSEDDIQKVEELILNNLQKLAEEGFDPEAVEASLNTIEFSLRENNTGSFPRGLS 2183
            S+GLKGVSE+DI+KVEEL+++ L+KLAEEGF+ +AVEAS+NTIEFSLRENNTGSFPRGLS
Sbjct: 373  SIGLKGVSEEDIEKVEELVMSTLKKLAEEGFETDAVEASMNTIEFSLRENNTGSFPRGLS 432

Query: 2182 LMLRSIGKWIYDLDPFEPLKYEKPLLALKARISEEGPKAVFCPLIEKFILNNPHRVTIEM 2003
            LML+SI KWIYD+DPFEPLKYEKPL+ALKARI+EEG KAVF PLIEKFILNN HRVTIEM
Sbjct: 433  LMLQSISKWIYDMDPFEPLKYEKPLMALKARIAEEGSKAVFSPLIEKFILNNLHRVTIEM 492

Query: 2002 QPDPVKASQDEAAEREILEKVKASMTEVDLAELARATRELRLKQETPDPPEALRSVPSLS 1823
            QPDP KAS+DEAAEREILEKVKASMTE DLAELARAT+ELRLKQETPDPPEALRSVPSLS
Sbjct: 493  QPDPEKASRDEAAEREILEKVKASMTEEDLAELARATQELRLKQETPDPPEALRSVPSLS 552

Query: 1822 LNDIPKKPIQIPIEVGEINGVRTLQHDLFTNDVLYAEVVFDMAHLKAELVPLVPLFCQSL 1643
            L DIPK+P+ +P E G+INGV+ L+HDLFTNDVLYAE+VF+M  LK EL+PLVPLFCQSL
Sbjct: 553  LLDIPKEPLHVPTEAGDINGVKVLKHDLFTNDVLYAEIVFNMRSLKQELLPLVPLFCQSL 612

Query: 1642 LEMGTKDMDFVQLNQLIGRKTGGISVYPFTSSVRGKEDPTSHVIVRGKSMSGRAEDLFNL 1463
            LEMGTKD+ FVQLNQLIGRKTGGISVYPFTSS++G+EDP SH+I +GK+M+GR EDLFNL
Sbjct: 613  LEMGTKDLTFVQLNQLIGRKTGGISVYPFTSSIQGREDPCSHIIAQGKAMAGRVEDLFNL 672

Query: 1462 VNCILHDVQFTDQQRFKQFVSQSKARMENRLRGSGHGIAAARMDAKLNAAGWIAEQMGGI 1283
            VNC+L +VQFTDQQRFKQFVSQSKA MENRLRGSGH IAA RMDAKLN  GWI+EQMGG+
Sbjct: 673  VNCVLQEVQFTDQQRFKQFVSQSKAGMENRLRGSGHRIAATRMDAKLNVTGWISEQMGGV 732

Query: 1282 SYLEFLRSLEEKVEKDWFVISSSLEEIRKSLLSKKGCLVNLTAEGKTLTSTEKFVSKFLD 1103
            SYLEFL++LEE+V++DW  +SSSLEEIR SLLSK GCL+N+TA+GK LT++EK+VSKFLD
Sbjct: 733  SYLEFLQALEERVDQDWAGVSSSLEEIRTSLLSKNGCLINMTADGKNLTNSEKYVSKFLD 792

Query: 1102 XXXXXXXXXXXSFNARLLPVNEAIVIPTQVNYIGKAANIYETGYQLNGSAYVISKYISNT 923
                       ++NARL P NEAIVIPTQVNY+GKAANIY+TGYQLNGSAYVISKYISNT
Sbjct: 793  LLPSKSSVEAAAWNARLSPGNEAIVIPTQVNYVGKAANIYDTGYQLNGSAYVISKYISNT 852

Query: 922  WLWDRVRVSGGAYGGFCDFDTHSGVFSYLSYRDPNLLKTLEVYDGTTNFLRELELDDDSL 743
            WLWDRVRVSGGAYGGFCD DTHSGVFS+LSYRDPNLLKTL+VYDGT  FLR+LE+DDD+L
Sbjct: 853  WLWDRVRVSGGAYGGFCDLDTHSGVFSFLSYRDPNLLKTLDVYDGTGAFLRQLEMDDDTL 912

Query: 742  TKAIIGTIGDVDAYQLPDAKGYSSMLRFLLGVTEEERQKRREEILSSSLNDFKAFAEAIE 563
            +KAIIGTIGDVD+YQLPDAKGYSS+LR+LLG+TEEERQKRREEILS+SL DFK F E IE
Sbjct: 913  SKAIIGTIGDVDSYQLPDAKGYSSLLRYLLGITEEERQKRREEILSTSLKDFKEFGEVIE 972

Query: 562  AVKEKGXXXXXXXXXXXXXANKERSNFFEVKQVL 461
            AVK+K              ANKERSN+F+VK+ L
Sbjct: 973  AVKDKWVSVAVASPDDVDDANKERSNYFDVKKAL 1006


>ref|XP_004511282.1| PREDICTED: presequence protease 1, chloroplastic/mitochondrial-like
            [Cicer arietinum]
          Length = 1080

 Score = 1610 bits (4170), Expect = 0.0
 Identities = 786/995 (78%), Positives = 896/995 (90%), Gaps = 5/995 (0%)
 Frame = -3

Query: 3430 SPDFSGA----HDDVAEKLGFEKVSEEFIDECKSKAVLYKHKKTGAEVMSVSNDDENKVF 3263
            SPD SG      D+VA +LGFEKVSEEFI ECKSKAVL++H KTGA+VMSVSN+DENKVF
Sbjct: 87   SPDISGGGEVVKDEVARELGFEKVSEEFITECKSKAVLFRHLKTGAQVMSVSNNDENKVF 146

Query: 3262 GVVFRTPPKDSTGIPHILEHSVLCGSRKYPLKEPFVELLKGSLNTFLNAFTYPDRTCYPV 3083
            G+VFRTPP DSTGIPHILEHSVLCGSRKYPLKEPFVELLKGSL+TFLNAFTYPDRTCYPV
Sbjct: 147  GIVFRTPPNDSTGIPHILEHSVLCGSRKYPLKEPFVELLKGSLHTFLNAFTYPDRTCYPV 206

Query: 3082 ASTNAKDFYNLVDVYLDAVFFPKCVEDFQTFQQEGWHYELNDPSEDISFKGVVFNEMKGV 2903
            ASTN KDFYNLVDVYLDAVFFPKCV+D QTFQQEGWHYELN PSEDI++KGVVFNEMKGV
Sbjct: 207  ASTNTKDFYNLVDVYLDAVFFPKCVDDLQTFQQEGWHYELNHPSEDITYKGVVFNEMKGV 266

Query: 2902 YSQPDNILGRSSQQALFPDNTYGVDSGGDPKVIPKLTFEEFKNFHRKYYHPSNARIWFYG 2723
            YSQPDNILGR++QQALFPDNTYGVDSGGDP+VIP LTFEEFK FHRKYYHPSN+RIWFYG
Sbjct: 267  YSQPDNILGRAAQQALFPDNTYGVDSGGDPRVIPNLTFEEFKEFHRKYYHPSNSRIWFYG 326

Query: 2722 DDDPSERLRILSEYLDMFDESSAPDESKIEPQKLFTEPVRIIEKYPAGEGGDLKKKHMVC 2543
            DDDP+ERLRILSEYL+MFD SSAP+ESK+EPQKLF++P+RI+E YPAGEGGDLKK HMVC
Sbjct: 327  DDDPNERLRILSEYLNMFDASSAPNESKVEPQKLFSKPIRIVETYPAGEGGDLKK-HMVC 385

Query: 2542 LNWLLSDKPLDVETEXXXXXXXXXXLGTPASPLRRILLESGLGEAIVGGGIEDELLQPQF 2363
            LNWLL+DKPLD+ETE          LGTPASPLR++LLES LG+AIVGGG+EDELLQPQF
Sbjct: 386  LNWLLADKPLDLETELALGFLNHLLLGTPASPLRKVLLESRLGDAIVGGGLEDELLQPQF 445

Query: 2362 SVGLKGVSEDDIQKVEELILNNLQKLAEEGFDPEAVEASLNTIEFSLRENNTGSFPRGLS 2183
            S+G+KGVSEDDI KVEELI++ L+KLAEEGFD +A+EAS+NTIEFSLRENNTGSFPRGLS
Sbjct: 446  SIGMKGVSEDDIHKVEELIMSTLKKLAEEGFDTDAIEASMNTIEFSLRENNTGSFPRGLS 505

Query: 2182 LMLRSIGKWIYDLDPFEPLKYEKPLLALKARISEEGPKAVFCPLIEKFILNNPHRVTIEM 2003
            LML+SIGKWIYD++P EPLKYEKPL  LK++I++EG K+VF PLIEKFILNNPH+VT++M
Sbjct: 506  LMLQSIGKWIYDMNPLEPLKYEKPLQDLKSKIAKEGSKSVFSPLIEKFILNNPHKVTVQM 565

Query: 2002 QPDPVKASQDEAAEREILEKVKASMTEVDLAELARATRELRLKQETPDPPEALRSVPSLS 1823
            QPDP KA++DE  E+++L+K+KASMT  DLAELARAT ELRLKQETPDPPEAL++VPSLS
Sbjct: 566  QPDPEKAARDEETEKQVLQKIKASMTTEDLAELARATHELRLKQETPDPPEALKTVPSLS 625

Query: 1822 LNDIPKKPIQIPIEVGEINGVRTLQHDLFTNDVLYAEVVFDMAHLKAELVPLVPLFCQSL 1643
            L DIPK+PI++P EVG+INGV+ LQHDLFTNDVLY E+VFDM+ LK EL+PLVPLFCQSL
Sbjct: 626  LQDIPKEPIRVPTEVGDINGVKVLQHDLFTNDVLYTEIVFDMSSLKQELLPLVPLFCQSL 685

Query: 1642 LEMGTKDMDFVQLNQLIGRKTGGISVYPFTSSVRGKEDPTSHVIVRGKSMSGRAEDLFNL 1463
            LEMGTKD+ FVQLNQLIGRKTGGISVYPFTSSV+GKEDP SH+IVRGK+MSGRAEDL++L
Sbjct: 686  LEMGTKDLTFVQLNQLIGRKTGGISVYPFTSSVQGKEDPCSHMIVRGKAMSGRAEDLYDL 745

Query: 1462 VNCILHDVQFTDQQRFKQFVSQSKARMENRLRGSGHGIAAARMDAKLNAAGWIAEQMGGI 1283
            VN +L DVQFTDQQRFKQFVSQS+ARMENRLRGSGHGIAAARMDAKLNAAGW++E+MGG+
Sbjct: 746  VNSVLQDVQFTDQQRFKQFVSQSRARMENRLRGSGHGIAAARMDAKLNAAGWMSEKMGGL 805

Query: 1282 SYLEFLRSLEEKVEKDWFVISSSLEEIRKSLLSKKGCLVNLTAEGKTLTSTEKFVSKFLD 1103
            SYLEFL++LE++V++DW  ISSSLEEIRK++ SK+GCL+N+TA+GK L + +KFVSKF+D
Sbjct: 806  SYLEFLQTLEKRVDEDWADISSSLEEIRKTVFSKQGCLINITADGKNLANMDKFVSKFVD 865

Query: 1102 -XXXXXXXXXXXSFNARLLPVNEAIVIPTQVNYIGKAANIYETGYQLNGSAYVISKYISN 926
                         +NARL   NEAIVIPTQVNY+GKA N+Y+ GY+LNGSAYVISKYISN
Sbjct: 866  MLPTSSPIATTNIWNARLPLTNEAIVIPTQVNYVGKATNVYDAGYKLNGSAYVISKYISN 925

Query: 925  TWLWDRVRVSGGAYGGFCDFDTHSGVFSYLSYRDPNLLKTLEVYDGTTNFLRELELDDDS 746
            TWLWDRVRVSGGAYGGFCDFDTHSGVFS+LSYRDPNLLKTLEVYDGT +FLRELE+DDD+
Sbjct: 926  TWLWDRVRVSGGAYGGFCDFDTHSGVFSFLSYRDPNLLKTLEVYDGTGDFLRELEIDDDT 985

Query: 745  LTKAIIGTIGDVDAYQLPDAKGYSSMLRFLLGVTEEERQKRREEILSSSLNDFKAFAEAI 566
            LTKAIIGTIGDVDAYQLPDAKGYSSMLR+LLG+TEEERQ+RREEILS+S  DFK F  A+
Sbjct: 986  LTKAIIGTIGDVDAYQLPDAKGYSSMLRYLLGITEEERQRRREEILSTSSKDFKQFIAAM 1045

Query: 565  EAVKEKGXXXXXXXXXXXXXANKERSNFFEVKQVL 461
            EAVK+KG             ANKE +NFF+VK+ L
Sbjct: 1046 EAVKDKGVVVAVASPEDVEAANKELANFFQVKKAL 1080


>ref|XP_007157239.1| hypothetical protein PHAVU_002G054400g [Phaseolus vulgaris]
            gi|561030654|gb|ESW29233.1| hypothetical protein
            PHAVU_002G054400g [Phaseolus vulgaris]
          Length = 1078

 Score = 1608 bits (4165), Expect = 0.0
 Identities = 789/1004 (78%), Positives = 891/1004 (88%), Gaps = 4/1004 (0%)
 Frame = -3

Query: 3460 KAIATPSKHVSPD----FSGAHDDVAEKLGFEKVSEEFIDECKSKAVLYKHKKTGAEVMS 3293
            +A+ +PS   SP     F    D+VA + GF+ VSEEFI ECKSKAVL++H KTGA+VMS
Sbjct: 75   RAVLSPSPSSSPSPPPAFPQVEDEVALQFGFQIVSEEFIPECKSKAVLFRHIKTGAQVMS 134

Query: 3292 VSNDDENKVFGVVFRTPPKDSTGIPHILEHSVLCGSRKYPLKEPFVELLKGSLNTFLNAF 3113
            VSNDDENKVFG+VFRTPP DSTGIPHILEHSVLCGSRKYPLKEPFVELLKGSL+TFLNAF
Sbjct: 135  VSNDDENKVFGIVFRTPPNDSTGIPHILEHSVLCGSRKYPLKEPFVELLKGSLHTFLNAF 194

Query: 3112 TYPDRTCYPVASTNAKDFYNLVDVYLDAVFFPKCVEDFQTFQQEGWHYELNDPSEDISFK 2933
            TYPDRTCYPVASTN+KDFYNLVDVYLDAVFFPKCVEDFQ FQQEGWH+ELNDPSEDI++K
Sbjct: 195  TYPDRTCYPVASTNSKDFYNLVDVYLDAVFFPKCVEDFQIFQQEGWHFELNDPSEDITYK 254

Query: 2932 GVVFNEMKGVYSQPDNILGRSSQQALFPDNTYGVDSGGDPKVIPKLTFEEFKNFHRKYYH 2753
            GVVFNEMKGVYSQPDNILGR+SQQALFPD TYGVDSGGDP+VIPKLTFEEFK FHRKYYH
Sbjct: 255  GVVFNEMKGVYSQPDNILGRASQQALFPDTTYGVDSGGDPRVIPKLTFEEFKEFHRKYYH 314

Query: 2752 PSNARIWFYGDDDPSERLRILSEYLDMFDESSAPDESKIEPQKLFTEPVRIIEKYPAGEG 2573
            PSN+RIWFYG+DDP ERLRILSEYLD+FD S A +ES+IEPQ LF++PVRI+E YPAGEG
Sbjct: 315  PSNSRIWFYGNDDPKERLRILSEYLDLFDSSLASEESRIEPQTLFSKPVRIVETYPAGEG 374

Query: 2572 GDLKKKHMVCLNWLLSDKPLDVETEXXXXXXXXXXLGTPASPLRRILLESGLGEAIVGGG 2393
            GDLKKKHMVCLNWLLSDKPLD+ETE          LGTPASPLR+ILLESGLG+AIVGGG
Sbjct: 375  GDLKKKHMVCLNWLLSDKPLDLETELAIGFLNHLLLGTPASPLRKILLESGLGDAIVGGG 434

Query: 2392 IEDELLQPQFSVGLKGVSEDDIQKVEELILNNLQKLAEEGFDPEAVEASLNTIEFSLREN 2213
            +EDELLQPQFS+GLKGVSEDDI KVEEL+ + L+KLAEEGFD +A+EAS+NTIEFSLREN
Sbjct: 435  VEDELLQPQFSIGLKGVSEDDIHKVEELVTSTLKKLAEEGFDTDAIEASMNTIEFSLREN 494

Query: 2212 NTGSFPRGLSLMLRSIGKWIYDLDPFEPLKYEKPLLALKARISEEGPKAVFCPLIEKFIL 2033
            NTGSFPRGLSLML+SIGKWIYD++PFEPLKYEKPL  LK+RI+EEGPK+VF PLIEKFIL
Sbjct: 495  NTGSFPRGLSLMLQSIGKWIYDMNPFEPLKYEKPLQGLKSRIAEEGPKSVFSPLIEKFIL 554

Query: 2032 NNPHRVTIEMQPDPVKASQDEAAEREILEKVKASMTEVDLAELARATRELRLKQETPDPP 1853
            NNPH+VT+EMQPDP KA+++EA E+ IL+KVK SMT  DLAEL RAT ELRLKQETPD P
Sbjct: 555  NNPHKVTVEMQPDPEKAAREEATEKHILQKVKTSMTTEDLAELTRATHELRLKQETPDSP 614

Query: 1852 EALRSVPSLSLNDIPKKPIQIPIEVGEINGVRTLQHDLFTNDVLYAEVVFDMAHLKAELV 1673
            EAL++VPSLSL DIPK+PI++P EVG+INGV+ LQHDLFTNDVLY E+VF+M  LK EL+
Sbjct: 615  EALKTVPSLSLQDIPKEPIRVPTEVGDINGVKVLQHDLFTNDVLYTEIVFNMNSLKQELL 674

Query: 1672 PLVPLFCQSLLEMGTKDMDFVQLNQLIGRKTGGISVYPFTSSVRGKEDPTSHVIVRGKSM 1493
            PLVPLFCQSLLEMGTKD+ FVQLNQLIGRKTGGISVYPFTSSVRGKEDP SH++VRGK+M
Sbjct: 675  PLVPLFCQSLLEMGTKDLSFVQLNQLIGRKTGGISVYPFTSSVRGKEDPCSHMVVRGKAM 734

Query: 1492 SGRAEDLFNLVNCILHDVQFTDQQRFKQFVSQSKARMENRLRGSGHGIAAARMDAKLNAA 1313
            +G  EDL++LVN +L DVQFTDQQRFKQFVSQS+ARMENRLRGSGHGIAAARMDAKLNAA
Sbjct: 735  AGCIEDLYDLVNSVLQDVQFTDQQRFKQFVSQSRARMENRLRGSGHGIAAARMDAKLNAA 794

Query: 1312 GWIAEQMGGISYLEFLRSLEEKVEKDWFVISSSLEEIRKSLLSKKGCLVNLTAEGKTLTS 1133
            GW++E+MGG+SYLEFLR+LEE+V++DW  ISSSLEEIRKS+ SK+GCLVN+TA+ K L +
Sbjct: 795  GWMSEKMGGLSYLEFLRTLEERVDQDWVDISSSLEEIRKSIFSKQGCLVNVTADRKNLAN 854

Query: 1132 TEKFVSKFLDXXXXXXXXXXXSFNARLLPVNEAIVIPTQVNYIGKAANIYETGYQLNGSA 953
             EK VSKF+D           + +  L   NEAIVIPTQVNY+GKAANIY+ GYQLNGSA
Sbjct: 855  AEKVVSKFVDLLPTRSPIAATNRDFTLPLTNEAIVIPTQVNYVGKAANIYDVGYQLNGSA 914

Query: 952  YVISKYISNTWLWDRVRVSGGAYGGFCDFDTHSGVFSYLSYRDPNLLKTLEVYDGTTNFL 773
            YVISKYISNTWLWDRVRVSGGAYGGFCDFDTHSGVFS+LSYRDPNLLKTL+VYDGT +FL
Sbjct: 915  YVISKYISNTWLWDRVRVSGGAYGGFCDFDTHSGVFSFLSYRDPNLLKTLDVYDGTGDFL 974

Query: 772  RELELDDDSLTKAIIGTIGDVDAYQLPDAKGYSSMLRFLLGVTEEERQKRREEILSSSLN 593
            REL++DDD+LTKAIIGTIGDVDAYQLPDAKGYSSMLR+LLG+TEEERQ+RREEILS+SL 
Sbjct: 975  RELQIDDDTLTKAIIGTIGDVDAYQLPDAKGYSSMLRYLLGITEEERQRRREEILSTSLK 1034

Query: 592  DFKAFAEAIEAVKEKGXXXXXXXXXXXXXANKERSNFFEVKQVL 461
            DFK F +A+EAVK KG             ANK+R +FF+VK+ L
Sbjct: 1035 DFKNFTDAMEAVKNKGVVVAVASPEDVDAANKDRPDFFQVKKAL 1078


>ref|XP_006829680.1| hypothetical protein AMTR_s00126p00013900 [Amborella trichopoda]
            gi|548835199|gb|ERM97096.1| hypothetical protein
            AMTR_s00126p00013900 [Amborella trichopoda]
          Length = 1075

 Score = 1604 bits (4153), Expect = 0.0
 Identities = 783/1000 (78%), Positives = 884/1000 (88%)
 Frame = -3

Query: 3460 KAIATPSKHVSPDFSGAHDDVAEKLGFEKVSEEFIDECKSKAVLYKHKKTGAEVMSVSND 3281
            +AIATPSK  S    G+HD +A +LGFEKVSE+ I+ECKSKA+LYKHKKTGAEV+SV ND
Sbjct: 77   QAIATPSKQASSGIDGSHD-IAHELGFEKVSEQLIEECKSKAILYKHKKTGAEVISVVND 135

Query: 3280 DENKVFGVVFRTPPKDSTGIPHILEHSVLCGSRKYPLKEPFVELLKGSLNTFLNAFTYPD 3101
            DENKVFG+VFRTPPKDSTGIPHILEHSVLCGSRKYPLKEPFVELLKGSL+TFLNAFTYPD
Sbjct: 136  DENKVFGIVFRTPPKDSTGIPHILEHSVLCGSRKYPLKEPFVELLKGSLHTFLNAFTYPD 195

Query: 3100 RTCYPVASTNAKDFYNLVDVYLDAVFFPKCVEDFQTFQQEGWHYELNDPSEDISFKGVVF 2921
            RTCYPVASTN KDFYNLVDVYLDAVFFPKC+ED+QTFQQEGWHYELN+P E+IS KGVVF
Sbjct: 196  RTCYPVASTNTKDFYNLVDVYLDAVFFPKCIEDYQTFQQEGWHYELNNPEEEISLKGVVF 255

Query: 2920 NEMKGVYSQPDNILGRSSQQALFPDNTYGVDSGGDPKVIPKLTFEEFKNFHRKYYHPSNA 2741
            NEMKGVYSQPDNI+GR SQQ +FPDNTYGVDSGGDPKVIPKLTFEEFK FHRKYYHPSN+
Sbjct: 256  NEMKGVYSQPDNIMGRISQQVMFPDNTYGVDSGGDPKVIPKLTFEEFKEFHRKYYHPSNS 315

Query: 2740 RIWFYGDDDPSERLRILSEYLDMFDESSAPDESKIEPQKLFTEPVRIIEKYPAGEGGDLK 2561
            +IWFYGDDDP+ERLR +S YLD FD SSAP ESK+ PQKLF +PV+++EKYPAG+ GDLK
Sbjct: 316  KIWFYGDDDPNERLRTISVYLDQFDASSAPYESKVVPQKLFPKPVKVVEKYPAGDTGDLK 375

Query: 2560 KKHMVCLNWLLSDKPLDVETEXXXXXXXXXXLGTPASPLRRILLESGLGEAIVGGGIEDE 2381
            KKHMV LNWLLS++PLD+ETE          LGTPASPLR+ LLESGLG+A++GGGIEDE
Sbjct: 376  KKHMVSLNWLLSEEPLDLETELALGFLDHLMLGTPASPLRKTLLESGLGDALIGGGIEDE 435

Query: 2380 LLQPQFSVGLKGVSEDDIQKVEELILNNLQKLAEEGFDPEAVEASLNTIEFSLRENNTGS 2201
            LLQPQFSVGLKGV+E+D++KVE+LI+  L++LA +GFD EA+EAS+NTIEFSLRENNTGS
Sbjct: 436  LLQPQFSVGLKGVAEEDVRKVEDLIIQTLEELANKGFDVEAIEASMNTIEFSLRENNTGS 495

Query: 2200 FPRGLSLMLRSIGKWIYDLDPFEPLKYEKPLLALKARISEEGPKAVFCPLIEKFILNNPH 2021
            FPRGLSLMLRSIGKWIYD+DPFEPLKYEKPL  LKARI+EEG KAVF PLI+KFIL+NPH
Sbjct: 496  FPRGLSLMLRSIGKWIYDMDPFEPLKYEKPLNDLKARIAEEGSKAVFSPLIQKFILDNPH 555

Query: 2020 RVTIEMQPDPVKASQDEAAEREILEKVKASMTEVDLAELARATRELRLKQETPDPPEALR 1841
            RVTIEMQPD  KAS+DEA E+E LEKVKASMTE DLAELARAT+ELRLKQETPDPPE L+
Sbjct: 556  RVTIEMQPDTEKASRDEADEKESLEKVKASMTEEDLAELARATQELRLKQETPDPPEVLK 615

Query: 1840 SVPSLSLNDIPKKPIQIPIEVGEINGVRTLQHDLFTNDVLYAEVVFDMAHLKAELVPLVP 1661
             VPSLSL+DIPK PI +PIE+GEINGV+ LQH+LFTNDVLYAEVVFDM  +K EL+PL+P
Sbjct: 616  CVPSLSLHDIPKHPIHVPIEIGEINGVKVLQHELFTNDVLYAEVVFDMCLVKQELLPLIP 675

Query: 1660 LFCQSLLEMGTKDMDFVQLNQLIGRKTGGISVYPFTSSVRGKEDPTSHVIVRGKSMSGRA 1481
            LFCQSLLEMGTKDMDFVQLNQLIGRKTGGIS+YPFTSS+RGK +P S +IVR KSM+ R 
Sbjct: 676  LFCQSLLEMGTKDMDFVQLNQLIGRKTGGISIYPFTSSIRGKVEPCSRIIVRAKSMAARV 735

Query: 1480 EDLFNLVNCILHDVQFTDQQRFKQFVSQSKARMENRLRGSGHGIAAARMDAKLNAAGWIA 1301
            +DLFNLVN +L DVQFTDQQRFKQFV QSKARME+RLRGSGHGIAAARMDAKLN AGWIA
Sbjct: 736  DDLFNLVNTVLQDVQFTDQQRFKQFVCQSKARMESRLRGSGHGIAAARMDAKLNTAGWIA 795

Query: 1300 EQMGGISYLEFLRSLEEKVEKDWFVISSSLEEIRKSLLSKKGCLVNLTAEGKTLTSTEKF 1121
            EQMGGISYL+FL +LE++V++DW  IS SLE+IR+SLLS+KGCL+NLTA+GK L+++EK 
Sbjct: 796  EQMGGISYLQFLETLEKQVDQDWSAISCSLEDIRRSLLSRKGCLINLTADGKNLSNSEKH 855

Query: 1120 VSKFLDXXXXXXXXXXXSFNARLLPVNEAIVIPTQVNYIGKAANIYETGYQLNGSAYVIS 941
            VSKFLD           S+ A+L   NEA+VIPTQVNY+GKA N+Y+TGYQLNGS YVIS
Sbjct: 856  VSKFLDLLPATSSLETTSWKAQLYLGNEALVIPTQVNYVGKAGNLYDTGYQLNGSTYVIS 915

Query: 940  KYISNTWLWDRVRVSGGAYGGFCDFDTHSGVFSYLSYRDPNLLKTLEVYDGTTNFLRELE 761
             YI NTWLWDRVRVSGGAYGGFCDFDTHSGVFSYLSYRDPNLLKTL++YDGT NFLRELE
Sbjct: 916  MYIGNTWLWDRVRVSGGAYGGFCDFDTHSGVFSYLSYRDPNLLKTLDIYDGTANFLRELE 975

Query: 760  LDDDSLTKAIIGTIGDVDAYQLPDAKGYSSMLRFLLGVTEEERQKRREEILSSSLNDFKA 581
            LD+D+LTKAIIGTIGDVD YQLPDAKGYSSMLR+LLG+TEEERQKR EEILS+SL DF  
Sbjct: 976  LDEDTLTKAIIGTIGDVDGYQLPDAKGYSSMLRYLLGITEEERQKRHEEILSTSLKDFHD 1035

Query: 580  FAEAIEAVKEKGXXXXXXXXXXXXXANKERSNFFEVKQVL 461
            FA+ ++ VK KG             AN+ER  FF+VK+VL
Sbjct: 1036 FADVVDVVKHKGVVVAVASEDDVTAANEERPGFFQVKKVL 1075


>ref|XP_007042384.1| Presequence protease 2 isoform 1 [Theobroma cacao]
            gi|508706319|gb|EOX98215.1| Presequence protease 2
            isoform 1 [Theobroma cacao]
          Length = 1037

 Score = 1599 bits (4141), Expect = 0.0
 Identities = 775/932 (83%), Positives = 867/932 (93%)
 Frame = -3

Query: 3460 KAIATPSKHVSPDFSGAHDDVAEKLGFEKVSEEFIDECKSKAVLYKHKKTGAEVMSVSND 3281
            +A+A+P++  SPD +G  D+VAEKLGFEKVSEEFI ECKSKAVL+KHKKTGAEVMSVSND
Sbjct: 87   RAVASPTQP-SPDIAGVEDEVAEKLGFEKVSEEFIGECKSKAVLFKHKKTGAEVMSVSND 145

Query: 3280 DENKVFGVVFRTPPKDSTGIPHILEHSVLCGSRKYPLKEPFVELLKGSLNTFLNAFTYPD 3101
            DENKVFG+VFRTPPKDSTGIPHILEHSVLCGSRKYPLKEPFVELLKGSL+TFLNAFTYPD
Sbjct: 146  DENKVFGIVFRTPPKDSTGIPHILEHSVLCGSRKYPLKEPFVELLKGSLHTFLNAFTYPD 205

Query: 3100 RTCYPVASTNAKDFYNLVDVYLDAVFFPKCVEDFQTFQQEGWHYELNDPSEDISFKGVVF 2921
            RTCYPVASTNAKDFYNLVDVYLDAVFFPKC+EDFQTFQQEGWHYELND SEDI++KGVVF
Sbjct: 206  RTCYPVASTNAKDFYNLVDVYLDAVFFPKCIEDFQTFQQEGWHYELNDTSEDITYKGVVF 265

Query: 2920 NEMKGVYSQPDNILGRSSQQALFPDNTYGVDSGGDPKVIPKLTFEEFKNFHRKYYHPSNA 2741
            NEMKGVYSQPDN+LGR++QQALFPDNTYGVDSGGDP+VIPKLT+EEFK FHRKYYHPSNA
Sbjct: 266  NEMKGVYSQPDNLLGRTAQQALFPDNTYGVDSGGDPQVIPKLTYEEFKEFHRKYYHPSNA 325

Query: 2740 RIWFYGDDDPSERLRILSEYLDMFDESSAPDESKIEPQKLFTEPVRIIEKYPAGEGGDLK 2561
            RIWFYGDDDP ERLRILSEYLDMFD S+APDESK+EPQKLF+EPVR +EKYP GEGGDLK
Sbjct: 326  RIWFYGDDDPIERLRILSEYLDMFDASTAPDESKVEPQKLFSEPVRFVEKYPVGEGGDLK 385

Query: 2560 KKHMVCLNWLLSDKPLDVETEXXXXXXXXXXLGTPASPLRRILLESGLGEAIVGGGIEDE 2381
            KKHMVCLNWLLSDKPLD++TE          LGTPASPLR++LLESGLG+AI+GGG+EDE
Sbjct: 386  KKHMVCLNWLLSDKPLDLQTELTLGFLDHLMLGTPASPLRKVLLESGLGDAIIGGGVEDE 445

Query: 2380 LLQPQFSVGLKGVSEDDIQKVEELILNNLQKLAEEGFDPEAVEASLNTIEFSLRENNTGS 2201
            LLQPQFS+GLKGVSEDDI KVEELI+++L+KLAEEGFD +AVEAS+NTIEFSLRENNTGS
Sbjct: 446  LLQPQFSIGLKGVSEDDIPKVEELIMSSLKKLAEEGFDTDAVEASMNTIEFSLRENNTGS 505

Query: 2200 FPRGLSLMLRSIGKWIYDLDPFEPLKYEKPLLALKARISEEGPKAVFCPLIEKFILNNPH 2021
            FPRGLSLMLRSIGKWIYD+DPFEPLKYEKPL+ LKARI+EEG KAVF PLIEKFILNNPH
Sbjct: 506  FPRGLSLMLRSIGKWIYDMDPFEPLKYEKPLMILKARIAEEGSKAVFSPLIEKFILNNPH 565

Query: 2020 RVTIEMQPDPVKASQDEAAEREILEKVKASMTEVDLAELARATRELRLKQETPDPPEALR 1841
             VTIEMQPDP KAS+DEAAE+EIL KVKASMTE DLAELARAT+EL+LKQETPDPPEALR
Sbjct: 566  CVTIEMQPDPEKASRDEAAEKEILNKVKASMTEEDLAELARATQELKLKQETPDPPEALR 625

Query: 1840 SVPSLSLNDIPKKPIQIPIEVGEINGVRTLQHDLFTNDVLYAEVVFDMAHLKAELVPLVP 1661
            SVPSLSL+DIPK+PI++P EVG+INGV+ LQHDLFTNDVLY +VVFDM+ LK EL+PLVP
Sbjct: 626  SVPSLSLHDIPKEPIRVPTEVGDINGVKVLQHDLFTNDVLYTDVVFDMSSLKRELLPLVP 685

Query: 1660 LFCQSLLEMGTKDMDFVQLNQLIGRKTGGISVYPFTSSVRGKEDPTSHVIVRGKSMSGRA 1481
            LFCQSLLEMGTKD+ FVQLNQLIGRKTGGISVYPFTSS++GKEDP SH+IVRGKSM+G A
Sbjct: 686  LFCQSLLEMGTKDLSFVQLNQLIGRKTGGISVYPFTSSIQGKEDPCSHIIVRGKSMAGCA 745

Query: 1480 EDLFNLVNCILHDVQFTDQQRFKQFVSQSKARMENRLRGSGHGIAAARMDAKLNAAGWIA 1301
            +DLFNL+NC++ +VQFTDQQRFKQFVSQSKARME+RLRGSGHGIAAARMDAKLN +GWI+
Sbjct: 746  DDLFNLINCVIQEVQFTDQQRFKQFVSQSKARMESRLRGSGHGIAAARMDAKLNVSGWIS 805

Query: 1300 EQMGGISYLEFLRSLEEKVEKDWFVISSSLEEIRKSLLSKKGCLVNLTAEGKTLTSTEKF 1121
            EQMGG+SYLEFL+ LEE+V+ DW  ISSSLEEIRKSLLS++GCL+N+TA+GK L++TEK 
Sbjct: 806  EQMGGVSYLEFLQGLEERVDNDWAGISSSLEEIRKSLLSREGCLINMTADGKNLSNTEKL 865

Query: 1120 VSKFLDXXXXXXXXXXXSFNARLLPVNEAIVIPTQVNYIGKAANIYETGYQLNGSAYVIS 941
            VSKFLD           S++ARL   NEAIVIPTQVNY+GKAAN+Y+ GYQLNGSAYVIS
Sbjct: 866  VSKFLDLLPSNSVVERASWSARLPSNNEAIVIPTQVNYVGKAANLYDGGYQLNGSAYVIS 925

Query: 940  KYISNTWLWDRVRVSGGAYGGFCDFDTHSGVFSYLSYRDPNLLKTLEVYDGTTNFLRELE 761
            K+ISNTWLWDRVRVSGGAYGGFC+FDTHSGVF++LSYRDPNLL+TL++YDGT +FLRELE
Sbjct: 926  KHISNTWLWDRVRVSGGAYGGFCNFDTHSGVFTFLSYRDPNLLETLDIYDGTGDFLRELE 985

Query: 760  LDDDSLTKAIIGTIGDVDAYQLPDAKGYSSML 665
            +DDD+LTKAIIGT+GDVDAYQLPDAKGYS  L
Sbjct: 986  MDDDTLTKAIIGTVGDVDAYQLPDAKGYSRFL 1017


>ref|XP_007042386.1| Presequence protease 2 isoform 3 [Theobroma cacao]
            gi|508706321|gb|EOX98217.1| Presequence protease 2
            isoform 3 [Theobroma cacao]
          Length = 1041

 Score = 1598 bits (4138), Expect = 0.0
 Identities = 774/929 (83%), Positives = 866/929 (93%)
 Frame = -3

Query: 3460 KAIATPSKHVSPDFSGAHDDVAEKLGFEKVSEEFIDECKSKAVLYKHKKTGAEVMSVSND 3281
            +A+A+P++  SPD +G  D+VAEKLGFEKVSEEFI ECKSKAVL+KHKKTGAEVMSVSND
Sbjct: 87   RAVASPTQP-SPDIAGVEDEVAEKLGFEKVSEEFIGECKSKAVLFKHKKTGAEVMSVSND 145

Query: 3280 DENKVFGVVFRTPPKDSTGIPHILEHSVLCGSRKYPLKEPFVELLKGSLNTFLNAFTYPD 3101
            DENKVFG+VFRTPPKDSTGIPHILEHSVLCGSRKYPLKEPFVELLKGSL+TFLNAFTYPD
Sbjct: 146  DENKVFGIVFRTPPKDSTGIPHILEHSVLCGSRKYPLKEPFVELLKGSLHTFLNAFTYPD 205

Query: 3100 RTCYPVASTNAKDFYNLVDVYLDAVFFPKCVEDFQTFQQEGWHYELNDPSEDISFKGVVF 2921
            RTCYPVASTNAKDFYNLVDVYLDAVFFPKC+EDFQTFQQEGWHYELND SEDI++KGVVF
Sbjct: 206  RTCYPVASTNAKDFYNLVDVYLDAVFFPKCIEDFQTFQQEGWHYELNDTSEDITYKGVVF 265

Query: 2920 NEMKGVYSQPDNILGRSSQQALFPDNTYGVDSGGDPKVIPKLTFEEFKNFHRKYYHPSNA 2741
            NEMKGVYSQPDN+LGR++QQALFPDNTYGVDSGGDP+VIPKLT+EEFK FHRKYYHPSNA
Sbjct: 266  NEMKGVYSQPDNLLGRTAQQALFPDNTYGVDSGGDPQVIPKLTYEEFKEFHRKYYHPSNA 325

Query: 2740 RIWFYGDDDPSERLRILSEYLDMFDESSAPDESKIEPQKLFTEPVRIIEKYPAGEGGDLK 2561
            RIWFYGDDDP ERLRILSEYLDMFD S+APDESK+EPQKLF+EPVR +EKYP GEGGDLK
Sbjct: 326  RIWFYGDDDPIERLRILSEYLDMFDASTAPDESKVEPQKLFSEPVRFVEKYPVGEGGDLK 385

Query: 2560 KKHMVCLNWLLSDKPLDVETEXXXXXXXXXXLGTPASPLRRILLESGLGEAIVGGGIEDE 2381
            KKHMVCLNWLLSDKPLD++TE          LGTPASPLR++LLESGLG+AI+GGG+EDE
Sbjct: 386  KKHMVCLNWLLSDKPLDLQTELTLGFLDHLMLGTPASPLRKVLLESGLGDAIIGGGVEDE 445

Query: 2380 LLQPQFSVGLKGVSEDDIQKVEELILNNLQKLAEEGFDPEAVEASLNTIEFSLRENNTGS 2201
            LLQPQFS+GLKGVSEDDI KVEELI+++L+KLAEEGFD +AVEAS+NTIEFSLRENNTGS
Sbjct: 446  LLQPQFSIGLKGVSEDDIPKVEELIMSSLKKLAEEGFDTDAVEASMNTIEFSLRENNTGS 505

Query: 2200 FPRGLSLMLRSIGKWIYDLDPFEPLKYEKPLLALKARISEEGPKAVFCPLIEKFILNNPH 2021
            FPRGLSLMLRSIGKWIYD+DPFEPLKYEKPL+ LKARI+EEG KAVF PLIEKFILNNPH
Sbjct: 506  FPRGLSLMLRSIGKWIYDMDPFEPLKYEKPLMILKARIAEEGSKAVFSPLIEKFILNNPH 565

Query: 2020 RVTIEMQPDPVKASQDEAAEREILEKVKASMTEVDLAELARATRELRLKQETPDPPEALR 1841
             VTIEMQPDP KAS+DEAAE+EIL KVKASMTE DLAELARAT+EL+LKQETPDPPEALR
Sbjct: 566  CVTIEMQPDPEKASRDEAAEKEILNKVKASMTEEDLAELARATQELKLKQETPDPPEALR 625

Query: 1840 SVPSLSLNDIPKKPIQIPIEVGEINGVRTLQHDLFTNDVLYAEVVFDMAHLKAELVPLVP 1661
            SVPSLSL+DIPK+PI++P EVG+INGV+ LQHDLFTNDVLY +VVFDM+ LK EL+PLVP
Sbjct: 626  SVPSLSLHDIPKEPIRVPTEVGDINGVKVLQHDLFTNDVLYTDVVFDMSSLKRELLPLVP 685

Query: 1660 LFCQSLLEMGTKDMDFVQLNQLIGRKTGGISVYPFTSSVRGKEDPTSHVIVRGKSMSGRA 1481
            LFCQSLLEMGTKD+ FVQLNQLIGRKTGGISVYPFTSS++GKEDP SH+IVRGKSM+G A
Sbjct: 686  LFCQSLLEMGTKDLSFVQLNQLIGRKTGGISVYPFTSSIQGKEDPCSHIIVRGKSMAGCA 745

Query: 1480 EDLFNLVNCILHDVQFTDQQRFKQFVSQSKARMENRLRGSGHGIAAARMDAKLNAAGWIA 1301
            +DLFNL+NC++ +VQFTDQQRFKQFVSQSKARME+RLRGSGHGIAAARMDAKLN +GWI+
Sbjct: 746  DDLFNLINCVIQEVQFTDQQRFKQFVSQSKARMESRLRGSGHGIAAARMDAKLNVSGWIS 805

Query: 1300 EQMGGISYLEFLRSLEEKVEKDWFVISSSLEEIRKSLLSKKGCLVNLTAEGKTLTSTEKF 1121
            EQMGG+SYLEFL+ LEE+V+ DW  ISSSLEEIRKSLLS++GCL+N+TA+GK L++TEK 
Sbjct: 806  EQMGGVSYLEFLQGLEERVDNDWAGISSSLEEIRKSLLSREGCLINMTADGKNLSNTEKL 865

Query: 1120 VSKFLDXXXXXXXXXXXSFNARLLPVNEAIVIPTQVNYIGKAANIYETGYQLNGSAYVIS 941
            VSKFLD           S++ARL   NEAIVIPTQVNY+GKAAN+Y+ GYQLNGSAYVIS
Sbjct: 866  VSKFLDLLPSNSVVERASWSARLPSNNEAIVIPTQVNYVGKAANLYDGGYQLNGSAYVIS 925

Query: 940  KYISNTWLWDRVRVSGGAYGGFCDFDTHSGVFSYLSYRDPNLLKTLEVYDGTTNFLRELE 761
            K+ISNTWLWDRVRVSGGAYGGFC+FDTHSGVF++LSYRDPNLL+TL++YDGT +FLRELE
Sbjct: 926  KHISNTWLWDRVRVSGGAYGGFCNFDTHSGVFTFLSYRDPNLLETLDIYDGTGDFLRELE 985

Query: 760  LDDDSLTKAIIGTIGDVDAYQLPDAKGYS 674
            +DDD+LTKAIIGT+GDVDAYQLPDAKGYS
Sbjct: 986  MDDDTLTKAIIGTVGDVDAYQLPDAKGYS 1014


>ref|XP_006406530.1| hypothetical protein EUTSA_v10019955mg [Eutrema salsugineum]
            gi|557107676|gb|ESQ47983.1| hypothetical protein
            EUTSA_v10019955mg [Eutrema salsugineum]
          Length = 1080

 Score = 1579 bits (4089), Expect = 0.0
 Identities = 776/1000 (77%), Positives = 879/1000 (87%)
 Frame = -3

Query: 3460 KAIATPSKHVSPDFSGAHDDVAEKLGFEKVSEEFIDECKSKAVLYKHKKTGAEVMSVSND 3281
            +A+AT     +P +     D AEKLGFEKVSEEFI ECKSKA+L+KHKKTG EVMSVSN+
Sbjct: 84   RAVATQP---APSYPDVGQDEAEKLGFEKVSEEFISECKSKAILFKHKKTGCEVMSVSNE 140

Query: 3280 DENKVFGVVFRTPPKDSTGIPHILEHSVLCGSRKYPLKEPFVELLKGSLNTFLNAFTYPD 3101
            DENKVFG+V RTPPKDSTGIPHILEHSVLCGSRKYPLKEPFVELLKGSL+TFLNAFTYPD
Sbjct: 141  DENKVFGIVLRTPPKDSTGIPHILEHSVLCGSRKYPLKEPFVELLKGSLHTFLNAFTYPD 200

Query: 3100 RTCYPVASTNAKDFYNLVDVYLDAVFFPKCVEDFQTFQQEGWHYELNDPSEDISFKGVVF 2921
            RTCYPVASTN KDFYNLVDVYLDAVFFPKCV+D  TFQQEGWHYELNDPSEDIS+KGVVF
Sbjct: 201  RTCYPVASTNTKDFYNLVDVYLDAVFFPKCVDDVHTFQQEGWHYELNDPSEDISYKGVVF 260

Query: 2920 NEMKGVYSQPDNILGRSSQQALFPDNTYGVDSGGDPKVIPKLTFEEFKNFHRKYYHPSNA 2741
            NEMKGVYSQPDNILGR +QQAL P+NTYGVDSGGDPKVIPKLTFEEFK FHRKYYHPSNA
Sbjct: 261  NEMKGVYSQPDNILGRIAQQALSPENTYGVDSGGDPKVIPKLTFEEFKEFHRKYYHPSNA 320

Query: 2740 RIWFYGDDDPSERLRILSEYLDMFDESSAPDESKIEPQKLFTEPVRIIEKYPAGEGGDLK 2561
            RIWFYGDDDP  RLR+LSEYLDMF+ES +PD SKIEPQKLF+EP+R++EKYPAG  GDLK
Sbjct: 321  RIWFYGDDDPVHRLRVLSEYLDMFEESPSPDSSKIEPQKLFSEPIRLVEKYPAGRDGDLK 380

Query: 2560 KKHMVCLNWLLSDKPLDVETEXXXXXXXXXXLGTPASPLRRILLESGLGEAIVGGGIEDE 2381
            KK+M+C+NWLLS+KPLD++T+          LGTPASPLR+ILLESGLGEA+V  G+ DE
Sbjct: 381  KKNMLCVNWLLSEKPLDLQTQLALGFLDHLMLGTPASPLRKILLESGLGEALVSSGMSDE 440

Query: 2380 LLQPQFSVGLKGVSEDDIQKVEELILNNLQKLAEEGFDPEAVEASLNTIEFSLRENNTGS 2201
            LLQPQFS+GLKGVSED++QKVEELI+N L+KLAEEGFD +AVEAS+NTIEFSLRENNTGS
Sbjct: 441  LLQPQFSIGLKGVSEDNVQKVEELIMNTLKKLAEEGFDNDAVEASMNTIEFSLRENNTGS 500

Query: 2200 FPRGLSLMLRSIGKWIYDLDPFEPLKYEKPLLALKARISEEGPKAVFCPLIEKFILNNPH 2021
            FPRGLSLML+SI KWIYD+DPFEPLKY +PL ALK RI+EEG KAVF PLIE++ILNN H
Sbjct: 501  FPRGLSLMLQSIAKWIYDMDPFEPLKYTEPLKALKTRIAEEGSKAVFSPLIEQYILNNSH 560

Query: 2020 RVTIEMQPDPVKASQDEAAEREILEKVKASMTEVDLAELARATRELRLKQETPDPPEALR 1841
            RVTIEMQPDP  ASQ+EA E+ ILEKVKA MTE DLAELARAT ELRLKQETPDPPEALR
Sbjct: 561  RVTIEMQPDPEYASQEEAEEKSILEKVKAGMTEEDLAELARATEELRLKQETPDPPEALR 620

Query: 1840 SVPSLSLNDIPKKPIQIPIEVGEINGVRTLQHDLFTNDVLYAEVVFDMAHLKAELVPLVP 1661
             VPSL+L DIPK+P  +P EVG+INGV+ L+HDLFTND++YAEVVFDM  LK EL+PLVP
Sbjct: 621  CVPSLNLGDIPKEPTYVPTEVGDINGVKVLRHDLFTNDIIYAEVVFDMGSLKHELLPLVP 680

Query: 1660 LFCQSLLEMGTKDMDFVQLNQLIGRKTGGISVYPFTSSVRGKEDPTSHVIVRGKSMSGRA 1481
            LFCQSLLEMGTKD+ FVQLNQLIGRKTGGISVYP TSSVRGK +P S +IVRGKSM+GRA
Sbjct: 681  LFCQSLLEMGTKDLTFVQLNQLIGRKTGGISVYPLTSSVRGKSEPCSKIIVRGKSMAGRA 740

Query: 1480 EDLFNLVNCILHDVQFTDQQRFKQFVSQSKARMENRLRGSGHGIAAARMDAKLNAAGWIA 1301
            EDLFNL+NC+L +VQFTDQQRFKQFVSQS ARMENRLRGSGHGIAAARMDA LN AGW++
Sbjct: 741  EDLFNLMNCLLQEVQFTDQQRFKQFVSQSIARMENRLRGSGHGIAAARMDAMLNIAGWMS 800

Query: 1300 EQMGGISYLEFLRSLEEKVEKDWFVISSSLEEIRKSLLSKKGCLVNLTAEGKTLTSTEKF 1121
            EQMGG+SYLEFLR+LE+K+++DW  ISSSLEEIR+SLL++ G +VN+TA+GK+LT+ EK 
Sbjct: 801  EQMGGLSYLEFLRTLEKKLDEDWEGISSSLEEIRRSLLARNGSIVNMTADGKSLTNVEKS 860

Query: 1120 VSKFLDXXXXXXXXXXXSFNARLLPVNEAIVIPTQVNYIGKAANIYETGYQLNGSAYVIS 941
            V KFLD           +++ RL   NEAIVIPTQVNY+GKA NIY TGY+L+GSAYVIS
Sbjct: 861  VEKFLDLLPENPSGGLVTWDGRLPLRNEAIVIPTQVNYVGKAGNIYSTGYELDGSAYVIS 920

Query: 940  KYISNTWLWDRVRVSGGAYGGFCDFDTHSGVFSYLSYRDPNLLKTLEVYDGTTNFLRELE 761
            K+ISNTWLWDRVRVSGGAYGGFCDFD+HSGVFS+LSYRDPNLLKTL++YDGT +FLR L+
Sbjct: 921  KHISNTWLWDRVRVSGGAYGGFCDFDSHSGVFSFLSYRDPNLLKTLDIYDGTGDFLRGLD 980

Query: 760  LDDDSLTKAIIGTIGDVDAYQLPDAKGYSSMLRFLLGVTEEERQKRREEILSSSLNDFKA 581
            +D ++LTKAIIGTIGDVD+YQLPDAKGYSS+LR LLGVT+EERQK+REEIL++SL DFK 
Sbjct: 981  VDQETLTKAIIGTIGDVDSYQLPDAKGYSSLLRHLLGVTDEERQKKREEILTTSLKDFKD 1040

Query: 580  FAEAIEAVKEKGXXXXXXXXXXXXXANKERSNFFEVKQVL 461
            FAEAI+ V++KG             AN  RSNFFEVK+ L
Sbjct: 1041 FAEAIDVVRDKGVAVAVASAEDIDAANNARSNFFEVKKAL 1080