BLASTX nr result

ID: Phellodendron21_contig00004211 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Phellodendron21_contig00004211
         (3313 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

XP_006445143.1 hypothetical protein CICLE_v10018855mg [Citrus cl...  1342   0.0  
XP_006445141.1 hypothetical protein CICLE_v10018855mg [Citrus cl...  1340   0.0  
KDO86022.1 hypothetical protein CISIN_1g002869mg [Citrus sinensis]   1337   0.0  
KDO86021.1 hypothetical protein CISIN_1g002869mg [Citrus sinensis]   1323   0.0  
EOX96069.1 HD domain class transcription factor isoform 1 [Theob...  1305   0.0  
XP_007051912.2 PREDICTED: homeobox-leucine zipper protein ANTHOC...  1303   0.0  
EOX96070.1 HD domain class transcription factor isoform 2 [Theob...  1303   0.0  
OMP00515.1 hypothetical protein COLO4_12612 [Corchorus olitorius]    1302   0.0  
XP_007051913.2 PREDICTED: homeobox-leucine zipper protein ANTHOC...  1302   0.0  
OAY49229.1 hypothetical protein MANES_05G039400 [Manihot esculenta]  1286   0.0  
XP_015584500.1 PREDICTED: homeobox-leucine zipper protein ANTHOC...  1283   0.0  
XP_018813891.1 PREDICTED: homeobox-leucine zipper protein ANTHOC...  1281   0.0  
GAV85584.1 Homeobox domain-containing protein/START domain-conta...  1279   0.0  
XP_002511801.1 PREDICTED: homeobox-leucine zipper protein ANTHOC...  1278   0.0  
XP_010661562.1 PREDICTED: homeobox-leucine zipper protein ANTHOC...  1277   0.0  
KDO86029.1 hypothetical protein CISIN_1g002869mg [Citrus sinensis]   1275   0.0  
XP_002272264.2 PREDICTED: homeobox-leucine zipper protein ANTHOC...  1273   0.0  
XP_002320755.1 homeodomain family protein [Populus trichocarpa] ...  1269   0.0  
XP_011035097.1 PREDICTED: homeobox-leucine zipper protein ANTHOC...  1269   0.0  
XP_002301331.2 homeodomain family protein [Populus trichocarpa] ...  1259   0.0  

>XP_006445143.1 hypothetical protein CICLE_v10018855mg [Citrus clementina]
            XP_006491020.1 PREDICTED: homeobox-leucine zipper protein
            ANTHOCYANINLESS 2 isoform X1 [Citrus sinensis] ESR58383.1
            hypothetical protein CICLE_v10018855mg [Citrus
            clementina] KDO86023.1 hypothetical protein
            CISIN_1g002869mg [Citrus sinensis]
          Length = 836

 Score = 1342 bits (3472), Expect = 0.0
 Identities = 715/846 (84%), Positives = 727/846 (85%), Gaps = 21/846 (2%)
 Frame = +3

Query: 453  MSFGGFLETNTSXXXXXXXXXXXARIVADISYTNN-------MPXXXXXLAHPRLLS-TP 608
            MSFGGFLE N S           ARIVADISYTNN       MP     LAHPRLLS TP
Sbjct: 1    MSFGGFLENNISTSSGGGG----ARIVADISYTNNDNNNNNNMPTTTT-LAHPRLLSSTP 55

Query: 609  -PLSKSMFNSPGLSLALQQP-IDNQG----DMQRMGESNFEGIIGRRSRENQ-EHESRSG 767
             PLSKSMFNSPGLSLALQQP IDNQG     +QRMGES FEGIIGRRSRE+  EHESRSG
Sbjct: 56   QPLSKSMFNSPGLSLALQQPNIDNQGGGDLQLQRMGES-FEGIIGRRSREDLLEHESRSG 114

Query: 768  SDNMEGASGDDLDVTDKPPRKKRYHRHTPQQIQELESLFKECPHPDEKQRLELSRRLCLE 947
            SDNM+GASGDDLD  D PPRKKRYHRHTPQQIQELESLFKECPHPDEKQRLELS+RLCLE
Sbjct: 115  SDNMDGASGDDLDAADNPPRKKRYHRHTPQQIQELESLFKECPHPDEKQRLELSKRLCLE 174

Query: 948  TRQVKFWFQNRRTQMKTQLERHENSLLRQENDKLRAENMSIRDAMRNPICTNCGGPAMIG 1127
            TRQVKFWFQNRRTQMKTQLERHENSLLRQENDKLRAENMSIRDAMRNPICTNCGGPA+IG
Sbjct: 175  TRQVKFWFQNRRTQMKTQLERHENSLLRQENDKLRAENMSIRDAMRNPICTNCGGPAIIG 234

Query: 1128 DISLEEQHLRIENARLKDELDRVCALAGKFLGRXXXXXXXXXXXXXX-ELGVGN-NGFGG 1301
            DISLEEQHLRIENARLKDELDRVCALAGKFLGR               ELGVG  NGFGG
Sbjct: 235  DISLEEQHLRIENARLKDELDRVCALAGKFLGRPVSSMGPPPMPNSSLELGVGTINGFGG 294

Query: 1302 LSTVTTTTLPLGPADFGTGGTSNNALQVMTQNRSGPGVTGLDRSIERSMFLELALTAMDE 1481
            LS+  TTTLP   ADFGTG  SN    VM  NRSGPGVTGLDRSIERSMFLELAL AMDE
Sbjct: 295  LSSTVTTTLP---ADFGTG-ISNALPVVMPPNRSGPGVTGLDRSIERSMFLELALAAMDE 350

Query: 1482 LVKMAQTDEPLWIRSLEGPGREVLNLEEYLTTFTPCIGLKPNGFVTEASRETGMVIINGL 1661
            LVKMAQTDEPLWIRS EG GR+VLN EEYL TFTPCIGLKPNGFVTEASRETGMVIIN L
Sbjct: 351  LVKMAQTDEPLWIRSFEGSGRQVLNHEEYLRTFTPCIGLKPNGFVTEASRETGMVIINSL 410

Query: 1662 ALVETLMDPNRWAEMFPCMIARTSTTDVISSGMGGTRNGALQLMHAELQVLSPLVPVREV 1841
            ALVETLMDPNRWAEMFPCMIART+TTDVISSGMGGTRNGALQLMHAELQVLSPLVPVREV
Sbjct: 411  ALVETLMDPNRWAEMFPCMIARTATTDVISSGMGGTRNGALQLMHAELQVLSPLVPVREV 470

Query: 1842 NFLRFCKQHAEGVWAVVDVSIDTIRETSGAPAFVNCRRLPSGCVVQDMPNGYSKVTWVEH 2021
            NFLRFCKQHAEGVWAVVDVSIDTIRETSGAPAFVNCRRLPSGCVVQDMPNGYSKVTWVEH
Sbjct: 471  NFLRFCKQHAEGVWAVVDVSIDTIRETSGAPAFVNCRRLPSGCVVQDMPNGYSKVTWVEH 530

Query: 2022 AEYDESQVHQLYKPFVSSGMGFGAQRWVATLQRQCECLAILMSSAVLARDHTAITAGGRR 2201
            AEYDESQVHQLYKP + SGMGFGAQRWVATLQRQCECLAILMS++V ARDHTAITAGGRR
Sbjct: 531  AEYDESQVHQLYKPLIISGMGFGAQRWVATLQRQCECLAILMSTSVSARDHTAITAGGRR 590

Query: 2202 SMLKLAQRMTDNFCAGVCASTVHKWNKLNAGNVDEDVRVMTRKSVDDPGEPPGIVLSAAT 2381
            SMLKLAQRMTDNFCAGVCASTVHKWNKLNAGNVDEDVRVMTRKSVDDPGEPPGIVLSAAT
Sbjct: 591  SMLKLAQRMTDNFCAGVCASTVHKWNKLNAGNVDEDVRVMTRKSVDDPGEPPGIVLSAAT 650

Query: 2382 SVWLPVSPQXXXXXXXXXXXXSEWDILSNGGPMQEMAHIAKGQDHGNCVSLLRASAINSN 2561
            SVWLPVSPQ            SEWDILSNGGPMQEMAHIAKGQDHGNCVSLLRASAIN+N
Sbjct: 651  SVWLPVSPQRLFNFLRDERLRSEWDILSNGGPMQEMAHIAKGQDHGNCVSLLRASAINAN 710

Query: 2562 QSSMLILQETCIDAAGSLVVYAPVDIPAMHVVMNGGDSAYVALLPSGFAIVPD----XXX 2729
            QSSMLILQETC DAAGSLVVYAPVDIPAMHVVMNGGDSAYVALLPSGFAIVPD       
Sbjct: 711  QSSMLILQETCTDAAGSLVVYAPVDIPAMHVVMNGGDSAYVALLPSGFAIVPDGPDSRGP 770

Query: 2730 XXXXXXXXXXXXXXXXXXXXXLLTVAFQILVNSLPTAKLTVESVETVNNLISCTVQKIKA 2909
                                 LLTVAFQILVNSLPTAKLTVESVETVNNLISCTVQKIKA
Sbjct: 771  LANGPTSGNGSNGGSQRVGGSLLTVAFQILVNSLPTAKLTVESVETVNNLISCTVQKIKA 830

Query: 2910 ALQCES 2927
            ALQCES
Sbjct: 831  ALQCES 836


>XP_006445141.1 hypothetical protein CICLE_v10018855mg [Citrus clementina]
            XP_006491021.1 PREDICTED: homeobox-leucine zipper protein
            ANTHOCYANINLESS 2 isoform X2 [Citrus sinensis] ESR58381.1
            hypothetical protein CICLE_v10018855mg [Citrus
            clementina] KDO86024.1 hypothetical protein
            CISIN_1g002869mg [Citrus sinensis]
          Length = 835

 Score = 1340 bits (3469), Expect = 0.0
 Identities = 713/845 (84%), Positives = 725/845 (85%), Gaps = 20/845 (2%)
 Frame = +3

Query: 453  MSFGGFLETNTSXXXXXXXXXXXARIVADISYTNN-------MPXXXXXLAHPRLLS-TP 608
            MSFGGFLE N S           ARIVADISYTNN       MP     LAHPRLLS TP
Sbjct: 1    MSFGGFLENNISTSSGGGG----ARIVADISYTNNDNNNNNNMPTTTT-LAHPRLLSSTP 55

Query: 609  -PLSKSMFNSPGLSLALQQPIDNQG----DMQRMGESNFEGIIGRRSRENQ-EHESRSGS 770
             PLSKSMFNSPGLSLALQ  IDNQG     +QRMGES FEGIIGRRSRE+  EHESRSGS
Sbjct: 56   QPLSKSMFNSPGLSLALQPNIDNQGGGDLQLQRMGES-FEGIIGRRSREDLLEHESRSGS 114

Query: 771  DNMEGASGDDLDVTDKPPRKKRYHRHTPQQIQELESLFKECPHPDEKQRLELSRRLCLET 950
            DNM+GASGDDLD  D PPRKKRYHRHTPQQIQELESLFKECPHPDEKQRLELS+RLCLET
Sbjct: 115  DNMDGASGDDLDAADNPPRKKRYHRHTPQQIQELESLFKECPHPDEKQRLELSKRLCLET 174

Query: 951  RQVKFWFQNRRTQMKTQLERHENSLLRQENDKLRAENMSIRDAMRNPICTNCGGPAMIGD 1130
            RQVKFWFQNRRTQMKTQLERHENSLLRQENDKLRAENMSIRDAMRNPICTNCGGPA+IGD
Sbjct: 175  RQVKFWFQNRRTQMKTQLERHENSLLRQENDKLRAENMSIRDAMRNPICTNCGGPAIIGD 234

Query: 1131 ISLEEQHLRIENARLKDELDRVCALAGKFLGRXXXXXXXXXXXXXX-ELGVGN-NGFGGL 1304
            ISLEEQHLRIENARLKDELDRVCALAGKFLGR               ELGVG  NGFGGL
Sbjct: 235  ISLEEQHLRIENARLKDELDRVCALAGKFLGRPVSSMGPPPMPNSSLELGVGTINGFGGL 294

Query: 1305 STVTTTTLPLGPADFGTGGTSNNALQVMTQNRSGPGVTGLDRSIERSMFLELALTAMDEL 1484
            S+  TTTLP   ADFGTG  SN    VM  NRSGPGVTGLDRSIERSMFLELAL AMDEL
Sbjct: 295  SSTVTTTLP---ADFGTG-ISNALPVVMPPNRSGPGVTGLDRSIERSMFLELALAAMDEL 350

Query: 1485 VKMAQTDEPLWIRSLEGPGREVLNLEEYLTTFTPCIGLKPNGFVTEASRETGMVIINGLA 1664
            VKMAQTDEPLWIRS EG GR+VLN EEYL TFTPCIGLKPNGFVTEASRETGMVIIN LA
Sbjct: 351  VKMAQTDEPLWIRSFEGSGRQVLNHEEYLRTFTPCIGLKPNGFVTEASRETGMVIINSLA 410

Query: 1665 LVETLMDPNRWAEMFPCMIARTSTTDVISSGMGGTRNGALQLMHAELQVLSPLVPVREVN 1844
            LVETLMDPNRWAEMFPCMIART+TTDVISSGMGGTRNGALQLMHAELQVLSPLVPVREVN
Sbjct: 411  LVETLMDPNRWAEMFPCMIARTATTDVISSGMGGTRNGALQLMHAELQVLSPLVPVREVN 470

Query: 1845 FLRFCKQHAEGVWAVVDVSIDTIRETSGAPAFVNCRRLPSGCVVQDMPNGYSKVTWVEHA 2024
            FLRFCKQHAEGVWAVVDVSIDTIRETSGAPAFVNCRRLPSGCVVQDMPNGYSKVTWVEHA
Sbjct: 471  FLRFCKQHAEGVWAVVDVSIDTIRETSGAPAFVNCRRLPSGCVVQDMPNGYSKVTWVEHA 530

Query: 2025 EYDESQVHQLYKPFVSSGMGFGAQRWVATLQRQCECLAILMSSAVLARDHTAITAGGRRS 2204
            EYDESQVHQLYKP + SGMGFGAQRWVATLQRQCECLAILMS++V ARDHTAITAGGRRS
Sbjct: 531  EYDESQVHQLYKPLIISGMGFGAQRWVATLQRQCECLAILMSTSVSARDHTAITAGGRRS 590

Query: 2205 MLKLAQRMTDNFCAGVCASTVHKWNKLNAGNVDEDVRVMTRKSVDDPGEPPGIVLSAATS 2384
            MLKLAQRMTDNFCAGVCASTVHKWNKLNAGNVDEDVRVMTRKSVDDPGEPPGIVLSAATS
Sbjct: 591  MLKLAQRMTDNFCAGVCASTVHKWNKLNAGNVDEDVRVMTRKSVDDPGEPPGIVLSAATS 650

Query: 2385 VWLPVSPQXXXXXXXXXXXXSEWDILSNGGPMQEMAHIAKGQDHGNCVSLLRASAINSNQ 2564
            VWLPVSPQ            SEWDILSNGGPMQEMAHIAKGQDHGNCVSLLRASAIN+NQ
Sbjct: 651  VWLPVSPQRLFNFLRDERLRSEWDILSNGGPMQEMAHIAKGQDHGNCVSLLRASAINANQ 710

Query: 2565 SSMLILQETCIDAAGSLVVYAPVDIPAMHVVMNGGDSAYVALLPSGFAIVPD----XXXX 2732
            SSMLILQETC DAAGSLVVYAPVDIPAMHVVMNGGDSAYVALLPSGFAIVPD        
Sbjct: 711  SSMLILQETCTDAAGSLVVYAPVDIPAMHVVMNGGDSAYVALLPSGFAIVPDGPDSRGPL 770

Query: 2733 XXXXXXXXXXXXXXXXXXXXLLTVAFQILVNSLPTAKLTVESVETVNNLISCTVQKIKAA 2912
                                LLTVAFQILVNSLPTAKLTVESVETVNNLISCTVQKIKAA
Sbjct: 771  ANGPTSGNGSNGGSQRVGGSLLTVAFQILVNSLPTAKLTVESVETVNNLISCTVQKIKAA 830

Query: 2913 LQCES 2927
            LQCES
Sbjct: 831  LQCES 835


>KDO86022.1 hypothetical protein CISIN_1g002869mg [Citrus sinensis]
          Length = 838

 Score = 1337 bits (3459), Expect = 0.0
 Identities = 715/848 (84%), Positives = 727/848 (85%), Gaps = 23/848 (2%)
 Frame = +3

Query: 453  MSFGGFLETNTSXXXXXXXXXXXARIVADISYTNN-------MPXXXXXLAHPRLLS-TP 608
            MSFGGFLE N S           ARIVADISYTNN       MP     LAHPRLLS TP
Sbjct: 1    MSFGGFLENNISTSSGGGG----ARIVADISYTNNDNNNNNNMPTTTT-LAHPRLLSSTP 55

Query: 609  -PLSKSMFNSPGLSLALQQP-IDNQG----DMQRMGESNFEGIIGRRSRENQ-EHESRSG 767
             PLSKSMFNSPGLSLALQQP IDNQG     +QRMGES FEGIIGRRSRE+  EHESRSG
Sbjct: 56   QPLSKSMFNSPGLSLALQQPNIDNQGGGDLQLQRMGES-FEGIIGRRSREDLLEHESRSG 114

Query: 768  SDNMEGASGDDLDVTDKPPRKKRYHRHTPQQIQELESLFKECPHPDEKQRLELSRRLCLE 947
            SDNM+GASGDDLD  D PPRKKRYHRHTPQQIQELESLFKECPHPDEKQRLELS+RLCLE
Sbjct: 115  SDNMDGASGDDLDAADNPPRKKRYHRHTPQQIQELESLFKECPHPDEKQRLELSKRLCLE 174

Query: 948  TRQVKFWFQNRRTQMKTQLERHENSLLRQENDKLRAENMSIRDAMRNPICTNCGGPAMIG 1127
            TRQVKFWFQNRRTQMKTQLERHENSLLRQENDKLRAENMSIRDAMRNPICTNCGGPA+IG
Sbjct: 175  TRQVKFWFQNRRTQMKTQLERHENSLLRQENDKLRAENMSIRDAMRNPICTNCGGPAIIG 234

Query: 1128 DISLEEQHLRIENARLKDELDRVCALAGKFLGRXXXXXXXXXXXXXX-ELGVGN-NGFGG 1301
            DISLEEQHLRIENARLKDELDRVCALAGKFLGR               ELGVG  NGFGG
Sbjct: 235  DISLEEQHLRIENARLKDELDRVCALAGKFLGRPVSSMGPPPMPNSSLELGVGTINGFGG 294

Query: 1302 LSTVTTTTLPLGPADFGTGGTSNNALQVMTQNRSGPGVTGLDRSIERSMFLELALTAMDE 1481
            LS+  TTTLP   ADFGTG  SN    VM  NRSGPGVTGLDRSIERSMFLELAL AMDE
Sbjct: 295  LSSTVTTTLP---ADFGTG-ISNALPVVMPPNRSGPGVTGLDRSIERSMFLELALAAMDE 350

Query: 1482 LVKMAQTDEPLWIRSLEGPGREVLNLEEYLTTFTPCIGLKPNGFVTEASRETGMVIINGL 1661
            LVKMAQTDEPLWIRS EG GR+VLN EEYL TFTPCIGLKPNGFVTEASRETGMVIIN L
Sbjct: 351  LVKMAQTDEPLWIRSFEGSGRQVLNHEEYLRTFTPCIGLKPNGFVTEASRETGMVIINSL 410

Query: 1662 ALVETLMDPNRWAEMFPCMIARTSTTDVISSGMGGTRNGALQLMHAELQVLSPLVPVREV 1841
            ALVETLMDPNRWAEMFPCMIART+TTDVISSGMGGTRNGALQLMHAELQVLSPLVPVREV
Sbjct: 411  ALVETLMDPNRWAEMFPCMIARTATTDVISSGMGGTRNGALQLMHAELQVLSPLVPVREV 470

Query: 1842 NFLRFCKQHAEGVWAVVDVSIDTIRETSGAPAFVNCRRLPSGCVVQDMPNGYSK--VTWV 2015
            NFLRFCKQHAEGVWAVVDVSIDTIRETSGAPAFVNCRRLPSGCVVQDMPNGYSK  VTWV
Sbjct: 471  NFLRFCKQHAEGVWAVVDVSIDTIRETSGAPAFVNCRRLPSGCVVQDMPNGYSKLQVTWV 530

Query: 2016 EHAEYDESQVHQLYKPFVSSGMGFGAQRWVATLQRQCECLAILMSSAVLARDHTAITAGG 2195
            EHAEYDESQVHQLYKP + SGMGFGAQRWVATLQRQCECLAILMS++V ARDHTAITAGG
Sbjct: 531  EHAEYDESQVHQLYKPLIISGMGFGAQRWVATLQRQCECLAILMSTSVSARDHTAITAGG 590

Query: 2196 RRSMLKLAQRMTDNFCAGVCASTVHKWNKLNAGNVDEDVRVMTRKSVDDPGEPPGIVLSA 2375
            RRSMLKLAQRMTDNFCAGVCASTVHKWNKLNAGNVDEDVRVMTRKSVDDPGEPPGIVLSA
Sbjct: 591  RRSMLKLAQRMTDNFCAGVCASTVHKWNKLNAGNVDEDVRVMTRKSVDDPGEPPGIVLSA 650

Query: 2376 ATSVWLPVSPQXXXXXXXXXXXXSEWDILSNGGPMQEMAHIAKGQDHGNCVSLLRASAIN 2555
            ATSVWLPVSPQ            SEWDILSNGGPMQEMAHIAKGQDHGNCVSLLRASAIN
Sbjct: 651  ATSVWLPVSPQRLFNFLRDERLRSEWDILSNGGPMQEMAHIAKGQDHGNCVSLLRASAIN 710

Query: 2556 SNQSSMLILQETCIDAAGSLVVYAPVDIPAMHVVMNGGDSAYVALLPSGFAIVPD----X 2723
            +NQSSMLILQETC DAAGSLVVYAPVDIPAMHVVMNGGDSAYVALLPSGFAIVPD     
Sbjct: 711  ANQSSMLILQETCTDAAGSLVVYAPVDIPAMHVVMNGGDSAYVALLPSGFAIVPDGPDSR 770

Query: 2724 XXXXXXXXXXXXXXXXXXXXXXXLLTVAFQILVNSLPTAKLTVESVETVNNLISCTVQKI 2903
                                   LLTVAFQILVNSLPTAKLTVESVETVNNLISCTVQKI
Sbjct: 771  GPLANGPTSGNGSNGGSQRVGGSLLTVAFQILVNSLPTAKLTVESVETVNNLISCTVQKI 830

Query: 2904 KAALQCES 2927
            KAALQCES
Sbjct: 831  KAALQCES 838


>KDO86021.1 hypothetical protein CISIN_1g002869mg [Citrus sinensis]
          Length = 872

 Score = 1323 bits (3425), Expect = 0.0
 Identities = 715/882 (81%), Positives = 727/882 (82%), Gaps = 57/882 (6%)
 Frame = +3

Query: 453  MSFGGFLETNTSXXXXXXXXXXXARIVADISYTNN-------MPXXXXXLAHPRLLS-TP 608
            MSFGGFLE N S           ARIVADISYTNN       MP     LAHPRLLS TP
Sbjct: 1    MSFGGFLENNISTSSGGGG----ARIVADISYTNNDNNNNNNMPTTTT-LAHPRLLSSTP 55

Query: 609  -PLSKSMFNSPGLSLALQQP-IDNQG----DMQRMGESNFEGIIGRRSRENQ-EHESRSG 767
             PLSKSMFNSPGLSLALQQP IDNQG     +QRMGES FEGIIGRRSRE+  EHESRSG
Sbjct: 56   QPLSKSMFNSPGLSLALQQPNIDNQGGGDLQLQRMGES-FEGIIGRRSREDLLEHESRSG 114

Query: 768  SDNMEGASGDDLDVTDKPPRKKRYHRHTPQQIQELESLFKECPHPDEKQRLELSRRLCLE 947
            SDNM+GASGDDLD  D PPRKKRYHRHTPQQIQELESLFKECPHPDEKQRLELS+RLCLE
Sbjct: 115  SDNMDGASGDDLDAADNPPRKKRYHRHTPQQIQELESLFKECPHPDEKQRLELSKRLCLE 174

Query: 948  TRQVKFWFQNRRTQMKTQLERHENSLLRQENDKLRAENMSIRDAMRNPICTNCGGPAMIG 1127
            TRQVKFWFQNRRTQMKTQLERHENSLLRQENDKLRAENMSIRDAMRNPICTNCGGPA+IG
Sbjct: 175  TRQVKFWFQNRRTQMKTQLERHENSLLRQENDKLRAENMSIRDAMRNPICTNCGGPAIIG 234

Query: 1128 DISLEEQHLRIENARLKDELDRVCALAGKFLGR-XXXXXXXXXXXXXXELGVGN-NGFGG 1301
            DISLEEQHLRIENARLKDELDRVCALAGKFLGR               ELGVG  NGFGG
Sbjct: 235  DISLEEQHLRIENARLKDELDRVCALAGKFLGRPVSSMGPPPMPNSSLELGVGTINGFGG 294

Query: 1302 LSTVTTTTLPLGPADFGTGGTSNNALQVMTQNRSGPGVTGLDRSIERSMFLELALTAMDE 1481
            LS+  TTTL   PADFGT G SN    VM  NRSGPGVTGLDRSIERSMFLELAL AMDE
Sbjct: 295  LSSTVTTTL---PADFGT-GISNALPVVMPPNRSGPGVTGLDRSIERSMFLELALAAMDE 350

Query: 1482 LVKMAQTDEPLWIRSLEGPGREVLNLEEYLTTFTPCIGLKPNGFVTEASRETGMVIINGL 1661
            LVKMAQTDEPLWIRS EG GR+VLN EEYL TFTPCIGLKPNGFVTEASRETGMVIIN L
Sbjct: 351  LVKMAQTDEPLWIRSFEGSGRQVLNHEEYLRTFTPCIGLKPNGFVTEASRETGMVIINSL 410

Query: 1662 ALVETLMDPNRWAEMFPCMIARTSTTDVISSGMGGTRNGALQL----------------- 1790
            ALVETLMDPNRWAEMFPCMIART+TTDVISSGMGGTRNGALQL                 
Sbjct: 411  ALVETLMDPNRWAEMFPCMIARTATTDVISSGMGGTRNGALQLVEFYNSIINEHLINYFL 470

Query: 1791 -------------------MHAELQVLSPLVPVREVNFLRFCKQHAEGVWAVVDVSIDTI 1913
                               MHAELQVLSPLVPVREVNFLRFCKQHAEGVWAVVDVSIDTI
Sbjct: 471  LIILVYKKIKIKLFFSFLEMHAELQVLSPLVPVREVNFLRFCKQHAEGVWAVVDVSIDTI 530

Query: 1914 RETSGAPAFVNCRRLPSGCVVQDMPNGYSKVTWVEHAEYDESQVHQLYKPFVSSGMGFGA 2093
            RETSGAPAFVNCRRLPSGCVVQDMPNGYSKVTWVEHAEYDESQVHQLYKP + SGMGFGA
Sbjct: 531  RETSGAPAFVNCRRLPSGCVVQDMPNGYSKVTWVEHAEYDESQVHQLYKPLIISGMGFGA 590

Query: 2094 QRWVATLQRQCECLAILMSSAVLARDHTAITAGGRRSMLKLAQRMTDNFCAGVCASTVHK 2273
            QRWVATLQRQCECLAILMS++V ARDHTAITAGGRRSMLKLAQRMTDNFCAGVCASTVHK
Sbjct: 591  QRWVATLQRQCECLAILMSTSVSARDHTAITAGGRRSMLKLAQRMTDNFCAGVCASTVHK 650

Query: 2274 WNKLNAGNVDEDVRVMTRKSVDDPGEPPGIVLSAATSVWLPVSPQXXXXXXXXXXXXSEW 2453
            WNKLNAGNVDEDVRVMTRKSVDDPGEPPGIVLSAATSVWLPVSPQ            SEW
Sbjct: 651  WNKLNAGNVDEDVRVMTRKSVDDPGEPPGIVLSAATSVWLPVSPQRLFNFLRDERLRSEW 710

Query: 2454 DILSNGGPMQEMAHIAKGQDHGNCVSLLRASAINSNQSSMLILQETCIDAAGSLVVYAPV 2633
            DILSNGGPMQEMAHIAKGQDHGNCVSLLRASAIN+NQSSMLILQETC DAAGSLVVYAPV
Sbjct: 711  DILSNGGPMQEMAHIAKGQDHGNCVSLLRASAINANQSSMLILQETCTDAAGSLVVYAPV 770

Query: 2634 DIPAMHVVMNGGDSAYVALLPSGFAIVPD----XXXXXXXXXXXXXXXXXXXXXXXXLLT 2801
            DIPAMHVVMNGGDSAYVALLPSGFAIVPD                            LLT
Sbjct: 771  DIPAMHVVMNGGDSAYVALLPSGFAIVPDGPDSRGPLANGPTSGNGSNGGSQRVGGSLLT 830

Query: 2802 VAFQILVNSLPTAKLTVESVETVNNLISCTVQKIKAALQCES 2927
            VAFQILVNSLPTAKLTVESVETVNNLISCTVQKIKAALQCES
Sbjct: 831  VAFQILVNSLPTAKLTVESVETVNNLISCTVQKIKAALQCES 872


>EOX96069.1 HD domain class transcription factor isoform 1 [Theobroma cacao]
          Length = 819

 Score = 1305 bits (3376), Expect = 0.0
 Identities = 689/837 (82%), Positives = 715/837 (85%), Gaps = 12/837 (1%)
 Frame = +3

Query: 453  MSFGGFLETNTSXXXXXXXXXXXARIVADISYTNNMPXXXXXLAHPRLLSTPPLSKSMFN 632
            MSFGGFL+ ++            ARIVADI Y+NNMP     +A PRL+S P L+K+MFN
Sbjct: 1    MSFGGFLDNSSGGGG--------ARIVADIPYSNNMPTGA--IAQPRLVS-PSLAKNMFN 49

Query: 633  SPGLSLALQQP-IDNQGDMQRMGESNFEGIIGRRSRENQEHESRSGSDNMEGASGDDLDV 809
            SPGLSLALQQP IDNQGD  RMGE NFEG +GRRSRE +EHESRSGSDNM+G SGDD D 
Sbjct: 50   SPGLSLALQQPNIDNQGDGTRMGE-NFEGSVGRRSRE-EEHESRSGSDNMDGGSGDDQDA 107

Query: 810  TDKPPRKKRYHRHTPQQIQELESLFKECPHPDEKQRLELSRRLCLETRQVKFWFQNRRTQ 989
             D PPRKKRYHRHTPQQIQELE+LFKECPHPDEKQRLELS+RLCLETRQVKFWFQNRRTQ
Sbjct: 108  ADNPPRKKRYHRHTPQQIQELEALFKECPHPDEKQRLELSKRLCLETRQVKFWFQNRRTQ 167

Query: 990  MKTQLERHENSLLRQENDKLRAENMSIRDAMRNPICTNCGGPAMIGDISLEEQHLRIENA 1169
            MKTQLERHENSLLRQENDKLRAENMSIRDAMRNPICTNCGGPA+IGDISLEEQHLRIENA
Sbjct: 168  MKTQLERHENSLLRQENDKLRAENMSIRDAMRNPICTNCGGPAIIGDISLEEQHLRIENA 227

Query: 1170 RLKDELDRVCALAGKFLGRXXXXXXXXXXXXXX----ELGVGNNGFGGLSTVTTTTLPLG 1337
            RLKDELDRVCALAGKFLGR                  ELGVG+NGFGGLSTV TT LPLG
Sbjct: 228  RLKDELDRVCALAGKFLGRPISALATSIAPPMPNSSLELGVGSNGFGGLSTVPTT-LPLG 286

Query: 1338 PADFGTGGTSNNALQVMTQNRSGPGVTGLDRSIERSMFLELALTAMDELVKMAQTDEPLW 1517
            P DFG G T  NAL V   NR   GVTGLDRS+ERSMFLELAL AMDELVKMAQTDEPLW
Sbjct: 287  P-DFGGGIT--NALPVAPPNRPTTGVTGLDRSVERSMFLELALAAMDELVKMAQTDEPLW 343

Query: 1518 IRSLEGPGREVLNLEEYLTTFTPCIGLKPNGFVTEASRETGMVIINGLALVETLMDPNRW 1697
            IRSLEG GRE+LN +EYL TFTPCIG+KP GFVTEASRETG+VIIN LALVETLMD  RW
Sbjct: 344  IRSLEG-GREILNHDEYLRTFTPCIGMKPGGFVTEASRETGVVIINSLALVETLMDSTRW 402

Query: 1698 AEMFPCMIARTSTTDVISSGMGGTRNGALQLMHAELQVLSPLVPVREVNFLRFCKQHAEG 1877
            AEMFPCMIARTSTTDVISSGMGGTRNGALQLMHAELQVLSPLVPVREVNFLRFCKQHAEG
Sbjct: 403  AEMFPCMIARTSTTDVISSGMGGTRNGALQLMHAELQVLSPLVPVREVNFLRFCKQHAEG 462

Query: 1878 VWAVVDVSIDTIRETSGAPAFVNCRRLPSGCVVQDMPNGYSKVTWVEHAEYDESQVHQLY 2057
            VWAVVDVSIDTIRETSGAP FVNCRRLPSGCVVQDMPNGYSKVTWVEHAEY+ESQVHQLY
Sbjct: 463  VWAVVDVSIDTIRETSGAPTFVNCRRLPSGCVVQDMPNGYSKVTWVEHAEYEESQVHQLY 522

Query: 2058 KPFVSSGMGFGAQRWVATLQRQCECLAILMSSAVLARDHTAITAGGRRSMLKLAQRMTDN 2237
            +P +SSGMGFGAQRWVATLQRQCECLAILMSS V  RDHTAITA GRRSMLKLAQRMTDN
Sbjct: 523  RPLLSSGMGFGAQRWVATLQRQCECLAILMSSTVPTRDHTAITASGRRSMLKLAQRMTDN 582

Query: 2238 FCAGVCASTVHKWNKL-NAGNVDEDVRVMTRKSVDDPGEPPGIVLSAATSVWLPVSPQXX 2414
            FCAGVCAST+HKWNKL NAGNVDEDVRVMTRKSVDDPGEPPGIVLSAATSVWLPVSPQ  
Sbjct: 583  FCAGVCASTLHKWNKLNNAGNVDEDVRVMTRKSVDDPGEPPGIVLSAATSVWLPVSPQRL 642

Query: 2415 XXXXXXXXXXSEWDILSNGGPMQEMAHIAKGQDHGNCVSLLRASAINSNQSSMLILQETC 2594
                      SEWDILSNGGPMQEMAHIAKGQDHGNCVSLLRASA+N+NQSSMLILQETC
Sbjct: 643  FDFLRDERLRSEWDILSNGGPMQEMAHIAKGQDHGNCVSLLRASAMNANQSSMLILQETC 702

Query: 2595 IDAAGSLVVYAPVDIPAMHVVMNGGDSAYVALLPSGFAIVPD------XXXXXXXXXXXX 2756
            IDAAGSLVVYAPVDIPAMHVVMNGGDSAYVALLPSGFAIVPD                  
Sbjct: 703  IDAAGSLVVYAPVDIPAMHVVMNGGDSAYVALLPSGFAIVPDGPGSRGPTSNGHVNGNGG 762

Query: 2757 XXXXXXXXXXXXLLTVAFQILVNSLPTAKLTVESVETVNNLISCTVQKIKAALQCES 2927
                        LLTVAFQILVNSLPTAKLTVESVETVNNLISCTVQKIKAALQCES
Sbjct: 763  GGGGRSQRVGGSLLTVAFQILVNSLPTAKLTVESVETVNNLISCTVQKIKAALQCES 819


>XP_007051912.2 PREDICTED: homeobox-leucine zipper protein ANTHOCYANINLESS 2 isoform
            X1 [Theobroma cacao]
          Length = 819

 Score = 1303 bits (3373), Expect = 0.0
 Identities = 688/837 (82%), Positives = 716/837 (85%), Gaps = 12/837 (1%)
 Frame = +3

Query: 453  MSFGGFLETNTSXXXXXXXXXXXARIVADISYTNNMPXXXXXLAHPRLLSTPPLSKSMFN 632
            MSFGGFL+ ++            ARIVADI Y+NNMP     +A PRL+S P L+K+MFN
Sbjct: 1    MSFGGFLDNSSGGGG--------ARIVADIPYSNNMPTGA--IAQPRLVS-PSLAKNMFN 49

Query: 633  SPGLSLALQQP-IDNQGDMQRMGESNFEGIIGRRSRENQEHESRSGSDNMEGASGDDLDV 809
            SPGLSLALQQP IDNQGD  RMGE NFEG +GRRSRE +EHESRSGSDNM+G SGDD D 
Sbjct: 50   SPGLSLALQQPNIDNQGDGTRMGE-NFEGSVGRRSRE-EEHESRSGSDNMDGGSGDDQDA 107

Query: 810  TDKPPRKKRYHRHTPQQIQELESLFKECPHPDEKQRLELSRRLCLETRQVKFWFQNRRTQ 989
             D PPRKKRYHRHTPQQIQELE+LFKECPHPDEKQRLELS+RLCLETRQVKFWFQNRRTQ
Sbjct: 108  ADNPPRKKRYHRHTPQQIQELEALFKECPHPDEKQRLELSKRLCLETRQVKFWFQNRRTQ 167

Query: 990  MKTQLERHENSLLRQENDKLRAENMSIRDAMRNPICTNCGGPAMIGDISLEEQHLRIENA 1169
            MKTQLERHENSLLRQENDKLRAENMSIRDAMRNPICTNCGGPA+IGDISLEEQHLRIENA
Sbjct: 168  MKTQLERHENSLLRQENDKLRAENMSIRDAMRNPICTNCGGPAIIGDISLEEQHLRIENA 227

Query: 1170 RLKDELDRVCALAGKFLGRXXXXXXXXXXXXXX----ELGVGNNGFGGLSTVTTTTLPLG 1337
            RLKDELDRVCALAGKFLGR                  ELGVG+NGFGGLSTV TT LPLG
Sbjct: 228  RLKDELDRVCALAGKFLGRPISALATSIAPPMPNSSLELGVGSNGFGGLSTVPTT-LPLG 286

Query: 1338 PADFGTGGTSNNALQVMTQNRSGPGVTGLDRSIERSMFLELALTAMDELVKMAQTDEPLW 1517
            P DFG G T  NAL V   NR+  GVTGLDRS+ERSMFLELAL AMDELVKMAQTDEPLW
Sbjct: 287  P-DFGGGIT--NALPVAPPNRATTGVTGLDRSVERSMFLELALAAMDELVKMAQTDEPLW 343

Query: 1518 IRSLEGPGREVLNLEEYLTTFTPCIGLKPNGFVTEASRETGMVIINGLALVETLMDPNRW 1697
            IRSLEG GRE+LN +EYL TFTPCIG+KP GFVTEASRETG+VIIN LALVETLMD  RW
Sbjct: 344  IRSLEG-GREILNHDEYLRTFTPCIGMKPGGFVTEASRETGVVIINSLALVETLMDSTRW 402

Query: 1698 AEMFPCMIARTSTTDVISSGMGGTRNGALQLMHAELQVLSPLVPVREVNFLRFCKQHAEG 1877
            AEMFPCMIARTSTTDVISSGMGGTRNGALQLMHAELQVLSPLVPVREVNFLRFCKQHAEG
Sbjct: 403  AEMFPCMIARTSTTDVISSGMGGTRNGALQLMHAELQVLSPLVPVREVNFLRFCKQHAEG 462

Query: 1878 VWAVVDVSIDTIRETSGAPAFVNCRRLPSGCVVQDMPNGYSKVTWVEHAEYDESQVHQLY 2057
            VWAVVDVSIDTIRETSGAP FVNCRRLPSGCVVQDMPNGYSKVTWVEHAEY+ESQVHQLY
Sbjct: 463  VWAVVDVSIDTIRETSGAPTFVNCRRLPSGCVVQDMPNGYSKVTWVEHAEYEESQVHQLY 522

Query: 2058 KPFVSSGMGFGAQRWVATLQRQCECLAILMSSAVLARDHTAITAGGRRSMLKLAQRMTDN 2237
            +P +SSGMGFGAQRWVATLQRQCECLAILMSS V  RDHTAITA GRRSMLKLAQRMTDN
Sbjct: 523  RPLLSSGMGFGAQRWVATLQRQCECLAILMSSTVPTRDHTAITASGRRSMLKLAQRMTDN 582

Query: 2238 FCAGVCASTVHKWNKL-NAGNVDEDVRVMTRKSVDDPGEPPGIVLSAATSVWLPVSPQXX 2414
            FCAGVCAST+HKWNKL NAG+VDEDVRVMTRKSVDDPGEPPGIVLSAATSVWLPVSPQ  
Sbjct: 583  FCAGVCASTLHKWNKLNNAGDVDEDVRVMTRKSVDDPGEPPGIVLSAATSVWLPVSPQRL 642

Query: 2415 XXXXXXXXXXSEWDILSNGGPMQEMAHIAKGQDHGNCVSLLRASAINSNQSSMLILQETC 2594
                      SEWDILSNGGPMQEMAHIAKGQDHGNCVSLLRASA+N+NQSSMLILQETC
Sbjct: 643  FDFLRDERLRSEWDILSNGGPMQEMAHIAKGQDHGNCVSLLRASAMNANQSSMLILQETC 702

Query: 2595 IDAAGSLVVYAPVDIPAMHVVMNGGDSAYVALLPSGFAIVPD------XXXXXXXXXXXX 2756
            IDAAGSLVVYAPVDIPAMHVVMNGGDSAYVALLPSGFAIVPD                  
Sbjct: 703  IDAAGSLVVYAPVDIPAMHVVMNGGDSAYVALLPSGFAIVPDGPGSRGPTSNGHVNGNGG 762

Query: 2757 XXXXXXXXXXXXLLTVAFQILVNSLPTAKLTVESVETVNNLISCTVQKIKAALQCES 2927
                        LLTVAFQILVNSLPTAKLTVESVETVNNLISCTVQKIKAALQCES
Sbjct: 763  GGGGGSQRVGGSLLTVAFQILVNSLPTAKLTVESVETVNNLISCTVQKIKAALQCES 819


>EOX96070.1 HD domain class transcription factor isoform 2 [Theobroma cacao]
          Length = 818

 Score = 1303 bits (3373), Expect = 0.0
 Identities = 687/836 (82%), Positives = 713/836 (85%), Gaps = 11/836 (1%)
 Frame = +3

Query: 453  MSFGGFLETNTSXXXXXXXXXXXARIVADISYTNNMPXXXXXLAHPRLLSTPPLSKSMFN 632
            MSFGGFL+ ++            ARIVADI Y+NNMP     +A PRL+S P L+K+MFN
Sbjct: 1    MSFGGFLDNSSGGGG--------ARIVADIPYSNNMPTGA--IAQPRLVS-PSLAKNMFN 49

Query: 633  SPGLSLALQQPIDNQGDMQRMGESNFEGIIGRRSRENQEHESRSGSDNMEGASGDDLDVT 812
            SPGLSLALQ  IDNQGD  RMGE NFEG +GRRSRE +EHESRSGSDNM+G SGDD D  
Sbjct: 50   SPGLSLALQPNIDNQGDGTRMGE-NFEGSVGRRSRE-EEHESRSGSDNMDGGSGDDQDAA 107

Query: 813  DKPPRKKRYHRHTPQQIQELESLFKECPHPDEKQRLELSRRLCLETRQVKFWFQNRRTQM 992
            D PPRKKRYHRHTPQQIQELE+LFKECPHPDEKQRLELS+RLCLETRQVKFWFQNRRTQM
Sbjct: 108  DNPPRKKRYHRHTPQQIQELEALFKECPHPDEKQRLELSKRLCLETRQVKFWFQNRRTQM 167

Query: 993  KTQLERHENSLLRQENDKLRAENMSIRDAMRNPICTNCGGPAMIGDISLEEQHLRIENAR 1172
            KTQLERHENSLLRQENDKLRAENMSIRDAMRNPICTNCGGPA+IGDISLEEQHLRIENAR
Sbjct: 168  KTQLERHENSLLRQENDKLRAENMSIRDAMRNPICTNCGGPAIIGDISLEEQHLRIENAR 227

Query: 1173 LKDELDRVCALAGKFLGRXXXXXXXXXXXXXX----ELGVGNNGFGGLSTVTTTTLPLGP 1340
            LKDELDRVCALAGKFLGR                  ELGVG+NGFGGLSTV TT LPLGP
Sbjct: 228  LKDELDRVCALAGKFLGRPISALATSIAPPMPNSSLELGVGSNGFGGLSTVPTT-LPLGP 286

Query: 1341 ADFGTGGTSNNALQVMTQNRSGPGVTGLDRSIERSMFLELALTAMDELVKMAQTDEPLWI 1520
             DFG G T  NAL V   NR   GVTGLDRS+ERSMFLELAL AMDELVKMAQTDEPLWI
Sbjct: 287  -DFGGGIT--NALPVAPPNRPTTGVTGLDRSVERSMFLELALAAMDELVKMAQTDEPLWI 343

Query: 1521 RSLEGPGREVLNLEEYLTTFTPCIGLKPNGFVTEASRETGMVIINGLALVETLMDPNRWA 1700
            RSLEG GRE+LN +EYL TFTPCIG+KP GFVTEASRETG+VIIN LALVETLMD  RWA
Sbjct: 344  RSLEG-GREILNHDEYLRTFTPCIGMKPGGFVTEASRETGVVIINSLALVETLMDSTRWA 402

Query: 1701 EMFPCMIARTSTTDVISSGMGGTRNGALQLMHAELQVLSPLVPVREVNFLRFCKQHAEGV 1880
            EMFPCMIARTSTTDVISSGMGGTRNGALQLMHAELQVLSPLVPVREVNFLRFCKQHAEGV
Sbjct: 403  EMFPCMIARTSTTDVISSGMGGTRNGALQLMHAELQVLSPLVPVREVNFLRFCKQHAEGV 462

Query: 1881 WAVVDVSIDTIRETSGAPAFVNCRRLPSGCVVQDMPNGYSKVTWVEHAEYDESQVHQLYK 2060
            WAVVDVSIDTIRETSGAP FVNCRRLPSGCVVQDMPNGYSKVTWVEHAEY+ESQVHQLY+
Sbjct: 463  WAVVDVSIDTIRETSGAPTFVNCRRLPSGCVVQDMPNGYSKVTWVEHAEYEESQVHQLYR 522

Query: 2061 PFVSSGMGFGAQRWVATLQRQCECLAILMSSAVLARDHTAITAGGRRSMLKLAQRMTDNF 2240
            P +SSGMGFGAQRWVATLQRQCECLAILMSS V  RDHTAITA GRRSMLKLAQRMTDNF
Sbjct: 523  PLLSSGMGFGAQRWVATLQRQCECLAILMSSTVPTRDHTAITASGRRSMLKLAQRMTDNF 582

Query: 2241 CAGVCASTVHKWNKL-NAGNVDEDVRVMTRKSVDDPGEPPGIVLSAATSVWLPVSPQXXX 2417
            CAGVCAST+HKWNKL NAGNVDEDVRVMTRKSVDDPGEPPGIVLSAATSVWLPVSPQ   
Sbjct: 583  CAGVCASTLHKWNKLNNAGNVDEDVRVMTRKSVDDPGEPPGIVLSAATSVWLPVSPQRLF 642

Query: 2418 XXXXXXXXXSEWDILSNGGPMQEMAHIAKGQDHGNCVSLLRASAINSNQSSMLILQETCI 2597
                     SEWDILSNGGPMQEMAHIAKGQDHGNCVSLLRASA+N+NQSSMLILQETCI
Sbjct: 643  DFLRDERLRSEWDILSNGGPMQEMAHIAKGQDHGNCVSLLRASAMNANQSSMLILQETCI 702

Query: 2598 DAAGSLVVYAPVDIPAMHVVMNGGDSAYVALLPSGFAIVPD------XXXXXXXXXXXXX 2759
            DAAGSLVVYAPVDIPAMHVVMNGGDSAYVALLPSGFAIVPD                   
Sbjct: 703  DAAGSLVVYAPVDIPAMHVVMNGGDSAYVALLPSGFAIVPDGPGSRGPTSNGHVNGNGGG 762

Query: 2760 XXXXXXXXXXXLLTVAFQILVNSLPTAKLTVESVETVNNLISCTVQKIKAALQCES 2927
                       LLTVAFQILVNSLPTAKLTVESVETVNNLISCTVQKIKAALQCES
Sbjct: 763  GGGRSQRVGGSLLTVAFQILVNSLPTAKLTVESVETVNNLISCTVQKIKAALQCES 818


>OMP00515.1 hypothetical protein COLO4_12612 [Corchorus olitorius]
          Length = 819

 Score = 1302 bits (3370), Expect = 0.0
 Identities = 686/838 (81%), Positives = 717/838 (85%), Gaps = 13/838 (1%)
 Frame = +3

Query: 453  MSFGGFLETNTSXXXXXXXXXXXARIVADISYTNNMPXXXXXLAHPRLLSTPPLSKSMFN 632
            MSFGGFL+ N+            ARIVADI Y+NNMP     +A PRL+S P L+K+MFN
Sbjct: 1    MSFGGFLDNNSGGGG--------ARIVADIPYSNNMPTGV--IAQPRLVS-PSLAKNMFN 49

Query: 633  SPGLSLALQQP-IDNQGDMQRMGESNFEGIIGRRSRENQEHESRSGSDNMEGASGDDLDV 809
            SPGLSLALQQP IDNQGD  RMGE NFE  +GRRSRE +EHESRSGSDNM+GASGDD D 
Sbjct: 50   SPGLSLALQQPNIDNQGDGTRMGE-NFEASVGRRSRE-EEHESRSGSDNMDGASGDDQDA 107

Query: 810  TDKPPRKKRYHRHTPQQIQELESLFKECPHPDEKQRLELSRRLCLETRQVKFWFQNRRTQ 989
             D PPRKKRYHRHTPQQIQELESLFKECPHPDEKQRLELS+RLCLETRQVKFWFQNRRTQ
Sbjct: 108  ADNPPRKKRYHRHTPQQIQELESLFKECPHPDEKQRLELSKRLCLETRQVKFWFQNRRTQ 167

Query: 990  MKTQLERHENSLLRQENDKLRAENMSIRDAMRNPICTNCGGPAMIGDISLEEQHLRIENA 1169
            MKTQLERHENSLLRQENDKLRAENMSIRDAMRNPICTNCGGPA++GDISLEEQHLRIENA
Sbjct: 168  MKTQLERHENSLLRQENDKLRAENMSIRDAMRNPICTNCGGPAIMGDISLEEQHLRIENA 227

Query: 1170 RLKDELDRVCALAGKFLGRXXXXXXXXXXXXXX----ELGVGNNGFGGLSTVTTTTLPLG 1337
            RLKDELDRVCALAGKFLGR                  ELGVGNNGFGGLSTV TT LPLG
Sbjct: 228  RLKDELDRVCALAGKFLGRPISALASSIAPPMPNSSLELGVGNNGFGGLSTVPTT-LPLG 286

Query: 1338 PADFGTGGTSNNALQVMTQNRSGPGVTGLDRSIERSMFLELALTAMDELVKMAQTDEPLW 1517
            P DFG+G    NAL V+   R   GVTGLDRS+ERSMFLELAL AMDELVKMAQTDEPLW
Sbjct: 287  P-DFGSG---INALPVVPATRPTAGVTGLDRSVERSMFLELALAAMDELVKMAQTDEPLW 342

Query: 1518 IRSLEGPGREVLNLEEYLTTFTPCIGLKPNGFVTEASRETGMVIINGLALVETLMDPNRW 1697
            I+SLEG GRE LN +EYL +FTPCIG+KP+GFVTEASRETG+VIIN LALVETLMD NRW
Sbjct: 343  IKSLEG-GRETLNYDEYLRSFTPCIGMKPSGFVTEASRETGVVIINSLALVETLMDSNRW 401

Query: 1698 AEMFPCMIARTSTTDVISSGMGGTRNGALQLMHAELQVLSPLVPVREVNFLRFCKQHAEG 1877
            AEMFPCMIARTSTTDVIS GMGGTRNGALQLMHAELQVLSPLVPVREVNFLRFCKQHAEG
Sbjct: 402  AEMFPCMIARTSTTDVISGGMGGTRNGALQLMHAELQVLSPLVPVREVNFLRFCKQHAEG 461

Query: 1878 VWAVVDVSIDTIRETSGAP-AFVNCRRLPSGCVVQDMPNGYSKVTWVEHAEYDESQVHQL 2054
            VWAVVDVSID+IRETSGAP  ++NCRRLPSGCVVQDMPNGYSKVTWVEHAEY+ESQVHQL
Sbjct: 462  VWAVVDVSIDSIRETSGAPPTYLNCRRLPSGCVVQDMPNGYSKVTWVEHAEYEESQVHQL 521

Query: 2055 YKPFVSSGMGFGAQRWVATLQRQCECLAILMSSAVLARDHTAITAGGRRSMLKLAQRMTD 2234
            Y+P +SSGMGFGAQRWVATLQRQCECLAILMSS+V ARDHTAITA GRRSMLKLAQRMTD
Sbjct: 522  YRPLLSSGMGFGAQRWVATLQRQCECLAILMSSSVPARDHTAITASGRRSMLKLAQRMTD 581

Query: 2235 NFCAGVCASTVHKWNKLNAGNVDEDVRVMTRKSVDDPGEPPGIVLSAATSVWLPVSPQXX 2414
            NFCAGVCASTVHKWNKLNAGNVDEDVRVMTRKSVDDPGEPPGIVLSAATSVWLPVSPQ  
Sbjct: 582  NFCAGVCASTVHKWNKLNAGNVDEDVRVMTRKSVDDPGEPPGIVLSAATSVWLPVSPQRL 641

Query: 2415 XXXXXXXXXXSEWDILSNGGPMQEMAHIAKGQDHGNCVSLLRASAINSNQSSMLILQETC 2594
                      SEWDILSNGGPMQEMAHIAKGQDHGNCVSLLRASA+N+NQSSMLILQETC
Sbjct: 642  FDFLRDERLRSEWDILSNGGPMQEMAHIAKGQDHGNCVSLLRASAMNANQSSMLILQETC 701

Query: 2595 IDAAGSLVVYAPVDIPAMHVVMNGGDSAYVALLPSGFAIVPDXXXXXXXXXXXXXXXXXX 2774
            IDAAGSLVVYAPVDIPAMHVVMNGGDSAYVALLPSGFAIVPD                  
Sbjct: 702  IDAAGSLVVYAPVDIPAMHVVMNGGDSAYVALLPSGFAIVPDGPGSRGPTSNGHVNGNGA 761

Query: 2775 XXXXXX-------LLTVAFQILVNSLPTAKLTVESVETVNNLISCTVQKIKAALQCES 2927
                         LLTVAFQILVNSLPTAKLTVESVETVNNLISCTVQKIKAALQCES
Sbjct: 762  AGGGAGSQRVGGSLLTVAFQILVNSLPTAKLTVESVETVNNLISCTVQKIKAALQCES 819


>XP_007051913.2 PREDICTED: homeobox-leucine zipper protein ANTHOCYANINLESS 2 isoform
            X2 [Theobroma cacao]
          Length = 818

 Score = 1302 bits (3370), Expect = 0.0
 Identities = 686/836 (82%), Positives = 714/836 (85%), Gaps = 11/836 (1%)
 Frame = +3

Query: 453  MSFGGFLETNTSXXXXXXXXXXXARIVADISYTNNMPXXXXXLAHPRLLSTPPLSKSMFN 632
            MSFGGFL+ ++            ARIVADI Y+NNMP     +A PRL+S P L+K+MFN
Sbjct: 1    MSFGGFLDNSSGGGG--------ARIVADIPYSNNMPTGA--IAQPRLVS-PSLAKNMFN 49

Query: 633  SPGLSLALQQPIDNQGDMQRMGESNFEGIIGRRSRENQEHESRSGSDNMEGASGDDLDVT 812
            SPGLSLALQ  IDNQGD  RMGE NFEG +GRRSRE +EHESRSGSDNM+G SGDD D  
Sbjct: 50   SPGLSLALQPNIDNQGDGTRMGE-NFEGSVGRRSRE-EEHESRSGSDNMDGGSGDDQDAA 107

Query: 813  DKPPRKKRYHRHTPQQIQELESLFKECPHPDEKQRLELSRRLCLETRQVKFWFQNRRTQM 992
            D PPRKKRYHRHTPQQIQELE+LFKECPHPDEKQRLELS+RLCLETRQVKFWFQNRRTQM
Sbjct: 108  DNPPRKKRYHRHTPQQIQELEALFKECPHPDEKQRLELSKRLCLETRQVKFWFQNRRTQM 167

Query: 993  KTQLERHENSLLRQENDKLRAENMSIRDAMRNPICTNCGGPAMIGDISLEEQHLRIENAR 1172
            KTQLERHENSLLRQENDKLRAENMSIRDAMRNPICTNCGGPA+IGDISLEEQHLRIENAR
Sbjct: 168  KTQLERHENSLLRQENDKLRAENMSIRDAMRNPICTNCGGPAIIGDISLEEQHLRIENAR 227

Query: 1173 LKDELDRVCALAGKFLGRXXXXXXXXXXXXXX----ELGVGNNGFGGLSTVTTTTLPLGP 1340
            LKDELDRVCALAGKFLGR                  ELGVG+NGFGGLSTV TT LPLGP
Sbjct: 228  LKDELDRVCALAGKFLGRPISALATSIAPPMPNSSLELGVGSNGFGGLSTVPTT-LPLGP 286

Query: 1341 ADFGTGGTSNNALQVMTQNRSGPGVTGLDRSIERSMFLELALTAMDELVKMAQTDEPLWI 1520
             DFG G T  NAL V   NR+  GVTGLDRS+ERSMFLELAL AMDELVKMAQTDEPLWI
Sbjct: 287  -DFGGGIT--NALPVAPPNRATTGVTGLDRSVERSMFLELALAAMDELVKMAQTDEPLWI 343

Query: 1521 RSLEGPGREVLNLEEYLTTFTPCIGLKPNGFVTEASRETGMVIINGLALVETLMDPNRWA 1700
            RSLEG GRE+LN +EYL TFTPCIG+KP GFVTEASRETG+VIIN LALVETLMD  RWA
Sbjct: 344  RSLEG-GREILNHDEYLRTFTPCIGMKPGGFVTEASRETGVVIINSLALVETLMDSTRWA 402

Query: 1701 EMFPCMIARTSTTDVISSGMGGTRNGALQLMHAELQVLSPLVPVREVNFLRFCKQHAEGV 1880
            EMFPCMIARTSTTDVISSGMGGTRNGALQLMHAELQVLSPLVPVREVNFLRFCKQHAEGV
Sbjct: 403  EMFPCMIARTSTTDVISSGMGGTRNGALQLMHAELQVLSPLVPVREVNFLRFCKQHAEGV 462

Query: 1881 WAVVDVSIDTIRETSGAPAFVNCRRLPSGCVVQDMPNGYSKVTWVEHAEYDESQVHQLYK 2060
            WAVVDVSIDTIRETSGAP FVNCRRLPSGCVVQDMPNGYSKVTWVEHAEY+ESQVHQLY+
Sbjct: 463  WAVVDVSIDTIRETSGAPTFVNCRRLPSGCVVQDMPNGYSKVTWVEHAEYEESQVHQLYR 522

Query: 2061 PFVSSGMGFGAQRWVATLQRQCECLAILMSSAVLARDHTAITAGGRRSMLKLAQRMTDNF 2240
            P +SSGMGFGAQRWVATLQRQCECLAILMSS V  RDHTAITA GRRSMLKLAQRMTDNF
Sbjct: 523  PLLSSGMGFGAQRWVATLQRQCECLAILMSSTVPTRDHTAITASGRRSMLKLAQRMTDNF 582

Query: 2241 CAGVCASTVHKWNKL-NAGNVDEDVRVMTRKSVDDPGEPPGIVLSAATSVWLPVSPQXXX 2417
            CAGVCAST+HKWNKL NAG+VDEDVRVMTRKSVDDPGEPPGIVLSAATSVWLPVSPQ   
Sbjct: 583  CAGVCASTLHKWNKLNNAGDVDEDVRVMTRKSVDDPGEPPGIVLSAATSVWLPVSPQRLF 642

Query: 2418 XXXXXXXXXSEWDILSNGGPMQEMAHIAKGQDHGNCVSLLRASAINSNQSSMLILQETCI 2597
                     SEWDILSNGGPMQEMAHIAKGQDHGNCVSLLRASA+N+NQSSMLILQETCI
Sbjct: 643  DFLRDERLRSEWDILSNGGPMQEMAHIAKGQDHGNCVSLLRASAMNANQSSMLILQETCI 702

Query: 2598 DAAGSLVVYAPVDIPAMHVVMNGGDSAYVALLPSGFAIVPD------XXXXXXXXXXXXX 2759
            DAAGSLVVYAPVDIPAMHVVMNGGDSAYVALLPSGFAIVPD                   
Sbjct: 703  DAAGSLVVYAPVDIPAMHVVMNGGDSAYVALLPSGFAIVPDGPGSRGPTSNGHVNGNGGG 762

Query: 2760 XXXXXXXXXXXLLTVAFQILVNSLPTAKLTVESVETVNNLISCTVQKIKAALQCES 2927
                       LLTVAFQILVNSLPTAKLTVESVETVNNLISCTVQKIKAALQCES
Sbjct: 763  GGGGSQRVGGSLLTVAFQILVNSLPTAKLTVESVETVNNLISCTVQKIKAALQCES 818


>OAY49229.1 hypothetical protein MANES_05G039400 [Manihot esculenta]
          Length = 822

 Score = 1286 bits (3327), Expect = 0.0
 Identities = 685/839 (81%), Positives = 711/839 (84%), Gaps = 14/839 (1%)
 Frame = +3

Query: 453  MSFGGFLETNTSXXXXXXXXXXXARIVADISYTN-NMPXXXXXLAHPRLLSTPPLSKSMF 629
            MSFGGFLE  +            ARIVADI Y++ NMP     +A PRL+S P L+K+MF
Sbjct: 1    MSFGGFLENGSPGGGG-------ARIVADIPYSSSNMPTGA--IAQPRLIS-PSLTKAMF 50

Query: 630  NSPGLSLALQQP-IDNQGDMQRMGESNFEGIIGRRSRENQEHESRSGSDNMEGASGDDLD 806
            NSPGLSLALQQP ID QGD+ RM E NFE   GRRSRE +EHESRSGSDNM+GASGDD D
Sbjct: 51   NSPGLSLALQQPNIDGQGDIARMAE-NFESNGGRRSRE-EEHESRSGSDNMDGASGDDQD 108

Query: 807  VTDKPPRKKRYHRHTPQQIQELESLFKECPHPDEKQRLELSRRLCLETRQVKFWFQNRRT 986
              D PPRKKRYHRHTPQQIQELE+LFKECPHPDEKQRLELS+RLCLETRQVKFWFQNRRT
Sbjct: 109  AADNPPRKKRYHRHTPQQIQELEALFKECPHPDEKQRLELSKRLCLETRQVKFWFQNRRT 168

Query: 987  QMKTQLERHENSLLRQENDKLRAENMSIRDAMRNPICTNCGGPAMIGDISLEEQHLRIEN 1166
            QMKTQLERHENSLLRQENDKLRAENMSIRDAMRNPIC+NCGGPA+IGDISLEEQHLRIEN
Sbjct: 169  QMKTQLERHENSLLRQENDKLRAENMSIRDAMRNPICSNCGGPAIIGDISLEEQHLRIEN 228

Query: 1167 ARLKDELDRVCALAGKFLGRXXXXXXXXXXXXXX----ELGVGNNGFGGLSTVTTTTLPL 1334
            ARLKDELDRVCALAGKFLGR                  ELGVG NGF GLSTV  T LPL
Sbjct: 229  ARLKDELDRVCALAGKFLGRPISSLAGSIGPPMPNSSLELGVGTNGFSGLSTVPAT-LPL 287

Query: 1335 GPADFGTGGTSNNALQVMTQNRSGP-GVTGLDRSIERSMFLELALTAMDELVKMAQTDEP 1511
            GP DF  GG S  AL VMTQ R    GVTGLDRS ERSMFLELAL AMDELVKMAQTDEP
Sbjct: 288  GP-DFA-GGISG-ALPVMTQTRPATAGVTGLDRSFERSMFLELALAAMDELVKMAQTDEP 344

Query: 1512 LWIRSLEGPGREVLNLEEYLTTFTPCIGLKPNGFVTEASRETGMVIINGLALVETLMDPN 1691
            LWIRSLEG GRE+LN EEY+ TFTPCIG+KP GFV+EASRETGMVIIN LALVETLMD N
Sbjct: 345  LWIRSLEG-GREILNHEEYMRTFTPCIGMKPGGFVSEASRETGMVIINSLALVETLMDSN 403

Query: 1692 RWAEMFPCMIARTSTTDVISSGMGGTRNGALQLMHAELQVLSPLVPVREVNFLRFCKQHA 1871
            RWAEMFPCMIARTSTTDVIS+GMGGTRNG+LQLM AELQVLSPLVPVREVNFLRFCKQHA
Sbjct: 404  RWAEMFPCMIARTSTTDVISNGMGGTRNGSLQLMLAELQVLSPLVPVREVNFLRFCKQHA 463

Query: 1872 EGVWAVVDVSIDTIRETSGAPAFVNCRRLPSGCVVQDMPNGYSKVTWVEHAEYDESQVHQ 2051
            EGVWAVVDVSIDTIRETSGAPAFVNCRRLPSGCVVQDMPNGYSKVTWVEHAEYDE+Q+HQ
Sbjct: 464  EGVWAVVDVSIDTIRETSGAPAFVNCRRLPSGCVVQDMPNGYSKVTWVEHAEYDETQIHQ 523

Query: 2052 LYKPFVSSGMGFGAQRWVATLQRQCECLAILMSSAVLARDHTAITAGGRRSMLKLAQRMT 2231
            LY+P +SSGMGFGAQRWVATLQRQCECLAILMSSAV  RDHTAITA GRRSMLKLAQRMT
Sbjct: 524  LYRPLISSGMGFGAQRWVATLQRQCECLAILMSSAVPTRDHTAITASGRRSMLKLAQRMT 583

Query: 2232 DNFCAGVCASTVHKWNKLNAGNVDEDVRVMTRKSVDDPGEPPGIVLSAATSVWLPVSPQX 2411
            DNFCAGVCASTVHKWNKLNAGNVDEDVRVMTRKSVDDPGEPPGIVLSAATSVWLPVSPQ 
Sbjct: 584  DNFCAGVCASTVHKWNKLNAGNVDEDVRVMTRKSVDDPGEPPGIVLSAATSVWLPVSPQR 643

Query: 2412 XXXXXXXXXXXSEWDILSNGGPMQEMAHIAKGQDHGNCVSLLRASAINSNQSSMLILQET 2591
                       SEWDILSNGGPMQEMAHIAKGQDHGNCVSLLRASA+N+NQSSMLILQET
Sbjct: 644  LFDFLRDERLRSEWDILSNGGPMQEMAHIAKGQDHGNCVSLLRASAMNANQSSMLILQET 703

Query: 2592 CIDAAGSLVVYAPVDIPAMHVVMNGGDSAYVALLPSGFAIVPDXXXXXXXXXXXXXXXXX 2771
            CIDAAGSLVVYAPVDIPAMHVVMNGGDSAYVALLPSGFAIVPD                 
Sbjct: 704  CIDAAGSLVVYAPVDIPAMHVVMNGGDSAYVALLPSGFAIVPDGPGSRGSLSTPNGPTGN 763

Query: 2772 XXXXXXX-------LLTVAFQILVNSLPTAKLTVESVETVNNLISCTVQKIKAALQCES 2927
                          LLTVAFQILVNSLPTAKLTVESVETVNNLISCTVQKIKAALQCES
Sbjct: 764  NGGGTGGQQRVSGSLLTVAFQILVNSLPTAKLTVESVETVNNLISCTVQKIKAALQCES 822


>XP_015584500.1 PREDICTED: homeobox-leucine zipper protein ANTHOCYANINLESS 2 isoform
            X2 [Ricinus communis]
          Length = 824

 Score = 1283 bits (3320), Expect = 0.0
 Identities = 685/842 (81%), Positives = 711/842 (84%), Gaps = 17/842 (2%)
 Frame = +3

Query: 453  MSFGGFLETNTSXXXXXXXXXXXARIVADISYTNN-------MPXXXXXLAHPRLLSTPP 611
            MSFGGFLE  +            ARIVADI + NN       MP     +A PRLLS P 
Sbjct: 1    MSFGGFLENGSPGGGG-------ARIVADIPFNNNSSSSSTNMPTGA--IAQPRLLS-PS 50

Query: 612  LSKSMFNSPGLSLALQQP-IDNQGD-MQRMGESNFEGIIGRRSRENQEHESRSGSDNMEG 785
             +KSMFNSPGLSLALQQP ID QGD + RM E NFE I GRRSRE +EHESRSGSDNM+G
Sbjct: 51   FTKSMFNSPGLSLALQQPNIDGQGDHVARMAE-NFETIGGRRSRE-EEHESRSGSDNMDG 108

Query: 786  ASGDDLDVTDKPPRKKRYHRHTPQQIQELESLFKECPHPDEKQRLELSRRLCLETRQVKF 965
            ASGDD D  D PPRKKRYHRHTPQQIQELE+LFKECPHPDEKQRLELS+RLCLETRQVKF
Sbjct: 109  ASGDDQDAADNPPRKKRYHRHTPQQIQELEALFKECPHPDEKQRLELSKRLCLETRQVKF 168

Query: 966  WFQNRRTQMKTQLERHENSLLRQENDKLRAENMSIRDAMRNPICTNCGGPAMIGDISLEE 1145
            WFQNRRTQMKTQLERHENSLLRQENDKLRAENM+IRDAMRNPIC+NCGGPA+IGDISLEE
Sbjct: 169  WFQNRRTQMKTQLERHENSLLRQENDKLRAENMTIRDAMRNPICSNCGGPAIIGDISLEE 228

Query: 1146 QHLRIENARLKDELDRVCALAGKFLGRXXXXXXXXXXXXXX----ELGVGNNGFGGLSTV 1313
            QHLRIENARLKDELDRVCALAGKFLGR                  ELGVGNNGF GLSTV
Sbjct: 229  QHLRIENARLKDELDRVCALAGKFLGRPISSLASSIGPPMPNSSLELGVGNNGFAGLSTV 288

Query: 1314 TTTTLPLGPADFGTGGTSNNALQVMTQNRSG-PGVTGLDRSIERSMFLELALTAMDELVK 1490
             TT LPLGP DFG GG S   L V+TQ R G  GVTGLDRS+ERSMFLELAL AMDELVK
Sbjct: 289  ATT-LPLGP-DFG-GGIST--LNVVTQTRPGNTGVTGLDRSLERSMFLELALAAMDELVK 343

Query: 1491 MAQTDEPLWIRSLEGPGREVLNLEEYLTTFTPCIGLKPNGFVTEASRETGMVIINGLALV 1670
            MAQTD+PLWIRSLEG GRE+LN EEY+ TFTPCIG+KP+GFV EASRE GMVIIN LALV
Sbjct: 344  MAQTDDPLWIRSLEG-GREMLNHEEYVRTFTPCIGMKPSGFVFEASREAGMVIINSLALV 402

Query: 1671 ETLMDPNRWAEMFPCMIARTSTTDVISSGMGGTRNGALQLMHAELQVLSPLVPVREVNFL 1850
            ETLMD NRWAEMFPC+IARTSTTDVISSGMGGTRNG+LQLMHAELQVLSPLVPVREVNFL
Sbjct: 403  ETLMDSNRWAEMFPCVIARTSTTDVISSGMGGTRNGSLQLMHAELQVLSPLVPVREVNFL 462

Query: 1851 RFCKQHAEGVWAVVDVSIDTIRETSGAPAFVNCRRLPSGCVVQDMPNGYSKVTWVEHAEY 2030
            RFCKQHAEGVWAVVDVSIDTIRETSG PAF NCRRLPSGCVVQDMPNGYSKVTWVEHAEY
Sbjct: 463  RFCKQHAEGVWAVVDVSIDTIRETSGGPAFANCRRLPSGCVVQDMPNGYSKVTWVEHAEY 522

Query: 2031 DESQVHQLYKPFVSSGMGFGAQRWVATLQRQCECLAILMSSAVLARDHTAITAGGRRSML 2210
            DES +HQLY+P +SSGMGFGAQRWVATLQRQCECLAILMSS V ARDHTAITA GRRSML
Sbjct: 523  DESPIHQLYRPLISSGMGFGAQRWVATLQRQCECLAILMSSTVPARDHTAITASGRRSML 582

Query: 2211 KLAQRMTDNFCAGVCASTVHKWNKLNAGNVDEDVRVMTRKSVDDPGEPPGIVLSAATSVW 2390
            KLAQRMTDNFCAGVCASTVHKWNKLNAGNVDEDVRVMTRKSVDDPGEPPGIVLSAATSVW
Sbjct: 583  KLAQRMTDNFCAGVCASTVHKWNKLNAGNVDEDVRVMTRKSVDDPGEPPGIVLSAATSVW 642

Query: 2391 LPVSPQXXXXXXXXXXXXSEWDILSNGGPMQEMAHIAKGQDHGNCVSLLRASAINSNQSS 2570
            LPVSPQ            SEWDILSNGGPMQEMAHIAKGQDHGNCVSLLRASA+N+NQSS
Sbjct: 643  LPVSPQRLFDFLRDERLRSEWDILSNGGPMQEMAHIAKGQDHGNCVSLLRASAMNANQSS 702

Query: 2571 MLILQETCIDAAGSLVVYAPVDIPAMHVVMNGGDSAYVALLPSGFAIVPD---XXXXXXX 2741
            MLILQETCIDAAGSLVVYAPVDIPAMHVVMNGGDSAYVALLPSGFAIVPD          
Sbjct: 703  MLILQETCIDAAGSLVVYAPVDIPAMHVVMNGGDSAYVALLPSGFAIVPDGPGSRGSPTN 762

Query: 2742 XXXXXXXXXXXXXXXXXLLTVAFQILVNSLPTAKLTVESVETVNNLISCTVQKIKAALQC 2921
                             LLTVAFQILVNSLPTAKLTVESVETVNNLISCTVQKIKAALQC
Sbjct: 763  QNGGGNNGGGPNRVSGSLLTVAFQILVNSLPTAKLTVESVETVNNLISCTVQKIKAALQC 822

Query: 2922 ES 2927
            ES
Sbjct: 823  ES 824


>XP_018813891.1 PREDICTED: homeobox-leucine zipper protein ANTHOCYANINLESS 2-like
            [Juglans regia]
          Length = 823

 Score = 1281 bits (3316), Expect = 0.0
 Identities = 682/842 (80%), Positives = 707/842 (83%), Gaps = 17/842 (2%)
 Frame = +3

Query: 453  MSFGGFLETNTSXXXXXXXXXXXARIVADISYTN-------NMPXXXXXLAHPRLLSTPP 611
            MSFGGFLE  T            AR+VADISY N       NMP     LAHPRL+ TPP
Sbjct: 1    MSFGGFLENTTGGG---------ARVVADISYNNSNGNNNHNMPAGA--LAHPRLV-TPP 48

Query: 612  LSKSMFNS-PGLSLALQQPIDNQGDMQRMGESNFEGIIGRRSRENQEHESRSGSDNMEGA 788
            L+KSMFNS PGLSL LQ  ID Q D+ RM E NFE  +GRRSR+ +EHESRSGSDNM+GA
Sbjct: 49   LTKSMFNSSPGLSLGLQPNIDGQADVARMAE-NFEVNLGRRSRD-EEHESRSGSDNMDGA 106

Query: 789  SGDDLDVTDKPPRKKRYHRHTPQQIQELESLFKECPHPDEKQRLELSRRLCLETRQVKFW 968
            SGDDLD  D PPRKKRYHRHTPQQIQELESLFKECPHPDEKQRLELS+RLCLETRQVKFW
Sbjct: 107  SGDDLDAADNPPRKKRYHRHTPQQIQELESLFKECPHPDEKQRLELSKRLCLETRQVKFW 166

Query: 969  FQNRRTQMKTQLERHENSLLRQENDKLRAENMSIRDAMRNPICTNCGGPAMIGDISLEEQ 1148
            FQNRRTQMKTQLERHENSLLRQENDKLRAENMSIRDAMRNPIC+NCGGPA+IG+ISLEEQ
Sbjct: 167  FQNRRTQMKTQLERHENSLLRQENDKLRAENMSIRDAMRNPICSNCGGPAIIGEISLEEQ 226

Query: 1149 HLRIENARLKDELDRVCALAGKFLGRXXXXXXXXXXXXXX----ELGVGNNGFGGLSTVT 1316
            HLRIENARLKDELDRVCALAGKFLGR                  ELGVG+NGF GLS V 
Sbjct: 227  HLRIENARLKDELDRVCALAGKFLGRPISSLANAIGPPLPSSSLELGVGSNGFAGLSHVA 286

Query: 1317 TTTLPLGPADFGTGGTSNNALQVMTQNRSGPGVTGLDRSIERSMFLELALTAMDELVKMA 1496
            TT LPLGP DFG G   +  L V+   R    +TGLDRSIERSMFLELAL AMDELVKMA
Sbjct: 287  TT-LPLGP-DFGVG--ISGVLPVVPPARPPSSLTGLDRSIERSMFLELALAAMDELVKMA 342

Query: 1497 QTDEPLWIRSLEGPGREVLNLEEYLTTFTPCIGLKPNGFVTEASRETGMVIINGLALVET 1676
            QTDE LW+RSLEG  RE+LNLEEYL TFTPCIG+KPNGFVTEASRETGMVIIN LALVET
Sbjct: 343  QTDESLWVRSLEGR-REMLNLEEYLRTFTPCIGMKPNGFVTEASRETGMVIINSLALVET 401

Query: 1677 LMDPNRWAEMFPCMIARTSTTDVISSGMGGTRNGALQLMHAELQVLSPLVPVREVNFLRF 1856
            LMD NRWAEMFPCMIARTSTTDVISSGMGGTRNGALQLMHAELQVLSPLVPVREVNFLRF
Sbjct: 402  LMDSNRWAEMFPCMIARTSTTDVISSGMGGTRNGALQLMHAELQVLSPLVPVREVNFLRF 461

Query: 1857 CKQHAEGVWAVVDVSIDTIRETSGAPAFVNCRRLPSGCVVQDMPNGYSKVTWVEHAEYDE 2036
            CKQHAEGVWAVVDVSID+IRETS AP FVNCRRLPSGCVVQDMPNGYSKVTWVEHAEYDE
Sbjct: 462  CKQHAEGVWAVVDVSIDSIRETSAAPTFVNCRRLPSGCVVQDMPNGYSKVTWVEHAEYDE 521

Query: 2037 SQVHQLYKPFVSSGMGFGAQRWVATLQRQCECLAILMSSAVLARDHT-AITAGGRRSMLK 2213
            SQVHQLY+P +SSGMGFGAQRW+ATLQRQCECLAILMSS V ARDHT AITAGGRRSMLK
Sbjct: 522  SQVHQLYRPLLSSGMGFGAQRWIATLQRQCECLAILMSSTVPARDHTAAITAGGRRSMLK 581

Query: 2214 LAQRMTDNFCAGVCASTVHKWNKLNAGNVDEDVRVMTRKSVDDPGEPPGIVLSAATSVWL 2393
            LAQRMTDNFCAGVCASTVHKWNKL AGNVDEDVRVMTRKSVDDPGEPPGIVLSAATSVWL
Sbjct: 582  LAQRMTDNFCAGVCASTVHKWNKLQAGNVDEDVRVMTRKSVDDPGEPPGIVLSAATSVWL 641

Query: 2394 PVSPQXXXXXXXXXXXXSEWDILSNGGPMQEMAHIAKGQDHGNCVSLLRASAINSNQSSM 2573
            PVSPQ            SEWDILSNGGPMQEMAHIAKGQDHGNCVSLLRA A+N+NQSSM
Sbjct: 642  PVSPQKLFDFLRDERLRSEWDILSNGGPMQEMAHIAKGQDHGNCVSLLRAGAMNANQSSM 701

Query: 2574 LILQETCIDAAGSLVVYAPVDIPAMHVVMNGGDSAYVALLPSGFAIVPD----XXXXXXX 2741
            LILQETCIDAAGSLVVYAPVDIPAMHVVMNGGDSAYVALLPSGFAIVPD           
Sbjct: 702  LILQETCIDAAGSLVVYAPVDIPAMHVVMNGGDSAYVALLPSGFAIVPDGPGSRGPGTAT 761

Query: 2742 XXXXXXXXXXXXXXXXXLLTVAFQILVNSLPTAKLTVESVETVNNLISCTVQKIKAALQC 2921
                             LLTVAFQILVNSLPTAKLTVESVETVNNLISCTVQKIKAALQC
Sbjct: 762  ANGGGGAGPGPHRVSGSLLTVAFQILVNSLPTAKLTVESVETVNNLISCTVQKIKAALQC 821

Query: 2922 ES 2927
            ES
Sbjct: 822  ES 823


>GAV85584.1 Homeobox domain-containing protein/START domain-containing protein
            [Cephalotus follicularis]
          Length = 827

 Score = 1279 bits (3310), Expect = 0.0
 Identities = 679/845 (80%), Positives = 715/845 (84%), Gaps = 20/845 (2%)
 Frame = +3

Query: 453  MSFGGFLETNTSXXXXXXXXXXXARIVADISY------TNNMPXXXXXLAHPRLLSTPPL 614
            MSFGGFL+++             ARIVADI Y       NNMP     +A PRL+S   L
Sbjct: 1    MSFGGFLDSSPGRGG--------ARIVADIPYKNHNNSNNNMPTGA--IAQPRLVSHS-L 49

Query: 615  SKSMFNSPGLSLALQQP-IDNQGDMQRMGESNFEGIIGRRSRENQEHESRSGSDNMEGAS 791
            +K+MFNSPGLSLALQQP IDNQGD+ RM E NFE  IGRRSRE +EHESRSGSDNM+G S
Sbjct: 50   AKNMFNSPGLSLALQQPNIDNQGDVTRMAE-NFEASIGRRSRE-EEHESRSGSDNMDGGS 107

Query: 792  GDDLDVTDKPPRKKRYHRHTPQQIQELESLFKECPHPDEKQRLELSRRLCLETRQVKFWF 971
            GDD D  D PPRKKRYHRHTPQQIQELE+LFKECPHPDEKQRLELS+RLCLETRQVKFWF
Sbjct: 108  GDDQDAADNPPRKKRYHRHTPQQIQELEALFKECPHPDEKQRLELSKRLCLETRQVKFWF 167

Query: 972  QNRRTQMKTQLERHENSLLRQENDKLRAENMSIRDAMRNPICTNCGGPAMIGDISLEEQH 1151
            QNRRTQMKTQLERHENSLLRQENDKLRAENMSIRDAMRNPICTNCGGPA++GDISLEE+H
Sbjct: 168  QNRRTQMKTQLERHENSLLRQENDKLRAENMSIRDAMRNPICTNCGGPAIMGDISLEEEH 227

Query: 1152 LRIENARLKDELDRVCALAGKFLGRXXXXXXXXXXXXXX----ELGVGNNGFGGLSTVTT 1319
            LRIENARLKDELDRVCALAGKFLGR                  ELGVG+NGFGGL +V +
Sbjct: 228  LRIENARLKDELDRVCALAGKFLGRPIPPLTASIGPPMPNSSLELGVGSNGFGGLGSVPS 287

Query: 1320 TTLPLGPADFGTGGTSNNALQVMTQNRSGPGVTGLDRSIERSMFLELALTAMDELVKMAQ 1499
            T LPLGP DFG G   +N+L V+  NRSG GVTGLDRSIERSMFLELAL AMDELVKMAQ
Sbjct: 288  T-LPLGP-DFGGG--MSNSLSVVPPNRSGTGVTGLDRSIERSMFLELALAAMDELVKMAQ 343

Query: 1500 TDEPLWIRSLEGPGREVLNLEEYLTTFTPCIGLKPNGFVTEASRETGMVIINGLALVETL 1679
            ++EPLWIRSLEG GRE+LN EEYL TFTPCIGLKP+GFVTEASRETGMVIIN LALVETL
Sbjct: 344  SEEPLWIRSLEG-GREILNPEEYLRTFTPCIGLKPHGFVTEASRETGMVIINSLALVETL 402

Query: 1680 MDPNRWAEMFPCMIARTSTTDVISSGMGGTRNGALQLMHAELQVLSPLVPVREVNFLRFC 1859
            MD NRWAEMFPCMIART+TTDVISSGMGGTRNG+LQLMHAELQVLSPLVPVREVNFLRFC
Sbjct: 403  MDSNRWAEMFPCMIARTTTTDVISSGMGGTRNGSLQLMHAELQVLSPLVPVREVNFLRFC 462

Query: 1860 KQHAEGVWAVVDVSIDTIRETS-GAPAFVNCRRLPSGCVVQDMPNGYSKVTWVEHAEYDE 2036
            KQHAEGVWAVVDVSIDTIRE S GAP ++NCRRLPSGCVVQDMPNGYSKVTWVEHAEYD+
Sbjct: 463  KQHAEGVWAVVDVSIDTIREASPGAPTYLNCRRLPSGCVVQDMPNGYSKVTWVEHAEYDD 522

Query: 2037 SQVHQLYKPFVSSGMGFGAQRWVATLQRQCECLAILMSSAVLARDHTA-ITAGGRRSMLK 2213
            SQVHQLY+P +  GMGFGAQRWVATLQRQCECLAILMSSAV +RDHTA I+A GRRSMLK
Sbjct: 523  SQVHQLYRPLLGCGMGFGAQRWVATLQRQCECLAILMSSAVPSRDHTAAISASGRRSMLK 582

Query: 2214 LAQRMTDNFCAGVCASTVHKWNKLNAGNVDEDVRVMTRKSVDDPGEPPGIVLSAATSVWL 2393
            LAQRMTDNFCAGVCASTVHKWNKLNAGNVDEDVRVMTRKSVDDPGEPPGIVLSAATSVWL
Sbjct: 583  LAQRMTDNFCAGVCASTVHKWNKLNAGNVDEDVRVMTRKSVDDPGEPPGIVLSAATSVWL 642

Query: 2394 PVSPQXXXXXXXXXXXXSEWDILSNGGPMQEMAHIAKGQDHGNCVSLLRASAINSNQSSM 2573
            PVSPQ            SEWDILSNGGPMQEMAHIAKGQDHGNCVSLLRASA+N+NQSSM
Sbjct: 643  PVSPQRLFDFLRDERLRSEWDILSNGGPMQEMAHIAKGQDHGNCVSLLRASAMNANQSSM 702

Query: 2574 LILQETCIDAAGSLVVYAPVDIPAMHVVMNGGDSAYVALLPSGFAIVPDXXXXXXXXXXX 2753
            LILQETCIDAAGSLVVYAPVDIPAMHVVMNGGDSAYVALLPSGFAIVPD           
Sbjct: 703  LILQETCIDAAGSLVVYAPVDIPAMHVVMNGGDSAYVALLPSGFAIVPDGPGSRGPTTNG 762

Query: 2754 XXXXXXXXXXXXX-------LLTVAFQILVNSLPTAKLTVESVETVNNLISCTVQKIKAA 2912
                                LLTVAFQILVNSLPTAKLTVESVETVNNLISCTVQKIKAA
Sbjct: 763  PTSNSNGGPGGGGSHRVSGSLLTVAFQILVNSLPTAKLTVESVETVNNLISCTVQKIKAA 822

Query: 2913 LQCES 2927
            LQCES
Sbjct: 823  LQCES 827


>XP_002511801.1 PREDICTED: homeobox-leucine zipper protein ANTHOCYANINLESS 2 isoform
            X1 [Ricinus communis] EEF50470.1 homeobox protein,
            putative [Ricinus communis]
          Length = 825

 Score = 1278 bits (3308), Expect = 0.0
 Identities = 685/843 (81%), Positives = 711/843 (84%), Gaps = 18/843 (2%)
 Frame = +3

Query: 453  MSFGGFLETNTSXXXXXXXXXXXARIVADISYTNN-------MPXXXXXLAHPRLLSTPP 611
            MSFGGFLE  +            ARIVADI + NN       MP     +A PRLLS P 
Sbjct: 1    MSFGGFLENGSPGGGG-------ARIVADIPFNNNSSSSSTNMPTGA--IAQPRLLS-PS 50

Query: 612  LSKSMFNSPGLSLALQQP-IDNQGD-MQRMGESNFEGIIGRRSRENQEHESRSGSDNMEG 785
             +KSMFNSPGLSLALQQP ID QGD + RM E NFE I GRRSRE +EHESRSGSDNM+G
Sbjct: 51   FTKSMFNSPGLSLALQQPNIDGQGDHVARMAE-NFETIGGRRSRE-EEHESRSGSDNMDG 108

Query: 786  ASGDDLDVTDKPPRKKRYHRHTPQQIQELESLFKECPHPDEKQRLELSRRLCLETRQVKF 965
            ASGDD D  D PPRKKRYHRHTPQQIQELE+LFKECPHPDEKQRLELS+RLCLETRQVKF
Sbjct: 109  ASGDDQDAADNPPRKKRYHRHTPQQIQELEALFKECPHPDEKQRLELSKRLCLETRQVKF 168

Query: 966  WFQNRRTQMKTQLERHENSLLRQENDKLRAENMSIRDAMRNPICTNCGGPAMIGDISLEE 1145
            WFQNRRTQMKTQLERHENSLLRQENDKLRAENM+IRDAMRNPIC+NCGGPA+IGDISLEE
Sbjct: 169  WFQNRRTQMKTQLERHENSLLRQENDKLRAENMTIRDAMRNPICSNCGGPAIIGDISLEE 228

Query: 1146 QHLRIENARLKDELDRVCALAGKFLGRXXXXXXXXXXXXXX----ELGVGNNGFGGLSTV 1313
            QHLRIENARLKDELDRVCALAGKFLGR                  ELGVGNNGF GLSTV
Sbjct: 229  QHLRIENARLKDELDRVCALAGKFLGRPISSLASSIGPPMPNSSLELGVGNNGFAGLSTV 288

Query: 1314 TTTTLPLGPADFGTGGTSNNALQVMTQNRSG-PGVTGLDRSIERSMFLELALTAMDELVK 1490
             TT LPLGP DFG GG S   L V+TQ R G  GVTGLDRS+ERSMFLELAL AMDELVK
Sbjct: 289  ATT-LPLGP-DFG-GGIST--LNVVTQTRPGNTGVTGLDRSLERSMFLELALAAMDELVK 343

Query: 1491 MAQTDEPLWIRSLEGPGREVLNLEEYLTTFTPCIGLKPNGFVTEASRETGMVIINGLALV 1670
            MAQTD+PLWIRSLEG GRE+LN EEY+ TFTPCIG+KP+GFV EASRE GMVIIN LALV
Sbjct: 344  MAQTDDPLWIRSLEG-GREMLNHEEYVRTFTPCIGMKPSGFVFEASREAGMVIINSLALV 402

Query: 1671 ETLMDPNRWAEMFPCMIARTSTTDVISSGMGGTRNGALQLMHAELQVLSPLVPVREVNFL 1850
            ETLMD NRWAEMFPC+IARTSTTDVISSGMGGTRNG+LQLMHAELQVLSPLVPVREVNFL
Sbjct: 403  ETLMDSNRWAEMFPCVIARTSTTDVISSGMGGTRNGSLQLMHAELQVLSPLVPVREVNFL 462

Query: 1851 RFCKQHAEGVWAVVDVSIDTIRETSGAPAFVNCRRLPSGCVVQDMPNGYSKVTWVEHAEY 2030
            RFCKQHAEGVWAVVDVSIDTIRETSG PAF NCRRLPSGCVVQDMPNGYSKVTWVEHAEY
Sbjct: 463  RFCKQHAEGVWAVVDVSIDTIRETSGGPAFANCRRLPSGCVVQDMPNGYSKVTWVEHAEY 522

Query: 2031 DESQVHQLYKPFVSSGMGFGAQRWVATLQRQCECLAILMSSAVLARDHT-AITAGGRRSM 2207
            DES +HQLY+P +SSGMGFGAQRWVATLQRQCECLAILMSS V ARDHT AITA GRRSM
Sbjct: 523  DESPIHQLYRPLISSGMGFGAQRWVATLQRQCECLAILMSSTVPARDHTAAITASGRRSM 582

Query: 2208 LKLAQRMTDNFCAGVCASTVHKWNKLNAGNVDEDVRVMTRKSVDDPGEPPGIVLSAATSV 2387
            LKLAQRMTDNFCAGVCASTVHKWNKLNAGNVDEDVRVMTRKSVDDPGEPPGIVLSAATSV
Sbjct: 583  LKLAQRMTDNFCAGVCASTVHKWNKLNAGNVDEDVRVMTRKSVDDPGEPPGIVLSAATSV 642

Query: 2388 WLPVSPQXXXXXXXXXXXXSEWDILSNGGPMQEMAHIAKGQDHGNCVSLLRASAINSNQS 2567
            WLPVSPQ            SEWDILSNGGPMQEMAHIAKGQDHGNCVSLLRASA+N+NQS
Sbjct: 643  WLPVSPQRLFDFLRDERLRSEWDILSNGGPMQEMAHIAKGQDHGNCVSLLRASAMNANQS 702

Query: 2568 SMLILQETCIDAAGSLVVYAPVDIPAMHVVMNGGDSAYVALLPSGFAIVPD---XXXXXX 2738
            SMLILQETCIDAAGSLVVYAPVDIPAMHVVMNGGDSAYVALLPSGFAIVPD         
Sbjct: 703  SMLILQETCIDAAGSLVVYAPVDIPAMHVVMNGGDSAYVALLPSGFAIVPDGPGSRGSPT 762

Query: 2739 XXXXXXXXXXXXXXXXXXLLTVAFQILVNSLPTAKLTVESVETVNNLISCTVQKIKAALQ 2918
                              LLTVAFQILVNSLPTAKLTVESVETVNNLISCTVQKIKAALQ
Sbjct: 763  NQNGGGNNGGGPNRVSGSLLTVAFQILVNSLPTAKLTVESVETVNNLISCTVQKIKAALQ 822

Query: 2919 CES 2927
            CES
Sbjct: 823  CES 825


>XP_010661562.1 PREDICTED: homeobox-leucine zipper protein ANTHOCYANINLESS 2 isoform
            X2 [Vitis vinifera]
          Length = 810

 Score = 1277 bits (3305), Expect = 0.0
 Identities = 671/829 (80%), Positives = 705/829 (85%), Gaps = 4/829 (0%)
 Frame = +3

Query: 453  MSFGGFLETNTSXXXXXXXXXXXARIVADISYTNNMPXXXXXLAHPRLLSTPPLSKSMFN 632
            MSFGGFL+ ++            ARIVADI Y+NNM      +A PRL+S P L+KSMF+
Sbjct: 1    MSFGGFLDNSSGGGG--------ARIVADIPYSNNMATGA--IAQPRLVS-PSLAKSMFS 49

Query: 633  SPGLSLALQQPIDNQGDMQRMGESNFEGIIGRRSRENQEHESRSGSDNMEGASGDDLDVT 812
            SPGLSLALQ  ++ QG++ R+ E NFE   GRRSRE+ EHESRSGSDNM+GASGDD D  
Sbjct: 50   SPGLSLALQTSMEGQGEVTRLAE-NFESGGGRRSRED-EHESRSGSDNMDGASGDDQDAA 107

Query: 813  DKPPRKKRYHRHTPQQIQELESLFKECPHPDEKQRLELSRRLCLETRQVKFWFQNRRTQM 992
            D PPRKKRYHRHTPQQIQELE+LFKECPHPDEKQRLELSRRL LETRQVKFWFQNRRTQM
Sbjct: 108  DNPPRKKRYHRHTPQQIQELEALFKECPHPDEKQRLELSRRLSLETRQVKFWFQNRRTQM 167

Query: 993  KTQLERHENSLLRQENDKLRAENMSIRDAMRNPICTNCGGPAMIGDISLEEQHLRIENAR 1172
            KTQLERHENS+LRQENDKLRAENMSIRDAMRNPICTNCGGPA+IGDISLEEQHLRIENAR
Sbjct: 168  KTQLERHENSILRQENDKLRAENMSIRDAMRNPICTNCGGPAIIGDISLEEQHLRIENAR 227

Query: 1173 LKDELDRVCALAGKFLGRXXXXXXXXXXXXXX----ELGVGNNGFGGLSTVTTTTLPLGP 1340
            LKDELDRVCALAGKFLGR                  ELGVG+NGFGGLSTV TT LPLG 
Sbjct: 228  LKDELDRVCALAGKFLGRPISSLASSMAPAMPSSSLELGVGSNGFGGLSTVATT-LPLGH 286

Query: 1341 ADFGTGGTSNNALQVMTQNRSGPGVTGLDRSIERSMFLELALTAMDELVKMAQTDEPLWI 1520
             DFG G +S   +   T   S  GVTGL+RS+ERSMFLELAL AMDELVKMAQTDEPLW+
Sbjct: 287  -DFGGGISSTLPVAPPT---STTGVTGLERSLERSMFLELALAAMDELVKMAQTDEPLWV 342

Query: 1521 RSLEGPGREVLNLEEYLTTFTPCIGLKPNGFVTEASRETGMVIINGLALVETLMDPNRWA 1700
            RSLEG GRE+LNLEEY+ TFTPCIG+KP+GFVTE++RETGMVIIN LALVETLMD NRWA
Sbjct: 343  RSLEG-GREILNLEEYMRTFTPCIGMKPSGFVTESTRETGMVIINSLALVETLMDSNRWA 401

Query: 1701 EMFPCMIARTSTTDVISSGMGGTRNGALQLMHAELQVLSPLVPVREVNFLRFCKQHAEGV 1880
            EMFPCMIARTSTTDVISSGMGGTRNGALQLMHAELQVLSPLVPVREVNFLRFCKQHAEGV
Sbjct: 402  EMFPCMIARTSTTDVISSGMGGTRNGALQLMHAELQVLSPLVPVREVNFLRFCKQHAEGV 461

Query: 1881 WAVVDVSIDTIRETSGAPAFVNCRRLPSGCVVQDMPNGYSKVTWVEHAEYDESQVHQLYK 2060
            WAVVDVSIDTIRETS AP FVNCRRLPSGCVVQDMPNGYSKVTWVEHAEYDES VHQLY+
Sbjct: 462  WAVVDVSIDTIRETSVAPTFVNCRRLPSGCVVQDMPNGYSKVTWVEHAEYDESAVHQLYR 521

Query: 2061 PFVSSGMGFGAQRWVATLQRQCECLAILMSSAVLARDHTAITAGGRRSMLKLAQRMTDNF 2240
            P + SGMGFGAQRWVATLQRQCECLAILMSS V  RDHTAITAGGRRSMLKLAQRMTDNF
Sbjct: 522  PLLGSGMGFGAQRWVATLQRQCECLAILMSSTVPTRDHTAITAGGRRSMLKLAQRMTDNF 581

Query: 2241 CAGVCASTVHKWNKLNAGNVDEDVRVMTRKSVDDPGEPPGIVLSAATSVWLPVSPQXXXX 2420
            CAGVCASTVHKWNKL AGNVDEDVRVMTRKSVDDPGEPPGIVLSAATSVWLPVSPQ    
Sbjct: 582  CAGVCASTVHKWNKLCAGNVDEDVRVMTRKSVDDPGEPPGIVLSAATSVWLPVSPQRLFD 641

Query: 2421 XXXXXXXXSEWDILSNGGPMQEMAHIAKGQDHGNCVSLLRASAINSNQSSMLILQETCID 2600
                    SEWDILSNGGPMQEMAHIAKGQDHGNCVSLLRASA+N+NQSSMLILQETCID
Sbjct: 642  FLRDERLRSEWDILSNGGPMQEMAHIAKGQDHGNCVSLLRASAMNANQSSMLILQETCID 701

Query: 2601 AAGSLVVYAPVDIPAMHVVMNGGDSAYVALLPSGFAIVPDXXXXXXXXXXXXXXXXXXXX 2780
            AAGSLVVYAPVDIPAMHVVMNGGDSAYVALLPSGFAIVPD                    
Sbjct: 702  AAGSLVVYAPVDIPAMHVVMNGGDSAYVALLPSGFAIVPDGPGSRGPNSGVHTNSGGPNR 761

Query: 2781 XXXXLLTVAFQILVNSLPTAKLTVESVETVNNLISCTVQKIKAALQCES 2927
                LLTVAFQILVNSLPTAKLTVESVETVNNLISCTVQKIKAAL CES
Sbjct: 762  VSGSLLTVAFQILVNSLPTAKLTVESVETVNNLISCTVQKIKAALHCES 810


>KDO86029.1 hypothetical protein CISIN_1g002869mg [Citrus sinensis]
          Length = 778

 Score = 1275 bits (3300), Expect = 0.0
 Identities = 665/769 (86%), Positives = 677/769 (88%), Gaps = 12/769 (1%)
 Frame = +3

Query: 657  QQP-IDNQG----DMQRMGESNFEGIIGRRSRENQ-EHESRSGSDNMEGASGDDLDVTDK 818
            QQP IDNQG     +QRMGES FEGIIGRRSRE+  EHESRSGSDNM+GASGDDLD  D 
Sbjct: 15   QQPNIDNQGGGDLQLQRMGES-FEGIIGRRSREDLLEHESRSGSDNMDGASGDDLDAADN 73

Query: 819  PPRKKRYHRHTPQQIQELESLFKECPHPDEKQRLELSRRLCLETRQVKFWFQNRRTQMKT 998
            PPRKKRYHRHTPQQIQELESLFKECPHPDEKQRLELS+RLCLETRQVKFWFQNRRTQMKT
Sbjct: 74   PPRKKRYHRHTPQQIQELESLFKECPHPDEKQRLELSKRLCLETRQVKFWFQNRRTQMKT 133

Query: 999  QLERHENSLLRQENDKLRAENMSIRDAMRNPICTNCGGPAMIGDISLEEQHLRIENARLK 1178
            QLERHENSLLRQENDKLRAENMSIRDAMRNPICTNCGGPA+IGDISLEEQHLRIENARLK
Sbjct: 134  QLERHENSLLRQENDKLRAENMSIRDAMRNPICTNCGGPAIIGDISLEEQHLRIENARLK 193

Query: 1179 DELDRVCALAGKFLGRXXXXXXXXXXXXXX-ELGVGN-NGFGGLSTVTTTTLPLGPADFG 1352
            DELDRVCALAGKFLGR               ELGVG  NGFGGLS+  TTTLP   ADFG
Sbjct: 194  DELDRVCALAGKFLGRPVSSMGPPPMPNSSLELGVGTINGFGGLSSTVTTTLP---ADFG 250

Query: 1353 TGGTSNNALQVMTQNRSGPGVTGLDRSIERSMFLELALTAMDELVKMAQTDEPLWIRSLE 1532
            TG  SN    VM  NRSGPGVTGLDRSIERSMFLELAL AMDELVKMAQTDEPLWIRS E
Sbjct: 251  TG-ISNALPVVMPPNRSGPGVTGLDRSIERSMFLELALAAMDELVKMAQTDEPLWIRSFE 309

Query: 1533 GPGREVLNLEEYLTTFTPCIGLKPNGFVTEASRETGMVIINGLALVETLMDPNRWAEMFP 1712
            G GR+VLN EEYL TFTPCIGLKPNGFVTEASRETGMVIIN LALVETLMDPNRWAEMFP
Sbjct: 310  GSGRQVLNHEEYLRTFTPCIGLKPNGFVTEASRETGMVIINSLALVETLMDPNRWAEMFP 369

Query: 1713 CMIARTSTTDVISSGMGGTRNGALQLMHAELQVLSPLVPVREVNFLRFCKQHAEGVWAVV 1892
            CMIART+TTDVISSGMGGTRNGALQLMHAELQVLSPLVPVREVNFLRFCKQHAEGVWAVV
Sbjct: 370  CMIARTATTDVISSGMGGTRNGALQLMHAELQVLSPLVPVREVNFLRFCKQHAEGVWAVV 429

Query: 1893 DVSIDTIRETSGAPAFVNCRRLPSGCVVQDMPNGYSKVTWVEHAEYDESQVHQLYKPFVS 2072
            DVSIDTIRETSGAPAFVNCRRLPSGCVVQDMPNGYSKVTWVEHAEYDESQVHQLYKP + 
Sbjct: 430  DVSIDTIRETSGAPAFVNCRRLPSGCVVQDMPNGYSKVTWVEHAEYDESQVHQLYKPLII 489

Query: 2073 SGMGFGAQRWVATLQRQCECLAILMSSAVLARDHTAITAGGRRSMLKLAQRMTDNFCAGV 2252
            SGMGFGAQRWVATLQRQCECLAILMS++V ARDHTAITAGGRRSMLKLAQRMTDNFCAGV
Sbjct: 490  SGMGFGAQRWVATLQRQCECLAILMSTSVSARDHTAITAGGRRSMLKLAQRMTDNFCAGV 549

Query: 2253 CASTVHKWNKLNAGNVDEDVRVMTRKSVDDPGEPPGIVLSAATSVWLPVSPQXXXXXXXX 2432
            CASTVHKWNKLNAGNVDEDVRVMTRKSVDDPGEPPGIVLSAATSVWLPVSPQ        
Sbjct: 550  CASTVHKWNKLNAGNVDEDVRVMTRKSVDDPGEPPGIVLSAATSVWLPVSPQRLFNFLRD 609

Query: 2433 XXXXSEWDILSNGGPMQEMAHIAKGQDHGNCVSLLRASAINSNQSSMLILQETCIDAAGS 2612
                SEWDILSNGGPMQEMAHIAKGQDHGNCVSLLRASAIN+NQSSMLILQETC DAAGS
Sbjct: 610  ERLRSEWDILSNGGPMQEMAHIAKGQDHGNCVSLLRASAINANQSSMLILQETCTDAAGS 669

Query: 2613 LVVYAPVDIPAMHVVMNGGDSAYVALLPSGFAIVPD----XXXXXXXXXXXXXXXXXXXX 2780
            LVVYAPVDIPAMHVVMNGGDSAYVALLPSGFAIVPD                        
Sbjct: 670  LVVYAPVDIPAMHVVMNGGDSAYVALLPSGFAIVPDGPDSRGPLANGPTSGNGSNGGSQR 729

Query: 2781 XXXXLLTVAFQILVNSLPTAKLTVESVETVNNLISCTVQKIKAALQCES 2927
                LLTVAFQILVNSLPTAKLTVESVETVNNLISCTVQKIKAALQCES
Sbjct: 730  VGGSLLTVAFQILVNSLPTAKLTVESVETVNNLISCTVQKIKAALQCES 778


>XP_002272264.2 PREDICTED: homeobox-leucine zipper protein ANTHOCYANINLESS 2 isoform
            X1 [Vitis vinifera] XP_010661561.1 PREDICTED:
            homeobox-leucine zipper protein ANTHOCYANINLESS 2 isoform
            X1 [Vitis vinifera]
          Length = 811

 Score = 1273 bits (3293), Expect = 0.0
 Identities = 671/830 (80%), Positives = 705/830 (84%), Gaps = 5/830 (0%)
 Frame = +3

Query: 453  MSFGGFLETNTSXXXXXXXXXXXARIVADISYTNNMPXXXXXLAHPRLLSTPPLSKSMFN 632
            MSFGGFL+ ++            ARIVADI Y+NNM      +A PRL+S P L+KSMF+
Sbjct: 1    MSFGGFLDNSSGGGG--------ARIVADIPYSNNMATGA--IAQPRLVS-PSLAKSMFS 49

Query: 633  SPGLSLALQQPIDNQGDMQRMGESNFEGIIGRRSRENQEHESRSGSDNMEGASGDDLDVT 812
            SPGLSLALQ  ++ QG++ R+ E NFE   GRRSRE+ EHESRSGSDNM+GASGDD D  
Sbjct: 50   SPGLSLALQTSMEGQGEVTRLAE-NFESGGGRRSRED-EHESRSGSDNMDGASGDDQDAA 107

Query: 813  DKPPRKKRYHRHTPQQIQELESLFKECPHPDEKQRLELSRRLCLETRQVKFWFQNRRTQM 992
            D PPRKKRYHRHTPQQIQELE+LFKECPHPDEKQRLELSRRL LETRQVKFWFQNRRTQM
Sbjct: 108  DNPPRKKRYHRHTPQQIQELEALFKECPHPDEKQRLELSRRLSLETRQVKFWFQNRRTQM 167

Query: 993  KTQLERHENSLLRQENDKLRAENMSIRDAMRNPICTNCGGPAMIGDISLEEQHLRIENAR 1172
            KTQLERHENS+LRQENDKLRAENMSIRDAMRNPICTNCGGPA+IGDISLEEQHLRIENAR
Sbjct: 168  KTQLERHENSILRQENDKLRAENMSIRDAMRNPICTNCGGPAIIGDISLEEQHLRIENAR 227

Query: 1173 LKDELDRVCALAGKFLGRXXXXXXXXXXXXXX----ELGVGNNGFGGLSTVTTTTLPLGP 1340
            LKDELDRVCALAGKFLGR                  ELGVG+NGFGGLSTV TT LPLG 
Sbjct: 228  LKDELDRVCALAGKFLGRPISSLASSMAPAMPSSSLELGVGSNGFGGLSTVATT-LPLGH 286

Query: 1341 ADFGTGGTSNNALQVMTQNRSGPGVTGLDRSIERSMFLELALTAMDELVKMAQTDEPLWI 1520
             DFG G +S   +   T   S  GVTGL+RS+ERSMFLELAL AMDELVKMAQTDEPLW+
Sbjct: 287  -DFGGGISSTLPVAPPT---STTGVTGLERSLERSMFLELALAAMDELVKMAQTDEPLWV 342

Query: 1521 RSLEGPGREVLNLEEYLTTFTPCIGLKPNGFVTEASRETGMVIINGLALVETLMDPNRWA 1700
            RSLEG GRE+LNLEEY+ TFTPCIG+KP+GFVTE++RETGMVIIN LALVETLMD NRWA
Sbjct: 343  RSLEG-GREILNLEEYMRTFTPCIGMKPSGFVTESTRETGMVIINSLALVETLMDSNRWA 401

Query: 1701 EMFPCMIARTSTTDVISSGMGGTRNGALQLMHAELQVLSPLVPVREVNFLRFCKQHAEGV 1880
            EMFPCMIARTSTTDVISSGMGGTRNGALQLMHAELQVLSPLVPVREVNFLRFCKQHAEGV
Sbjct: 402  EMFPCMIARTSTTDVISSGMGGTRNGALQLMHAELQVLSPLVPVREVNFLRFCKQHAEGV 461

Query: 1881 WAVVDVSIDTIRETSGAPAFVNCRRLPSGCVVQDMPNGYSKVTWVEHAEYDESQVHQLYK 2060
            WAVVDVSIDTIRETS AP FVNCRRLPSGCVVQDMPNGYSKVTWVEHAEYDES VHQLY+
Sbjct: 462  WAVVDVSIDTIRETSVAPTFVNCRRLPSGCVVQDMPNGYSKVTWVEHAEYDESAVHQLYR 521

Query: 2061 PFVSSGMGFGAQRWVATLQRQCECLAILMSSAVLARDHT-AITAGGRRSMLKLAQRMTDN 2237
            P + SGMGFGAQRWVATLQRQCECLAILMSS V  RDHT AITAGGRRSMLKLAQRMTDN
Sbjct: 522  PLLGSGMGFGAQRWVATLQRQCECLAILMSSTVPTRDHTAAITAGGRRSMLKLAQRMTDN 581

Query: 2238 FCAGVCASTVHKWNKLNAGNVDEDVRVMTRKSVDDPGEPPGIVLSAATSVWLPVSPQXXX 2417
            FCAGVCASTVHKWNKL AGNVDEDVRVMTRKSVDDPGEPPGIVLSAATSVWLPVSPQ   
Sbjct: 582  FCAGVCASTVHKWNKLCAGNVDEDVRVMTRKSVDDPGEPPGIVLSAATSVWLPVSPQRLF 641

Query: 2418 XXXXXXXXXSEWDILSNGGPMQEMAHIAKGQDHGNCVSLLRASAINSNQSSMLILQETCI 2597
                     SEWDILSNGGPMQEMAHIAKGQDHGNCVSLLRASA+N+NQSSMLILQETCI
Sbjct: 642  DFLRDERLRSEWDILSNGGPMQEMAHIAKGQDHGNCVSLLRASAMNANQSSMLILQETCI 701

Query: 2598 DAAGSLVVYAPVDIPAMHVVMNGGDSAYVALLPSGFAIVPDXXXXXXXXXXXXXXXXXXX 2777
            DAAGSLVVYAPVDIPAMHVVMNGGDSAYVALLPSGFAIVPD                   
Sbjct: 702  DAAGSLVVYAPVDIPAMHVVMNGGDSAYVALLPSGFAIVPDGPGSRGPNSGVHTNSGGPN 761

Query: 2778 XXXXXLLTVAFQILVNSLPTAKLTVESVETVNNLISCTVQKIKAALQCES 2927
                 LLTVAFQILVNSLPTAKLTVESVETVNNLISCTVQKIKAAL CES
Sbjct: 762  RVSGSLLTVAFQILVNSLPTAKLTVESVETVNNLISCTVQKIKAALHCES 811


>XP_002320755.1 homeodomain family protein [Populus trichocarpa] EEE99070.1
            homeodomain family protein [Populus trichocarpa]
          Length = 823

 Score = 1269 bits (3285), Expect = 0.0
 Identities = 677/840 (80%), Positives = 710/840 (84%), Gaps = 15/840 (1%)
 Frame = +3

Query: 453  MSFGGFLETNTSXXXXXXXXXXXARIVADISYTNN-MPXXXXXLAHPRLLSTPPLSKSMF 629
            MSFGGFLE NTS           ARIVADI Y NN MP     +  PRL+S P ++KSMF
Sbjct: 1    MSFGGFLE-NTSPGGGG------ARIVADIPYNNNNMPTGA--IVQPRLVS-PSITKSMF 50

Query: 630  NSPGLSLALQQP-IDNQGDMQRMGESNFEGIIGRRSRENQEHESRSGSDNMEGASGDDLD 806
            NSPGLSLALQQP ID QGD+ RM E NFE  +GRRSRE +EHESRSGSDNM+GASGDD D
Sbjct: 51   NSPGLSLALQQPNIDGQGDITRMSE-NFETSVGRRSRE-EEHESRSGSDNMDGASGDDQD 108

Query: 807  VTDKPPRKKRYHRHTPQQIQELESLFKECPHPDEKQRLELSRRLCLETRQVKFWFQNRRT 986
              D PPRKKRYHRHTPQQIQELE+LFKECPHPDEKQRLELSRRLCLETRQVKFWFQNRRT
Sbjct: 109  AADNPPRKKRYHRHTPQQIQELEALFKECPHPDEKQRLELSRRLCLETRQVKFWFQNRRT 168

Query: 987  QMKTQLERHENSLLRQENDKLRAENMSIRDAMRNPICTNCGGPAMIGDISLEEQHLRIEN 1166
            QMKTQLERHENSLLRQENDKLRAENMSIRDAMRNP+C+NCGGPA+IGDISLEEQHLRIEN
Sbjct: 169  QMKTQLERHENSLLRQENDKLRAENMSIRDAMRNPMCSNCGGPAIIGDISLEEQHLRIEN 228

Query: 1167 ARLKDELDRVCALAGKFLGRXXXXXXXXXXXXXX----ELGVGNNGFGGLSTVTTTTLPL 1334
            ARLKDELDRVCALAGKFLGR                  ELGVG+NGF GLSTV TT LPL
Sbjct: 229  ARLKDELDRVCALAGKFLGRPISSLASSLGPPMPNSSLELGVGSNGFAGLSTVATT-LPL 287

Query: 1335 GPADFGTGGTSNNALQVMTQNRSGP-GVTGLDRSIERSMFLELALTAMDELVKMAQTDEP 1511
            GP DF  GG S  AL V+TQ R    GVTG+ RS+ERSMFLELAL AMDELVKMAQTDEP
Sbjct: 288  GP-DF-VGGISG-ALPVLTQTRPATTGVTGIGRSLERSMFLELALAAMDELVKMAQTDEP 344

Query: 1512 LWIRSLEGPGREVLNLEEYLTTFTPCIGLKPNGFVTEASRETGMVIINGLALVETLMDPN 1691
            LWIRS +G GRE+LN EEYL T TPCIG+KP+GFV+EASRETGMVIIN LALVETLMD N
Sbjct: 345  LWIRSFDG-GREILNHEEYLRTITPCIGMKPSGFVSEASRETGMVIINSLALVETLMDSN 403

Query: 1692 RWAEMFPCMIARTSTTDVISSGMGGTRNGALQLMHAELQVLSPLVPVREVNFLRFCKQHA 1871
            RWAEMFPC+IARTSTTDVI++GMGGTRNG+LQLMHAELQVLSPLVPVREVNFLRFCKQHA
Sbjct: 404  RWAEMFPCVIARTSTTDVIANGMGGTRNGSLQLMHAELQVLSPLVPVREVNFLRFCKQHA 463

Query: 1872 EGVWAVVDVSIDTIRETSGA-PAFVNCRRLPSGCVVQDMPNGYSKVTWVEHAEYDESQVH 2048
            EGVWAVVDVS+DTIRETSGA P FVNCRRLPSGCVVQDMPNGYSKVTW+EHAEYDESQ H
Sbjct: 464  EGVWAVVDVSVDTIRETSGASPTFVNCRRLPSGCVVQDMPNGYSKVTWIEHAEYDESQTH 523

Query: 2049 QLYKPFVSSGMGFGAQRWVATLQRQCECLAILMSSAVLARDHTAITAGGRRSMLKLAQRM 2228
            QLY+P +SSGMGFGAQRW+ATLQRQ ECLAILMSS V +RDHTAITA GRRSMLKLAQRM
Sbjct: 524  QLYRPLISSGMGFGAQRWIATLQRQSECLAILMSSNVPSRDHTAITASGRRSMLKLAQRM 583

Query: 2229 TDNFCAGVCASTVHKWNKLNAGNVDEDVRVMTRKSVDDPGEPPGIVLSAATSVWLPVSPQ 2408
            T NFCAGVCASTVHKWNKLNAGNVDEDVRVMTRKSVDDPGEPPGIVLSAATSVWLPVSPQ
Sbjct: 584  TANFCAGVCASTVHKWNKLNAGNVDEDVRVMTRKSVDDPGEPPGIVLSAATSVWLPVSPQ 643

Query: 2409 XXXXXXXXXXXXSEWDILSNGGPMQEMAHIAKGQDHGNCVSLLRASAINSNQSSMLILQE 2588
                        SEWDILSNGGPMQEMAHIAKGQDHGNCVSLLRASA+N+NQSSMLILQE
Sbjct: 644  RLFDFLRDERLRSEWDILSNGGPMQEMAHIAKGQDHGNCVSLLRASAMNANQSSMLILQE 703

Query: 2589 TCIDAAGSLVVYAPVDIPAMHVVMNGGDSAYVALLPSGFAIVPDXXXXXXXXXXXXXXXX 2768
            TCIDAAGSLVVYAPVDIPAMHVVMNGGDSAYVALLPSGFAIVPD                
Sbjct: 704  TCIDAAGSLVVYAPVDIPAMHVVMNGGDSAYVALLPSGFAIVPDGPGSRGPPTTNGGPTA 763

Query: 2769 XXXXXXXX-------LLTVAFQILVNSLPTAKLTVESVETVNNLISCTVQKIKAALQCES 2927
                           LLTVAFQILVNSLPTAKLTVESVETVNNLISCTVQKIKAALQCES
Sbjct: 764  NNNSNGGGPERVSGSLLTVAFQILVNSLPTAKLTVESVETVNNLISCTVQKIKAALQCES 823


>XP_011035097.1 PREDICTED: homeobox-leucine zipper protein ANTHOCYANINLESS 2 [Populus
            euphratica]
          Length = 823

 Score = 1269 bits (3283), Expect = 0.0
 Identities = 677/840 (80%), Positives = 709/840 (84%), Gaps = 15/840 (1%)
 Frame = +3

Query: 453  MSFGGFLETNTSXXXXXXXXXXXARIVADISYTNN-MPXXXXXLAHPRLLSTPPLSKSMF 629
            MSFGGFLE NTS           ARIVADI Y NN MP     +  PRL+S P ++KSMF
Sbjct: 1    MSFGGFLE-NTSPGGGG------ARIVADIPYNNNNMPTGA--IVQPRLVS-PSITKSMF 50

Query: 630  NSPGLSLALQQP-IDNQGDMQRMGESNFEGIIGRRSRENQEHESRSGSDNMEGASGDDLD 806
            NSPGLSLALQQP ID QGD+ RM E NFE  +GRRSRE +EHESRSGSDNM+GASGDD D
Sbjct: 51   NSPGLSLALQQPNIDGQGDITRMSE-NFETSVGRRSRE-EEHESRSGSDNMDGASGDDQD 108

Query: 807  VTDKPPRKKRYHRHTPQQIQELESLFKECPHPDEKQRLELSRRLCLETRQVKFWFQNRRT 986
              D PPRKKRYHRHTPQQIQELE+LFKECPHPDEKQRLELSRRLCLETRQVKFWFQNRRT
Sbjct: 109  AADNPPRKKRYHRHTPQQIQELEALFKECPHPDEKQRLELSRRLCLETRQVKFWFQNRRT 168

Query: 987  QMKTQLERHENSLLRQENDKLRAENMSIRDAMRNPICTNCGGPAMIGDISLEEQHLRIEN 1166
            QMKTQLERHENSLLRQENDKLRAENMSIRDAMRNP+C+NCGGPA+IGDISLEEQHLRIEN
Sbjct: 169  QMKTQLERHENSLLRQENDKLRAENMSIRDAMRNPMCSNCGGPAIIGDISLEEQHLRIEN 228

Query: 1167 ARLKDELDRVCALAGKFLGRXXXXXXXXXXXXXX----ELGVGNNGFGGLSTVTTTTLPL 1334
            ARLKDELDRVCALAGKFLGR                  ELGVG+NGF GLSTV TT LPL
Sbjct: 229  ARLKDELDRVCALAGKFLGRPISSLASSLGPPMPNSSLELGVGSNGFAGLSTVATT-LPL 287

Query: 1335 GPADFGTGGTSNNALQVMTQNRSGP-GVTGLDRSIERSMFLELALTAMDELVKMAQTDEP 1511
            GP DF  GG S  AL V+ Q R    GVTG+ RS+ERSMFLELAL AMDELVKMAQTDEP
Sbjct: 288  GP-DF-VGGISG-ALPVLAQTRPATTGVTGIGRSLERSMFLELALAAMDELVKMAQTDEP 344

Query: 1512 LWIRSLEGPGREVLNLEEYLTTFTPCIGLKPNGFVTEASRETGMVIINGLALVETLMDPN 1691
            LWIRS +G GRE+LN EEYL T TPCIG+KP+GFV+EASRETGMVIIN LALVETLMD N
Sbjct: 345  LWIRSFDG-GREILNHEEYLRTITPCIGMKPSGFVSEASRETGMVIINSLALVETLMDSN 403

Query: 1692 RWAEMFPCMIARTSTTDVISSGMGGTRNGALQLMHAELQVLSPLVPVREVNFLRFCKQHA 1871
            RWAEMFPC+IARTSTTDVI++GMGGTRNG+LQLMHAELQVLSPLVPVREVNFLRFCKQHA
Sbjct: 404  RWAEMFPCVIARTSTTDVIANGMGGTRNGSLQLMHAELQVLSPLVPVREVNFLRFCKQHA 463

Query: 1872 EGVWAVVDVSIDTIRETSGAP-AFVNCRRLPSGCVVQDMPNGYSKVTWVEHAEYDESQVH 2048
            EGVWAVVDVS+DTIRETSGAP  FVNCRRLPSGCVVQDMPNGYSKVTW+EHAEYDESQ H
Sbjct: 464  EGVWAVVDVSVDTIRETSGAPPTFVNCRRLPSGCVVQDMPNGYSKVTWIEHAEYDESQTH 523

Query: 2049 QLYKPFVSSGMGFGAQRWVATLQRQCECLAILMSSAVLARDHTAITAGGRRSMLKLAQRM 2228
            QLY+P +SSGMGFGAQRW+ATLQRQ ECLAILMSS V +RDHTAITA GRRSMLKLAQRM
Sbjct: 524  QLYRPLISSGMGFGAQRWIATLQRQSECLAILMSSNVPSRDHTAITASGRRSMLKLAQRM 583

Query: 2229 TDNFCAGVCASTVHKWNKLNAGNVDEDVRVMTRKSVDDPGEPPGIVLSAATSVWLPVSPQ 2408
            T NFCAGVCASTVHKWNKLNAGNVDEDVRVMTRKSVDDPGEPPGIVLSAATSVWLPVSPQ
Sbjct: 584  TANFCAGVCASTVHKWNKLNAGNVDEDVRVMTRKSVDDPGEPPGIVLSAATSVWLPVSPQ 643

Query: 2409 XXXXXXXXXXXXSEWDILSNGGPMQEMAHIAKGQDHGNCVSLLRASAINSNQSSMLILQE 2588
                        SEWDILSNGGPMQEMAHIAKGQDHGNCVSLLRASA+NSNQSSMLILQE
Sbjct: 644  RLFDFLRDERLRSEWDILSNGGPMQEMAHIAKGQDHGNCVSLLRASAMNSNQSSMLILQE 703

Query: 2589 TCIDAAGSLVVYAPVDIPAMHVVMNGGDSAYVALLPSGFAIVPDXXXXXXXXXXXXXXXX 2768
            TCIDAAGSLVVYAPVDIPAMHVVMNGGDSAYVALLPSGFAIVPD                
Sbjct: 704  TCIDAAGSLVVYAPVDIPAMHVVMNGGDSAYVALLPSGFAIVPDGPGSRGPPTTNGGPTA 763

Query: 2769 XXXXXXXX-------LLTVAFQILVNSLPTAKLTVESVETVNNLISCTVQKIKAALQCES 2927
                           LLTVAFQILVNSLPTAKLTVESVETVNNLISCTVQKIKAALQCES
Sbjct: 764  NNNSNGCGPDRVSGSLLTVAFQILVNSLPTAKLTVESVETVNNLISCTVQKIKAALQCES 823


>XP_002301331.2 homeodomain family protein [Populus trichocarpa] EEE80604.2
            homeodomain family protein [Populus trichocarpa]
          Length = 820

 Score = 1259 bits (3258), Expect = 0.0
 Identities = 666/835 (79%), Positives = 702/835 (84%), Gaps = 10/835 (1%)
 Frame = +3

Query: 453  MSFGGFLETNTSXXXXXXXXXXXARIVADISYTNNMPXXXXXLAHPRLLSTPPLSKSMFN 632
            MSFGGFLE NTS           ARIVADI Y NN       +A  RL+S P ++KSMFN
Sbjct: 1    MSFGGFLE-NTSPGGGG------ARIVADILYNNNNNMPTGAIAQTRLVS-PSITKSMFN 52

Query: 633  SPGLSLALQQP-IDNQGDMQRMGESNFEGIIGRRSRENQEHESRSGSDNMEGASGDDLDV 809
            SPGLSLALQQP ID QGD+ RM E NFE  +GRRSRE +EHESRSGSDNM+GASGDD D 
Sbjct: 53   SPGLSLALQQPNIDGQGDITRMAE-NFETSVGRRSRE-EEHESRSGSDNMDGASGDDQDA 110

Query: 810  TDKPPRKKRYHRHTPQQIQELESLFKECPHPDEKQRLELSRRLCLETRQVKFWFQNRRTQ 989
             D PPRKKRYHRHTPQQIQELE+LFKECPHPDEKQRLELSRRLCLETRQVKFWFQNRRTQ
Sbjct: 111  ADNPPRKKRYHRHTPQQIQELEALFKECPHPDEKQRLELSRRLCLETRQVKFWFQNRRTQ 170

Query: 990  MKTQLERHENSLLRQENDKLRAENMSIRDAMRNPICTNCGGPAMIGDISLEEQHLRIENA 1169
            MKTQLERHENSLLRQ+NDKLRAENMSIRDAMRNP C+NCGGPA+IGD+SLEEQHLRIENA
Sbjct: 171  MKTQLERHENSLLRQDNDKLRAENMSIRDAMRNPSCSNCGGPAIIGDMSLEEQHLRIENA 230

Query: 1170 RLKDELDRVCALAGKFLGRXXXXXXXXXXXXXX---ELGVGNNGFGGLSTVTTTTLPLGP 1340
            RLKDELDRVCALAGKFLGR                 EL VG+NGF GLST+ TT LPLGP
Sbjct: 231  RLKDELDRVCALAGKFLGRPISSLASSLSPPTNSSLELAVGSNGFAGLSTIATT-LPLGP 289

Query: 1341 ADFGTGGTSNNALQVMTQNR-SGPGVTGLDRSIERSMFLELALTAMDELVKMAQTDEPLW 1517
                 GG S  AL ++TQ R +  GVTG+DRS+ERSMFLELAL AMDELVKM QTDEPLW
Sbjct: 290  --HFEGGISG-ALSMVTQTRLATAGVTGIDRSVERSMFLELALAAMDELVKMVQTDEPLW 346

Query: 1518 IRSLEGPGREVLNLEEYLTTFTPCIGLKPNGFVTEASRETGMVIINGLALVETLMDPNRW 1697
            I S EG GRE+LN E YL TFTPCIG+KP+GFV+EASRETGMVIIN LALVETLMD NRW
Sbjct: 347  IGSFEG-GREILNHEGYLRTFTPCIGMKPSGFVSEASRETGMVIINSLALVETLMDSNRW 405

Query: 1698 AEMFPCMIARTSTTDVISSGMGGTRNGALQLMHAELQVLSPLVPVREVNFLRFCKQHAEG 1877
            AEMFPCMIARTSTTDVI+SGMGGTRNG+LQLM AEL VLSPLVPVREVNFLRFCKQHAEG
Sbjct: 406  AEMFPCMIARTSTTDVIASGMGGTRNGSLQLMQAELHVLSPLVPVREVNFLRFCKQHAEG 465

Query: 1878 VWAVVDVSIDTIRETSGAP-AFVNCRRLPSGCVVQDMPNGYSKVTWVEHAEYDESQVHQL 2054
            VWAVVDVSIDTIR+TSGAP  FVNCRRLPSGCVVQDMPNGYSKVTWVEHA+YDE Q+HQL
Sbjct: 466  VWAVVDVSIDTIRDTSGAPPTFVNCRRLPSGCVVQDMPNGYSKVTWVEHAQYDERQIHQL 525

Query: 2055 YKPFVSSGMGFGAQRWVATLQRQCECLAILMSSAVLARDHTAITAGGRRSMLKLAQRMTD 2234
            Y+P +SSGMGFGAQRW+ATLQRQCECLAIL+SS V +RDHTAIT  GRRSMLKLAQRMTD
Sbjct: 526  YRPVISSGMGFGAQRWIATLQRQCECLAILLSSNVPSRDHTAITTSGRRSMLKLAQRMTD 585

Query: 2235 NFCAGVCASTVHKWNKLNAGNVDEDVRVMTRKSVDDPGEPPGIVLSAATSVWLPVSPQXX 2414
            NFCAGVCASTVHKWNKLNAGNVDEDVRVMTRKSVDDPGEPPGIVLSAATSVWLPVSPQ  
Sbjct: 586  NFCAGVCASTVHKWNKLNAGNVDEDVRVMTRKSVDDPGEPPGIVLSAATSVWLPVSPQRL 645

Query: 2415 XXXXXXXXXXSEWDILSNGGPMQEMAHIAKGQDHGNCVSLLRASAINSNQSSMLILQETC 2594
                      SEWDILSNGGPMQEMAHIAKGQDHGNCVSLLRASA+N+NQSSMLILQETC
Sbjct: 646  FDFLRNERLRSEWDILSNGGPMQEMAHIAKGQDHGNCVSLLRASAMNANQSSMLILQETC 705

Query: 2595 IDAAGSLVVYAPVDIPAMHVVMNGGDSAYVALLPSGFAIVPD----XXXXXXXXXXXXXX 2762
            IDAAGSLVVYAPVD PAMHVVMNGGDSAYVALLPSGFAIVPD                  
Sbjct: 706  IDAAGSLVVYAPVDTPAMHVVMNGGDSAYVALLPSGFAIVPDGPGSRDPPSTNGGPTANN 765

Query: 2763 XXXXXXXXXXLLTVAFQILVNSLPTAKLTVESVETVNNLISCTVQKIKAALQCES 2927
                      LLTVAFQILVNSLPTAKLTVESVETVNNLISCTVQKIKAALQCES
Sbjct: 766  VGGQERVSGSLLTVAFQILVNSLPTAKLTVESVETVNNLISCTVQKIKAALQCES 820


Top