BLASTX nr result

ID: Papaver32_contig00006754 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Papaver32_contig00006754
         (2154 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

XP_010278578.1 PREDICTED: homeobox-leucine zipper protein ANTHOC...   932   0.0  
OMP00515.1 hypothetical protein COLO4_12612 [Corchorus olitorius]     931   0.0  
XP_010255501.1 PREDICTED: homeobox-leucine zipper protein ANTHOC...   929   0.0  
XP_012083470.1 PREDICTED: homeobox-leucine zipper protein ANTHOC...   929   0.0  
XP_010661562.1 PREDICTED: homeobox-leucine zipper protein ANTHOC...   928   0.0  
XP_002320755.1 homeodomain family protein [Populus trichocarpa] ...   928   0.0  
OAY49229.1 hypothetical protein MANES_05G039400 [Manihot esculenta]   928   0.0  
EOX96069.1 HD domain class transcription factor isoform 1 [Theob...   927   0.0  
EOX96070.1 HD domain class transcription factor isoform 2 [Theob...   926   0.0  
XP_010278577.1 PREDICTED: homeobox-leucine zipper protein ANTHOC...   926   0.0  
XP_007051912.2 PREDICTED: homeobox-leucine zipper protein ANTHOC...   925   0.0  
XP_016748231.1 PREDICTED: homeobox-leucine zipper protein ANTHOC...   925   0.0  
XP_011035097.1 PREDICTED: homeobox-leucine zipper protein ANTHOC...   925   0.0  
XP_007051913.2 PREDICTED: homeobox-leucine zipper protein ANTHOC...   924   0.0  
XP_002272264.2 PREDICTED: homeobox-leucine zipper protein ANTHOC...   924   0.0  
XP_017631289.1 PREDICTED: homeobox-leucine zipper protein ANTHOC...   923   0.0  
XP_012489878.1 PREDICTED: homeobox-leucine zipper protein ANTHOC...   920   0.0  
GAV85584.1 Homeobox domain-containing protein/START domain-conta...   919   0.0  
OAY62348.1 hypothetical protein MANES_01G261300 [Manihot esculenta]   918   0.0  
XP_007220256.1 hypothetical protein PRUPE_ppa001436mg [Prunus pe...   918   0.0  

>XP_010278578.1 PREDICTED: homeobox-leucine zipper protein ANTHOCYANINLESS 2-like
            isoform X2 [Nelumbo nucifera]
          Length = 813

 Score =  932 bits (2408), Expect = 0.0
 Identities = 478/714 (66%), Positives = 559/714 (78%), Gaps = 4/714 (0%)
 Frame = -2

Query: 2132 RVVSDIPLNNMPTGVISQSRVVNHNHHHPYHKPAAVFKSPGLSLALQTNNMDXXXXXGDR 1953
            RVV+DIP +NMP G I+Q R+++ +         ++F SPGLSLAL+T            
Sbjct: 16   RVVADIPYSNMPAGAIAQPRLLSPS------LAKSMFNSPGLSLALKTG----------- 58

Query: 1952 FXXXXXXXXXXXXXXXXXXXXXXXDEYESRSGSDNMDGANSDDQDTDK-PPRKKRYHRHT 1776
                                    D YESRSGSDNM+GA+ DDQD D  PPRKKRYHRHT
Sbjct: 59   MEGQGEVGRIGENLDTGAVGRNKEDGYESRSGSDNMEGASGDDQDGDNNPPRKKRYHRHT 118

Query: 1775 PQQIQELEALFKECPHPDEKQRNELSRRLVLEARQVKFWFQNRRTQMKTQIERHENSILR 1596
            PQQIQELEALFKECPHPDEKQRNELS+RL LE+RQVKFWFQNRRTQMKTQ+ERHENSILR
Sbjct: 119  PQQIQELEALFKECPHPDEKQRNELSKRLCLESRQVKFWFQNRRTQMKTQLERHENSILR 178

Query: 1595 QENDKLRAENMNVREAMRNPMCNNCGGAAVLGDVSLEEQHLRVENARLKEELDRVCALAG 1416
            QENDKLRAENM++R+AMRNP+C+NCGG A+LGD+SLEEQHLR+ENARLK+ELDRVCALAG
Sbjct: 179  QENDKLRAENMSIRDAMRNPICSNCGGPAMLGDISLEEQHLRIENARLKDELDRVCALAG 238

Query: 1415 KFLGRPVSSIGDPLPPSMQIGAGLDLAVXXXXXXXXXXXXXXGVXXXXXXXXXXNQFRXX 1236
            KFLGRPVSS+   +PP M   + L+LAV                          +     
Sbjct: 239  KFLGRPVSSLATSIPPPMP-SSSLELAVGSNGFGGLNTVAATLPLVSDFGGGVSSALSVV 297

Query: 1235 XXXXXXXXXXXVPHQQQSISSGVDRSVENSMYLDLAMAAMEELVRMAQTNEPLWIPSSFE 1056
                        P +  +  +G++RS+E SM+LDLA+AAM+ELV+MAQT++PLW+P   +
Sbjct: 298  P-----------PARPAAGVTGLERSLERSMFLDLALAAMDELVKMAQTDKPLWLPG-LD 345

Query: 1055 GGKETFNREEYTRMFAPCIGSKPNGFVTEATRETGVVIINGVALVETLMEAKRYVDMFPC 876
            GGKET N EEY + F PCIG KP+GFVTEATRETG+VIIN +ALVETLM+A R+ +MFPC
Sbjct: 346  GGKETLNHEEYMQTFPPCIGLKPSGFVTEATRETGMVIINSLALVETLMDASRWAEMFPC 405

Query: 875  VIARCNTTDVISSGVGGTRNGALQLMHAEFQVLSPLVPIREVNFLRFCKQHAEGVWAVVD 696
            +IAR +TT+VISSG+GGTRN ALQLMHAE QVLSPLVPIREV FLRFCKQHAEGVWAVVD
Sbjct: 406  MIARTSTTEVISSGMGGTRNCALQLMHAELQVLSPLVPIREVKFLRFCKQHAEGVWAVVD 465

Query: 695  ISVD-ANQENSDTSKYLS-RRLPSGCVIQDMPNGCCKVTWVEHAEYDENSIHRIYQPLVN 522
            +S+D   +E S+   ++S RRLPSGCV+QDMPNG  KVTWVEH EYDE+SIH++Y+PL+ 
Sbjct: 466  VSIDHILRETSNEPVFVSCRRLPSGCVVQDMPNGYSKVTWVEHGEYDESSIHQLYRPLLR 525

Query: 521  AGLGFGAQKWIATLQRQCECIAILMSN-MPIRDQTGITQSGRKSMLKLAQRMTQNFCAGV 345
            AG+GFGAQ+W+ATLQRQCEC+AILMS+ +P RD T IT SGR+SMLKLAQRMT NFCAGV
Sbjct: 526  AGMGFGAQRWVATLQRQCECLAILMSSTLPARDHTAITPSGRRSMLKLAQRMTDNFCAGV 585

Query: 344  CPSSVHKWNRLSAGNVDADVRVMTRKSIDDPGEPPGVVLSAATSVWLPISPQKLFDFLRD 165
            C S+VHKWN+L AGNVD DVRVMTRKS+DDPGEPPGVVLSAATSVWLP+SPQ+LFDFLRD
Sbjct: 586  CASAVHKWNKLCAGNVDEDVRVMTRKSVDDPGEPPGVVLSAATSVWLPVSPQRLFDFLRD 645

Query: 164  ERLRSEWDILSNGGPMQQMSHITKGQDPGNCVSLLRAGAMNASQSSMLILQETC 3
            ERLRSEWDILSNGGPMQ+M+HI KGQD GNCVSLLRA AMNA+QSSMLILQETC
Sbjct: 646  ERLRSEWDILSNGGPMQEMAHIAKGQDHGNCVSLLRASAMNANQSSMLILQETC 699


>OMP00515.1 hypothetical protein COLO4_12612 [Corchorus olitorius]
          Length = 819

 Score =  931 bits (2405), Expect = 0.0
 Identities = 476/716 (66%), Positives = 561/716 (78%), Gaps = 6/716 (0%)
 Frame = -2

Query: 2132 RVVSDIPL-NNMPTGVISQSRVVNHNHHHPYHKPAAVFKSPGLSLALQTNNMDXXXXXGD 1956
            R+V+DIP  NNMPTGVI+Q R+V+ +          +F SPGLSLALQ  N+D       
Sbjct: 17   RIVADIPYSNNMPTGVIAQPRLVSPS------LAKNMFNSPGLSLALQQPNIDNQGDGT- 69

Query: 1955 RFXXXXXXXXXXXXXXXXXXXXXXXDEYESRSGSDNMDGANSDDQDT-DKPPRKKRYHRH 1779
                                     +E+ESRSGSDNMDGA+ DDQD  D PPRKKRYHRH
Sbjct: 70   ---------RMGENFEASVGRRSREEEHESRSGSDNMDGASGDDQDAADNPPRKKRYHRH 120

Query: 1778 TPQQIQELEALFKECPHPDEKQRNELSRRLVLEARQVKFWFQNRRTQMKTQIERHENSIL 1599
            TPQQIQELE+LFKECPHPDEKQR ELS+RL LE RQVKFWFQNRRTQMKTQ+ERHENS+L
Sbjct: 121  TPQQIQELESLFKECPHPDEKQRLELSKRLCLETRQVKFWFQNRRTQMKTQLERHENSLL 180

Query: 1598 RQENDKLRAENMNVREAMRNPMCNNCGGAAVLGDVSLEEQHLRVENARLKEELDRVCALA 1419
            RQENDKLRAENM++R+AMRNP+C NCGG A++GD+SLEEQHLR+ENARLK+ELDRVCALA
Sbjct: 181  RQENDKLRAENMSIRDAMRNPICTNCGGPAIMGDISLEEQHLRIENARLKDELDRVCALA 240

Query: 1418 GKFLGRPVSSIGDPLPPSMQIGAGLDLAVXXXXXXXXXXXXXXGVXXXXXXXXXXNQFRX 1239
            GKFLGRP+S++   + P M   + L+L V                               
Sbjct: 241  GKFLGRPISALASSIAPPMP-NSSLELGVGNNGFGGLSTVPTTLPLGPDFGSGINA---- 295

Query: 1238 XXXXXXXXXXXXVPHQQQSIS-SGVDRSVENSMYLDLAMAAMEELVRMAQTNEPLWIPSS 1062
                        VP  + +   +G+DRSVE SM+L+LA+AAM+ELV+MAQT+EPLWI  S
Sbjct: 296  ---------LPVVPATRPTAGVTGLDRSVERSMFLELALAAMDELVKMAQTDEPLWI-KS 345

Query: 1061 FEGGKETFNREEYTRMFAPCIGSKPNGFVTEATRETGVVIINGVALVETLMEAKRYVDMF 882
             EGG+ET N +EY R F PCIG KP+GFVTEA+RETGVVIIN +ALVETLM++ R+ +MF
Sbjct: 346  LEGGRETLNYDEYLRSFTPCIGMKPSGFVTEASRETGVVIINSLALVETLMDSNRWAEMF 405

Query: 881  PCVIARCNTTDVISSGVGGTRNGALQLMHAEFQVLSPLVPIREVNFLRFCKQHAEGVWAV 702
            PC+IAR +TTDVIS G+GGTRNGALQLMHAE QVLSPLVP+REVNFLRFCKQHAEGVWAV
Sbjct: 406  PCMIARTSTTDVISGGMGGTRNGALQLMHAELQVLSPLVPVREVNFLRFCKQHAEGVWAV 465

Query: 701  VDISVDANQENSDTSK-YLS-RRLPSGCVIQDMPNGCCKVTWVEHAEYDENSIHRIYQPL 528
            VD+S+D+ +E S     YL+ RRLPSGCV+QDMPNG  KVTWVEHAEY+E+ +H++Y+PL
Sbjct: 466  VDVSIDSIRETSGAPPTYLNCRRLPSGCVVQDMPNGYSKVTWVEHAEYEESQVHQLYRPL 525

Query: 527  VNAGLGFGAQKWIATLQRQCECIAILMSN-MPIRDQTGITQSGRKSMLKLAQRMTQNFCA 351
            +++G+GFGAQ+W+ATLQRQCEC+AILMS+ +P RD T IT SGR+SMLKLAQRMT NFCA
Sbjct: 526  LSSGMGFGAQRWVATLQRQCECLAILMSSSVPARDHTAITASGRRSMLKLAQRMTDNFCA 585

Query: 350  GVCPSSVHKWNRLSAGNVDADVRVMTRKSIDDPGEPPGVVLSAATSVWLPISPQKLFDFL 171
            GVC S+VHKWN+L+AGNVD DVRVMTRKS+DDPGEPPG+VLSAATSVWLP+SPQ+LFDFL
Sbjct: 586  GVCASTVHKWNKLNAGNVDEDVRVMTRKSVDDPGEPPGIVLSAATSVWLPVSPQRLFDFL 645

Query: 170  RDERLRSEWDILSNGGPMQQMSHITKGQDPGNCVSLLRAGAMNASQSSMLILQETC 3
            RDERLRSEWDILSNGGPMQ+M+HI KGQD GNCVSLLRA AMNA+QSSMLILQETC
Sbjct: 646  RDERLRSEWDILSNGGPMQEMAHIAKGQDHGNCVSLLRASAMNANQSSMLILQETC 701


>XP_010255501.1 PREDICTED: homeobox-leucine zipper protein ANTHOCYANINLESS 2 [Nelumbo
            nucifera]
          Length = 811

 Score =  929 bits (2402), Expect = 0.0
 Identities = 478/713 (67%), Positives = 556/713 (77%), Gaps = 3/713 (0%)
 Frame = -2

Query: 2132 RVVSDIPLNNMPTGVISQSRVVNHNHHHPYHKPAAVFKSPGLSLALQTNNMDXXXXXGDR 1953
            RVV+DIP +NMP G I+Q R+V  +         ++F SPGLSLALQT            
Sbjct: 16   RVVADIPYSNMPAGAIAQPRLVAPS------LAKSMFSSPGLSLALQTG----------- 58

Query: 1952 FXXXXXXXXXXXXXXXXXXXXXXXDEYESRSGSDNMDGANSDDQDTDKPPRKKRYHRHTP 1773
                                    DEYESRSGSDNM+GA+ DDQD D PPRKKRYHRHTP
Sbjct: 59   MEGQGEAGQIGEKLDSTVVGRNREDEYESRSGSDNMEGASGDDQDGDNPPRKKRYHRHTP 118

Query: 1772 QQIQELEALFKECPHPDEKQRNELSRRLVLEARQVKFWFQNRRTQMKTQIERHENSILRQ 1593
            QQIQELEALFKECPHPDEKQR ELS+RL LE+RQVKFWFQNRRTQMKTQ+ERHEN+ILRQ
Sbjct: 119  QQIQELEALFKECPHPDEKQRMELSKRLCLESRQVKFWFQNRRTQMKTQLERHENTILRQ 178

Query: 1592 ENDKLRAENMNVREAMRNPMCNNCGGAAVLGDVSLEEQHLRVENARLKEELDRVCALAGK 1413
            ENDKLRAENM++REAMRNP+C+NCGG A+LGD+SLEEQHLR+ENARLK+ELDRVCALAGK
Sbjct: 179  ENDKLRAENMSIREAMRNPICSNCGGPAMLGDISLEEQHLRIENARLKDELDRVCALAGK 238

Query: 1412 FLGRPVSSIGDPLPPSMQIGAGLDLAVXXXXXXXXXXXXXXGVXXXXXXXXXXNQFRXXX 1233
            FLGRPVSS+  P+P S      L+LAV               +          + F    
Sbjct: 239  FLGRPVSSLATPMPSS-----SLELAVGSNGFGG--------MNPVATTLPLVSDFVGGV 285

Query: 1232 XXXXXXXXXXVPHQQQSISSGVDRSVENSMYLDLAMAAMEELVRMAQTNEPLWIPSSFEG 1053
                       P    +I   +DRS+E SM+LDLA+AAM+ELV+MAQ+++ LW+P   EG
Sbjct: 286  SNTLPVVPQTRPTPGVTI---LDRSLERSMFLDLALAAMDELVKMAQSDKSLWLPG-LEG 341

Query: 1052 GKETFNREEYTRMFAPCIGSKPNGFVTEATRETGVVIINGVALVETLMEAKRYVDMFPCV 873
            GKET N+EEY + F PCIG KP+GFVTEATRETG+VIIN +ALVETLM+A R+ +MFPC+
Sbjct: 342  GKETLNQEEYMQTFPPCIGMKPSGFVTEATRETGMVIINSLALVETLMDANRWAEMFPCM 401

Query: 872  IARCNTTDVISSGVGGTRNGALQLMHAEFQVLSPLVPIREVNFLRFCKQHAEGVWAVVDI 693
            IAR +TT+V+SSG+GGTRN ALQLMHAE QVLSPLVPIREV FLRFCKQHAEGVWAVVD+
Sbjct: 402  IARTSTTEVLSSGMGGTRNCALQLMHAELQVLSPLVPIREVKFLRFCKQHAEGVWAVVDV 461

Query: 692  SVD-ANQENSDTSKYLS-RRLPSGCVIQDMPNGCCKVTWVEHAEYDENSIHRIYQPLVNA 519
            S+D   +E S+   + S RRLPSGCV+QDMPNG  KV WVEHAEYDE++IH++Y+PL+ A
Sbjct: 462  SIDHILRETSNEPTFASCRRLPSGCVVQDMPNGYSKVIWVEHAEYDESAIHQLYRPLLRA 521

Query: 518  GLGFGAQKWIATLQRQCECIAILMSN-MPIRDQTGITQSGRKSMLKLAQRMTQNFCAGVC 342
            G+GFGAQ+W+ATLQRQCEC+AILMS+ +P RD T IT SGR+SMLKLAQRMT NFCAGVC
Sbjct: 522  GMGFGAQRWVATLQRQCECLAILMSSTVPARDHTAITPSGRRSMLKLAQRMTDNFCAGVC 581

Query: 341  PSSVHKWNRLSAGNVDADVRVMTRKSIDDPGEPPGVVLSAATSVWLPISPQKLFDFLRDE 162
             S+VHKWN+L  GNVD DVRVMTRKS+DDPGEPPGVVLSAATSVWLP+SPQ+LFDFLRDE
Sbjct: 582  ASAVHKWNKLCTGNVDEDVRVMTRKSVDDPGEPPGVVLSAATSVWLPVSPQRLFDFLRDE 641

Query: 161  RLRSEWDILSNGGPMQQMSHITKGQDPGNCVSLLRAGAMNASQSSMLILQETC 3
            RLRSEWDILSNGGPMQ+M+HI KGQDPGNCVSLLRA AMNA+QS+MLILQETC
Sbjct: 642  RLRSEWDILSNGGPMQEMAHIAKGQDPGNCVSLLRASAMNANQSNMLILQETC 694


>XP_012083470.1 PREDICTED: homeobox-leucine zipper protein ANTHOCYANINLESS 2
            [Jatropha curcas] KDP28682.1 hypothetical protein
            JCGZ_14453 [Jatropha curcas]
          Length = 819

 Score =  929 bits (2401), Expect = 0.0
 Identities = 475/720 (65%), Positives = 563/720 (78%), Gaps = 10/720 (1%)
 Frame = -2

Query: 2132 RVVSDIPLN--NMPTGVISQSRVVNHNHHHPYHKPAAVFKSPGLSLALQTNNMDXXXXXG 1959
            R+V+DIP +  NMPTG I+Q R+V+ +         ++F SPGLSLALQ  N+D     G
Sbjct: 18   RIVADIPYSSSNMPTGAIAQPRLVSPS------LTKSMFSSPGLSLALQQPNIDSPGDMG 71

Query: 1958 DRFXXXXXXXXXXXXXXXXXXXXXXXDEYESRSGSDNMDGANSDDQDT-DKPPRKKRYHR 1782
                                      +E+ESRSGSDNMDGA+ DDQD  D PPRKKRYHR
Sbjct: 72   ----------RMAENFEPSGGRRSREEEHESRSGSDNMDGASGDDQDAADNPPRKKRYHR 121

Query: 1781 HTPQQIQELEALFKECPHPDEKQRNELSRRLVLEARQVKFWFQNRRTQMKTQIERHENSI 1602
            HTPQQIQELEALFKECPHPDEKQR ELS+RL LE RQVKFWFQNRRTQMKTQ+ERHENS+
Sbjct: 122  HTPQQIQELEALFKECPHPDEKQRLELSKRLSLETRQVKFWFQNRRTQMKTQLERHENSL 181

Query: 1601 LRQENDKLRAENMNVREAMRNPMCNNCGGAAVLGDVSLEEQHLRVENARLKEELDRVCAL 1422
            LRQENDKLRAENM++R+AMRNP+C+NCGG A++GD+SLEEQHLR+ENARLK+ELDRVCAL
Sbjct: 182  LRQENDKLRAENMSIRDAMRNPICSNCGGPAIIGDISLEEQHLRIENARLKDELDRVCAL 241

Query: 1421 AGKFLGRPVSS----IGDPLP-PSMQIGAGLDLAVXXXXXXXXXXXXXXGVXXXXXXXXX 1257
            AGKFLGRP+SS    IG P+P  S+++G G +                 G+         
Sbjct: 242  AGKFLGRPISSLAGSIGPPMPNSSLELGVGSN--------------GFGGLSTVATTLPL 287

Query: 1256 XNQFRXXXXXXXXXXXXXVPHQQQSISSGVDRSVENSMYLDLAMAAMEELVRMAQTNEPL 1077
               F               P    +  +G+DRS+E SM+L+LA+AAM+ELV+MAQT+EPL
Sbjct: 288  GPDF---GGGISSLPVMNQPRSTTTGVTGLDRSLERSMFLELALAAMDELVKMAQTDEPL 344

Query: 1076 WIPSSFEGGKETFNREEYTRMFAPCIGSKPNGFVTEATRETGVVIINGVALVETLMEAKR 897
            WI  S EGG+E  N EEY R F PCIG KP+GF +EA+RETG VIIN +ALVETLM++ R
Sbjct: 345  WI-RSLEGGREILNHEEYMRTFTPCIGMKPSGFFSEASRETGTVIINSLALVETLMDSNR 403

Query: 896  YVDMFPCVIARCNTTDVISSGVGGTRNGALQLMHAEFQVLSPLVPIREVNFLRFCKQHAE 717
            + +MFPC+IAR  TTDVISSG+GGTRNG+LQLMHAE QVLSPLVP+REVNFLRFCKQHAE
Sbjct: 404  WAEMFPCMIARTTTTDVISSGMGGTRNGSLQLMHAELQVLSPLVPVREVNFLRFCKQHAE 463

Query: 716  GVWAVVDISVDANQENSDTSKYLS-RRLPSGCVIQDMPNGCCKVTWVEHAEYDENSIHRI 540
            GVWAVVD+S+D  +E S    +++ RRLPSGCV+QDMPNG  KVTWVEHAEY+E+ IH++
Sbjct: 464  GVWAVVDVSIDTIRETSGAPTFINCRRLPSGCVVQDMPNGYSKVTWVEHAEYEESQIHQL 523

Query: 539  YQPLVNAGLGFGAQKWIATLQRQCECIAILMSN-MPIRDQTGITQSGRKSMLKLAQRMTQ 363
            Y+PL+++G+GFGAQ+W+ATLQRQCEC+AILMS+ +P RD T IT SGR+SMLKLAQRMT 
Sbjct: 524  YRPLISSGMGFGAQRWVATLQRQCECLAILMSSTVPSRDHTAITASGRRSMLKLAQRMTD 583

Query: 362  NFCAGVCPSSVHKWNRLSAGNVDADVRVMTRKSIDDPGEPPGVVLSAATSVWLPISPQKL 183
            NFCAGVC S+VHKWN+L+AGNVD DVRVMTRKS+DDPGEPPG+VLSAATSVWLP+SPQ+L
Sbjct: 584  NFCAGVCASTVHKWNKLNAGNVDEDVRVMTRKSVDDPGEPPGIVLSAATSVWLPVSPQRL 643

Query: 182  FDFLRDERLRSEWDILSNGGPMQQMSHITKGQDPGNCVSLLRAGAMNASQSSMLILQETC 3
            FDFLRDERLRSEWDILSNGGPMQ+M+HI KGQD GNCVSLLRA AMNA+QSSMLILQETC
Sbjct: 644  FDFLRDERLRSEWDILSNGGPMQEMAHIAKGQDHGNCVSLLRASAMNANQSSMLILQETC 703


>XP_010661562.1 PREDICTED: homeobox-leucine zipper protein ANTHOCYANINLESS 2 isoform
            X2 [Vitis vinifera]
          Length = 810

 Score =  928 bits (2399), Expect = 0.0
 Identities = 472/714 (66%), Positives = 556/714 (77%), Gaps = 4/714 (0%)
 Frame = -2

Query: 2132 RVVSDIPL-NNMPTGVISQSRVVNHNHHHPYHKPAAVFKSPGLSLALQTNNMDXXXXXGD 1956
            R+V+DIP  NNM TG I+Q R+V+ +         ++F SPGLSLALQT+          
Sbjct: 17   RIVADIPYSNNMATGAIAQPRLVSPS------LAKSMFSSPGLSLALQTS---------- 60

Query: 1955 RFXXXXXXXXXXXXXXXXXXXXXXXDEYESRSGSDNMDGANSDDQDT-DKPPRKKRYHRH 1779
                                     DE+ESRSGSDNMDGA+ DDQD  D PPRKKRYHRH
Sbjct: 61   -MEGQGEVTRLAENFESGGGRRSREDEHESRSGSDNMDGASGDDQDAADNPPRKKRYHRH 119

Query: 1778 TPQQIQELEALFKECPHPDEKQRNELSRRLVLEARQVKFWFQNRRTQMKTQIERHENSIL 1599
            TPQQIQELEALFKECPHPDEKQR ELSRRL LE RQVKFWFQNRRTQMKTQ+ERHENSIL
Sbjct: 120  TPQQIQELEALFKECPHPDEKQRLELSRRLSLETRQVKFWFQNRRTQMKTQLERHENSIL 179

Query: 1598 RQENDKLRAENMNVREAMRNPMCNNCGGAAVLGDVSLEEQHLRVENARLKEELDRVCALA 1419
            RQENDKLRAENM++R+AMRNP+C NCGG A++GD+SLEEQHLR+ENARLK+ELDRVCALA
Sbjct: 180  RQENDKLRAENMSIRDAMRNPICTNCGGPAIIGDISLEEQHLRIENARLKDELDRVCALA 239

Query: 1418 GKFLGRPVSSIGDPLPPSMQIGAGLDLAVXXXXXXXXXXXXXXGVXXXXXXXXXXNQFRX 1239
            GKFLGRP+SS+   + P+M   + L+L V                          +    
Sbjct: 240  GKFLGRPISSLASSMAPAMP-SSSLELGVGSNGFGGLSTVATTLPLGHDFGGGISSTL-- 296

Query: 1238 XXXXXXXXXXXXVPHQQQSISSGVDRSVENSMYLDLAMAAMEELVRMAQTNEPLWIPSSF 1059
                         P    +  +G++RS+E SM+L+LA+AAM+ELV+MAQT+EPLW+  S 
Sbjct: 297  ----------PVAPPTSTTGVTGLERSLERSMFLELALAAMDELVKMAQTDEPLWV-RSL 345

Query: 1058 EGGKETFNREEYTRMFAPCIGSKPNGFVTEATRETGVVIINGVALVETLMEAKRYVDMFP 879
            EGG+E  N EEY R F PCIG KP+GFVTE+TRETG+VIIN +ALVETLM++ R+ +MFP
Sbjct: 346  EGGREILNLEEYMRTFTPCIGMKPSGFVTESTRETGMVIINSLALVETLMDSNRWAEMFP 405

Query: 878  CVIARCNTTDVISSGVGGTRNGALQLMHAEFQVLSPLVPIREVNFLRFCKQHAEGVWAVV 699
            C+IAR +TTDVISSG+GGTRNGALQLMHAE QVLSPLVP+REVNFLRFCKQHAEGVWAVV
Sbjct: 406  CMIARTSTTDVISSGMGGTRNGALQLMHAELQVLSPLVPVREVNFLRFCKQHAEGVWAVV 465

Query: 698  DISVDANQENSDTSKYLS-RRLPSGCVIQDMPNGCCKVTWVEHAEYDENSIHRIYQPLVN 522
            D+S+D  +E S    +++ RRLPSGCV+QDMPNG  KVTWVEHAEYDE+++H++Y+PL+ 
Sbjct: 466  DVSIDTIRETSVAPTFVNCRRLPSGCVVQDMPNGYSKVTWVEHAEYDESAVHQLYRPLLG 525

Query: 521  AGLGFGAQKWIATLQRQCECIAILMSN-MPIRDQTGITQSGRKSMLKLAQRMTQNFCAGV 345
            +G+GFGAQ+W+ATLQRQCEC+AILMS+ +P RD T IT  GR+SMLKLAQRMT NFCAGV
Sbjct: 526  SGMGFGAQRWVATLQRQCECLAILMSSTVPTRDHTAITAGGRRSMLKLAQRMTDNFCAGV 585

Query: 344  CPSSVHKWNRLSAGNVDADVRVMTRKSIDDPGEPPGVVLSAATSVWLPISPQKLFDFLRD 165
            C S+VHKWN+L AGNVD DVRVMTRKS+DDPGEPPG+VLSAATSVWLP+SPQ+LFDFLRD
Sbjct: 586  CASTVHKWNKLCAGNVDEDVRVMTRKSVDDPGEPPGIVLSAATSVWLPVSPQRLFDFLRD 645

Query: 164  ERLRSEWDILSNGGPMQQMSHITKGQDPGNCVSLLRAGAMNASQSSMLILQETC 3
            ERLRSEWDILSNGGPMQ+M+HI KGQD GNCVSLLRA AMNA+QSSMLILQETC
Sbjct: 646  ERLRSEWDILSNGGPMQEMAHIAKGQDHGNCVSLLRASAMNANQSSMLILQETC 699


>XP_002320755.1 homeodomain family protein [Populus trichocarpa] EEE99070.1
            homeodomain family protein [Populus trichocarpa]
          Length = 823

 Score =  928 bits (2399), Expect = 0.0
 Identities = 476/721 (66%), Positives = 557/721 (77%), Gaps = 11/721 (1%)
 Frame = -2

Query: 2132 RVVSDIPLNN--MPTGVISQSRVVNHNHHHPYHKPAAVFKSPGLSLALQTNNMDXXXXXG 1959
            R+V+DIP NN  MPTG I Q R+V+ +         ++F SPGLSLALQ  N+D      
Sbjct: 18   RIVADIPYNNNNMPTGAIVQPRLVSPSI------TKSMFNSPGLSLALQQPNIDGQGDIT 71

Query: 1958 DRFXXXXXXXXXXXXXXXXXXXXXXXDEYESRSGSDNMDGANSDDQDT-DKPPRKKRYHR 1782
                                      +E+ESRSGSDNMDGA+ DDQD  D PPRKKRYHR
Sbjct: 72   ----------RMSENFETSVGRRSREEEHESRSGSDNMDGASGDDQDAADNPPRKKRYHR 121

Query: 1781 HTPQQIQELEALFKECPHPDEKQRNELSRRLVLEARQVKFWFQNRRTQMKTQIERHENSI 1602
            HTPQQIQELEALFKECPHPDEKQR ELSRRL LE RQVKFWFQNRRTQMKTQ+ERHENS+
Sbjct: 122  HTPQQIQELEALFKECPHPDEKQRLELSRRLCLETRQVKFWFQNRRTQMKTQLERHENSL 181

Query: 1601 LRQENDKLRAENMNVREAMRNPMCNNCGGAAVLGDVSLEEQHLRVENARLKEELDRVCAL 1422
            LRQENDKLRAENM++R+AMRNPMC+NCGG A++GD+SLEEQHLR+ENARLK+ELDRVCAL
Sbjct: 182  LRQENDKLRAENMSIRDAMRNPMCSNCGGPAIIGDISLEEQHLRIENARLKDELDRVCAL 241

Query: 1421 AGKFLGRPVSSI----GDPLP-PSMQIGAGLDLAVXXXXXXXXXXXXXXGVXXXXXXXXX 1257
            AGKFLGRP+SS+    G P+P  S+++G G +                  V         
Sbjct: 242  AGKFLGRPISSLASSLGPPMPNSSLELGVGSNGFAGLSTVATTLPLGPDFVGGISGALPV 301

Query: 1256 XNQFRXXXXXXXXXXXXXVPHQQQSISSGVDRSVENSMYLDLAMAAMEELVRMAQTNEPL 1077
              Q R                      +G+ RS+E SM+L+LA+AAM+ELV+MAQT+EPL
Sbjct: 302  LTQTRPATTGV----------------TGIGRSLERSMFLELALAAMDELVKMAQTDEPL 345

Query: 1076 WIPSSFEGGKETFNREEYTRMFAPCIGSKPNGFVTEATRETGVVIINGVALVETLMEAKR 897
            WI  SF+GG+E  N EEY R   PCIG KP+GFV+EA+RETG+VIIN +ALVETLM++ R
Sbjct: 346  WI-RSFDGGREILNHEEYLRTITPCIGMKPSGFVSEASRETGMVIINSLALVETLMDSNR 404

Query: 896  YVDMFPCVIARCNTTDVISSGVGGTRNGALQLMHAEFQVLSPLVPIREVNFLRFCKQHAE 717
            + +MFPCVIAR +TTDVI++G+GGTRNG+LQLMHAE QVLSPLVP+REVNFLRFCKQHAE
Sbjct: 405  WAEMFPCVIARTSTTDVIANGMGGTRNGSLQLMHAELQVLSPLVPVREVNFLRFCKQHAE 464

Query: 716  GVWAVVDISVDANQENSDTSKYL--SRRLPSGCVIQDMPNGCCKVTWVEHAEYDENSIHR 543
            GVWAVVD+SVD  +E S  S      RRLPSGCV+QDMPNG  KVTW+EHAEYDE+  H+
Sbjct: 465  GVWAVVDVSVDTIRETSGASPTFVNCRRLPSGCVVQDMPNGYSKVTWIEHAEYDESQTHQ 524

Query: 542  IYQPLVNAGLGFGAQKWIATLQRQCECIAILMS-NMPIRDQTGITQSGRKSMLKLAQRMT 366
            +Y+PL+++G+GFGAQ+WIATLQRQ EC+AILMS N+P RD T IT SGR+SMLKLAQRMT
Sbjct: 525  LYRPLISSGMGFGAQRWIATLQRQSECLAILMSSNVPSRDHTAITASGRRSMLKLAQRMT 584

Query: 365  QNFCAGVCPSSVHKWNRLSAGNVDADVRVMTRKSIDDPGEPPGVVLSAATSVWLPISPQK 186
             NFCAGVC S+VHKWN+L+AGNVD DVRVMTRKS+DDPGEPPG+VLSAATSVWLP+SPQ+
Sbjct: 585  ANFCAGVCASTVHKWNKLNAGNVDEDVRVMTRKSVDDPGEPPGIVLSAATSVWLPVSPQR 644

Query: 185  LFDFLRDERLRSEWDILSNGGPMQQMSHITKGQDPGNCVSLLRAGAMNASQSSMLILQET 6
            LFDFLRDERLRSEWDILSNGGPMQ+M+HI KGQD GNCVSLLRA AMNA+QSSMLILQET
Sbjct: 645  LFDFLRDERLRSEWDILSNGGPMQEMAHIAKGQDHGNCVSLLRASAMNANQSSMLILQET 704

Query: 5    C 3
            C
Sbjct: 705  C 705


>OAY49229.1 hypothetical protein MANES_05G039400 [Manihot esculenta]
          Length = 822

 Score =  928 bits (2398), Expect = 0.0
 Identities = 473/720 (65%), Positives = 558/720 (77%), Gaps = 10/720 (1%)
 Frame = -2

Query: 2132 RVVSDIPLN--NMPTGVISQSRVVNHNHHHPYHKPAAVFKSPGLSLALQTNNMDXXXXXG 1959
            R+V+DIP +  NMPTG I+Q R+++ +         A+F SPGLSLALQ  N+D      
Sbjct: 18   RIVADIPYSSSNMPTGAIAQPRLISPS------LTKAMFNSPGLSLALQQPNIDGQGDIA 71

Query: 1958 DRFXXXXXXXXXXXXXXXXXXXXXXXDEYESRSGSDNMDGANSDDQDT-DKPPRKKRYHR 1782
                                      +E+ESRSGSDNMDGA+ DDQD  D PPRKKRYHR
Sbjct: 72   ----------RMAENFESNGGRRSREEEHESRSGSDNMDGASGDDQDAADNPPRKKRYHR 121

Query: 1781 HTPQQIQELEALFKECPHPDEKQRNELSRRLVLEARQVKFWFQNRRTQMKTQIERHENSI 1602
            HTPQQIQELEALFKECPHPDEKQR ELS+RL LE RQVKFWFQNRRTQMKTQ+ERHENS+
Sbjct: 122  HTPQQIQELEALFKECPHPDEKQRLELSKRLCLETRQVKFWFQNRRTQMKTQLERHENSL 181

Query: 1601 LRQENDKLRAENMNVREAMRNPMCNNCGGAAVLGDVSLEEQHLRVENARLKEELDRVCAL 1422
            LRQENDKLRAENM++R+AMRNP+C+NCGG A++GD+SLEEQHLR+ENARLK+ELDRVCAL
Sbjct: 182  LRQENDKLRAENMSIRDAMRNPICSNCGGPAIIGDISLEEQHLRIENARLKDELDRVCAL 241

Query: 1421 AGKFLGRPVSS----IGDPLP-PSMQIGAGLDLAVXXXXXXXXXXXXXXGVXXXXXXXXX 1257
            AGKFLGRP+SS    IG P+P  S+++G G +                            
Sbjct: 242  AGKFLGRPISSLAGSIGPPMPNSSLELGVGTNGFSGLSTVPATLPLGPDFAGGISGALPV 301

Query: 1256 XNQFRXXXXXXXXXXXXXVPHQQQSISSGVDRSVENSMYLDLAMAAMEELVRMAQTNEPL 1077
              Q R                      +G+DRS E SM+L+LA+AAM+ELV+MAQT+EPL
Sbjct: 302  MTQTRPATAGV----------------TGLDRSFERSMFLELALAAMDELVKMAQTDEPL 345

Query: 1076 WIPSSFEGGKETFNREEYTRMFAPCIGSKPNGFVTEATRETGVVIINGVALVETLMEAKR 897
            WI  S EGG+E  N EEY R F PCIG KP GFV+EA+RETG+VIIN +ALVETLM++ R
Sbjct: 346  WI-RSLEGGREILNHEEYMRTFTPCIGMKPGGFVSEASRETGMVIINSLALVETLMDSNR 404

Query: 896  YVDMFPCVIARCNTTDVISSGVGGTRNGALQLMHAEFQVLSPLVPIREVNFLRFCKQHAE 717
            + +MFPC+IAR +TTDVIS+G+GGTRNG+LQLM AE QVLSPLVP+REVNFLRFCKQHAE
Sbjct: 405  WAEMFPCMIARTSTTDVISNGMGGTRNGSLQLMLAELQVLSPLVPVREVNFLRFCKQHAE 464

Query: 716  GVWAVVDISVDANQENSDTSKYLS-RRLPSGCVIQDMPNGCCKVTWVEHAEYDENSIHRI 540
            GVWAVVD+S+D  +E S    +++ RRLPSGCV+QDMPNG  KVTWVEHAEYDE  IH++
Sbjct: 465  GVWAVVDVSIDTIRETSGAPAFVNCRRLPSGCVVQDMPNGYSKVTWVEHAEYDETQIHQL 524

Query: 539  YQPLVNAGLGFGAQKWIATLQRQCECIAILMSN-MPIRDQTGITQSGRKSMLKLAQRMTQ 363
            Y+PL+++G+GFGAQ+W+ATLQRQCEC+AILMS+ +P RD T IT SGR+SMLKLAQRMT 
Sbjct: 525  YRPLISSGMGFGAQRWVATLQRQCECLAILMSSAVPTRDHTAITASGRRSMLKLAQRMTD 584

Query: 362  NFCAGVCPSSVHKWNRLSAGNVDADVRVMTRKSIDDPGEPPGVVLSAATSVWLPISPQKL 183
            NFCAGVC S+VHKWN+L+AGNVD DVRVMTRKS+DDPGEPPG+VLSAATSVWLP+SPQ+L
Sbjct: 585  NFCAGVCASTVHKWNKLNAGNVDEDVRVMTRKSVDDPGEPPGIVLSAATSVWLPVSPQRL 644

Query: 182  FDFLRDERLRSEWDILSNGGPMQQMSHITKGQDPGNCVSLLRAGAMNASQSSMLILQETC 3
            FDFLRDERLRSEWDILSNGGPMQ+M+HI KGQD GNCVSLLRA AMNA+QSSMLILQETC
Sbjct: 645  FDFLRDERLRSEWDILSNGGPMQEMAHIAKGQDHGNCVSLLRASAMNANQSSMLILQETC 704


>EOX96069.1 HD domain class transcription factor isoform 1 [Theobroma cacao]
          Length = 819

 Score =  927 bits (2396), Expect = 0.0
 Identities = 472/715 (66%), Positives = 558/715 (78%), Gaps = 5/715 (0%)
 Frame = -2

Query: 2132 RVVSDIPL-NNMPTGVISQSRVVNHNHHHPYHKPAAVFKSPGLSLALQTNNMDXXXXXGD 1956
            R+V+DIP  NNMPTG I+Q R+V+ +          +F SPGLSLALQ  N+D       
Sbjct: 17   RIVADIPYSNNMPTGAIAQPRLVSPS------LAKNMFNSPGLSLALQQPNIDNQGDGT- 69

Query: 1955 RFXXXXXXXXXXXXXXXXXXXXXXXDEYESRSGSDNMDGANSDDQDT-DKPPRKKRYHRH 1779
                                     +E+ESRSGSDNMDG + DDQD  D PPRKKRYHRH
Sbjct: 70   ---------RMGENFEGSVGRRSREEEHESRSGSDNMDGGSGDDQDAADNPPRKKRYHRH 120

Query: 1778 TPQQIQELEALFKECPHPDEKQRNELSRRLVLEARQVKFWFQNRRTQMKTQIERHENSIL 1599
            TPQQIQELEALFKECPHPDEKQR ELS+RL LE RQVKFWFQNRRTQMKTQ+ERHENS+L
Sbjct: 121  TPQQIQELEALFKECPHPDEKQRLELSKRLCLETRQVKFWFQNRRTQMKTQLERHENSLL 180

Query: 1598 RQENDKLRAENMNVREAMRNPMCNNCGGAAVLGDVSLEEQHLRVENARLKEELDRVCALA 1419
            RQENDKLRAENM++R+AMRNP+C NCGG A++GD+SLEEQHLR+ENARLK+ELDRVCALA
Sbjct: 181  RQENDKLRAENMSIRDAMRNPICTNCGGPAIIGDISLEEQHLRIENARLKDELDRVCALA 240

Query: 1418 GKFLGRPVSSIGDPLPPSMQIGAGLDLAVXXXXXXXXXXXXXXGVXXXXXXXXXXNQFRX 1239
            GKFLGRP+S++   + P M   + L+L V                          N    
Sbjct: 241  GKFLGRPISALATSIAPPMP-NSSLELGVGSNGFGGLSTVPTTLPLGPDFGGGITNAL-- 297

Query: 1238 XXXXXXXXXXXXVPHQQQSISSGVDRSVENSMYLDLAMAAMEELVRMAQTNEPLWIPSSF 1059
                         P++  +  +G+DRSVE SM+L+LA+AAM+ELV+MAQT+EPLWI  S 
Sbjct: 298  ---------PVAPPNRPTTGVTGLDRSVERSMFLELALAAMDELVKMAQTDEPLWI-RSL 347

Query: 1058 EGGKETFNREEYTRMFAPCIGSKPNGFVTEATRETGVVIINGVALVETLMEAKRYVDMFP 879
            EGG+E  N +EY R F PCIG KP GFVTEA+RETGVVIIN +ALVETLM++ R+ +MFP
Sbjct: 348  EGGREILNHDEYLRTFTPCIGMKPGGFVTEASRETGVVIINSLALVETLMDSTRWAEMFP 407

Query: 878  CVIARCNTTDVISSGVGGTRNGALQLMHAEFQVLSPLVPIREVNFLRFCKQHAEGVWAVV 699
            C+IAR +TTDVISSG+GGTRNGALQLMHAE QVLSPLVP+REVNFLRFCKQHAEGVWAVV
Sbjct: 408  CMIARTSTTDVISSGMGGTRNGALQLMHAELQVLSPLVPVREVNFLRFCKQHAEGVWAVV 467

Query: 698  DISVDANQENSDTSKYLS-RRLPSGCVIQDMPNGCCKVTWVEHAEYDENSIHRIYQPLVN 522
            D+S+D  +E S    +++ RRLPSGCV+QDMPNG  KVTWVEHAEY+E+ +H++Y+PL++
Sbjct: 468  DVSIDTIRETSGAPTFVNCRRLPSGCVVQDMPNGYSKVTWVEHAEYEESQVHQLYRPLLS 527

Query: 521  AGLGFGAQKWIATLQRQCECIAILMSN-MPIRDQTGITQSGRKSMLKLAQRMTQNFCAGV 345
            +G+GFGAQ+W+ATLQRQCEC+AILMS+ +P RD T IT SGR+SMLKLAQRMT NFCAGV
Sbjct: 528  SGMGFGAQRWVATLQRQCECLAILMSSTVPTRDHTAITASGRRSMLKLAQRMTDNFCAGV 587

Query: 344  CPSSVHKWNRL-SAGNVDADVRVMTRKSIDDPGEPPGVVLSAATSVWLPISPQKLFDFLR 168
            C S++HKWN+L +AGNVD DVRVMTRKS+DDPGEPPG+VLSAATSVWLP+SPQ+LFDFLR
Sbjct: 588  CASTLHKWNKLNNAGNVDEDVRVMTRKSVDDPGEPPGIVLSAATSVWLPVSPQRLFDFLR 647

Query: 167  DERLRSEWDILSNGGPMQQMSHITKGQDPGNCVSLLRAGAMNASQSSMLILQETC 3
            DERLRSEWDILSNGGPMQ+M+HI KGQD GNCVSLLRA AMNA+QSSMLILQETC
Sbjct: 648  DERLRSEWDILSNGGPMQEMAHIAKGQDHGNCVSLLRASAMNANQSSMLILQETC 702


>EOX96070.1 HD domain class transcription factor isoform 2 [Theobroma cacao]
          Length = 818

 Score =  926 bits (2394), Expect = 0.0
 Identities = 474/717 (66%), Positives = 559/717 (77%), Gaps = 7/717 (0%)
 Frame = -2

Query: 2132 RVVSDIPL-NNMPTGVISQSRVVNHNHHHPYHKPAAVFKSPGLSLALQTN--NMDXXXXX 1962
            R+V+DIP  NNMPTG I+Q R+V+ +          +F SPGLSLALQ N  N       
Sbjct: 17   RIVADIPYSNNMPTGAIAQPRLVSPS------LAKNMFNSPGLSLALQPNIDNQGDGTRM 70

Query: 1961 GDRFXXXXXXXXXXXXXXXXXXXXXXXDEYESRSGSDNMDGANSDDQDT-DKPPRKKRYH 1785
            G+ F                        E+ESRSGSDNMDG + DDQD  D PPRKKRYH
Sbjct: 71   GENFEGSVGRRSREE-------------EHESRSGSDNMDGGSGDDQDAADNPPRKKRYH 117

Query: 1784 RHTPQQIQELEALFKECPHPDEKQRNELSRRLVLEARQVKFWFQNRRTQMKTQIERHENS 1605
            RHTPQQIQELEALFKECPHPDEKQR ELS+RL LE RQVKFWFQNRRTQMKTQ+ERHENS
Sbjct: 118  RHTPQQIQELEALFKECPHPDEKQRLELSKRLCLETRQVKFWFQNRRTQMKTQLERHENS 177

Query: 1604 ILRQENDKLRAENMNVREAMRNPMCNNCGGAAVLGDVSLEEQHLRVENARLKEELDRVCA 1425
            +LRQENDKLRAENM++R+AMRNP+C NCGG A++GD+SLEEQHLR+ENARLK+ELDRVCA
Sbjct: 178  LLRQENDKLRAENMSIRDAMRNPICTNCGGPAIIGDISLEEQHLRIENARLKDELDRVCA 237

Query: 1424 LAGKFLGRPVSSIGDPLPPSMQIGAGLDLAVXXXXXXXXXXXXXXGVXXXXXXXXXXNQF 1245
            LAGKFLGRP+S++   + P M   + L+L V                          N  
Sbjct: 238  LAGKFLGRPISALATSIAPPMP-NSSLELGVGSNGFGGLSTVPTTLPLGPDFGGGITNAL 296

Query: 1244 RXXXXXXXXXXXXXVPHQQQSISSGVDRSVENSMYLDLAMAAMEELVRMAQTNEPLWIPS 1065
                           P++  +  +G+DRSVE SM+L+LA+AAM+ELV+MAQT+EPLWI  
Sbjct: 297  -----------PVAPPNRPTTGVTGLDRSVERSMFLELALAAMDELVKMAQTDEPLWI-R 344

Query: 1064 SFEGGKETFNREEYTRMFAPCIGSKPNGFVTEATRETGVVIINGVALVETLMEAKRYVDM 885
            S EGG+E  N +EY R F PCIG KP GFVTEA+RETGVVIIN +ALVETLM++ R+ +M
Sbjct: 345  SLEGGREILNHDEYLRTFTPCIGMKPGGFVTEASRETGVVIINSLALVETLMDSTRWAEM 404

Query: 884  FPCVIARCNTTDVISSGVGGTRNGALQLMHAEFQVLSPLVPIREVNFLRFCKQHAEGVWA 705
            FPC+IAR +TTDVISSG+GGTRNGALQLMHAE QVLSPLVP+REVNFLRFCKQHAEGVWA
Sbjct: 405  FPCMIARTSTTDVISSGMGGTRNGALQLMHAELQVLSPLVPVREVNFLRFCKQHAEGVWA 464

Query: 704  VVDISVDANQENSDTSKYLS-RRLPSGCVIQDMPNGCCKVTWVEHAEYDENSIHRIYQPL 528
            VVD+S+D  +E S    +++ RRLPSGCV+QDMPNG  KVTWVEHAEY+E+ +H++Y+PL
Sbjct: 465  VVDVSIDTIRETSGAPTFVNCRRLPSGCVVQDMPNGYSKVTWVEHAEYEESQVHQLYRPL 524

Query: 527  VNAGLGFGAQKWIATLQRQCECIAILMSN-MPIRDQTGITQSGRKSMLKLAQRMTQNFCA 351
            +++G+GFGAQ+W+ATLQRQCEC+AILMS+ +P RD T IT SGR+SMLKLAQRMT NFCA
Sbjct: 525  LSSGMGFGAQRWVATLQRQCECLAILMSSTVPTRDHTAITASGRRSMLKLAQRMTDNFCA 584

Query: 350  GVCPSSVHKWNRL-SAGNVDADVRVMTRKSIDDPGEPPGVVLSAATSVWLPISPQKLFDF 174
            GVC S++HKWN+L +AGNVD DVRVMTRKS+DDPGEPPG+VLSAATSVWLP+SPQ+LFDF
Sbjct: 585  GVCASTLHKWNKLNNAGNVDEDVRVMTRKSVDDPGEPPGIVLSAATSVWLPVSPQRLFDF 644

Query: 173  LRDERLRSEWDILSNGGPMQQMSHITKGQDPGNCVSLLRAGAMNASQSSMLILQETC 3
            LRDERLRSEWDILSNGGPMQ+M+HI KGQD GNCVSLLRA AMNA+QSSMLILQETC
Sbjct: 645  LRDERLRSEWDILSNGGPMQEMAHIAKGQDHGNCVSLLRASAMNANQSSMLILQETC 701


>XP_010278577.1 PREDICTED: homeobox-leucine zipper protein ANTHOCYANINLESS 2-like
            isoform X1 [Nelumbo nucifera]
          Length = 818

 Score =  926 bits (2392), Expect = 0.0
 Identities = 478/719 (66%), Positives = 559/719 (77%), Gaps = 9/719 (1%)
 Frame = -2

Query: 2132 RVVSDIPLNNMPTGVISQSRVVNHNHHHPYHKPAAVFKSPGLSLALQTNNMDXXXXXGDR 1953
            RVV+DIP +NMP G I+Q R+++ +         ++F SPGLSLAL+T            
Sbjct: 16   RVVADIPYSNMPAGAIAQPRLLSPS------LAKSMFNSPGLSLALKTG----------- 58

Query: 1952 FXXXXXXXXXXXXXXXXXXXXXXXDEYESRSGSDNMDGANSDDQDTDK-PPRKKRYHRHT 1776
                                    D YESRSGSDNM+GA+ DDQD D  PPRKKRYHRHT
Sbjct: 59   MEGQGEVGRIGENLDTGAVGRNKEDGYESRSGSDNMEGASGDDQDGDNNPPRKKRYHRHT 118

Query: 1775 PQQIQELEALFKECPHPDEKQRNELSRRLVLEARQVKFWFQNRRTQMKTQIERHENSILR 1596
            PQQIQELEALFKECPHPDEKQRNELS+RL LE+RQVKFWFQNRRTQMKTQ+ERHENSILR
Sbjct: 119  PQQIQELEALFKECPHPDEKQRNELSKRLCLESRQVKFWFQNRRTQMKTQLERHENSILR 178

Query: 1595 QENDKLRAENMNVREAMRNPMCNNCGGAAVLGDVSLEEQHLRVENARLKEELDRVCALAG 1416
            QENDKLRAENM++R+AMRNP+C+NCGG A+LGD+SLEEQHLR+ENARLK+ELDRVCALAG
Sbjct: 179  QENDKLRAENMSIRDAMRNPICSNCGGPAMLGDISLEEQHLRIENARLKDELDRVCALAG 238

Query: 1415 KFLGRPVSSIGDPLPPSMQIGAGLDLAVXXXXXXXXXXXXXXGVXXXXXXXXXXNQFRXX 1236
            KFLGRPVSS+   +PP M   + L+LAV                          +     
Sbjct: 239  KFLGRPVSSLATSIPPPMP-SSSLELAVGSNGFGGLNTVAATLPLVSDFGGGVSSALSVV 297

Query: 1235 XXXXXXXXXXXVPHQQQSISSGVDRSVENSMYLDLAMAAMEELVRMAQTNEPLWIPSSFE 1056
                        P +  +  +G++RS+E SM+LDLA+AAM+ELV+MAQT++PLW+P   +
Sbjct: 298  P-----------PARPAAGVTGLERSLERSMFLDLALAAMDELVKMAQTDKPLWLPG-LD 345

Query: 1055 GGKETFNREEYTRMFAPCIGSKPNGFVTEATRETGVVIINGVALVETLMEAKRYVDMFPC 876
            GGKET N EEY + F PCIG KP+GFVTEATRETG+VIIN +ALVETLM+A R+ +MFPC
Sbjct: 346  GGKETLNHEEYMQTFPPCIGLKPSGFVTEATRETGMVIINSLALVETLMDASRWAEMFPC 405

Query: 875  VIARCNTTDVISSGVGGTRNGALQLMHAEFQVLSPLVPIREVNFLRFCKQHAEGVWAVVD 696
            +IAR +TT+VISSG+GGTRN ALQLMHAE QVLSPLVPIREV FLRFCKQHAEGVWAVVD
Sbjct: 406  MIARTSTTEVISSGMGGTRNCALQLMHAELQVLSPLVPIREVKFLRFCKQHAEGVWAVVD 465

Query: 695  ISVD-ANQENSDTSKYLS-RRLPSGCVIQDMPNGCCKVTWVEHAEYDENSIHRIYQPLVN 522
            +S+D   +E S+   ++S RRLPSGCV+QDMPNG  KVTWVEH EYDE+SIH++Y+PL+ 
Sbjct: 466  VSIDHILRETSNEPVFVSCRRLPSGCVVQDMPNGYSKVTWVEHGEYDESSIHQLYRPLLR 525

Query: 521  AGLGFGAQKWIATLQRQCECIAILMSN-MPIRDQT-----GITQSGRKSMLKLAQRMTQN 360
            AG+GFGAQ+W+ATLQRQCEC+AILMS+ +P RD T      IT SGR+SMLKLAQRMT N
Sbjct: 526  AGMGFGAQRWVATLQRQCECLAILMSSTLPARDHTDNNPTAITPSGRRSMLKLAQRMTDN 585

Query: 359  FCAGVCPSSVHKWNRLSAGNVDADVRVMTRKSIDDPGEPPGVVLSAATSVWLPISPQKLF 180
            FCAGVC S+VHKWN+L AGNVD DVRVMTRKS+DDPGEPPGVVLSAATSVWLP+SPQ+LF
Sbjct: 586  FCAGVCASAVHKWNKLCAGNVDEDVRVMTRKSVDDPGEPPGVVLSAATSVWLPVSPQRLF 645

Query: 179  DFLRDERLRSEWDILSNGGPMQQMSHITKGQDPGNCVSLLRAGAMNASQSSMLILQETC 3
            DFLRDERLRSEWDILSNGGPMQ+M+HI KGQD GNCVSLLRA AMNA+QSSMLILQETC
Sbjct: 646  DFLRDERLRSEWDILSNGGPMQEMAHIAKGQDHGNCVSLLRASAMNANQSSMLILQETC 704


>XP_007051912.2 PREDICTED: homeobox-leucine zipper protein ANTHOCYANINLESS 2 isoform
            X1 [Theobroma cacao]
          Length = 819

 Score =  925 bits (2391), Expect = 0.0
 Identities = 471/715 (65%), Positives = 558/715 (78%), Gaps = 5/715 (0%)
 Frame = -2

Query: 2132 RVVSDIPL-NNMPTGVISQSRVVNHNHHHPYHKPAAVFKSPGLSLALQTNNMDXXXXXGD 1956
            R+V+DIP  NNMPTG I+Q R+V+ +          +F SPGLSLALQ  N+D       
Sbjct: 17   RIVADIPYSNNMPTGAIAQPRLVSPS------LAKNMFNSPGLSLALQQPNIDNQGDGT- 69

Query: 1955 RFXXXXXXXXXXXXXXXXXXXXXXXDEYESRSGSDNMDGANSDDQDT-DKPPRKKRYHRH 1779
                                     +E+ESRSGSDNMDG + DDQD  D PPRKKRYHRH
Sbjct: 70   ---------RMGENFEGSVGRRSREEEHESRSGSDNMDGGSGDDQDAADNPPRKKRYHRH 120

Query: 1778 TPQQIQELEALFKECPHPDEKQRNELSRRLVLEARQVKFWFQNRRTQMKTQIERHENSIL 1599
            TPQQIQELEALFKECPHPDEKQR ELS+RL LE RQVKFWFQNRRTQMKTQ+ERHENS+L
Sbjct: 121  TPQQIQELEALFKECPHPDEKQRLELSKRLCLETRQVKFWFQNRRTQMKTQLERHENSLL 180

Query: 1598 RQENDKLRAENMNVREAMRNPMCNNCGGAAVLGDVSLEEQHLRVENARLKEELDRVCALA 1419
            RQENDKLRAENM++R+AMRNP+C NCGG A++GD+SLEEQHLR+ENARLK+ELDRVCALA
Sbjct: 181  RQENDKLRAENMSIRDAMRNPICTNCGGPAIIGDISLEEQHLRIENARLKDELDRVCALA 240

Query: 1418 GKFLGRPVSSIGDPLPPSMQIGAGLDLAVXXXXXXXXXXXXXXGVXXXXXXXXXXNQFRX 1239
            GKFLGRP+S++   + P M   + L+L V                          N    
Sbjct: 241  GKFLGRPISALATSIAPPMP-NSSLELGVGSNGFGGLSTVPTTLPLGPDFGGGITNAL-- 297

Query: 1238 XXXXXXXXXXXXVPHQQQSISSGVDRSVENSMYLDLAMAAMEELVRMAQTNEPLWIPSSF 1059
                         P++  +  +G+DRSVE SM+L+LA+AAM+ELV+MAQT+EPLWI  S 
Sbjct: 298  ---------PVAPPNRATTGVTGLDRSVERSMFLELALAAMDELVKMAQTDEPLWI-RSL 347

Query: 1058 EGGKETFNREEYTRMFAPCIGSKPNGFVTEATRETGVVIINGVALVETLMEAKRYVDMFP 879
            EGG+E  N +EY R F PCIG KP GFVTEA+RETGVVIIN +ALVETLM++ R+ +MFP
Sbjct: 348  EGGREILNHDEYLRTFTPCIGMKPGGFVTEASRETGVVIINSLALVETLMDSTRWAEMFP 407

Query: 878  CVIARCNTTDVISSGVGGTRNGALQLMHAEFQVLSPLVPIREVNFLRFCKQHAEGVWAVV 699
            C+IAR +TTDVISSG+GGTRNGALQLMHAE QVLSPLVP+REVNFLRFCKQHAEGVWAVV
Sbjct: 408  CMIARTSTTDVISSGMGGTRNGALQLMHAELQVLSPLVPVREVNFLRFCKQHAEGVWAVV 467

Query: 698  DISVDANQENSDTSKYLS-RRLPSGCVIQDMPNGCCKVTWVEHAEYDENSIHRIYQPLVN 522
            D+S+D  +E S    +++ RRLPSGCV+QDMPNG  KVTWVEHAEY+E+ +H++Y+PL++
Sbjct: 468  DVSIDTIRETSGAPTFVNCRRLPSGCVVQDMPNGYSKVTWVEHAEYEESQVHQLYRPLLS 527

Query: 521  AGLGFGAQKWIATLQRQCECIAILMSN-MPIRDQTGITQSGRKSMLKLAQRMTQNFCAGV 345
            +G+GFGAQ+W+ATLQRQCEC+AILMS+ +P RD T IT SGR+SMLKLAQRMT NFCAGV
Sbjct: 528  SGMGFGAQRWVATLQRQCECLAILMSSTVPTRDHTAITASGRRSMLKLAQRMTDNFCAGV 587

Query: 344  CPSSVHKWNRL-SAGNVDADVRVMTRKSIDDPGEPPGVVLSAATSVWLPISPQKLFDFLR 168
            C S++HKWN+L +AG+VD DVRVMTRKS+DDPGEPPG+VLSAATSVWLP+SPQ+LFDFLR
Sbjct: 588  CASTLHKWNKLNNAGDVDEDVRVMTRKSVDDPGEPPGIVLSAATSVWLPVSPQRLFDFLR 647

Query: 167  DERLRSEWDILSNGGPMQQMSHITKGQDPGNCVSLLRAGAMNASQSSMLILQETC 3
            DERLRSEWDILSNGGPMQ+M+HI KGQD GNCVSLLRA AMNA+QSSMLILQETC
Sbjct: 648  DERLRSEWDILSNGGPMQEMAHIAKGQDHGNCVSLLRASAMNANQSSMLILQETC 702


>XP_016748231.1 PREDICTED: homeobox-leucine zipper protein ANTHOCYANINLESS 2-like
            [Gossypium hirsutum]
          Length = 820

 Score =  925 bits (2391), Expect = 0.0
 Identities = 477/721 (66%), Positives = 560/721 (77%), Gaps = 11/721 (1%)
 Frame = -2

Query: 2132 RVVSDIPL-NNMPTGV--ISQSRVVNHNHHHPYHKPAAVFKSPGLSLALQTNNMDXXXXX 1962
            R+V+DIP  NNM TG   I+Q R+++ +       P  +F SPGLSLALQ N +D     
Sbjct: 19   RMVADIPYSNNMATGATAIAQPRLMSPS------LPKNIFNSPGLSLALQPN-IDNQGDH 71

Query: 1961 GDRFXXXXXXXXXXXXXXXXXXXXXXXDEYESRSGSDNMDGANSDDQDT-DKPPRKKRYH 1785
            G R                         E+ESRSGSDNMDGA+ DDQD  DKPPRKKRYH
Sbjct: 72   GSRIMRESLEGSVGRRSREE--------EHESRSGSDNMDGASGDDQDAADKPPRKKRYH 123

Query: 1784 RHTPQQIQELEALFKECPHPDEKQRNELSRRLVLEARQVKFWFQNRRTQMKTQIERHENS 1605
            RHTPQQIQELEALFKECPHPDEKQR ELS+RL LE RQVKFWFQNRRTQMKTQ+ERHENS
Sbjct: 124  RHTPQQIQELEALFKECPHPDEKQRLELSKRLCLETRQVKFWFQNRRTQMKTQLERHENS 183

Query: 1604 ILRQENDKLRAENMNVREAMRNPMCNNCGGAAVLGDVSLEEQHLRVENARLKEELDRVCA 1425
            +LRQENDKLRAENM++R+AMRNP+C NCGG A++GD+SLEEQHLR+ENARLK+ELDRVCA
Sbjct: 184  LLRQENDKLRAENMSIRDAMRNPICTNCGGPAIIGDISLEEQHLRIENARLKDELDRVCA 243

Query: 1424 LAGKFLGRPVS----SIGDPLP-PSMQIGAGLDLAVXXXXXXXXXXXXXXGVXXXXXXXX 1260
            LAGKFLGRP+S    SI  PLP  S+++G G +                  +        
Sbjct: 244  LAGKFLGRPISTLATSIAPPLPNSSLELGVGSN--------------GFGALSTVATTLP 289

Query: 1259 XXNQFRXXXXXXXXXXXXXVPHQQQSISSGVDRSVENSMYLDLAMAAMEELVRMAQTNEP 1080
                F               P +  +  +G+DRSVE SM+L+LA+AAM ELV+MAQT+EP
Sbjct: 290  LGPDF-----GGGMSNALVPPSRPTTAVTGLDRSVERSMFLELALAAMNELVKMAQTDEP 344

Query: 1079 LWIPSSFEGGKETFNREEYTRMFAPCIGSKPNGFVTEATRETGVVIINGVALVETLMEAK 900
            LWI  S EGG+E  N++EY R F PCIG K NGFVTEA+RE+G+VIIN +ALVETLM++ 
Sbjct: 345  LWI-RSLEGGREILNQDEYLRTFTPCIGMKSNGFVTEASRESGMVIINSLALVETLMDSN 403

Query: 899  RYVDMFPCVIARCNTTDVISSGVGGTRNGALQLMHAEFQVLSPLVPIREVNFLRFCKQHA 720
            R+ +MFPC+IAR +TTDVIS GVGGTRNGALQLMHAE QVLSPLVP+REVNFLRFCKQHA
Sbjct: 404  RWSEMFPCMIARTSTTDVISGGVGGTRNGALQLMHAELQVLSPLVPVREVNFLRFCKQHA 463

Query: 719  EGVWAVVDISVDANQENSDTSKYLS-RRLPSGCVIQDMPNGCCKVTWVEHAEYDENSIHR 543
            EGVWAVVD+SVD N+E S    +++ RRLPSGCV+QDMPNG  KVTWVEHAEY+E+ +H+
Sbjct: 464  EGVWAVVDVSVDTNRETSGAPSFVNCRRLPSGCVVQDMPNGYSKVTWVEHAEYEESQVHQ 523

Query: 542  IYQPLVNAGLGFGAQKWIATLQRQCECIAILMSN-MPIRDQTGITQSGRKSMLKLAQRMT 366
            +Y PL+ +G+ FGAQ+W+ATLQRQCEC+AILMS+ +P RD TGIT SGR+SMLKLAQRMT
Sbjct: 524  LYHPLLRSGMAFGAQRWVATLQRQCECLAILMSSSVPTRDHTGITASGRRSMLKLAQRMT 583

Query: 365  QNFCAGVCPSSVHKWNRLSAGNVDADVRVMTRKSIDDPGEPPGVVLSAATSVWLPISPQK 186
             NFCAGVC S+VHKWN+L+ GNVD DVRVMTRKS+DDPGEPPG+VLSAATSVWLP+SPQ+
Sbjct: 584  DNFCAGVCASTVHKWNKLNVGNVDEDVRVMTRKSVDDPGEPPGIVLSAATSVWLPVSPQR 643

Query: 185  LFDFLRDERLRSEWDILSNGGPMQQMSHITKGQDPGNCVSLLRAGAMNASQSSMLILQET 6
            LFDFLRDERLRSEWDILSNGGPMQ+M+HI KGQ+ GNCVSLLRA AMNA+QSSMLILQET
Sbjct: 644  LFDFLRDERLRSEWDILSNGGPMQEMAHIAKGQEHGNCVSLLRASAMNANQSSMLILQET 703

Query: 5    C 3
            C
Sbjct: 704  C 704


>XP_011035097.1 PREDICTED: homeobox-leucine zipper protein ANTHOCYANINLESS 2 [Populus
            euphratica]
          Length = 823

 Score =  925 bits (2391), Expect = 0.0
 Identities = 474/721 (65%), Positives = 556/721 (77%), Gaps = 11/721 (1%)
 Frame = -2

Query: 2132 RVVSDIPLNN--MPTGVISQSRVVNHNHHHPYHKPAAVFKSPGLSLALQTNNMDXXXXXG 1959
            R+V+DIP NN  MPTG I Q R+V+ +         ++F SPGLSLALQ  N+D      
Sbjct: 18   RIVADIPYNNNNMPTGAIVQPRLVSPSI------TKSMFNSPGLSLALQQPNIDGQGDIT 71

Query: 1958 DRFXXXXXXXXXXXXXXXXXXXXXXXDEYESRSGSDNMDGANSDDQDT-DKPPRKKRYHR 1782
                                      +E+ESRSGSDNMDGA+ DDQD  D PPRKKRYHR
Sbjct: 72   ----------RMSENFETSVGRRSREEEHESRSGSDNMDGASGDDQDAADNPPRKKRYHR 121

Query: 1781 HTPQQIQELEALFKECPHPDEKQRNELSRRLVLEARQVKFWFQNRRTQMKTQIERHENSI 1602
            HTPQQIQELEALFKECPHPDEKQR ELSRRL LE RQVKFWFQNRRTQMKTQ+ERHENS+
Sbjct: 122  HTPQQIQELEALFKECPHPDEKQRLELSRRLCLETRQVKFWFQNRRTQMKTQLERHENSL 181

Query: 1601 LRQENDKLRAENMNVREAMRNPMCNNCGGAAVLGDVSLEEQHLRVENARLKEELDRVCAL 1422
            LRQENDKLRAENM++R+AMRNPMC+NCGG A++GD+SLEEQHLR+ENARLK+ELDRVCAL
Sbjct: 182  LRQENDKLRAENMSIRDAMRNPMCSNCGGPAIIGDISLEEQHLRIENARLKDELDRVCAL 241

Query: 1421 AGKFLGRPVSSI----GDPLP-PSMQIGAGLDLAVXXXXXXXXXXXXXXGVXXXXXXXXX 1257
            AGKFLGRP+SS+    G P+P  S+++G G +                  V         
Sbjct: 242  AGKFLGRPISSLASSLGPPMPNSSLELGVGSNGFAGLSTVATTLPLGPDFVGGISGALPV 301

Query: 1256 XNQFRXXXXXXXXXXXXXVPHQQQSISSGVDRSVENSMYLDLAMAAMEELVRMAQTNEPL 1077
              Q R                      +G+ RS+E SM+L+LA+AAM+ELV+MAQT+EPL
Sbjct: 302  LAQTRPATTGV----------------TGIGRSLERSMFLELALAAMDELVKMAQTDEPL 345

Query: 1076 WIPSSFEGGKETFNREEYTRMFAPCIGSKPNGFVTEATRETGVVIINGVALVETLMEAKR 897
            WI  SF+GG+E  N EEY R   PCIG KP+GFV+EA+RETG+VIIN +ALVETLM++ R
Sbjct: 346  WI-RSFDGGREILNHEEYLRTITPCIGMKPSGFVSEASRETGMVIINSLALVETLMDSNR 404

Query: 896  YVDMFPCVIARCNTTDVISSGVGGTRNGALQLMHAEFQVLSPLVPIREVNFLRFCKQHAE 717
            + +MFPCVIAR +TTDVI++G+GGTRNG+LQLMHAE QVLSPLVP+REVNFLRFCKQHAE
Sbjct: 405  WAEMFPCVIARTSTTDVIANGMGGTRNGSLQLMHAELQVLSPLVPVREVNFLRFCKQHAE 464

Query: 716  GVWAVVDISVDANQENSDTSKYL--SRRLPSGCVIQDMPNGCCKVTWVEHAEYDENSIHR 543
            GVWAVVD+SVD  +E S         RRLPSGCV+QDMPNG  KVTW+EHAEYDE+  H+
Sbjct: 465  GVWAVVDVSVDTIRETSGAPPTFVNCRRLPSGCVVQDMPNGYSKVTWIEHAEYDESQTHQ 524

Query: 542  IYQPLVNAGLGFGAQKWIATLQRQCECIAILMS-NMPIRDQTGITQSGRKSMLKLAQRMT 366
            +Y+PL+++G+GFGAQ+WIATLQRQ EC+AILMS N+P RD T IT SGR+SMLKLAQRMT
Sbjct: 525  LYRPLISSGMGFGAQRWIATLQRQSECLAILMSSNVPSRDHTAITASGRRSMLKLAQRMT 584

Query: 365  QNFCAGVCPSSVHKWNRLSAGNVDADVRVMTRKSIDDPGEPPGVVLSAATSVWLPISPQK 186
             NFCAGVC S+VHKWN+L+AGNVD DVRVMTRKS+DDPGEPPG+VLSAATSVWLP+SPQ+
Sbjct: 585  ANFCAGVCASTVHKWNKLNAGNVDEDVRVMTRKSVDDPGEPPGIVLSAATSVWLPVSPQR 644

Query: 185  LFDFLRDERLRSEWDILSNGGPMQQMSHITKGQDPGNCVSLLRAGAMNASQSSMLILQET 6
            LFDFLRDERLRSEWDILSNGGPMQ+M+HI KGQD GNCVSLLRA AMN++QSSMLILQET
Sbjct: 645  LFDFLRDERLRSEWDILSNGGPMQEMAHIAKGQDHGNCVSLLRASAMNSNQSSMLILQET 704

Query: 5    C 3
            C
Sbjct: 705  C 705


>XP_007051913.2 PREDICTED: homeobox-leucine zipper protein ANTHOCYANINLESS 2 isoform
            X2 [Theobroma cacao]
          Length = 818

 Score =  924 bits (2389), Expect = 0.0
 Identities = 473/717 (65%), Positives = 559/717 (77%), Gaps = 7/717 (0%)
 Frame = -2

Query: 2132 RVVSDIPL-NNMPTGVISQSRVVNHNHHHPYHKPAAVFKSPGLSLALQTN--NMDXXXXX 1962
            R+V+DIP  NNMPTG I+Q R+V+ +          +F SPGLSLALQ N  N       
Sbjct: 17   RIVADIPYSNNMPTGAIAQPRLVSPS------LAKNMFNSPGLSLALQPNIDNQGDGTRM 70

Query: 1961 GDRFXXXXXXXXXXXXXXXXXXXXXXXDEYESRSGSDNMDGANSDDQDT-DKPPRKKRYH 1785
            G+ F                        E+ESRSGSDNMDG + DDQD  D PPRKKRYH
Sbjct: 71   GENFEGSVGRRSREE-------------EHESRSGSDNMDGGSGDDQDAADNPPRKKRYH 117

Query: 1784 RHTPQQIQELEALFKECPHPDEKQRNELSRRLVLEARQVKFWFQNRRTQMKTQIERHENS 1605
            RHTPQQIQELEALFKECPHPDEKQR ELS+RL LE RQVKFWFQNRRTQMKTQ+ERHENS
Sbjct: 118  RHTPQQIQELEALFKECPHPDEKQRLELSKRLCLETRQVKFWFQNRRTQMKTQLERHENS 177

Query: 1604 ILRQENDKLRAENMNVREAMRNPMCNNCGGAAVLGDVSLEEQHLRVENARLKEELDRVCA 1425
            +LRQENDKLRAENM++R+AMRNP+C NCGG A++GD+SLEEQHLR+ENARLK+ELDRVCA
Sbjct: 178  LLRQENDKLRAENMSIRDAMRNPICTNCGGPAIIGDISLEEQHLRIENARLKDELDRVCA 237

Query: 1424 LAGKFLGRPVSSIGDPLPPSMQIGAGLDLAVXXXXXXXXXXXXXXGVXXXXXXXXXXNQF 1245
            LAGKFLGRP+S++   + P M   + L+L V                          N  
Sbjct: 238  LAGKFLGRPISALATSIAPPMP-NSSLELGVGSNGFGGLSTVPTTLPLGPDFGGGITNAL 296

Query: 1244 RXXXXXXXXXXXXXVPHQQQSISSGVDRSVENSMYLDLAMAAMEELVRMAQTNEPLWIPS 1065
                           P++  +  +G+DRSVE SM+L+LA+AAM+ELV+MAQT+EPLWI  
Sbjct: 297  -----------PVAPPNRATTGVTGLDRSVERSMFLELALAAMDELVKMAQTDEPLWI-R 344

Query: 1064 SFEGGKETFNREEYTRMFAPCIGSKPNGFVTEATRETGVVIINGVALVETLMEAKRYVDM 885
            S EGG+E  N +EY R F PCIG KP GFVTEA+RETGVVIIN +ALVETLM++ R+ +M
Sbjct: 345  SLEGGREILNHDEYLRTFTPCIGMKPGGFVTEASRETGVVIINSLALVETLMDSTRWAEM 404

Query: 884  FPCVIARCNTTDVISSGVGGTRNGALQLMHAEFQVLSPLVPIREVNFLRFCKQHAEGVWA 705
            FPC+IAR +TTDVISSG+GGTRNGALQLMHAE QVLSPLVP+REVNFLRFCKQHAEGVWA
Sbjct: 405  FPCMIARTSTTDVISSGMGGTRNGALQLMHAELQVLSPLVPVREVNFLRFCKQHAEGVWA 464

Query: 704  VVDISVDANQENSDTSKYLS-RRLPSGCVIQDMPNGCCKVTWVEHAEYDENSIHRIYQPL 528
            VVD+S+D  +E S    +++ RRLPSGCV+QDMPNG  KVTWVEHAEY+E+ +H++Y+PL
Sbjct: 465  VVDVSIDTIRETSGAPTFVNCRRLPSGCVVQDMPNGYSKVTWVEHAEYEESQVHQLYRPL 524

Query: 527  VNAGLGFGAQKWIATLQRQCECIAILMSN-MPIRDQTGITQSGRKSMLKLAQRMTQNFCA 351
            +++G+GFGAQ+W+ATLQRQCEC+AILMS+ +P RD T IT SGR+SMLKLAQRMT NFCA
Sbjct: 525  LSSGMGFGAQRWVATLQRQCECLAILMSSTVPTRDHTAITASGRRSMLKLAQRMTDNFCA 584

Query: 350  GVCPSSVHKWNRL-SAGNVDADVRVMTRKSIDDPGEPPGVVLSAATSVWLPISPQKLFDF 174
            GVC S++HKWN+L +AG+VD DVRVMTRKS+DDPGEPPG+VLSAATSVWLP+SPQ+LFDF
Sbjct: 585  GVCASTLHKWNKLNNAGDVDEDVRVMTRKSVDDPGEPPGIVLSAATSVWLPVSPQRLFDF 644

Query: 173  LRDERLRSEWDILSNGGPMQQMSHITKGQDPGNCVSLLRAGAMNASQSSMLILQETC 3
            LRDERLRSEWDILSNGGPMQ+M+HI KGQD GNCVSLLRA AMNA+QSSMLILQETC
Sbjct: 645  LRDERLRSEWDILSNGGPMQEMAHIAKGQDHGNCVSLLRASAMNANQSSMLILQETC 701


>XP_002272264.2 PREDICTED: homeobox-leucine zipper protein ANTHOCYANINLESS 2 isoform
            X1 [Vitis vinifera] XP_010661561.1 PREDICTED:
            homeobox-leucine zipper protein ANTHOCYANINLESS 2 isoform
            X1 [Vitis vinifera]
          Length = 811

 Score =  924 bits (2387), Expect = 0.0
 Identities = 472/715 (66%), Positives = 556/715 (77%), Gaps = 5/715 (0%)
 Frame = -2

Query: 2132 RVVSDIPL-NNMPTGVISQSRVVNHNHHHPYHKPAAVFKSPGLSLALQTNNMDXXXXXGD 1956
            R+V+DIP  NNM TG I+Q R+V+ +         ++F SPGLSLALQT+          
Sbjct: 17   RIVADIPYSNNMATGAIAQPRLVSPS------LAKSMFSSPGLSLALQTS---------- 60

Query: 1955 RFXXXXXXXXXXXXXXXXXXXXXXXDEYESRSGSDNMDGANSDDQDT-DKPPRKKRYHRH 1779
                                     DE+ESRSGSDNMDGA+ DDQD  D PPRKKRYHRH
Sbjct: 61   -MEGQGEVTRLAENFESGGGRRSREDEHESRSGSDNMDGASGDDQDAADNPPRKKRYHRH 119

Query: 1778 TPQQIQELEALFKECPHPDEKQRNELSRRLVLEARQVKFWFQNRRTQMKTQIERHENSIL 1599
            TPQQIQELEALFKECPHPDEKQR ELSRRL LE RQVKFWFQNRRTQMKTQ+ERHENSIL
Sbjct: 120  TPQQIQELEALFKECPHPDEKQRLELSRRLSLETRQVKFWFQNRRTQMKTQLERHENSIL 179

Query: 1598 RQENDKLRAENMNVREAMRNPMCNNCGGAAVLGDVSLEEQHLRVENARLKEELDRVCALA 1419
            RQENDKLRAENM++R+AMRNP+C NCGG A++GD+SLEEQHLR+ENARLK+ELDRVCALA
Sbjct: 180  RQENDKLRAENMSIRDAMRNPICTNCGGPAIIGDISLEEQHLRIENARLKDELDRVCALA 239

Query: 1418 GKFLGRPVSSIGDPLPPSMQIGAGLDLAVXXXXXXXXXXXXXXGVXXXXXXXXXXNQFRX 1239
            GKFLGRP+SS+   + P+M   + L+L V                          +    
Sbjct: 240  GKFLGRPISSLASSMAPAMP-SSSLELGVGSNGFGGLSTVATTLPLGHDFGGGISSTL-- 296

Query: 1238 XXXXXXXXXXXXVPHQQQSISSGVDRSVENSMYLDLAMAAMEELVRMAQTNEPLWIPSSF 1059
                         P    +  +G++RS+E SM+L+LA+AAM+ELV+MAQT+EPLW+  S 
Sbjct: 297  ----------PVAPPTSTTGVTGLERSLERSMFLELALAAMDELVKMAQTDEPLWV-RSL 345

Query: 1058 EGGKETFNREEYTRMFAPCIGSKPNGFVTEATRETGVVIINGVALVETLMEAKRYVDMFP 879
            EGG+E  N EEY R F PCIG KP+GFVTE+TRETG+VIIN +ALVETLM++ R+ +MFP
Sbjct: 346  EGGREILNLEEYMRTFTPCIGMKPSGFVTESTRETGMVIINSLALVETLMDSNRWAEMFP 405

Query: 878  CVIARCNTTDVISSGVGGTRNGALQLMHAEFQVLSPLVPIREVNFLRFCKQHAEGVWAVV 699
            C+IAR +TTDVISSG+GGTRNGALQLMHAE QVLSPLVP+REVNFLRFCKQHAEGVWAVV
Sbjct: 406  CMIARTSTTDVISSGMGGTRNGALQLMHAELQVLSPLVPVREVNFLRFCKQHAEGVWAVV 465

Query: 698  DISVDANQENSDTSKYLS-RRLPSGCVIQDMPNGCCKVTWVEHAEYDENSIHRIYQPLVN 522
            D+S+D  +E S    +++ RRLPSGCV+QDMPNG  KVTWVEHAEYDE+++H++Y+PL+ 
Sbjct: 466  DVSIDTIRETSVAPTFVNCRRLPSGCVVQDMPNGYSKVTWVEHAEYDESAVHQLYRPLLG 525

Query: 521  AGLGFGAQKWIATLQRQCECIAILMSN-MPIRDQTG-ITQSGRKSMLKLAQRMTQNFCAG 348
            +G+GFGAQ+W+ATLQRQCEC+AILMS+ +P RD T  IT  GR+SMLKLAQRMT NFCAG
Sbjct: 526  SGMGFGAQRWVATLQRQCECLAILMSSTVPTRDHTAAITAGGRRSMLKLAQRMTDNFCAG 585

Query: 347  VCPSSVHKWNRLSAGNVDADVRVMTRKSIDDPGEPPGVVLSAATSVWLPISPQKLFDFLR 168
            VC S+VHKWN+L AGNVD DVRVMTRKS+DDPGEPPG+VLSAATSVWLP+SPQ+LFDFLR
Sbjct: 586  VCASTVHKWNKLCAGNVDEDVRVMTRKSVDDPGEPPGIVLSAATSVWLPVSPQRLFDFLR 645

Query: 167  DERLRSEWDILSNGGPMQQMSHITKGQDPGNCVSLLRAGAMNASQSSMLILQETC 3
            DERLRSEWDILSNGGPMQ+M+HI KGQD GNCVSLLRA AMNA+QSSMLILQETC
Sbjct: 646  DERLRSEWDILSNGGPMQEMAHIAKGQDHGNCVSLLRASAMNANQSSMLILQETC 700


>XP_017631289.1 PREDICTED: homeobox-leucine zipper protein ANTHOCYANINLESS 2-like
            [Gossypium arboreum] KHG16285.1 Homeobox-leucine zipper
            ANTHOCYANINLESS 2 -like protein [Gossypium arboreum]
          Length = 820

 Score =  923 bits (2386), Expect = 0.0
 Identities = 477/721 (66%), Positives = 559/721 (77%), Gaps = 11/721 (1%)
 Frame = -2

Query: 2132 RVVSDIPL-NNMPTGV--ISQSRVVNHNHHHPYHKPAAVFKSPGLSLALQTNNMDXXXXX 1962
            R+V+DIP  NNM TG   I+Q R+++ +       P  +F SPGLSLALQ N +D     
Sbjct: 19   RMVADIPYSNNMATGATAIAQPRLMSPS------LPKNIFNSPGLSLALQPN-IDNQGDH 71

Query: 1961 GDRFXXXXXXXXXXXXXXXXXXXXXXXDEYESRSGSDNMDGANSDDQDT-DKPPRKKRYH 1785
            G R                         E+ESRSGSDNMDGA+ DDQD  DKPPRKKRYH
Sbjct: 72   GSRIMRESLEGSVGRRSREE--------EHESRSGSDNMDGASGDDQDAADKPPRKKRYH 123

Query: 1784 RHTPQQIQELEALFKECPHPDEKQRNELSRRLVLEARQVKFWFQNRRTQMKTQIERHENS 1605
            RHTPQQIQELEALFKECPHPDEKQR ELS+RL LE RQVKFWFQNRRTQMKTQ+ERHENS
Sbjct: 124  RHTPQQIQELEALFKECPHPDEKQRLELSKRLCLETRQVKFWFQNRRTQMKTQLERHENS 183

Query: 1604 ILRQENDKLRAENMNVREAMRNPMCNNCGGAAVLGDVSLEEQHLRVENARLKEELDRVCA 1425
            +LRQENDKLRAENM++R+AMRNP+C NCGG A++GD+SLEEQHLR+ENARLK+ELDRVCA
Sbjct: 184  LLRQENDKLRAENMSIRDAMRNPICTNCGGPAIIGDISLEEQHLRIENARLKDELDRVCA 243

Query: 1424 LAGKFLGRPVS----SIGDPLP-PSMQIGAGLDLAVXXXXXXXXXXXXXXGVXXXXXXXX 1260
            LAGKFLGRP+S    SI  PLP  S+++G G +                  +        
Sbjct: 244  LAGKFLGRPISTLATSIAPPLPNSSLELGVGSN--------------GFGALSTVATTLP 289

Query: 1259 XXNQFRXXXXXXXXXXXXXVPHQQQSISSGVDRSVENSMYLDLAMAAMEELVRMAQTNEP 1080
                F               P +  +  +G+DRSVE SM+L+LA+AAM ELV+MAQT+EP
Sbjct: 290  LGPDF-----GGGMSNALVPPSRPTTAVTGLDRSVERSMFLELALAAMNELVKMAQTDEP 344

Query: 1079 LWIPSSFEGGKETFNREEYTRMFAPCIGSKPNGFVTEATRETGVVIINGVALVETLMEAK 900
            LWI  S EGG+E  N++EY R F PCIG K NGFVTEA+RE+G+VIIN +ALVETLM++ 
Sbjct: 345  LWI-RSLEGGREILNQDEYLRTFTPCIGMKSNGFVTEASRESGMVIINSLALVETLMDSN 403

Query: 899  RYVDMFPCVIARCNTTDVISSGVGGTRNGALQLMHAEFQVLSPLVPIREVNFLRFCKQHA 720
            R+ +MFPC+IAR +TTDVIS GVGGTRNGALQLMHAE QVLSPLVP+REVNFLRFCKQHA
Sbjct: 404  RWSEMFPCMIARTSTTDVISGGVGGTRNGALQLMHAELQVLSPLVPVREVNFLRFCKQHA 463

Query: 719  EGVWAVVDISVDANQENSDTSKYLS-RRLPSGCVIQDMPNGCCKVTWVEHAEYDENSIHR 543
            EGVWAVVD+SVD  +E S    +++ RRLPSGCV+QDMPNG  KVTWVEHAEY+E+ +H+
Sbjct: 464  EGVWAVVDVSVDTIRETSGAPSFVNCRRLPSGCVVQDMPNGYSKVTWVEHAEYEESQVHQ 523

Query: 542  IYQPLVNAGLGFGAQKWIATLQRQCECIAILMSN-MPIRDQTGITQSGRKSMLKLAQRMT 366
            +Y PL+ +G+ FGAQ+W+ATLQRQCEC+AILMS+ +P RD TGIT SGR+SMLKLAQRMT
Sbjct: 524  LYHPLLRSGMAFGAQRWVATLQRQCECLAILMSSSVPTRDHTGITASGRRSMLKLAQRMT 583

Query: 365  QNFCAGVCPSSVHKWNRLSAGNVDADVRVMTRKSIDDPGEPPGVVLSAATSVWLPISPQK 186
             NFCAGVC S+VHKWN+L+ GNVD DVRVMTRKS+DDPGEPPG+VLSAATSVWLP+SPQ+
Sbjct: 584  DNFCAGVCASTVHKWNKLNVGNVDEDVRVMTRKSVDDPGEPPGIVLSAATSVWLPVSPQR 643

Query: 185  LFDFLRDERLRSEWDILSNGGPMQQMSHITKGQDPGNCVSLLRAGAMNASQSSMLILQET 6
            LFDFLRDERLRSEWDILSNGGPMQ+M+HI KGQD GNCVSLLRA AMNA+QSSMLILQET
Sbjct: 644  LFDFLRDERLRSEWDILSNGGPMQEMAHIAKGQDHGNCVSLLRASAMNANQSSMLILQET 703

Query: 5    C 3
            C
Sbjct: 704  C 704


>XP_012489878.1 PREDICTED: homeobox-leucine zipper protein ANTHOCYANINLESS 2
            [Gossypium raimondii] KJB41233.1 hypothetical protein
            B456_007G096000 [Gossypium raimondii]
          Length = 820

 Score =  920 bits (2377), Expect = 0.0
 Identities = 471/716 (65%), Positives = 557/716 (77%), Gaps = 6/716 (0%)
 Frame = -2

Query: 2132 RVVSDIPL-NNMPTGV--ISQSRVVNHNHHHPYHKPAAVFKSPGLSLALQTNNMDXXXXX 1962
            R+V+DIP  NNM TG   I+Q R+++ +       P  +F SPGLSLALQ N +D     
Sbjct: 19   RMVADIPYSNNMATGATAIAQPRLMSPS------LPKNIFNSPGLSLALQPN-IDNQGDH 71

Query: 1961 GDRFXXXXXXXXXXXXXXXXXXXXXXXDEYESRSGSDNMDGANSDDQDT-DKPPRKKRYH 1785
            G R                         E+ESRSGSDNMDGA+ DDQD  DKPPRKKRYH
Sbjct: 72   GSRIMRESLEGSVGRRSREE--------EHESRSGSDNMDGASGDDQDAADKPPRKKRYH 123

Query: 1784 RHTPQQIQELEALFKECPHPDEKQRNELSRRLVLEARQVKFWFQNRRTQMKTQIERHENS 1605
            RHTPQQIQELEALFKECPHPDEKQR ELS+RL LE RQVKFWFQNRRTQMKTQ+ERHENS
Sbjct: 124  RHTPQQIQELEALFKECPHPDEKQRLELSKRLCLETRQVKFWFQNRRTQMKTQLERHENS 183

Query: 1604 ILRQENDKLRAENMNVREAMRNPMCNNCGGAAVLGDVSLEEQHLRVENARLKEELDRVCA 1425
            +LRQENDKLRAENM++R+AMRNP+C NCGG A++GD+SLEEQHLR+ENARLK+ELDRVCA
Sbjct: 184  LLRQENDKLRAENMSIRDAMRNPICTNCGGPAIIGDISLEEQHLRIENARLKDELDRVCA 243

Query: 1424 LAGKFLGRPVSSIGDPLPPSMQIGAGLDLAVXXXXXXXXXXXXXXGVXXXXXXXXXXNQF 1245
            LAGKFLGRP+S++   + P +   + L+L V                          N  
Sbjct: 244  LAGKFLGRPISTLATSIAPPLP-NSSLELGVGSNGFGALSTVATTLPLAPDFGGGMSNAL 302

Query: 1244 RXXXXXXXXXXXXXVPHQQQSISSGVDRSVENSMYLDLAMAAMEELVRMAQTNEPLWIPS 1065
                             +  +  +G+DRSVE SM+L+LA+AAM+ELV+MAQT+EPLWI  
Sbjct: 303  -------------IPASRPTTAVTGLDRSVERSMFLELALAAMDELVKMAQTDEPLWI-R 348

Query: 1064 SFEGGKETFNREEYTRMFAPCIGSKPNGFVTEATRETGVVIINGVALVETLMEAKRYVDM 885
            S EGG+E  N++EY R F PCIG K NGFVTEA+RE+G+VIIN +ALVETLM++ R+ +M
Sbjct: 349  SLEGGREILNQDEYLRTFTPCIGMKSNGFVTEASRESGMVIINSLALVETLMDSNRWSEM 408

Query: 884  FPCVIARCNTTDVISSGVGGTRNGALQLMHAEFQVLSPLVPIREVNFLRFCKQHAEGVWA 705
            FPC+IAR +TTDVISSGVGGTRNGALQLMHAE QVLSPLVP+REVNFLRFCKQHAEGVWA
Sbjct: 409  FPCMIARTSTTDVISSGVGGTRNGALQLMHAELQVLSPLVPVREVNFLRFCKQHAEGVWA 468

Query: 704  VVDISVDANQENSDTSKYLS-RRLPSGCVIQDMPNGCCKVTWVEHAEYDENSIHRIYQPL 528
            VVD+S++  +E S    +++ RRLPSGCV+QDMPNG  KVTWVEHAEY+E+ +H++Y PL
Sbjct: 469  VVDVSIETIRETSGAPSFVNCRRLPSGCVVQDMPNGYSKVTWVEHAEYEESQVHQLYHPL 528

Query: 527  VNAGLGFGAQKWIATLQRQCECIAILMSN-MPIRDQTGITQSGRKSMLKLAQRMTQNFCA 351
            + +G+ FGAQ+W+ATLQRQCEC+AILMS+ +P RD TGIT SGR+SMLKLAQRMT NFCA
Sbjct: 529  LRSGMAFGAQRWVATLQRQCECLAILMSSSVPTRDHTGITASGRRSMLKLAQRMTDNFCA 588

Query: 350  GVCPSSVHKWNRLSAGNVDADVRVMTRKSIDDPGEPPGVVLSAATSVWLPISPQKLFDFL 171
            GVC S+VHKWN+L+ GNVD DVRVMTRKS+DDPGEPPG+VLSAATSVWLP+SPQ+LFDFL
Sbjct: 589  GVCASTVHKWNKLNVGNVDEDVRVMTRKSVDDPGEPPGIVLSAATSVWLPVSPQRLFDFL 648

Query: 170  RDERLRSEWDILSNGGPMQQMSHITKGQDPGNCVSLLRAGAMNASQSSMLILQETC 3
            RDERLRSEWDILSNGGPMQ+M+HI KGQD GNCVSLLRA AMNA+QSSMLILQETC
Sbjct: 649  RDERLRSEWDILSNGGPMQEMAHIAKGQDHGNCVSLLRASAMNANQSSMLILQETC 704


>GAV85584.1 Homeobox domain-containing protein/START domain-containing protein
            [Cephalotus follicularis]
          Length = 827

 Score =  919 bits (2376), Expect = 0.0
 Identities = 474/727 (65%), Positives = 562/727 (77%), Gaps = 17/727 (2%)
 Frame = -2

Query: 2132 RVVSDIPL-------NNMPTGVISQSRVVNHNHHHPYHKPAAVFKSPGLSLALQTNNMDX 1974
            R+V+DIP        NNMPTG I+Q R+V+H+          +F SPGLSLALQ  N+D 
Sbjct: 17   RIVADIPYKNHNNSNNNMPTGAIAQPRLVSHS------LAKNMFNSPGLSLALQQPNIDN 70

Query: 1973 XXXXGDRFXXXXXXXXXXXXXXXXXXXXXXXDEYESRSGSDNMDGANSDDQDT-DKPPRK 1797
                                           +E+ESRSGSDNMDG + DDQD  D PPRK
Sbjct: 71   QGDVT----------RMAENFEASIGRRSREEEHESRSGSDNMDGGSGDDQDAADNPPRK 120

Query: 1796 KRYHRHTPQQIQELEALFKECPHPDEKQRNELSRRLVLEARQVKFWFQNRRTQMKTQIER 1617
            KRYHRHTPQQIQELEALFKECPHPDEKQR ELS+RL LE RQVKFWFQNRRTQMKTQ+ER
Sbjct: 121  KRYHRHTPQQIQELEALFKECPHPDEKQRLELSKRLCLETRQVKFWFQNRRTQMKTQLER 180

Query: 1616 HENSILRQENDKLRAENMNVREAMRNPMCNNCGGAAVLGDVSLEEQHLRVENARLKEELD 1437
            HENS+LRQENDKLRAENM++R+AMRNP+C NCGG A++GD+SLEE+HLR+ENARLK+ELD
Sbjct: 181  HENSLLRQENDKLRAENMSIRDAMRNPICTNCGGPAIMGDISLEEEHLRIENARLKDELD 240

Query: 1436 RVCALAGKFLGRPV----SSIGDPLP-PSMQIGAGLDLAVXXXXXXXXXXXXXXGVXXXX 1272
            RVCALAGKFLGRP+    +SIG P+P  S+++G G +                 G+    
Sbjct: 241  RVCALAGKFLGRPIPPLTASIGPPMPNSSLELGVGSN--------------GFGGLGSVP 286

Query: 1271 XXXXXXNQFRXXXXXXXXXXXXXVPHQQQSISSGVDRSVENSMYLDLAMAAMEELVRMAQ 1092
                    F               P++  +  +G+DRS+E SM+L+LA+AAM+ELV+MAQ
Sbjct: 287  STLPLGPDF---GGGMSNSLSVVPPNRSGTGVTGLDRSIERSMFLELALAAMDELVKMAQ 343

Query: 1091 TNEPLWIPSSFEGGKETFNREEYTRMFAPCIGSKPNGFVTEATRETGVVIINGVALVETL 912
            + EPLWI  S EGG+E  N EEY R F PCIG KP+GFVTEA+RETG+VIIN +ALVETL
Sbjct: 344  SEEPLWI-RSLEGGREILNPEEYLRTFTPCIGLKPHGFVTEASRETGMVIINSLALVETL 402

Query: 911  MEAKRYVDMFPCVIARCNTTDVISSGVGGTRNGALQLMHAEFQVLSPLVPIREVNFLRFC 732
            M++ R+ +MFPC+IAR  TTDVISSG+GGTRNG+LQLMHAE QVLSPLVP+REVNFLRFC
Sbjct: 403  MDSNRWAEMFPCMIARTTTTDVISSGMGGTRNGSLQLMHAELQVLSPLVPVREVNFLRFC 462

Query: 731  KQHAEGVWAVVDISVDANQENSDTSK-YLS-RRLPSGCVIQDMPNGCCKVTWVEHAEYDE 558
            KQHAEGVWAVVD+S+D  +E S  +  YL+ RRLPSGCV+QDMPNG  KVTWVEHAEYD+
Sbjct: 463  KQHAEGVWAVVDVSIDTIREASPGAPTYLNCRRLPSGCVVQDMPNGYSKVTWVEHAEYDD 522

Query: 557  NSIHRIYQPLVNAGLGFGAQKWIATLQRQCECIAILMSN-MPIRDQTG-ITQSGRKSMLK 384
            + +H++Y+PL+  G+GFGAQ+W+ATLQRQCEC+AILMS+ +P RD T  I+ SGR+SMLK
Sbjct: 523  SQVHQLYRPLLGCGMGFGAQRWVATLQRQCECLAILMSSAVPSRDHTAAISASGRRSMLK 582

Query: 383  LAQRMTQNFCAGVCPSSVHKWNRLSAGNVDADVRVMTRKSIDDPGEPPGVVLSAATSVWL 204
            LAQRMT NFCAGVC S+VHKWN+L+AGNVD DVRVMTRKS+DDPGEPPG+VLSAATSVWL
Sbjct: 583  LAQRMTDNFCAGVCASTVHKWNKLNAGNVDEDVRVMTRKSVDDPGEPPGIVLSAATSVWL 642

Query: 203  PISPQKLFDFLRDERLRSEWDILSNGGPMQQMSHITKGQDPGNCVSLLRAGAMNASQSSM 24
            P+SPQ+LFDFLRDERLRSEWDILSNGGPMQ+M+HI KGQD GNCVSLLRA AMNA+QSSM
Sbjct: 643  PVSPQRLFDFLRDERLRSEWDILSNGGPMQEMAHIAKGQDHGNCVSLLRASAMNANQSSM 702

Query: 23   LILQETC 3
            LILQETC
Sbjct: 703  LILQETC 709


>OAY62348.1 hypothetical protein MANES_01G261300 [Manihot esculenta]
          Length = 818

 Score =  918 bits (2372), Expect = 0.0
 Identities = 466/719 (64%), Positives = 555/719 (77%), Gaps = 9/719 (1%)
 Frame = -2

Query: 2132 RVVSDIPLNN--MPTGVISQSRVVNHNHHHPYHKPAAVFKSPGLSLALQTNNMDXXXXXG 1959
            R+V+DI  ++  MP G I+Q R+++H+         ++F SPGLSLALQ  N+D      
Sbjct: 18   RIVADIAYSSSSMPAGAIAQPRLISHS------LTKSMFNSPGLSLALQQPNIDGQGDVP 71

Query: 1958 DRFXXXXXXXXXXXXXXXXXXXXXXXDEYESRSGSDNMDGANSDDQDT-DKPPRKKRYHR 1782
                                      +E+ESRSGSDN+DGA+ DDQD  D  PRKKRYHR
Sbjct: 72   ----------RMVENFEPNGARRSREEEHESRSGSDNLDGASGDDQDAADNRPRKKRYHR 121

Query: 1781 HTPQQIQELEALFKECPHPDEKQRNELSRRLVLEARQVKFWFQNRRTQMKTQIERHENSI 1602
            HTPQQIQELEALFKECPHPDEKQR ELSRRL LE RQVKFWFQNRRTQMKTQ+ERHEN++
Sbjct: 122  HTPQQIQELEALFKECPHPDEKQRLELSRRLCLETRQVKFWFQNRRTQMKTQLERHENTL 181

Query: 1601 LRQENDKLRAENMNVREAMRNPMCNNCGGAAVLGDVSLEEQHLRVENARLKEELDRVCAL 1422
            LRQENDKLRAENM++R+AMRNP+C+NCGG A++GD+SLEEQHL +ENARLKEELDRVCAL
Sbjct: 182  LRQENDKLRAENMSIRDAMRNPICSNCGGPAIIGDISLEEQHLTIENARLKEELDRVCAL 241

Query: 1421 AGKFLGRPVS----SIGDPLPPSMQIGAGLDLAVXXXXXXXXXXXXXXGVXXXXXXXXXX 1254
            AGKFLGRP+S    SIG P+P S      L+L V                          
Sbjct: 242  AGKFLGRPISLLANSIGPPMPNS-----SLELGVGNNGFGCLSNAAATVPLGPDFSNALP 296

Query: 1253 NQFRXXXXXXXXXXXXXVPHQQQSISSGVDRSVENSMYLDLAMAAMEELVRMAQTNEPLW 1074
               +                   +  +G DRS+E SM+L+LA+AAM+ELV++AQT+EPLW
Sbjct: 297  VVTQT--------------RPPTASMTGFDRSLERSMFLELALAAMDELVKLAQTDEPLW 342

Query: 1073 IPSSFEGGKETFNREEYTRMFAPCIGSKPNGFVTEATRETGVVIINGVALVETLMEAKRY 894
               S EGG+E  N EEY R+F PCIG KP+GFV+EA+RETG+VIING+ALVETLM++ R+
Sbjct: 343  F-RSLEGGREVLNHEEYMRIFTPCIGMKPSGFVSEASRETGMVIINGLALVETLMDSNRW 401

Query: 893  VDMFPCVIARCNTTDVISSGVGGTRNGALQLMHAEFQVLSPLVPIREVNFLRFCKQHAEG 714
             +MFPC+IAR +TTDVIS+G+GGTRNG+LQLMHAE Q LSPLVP+REVNFLRFCKQHAEG
Sbjct: 402  AEMFPCMIARTSTTDVISTGMGGTRNGSLQLMHAELQALSPLVPVREVNFLRFCKQHAEG 461

Query: 713  VWAVVDISVDANQENSDTSKYLS-RRLPSGCVIQDMPNGCCKVTWVEHAEYDENSIHRIY 537
            VWAVVD+S+D  +E S    +++ RRLPSGCV+QDMPNG  KVTWVEHAEYDE  IH++Y
Sbjct: 462  VWAVVDVSIDTIRETSGAPTFVNCRRLPSGCVVQDMPNGYSKVTWVEHAEYDETQIHQLY 521

Query: 536  QPLVNAGLGFGAQKWIATLQRQCECIAILMSN-MPIRDQTGITQSGRKSMLKLAQRMTQN 360
            +PL+++G+GFGAQ+W+ATLQRQCEC+AILMS+ +P RD T IT SGR+SMLKLAQRMT N
Sbjct: 522  RPLISSGMGFGAQRWVATLQRQCECLAILMSSTVPTRDHTAITSSGRRSMLKLAQRMTDN 581

Query: 359  FCAGVCPSSVHKWNRLSAGNVDADVRVMTRKSIDDPGEPPGVVLSAATSVWLPISPQKLF 180
            FCAGVC S+VHKWN+L+AGNVD DVRVMTRKS+DDPGEPPG+VLSAATSVWLP+SPQ+LF
Sbjct: 582  FCAGVCASTVHKWNKLNAGNVDEDVRVMTRKSVDDPGEPPGIVLSAATSVWLPVSPQRLF 641

Query: 179  DFLRDERLRSEWDILSNGGPMQQMSHITKGQDPGNCVSLLRAGAMNASQSSMLILQETC 3
            DFLRDERLRSEWDILSNGGPMQ+M+HI KGQD GNCVSLLRA AMNA+QSSMLILQETC
Sbjct: 642  DFLRDERLRSEWDILSNGGPMQEMAHIAKGQDQGNCVSLLRASAMNANQSSMLILQETC 700


>XP_007220256.1 hypothetical protein PRUPE_ppa001436mg [Prunus persica] ONI22884.1
            hypothetical protein PRUPE_2G156900 [Prunus persica]
          Length = 829

 Score =  918 bits (2372), Expect = 0.0
 Identities = 471/731 (64%), Positives = 564/731 (77%), Gaps = 21/731 (2%)
 Frame = -2

Query: 2132 RVVSDIPLNN---------MPTGVISQSRVVNHNHHHPYHKPAAVFKSPGLSLALQTN-- 1986
            R+V+DI  NN         MP+  ++Q R+V  +         ++F SPGLSLALQTN  
Sbjct: 18   RIVADISYNNTSSSTHSNNMPSSALAQPRLVTQS------LTKSMFNSPGLSLALQTNAD 71

Query: 1985 NMDXXXXXGDRFXXXXXXXXXXXXXXXXXXXXXXXDEYESRSGSDNMDGANSDDQDT--- 1815
                     + F                        E+ESRSGSDNMDG + DDQD    
Sbjct: 72   GQGDVTRMAENFETNVGRRSREE-------------EHESRSGSDNMDGGSGDDQDAADN 118

Query: 1814 DKPPRKKRYHRHTPQQIQELEALFKECPHPDEKQRNELSRRLVLEARQVKFWFQNRRTQM 1635
              P +KKRYHRHTPQQIQELEALFKECPHPDEKQR ELSRRL LE RQVKFWFQNRRTQM
Sbjct: 119  TNPRKKKRYHRHTPQQIQELEALFKECPHPDEKQRLELSRRLCLETRQVKFWFQNRRTQM 178

Query: 1634 KTQIERHENSILRQENDKLRAENMNVREAMRNPMCNNCGGAAVLGDVSLEEQHLRVENAR 1455
            KTQ+ERHENS+LRQENDKLRAENM++R+AMRNP+C+NCGG A++G++SLEEQHLR+ENAR
Sbjct: 179  KTQLERHENSLLRQENDKLRAENMSIRDAMRNPICSNCGGPAIIGEISLEEQHLRIENAR 238

Query: 1454 LKEELDRVCALAGKFLGRPVSSI----GDPLPPS-MQIGAGLDLAVXXXXXXXXXXXXXX 1290
            LK+ELDRVCALAGKFLGRP+SS+    G PLP S +++G G +                 
Sbjct: 239  LKDELDRVCALAGKFLGRPISSLATSMGPPLPSSTLELGVGSN--------------GFG 284

Query: 1289 GVXXXXXXXXXXNQFRXXXXXXXXXXXXXVPHQQQSISSGVDRSVENSMYLDLAMAAMEE 1110
            G+            F              VPH + S++ G+DRS+E SM+L+LA+AAM+E
Sbjct: 285  GLSSVATSMPVGPDF----GGGIGSAMSVVPHSRPSVT-GLDRSMERSMFLELALAAMDE 339

Query: 1109 LVRMAQTNEPLWIPSSFEGGKETFNREEYTRMFAPCIGSKPNGFVTEATRETGVVIINGV 930
            LV++AQT+EPLW+  S EGG+E  N EEY R F PCIG KPNGFVTEA+RETG+VIIN +
Sbjct: 340  LVKLAQTDEPLWL-RSLEGGREVLNHEEYMRSFTPCIGLKPNGFVTEASRETGMVIINSL 398

Query: 929  ALVETLMEAKRYVDMFPCVIARCNTTDVISSGVGGTRNGALQLMHAEFQVLSPLVPIREV 750
            ALVETLME+ R+++MFPC++AR +TTDVISSG+GGTRNGALQLMHAE QVLSPLVP+REV
Sbjct: 399  ALVETLMESNRWLEMFPCLVARTSTTDVISSGMGGTRNGALQLMHAELQVLSPLVPVREV 458

Query: 749  NFLRFCKQHAEGVWAVVDISVDANQENSDTSKYLS-RRLPSGCVIQDMPNGCCKVTWVEH 573
            NFLRFCKQHAEGVWAVVD+SVD  ++ S    +++ RRLPSGCV+QDMPNG  KVTWVEH
Sbjct: 459  NFLRFCKQHAEGVWAVVDVSVDTIRDTSGAPTFMNCRRLPSGCVVQDMPNGYSKVTWVEH 518

Query: 572  AEYDENSIHRIYQPLVNAGLGFGAQKWIATLQRQCECIAILMSN-MPIRDQTGITQSGRK 396
            AEYDE+ +H++Y+P++++G+GFGAQ+W+ATLQRQCEC+AILMS+ +P RD T IT SGR+
Sbjct: 519  AEYDESQVHQLYRPMLSSGMGFGAQRWVATLQRQCECLAILMSSSVPTRDHTAITASGRR 578

Query: 395  SMLKLAQRMTQNFCAGVCPSSVHKWNRLSAGNVDADVRVMTRKSIDDPGEPPGVVLSAAT 216
            SMLKLAQRMT NFCAGVC S+VHKWN+L+A NVD DVRVMTR+S+DDPGEPPG+VLSAAT
Sbjct: 579  SMLKLAQRMTDNFCAGVCASTVHKWNKLNARNVDEDVRVMTRESLDDPGEPPGIVLSAAT 638

Query: 215  SVWLPISPQKLFDFLRDERLRSEWDILSNGGPMQQMSHITKGQDPGNCVSLLRAGAMNAS 36
            SVWLP+SPQ+LFDFLRDERLRSEWDILSNGGPMQ+M+HI KGQDPGNCVSLLRA AMNA+
Sbjct: 639  SVWLPVSPQRLFDFLRDERLRSEWDILSNGGPMQEMAHIAKGQDPGNCVSLLRARAMNAN 698

Query: 35   QSSMLILQETC 3
            QSSMLILQETC
Sbjct: 699  QSSMLILQETC 709


Top