BLASTX nr result

ID: Lithospermum23_contig00006151 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Lithospermum23_contig00006151
         (3267 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

CDO97241.1 unnamed protein product [Coffea canephora]                 504   e-159
XP_011088187.1 PREDICTED: pathogenesis-related homeodomain prote...   482   e-151
XP_011088190.1 PREDICTED: pathogenesis-related homeodomain prote...   456   e-143
XP_012836886.1 PREDICTED: homeobox protein HAT3.1 [Erythranthe g...   453   e-141
XP_006346339.1 PREDICTED: pathogenesis-related homeodomain prote...   436   e-134
XP_019256173.1 PREDICTED: pathogenesis-related homeodomain prote...   431   e-133
NP_001308700.1 PHD-finger family homeodomain protein [Solanum ly...   431   e-133
XP_016539473.1 PREDICTED: pathogenesis-related homeodomain prote...   432   e-132
XP_015062395.1 PREDICTED: pathogenesis-related homeodomain prote...   431   e-132
XP_019170136.1 PREDICTED: homeobox protein HOX1A-like isoform X2...   436   e-132
XP_009592467.1 PREDICTED: pathogenesis-related homeodomain prote...   423   e-130
XP_016478781.1 PREDICTED: pathogenesis-related homeodomain prote...   422   e-130
XP_009775281.1 PREDICTED: pathogenesis-related homeodomain prote...   419   e-129
OMO73948.1 hypothetical protein CCACVL1_17051 [Corchorus capsula...   418   e-126
XP_003555282.1 PREDICTED: homeobox protein HAT3.1-like isoform X...   414   e-126
XP_006589630.1 PREDICTED: homeobox protein HAT3.1 [Glycine max] ...   407   e-123
ONH91822.1 hypothetical protein PRUPE_8G137800 [Prunus persica] ...   413   e-123
XP_018809403.1 PREDICTED: homeobox protein HAT3.1 [Juglans regia]     416   e-123
KHN06779.1 Homeobox protein HAT3.1 [Glycine soja]                     407   e-123
XP_008338253.1 PREDICTED: homeobox protein HAT3.1-like isoform X...   409   e-122

>CDO97241.1 unnamed protein product [Coffea canephora]
          Length = 881

 Score =  504 bits (1298), Expect = e-159
 Identities = 285/627 (45%), Positives = 365/627 (58%), Gaps = 17/627 (2%)
 Frame = +1

Query: 778  SRKRRNPTVETTPVSARVLRSRSKDKPEAPVPCNIAEQVPDAVEXXXXXXXXXXXXPPVS 957
            SRKR++ +  T PV+ARVLRSRS++K +     ++ E                    PV+
Sbjct: 190  SRKRKSTS--TIPVTARVLRSRSQEKSKESEKKDVVEDAATEAYRRKRGKKKQRRNIPVN 247

Query: 958  EFSRMRNHLRYLLHRITIEQNLIDAYSSEGWRKQSLEKIKPEKELQKAKSHILRYKLKIR 1137
            EFSR+R HLRYLLHRI  EQNLIDAYS EGW+ QSLEKIKPEKELQ+AKS I RYKLKIR
Sbjct: 248  EFSRIRAHLRYLLHRIKYEQNLIDAYSGEGWKGQSLEKIKPEKELQRAKSQIFRYKLKIR 307

Query: 1138 DLFQRLDLSLTEGQIPESLFDSEGEIDSEDIFCAKCGSKDVSLDNDIILCDGSCERGFHQ 1317
            DLF+++DL L EG++PESLFDS+G+IDSEDIFCAKCGSKD++LDNDIILCDG+CERGFHQ
Sbjct: 308  DLFRQIDLLLAEGKLPESLFDSDGQIDSEDIFCAKCGSKDLTLDNDIILCDGACERGFHQ 367

Query: 1318 FCLEPPLQKEDIPQGNEGWLCPGCDCKVDCIDLLNDFNGVKLSVLDSWENVFPE-ADAAA 1494
            FCLEPPL KEDIP   EGWLCPGCDCKVDCI+LL+DF G  LSVLD WE VFPE A AAA
Sbjct: 368  FCLEPPLLKEDIPPDEEGWLCPGCDCKVDCIELLSDFQGSNLSVLDKWEKVFPEEAAAAA 427

Query: 1495 TGKTLEDGSALXXXXXXXXXXXXXRTEVEEDLADQEKSSDESDYVSASNSSSGSPTKPGF 1674
            +G  ++D S L             + EV+  +  +E SSDESDY SAS     +      
Sbjct: 428  SGMKMDDYSGLPSDDSDDDDYDPDKPEVDNMVLGEESSSDESDYFSASEEPVSAVKA--- 484

Query: 1675 PAKEHFMGLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLGAVLQDNTSEGDNQ 1854
               E  +GL                                     GA+  +    G+  
Sbjct: 485  ---EQILGLPSDDSEDDDFDPSAADHGELAKQESSSSDFSSDSEDFGAMFHEKEPLGEEA 541

Query: 1855 GQSKSVGEELEVDVWV----------NTRSLKDEISYVLNSNDSPISAKRNVERLDYKKL 2004
            G   SV  +  + V               SL DE+S++L SND+P+S KR+VERLDYKKL
Sbjct: 542  GHVSSVSTQSNLAVGSIGPIFKVGRDKRHSLSDELSFLLESNDAPVSGKRHVERLDYKKL 601

Query: 2005 HDETYGNXXXXXXXXXXXXNVGNKRRKNVSGR---ISVXXXXXXXXXXDIRPEDGDQE-- 2169
            H+ETYG+             VG +RRK  +G+   +            DI+ E+ +Q+  
Sbjct: 602  HEETYGDTSSDSSDEDYGETVGPRRRKKSTGKAILVPSNEPETIHKGADIKDENCNQKDF 661

Query: 2170 ERRSVKQASKQFDTEYGNKF-LTSHASSSEAGLGKRHRSKPHQKLGEAVTQRLFDSFKQN 2346
            E   V++ +K+F+ E  N   + S   S+E G   +   +P+Q+LG+ + QRL +SF++N
Sbjct: 662  EMTPVEKINKKFEIEGSNNMSVDSPRISTEGGSSGKRTGRPYQRLGDGIVQRLLESFREN 721

Query: 2347 QYPKRPEKENLARDLGLSIQQVSKWFENARWSSRHSATMVXXXXXXXXXXXXXXLLAKDK 2526
            QYPK   KE+LA++LGL IQQVSKWFENARWS RHS+ M                   +K
Sbjct: 722  QYPKNGVKESLAKELGLRIQQVSKWFENARWSCRHSSRMDSKMTGTTSINGTCLPEINEK 781

Query: 2527 LKNKGQVVDTEVVLSNDNSIAASPVTN 2607
            +   G+  + E    N+    A P TN
Sbjct: 782  VPKHGEQSNLESATCNEEGKMALPQTN 808


>XP_011088187.1 PREDICTED: pathogenesis-related homeodomain protein isoform X1
            [Sesamum indicum] XP_011088188.1 PREDICTED:
            pathogenesis-related homeodomain protein isoform X1
            [Sesamum indicum]
          Length = 835

 Score =  482 bits (1241), Expect = e-151
 Identities = 272/584 (46%), Positives = 353/584 (60%), Gaps = 15/584 (2%)
 Frame = +1

Query: 745  DSAQPEMEDIG----SRKRRNPTVETTPVSARVLRSRSKDKPEAPVPC-NIAEQVPDAVE 909
            +S Q   ED G    SRKR+   +++   S+ VLRS+S++KP+AP P  N+ E   +  +
Sbjct: 151  NSGQLGTEDRGCSVQSRKRK-AGLKSPVTSSWVLRSKSQEKPKAPEPNENVKEDSANGEK 209

Query: 910  XXXXXXXXXXXXPPVSEFSRMRNHLRYLLHRITIEQNLIDAYSSEGWRKQSLEKIKPEKE 1089
                          V+EFSR + HLRYLLHRI  EQ+LIDAYS+EGW+ QSL+K+KPEKE
Sbjct: 210  KKRGRKKKPMQKTTVNEFSRTKTHLRYLLHRIKYEQSLIDAYSAEGWKGQSLDKLKPEKE 269

Query: 1090 LQKAKSHILRYKLKIRDLFQRLDLSLTEGQIPESLFDSEGEIDSEDIFCAKCGSKDVSLD 1269
            LQ+AKSHILRYKLKIR L QRLD+SL  G++PESLFDS GEIDSEDIFCAKCGSKD+ LD
Sbjct: 270  LQRAKSHILRYKLKIRALIQRLDMSLAVGKLPESLFDSHGEIDSEDIFCAKCGSKDLPLD 329

Query: 1270 NDIILCDGSCERGFHQFCLEPPLQKEDIPQGNEGWLCPGCDCKVDCIDLLNDFNGVKLSV 1449
            NDIILCDG+CERGFHQFCLEPPL KEDIP G+EGW+CPGCDCK+DCID+L DF G K+S 
Sbjct: 330  NDIILCDGACERGFHQFCLEPPLLKEDIPPGDEGWICPGCDCKIDCIDMLKDFQGTKISH 389

Query: 1450 LDSWENVFPEADAAATGKTLEDGSALXXXXXXXXXXXXXRTEVEEDLADQEKSSDESDYV 1629
             DSWE +FPEA AAA+GKTL++GS               + +  E +   E SSDES+Y 
Sbjct: 390  TDSWEKIFPEAAAAASGKTLDNGSGSSSDDSDDDDYDPDKPDAVEKVEGDESSSDESNYF 449

Query: 1630 SASNSSSGSPTKPGFPAKEHFMGLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1809
            SAS+  + S         E ++GL                                    
Sbjct: 450  SASDDLAASLN------NEKYLGLPSDDSEDDDFDPSALDPDKQAEQESSSSDFTSDSED 503

Query: 1810 LGAVLQD--------NTSEGDNQGQSKSVGEELEVDVWVNTR-SLKDEISYVLNSNDSPI 1962
            LGA+L D        + S    Q QS +  +E  V V    R SLKDE+SY+L ++  P+
Sbjct: 504  LGALLDDTEAGEDLGHISPSSYQNQSSTGSKEENVKVGGTKRQSLKDELSYLLETSGEPV 563

Query: 1963 SAKRNVERLDYKKLHDETYGNXXXXXXXXXXXXNVGNKRRKNVSGRISV-XXXXXXXXXX 2139
            S +R+VER DYK LHDETYGN                KRR+    +  V           
Sbjct: 564  SGRRHVERWDYKSLHDETYGNSSSDSSDEDFVDTTAPKRRRIDREKTEVTSPNKTPITEN 623

Query: 2140 DIRPEDGDQEERRSVKQASKQFDTEYGNKFLTSHASSSEAGLGKRHRSKPHQKLGEAVTQ 2319
            +++ +D +Q+E + +++ +++      N   T  +SS         +   +++LGEA+TQ
Sbjct: 624  NMKAKDENQKESKHLRERTRK------NIGDTIESSSKVGSASTGTKRSANKRLGEAITQ 677

Query: 2320 RLFDSFKQNQYPKRPEKENLARDLGLSIQQVSKWFENARWSSRH 2451
            RL+ SF +NQYP+R  KENLA++LGL IQQVSKWFENARWS +H
Sbjct: 678  RLYASFNENQYPERAVKENLAKELGLKIQQVSKWFENARWSFQH 721


>XP_011088190.1 PREDICTED: pathogenesis-related homeodomain protein isoform X2
            [Sesamum indicum]
          Length = 715

 Score =  456 bits (1173), Expect = e-143
 Identities = 260/572 (45%), Positives = 342/572 (59%), Gaps = 15/572 (2%)
 Frame = +1

Query: 745  DSAQPEMEDIG----SRKRRNPTVETTPVSARVLRSRSKDKPEAPVPC-NIAEQVPDAVE 909
            +S Q   ED G    SRKR+   +++   S+ VLRS+S++KP+AP P  N+ E   +  +
Sbjct: 151  NSGQLGTEDRGCSVQSRKRK-AGLKSPVTSSWVLRSKSQEKPKAPEPNENVKEDSANGEK 209

Query: 910  XXXXXXXXXXXXPPVSEFSRMRNHLRYLLHRITIEQNLIDAYSSEGWRKQSLEKIKPEKE 1089
                          V+EFSR + HLRYLLHRI  EQ+LIDAYS+EGW+ QSL+K+KPEKE
Sbjct: 210  KKRGRKKKPMQKTTVNEFSRTKTHLRYLLHRIKYEQSLIDAYSAEGWKGQSLDKLKPEKE 269

Query: 1090 LQKAKSHILRYKLKIRDLFQRLDLSLTEGQIPESLFDSEGEIDSEDIFCAKCGSKDVSLD 1269
            LQ+AKSHILRYKLKIR L QRLD+SL  G++PESLFDS GEIDSEDIFCAKCGSKD+ LD
Sbjct: 270  LQRAKSHILRYKLKIRALIQRLDMSLAVGKLPESLFDSHGEIDSEDIFCAKCGSKDLPLD 329

Query: 1270 NDIILCDGSCERGFHQFCLEPPLQKEDIPQGNEGWLCPGCDCKVDCIDLLNDFNGVKLSV 1449
            NDIILCDG+CERGFHQFCLEPPL KEDIP G+EGW+CPGCDCK+DCID+L DF G K+S 
Sbjct: 330  NDIILCDGACERGFHQFCLEPPLLKEDIPPGDEGWICPGCDCKIDCIDMLKDFQGTKISH 389

Query: 1450 LDSWENVFPEADAAATGKTLEDGSALXXXXXXXXXXXXXRTEVEEDLADQEKSSDESDYV 1629
             DSWE +FPEA AAA+GKTL++GS               + +  E +   E SSDES+Y 
Sbjct: 390  TDSWEKIFPEAAAAASGKTLDNGSGSSSDDSDDDDYDPDKPDAVEKVEGDESSSDESNYF 449

Query: 1630 SASNSSSGSPTKPGFPAKEHFMGLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1809
            SAS+  + S         E ++GL                                    
Sbjct: 450  SASDDLAASLN------NEKYLGLPSDDSEDDDFDPSALDPDKQAEQESSSSDFTSDSED 503

Query: 1810 LGAVLQD--------NTSEGDNQGQSKSVGEELEVDVWVNTR-SLKDEISYVLNSNDSPI 1962
            LGA+L D        + S    Q QS +  +E  V V    R SLKDE+SY+L ++  P+
Sbjct: 504  LGALLDDTEAGEDLGHISPSSYQNQSSTGSKEENVKVGGTKRQSLKDELSYLLETSGEPV 563

Query: 1963 SAKRNVERLDYKKLHDETYGNXXXXXXXXXXXXNVGNKRRKNVSGRISV-XXXXXXXXXX 2139
            S +R+VER DYK LHDETYGN                KRR+    +  V           
Sbjct: 564  SGRRHVERWDYKSLHDETYGNSSSDSSDEDFVDTTAPKRRRIDREKTEVTSPNKTPITEN 623

Query: 2140 DIRPEDGDQEERRSVKQASKQFDTEYGNKFLTSHASSSEAGLGKRHRSKPHQKLGEAVTQ 2319
            +++ +D +Q+E + +++ +++      N   T  +SS         +   +++LGEA+TQ
Sbjct: 624  NMKAKDENQKESKHLRERTRK------NIGDTIESSSKVGSASTGTKRSANKRLGEAITQ 677

Query: 2320 RLFDSFKQNQYPKRPEKENLARDLGLSIQQVS 2415
            RL+ SF +NQYP+R  KENLA++LGL IQQ++
Sbjct: 678  RLYASFNENQYPERAVKENLAKELGLKIQQIT 709


>XP_012836886.1 PREDICTED: homeobox protein HAT3.1 [Erythranthe guttata]
            XP_012836887.1 PREDICTED: homeobox protein HAT3.1
            [Erythranthe guttata] EYU37611.1 hypothetical protein
            MIMGU_mgv1a001571mg [Erythranthe guttata] EYU37612.1
            hypothetical protein MIMGU_mgv1a001571mg [Erythranthe
            guttata]
          Length = 793

 Score =  453 bits (1165), Expect = e-141
 Identities = 279/673 (41%), Positives = 370/673 (54%), Gaps = 47/673 (6%)
 Frame = +1

Query: 574  AYKPSKLIEDEDDGMQHHANLESPLPESPNHQHL-QPIMANASNAQLGEEETSPL-LNCD 747
            A K   L+E+ ++ +    N E     S NH++L  P+ A + +   G+ E   +    D
Sbjct: 88   AEKQEPLLENVEE-LPGFENTEVASNGSTNHENLGTPLGAASDDPNCGKVEPVQIDFTID 146

Query: 748  SAQPEMED---IGSRKRRNPTVETTPVSARVLRSRSKDKPEAPVPCNI--------AEQV 894
            S Q + ED    G  ++R   V+   +S+  LRS+S+++P+AP P           A++ 
Sbjct: 147  SGQIDNEDGAASGQSRKRKSRVKGPVISSWSLRSKSQERPKAPEPDETVKADETVKADET 206

Query: 895  PDAVEXXXXXXXXXXXXPP------------VSEFSRMRNHLRYLLHRITIEQNLIDAYS 1038
              A E                          V+E+SR R HLRYLLHRI  EQ+LIDAY 
Sbjct: 207  VKADETVKAGSSNGEKKKKGRKKKQVKNNTTVNEYSRTRTHLRYLLHRIKYEQSLIDAYC 266

Query: 1039 SEGWRKQSLEKIKPEKELQKAKSHILRYKLKIRDLFQRLDLSLTEGQIPESLFDSEGEID 1218
            +EGW+ QSLEK+KPEKELQ+AKSHILRYKL+IR LF+ LDLSL  G++P SLFDS+GEID
Sbjct: 267  TEGWKGQSLEKLKPEKELQRAKSHILRYKLRIRALFENLDLSLAVGKLPTSLFDSQGEID 326

Query: 1219 SEDIFCAKCGSKDVSLDNDIILCDGSCERGFHQFCLEPPLQKEDIPQGNEGWLCPGCDCK 1398
            SEDIFCAKCGSK++ LDNDIILCDG+CERGFHQFCL+PPL KE IP G+EGWLCPGCDCK
Sbjct: 327  SEDIFCAKCGSKELPLDNDIILCDGACERGFHQFCLDPPLLKEQIPPGDEGWLCPGCDCK 386

Query: 1399 VDCIDLLNDFNGVKLSVLDSWENVFPEADAAATGKTLEDGSALXXXXXXXXXXXXXRTEV 1578
            VDCID+L DF G K+S+LDSWE +FPEA AAA+GK L+D S               + + 
Sbjct: 387  VDCIDMLKDFQGTKISILDSWEKIFPEAAAAASGKKLDDCSGSSSDDAEDDDYDPDKPDA 446

Query: 1579 EEDLADQ----------EKSSDESDYVSASNSSSGSPTKPGFPAKEHFMGLXXXXXXXXX 1728
            +E+  D+          E SSDESDY SAS+  +           + + GL         
Sbjct: 447  DENNVDENNADEKVEGDESSSDESDYFSASDGVAAPLNN------DKYEGLPSEDSEDDD 500

Query: 1729 XXXXXXXXXXXXXXXXXXXXXXXXXXXLGAVLQDNTSE-GDNQGQSK--------SVGEE 1881
                                       L A+L++N +E G + GQ+         S  E 
Sbjct: 501  FDPSAPDEDEQVKQDSSGSDFTSDSEDLDALLEENATEPGQDPGQTADQKQPSTGSNDEN 560

Query: 1882 LEVDVWVNTRSLKDEISYVLNSNDSPISAKRNVERLDYKKLHDETYGNXXXXXXXXXXXX 2061
             +V     T SLKDE+ Y++ ++  P++ KR V+RLDYKKL DETYGN            
Sbjct: 561  PKVGRMKRT-SLKDELVYLMETDAQPVAGKRQVKRLDYKKLLDETYGNASSDSSDEDFDD 619

Query: 2062 NVGNKRRK---NVSGRISVXXXXXXXXXXDIRPEDGDQEERRSVKQASKQFDTEYGNKFL 2232
                KRRK     S R S           +   E+    +R S +   K  D        
Sbjct: 620  GTTRKRRKIDPEKSERKSRDKTPITKSNTNTTDENQKASKRSSKRPRKKVADGG------ 673

Query: 2233 TSHASSSEAGLGKRHRSKPHQKLGEAVTQRLFDSFKQNQYPKRPEKENLARDLGLSIQQV 2412
             ++ S +  G     + +P ++LGEA TQRL+ SF +NQYP+R  KENLA +LG++++QV
Sbjct: 674  -TNESPANNGSSTTSKKRPLKRLGEATTQRLYVSFSENQYPQRAAKENLANELGITVRQV 732

Query: 2413 SKWFENARWSSRH 2451
            SKWFENARWS  H
Sbjct: 733  SKWFENARWSYNH 745


>XP_006346339.1 PREDICTED: pathogenesis-related homeodomain protein [Solanum
            tuberosum] XP_006346341.1 PREDICTED: pathogenesis-related
            homeodomain protein [Solanum tuberosum] XP_006346342.1
            PREDICTED: pathogenesis-related homeodomain protein
            [Solanum tuberosum]
          Length = 798

 Score =  436 bits (1122), Expect = e-134
 Identities = 265/600 (44%), Positives = 338/600 (56%), Gaps = 16/600 (2%)
 Frame = +1

Query: 706  QLGEEETSPLLNCDSAQPEMEDIGSRKRRNPTVETTPVSA-RVLRSRSKDKPEAPVPCNI 882
            Q GE   + + N + ++   +  G  ++R  ++  +P+S+ R+LRS+SK+K  A    N 
Sbjct: 38   QSGEACENAVQNLNQSEYREKTPGQPRKRK-SISGSPISSTRLLRSKSKEKSGAS-EANN 95

Query: 883  AEQVPDAVEXXXXXXXXXXXXP--PVSEFSRMRNHLRYLLHRITIEQNLIDAYSSEGWRK 1056
                 DA E                V+EF+R+R HLRYLL RIT EQ LI+AYS EGW+ 
Sbjct: 96   TVVTHDATEEKKRKRRKKKHSKHIAVNEFTRIRGHLRYLLQRITYEQTLIEAYSGEGWKG 155

Query: 1057 QSLEKIKPEKELQKAKSHILRYKLKIRDLFQRLDLSLTEGQIPESLFDSEGEIDSEDIFC 1236
            QSLEKIK EKELQ+AK+HI RYKLKIRDLFQRLD  L EG++P SLFD+EGEIDSEDIFC
Sbjct: 156  QSLEKIKLEKELQRAKTHIFRYKLKIRDLFQRLDTLLAEGRLPASLFDNEGEIDSEDIFC 215

Query: 1237 AKCGSKDVSLDNDIILCDGSCERGFHQFCLEPPLQKEDIPQGNEGWLCPGCDCKVDCIDL 1416
            AKCGS D+  DNDIILCDG+CERGFHQ C+EPPL KEDIP  +EGWLCPGCDCKVDCIDL
Sbjct: 216  AKCGSMDLPADNDIILCDGACERGFHQLCVEPPLLKEDIPPDDEGWLCPGCDCKVDCIDL 275

Query: 1417 LNDFNGVKLSVLDSWENVFP-EADAAATGKTLEDGSALXXXXXXXXXXXXXRTEVEEDLA 1593
            LND  G  LSV DSWE V+P EA AAA+G+ L+D S L               +V ++ +
Sbjct: 276  LNDLQGTDLSVTDSWEKVYPKEAAAAASGEKLDDISGLPSDDSEDDDYNPETPDVGKNDS 335

Query: 1594 DQEKSSDESDYVSASNSSSGSPTKPGFPAKEHFMGLXXXXXXXXXXXXXXXXXXXXXXXX 1773
            + E SSDESD+ SAS   + +P     P  +  +G+                        
Sbjct: 336  EDESSSDESDFYSASEDLAEAP-----PKDDEILGISSEDSEDDDFNPDDPDKDEPVKTE 390

Query: 1774 XXXXXXXXXXXXLGAVLQDNTSEGDNQGQSKSVGEELEVDVWVNTR---------SLKDE 1926
                           ++  N  +GD QG S SV   +        +         SLKDE
Sbjct: 391  SSSSDFTSDSEDFNLIVDTNRLQGDEQGVSSSVDNSMPNSASQEEKAKVGKAKGNSLKDE 450

Query: 1927 ISYVLNSNDSPISAKRNVERLDYKKLHDETYGNXXXXXXXXXXXXNVGNKRRK--NVSGR 2100
            +SY++ S+   +SAKR++ERLDYKKLHDETYGN                K RK  N  G 
Sbjct: 451  LSYLMQSDSPLVSAKRHIERLDYKKLHDETYGNGSSESSDEDYDDGPLPKVRKLRNAKGA 510

Query: 2101 ISVXXXXXXXXXXDIRPEDGDQEERRSVKQASKQFDTEYGNKFLTSHASSSEA-GLGKRH 2277
            ++           DI+ + G Q      K + +  D+    K     A +SE+   GKR 
Sbjct: 511  MT----SPSSTPADIKHQSGKQ------KGSGRASDSGISEKLKVGGAGTSESPSSGKR- 559

Query: 2278 RSKPHQKLGEAVTQRLFDSFKQNQYPKRPEKENLARDLGLSIQQVSKWFENARWSSRHSA 2457
              K H   GE  T+RL++SFK NQYP R  K  L ++LGL+  QVSKWFENAR   RHS+
Sbjct: 560  --KTH---GEVATKRLYESFKDNQYPDRDAKGKLGKELGLTAYQVSKWFENARHCHRHSS 614


>XP_019256173.1 PREDICTED: pathogenesis-related homeodomain protein [Nicotiana
            attenuata] OIS97315.1 homeobox protein hat3.1 [Nicotiana
            attenuata]
          Length = 747

 Score =  431 bits (1109), Expect = e-133
 Identities = 267/654 (40%), Positives = 340/654 (51%), Gaps = 12/654 (1%)
 Frame = +1

Query: 532  ECKHQPEMKGSPISAYKPSKLIEDEDDGMQHHANLESPLPESPNHQHLQPIMANAS-NAQ 708
            + + Q EM     +A  P K+         H+      + E+P  + L     NA  N  
Sbjct: 4    QLEDQTEMSTLGNTAVSPGKVARTT--ARSHNTASAGKMSENPGVEQLGDACGNAGQNLN 61

Query: 709  LGEEETSPLLNCDSAQPEMEDIGSRKRRNPTVETTPVSARVLRSRSKDKPEAPVPCN-IA 885
            L E        C    P     G  ++R  T  T   S R+LRS+SK+K  A    N + 
Sbjct: 62   LSE--------CQEKTP-----GQPRKRKSTSGTPISSTRLLRSKSKEKSGASEANNTVV 108

Query: 886  EQVPDAVEXXXXXXXXXXXXPPVSEFSRMRNHLRYLLHRITIEQNLIDAYSSEGWRKQSL 1065
                +  +              V+EF+ +R HLRYLL RI  EQ LI+AYS EGW+ QSL
Sbjct: 109  THEANEEKKRKRRKKKHSKHIAVNEFTSIRGHLRYLLQRIKYEQTLIEAYSGEGWKGQSL 168

Query: 1066 EKIKPEKELQKAKSHILRYKLKIRDLFQRLDLSLTEGQIPESLFDSEGEIDSEDIFCAKC 1245
            EKIK EKEL++AK+HI RYKLKIRDLFQR+D  LT+G++PESLFD+EGEIDSEDIFCAKC
Sbjct: 169  EKIKLEKELERAKAHIFRYKLKIRDLFQRVDTLLTQGRLPESLFDNEGEIDSEDIFCAKC 228

Query: 1246 GSKDVSLDNDIILCDGSCERGFHQFCLEPPLQKEDIPQGNEGWLCPGCDCKVDCIDLLND 1425
            G+KD+  DNDIILCDG+CERGFHQ CLEPPL KEDIP  +EGWLCPGCDCKVDCIDLLND
Sbjct: 229  GAKDLPADNDIILCDGACERGFHQLCLEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLLND 288

Query: 1426 FNGVKLSVLDSWENVFP-EADAAATGKTLEDGSALXXXXXXXXXXXXXRTEVEEDLADQE 1602
              G  LS+ DSWE V+P EA AAA+G+ L+D S L               +VE++ +  E
Sbjct: 289  LQGTNLSITDSWEKVYPKEAAAAASGEKLDDISGLPSDDSEDDDYNPENPDVEKNDSGDE 348

Query: 1603 KSSDESDYVSASNSSSGSPTKPGFPAKEHFMGLXXXXXXXXXXXXXXXXXXXXXXXXXXX 1782
             SSDESD+ SAS      P     P  +  +GL                           
Sbjct: 349  SSSDESDFFSASEDLEEVP-----PKDDELLGLPSEDSEDDDYTPDDPDKDEPVKTESSS 403

Query: 1783 XXXXXXXXXLGAVLQDNTSEGDNQGQSKSVG---------EELEVDVWVNTRSLKDEISY 1935
                     LG ++  N   GD  G S SV          EE          SL DE+S 
Sbjct: 404  SDFTSDSEDLGLIVDTNRLPGDELGVSSSVDNSKHSSASQEEKPKGGRAKRNSLNDELSD 463

Query: 1936 VLNSNDSPISAKRNVERLDYKKLHDETYGNXXXXXXXXXXXXNVGNKRRKNVSGRISVXX 2115
            ++ S+   +S KR++ERLDYKKLHDETYGN            +   K R+  S + ++  
Sbjct: 464  LMQSHSPLVSCKRHIERLDYKKLHDETYGNESSDSSDEDFEGDPLPKVREIRSAKAAMTS 523

Query: 2116 XXXXXXXXDIRPEDGDQEERRSVKQASKQFDTEYGNKFLTSHASSSEAGLGKRHRSKPHQ 2295
                           D + +   K+ S+  D     K       +SE      H S   +
Sbjct: 524  P---------NSTPADTKYQSGKKKVSRHTDRGLCKKLKIGGMDTSEP-----HSSGKKK 569

Query: 2296 KLGEAVTQRLFDSFKQNQYPKRPEKENLARDLGLSIQQVSKWFENARWSSRHSA 2457
              GE   +RL++SFK+NQYP R  KE L ++LGL+  QVSKWFENAR   RHS+
Sbjct: 570  TYGEGAIKRLYESFKENQYPDRDAKEKLGKELGLTAHQVSKWFENARHCHRHSS 623


>NP_001308700.1 PHD-finger family homeodomain protein [Solanum lycopersicum]
          Length = 796

 Score =  431 bits (1109), Expect = e-133
 Identities = 279/685 (40%), Positives = 361/685 (52%), Gaps = 29/685 (4%)
 Frame = +1

Query: 787  RRNPTVETTPVSA-RVLRSRSKDKPEAPVPCNIAEQVPDAVEXXXXXXXXXXXXPPVS-- 957
            R+  ++  +P+S+ R+LRS+SK+K  A    N      DA E              ++  
Sbjct: 63   RKRKSISGSPISSTRLLRSKSKEKSGASEAKNTVV-THDATEEKKRKRRKKKHSKHIAAN 121

Query: 958  EFSRMRNHLRYLLHRITIEQNLIDAYSSEGWRKQSLEKIKPEKELQKAKSHILRYKLKIR 1137
            EF+R+R HLRYLL RI  EQ LI+AYS EGW+ QSLEKIK EKELQ+AK+HI RYKLKIR
Sbjct: 122  EFTRIRGHLRYLLQRIKYEQTLIEAYSGEGWKGQSLEKIKLEKELQRAKTHIFRYKLKIR 181

Query: 1138 DLFQRLDLSLTEGQIPESLFDSEGEIDSEDIFCAKCGSKDVSLDNDIILCDGSCERGFHQ 1317
            DLFQRLD  L EG++P SLFD+EGEIDSEDIFCAKCGS D+  DNDIILCDG+CERGFHQ
Sbjct: 182  DLFQRLDTLLAEGRLPASLFDNEGEIDSEDIFCAKCGSMDLPADNDIILCDGACERGFHQ 241

Query: 1318 FCLEPPLQKEDIPQGNEGWLCPGCDCKVDCIDLLNDFNGVKLSVLDSWENVFP-EADAAA 1494
             C+EPPL KEDIP  +EGWLCPGCDCKVDCIDLLND  G  LSV DSWE V+P EA AAA
Sbjct: 242  LCVEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLLNDLQGTDLSVTDSWEKVYPKEAAAAA 301

Query: 1495 TGKTLEDGSALXXXXXXXXXXXXXRTEVEEDLADQEKSSDES--DYVSASNSSSGSPTKP 1668
            +G+ L+D S L               +V ++ ++ E SSDES  D+ SAS   + +PTK 
Sbjct: 302  SGEKLDDISGLPSDDSEDDDYNPEAPDVGKNDSEDESSSDESESDFYSASEDLAEAPTKD 361

Query: 1669 GFPAKEHFMGLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLGAVLQDNTSEGD 1848
                 +  +GL                                       ++  N   GD
Sbjct: 362  -----DEILGLSSEDSEDDDYNPDDPDKDEPVKTESSSSDFTSDSEDFSLIVDTNRLRGD 416

Query: 1849 NQGQSKSVGEELEVDVWVNTR---------SLKDEISYVLNSNDSPISAKRNVERLDYKK 2001
             QG S SV   +   V +  +         SLKDE+SY++ S+   +SAKR++ERLDYKK
Sbjct: 417  EQGVSSSVDNSMPNSVSLKEKAKVGKAKGNSLKDELSYLMQSDSPLVSAKRHIERLDYKK 476

Query: 2002 LHDETYGNXXXXXXXXXXXXNVGNKRRK--NVSGRISVXXXXXXXXXXDIRPEDGDQEER 2175
            LHDETYGN                K RK  N  G ++           DI+ + G Q   
Sbjct: 477  LHDETYGNGSSDSSDEDYDDGPLPKVRKLRNAKGAMAAPSSTPA----DIKYQSGKQ--- 529

Query: 2176 RSVKQASKQFDTEYGNKFLTSHASSSEA-GLGKRHRSKPHQKLGEAVTQRLFDSFKQNQY 2352
               K +    D+    K       +SE+   GKR      +  GE  T+RL++SFK NQY
Sbjct: 530  ---KGSGHASDSGISEKLKVGGTGTSESPSSGKR------KTYGEVSTKRLYESFKDNQY 580

Query: 2353 PKRPEKENLARDLGLSIQQVSKWFENARWSSRHSATMVXXXXXXXXXXXXXXLLAKDKLK 2532
            P R  KE L ++LGL+  QVSKWFENAR   RHS                      ++  
Sbjct: 581  PDRDAKEKLGKELGLTAHQVSKWFENARHCHRHSPNWKKIMSHK----------VSEESP 630

Query: 2533 NKGQVVDTEVVLSNDNSIAAS---------PVTNLGVKLSQTVSVVVEPLVTEEPSEEKS 2685
            +K Q++  E + +  NSI AS         P   L  +    +    E L+ ++ S +KS
Sbjct: 631  SKSQIIG-EPLGTESNSIIASCNGVEKLEQPKQCLNGEKGHAIDKSEEELLIQDTSGKKS 689

Query: 2686 GALKSRKRKNKKGNQ--PPSSCEKK 2754
                 +     +G++  P S   KK
Sbjct: 690  SEPTKKVHTTNEGSEDTPRSKTSKK 714


>XP_016539473.1 PREDICTED: pathogenesis-related homeodomain protein [Capsicum annuum]
            XP_016539474.1 PREDICTED: pathogenesis-related
            homeodomain protein [Capsicum annuum] XP_016539475.1
            PREDICTED: pathogenesis-related homeodomain protein
            [Capsicum annuum] XP_016539476.1 PREDICTED:
            pathogenesis-related homeodomain protein [Capsicum
            annuum] XP_016539477.1 PREDICTED: pathogenesis-related
            homeodomain protein [Capsicum annuum] XP_016539478.1
            PREDICTED: pathogenesis-related homeodomain protein
            [Capsicum annuum] XP_016539479.1 PREDICTED:
            pathogenesis-related homeodomain protein [Capsicum
            annuum]
          Length = 831

 Score =  432 bits (1110), Expect = e-132
 Identities = 260/593 (43%), Positives = 330/593 (55%), Gaps = 21/593 (3%)
 Frame = +1

Query: 742  CDSAQPEMEDIGSRK------RRNPTVETTPVSA-RVLRSRSKDKPEAPVPCNIAEQVPD 900
            CD+A   +     R+      R+  ++  TP+S+ R+LRS+SK+K  A    N      D
Sbjct: 43   CDNAVQNLNQSECREKTPGQPRKRKSISGTPISSTRLLRSKSKEKSGAS-EANNTVVTHD 101

Query: 901  AVEXXXXXXXXXXXXPP--VSEFSRMRNHLRYLLHRITIEQNLIDAYSSEGWRKQSLEKI 1074
            A E                V+EF+R+R HLRYLLHRIT EQ LI+AYS EGW+ QSLEKI
Sbjct: 102  AAEEKRRKRRKKKHSKDIAVNEFTRIRGHLRYLLHRITYEQTLIEAYSGEGWKGQSLEKI 161

Query: 1075 KPEKELQKAKSHILRYKLKIRDLFQRLDLSLTEGQIPESLFDSEGEIDSEDIFCAKCGSK 1254
            K EKELQ+AK+HI RYKLKIRDLFQRLD  L +G++P SLFD+EGEIDSEDIFCAKCGS 
Sbjct: 162  KLEKELQRAKTHIFRYKLKIRDLFQRLDTLLAQGRLPASLFDNEGEIDSEDIFCAKCGSM 221

Query: 1255 DVSLDNDIILCDGSCERGFHQFCLEPPLQKEDIPQGNEGWLCPGCDCKVDCIDLLNDFNG 1434
            D+  DNDIILCDG+CERGFHQ C+EPPL KEDIP  +EGWLCPGCDCKVDCIDLLND  G
Sbjct: 222  DLPADNDIILCDGTCERGFHQLCVEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLLNDLQG 281

Query: 1435 VKLSVLDSWENVFP-EADAAATGKTLEDGSALXXXXXXXXXXXXXRTEVEEDLADQEKSS 1611
              LSV DSWE V+P EA AAA+G+ L+D S L               +VE++ ++ E SS
Sbjct: 282  TNLSVTDSWEKVYPKEAAAAASGEKLDDISGLPSDDSEDDDYNPENPDVEKNDSEDESSS 341

Query: 1612 DESDYVSASNSSSGSPTKPGFPAKEHFMGLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1791
            DESD+ SAS   + +P     P  +  + L                              
Sbjct: 342  DESDFYSASEDLAEAP-----PKDDEILALSSEDSEDDDFNPDDPDKDESVKTESSSSDF 396

Query: 1792 XXXXXXLGAVLQDNTSEGDNQGQSKSV--------GEELEVDVWVNTRSL-KDEISYVLN 1944
                     ++  +   GD QG S SV         +E +  V    R+L KDE+SY++ 
Sbjct: 397  TSDSEDFSLIVDTDMLRGDEQGVSSSVDNSMPNSASQEEKAKVGKGKRNLLKDELSYLMQ 456

Query: 1945 SNDSPISAKRNVERLDYKKLHDETYGNXXXXXXXXXXXXNVGNKRRK--NVSGRISVXXX 2118
            S    +SAKR++ERLDYKKL+DETYGN                K RK  N  G ++    
Sbjct: 457  SVSPLVSAKRHIERLDYKKLNDETYGNESSDSSDEEYEGGPSPKVRKFRNAKGAMA---- 512

Query: 2119 XXXXXXXDIRPEDGDQEERRSVKQASKQFDTEYGNKFLTSHASSSEAGLGKRHRSKPHQK 2298
                   DI+ + G Q+       +      + G        SS     GKR      + 
Sbjct: 513  SPSSTTADIQYQSGKQKGSGHTSDSGLSEKLKVGGMSTPGSRSS-----GKR------KA 561

Query: 2299 LGEAVTQRLFDSFKQNQYPKRPEKENLARDLGLSIQQVSKWFENARWSSRHSA 2457
             GE  T+RL++SFK+N YP R  KE L ++LG++  QVSKWFENAR   RHS+
Sbjct: 562  YGEVATKRLYESFKENNYPNRGAKEKLGKELGMTAYQVSKWFENARHCQRHSS 614


>XP_015062395.1 PREDICTED: pathogenesis-related homeodomain protein [Solanum
            pennellii]
          Length = 799

 Score =  431 bits (1107), Expect = e-132
 Identities = 280/685 (40%), Positives = 363/685 (52%), Gaps = 29/685 (4%)
 Frame = +1

Query: 787  RRNPTVETTPVSA-RVLRSRSKDKPEAPVPCNIAEQVPDAVEXXXXXXXXXXXXPPVS-- 957
            R+  ++  +P+S+ R+LRS+SK+K  A    N      DA E              ++  
Sbjct: 63   RKRKSISGSPISSTRLLRSKSKEKSGASEAKNTVV-THDATEEKKRKRRKKKHSKHIAAN 121

Query: 958  EFSRMRNHLRYLLHRITIEQNLIDAYSSEGWRKQSLEKIKPEKELQKAKSHILRYKLKIR 1137
            EF+R+R HLRYLL RI  EQ LI+AYS EGW+ QSLEKIK EKELQ+AK+HI RYKLKIR
Sbjct: 122  EFTRIRGHLRYLLQRIKYEQTLIEAYSGEGWKGQSLEKIKLEKELQRAKTHIFRYKLKIR 181

Query: 1138 DLFQRLDLSLTEGQIPESLFDSEGEIDSEDIFCAKCGSKDVSLDNDIILCDGSCERGFHQ 1317
            DLFQRLD  L EG++P SLFD+EGEIDSEDIFCAKCGS D+  DNDIILCDG+CERGFHQ
Sbjct: 182  DLFQRLDTLLAEGRLPASLFDNEGEIDSEDIFCAKCGSMDLPADNDIILCDGACERGFHQ 241

Query: 1318 FCLEPPLQKEDIPQGNEGWLCPGCDCKVDCIDLLNDFNGVKLSVLDSWENVFP-EADAAA 1494
             C+EPPL KEDIP  +EGWLCPGCDCKVDCIDLLND  G  LSV DSWE V+P EA AAA
Sbjct: 242  LCVEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLLNDLQGTDLSVTDSWEKVYPKEAAAAA 301

Query: 1495 TGKTLEDGSALXXXXXXXXXXXXXRTEVEEDLADQEKSSDES--DYVSASNSSSGSPTKP 1668
            +G+ L+D S L               +V ++ ++ E SSDES  D+ SAS   + +PTK 
Sbjct: 302  SGEKLDDISGLPSDDSEDDDYNPETPDVGKNDSEDESSSDESESDFYSASEDLAEAPTKD 361

Query: 1669 GFPAKEHFMGLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLGAVLQDNTSEGD 1848
                 +  +GL                                       ++  N   GD
Sbjct: 362  -----DEILGLSSEDSEDDDFNPDDPDKDEPVKTESSSSDFTSDSEDFSLIVDTNRLRGD 416

Query: 1849 NQGQSKSVGEELEVDVWVNTR---------SLKDEISYVLNSNDSPISAKRNVERLDYKK 2001
             QG S SV   +     +  +         SLKDE+SY++ S+   +SAKR++ERLDYKK
Sbjct: 417  EQGVSSSVDNSMPNSASLEEKAKVGKAKGNSLKDELSYLMQSDSPLVSAKRHIERLDYKK 476

Query: 2002 LHDETYGNXXXXXXXXXXXXNVGNKRRK--NVSGRISVXXXXXXXXXXDIRPEDGDQEER 2175
            LHDETYGN                K RK  N  G ++           DI+ + G Q   
Sbjct: 477  LHDETYGNGSSDSSDEDYDDGPLPKVRKLRNAKGAMA----SPSSTPADIKYQSGKQ--- 529

Query: 2176 RSVKQASKQFDTEYGNKFLTSHASSSEA-GLGKRHRSKPHQKLGEAVTQRLFDSFKQNQY 2352
            + +  AS   D+    K     A +SE+   GKR      +  GE  T+RL++SFK NQY
Sbjct: 530  KGIGHAS---DSGISEKLKVGGAGTSESPSSGKR------KTYGEVSTKRLYESFKDNQY 580

Query: 2353 PKRPEKENLARDLGLSIQQVSKWFENARWSSRHSATMVXXXXXXXXXXXXXXLLAKDKLK 2532
            P R  KE L ++LGL+  QVSKWFENAR   RHS                      ++  
Sbjct: 581  PDRDAKEKLGKELGLTAHQVSKWFENARHCHRHSPN----------WNKIMSQKVSEESP 630

Query: 2533 NKGQVVDTEVVLSNDNSIAAS---------PVTNLGVKLSQTVSVVVEPLVTEEPSEEKS 2685
            +K Q++  E + +  NSI AS         P   L  +    +    E L+ ++ S +KS
Sbjct: 631  SKSQIIG-EPLGTESNSIIASCNGVEKLEQPKQCLNGEKGHAIDKSEEELLIQDTSGKKS 689

Query: 2686 GALKSRKRKNKKGNQ--PPSSCEKK 2754
                 +     +G++  P S   KK
Sbjct: 690  SEPTKKVHTTSQGSEDTPRSKTSKK 714


>XP_019170136.1 PREDICTED: homeobox protein HOX1A-like isoform X2 [Ipomoea nil]
          Length = 995

 Score =  436 bits (1120), Expect = e-132
 Identities = 266/639 (41%), Positives = 359/639 (56%), Gaps = 12/639 (1%)
 Frame = +1

Query: 571  SAYKPSKLIEDEDDGMQ---HHANLESPLPESPNHQHLQPIMANASNAQLGEEETSPLLN 741
            S++K  +L+ + ++ +      A L     E+P          + S       E +  L 
Sbjct: 312  SSFKNLELLHENEEAISIVDRLAELHGDASENPGQ--------DLSKMPRDSNENATQLE 363

Query: 742  CDSAQPEMEDIGSRKRRNPTVETTPVSARVLRSRSKD--KPEAPVPCNIAEQVPDAVEXX 915
            C   +P     G  ++R  T+ +  +S RVLRSR+++  KP  P+  +  +   D  +  
Sbjct: 364  CGDKRPT----GCSRKRKATLGSPVISTRVLRSRTQEEPKPVEPIHASANDSATDEKKRK 419

Query: 916  XXXXXXXXXXPPVSEFSRMRNHLRYLLHRITIEQNLIDAYSSEGWRKQSLEKIKPEKELQ 1095
                        V+EFS +++HLRYLL RI  EQNLIDAYS+EGW+ QSLEK+KPEKELQ
Sbjct: 420  RRKRKHSKQIA-VNEFSGIKSHLRYLLSRIKYEQNLIDAYSAEGWKGQSLEKLKPEKELQ 478

Query: 1096 KAKSHILRYKLKIRDLFQRLDLSLTEGQIPESLFDSEGEIDSEDIFCAKCGSKDVSLDND 1275
            +AKS I  YKLKIRDLFQR+D SL++G++PESLFDSEG+IDSEDIFCAKCGS D+  DND
Sbjct: 479  RAKSGIFHYKLKIRDLFQRIDTSLSQGKLPESLFDSEGQIDSEDIFCAKCGSTDLPADND 538

Query: 1276 IILCDGSCERGFHQFCLEPPLQKEDIPQGNEGWLCPGCDCKVDCIDLLNDFNGVKLSVLD 1455
            IILCDG+CERGFHQ CLEPPL KEDIP G+EGWLCPGCDCKVDC DLL+D  G  LSV D
Sbjct: 539  IILCDGACERGFHQLCLEPPLLKEDIPPGDEGWLCPGCDCKVDCTDLLSDLLGTDLSVTD 598

Query: 1456 SWENVFP-EADAAATGKTLEDGSALXXXXXXXXXXXXXRTEVEEDLADQEKSSDESDYVS 1632
            SWE VFP EA AAA+GK L+D S L               EVEE+++  E SSDE+D   
Sbjct: 599  SWEKVFPEEAAAAASGKQLDDISGLPSDGSDDDDYNPDNPEVEENVSQDESSSDENDSSD 658

Query: 1633 AS---NSSSGSPTKPGFPAKEHFMGLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1803
            AS    +++      G P+++                                       
Sbjct: 659  ASFDLETTANDDILLGLPSED----------------------------SEDDDFDPDAP 690

Query: 1804 XXLGAVLQDNTSEG---DNQGQSKSVGEELEVDVWVNTRSLKDEISYVLNSNDSPISAKR 1974
                 V+Q+++S G   D++   +   E+++V      + LKDE+SY+L+S+    S KR
Sbjct: 691  DHDEQVMQESSSSGFTSDSEDSGQDQCEKIKVG-GAKQQPLKDEVSYLLHSSTVLASGKR 749

Query: 1975 NVERLDYKKLHDETYGNXXXXXXXXXXXXNVGNKRRKNVSGRISVXXXXXXXXXXDIRPE 2154
             VERLDYKKLHDETYG             N   KRRK  S +  +            +  
Sbjct: 750  QVERLDYKKLHDETYGIASSDSSDEDYEDNSPPKRRKKGSDKAGLKSSDQSPLDAMDKNF 809

Query: 2155 DGDQEERRSVKQASKQFDTEYGNKFLTSHASSSEAGLGKRHRSKPHQKLGEAVTQRLFDS 2334
              ++ E  + ++ASK+F+   G   + S + SS    GKR     + + GE   +RL ++
Sbjct: 810  KQNEIEHTANRRASKKFN---GEGLVVSESGSS----GKR-----NSRFGEDAIKRLNEA 857

Query: 2335 FKQNQYPKRPEKENLARDLGLSIQQVSKWFENARWSSRH 2451
            FK+N YPKR  KE+LAR+LGL+++QV KWF N+RWS  H
Sbjct: 858  FKENHYPKRNVKESLARELGLTLRQVDKWFGNSRWSFYH 896


>XP_009592467.1 PREDICTED: pathogenesis-related homeodomain protein-like [Nicotiana
            tomentosiformis] XP_009592468.1 PREDICTED:
            pathogenesis-related homeodomain protein-like [Nicotiana
            tomentosiformis]
          Length = 740

 Score =  423 bits (1088), Expect = e-130
 Identities = 256/600 (42%), Positives = 327/600 (54%), Gaps = 11/600 (1%)
 Frame = +1

Query: 691  NASNAQLGEEETSPLLNCDSAQPEMEDIGSRKRRNPTVETTPVSARVLRSRSKDKPEAPV 870
            N    Q G+   + + N + +Q + +  G  ++R  T  T   S R+LRS+SK+K  A  
Sbjct: 34   NRGVEQSGDACENAVQNLNLSQCQEKTPGRPRKRKSTSGTPINSTRLLRSKSKEKSVASE 93

Query: 871  PCN-IAEQVPDAVEXXXXXXXXXXXXPPVSEFSRMRNHLRYLLHRITIEQNLIDAYSSEG 1047
              N +A    +  +              V+EF+R+R HLRYLL RI  EQ LI+AYS EG
Sbjct: 94   ANNTVATHEANEEKKRKRRKKKQSKHIAVNEFTRIRGHLRYLLQRIKYEQTLIEAYSGEG 153

Query: 1048 WRKQSLEKIKPEKELQKAKSHILRYKLKIRDLFQRLDLSLTEGQIPESLFDSEGEIDSED 1227
            W+ QSLEKIK EKELQ+AK+HI RYKLKIRDLFQRLD  L +G++P SLFD+EGEIDSED
Sbjct: 154  WKGQSLEKIKLEKELQRAKAHIFRYKLKIRDLFQRLDTLLAQGRLPASLFDNEGEIDSED 213

Query: 1228 IFCAKCGSKDVSLDNDIILCDGSCERGFHQFCLEPPLQKEDIPQGNEGWLCPGCDCKVDC 1407
            IFCAKC +KD+  DNDIILCDG+CERGFHQ CLEPPL KEDIP  +EGWLCPGCDCKVDC
Sbjct: 214  IFCAKCSAKDLPADNDIILCDGACERGFHQLCLEPPLLKEDIPPDDEGWLCPGCDCKVDC 273

Query: 1408 IDLLNDFNGVKLSVLDSWENVFP-EADAAATGKTLEDGSALXXXXXXXXXXXXXRTEVEE 1584
            IDLLND  G  LSV DSWE V+P EA AA +G+ L+D S L               +VE+
Sbjct: 274  IDLLNDLQGTNLSVTDSWEKVYPKEAAAAESGEKLDDISGLPSDDSEDDDYNPENPDVEK 333

Query: 1585 DLADQEKSSDESDYVSASNSSSGSPTKPGFPAKEHFMGLXXXXXXXXXXXXXXXXXXXXX 1764
            + +  E SSDESD+ SAS      P     P  +  +GL                     
Sbjct: 334  NDSGDESSSDESDFFSASEDLEEVP-----PKDDEILGLPSEDSEDDDYSPDDPDKNEPV 388

Query: 1765 XXXXXXXXXXXXXXXLGAVLQDNTSEGDNQGQSKSVG---------EELEVDVWVNTRSL 1917
                           LG ++  N   GD QG S SV          E+          SL
Sbjct: 389  KAESSSSDFTSDSEDLGLIVDANRLPGDEQGVSSSVDNSRPSSASQEDKPKAGRAKRNSL 448

Query: 1918 KDEISYVLNSNDSPISAKRNVERLDYKKLHDETYGNXXXXXXXXXXXXNVGNKRRKNVSG 2097
            K E+S ++ S+   +S KR++ERLDYKKLHDETYGN                K R+  S 
Sbjct: 449  KVELSDLMLSHSPVVSGKRHIERLDYKKLHDETYGNESSDSSDEDFEGGPSPKVREIRSA 508

Query: 2098 RISVXXXXXXXXXXDIRPEDGDQEERRSVKQASKQFDTEYGNKFLTSHASSSEAGLGKRH 2277
            + ++          D + ++G Q+  R           + G    +   SS     GK+ 
Sbjct: 509  KAAM--TSPSSTPADTKYQNGKQKGSRHTSDRGLCEKLKIGGMDTSEPRSS-----GKK- 560

Query: 2278 RSKPHQKLGEAVTQRLFDSFKQNQYPKRPEKENLARDLGLSIQQVSKWFENARWSSRHSA 2457
                 +  GE   +RL++SFK+NQYP R  KE L ++LGL+  QVSKWFENAR   RHS+
Sbjct: 561  -----KTYGEGAIKRLYESFKENQYPDRDAKEKLGKELGLTAHQVSKWFENARHCHRHSS 615


>XP_016478781.1 PREDICTED: pathogenesis-related homeodomain protein-like [Nicotiana
            tabacum] XP_016478782.1 PREDICTED: pathogenesis-related
            homeodomain protein-like [Nicotiana tabacum]
          Length = 740

 Score =  422 bits (1086), Expect = e-130
 Identities = 256/600 (42%), Positives = 326/600 (54%), Gaps = 11/600 (1%)
 Frame = +1

Query: 691  NASNAQLGEEETSPLLNCDSAQPEMEDIGSRKRRNPTVETTPVSARVLRSRSKDKPEAPV 870
            N    Q G+   + + N + +Q + +  G  ++R  T  T   S R+LRS+SK+K  A  
Sbjct: 34   NRGVEQSGDACENAVQNLNLSQCQEKTPGRPRKRKSTSGTPINSTRLLRSKSKEKSVASE 93

Query: 871  PCN-IAEQVPDAVEXXXXXXXXXXXXPPVSEFSRMRNHLRYLLHRITIEQNLIDAYSSEG 1047
              N +A    +  +              V+EF+R+R HLRYLL RI  EQ LI+AYS EG
Sbjct: 94   ANNTVATHEANEEKKRKRRKKKQSKHIAVNEFTRIRGHLRYLLQRIKYEQTLIEAYSGEG 153

Query: 1048 WRKQSLEKIKPEKELQKAKSHILRYKLKIRDLFQRLDLSLTEGQIPESLFDSEGEIDSED 1227
            W+ QSLEKIK EKELQ+AK+HI RYKLKIRDLFQRLD  L +G++P SLFD+EGEIDSED
Sbjct: 154  WKGQSLEKIKLEKELQRAKAHIFRYKLKIRDLFQRLDTLLAQGRLPASLFDNEGEIDSED 213

Query: 1228 IFCAKCGSKDVSLDNDIILCDGSCERGFHQFCLEPPLQKEDIPQGNEGWLCPGCDCKVDC 1407
            IFCAKC +KD+  DNDIILCDG+CERGFHQ CLEPPL KEDIP  +EGWLCPGCDCKVDC
Sbjct: 214  IFCAKCSAKDLPADNDIILCDGACERGFHQLCLEPPLLKEDIPPDDEGWLCPGCDCKVDC 273

Query: 1408 IDLLNDFNGVKLSVLDSWENVFP-EADAAATGKTLEDGSALXXXXXXXXXXXXXRTEVEE 1584
            IDLLND  G  LSV DSWE V+P EA AA  G+ L+D S L               +VE+
Sbjct: 274  IDLLNDLQGTNLSVTDSWEKVYPKEAAAAELGEKLDDISGLPSDDSEDDDYNPENPDVEK 333

Query: 1585 DLADQEKSSDESDYVSASNSSSGSPTKPGFPAKEHFMGLXXXXXXXXXXXXXXXXXXXXX 1764
            + +  E SSDESD+ SAS      P     P  +  +GL                     
Sbjct: 334  NDSGDESSSDESDFFSASEDLEEVP-----PKDDEILGLPSEDSEDDDYSPDDPDKNEPV 388

Query: 1765 XXXXXXXXXXXXXXXLGAVLQDNTSEGDNQGQSKSVG---------EELEVDVWVNTRSL 1917
                           LG ++  N   GD QG S SV          E+          SL
Sbjct: 389  KAESSSSDFTSDSEDLGLIVDANRLPGDEQGVSSSVDNSRPSSASQEDKPKAGRAKRNSL 448

Query: 1918 KDEISYVLNSNDSPISAKRNVERLDYKKLHDETYGNXXXXXXXXXXXXNVGNKRRKNVSG 2097
            K E+S ++ S+   +S KR++ERLDYKKLHDETYGN                K R+  S 
Sbjct: 449  KVELSDLMLSHSPVVSGKRHIERLDYKKLHDETYGNESSDSSDEDFEGGPSPKVREIRSA 508

Query: 2098 RISVXXXXXXXXXXDIRPEDGDQEERRSVKQASKQFDTEYGNKFLTSHASSSEAGLGKRH 2277
            + ++          D + ++G Q+  R           + G    +   SS     GK+ 
Sbjct: 509  KAAM--TSPSSTPADTKYQNGKQKGSRHTSDRGLCEKLKIGGMDTSEPRSS-----GKK- 560

Query: 2278 RSKPHQKLGEAVTQRLFDSFKQNQYPKRPEKENLARDLGLSIQQVSKWFENARWSSRHSA 2457
                 +  GE   +RL++SFK+NQYP R  KE L ++LGL+  QVSKWFENAR   RHS+
Sbjct: 561  -----KTYGEGAIKRLYESFKENQYPDRDAKEKLGKELGLTAHQVSKWFENARHCHRHSS 615


>XP_009775281.1 PREDICTED: pathogenesis-related homeodomain protein [Nicotiana
            sylvestris] XP_009775282.1 PREDICTED:
            pathogenesis-related homeodomain protein [Nicotiana
            sylvestris]
          Length = 747

 Score =  419 bits (1078), Expect = e-129
 Identities = 262/653 (40%), Positives = 337/653 (51%), Gaps = 11/653 (1%)
 Frame = +1

Query: 532  ECKHQPEMKGSPISAYKPSKLIEDEDDGMQHHANLESPLPESPNHQHLQPIMANASNAQL 711
            + + Q EM     +A  P K+      G  H+  L   + E+P  + L     NA     
Sbjct: 4    QLEDQTEMSTLGNTAVSPGKVARTTARG--HNTALAGKMSENPGVEQLGDAFENAV---- 57

Query: 712  GEEETSPLLNCDSAQPEMEDIGSRKRRNPTVETTPVSARVLRSRSKDKPEAPVPCN-IAE 888
               +   L  C    P     G  ++R  T  T   S R+LRS+SK+K  A    N +  
Sbjct: 58   ---QKLNLSECQEKTP-----GQPRKRKSTSGTPISSTRLLRSKSKEKSGASEVNNTVVT 109

Query: 889  QVPDAVEXXXXXXXXXXXXPPVSEFSRMRNHLRYLLHRITIEQNLIDAYSSEGWRKQSLE 1068
               +  +              V+EF+ +R HLRYLL RI  EQ LI+AYS EGW+ QSLE
Sbjct: 110  DEANEEKKRKRRKKKHSKHIAVNEFTSIRGHLRYLLQRIKYEQTLIEAYSGEGWKGQSLE 169

Query: 1069 KIKPEKELQKAKSHILRYKLKIRDLFQRLDLSLTEGQIPESLFDSEGEIDSEDIFCAKCG 1248
            KIK EKEL++AK+HI RYKLKIRDLFQR+D  L +G++P SLFD+EGEIDSEDIFCAKCG
Sbjct: 170  KIKLEKELERAKAHIFRYKLKIRDLFQRVDALLAQGRLPASLFDNEGEIDSEDIFCAKCG 229

Query: 1249 SKDVSLDNDIILCDGSCERGFHQFCLEPPLQKEDIPQGNEGWLCPGCDCKVDCIDLLNDF 1428
            +KD+  DNDIILCDG+CERGFHQ CLEPPL KEDIP  +EGWLCPGCDCKVDCIDLLND 
Sbjct: 230  AKDLPADNDIILCDGACERGFHQLCLEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLLNDL 289

Query: 1429 NGVKLSVLDSWENVFP-EADAAATGKTLEDGSALXXXXXXXXXXXXXRTEVEEDLADQEK 1605
             G  LS+ DSWE V+P EA AAA+G+ L+D S L               +VE++ +  E 
Sbjct: 290  QGTNLSITDSWEKVYPKEAAAAASGEKLDDISGLPSDDSEDDDYNPENPDVEKNDSGDES 349

Query: 1606 SSDESDYVSASNSSSGSPTKPGFPAKEHFMGLXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1785
            SSDESD+ SAS      P     P  +  + L                            
Sbjct: 350  SSDESDFFSASEDLEEVP-----PKDDEILALPSEDSEDGDYSPDDPDKDEPAKTESSSS 404

Query: 1786 XXXXXXXXLGAVLQDNTSEGDNQGQSKSVG---------EELEVDVWVNTRSLKDEISYV 1938
                    LG ++  N   GD  G S SV          EE          SL +E+S +
Sbjct: 405  DFTSDSEDLGLIVDTNRLPGDELGVSSSVDNSKPSLASQEEKPKGGRAKRNSLNNELSDL 464

Query: 1939 LNSNDSPISAKRNVERLDYKKLHDETYGNXXXXXXXXXXXXNVGNKRRKNVSGRISVXXX 2118
            + S    +S KR++ERLDYKKLHDETYGN            +   K R+  S + +    
Sbjct: 465  MLSYSPLVSCKRHIERLDYKKLHDETYGNESSDSSDEDFEGDPLPKVREIRSAKAA---- 520

Query: 2119 XXXXXXXDIRPEDGDQEERRSVKQASKQFDTEYGNKFLTSHASSSEAGLGKRHRSKPHQK 2298
                      P D       +  Q+ KQ  + + ++ L            + H S   + 
Sbjct: 521  ---RTSPSSTPAD-------TKYQSGKQKVSRHTDRGLCKQLKIGGMDTSEPHSSGKKKT 570

Query: 2299 LGEAVTQRLFDSFKQNQYPKRPEKENLARDLGLSIQQVSKWFENARWSSRHSA 2457
             GE   +RL++SFK+NQYP R  KE L ++LGL+  QVSKWFENAR   RHS+
Sbjct: 571  YGEGAIKRLYESFKENQYPDRDAKEKLGKELGLTAHQVSKWFENARHCHRHSS 623


>OMO73948.1 hypothetical protein CCACVL1_17051 [Corchorus capsularis]
          Length = 888

 Score =  418 bits (1074), Expect = e-126
 Identities = 245/563 (43%), Positives = 317/563 (56%), Gaps = 15/563 (2%)
 Frame = +1

Query: 820  SARVLRSRSKDKPEAPVPCN-IAEQVPDAVEXXXXXXXXXXXXPPVSEFSRMRNHLRYLL 996
            S RVLRS+S++KP+A  P N +A+      +                E+SR+R HLRYLL
Sbjct: 320  SDRVLRSKSQEKPDASEPSNNLADVGSSKQKRRTKRKKKRGKGEAADEYSRIRTHLRYLL 379

Query: 997  HRITIEQNLIDAYSSEGWRKQSLEKIKPEKELQKAKSHILRYKLKIRDLFQRLDLSLTEG 1176
            +RI  EQNLIDAYS+EGW+  SLEK+KPEKELQ+A S ILR KLKIRDLFQR+D    EG
Sbjct: 380  NRINYEQNLIDAYSTEGWKGLSLEKLKPEKELQRATSEILRRKLKIRDLFQRIDSLSAEG 439

Query: 1177 QIPESLFDSEGEIDSEDIFCAKCGSKDVSLDNDIILCDGSCERGFHQFCLEPPLQKEDIP 1356
            ++PESLFDSEG+IDSEDIFCAKCGSKD+S +NDIILCDG+C+RGFHQ+CL+PPL KEDIP
Sbjct: 440  RLPESLFDSEGQIDSEDIFCAKCGSKDLSANNDIILCDGACDRGFHQYCLQPPLLKEDIP 499

Query: 1357 QGNEGWLCPGCDCKVDCIDLLNDFNGVKLSVLDSWENVFPEADAAATGKTLEDGSALXXX 1536
              +EGWLCPGCDCKVDCI L+N+  G +LS+ D WE VFPEA  A  G+  +    L   
Sbjct: 500  PDDEGWLCPGCDCKVDCIKLVNECQGTRLSISDCWEKVFPEA--APGGQNQDPNFGLPSD 557

Query: 1537 XXXXXXXXXXRTEVEEDLADQEKSSDESDYVSASNSSSGSPTKPGFPAK-EHFMGLXXXX 1713
                       +E +E     E SSDESD+ S S            PA  + ++GL    
Sbjct: 558  DSDDNDYNPDGSETDEKDQGDESSSDESDFTSTSGDLE-------VPANVDPYLGLPSDD 610

Query: 1714 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLGAVLQDNTSEGDNQGQSKSVGEELEVD 1893
                                            LGA L+ NTS+ D    S S   + +  
Sbjct: 611  SEDDDFNPDNPDHDDVVKPESSSSDFTSDSEDLGATLEGNTSQKDEGPFSSSALRDSKRG 670

Query: 1894 VWV--NTRSLKDEISYVLNSND-SPISAKRNVERLDYKKLHDETYGNXXXXXXXXXXXXN 2064
                    SL DE++ + +  D S  S KR +ERLDYKKL+DETYGN            +
Sbjct: 671  KAKLGGKASLNDELTELASGEDGSTFSKKRTIERLDYKKLYDETYGNVPSSSSDDENWGD 730

Query: 2065 VG--NKRRKNVS--------GRISVXXXXXXXXXXDIRPEDGDQEERRSVKQASKQFDTE 2214
                 KRRK  +        G +S              P++ + + RR  +Q SK  D +
Sbjct: 731  TAMPRKRRKQTAEAISAPANGNVSASRRALASNNLTQSPKESEHKSRRKTRQTSKLKDAD 790

Query: 2215 YGNKFLTSHASSSEAGLGKRHRSKPHQKLGEAVTQRLFDSFKQNQYPKRPEKENLARDLG 2394
                 L    +S  +  GK+  S  +++LGEA  QRL+ SFK+NQYP R  KE LA++L 
Sbjct: 791  SSPAELQG-GTSVPSSSGKKAGSSSYRRLGEAEKQRLYSSFKENQYPDRATKECLAKELE 849

Query: 2395 LSIQQVSKWFENARWSSRHSATM 2463
            +++QQVSKWF+N RWS  +S +M
Sbjct: 850  MTLQQVSKWFDNTRWSYHNSPSM 872


>XP_003555282.1 PREDICTED: homeobox protein HAT3.1-like isoform X1 [Glycine max]
            KHN42341.1 Homeobox protein HAT3.1 [Glycine soja]
            KRG91061.1 hypothetical protein GLYMA_20G130800 [Glycine
            max]
          Length = 820

 Score =  414 bits (1065), Expect = e-126
 Identities = 283/777 (36%), Positives = 397/777 (51%), Gaps = 26/777 (3%)
 Frame = +1

Query: 460  QVQVFVTNISSDN-LPPYSENMHSEECKHQP------EMKGSPISAYKPSK---LIEDED 609
            QV V ++N  S+N   P SEN+ SE  +  P      +M+ SP  A   S    L +   
Sbjct: 100  QVSVDLSNDKSENKCKPLSENVQSEPVESIPAFVVDGQMQSSPAQANMSSVNELLDQPSG 159

Query: 610  DGMQHHANLESPLPESPNHQHLQPIMANASNAQLGEEETSPLLNCDSAQPEMEDIGSRKR 789
            D + +  N    +  SP+H          S ++   +  S LL     +  +  +GS   
Sbjct: 160  DVVNNITNCSEKMSNSPSH----------SQSRRKGKRNSKLLK---KKYMLRSLGS--- 203

Query: 790  RNPTVETTPVSARVLRSRSKDKPEAPVPCN--IAEQVPDAVEXXXXXXXXXXXXPPVSE- 960
                      S R LRSR+K+KP+ P P +  +     D V+              +++ 
Sbjct: 204  ----------SGRALRSRTKEKPKEPEPTSNLVDGNSNDGVKRKSGRKKKKRREEGITDQ 253

Query: 961  FSRMRNHLRYLLHRITIEQNLIDAYSSEGWRKQSLEKIKPEKELQKAKSHILRYKLKIRD 1140
            FSR+R+HLRYLL+RI+ E +LIDAYS EGW+  S+EK+KPEKELQ+AKS ILR KLKIRD
Sbjct: 254  FSRIRSHLRYLLNRISYENSLIDAYSGEGWKGYSMEKLKPEKELQRAKSEILRRKLKIRD 313

Query: 1141 LFQRLDLSLTEGQIPESLFDSEGEIDSEDIFCAKCGSKDVSLDNDIILCDGSCERGFHQF 1320
            LF+ LD    EG+ PESLFDS GEIDSEDIFCAKC SK++S +NDIILCDG C+RGFHQ 
Sbjct: 314  LFRNLDSLCAEGKFPESLFDSAGEIDSEDIFCAKCQSKELSTNNDIILCDGVCDRGFHQL 373

Query: 1321 CLEPPLQKEDIPQGNEGWLCPGCDCKVDCIDLLNDFNGVKLSVLDSWENVFPEADAAATG 1500
            CL+PPL  EDIP G+EGWLCPGCDCK DC+DL+ND  G  LS+ D+WE VFPEA A+  G
Sbjct: 374  CLDPPLLTEDIPPGDEGWLCPGCDCKDDCMDLVNDSFGTSLSISDTWERVFPEA-ASFAG 432

Query: 1501 KTLEDGSALXXXXXXXXXXXXXRTEVEEDLADQEKSSDESDYVSASNSSSGSPTKPGFPA 1680
              +++   L              ++ +  +   E SSDES+Y SAS    G         
Sbjct: 433  NNMDNNLGLPSDDSDDDDYNPNGSD-DVKIEGDESSSDESEYASASEKLEGG------SH 485

Query: 1681 KEHFMGLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLGAVLQDNTSEGDNQG- 1857
            ++ ++GL                                    L A  +DNTS G + G 
Sbjct: 486  EDQYLGLPSEDSDDGDYDPDAPDVDCKVNEESSSSDFTSDSEDLAAAFEDNTSPGQDGGI 545

Query: 1858 -QSKSVGEELEVDVWVNTRSLKDEISYVLNSND-----SPISAKRNVERLDYKKLHDETY 2019
              SK  G+       V   S+ DE+S +L  +      +P+S KR+VERLDYKKL++ETY
Sbjct: 546  NSSKKKGK-------VGKLSMADELSSLLEPDSGQGGPTPVSGKRHVERLDYKKLYEETY 598

Query: 2020 GNXXXXXXXXXXXXNVGNKRRKNVSGRISVXXXXXXXXXXDIRPE-DGDQEERRSVKQAS 2196
             +                 R+K ++G ++            + P  +       ++K+ +
Sbjct: 599  HSDTSDDEDWNDA--AAPSRKKKLTGNVT-----------PVSPNANASNNSIHTLKRNA 645

Query: 2197 KQFDTEYGNKFLTS--HASSSEAGLGKRHRSKPHQKLGEAVTQRLFDSFKQNQYPKRPEK 2370
             Q   E  N   T      S      KR  S  H++LGEAV QRL  SFK+NQYP R  K
Sbjct: 646  HQNKVENTNSSPTKSLDGRSKSGSRDKRSGSSAHKRLGEAVVQRLHKSFKENQYPDRSTK 705

Query: 2371 ENLARDLGLSIQQVSKWFENARWSSRHSATMVXXXXXXXXXXXXXXLLAKDKLKNKGQVV 2550
            E+LA++LGL+ QQV+KWF+N RWS RHS+ M                 A+++ + + + +
Sbjct: 706  ESLAQELGLTYQQVAKWFDNTRWSFRHSSQMETNSGRNASPEATDG-RAENEGEKQCESM 764

Query: 2551 DTEVVLSNDNSIAASPVTNLGVKLSQTVSVVVEPLVTEEPSEEKS---GALKSRKRK 2712
              EV   N  + ++    +L   LS+   + +  L T  P+  ++     +K+RKRK
Sbjct: 765  SPEVSGKNSKTTSSRKRKHLSEPLSE-AQLDINGLATSSPNVHQTQVGNKMKTRKRK 820


>XP_006589630.1 PREDICTED: homeobox protein HAT3.1 [Glycine max] KRH35711.1
            hypothetical protein GLYMA_10G260400 [Glycine max]
            KRH35712.1 hypothetical protein GLYMA_10G260400 [Glycine
            max]
          Length = 820

 Score =  407 bits (1046), Expect = e-123
 Identities = 264/686 (38%), Positives = 368/686 (53%), Gaps = 18/686 (2%)
 Frame = +1

Query: 460  QVQVFVTNISSDN-LPPYSENMHSEECKHQPEMKGSPISAYKPSKLIEDEDDGMQHHANL 636
            QV V ++N   +N   P SEN+ SE           P+ +  P+ ++E +       AN+
Sbjct: 100  QVTVDLSNDKPENKCKPLSENVQSE-----------PVESI-PAVVVEGQMQSNPSQANM 147

Query: 637  ES--PLPESPNHQHLQPIMANASNAQLGEEETSPLLNCDSAQPEMEDIGSRKRRNPTVET 810
             S   L + P+   +  I +N S     E+ ++   +  S +   ++  S+  +   + +
Sbjct: 148  SSVNELLDQPSGDAVNNISSNCS-----EKMSNSPTHSQSRRKGKKN--SKLLKKYMLRS 200

Query: 811  TPVSARVLRSRSKDKPEAPVPC-NIAEQVPDAVEXXXXXXXXXXXXPPVS-EFSRMRNHL 984
               S R LRSR+K+KP+ P P  N+ +   + V+              ++ +FSR+R+HL
Sbjct: 201  LGSSDRALRSRTKEKPKEPEPTSNLVDGNNNGVKRKSGRKKKKRKEEGITNQFSRIRSHL 260

Query: 985  RYLLHRITIEQNLIDAYSSEGWRKQSLEKIKPEKELQKAKSHILRYKLKIRDLFQRLDLS 1164
            RYLL+RI+ E +LIDAYS EGW+  S+EK+KPEKELQ+AKS ILR KLKIRDLFQ LD  
Sbjct: 261  RYLLNRISYENSLIDAYSGEGWKGYSIEKLKPEKELQRAKSEILRRKLKIRDLFQNLDSL 320

Query: 1165 LTEGQIPESLFDSEGEIDSEDIFCAKCGSKDVSLDNDIILCDGSCERGFHQFCLEPPLQK 1344
              EG+ PESLFDS GEIDSEDIFCAKC SK++S +NDIILCDG C+RGFHQ CL+PP+  
Sbjct: 321  CAEGKFPESLFDSAGEIDSEDIFCAKCQSKELSTNNDIILCDGVCDRGFHQLCLDPPMLT 380

Query: 1345 EDIPQGNEGWLCPGCDCKVDCIDLLNDFNGVKLSVLDSWENVFPEADAAATGKTLEDGSA 1524
            EDIP G+EGWLCPGCDCK DC+DL+ND  G  LS+ D+WE VFPEA A+  G  +++ S 
Sbjct: 381  EDIPPGDEGWLCPGCDCKDDCMDLVNDSFGTSLSISDTWERVFPEA-ASFAGNNMDNNSG 439

Query: 1525 LXXXXXXXXXXXXXRTEVEEDLADQEKSSDESDYVSASNSSSGSPTKPGFPAKEHFMGLX 1704
            +               + +  +   E SSDES+Y SAS    G         ++ ++GL 
Sbjct: 440  VPSDDSDDDDYNPNGPD-DVKVEGDESSSDESEYASASEKLEGG------SHEDQYLGLP 492

Query: 1705 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLGAVLQDNTSEGDNQGQSKS----- 1869
                                               L A ++DNTS G + G S S     
Sbjct: 493  SEDSDDGDYDPDAPDVECKVNEESSSSDFTSDSEDLAAAIEDNTSPGQDGGISSSKKKGK 552

Query: 1870 VGEELEVDVWVNTRSLKDEISYVLNSND-----SPISAKRNVERLDYKKLHDETYGNXXX 2034
            VG++L         SL DE+S +L  +      +P+S KR+VERLDYKKL++ETY +   
Sbjct: 553  VGKKL---------SLPDELSSLLEPDSGQEAPTPVSGKRHVERLDYKKLYEETYHSDTS 603

Query: 2035 XXXXXXXXXNVGNKRRKNVSGRISVXXXXXXXXXXDIRPE-DGDQEERRSVKQASKQFDT 2211
                         K  K ++G ++            + P  +       + K+ + Q + 
Sbjct: 604  DDEDWNDTAAPSGK--KKLTGNVT-----------PVSPNGNASNNSIHTPKRNAHQNNV 650

Query: 2212 EYGNKFLTS--HASSSEAGLGKRHRSKPHQKLGEAVTQRLFDSFKQNQYPKRPEKENLAR 2385
            E  N   T      S      K+  S  H++LGEAV QRL  SFK+NQYP R  KE+LA+
Sbjct: 651  ENTNNSPTKSLEGCSKSGSRDKKSGSSAHKRLGEAVVQRLHKSFKENQYPDRTTKESLAQ 710

Query: 2386 DLGLSIQQVSKWFENARWSSRHSATM 2463
            +LGL+ QQV+KWF N RWS RHS+ M
Sbjct: 711  ELGLTYQQVAKWFGNTRWSFRHSSQM 736


>ONH91822.1 hypothetical protein PRUPE_8G137800 [Prunus persica] ONH91823.1
            hypothetical protein PRUPE_8G137800 [Prunus persica]
            ONH91824.1 hypothetical protein PRUPE_8G137800 [Prunus
            persica] ONH91825.1 hypothetical protein PRUPE_8G137800
            [Prunus persica] ONH91826.1 hypothetical protein
            PRUPE_8G137800 [Prunus persica]
          Length = 1049

 Score =  413 bits (1061), Expect = e-123
 Identities = 281/705 (39%), Positives = 366/705 (51%), Gaps = 63/705 (8%)
 Frame = +1

Query: 511  SENMHSEECKHQPEMKGSPIS--AYKPSKLIEDEDDGMQHHANLESPLPESPNHQHLQPI 684
            S ++ SE  K + ++   P      K SK +       Q   ++E+   +SP   H +P 
Sbjct: 232  SGSVPSEPAKQKDQLDSVPAQNDEAKTSKAVSSSTVFEQPGPSIEAMTEDSPIG-HSEPP 290

Query: 685  MANASNAQLGEEETSPL---LNCDSAQPEMED-----------IGSRKRRNPTVETTPV- 819
            + + S + L ++E  PL   +  +S+  ++E            +G + ++NP        
Sbjct: 291  LEDLSKS-LSDKEMEPLPEDVTQNSSLQQLETASKNALKISSCLGPKDKKNPKSRKRKYM 349

Query: 820  ------SARVLRSRS--KDKP-EAPVPCNIAE--------QVPDAVEXXXXXXXXXXXXP 948
                  S RVLRS++  K+KP +  +  N+A          V +  E             
Sbjct: 350  SRSFVRSDRVLRSKTGEKEKPKDLKLSNNVATLESSNSIANVSNGEEKKRKKRKNRRDNR 409

Query: 949  PVS-EFSRMRNHLRYLLHRITIEQNLIDAYSSEGWRKQSLEKIKPEKELQKAKSHILRYK 1125
             ++ EFSR+R HLRYLL+RI  E++LIDAYS EGW+  SLEK+KPEKELQ+A S ILR K
Sbjct: 410  AIADEFSRIRTHLRYLLNRIGYEKSLIDAYSGEGWKGSSLEKLKPEKELQRATSEILRRK 469

Query: 1126 LKIRDLFQRLDLSLTEGQIPESLFDSEGEIDSEDIFCAKCGSKDVSLDNDIILCDGSCER 1305
            LKIRDLFQRL+    EG  PESLFDSEG+IDSEDIFC KCGSKDVSLDNDIILCDG+C+R
Sbjct: 470  LKIRDLFQRLESLCAEGMFPESLFDSEGQIDSEDIFCGKCGSKDVSLDNDIILCDGACDR 529

Query: 1306 GFHQFCLEPPLQKEDIPQGNEGWLCPGCDCKVDCIDLLNDFNGVKLSVLDSWENVFPEAD 1485
            GFHQFCLEPPL  EDIP  +EGWLCPGCDCKVDCIDLLND  G  LSV DSWE VFPEA 
Sbjct: 530  GFHQFCLEPPLLSEDIPPDDEGWLCPGCDCKVDCIDLLNDSQGTDLSVTDSWEKVFPEAA 589

Query: 1486 AAATGKTLEDGSALXXXXXXXXXXXXXRTEVEEDLADQEKSSDESDYVSASNSSSGSPTK 1665
            AAA+    +D   L               E +  +  +E SSDES+Y SAS+      + 
Sbjct: 590  AAASAGENQDNHGLPSDDSDDNDYDPDGPETDNKVQGEESSSDESEYASASDGLETPKSN 649

Query: 1666 PGFPAKEHFMGLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLGAVLQDNTSEG 1845
                  E ++GL                                    LGA L DN    
Sbjct: 650  -----DEQYLGLPSEDSEDDDYNPYAPDVNEDVKQESSSSDFTSDSEDLGAALDDNIMSS 704

Query: 1846 DNQGQSKSV-----------GEELEVDVWVNTRSLKDEISYVLNS-----NDSPISAKRN 1977
            ++    KS            GE+  +       SLKDE+  +L S       +P+S KR+
Sbjct: 705  EDVEGPKSTSLDDSKPHRGSGEQSSIS-GQKKHSLKDELISLLESGPGQGESAPLSGKRH 763

Query: 1978 VERLDYKKLHDETYGNXXXXXXXXXXXXNVGNKR-RKNVSGRIS-------VXXXXXXXX 2133
            +ERLDYK+LHDE YGN            ++  +R RK  +G+++                
Sbjct: 764  IERLDYKRLHDEAYGNVPTDSSDDEDWNDIATQRKRKKGTGQVANRSPNGKTSNIKNGVI 823

Query: 2134 XXDIRPEDGDQEE--RRSVKQASKQFDT-EYGNKF-LTSHASSSEAGLGKRHRSKPHQKL 2301
              DI+P+  + E   RR   + S   DT    NK    S  S S +G     RS  + +L
Sbjct: 824  TKDIKPDVDENENTPRRMPHRKSNVEDTSNLSNKSPKGSTKSGSTSGRAGSSRS-TYSRL 882

Query: 2302 GEAVTQRLFDSFKQNQYPKRPEKENLARDLGLSIQQVSKWFENAR 2436
            GEA TQRL  SFK+N YP R  KE+LAR+LGL  +QVSKWFENAR
Sbjct: 883  GEAATQRLCKSFKENHYPDRSMKESLARELGLMAKQVSKWFENAR 927


>XP_018809403.1 PREDICTED: homeobox protein HAT3.1 [Juglans regia]
          Length = 1164

 Score =  416 bits (1068), Expect = e-123
 Identities = 277/688 (40%), Positives = 346/688 (50%), Gaps = 26/688 (3%)
 Frame = +1

Query: 772  IGSRKRRNPT-------VETTPVSARVLRSRSKDKPEAPVPCNIAEQVPDAVEXXXXXXX 930
            +G R  R P        + +   S RVLRSR+   P+A    +    V    E       
Sbjct: 477  LGRRDNRTPKSLRKKYMLRSLAASDRVLRSRTHGMPKATGSSSNLANVSTMEEKQRKSKK 536

Query: 931  XXXXXPPVSEFSRMRNHLRYLLHRITIEQNLIDAYSSEGWRKQSLEKIKPEKELQKAKSH 1110
                     EFSR+R HLRYLL+RI+ EQNLIDAYSSEGW+  SLEK+KPEKELQ+A S 
Sbjct: 537  GRGKRIVADEFSRIRTHLRYLLNRISYEQNLIDAYSSEGWKGGSLEKLKPEKELQRATSE 596

Query: 1111 ILRYKLKIRDLFQRLDLSLTEGQIPESLFDSEGEIDSEDIFCAKCGSKDVSLDNDIILCD 1290
            ILR KLKIRDLFQ L    TEG++P SLFDSEGEI SEDIFCAKCGSKD+S DNDIILCD
Sbjct: 597  ILRRKLKIRDLFQHLGSLCTEGRLPGSLFDSEGEICSEDIFCAKCGSKDLSADNDIILCD 656

Query: 1291 GSCERGFHQFCLEPPLQKEDIPQGNEGWLCPGCDCKVDCIDLLNDFNGVKLSVLDSWENV 1470
            G+C+RGFHQ+CLEPPL  EDIP   +GWLCPGCDCKVDCIDLLN+  G  LS+ DSWE V
Sbjct: 657  GACDRGFHQYCLEPPLLSEDIPPDEKGWLCPGCDCKVDCIDLLNETQGTDLSLADSWEKV 716

Query: 1471 FPEADAAATGKTLEDGSALXXXXXXXXXXXXXRTEVEEDLADQEKSSDESDYVSASNSSS 1650
            FPEA AA  G   +   +L               + E+ L D E SSDES+Y SA+    
Sbjct: 717  FPEA-AATAGHNPDHNFSLPSDDSDDNDYNPDGQDDEKVLGD-ESSSDESEYASATEELE 774

Query: 1651 GSPTKPGFPAKEHFMGLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLGAVLQD 1830
              P        + ++GL                                    L A L D
Sbjct: 775  TPPN------DDQYLGLPSDDSEDDDYNPDAADHSEKVKQESSSSDFTSDSEDLAAALDD 828

Query: 1831 NTSEGDNQGQ-----------SKSVGEELEVDVWVNTRSLKDEISYVLNSNDS-----PI 1962
            N S  D++               S GE  +       +SL DE+  +L S+        +
Sbjct: 829  NRSSRDDEDPMSASLDGVKPFGSSGGERPKPG--GKKQSLNDELLSILESDPGQAGFPTV 886

Query: 1963 SAKRNVERLDYKKLHDETYGNXXXXXXXXXXXXNVGNKRRKNVSGRISVXXXXXXXXXXD 2142
            S KR++ERLDYKKLHDETYGN                 R++  + R             D
Sbjct: 887  SGKRHMERLDYKKLHDETYGNVSTDSSDDEDYNGAAAPRKRKKTTREVAPLSPSGKNMRD 946

Query: 2143 I---RPEDGDQEERRSVKQASKQFDTEYGNKFLTSHASSSEAGLGKRHRSKPHQKLGEAV 2313
            I   R       +RR+ + A+    +    K L  +  S     GKR RS   ++LGEAV
Sbjct: 947  INQNRKVADHTPKRRTRQNANIDGTSNSPTKTLDGYHRSGSG--GKRIRSSTSRRLGEAV 1004

Query: 2314 TQRLFDSFKQNQYPKRPEKENLARDLGLSIQQVSKWFENARWSSRHSATMVXXXXXXXXX 2493
            TQRL+  FK+NQYP+R  KE+LA++LG++ QQVSKWFENARWS  HS+ M          
Sbjct: 1005 TQRLYKVFKENQYPERVTKESLAQELGITFQQVSKWFENARWSFHHSSHM---------E 1055

Query: 2494 XXXXXLLAKDKLKNKGQVVDTEVVLSNDNSIAASPVTNLGVKLSQTVSVVVEPLVTEEPS 2673
                   +K    +    + T     N     ASP +   V+ S +  +    L T E  
Sbjct: 1056 AGGADSASKAGTPSSQTNMATRDTTCNGAQCEASPRSATTVRES-SGDLRHSELETRESC 1114

Query: 2674 EEKSGALKSRKRKNKKGNQPPSSCEKKF 2757
              KS    SRKR   KG   P + +  F
Sbjct: 1115 RHKSTTPNSRKR---KGRSDPQASDPNF 1139


>KHN06779.1 Homeobox protein HAT3.1 [Glycine soja]
          Length = 849

 Score =  407 bits (1045), Expect = e-123
 Identities = 274/743 (36%), Positives = 387/743 (52%), Gaps = 25/743 (3%)
 Frame = +1

Query: 310  GIEVNHKHICEKLKCASEVDAQNKFEESVTLTTFSQH-ASSNCEVLFDA---EPQVQVFV 477
            G E+    I EK    S +  +N     + L    QH    NC+ +  +   +  V+   
Sbjct: 75   GTELTSSVIEEKSNQVSAIVTENAV---IQLPEPLQHDLQKNCQTVEGSCLEQSTVEKVT 131

Query: 478  TNISSDN----LPPYSENMHSEECKHQPEMKGSPISAYKPSKLIEDEDDGMQHHANLES- 642
             ++S+D       P SEN+ SE           P+ +  P+ ++E +       AN+ S 
Sbjct: 132  VDLSNDKPENKCKPLSENVQSE-----------PVESI-PAVVVEGQMQSNPSQANMSSV 179

Query: 643  -PLPESPNHQHLQPIMANASNAQLGEEETSPLLNCDSAQPEMEDIGSRKRRNPTVETTPV 819
              L + P+   +  I +N S     E+ ++   +  S +   ++  S+  +   + +   
Sbjct: 180  NELLDQPSGDAVNNISSNCS-----EKMSNSPTHSQSRRKGKKN--SKLLKKYMLRSLGS 232

Query: 820  SARVLRSRSKDKPEAPVPC-NIAEQVPDAVEXXXXXXXXXXXXPPVSE-FSRMRNHLRYL 993
            S R LRSR+K+KP+ P P  N+ +   + V+              +++ FSR+R+HLRYL
Sbjct: 233  SDRALRSRTKEKPKEPEPTSNLVDGNNNGVKRKSGRKKKKRKEEGITDQFSRIRSHLRYL 292

Query: 994  LHRITIEQNLIDAYSSEGWRKQSLEKIKPEKELQKAKSHILRYKLKIRDLFQRLDLSLTE 1173
            L+RI+ E +LIDAYS EGW+  S+EK+KPEKELQ+AKS ILR KLKIRDLFQ LD    E
Sbjct: 293  LNRISYENSLIDAYSGEGWKGYSIEKLKPEKELQRAKSEILRRKLKIRDLFQNLDSLCAE 352

Query: 1174 GQIPESLFDSEGEIDSEDIFCAKCGSKDVSLDNDIILCDGSCERGFHQFCLEPPLQKEDI 1353
            G+ PESLFDS GEIDSEDIFCAKC SK++S +NDIILCDG C+RGFHQ CL+PP+  EDI
Sbjct: 353  GKFPESLFDSAGEIDSEDIFCAKCQSKELSTNNDIILCDGVCDRGFHQLCLDPPMLTEDI 412

Query: 1354 PQGNEGWLCPGCDCKVDCIDLLNDFNGVKLSVLDSWENVFPEADAAATGKTLEDGSALXX 1533
            P G+EGWLCPGCDCK DC+DL+ND  G  LS+ D+WE VFPEA A+  G  +++ S +  
Sbjct: 413  PPGDEGWLCPGCDCKDDCMDLVNDSFGTSLSISDTWERVFPEA-ASFAGNNMDNNSGVPS 471

Query: 1534 XXXXXXXXXXXRTEVEEDLADQEKSSDESDYVSASNSSSGSPTKPGFPAKEHFMGLXXXX 1713
                         + +  +   E SSDES+Y SAS    G         ++ ++GL    
Sbjct: 472  DDSDDDDYNPNGPD-DVKVEGDESSSDESEYASASEKLEGG------SHEDQYLGLPSED 524

Query: 1714 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLGAVLQDNTSEGDNQGQSKS-----VGE 1878
                                            L A ++DNTS G + G S S     VG+
Sbjct: 525  SDDGDYDPDAPDVECKVNEKSSSSDFTSDSEDLAAAIEDNTSPGQDGGISSSKKKGKVGK 584

Query: 1879 ELEVDVWVNTRSLKDEISYVLNSND-----SPISAKRNVERLDYKKLHDETYGNXXXXXX 2043
            +L         SL DE+S +L  +      +P+S KR+VERLDYKKL++ETY +      
Sbjct: 585  KL---------SLPDELSSLLEPDSGQEAPTPVSGKRHVERLDYKKLYEETYHSDTSDDE 635

Query: 2044 XXXXXXNVGNKRRKNVSGRISVXXXXXXXXXXDIRPE-DGDQEERRSVKQASKQFDTEYG 2220
                      K  K ++G ++            + P  +       + K+ + Q + E  
Sbjct: 636  DWNDTAAPSGK--KKLTGNVT-----------PVSPNGNASNNSIHTPKRNAHQNNVENT 682

Query: 2221 NKFLTS--HASSSEAGLGKRHRSKPHQKLGEAVTQRLFDSFKQNQYPKRPEKENLARDLG 2394
            N   T      S      K+  S  H++LGEAV QRL  SFK+NQYP R  KE+LA++LG
Sbjct: 683  NNSPTKSLEGCSKSGSRDKKSGSSAHKRLGEAVVQRLHKSFKENQYPDRTTKESLAQELG 742

Query: 2395 LSIQQVSKWFENARWSSRHSATM 2463
            L+ QQV+KWF N RWS RHS+ M
Sbjct: 743  LTYQQVAKWFGNTRWSFRHSSQM 765


>XP_008338253.1 PREDICTED: homeobox protein HAT3.1-like isoform X2 [Malus domestica]
          Length = 1067

 Score =  409 bits (1052), Expect = e-122
 Identities = 282/756 (37%), Positives = 381/756 (50%), Gaps = 52/756 (6%)
 Frame = +1

Query: 631  NLESPLPES----PNHQHLQPIMANASNAQLGEEETSPLLNCDSAQPEMEDIGSRKRRNP 798
            +LE P+ ++    PN + ++P+  + +     E+   P  N      + ++  SRK++  
Sbjct: 316  HLELPIEDAGKSPPNDKEMEPLPEDVTQNFSLEKTEMPSKN---GPKDKQNPKSRKKKYM 372

Query: 799  TVETTPVSARVLRSRSKDKPEAPVPCNIAE-QVPDAVEXXXXXXXXXXXXPPVS------ 957
            + +++  S RVLRS+  +KP  P   N A  +  ++V                S      
Sbjct: 373  S-KSSLGSDRVLRSKIGEKPRDPKLSNNATLESSNSVANVSNVEHKRRKKRKQSQQNRVI 431

Query: 958  --EFSRMRNHLRYLLHRITIEQNLIDAYSSEGWRKQSLEKIKPEKELQKAKSHILRYKLK 1131
              EFSR+R HLRYLL+RI+ E++LIDAYS EGW+  SLEK+KPEKELQ+A   ILR KLK
Sbjct: 432  DDEFSRVRKHLRYLLNRISYEKSLIDAYSGEGWKGSSLEKLKPEKELQRATFEILRRKLK 491

Query: 1132 IRDLFQRLDLSLTEGQIPESLFDSEGEIDSEDIFCAKCGSKDVSLDNDIILCDGSCERGF 1311
            IRDLFQ LDL  +EG  PESLFDSEG+IDSEDIFCAKCGSKDVSL NDIILCDG+C+RGF
Sbjct: 492  IRDLFQHLDLLCSEGMFPESLFDSEGQIDSEDIFCAKCGSKDVSLQNDIILCDGACDRGF 551

Query: 1312 HQFCLEPPLQKEDIPQGNEGWLCPGCDCKVDCIDLLNDFNGVKLSVLDSWENVFPEADAA 1491
            HQFCLEPPL  EDIP  +EGWLCPGCDCKVDC DLLND  G  LSV DSWE VFPEA AA
Sbjct: 552  HQFCLEPPLLSEDIPPDDEGWLCPGCDCKVDCFDLLNDSQGTNLSVTDSWEKVFPEAAAA 611

Query: 1492 ATGKTLEDGSALXXXXXXXXXXXXXRTEVEEDLADQEKSSDESDYVSASNSSSGSPTKPG 1671
            A+G   +    L               E  +++  +E SSDES+Y SAS+          
Sbjct: 612  ASGHNQDHSHGLPSDDSDDNDYDPDGPETNDEVPGEESSSDESEYASASDGLDTPKNND- 670

Query: 1672 FPAKEHFMGLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLGAVLQDNTSEGDN 1851
                E ++GL                                    LGA L DN    ++
Sbjct: 671  ----EQYLGLPSDDSEDDDYNPDAPEVIEDDKKESSSSDFTSDSEDLGAALDDNNMSAED 726

Query: 1852 QGQSKSVGEELEVDVWVNTRS----------LKDEISYVLN-----SNDSPISAKRNVER 1986
                KS   +    +  +++           LKDE+  +L         +P+S KR++ER
Sbjct: 727  VEGPKSTSLDESGPLRGSSKQSSRRGQKKQPLKDEVLSLLELGPGQGGAAPVSGKRHIER 786

Query: 1987 LDYKKLHDETYGNXXXXXXXXXXXXNVGNKRRK-----------------NVSGRISVXX 2115
            LDYKKLHDETYGN            +    R++                 N++  +    
Sbjct: 787  LDYKKLHDETYGNVPTDSSDDEEWNDTAAPRKRKKGTGQAPMVSPNGDSSNINNGVITND 846

Query: 2116 XXXXXXXXDIRPEDGDQEERRSVKQA---SKQFDT-EYGNKF---LTSHASSSEAGLGKR 2274
                    +  P+   +  + + K+A   SK  DT    NK     T  AS+SE G   R
Sbjct: 847  IKHDLDENENTPKRAPRGNKNTPKRARRKSKVEDTSNLSNKSRNGSTQSASTSEKGGSSR 906

Query: 2275 HRSKPHQKLGEAVTQRLFDSFKQNQYPKRPEKENLARDLGLSIQQVSKWFENARWSSRHS 2454
                 ++KLGEAVTQRL  SFK+N YP R  KE+LA++LG+  +QVSKWFENAR   + S
Sbjct: 907  ---STYRKLGEAVTQRLSKSFKENHYPDRSMKESLAQELGIMAKQVSKWFENARHCLKVS 963

Query: 2455 ATMVXXXXXXXXXXXXXXLLAKDKLKNKGQVVDTEVVLSNDNSIAASPVTNLGVKLSQTV 2634
                               L +D      Q  + E+  ++D      P+T      S + 
Sbjct: 964  VDKSAAGNGTPLPQTNGKQLEQDGTTFGAQ--NKELPRTDD------PMTG-----SSSR 1010

Query: 2635 SVVVEPLVTEEPSEEKSGALKSRKRKNKKGNQPPSS 2742
             +    LVT + S+ K+ +  +RKR+ K  +  P +
Sbjct: 1011 DMKDSELVTPKSSKRKAISPNNRKRERKSDDLDPEN 1046


Top