BLASTX nr result

ID: Rehmannia30_contig00000467 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia30_contig00000467
         (3544 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011075757.1| DNA-binding protein SMUBP-2 [Sesamum indicum]    1646   0.0  
ref|XP_012850649.1| PREDICTED: DNA-binding protein SMUBP-2 [Eryt...  1587   0.0  
gb|EYU44882.1| hypothetical protein MIMGU_mgv1a001152mg [Erythra...  1535   0.0  
gb|KZV41087.1| P-loop containing nucleoside triphosphate hydrola...  1482   0.0  
gb|EOY10295.1| P-loop containing nucleoside triphosphate hydrola...  1423   0.0  
ref|XP_017977299.1| PREDICTED: DNA-binding protein SMUBP-2 [Theo...  1422   0.0  
gb|OMO99192.1| putative DNA-binding protein smubp-2 [Corchorus c...  1420   0.0  
gb|OMO56477.1| hypothetical protein COLO4_35630 [Corchorus olito...  1417   0.0  
gb|PHT30198.1| hypothetical protein CQW23_30230 [Capsicum baccatum]  1416   0.0  
ref|XP_022718654.1| DNA-binding protein SMUBP-2-like [Durio zibe...  1416   0.0  
ref|XP_021282320.1| DNA-binding protein SMUBP-2 [Herrania umbrat...  1415   0.0  
ref|XP_016564094.1| PREDICTED: DNA-binding protein SMUBP-2 [Caps...  1411   0.0  
gb|PHU23400.1| hypothetical protein BC332_08507 [Capsicum chinense]  1409   0.0  
gb|PHT87733.1| hypothetical protein T459_09839 [Capsicum annuum]     1409   0.0  
ref|XP_009771939.1| PREDICTED: DNA-binding protein SMUBP-2 [Nico...  1400   0.0  
ref|XP_016474118.1| PREDICTED: DNA-binding protein SMUBP-2-like ...  1400   0.0  
ref|XP_019184191.1| PREDICTED: DNA-binding protein SMUBP-2 [Ipom...  1399   0.0  
ref|XP_012492340.1| PREDICTED: DNA-binding protein SMUBP-2 [Goss...  1399   0.0  
ref|XP_002524012.1| PREDICTED: DNA-binding protein SMUBP-2 [Rici...  1399   0.0  
ref|XP_019259161.1| PREDICTED: DNA-binding protein SMUBP-2 [Nico...  1399   0.0  

>ref|XP_011075757.1| DNA-binding protein SMUBP-2 [Sesamum indicum]
          Length = 964

 Score = 1646 bits (4263), Expect = 0.0
 Identities = 838/968 (86%), Positives = 885/968 (91%), Gaps = 4/968 (0%)
 Frame = -2

Query: 3321 MEASCIFCGGVSASILKSQGIRHRPSESISLYSNKNRLFLSSPISHRVWXXXXXXXXXXX 3142
            MEASCIFCGGVS S+LKS  +RHRP ESISLY N+N +F++SPISHRVW           
Sbjct: 1    MEASCIFCGGVSTSLLKSPALRHRPIESISLYRNRNLVFVASPISHRVWASANNSSNSRS 60

Query: 3141 XXXXXR----EDGRGADVSNNNTNNKAAVSEEKTRMKQQQVNDEKDGPTSVRALYQNGDP 2974
                      ED  G+DV+N NTN KAAVSEE TR K   VND+++GP SVRALYQ+GDP
Sbjct: 61   ATKRRSRKNREDAGGSDVTNKNTNKKAAVSEE-TRKK---VNDQENGPRSVRALYQSGDP 116

Query: 2973 LGRRDLGKGVVKWIGKGMKAMALDFALAETQGDFADLKQRMGPGLTFVIQAQPYLNAVPM 2794
            LGRR+LGKGVVKWI +GMKAMALDFA+ E QGDFA+LKQRMGPGLTFVIQAQPYLNAVPM
Sbjct: 117  LGRRELGKGVVKWICQGMKAMALDFAMVEMQGDFAELKQRMGPGLTFVIQAQPYLNAVPM 176

Query: 2793 PLGMEAICLKTCTHYPTLFDHFQRELRDVLLDLQHKTLIHNWRETESWKLLKELATSAQH 2614
            PLG+EAICLKTCTHYPTLFDHFQRELRDVL DLQHKTLIHNWRETESWKLLKELA+SAQH
Sbjct: 177  PLGLEAICLKTCTHYPTLFDHFQRELRDVLQDLQHKTLIHNWRETESWKLLKELASSAQH 236

Query: 2613 RAIARKTSLSKSVHGVLGLNIDKAKAIQCRIDEFTKHMSDLLRIERDAELEFTQEELNAV 2434
            RAIARKTSL+KSVHGVLGL + KAKA+QCRIDEFTK MSDLLRIERDAELEFTQ+ELNAV
Sbjct: 237  RAIARKTSLTKSVHGVLGLELVKAKAMQCRIDEFTKQMSDLLRIERDAELEFTQDELNAV 296

Query: 2433 PTPDEHSTSPKPTEFLVSHAQSEQELCDTICNLNAISTSTGLGGMHLVLFRVEGNHRLPP 2254
            PTPD+ S+S +P EFLVSHAQ+EQELCDTICNLNAISTSTGLGGMHLVLFRVE NHRLPP
Sbjct: 297  PTPDDLSSSSRPIEFLVSHAQAEQELCDTICNLNAISTSTGLGGMHLVLFRVERNHRLPP 356

Query: 2253 TNLSPGDMVCVRICDSRGAGATSCMQGFVNNLGDDGCSISVALESLHGDPTFSKLFGKNI 2074
            TNLSPGDMVCVR+CD RGAGATS MQGFVNNLGDDGCSISVALES HGDPTFSKLFGK+I
Sbjct: 357  TNLSPGDMVCVRVCDKRGAGATSSMQGFVNNLGDDGCSISVALESRHGDPTFSKLFGKSI 416

Query: 2073 RIDRIQGLADALTYERNCEAXXXXXXXXXXXKNSSIAVVTTIFGDNEDIAWFEDNNMVDW 1894
            RIDRIQGLADA+TYERNCEA           KNSS AVVTTIFGD EDI  FE NN+VDW
Sbjct: 417  RIDRIQGLADAITYERNCEALMMLQKKGLQKKNSSRAVVTTIFGDKEDITRFEGNNLVDW 476

Query: 1893 AEAELNGLLDTEFYDTSQQRAIALGLNKKRPVLIIQGPPGTGKTGVLKQLISIAVKQGER 1714
            +E EL+GLLDTEFYD+SQQRAIALGLNKKRPVLIIQGPPGTGKTGVLKQ+IS+ VKQGER
Sbjct: 477  SEVELSGLLDTEFYDSSQQRAIALGLNKKRPVLIIQGPPGTGKTGVLKQIISLVVKQGER 536

Query: 1713 VLVTAPTNAAVDNMVEKLSDIGANIVRVGNPARISPAVASKSLVEIVNGRLADFRSEFER 1534
            VLVTAPTNAAVDNMVEKLS+IGANIVRVGNPARISP VASKSLVEIVN RL DFRSEFER
Sbjct: 537  VLVTAPTNAAVDNMVEKLSEIGANIVRVGNPARISPTVASKSLVEIVNSRLGDFRSEFER 596

Query: 1533 KKSDLRKDLSHCLRDDSLAAGIRQLLKQLGKTMKKKERETIREILSSAHVVLATNIGAAD 1354
            KKSDLRKDLS+CL+DDSLAAGIRQLLKQLGKTMKKKERET+REILSSA VVL TNIGAAD
Sbjct: 597  KKSDLRKDLSYCLKDDSLAAGIRQLLKQLGKTMKKKERETVREILSSAQVVLTTNIGAAD 656

Query: 1353 PMIRWLNSFDLVVIDEAGQAIEPSCWIPILLGKRCILAGDQCQLAPVILSRKALEGGLGV 1174
            PMIR LN FDLVVIDEAGQAIEPSCWIPILLGKRCILAGDQCQLAPVILSRKALEGGLGV
Sbjct: 657  PMIRCLNFFDLVVIDEAGQAIEPSCWIPILLGKRCILAGDQCQLAPVILSRKALEGGLGV 716

Query: 1173 SFLERASTLHEGVLATKLTTQYRMNDAIASWASKEMYNGLLKSSASVMSHLLSDSPLVKS 994
            S LERA+TLHEGVLATKLT QYRMNDAIASWASKEMYNGLLKSSASV SHLLSDSPLVK 
Sbjct: 717  SLLERAATLHEGVLATKLTIQYRMNDAIASWASKEMYNGLLKSSASVTSHLLSDSPLVKQ 776

Query: 993  TWITQCPLLLLDTRMPFGSLSVGCEEQLDPAGTGSFYNEGEADIVVQHVFALIYAGVRPS 814
            TWITQCPLLLLDTRMP+GSL+VGCEEQLDPAGTGSFYNEGEADIVVQHVFALIYAGV P+
Sbjct: 777  TWITQCPLLLLDTRMPYGSLTVGCEEQLDPAGTGSFYNEGEADIVVQHVFALIYAGVSPA 836

Query: 813  TIVVQSPYVSQVQLLRDRLEEFPLSTGVEVATIDSFQGREADAVVISMVRSNNLGAVGFL 634
            TIVVQSPYV+QVQLLRDRLEEFPLSTGVEVAT+DSFQGREADAV+ISMVRSNNLGAVGFL
Sbjct: 837  TIVVQSPYVAQVQLLRDRLEEFPLSTGVEVATVDSFQGREADAVIISMVRSNNLGAVGFL 896

Query: 633  GDSRRMNVAITRARKHVAIICDSSTICHNTFLARLLRHIRYFGRVKHAEPGGSGGYGLSM 454
            GDSRRMNVAITRARKHVAIICDSSTICHNTFLARLLRHIRYFGRVKHAEPG SGG GLSM
Sbjct: 897  GDSRRMNVAITRARKHVAIICDSSTICHNTFLARLLRHIRYFGRVKHAEPGDSGGSGLSM 956

Query: 453  NPMLPSVS 430
            NPMLPS+S
Sbjct: 957  NPMLPSIS 964


>ref|XP_012850649.1| PREDICTED: DNA-binding protein SMUBP-2 [Erythranthe guttata]
          Length = 961

 Score = 1587 bits (4109), Expect = 0.0
 Identities = 809/968 (83%), Positives = 880/968 (90%), Gaps = 4/968 (0%)
 Frame = -2

Query: 3321 MEASCIFCGGVSASILKSQGIRHRPSESISLYSNKNRLFLSSPISHRVWXXXXXXXXXXX 3142
            MEA CI CGGVSAS+LKS  +R   S+S+ LY +K R+FL SPISHR+            
Sbjct: 1    MEALCISCGGVSASLLKSPVVR---SDSVYLYRHKKRVFLGSPISHRILSTARNNSSGSA 57

Query: 3141 XXXXXREDGRGADVSNNNTNNKAAVSEEKTRMKQQQVNDEK-DGPTSVRALYQNG-DPLG 2968
                  ++ +G + +++++    +V+EE+ R KQQQ+N+ K +GPTSVR+LYQNG DPLG
Sbjct: 58   TKRRSNKNKQGKN-NSSDSGVPVSVTEEEMRNKQQQINEGKRNGPTSVRSLYQNGGDPLG 116

Query: 2967 RRDLGKGVVKWIGKGMKAMALDFALAETQGDFADLKQRMGP-GLTFVIQAQPYLNAVPMP 2791
            RRDLGKGVVKWI +GMKAMAL+FA AE QG+FA+LKQ+MGP GLTFVIQAQPYLNAVPMP
Sbjct: 117  RRDLGKGVVKWISQGMKAMALEFARAEMQGEFAELKQQMGPAGLTFVIQAQPYLNAVPMP 176

Query: 2790 LGMEAICLKTCTHYPTLFDHFQRELRDVLLDLQHKTLIH-NWRETESWKLLKELATSAQH 2614
            +G+EAICLKTCTHYPTLFDHFQRELRD+L DLQHK+LI   W +T+SWKLLK+LA SAQH
Sbjct: 177  VGLEAICLKTCTHYPTLFDHFQRELRDILQDLQHKSLIPLTWHQTQSWKLLKDLANSAQH 236

Query: 2613 RAIARKTSLSKSVHGVLGLNIDKAKAIQCRIDEFTKHMSDLLRIERDAELEFTQEELNAV 2434
            RA+ARK  LSKS+HG   L+IDK K+IQCRID+FT+HMS LLRIERD+ELEFT+EELNAV
Sbjct: 237  RAVARKAPLSKSLHG---LSIDKTKSIQCRIDKFTEHMSHLLRIERDSELEFTEEELNAV 293

Query: 2433 PTPDEHSTSPKPTEFLVSHAQSEQELCDTICNLNAISTSTGLGGMHLVLFRVEGNHRLPP 2254
            PTPDEHSTSPKP EFLVSHAQ+EQELCDTICNLNAISTS GLGGMHLVLFR EGNHRLPP
Sbjct: 294  PTPDEHSTSPKPIEFLVSHAQAEQELCDTICNLNAISTSIGLGGMHLVLFRAEGNHRLPP 353

Query: 2253 TNLSPGDMVCVRICDSRGAGATSCMQGFVNNLGDDGCSISVALESLHGDPTFSKLFGKNI 2074
            TNLSPGDMVCVRICDSRGAGATSCMQGFVNNLGDDGCSISVALES HGDPTFSKLFGKNI
Sbjct: 354  TNLSPGDMVCVRICDSRGAGATSCMQGFVNNLGDDGCSISVALESRHGDPTFSKLFGKNI 413

Query: 2073 RIDRIQGLADALTYERNCEAXXXXXXXXXXXKNSSIAVVTTIFGDNEDIAWFEDNNMVDW 1894
            RIDRIQGLADALTYERNCEA           +NSS+AVVTTIFGD EDIAWFEDN++VDW
Sbjct: 414  RIDRIQGLADALTYERNCEALMMLQKKGLQKQNSSVAVVTTIFGDKEDIAWFEDNDLVDW 473

Query: 1893 AEAELNGLLDTEFYDTSQQRAIALGLNKKRPVLIIQGPPGTGKTGVLKQLISIAVKQGER 1714
            +E EL+GLLDTEFYD+SQQRAIALGLNKKRPVLIIQGPPG GKTGVLKQLIS+ VK+GER
Sbjct: 474  SEVELDGLLDTEFYDSSQQRAIALGLNKKRPVLIIQGPPGAGKTGVLKQLISLVVKRGER 533

Query: 1713 VLVTAPTNAAVDNMVEKLSDIGANIVRVGNPARISPAVASKSLVEIVNGRLADFRSEFER 1534
            VLVTAPTNAAVDNMVEKLSDIGANIVRVGNPARISPAVASKSLVEIVN +LAD++SEF R
Sbjct: 534  VLVTAPTNAAVDNMVEKLSDIGANIVRVGNPARISPAVASKSLVEIVNSKLADYKSEFGR 593

Query: 1533 KKSDLRKDLSHCLRDDSLAAGIRQLLKQLGKTMKKKERETIREILSSAHVVLATNIGAAD 1354
            KKS+LRKDLSHCL+DDSLAAGIRQLLKQLGK +KKKERET++EILSSA VVLATNIGAAD
Sbjct: 594  KKSNLRKDLSHCLKDDSLAAGIRQLLKQLGKAIKKKERETVKEILSSAQVVLATNIGAAD 653

Query: 1353 PMIRWLNSFDLVVIDEAGQAIEPSCWIPILLGKRCILAGDQCQLAPVILSRKALEGGLGV 1174
            PMIR L+SFDLVVIDEAGQAIEPSCWIPILLGKRCILAGDQCQLAPVILSRKALEGGLGV
Sbjct: 654  PMIRSLDSFDLVVIDEAGQAIEPSCWIPILLGKRCILAGDQCQLAPVILSRKALEGGLGV 713

Query: 1173 SFLERASTLHEGVLATKLTTQYRMNDAIASWASKEMYNGLLKSSASVMSHLLSDSPLVKS 994
            S LERASTLHEGV ATKLTTQYRMNDAIASWASKEMYNGLLKSSASV SHLLSDSPLVK 
Sbjct: 714  SLLERASTLHEGVFATKLTTQYRMNDAIASWASKEMYNGLLKSSASVTSHLLSDSPLVKP 773

Query: 993  TWITQCPLLLLDTRMPFGSLSVGCEEQLDPAGTGSFYNEGEADIVVQHVFALIYAGVRPS 814
            TWITQCPLLLLDTRMP+GSLSVGCEEQLDPAGTGSFYNEGEADIVVQHVFALIYAGVRP+
Sbjct: 774  TWITQCPLLLLDTRMPYGSLSVGCEEQLDPAGTGSFYNEGEADIVVQHVFALIYAGVRPA 833

Query: 813  TIVVQSPYVSQVQLLRDRLEEFPLSTGVEVATIDSFQGREADAVVISMVRSNNLGAVGFL 634
            +IVVQSPYV+QVQLLRDRLEEFP++ GVEVATIDSFQGREADAV+ISMVRSNNLGAVGFL
Sbjct: 834  SIVVQSPYVAQVQLLRDRLEEFPITKGVEVATIDSFQGREADAVIISMVRSNNLGAVGFL 893

Query: 633  GDSRRMNVAITRARKHVAIICDSSTICHNTFLARLLRHIRYFGRVKHAEPGGSGGYGLSM 454
            GDSRRMNVAITRARKHVAIICDSSTICHNTFLARLLRHIRYFGRVKHAEPGGSGG GL+M
Sbjct: 894  GDSRRMNVAITRARKHVAIICDSSTICHNTFLARLLRHIRYFGRVKHAEPGGSGGSGLAM 953

Query: 453  NPMLPSVS 430
            NPMLPS+S
Sbjct: 954  NPMLPSLS 961


>gb|EYU44882.1| hypothetical protein MIMGU_mgv1a001152mg [Erythranthe guttata]
          Length = 876

 Score = 1535 bits (3973), Expect = 0.0
 Identities = 775/878 (88%), Positives = 827/878 (94%), Gaps = 4/878 (0%)
 Frame = -2

Query: 3051 RMKQQQVNDEK-DGPTSVRALYQNG-DPLGRRDLGKGVVKWIGKGMKAMALDFALAETQG 2878
            R KQQQ+N+ K +GPTSVR+LYQNG DPLGRRDLGKGVVKWI +GMKAMAL+FA AE QG
Sbjct: 2    RNKQQQINEGKRNGPTSVRSLYQNGGDPLGRRDLGKGVVKWISQGMKAMALEFARAEMQG 61

Query: 2877 DFADLKQRMGP-GLTFVIQAQPYLNAVPMPLGMEAICLKTCTHYPTLFDHFQRELRDVLL 2701
            +FA+LKQ+MGP GLTFVIQAQPYLNAVPMP+G+EAICLKTCTHYPTLFDHFQRELRD+L 
Sbjct: 62   EFAELKQQMGPAGLTFVIQAQPYLNAVPMPVGLEAICLKTCTHYPTLFDHFQRELRDILQ 121

Query: 2700 DLQHKTLIH-NWRETESWKLLKELATSAQHRAIARKTSLSKSVHGVLGLNIDKAKAIQCR 2524
            DLQHK+LI   W +T+SWKLLK+LA SAQHRA+ARK  LSKS+HG   L+IDK K+IQCR
Sbjct: 122  DLQHKSLIPLTWHQTQSWKLLKDLANSAQHRAVARKAPLSKSLHG---LSIDKTKSIQCR 178

Query: 2523 IDEFTKHMSDLLRIERDAELEFTQEELNAVPTPDEHSTSPKPTEFLVSHAQSEQELCDTI 2344
            ID+FT+HMS LLRIERD+ELEFT+EELNAVPTPDEHSTSPKP EFLVSHAQ+EQELCDTI
Sbjct: 179  IDKFTEHMSHLLRIERDSELEFTEEELNAVPTPDEHSTSPKPIEFLVSHAQAEQELCDTI 238

Query: 2343 CNLNAISTSTGLGGMHLVLFRVEGNHRLPPTNLSPGDMVCVRICDSRGAGATSCMQGFVN 2164
            CNLNAISTS GLGGMHLVLFR EGNHRLPPTNLSPGDMVCVRICDSRGAGATSCMQGFVN
Sbjct: 239  CNLNAISTSIGLGGMHLVLFRAEGNHRLPPTNLSPGDMVCVRICDSRGAGATSCMQGFVN 298

Query: 2163 NLGDDGCSISVALESLHGDPTFSKLFGKNIRIDRIQGLADALTYERNCEAXXXXXXXXXX 1984
            NLGDDGCSISVALES HGDPTFSKLFGKNIRIDRIQGLADALTYERNCEA          
Sbjct: 299  NLGDDGCSISVALESRHGDPTFSKLFGKNIRIDRIQGLADALTYERNCEALMMLQKKGLQ 358

Query: 1983 XKNSSIAVVTTIFGDNEDIAWFEDNNMVDWAEAELNGLLDTEFYDTSQQRAIALGLNKKR 1804
             +NSS+AVVTTIFGD EDIAWFEDN++VDW+E EL+GLLDTEFYD+SQQRAIALGLNKKR
Sbjct: 359  KQNSSVAVVTTIFGDKEDIAWFEDNDLVDWSEVELDGLLDTEFYDSSQQRAIALGLNKKR 418

Query: 1803 PVLIIQGPPGTGKTGVLKQLISIAVKQGERVLVTAPTNAAVDNMVEKLSDIGANIVRVGN 1624
            PVLIIQGPPG GKTGVLKQLIS+ VK+GERVLVTAPTNAAVDNMVEKLSDIGANIVRVGN
Sbjct: 419  PVLIIQGPPGAGKTGVLKQLISLVVKRGERVLVTAPTNAAVDNMVEKLSDIGANIVRVGN 478

Query: 1623 PARISPAVASKSLVEIVNGRLADFRSEFERKKSDLRKDLSHCLRDDSLAAGIRQLLKQLG 1444
            PARISPAVASKSLVEIVN +LAD++SEF RKKS+LRKDLSHCL+DDSLAAGIRQLLKQLG
Sbjct: 479  PARISPAVASKSLVEIVNSKLADYKSEFGRKKSNLRKDLSHCLKDDSLAAGIRQLLKQLG 538

Query: 1443 KTMKKKERETIREILSSAHVVLATNIGAADPMIRWLNSFDLVVIDEAGQAIEPSCWIPIL 1264
            K +KKKERET++EILSSA VVLATNIGAADPMIR L+SFDLVVIDEAGQAIEPSCWIPIL
Sbjct: 539  KAIKKKERETVKEILSSAQVVLATNIGAADPMIRSLDSFDLVVIDEAGQAIEPSCWIPIL 598

Query: 1263 LGKRCILAGDQCQLAPVILSRKALEGGLGVSFLERASTLHEGVLATKLTTQYRMNDAIAS 1084
            LGKRCILAGDQCQLAPVILSRKALEGGLGVS LERASTLHEGV ATKLTTQYRMNDAIAS
Sbjct: 599  LGKRCILAGDQCQLAPVILSRKALEGGLGVSLLERASTLHEGVFATKLTTQYRMNDAIAS 658

Query: 1083 WASKEMYNGLLKSSASVMSHLLSDSPLVKSTWITQCPLLLLDTRMPFGSLSVGCEEQLDP 904
            WASKEMYNGLLKSSASV SHLLSDSPLVK TWITQCPLLLLDTRMP+GSLSVGCEEQLDP
Sbjct: 659  WASKEMYNGLLKSSASVTSHLLSDSPLVKPTWITQCPLLLLDTRMPYGSLSVGCEEQLDP 718

Query: 903  AGTGSFYNEGEADIVVQHVFALIYAGVRPSTIVVQSPYVSQVQLLRDRLEEFPLSTGVEV 724
            AGTGSFYNEGEADIVVQHVFALIYAGVRP++IVVQSPYV+QVQLLRDRLEEFP++ GVEV
Sbjct: 719  AGTGSFYNEGEADIVVQHVFALIYAGVRPASIVVQSPYVAQVQLLRDRLEEFPITKGVEV 778

Query: 723  ATIDSFQGREADAVVISMVRSNNLGAVGFLGDSRRMNVAITRARKHVAIICDSSTICHNT 544
            ATIDSFQGREADAV+ISMVRSNNLGAVGFLGDSRRMNVAITRARKHVAIICDSSTICHNT
Sbjct: 779  ATIDSFQGREADAVIISMVRSNNLGAVGFLGDSRRMNVAITRARKHVAIICDSSTICHNT 838

Query: 543  FLARLLRHIRYFGRVKHAEPGGSGGYGLSMNPMLPSVS 430
            FLARLLRHIRYFGRVKHAEPGGSGG GL+MNPMLPS+S
Sbjct: 839  FLARLLRHIRYFGRVKHAEPGGSGGSGLAMNPMLPSLS 876


>gb|KZV41087.1| P-loop containing nucleoside triphosphate hydrolases superfamily
            protein isoform 1 [Dorcoceras hygrometricum]
          Length = 939

 Score = 1482 bits (3836), Expect = 0.0
 Identities = 754/964 (78%), Positives = 829/964 (85%)
 Frame = -2

Query: 3321 MEASCIFCGGVSASILKSQGIRHRPSESISLYSNKNRLFLSSPISHRVWXXXXXXXXXXX 3142
            ME+SCI CGGVS  + KS G    P ES S Y   NR+ + S I   +W           
Sbjct: 1    MESSCICCGGVSTLLYKSPGNGRHPDESFSPY---NRVLIGSRIPRSIWASASTKRR--- 54

Query: 3141 XXXXXREDGRGADVSNNNTNNKAAVSEEKTRMKQQQVNDEKDGPTSVRALYQNGDPLGRR 2962
                                 K  V  +K   + +Q+ D++    S+   +QNGDPLGR+
Sbjct: 55   -------------TGGKKKEEKVGVVPKKKLGQPRQLGDQR----SLLTEHQNGDPLGRK 97

Query: 2961 DLGKGVVKWIGKGMKAMALDFALAETQGDFADLKQRMGPGLTFVIQAQPYLNAVPMPLGM 2782
            DLGK V+KWI +GMK+MAL  A AE QGD ++ KQRMGPGLTFVI+AQPYLNAVPMP G+
Sbjct: 98   DLGKNVMKWICQGMKSMALAIAKAEMQGDLSEFKQRMGPGLTFVIEAQPYLNAVPMPPGL 157

Query: 2781 EAICLKTCTHYPTLFDHFQRELRDVLLDLQHKTLIHNWRETESWKLLKELATSAQHRAIA 2602
            EAICLKTCTHYPTLFDHFQRELRDVL DLQ ++LI +WRETESWKLLKELA SAQHRAIA
Sbjct: 158  EAICLKTCTHYPTLFDHFQRELRDVLQDLQQQSLIVDWRETESWKLLKELANSAQHRAIA 217

Query: 2601 RKTSLSKSVHGVLGLNIDKAKAIQCRIDEFTKHMSDLLRIERDAELEFTQEELNAVPTPD 2422
            RKT LS  +HGVLG++++K KAIQ RIDE T+ MS+LLR+ERDAELEFTQEELNAVPTPD
Sbjct: 218  RKTPLS--LHGVLGMDLNKVKAIQRRIDELTQQMSELLRVERDAELEFTQEELNAVPTPD 275

Query: 2421 EHSTSPKPTEFLVSHAQSEQELCDTICNLNAISTSTGLGGMHLVLFRVEGNHRLPPTNLS 2242
            E+S+S KPTEFLVSHAQ EQE+CDTICNLNA+STS GLGGMHLVLF+ EGN+RLPPTNLS
Sbjct: 276  ENSSSRKPTEFLVSHAQVEQEMCDTICNLNAVSTSIGLGGMHLVLFKAEGNNRLPPTNLS 335

Query: 2241 PGDMVCVRICDSRGAGATSCMQGFVNNLGDDGCSISVALESLHGDPTFSKLFGKNIRIDR 2062
            PGDMVCVRICDSRGAGATSC+QGFVNNLG+DGCSISVALES HGDPTFSKLFGKNIRIDR
Sbjct: 336  PGDMVCVRICDSRGAGATSCLQGFVNNLGEDGCSISVALESRHGDPTFSKLFGKNIRIDR 395

Query: 2061 IQGLADALTYERNCEAXXXXXXXXXXXKNSSIAVVTTIFGDNEDIAWFEDNNMVDWAEAE 1882
            IQGLAD LTYERNCEA           KN SI VV T+FGD ED+ W EDN +VDWAE E
Sbjct: 396  IQGLADTLTYERNCEALMMLQKKGLHKKNPSITVVATVFGDKEDVVWLEDNKLVDWAEME 455

Query: 1881 LNGLLDTEFYDTSQQRAIALGLNKKRPVLIIQGPPGTGKTGVLKQLISIAVKQGERVLVT 1702
            L  LLDTE YD SQQRAIALGLNKKRP+LIIQGPPGTGKT VLK+LIS+ V+QGERVLVT
Sbjct: 456  LGELLDTESYDASQQRAIALGLNKKRPMLIIQGPPGTGKTVVLKELISLVVEQGERVLVT 515

Query: 1701 APTNAAVDNMVEKLSDIGANIVRVGNPARISPAVASKSLVEIVNGRLADFRSEFERKKSD 1522
            APTNAAVDNMVEKLSDIGANIVRVGNPARISPAVASKSLVEIVN +LADF+SEFERKKSD
Sbjct: 516  APTNAAVDNMVEKLSDIGANIVRVGNPARISPAVASKSLVEIVNAKLADFKSEFERKKSD 575

Query: 1521 LRKDLSHCLRDDSLAAGIRQLLKQLGKTMKKKERETIREILSSAHVVLATNIGAADPMIR 1342
            LRKDLSHCLRDDSLAAGIRQLLKQLGKTMKKKERET+RE+LSSA VVLATNIGAADP+IR
Sbjct: 576  LRKDLSHCLRDDSLAAGIRQLLKQLGKTMKKKERETVREVLSSAQVVLATNIGAADPLIR 635

Query: 1341 WLNSFDLVVIDEAGQAIEPSCWIPILLGKRCILAGDQCQLAPVILSRKALEGGLGVSFLE 1162
             LN FDLVVIDEAGQAIEPSCWIPILLGKRCILAGD+CQLAPVILSR+ALEGGLGVS LE
Sbjct: 636  LLNFFDLVVIDEAGQAIEPSCWIPILLGKRCILAGDKCQLAPVILSRRALEGGLGVSLLE 695

Query: 1161 RASTLHEGVLATKLTTQYRMNDAIASWASKEMYNGLLKSSASVMSHLLSDSPLVKSTWIT 982
            RA TLHEGVL+T+LTTQYRMNDAIASWASKEMY+G L+SS+ V SHLLSDSP VK TWIT
Sbjct: 696  RAETLHEGVLSTQLTTQYRMNDAIASWASKEMYDGTLESSSRVTSHLLSDSPFVKQTWIT 755

Query: 981  QCPLLLLDTRMPFGSLSVGCEEQLDPAGTGSFYNEGEADIVVQHVFALIYAGVRPSTIVV 802
            QCPLLLLDTR+P+GSLS+GCEEQ+DPAGTGSFYNEGEADIVVQHV++LIYAGV P++IVV
Sbjct: 756  QCPLLLLDTRLPYGSLSMGCEEQIDPAGTGSFYNEGEADIVVQHVYSLIYAGVIPASIVV 815

Query: 801  QSPYVSQVQLLRDRLEEFPLSTGVEVATIDSFQGREADAVVISMVRSNNLGAVGFLGDSR 622
            QSPYV+QVQLLRDRLEEFP++TGVEVATIDSFQGREADAV+ISMVRSNNLGAVGFLGDSR
Sbjct: 816  QSPYVAQVQLLRDRLEEFPITTGVEVATIDSFQGREADAVIISMVRSNNLGAVGFLGDSR 875

Query: 621  RMNVAITRARKHVAIICDSSTICHNTFLARLLRHIRYFGRVKHAEPGGSGGYGLSMNPML 442
            RMNVAITRARKHVAI+CDSSTICHNTFLARLLRHIRY+GRVKHA+PGG GG GLSM PML
Sbjct: 876  RMNVAITRARKHVAIVCDSSTICHNTFLARLLRHIRYYGRVKHADPGGYGGTGLSMTPML 935

Query: 441  PSVS 430
            PS+S
Sbjct: 936  PSLS 939


>gb|EOY10295.1| P-loop containing nucleoside triphosphate hydrolases superfamily
            protein isoform 1 [Theobroma cacao]
          Length = 1008

 Score = 1423 bits (3684), Expect = 0.0
 Identities = 707/890 (79%), Positives = 786/890 (88%)
 Frame = -2

Query: 3099 SNNNTNNKAAVSEEKTRMKQQQVNDEKDGPTSVRALYQNGDPLGRRDLGKGVVKWIGKGM 2920
            S++ ++ K  V E      Q+Q   +K    +VR LYQNGDPLGRRDLGK V++WI +GM
Sbjct: 119  SSSCSSTKIIVEELGLLKNQKQEKVKKTKAVNVRTLYQNGDPLGRRDLGKRVIRWISEGM 178

Query: 2919 KAMALDFALAETQGDFADLKQRMGPGLTFVIQAQPYLNAVPMPLGMEAICLKTCTHYPTL 2740
            KAMA DF  AE QG+F +L+QRMGPGLTFVIQAQPYLNA+P+PLG+EAICLK CTHYPTL
Sbjct: 179  KAMASDFVTAELQGEFLELRQRMGPGLTFVIQAQPYLNAIPIPLGLEAICLKACTHYPTL 238

Query: 2739 FDHFQRELRDVLLDLQHKTLIHNWRETESWKLLKELATSAQHRAIARKTSLSKSVHGVLG 2560
            FDHFQRELR++L +LQ  +++ +WRETESWKLLKELA SAQHRAIARK +  K V GVLG
Sbjct: 239  FDHFQRELRNILQELQQNSVVEDWRETESWKLLKELANSAQHRAIARKITQPKPVQGVLG 298

Query: 2559 LNIDKAKAIQCRIDEFTKHMSDLLRIERDAELEFTQEELNAVPTPDEHSTSPKPTEFLVS 2380
            ++++KAKA+Q RIDEFTK MS+LLRIERDAELEFTQEELNAVPTPDE S S KP EFLVS
Sbjct: 299  MDLEKAKAMQGRIDEFTKQMSELLRIERDAELEFTQEELNAVPTPDEGSDSSKPIEFLVS 358

Query: 2379 HAQSEQELCDTICNLNAISTSTGLGGMHLVLFRVEGNHRLPPTNLSPGDMVCVRICDSRG 2200
            H Q++QELCDTICNLNA+STSTGLGGMHLVLFRVEGNHRLPPT LSPGDMVCVRICDSRG
Sbjct: 359  HGQAQQELCDTICNLNAVSTSTGLGGMHLVLFRVEGNHRLPPTTLSPGDMVCVRICDSRG 418

Query: 2199 AGATSCMQGFVNNLGDDGCSISVALESLHGDPTFSKLFGKNIRIDRIQGLADALTYERNC 2020
            AGATSCMQGFV+NLG+DGCSISVALES HGDPTFSK FGKN+RIDRIQGLADALTYERNC
Sbjct: 419  AGATSCMQGFVDNLGEDGCSISVALESRHGDPTFSKFFGKNVRIDRIQGLADALTYERNC 478

Query: 2019 EAXXXXXXXXXXXKNSSIAVVTTIFGDNEDIAWFEDNNMVDWAEAELNGLLDTEFYDTSQ 1840
            EA           KN SIAVV T+FGD ED+ W E N+  DW EA+L+GLL    +D SQ
Sbjct: 479  EALMLLQKNGLQKKNPSIAVVATLFGDKEDVTWLEKNSYADWNEAKLDGLLQNGTFDDSQ 538

Query: 1839 QRAIALGLNKKRPVLIIQGPPGTGKTGVLKQLISIAVKQGERVLVTAPTNAAVDNMVEKL 1660
            QRAIALGLNKKRP+L++QGPPGTGKTG+LK++I++AV+QGERVLV APTNAAVDNMVEKL
Sbjct: 539  QRAIALGLNKKRPILVVQGPPGTGKTGLLKEVIALAVQQGERVLVAAPTNAAVDNMVEKL 598

Query: 1659 SDIGANIVRVGNPARISPAVASKSLVEIVNGRLADFRSEFERKKSDLRKDLSHCLRDDSL 1480
            S+IG NIVRVGNPARIS AVASKSL EIVN +LAD+ +EFERKKSDLRKDL HCL+DDSL
Sbjct: 599  SNIGLNIVRVGNPARISSAVASKSLAEIVNSKLADYLAEFERKKSDLRKDLRHCLKDDSL 658

Query: 1479 AAGIRQLLKQLGKTMKKKERETIREILSSAHVVLATNIGAADPMIRWLNSFDLVVIDEAG 1300
            AAGIRQLLKQLGK +KKKE+ET+RE+LSSA VVL+TN GAADP+IR +++FDLVVIDEAG
Sbjct: 659  AAGIRQLLKQLGKALKKKEKETVREVLSSAQVVLSTNTGAADPLIRRMDTFDLVVIDEAG 718

Query: 1299 QAIEPSCWIPILLGKRCILAGDQCQLAPVILSRKALEGGLGVSFLERASTLHEGVLATKL 1120
            QAIEPSCWIPIL GKRCILAGDQCQLAPVILSRKALEGGLGVS LERA+T+HEGVLAT L
Sbjct: 719  QAIEPSCWIPILQGKRCILAGDQCQLAPVILSRKALEGGLGVSLLERAATMHEGVLATML 778

Query: 1119 TTQYRMNDAIASWASKEMYNGLLKSSASVMSHLLSDSPLVKSTWITQCPLLLLDTRMPFG 940
            TTQYRMNDAIA WASKEMY+G LKSS SV SHLL DSP VK TWITQCPLLLLDTRMP+G
Sbjct: 779  TTQYRMNDAIAGWASKEMYDGELKSSPSVGSHLLVDSPFVKPTWITQCPLLLLDTRMPYG 838

Query: 939  SLSVGCEEQLDPAGTGSFYNEGEADIVVQHVFALIYAGVRPSTIVVQSPYVSQVQLLRDR 760
            SLSVGCEE LDPAGTGSFYNEGEADIVVQHVF LIYAGV P+ I VQSPYV+QVQLLRDR
Sbjct: 839  SLSVGCEEHLDPAGTGSFYNEGEADIVVQHVFYLIYAGVSPTAIAVQSPYVAQVQLLRDR 898

Query: 759  LEEFPLSTGVEVATIDSFQGREADAVVISMVRSNNLGAVGFLGDSRRMNVAITRARKHVA 580
            L+EFP + GVEVATIDSFQGREADAV+ISMVRSN LGAVGFLGDSRRMNVA+TRARKHVA
Sbjct: 899  LDEFPEAAGVEVATIDSFQGREADAVIISMVRSNTLGAVGFLGDSRRMNVAVTRARKHVA 958

Query: 579  IICDSSTICHNTFLARLLRHIRYFGRVKHAEPGGSGGYGLSMNPMLPSVS 430
            ++CDSSTICHNTFLARLLRHIRYFGRVKHAEPG SGG GL M+PMLPS+S
Sbjct: 959  VVCDSSTICHNTFLARLLRHIRYFGRVKHAEPGTSGGSGLGMDPMLPSIS 1008


>ref|XP_017977299.1| PREDICTED: DNA-binding protein SMUBP-2 [Theobroma cacao]
 ref|XP_007029793.2| PREDICTED: DNA-binding protein SMUBP-2 [Theobroma cacao]
          Length = 1008

 Score = 1422 bits (3680), Expect = 0.0
 Identities = 706/890 (79%), Positives = 786/890 (88%)
 Frame = -2

Query: 3099 SNNNTNNKAAVSEEKTRMKQQQVNDEKDGPTSVRALYQNGDPLGRRDLGKGVVKWIGKGM 2920
            S++ ++ K  V E      Q+Q   +K    +VR LYQNGDPLGRRDLGK V++WI +GM
Sbjct: 119  SSSCSSTKIIVEELGLLKNQKQEKVKKTKAVNVRTLYQNGDPLGRRDLGKRVIRWISEGM 178

Query: 2919 KAMALDFALAETQGDFADLKQRMGPGLTFVIQAQPYLNAVPMPLGMEAICLKTCTHYPTL 2740
            KAMA DF  AE QG+F +L+QRMGPGLTFVIQAQPYLNA+P+PLG+EAICLK CTHYPTL
Sbjct: 179  KAMASDFVTAELQGEFLELRQRMGPGLTFVIQAQPYLNAIPIPLGLEAICLKACTHYPTL 238

Query: 2739 FDHFQRELRDVLLDLQHKTLIHNWRETESWKLLKELATSAQHRAIARKTSLSKSVHGVLG 2560
            FDHFQRELR++L +LQ  +++ +WR+TESWKLLKELA SAQHRAIARK +  K V GVLG
Sbjct: 239  FDHFQRELRNILQELQQNSVVEDWRKTESWKLLKELANSAQHRAIARKITQPKPVQGVLG 298

Query: 2559 LNIDKAKAIQCRIDEFTKHMSDLLRIERDAELEFTQEELNAVPTPDEHSTSPKPTEFLVS 2380
            ++++KAKA+Q RIDEFTK MS+LLRIERDAELEFTQEELNAVPTPDE S S KP EFLVS
Sbjct: 299  MDLEKAKAMQGRIDEFTKQMSELLRIERDAELEFTQEELNAVPTPDEGSDSSKPIEFLVS 358

Query: 2379 HAQSEQELCDTICNLNAISTSTGLGGMHLVLFRVEGNHRLPPTNLSPGDMVCVRICDSRG 2200
            H Q++QELCDTICNLNA+STSTGLGGMHLVLFRVEGNHRLPPT LSPGDMVCVRICDSRG
Sbjct: 359  HGQAQQELCDTICNLNAVSTSTGLGGMHLVLFRVEGNHRLPPTTLSPGDMVCVRICDSRG 418

Query: 2199 AGATSCMQGFVNNLGDDGCSISVALESLHGDPTFSKLFGKNIRIDRIQGLADALTYERNC 2020
            AGATSCMQGFV+NLG+DGCSISVALES HGDPTFSK FGKN+RIDRIQGLADALTYERNC
Sbjct: 419  AGATSCMQGFVDNLGEDGCSISVALESRHGDPTFSKFFGKNVRIDRIQGLADALTYERNC 478

Query: 2019 EAXXXXXXXXXXXKNSSIAVVTTIFGDNEDIAWFEDNNMVDWAEAELNGLLDTEFYDTSQ 1840
            EA           KN SIAVV T+FGD ED+ W E N+  DW EA+L+GLL    +D SQ
Sbjct: 479  EALMLLQKNGLQKKNPSIAVVATLFGDKEDVTWLEKNSYADWNEAKLDGLLQNGTFDDSQ 538

Query: 1839 QRAIALGLNKKRPVLIIQGPPGTGKTGVLKQLISIAVKQGERVLVTAPTNAAVDNMVEKL 1660
            QRAIALGLNKKRP+L++QGPPGTGKTG+LK++I++AV+QGERVLV APTNAAVDNMVEKL
Sbjct: 539  QRAIALGLNKKRPILVVQGPPGTGKTGLLKEVIALAVQQGERVLVAAPTNAAVDNMVEKL 598

Query: 1659 SDIGANIVRVGNPARISPAVASKSLVEIVNGRLADFRSEFERKKSDLRKDLSHCLRDDSL 1480
            S+IG NIVRVGNPARIS AVASKSL EIVN +LAD+ +EFERKKSDLRKDL HCL+DDSL
Sbjct: 599  SNIGLNIVRVGNPARISSAVASKSLAEIVNSKLADYLAEFERKKSDLRKDLRHCLKDDSL 658

Query: 1479 AAGIRQLLKQLGKTMKKKERETIREILSSAHVVLATNIGAADPMIRWLNSFDLVVIDEAG 1300
            AAGIRQLLKQLGK +KKKE+ET+RE+LSSA VVL+TN GAADP+IR +++FDLVVIDEAG
Sbjct: 659  AAGIRQLLKQLGKALKKKEKETVREVLSSAQVVLSTNTGAADPLIRRMDTFDLVVIDEAG 718

Query: 1299 QAIEPSCWIPILLGKRCILAGDQCQLAPVILSRKALEGGLGVSFLERASTLHEGVLATKL 1120
            QAIEPSCWIPIL GKRCILAGDQCQLAPVILSRKALEGGLGVS LERA+T+HEGVLAT L
Sbjct: 719  QAIEPSCWIPILQGKRCILAGDQCQLAPVILSRKALEGGLGVSLLERAATMHEGVLATML 778

Query: 1119 TTQYRMNDAIASWASKEMYNGLLKSSASVMSHLLSDSPLVKSTWITQCPLLLLDTRMPFG 940
            TTQYRMNDAIA WASKEMY+G LKSS SV SHLL DSP VK TWITQCPLLLLDTRMP+G
Sbjct: 779  TTQYRMNDAIAGWASKEMYDGELKSSPSVGSHLLVDSPFVKPTWITQCPLLLLDTRMPYG 838

Query: 939  SLSVGCEEQLDPAGTGSFYNEGEADIVVQHVFALIYAGVRPSTIVVQSPYVSQVQLLRDR 760
            SLSVGCEE LDPAGTGSFYNEGEADIVVQHVF LIYAGV P+ I VQSPYV+QVQLLRDR
Sbjct: 839  SLSVGCEEHLDPAGTGSFYNEGEADIVVQHVFYLIYAGVSPTAIAVQSPYVAQVQLLRDR 898

Query: 759  LEEFPLSTGVEVATIDSFQGREADAVVISMVRSNNLGAVGFLGDSRRMNVAITRARKHVA 580
            L+EFP + GVEVATIDSFQGREADAV+ISMVRSN LGAVGFLGDSRRMNVA+TRARKHVA
Sbjct: 899  LDEFPEAAGVEVATIDSFQGREADAVIISMVRSNTLGAVGFLGDSRRMNVAVTRARKHVA 958

Query: 579  IICDSSTICHNTFLARLLRHIRYFGRVKHAEPGGSGGYGLSMNPMLPSVS 430
            ++CDSSTICHNTFLARLLRHIRYFGRVKHAEPG SGG GL M+PMLPS+S
Sbjct: 959  VVCDSSTICHNTFLARLLRHIRYFGRVKHAEPGTSGGSGLGMDPMLPSIS 1008


>gb|OMO99192.1| putative DNA-binding protein smubp-2 [Corchorus capsularis]
          Length = 1011

 Score = 1420 bits (3677), Expect = 0.0
 Identities = 708/890 (79%), Positives = 786/890 (88%)
 Frame = -2

Query: 3099 SNNNTNNKAAVSEEKTRMKQQQVNDEKDGPTSVRALYQNGDPLGRRDLGKGVVKWIGKGM 2920
            ++N +  K  V E     K+ Q   +K    +VR LYQNGDPLGR+DLGK V++WI +GM
Sbjct: 122  NSNVSGTKLIVEEMGLLKKKNQQKVKKTKAVNVRTLYQNGDPLGRKDLGKTVIRWISEGM 181

Query: 2919 KAMALDFALAETQGDFADLKQRMGPGLTFVIQAQPYLNAVPMPLGMEAICLKTCTHYPTL 2740
            +AMALDFA AE QG+F +L+QRMGPGLTFVIQAQPYLNA+P+PLG+EAI LK CTHYPTL
Sbjct: 182  RAMALDFASAELQGEFPELRQRMGPGLTFVIQAQPYLNAIPIPLGLEAISLKACTHYPTL 241

Query: 2739 FDHFQRELRDVLLDLQHKTLIHNWRETESWKLLKELATSAQHRAIARKTSLSKSVHGVLG 2560
            FDHFQRELR+VL +LQ K+++ +WRETESWK+LKELA SAQHRAIARK++  K V GVLG
Sbjct: 242  FDHFQRELRNVLQELQQKSMVEDWRETESWKMLKELANSAQHRAIARKSTQPKPVQGVLG 301

Query: 2559 LNIDKAKAIQCRIDEFTKHMSDLLRIERDAELEFTQEELNAVPTPDEHSTSPKPTEFLVS 2380
            ++++K KA+Q RIDEFTK MS+LL+IERDAELEFTQEELNAVPTPDE S   KP EFLVS
Sbjct: 302  MDLEKVKAMQGRIDEFTKWMSELLQIERDAELEFTQEELNAVPTPDEGSNPSKPIEFLVS 361

Query: 2379 HAQSEQELCDTICNLNAISTSTGLGGMHLVLFRVEGNHRLPPTNLSPGDMVCVRICDSRG 2200
            H Q++QELCDTICNLNA+STSTGLGGMHLVLFRVEGNHRLPPT LSPGDMVCVRICD+RG
Sbjct: 362  HGQAQQELCDTICNLNAVSTSTGLGGMHLVLFRVEGNHRLPPTTLSPGDMVCVRICDNRG 421

Query: 2199 AGATSCMQGFVNNLGDDGCSISVALESLHGDPTFSKLFGKNIRIDRIQGLADALTYERNC 2020
            AGAT+CMQGFV+NLG+DGCSISVALES HGDPTFSKLFGK +RIDRIQGLADALTYERNC
Sbjct: 422  AGATACMQGFVDNLGEDGCSISVALESRHGDPTFSKLFGKTVRIDRIQGLADALTYERNC 481

Query: 2019 EAXXXXXXXXXXXKNSSIAVVTTIFGDNEDIAWFEDNNMVDWAEAELNGLLDTEFYDTSQ 1840
            EA           KN SIAVV T+FGD ED+ W E N++ DW E +L+GLL    +D SQ
Sbjct: 482  EALMLLQKNGLQKKNPSIAVVATLFGDKEDMDWLEKNDLADWNETKLDGLLQNGIFDDSQ 541

Query: 1839 QRAIALGLNKKRPVLIIQGPPGTGKTGVLKQLISIAVKQGERVLVTAPTNAAVDNMVEKL 1660
            ++AIALGLNKKRPVL++QGPPGTGKTG+LK++I++AV+QGERVLVTAPTNAAVDNMVEKL
Sbjct: 542  RKAIALGLNKKRPVLVVQGPPGTGKTGLLKEIIALAVQQGERVLVTAPTNAAVDNMVEKL 601

Query: 1659 SDIGANIVRVGNPARISPAVASKSLVEIVNGRLADFRSEFERKKSDLRKDLSHCLRDDSL 1480
            SD G NIVRVGNPARIS AVASKSLVEIVN +LA+FR+EFERKKSDLRKDL  CL+DDSL
Sbjct: 602  SDTGLNIVRVGNPARISSAVASKSLVEIVNSKLANFRAEFERKKSDLRKDLRLCLKDDSL 661

Query: 1479 AAGIRQLLKQLGKTMKKKERETIREILSSAHVVLATNIGAADPMIRWLNSFDLVVIDEAG 1300
            AAGIRQLLKQLGKT+KKKE+ET+REILSSA VVL+TN GAADP+IR L +FDLVVIDEAG
Sbjct: 662  AAGIRQLLKQLGKTLKKKEKETVREILSSAQVVLSTNTGAADPLIRRLKTFDLVVIDEAG 721

Query: 1299 QAIEPSCWIPILLGKRCILAGDQCQLAPVILSRKALEGGLGVSFLERASTLHEGVLATKL 1120
            QAIEPSCWIPIL GKRCILAGDQCQLAPVILSRKALEGGLGVS LERA+TLHEGVL T L
Sbjct: 722  QAIEPSCWIPILQGKRCILAGDQCQLAPVILSRKALEGGLGVSLLERAATLHEGVLTTLL 781

Query: 1119 TTQYRMNDAIASWASKEMYNGLLKSSASVMSHLLSDSPLVKSTWITQCPLLLLDTRMPFG 940
            TTQYRMNDAIA WASKEMYNG LKSS SV SHLL DSP VK TWITQCPLLLLDTRMP+G
Sbjct: 782  TTQYRMNDAIAGWASKEMYNGELKSSPSVASHLLVDSPFVKPTWITQCPLLLLDTRMPYG 841

Query: 939  SLSVGCEEQLDPAGTGSFYNEGEADIVVQHVFALIYAGVRPSTIVVQSPYVSQVQLLRDR 760
            SLSVGCEE LDPAGTGSFYNEGEADIVVQHVF LIYAGV P TI VQSPYV+QVQLLRDR
Sbjct: 842  SLSVGCEEHLDPAGTGSFYNEGEADIVVQHVFYLIYAGVSPKTIAVQSPYVAQVQLLRDR 901

Query: 759  LEEFPLSTGVEVATIDSFQGREADAVVISMVRSNNLGAVGFLGDSRRMNVAITRARKHVA 580
            L+EFP + GVEVATIDSFQGREADAV+ISMVRSN LGAVGFLGDSRRMNVAITRARKHVA
Sbjct: 902  LDEFPEAAGVEVATIDSFQGREADAVIISMVRSNTLGAVGFLGDSRRMNVAITRARKHVA 961

Query: 579  IICDSSTICHNTFLARLLRHIRYFGRVKHAEPGGSGGYGLSMNPMLPSVS 430
            ++CDSSTICHNTFLARLLRHIRYFGRVKHAEPG SGG GL M+PMLPS+S
Sbjct: 962  VVCDSSTICHNTFLARLLRHIRYFGRVKHAEPGNSGGSGLGMDPMLPSIS 1011


>gb|OMO56477.1| hypothetical protein COLO4_35630 [Corchorus olitorius]
          Length = 1011

 Score = 1417 bits (3667), Expect = 0.0
 Identities = 707/890 (79%), Positives = 785/890 (88%)
 Frame = -2

Query: 3099 SNNNTNNKAAVSEEKTRMKQQQVNDEKDGPTSVRALYQNGDPLGRRDLGKGVVKWIGKGM 2920
            ++N +  K  V E     K+ Q   +K    +VR LYQNGDPLGR+DLGK V++WI +GM
Sbjct: 122  NSNVSGTKLIVEEMGLLKKKNQQKVKKTKAVNVRTLYQNGDPLGRKDLGKTVIRWISEGM 181

Query: 2919 KAMALDFALAETQGDFADLKQRMGPGLTFVIQAQPYLNAVPMPLGMEAICLKTCTHYPTL 2740
            +AMALDFA AE QG+F +L+QRMGPGLTFVIQAQPYLNA+P+PLG+EAI LK CTHYPTL
Sbjct: 182  RAMALDFASAELQGEFPELRQRMGPGLTFVIQAQPYLNAIPIPLGLEAISLKACTHYPTL 241

Query: 2739 FDHFQRELRDVLLDLQHKTLIHNWRETESWKLLKELATSAQHRAIARKTSLSKSVHGVLG 2560
            FDHFQRELR+VL +LQ K+++ +WRETESWK+LKELA SAQHRAIARK++  K V GVLG
Sbjct: 242  FDHFQRELRNVLQELQQKSMVEDWRETESWKMLKELAHSAQHRAIARKSTQPKPVQGVLG 301

Query: 2559 LNIDKAKAIQCRIDEFTKHMSDLLRIERDAELEFTQEELNAVPTPDEHSTSPKPTEFLVS 2380
            ++++K KA+Q RIDEFTK MS+LL+IERDAELEFTQEELNAVPTPDE S   KP EFLVS
Sbjct: 302  MDLEKVKAMQGRIDEFTKWMSELLQIERDAELEFTQEELNAVPTPDEGSNPSKPIEFLVS 361

Query: 2379 HAQSEQELCDTICNLNAISTSTGLGGMHLVLFRVEGNHRLPPTNLSPGDMVCVRICDSRG 2200
            H Q++QELCDTICNLNA+STSTGLGGMHLVLFRVEGNHRLPPT LSPGDMVCVRICD+RG
Sbjct: 362  HGQAQQELCDTICNLNAVSTSTGLGGMHLVLFRVEGNHRLPPTTLSPGDMVCVRICDNRG 421

Query: 2199 AGATSCMQGFVNNLGDDGCSISVALESLHGDPTFSKLFGKNIRIDRIQGLADALTYERNC 2020
            AGAT+CMQGFV+NLG+DGCSISVALES HGDPTFSKLFGK +RIDRIQGLADALTYERNC
Sbjct: 422  AGATACMQGFVDNLGEDGCSISVALESRHGDPTFSKLFGKTVRIDRIQGLADALTYERNC 481

Query: 2019 EAXXXXXXXXXXXKNSSIAVVTTIFGDNEDIAWFEDNNMVDWAEAELNGLLDTEFYDTSQ 1840
            EA           KN SIAVV T+FGD ED+ W E N++ DW E  L+GLL    +D SQ
Sbjct: 482  EALMLLQKNGLQKKNLSIAVVATLFGDKEDMDWLEKNDLADWNETMLDGLLQNGIFDDSQ 541

Query: 1839 QRAIALGLNKKRPVLIIQGPPGTGKTGVLKQLISIAVKQGERVLVTAPTNAAVDNMVEKL 1660
            ++AIALGLNKKRP+L++QGPPGTGKTG+LK++I++AV+QGERVLVTAPTNAAVDNMVEKL
Sbjct: 542  RKAIALGLNKKRPLLVVQGPPGTGKTGLLKEIIALAVQQGERVLVTAPTNAAVDNMVEKL 601

Query: 1659 SDIGANIVRVGNPARISPAVASKSLVEIVNGRLADFRSEFERKKSDLRKDLSHCLRDDSL 1480
            SD G NIVRVGNPARIS AVASKSLVEIVN +LA+FR+EFERKKSDLRKDL  CL+DDSL
Sbjct: 602  SDTGLNIVRVGNPARISSAVASKSLVEIVNSKLANFRAEFERKKSDLRKDLRLCLKDDSL 661

Query: 1479 AAGIRQLLKQLGKTMKKKERETIREILSSAHVVLATNIGAADPMIRWLNSFDLVVIDEAG 1300
            AAGIRQLLKQLGKT+KKKE+ET+REILSSA VVL+TN GAADP+IR L +FDLVVIDEAG
Sbjct: 662  AAGIRQLLKQLGKTLKKKEKETVREILSSAQVVLSTNTGAADPLIRRLKTFDLVVIDEAG 721

Query: 1299 QAIEPSCWIPILLGKRCILAGDQCQLAPVILSRKALEGGLGVSFLERASTLHEGVLATKL 1120
            QAIEPSCWIPIL GKRCILAGDQCQLAPVILSRKALEGGLGVS LERA+TLHEGVL T L
Sbjct: 722  QAIEPSCWIPILQGKRCILAGDQCQLAPVILSRKALEGGLGVSLLERAATLHEGVLTTLL 781

Query: 1119 TTQYRMNDAIASWASKEMYNGLLKSSASVMSHLLSDSPLVKSTWITQCPLLLLDTRMPFG 940
            TTQYRMNDAIASWASKEMYNG LKSS SV SHLL DSP VK TWITQCPLLLLDTRMP+G
Sbjct: 782  TTQYRMNDAIASWASKEMYNGELKSSPSVASHLLVDSPFVKPTWITQCPLLLLDTRMPYG 841

Query: 939  SLSVGCEEQLDPAGTGSFYNEGEADIVVQHVFALIYAGVRPSTIVVQSPYVSQVQLLRDR 760
            SLSVGCEE LDPAGTGSFYNEGEADIVVQHVF LIYAGV P  I VQSPYV+QVQLLRDR
Sbjct: 842  SLSVGCEEHLDPAGTGSFYNEGEADIVVQHVFYLIYAGVSPKAIAVQSPYVAQVQLLRDR 901

Query: 759  LEEFPLSTGVEVATIDSFQGREADAVVISMVRSNNLGAVGFLGDSRRMNVAITRARKHVA 580
            L+EFP + GVEVATIDSFQGREADAV+ISMVRSN LGAVGFLGDSRRMNVAITRARKHVA
Sbjct: 902  LDEFPEAAGVEVATIDSFQGREADAVIISMVRSNTLGAVGFLGDSRRMNVAITRARKHVA 961

Query: 579  IICDSSTICHNTFLARLLRHIRYFGRVKHAEPGGSGGYGLSMNPMLPSVS 430
            ++CDSSTICHNTFLARLLRHIRYFGRVKHAEPG SGG GL M+PMLPS+S
Sbjct: 962  VVCDSSTICHNTFLARLLRHIRYFGRVKHAEPGNSGGSGLGMDPMLPSIS 1011


>gb|PHT30198.1| hypothetical protein CQW23_30230 [Capsicum baccatum]
          Length = 989

 Score = 1416 bits (3666), Expect = 0.0
 Identities = 723/981 (73%), Positives = 822/981 (83%), Gaps = 18/981 (1%)
 Frame = -2

Query: 3324 KMEASCIFCGGVSASILKSQGIRHRPS--ESISLYSNKNRLFLSS---PISHRVWXXXXX 3160
            KMEASC FCG +  S L  Q   +  S   S++L S KNR FL S     S R       
Sbjct: 7    KMEASCNFCGSLVPSCLTRQKRSNLSSFIGSVALSSIKNRTFLDSISLTSSIRATASSSG 66

Query: 3159 XXXXXXXXXXXRED-----GRGADVSNNN--------TNNKAAVSEEKTRMKQQQVNDEK 3019
                        ++     G G +V N+         ++ KA    +  R  QQQ   ++
Sbjct: 67   GTKAVTTRRRKPKNVGTTGGSGKNVKNSEIPAVTTKGSSGKAIEKVQVKRKNQQQECIQE 126

Query: 3018 DGPTSVRALYQNGDPLGRRDLGKGVVKWIGKGMKAMALDFALAETQGDFADLKQRMGPGL 2839
             GP  VRAL+QNGDPLGR+DLGK VV+W+ +GM+AMALDFA AE QG+FA+LKQRM PGL
Sbjct: 127  GGPVDVRALHQNGDPLGRKDLGKCVVRWLSQGMRAMALDFATAEMQGEFAELKQRMEPGL 186

Query: 2838 TFVIQAQPYLNAVPMPLGMEAICLKTCTHYPTLFDHFQRELRDVLLDLQHKTLIHNWRET 2659
            TFVIQAQPYLNAVPMPLG+EAICLK CTHYPTLFD+FQRELRDVL DLQ K+ + +WR+T
Sbjct: 187  TFVIQAQPYLNAVPMPLGLEAICLKACTHYPTLFDNFQRELRDVLQDLQRKSSVQDWRDT 246

Query: 2658 ESWKLLKELATSAQHRAIARKTSLSKSVHGVLGLNIDKAKAIQCRIDEFTKHMSDLLRIE 2479
            ESWKLLK+LA+SAQH+AIARK S  KSV GV+G++++KAKAIQ RID+FT  MSDLL IE
Sbjct: 247  ESWKLLKDLASSAQHKAIARKGSQPKSVPGVMGMDLEKAKAIQSRIDDFTNRMSDLLHIE 306

Query: 2478 RDAELEFTQEELNAVPTPDEHSTSPKPTEFLVSHAQSEQELCDTICNLNAISTSTGLGGM 2299
            RDAELEFTQEELNAVP PD +S + KP EFLVSHAQ EQELCDTICNL A+STS GLGGM
Sbjct: 307  RDAELEFTQEELNAVPAPDVNSEAQKPFEFLVSHAQPEQELCDTICNLTAVSTSIGLGGM 366

Query: 2298 HLVLFRVEGNHRLPPTNLSPGDMVCVRICDSRGAGATSCMQGFVNNLGDDGCSISVALES 2119
            HLVLF++EGNHRLPP NLSPGDMVCVRICDSRGAGATSCMQGFV+NLG+DGCSIS+ALES
Sbjct: 367  HLVLFKLEGNHRLPPANLSPGDMVCVRICDSRGAGATSCMQGFVHNLGEDGCSISLALES 426

Query: 2118 LHGDPTFSKLFGKNIRIDRIQGLADALTYERNCEAXXXXXXXXXXXKNSSIAVVTTIFGD 1939
            L GD TFSKLFGKN+RIDRIQGLADALTYERNCEA           KNSS+AVV T+FGD
Sbjct: 427  LQGDTTFSKLFGKNVRIDRIQGLADALTYERNCEALMMLQKKGFRKKNSSVAVVATLFGD 486

Query: 1938 NEDIAWFEDNNMVDWAEAELNGLLDTEFYDTSQQRAIALGLNKKRPVLIIQGPPGTGKTG 1759
            NED+ W E+N+M DWAE EL    + + +D SQ++AIALGLNK RP++IIQGPPGTGKTG
Sbjct: 487  NEDLKWLEENDMADWAEVELPDSTNKKSFDASQRKAIALGLNKNRPIMIIQGPPGTGKTG 546

Query: 1758 VLKQLISIAVKQGERVLVTAPTNAAVDNMVEKLSDIGANIVRVGNPARISPAVASKSLVE 1579
            +LK+LIS+AVKQGERVLVTAPTNAAVDNMVEKLSDIG NIVRVGNPARIS +VASKSL E
Sbjct: 547  LLKELISLAVKQGERVLVTAPTNAAVDNMVEKLSDIGINIVRVGNPARISSSVASKSLAE 606

Query: 1578 IVNGRLADFRSEFERKKSDLRKDLSHCLRDDSLAAGIRQLLKQLGKTMKKKERETIREIL 1399
            IVN +L+DF SE ERKKSDLRKDL +CL+DDSLAAGIRQLLKQLGK++KKKE+ET++EIL
Sbjct: 607  IVNNKLSDFLSEIERKKSDLRKDLRYCLKDDSLAAGIRQLLKQLGKSIKKKEKETVKEIL 666

Query: 1398 SSAHVVLATNIGAADPMIRWLNSFDLVVIDEAGQAIEPSCWIPILLGKRCILAGDQCQLA 1219
            S+AHVVLATNIGAADP+IR L++FDLV+IDEAGQAIEPS WIPILLGKRCILAGDQ QLA
Sbjct: 667  STAHVVLATNIGAADPLIRRLDAFDLVIIDEAGQAIEPSSWIPILLGKRCILAGDQFQLA 726

Query: 1218 PVILSRKALEGGLGVSFLERASTLHEGVLATKLTTQYRMNDAIASWASKEMYNGLLKSSA 1039
            PVILSRKALEGGLGVS LERA+TLH+G+L+TKLTTQYRMNDAIASWASKEMY G L SS 
Sbjct: 727  PVILSRKALEGGLGVSLLERAATLHDGMLSTKLTTQYRMNDAIASWASKEMYGGSLTSSP 786

Query: 1038 SVMSHLLSDSPLVKSTWITQCPLLLLDTRMPFGSLSVGCEEQLDPAGTGSFYNEGEADIV 859
            +V SHLL DSP VK TWITQCPLLLLDTRMP+GSLSVGCEE LDPAGTGSFYNEGEADIV
Sbjct: 787  TVASHLLVDSPFVKPTWITQCPLLLLDTRMPYGSLSVGCEEHLDPAGTGSFYNEGEADIV 846

Query: 858  VQHVFALIYAGVRPSTIVVQSPYVSQVQLLRDRLEEFPLSTGVEVATIDSFQGREADAVV 679
            VQHVF+LIYAGV P+ I VQSPYV+QVQLLRD+++E P++TGV+VATIDSFQGREADAV+
Sbjct: 847  VQHVFSLIYAGVPPAAIAVQSPYVAQVQLLRDKIDEIPMATGVDVATIDSFQGREADAVI 906

Query: 678  ISMVRSNNLGAVGFLGDSRRMNVAITRARKHVAIICDSSTICHNTFLARLLRHIRYFGRV 499
            ISMVRSNNLGAVGFLGD+RRMNVAITRARKHVA++CDSSTICHNT+LARLLRHIRYFG+V
Sbjct: 907  ISMVRSNNLGAVGFLGDNRRMNVAITRARKHVAVVCDSSTICHNTYLARLLRHIRYFGKV 966

Query: 498  KHAEPGGSGGYGLSMNPMLPS 436
            KH EPG    +GL M+PMLP+
Sbjct: 967  KHVEPGSFWEFGLGMDPMLPT 987


>ref|XP_022718654.1| DNA-binding protein SMUBP-2-like [Durio zibethinus]
          Length = 1004

 Score = 1416 bits (3665), Expect = 0.0
 Identities = 703/898 (78%), Positives = 788/898 (87%)
 Frame = -2

Query: 3123 EDGRGADVSNNNTNNKAAVSEEKTRMKQQQVNDEKDGPTSVRALYQNGDPLGRRDLGKGV 2944
            ++G  +  +   ++ K  V E +   +Q+Q   +K    +VR LYQNGDPLGRRDLGK V
Sbjct: 107  DNGSSSKSTPELSSTKILVEELELLKEQKQEKVKKTKALNVRTLYQNGDPLGRRDLGKRV 166

Query: 2943 VKWIGKGMKAMALDFALAETQGDFADLKQRMGPGLTFVIQAQPYLNAVPMPLGMEAICLK 2764
            V+WI +GMKAMA DF  AE QG+F +L+Q M PGLTFVIQAQPYLNA+P+PLG+EAICLK
Sbjct: 167  VRWISEGMKAMASDFVSAELQGEFLELRQMMEPGLTFVIQAQPYLNAIPIPLGLEAICLK 226

Query: 2763 TCTHYPTLFDHFQRELRDVLLDLQHKTLIHNWRETESWKLLKELATSAQHRAIARKTSLS 2584
             CTHYPTLFDHFQRELR+VL +LQH +++ +WRETESWKLLKELA S QHRAIARK +L 
Sbjct: 227  ACTHYPTLFDHFQRELRNVLQELQHNSVVEDWRETESWKLLKELANSVQHRAIARKITLP 286

Query: 2583 KSVHGVLGLNIDKAKAIQCRIDEFTKHMSDLLRIERDAELEFTQEELNAVPTPDEHSTSP 2404
            K + G+LG+ ++KAKA+Q RIDEFTK MS+LLRIERDAELEFTQEELNAVPTP+E   S 
Sbjct: 287  KPIQGILGIGLEKAKAMQGRIDEFTKRMSELLRIERDAELEFTQEELNAVPTPNEGCDSI 346

Query: 2403 KPTEFLVSHAQSEQELCDTICNLNAISTSTGLGGMHLVLFRVEGNHRLPPTNLSPGDMVC 2224
            KP EFLVSH Q++QELCDTICNLNA+STSTGLGGMHLVLFRVEGNHRLPPT LSPGDMVC
Sbjct: 347  KPIEFLVSHGQAQQELCDTICNLNAVSTSTGLGGMHLVLFRVEGNHRLPPTTLSPGDMVC 406

Query: 2223 VRICDSRGAGATSCMQGFVNNLGDDGCSISVALESLHGDPTFSKLFGKNIRIDRIQGLAD 2044
            VRICDSRGAGATSC+QGFV+NLG+DGCSISVALES HGDPTFSKLFGK++RIDRIQGLAD
Sbjct: 407  VRICDSRGAGATSCIQGFVDNLGEDGCSISVALESRHGDPTFSKLFGKSVRIDRIQGLAD 466

Query: 2043 ALTYERNCEAXXXXXXXXXXXKNSSIAVVTTIFGDNEDIAWFEDNNMVDWAEAELNGLLD 1864
            ALTYERNCEA           KN SIAVV T+FGD ED+AW E+N++ DW + EL+G L 
Sbjct: 467  ALTYERNCEALMLLQKNGLQKKNPSIAVVATLFGDKEDVAWLEENDLADWNQTELDGSLQ 526

Query: 1863 TEFYDTSQQRAIALGLNKKRPVLIIQGPPGTGKTGVLKQLISIAVKQGERVLVTAPTNAA 1684
               +D SQQRAI LGLNKKRP+L++QGPPGTGKTG+LK++I++AV+QGE VLVTAPTNAA
Sbjct: 527  NRTFDDSQQRAICLGLNKKRPMLVVQGPPGTGKTGLLKEVIALAVQQGETVLVTAPTNAA 586

Query: 1683 VDNMVEKLSDIGANIVRVGNPARISPAVASKSLVEIVNGRLADFRSEFERKKSDLRKDLS 1504
            VDNMVEKLSD G +IVRVGNPARIS  VASKSLVEIVN +LAD+R+EFERKKSDLRKDL 
Sbjct: 587  VDNMVEKLSDSGLDIVRVGNPARISSTVASKSLVEIVNSKLADYRAEFERKKSDLRKDLR 646

Query: 1503 HCLRDDSLAAGIRQLLKQLGKTMKKKERETIREILSSAHVVLATNIGAADPMIRWLNSFD 1324
            HCL+DDSLAAGIRQLLKQLGK +KKKE+ET+RE+LSSA VVL+TN GAADP+IR L++FD
Sbjct: 647  HCLKDDSLAAGIRQLLKQLGKALKKKEKETVREVLSSAQVVLSTNTGAADPLIRRLDTFD 706

Query: 1323 LVVIDEAGQAIEPSCWIPILLGKRCILAGDQCQLAPVILSRKALEGGLGVSFLERASTLH 1144
            LVVIDEAGQAIEPSCWIPIL GKRCILAGD+CQLAPVILSRKALEGGLGVS LERA+TLH
Sbjct: 707  LVVIDEAGQAIEPSCWIPILKGKRCILAGDRCQLAPVILSRKALEGGLGVSLLERAATLH 766

Query: 1143 EGVLATKLTTQYRMNDAIASWASKEMYNGLLKSSASVMSHLLSDSPLVKSTWITQCPLLL 964
            EGVLAT LTTQYRMNDAIASWASKEMYNG LKSS SV S+LL DSP VK TWITQCPLLL
Sbjct: 767  EGVLATMLTTQYRMNDAIASWASKEMYNGELKSSPSVASYLLVDSPFVKPTWITQCPLLL 826

Query: 963  LDTRMPFGSLSVGCEEQLDPAGTGSFYNEGEADIVVQHVFALIYAGVRPSTIVVQSPYVS 784
            LDTRMP+GSLSVGCEE LDPAGTGSFYNEGE DIVVQHVF LIYAGV P+ I VQSPYV+
Sbjct: 827  LDTRMPYGSLSVGCEEHLDPAGTGSFYNEGETDIVVQHVFYLIYAGVSPTAIAVQSPYVA 886

Query: 783  QVQLLRDRLEEFPLSTGVEVATIDSFQGREADAVVISMVRSNNLGAVGFLGDSRRMNVAI 604
            QVQLLRDRL+EFP + GVEVATIDSFQGREADAV+ISMVRSN LGAVGFLGDSRRMNVAI
Sbjct: 887  QVQLLRDRLDEFPQTAGVEVATIDSFQGREADAVIISMVRSNTLGAVGFLGDSRRMNVAI 946

Query: 603  TRARKHVAIICDSSTICHNTFLARLLRHIRYFGRVKHAEPGGSGGYGLSMNPMLPSVS 430
            TRARKHVA++CDSSTICHNTFLARLLRHIRYFGRVKHAEPG SGG GL M+PMLPS+S
Sbjct: 947  TRARKHVAVVCDSSTICHNTFLARLLRHIRYFGRVKHAEPGASGGSGLGMDPMLPSIS 1004


>ref|XP_021282320.1| DNA-binding protein SMUBP-2 [Herrania umbratica]
          Length = 1009

 Score = 1415 bits (3662), Expect = 0.0
 Identities = 705/890 (79%), Positives = 782/890 (87%)
 Frame = -2

Query: 3099 SNNNTNNKAAVSEEKTRMKQQQVNDEKDGPTSVRALYQNGDPLGRRDLGKGVVKWIGKGM 2920
            S++ ++ K  V E      Q+Q   +K    +VR LYQNGDPLGRRDLGK VV+WI +GM
Sbjct: 120  SSSFSSTKIIVEELGLLKDQKQQKVKKTKAVNVRTLYQNGDPLGRRDLGKRVVRWISEGM 179

Query: 2919 KAMALDFALAETQGDFADLKQRMGPGLTFVIQAQPYLNAVPMPLGMEAICLKTCTHYPTL 2740
            KAMA DF  AE QG+F +L+QRMGPGLTFVIQAQPYLNA+P+PLG+EAICLK CTHYPTL
Sbjct: 180  KAMASDFVTAELQGEFLELRQRMGPGLTFVIQAQPYLNAIPIPLGLEAICLKACTHYPTL 239

Query: 2739 FDHFQRELRDVLLDLQHKTLIHNWRETESWKLLKELATSAQHRAIARKTSLSKSVHGVLG 2560
            FDHFQRELR+VL +LQ  +++ +WRETESW LLKELA SAQHRAIARK    K V GVLG
Sbjct: 240  FDHFQRELRNVLQELQKNSVVEDWRETESWTLLKELANSAQHRAIARKIEQPKPVQGVLG 299

Query: 2559 LNIDKAKAIQCRIDEFTKHMSDLLRIERDAELEFTQEELNAVPTPDEHSTSPKPTEFLVS 2380
            ++++KAKA+Q RIDEFTK MS+LLRIERDAELEFTQEELNAVPTPDE S S KP EFLVS
Sbjct: 300  MDLEKAKAMQGRIDEFTKQMSELLRIERDAELEFTQEELNAVPTPDEGSDSSKPIEFLVS 359

Query: 2379 HAQSEQELCDTICNLNAISTSTGLGGMHLVLFRVEGNHRLPPTNLSPGDMVCVRICDSRG 2200
            H Q++QELCDTICNLNA+STSTGLGGMHLVL RVEGNHRLPPT LSPGDMVCVRICDSRG
Sbjct: 360  HGQAQQELCDTICNLNAVSTSTGLGGMHLVLLRVEGNHRLPPTTLSPGDMVCVRICDSRG 419

Query: 2199 AGATSCMQGFVNNLGDDGCSISVALESLHGDPTFSKLFGKNIRIDRIQGLADALTYERNC 2020
            AGATSCMQGFV+NLG+DGCSISVALES HGDPTFSK FGKN+RIDRIQGLADALTYERNC
Sbjct: 420  AGATSCMQGFVDNLGEDGCSISVALESRHGDPTFSKFFGKNVRIDRIQGLADALTYERNC 479

Query: 2019 EAXXXXXXXXXXXKNSSIAVVTTIFGDNEDIAWFEDNNMVDWAEAELNGLLDTEFYDTSQ 1840
            EA           KN SIAVV T+FGD ED+ W E N+  DW EA+L+GLL    +D SQ
Sbjct: 480  EALMLLQKNGLQKKNPSIAVVATLFGDTEDVTWLEKNSFADWNEAKLDGLLQNGIFDDSQ 539

Query: 1839 QRAIALGLNKKRPVLIIQGPPGTGKTGVLKQLISIAVKQGERVLVTAPTNAAVDNMVEKL 1660
            QRAIALGLNKKRP+L++QGPPGTGKTG+LK++I++AV+QGERVLVTAPTNAAVDNMVEKL
Sbjct: 540  QRAIALGLNKKRPILVVQGPPGTGKTGLLKEVIALAVQQGERVLVTAPTNAAVDNMVEKL 599

Query: 1659 SDIGANIVRVGNPARISPAVASKSLVEIVNGRLADFRSEFERKKSDLRKDLSHCLRDDSL 1480
            S+ G NIVRVGNPARIS AVASKSLVEIVN +LAD+ +EFERKKSDLRKDL HCL+DDSL
Sbjct: 600  SNTGLNIVRVGNPARISSAVASKSLVEIVNSKLADYLAEFERKKSDLRKDLRHCLKDDSL 659

Query: 1479 AAGIRQLLKQLGKTMKKKERETIREILSSAHVVLATNIGAADPMIRWLNSFDLVVIDEAG 1300
            AAGIRQLLKQLGK +KKKE+ET+RE+LSSA VVL+TN GAADP+IR +++FDLVVIDEAG
Sbjct: 660  AAGIRQLLKQLGKALKKKEKETVREVLSSAQVVLSTNTGAADPLIRRMDTFDLVVIDEAG 719

Query: 1299 QAIEPSCWIPILLGKRCILAGDQCQLAPVILSRKALEGGLGVSFLERASTLHEGVLATKL 1120
            QAIEPSCWIPI  GKRCILAGDQCQLAPVILSRKAL+GGLGVS LERA+T+HEGVLAT L
Sbjct: 720  QAIEPSCWIPIFQGKRCILAGDQCQLAPVILSRKALDGGLGVSLLERAATMHEGVLATML 779

Query: 1119 TTQYRMNDAIASWASKEMYNGLLKSSASVMSHLLSDSPLVKSTWITQCPLLLLDTRMPFG 940
            T+QYRMNDAIASWASKEMY+G LKSS SV SHLL DSP VK TWITQCPLLLLDTRMP+G
Sbjct: 780  TSQYRMNDAIASWASKEMYDGELKSSPSVGSHLLVDSPFVKPTWITQCPLLLLDTRMPYG 839

Query: 939  SLSVGCEEQLDPAGTGSFYNEGEADIVVQHVFALIYAGVRPSTIVVQSPYVSQVQLLRDR 760
            SLSVGCEE LDP GTGSFYNEGEADIVVQHVF LIYAGV P+ I VQSPYV+QVQLLRDR
Sbjct: 840  SLSVGCEEHLDPVGTGSFYNEGEADIVVQHVFYLIYAGVSPTAIAVQSPYVAQVQLLRDR 899

Query: 759  LEEFPLSTGVEVATIDSFQGREADAVVISMVRSNNLGAVGFLGDSRRMNVAITRARKHVA 580
            L+E P + GVEVATIDSFQGREADAV+ISMVRSN LGAVGFLGDSRRMNVAITRARKHVA
Sbjct: 900  LDELPEAAGVEVATIDSFQGREADAVIISMVRSNTLGAVGFLGDSRRMNVAITRARKHVA 959

Query: 579  IICDSSTICHNTFLARLLRHIRYFGRVKHAEPGGSGGYGLSMNPMLPSVS 430
            ++CDSSTICHNTFLARLLRHIRYFGRVKHAEPG SGG GL M+PMLPS+S
Sbjct: 960  VVCDSSTICHNTFLARLLRHIRYFGRVKHAEPGTSGGSGLGMDPMLPSIS 1009


>ref|XP_016564094.1| PREDICTED: DNA-binding protein SMUBP-2 [Capsicum annuum]
          Length = 989

 Score = 1411 bits (3652), Expect = 0.0
 Identities = 720/981 (73%), Positives = 820/981 (83%), Gaps = 18/981 (1%)
 Frame = -2

Query: 3324 KMEASCIFCGGVSASILKSQGIRHRPS--ESISLYSNKNRLFLSS---PISHRVWXXXXX 3160
            KMEASC FCG +  S L  Q   +  S   S++L S KNR FL S     S R       
Sbjct: 7    KMEASCNFCGSLVPSCLTRQKRSNLSSFIGSVALSSIKNRTFLDSISLTSSIRATASSSG 66

Query: 3159 XXXXXXXXXXXRED-----GRGADVSNNN--------TNNKAAVSEEKTRMKQQQVNDEK 3019
                        ++     G G +V N+         ++ KA    +  R  QQQ   ++
Sbjct: 67   GTKAVTTRRRKPKNVGTTGGSGKNVKNSEIPAVTTKGSSGKAIEKVQVKRKNQQQECIQE 126

Query: 3018 DGPTSVRALYQNGDPLGRRDLGKGVVKWIGKGMKAMALDFALAETQGDFADLKQRMGPGL 2839
             GP  VRAL+QNGDPLGR+DLGK VV+W+ +GM+AMALDFA AE QG+FA+LKQRM PGL
Sbjct: 127  GGPVDVRALHQNGDPLGRKDLGKCVVRWLSQGMRAMALDFATAEMQGEFAELKQRMEPGL 186

Query: 2838 TFVIQAQPYLNAVPMPLGMEAICLKTCTHYPTLFDHFQRELRDVLLDLQHKTLIHNWRET 2659
            TFVIQAQPYLNAVPMPLG+EAICLK CTHYPTLFD+FQRELRDVL DLQ K+ + +WR+T
Sbjct: 187  TFVIQAQPYLNAVPMPLGLEAICLKACTHYPTLFDNFQRELRDVLQDLQRKSSVQDWRDT 246

Query: 2658 ESWKLLKELATSAQHRAIARKTSLSKSVHGVLGLNIDKAKAIQCRIDEFTKHMSDLLRIE 2479
            ESWKLLK+LA+SAQH+AIARK S  KSV GV+G++++KAKAIQ RID+FT  MSDLL IE
Sbjct: 247  ESWKLLKDLASSAQHKAIARKGSQPKSVPGVMGMDLEKAKAIQSRIDDFTNRMSDLLHIE 306

Query: 2478 RDAELEFTQEELNAVPTPDEHSTSPKPTEFLVSHAQSEQELCDTICNLNAISTSTGLGGM 2299
            RDAELEFTQEELNAVP PD +S + KP EFLVSHAQ EQELCDTICNL A+STS GLGGM
Sbjct: 307  RDAELEFTQEELNAVPAPDVNSEAQKPFEFLVSHAQPEQELCDTICNLTAVSTSIGLGGM 366

Query: 2298 HLVLFRVEGNHRLPPTNLSPGDMVCVRICDSRGAGATSCMQGFVNNLGDDGCSISVALES 2119
            HLVLF++EGNHRLPP NLSPGDMVCVRICDSRGAGATSCMQGFV+NLG+DGCSIS+ALES
Sbjct: 367  HLVLFKLEGNHRLPPANLSPGDMVCVRICDSRGAGATSCMQGFVHNLGEDGCSISLALES 426

Query: 2118 LHGDPTFSKLFGKNIRIDRIQGLADALTYERNCEAXXXXXXXXXXXKNSSIAVVTTIFGD 1939
            L GD TFSKLFGKN+RIDRIQGLADALTYERNCEA           KN S+AVV T+FGD
Sbjct: 427  LQGDTTFSKLFGKNVRIDRIQGLADALTYERNCEALMMLQKKGFRKKNPSVAVVATLFGD 486

Query: 1938 NEDIAWFEDNNMVDWAEAELNGLLDTEFYDTSQQRAIALGLNKKRPVLIIQGPPGTGKTG 1759
            NED+ W E+N+M DWAE EL    + + +D SQ++AIALGLNK RP++IIQGPPGTGKTG
Sbjct: 487  NEDLKWLEENDMADWAEVELPDSTNKKSFDASQRKAIALGLNKNRPIMIIQGPPGTGKTG 546

Query: 1758 VLKQLISIAVKQGERVLVTAPTNAAVDNMVEKLSDIGANIVRVGNPARISPAVASKSLVE 1579
            +LK+LIS+AVKQGERVLVTAPTNAAVDNMVEKLSDIG NIVRVGNPARIS +VASKSL E
Sbjct: 547  LLKELISLAVKQGERVLVTAPTNAAVDNMVEKLSDIGINIVRVGNPARISSSVASKSLAE 606

Query: 1578 IVNGRLADFRSEFERKKSDLRKDLSHCLRDDSLAAGIRQLLKQLGKTMKKKERETIREIL 1399
            IVN +L+DF +E ERKKSDLRKDL +CL+DDSLAAGIRQLLKQLGK++KKKE+ET++EIL
Sbjct: 607  IVNNKLSDFLAEIERKKSDLRKDLRYCLKDDSLAAGIRQLLKQLGKSIKKKEKETVKEIL 666

Query: 1398 SSAHVVLATNIGAADPMIRWLNSFDLVVIDEAGQAIEPSCWIPILLGKRCILAGDQCQLA 1219
            S+AHVVLATNIGAADP+IR L++FDLV+IDEAGQAIEPS WIPILLGKRCILAGDQ QLA
Sbjct: 667  STAHVVLATNIGAADPLIRRLDAFDLVIIDEAGQAIEPSSWIPILLGKRCILAGDQFQLA 726

Query: 1218 PVILSRKALEGGLGVSFLERASTLHEGVLATKLTTQYRMNDAIASWASKEMYNGLLKSSA 1039
            PVILSRKALEGGLGVS LERA+TLH+G+L+TKLTTQYRMNDAIASWASKEMY G L SS 
Sbjct: 727  PVILSRKALEGGLGVSLLERAATLHDGMLSTKLTTQYRMNDAIASWASKEMYGGSLTSSP 786

Query: 1038 SVMSHLLSDSPLVKSTWITQCPLLLLDTRMPFGSLSVGCEEQLDPAGTGSFYNEGEADIV 859
            +V SHLL DSP VK TWITQCPLLLLDTRMP+GSLSVGCEE LDPAGTGSFYNEGEADIV
Sbjct: 787  TVASHLLVDSPFVKPTWITQCPLLLLDTRMPYGSLSVGCEEHLDPAGTGSFYNEGEADIV 846

Query: 858  VQHVFALIYAGVRPSTIVVQSPYVSQVQLLRDRLEEFPLSTGVEVATIDSFQGREADAVV 679
            VQHVF+LIYAGV P+ I VQSPYV+QVQLLRD+++E P++TGV+VATIDSFQGREADAV+
Sbjct: 847  VQHVFSLIYAGVPPAAIAVQSPYVAQVQLLRDKIDEIPMATGVDVATIDSFQGREADAVI 906

Query: 678  ISMVRSNNLGAVGFLGDSRRMNVAITRARKHVAIICDSSTICHNTFLARLLRHIRYFGRV 499
            ISMVRSNNLGAVGFLGD+RRMNVAITRA KHVA++CDSSTICHNT+LARLLRHIRYFG+V
Sbjct: 907  ISMVRSNNLGAVGFLGDNRRMNVAITRASKHVAVVCDSSTICHNTYLARLLRHIRYFGKV 966

Query: 498  KHAEPGGSGGYGLSMNPMLPS 436
            KH EPG    +GL M+PMLP+
Sbjct: 967  KHVEPGSFWEFGLGMDPMLPT 987


>gb|PHU23400.1| hypothetical protein BC332_08507 [Capsicum chinense]
          Length = 989

 Score = 1409 bits (3647), Expect = 0.0
 Identities = 720/981 (73%), Positives = 819/981 (83%), Gaps = 18/981 (1%)
 Frame = -2

Query: 3324 KMEASCIFCGGVSASILKSQGIRHRPS--ESISLYSNKNRLFLSS---PISHRVWXXXXX 3160
            KMEASC FCG +  S L  Q   +  S   S++L S KNR FL S     S R       
Sbjct: 7    KMEASCNFCGSLVPSCLTRQKRSNLSSFIGSVALSSIKNRTFLDSISLTSSIRATASSSG 66

Query: 3159 XXXXXXXXXXXRED-----GRGADVSNNN--------TNNKAAVSEEKTRMKQQQVNDEK 3019
                        ++     G G +V N+         ++ KA    +  R  QQQ   ++
Sbjct: 67   GTKAVTTRRRKPKNVGTTGGSGKNVKNSEIPAVTTKGSSGKAIEKVQVKRKNQQQECIQE 126

Query: 3018 DGPTSVRALYQNGDPLGRRDLGKGVVKWIGKGMKAMALDFALAETQGDFADLKQRMGPGL 2839
             GP  VRAL+QNGDPLGR+DLGK VV+W+ +GM+AMALDFA AE QG+FA+LKQRM PGL
Sbjct: 127  GGPVDVRALHQNGDPLGRKDLGKCVVRWLSQGMRAMALDFATAEMQGEFAELKQRMEPGL 186

Query: 2838 TFVIQAQPYLNAVPMPLGMEAICLKTCTHYPTLFDHFQRELRDVLLDLQHKTLIHNWRET 2659
            TFVIQAQPYLNAVPMPLG+EAICLK CTHYPTLFD+FQRELRDVL DLQ K+ + +WR+T
Sbjct: 187  TFVIQAQPYLNAVPMPLGLEAICLKACTHYPTLFDNFQRELRDVLQDLQRKSSVQDWRDT 246

Query: 2658 ESWKLLKELATSAQHRAIARKTSLSKSVHGVLGLNIDKAKAIQCRIDEFTKHMSDLLRIE 2479
            ESWKLLK+LA+SAQH+AIARK S  KSV GV+G++++KAKAIQ RID+FT  MSDLL IE
Sbjct: 247  ESWKLLKDLASSAQHKAIARKGSQPKSVPGVMGMDLEKAKAIQSRIDDFTNRMSDLLHIE 306

Query: 2478 RDAELEFTQEELNAVPTPDEHSTSPKPTEFLVSHAQSEQELCDTICNLNAISTSTGLGGM 2299
            RDAELEFTQEELNAVP PD +S + KP EFLVSHAQ EQELCDTICNL A+STS GLGGM
Sbjct: 307  RDAELEFTQEELNAVPAPDVNSEAQKPFEFLVSHAQPEQELCDTICNLTAVSTSIGLGGM 366

Query: 2298 HLVLFRVEGNHRLPPTNLSPGDMVCVRICDSRGAGATSCMQGFVNNLGDDGCSISVALES 2119
            HLVLF++EGNHRLPP NLSPGDMVCVRICDSRGAGATSCMQGFV+NLG+DGCSIS+ALES
Sbjct: 367  HLVLFKLEGNHRLPPANLSPGDMVCVRICDSRGAGATSCMQGFVHNLGEDGCSISLALES 426

Query: 2118 LHGDPTFSKLFGKNIRIDRIQGLADALTYERNCEAXXXXXXXXXXXKNSSIAVVTTIFGD 1939
            L GD TFSKLFGKN+RIDRIQGLADALTYERNCEA           KN S+AVV T+FGD
Sbjct: 427  LQGDTTFSKLFGKNVRIDRIQGLADALTYERNCEALMMLQKKGFRKKNPSVAVVATLFGD 486

Query: 1938 NEDIAWFEDNNMVDWAEAELNGLLDTEFYDTSQQRAIALGLNKKRPVLIIQGPPGTGKTG 1759
            NED+ W E+N+M DWAE EL    + + +D SQ++AIALGLNK RP++IIQGPPGTGKTG
Sbjct: 487  NEDLKWLEENDMADWAEVELPDSTNKKSFDASQRKAIALGLNKNRPIMIIQGPPGTGKTG 546

Query: 1758 VLKQLISIAVKQGERVLVTAPTNAAVDNMVEKLSDIGANIVRVGNPARISPAVASKSLVE 1579
            +LK+LIS+AVKQGERVLVTAPTNAAVDNMVEKLSDIG NIVRVGNPARIS +VASKSL E
Sbjct: 547  LLKELISLAVKQGERVLVTAPTNAAVDNMVEKLSDIGINIVRVGNPARISSSVASKSLAE 606

Query: 1578 IVNGRLADFRSEFERKKSDLRKDLSHCLRDDSLAAGIRQLLKQLGKTMKKKERETIREIL 1399
            IVN +L+DF +E ERKKSDLRKDL  CL+DDSLAAGIRQLLKQLGK++KKKE+ET++EIL
Sbjct: 607  IVNNKLSDFLAEIERKKSDLRKDLRCCLKDDSLAAGIRQLLKQLGKSIKKKEKETVKEIL 666

Query: 1398 SSAHVVLATNIGAADPMIRWLNSFDLVVIDEAGQAIEPSCWIPILLGKRCILAGDQCQLA 1219
            S+AHVVLATNIGAADP+IR L++FDLV+IDEAGQAIEPS WIPILLGKRCILAGDQ QLA
Sbjct: 667  STAHVVLATNIGAADPLIRRLDAFDLVIIDEAGQAIEPSSWIPILLGKRCILAGDQFQLA 726

Query: 1218 PVILSRKALEGGLGVSFLERASTLHEGVLATKLTTQYRMNDAIASWASKEMYNGLLKSSA 1039
            PVILSRKALEGGLGVS LERA+TLH+G+L+TKLTTQYRMNDAIASWASKEMY G L SS 
Sbjct: 727  PVILSRKALEGGLGVSLLERAATLHDGMLSTKLTTQYRMNDAIASWASKEMYGGSLTSSP 786

Query: 1038 SVMSHLLSDSPLVKSTWITQCPLLLLDTRMPFGSLSVGCEEQLDPAGTGSFYNEGEADIV 859
            +V SHLL DSP VK TWITQCPLLLLDTRMP+GSLSVGCEE LDPAGTGSFYNEGEADIV
Sbjct: 787  TVASHLLVDSPFVKPTWITQCPLLLLDTRMPYGSLSVGCEEHLDPAGTGSFYNEGEADIV 846

Query: 858  VQHVFALIYAGVRPSTIVVQSPYVSQVQLLRDRLEEFPLSTGVEVATIDSFQGREADAVV 679
            VQHVF+LIYAGV P+ I VQSPYV+QVQLLRD+++E P++TGV+VATIDSFQGREADAV+
Sbjct: 847  VQHVFSLIYAGVPPAAIAVQSPYVAQVQLLRDKIDEIPMATGVDVATIDSFQGREADAVI 906

Query: 678  ISMVRSNNLGAVGFLGDSRRMNVAITRARKHVAIICDSSTICHNTFLARLLRHIRYFGRV 499
            ISMVRSNNLGAVGFLGD+RRMNVAITRA KHVA++CDSSTICHNT+LARLLRHIRYFG+V
Sbjct: 907  ISMVRSNNLGAVGFLGDNRRMNVAITRASKHVAVVCDSSTICHNTYLARLLRHIRYFGKV 966

Query: 498  KHAEPGGSGGYGLSMNPMLPS 436
            KH EPG    +GL M+PMLP+
Sbjct: 967  KHVEPGSFWEFGLGMDPMLPT 987


>gb|PHT87733.1| hypothetical protein T459_09839 [Capsicum annuum]
          Length = 989

 Score = 1409 bits (3647), Expect = 0.0
 Identities = 719/981 (73%), Positives = 819/981 (83%), Gaps = 18/981 (1%)
 Frame = -2

Query: 3324 KMEASCIFCGGVSASILKSQGIRHRPS--ESISLYSNKNRLFLSS---PISHRVWXXXXX 3160
            KMEASC FCG +  S L  Q   +  S    ++L S KNR FL S     S R       
Sbjct: 7    KMEASCNFCGSLVPSCLTRQKRSNLSSFIGPVALSSIKNRTFLDSISLTSSIRATASSSG 66

Query: 3159 XXXXXXXXXXXRED-----GRGADVSNNN--------TNNKAAVSEEKTRMKQQQVNDEK 3019
                        ++     G G +V N+         ++ KA    +  R  QQQ   ++
Sbjct: 67   GTKAVTTRRRKPKNVGTTGGSGKNVKNSEIPAVTTKGSSGKAIEKVQVKRKNQQQECIQE 126

Query: 3018 DGPTSVRALYQNGDPLGRRDLGKGVVKWIGKGMKAMALDFALAETQGDFADLKQRMGPGL 2839
             GP  VRAL+QNGDPLGR+DLGK VV+W+ +GM+AMALDFA AE QG+FA+LKQRM PGL
Sbjct: 127  GGPVDVRALHQNGDPLGRKDLGKCVVRWLSQGMRAMALDFATAEMQGEFAELKQRMEPGL 186

Query: 2838 TFVIQAQPYLNAVPMPLGMEAICLKTCTHYPTLFDHFQRELRDVLLDLQHKTLIHNWRET 2659
            TFVIQAQPYLNAVPMPLG+EAICLK CTHYPTLFD+FQRELRDVL DLQ K+ + +WR+T
Sbjct: 187  TFVIQAQPYLNAVPMPLGLEAICLKACTHYPTLFDNFQRELRDVLQDLQRKSSVQDWRDT 246

Query: 2658 ESWKLLKELATSAQHRAIARKTSLSKSVHGVLGLNIDKAKAIQCRIDEFTKHMSDLLRIE 2479
            ESWKLLK+LA+SAQH+AIARK S  KSV GV+G++++KAKAIQ RID+FT  MSDLL IE
Sbjct: 247  ESWKLLKDLASSAQHKAIARKGSQPKSVPGVMGMDLEKAKAIQSRIDDFTNRMSDLLHIE 306

Query: 2478 RDAELEFTQEELNAVPTPDEHSTSPKPTEFLVSHAQSEQELCDTICNLNAISTSTGLGGM 2299
            RDAELEFTQEELNAVP PD +S + KP EFLVSHAQ EQELCDTICNL A+STS GLGGM
Sbjct: 307  RDAELEFTQEELNAVPAPDVNSEAQKPFEFLVSHAQPEQELCDTICNLTAVSTSIGLGGM 366

Query: 2298 HLVLFRVEGNHRLPPTNLSPGDMVCVRICDSRGAGATSCMQGFVNNLGDDGCSISVALES 2119
            HLVLF++EGNHRLPP NLSPGDMVCVRICDSRGAGATSCMQGFV+NLG+DGCSIS+ALES
Sbjct: 367  HLVLFKLEGNHRLPPANLSPGDMVCVRICDSRGAGATSCMQGFVHNLGEDGCSISLALES 426

Query: 2118 LHGDPTFSKLFGKNIRIDRIQGLADALTYERNCEAXXXXXXXXXXXKNSSIAVVTTIFGD 1939
            L GD TFSKLFGKN+RIDRIQGLADALTYERNCEA           KN S+AVV T+FGD
Sbjct: 427  LQGDTTFSKLFGKNVRIDRIQGLADALTYERNCEALMMLQKKGFRKKNPSVAVVATLFGD 486

Query: 1938 NEDIAWFEDNNMVDWAEAELNGLLDTEFYDTSQQRAIALGLNKKRPVLIIQGPPGTGKTG 1759
            NED+ W E+N+M DWAE EL    + + +D SQ++AIALGLNK RP++IIQGPPGTGKTG
Sbjct: 487  NEDLKWLEENDMADWAEVELPDSTNKKSFDASQRKAIALGLNKNRPIMIIQGPPGTGKTG 546

Query: 1758 VLKQLISIAVKQGERVLVTAPTNAAVDNMVEKLSDIGANIVRVGNPARISPAVASKSLVE 1579
            +LK+LIS+AVKQGERVLVTAPTNAAVDNMVEKLSDIG NIVRVGNPARIS +VASKSL E
Sbjct: 547  LLKELISLAVKQGERVLVTAPTNAAVDNMVEKLSDIGINIVRVGNPARISSSVASKSLAE 606

Query: 1578 IVNGRLADFRSEFERKKSDLRKDLSHCLRDDSLAAGIRQLLKQLGKTMKKKERETIREIL 1399
            IVN +L+DF +E ERKKSDLRKDL +CL+DDSLAAGIRQLLKQLGK++KKKE+ET++EIL
Sbjct: 607  IVNNKLSDFLAEIERKKSDLRKDLRYCLKDDSLAAGIRQLLKQLGKSIKKKEKETVKEIL 666

Query: 1398 SSAHVVLATNIGAADPMIRWLNSFDLVVIDEAGQAIEPSCWIPILLGKRCILAGDQCQLA 1219
            S+AHVVLATNIGAADP+IR L++FDLV+IDEAGQAIEPS WIPILLGKRCILAGDQ QLA
Sbjct: 667  STAHVVLATNIGAADPLIRRLDAFDLVIIDEAGQAIEPSSWIPILLGKRCILAGDQFQLA 726

Query: 1218 PVILSRKALEGGLGVSFLERASTLHEGVLATKLTTQYRMNDAIASWASKEMYNGLLKSSA 1039
            PVILSRKALEGGLGVS LERA+TLH+G+L+TKLTTQYRMNDAIASWASKEMY G L SS 
Sbjct: 727  PVILSRKALEGGLGVSLLERAATLHDGMLSTKLTTQYRMNDAIASWASKEMYGGSLTSSP 786

Query: 1038 SVMSHLLSDSPLVKSTWITQCPLLLLDTRMPFGSLSVGCEEQLDPAGTGSFYNEGEADIV 859
            +V SHLL DSP VK TWITQCPLLLLDTRMP+GSLSVGCEE LDPAGTGSFYNEGEADIV
Sbjct: 787  TVASHLLVDSPFVKPTWITQCPLLLLDTRMPYGSLSVGCEEHLDPAGTGSFYNEGEADIV 846

Query: 858  VQHVFALIYAGVRPSTIVVQSPYVSQVQLLRDRLEEFPLSTGVEVATIDSFQGREADAVV 679
            VQHVF+LIYAGV P+ I VQSPYV+QVQLLRD+++E P++TGV+VATIDSFQGREADAV+
Sbjct: 847  VQHVFSLIYAGVPPAAIAVQSPYVAQVQLLRDKIDEIPMATGVDVATIDSFQGREADAVI 906

Query: 678  ISMVRSNNLGAVGFLGDSRRMNVAITRARKHVAIICDSSTICHNTFLARLLRHIRYFGRV 499
            ISMVRSNNLGAVGFLGD+RRMNVAITRA KHVA++CDSSTICHNT+LARLLRHIRYFG+V
Sbjct: 907  ISMVRSNNLGAVGFLGDNRRMNVAITRASKHVAVVCDSSTICHNTYLARLLRHIRYFGKV 966

Query: 498  KHAEPGGSGGYGLSMNPMLPS 436
            KH EPG    +GL M+PMLP+
Sbjct: 967  KHVEPGSFWEFGLGMDPMLPT 987


>ref|XP_009771939.1| PREDICTED: DNA-binding protein SMUBP-2 [Nicotiana sylvestris]
          Length = 980

 Score = 1400 bits (3625), Expect = 0.0
 Identities = 708/976 (72%), Positives = 819/976 (83%), Gaps = 11/976 (1%)
 Frame = -2

Query: 3324 KMEASCIFCGGVSA---SILKSQGIRHRPS-----ESISLYSNKNRLFLSSPIS---HRV 3178
            KME+ C  CG +S    S L  +  + R +      S++L + KNR+FL S IS   + +
Sbjct: 7    KMESLCNSCGSISTLAPSCLTLRFYKKRSNLSSFFGSVTLSNPKNRIFLDSSISFPNYNI 66

Query: 3177 WXXXXXXXXXXXXXXXXREDGRGADVSNNNTNNKAAVSEEKTRMKQQQVNDEKDGPTSVR 2998
                              ++ + +D+ +  T        EK +   Q+  D   GP +VR
Sbjct: 67   QASSSSGTKSLSPRRRKPKNVKTSDIPSVTTKGSLGKKTEKNQECSQEERDS--GPVNVR 124

Query: 2997 ALYQNGDPLGRRDLGKGVVKWIGKGMKAMALDFALAETQGDFADLKQRMGPGLTFVIQAQ 2818
            AL +NGDP+GR+DLGK VV+WI +GMKAMA DFA AE QG+F ++KQRM PGLTFVIQAQ
Sbjct: 125  ALNENGDPMGRKDLGKCVVRWISQGMKAMATDFATAEMQGEFTEVKQRMEPGLTFVIQAQ 184

Query: 2817 PYLNAVPMPLGMEAICLKTCTHYPTLFDHFQRELRDVLLDLQHKTLIHNWRETESWKLLK 2638
            PYLNA+PMPLG+EAICLK CTHYPTLFD+FQRELRDVL +LQ K+L+ +WR+TESWKLLK
Sbjct: 185  PYLNAIPMPLGLEAICLKACTHYPTLFDNFQRELRDVLQNLQRKSLVQDWRDTESWKLLK 244

Query: 2637 ELATSAQHRAIARKTSLSKSVHGVLGLNIDKAKAIQCRIDEFTKHMSDLLRIERDAELEF 2458
            +LA SAQH+AIARKTS  K V GV+G++++KAKA+Q RID+FT  MSDLLRIERD+ELEF
Sbjct: 245  DLAISAQHKAIARKTSQPKFVPGVMGMDLEKAKAMQSRIDDFTNRMSDLLRIERDSELEF 304

Query: 2457 TQEELNAVPTPDEHSTSPKPTEFLVSHAQSEQELCDTICNLNAISTSTGLGGMHLVLFRV 2278
            TQEELNAVP P  +S   KP EFLVSHAQ EQELCDTICNL A+STS GLGGMHLVLF++
Sbjct: 305  TQEELNAVPAPVLNSEEQKPFEFLVSHAQPEQELCDTICNLTAVSTSIGLGGMHLVLFKL 364

Query: 2277 EGNHRLPPTNLSPGDMVCVRICDSRGAGATSCMQGFVNNLGDDGCSISVALESLHGDPTF 2098
            EGNHRLPPTNLSPGDMVCVR CDSRGAGATSCMQGFV+NLG+DG SIS+ALESLHGD TF
Sbjct: 365  EGNHRLPPTNLSPGDMVCVRTCDSRGAGATSCMQGFVHNLGEDGRSISLALESLHGDSTF 424

Query: 2097 SKLFGKNIRIDRIQGLADALTYERNCEAXXXXXXXXXXXKNSSIAVVTTIFGDNEDIAWF 1918
            SKLFGKN+RIDRIQGLADALTYERNCEA           KN S+AVV T+FGD ED+AW 
Sbjct: 425  SKLFGKNVRIDRIQGLADALTYERNCEALMMLQKKGFQKKNPSVAVVATLFGDKEDLAWL 484

Query: 1917 EDNNMVDWAEAELNGLLDTEFYDTSQQRAIALGLNKKRPVLIIQGPPGTGKTGVLKQLIS 1738
            E+N M DW+E EL    D + +DTSQ++AIALGLNK RP++IIQGPPGTGKTG+LK+LIS
Sbjct: 485  EENGMADWSEVELPDSTDRKSFDTSQRKAIALGLNKNRPIMIIQGPPGTGKTGMLKELIS 544

Query: 1737 IAVKQGERVLVTAPTNAAVDNMVEKLSDIGANIVRVGNPARISPAVASKSLVEIVNGRLA 1558
            +AVKQGERVLVTAPTNAAVDNMVEKLSDIG NIVRVGNPARISPAVASKSL EIVN  LA
Sbjct: 545  LAVKQGERVLVTAPTNAAVDNMVEKLSDIGLNIVRVGNPARISPAVASKSLTEIVNTELA 604

Query: 1557 DFRSEFERKKSDLRKDLSHCLRDDSLAAGIRQLLKQLGKTMKKKERETIREILSSAHVVL 1378
            DFR+E ERKKSDLR+DL +CL+DDSLAAGIRQLLKQLGK++K++E+ET++EILSSA VVL
Sbjct: 605  DFRAEIERKKSDLRRDLRYCLKDDSLAAGIRQLLKQLGKSIKREEKETVKEILSSAQVVL 664

Query: 1377 ATNIGAADPMIRWLNSFDLVVIDEAGQAIEPSCWIPILLGKRCILAGDQCQLAPVILSRK 1198
            ATNIGAADP+IR L++FDLV+IDEAGQAIEPSCWIPILLGKRCILAGDQ QLAPVILSRK
Sbjct: 665  ATNIGAADPLIRRLDTFDLVIIDEAGQAIEPSCWIPILLGKRCILAGDQFQLAPVILSRK 724

Query: 1197 ALEGGLGVSFLERASTLHEGVLATKLTTQYRMNDAIASWASKEMYNGLLKSSASVMSHLL 1018
            ALEGGLGVS LERA++LH+G+L+TKLTTQYRMN+AIASWASKEMY+G L SS +V SHLL
Sbjct: 725  ALEGGLGVSLLERAASLHDGMLSTKLTTQYRMNNAIASWASKEMYDGSLISSPTVASHLL 784

Query: 1017 SDSPLVKSTWITQCPLLLLDTRMPFGSLSVGCEEQLDPAGTGSFYNEGEADIVVQHVFAL 838
             DSP VK TW+TQCPLLLLDTRMP+GSLSVGCEE LDPAGTGSF+NEGEADIVVQHVF+L
Sbjct: 785  VDSPFVKPTWVTQCPLLLLDTRMPYGSLSVGCEEHLDPAGTGSFFNEGEADIVVQHVFSL 844

Query: 837  IYAGVRPSTIVVQSPYVSQVQLLRDRLEEFPLSTGVEVATIDSFQGREADAVVISMVRSN 658
            IY+GV P+ I VQSPYV+QVQLLRD+++E P++TGVEVATIDSFQGREADAV+ISMVRSN
Sbjct: 845  IYSGVPPAAIAVQSPYVAQVQLLRDKIDELPMATGVEVATIDSFQGREADAVIISMVRSN 904

Query: 657  NLGAVGFLGDSRRMNVAITRARKHVAIICDSSTICHNTFLARLLRHIRYFGRVKHAEPGG 478
            NLGAVGFLGDSRRMNVAITRARKHVA++CDSSTICHNT+LARLLRHIRYFG+VKH EPG 
Sbjct: 905  NLGAVGFLGDSRRMNVAITRARKHVAVVCDSSTICHNTYLARLLRHIRYFGKVKHVEPGS 964

Query: 477  SGGYGLSMNPMLPSVS 430
               +GL M+PMLP+ S
Sbjct: 965  FWEFGLGMDPMLPTAS 980


>ref|XP_016474118.1| PREDICTED: DNA-binding protein SMUBP-2-like [Nicotiana tabacum]
          Length = 980

 Score = 1400 bits (3624), Expect = 0.0
 Identities = 708/976 (72%), Positives = 819/976 (83%), Gaps = 11/976 (1%)
 Frame = -2

Query: 3324 KMEASCIFCGGVSA---SILKSQGIRHRPS-----ESISLYSNKNRLFLSSPIS---HRV 3178
            KME+ C  CG +S    S L  +  + R +      S++L + KNR+FL S IS   + +
Sbjct: 7    KMESLCNSCGSISTLAPSCLTLRFYKKRSNLSSFFGSVTLSNPKNRIFLDSSISFPNYNI 66

Query: 3177 WXXXXXXXXXXXXXXXXREDGRGADVSNNNTNNKAAVSEEKTRMKQQQVNDEKDGPTSVR 2998
                              ++ + +D+ +  T        EK +   Q+  D   GP +VR
Sbjct: 67   QASSSSGTKSLSPRRRKPKNVKTSDIPSVTTKGSLGKKTEKNQECSQEERDS--GPVNVR 124

Query: 2997 ALYQNGDPLGRRDLGKGVVKWIGKGMKAMALDFALAETQGDFADLKQRMGPGLTFVIQAQ 2818
            AL +NGDP+GR+DLGK VV+WI +GMKAMA DFA AE QG+F ++KQRM PGLTFVIQAQ
Sbjct: 125  ALNENGDPMGRKDLGKCVVRWISQGMKAMATDFATAEMQGEFTEVKQRMEPGLTFVIQAQ 184

Query: 2817 PYLNAVPMPLGMEAICLKTCTHYPTLFDHFQRELRDVLLDLQHKTLIHNWRETESWKLLK 2638
            PYLNA+PMPLG+EAICLK CTHYPTLFD+FQRELRDVL +LQ K+L+ +WR+TESWKLLK
Sbjct: 185  PYLNAIPMPLGLEAICLKACTHYPTLFDNFQRELRDVLQNLQRKSLVQDWRDTESWKLLK 244

Query: 2637 ELATSAQHRAIARKTSLSKSVHGVLGLNIDKAKAIQCRIDEFTKHMSDLLRIERDAELEF 2458
            +LA SAQH+AIARKTS  K V GV+G++++KAKA+Q RID+FT  MSDLLRIERD+ELEF
Sbjct: 245  DLAISAQHKAIARKTSQPKFVPGVMGMDLEKAKAMQSRIDDFTNRMSDLLRIERDSELEF 304

Query: 2457 TQEELNAVPTPDEHSTSPKPTEFLVSHAQSEQELCDTICNLNAISTSTGLGGMHLVLFRV 2278
            TQEELNAVP P  +S   KP EFLVSHAQ EQELCDTICNL A+STS GLGGMHLVLF++
Sbjct: 305  TQEELNAVPAPVLNSEEQKPFEFLVSHAQPEQELCDTICNLTAVSTSIGLGGMHLVLFKL 364

Query: 2277 EGNHRLPPTNLSPGDMVCVRICDSRGAGATSCMQGFVNNLGDDGCSISVALESLHGDPTF 2098
            EGNHRLPPTNLSPGDMVCVR CDSRGAGATSCMQGFV+NLG+DG SIS+ALESLHGD TF
Sbjct: 365  EGNHRLPPTNLSPGDMVCVRTCDSRGAGATSCMQGFVHNLGEDGRSISLALESLHGDSTF 424

Query: 2097 SKLFGKNIRIDRIQGLADALTYERNCEAXXXXXXXXXXXKNSSIAVVTTIFGDNEDIAWF 1918
            SKLFGKN+RIDRIQGLADALTYERNCEA           KN S+AVV T+FGD ED+AW 
Sbjct: 425  SKLFGKNVRIDRIQGLADALTYERNCEALMMLQKKGFQKKNPSVAVVATLFGDKEDLAWL 484

Query: 1917 EDNNMVDWAEAELNGLLDTEFYDTSQQRAIALGLNKKRPVLIIQGPPGTGKTGVLKQLIS 1738
            E+N M DW+E EL    D + +DTSQ++AIALGLNK RP++IIQGPPGTGKTG+LK+LIS
Sbjct: 485  EENGMADWSEVELPDSTDRKSFDTSQRKAIALGLNKNRPIMIIQGPPGTGKTGMLKELIS 544

Query: 1737 IAVKQGERVLVTAPTNAAVDNMVEKLSDIGANIVRVGNPARISPAVASKSLVEIVNGRLA 1558
            +AVKQGERVLVTAPTNAAVDNMVEKLSDIG NIVRVGNPARISPAVASKSL EIVN  LA
Sbjct: 545  LAVKQGERVLVTAPTNAAVDNMVEKLSDIGLNIVRVGNPARISPAVASKSLTEIVNTELA 604

Query: 1557 DFRSEFERKKSDLRKDLSHCLRDDSLAAGIRQLLKQLGKTMKKKERETIREILSSAHVVL 1378
            DFR+E ERKKSDLR+DL +CL+DDSLAAGIRQLLKQLGK++K++E+ET++EILSSA VVL
Sbjct: 605  DFRAEIERKKSDLRRDLRYCLKDDSLAAGIRQLLKQLGKSIKREEKETVKEILSSAQVVL 664

Query: 1377 ATNIGAADPMIRWLNSFDLVVIDEAGQAIEPSCWIPILLGKRCILAGDQCQLAPVILSRK 1198
            ATNIGAADP+IR L++FDLV+IDEAGQAIEPSCWIPILLGKRCILAGDQ QLAPVILSRK
Sbjct: 665  ATNIGAADPLIRRLDTFDLVIIDEAGQAIEPSCWIPILLGKRCILAGDQFQLAPVILSRK 724

Query: 1197 ALEGGLGVSFLERASTLHEGVLATKLTTQYRMNDAIASWASKEMYNGLLKSSASVMSHLL 1018
            ALEGGLGVS LERA++LH+G+L+TKLTTQYRMN+AIASWASKEMY+G L SS +V SHLL
Sbjct: 725  ALEGGLGVSLLERAASLHDGMLSTKLTTQYRMNNAIASWASKEMYDGSLISSPTVASHLL 784

Query: 1017 SDSPLVKSTWITQCPLLLLDTRMPFGSLSVGCEEQLDPAGTGSFYNEGEADIVVQHVFAL 838
             DSP VK TW+TQCPLLLLDTRMP+GSLSVGCEE LDPAGTGSF+NEGEADIVVQHVF+L
Sbjct: 785  VDSPFVKPTWVTQCPLLLLDTRMPYGSLSVGCEEHLDPAGTGSFFNEGEADIVVQHVFSL 844

Query: 837  IYAGVRPSTIVVQSPYVSQVQLLRDRLEEFPLSTGVEVATIDSFQGREADAVVISMVRSN 658
            IY+GV P+ I VQSPYV+QVQLLRD+++E P++TGVEVATIDSFQGREADAV+ISMVRSN
Sbjct: 845  IYSGVPPAAIAVQSPYVAQVQLLRDKVDELPMATGVEVATIDSFQGREADAVIISMVRSN 904

Query: 657  NLGAVGFLGDSRRMNVAITRARKHVAIICDSSTICHNTFLARLLRHIRYFGRVKHAEPGG 478
            NLGAVGFLGDSRRMNVAITRARKHVA++CDSSTICHNT+LARLLRHIRYFG+VKH EPG 
Sbjct: 905  NLGAVGFLGDSRRMNVAITRARKHVAVVCDSSTICHNTYLARLLRHIRYFGKVKHVEPGS 964

Query: 477  SGGYGLSMNPMLPSVS 430
               +GL M+PMLP+ S
Sbjct: 965  FWEFGLGMDPMLPTAS 980


>ref|XP_019184191.1| PREDICTED: DNA-binding protein SMUBP-2 [Ipomoea nil]
          Length = 993

 Score = 1399 bits (3622), Expect = 0.0
 Identities = 714/994 (71%), Positives = 815/994 (81%), Gaps = 30/994 (3%)
 Frame = -2

Query: 3321 MEASCIFCGGVSASILKSQGIRHRPSESISLYSNKN-------------RLFLSSPISHR 3181
            MEASC+FCGG S S L  +  R R S   S +++                +  +SP+ H 
Sbjct: 1    MEASCVFCGGAS-SFLGIRVRRQRDSLHSSFFASVTPFGGNSSFSRGGGSILFASPLPHC 59

Query: 3180 VWXXXXXXXXXXXXXXXXREDGRGA-----------DVSNNNTNNKAAVS---EEKTRMK 3043
             +                +   R +           + S N  N+  + S   E + R K
Sbjct: 60   RFQVANSNGGGTKAVRTAKRKSRKSGGSSGPGPGPVETSQNLKNSPVSSSVEFERQGRRK 119

Query: 3042 QQQVNDEKDGPTSVRALYQNGDPLGRRDLGKGVVKWIGKGMKAMALDFALAETQGD--FA 2869
                    + P +V ALYQ+GDPLGRRDLGK VV WI +GMKAMA+DFA AE QG+  F+
Sbjct: 120  PALTRKNTNTPANVAALYQSGDPLGRRDLGKCVVTWISQGMKAMAIDFATAEVQGEGEFS 179

Query: 2868 DLKQRMGPGLTFVIQAQPYLNAVPMPLGMEAICLKTCTHYPTLFDHFQRELRDVLLDLQH 2689
            +L+Q+MGPGLTFVIQAQPYLNAVPMPLG+EAICLKTCTHYPTLFDHFQRELRDVL DLQ 
Sbjct: 180  ELRQQMGPGLTFVIQAQPYLNAVPMPLGLEAICLKTCTHYPTLFDHFQRELRDVLKDLQS 239

Query: 2688 KTLIHNWRETESWKLLKELATSAQHRAIARKTSLSKSVHGVLGLNIDKAKAIQCRIDEFT 2509
            K+L+ +WRETESWKLLKELA SAQH+AIARK S  K + GVLG++IDKAKAIQ RID+FT
Sbjct: 240  KSLVQDWRETESWKLLKELACSAQHKAIARKISEPKPIQGVLGMDIDKAKAIQSRIDDFT 299

Query: 2508 KHMSDLLRIERDAELEFTQEELNAVPTPDEHSTSP-KPTEFLVSHAQSEQELCDTICNLN 2332
            + MS LLRIERDAELEFTQEELNAVPTP E ++ P KP EFLVSHAQ EQELCDTICNL+
Sbjct: 300  EQMSALLRIERDAELEFTQEELNAVPTPAEENSKPSKPIEFLVSHAQPEQELCDTICNLH 359

Query: 2331 AISTSTGLGGMHLVLFRVEGNHRLPPTNLSPGDMVCVRICDSRGAGATSCMQGFVNNLGD 2152
            A+STSTGLGGMHLVLF+V+GNHRLPPTNLSPGDMVCVR CDSRGAGATSCMQGFVNNLG+
Sbjct: 360  AVSTSTGLGGMHLVLFKVDGNHRLPPTNLSPGDMVCVRTCDSRGAGATSCMQGFVNNLGE 419

Query: 2151 DGCSISVALESLHGDPTFSKLFGKNIRIDRIQGLADALTYERNCEAXXXXXXXXXXXKNS 1972
            DGCSI++ALESL GDPTFSKLFGKN+RIDRIQGLAD LTYERNCEA           KN 
Sbjct: 420  DGCSITLALESLRGDPTFSKLFGKNVRIDRIQGLADTLTYERNCEALMMLKKKGLQKKNP 479

Query: 1971 SIAVVTTIFGDNEDIAWFEDNNMVDWAEAELNGLLDTEFYDTSQQRAIALGLNKKRPVLI 1792
            SIAVV T+FGD ED+AW E N++ DWA  EL+  +D++ YD SQ+RAIALGLNK+RP+LI
Sbjct: 480  SIAVVATLFGDQEDVAWLEKNDLADWAGVELDASIDSKGYDISQRRAIALGLNKRRPILI 539

Query: 1791 IQGPPGTGKTGVLKQLISIAVKQGERVLVTAPTNAAVDNMVEKLSDIGANIVRVGNPARI 1612
            +QGPPGTGKTG+LK+LIS+AV+QGERVL+TAPTNAAVDNMVEKLSD+  NIVR GNPARI
Sbjct: 540  VQGPPGTGKTGLLKELISLAVQQGERVLITAPTNAAVDNMVEKLSDVAINIVRFGNPARI 599

Query: 1611 SPAVASKSLVEIVNGRLADFRSEFERKKSDLRKDLSHCLRDDSLAAGIRQLLKQLGKTMK 1432
            SP V+SKSL EIVN +LA+FR+E  RKK+DLRKDL HCL DDSLAAGIRQLLKQLGK++K
Sbjct: 600  SPVVSSKSLTEIVNTKLAEFRAELHRKKTDLRKDLRHCLNDDSLAAGIRQLLKQLGKSLK 659

Query: 1431 KKERETIREILSSAHVVLATNIGAADPMIRWLNSFDLVVIDEAGQAIEPSCWIPILLGKR 1252
            KKE+ET+RE+LSSA VVLATNIGAADP+IR L++FDLV+IDEA QAIEPS WIPIL GKR
Sbjct: 660  KKEKETVREVLSSAQVVLATNIGAADPLIRQLDTFDLVIIDEAAQAIEPSSWIPILRGKR 719

Query: 1251 CILAGDQCQLAPVILSRKALEGGLGVSFLERASTLHEGVLATKLTTQYRMNDAIASWASK 1072
            CILAGDQ QLAPVILSRKALEGGLG+S LERA++LHEG+L+TKLTTQYRMNDAIASWASK
Sbjct: 720  CILAGDQFQLAPVILSRKALEGGLGISLLERAASLHEGMLSTKLTTQYRMNDAIASWASK 779

Query: 1071 EMYNGLLKSSASVMSHLLSDSPLVKSTWITQCPLLLLDTRMPFGSLSVGCEEQLDPAGTG 892
            EMY G LKS   V SHLL DSP VK TWIT+CPLLLLDTRMP+GSLS GCEE LDPAGTG
Sbjct: 780  EMYGGSLKSFPQVASHLLVDSPFVKPTWITRCPLLLLDTRMPYGSLSTGCEEHLDPAGTG 839

Query: 891  SFYNEGEADIVVQHVFALIYAGVRPSTIVVQSPYVSQVQLLRDRLEEFPLSTGVEVATID 712
            SFYNEGEADIVV+HV +L+Y+GV P  I VQSPYV+QVQLLRDRL+E P++TGVEVATID
Sbjct: 840  SFYNEGEADIVVKHVLSLVYSGVSPVAIAVQSPYVAQVQLLRDRLDEIPVTTGVEVATID 899

Query: 711  SFQGREADAVVISMVRSNNLGAVGFLGDSRRMNVAITRARKHVAIICDSSTICHNTFLAR 532
            SFQGREADAV+ISMVRSNN+GAVGFLGDSRRMNVAITRARKHVA++CDSSTICHNTFLAR
Sbjct: 900  SFQGREADAVIISMVRSNNMGAVGFLGDSRRMNVAITRARKHVAVVCDSSTICHNTFLAR 959

Query: 531  LLRHIRYFGRVKHAEPGGSGGYGLSMNPMLPSVS 430
            LLRHIRYFG VK+AEPG  GG+GL M+PMLP+ +
Sbjct: 960  LLRHIRYFGHVKNAEPGSFGGFGLGMDPMLPTAN 993


>ref|XP_012492340.1| PREDICTED: DNA-binding protein SMUBP-2 [Gossypium raimondii]
 gb|KJB44363.1| hypothetical protein B456_007G248100 [Gossypium raimondii]
          Length = 1003

 Score = 1399 bits (3621), Expect = 0.0
 Identities = 699/886 (78%), Positives = 778/886 (87%)
 Frame = -2

Query: 3087 TNNKAAVSEEKTRMKQQQVNDEKDGPTSVRALYQNGDPLGRRDLGKGVVKWIGKGMKAMA 2908
            T     V E     KQ++   +K    +VR LYQNGDPLGRRDLGK VV WI +GMKAMA
Sbjct: 118  TRTNILVEELGLFKKQKEQKVQKTKALNVRTLYQNGDPLGRRDLGKRVVWWISEGMKAMA 177

Query: 2907 LDFALAETQGDFADLKQRMGPGLTFVIQAQPYLNAVPMPLGMEAICLKTCTHYPTLFDHF 2728
             DFA AE QG+F +L+QRMGPGLTFVIQAQPYLN+VPMPLG+EAICLK CTHYPTLFDHF
Sbjct: 178  SDFASAELQGEFLELRQRMGPGLTFVIQAQPYLNSVPMPLGLEAICLKACTHYPTLFDHF 237

Query: 2727 QRELRDVLLDLQHKTLIHNWRETESWKLLKELATSAQHRAIARKTSLSKSVHGVLGLNID 2548
            QRELR+VL +LQ  +++ +W+ETESWKLLKELA SAQHRAIARK +  K V GVLG++++
Sbjct: 238  QRELRNVLQELQQNSMVQDWKETESWKLLKELANSAQHRAIARKVTPPKPVQGVLGMDLE 297

Query: 2547 KAKAIQCRIDEFTKHMSDLLRIERDAELEFTQEELNAVPTPDEHSTSPKPTEFLVSHAQS 2368
            KAKA+Q RIDEFTK MS+LLRIERDAELEFTQEEL+AVPT DE S S KP EFLVSH Q+
Sbjct: 298  KAKAMQGRIDEFTKQMSELLRIERDAELEFTQEELDAVPTLDEGSDSSKPIEFLVSHGQA 357

Query: 2367 EQELCDTICNLNAISTSTGLGGMHLVLFRVEGNHRLPPTNLSPGDMVCVRICDSRGAGAT 2188
            +QELCDTICNLNA+STSTGLGGMHLVLFRVEGNHRLPPT LSPGDMVCVRI DSRGAGAT
Sbjct: 358  QQELCDTICNLNAVSTSTGLGGMHLVLFRVEGNHRLPPTTLSPGDMVCVRISDSRGAGAT 417

Query: 2187 SCMQGFVNNLGDDGCSISVALESLHGDPTFSKLFGKNIRIDRIQGLADALTYERNCEAXX 2008
            SC+QGFV+NLGDDGCSISVALES HGDPTFSKLFGK++RIDRI GLADALTYERNCEA  
Sbjct: 418  SCIQGFVDNLGDDGCSISVALESRHGDPTFSKLFGKSVRIDRIHGLADALTYERNCEALM 477

Query: 2007 XXXXXXXXXKNSSIAVVTTIFGDNEDIAWFEDNNMVDWAEAELNGLLDTEFYDTSQQRAI 1828
                     KN SIAVV T+F D ED+ W E+N++ DW+ AEL+GLL    +D SQQRAI
Sbjct: 478  LLQKNGLQKKNPSIAVVATLFADKEDVEWLEENDLADWSPAELDGLLQNGTFDDSQQRAI 537

Query: 1827 ALGLNKKRPVLIIQGPPGTGKTGVLKQLISIAVKQGERVLVTAPTNAAVDNMVEKLSDIG 1648
            ALGLNKKRPV+++QGPPGTGKTG+LK++I++A +QGERVLVTAPTNAAVDN+VEKLS+ G
Sbjct: 538  ALGLNKKRPVMVVQGPPGTGKTGMLKEVIALAAQQGERVLVTAPTNAAVDNLVEKLSNTG 597

Query: 1647 ANIVRVGNPARISPAVASKSLVEIVNGRLADFRSEFERKKSDLRKDLSHCLRDDSLAAGI 1468
             NIVRVGNPARIS AVASKSLVEIVN +LAD+R+EFERKKSDLRKDL HCL+DDSLAAGI
Sbjct: 598  LNIVRVGNPARISSAVASKSLVEIVNSKLADYRAEFERKKSDLRKDLRHCLKDDSLAAGI 657

Query: 1467 RQLLKQLGKTMKKKERETIREILSSAHVVLATNIGAADPMIRWLNSFDLVVIDEAGQAIE 1288
            RQLLKQLGK +KKKE+ET+RE+LS+A VVL+TN GAADP+IR L++FDLVVIDEAGQAIE
Sbjct: 658  RQLLKQLGKALKKKEKETVREVLSNAQVVLSTNTGAADPLIRRLDTFDLVVIDEAGQAIE 717

Query: 1287 PSCWIPILLGKRCILAGDQCQLAPVILSRKALEGGLGVSFLERASTLHEGVLATKLTTQY 1108
            PSCWIPIL GKRCILAGDQCQLAPVILSRKALEGGLG+S LERA+TLHEGVLAT L TQY
Sbjct: 718  PSCWIPILQGKRCILAGDQCQLAPVILSRKALEGGLGISLLERAATLHEGVLATMLATQY 777

Query: 1107 RMNDAIASWASKEMYNGLLKSSASVMSHLLSDSPLVKSTWITQCPLLLLDTRMPFGSLSV 928
            RMNDAIASWASKEMY+G LKSS  V SHLL DSP VK TWITQCPLLLLDTRMP+GSLSV
Sbjct: 778  RMNDAIASWASKEMYDGELKSSPLVASHLLVDSPFVKPTWITQCPLLLLDTRMPYGSLSV 837

Query: 927  GCEEQLDPAGTGSFYNEGEADIVVQHVFALIYAGVRPSTIVVQSPYVSQVQLLRDRLEEF 748
            GCEE LD AGTGSF+NEGEADIVVQHV  LIYAGV P+ I VQSPYV+QVQLLRDRL+EF
Sbjct: 838  GCEEHLDLAGTGSFFNEGEADIVVQHVLYLIYAGVSPTAIAVQSPYVAQVQLLRDRLDEF 897

Query: 747  PLSTGVEVATIDSFQGREADAVVISMVRSNNLGAVGFLGDSRRMNVAITRARKHVAIICD 568
            P + G+EVATIDSFQGREADAV+ISMVRSN LGAVGFLGDSRRMNVAITRARKHVA++CD
Sbjct: 898  PEADGIEVATIDSFQGREADAVIISMVRSNTLGAVGFLGDSRRMNVAITRARKHVAVVCD 957

Query: 567  SSTICHNTFLARLLRHIRYFGRVKHAEPGGSGGYGLSMNPMLPSVS 430
            SSTICHNTFLARLLRHIRY GRVKHAEPG SGG GL M+PMLPS+S
Sbjct: 958  SSTICHNTFLARLLRHIRYVGRVKHAEPGASGGSGLGMDPMLPSIS 1003


>ref|XP_002524012.1| PREDICTED: DNA-binding protein SMUBP-2 [Ricinus communis]
 gb|EEF38380.1| DNA-binding protein smubp-2, putative [Ricinus communis]
          Length = 989

 Score = 1399 bits (3621), Expect = 0.0
 Identities = 700/890 (78%), Positives = 781/890 (87%), Gaps = 2/890 (0%)
 Frame = -2

Query: 3093 NNTNNKAAVSEEKTRMKQQQVNDEKDGPTSVRALYQNGDPLGRRDLGKGVVKWIGKGMKA 2914
            N    K AVSEE+    + +VN        V++L+QNGDPLG++DLGK VVKWI +GM+A
Sbjct: 108  NTDGGKLAVSEEREEKVKMKVN--------VKSLHQNGDPLGKKDLGKTVVKWISQGMRA 159

Query: 2913 MALDFALAETQGDFADLKQRMG--PGLTFVIQAQPYLNAVPMPLGMEAICLKTCTHYPTL 2740
            MA DFA AETQG+F +L+QRM    GLTFVIQAQPY+NAVP+PLG EA+CLK C HYPTL
Sbjct: 160  MAADFASAETQGEFLELRQRMDLEAGLTFVIQAQPYINAVPIPLGFEALCLKACIHYPTL 219

Query: 2739 FDHFQRELRDVLLDLQHKTLIHNWRETESWKLLKELATSAQHRAIARKTSLSKSVHGVLG 2560
            FDHFQRELRDVL DLQ K L+ +W+ TESWKLLKELA S QHRA+ARK S  K + GVLG
Sbjct: 220  FDHFQRELRDVLQDLQRKGLVQDWQNTESWKLLKELANSVQHRAVARKVSKPKPLQGVLG 279

Query: 2559 LNIDKAKAIQCRIDEFTKHMSDLLRIERDAELEFTQEELNAVPTPDEHSTSPKPTEFLVS 2380
            +N+DKAKAIQ RIDEFTK MS+LL+IERD+ELEFTQEELNAVPTPDE+S   KP EFLVS
Sbjct: 280  MNLDKAKAIQSRIDEFTKTMSELLQIERDSELEFTQEELNAVPTPDENSDPSKPIEFLVS 339

Query: 2379 HAQSEQELCDTICNLNAISTSTGLGGMHLVLFRVEGNHRLPPTNLSPGDMVCVRICDSRG 2200
            H Q++QELCDTICNLNA+STSTGLGGMHLVLFRVEGNHRLPPTNLSPGDMVCVRICDSRG
Sbjct: 340  HGQAQQELCDTICNLNAVSTSTGLGGMHLVLFRVEGNHRLPPTNLSPGDMVCVRICDSRG 399

Query: 2199 AGATSCMQGFVNNLGDDGCSISVALESLHGDPTFSKLFGKNIRIDRIQGLADALTYERNC 2020
            AGATSCMQGFVNNLG+DGCSISVALES HGDPTFSKLFGK +RIDRI GLADALTYERNC
Sbjct: 400  AGATSCMQGFVNNLGEDGCSISVALESRHGDPTFSKLFGKGVRIDRIHGLADALTYERNC 459

Query: 2019 EAXXXXXXXXXXXKNSSIAVVTTIFGDNEDIAWFEDNNMVDWAEAELNGLLDTEFYDTSQ 1840
            EA           KN SIA+V T+FGD+ED+AW E+ ++ +W EA+++G   +E +D SQ
Sbjct: 460  EALMLLQKNGLQKKNPSIAIVATLFGDSEDLAWLEEKDLAEWNEADMDGCFGSERFDDSQ 519

Query: 1839 QRAIALGLNKKRPVLIIQGPPGTGKTGVLKQLISIAVKQGERVLVTAPTNAAVDNMVEKL 1660
            +RA+ALGLN+KRP+LIIQGPPGTGK+G+LK+LI  AV QGERVLVTAPTNAAVDNMVEKL
Sbjct: 520  RRAMALGLNQKRPLLIIQGPPGTGKSGLLKELIVRAVHQGERVLVTAPTNAAVDNMVEKL 579

Query: 1659 SDIGANIVRVGNPARISPAVASKSLVEIVNGRLADFRSEFERKKSDLRKDLSHCLRDDSL 1480
            S+IG +IVRVGNPARIS AVASKSL EIVN +LA FR EFERKKSDLRKDL HCL DDSL
Sbjct: 580  SNIGLDIVRVGNPARISSAVASKSLSEIVNSKLATFRMEFERKKSDLRKDLRHCLEDDSL 639

Query: 1479 AAGIRQLLKQLGKTMKKKERETIREILSSAHVVLATNIGAADPMIRWLNSFDLVVIDEAG 1300
            AAGIRQLLKQLGKTMKKKE+E+++E+LSSA VVLATN GAADP+IR L++FDLVVIDEAG
Sbjct: 640  AAGIRQLLKQLGKTMKKKEKESVKEVLSSAQVVLATNTGAADPLIRRLDTFDLVVIDEAG 699

Query: 1299 QAIEPSCWIPILLGKRCILAGDQCQLAPVILSRKALEGGLGVSFLERASTLHEGVLATKL 1120
            QAIEPSCWIPIL GKRCILAGDQCQLAPVILSRKALEGGLGVS LERA+TLH+GVLA +L
Sbjct: 700  QAIEPSCWIPILQGKRCILAGDQCQLAPVILSRKALEGGLGVSLLERAATLHDGVLALQL 759

Query: 1119 TTQYRMNDAIASWASKEMYNGLLKSSASVMSHLLSDSPLVKSTWITQCPLLLLDTRMPFG 940
            TTQYRMNDAIASWASKEMY GLLKSS+ V SHLL  SP VK TWITQCPLLLLDTRMP+G
Sbjct: 760  TTQYRMNDAIASWASKEMYGGLLKSSSKVASHLLVHSPFVKPTWITQCPLLLLDTRMPYG 819

Query: 939  SLSVGCEEQLDPAGTGSFYNEGEADIVVQHVFALIYAGVRPSTIVVQSPYVSQVQLLRDR 760
            SL +GCEE LDPAGTGSFYNEGEA+IVVQHV +LIYAGVRP+TI VQSPYV+QVQLLRDR
Sbjct: 820  SLFIGCEEHLDPAGTGSFYNEGEAEIVVQHVISLIYAGVRPTTIAVQSPYVAQVQLLRDR 879

Query: 759  LEEFPLSTGVEVATIDSFQGREADAVVISMVRSNNLGAVGFLGDSRRMNVAITRARKHVA 580
            L+E P + GVEVATIDSFQGREADAV+ISMVRSNNLGAVGFLGDSRRMNVAITRAR+HVA
Sbjct: 880  LDELPEADGVEVATIDSFQGREADAVIISMVRSNNLGAVGFLGDSRRMNVAITRARRHVA 939

Query: 579  IICDSSTICHNTFLARLLRHIRYFGRVKHAEPGGSGGYGLSMNPMLPSVS 430
            ++CDSSTICHNTFLARLLRHIRYFGRVKHAEPG  GG GL M+PMLPS+S
Sbjct: 940  VVCDSSTICHNTFLARLLRHIRYFGRVKHAEPGSFGGSGLGMDPMLPSIS 989


>ref|XP_019259161.1| PREDICTED: DNA-binding protein SMUBP-2 [Nicotiana attenuata]
 gb|OIT40020.1| regulator of nonsense transcripts 1-like protein [Nicotiana
            attenuata]
          Length = 980

 Score = 1399 bits (3620), Expect = 0.0
 Identities = 709/974 (72%), Positives = 816/974 (83%), Gaps = 9/974 (0%)
 Frame = -2

Query: 3324 KMEASCIFCGGVSA---SILKSQGIRHRPS-----ESISLYSNKNRLFLSSPISHRVWXX 3169
            KME+ C  CG +S    S L  +  + R +      S++L + KNR+FL S IS   +  
Sbjct: 7    KMESLCNSCGSISTLAPSCLTLRFYKKRSNLSSFFGSVTLSNPKNRIFLDSSISFPNYNI 66

Query: 3168 XXXXXXXXXXXXXXREDGRGADVSNNNTNNKAAVSEEKTRMKQQQVNDEKD-GPTSVRAL 2992
                          R   +    S            +KT   Q+   +E+D GP +VRAL
Sbjct: 67   QASSSSGTKSLSPRRRKPKNVKTSQIPAVTTKGSVVKKTEKIQECSQEERDSGPVNVRAL 126

Query: 2991 YQNGDPLGRRDLGKGVVKWIGKGMKAMALDFALAETQGDFADLKQRMGPGLTFVIQAQPY 2812
             +NGDP+GR+DLGK VV+WI +GMKAMA DFA AE QG+F ++KQRM PGLTFVIQAQPY
Sbjct: 127  NENGDPMGRKDLGKCVVRWISQGMKAMATDFATAEMQGEFTEVKQRMEPGLTFVIQAQPY 186

Query: 2811 LNAVPMPLGMEAICLKTCTHYPTLFDHFQRELRDVLLDLQHKTLIHNWRETESWKLLKEL 2632
            LNA+PMPLG+EAICLK CTHYPTLFD+FQRELRDVL DLQ K+L+ +WR+TESWKLLK+L
Sbjct: 187  LNAIPMPLGLEAICLKACTHYPTLFDNFQRELRDVLQDLQRKSLVQDWRDTESWKLLKDL 246

Query: 2631 ATSAQHRAIARKTSLSKSVHGVLGLNIDKAKAIQCRIDEFTKHMSDLLRIERDAELEFTQ 2452
            A+SAQH+AIARKTS  K V GV+G++++KAKA+Q RID+FT  MSDLLRIERD+ELEFTQ
Sbjct: 247  ASSAQHKAIARKTSQRKFVPGVMGMDLEKAKAMQSRIDDFTNRMSDLLRIERDSELEFTQ 306

Query: 2451 EELNAVPTPDEHSTSPKPTEFLVSHAQSEQELCDTICNLNAISTSTGLGGMHLVLFRVEG 2272
            EELNAVP P  +S   KP EFLVSHAQ EQELCDTICNL A+STS GLGGMHLVLF++EG
Sbjct: 307  EELNAVPAPVLNSEEQKPFEFLVSHAQPEQELCDTICNLTAVSTSIGLGGMHLVLFKLEG 366

Query: 2271 NHRLPPTNLSPGDMVCVRICDSRGAGATSCMQGFVNNLGDDGCSISVALESLHGDPTFSK 2092
            NHRLPPTNLSPGDMVCVR CDSRGAGATSCMQGFV+NLG+DG SIS+ALESLHGD TFSK
Sbjct: 367  NHRLPPTNLSPGDMVCVRTCDSRGAGATSCMQGFVHNLGEDGRSISLALESLHGDSTFSK 426

Query: 2091 LFGKNIRIDRIQGLADALTYERNCEAXXXXXXXXXXXKNSSIAVVTTIFGDNEDIAWFED 1912
            LFGKN+RIDRIQGLADALTYERNCEA           KN S+AVV T+FGD ED+AW E+
Sbjct: 427  LFGKNVRIDRIQGLADALTYERNCEALMMLQKKGFLKKNPSVAVVATLFGDKEDLAWLEE 486

Query: 1911 NNMVDWAEAELNGLLDTEFYDTSQQRAIALGLNKKRPVLIIQGPPGTGKTGVLKQLISIA 1732
            N M DW+E EL    D + +D SQ++AIALGLNK RP++IIQGPPGTGKTG+LK+LIS+A
Sbjct: 487  NGMADWSEVELPDSTDRKSFDASQRKAIALGLNKNRPIMIIQGPPGTGKTGMLKELISLA 546

Query: 1731 VKQGERVLVTAPTNAAVDNMVEKLSDIGANIVRVGNPARISPAVASKSLVEIVNGRLADF 1552
            VKQGERVLVTAPTNAAVDNMVEKLSDIG NIVRVGNPARISPAVASKSL EIVN +LADF
Sbjct: 547  VKQGERVLVTAPTNAAVDNMVEKLSDIGLNIVRVGNPARISPAVASKSLAEIVNTKLADF 606

Query: 1551 RSEFERKKSDLRKDLSHCLRDDSLAAGIRQLLKQLGKTMKKKERETIREILSSAHVVLAT 1372
            R+E ERKKSDLR+DL +CL+DDSLAAGIRQLLKQLGK++K++E+ET++EILSSA VVLAT
Sbjct: 607  RAEIERKKSDLRRDLRYCLKDDSLAAGIRQLLKQLGKSIKREEKETVKEILSSAQVVLAT 666

Query: 1371 NIGAADPMIRWLNSFDLVVIDEAGQAIEPSCWIPILLGKRCILAGDQCQLAPVILSRKAL 1192
            NIGAADP+IR L++FDLV+IDEAGQAIEPSCWIPILLGKRCILAGDQ QLAPVILSRKAL
Sbjct: 667  NIGAADPLIRRLDTFDLVIIDEAGQAIEPSCWIPILLGKRCILAGDQFQLAPVILSRKAL 726

Query: 1191 EGGLGVSFLERASTLHEGVLATKLTTQYRMNDAIASWASKEMYNGLLKSSASVMSHLLSD 1012
            EGGLGVS LERA+ LH+G+L+TKLTTQYRMN+AIASWASKEMY+G L SS +V SHLL D
Sbjct: 727  EGGLGVSLLERAAGLHDGMLSTKLTTQYRMNNAIASWASKEMYDGSLISSPTVASHLLVD 786

Query: 1011 SPLVKSTWITQCPLLLLDTRMPFGSLSVGCEEQLDPAGTGSFYNEGEADIVVQHVFALIY 832
            SP VK TW+TQCPLLLLDTRMP+GSLSVGCEE LDPAGTGSF+NEGEADIVVQHVF+LIY
Sbjct: 787  SPFVKPTWVTQCPLLLLDTRMPYGSLSVGCEEHLDPAGTGSFFNEGEADIVVQHVFSLIY 846

Query: 831  AGVRPSTIVVQSPYVSQVQLLRDRLEEFPLSTGVEVATIDSFQGREADAVVISMVRSNNL 652
            +GV P+ I VQSPYV+QVQLLRD+++E P++TGVEVATIDSFQGREADAV+ISMVRSNNL
Sbjct: 847  SGVPPAAIAVQSPYVAQVQLLRDKIDELPMATGVEVATIDSFQGREADAVIISMVRSNNL 906

Query: 651  GAVGFLGDSRRMNVAITRARKHVAIICDSSTICHNTFLARLLRHIRYFGRVKHAEPGGSG 472
            GAVGFLGDSRRMNVAITRARKHVA++CDSSTICHNT+LARLLRHIRYFG+VKH EPG   
Sbjct: 907  GAVGFLGDSRRMNVAITRARKHVAVVCDSSTICHNTYLARLLRHIRYFGKVKHVEPGSFW 966

Query: 471  GYGLSMNPMLPSVS 430
             +GL M+PMLP+ S
Sbjct: 967  EFGLGMDPMLPTAS 980


Top