BLASTX nr result
ID: Rehmannia30_contig00000467
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia30_contig00000467 (3544 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_011075757.1| DNA-binding protein SMUBP-2 [Sesamum indicum] 1646 0.0 ref|XP_012850649.1| PREDICTED: DNA-binding protein SMUBP-2 [Eryt... 1587 0.0 gb|EYU44882.1| hypothetical protein MIMGU_mgv1a001152mg [Erythra... 1535 0.0 gb|KZV41087.1| P-loop containing nucleoside triphosphate hydrola... 1482 0.0 gb|EOY10295.1| P-loop containing nucleoside triphosphate hydrola... 1423 0.0 ref|XP_017977299.1| PREDICTED: DNA-binding protein SMUBP-2 [Theo... 1422 0.0 gb|OMO99192.1| putative DNA-binding protein smubp-2 [Corchorus c... 1420 0.0 gb|OMO56477.1| hypothetical protein COLO4_35630 [Corchorus olito... 1417 0.0 gb|PHT30198.1| hypothetical protein CQW23_30230 [Capsicum baccatum] 1416 0.0 ref|XP_022718654.1| DNA-binding protein SMUBP-2-like [Durio zibe... 1416 0.0 ref|XP_021282320.1| DNA-binding protein SMUBP-2 [Herrania umbrat... 1415 0.0 ref|XP_016564094.1| PREDICTED: DNA-binding protein SMUBP-2 [Caps... 1411 0.0 gb|PHU23400.1| hypothetical protein BC332_08507 [Capsicum chinense] 1409 0.0 gb|PHT87733.1| hypothetical protein T459_09839 [Capsicum annuum] 1409 0.0 ref|XP_009771939.1| PREDICTED: DNA-binding protein SMUBP-2 [Nico... 1400 0.0 ref|XP_016474118.1| PREDICTED: DNA-binding protein SMUBP-2-like ... 1400 0.0 ref|XP_019184191.1| PREDICTED: DNA-binding protein SMUBP-2 [Ipom... 1399 0.0 ref|XP_012492340.1| PREDICTED: DNA-binding protein SMUBP-2 [Goss... 1399 0.0 ref|XP_002524012.1| PREDICTED: DNA-binding protein SMUBP-2 [Rici... 1399 0.0 ref|XP_019259161.1| PREDICTED: DNA-binding protein SMUBP-2 [Nico... 1399 0.0 >ref|XP_011075757.1| DNA-binding protein SMUBP-2 [Sesamum indicum] Length = 964 Score = 1646 bits (4263), Expect = 0.0 Identities = 838/968 (86%), Positives = 885/968 (91%), Gaps = 4/968 (0%) Frame = -2 Query: 3321 MEASCIFCGGVSASILKSQGIRHRPSESISLYSNKNRLFLSSPISHRVWXXXXXXXXXXX 3142 MEASCIFCGGVS S+LKS +RHRP ESISLY N+N +F++SPISHRVW Sbjct: 1 MEASCIFCGGVSTSLLKSPALRHRPIESISLYRNRNLVFVASPISHRVWASANNSSNSRS 60 Query: 3141 XXXXXR----EDGRGADVSNNNTNNKAAVSEEKTRMKQQQVNDEKDGPTSVRALYQNGDP 2974 ED G+DV+N NTN KAAVSEE TR K VND+++GP SVRALYQ+GDP Sbjct: 61 ATKRRSRKNREDAGGSDVTNKNTNKKAAVSEE-TRKK---VNDQENGPRSVRALYQSGDP 116 Query: 2973 LGRRDLGKGVVKWIGKGMKAMALDFALAETQGDFADLKQRMGPGLTFVIQAQPYLNAVPM 2794 LGRR+LGKGVVKWI +GMKAMALDFA+ E QGDFA+LKQRMGPGLTFVIQAQPYLNAVPM Sbjct: 117 LGRRELGKGVVKWICQGMKAMALDFAMVEMQGDFAELKQRMGPGLTFVIQAQPYLNAVPM 176 Query: 2793 PLGMEAICLKTCTHYPTLFDHFQRELRDVLLDLQHKTLIHNWRETESWKLLKELATSAQH 2614 PLG+EAICLKTCTHYPTLFDHFQRELRDVL DLQHKTLIHNWRETESWKLLKELA+SAQH Sbjct: 177 PLGLEAICLKTCTHYPTLFDHFQRELRDVLQDLQHKTLIHNWRETESWKLLKELASSAQH 236 Query: 2613 RAIARKTSLSKSVHGVLGLNIDKAKAIQCRIDEFTKHMSDLLRIERDAELEFTQEELNAV 2434 RAIARKTSL+KSVHGVLGL + KAKA+QCRIDEFTK MSDLLRIERDAELEFTQ+ELNAV Sbjct: 237 RAIARKTSLTKSVHGVLGLELVKAKAMQCRIDEFTKQMSDLLRIERDAELEFTQDELNAV 296 Query: 2433 PTPDEHSTSPKPTEFLVSHAQSEQELCDTICNLNAISTSTGLGGMHLVLFRVEGNHRLPP 2254 PTPD+ S+S +P EFLVSHAQ+EQELCDTICNLNAISTSTGLGGMHLVLFRVE NHRLPP Sbjct: 297 PTPDDLSSSSRPIEFLVSHAQAEQELCDTICNLNAISTSTGLGGMHLVLFRVERNHRLPP 356 Query: 2253 TNLSPGDMVCVRICDSRGAGATSCMQGFVNNLGDDGCSISVALESLHGDPTFSKLFGKNI 2074 TNLSPGDMVCVR+CD RGAGATS MQGFVNNLGDDGCSISVALES HGDPTFSKLFGK+I Sbjct: 357 TNLSPGDMVCVRVCDKRGAGATSSMQGFVNNLGDDGCSISVALESRHGDPTFSKLFGKSI 416 Query: 2073 RIDRIQGLADALTYERNCEAXXXXXXXXXXXKNSSIAVVTTIFGDNEDIAWFEDNNMVDW 1894 RIDRIQGLADA+TYERNCEA KNSS AVVTTIFGD EDI FE NN+VDW Sbjct: 417 RIDRIQGLADAITYERNCEALMMLQKKGLQKKNSSRAVVTTIFGDKEDITRFEGNNLVDW 476 Query: 1893 AEAELNGLLDTEFYDTSQQRAIALGLNKKRPVLIIQGPPGTGKTGVLKQLISIAVKQGER 1714 +E EL+GLLDTEFYD+SQQRAIALGLNKKRPVLIIQGPPGTGKTGVLKQ+IS+ VKQGER Sbjct: 477 SEVELSGLLDTEFYDSSQQRAIALGLNKKRPVLIIQGPPGTGKTGVLKQIISLVVKQGER 536 Query: 1713 VLVTAPTNAAVDNMVEKLSDIGANIVRVGNPARISPAVASKSLVEIVNGRLADFRSEFER 1534 VLVTAPTNAAVDNMVEKLS+IGANIVRVGNPARISP VASKSLVEIVN RL DFRSEFER Sbjct: 537 VLVTAPTNAAVDNMVEKLSEIGANIVRVGNPARISPTVASKSLVEIVNSRLGDFRSEFER 596 Query: 1533 KKSDLRKDLSHCLRDDSLAAGIRQLLKQLGKTMKKKERETIREILSSAHVVLATNIGAAD 1354 KKSDLRKDLS+CL+DDSLAAGIRQLLKQLGKTMKKKERET+REILSSA VVL TNIGAAD Sbjct: 597 KKSDLRKDLSYCLKDDSLAAGIRQLLKQLGKTMKKKERETVREILSSAQVVLTTNIGAAD 656 Query: 1353 PMIRWLNSFDLVVIDEAGQAIEPSCWIPILLGKRCILAGDQCQLAPVILSRKALEGGLGV 1174 PMIR LN FDLVVIDEAGQAIEPSCWIPILLGKRCILAGDQCQLAPVILSRKALEGGLGV Sbjct: 657 PMIRCLNFFDLVVIDEAGQAIEPSCWIPILLGKRCILAGDQCQLAPVILSRKALEGGLGV 716 Query: 1173 SFLERASTLHEGVLATKLTTQYRMNDAIASWASKEMYNGLLKSSASVMSHLLSDSPLVKS 994 S LERA+TLHEGVLATKLT QYRMNDAIASWASKEMYNGLLKSSASV SHLLSDSPLVK Sbjct: 717 SLLERAATLHEGVLATKLTIQYRMNDAIASWASKEMYNGLLKSSASVTSHLLSDSPLVKQ 776 Query: 993 TWITQCPLLLLDTRMPFGSLSVGCEEQLDPAGTGSFYNEGEADIVVQHVFALIYAGVRPS 814 TWITQCPLLLLDTRMP+GSL+VGCEEQLDPAGTGSFYNEGEADIVVQHVFALIYAGV P+ Sbjct: 777 TWITQCPLLLLDTRMPYGSLTVGCEEQLDPAGTGSFYNEGEADIVVQHVFALIYAGVSPA 836 Query: 813 TIVVQSPYVSQVQLLRDRLEEFPLSTGVEVATIDSFQGREADAVVISMVRSNNLGAVGFL 634 TIVVQSPYV+QVQLLRDRLEEFPLSTGVEVAT+DSFQGREADAV+ISMVRSNNLGAVGFL Sbjct: 837 TIVVQSPYVAQVQLLRDRLEEFPLSTGVEVATVDSFQGREADAVIISMVRSNNLGAVGFL 896 Query: 633 GDSRRMNVAITRARKHVAIICDSSTICHNTFLARLLRHIRYFGRVKHAEPGGSGGYGLSM 454 GDSRRMNVAITRARKHVAIICDSSTICHNTFLARLLRHIRYFGRVKHAEPG SGG GLSM Sbjct: 897 GDSRRMNVAITRARKHVAIICDSSTICHNTFLARLLRHIRYFGRVKHAEPGDSGGSGLSM 956 Query: 453 NPMLPSVS 430 NPMLPS+S Sbjct: 957 NPMLPSIS 964 >ref|XP_012850649.1| PREDICTED: DNA-binding protein SMUBP-2 [Erythranthe guttata] Length = 961 Score = 1587 bits (4109), Expect = 0.0 Identities = 809/968 (83%), Positives = 880/968 (90%), Gaps = 4/968 (0%) Frame = -2 Query: 3321 MEASCIFCGGVSASILKSQGIRHRPSESISLYSNKNRLFLSSPISHRVWXXXXXXXXXXX 3142 MEA CI CGGVSAS+LKS +R S+S+ LY +K R+FL SPISHR+ Sbjct: 1 MEALCISCGGVSASLLKSPVVR---SDSVYLYRHKKRVFLGSPISHRILSTARNNSSGSA 57 Query: 3141 XXXXXREDGRGADVSNNNTNNKAAVSEEKTRMKQQQVNDEK-DGPTSVRALYQNG-DPLG 2968 ++ +G + +++++ +V+EE+ R KQQQ+N+ K +GPTSVR+LYQNG DPLG Sbjct: 58 TKRRSNKNKQGKN-NSSDSGVPVSVTEEEMRNKQQQINEGKRNGPTSVRSLYQNGGDPLG 116 Query: 2967 RRDLGKGVVKWIGKGMKAMALDFALAETQGDFADLKQRMGP-GLTFVIQAQPYLNAVPMP 2791 RRDLGKGVVKWI +GMKAMAL+FA AE QG+FA+LKQ+MGP GLTFVIQAQPYLNAVPMP Sbjct: 117 RRDLGKGVVKWISQGMKAMALEFARAEMQGEFAELKQQMGPAGLTFVIQAQPYLNAVPMP 176 Query: 2790 LGMEAICLKTCTHYPTLFDHFQRELRDVLLDLQHKTLIH-NWRETESWKLLKELATSAQH 2614 +G+EAICLKTCTHYPTLFDHFQRELRD+L DLQHK+LI W +T+SWKLLK+LA SAQH Sbjct: 177 VGLEAICLKTCTHYPTLFDHFQRELRDILQDLQHKSLIPLTWHQTQSWKLLKDLANSAQH 236 Query: 2613 RAIARKTSLSKSVHGVLGLNIDKAKAIQCRIDEFTKHMSDLLRIERDAELEFTQEELNAV 2434 RA+ARK LSKS+HG L+IDK K+IQCRID+FT+HMS LLRIERD+ELEFT+EELNAV Sbjct: 237 RAVARKAPLSKSLHG---LSIDKTKSIQCRIDKFTEHMSHLLRIERDSELEFTEEELNAV 293 Query: 2433 PTPDEHSTSPKPTEFLVSHAQSEQELCDTICNLNAISTSTGLGGMHLVLFRVEGNHRLPP 2254 PTPDEHSTSPKP EFLVSHAQ+EQELCDTICNLNAISTS GLGGMHLVLFR EGNHRLPP Sbjct: 294 PTPDEHSTSPKPIEFLVSHAQAEQELCDTICNLNAISTSIGLGGMHLVLFRAEGNHRLPP 353 Query: 2253 TNLSPGDMVCVRICDSRGAGATSCMQGFVNNLGDDGCSISVALESLHGDPTFSKLFGKNI 2074 TNLSPGDMVCVRICDSRGAGATSCMQGFVNNLGDDGCSISVALES HGDPTFSKLFGKNI Sbjct: 354 TNLSPGDMVCVRICDSRGAGATSCMQGFVNNLGDDGCSISVALESRHGDPTFSKLFGKNI 413 Query: 2073 RIDRIQGLADALTYERNCEAXXXXXXXXXXXKNSSIAVVTTIFGDNEDIAWFEDNNMVDW 1894 RIDRIQGLADALTYERNCEA +NSS+AVVTTIFGD EDIAWFEDN++VDW Sbjct: 414 RIDRIQGLADALTYERNCEALMMLQKKGLQKQNSSVAVVTTIFGDKEDIAWFEDNDLVDW 473 Query: 1893 AEAELNGLLDTEFYDTSQQRAIALGLNKKRPVLIIQGPPGTGKTGVLKQLISIAVKQGER 1714 +E EL+GLLDTEFYD+SQQRAIALGLNKKRPVLIIQGPPG GKTGVLKQLIS+ VK+GER Sbjct: 474 SEVELDGLLDTEFYDSSQQRAIALGLNKKRPVLIIQGPPGAGKTGVLKQLISLVVKRGER 533 Query: 1713 VLVTAPTNAAVDNMVEKLSDIGANIVRVGNPARISPAVASKSLVEIVNGRLADFRSEFER 1534 VLVTAPTNAAVDNMVEKLSDIGANIVRVGNPARISPAVASKSLVEIVN +LAD++SEF R Sbjct: 534 VLVTAPTNAAVDNMVEKLSDIGANIVRVGNPARISPAVASKSLVEIVNSKLADYKSEFGR 593 Query: 1533 KKSDLRKDLSHCLRDDSLAAGIRQLLKQLGKTMKKKERETIREILSSAHVVLATNIGAAD 1354 KKS+LRKDLSHCL+DDSLAAGIRQLLKQLGK +KKKERET++EILSSA VVLATNIGAAD Sbjct: 594 KKSNLRKDLSHCLKDDSLAAGIRQLLKQLGKAIKKKERETVKEILSSAQVVLATNIGAAD 653 Query: 1353 PMIRWLNSFDLVVIDEAGQAIEPSCWIPILLGKRCILAGDQCQLAPVILSRKALEGGLGV 1174 PMIR L+SFDLVVIDEAGQAIEPSCWIPILLGKRCILAGDQCQLAPVILSRKALEGGLGV Sbjct: 654 PMIRSLDSFDLVVIDEAGQAIEPSCWIPILLGKRCILAGDQCQLAPVILSRKALEGGLGV 713 Query: 1173 SFLERASTLHEGVLATKLTTQYRMNDAIASWASKEMYNGLLKSSASVMSHLLSDSPLVKS 994 S LERASTLHEGV ATKLTTQYRMNDAIASWASKEMYNGLLKSSASV SHLLSDSPLVK Sbjct: 714 SLLERASTLHEGVFATKLTTQYRMNDAIASWASKEMYNGLLKSSASVTSHLLSDSPLVKP 773 Query: 993 TWITQCPLLLLDTRMPFGSLSVGCEEQLDPAGTGSFYNEGEADIVVQHVFALIYAGVRPS 814 TWITQCPLLLLDTRMP+GSLSVGCEEQLDPAGTGSFYNEGEADIVVQHVFALIYAGVRP+ Sbjct: 774 TWITQCPLLLLDTRMPYGSLSVGCEEQLDPAGTGSFYNEGEADIVVQHVFALIYAGVRPA 833 Query: 813 TIVVQSPYVSQVQLLRDRLEEFPLSTGVEVATIDSFQGREADAVVISMVRSNNLGAVGFL 634 +IVVQSPYV+QVQLLRDRLEEFP++ GVEVATIDSFQGREADAV+ISMVRSNNLGAVGFL Sbjct: 834 SIVVQSPYVAQVQLLRDRLEEFPITKGVEVATIDSFQGREADAVIISMVRSNNLGAVGFL 893 Query: 633 GDSRRMNVAITRARKHVAIICDSSTICHNTFLARLLRHIRYFGRVKHAEPGGSGGYGLSM 454 GDSRRMNVAITRARKHVAIICDSSTICHNTFLARLLRHIRYFGRVKHAEPGGSGG GL+M Sbjct: 894 GDSRRMNVAITRARKHVAIICDSSTICHNTFLARLLRHIRYFGRVKHAEPGGSGGSGLAM 953 Query: 453 NPMLPSVS 430 NPMLPS+S Sbjct: 954 NPMLPSLS 961 >gb|EYU44882.1| hypothetical protein MIMGU_mgv1a001152mg [Erythranthe guttata] Length = 876 Score = 1535 bits (3973), Expect = 0.0 Identities = 775/878 (88%), Positives = 827/878 (94%), Gaps = 4/878 (0%) Frame = -2 Query: 3051 RMKQQQVNDEK-DGPTSVRALYQNG-DPLGRRDLGKGVVKWIGKGMKAMALDFALAETQG 2878 R KQQQ+N+ K +GPTSVR+LYQNG DPLGRRDLGKGVVKWI +GMKAMAL+FA AE QG Sbjct: 2 RNKQQQINEGKRNGPTSVRSLYQNGGDPLGRRDLGKGVVKWISQGMKAMALEFARAEMQG 61 Query: 2877 DFADLKQRMGP-GLTFVIQAQPYLNAVPMPLGMEAICLKTCTHYPTLFDHFQRELRDVLL 2701 +FA+LKQ+MGP GLTFVIQAQPYLNAVPMP+G+EAICLKTCTHYPTLFDHFQRELRD+L Sbjct: 62 EFAELKQQMGPAGLTFVIQAQPYLNAVPMPVGLEAICLKTCTHYPTLFDHFQRELRDILQ 121 Query: 2700 DLQHKTLIH-NWRETESWKLLKELATSAQHRAIARKTSLSKSVHGVLGLNIDKAKAIQCR 2524 DLQHK+LI W +T+SWKLLK+LA SAQHRA+ARK LSKS+HG L+IDK K+IQCR Sbjct: 122 DLQHKSLIPLTWHQTQSWKLLKDLANSAQHRAVARKAPLSKSLHG---LSIDKTKSIQCR 178 Query: 2523 IDEFTKHMSDLLRIERDAELEFTQEELNAVPTPDEHSTSPKPTEFLVSHAQSEQELCDTI 2344 ID+FT+HMS LLRIERD+ELEFT+EELNAVPTPDEHSTSPKP EFLVSHAQ+EQELCDTI Sbjct: 179 IDKFTEHMSHLLRIERDSELEFTEEELNAVPTPDEHSTSPKPIEFLVSHAQAEQELCDTI 238 Query: 2343 CNLNAISTSTGLGGMHLVLFRVEGNHRLPPTNLSPGDMVCVRICDSRGAGATSCMQGFVN 2164 CNLNAISTS GLGGMHLVLFR EGNHRLPPTNLSPGDMVCVRICDSRGAGATSCMQGFVN Sbjct: 239 CNLNAISTSIGLGGMHLVLFRAEGNHRLPPTNLSPGDMVCVRICDSRGAGATSCMQGFVN 298 Query: 2163 NLGDDGCSISVALESLHGDPTFSKLFGKNIRIDRIQGLADALTYERNCEAXXXXXXXXXX 1984 NLGDDGCSISVALES HGDPTFSKLFGKNIRIDRIQGLADALTYERNCEA Sbjct: 299 NLGDDGCSISVALESRHGDPTFSKLFGKNIRIDRIQGLADALTYERNCEALMMLQKKGLQ 358 Query: 1983 XKNSSIAVVTTIFGDNEDIAWFEDNNMVDWAEAELNGLLDTEFYDTSQQRAIALGLNKKR 1804 +NSS+AVVTTIFGD EDIAWFEDN++VDW+E EL+GLLDTEFYD+SQQRAIALGLNKKR Sbjct: 359 KQNSSVAVVTTIFGDKEDIAWFEDNDLVDWSEVELDGLLDTEFYDSSQQRAIALGLNKKR 418 Query: 1803 PVLIIQGPPGTGKTGVLKQLISIAVKQGERVLVTAPTNAAVDNMVEKLSDIGANIVRVGN 1624 PVLIIQGPPG GKTGVLKQLIS+ VK+GERVLVTAPTNAAVDNMVEKLSDIGANIVRVGN Sbjct: 419 PVLIIQGPPGAGKTGVLKQLISLVVKRGERVLVTAPTNAAVDNMVEKLSDIGANIVRVGN 478 Query: 1623 PARISPAVASKSLVEIVNGRLADFRSEFERKKSDLRKDLSHCLRDDSLAAGIRQLLKQLG 1444 PARISPAVASKSLVEIVN +LAD++SEF RKKS+LRKDLSHCL+DDSLAAGIRQLLKQLG Sbjct: 479 PARISPAVASKSLVEIVNSKLADYKSEFGRKKSNLRKDLSHCLKDDSLAAGIRQLLKQLG 538 Query: 1443 KTMKKKERETIREILSSAHVVLATNIGAADPMIRWLNSFDLVVIDEAGQAIEPSCWIPIL 1264 K +KKKERET++EILSSA VVLATNIGAADPMIR L+SFDLVVIDEAGQAIEPSCWIPIL Sbjct: 539 KAIKKKERETVKEILSSAQVVLATNIGAADPMIRSLDSFDLVVIDEAGQAIEPSCWIPIL 598 Query: 1263 LGKRCILAGDQCQLAPVILSRKALEGGLGVSFLERASTLHEGVLATKLTTQYRMNDAIAS 1084 LGKRCILAGDQCQLAPVILSRKALEGGLGVS LERASTLHEGV ATKLTTQYRMNDAIAS Sbjct: 599 LGKRCILAGDQCQLAPVILSRKALEGGLGVSLLERASTLHEGVFATKLTTQYRMNDAIAS 658 Query: 1083 WASKEMYNGLLKSSASVMSHLLSDSPLVKSTWITQCPLLLLDTRMPFGSLSVGCEEQLDP 904 WASKEMYNGLLKSSASV SHLLSDSPLVK TWITQCPLLLLDTRMP+GSLSVGCEEQLDP Sbjct: 659 WASKEMYNGLLKSSASVTSHLLSDSPLVKPTWITQCPLLLLDTRMPYGSLSVGCEEQLDP 718 Query: 903 AGTGSFYNEGEADIVVQHVFALIYAGVRPSTIVVQSPYVSQVQLLRDRLEEFPLSTGVEV 724 AGTGSFYNEGEADIVVQHVFALIYAGVRP++IVVQSPYV+QVQLLRDRLEEFP++ GVEV Sbjct: 719 AGTGSFYNEGEADIVVQHVFALIYAGVRPASIVVQSPYVAQVQLLRDRLEEFPITKGVEV 778 Query: 723 ATIDSFQGREADAVVISMVRSNNLGAVGFLGDSRRMNVAITRARKHVAIICDSSTICHNT 544 ATIDSFQGREADAV+ISMVRSNNLGAVGFLGDSRRMNVAITRARKHVAIICDSSTICHNT Sbjct: 779 ATIDSFQGREADAVIISMVRSNNLGAVGFLGDSRRMNVAITRARKHVAIICDSSTICHNT 838 Query: 543 FLARLLRHIRYFGRVKHAEPGGSGGYGLSMNPMLPSVS 430 FLARLLRHIRYFGRVKHAEPGGSGG GL+MNPMLPS+S Sbjct: 839 FLARLLRHIRYFGRVKHAEPGGSGGSGLAMNPMLPSLS 876 >gb|KZV41087.1| P-loop containing nucleoside triphosphate hydrolases superfamily protein isoform 1 [Dorcoceras hygrometricum] Length = 939 Score = 1482 bits (3836), Expect = 0.0 Identities = 754/964 (78%), Positives = 829/964 (85%) Frame = -2 Query: 3321 MEASCIFCGGVSASILKSQGIRHRPSESISLYSNKNRLFLSSPISHRVWXXXXXXXXXXX 3142 ME+SCI CGGVS + KS G P ES S Y NR+ + S I +W Sbjct: 1 MESSCICCGGVSTLLYKSPGNGRHPDESFSPY---NRVLIGSRIPRSIWASASTKRR--- 54 Query: 3141 XXXXXREDGRGADVSNNNTNNKAAVSEEKTRMKQQQVNDEKDGPTSVRALYQNGDPLGRR 2962 K V +K + +Q+ D++ S+ +QNGDPLGR+ Sbjct: 55 -------------TGGKKKEEKVGVVPKKKLGQPRQLGDQR----SLLTEHQNGDPLGRK 97 Query: 2961 DLGKGVVKWIGKGMKAMALDFALAETQGDFADLKQRMGPGLTFVIQAQPYLNAVPMPLGM 2782 DLGK V+KWI +GMK+MAL A AE QGD ++ KQRMGPGLTFVI+AQPYLNAVPMP G+ Sbjct: 98 DLGKNVMKWICQGMKSMALAIAKAEMQGDLSEFKQRMGPGLTFVIEAQPYLNAVPMPPGL 157 Query: 2781 EAICLKTCTHYPTLFDHFQRELRDVLLDLQHKTLIHNWRETESWKLLKELATSAQHRAIA 2602 EAICLKTCTHYPTLFDHFQRELRDVL DLQ ++LI +WRETESWKLLKELA SAQHRAIA Sbjct: 158 EAICLKTCTHYPTLFDHFQRELRDVLQDLQQQSLIVDWRETESWKLLKELANSAQHRAIA 217 Query: 2601 RKTSLSKSVHGVLGLNIDKAKAIQCRIDEFTKHMSDLLRIERDAELEFTQEELNAVPTPD 2422 RKT LS +HGVLG++++K KAIQ RIDE T+ MS+LLR+ERDAELEFTQEELNAVPTPD Sbjct: 218 RKTPLS--LHGVLGMDLNKVKAIQRRIDELTQQMSELLRVERDAELEFTQEELNAVPTPD 275 Query: 2421 EHSTSPKPTEFLVSHAQSEQELCDTICNLNAISTSTGLGGMHLVLFRVEGNHRLPPTNLS 2242 E+S+S KPTEFLVSHAQ EQE+CDTICNLNA+STS GLGGMHLVLF+ EGN+RLPPTNLS Sbjct: 276 ENSSSRKPTEFLVSHAQVEQEMCDTICNLNAVSTSIGLGGMHLVLFKAEGNNRLPPTNLS 335 Query: 2241 PGDMVCVRICDSRGAGATSCMQGFVNNLGDDGCSISVALESLHGDPTFSKLFGKNIRIDR 2062 PGDMVCVRICDSRGAGATSC+QGFVNNLG+DGCSISVALES HGDPTFSKLFGKNIRIDR Sbjct: 336 PGDMVCVRICDSRGAGATSCLQGFVNNLGEDGCSISVALESRHGDPTFSKLFGKNIRIDR 395 Query: 2061 IQGLADALTYERNCEAXXXXXXXXXXXKNSSIAVVTTIFGDNEDIAWFEDNNMVDWAEAE 1882 IQGLAD LTYERNCEA KN SI VV T+FGD ED+ W EDN +VDWAE E Sbjct: 396 IQGLADTLTYERNCEALMMLQKKGLHKKNPSITVVATVFGDKEDVVWLEDNKLVDWAEME 455 Query: 1881 LNGLLDTEFYDTSQQRAIALGLNKKRPVLIIQGPPGTGKTGVLKQLISIAVKQGERVLVT 1702 L LLDTE YD SQQRAIALGLNKKRP+LIIQGPPGTGKT VLK+LIS+ V+QGERVLVT Sbjct: 456 LGELLDTESYDASQQRAIALGLNKKRPMLIIQGPPGTGKTVVLKELISLVVEQGERVLVT 515 Query: 1701 APTNAAVDNMVEKLSDIGANIVRVGNPARISPAVASKSLVEIVNGRLADFRSEFERKKSD 1522 APTNAAVDNMVEKLSDIGANIVRVGNPARISPAVASKSLVEIVN +LADF+SEFERKKSD Sbjct: 516 APTNAAVDNMVEKLSDIGANIVRVGNPARISPAVASKSLVEIVNAKLADFKSEFERKKSD 575 Query: 1521 LRKDLSHCLRDDSLAAGIRQLLKQLGKTMKKKERETIREILSSAHVVLATNIGAADPMIR 1342 LRKDLSHCLRDDSLAAGIRQLLKQLGKTMKKKERET+RE+LSSA VVLATNIGAADP+IR Sbjct: 576 LRKDLSHCLRDDSLAAGIRQLLKQLGKTMKKKERETVREVLSSAQVVLATNIGAADPLIR 635 Query: 1341 WLNSFDLVVIDEAGQAIEPSCWIPILLGKRCILAGDQCQLAPVILSRKALEGGLGVSFLE 1162 LN FDLVVIDEAGQAIEPSCWIPILLGKRCILAGD+CQLAPVILSR+ALEGGLGVS LE Sbjct: 636 LLNFFDLVVIDEAGQAIEPSCWIPILLGKRCILAGDKCQLAPVILSRRALEGGLGVSLLE 695 Query: 1161 RASTLHEGVLATKLTTQYRMNDAIASWASKEMYNGLLKSSASVMSHLLSDSPLVKSTWIT 982 RA TLHEGVL+T+LTTQYRMNDAIASWASKEMY+G L+SS+ V SHLLSDSP VK TWIT Sbjct: 696 RAETLHEGVLSTQLTTQYRMNDAIASWASKEMYDGTLESSSRVTSHLLSDSPFVKQTWIT 755 Query: 981 QCPLLLLDTRMPFGSLSVGCEEQLDPAGTGSFYNEGEADIVVQHVFALIYAGVRPSTIVV 802 QCPLLLLDTR+P+GSLS+GCEEQ+DPAGTGSFYNEGEADIVVQHV++LIYAGV P++IVV Sbjct: 756 QCPLLLLDTRLPYGSLSMGCEEQIDPAGTGSFYNEGEADIVVQHVYSLIYAGVIPASIVV 815 Query: 801 QSPYVSQVQLLRDRLEEFPLSTGVEVATIDSFQGREADAVVISMVRSNNLGAVGFLGDSR 622 QSPYV+QVQLLRDRLEEFP++TGVEVATIDSFQGREADAV+ISMVRSNNLGAVGFLGDSR Sbjct: 816 QSPYVAQVQLLRDRLEEFPITTGVEVATIDSFQGREADAVIISMVRSNNLGAVGFLGDSR 875 Query: 621 RMNVAITRARKHVAIICDSSTICHNTFLARLLRHIRYFGRVKHAEPGGSGGYGLSMNPML 442 RMNVAITRARKHVAI+CDSSTICHNTFLARLLRHIRY+GRVKHA+PGG GG GLSM PML Sbjct: 876 RMNVAITRARKHVAIVCDSSTICHNTFLARLLRHIRYYGRVKHADPGGYGGTGLSMTPML 935 Query: 441 PSVS 430 PS+S Sbjct: 936 PSLS 939 >gb|EOY10295.1| P-loop containing nucleoside triphosphate hydrolases superfamily protein isoform 1 [Theobroma cacao] Length = 1008 Score = 1423 bits (3684), Expect = 0.0 Identities = 707/890 (79%), Positives = 786/890 (88%) Frame = -2 Query: 3099 SNNNTNNKAAVSEEKTRMKQQQVNDEKDGPTSVRALYQNGDPLGRRDLGKGVVKWIGKGM 2920 S++ ++ K V E Q+Q +K +VR LYQNGDPLGRRDLGK V++WI +GM Sbjct: 119 SSSCSSTKIIVEELGLLKNQKQEKVKKTKAVNVRTLYQNGDPLGRRDLGKRVIRWISEGM 178 Query: 2919 KAMALDFALAETQGDFADLKQRMGPGLTFVIQAQPYLNAVPMPLGMEAICLKTCTHYPTL 2740 KAMA DF AE QG+F +L+QRMGPGLTFVIQAQPYLNA+P+PLG+EAICLK CTHYPTL Sbjct: 179 KAMASDFVTAELQGEFLELRQRMGPGLTFVIQAQPYLNAIPIPLGLEAICLKACTHYPTL 238 Query: 2739 FDHFQRELRDVLLDLQHKTLIHNWRETESWKLLKELATSAQHRAIARKTSLSKSVHGVLG 2560 FDHFQRELR++L +LQ +++ +WRETESWKLLKELA SAQHRAIARK + K V GVLG Sbjct: 239 FDHFQRELRNILQELQQNSVVEDWRETESWKLLKELANSAQHRAIARKITQPKPVQGVLG 298 Query: 2559 LNIDKAKAIQCRIDEFTKHMSDLLRIERDAELEFTQEELNAVPTPDEHSTSPKPTEFLVS 2380 ++++KAKA+Q RIDEFTK MS+LLRIERDAELEFTQEELNAVPTPDE S S KP EFLVS Sbjct: 299 MDLEKAKAMQGRIDEFTKQMSELLRIERDAELEFTQEELNAVPTPDEGSDSSKPIEFLVS 358 Query: 2379 HAQSEQELCDTICNLNAISTSTGLGGMHLVLFRVEGNHRLPPTNLSPGDMVCVRICDSRG 2200 H Q++QELCDTICNLNA+STSTGLGGMHLVLFRVEGNHRLPPT LSPGDMVCVRICDSRG Sbjct: 359 HGQAQQELCDTICNLNAVSTSTGLGGMHLVLFRVEGNHRLPPTTLSPGDMVCVRICDSRG 418 Query: 2199 AGATSCMQGFVNNLGDDGCSISVALESLHGDPTFSKLFGKNIRIDRIQGLADALTYERNC 2020 AGATSCMQGFV+NLG+DGCSISVALES HGDPTFSK FGKN+RIDRIQGLADALTYERNC Sbjct: 419 AGATSCMQGFVDNLGEDGCSISVALESRHGDPTFSKFFGKNVRIDRIQGLADALTYERNC 478 Query: 2019 EAXXXXXXXXXXXKNSSIAVVTTIFGDNEDIAWFEDNNMVDWAEAELNGLLDTEFYDTSQ 1840 EA KN SIAVV T+FGD ED+ W E N+ DW EA+L+GLL +D SQ Sbjct: 479 EALMLLQKNGLQKKNPSIAVVATLFGDKEDVTWLEKNSYADWNEAKLDGLLQNGTFDDSQ 538 Query: 1839 QRAIALGLNKKRPVLIIQGPPGTGKTGVLKQLISIAVKQGERVLVTAPTNAAVDNMVEKL 1660 QRAIALGLNKKRP+L++QGPPGTGKTG+LK++I++AV+QGERVLV APTNAAVDNMVEKL Sbjct: 539 QRAIALGLNKKRPILVVQGPPGTGKTGLLKEVIALAVQQGERVLVAAPTNAAVDNMVEKL 598 Query: 1659 SDIGANIVRVGNPARISPAVASKSLVEIVNGRLADFRSEFERKKSDLRKDLSHCLRDDSL 1480 S+IG NIVRVGNPARIS AVASKSL EIVN +LAD+ +EFERKKSDLRKDL HCL+DDSL Sbjct: 599 SNIGLNIVRVGNPARISSAVASKSLAEIVNSKLADYLAEFERKKSDLRKDLRHCLKDDSL 658 Query: 1479 AAGIRQLLKQLGKTMKKKERETIREILSSAHVVLATNIGAADPMIRWLNSFDLVVIDEAG 1300 AAGIRQLLKQLGK +KKKE+ET+RE+LSSA VVL+TN GAADP+IR +++FDLVVIDEAG Sbjct: 659 AAGIRQLLKQLGKALKKKEKETVREVLSSAQVVLSTNTGAADPLIRRMDTFDLVVIDEAG 718 Query: 1299 QAIEPSCWIPILLGKRCILAGDQCQLAPVILSRKALEGGLGVSFLERASTLHEGVLATKL 1120 QAIEPSCWIPIL GKRCILAGDQCQLAPVILSRKALEGGLGVS LERA+T+HEGVLAT L Sbjct: 719 QAIEPSCWIPILQGKRCILAGDQCQLAPVILSRKALEGGLGVSLLERAATMHEGVLATML 778 Query: 1119 TTQYRMNDAIASWASKEMYNGLLKSSASVMSHLLSDSPLVKSTWITQCPLLLLDTRMPFG 940 TTQYRMNDAIA WASKEMY+G LKSS SV SHLL DSP VK TWITQCPLLLLDTRMP+G Sbjct: 779 TTQYRMNDAIAGWASKEMYDGELKSSPSVGSHLLVDSPFVKPTWITQCPLLLLDTRMPYG 838 Query: 939 SLSVGCEEQLDPAGTGSFYNEGEADIVVQHVFALIYAGVRPSTIVVQSPYVSQVQLLRDR 760 SLSVGCEE LDPAGTGSFYNEGEADIVVQHVF LIYAGV P+ I VQSPYV+QVQLLRDR Sbjct: 839 SLSVGCEEHLDPAGTGSFYNEGEADIVVQHVFYLIYAGVSPTAIAVQSPYVAQVQLLRDR 898 Query: 759 LEEFPLSTGVEVATIDSFQGREADAVVISMVRSNNLGAVGFLGDSRRMNVAITRARKHVA 580 L+EFP + GVEVATIDSFQGREADAV+ISMVRSN LGAVGFLGDSRRMNVA+TRARKHVA Sbjct: 899 LDEFPEAAGVEVATIDSFQGREADAVIISMVRSNTLGAVGFLGDSRRMNVAVTRARKHVA 958 Query: 579 IICDSSTICHNTFLARLLRHIRYFGRVKHAEPGGSGGYGLSMNPMLPSVS 430 ++CDSSTICHNTFLARLLRHIRYFGRVKHAEPG SGG GL M+PMLPS+S Sbjct: 959 VVCDSSTICHNTFLARLLRHIRYFGRVKHAEPGTSGGSGLGMDPMLPSIS 1008 >ref|XP_017977299.1| PREDICTED: DNA-binding protein SMUBP-2 [Theobroma cacao] ref|XP_007029793.2| PREDICTED: DNA-binding protein SMUBP-2 [Theobroma cacao] Length = 1008 Score = 1422 bits (3680), Expect = 0.0 Identities = 706/890 (79%), Positives = 786/890 (88%) Frame = -2 Query: 3099 SNNNTNNKAAVSEEKTRMKQQQVNDEKDGPTSVRALYQNGDPLGRRDLGKGVVKWIGKGM 2920 S++ ++ K V E Q+Q +K +VR LYQNGDPLGRRDLGK V++WI +GM Sbjct: 119 SSSCSSTKIIVEELGLLKNQKQEKVKKTKAVNVRTLYQNGDPLGRRDLGKRVIRWISEGM 178 Query: 2919 KAMALDFALAETQGDFADLKQRMGPGLTFVIQAQPYLNAVPMPLGMEAICLKTCTHYPTL 2740 KAMA DF AE QG+F +L+QRMGPGLTFVIQAQPYLNA+P+PLG+EAICLK CTHYPTL Sbjct: 179 KAMASDFVTAELQGEFLELRQRMGPGLTFVIQAQPYLNAIPIPLGLEAICLKACTHYPTL 238 Query: 2739 FDHFQRELRDVLLDLQHKTLIHNWRETESWKLLKELATSAQHRAIARKTSLSKSVHGVLG 2560 FDHFQRELR++L +LQ +++ +WR+TESWKLLKELA SAQHRAIARK + K V GVLG Sbjct: 239 FDHFQRELRNILQELQQNSVVEDWRKTESWKLLKELANSAQHRAIARKITQPKPVQGVLG 298 Query: 2559 LNIDKAKAIQCRIDEFTKHMSDLLRIERDAELEFTQEELNAVPTPDEHSTSPKPTEFLVS 2380 ++++KAKA+Q RIDEFTK MS+LLRIERDAELEFTQEELNAVPTPDE S S KP EFLVS Sbjct: 299 MDLEKAKAMQGRIDEFTKQMSELLRIERDAELEFTQEELNAVPTPDEGSDSSKPIEFLVS 358 Query: 2379 HAQSEQELCDTICNLNAISTSTGLGGMHLVLFRVEGNHRLPPTNLSPGDMVCVRICDSRG 2200 H Q++QELCDTICNLNA+STSTGLGGMHLVLFRVEGNHRLPPT LSPGDMVCVRICDSRG Sbjct: 359 HGQAQQELCDTICNLNAVSTSTGLGGMHLVLFRVEGNHRLPPTTLSPGDMVCVRICDSRG 418 Query: 2199 AGATSCMQGFVNNLGDDGCSISVALESLHGDPTFSKLFGKNIRIDRIQGLADALTYERNC 2020 AGATSCMQGFV+NLG+DGCSISVALES HGDPTFSK FGKN+RIDRIQGLADALTYERNC Sbjct: 419 AGATSCMQGFVDNLGEDGCSISVALESRHGDPTFSKFFGKNVRIDRIQGLADALTYERNC 478 Query: 2019 EAXXXXXXXXXXXKNSSIAVVTTIFGDNEDIAWFEDNNMVDWAEAELNGLLDTEFYDTSQ 1840 EA KN SIAVV T+FGD ED+ W E N+ DW EA+L+GLL +D SQ Sbjct: 479 EALMLLQKNGLQKKNPSIAVVATLFGDKEDVTWLEKNSYADWNEAKLDGLLQNGTFDDSQ 538 Query: 1839 QRAIALGLNKKRPVLIIQGPPGTGKTGVLKQLISIAVKQGERVLVTAPTNAAVDNMVEKL 1660 QRAIALGLNKKRP+L++QGPPGTGKTG+LK++I++AV+QGERVLV APTNAAVDNMVEKL Sbjct: 539 QRAIALGLNKKRPILVVQGPPGTGKTGLLKEVIALAVQQGERVLVAAPTNAAVDNMVEKL 598 Query: 1659 SDIGANIVRVGNPARISPAVASKSLVEIVNGRLADFRSEFERKKSDLRKDLSHCLRDDSL 1480 S+IG NIVRVGNPARIS AVASKSL EIVN +LAD+ +EFERKKSDLRKDL HCL+DDSL Sbjct: 599 SNIGLNIVRVGNPARISSAVASKSLAEIVNSKLADYLAEFERKKSDLRKDLRHCLKDDSL 658 Query: 1479 AAGIRQLLKQLGKTMKKKERETIREILSSAHVVLATNIGAADPMIRWLNSFDLVVIDEAG 1300 AAGIRQLLKQLGK +KKKE+ET+RE+LSSA VVL+TN GAADP+IR +++FDLVVIDEAG Sbjct: 659 AAGIRQLLKQLGKALKKKEKETVREVLSSAQVVLSTNTGAADPLIRRMDTFDLVVIDEAG 718 Query: 1299 QAIEPSCWIPILLGKRCILAGDQCQLAPVILSRKALEGGLGVSFLERASTLHEGVLATKL 1120 QAIEPSCWIPIL GKRCILAGDQCQLAPVILSRKALEGGLGVS LERA+T+HEGVLAT L Sbjct: 719 QAIEPSCWIPILQGKRCILAGDQCQLAPVILSRKALEGGLGVSLLERAATMHEGVLATML 778 Query: 1119 TTQYRMNDAIASWASKEMYNGLLKSSASVMSHLLSDSPLVKSTWITQCPLLLLDTRMPFG 940 TTQYRMNDAIA WASKEMY+G LKSS SV SHLL DSP VK TWITQCPLLLLDTRMP+G Sbjct: 779 TTQYRMNDAIAGWASKEMYDGELKSSPSVGSHLLVDSPFVKPTWITQCPLLLLDTRMPYG 838 Query: 939 SLSVGCEEQLDPAGTGSFYNEGEADIVVQHVFALIYAGVRPSTIVVQSPYVSQVQLLRDR 760 SLSVGCEE LDPAGTGSFYNEGEADIVVQHVF LIYAGV P+ I VQSPYV+QVQLLRDR Sbjct: 839 SLSVGCEEHLDPAGTGSFYNEGEADIVVQHVFYLIYAGVSPTAIAVQSPYVAQVQLLRDR 898 Query: 759 LEEFPLSTGVEVATIDSFQGREADAVVISMVRSNNLGAVGFLGDSRRMNVAITRARKHVA 580 L+EFP + GVEVATIDSFQGREADAV+ISMVRSN LGAVGFLGDSRRMNVA+TRARKHVA Sbjct: 899 LDEFPEAAGVEVATIDSFQGREADAVIISMVRSNTLGAVGFLGDSRRMNVAVTRARKHVA 958 Query: 579 IICDSSTICHNTFLARLLRHIRYFGRVKHAEPGGSGGYGLSMNPMLPSVS 430 ++CDSSTICHNTFLARLLRHIRYFGRVKHAEPG SGG GL M+PMLPS+S Sbjct: 959 VVCDSSTICHNTFLARLLRHIRYFGRVKHAEPGTSGGSGLGMDPMLPSIS 1008 >gb|OMO99192.1| putative DNA-binding protein smubp-2 [Corchorus capsularis] Length = 1011 Score = 1420 bits (3677), Expect = 0.0 Identities = 708/890 (79%), Positives = 786/890 (88%) Frame = -2 Query: 3099 SNNNTNNKAAVSEEKTRMKQQQVNDEKDGPTSVRALYQNGDPLGRRDLGKGVVKWIGKGM 2920 ++N + K V E K+ Q +K +VR LYQNGDPLGR+DLGK V++WI +GM Sbjct: 122 NSNVSGTKLIVEEMGLLKKKNQQKVKKTKAVNVRTLYQNGDPLGRKDLGKTVIRWISEGM 181 Query: 2919 KAMALDFALAETQGDFADLKQRMGPGLTFVIQAQPYLNAVPMPLGMEAICLKTCTHYPTL 2740 +AMALDFA AE QG+F +L+QRMGPGLTFVIQAQPYLNA+P+PLG+EAI LK CTHYPTL Sbjct: 182 RAMALDFASAELQGEFPELRQRMGPGLTFVIQAQPYLNAIPIPLGLEAISLKACTHYPTL 241 Query: 2739 FDHFQRELRDVLLDLQHKTLIHNWRETESWKLLKELATSAQHRAIARKTSLSKSVHGVLG 2560 FDHFQRELR+VL +LQ K+++ +WRETESWK+LKELA SAQHRAIARK++ K V GVLG Sbjct: 242 FDHFQRELRNVLQELQQKSMVEDWRETESWKMLKELANSAQHRAIARKSTQPKPVQGVLG 301 Query: 2559 LNIDKAKAIQCRIDEFTKHMSDLLRIERDAELEFTQEELNAVPTPDEHSTSPKPTEFLVS 2380 ++++K KA+Q RIDEFTK MS+LL+IERDAELEFTQEELNAVPTPDE S KP EFLVS Sbjct: 302 MDLEKVKAMQGRIDEFTKWMSELLQIERDAELEFTQEELNAVPTPDEGSNPSKPIEFLVS 361 Query: 2379 HAQSEQELCDTICNLNAISTSTGLGGMHLVLFRVEGNHRLPPTNLSPGDMVCVRICDSRG 2200 H Q++QELCDTICNLNA+STSTGLGGMHLVLFRVEGNHRLPPT LSPGDMVCVRICD+RG Sbjct: 362 HGQAQQELCDTICNLNAVSTSTGLGGMHLVLFRVEGNHRLPPTTLSPGDMVCVRICDNRG 421 Query: 2199 AGATSCMQGFVNNLGDDGCSISVALESLHGDPTFSKLFGKNIRIDRIQGLADALTYERNC 2020 AGAT+CMQGFV+NLG+DGCSISVALES HGDPTFSKLFGK +RIDRIQGLADALTYERNC Sbjct: 422 AGATACMQGFVDNLGEDGCSISVALESRHGDPTFSKLFGKTVRIDRIQGLADALTYERNC 481 Query: 2019 EAXXXXXXXXXXXKNSSIAVVTTIFGDNEDIAWFEDNNMVDWAEAELNGLLDTEFYDTSQ 1840 EA KN SIAVV T+FGD ED+ W E N++ DW E +L+GLL +D SQ Sbjct: 482 EALMLLQKNGLQKKNPSIAVVATLFGDKEDMDWLEKNDLADWNETKLDGLLQNGIFDDSQ 541 Query: 1839 QRAIALGLNKKRPVLIIQGPPGTGKTGVLKQLISIAVKQGERVLVTAPTNAAVDNMVEKL 1660 ++AIALGLNKKRPVL++QGPPGTGKTG+LK++I++AV+QGERVLVTAPTNAAVDNMVEKL Sbjct: 542 RKAIALGLNKKRPVLVVQGPPGTGKTGLLKEIIALAVQQGERVLVTAPTNAAVDNMVEKL 601 Query: 1659 SDIGANIVRVGNPARISPAVASKSLVEIVNGRLADFRSEFERKKSDLRKDLSHCLRDDSL 1480 SD G NIVRVGNPARIS AVASKSLVEIVN +LA+FR+EFERKKSDLRKDL CL+DDSL Sbjct: 602 SDTGLNIVRVGNPARISSAVASKSLVEIVNSKLANFRAEFERKKSDLRKDLRLCLKDDSL 661 Query: 1479 AAGIRQLLKQLGKTMKKKERETIREILSSAHVVLATNIGAADPMIRWLNSFDLVVIDEAG 1300 AAGIRQLLKQLGKT+KKKE+ET+REILSSA VVL+TN GAADP+IR L +FDLVVIDEAG Sbjct: 662 AAGIRQLLKQLGKTLKKKEKETVREILSSAQVVLSTNTGAADPLIRRLKTFDLVVIDEAG 721 Query: 1299 QAIEPSCWIPILLGKRCILAGDQCQLAPVILSRKALEGGLGVSFLERASTLHEGVLATKL 1120 QAIEPSCWIPIL GKRCILAGDQCQLAPVILSRKALEGGLGVS LERA+TLHEGVL T L Sbjct: 722 QAIEPSCWIPILQGKRCILAGDQCQLAPVILSRKALEGGLGVSLLERAATLHEGVLTTLL 781 Query: 1119 TTQYRMNDAIASWASKEMYNGLLKSSASVMSHLLSDSPLVKSTWITQCPLLLLDTRMPFG 940 TTQYRMNDAIA WASKEMYNG LKSS SV SHLL DSP VK TWITQCPLLLLDTRMP+G Sbjct: 782 TTQYRMNDAIAGWASKEMYNGELKSSPSVASHLLVDSPFVKPTWITQCPLLLLDTRMPYG 841 Query: 939 SLSVGCEEQLDPAGTGSFYNEGEADIVVQHVFALIYAGVRPSTIVVQSPYVSQVQLLRDR 760 SLSVGCEE LDPAGTGSFYNEGEADIVVQHVF LIYAGV P TI VQSPYV+QVQLLRDR Sbjct: 842 SLSVGCEEHLDPAGTGSFYNEGEADIVVQHVFYLIYAGVSPKTIAVQSPYVAQVQLLRDR 901 Query: 759 LEEFPLSTGVEVATIDSFQGREADAVVISMVRSNNLGAVGFLGDSRRMNVAITRARKHVA 580 L+EFP + GVEVATIDSFQGREADAV+ISMVRSN LGAVGFLGDSRRMNVAITRARKHVA Sbjct: 902 LDEFPEAAGVEVATIDSFQGREADAVIISMVRSNTLGAVGFLGDSRRMNVAITRARKHVA 961 Query: 579 IICDSSTICHNTFLARLLRHIRYFGRVKHAEPGGSGGYGLSMNPMLPSVS 430 ++CDSSTICHNTFLARLLRHIRYFGRVKHAEPG SGG GL M+PMLPS+S Sbjct: 962 VVCDSSTICHNTFLARLLRHIRYFGRVKHAEPGNSGGSGLGMDPMLPSIS 1011 >gb|OMO56477.1| hypothetical protein COLO4_35630 [Corchorus olitorius] Length = 1011 Score = 1417 bits (3667), Expect = 0.0 Identities = 707/890 (79%), Positives = 785/890 (88%) Frame = -2 Query: 3099 SNNNTNNKAAVSEEKTRMKQQQVNDEKDGPTSVRALYQNGDPLGRRDLGKGVVKWIGKGM 2920 ++N + K V E K+ Q +K +VR LYQNGDPLGR+DLGK V++WI +GM Sbjct: 122 NSNVSGTKLIVEEMGLLKKKNQQKVKKTKAVNVRTLYQNGDPLGRKDLGKTVIRWISEGM 181 Query: 2919 KAMALDFALAETQGDFADLKQRMGPGLTFVIQAQPYLNAVPMPLGMEAICLKTCTHYPTL 2740 +AMALDFA AE QG+F +L+QRMGPGLTFVIQAQPYLNA+P+PLG+EAI LK CTHYPTL Sbjct: 182 RAMALDFASAELQGEFPELRQRMGPGLTFVIQAQPYLNAIPIPLGLEAISLKACTHYPTL 241 Query: 2739 FDHFQRELRDVLLDLQHKTLIHNWRETESWKLLKELATSAQHRAIARKTSLSKSVHGVLG 2560 FDHFQRELR+VL +LQ K+++ +WRETESWK+LKELA SAQHRAIARK++ K V GVLG Sbjct: 242 FDHFQRELRNVLQELQQKSMVEDWRETESWKMLKELAHSAQHRAIARKSTQPKPVQGVLG 301 Query: 2559 LNIDKAKAIQCRIDEFTKHMSDLLRIERDAELEFTQEELNAVPTPDEHSTSPKPTEFLVS 2380 ++++K KA+Q RIDEFTK MS+LL+IERDAELEFTQEELNAVPTPDE S KP EFLVS Sbjct: 302 MDLEKVKAMQGRIDEFTKWMSELLQIERDAELEFTQEELNAVPTPDEGSNPSKPIEFLVS 361 Query: 2379 HAQSEQELCDTICNLNAISTSTGLGGMHLVLFRVEGNHRLPPTNLSPGDMVCVRICDSRG 2200 H Q++QELCDTICNLNA+STSTGLGGMHLVLFRVEGNHRLPPT LSPGDMVCVRICD+RG Sbjct: 362 HGQAQQELCDTICNLNAVSTSTGLGGMHLVLFRVEGNHRLPPTTLSPGDMVCVRICDNRG 421 Query: 2199 AGATSCMQGFVNNLGDDGCSISVALESLHGDPTFSKLFGKNIRIDRIQGLADALTYERNC 2020 AGAT+CMQGFV+NLG+DGCSISVALES HGDPTFSKLFGK +RIDRIQGLADALTYERNC Sbjct: 422 AGATACMQGFVDNLGEDGCSISVALESRHGDPTFSKLFGKTVRIDRIQGLADALTYERNC 481 Query: 2019 EAXXXXXXXXXXXKNSSIAVVTTIFGDNEDIAWFEDNNMVDWAEAELNGLLDTEFYDTSQ 1840 EA KN SIAVV T+FGD ED+ W E N++ DW E L+GLL +D SQ Sbjct: 482 EALMLLQKNGLQKKNLSIAVVATLFGDKEDMDWLEKNDLADWNETMLDGLLQNGIFDDSQ 541 Query: 1839 QRAIALGLNKKRPVLIIQGPPGTGKTGVLKQLISIAVKQGERVLVTAPTNAAVDNMVEKL 1660 ++AIALGLNKKRP+L++QGPPGTGKTG+LK++I++AV+QGERVLVTAPTNAAVDNMVEKL Sbjct: 542 RKAIALGLNKKRPLLVVQGPPGTGKTGLLKEIIALAVQQGERVLVTAPTNAAVDNMVEKL 601 Query: 1659 SDIGANIVRVGNPARISPAVASKSLVEIVNGRLADFRSEFERKKSDLRKDLSHCLRDDSL 1480 SD G NIVRVGNPARIS AVASKSLVEIVN +LA+FR+EFERKKSDLRKDL CL+DDSL Sbjct: 602 SDTGLNIVRVGNPARISSAVASKSLVEIVNSKLANFRAEFERKKSDLRKDLRLCLKDDSL 661 Query: 1479 AAGIRQLLKQLGKTMKKKERETIREILSSAHVVLATNIGAADPMIRWLNSFDLVVIDEAG 1300 AAGIRQLLKQLGKT+KKKE+ET+REILSSA VVL+TN GAADP+IR L +FDLVVIDEAG Sbjct: 662 AAGIRQLLKQLGKTLKKKEKETVREILSSAQVVLSTNTGAADPLIRRLKTFDLVVIDEAG 721 Query: 1299 QAIEPSCWIPILLGKRCILAGDQCQLAPVILSRKALEGGLGVSFLERASTLHEGVLATKL 1120 QAIEPSCWIPIL GKRCILAGDQCQLAPVILSRKALEGGLGVS LERA+TLHEGVL T L Sbjct: 722 QAIEPSCWIPILQGKRCILAGDQCQLAPVILSRKALEGGLGVSLLERAATLHEGVLTTLL 781 Query: 1119 TTQYRMNDAIASWASKEMYNGLLKSSASVMSHLLSDSPLVKSTWITQCPLLLLDTRMPFG 940 TTQYRMNDAIASWASKEMYNG LKSS SV SHLL DSP VK TWITQCPLLLLDTRMP+G Sbjct: 782 TTQYRMNDAIASWASKEMYNGELKSSPSVASHLLVDSPFVKPTWITQCPLLLLDTRMPYG 841 Query: 939 SLSVGCEEQLDPAGTGSFYNEGEADIVVQHVFALIYAGVRPSTIVVQSPYVSQVQLLRDR 760 SLSVGCEE LDPAGTGSFYNEGEADIVVQHVF LIYAGV P I VQSPYV+QVQLLRDR Sbjct: 842 SLSVGCEEHLDPAGTGSFYNEGEADIVVQHVFYLIYAGVSPKAIAVQSPYVAQVQLLRDR 901 Query: 759 LEEFPLSTGVEVATIDSFQGREADAVVISMVRSNNLGAVGFLGDSRRMNVAITRARKHVA 580 L+EFP + GVEVATIDSFQGREADAV+ISMVRSN LGAVGFLGDSRRMNVAITRARKHVA Sbjct: 902 LDEFPEAAGVEVATIDSFQGREADAVIISMVRSNTLGAVGFLGDSRRMNVAITRARKHVA 961 Query: 579 IICDSSTICHNTFLARLLRHIRYFGRVKHAEPGGSGGYGLSMNPMLPSVS 430 ++CDSSTICHNTFLARLLRHIRYFGRVKHAEPG SGG GL M+PMLPS+S Sbjct: 962 VVCDSSTICHNTFLARLLRHIRYFGRVKHAEPGNSGGSGLGMDPMLPSIS 1011 >gb|PHT30198.1| hypothetical protein CQW23_30230 [Capsicum baccatum] Length = 989 Score = 1416 bits (3666), Expect = 0.0 Identities = 723/981 (73%), Positives = 822/981 (83%), Gaps = 18/981 (1%) Frame = -2 Query: 3324 KMEASCIFCGGVSASILKSQGIRHRPS--ESISLYSNKNRLFLSS---PISHRVWXXXXX 3160 KMEASC FCG + S L Q + S S++L S KNR FL S S R Sbjct: 7 KMEASCNFCGSLVPSCLTRQKRSNLSSFIGSVALSSIKNRTFLDSISLTSSIRATASSSG 66 Query: 3159 XXXXXXXXXXXRED-----GRGADVSNNN--------TNNKAAVSEEKTRMKQQQVNDEK 3019 ++ G G +V N+ ++ KA + R QQQ ++ Sbjct: 67 GTKAVTTRRRKPKNVGTTGGSGKNVKNSEIPAVTTKGSSGKAIEKVQVKRKNQQQECIQE 126 Query: 3018 DGPTSVRALYQNGDPLGRRDLGKGVVKWIGKGMKAMALDFALAETQGDFADLKQRMGPGL 2839 GP VRAL+QNGDPLGR+DLGK VV+W+ +GM+AMALDFA AE QG+FA+LKQRM PGL Sbjct: 127 GGPVDVRALHQNGDPLGRKDLGKCVVRWLSQGMRAMALDFATAEMQGEFAELKQRMEPGL 186 Query: 2838 TFVIQAQPYLNAVPMPLGMEAICLKTCTHYPTLFDHFQRELRDVLLDLQHKTLIHNWRET 2659 TFVIQAQPYLNAVPMPLG+EAICLK CTHYPTLFD+FQRELRDVL DLQ K+ + +WR+T Sbjct: 187 TFVIQAQPYLNAVPMPLGLEAICLKACTHYPTLFDNFQRELRDVLQDLQRKSSVQDWRDT 246 Query: 2658 ESWKLLKELATSAQHRAIARKTSLSKSVHGVLGLNIDKAKAIQCRIDEFTKHMSDLLRIE 2479 ESWKLLK+LA+SAQH+AIARK S KSV GV+G++++KAKAIQ RID+FT MSDLL IE Sbjct: 247 ESWKLLKDLASSAQHKAIARKGSQPKSVPGVMGMDLEKAKAIQSRIDDFTNRMSDLLHIE 306 Query: 2478 RDAELEFTQEELNAVPTPDEHSTSPKPTEFLVSHAQSEQELCDTICNLNAISTSTGLGGM 2299 RDAELEFTQEELNAVP PD +S + KP EFLVSHAQ EQELCDTICNL A+STS GLGGM Sbjct: 307 RDAELEFTQEELNAVPAPDVNSEAQKPFEFLVSHAQPEQELCDTICNLTAVSTSIGLGGM 366 Query: 2298 HLVLFRVEGNHRLPPTNLSPGDMVCVRICDSRGAGATSCMQGFVNNLGDDGCSISVALES 2119 HLVLF++EGNHRLPP NLSPGDMVCVRICDSRGAGATSCMQGFV+NLG+DGCSIS+ALES Sbjct: 367 HLVLFKLEGNHRLPPANLSPGDMVCVRICDSRGAGATSCMQGFVHNLGEDGCSISLALES 426 Query: 2118 LHGDPTFSKLFGKNIRIDRIQGLADALTYERNCEAXXXXXXXXXXXKNSSIAVVTTIFGD 1939 L GD TFSKLFGKN+RIDRIQGLADALTYERNCEA KNSS+AVV T+FGD Sbjct: 427 LQGDTTFSKLFGKNVRIDRIQGLADALTYERNCEALMMLQKKGFRKKNSSVAVVATLFGD 486 Query: 1938 NEDIAWFEDNNMVDWAEAELNGLLDTEFYDTSQQRAIALGLNKKRPVLIIQGPPGTGKTG 1759 NED+ W E+N+M DWAE EL + + +D SQ++AIALGLNK RP++IIQGPPGTGKTG Sbjct: 487 NEDLKWLEENDMADWAEVELPDSTNKKSFDASQRKAIALGLNKNRPIMIIQGPPGTGKTG 546 Query: 1758 VLKQLISIAVKQGERVLVTAPTNAAVDNMVEKLSDIGANIVRVGNPARISPAVASKSLVE 1579 +LK+LIS+AVKQGERVLVTAPTNAAVDNMVEKLSDIG NIVRVGNPARIS +VASKSL E Sbjct: 547 LLKELISLAVKQGERVLVTAPTNAAVDNMVEKLSDIGINIVRVGNPARISSSVASKSLAE 606 Query: 1578 IVNGRLADFRSEFERKKSDLRKDLSHCLRDDSLAAGIRQLLKQLGKTMKKKERETIREIL 1399 IVN +L+DF SE ERKKSDLRKDL +CL+DDSLAAGIRQLLKQLGK++KKKE+ET++EIL Sbjct: 607 IVNNKLSDFLSEIERKKSDLRKDLRYCLKDDSLAAGIRQLLKQLGKSIKKKEKETVKEIL 666 Query: 1398 SSAHVVLATNIGAADPMIRWLNSFDLVVIDEAGQAIEPSCWIPILLGKRCILAGDQCQLA 1219 S+AHVVLATNIGAADP+IR L++FDLV+IDEAGQAIEPS WIPILLGKRCILAGDQ QLA Sbjct: 667 STAHVVLATNIGAADPLIRRLDAFDLVIIDEAGQAIEPSSWIPILLGKRCILAGDQFQLA 726 Query: 1218 PVILSRKALEGGLGVSFLERASTLHEGVLATKLTTQYRMNDAIASWASKEMYNGLLKSSA 1039 PVILSRKALEGGLGVS LERA+TLH+G+L+TKLTTQYRMNDAIASWASKEMY G L SS Sbjct: 727 PVILSRKALEGGLGVSLLERAATLHDGMLSTKLTTQYRMNDAIASWASKEMYGGSLTSSP 786 Query: 1038 SVMSHLLSDSPLVKSTWITQCPLLLLDTRMPFGSLSVGCEEQLDPAGTGSFYNEGEADIV 859 +V SHLL DSP VK TWITQCPLLLLDTRMP+GSLSVGCEE LDPAGTGSFYNEGEADIV Sbjct: 787 TVASHLLVDSPFVKPTWITQCPLLLLDTRMPYGSLSVGCEEHLDPAGTGSFYNEGEADIV 846 Query: 858 VQHVFALIYAGVRPSTIVVQSPYVSQVQLLRDRLEEFPLSTGVEVATIDSFQGREADAVV 679 VQHVF+LIYAGV P+ I VQSPYV+QVQLLRD+++E P++TGV+VATIDSFQGREADAV+ Sbjct: 847 VQHVFSLIYAGVPPAAIAVQSPYVAQVQLLRDKIDEIPMATGVDVATIDSFQGREADAVI 906 Query: 678 ISMVRSNNLGAVGFLGDSRRMNVAITRARKHVAIICDSSTICHNTFLARLLRHIRYFGRV 499 ISMVRSNNLGAVGFLGD+RRMNVAITRARKHVA++CDSSTICHNT+LARLLRHIRYFG+V Sbjct: 907 ISMVRSNNLGAVGFLGDNRRMNVAITRARKHVAVVCDSSTICHNTYLARLLRHIRYFGKV 966 Query: 498 KHAEPGGSGGYGLSMNPMLPS 436 KH EPG +GL M+PMLP+ Sbjct: 967 KHVEPGSFWEFGLGMDPMLPT 987 >ref|XP_022718654.1| DNA-binding protein SMUBP-2-like [Durio zibethinus] Length = 1004 Score = 1416 bits (3665), Expect = 0.0 Identities = 703/898 (78%), Positives = 788/898 (87%) Frame = -2 Query: 3123 EDGRGADVSNNNTNNKAAVSEEKTRMKQQQVNDEKDGPTSVRALYQNGDPLGRRDLGKGV 2944 ++G + + ++ K V E + +Q+Q +K +VR LYQNGDPLGRRDLGK V Sbjct: 107 DNGSSSKSTPELSSTKILVEELELLKEQKQEKVKKTKALNVRTLYQNGDPLGRRDLGKRV 166 Query: 2943 VKWIGKGMKAMALDFALAETQGDFADLKQRMGPGLTFVIQAQPYLNAVPMPLGMEAICLK 2764 V+WI +GMKAMA DF AE QG+F +L+Q M PGLTFVIQAQPYLNA+P+PLG+EAICLK Sbjct: 167 VRWISEGMKAMASDFVSAELQGEFLELRQMMEPGLTFVIQAQPYLNAIPIPLGLEAICLK 226 Query: 2763 TCTHYPTLFDHFQRELRDVLLDLQHKTLIHNWRETESWKLLKELATSAQHRAIARKTSLS 2584 CTHYPTLFDHFQRELR+VL +LQH +++ +WRETESWKLLKELA S QHRAIARK +L Sbjct: 227 ACTHYPTLFDHFQRELRNVLQELQHNSVVEDWRETESWKLLKELANSVQHRAIARKITLP 286 Query: 2583 KSVHGVLGLNIDKAKAIQCRIDEFTKHMSDLLRIERDAELEFTQEELNAVPTPDEHSTSP 2404 K + G+LG+ ++KAKA+Q RIDEFTK MS+LLRIERDAELEFTQEELNAVPTP+E S Sbjct: 287 KPIQGILGIGLEKAKAMQGRIDEFTKRMSELLRIERDAELEFTQEELNAVPTPNEGCDSI 346 Query: 2403 KPTEFLVSHAQSEQELCDTICNLNAISTSTGLGGMHLVLFRVEGNHRLPPTNLSPGDMVC 2224 KP EFLVSH Q++QELCDTICNLNA+STSTGLGGMHLVLFRVEGNHRLPPT LSPGDMVC Sbjct: 347 KPIEFLVSHGQAQQELCDTICNLNAVSTSTGLGGMHLVLFRVEGNHRLPPTTLSPGDMVC 406 Query: 2223 VRICDSRGAGATSCMQGFVNNLGDDGCSISVALESLHGDPTFSKLFGKNIRIDRIQGLAD 2044 VRICDSRGAGATSC+QGFV+NLG+DGCSISVALES HGDPTFSKLFGK++RIDRIQGLAD Sbjct: 407 VRICDSRGAGATSCIQGFVDNLGEDGCSISVALESRHGDPTFSKLFGKSVRIDRIQGLAD 466 Query: 2043 ALTYERNCEAXXXXXXXXXXXKNSSIAVVTTIFGDNEDIAWFEDNNMVDWAEAELNGLLD 1864 ALTYERNCEA KN SIAVV T+FGD ED+AW E+N++ DW + EL+G L Sbjct: 467 ALTYERNCEALMLLQKNGLQKKNPSIAVVATLFGDKEDVAWLEENDLADWNQTELDGSLQ 526 Query: 1863 TEFYDTSQQRAIALGLNKKRPVLIIQGPPGTGKTGVLKQLISIAVKQGERVLVTAPTNAA 1684 +D SQQRAI LGLNKKRP+L++QGPPGTGKTG+LK++I++AV+QGE VLVTAPTNAA Sbjct: 527 NRTFDDSQQRAICLGLNKKRPMLVVQGPPGTGKTGLLKEVIALAVQQGETVLVTAPTNAA 586 Query: 1683 VDNMVEKLSDIGANIVRVGNPARISPAVASKSLVEIVNGRLADFRSEFERKKSDLRKDLS 1504 VDNMVEKLSD G +IVRVGNPARIS VASKSLVEIVN +LAD+R+EFERKKSDLRKDL Sbjct: 587 VDNMVEKLSDSGLDIVRVGNPARISSTVASKSLVEIVNSKLADYRAEFERKKSDLRKDLR 646 Query: 1503 HCLRDDSLAAGIRQLLKQLGKTMKKKERETIREILSSAHVVLATNIGAADPMIRWLNSFD 1324 HCL+DDSLAAGIRQLLKQLGK +KKKE+ET+RE+LSSA VVL+TN GAADP+IR L++FD Sbjct: 647 HCLKDDSLAAGIRQLLKQLGKALKKKEKETVREVLSSAQVVLSTNTGAADPLIRRLDTFD 706 Query: 1323 LVVIDEAGQAIEPSCWIPILLGKRCILAGDQCQLAPVILSRKALEGGLGVSFLERASTLH 1144 LVVIDEAGQAIEPSCWIPIL GKRCILAGD+CQLAPVILSRKALEGGLGVS LERA+TLH Sbjct: 707 LVVIDEAGQAIEPSCWIPILKGKRCILAGDRCQLAPVILSRKALEGGLGVSLLERAATLH 766 Query: 1143 EGVLATKLTTQYRMNDAIASWASKEMYNGLLKSSASVMSHLLSDSPLVKSTWITQCPLLL 964 EGVLAT LTTQYRMNDAIASWASKEMYNG LKSS SV S+LL DSP VK TWITQCPLLL Sbjct: 767 EGVLATMLTTQYRMNDAIASWASKEMYNGELKSSPSVASYLLVDSPFVKPTWITQCPLLL 826 Query: 963 LDTRMPFGSLSVGCEEQLDPAGTGSFYNEGEADIVVQHVFALIYAGVRPSTIVVQSPYVS 784 LDTRMP+GSLSVGCEE LDPAGTGSFYNEGE DIVVQHVF LIYAGV P+ I VQSPYV+ Sbjct: 827 LDTRMPYGSLSVGCEEHLDPAGTGSFYNEGETDIVVQHVFYLIYAGVSPTAIAVQSPYVA 886 Query: 783 QVQLLRDRLEEFPLSTGVEVATIDSFQGREADAVVISMVRSNNLGAVGFLGDSRRMNVAI 604 QVQLLRDRL+EFP + GVEVATIDSFQGREADAV+ISMVRSN LGAVGFLGDSRRMNVAI Sbjct: 887 QVQLLRDRLDEFPQTAGVEVATIDSFQGREADAVIISMVRSNTLGAVGFLGDSRRMNVAI 946 Query: 603 TRARKHVAIICDSSTICHNTFLARLLRHIRYFGRVKHAEPGGSGGYGLSMNPMLPSVS 430 TRARKHVA++CDSSTICHNTFLARLLRHIRYFGRVKHAEPG SGG GL M+PMLPS+S Sbjct: 947 TRARKHVAVVCDSSTICHNTFLARLLRHIRYFGRVKHAEPGASGGSGLGMDPMLPSIS 1004 >ref|XP_021282320.1| DNA-binding protein SMUBP-2 [Herrania umbratica] Length = 1009 Score = 1415 bits (3662), Expect = 0.0 Identities = 705/890 (79%), Positives = 782/890 (87%) Frame = -2 Query: 3099 SNNNTNNKAAVSEEKTRMKQQQVNDEKDGPTSVRALYQNGDPLGRRDLGKGVVKWIGKGM 2920 S++ ++ K V E Q+Q +K +VR LYQNGDPLGRRDLGK VV+WI +GM Sbjct: 120 SSSFSSTKIIVEELGLLKDQKQQKVKKTKAVNVRTLYQNGDPLGRRDLGKRVVRWISEGM 179 Query: 2919 KAMALDFALAETQGDFADLKQRMGPGLTFVIQAQPYLNAVPMPLGMEAICLKTCTHYPTL 2740 KAMA DF AE QG+F +L+QRMGPGLTFVIQAQPYLNA+P+PLG+EAICLK CTHYPTL Sbjct: 180 KAMASDFVTAELQGEFLELRQRMGPGLTFVIQAQPYLNAIPIPLGLEAICLKACTHYPTL 239 Query: 2739 FDHFQRELRDVLLDLQHKTLIHNWRETESWKLLKELATSAQHRAIARKTSLSKSVHGVLG 2560 FDHFQRELR+VL +LQ +++ +WRETESW LLKELA SAQHRAIARK K V GVLG Sbjct: 240 FDHFQRELRNVLQELQKNSVVEDWRETESWTLLKELANSAQHRAIARKIEQPKPVQGVLG 299 Query: 2559 LNIDKAKAIQCRIDEFTKHMSDLLRIERDAELEFTQEELNAVPTPDEHSTSPKPTEFLVS 2380 ++++KAKA+Q RIDEFTK MS+LLRIERDAELEFTQEELNAVPTPDE S S KP EFLVS Sbjct: 300 MDLEKAKAMQGRIDEFTKQMSELLRIERDAELEFTQEELNAVPTPDEGSDSSKPIEFLVS 359 Query: 2379 HAQSEQELCDTICNLNAISTSTGLGGMHLVLFRVEGNHRLPPTNLSPGDMVCVRICDSRG 2200 H Q++QELCDTICNLNA+STSTGLGGMHLVL RVEGNHRLPPT LSPGDMVCVRICDSRG Sbjct: 360 HGQAQQELCDTICNLNAVSTSTGLGGMHLVLLRVEGNHRLPPTTLSPGDMVCVRICDSRG 419 Query: 2199 AGATSCMQGFVNNLGDDGCSISVALESLHGDPTFSKLFGKNIRIDRIQGLADALTYERNC 2020 AGATSCMQGFV+NLG+DGCSISVALES HGDPTFSK FGKN+RIDRIQGLADALTYERNC Sbjct: 420 AGATSCMQGFVDNLGEDGCSISVALESRHGDPTFSKFFGKNVRIDRIQGLADALTYERNC 479 Query: 2019 EAXXXXXXXXXXXKNSSIAVVTTIFGDNEDIAWFEDNNMVDWAEAELNGLLDTEFYDTSQ 1840 EA KN SIAVV T+FGD ED+ W E N+ DW EA+L+GLL +D SQ Sbjct: 480 EALMLLQKNGLQKKNPSIAVVATLFGDTEDVTWLEKNSFADWNEAKLDGLLQNGIFDDSQ 539 Query: 1839 QRAIALGLNKKRPVLIIQGPPGTGKTGVLKQLISIAVKQGERVLVTAPTNAAVDNMVEKL 1660 QRAIALGLNKKRP+L++QGPPGTGKTG+LK++I++AV+QGERVLVTAPTNAAVDNMVEKL Sbjct: 540 QRAIALGLNKKRPILVVQGPPGTGKTGLLKEVIALAVQQGERVLVTAPTNAAVDNMVEKL 599 Query: 1659 SDIGANIVRVGNPARISPAVASKSLVEIVNGRLADFRSEFERKKSDLRKDLSHCLRDDSL 1480 S+ G NIVRVGNPARIS AVASKSLVEIVN +LAD+ +EFERKKSDLRKDL HCL+DDSL Sbjct: 600 SNTGLNIVRVGNPARISSAVASKSLVEIVNSKLADYLAEFERKKSDLRKDLRHCLKDDSL 659 Query: 1479 AAGIRQLLKQLGKTMKKKERETIREILSSAHVVLATNIGAADPMIRWLNSFDLVVIDEAG 1300 AAGIRQLLKQLGK +KKKE+ET+RE+LSSA VVL+TN GAADP+IR +++FDLVVIDEAG Sbjct: 660 AAGIRQLLKQLGKALKKKEKETVREVLSSAQVVLSTNTGAADPLIRRMDTFDLVVIDEAG 719 Query: 1299 QAIEPSCWIPILLGKRCILAGDQCQLAPVILSRKALEGGLGVSFLERASTLHEGVLATKL 1120 QAIEPSCWIPI GKRCILAGDQCQLAPVILSRKAL+GGLGVS LERA+T+HEGVLAT L Sbjct: 720 QAIEPSCWIPIFQGKRCILAGDQCQLAPVILSRKALDGGLGVSLLERAATMHEGVLATML 779 Query: 1119 TTQYRMNDAIASWASKEMYNGLLKSSASVMSHLLSDSPLVKSTWITQCPLLLLDTRMPFG 940 T+QYRMNDAIASWASKEMY+G LKSS SV SHLL DSP VK TWITQCPLLLLDTRMP+G Sbjct: 780 TSQYRMNDAIASWASKEMYDGELKSSPSVGSHLLVDSPFVKPTWITQCPLLLLDTRMPYG 839 Query: 939 SLSVGCEEQLDPAGTGSFYNEGEADIVVQHVFALIYAGVRPSTIVVQSPYVSQVQLLRDR 760 SLSVGCEE LDP GTGSFYNEGEADIVVQHVF LIYAGV P+ I VQSPYV+QVQLLRDR Sbjct: 840 SLSVGCEEHLDPVGTGSFYNEGEADIVVQHVFYLIYAGVSPTAIAVQSPYVAQVQLLRDR 899 Query: 759 LEEFPLSTGVEVATIDSFQGREADAVVISMVRSNNLGAVGFLGDSRRMNVAITRARKHVA 580 L+E P + GVEVATIDSFQGREADAV+ISMVRSN LGAVGFLGDSRRMNVAITRARKHVA Sbjct: 900 LDELPEAAGVEVATIDSFQGREADAVIISMVRSNTLGAVGFLGDSRRMNVAITRARKHVA 959 Query: 579 IICDSSTICHNTFLARLLRHIRYFGRVKHAEPGGSGGYGLSMNPMLPSVS 430 ++CDSSTICHNTFLARLLRHIRYFGRVKHAEPG SGG GL M+PMLPS+S Sbjct: 960 VVCDSSTICHNTFLARLLRHIRYFGRVKHAEPGTSGGSGLGMDPMLPSIS 1009 >ref|XP_016564094.1| PREDICTED: DNA-binding protein SMUBP-2 [Capsicum annuum] Length = 989 Score = 1411 bits (3652), Expect = 0.0 Identities = 720/981 (73%), Positives = 820/981 (83%), Gaps = 18/981 (1%) Frame = -2 Query: 3324 KMEASCIFCGGVSASILKSQGIRHRPS--ESISLYSNKNRLFLSS---PISHRVWXXXXX 3160 KMEASC FCG + S L Q + S S++L S KNR FL S S R Sbjct: 7 KMEASCNFCGSLVPSCLTRQKRSNLSSFIGSVALSSIKNRTFLDSISLTSSIRATASSSG 66 Query: 3159 XXXXXXXXXXXRED-----GRGADVSNNN--------TNNKAAVSEEKTRMKQQQVNDEK 3019 ++ G G +V N+ ++ KA + R QQQ ++ Sbjct: 67 GTKAVTTRRRKPKNVGTTGGSGKNVKNSEIPAVTTKGSSGKAIEKVQVKRKNQQQECIQE 126 Query: 3018 DGPTSVRALYQNGDPLGRRDLGKGVVKWIGKGMKAMALDFALAETQGDFADLKQRMGPGL 2839 GP VRAL+QNGDPLGR+DLGK VV+W+ +GM+AMALDFA AE QG+FA+LKQRM PGL Sbjct: 127 GGPVDVRALHQNGDPLGRKDLGKCVVRWLSQGMRAMALDFATAEMQGEFAELKQRMEPGL 186 Query: 2838 TFVIQAQPYLNAVPMPLGMEAICLKTCTHYPTLFDHFQRELRDVLLDLQHKTLIHNWRET 2659 TFVIQAQPYLNAVPMPLG+EAICLK CTHYPTLFD+FQRELRDVL DLQ K+ + +WR+T Sbjct: 187 TFVIQAQPYLNAVPMPLGLEAICLKACTHYPTLFDNFQRELRDVLQDLQRKSSVQDWRDT 246 Query: 2658 ESWKLLKELATSAQHRAIARKTSLSKSVHGVLGLNIDKAKAIQCRIDEFTKHMSDLLRIE 2479 ESWKLLK+LA+SAQH+AIARK S KSV GV+G++++KAKAIQ RID+FT MSDLL IE Sbjct: 247 ESWKLLKDLASSAQHKAIARKGSQPKSVPGVMGMDLEKAKAIQSRIDDFTNRMSDLLHIE 306 Query: 2478 RDAELEFTQEELNAVPTPDEHSTSPKPTEFLVSHAQSEQELCDTICNLNAISTSTGLGGM 2299 RDAELEFTQEELNAVP PD +S + KP EFLVSHAQ EQELCDTICNL A+STS GLGGM Sbjct: 307 RDAELEFTQEELNAVPAPDVNSEAQKPFEFLVSHAQPEQELCDTICNLTAVSTSIGLGGM 366 Query: 2298 HLVLFRVEGNHRLPPTNLSPGDMVCVRICDSRGAGATSCMQGFVNNLGDDGCSISVALES 2119 HLVLF++EGNHRLPP NLSPGDMVCVRICDSRGAGATSCMQGFV+NLG+DGCSIS+ALES Sbjct: 367 HLVLFKLEGNHRLPPANLSPGDMVCVRICDSRGAGATSCMQGFVHNLGEDGCSISLALES 426 Query: 2118 LHGDPTFSKLFGKNIRIDRIQGLADALTYERNCEAXXXXXXXXXXXKNSSIAVVTTIFGD 1939 L GD TFSKLFGKN+RIDRIQGLADALTYERNCEA KN S+AVV T+FGD Sbjct: 427 LQGDTTFSKLFGKNVRIDRIQGLADALTYERNCEALMMLQKKGFRKKNPSVAVVATLFGD 486 Query: 1938 NEDIAWFEDNNMVDWAEAELNGLLDTEFYDTSQQRAIALGLNKKRPVLIIQGPPGTGKTG 1759 NED+ W E+N+M DWAE EL + + +D SQ++AIALGLNK RP++IIQGPPGTGKTG Sbjct: 487 NEDLKWLEENDMADWAEVELPDSTNKKSFDASQRKAIALGLNKNRPIMIIQGPPGTGKTG 546 Query: 1758 VLKQLISIAVKQGERVLVTAPTNAAVDNMVEKLSDIGANIVRVGNPARISPAVASKSLVE 1579 +LK+LIS+AVKQGERVLVTAPTNAAVDNMVEKLSDIG NIVRVGNPARIS +VASKSL E Sbjct: 547 LLKELISLAVKQGERVLVTAPTNAAVDNMVEKLSDIGINIVRVGNPARISSSVASKSLAE 606 Query: 1578 IVNGRLADFRSEFERKKSDLRKDLSHCLRDDSLAAGIRQLLKQLGKTMKKKERETIREIL 1399 IVN +L+DF +E ERKKSDLRKDL +CL+DDSLAAGIRQLLKQLGK++KKKE+ET++EIL Sbjct: 607 IVNNKLSDFLAEIERKKSDLRKDLRYCLKDDSLAAGIRQLLKQLGKSIKKKEKETVKEIL 666 Query: 1398 SSAHVVLATNIGAADPMIRWLNSFDLVVIDEAGQAIEPSCWIPILLGKRCILAGDQCQLA 1219 S+AHVVLATNIGAADP+IR L++FDLV+IDEAGQAIEPS WIPILLGKRCILAGDQ QLA Sbjct: 667 STAHVVLATNIGAADPLIRRLDAFDLVIIDEAGQAIEPSSWIPILLGKRCILAGDQFQLA 726 Query: 1218 PVILSRKALEGGLGVSFLERASTLHEGVLATKLTTQYRMNDAIASWASKEMYNGLLKSSA 1039 PVILSRKALEGGLGVS LERA+TLH+G+L+TKLTTQYRMNDAIASWASKEMY G L SS Sbjct: 727 PVILSRKALEGGLGVSLLERAATLHDGMLSTKLTTQYRMNDAIASWASKEMYGGSLTSSP 786 Query: 1038 SVMSHLLSDSPLVKSTWITQCPLLLLDTRMPFGSLSVGCEEQLDPAGTGSFYNEGEADIV 859 +V SHLL DSP VK TWITQCPLLLLDTRMP+GSLSVGCEE LDPAGTGSFYNEGEADIV Sbjct: 787 TVASHLLVDSPFVKPTWITQCPLLLLDTRMPYGSLSVGCEEHLDPAGTGSFYNEGEADIV 846 Query: 858 VQHVFALIYAGVRPSTIVVQSPYVSQVQLLRDRLEEFPLSTGVEVATIDSFQGREADAVV 679 VQHVF+LIYAGV P+ I VQSPYV+QVQLLRD+++E P++TGV+VATIDSFQGREADAV+ Sbjct: 847 VQHVFSLIYAGVPPAAIAVQSPYVAQVQLLRDKIDEIPMATGVDVATIDSFQGREADAVI 906 Query: 678 ISMVRSNNLGAVGFLGDSRRMNVAITRARKHVAIICDSSTICHNTFLARLLRHIRYFGRV 499 ISMVRSNNLGAVGFLGD+RRMNVAITRA KHVA++CDSSTICHNT+LARLLRHIRYFG+V Sbjct: 907 ISMVRSNNLGAVGFLGDNRRMNVAITRASKHVAVVCDSSTICHNTYLARLLRHIRYFGKV 966 Query: 498 KHAEPGGSGGYGLSMNPMLPS 436 KH EPG +GL M+PMLP+ Sbjct: 967 KHVEPGSFWEFGLGMDPMLPT 987 >gb|PHU23400.1| hypothetical protein BC332_08507 [Capsicum chinense] Length = 989 Score = 1409 bits (3647), Expect = 0.0 Identities = 720/981 (73%), Positives = 819/981 (83%), Gaps = 18/981 (1%) Frame = -2 Query: 3324 KMEASCIFCGGVSASILKSQGIRHRPS--ESISLYSNKNRLFLSS---PISHRVWXXXXX 3160 KMEASC FCG + S L Q + S S++L S KNR FL S S R Sbjct: 7 KMEASCNFCGSLVPSCLTRQKRSNLSSFIGSVALSSIKNRTFLDSISLTSSIRATASSSG 66 Query: 3159 XXXXXXXXXXXRED-----GRGADVSNNN--------TNNKAAVSEEKTRMKQQQVNDEK 3019 ++ G G +V N+ ++ KA + R QQQ ++ Sbjct: 67 GTKAVTTRRRKPKNVGTTGGSGKNVKNSEIPAVTTKGSSGKAIEKVQVKRKNQQQECIQE 126 Query: 3018 DGPTSVRALYQNGDPLGRRDLGKGVVKWIGKGMKAMALDFALAETQGDFADLKQRMGPGL 2839 GP VRAL+QNGDPLGR+DLGK VV+W+ +GM+AMALDFA AE QG+FA+LKQRM PGL Sbjct: 127 GGPVDVRALHQNGDPLGRKDLGKCVVRWLSQGMRAMALDFATAEMQGEFAELKQRMEPGL 186 Query: 2838 TFVIQAQPYLNAVPMPLGMEAICLKTCTHYPTLFDHFQRELRDVLLDLQHKTLIHNWRET 2659 TFVIQAQPYLNAVPMPLG+EAICLK CTHYPTLFD+FQRELRDVL DLQ K+ + +WR+T Sbjct: 187 TFVIQAQPYLNAVPMPLGLEAICLKACTHYPTLFDNFQRELRDVLQDLQRKSSVQDWRDT 246 Query: 2658 ESWKLLKELATSAQHRAIARKTSLSKSVHGVLGLNIDKAKAIQCRIDEFTKHMSDLLRIE 2479 ESWKLLK+LA+SAQH+AIARK S KSV GV+G++++KAKAIQ RID+FT MSDLL IE Sbjct: 247 ESWKLLKDLASSAQHKAIARKGSQPKSVPGVMGMDLEKAKAIQSRIDDFTNRMSDLLHIE 306 Query: 2478 RDAELEFTQEELNAVPTPDEHSTSPKPTEFLVSHAQSEQELCDTICNLNAISTSTGLGGM 2299 RDAELEFTQEELNAVP PD +S + KP EFLVSHAQ EQELCDTICNL A+STS GLGGM Sbjct: 307 RDAELEFTQEELNAVPAPDVNSEAQKPFEFLVSHAQPEQELCDTICNLTAVSTSIGLGGM 366 Query: 2298 HLVLFRVEGNHRLPPTNLSPGDMVCVRICDSRGAGATSCMQGFVNNLGDDGCSISVALES 2119 HLVLF++EGNHRLPP NLSPGDMVCVRICDSRGAGATSCMQGFV+NLG+DGCSIS+ALES Sbjct: 367 HLVLFKLEGNHRLPPANLSPGDMVCVRICDSRGAGATSCMQGFVHNLGEDGCSISLALES 426 Query: 2118 LHGDPTFSKLFGKNIRIDRIQGLADALTYERNCEAXXXXXXXXXXXKNSSIAVVTTIFGD 1939 L GD TFSKLFGKN+RIDRIQGLADALTYERNCEA KN S+AVV T+FGD Sbjct: 427 LQGDTTFSKLFGKNVRIDRIQGLADALTYERNCEALMMLQKKGFRKKNPSVAVVATLFGD 486 Query: 1938 NEDIAWFEDNNMVDWAEAELNGLLDTEFYDTSQQRAIALGLNKKRPVLIIQGPPGTGKTG 1759 NED+ W E+N+M DWAE EL + + +D SQ++AIALGLNK RP++IIQGPPGTGKTG Sbjct: 487 NEDLKWLEENDMADWAEVELPDSTNKKSFDASQRKAIALGLNKNRPIMIIQGPPGTGKTG 546 Query: 1758 VLKQLISIAVKQGERVLVTAPTNAAVDNMVEKLSDIGANIVRVGNPARISPAVASKSLVE 1579 +LK+LIS+AVKQGERVLVTAPTNAAVDNMVEKLSDIG NIVRVGNPARIS +VASKSL E Sbjct: 547 LLKELISLAVKQGERVLVTAPTNAAVDNMVEKLSDIGINIVRVGNPARISSSVASKSLAE 606 Query: 1578 IVNGRLADFRSEFERKKSDLRKDLSHCLRDDSLAAGIRQLLKQLGKTMKKKERETIREIL 1399 IVN +L+DF +E ERKKSDLRKDL CL+DDSLAAGIRQLLKQLGK++KKKE+ET++EIL Sbjct: 607 IVNNKLSDFLAEIERKKSDLRKDLRCCLKDDSLAAGIRQLLKQLGKSIKKKEKETVKEIL 666 Query: 1398 SSAHVVLATNIGAADPMIRWLNSFDLVVIDEAGQAIEPSCWIPILLGKRCILAGDQCQLA 1219 S+AHVVLATNIGAADP+IR L++FDLV+IDEAGQAIEPS WIPILLGKRCILAGDQ QLA Sbjct: 667 STAHVVLATNIGAADPLIRRLDAFDLVIIDEAGQAIEPSSWIPILLGKRCILAGDQFQLA 726 Query: 1218 PVILSRKALEGGLGVSFLERASTLHEGVLATKLTTQYRMNDAIASWASKEMYNGLLKSSA 1039 PVILSRKALEGGLGVS LERA+TLH+G+L+TKLTTQYRMNDAIASWASKEMY G L SS Sbjct: 727 PVILSRKALEGGLGVSLLERAATLHDGMLSTKLTTQYRMNDAIASWASKEMYGGSLTSSP 786 Query: 1038 SVMSHLLSDSPLVKSTWITQCPLLLLDTRMPFGSLSVGCEEQLDPAGTGSFYNEGEADIV 859 +V SHLL DSP VK TWITQCPLLLLDTRMP+GSLSVGCEE LDPAGTGSFYNEGEADIV Sbjct: 787 TVASHLLVDSPFVKPTWITQCPLLLLDTRMPYGSLSVGCEEHLDPAGTGSFYNEGEADIV 846 Query: 858 VQHVFALIYAGVRPSTIVVQSPYVSQVQLLRDRLEEFPLSTGVEVATIDSFQGREADAVV 679 VQHVF+LIYAGV P+ I VQSPYV+QVQLLRD+++E P++TGV+VATIDSFQGREADAV+ Sbjct: 847 VQHVFSLIYAGVPPAAIAVQSPYVAQVQLLRDKIDEIPMATGVDVATIDSFQGREADAVI 906 Query: 678 ISMVRSNNLGAVGFLGDSRRMNVAITRARKHVAIICDSSTICHNTFLARLLRHIRYFGRV 499 ISMVRSNNLGAVGFLGD+RRMNVAITRA KHVA++CDSSTICHNT+LARLLRHIRYFG+V Sbjct: 907 ISMVRSNNLGAVGFLGDNRRMNVAITRASKHVAVVCDSSTICHNTYLARLLRHIRYFGKV 966 Query: 498 KHAEPGGSGGYGLSMNPMLPS 436 KH EPG +GL M+PMLP+ Sbjct: 967 KHVEPGSFWEFGLGMDPMLPT 987 >gb|PHT87733.1| hypothetical protein T459_09839 [Capsicum annuum] Length = 989 Score = 1409 bits (3647), Expect = 0.0 Identities = 719/981 (73%), Positives = 819/981 (83%), Gaps = 18/981 (1%) Frame = -2 Query: 3324 KMEASCIFCGGVSASILKSQGIRHRPS--ESISLYSNKNRLFLSS---PISHRVWXXXXX 3160 KMEASC FCG + S L Q + S ++L S KNR FL S S R Sbjct: 7 KMEASCNFCGSLVPSCLTRQKRSNLSSFIGPVALSSIKNRTFLDSISLTSSIRATASSSG 66 Query: 3159 XXXXXXXXXXXRED-----GRGADVSNNN--------TNNKAAVSEEKTRMKQQQVNDEK 3019 ++ G G +V N+ ++ KA + R QQQ ++ Sbjct: 67 GTKAVTTRRRKPKNVGTTGGSGKNVKNSEIPAVTTKGSSGKAIEKVQVKRKNQQQECIQE 126 Query: 3018 DGPTSVRALYQNGDPLGRRDLGKGVVKWIGKGMKAMALDFALAETQGDFADLKQRMGPGL 2839 GP VRAL+QNGDPLGR+DLGK VV+W+ +GM+AMALDFA AE QG+FA+LKQRM PGL Sbjct: 127 GGPVDVRALHQNGDPLGRKDLGKCVVRWLSQGMRAMALDFATAEMQGEFAELKQRMEPGL 186 Query: 2838 TFVIQAQPYLNAVPMPLGMEAICLKTCTHYPTLFDHFQRELRDVLLDLQHKTLIHNWRET 2659 TFVIQAQPYLNAVPMPLG+EAICLK CTHYPTLFD+FQRELRDVL DLQ K+ + +WR+T Sbjct: 187 TFVIQAQPYLNAVPMPLGLEAICLKACTHYPTLFDNFQRELRDVLQDLQRKSSVQDWRDT 246 Query: 2658 ESWKLLKELATSAQHRAIARKTSLSKSVHGVLGLNIDKAKAIQCRIDEFTKHMSDLLRIE 2479 ESWKLLK+LA+SAQH+AIARK S KSV GV+G++++KAKAIQ RID+FT MSDLL IE Sbjct: 247 ESWKLLKDLASSAQHKAIARKGSQPKSVPGVMGMDLEKAKAIQSRIDDFTNRMSDLLHIE 306 Query: 2478 RDAELEFTQEELNAVPTPDEHSTSPKPTEFLVSHAQSEQELCDTICNLNAISTSTGLGGM 2299 RDAELEFTQEELNAVP PD +S + KP EFLVSHAQ EQELCDTICNL A+STS GLGGM Sbjct: 307 RDAELEFTQEELNAVPAPDVNSEAQKPFEFLVSHAQPEQELCDTICNLTAVSTSIGLGGM 366 Query: 2298 HLVLFRVEGNHRLPPTNLSPGDMVCVRICDSRGAGATSCMQGFVNNLGDDGCSISVALES 2119 HLVLF++EGNHRLPP NLSPGDMVCVRICDSRGAGATSCMQGFV+NLG+DGCSIS+ALES Sbjct: 367 HLVLFKLEGNHRLPPANLSPGDMVCVRICDSRGAGATSCMQGFVHNLGEDGCSISLALES 426 Query: 2118 LHGDPTFSKLFGKNIRIDRIQGLADALTYERNCEAXXXXXXXXXXXKNSSIAVVTTIFGD 1939 L GD TFSKLFGKN+RIDRIQGLADALTYERNCEA KN S+AVV T+FGD Sbjct: 427 LQGDTTFSKLFGKNVRIDRIQGLADALTYERNCEALMMLQKKGFRKKNPSVAVVATLFGD 486 Query: 1938 NEDIAWFEDNNMVDWAEAELNGLLDTEFYDTSQQRAIALGLNKKRPVLIIQGPPGTGKTG 1759 NED+ W E+N+M DWAE EL + + +D SQ++AIALGLNK RP++IIQGPPGTGKTG Sbjct: 487 NEDLKWLEENDMADWAEVELPDSTNKKSFDASQRKAIALGLNKNRPIMIIQGPPGTGKTG 546 Query: 1758 VLKQLISIAVKQGERVLVTAPTNAAVDNMVEKLSDIGANIVRVGNPARISPAVASKSLVE 1579 +LK+LIS+AVKQGERVLVTAPTNAAVDNMVEKLSDIG NIVRVGNPARIS +VASKSL E Sbjct: 547 LLKELISLAVKQGERVLVTAPTNAAVDNMVEKLSDIGINIVRVGNPARISSSVASKSLAE 606 Query: 1578 IVNGRLADFRSEFERKKSDLRKDLSHCLRDDSLAAGIRQLLKQLGKTMKKKERETIREIL 1399 IVN +L+DF +E ERKKSDLRKDL +CL+DDSLAAGIRQLLKQLGK++KKKE+ET++EIL Sbjct: 607 IVNNKLSDFLAEIERKKSDLRKDLRYCLKDDSLAAGIRQLLKQLGKSIKKKEKETVKEIL 666 Query: 1398 SSAHVVLATNIGAADPMIRWLNSFDLVVIDEAGQAIEPSCWIPILLGKRCILAGDQCQLA 1219 S+AHVVLATNIGAADP+IR L++FDLV+IDEAGQAIEPS WIPILLGKRCILAGDQ QLA Sbjct: 667 STAHVVLATNIGAADPLIRRLDAFDLVIIDEAGQAIEPSSWIPILLGKRCILAGDQFQLA 726 Query: 1218 PVILSRKALEGGLGVSFLERASTLHEGVLATKLTTQYRMNDAIASWASKEMYNGLLKSSA 1039 PVILSRKALEGGLGVS LERA+TLH+G+L+TKLTTQYRMNDAIASWASKEMY G L SS Sbjct: 727 PVILSRKALEGGLGVSLLERAATLHDGMLSTKLTTQYRMNDAIASWASKEMYGGSLTSSP 786 Query: 1038 SVMSHLLSDSPLVKSTWITQCPLLLLDTRMPFGSLSVGCEEQLDPAGTGSFYNEGEADIV 859 +V SHLL DSP VK TWITQCPLLLLDTRMP+GSLSVGCEE LDPAGTGSFYNEGEADIV Sbjct: 787 TVASHLLVDSPFVKPTWITQCPLLLLDTRMPYGSLSVGCEEHLDPAGTGSFYNEGEADIV 846 Query: 858 VQHVFALIYAGVRPSTIVVQSPYVSQVQLLRDRLEEFPLSTGVEVATIDSFQGREADAVV 679 VQHVF+LIYAGV P+ I VQSPYV+QVQLLRD+++E P++TGV+VATIDSFQGREADAV+ Sbjct: 847 VQHVFSLIYAGVPPAAIAVQSPYVAQVQLLRDKIDEIPMATGVDVATIDSFQGREADAVI 906 Query: 678 ISMVRSNNLGAVGFLGDSRRMNVAITRARKHVAIICDSSTICHNTFLARLLRHIRYFGRV 499 ISMVRSNNLGAVGFLGD+RRMNVAITRA KHVA++CDSSTICHNT+LARLLRHIRYFG+V Sbjct: 907 ISMVRSNNLGAVGFLGDNRRMNVAITRASKHVAVVCDSSTICHNTYLARLLRHIRYFGKV 966 Query: 498 KHAEPGGSGGYGLSMNPMLPS 436 KH EPG +GL M+PMLP+ Sbjct: 967 KHVEPGSFWEFGLGMDPMLPT 987 >ref|XP_009771939.1| PREDICTED: DNA-binding protein SMUBP-2 [Nicotiana sylvestris] Length = 980 Score = 1400 bits (3625), Expect = 0.0 Identities = 708/976 (72%), Positives = 819/976 (83%), Gaps = 11/976 (1%) Frame = -2 Query: 3324 KMEASCIFCGGVSA---SILKSQGIRHRPS-----ESISLYSNKNRLFLSSPIS---HRV 3178 KME+ C CG +S S L + + R + S++L + KNR+FL S IS + + Sbjct: 7 KMESLCNSCGSISTLAPSCLTLRFYKKRSNLSSFFGSVTLSNPKNRIFLDSSISFPNYNI 66 Query: 3177 WXXXXXXXXXXXXXXXXREDGRGADVSNNNTNNKAAVSEEKTRMKQQQVNDEKDGPTSVR 2998 ++ + +D+ + T EK + Q+ D GP +VR Sbjct: 67 QASSSSGTKSLSPRRRKPKNVKTSDIPSVTTKGSLGKKTEKNQECSQEERDS--GPVNVR 124 Query: 2997 ALYQNGDPLGRRDLGKGVVKWIGKGMKAMALDFALAETQGDFADLKQRMGPGLTFVIQAQ 2818 AL +NGDP+GR+DLGK VV+WI +GMKAMA DFA AE QG+F ++KQRM PGLTFVIQAQ Sbjct: 125 ALNENGDPMGRKDLGKCVVRWISQGMKAMATDFATAEMQGEFTEVKQRMEPGLTFVIQAQ 184 Query: 2817 PYLNAVPMPLGMEAICLKTCTHYPTLFDHFQRELRDVLLDLQHKTLIHNWRETESWKLLK 2638 PYLNA+PMPLG+EAICLK CTHYPTLFD+FQRELRDVL +LQ K+L+ +WR+TESWKLLK Sbjct: 185 PYLNAIPMPLGLEAICLKACTHYPTLFDNFQRELRDVLQNLQRKSLVQDWRDTESWKLLK 244 Query: 2637 ELATSAQHRAIARKTSLSKSVHGVLGLNIDKAKAIQCRIDEFTKHMSDLLRIERDAELEF 2458 +LA SAQH+AIARKTS K V GV+G++++KAKA+Q RID+FT MSDLLRIERD+ELEF Sbjct: 245 DLAISAQHKAIARKTSQPKFVPGVMGMDLEKAKAMQSRIDDFTNRMSDLLRIERDSELEF 304 Query: 2457 TQEELNAVPTPDEHSTSPKPTEFLVSHAQSEQELCDTICNLNAISTSTGLGGMHLVLFRV 2278 TQEELNAVP P +S KP EFLVSHAQ EQELCDTICNL A+STS GLGGMHLVLF++ Sbjct: 305 TQEELNAVPAPVLNSEEQKPFEFLVSHAQPEQELCDTICNLTAVSTSIGLGGMHLVLFKL 364 Query: 2277 EGNHRLPPTNLSPGDMVCVRICDSRGAGATSCMQGFVNNLGDDGCSISVALESLHGDPTF 2098 EGNHRLPPTNLSPGDMVCVR CDSRGAGATSCMQGFV+NLG+DG SIS+ALESLHGD TF Sbjct: 365 EGNHRLPPTNLSPGDMVCVRTCDSRGAGATSCMQGFVHNLGEDGRSISLALESLHGDSTF 424 Query: 2097 SKLFGKNIRIDRIQGLADALTYERNCEAXXXXXXXXXXXKNSSIAVVTTIFGDNEDIAWF 1918 SKLFGKN+RIDRIQGLADALTYERNCEA KN S+AVV T+FGD ED+AW Sbjct: 425 SKLFGKNVRIDRIQGLADALTYERNCEALMMLQKKGFQKKNPSVAVVATLFGDKEDLAWL 484 Query: 1917 EDNNMVDWAEAELNGLLDTEFYDTSQQRAIALGLNKKRPVLIIQGPPGTGKTGVLKQLIS 1738 E+N M DW+E EL D + +DTSQ++AIALGLNK RP++IIQGPPGTGKTG+LK+LIS Sbjct: 485 EENGMADWSEVELPDSTDRKSFDTSQRKAIALGLNKNRPIMIIQGPPGTGKTGMLKELIS 544 Query: 1737 IAVKQGERVLVTAPTNAAVDNMVEKLSDIGANIVRVGNPARISPAVASKSLVEIVNGRLA 1558 +AVKQGERVLVTAPTNAAVDNMVEKLSDIG NIVRVGNPARISPAVASKSL EIVN LA Sbjct: 545 LAVKQGERVLVTAPTNAAVDNMVEKLSDIGLNIVRVGNPARISPAVASKSLTEIVNTELA 604 Query: 1557 DFRSEFERKKSDLRKDLSHCLRDDSLAAGIRQLLKQLGKTMKKKERETIREILSSAHVVL 1378 DFR+E ERKKSDLR+DL +CL+DDSLAAGIRQLLKQLGK++K++E+ET++EILSSA VVL Sbjct: 605 DFRAEIERKKSDLRRDLRYCLKDDSLAAGIRQLLKQLGKSIKREEKETVKEILSSAQVVL 664 Query: 1377 ATNIGAADPMIRWLNSFDLVVIDEAGQAIEPSCWIPILLGKRCILAGDQCQLAPVILSRK 1198 ATNIGAADP+IR L++FDLV+IDEAGQAIEPSCWIPILLGKRCILAGDQ QLAPVILSRK Sbjct: 665 ATNIGAADPLIRRLDTFDLVIIDEAGQAIEPSCWIPILLGKRCILAGDQFQLAPVILSRK 724 Query: 1197 ALEGGLGVSFLERASTLHEGVLATKLTTQYRMNDAIASWASKEMYNGLLKSSASVMSHLL 1018 ALEGGLGVS LERA++LH+G+L+TKLTTQYRMN+AIASWASKEMY+G L SS +V SHLL Sbjct: 725 ALEGGLGVSLLERAASLHDGMLSTKLTTQYRMNNAIASWASKEMYDGSLISSPTVASHLL 784 Query: 1017 SDSPLVKSTWITQCPLLLLDTRMPFGSLSVGCEEQLDPAGTGSFYNEGEADIVVQHVFAL 838 DSP VK TW+TQCPLLLLDTRMP+GSLSVGCEE LDPAGTGSF+NEGEADIVVQHVF+L Sbjct: 785 VDSPFVKPTWVTQCPLLLLDTRMPYGSLSVGCEEHLDPAGTGSFFNEGEADIVVQHVFSL 844 Query: 837 IYAGVRPSTIVVQSPYVSQVQLLRDRLEEFPLSTGVEVATIDSFQGREADAVVISMVRSN 658 IY+GV P+ I VQSPYV+QVQLLRD+++E P++TGVEVATIDSFQGREADAV+ISMVRSN Sbjct: 845 IYSGVPPAAIAVQSPYVAQVQLLRDKIDELPMATGVEVATIDSFQGREADAVIISMVRSN 904 Query: 657 NLGAVGFLGDSRRMNVAITRARKHVAIICDSSTICHNTFLARLLRHIRYFGRVKHAEPGG 478 NLGAVGFLGDSRRMNVAITRARKHVA++CDSSTICHNT+LARLLRHIRYFG+VKH EPG Sbjct: 905 NLGAVGFLGDSRRMNVAITRARKHVAVVCDSSTICHNTYLARLLRHIRYFGKVKHVEPGS 964 Query: 477 SGGYGLSMNPMLPSVS 430 +GL M+PMLP+ S Sbjct: 965 FWEFGLGMDPMLPTAS 980 >ref|XP_016474118.1| PREDICTED: DNA-binding protein SMUBP-2-like [Nicotiana tabacum] Length = 980 Score = 1400 bits (3624), Expect = 0.0 Identities = 708/976 (72%), Positives = 819/976 (83%), Gaps = 11/976 (1%) Frame = -2 Query: 3324 KMEASCIFCGGVSA---SILKSQGIRHRPS-----ESISLYSNKNRLFLSSPIS---HRV 3178 KME+ C CG +S S L + + R + S++L + KNR+FL S IS + + Sbjct: 7 KMESLCNSCGSISTLAPSCLTLRFYKKRSNLSSFFGSVTLSNPKNRIFLDSSISFPNYNI 66 Query: 3177 WXXXXXXXXXXXXXXXXREDGRGADVSNNNTNNKAAVSEEKTRMKQQQVNDEKDGPTSVR 2998 ++ + +D+ + T EK + Q+ D GP +VR Sbjct: 67 QASSSSGTKSLSPRRRKPKNVKTSDIPSVTTKGSLGKKTEKNQECSQEERDS--GPVNVR 124 Query: 2997 ALYQNGDPLGRRDLGKGVVKWIGKGMKAMALDFALAETQGDFADLKQRMGPGLTFVIQAQ 2818 AL +NGDP+GR+DLGK VV+WI +GMKAMA DFA AE QG+F ++KQRM PGLTFVIQAQ Sbjct: 125 ALNENGDPMGRKDLGKCVVRWISQGMKAMATDFATAEMQGEFTEVKQRMEPGLTFVIQAQ 184 Query: 2817 PYLNAVPMPLGMEAICLKTCTHYPTLFDHFQRELRDVLLDLQHKTLIHNWRETESWKLLK 2638 PYLNA+PMPLG+EAICLK CTHYPTLFD+FQRELRDVL +LQ K+L+ +WR+TESWKLLK Sbjct: 185 PYLNAIPMPLGLEAICLKACTHYPTLFDNFQRELRDVLQNLQRKSLVQDWRDTESWKLLK 244 Query: 2637 ELATSAQHRAIARKTSLSKSVHGVLGLNIDKAKAIQCRIDEFTKHMSDLLRIERDAELEF 2458 +LA SAQH+AIARKTS K V GV+G++++KAKA+Q RID+FT MSDLLRIERD+ELEF Sbjct: 245 DLAISAQHKAIARKTSQPKFVPGVMGMDLEKAKAMQSRIDDFTNRMSDLLRIERDSELEF 304 Query: 2457 TQEELNAVPTPDEHSTSPKPTEFLVSHAQSEQELCDTICNLNAISTSTGLGGMHLVLFRV 2278 TQEELNAVP P +S KP EFLVSHAQ EQELCDTICNL A+STS GLGGMHLVLF++ Sbjct: 305 TQEELNAVPAPVLNSEEQKPFEFLVSHAQPEQELCDTICNLTAVSTSIGLGGMHLVLFKL 364 Query: 2277 EGNHRLPPTNLSPGDMVCVRICDSRGAGATSCMQGFVNNLGDDGCSISVALESLHGDPTF 2098 EGNHRLPPTNLSPGDMVCVR CDSRGAGATSCMQGFV+NLG+DG SIS+ALESLHGD TF Sbjct: 365 EGNHRLPPTNLSPGDMVCVRTCDSRGAGATSCMQGFVHNLGEDGRSISLALESLHGDSTF 424 Query: 2097 SKLFGKNIRIDRIQGLADALTYERNCEAXXXXXXXXXXXKNSSIAVVTTIFGDNEDIAWF 1918 SKLFGKN+RIDRIQGLADALTYERNCEA KN S+AVV T+FGD ED+AW Sbjct: 425 SKLFGKNVRIDRIQGLADALTYERNCEALMMLQKKGFQKKNPSVAVVATLFGDKEDLAWL 484 Query: 1917 EDNNMVDWAEAELNGLLDTEFYDTSQQRAIALGLNKKRPVLIIQGPPGTGKTGVLKQLIS 1738 E+N M DW+E EL D + +DTSQ++AIALGLNK RP++IIQGPPGTGKTG+LK+LIS Sbjct: 485 EENGMADWSEVELPDSTDRKSFDTSQRKAIALGLNKNRPIMIIQGPPGTGKTGMLKELIS 544 Query: 1737 IAVKQGERVLVTAPTNAAVDNMVEKLSDIGANIVRVGNPARISPAVASKSLVEIVNGRLA 1558 +AVKQGERVLVTAPTNAAVDNMVEKLSDIG NIVRVGNPARISPAVASKSL EIVN LA Sbjct: 545 LAVKQGERVLVTAPTNAAVDNMVEKLSDIGLNIVRVGNPARISPAVASKSLTEIVNTELA 604 Query: 1557 DFRSEFERKKSDLRKDLSHCLRDDSLAAGIRQLLKQLGKTMKKKERETIREILSSAHVVL 1378 DFR+E ERKKSDLR+DL +CL+DDSLAAGIRQLLKQLGK++K++E+ET++EILSSA VVL Sbjct: 605 DFRAEIERKKSDLRRDLRYCLKDDSLAAGIRQLLKQLGKSIKREEKETVKEILSSAQVVL 664 Query: 1377 ATNIGAADPMIRWLNSFDLVVIDEAGQAIEPSCWIPILLGKRCILAGDQCQLAPVILSRK 1198 ATNIGAADP+IR L++FDLV+IDEAGQAIEPSCWIPILLGKRCILAGDQ QLAPVILSRK Sbjct: 665 ATNIGAADPLIRRLDTFDLVIIDEAGQAIEPSCWIPILLGKRCILAGDQFQLAPVILSRK 724 Query: 1197 ALEGGLGVSFLERASTLHEGVLATKLTTQYRMNDAIASWASKEMYNGLLKSSASVMSHLL 1018 ALEGGLGVS LERA++LH+G+L+TKLTTQYRMN+AIASWASKEMY+G L SS +V SHLL Sbjct: 725 ALEGGLGVSLLERAASLHDGMLSTKLTTQYRMNNAIASWASKEMYDGSLISSPTVASHLL 784 Query: 1017 SDSPLVKSTWITQCPLLLLDTRMPFGSLSVGCEEQLDPAGTGSFYNEGEADIVVQHVFAL 838 DSP VK TW+TQCPLLLLDTRMP+GSLSVGCEE LDPAGTGSF+NEGEADIVVQHVF+L Sbjct: 785 VDSPFVKPTWVTQCPLLLLDTRMPYGSLSVGCEEHLDPAGTGSFFNEGEADIVVQHVFSL 844 Query: 837 IYAGVRPSTIVVQSPYVSQVQLLRDRLEEFPLSTGVEVATIDSFQGREADAVVISMVRSN 658 IY+GV P+ I VQSPYV+QVQLLRD+++E P++TGVEVATIDSFQGREADAV+ISMVRSN Sbjct: 845 IYSGVPPAAIAVQSPYVAQVQLLRDKVDELPMATGVEVATIDSFQGREADAVIISMVRSN 904 Query: 657 NLGAVGFLGDSRRMNVAITRARKHVAIICDSSTICHNTFLARLLRHIRYFGRVKHAEPGG 478 NLGAVGFLGDSRRMNVAITRARKHVA++CDSSTICHNT+LARLLRHIRYFG+VKH EPG Sbjct: 905 NLGAVGFLGDSRRMNVAITRARKHVAVVCDSSTICHNTYLARLLRHIRYFGKVKHVEPGS 964 Query: 477 SGGYGLSMNPMLPSVS 430 +GL M+PMLP+ S Sbjct: 965 FWEFGLGMDPMLPTAS 980 >ref|XP_019184191.1| PREDICTED: DNA-binding protein SMUBP-2 [Ipomoea nil] Length = 993 Score = 1399 bits (3622), Expect = 0.0 Identities = 714/994 (71%), Positives = 815/994 (81%), Gaps = 30/994 (3%) Frame = -2 Query: 3321 MEASCIFCGGVSASILKSQGIRHRPSESISLYSNKN-------------RLFLSSPISHR 3181 MEASC+FCGG S S L + R R S S +++ + +SP+ H Sbjct: 1 MEASCVFCGGAS-SFLGIRVRRQRDSLHSSFFASVTPFGGNSSFSRGGGSILFASPLPHC 59 Query: 3180 VWXXXXXXXXXXXXXXXXREDGRGA-----------DVSNNNTNNKAAVS---EEKTRMK 3043 + + R + + S N N+ + S E + R K Sbjct: 60 RFQVANSNGGGTKAVRTAKRKSRKSGGSSGPGPGPVETSQNLKNSPVSSSVEFERQGRRK 119 Query: 3042 QQQVNDEKDGPTSVRALYQNGDPLGRRDLGKGVVKWIGKGMKAMALDFALAETQGD--FA 2869 + P +V ALYQ+GDPLGRRDLGK VV WI +GMKAMA+DFA AE QG+ F+ Sbjct: 120 PALTRKNTNTPANVAALYQSGDPLGRRDLGKCVVTWISQGMKAMAIDFATAEVQGEGEFS 179 Query: 2868 DLKQRMGPGLTFVIQAQPYLNAVPMPLGMEAICLKTCTHYPTLFDHFQRELRDVLLDLQH 2689 +L+Q+MGPGLTFVIQAQPYLNAVPMPLG+EAICLKTCTHYPTLFDHFQRELRDVL DLQ Sbjct: 180 ELRQQMGPGLTFVIQAQPYLNAVPMPLGLEAICLKTCTHYPTLFDHFQRELRDVLKDLQS 239 Query: 2688 KTLIHNWRETESWKLLKELATSAQHRAIARKTSLSKSVHGVLGLNIDKAKAIQCRIDEFT 2509 K+L+ +WRETESWKLLKELA SAQH+AIARK S K + GVLG++IDKAKAIQ RID+FT Sbjct: 240 KSLVQDWRETESWKLLKELACSAQHKAIARKISEPKPIQGVLGMDIDKAKAIQSRIDDFT 299 Query: 2508 KHMSDLLRIERDAELEFTQEELNAVPTPDEHSTSP-KPTEFLVSHAQSEQELCDTICNLN 2332 + MS LLRIERDAELEFTQEELNAVPTP E ++ P KP EFLVSHAQ EQELCDTICNL+ Sbjct: 300 EQMSALLRIERDAELEFTQEELNAVPTPAEENSKPSKPIEFLVSHAQPEQELCDTICNLH 359 Query: 2331 AISTSTGLGGMHLVLFRVEGNHRLPPTNLSPGDMVCVRICDSRGAGATSCMQGFVNNLGD 2152 A+STSTGLGGMHLVLF+V+GNHRLPPTNLSPGDMVCVR CDSRGAGATSCMQGFVNNLG+ Sbjct: 360 AVSTSTGLGGMHLVLFKVDGNHRLPPTNLSPGDMVCVRTCDSRGAGATSCMQGFVNNLGE 419 Query: 2151 DGCSISVALESLHGDPTFSKLFGKNIRIDRIQGLADALTYERNCEAXXXXXXXXXXXKNS 1972 DGCSI++ALESL GDPTFSKLFGKN+RIDRIQGLAD LTYERNCEA KN Sbjct: 420 DGCSITLALESLRGDPTFSKLFGKNVRIDRIQGLADTLTYERNCEALMMLKKKGLQKKNP 479 Query: 1971 SIAVVTTIFGDNEDIAWFEDNNMVDWAEAELNGLLDTEFYDTSQQRAIALGLNKKRPVLI 1792 SIAVV T+FGD ED+AW E N++ DWA EL+ +D++ YD SQ+RAIALGLNK+RP+LI Sbjct: 480 SIAVVATLFGDQEDVAWLEKNDLADWAGVELDASIDSKGYDISQRRAIALGLNKRRPILI 539 Query: 1791 IQGPPGTGKTGVLKQLISIAVKQGERVLVTAPTNAAVDNMVEKLSDIGANIVRVGNPARI 1612 +QGPPGTGKTG+LK+LIS+AV+QGERVL+TAPTNAAVDNMVEKLSD+ NIVR GNPARI Sbjct: 540 VQGPPGTGKTGLLKELISLAVQQGERVLITAPTNAAVDNMVEKLSDVAINIVRFGNPARI 599 Query: 1611 SPAVASKSLVEIVNGRLADFRSEFERKKSDLRKDLSHCLRDDSLAAGIRQLLKQLGKTMK 1432 SP V+SKSL EIVN +LA+FR+E RKK+DLRKDL HCL DDSLAAGIRQLLKQLGK++K Sbjct: 600 SPVVSSKSLTEIVNTKLAEFRAELHRKKTDLRKDLRHCLNDDSLAAGIRQLLKQLGKSLK 659 Query: 1431 KKERETIREILSSAHVVLATNIGAADPMIRWLNSFDLVVIDEAGQAIEPSCWIPILLGKR 1252 KKE+ET+RE+LSSA VVLATNIGAADP+IR L++FDLV+IDEA QAIEPS WIPIL GKR Sbjct: 660 KKEKETVREVLSSAQVVLATNIGAADPLIRQLDTFDLVIIDEAAQAIEPSSWIPILRGKR 719 Query: 1251 CILAGDQCQLAPVILSRKALEGGLGVSFLERASTLHEGVLATKLTTQYRMNDAIASWASK 1072 CILAGDQ QLAPVILSRKALEGGLG+S LERA++LHEG+L+TKLTTQYRMNDAIASWASK Sbjct: 720 CILAGDQFQLAPVILSRKALEGGLGISLLERAASLHEGMLSTKLTTQYRMNDAIASWASK 779 Query: 1071 EMYNGLLKSSASVMSHLLSDSPLVKSTWITQCPLLLLDTRMPFGSLSVGCEEQLDPAGTG 892 EMY G LKS V SHLL DSP VK TWIT+CPLLLLDTRMP+GSLS GCEE LDPAGTG Sbjct: 780 EMYGGSLKSFPQVASHLLVDSPFVKPTWITRCPLLLLDTRMPYGSLSTGCEEHLDPAGTG 839 Query: 891 SFYNEGEADIVVQHVFALIYAGVRPSTIVVQSPYVSQVQLLRDRLEEFPLSTGVEVATID 712 SFYNEGEADIVV+HV +L+Y+GV P I VQSPYV+QVQLLRDRL+E P++TGVEVATID Sbjct: 840 SFYNEGEADIVVKHVLSLVYSGVSPVAIAVQSPYVAQVQLLRDRLDEIPVTTGVEVATID 899 Query: 711 SFQGREADAVVISMVRSNNLGAVGFLGDSRRMNVAITRARKHVAIICDSSTICHNTFLAR 532 SFQGREADAV+ISMVRSNN+GAVGFLGDSRRMNVAITRARKHVA++CDSSTICHNTFLAR Sbjct: 900 SFQGREADAVIISMVRSNNMGAVGFLGDSRRMNVAITRARKHVAVVCDSSTICHNTFLAR 959 Query: 531 LLRHIRYFGRVKHAEPGGSGGYGLSMNPMLPSVS 430 LLRHIRYFG VK+AEPG GG+GL M+PMLP+ + Sbjct: 960 LLRHIRYFGHVKNAEPGSFGGFGLGMDPMLPTAN 993 >ref|XP_012492340.1| PREDICTED: DNA-binding protein SMUBP-2 [Gossypium raimondii] gb|KJB44363.1| hypothetical protein B456_007G248100 [Gossypium raimondii] Length = 1003 Score = 1399 bits (3621), Expect = 0.0 Identities = 699/886 (78%), Positives = 778/886 (87%) Frame = -2 Query: 3087 TNNKAAVSEEKTRMKQQQVNDEKDGPTSVRALYQNGDPLGRRDLGKGVVKWIGKGMKAMA 2908 T V E KQ++ +K +VR LYQNGDPLGRRDLGK VV WI +GMKAMA Sbjct: 118 TRTNILVEELGLFKKQKEQKVQKTKALNVRTLYQNGDPLGRRDLGKRVVWWISEGMKAMA 177 Query: 2907 LDFALAETQGDFADLKQRMGPGLTFVIQAQPYLNAVPMPLGMEAICLKTCTHYPTLFDHF 2728 DFA AE QG+F +L+QRMGPGLTFVIQAQPYLN+VPMPLG+EAICLK CTHYPTLFDHF Sbjct: 178 SDFASAELQGEFLELRQRMGPGLTFVIQAQPYLNSVPMPLGLEAICLKACTHYPTLFDHF 237 Query: 2727 QRELRDVLLDLQHKTLIHNWRETESWKLLKELATSAQHRAIARKTSLSKSVHGVLGLNID 2548 QRELR+VL +LQ +++ +W+ETESWKLLKELA SAQHRAIARK + K V GVLG++++ Sbjct: 238 QRELRNVLQELQQNSMVQDWKETESWKLLKELANSAQHRAIARKVTPPKPVQGVLGMDLE 297 Query: 2547 KAKAIQCRIDEFTKHMSDLLRIERDAELEFTQEELNAVPTPDEHSTSPKPTEFLVSHAQS 2368 KAKA+Q RIDEFTK MS+LLRIERDAELEFTQEEL+AVPT DE S S KP EFLVSH Q+ Sbjct: 298 KAKAMQGRIDEFTKQMSELLRIERDAELEFTQEELDAVPTLDEGSDSSKPIEFLVSHGQA 357 Query: 2367 EQELCDTICNLNAISTSTGLGGMHLVLFRVEGNHRLPPTNLSPGDMVCVRICDSRGAGAT 2188 +QELCDTICNLNA+STSTGLGGMHLVLFRVEGNHRLPPT LSPGDMVCVRI DSRGAGAT Sbjct: 358 QQELCDTICNLNAVSTSTGLGGMHLVLFRVEGNHRLPPTTLSPGDMVCVRISDSRGAGAT 417 Query: 2187 SCMQGFVNNLGDDGCSISVALESLHGDPTFSKLFGKNIRIDRIQGLADALTYERNCEAXX 2008 SC+QGFV+NLGDDGCSISVALES HGDPTFSKLFGK++RIDRI GLADALTYERNCEA Sbjct: 418 SCIQGFVDNLGDDGCSISVALESRHGDPTFSKLFGKSVRIDRIHGLADALTYERNCEALM 477 Query: 2007 XXXXXXXXXKNSSIAVVTTIFGDNEDIAWFEDNNMVDWAEAELNGLLDTEFYDTSQQRAI 1828 KN SIAVV T+F D ED+ W E+N++ DW+ AEL+GLL +D SQQRAI Sbjct: 478 LLQKNGLQKKNPSIAVVATLFADKEDVEWLEENDLADWSPAELDGLLQNGTFDDSQQRAI 537 Query: 1827 ALGLNKKRPVLIIQGPPGTGKTGVLKQLISIAVKQGERVLVTAPTNAAVDNMVEKLSDIG 1648 ALGLNKKRPV+++QGPPGTGKTG+LK++I++A +QGERVLVTAPTNAAVDN+VEKLS+ G Sbjct: 538 ALGLNKKRPVMVVQGPPGTGKTGMLKEVIALAAQQGERVLVTAPTNAAVDNLVEKLSNTG 597 Query: 1647 ANIVRVGNPARISPAVASKSLVEIVNGRLADFRSEFERKKSDLRKDLSHCLRDDSLAAGI 1468 NIVRVGNPARIS AVASKSLVEIVN +LAD+R+EFERKKSDLRKDL HCL+DDSLAAGI Sbjct: 598 LNIVRVGNPARISSAVASKSLVEIVNSKLADYRAEFERKKSDLRKDLRHCLKDDSLAAGI 657 Query: 1467 RQLLKQLGKTMKKKERETIREILSSAHVVLATNIGAADPMIRWLNSFDLVVIDEAGQAIE 1288 RQLLKQLGK +KKKE+ET+RE+LS+A VVL+TN GAADP+IR L++FDLVVIDEAGQAIE Sbjct: 658 RQLLKQLGKALKKKEKETVREVLSNAQVVLSTNTGAADPLIRRLDTFDLVVIDEAGQAIE 717 Query: 1287 PSCWIPILLGKRCILAGDQCQLAPVILSRKALEGGLGVSFLERASTLHEGVLATKLTTQY 1108 PSCWIPIL GKRCILAGDQCQLAPVILSRKALEGGLG+S LERA+TLHEGVLAT L TQY Sbjct: 718 PSCWIPILQGKRCILAGDQCQLAPVILSRKALEGGLGISLLERAATLHEGVLATMLATQY 777 Query: 1107 RMNDAIASWASKEMYNGLLKSSASVMSHLLSDSPLVKSTWITQCPLLLLDTRMPFGSLSV 928 RMNDAIASWASKEMY+G LKSS V SHLL DSP VK TWITQCPLLLLDTRMP+GSLSV Sbjct: 778 RMNDAIASWASKEMYDGELKSSPLVASHLLVDSPFVKPTWITQCPLLLLDTRMPYGSLSV 837 Query: 927 GCEEQLDPAGTGSFYNEGEADIVVQHVFALIYAGVRPSTIVVQSPYVSQVQLLRDRLEEF 748 GCEE LD AGTGSF+NEGEADIVVQHV LIYAGV P+ I VQSPYV+QVQLLRDRL+EF Sbjct: 838 GCEEHLDLAGTGSFFNEGEADIVVQHVLYLIYAGVSPTAIAVQSPYVAQVQLLRDRLDEF 897 Query: 747 PLSTGVEVATIDSFQGREADAVVISMVRSNNLGAVGFLGDSRRMNVAITRARKHVAIICD 568 P + G+EVATIDSFQGREADAV+ISMVRSN LGAVGFLGDSRRMNVAITRARKHVA++CD Sbjct: 898 PEADGIEVATIDSFQGREADAVIISMVRSNTLGAVGFLGDSRRMNVAITRARKHVAVVCD 957 Query: 567 SSTICHNTFLARLLRHIRYFGRVKHAEPGGSGGYGLSMNPMLPSVS 430 SSTICHNTFLARLLRHIRY GRVKHAEPG SGG GL M+PMLPS+S Sbjct: 958 SSTICHNTFLARLLRHIRYVGRVKHAEPGASGGSGLGMDPMLPSIS 1003 >ref|XP_002524012.1| PREDICTED: DNA-binding protein SMUBP-2 [Ricinus communis] gb|EEF38380.1| DNA-binding protein smubp-2, putative [Ricinus communis] Length = 989 Score = 1399 bits (3621), Expect = 0.0 Identities = 700/890 (78%), Positives = 781/890 (87%), Gaps = 2/890 (0%) Frame = -2 Query: 3093 NNTNNKAAVSEEKTRMKQQQVNDEKDGPTSVRALYQNGDPLGRRDLGKGVVKWIGKGMKA 2914 N K AVSEE+ + +VN V++L+QNGDPLG++DLGK VVKWI +GM+A Sbjct: 108 NTDGGKLAVSEEREEKVKMKVN--------VKSLHQNGDPLGKKDLGKTVVKWISQGMRA 159 Query: 2913 MALDFALAETQGDFADLKQRMG--PGLTFVIQAQPYLNAVPMPLGMEAICLKTCTHYPTL 2740 MA DFA AETQG+F +L+QRM GLTFVIQAQPY+NAVP+PLG EA+CLK C HYPTL Sbjct: 160 MAADFASAETQGEFLELRQRMDLEAGLTFVIQAQPYINAVPIPLGFEALCLKACIHYPTL 219 Query: 2739 FDHFQRELRDVLLDLQHKTLIHNWRETESWKLLKELATSAQHRAIARKTSLSKSVHGVLG 2560 FDHFQRELRDVL DLQ K L+ +W+ TESWKLLKELA S QHRA+ARK S K + GVLG Sbjct: 220 FDHFQRELRDVLQDLQRKGLVQDWQNTESWKLLKELANSVQHRAVARKVSKPKPLQGVLG 279 Query: 2559 LNIDKAKAIQCRIDEFTKHMSDLLRIERDAELEFTQEELNAVPTPDEHSTSPKPTEFLVS 2380 +N+DKAKAIQ RIDEFTK MS+LL+IERD+ELEFTQEELNAVPTPDE+S KP EFLVS Sbjct: 280 MNLDKAKAIQSRIDEFTKTMSELLQIERDSELEFTQEELNAVPTPDENSDPSKPIEFLVS 339 Query: 2379 HAQSEQELCDTICNLNAISTSTGLGGMHLVLFRVEGNHRLPPTNLSPGDMVCVRICDSRG 2200 H Q++QELCDTICNLNA+STSTGLGGMHLVLFRVEGNHRLPPTNLSPGDMVCVRICDSRG Sbjct: 340 HGQAQQELCDTICNLNAVSTSTGLGGMHLVLFRVEGNHRLPPTNLSPGDMVCVRICDSRG 399 Query: 2199 AGATSCMQGFVNNLGDDGCSISVALESLHGDPTFSKLFGKNIRIDRIQGLADALTYERNC 2020 AGATSCMQGFVNNLG+DGCSISVALES HGDPTFSKLFGK +RIDRI GLADALTYERNC Sbjct: 400 AGATSCMQGFVNNLGEDGCSISVALESRHGDPTFSKLFGKGVRIDRIHGLADALTYERNC 459 Query: 2019 EAXXXXXXXXXXXKNSSIAVVTTIFGDNEDIAWFEDNNMVDWAEAELNGLLDTEFYDTSQ 1840 EA KN SIA+V T+FGD+ED+AW E+ ++ +W EA+++G +E +D SQ Sbjct: 460 EALMLLQKNGLQKKNPSIAIVATLFGDSEDLAWLEEKDLAEWNEADMDGCFGSERFDDSQ 519 Query: 1839 QRAIALGLNKKRPVLIIQGPPGTGKTGVLKQLISIAVKQGERVLVTAPTNAAVDNMVEKL 1660 +RA+ALGLN+KRP+LIIQGPPGTGK+G+LK+LI AV QGERVLVTAPTNAAVDNMVEKL Sbjct: 520 RRAMALGLNQKRPLLIIQGPPGTGKSGLLKELIVRAVHQGERVLVTAPTNAAVDNMVEKL 579 Query: 1659 SDIGANIVRVGNPARISPAVASKSLVEIVNGRLADFRSEFERKKSDLRKDLSHCLRDDSL 1480 S+IG +IVRVGNPARIS AVASKSL EIVN +LA FR EFERKKSDLRKDL HCL DDSL Sbjct: 580 SNIGLDIVRVGNPARISSAVASKSLSEIVNSKLATFRMEFERKKSDLRKDLRHCLEDDSL 639 Query: 1479 AAGIRQLLKQLGKTMKKKERETIREILSSAHVVLATNIGAADPMIRWLNSFDLVVIDEAG 1300 AAGIRQLLKQLGKTMKKKE+E+++E+LSSA VVLATN GAADP+IR L++FDLVVIDEAG Sbjct: 640 AAGIRQLLKQLGKTMKKKEKESVKEVLSSAQVVLATNTGAADPLIRRLDTFDLVVIDEAG 699 Query: 1299 QAIEPSCWIPILLGKRCILAGDQCQLAPVILSRKALEGGLGVSFLERASTLHEGVLATKL 1120 QAIEPSCWIPIL GKRCILAGDQCQLAPVILSRKALEGGLGVS LERA+TLH+GVLA +L Sbjct: 700 QAIEPSCWIPILQGKRCILAGDQCQLAPVILSRKALEGGLGVSLLERAATLHDGVLALQL 759 Query: 1119 TTQYRMNDAIASWASKEMYNGLLKSSASVMSHLLSDSPLVKSTWITQCPLLLLDTRMPFG 940 TTQYRMNDAIASWASKEMY GLLKSS+ V SHLL SP VK TWITQCPLLLLDTRMP+G Sbjct: 760 TTQYRMNDAIASWASKEMYGGLLKSSSKVASHLLVHSPFVKPTWITQCPLLLLDTRMPYG 819 Query: 939 SLSVGCEEQLDPAGTGSFYNEGEADIVVQHVFALIYAGVRPSTIVVQSPYVSQVQLLRDR 760 SL +GCEE LDPAGTGSFYNEGEA+IVVQHV +LIYAGVRP+TI VQSPYV+QVQLLRDR Sbjct: 820 SLFIGCEEHLDPAGTGSFYNEGEAEIVVQHVISLIYAGVRPTTIAVQSPYVAQVQLLRDR 879 Query: 759 LEEFPLSTGVEVATIDSFQGREADAVVISMVRSNNLGAVGFLGDSRRMNVAITRARKHVA 580 L+E P + GVEVATIDSFQGREADAV+ISMVRSNNLGAVGFLGDSRRMNVAITRAR+HVA Sbjct: 880 LDELPEADGVEVATIDSFQGREADAVIISMVRSNNLGAVGFLGDSRRMNVAITRARRHVA 939 Query: 579 IICDSSTICHNTFLARLLRHIRYFGRVKHAEPGGSGGYGLSMNPMLPSVS 430 ++CDSSTICHNTFLARLLRHIRYFGRVKHAEPG GG GL M+PMLPS+S Sbjct: 940 VVCDSSTICHNTFLARLLRHIRYFGRVKHAEPGSFGGSGLGMDPMLPSIS 989 >ref|XP_019259161.1| PREDICTED: DNA-binding protein SMUBP-2 [Nicotiana attenuata] gb|OIT40020.1| regulator of nonsense transcripts 1-like protein [Nicotiana attenuata] Length = 980 Score = 1399 bits (3620), Expect = 0.0 Identities = 709/974 (72%), Positives = 816/974 (83%), Gaps = 9/974 (0%) Frame = -2 Query: 3324 KMEASCIFCGGVSA---SILKSQGIRHRPS-----ESISLYSNKNRLFLSSPISHRVWXX 3169 KME+ C CG +S S L + + R + S++L + KNR+FL S IS + Sbjct: 7 KMESLCNSCGSISTLAPSCLTLRFYKKRSNLSSFFGSVTLSNPKNRIFLDSSISFPNYNI 66 Query: 3168 XXXXXXXXXXXXXXREDGRGADVSNNNTNNKAAVSEEKTRMKQQQVNDEKD-GPTSVRAL 2992 R + S +KT Q+ +E+D GP +VRAL Sbjct: 67 QASSSSGTKSLSPRRRKPKNVKTSQIPAVTTKGSVVKKTEKIQECSQEERDSGPVNVRAL 126 Query: 2991 YQNGDPLGRRDLGKGVVKWIGKGMKAMALDFALAETQGDFADLKQRMGPGLTFVIQAQPY 2812 +NGDP+GR+DLGK VV+WI +GMKAMA DFA AE QG+F ++KQRM PGLTFVIQAQPY Sbjct: 127 NENGDPMGRKDLGKCVVRWISQGMKAMATDFATAEMQGEFTEVKQRMEPGLTFVIQAQPY 186 Query: 2811 LNAVPMPLGMEAICLKTCTHYPTLFDHFQRELRDVLLDLQHKTLIHNWRETESWKLLKEL 2632 LNA+PMPLG+EAICLK CTHYPTLFD+FQRELRDVL DLQ K+L+ +WR+TESWKLLK+L Sbjct: 187 LNAIPMPLGLEAICLKACTHYPTLFDNFQRELRDVLQDLQRKSLVQDWRDTESWKLLKDL 246 Query: 2631 ATSAQHRAIARKTSLSKSVHGVLGLNIDKAKAIQCRIDEFTKHMSDLLRIERDAELEFTQ 2452 A+SAQH+AIARKTS K V GV+G++++KAKA+Q RID+FT MSDLLRIERD+ELEFTQ Sbjct: 247 ASSAQHKAIARKTSQRKFVPGVMGMDLEKAKAMQSRIDDFTNRMSDLLRIERDSELEFTQ 306 Query: 2451 EELNAVPTPDEHSTSPKPTEFLVSHAQSEQELCDTICNLNAISTSTGLGGMHLVLFRVEG 2272 EELNAVP P +S KP EFLVSHAQ EQELCDTICNL A+STS GLGGMHLVLF++EG Sbjct: 307 EELNAVPAPVLNSEEQKPFEFLVSHAQPEQELCDTICNLTAVSTSIGLGGMHLVLFKLEG 366 Query: 2271 NHRLPPTNLSPGDMVCVRICDSRGAGATSCMQGFVNNLGDDGCSISVALESLHGDPTFSK 2092 NHRLPPTNLSPGDMVCVR CDSRGAGATSCMQGFV+NLG+DG SIS+ALESLHGD TFSK Sbjct: 367 NHRLPPTNLSPGDMVCVRTCDSRGAGATSCMQGFVHNLGEDGRSISLALESLHGDSTFSK 426 Query: 2091 LFGKNIRIDRIQGLADALTYERNCEAXXXXXXXXXXXKNSSIAVVTTIFGDNEDIAWFED 1912 LFGKN+RIDRIQGLADALTYERNCEA KN S+AVV T+FGD ED+AW E+ Sbjct: 427 LFGKNVRIDRIQGLADALTYERNCEALMMLQKKGFLKKNPSVAVVATLFGDKEDLAWLEE 486 Query: 1911 NNMVDWAEAELNGLLDTEFYDTSQQRAIALGLNKKRPVLIIQGPPGTGKTGVLKQLISIA 1732 N M DW+E EL D + +D SQ++AIALGLNK RP++IIQGPPGTGKTG+LK+LIS+A Sbjct: 487 NGMADWSEVELPDSTDRKSFDASQRKAIALGLNKNRPIMIIQGPPGTGKTGMLKELISLA 546 Query: 1731 VKQGERVLVTAPTNAAVDNMVEKLSDIGANIVRVGNPARISPAVASKSLVEIVNGRLADF 1552 VKQGERVLVTAPTNAAVDNMVEKLSDIG NIVRVGNPARISPAVASKSL EIVN +LADF Sbjct: 547 VKQGERVLVTAPTNAAVDNMVEKLSDIGLNIVRVGNPARISPAVASKSLAEIVNTKLADF 606 Query: 1551 RSEFERKKSDLRKDLSHCLRDDSLAAGIRQLLKQLGKTMKKKERETIREILSSAHVVLAT 1372 R+E ERKKSDLR+DL +CL+DDSLAAGIRQLLKQLGK++K++E+ET++EILSSA VVLAT Sbjct: 607 RAEIERKKSDLRRDLRYCLKDDSLAAGIRQLLKQLGKSIKREEKETVKEILSSAQVVLAT 666 Query: 1371 NIGAADPMIRWLNSFDLVVIDEAGQAIEPSCWIPILLGKRCILAGDQCQLAPVILSRKAL 1192 NIGAADP+IR L++FDLV+IDEAGQAIEPSCWIPILLGKRCILAGDQ QLAPVILSRKAL Sbjct: 667 NIGAADPLIRRLDTFDLVIIDEAGQAIEPSCWIPILLGKRCILAGDQFQLAPVILSRKAL 726 Query: 1191 EGGLGVSFLERASTLHEGVLATKLTTQYRMNDAIASWASKEMYNGLLKSSASVMSHLLSD 1012 EGGLGVS LERA+ LH+G+L+TKLTTQYRMN+AIASWASKEMY+G L SS +V SHLL D Sbjct: 727 EGGLGVSLLERAAGLHDGMLSTKLTTQYRMNNAIASWASKEMYDGSLISSPTVASHLLVD 786 Query: 1011 SPLVKSTWITQCPLLLLDTRMPFGSLSVGCEEQLDPAGTGSFYNEGEADIVVQHVFALIY 832 SP VK TW+TQCPLLLLDTRMP+GSLSVGCEE LDPAGTGSF+NEGEADIVVQHVF+LIY Sbjct: 787 SPFVKPTWVTQCPLLLLDTRMPYGSLSVGCEEHLDPAGTGSFFNEGEADIVVQHVFSLIY 846 Query: 831 AGVRPSTIVVQSPYVSQVQLLRDRLEEFPLSTGVEVATIDSFQGREADAVVISMVRSNNL 652 +GV P+ I VQSPYV+QVQLLRD+++E P++TGVEVATIDSFQGREADAV+ISMVRSNNL Sbjct: 847 SGVPPAAIAVQSPYVAQVQLLRDKIDELPMATGVEVATIDSFQGREADAVIISMVRSNNL 906 Query: 651 GAVGFLGDSRRMNVAITRARKHVAIICDSSTICHNTFLARLLRHIRYFGRVKHAEPGGSG 472 GAVGFLGDSRRMNVAITRARKHVA++CDSSTICHNT+LARLLRHIRYFG+VKH EPG Sbjct: 907 GAVGFLGDSRRMNVAITRARKHVAVVCDSSTICHNTYLARLLRHIRYFGKVKHVEPGSFW 966 Query: 471 GYGLSMNPMLPSVS 430 +GL M+PMLP+ S Sbjct: 967 EFGLGMDPMLPTAS 980