BLASTX nr result
ID: Rehmannia31_contig00002932
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia31_contig00002932 (3628 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_011075757.1| DNA-binding protein SMUBP-2 [Sesamum indicum] 1646 0.0 ref|XP_012850649.1| PREDICTED: DNA-binding protein SMUBP-2 [Eryt... 1587 0.0 gb|EYU44882.1| hypothetical protein MIMGU_mgv1a001152mg [Erythra... 1535 0.0 gb|KZV41087.1| P-loop containing nucleoside triphosphate hydrola... 1482 0.0 gb|EOY10295.1| P-loop containing nucleoside triphosphate hydrola... 1423 0.0 ref|XP_017977299.1| PREDICTED: DNA-binding protein SMUBP-2 [Theo... 1422 0.0 gb|OMO99192.1| putative DNA-binding protein smubp-2 [Corchorus c... 1420 0.0 gb|PHT30198.1| hypothetical protein CQW23_30230 [Capsicum baccatum] 1417 0.0 gb|OMO56477.1| hypothetical protein COLO4_35630 [Corchorus olito... 1417 0.0 ref|XP_022718654.1| DNA-binding protein SMUBP-2-like [Durio zibe... 1416 0.0 ref|XP_021282320.1| DNA-binding protein SMUBP-2 [Herrania umbrat... 1415 0.0 ref|XP_016564094.1| PREDICTED: DNA-binding protein SMUBP-2 [Caps... 1411 0.0 gb|PHU23400.1| hypothetical protein BC332_08507 [Capsicum chinense] 1409 0.0 gb|PHT87733.1| hypothetical protein T459_09839 [Capsicum annuum] 1409 0.0 ref|XP_009771939.1| PREDICTED: DNA-binding protein SMUBP-2 [Nico... 1401 0.0 ref|XP_016474118.1| PREDICTED: DNA-binding protein SMUBP-2-like ... 1400 0.0 ref|XP_019184191.1| PREDICTED: DNA-binding protein SMUBP-2 [Ipom... 1399 0.0 ref|XP_019259161.1| PREDICTED: DNA-binding protein SMUBP-2 [Nico... 1399 0.0 ref|XP_012492340.1| PREDICTED: DNA-binding protein SMUBP-2 [Goss... 1399 0.0 ref|XP_002524012.1| PREDICTED: DNA-binding protein SMUBP-2 [Rici... 1399 0.0 >ref|XP_011075757.1| DNA-binding protein SMUBP-2 [Sesamum indicum] Length = 964 Score = 1646 bits (4263), Expect = 0.0 Identities = 838/968 (86%), Positives = 885/968 (91%), Gaps = 4/968 (0%) Frame = -1 Query: 3316 MEASCIFCGGVSASILKSQGIRHRPSESISLYSNKNRLFLSSPISHRVWXXXXXXXXXXX 3137 MEASCIFCGGVS S+LKS +RHRP ESISLY N+N +F++SPISHRVW Sbjct: 1 MEASCIFCGGVSTSLLKSPALRHRPIESISLYRNRNLVFVASPISHRVWASANNSSNSRS 60 Query: 3136 XXXXXR----EDGRGADVSNNNTNNKAAVSEEKTRMKQQQVNDEKDGPTSVRALYQNGDP 2969 ED G+DV+N NTN KAAVSEE TR K VND+++GP SVRALYQ+GDP Sbjct: 61 ATKRRSRKNREDAGGSDVTNKNTNKKAAVSEE-TRKK---VNDQENGPRSVRALYQSGDP 116 Query: 2968 LGRRDLGKGVVKWIGKGMKAMALDFALAETQGDFADLKQRMGPGLTFVIQAQPYLNAVPM 2789 LGRR+LGKGVVKWI +GMKAMALDFA+ E QGDFA+LKQRMGPGLTFVIQAQPYLNAVPM Sbjct: 117 LGRRELGKGVVKWICQGMKAMALDFAMVEMQGDFAELKQRMGPGLTFVIQAQPYLNAVPM 176 Query: 2788 PLGMEAICLKTCTHYPTLFDHFQRELRDVLLDLQHKTLIHNWRETESWKLLKELATSAQH 2609 PLG+EAICLKTCTHYPTLFDHFQRELRDVL DLQHKTLIHNWRETESWKLLKELA+SAQH Sbjct: 177 PLGLEAICLKTCTHYPTLFDHFQRELRDVLQDLQHKTLIHNWRETESWKLLKELASSAQH 236 Query: 2608 RAIARKTSLSKSVHGVLGLNIDKAKAIQCRIDEFTKHMSDLLRIERDAELEFTQEELNAV 2429 RAIARKTSL+KSVHGVLGL + KAKA+QCRIDEFTK MSDLLRIERDAELEFTQ+ELNAV Sbjct: 237 RAIARKTSLTKSVHGVLGLELVKAKAMQCRIDEFTKQMSDLLRIERDAELEFTQDELNAV 296 Query: 2428 PTPDEHSTSPKPTEFLVSHAQSEQELCDTICNLNAISTSTGLGGMHLVLFRVEGNHRLPP 2249 PTPD+ S+S +P EFLVSHAQ+EQELCDTICNLNAISTSTGLGGMHLVLFRVE NHRLPP Sbjct: 297 PTPDDLSSSSRPIEFLVSHAQAEQELCDTICNLNAISTSTGLGGMHLVLFRVERNHRLPP 356 Query: 2248 TNLSPGDMVCVRICDSRGAGATSCMQGFVNNLGDDGCSISVALESLHGDPTFSKLFGKNI 2069 TNLSPGDMVCVR+CD RGAGATS MQGFVNNLGDDGCSISVALES HGDPTFSKLFGK+I Sbjct: 357 TNLSPGDMVCVRVCDKRGAGATSSMQGFVNNLGDDGCSISVALESRHGDPTFSKLFGKSI 416 Query: 2068 RIDRIQGLADALTYERNCEAXXXXXXXXXXXKNSSIAVVTTIFGDNEDIAWFEDNNMVDW 1889 RIDRIQGLADA+TYERNCEA KNSS AVVTTIFGD EDI FE NN+VDW Sbjct: 417 RIDRIQGLADAITYERNCEALMMLQKKGLQKKNSSRAVVTTIFGDKEDITRFEGNNLVDW 476 Query: 1888 AEAELNGLLDTEFYDTSQQRAIALGLNKKRPVLIIQGPPGTGKTGVLKQLISIAVKQGER 1709 +E EL+GLLDTEFYD+SQQRAIALGLNKKRPVLIIQGPPGTGKTGVLKQ+IS+ VKQGER Sbjct: 477 SEVELSGLLDTEFYDSSQQRAIALGLNKKRPVLIIQGPPGTGKTGVLKQIISLVVKQGER 536 Query: 1708 VLVTAPTNAAVDNMVEKLSDIGANIVRVGNPARISPAVASKSLVEIVNGRLADFRSEFER 1529 VLVTAPTNAAVDNMVEKLS+IGANIVRVGNPARISP VASKSLVEIVN RL DFRSEFER Sbjct: 537 VLVTAPTNAAVDNMVEKLSEIGANIVRVGNPARISPTVASKSLVEIVNSRLGDFRSEFER 596 Query: 1528 KKSDLRKDLSHCLRDDSLAAGIRQLLKQLGKTMKKKERETIREILSSAHVVLATNIGAAD 1349 KKSDLRKDLS+CL+DDSLAAGIRQLLKQLGKTMKKKERET+REILSSA VVL TNIGAAD Sbjct: 597 KKSDLRKDLSYCLKDDSLAAGIRQLLKQLGKTMKKKERETVREILSSAQVVLTTNIGAAD 656 Query: 1348 PMIRWLNSFDLVVIDEAGQAIEPSCWIPILLGKRCILAGDQCQLAPVILSRKALEGGLGV 1169 PMIR LN FDLVVIDEAGQAIEPSCWIPILLGKRCILAGDQCQLAPVILSRKALEGGLGV Sbjct: 657 PMIRCLNFFDLVVIDEAGQAIEPSCWIPILLGKRCILAGDQCQLAPVILSRKALEGGLGV 716 Query: 1168 SFLERASTLHEGVLATKLTTQYRMNDAIASWASKEMYNGLLKSSASVMSHLLSDSPLVKS 989 S LERA+TLHEGVLATKLT QYRMNDAIASWASKEMYNGLLKSSASV SHLLSDSPLVK Sbjct: 717 SLLERAATLHEGVLATKLTIQYRMNDAIASWASKEMYNGLLKSSASVTSHLLSDSPLVKQ 776 Query: 988 TWITQCPLLLLDTRMPFGSLSVGCEEQLDPAGTGSFYNEGEADIVVQHVFALIYAGVRPS 809 TWITQCPLLLLDTRMP+GSL+VGCEEQLDPAGTGSFYNEGEADIVVQHVFALIYAGV P+ Sbjct: 777 TWITQCPLLLLDTRMPYGSLTVGCEEQLDPAGTGSFYNEGEADIVVQHVFALIYAGVSPA 836 Query: 808 TIVVQSPYVSQVQLLRDRLEEFPLSTGVEVATIDSFQGREADAVVISMVRSNNLGAVGFL 629 TIVVQSPYV+QVQLLRDRLEEFPLSTGVEVAT+DSFQGREADAV+ISMVRSNNLGAVGFL Sbjct: 837 TIVVQSPYVAQVQLLRDRLEEFPLSTGVEVATVDSFQGREADAVIISMVRSNNLGAVGFL 896 Query: 628 GDSRRMNVAITRARKHVAIICDSSTICHNTFLARLLRHIRYFGRVKHAEPGGSGGYGLSM 449 GDSRRMNVAITRARKHVAIICDSSTICHNTFLARLLRHIRYFGRVKHAEPG SGG GLSM Sbjct: 897 GDSRRMNVAITRARKHVAIICDSSTICHNTFLARLLRHIRYFGRVKHAEPGDSGGSGLSM 956 Query: 448 NPMLPSVS 425 NPMLPS+S Sbjct: 957 NPMLPSIS 964 >ref|XP_012850649.1| PREDICTED: DNA-binding protein SMUBP-2 [Erythranthe guttata] Length = 961 Score = 1587 bits (4109), Expect = 0.0 Identities = 809/968 (83%), Positives = 880/968 (90%), Gaps = 4/968 (0%) Frame = -1 Query: 3316 MEASCIFCGGVSASILKSQGIRHRPSESISLYSNKNRLFLSSPISHRVWXXXXXXXXXXX 3137 MEA CI CGGVSAS+LKS +R S+S+ LY +K R+FL SPISHR+ Sbjct: 1 MEALCISCGGVSASLLKSPVVR---SDSVYLYRHKKRVFLGSPISHRILSTARNNSSGSA 57 Query: 3136 XXXXXREDGRGADVSNNNTNNKAAVSEEKTRMKQQQVNDEK-DGPTSVRALYQNG-DPLG 2963 ++ +G + +++++ +V+EE+ R KQQQ+N+ K +GPTSVR+LYQNG DPLG Sbjct: 58 TKRRSNKNKQGKN-NSSDSGVPVSVTEEEMRNKQQQINEGKRNGPTSVRSLYQNGGDPLG 116 Query: 2962 RRDLGKGVVKWIGKGMKAMALDFALAETQGDFADLKQRMGP-GLTFVIQAQPYLNAVPMP 2786 RRDLGKGVVKWI +GMKAMAL+FA AE QG+FA+LKQ+MGP GLTFVIQAQPYLNAVPMP Sbjct: 117 RRDLGKGVVKWISQGMKAMALEFARAEMQGEFAELKQQMGPAGLTFVIQAQPYLNAVPMP 176 Query: 2785 LGMEAICLKTCTHYPTLFDHFQRELRDVLLDLQHKTLIH-NWRETESWKLLKELATSAQH 2609 +G+EAICLKTCTHYPTLFDHFQRELRD+L DLQHK+LI W +T+SWKLLK+LA SAQH Sbjct: 177 VGLEAICLKTCTHYPTLFDHFQRELRDILQDLQHKSLIPLTWHQTQSWKLLKDLANSAQH 236 Query: 2608 RAIARKTSLSKSVHGVLGLNIDKAKAIQCRIDEFTKHMSDLLRIERDAELEFTQEELNAV 2429 RA+ARK LSKS+HG L+IDK K+IQCRID+FT+HMS LLRIERD+ELEFT+EELNAV Sbjct: 237 RAVARKAPLSKSLHG---LSIDKTKSIQCRIDKFTEHMSHLLRIERDSELEFTEEELNAV 293 Query: 2428 PTPDEHSTSPKPTEFLVSHAQSEQELCDTICNLNAISTSTGLGGMHLVLFRVEGNHRLPP 2249 PTPDEHSTSPKP EFLVSHAQ+EQELCDTICNLNAISTS GLGGMHLVLFR EGNHRLPP Sbjct: 294 PTPDEHSTSPKPIEFLVSHAQAEQELCDTICNLNAISTSIGLGGMHLVLFRAEGNHRLPP 353 Query: 2248 TNLSPGDMVCVRICDSRGAGATSCMQGFVNNLGDDGCSISVALESLHGDPTFSKLFGKNI 2069 TNLSPGDMVCVRICDSRGAGATSCMQGFVNNLGDDGCSISVALES HGDPTFSKLFGKNI Sbjct: 354 TNLSPGDMVCVRICDSRGAGATSCMQGFVNNLGDDGCSISVALESRHGDPTFSKLFGKNI 413 Query: 2068 RIDRIQGLADALTYERNCEAXXXXXXXXXXXKNSSIAVVTTIFGDNEDIAWFEDNNMVDW 1889 RIDRIQGLADALTYERNCEA +NSS+AVVTTIFGD EDIAWFEDN++VDW Sbjct: 414 RIDRIQGLADALTYERNCEALMMLQKKGLQKQNSSVAVVTTIFGDKEDIAWFEDNDLVDW 473 Query: 1888 AEAELNGLLDTEFYDTSQQRAIALGLNKKRPVLIIQGPPGTGKTGVLKQLISIAVKQGER 1709 +E EL+GLLDTEFYD+SQQRAIALGLNKKRPVLIIQGPPG GKTGVLKQLIS+ VK+GER Sbjct: 474 SEVELDGLLDTEFYDSSQQRAIALGLNKKRPVLIIQGPPGAGKTGVLKQLISLVVKRGER 533 Query: 1708 VLVTAPTNAAVDNMVEKLSDIGANIVRVGNPARISPAVASKSLVEIVNGRLADFRSEFER 1529 VLVTAPTNAAVDNMVEKLSDIGANIVRVGNPARISPAVASKSLVEIVN +LAD++SEF R Sbjct: 534 VLVTAPTNAAVDNMVEKLSDIGANIVRVGNPARISPAVASKSLVEIVNSKLADYKSEFGR 593 Query: 1528 KKSDLRKDLSHCLRDDSLAAGIRQLLKQLGKTMKKKERETIREILSSAHVVLATNIGAAD 1349 KKS+LRKDLSHCL+DDSLAAGIRQLLKQLGK +KKKERET++EILSSA VVLATNIGAAD Sbjct: 594 KKSNLRKDLSHCLKDDSLAAGIRQLLKQLGKAIKKKERETVKEILSSAQVVLATNIGAAD 653 Query: 1348 PMIRWLNSFDLVVIDEAGQAIEPSCWIPILLGKRCILAGDQCQLAPVILSRKALEGGLGV 1169 PMIR L+SFDLVVIDEAGQAIEPSCWIPILLGKRCILAGDQCQLAPVILSRKALEGGLGV Sbjct: 654 PMIRSLDSFDLVVIDEAGQAIEPSCWIPILLGKRCILAGDQCQLAPVILSRKALEGGLGV 713 Query: 1168 SFLERASTLHEGVLATKLTTQYRMNDAIASWASKEMYNGLLKSSASVMSHLLSDSPLVKS 989 S LERASTLHEGV ATKLTTQYRMNDAIASWASKEMYNGLLKSSASV SHLLSDSPLVK Sbjct: 714 SLLERASTLHEGVFATKLTTQYRMNDAIASWASKEMYNGLLKSSASVTSHLLSDSPLVKP 773 Query: 988 TWITQCPLLLLDTRMPFGSLSVGCEEQLDPAGTGSFYNEGEADIVVQHVFALIYAGVRPS 809 TWITQCPLLLLDTRMP+GSLSVGCEEQLDPAGTGSFYNEGEADIVVQHVFALIYAGVRP+ Sbjct: 774 TWITQCPLLLLDTRMPYGSLSVGCEEQLDPAGTGSFYNEGEADIVVQHVFALIYAGVRPA 833 Query: 808 TIVVQSPYVSQVQLLRDRLEEFPLSTGVEVATIDSFQGREADAVVISMVRSNNLGAVGFL 629 +IVVQSPYV+QVQLLRDRLEEFP++ GVEVATIDSFQGREADAV+ISMVRSNNLGAVGFL Sbjct: 834 SIVVQSPYVAQVQLLRDRLEEFPITKGVEVATIDSFQGREADAVIISMVRSNNLGAVGFL 893 Query: 628 GDSRRMNVAITRARKHVAIICDSSTICHNTFLARLLRHIRYFGRVKHAEPGGSGGYGLSM 449 GDSRRMNVAITRARKHVAIICDSSTICHNTFLARLLRHIRYFGRVKHAEPGGSGG GL+M Sbjct: 894 GDSRRMNVAITRARKHVAIICDSSTICHNTFLARLLRHIRYFGRVKHAEPGGSGGSGLAM 953 Query: 448 NPMLPSVS 425 NPMLPS+S Sbjct: 954 NPMLPSLS 961 >gb|EYU44882.1| hypothetical protein MIMGU_mgv1a001152mg [Erythranthe guttata] Length = 876 Score = 1535 bits (3973), Expect = 0.0 Identities = 775/878 (88%), Positives = 827/878 (94%), Gaps = 4/878 (0%) Frame = -1 Query: 3046 RMKQQQVNDEK-DGPTSVRALYQNG-DPLGRRDLGKGVVKWIGKGMKAMALDFALAETQG 2873 R KQQQ+N+ K +GPTSVR+LYQNG DPLGRRDLGKGVVKWI +GMKAMAL+FA AE QG Sbjct: 2 RNKQQQINEGKRNGPTSVRSLYQNGGDPLGRRDLGKGVVKWISQGMKAMALEFARAEMQG 61 Query: 2872 DFADLKQRMGP-GLTFVIQAQPYLNAVPMPLGMEAICLKTCTHYPTLFDHFQRELRDVLL 2696 +FA+LKQ+MGP GLTFVIQAQPYLNAVPMP+G+EAICLKTCTHYPTLFDHFQRELRD+L Sbjct: 62 EFAELKQQMGPAGLTFVIQAQPYLNAVPMPVGLEAICLKTCTHYPTLFDHFQRELRDILQ 121 Query: 2695 DLQHKTLIH-NWRETESWKLLKELATSAQHRAIARKTSLSKSVHGVLGLNIDKAKAIQCR 2519 DLQHK+LI W +T+SWKLLK+LA SAQHRA+ARK LSKS+HG L+IDK K+IQCR Sbjct: 122 DLQHKSLIPLTWHQTQSWKLLKDLANSAQHRAVARKAPLSKSLHG---LSIDKTKSIQCR 178 Query: 2518 IDEFTKHMSDLLRIERDAELEFTQEELNAVPTPDEHSTSPKPTEFLVSHAQSEQELCDTI 2339 ID+FT+HMS LLRIERD+ELEFT+EELNAVPTPDEHSTSPKP EFLVSHAQ+EQELCDTI Sbjct: 179 IDKFTEHMSHLLRIERDSELEFTEEELNAVPTPDEHSTSPKPIEFLVSHAQAEQELCDTI 238 Query: 2338 CNLNAISTSTGLGGMHLVLFRVEGNHRLPPTNLSPGDMVCVRICDSRGAGATSCMQGFVN 2159 CNLNAISTS GLGGMHLVLFR EGNHRLPPTNLSPGDMVCVRICDSRGAGATSCMQGFVN Sbjct: 239 CNLNAISTSIGLGGMHLVLFRAEGNHRLPPTNLSPGDMVCVRICDSRGAGATSCMQGFVN 298 Query: 2158 NLGDDGCSISVALESLHGDPTFSKLFGKNIRIDRIQGLADALTYERNCEAXXXXXXXXXX 1979 NLGDDGCSISVALES HGDPTFSKLFGKNIRIDRIQGLADALTYERNCEA Sbjct: 299 NLGDDGCSISVALESRHGDPTFSKLFGKNIRIDRIQGLADALTYERNCEALMMLQKKGLQ 358 Query: 1978 XKNSSIAVVTTIFGDNEDIAWFEDNNMVDWAEAELNGLLDTEFYDTSQQRAIALGLNKKR 1799 +NSS+AVVTTIFGD EDIAWFEDN++VDW+E EL+GLLDTEFYD+SQQRAIALGLNKKR Sbjct: 359 KQNSSVAVVTTIFGDKEDIAWFEDNDLVDWSEVELDGLLDTEFYDSSQQRAIALGLNKKR 418 Query: 1798 PVLIIQGPPGTGKTGVLKQLISIAVKQGERVLVTAPTNAAVDNMVEKLSDIGANIVRVGN 1619 PVLIIQGPPG GKTGVLKQLIS+ VK+GERVLVTAPTNAAVDNMVEKLSDIGANIVRVGN Sbjct: 419 PVLIIQGPPGAGKTGVLKQLISLVVKRGERVLVTAPTNAAVDNMVEKLSDIGANIVRVGN 478 Query: 1618 PARISPAVASKSLVEIVNGRLADFRSEFERKKSDLRKDLSHCLRDDSLAAGIRQLLKQLG 1439 PARISPAVASKSLVEIVN +LAD++SEF RKKS+LRKDLSHCL+DDSLAAGIRQLLKQLG Sbjct: 479 PARISPAVASKSLVEIVNSKLADYKSEFGRKKSNLRKDLSHCLKDDSLAAGIRQLLKQLG 538 Query: 1438 KTMKKKERETIREILSSAHVVLATNIGAADPMIRWLNSFDLVVIDEAGQAIEPSCWIPIL 1259 K +KKKERET++EILSSA VVLATNIGAADPMIR L+SFDLVVIDEAGQAIEPSCWIPIL Sbjct: 539 KAIKKKERETVKEILSSAQVVLATNIGAADPMIRSLDSFDLVVIDEAGQAIEPSCWIPIL 598 Query: 1258 LGKRCILAGDQCQLAPVILSRKALEGGLGVSFLERASTLHEGVLATKLTTQYRMNDAIAS 1079 LGKRCILAGDQCQLAPVILSRKALEGGLGVS LERASTLHEGV ATKLTTQYRMNDAIAS Sbjct: 599 LGKRCILAGDQCQLAPVILSRKALEGGLGVSLLERASTLHEGVFATKLTTQYRMNDAIAS 658 Query: 1078 WASKEMYNGLLKSSASVMSHLLSDSPLVKSTWITQCPLLLLDTRMPFGSLSVGCEEQLDP 899 WASKEMYNGLLKSSASV SHLLSDSPLVK TWITQCPLLLLDTRMP+GSLSVGCEEQLDP Sbjct: 659 WASKEMYNGLLKSSASVTSHLLSDSPLVKPTWITQCPLLLLDTRMPYGSLSVGCEEQLDP 718 Query: 898 AGTGSFYNEGEADIVVQHVFALIYAGVRPSTIVVQSPYVSQVQLLRDRLEEFPLSTGVEV 719 AGTGSFYNEGEADIVVQHVFALIYAGVRP++IVVQSPYV+QVQLLRDRLEEFP++ GVEV Sbjct: 719 AGTGSFYNEGEADIVVQHVFALIYAGVRPASIVVQSPYVAQVQLLRDRLEEFPITKGVEV 778 Query: 718 ATIDSFQGREADAVVISMVRSNNLGAVGFLGDSRRMNVAITRARKHVAIICDSSTICHNT 539 ATIDSFQGREADAV+ISMVRSNNLGAVGFLGDSRRMNVAITRARKHVAIICDSSTICHNT Sbjct: 779 ATIDSFQGREADAVIISMVRSNNLGAVGFLGDSRRMNVAITRARKHVAIICDSSTICHNT 838 Query: 538 FLARLLRHIRYFGRVKHAEPGGSGGYGLSMNPMLPSVS 425 FLARLLRHIRYFGRVKHAEPGGSGG GL+MNPMLPS+S Sbjct: 839 FLARLLRHIRYFGRVKHAEPGGSGGSGLAMNPMLPSLS 876 >gb|KZV41087.1| P-loop containing nucleoside triphosphate hydrolases superfamily protein isoform 1 [Dorcoceras hygrometricum] Length = 939 Score = 1482 bits (3836), Expect = 0.0 Identities = 754/964 (78%), Positives = 829/964 (85%) Frame = -1 Query: 3316 MEASCIFCGGVSASILKSQGIRHRPSESISLYSNKNRLFLSSPISHRVWXXXXXXXXXXX 3137 ME+SCI CGGVS + KS G P ES S Y NR+ + S I +W Sbjct: 1 MESSCICCGGVSTLLYKSPGNGRHPDESFSPY---NRVLIGSRIPRSIWASASTKRR--- 54 Query: 3136 XXXXXREDGRGADVSNNNTNNKAAVSEEKTRMKQQQVNDEKDGPTSVRALYQNGDPLGRR 2957 K V +K + +Q+ D++ S+ +QNGDPLGR+ Sbjct: 55 -------------TGGKKKEEKVGVVPKKKLGQPRQLGDQR----SLLTEHQNGDPLGRK 97 Query: 2956 DLGKGVVKWIGKGMKAMALDFALAETQGDFADLKQRMGPGLTFVIQAQPYLNAVPMPLGM 2777 DLGK V+KWI +GMK+MAL A AE QGD ++ KQRMGPGLTFVI+AQPYLNAVPMP G+ Sbjct: 98 DLGKNVMKWICQGMKSMALAIAKAEMQGDLSEFKQRMGPGLTFVIEAQPYLNAVPMPPGL 157 Query: 2776 EAICLKTCTHYPTLFDHFQRELRDVLLDLQHKTLIHNWRETESWKLLKELATSAQHRAIA 2597 EAICLKTCTHYPTLFDHFQRELRDVL DLQ ++LI +WRETESWKLLKELA SAQHRAIA Sbjct: 158 EAICLKTCTHYPTLFDHFQRELRDVLQDLQQQSLIVDWRETESWKLLKELANSAQHRAIA 217 Query: 2596 RKTSLSKSVHGVLGLNIDKAKAIQCRIDEFTKHMSDLLRIERDAELEFTQEELNAVPTPD 2417 RKT LS +HGVLG++++K KAIQ RIDE T+ MS+LLR+ERDAELEFTQEELNAVPTPD Sbjct: 218 RKTPLS--LHGVLGMDLNKVKAIQRRIDELTQQMSELLRVERDAELEFTQEELNAVPTPD 275 Query: 2416 EHSTSPKPTEFLVSHAQSEQELCDTICNLNAISTSTGLGGMHLVLFRVEGNHRLPPTNLS 2237 E+S+S KPTEFLVSHAQ EQE+CDTICNLNA+STS GLGGMHLVLF+ EGN+RLPPTNLS Sbjct: 276 ENSSSRKPTEFLVSHAQVEQEMCDTICNLNAVSTSIGLGGMHLVLFKAEGNNRLPPTNLS 335 Query: 2236 PGDMVCVRICDSRGAGATSCMQGFVNNLGDDGCSISVALESLHGDPTFSKLFGKNIRIDR 2057 PGDMVCVRICDSRGAGATSC+QGFVNNLG+DGCSISVALES HGDPTFSKLFGKNIRIDR Sbjct: 336 PGDMVCVRICDSRGAGATSCLQGFVNNLGEDGCSISVALESRHGDPTFSKLFGKNIRIDR 395 Query: 2056 IQGLADALTYERNCEAXXXXXXXXXXXKNSSIAVVTTIFGDNEDIAWFEDNNMVDWAEAE 1877 IQGLAD LTYERNCEA KN SI VV T+FGD ED+ W EDN +VDWAE E Sbjct: 396 IQGLADTLTYERNCEALMMLQKKGLHKKNPSITVVATVFGDKEDVVWLEDNKLVDWAEME 455 Query: 1876 LNGLLDTEFYDTSQQRAIALGLNKKRPVLIIQGPPGTGKTGVLKQLISIAVKQGERVLVT 1697 L LLDTE YD SQQRAIALGLNKKRP+LIIQGPPGTGKT VLK+LIS+ V+QGERVLVT Sbjct: 456 LGELLDTESYDASQQRAIALGLNKKRPMLIIQGPPGTGKTVVLKELISLVVEQGERVLVT 515 Query: 1696 APTNAAVDNMVEKLSDIGANIVRVGNPARISPAVASKSLVEIVNGRLADFRSEFERKKSD 1517 APTNAAVDNMVEKLSDIGANIVRVGNPARISPAVASKSLVEIVN +LADF+SEFERKKSD Sbjct: 516 APTNAAVDNMVEKLSDIGANIVRVGNPARISPAVASKSLVEIVNAKLADFKSEFERKKSD 575 Query: 1516 LRKDLSHCLRDDSLAAGIRQLLKQLGKTMKKKERETIREILSSAHVVLATNIGAADPMIR 1337 LRKDLSHCLRDDSLAAGIRQLLKQLGKTMKKKERET+RE+LSSA VVLATNIGAADP+IR Sbjct: 576 LRKDLSHCLRDDSLAAGIRQLLKQLGKTMKKKERETVREVLSSAQVVLATNIGAADPLIR 635 Query: 1336 WLNSFDLVVIDEAGQAIEPSCWIPILLGKRCILAGDQCQLAPVILSRKALEGGLGVSFLE 1157 LN FDLVVIDEAGQAIEPSCWIPILLGKRCILAGD+CQLAPVILSR+ALEGGLGVS LE Sbjct: 636 LLNFFDLVVIDEAGQAIEPSCWIPILLGKRCILAGDKCQLAPVILSRRALEGGLGVSLLE 695 Query: 1156 RASTLHEGVLATKLTTQYRMNDAIASWASKEMYNGLLKSSASVMSHLLSDSPLVKSTWIT 977 RA TLHEGVL+T+LTTQYRMNDAIASWASKEMY+G L+SS+ V SHLLSDSP VK TWIT Sbjct: 696 RAETLHEGVLSTQLTTQYRMNDAIASWASKEMYDGTLESSSRVTSHLLSDSPFVKQTWIT 755 Query: 976 QCPLLLLDTRMPFGSLSVGCEEQLDPAGTGSFYNEGEADIVVQHVFALIYAGVRPSTIVV 797 QCPLLLLDTR+P+GSLS+GCEEQ+DPAGTGSFYNEGEADIVVQHV++LIYAGV P++IVV Sbjct: 756 QCPLLLLDTRLPYGSLSMGCEEQIDPAGTGSFYNEGEADIVVQHVYSLIYAGVIPASIVV 815 Query: 796 QSPYVSQVQLLRDRLEEFPLSTGVEVATIDSFQGREADAVVISMVRSNNLGAVGFLGDSR 617 QSPYV+QVQLLRDRLEEFP++TGVEVATIDSFQGREADAV+ISMVRSNNLGAVGFLGDSR Sbjct: 816 QSPYVAQVQLLRDRLEEFPITTGVEVATIDSFQGREADAVIISMVRSNNLGAVGFLGDSR 875 Query: 616 RMNVAITRARKHVAIICDSSTICHNTFLARLLRHIRYFGRVKHAEPGGSGGYGLSMNPML 437 RMNVAITRARKHVAI+CDSSTICHNTFLARLLRHIRY+GRVKHA+PGG GG GLSM PML Sbjct: 876 RMNVAITRARKHVAIVCDSSTICHNTFLARLLRHIRYYGRVKHADPGGYGGTGLSMTPML 935 Query: 436 PSVS 425 PS+S Sbjct: 936 PSLS 939 >gb|EOY10295.1| P-loop containing nucleoside triphosphate hydrolases superfamily protein isoform 1 [Theobroma cacao] Length = 1008 Score = 1423 bits (3684), Expect = 0.0 Identities = 707/890 (79%), Positives = 786/890 (88%) Frame = -1 Query: 3094 SNNNTNNKAAVSEEKTRMKQQQVNDEKDGPTSVRALYQNGDPLGRRDLGKGVVKWIGKGM 2915 S++ ++ K V E Q+Q +K +VR LYQNGDPLGRRDLGK V++WI +GM Sbjct: 119 SSSCSSTKIIVEELGLLKNQKQEKVKKTKAVNVRTLYQNGDPLGRRDLGKRVIRWISEGM 178 Query: 2914 KAMALDFALAETQGDFADLKQRMGPGLTFVIQAQPYLNAVPMPLGMEAICLKTCTHYPTL 2735 KAMA DF AE QG+F +L+QRMGPGLTFVIQAQPYLNA+P+PLG+EAICLK CTHYPTL Sbjct: 179 KAMASDFVTAELQGEFLELRQRMGPGLTFVIQAQPYLNAIPIPLGLEAICLKACTHYPTL 238 Query: 2734 FDHFQRELRDVLLDLQHKTLIHNWRETESWKLLKELATSAQHRAIARKTSLSKSVHGVLG 2555 FDHFQRELR++L +LQ +++ +WRETESWKLLKELA SAQHRAIARK + K V GVLG Sbjct: 239 FDHFQRELRNILQELQQNSVVEDWRETESWKLLKELANSAQHRAIARKITQPKPVQGVLG 298 Query: 2554 LNIDKAKAIQCRIDEFTKHMSDLLRIERDAELEFTQEELNAVPTPDEHSTSPKPTEFLVS 2375 ++++KAKA+Q RIDEFTK MS+LLRIERDAELEFTQEELNAVPTPDE S S KP EFLVS Sbjct: 299 MDLEKAKAMQGRIDEFTKQMSELLRIERDAELEFTQEELNAVPTPDEGSDSSKPIEFLVS 358 Query: 2374 HAQSEQELCDTICNLNAISTSTGLGGMHLVLFRVEGNHRLPPTNLSPGDMVCVRICDSRG 2195 H Q++QELCDTICNLNA+STSTGLGGMHLVLFRVEGNHRLPPT LSPGDMVCVRICDSRG Sbjct: 359 HGQAQQELCDTICNLNAVSTSTGLGGMHLVLFRVEGNHRLPPTTLSPGDMVCVRICDSRG 418 Query: 2194 AGATSCMQGFVNNLGDDGCSISVALESLHGDPTFSKLFGKNIRIDRIQGLADALTYERNC 2015 AGATSCMQGFV+NLG+DGCSISVALES HGDPTFSK FGKN+RIDRIQGLADALTYERNC Sbjct: 419 AGATSCMQGFVDNLGEDGCSISVALESRHGDPTFSKFFGKNVRIDRIQGLADALTYERNC 478 Query: 2014 EAXXXXXXXXXXXKNSSIAVVTTIFGDNEDIAWFEDNNMVDWAEAELNGLLDTEFYDTSQ 1835 EA KN SIAVV T+FGD ED+ W E N+ DW EA+L+GLL +D SQ Sbjct: 479 EALMLLQKNGLQKKNPSIAVVATLFGDKEDVTWLEKNSYADWNEAKLDGLLQNGTFDDSQ 538 Query: 1834 QRAIALGLNKKRPVLIIQGPPGTGKTGVLKQLISIAVKQGERVLVTAPTNAAVDNMVEKL 1655 QRAIALGLNKKRP+L++QGPPGTGKTG+LK++I++AV+QGERVLV APTNAAVDNMVEKL Sbjct: 539 QRAIALGLNKKRPILVVQGPPGTGKTGLLKEVIALAVQQGERVLVAAPTNAAVDNMVEKL 598 Query: 1654 SDIGANIVRVGNPARISPAVASKSLVEIVNGRLADFRSEFERKKSDLRKDLSHCLRDDSL 1475 S+IG NIVRVGNPARIS AVASKSL EIVN +LAD+ +EFERKKSDLRKDL HCL+DDSL Sbjct: 599 SNIGLNIVRVGNPARISSAVASKSLAEIVNSKLADYLAEFERKKSDLRKDLRHCLKDDSL 658 Query: 1474 AAGIRQLLKQLGKTMKKKERETIREILSSAHVVLATNIGAADPMIRWLNSFDLVVIDEAG 1295 AAGIRQLLKQLGK +KKKE+ET+RE+LSSA VVL+TN GAADP+IR +++FDLVVIDEAG Sbjct: 659 AAGIRQLLKQLGKALKKKEKETVREVLSSAQVVLSTNTGAADPLIRRMDTFDLVVIDEAG 718 Query: 1294 QAIEPSCWIPILLGKRCILAGDQCQLAPVILSRKALEGGLGVSFLERASTLHEGVLATKL 1115 QAIEPSCWIPIL GKRCILAGDQCQLAPVILSRKALEGGLGVS LERA+T+HEGVLAT L Sbjct: 719 QAIEPSCWIPILQGKRCILAGDQCQLAPVILSRKALEGGLGVSLLERAATMHEGVLATML 778 Query: 1114 TTQYRMNDAIASWASKEMYNGLLKSSASVMSHLLSDSPLVKSTWITQCPLLLLDTRMPFG 935 TTQYRMNDAIA WASKEMY+G LKSS SV SHLL DSP VK TWITQCPLLLLDTRMP+G Sbjct: 779 TTQYRMNDAIAGWASKEMYDGELKSSPSVGSHLLVDSPFVKPTWITQCPLLLLDTRMPYG 838 Query: 934 SLSVGCEEQLDPAGTGSFYNEGEADIVVQHVFALIYAGVRPSTIVVQSPYVSQVQLLRDR 755 SLSVGCEE LDPAGTGSFYNEGEADIVVQHVF LIYAGV P+ I VQSPYV+QVQLLRDR Sbjct: 839 SLSVGCEEHLDPAGTGSFYNEGEADIVVQHVFYLIYAGVSPTAIAVQSPYVAQVQLLRDR 898 Query: 754 LEEFPLSTGVEVATIDSFQGREADAVVISMVRSNNLGAVGFLGDSRRMNVAITRARKHVA 575 L+EFP + GVEVATIDSFQGREADAV+ISMVRSN LGAVGFLGDSRRMNVA+TRARKHVA Sbjct: 899 LDEFPEAAGVEVATIDSFQGREADAVIISMVRSNTLGAVGFLGDSRRMNVAVTRARKHVA 958 Query: 574 IICDSSTICHNTFLARLLRHIRYFGRVKHAEPGGSGGYGLSMNPMLPSVS 425 ++CDSSTICHNTFLARLLRHIRYFGRVKHAEPG SGG GL M+PMLPS+S Sbjct: 959 VVCDSSTICHNTFLARLLRHIRYFGRVKHAEPGTSGGSGLGMDPMLPSIS 1008 >ref|XP_017977299.1| PREDICTED: DNA-binding protein SMUBP-2 [Theobroma cacao] ref|XP_007029793.2| PREDICTED: DNA-binding protein SMUBP-2 [Theobroma cacao] Length = 1008 Score = 1422 bits (3680), Expect = 0.0 Identities = 706/890 (79%), Positives = 786/890 (88%) Frame = -1 Query: 3094 SNNNTNNKAAVSEEKTRMKQQQVNDEKDGPTSVRALYQNGDPLGRRDLGKGVVKWIGKGM 2915 S++ ++ K V E Q+Q +K +VR LYQNGDPLGRRDLGK V++WI +GM Sbjct: 119 SSSCSSTKIIVEELGLLKNQKQEKVKKTKAVNVRTLYQNGDPLGRRDLGKRVIRWISEGM 178 Query: 2914 KAMALDFALAETQGDFADLKQRMGPGLTFVIQAQPYLNAVPMPLGMEAICLKTCTHYPTL 2735 KAMA DF AE QG+F +L+QRMGPGLTFVIQAQPYLNA+P+PLG+EAICLK CTHYPTL Sbjct: 179 KAMASDFVTAELQGEFLELRQRMGPGLTFVIQAQPYLNAIPIPLGLEAICLKACTHYPTL 238 Query: 2734 FDHFQRELRDVLLDLQHKTLIHNWRETESWKLLKELATSAQHRAIARKTSLSKSVHGVLG 2555 FDHFQRELR++L +LQ +++ +WR+TESWKLLKELA SAQHRAIARK + K V GVLG Sbjct: 239 FDHFQRELRNILQELQQNSVVEDWRKTESWKLLKELANSAQHRAIARKITQPKPVQGVLG 298 Query: 2554 LNIDKAKAIQCRIDEFTKHMSDLLRIERDAELEFTQEELNAVPTPDEHSTSPKPTEFLVS 2375 ++++KAKA+Q RIDEFTK MS+LLRIERDAELEFTQEELNAVPTPDE S S KP EFLVS Sbjct: 299 MDLEKAKAMQGRIDEFTKQMSELLRIERDAELEFTQEELNAVPTPDEGSDSSKPIEFLVS 358 Query: 2374 HAQSEQELCDTICNLNAISTSTGLGGMHLVLFRVEGNHRLPPTNLSPGDMVCVRICDSRG 2195 H Q++QELCDTICNLNA+STSTGLGGMHLVLFRVEGNHRLPPT LSPGDMVCVRICDSRG Sbjct: 359 HGQAQQELCDTICNLNAVSTSTGLGGMHLVLFRVEGNHRLPPTTLSPGDMVCVRICDSRG 418 Query: 2194 AGATSCMQGFVNNLGDDGCSISVALESLHGDPTFSKLFGKNIRIDRIQGLADALTYERNC 2015 AGATSCMQGFV+NLG+DGCSISVALES HGDPTFSK FGKN+RIDRIQGLADALTYERNC Sbjct: 419 AGATSCMQGFVDNLGEDGCSISVALESRHGDPTFSKFFGKNVRIDRIQGLADALTYERNC 478 Query: 2014 EAXXXXXXXXXXXKNSSIAVVTTIFGDNEDIAWFEDNNMVDWAEAELNGLLDTEFYDTSQ 1835 EA KN SIAVV T+FGD ED+ W E N+ DW EA+L+GLL +D SQ Sbjct: 479 EALMLLQKNGLQKKNPSIAVVATLFGDKEDVTWLEKNSYADWNEAKLDGLLQNGTFDDSQ 538 Query: 1834 QRAIALGLNKKRPVLIIQGPPGTGKTGVLKQLISIAVKQGERVLVTAPTNAAVDNMVEKL 1655 QRAIALGLNKKRP+L++QGPPGTGKTG+LK++I++AV+QGERVLV APTNAAVDNMVEKL Sbjct: 539 QRAIALGLNKKRPILVVQGPPGTGKTGLLKEVIALAVQQGERVLVAAPTNAAVDNMVEKL 598 Query: 1654 SDIGANIVRVGNPARISPAVASKSLVEIVNGRLADFRSEFERKKSDLRKDLSHCLRDDSL 1475 S+IG NIVRVGNPARIS AVASKSL EIVN +LAD+ +EFERKKSDLRKDL HCL+DDSL Sbjct: 599 SNIGLNIVRVGNPARISSAVASKSLAEIVNSKLADYLAEFERKKSDLRKDLRHCLKDDSL 658 Query: 1474 AAGIRQLLKQLGKTMKKKERETIREILSSAHVVLATNIGAADPMIRWLNSFDLVVIDEAG 1295 AAGIRQLLKQLGK +KKKE+ET+RE+LSSA VVL+TN GAADP+IR +++FDLVVIDEAG Sbjct: 659 AAGIRQLLKQLGKALKKKEKETVREVLSSAQVVLSTNTGAADPLIRRMDTFDLVVIDEAG 718 Query: 1294 QAIEPSCWIPILLGKRCILAGDQCQLAPVILSRKALEGGLGVSFLERASTLHEGVLATKL 1115 QAIEPSCWIPIL GKRCILAGDQCQLAPVILSRKALEGGLGVS LERA+T+HEGVLAT L Sbjct: 719 QAIEPSCWIPILQGKRCILAGDQCQLAPVILSRKALEGGLGVSLLERAATMHEGVLATML 778 Query: 1114 TTQYRMNDAIASWASKEMYNGLLKSSASVMSHLLSDSPLVKSTWITQCPLLLLDTRMPFG 935 TTQYRMNDAIA WASKEMY+G LKSS SV SHLL DSP VK TWITQCPLLLLDTRMP+G Sbjct: 779 TTQYRMNDAIAGWASKEMYDGELKSSPSVGSHLLVDSPFVKPTWITQCPLLLLDTRMPYG 838 Query: 934 SLSVGCEEQLDPAGTGSFYNEGEADIVVQHVFALIYAGVRPSTIVVQSPYVSQVQLLRDR 755 SLSVGCEE LDPAGTGSFYNEGEADIVVQHVF LIYAGV P+ I VQSPYV+QVQLLRDR Sbjct: 839 SLSVGCEEHLDPAGTGSFYNEGEADIVVQHVFYLIYAGVSPTAIAVQSPYVAQVQLLRDR 898 Query: 754 LEEFPLSTGVEVATIDSFQGREADAVVISMVRSNNLGAVGFLGDSRRMNVAITRARKHVA 575 L+EFP + GVEVATIDSFQGREADAV+ISMVRSN LGAVGFLGDSRRMNVA+TRARKHVA Sbjct: 899 LDEFPEAAGVEVATIDSFQGREADAVIISMVRSNTLGAVGFLGDSRRMNVAVTRARKHVA 958 Query: 574 IICDSSTICHNTFLARLLRHIRYFGRVKHAEPGGSGGYGLSMNPMLPSVS 425 ++CDSSTICHNTFLARLLRHIRYFGRVKHAEPG SGG GL M+PMLPS+S Sbjct: 959 VVCDSSTICHNTFLARLLRHIRYFGRVKHAEPGTSGGSGLGMDPMLPSIS 1008 >gb|OMO99192.1| putative DNA-binding protein smubp-2 [Corchorus capsularis] Length = 1011 Score = 1420 bits (3677), Expect = 0.0 Identities = 708/890 (79%), Positives = 786/890 (88%) Frame = -1 Query: 3094 SNNNTNNKAAVSEEKTRMKQQQVNDEKDGPTSVRALYQNGDPLGRRDLGKGVVKWIGKGM 2915 ++N + K V E K+ Q +K +VR LYQNGDPLGR+DLGK V++WI +GM Sbjct: 122 NSNVSGTKLIVEEMGLLKKKNQQKVKKTKAVNVRTLYQNGDPLGRKDLGKTVIRWISEGM 181 Query: 2914 KAMALDFALAETQGDFADLKQRMGPGLTFVIQAQPYLNAVPMPLGMEAICLKTCTHYPTL 2735 +AMALDFA AE QG+F +L+QRMGPGLTFVIQAQPYLNA+P+PLG+EAI LK CTHYPTL Sbjct: 182 RAMALDFASAELQGEFPELRQRMGPGLTFVIQAQPYLNAIPIPLGLEAISLKACTHYPTL 241 Query: 2734 FDHFQRELRDVLLDLQHKTLIHNWRETESWKLLKELATSAQHRAIARKTSLSKSVHGVLG 2555 FDHFQRELR+VL +LQ K+++ +WRETESWK+LKELA SAQHRAIARK++ K V GVLG Sbjct: 242 FDHFQRELRNVLQELQQKSMVEDWRETESWKMLKELANSAQHRAIARKSTQPKPVQGVLG 301 Query: 2554 LNIDKAKAIQCRIDEFTKHMSDLLRIERDAELEFTQEELNAVPTPDEHSTSPKPTEFLVS 2375 ++++K KA+Q RIDEFTK MS+LL+IERDAELEFTQEELNAVPTPDE S KP EFLVS Sbjct: 302 MDLEKVKAMQGRIDEFTKWMSELLQIERDAELEFTQEELNAVPTPDEGSNPSKPIEFLVS 361 Query: 2374 HAQSEQELCDTICNLNAISTSTGLGGMHLVLFRVEGNHRLPPTNLSPGDMVCVRICDSRG 2195 H Q++QELCDTICNLNA+STSTGLGGMHLVLFRVEGNHRLPPT LSPGDMVCVRICD+RG Sbjct: 362 HGQAQQELCDTICNLNAVSTSTGLGGMHLVLFRVEGNHRLPPTTLSPGDMVCVRICDNRG 421 Query: 2194 AGATSCMQGFVNNLGDDGCSISVALESLHGDPTFSKLFGKNIRIDRIQGLADALTYERNC 2015 AGAT+CMQGFV+NLG+DGCSISVALES HGDPTFSKLFGK +RIDRIQGLADALTYERNC Sbjct: 422 AGATACMQGFVDNLGEDGCSISVALESRHGDPTFSKLFGKTVRIDRIQGLADALTYERNC 481 Query: 2014 EAXXXXXXXXXXXKNSSIAVVTTIFGDNEDIAWFEDNNMVDWAEAELNGLLDTEFYDTSQ 1835 EA KN SIAVV T+FGD ED+ W E N++ DW E +L+GLL +D SQ Sbjct: 482 EALMLLQKNGLQKKNPSIAVVATLFGDKEDMDWLEKNDLADWNETKLDGLLQNGIFDDSQ 541 Query: 1834 QRAIALGLNKKRPVLIIQGPPGTGKTGVLKQLISIAVKQGERVLVTAPTNAAVDNMVEKL 1655 ++AIALGLNKKRPVL++QGPPGTGKTG+LK++I++AV+QGERVLVTAPTNAAVDNMVEKL Sbjct: 542 RKAIALGLNKKRPVLVVQGPPGTGKTGLLKEIIALAVQQGERVLVTAPTNAAVDNMVEKL 601 Query: 1654 SDIGANIVRVGNPARISPAVASKSLVEIVNGRLADFRSEFERKKSDLRKDLSHCLRDDSL 1475 SD G NIVRVGNPARIS AVASKSLVEIVN +LA+FR+EFERKKSDLRKDL CL+DDSL Sbjct: 602 SDTGLNIVRVGNPARISSAVASKSLVEIVNSKLANFRAEFERKKSDLRKDLRLCLKDDSL 661 Query: 1474 AAGIRQLLKQLGKTMKKKERETIREILSSAHVVLATNIGAADPMIRWLNSFDLVVIDEAG 1295 AAGIRQLLKQLGKT+KKKE+ET+REILSSA VVL+TN GAADP+IR L +FDLVVIDEAG Sbjct: 662 AAGIRQLLKQLGKTLKKKEKETVREILSSAQVVLSTNTGAADPLIRRLKTFDLVVIDEAG 721 Query: 1294 QAIEPSCWIPILLGKRCILAGDQCQLAPVILSRKALEGGLGVSFLERASTLHEGVLATKL 1115 QAIEPSCWIPIL GKRCILAGDQCQLAPVILSRKALEGGLGVS LERA+TLHEGVL T L Sbjct: 722 QAIEPSCWIPILQGKRCILAGDQCQLAPVILSRKALEGGLGVSLLERAATLHEGVLTTLL 781 Query: 1114 TTQYRMNDAIASWASKEMYNGLLKSSASVMSHLLSDSPLVKSTWITQCPLLLLDTRMPFG 935 TTQYRMNDAIA WASKEMYNG LKSS SV SHLL DSP VK TWITQCPLLLLDTRMP+G Sbjct: 782 TTQYRMNDAIAGWASKEMYNGELKSSPSVASHLLVDSPFVKPTWITQCPLLLLDTRMPYG 841 Query: 934 SLSVGCEEQLDPAGTGSFYNEGEADIVVQHVFALIYAGVRPSTIVVQSPYVSQVQLLRDR 755 SLSVGCEE LDPAGTGSFYNEGEADIVVQHVF LIYAGV P TI VQSPYV+QVQLLRDR Sbjct: 842 SLSVGCEEHLDPAGTGSFYNEGEADIVVQHVFYLIYAGVSPKTIAVQSPYVAQVQLLRDR 901 Query: 754 LEEFPLSTGVEVATIDSFQGREADAVVISMVRSNNLGAVGFLGDSRRMNVAITRARKHVA 575 L+EFP + GVEVATIDSFQGREADAV+ISMVRSN LGAVGFLGDSRRMNVAITRARKHVA Sbjct: 902 LDEFPEAAGVEVATIDSFQGREADAVIISMVRSNTLGAVGFLGDSRRMNVAITRARKHVA 961 Query: 574 IICDSSTICHNTFLARLLRHIRYFGRVKHAEPGGSGGYGLSMNPMLPSVS 425 ++CDSSTICHNTFLARLLRHIRYFGRVKHAEPG SGG GL M+PMLPS+S Sbjct: 962 VVCDSSTICHNTFLARLLRHIRYFGRVKHAEPGNSGGSGLGMDPMLPSIS 1011 >gb|PHT30198.1| hypothetical protein CQW23_30230 [Capsicum baccatum] Length = 989 Score = 1417 bits (3667), Expect = 0.0 Identities = 724/984 (73%), Positives = 823/984 (83%), Gaps = 18/984 (1%) Frame = -1 Query: 3328 KRSKMEASCIFCGGVSASILKSQGIRHRPS--ESISLYSNKNRLFLSS---PISHRVWXX 3164 K KMEASC FCG + S L Q + S S++L S KNR FL S S R Sbjct: 4 KLLKMEASCNFCGSLVPSCLTRQKRSNLSSFIGSVALSSIKNRTFLDSISLTSSIRATAS 63 Query: 3163 XXXXXXXXXXXXXXRED-----GRGADVSNNN--------TNNKAAVSEEKTRMKQQQVN 3023 ++ G G +V N+ ++ KA + R QQQ Sbjct: 64 SSGGTKAVTTRRRKPKNVGTTGGSGKNVKNSEIPAVTTKGSSGKAIEKVQVKRKNQQQEC 123 Query: 3022 DEKDGPTSVRALYQNGDPLGRRDLGKGVVKWIGKGMKAMALDFALAETQGDFADLKQRMG 2843 ++ GP VRAL+QNGDPLGR+DLGK VV+W+ +GM+AMALDFA AE QG+FA+LKQRM Sbjct: 124 IQEGGPVDVRALHQNGDPLGRKDLGKCVVRWLSQGMRAMALDFATAEMQGEFAELKQRME 183 Query: 2842 PGLTFVIQAQPYLNAVPMPLGMEAICLKTCTHYPTLFDHFQRELRDVLLDLQHKTLIHNW 2663 PGLTFVIQAQPYLNAVPMPLG+EAICLK CTHYPTLFD+FQRELRDVL DLQ K+ + +W Sbjct: 184 PGLTFVIQAQPYLNAVPMPLGLEAICLKACTHYPTLFDNFQRELRDVLQDLQRKSSVQDW 243 Query: 2662 RETESWKLLKELATSAQHRAIARKTSLSKSVHGVLGLNIDKAKAIQCRIDEFTKHMSDLL 2483 R+TESWKLLK+LA+SAQH+AIARK S KSV GV+G++++KAKAIQ RID+FT MSDLL Sbjct: 244 RDTESWKLLKDLASSAQHKAIARKGSQPKSVPGVMGMDLEKAKAIQSRIDDFTNRMSDLL 303 Query: 2482 RIERDAELEFTQEELNAVPTPDEHSTSPKPTEFLVSHAQSEQELCDTICNLNAISTSTGL 2303 IERDAELEFTQEELNAVP PD +S + KP EFLVSHAQ EQELCDTICNL A+STS GL Sbjct: 304 HIERDAELEFTQEELNAVPAPDVNSEAQKPFEFLVSHAQPEQELCDTICNLTAVSTSIGL 363 Query: 2302 GGMHLVLFRVEGNHRLPPTNLSPGDMVCVRICDSRGAGATSCMQGFVNNLGDDGCSISVA 2123 GGMHLVLF++EGNHRLPP NLSPGDMVCVRICDSRGAGATSCMQGFV+NLG+DGCSIS+A Sbjct: 364 GGMHLVLFKLEGNHRLPPANLSPGDMVCVRICDSRGAGATSCMQGFVHNLGEDGCSISLA 423 Query: 2122 LESLHGDPTFSKLFGKNIRIDRIQGLADALTYERNCEAXXXXXXXXXXXKNSSIAVVTTI 1943 LESL GD TFSKLFGKN+RIDRIQGLADALTYERNCEA KNSS+AVV T+ Sbjct: 424 LESLQGDTTFSKLFGKNVRIDRIQGLADALTYERNCEALMMLQKKGFRKKNSSVAVVATL 483 Query: 1942 FGDNEDIAWFEDNNMVDWAEAELNGLLDTEFYDTSQQRAIALGLNKKRPVLIIQGPPGTG 1763 FGDNED+ W E+N+M DWAE EL + + +D SQ++AIALGLNK RP++IIQGPPGTG Sbjct: 484 FGDNEDLKWLEENDMADWAEVELPDSTNKKSFDASQRKAIALGLNKNRPIMIIQGPPGTG 543 Query: 1762 KTGVLKQLISIAVKQGERVLVTAPTNAAVDNMVEKLSDIGANIVRVGNPARISPAVASKS 1583 KTG+LK+LIS+AVKQGERVLVTAPTNAAVDNMVEKLSDIG NIVRVGNPARIS +VASKS Sbjct: 544 KTGLLKELISLAVKQGERVLVTAPTNAAVDNMVEKLSDIGINIVRVGNPARISSSVASKS 603 Query: 1582 LVEIVNGRLADFRSEFERKKSDLRKDLSHCLRDDSLAAGIRQLLKQLGKTMKKKERETIR 1403 L EIVN +L+DF SE ERKKSDLRKDL +CL+DDSLAAGIRQLLKQLGK++KKKE+ET++ Sbjct: 604 LAEIVNNKLSDFLSEIERKKSDLRKDLRYCLKDDSLAAGIRQLLKQLGKSIKKKEKETVK 663 Query: 1402 EILSSAHVVLATNIGAADPMIRWLNSFDLVVIDEAGQAIEPSCWIPILLGKRCILAGDQC 1223 EILS+AHVVLATNIGAADP+IR L++FDLV+IDEAGQAIEPS WIPILLGKRCILAGDQ Sbjct: 664 EILSTAHVVLATNIGAADPLIRRLDAFDLVIIDEAGQAIEPSSWIPILLGKRCILAGDQF 723 Query: 1222 QLAPVILSRKALEGGLGVSFLERASTLHEGVLATKLTTQYRMNDAIASWASKEMYNGLLK 1043 QLAPVILSRKALEGGLGVS LERA+TLH+G+L+TKLTTQYRMNDAIASWASKEMY G L Sbjct: 724 QLAPVILSRKALEGGLGVSLLERAATLHDGMLSTKLTTQYRMNDAIASWASKEMYGGSLT 783 Query: 1042 SSASVMSHLLSDSPLVKSTWITQCPLLLLDTRMPFGSLSVGCEEQLDPAGTGSFYNEGEA 863 SS +V SHLL DSP VK TWITQCPLLLLDTRMP+GSLSVGCEE LDPAGTGSFYNEGEA Sbjct: 784 SSPTVASHLLVDSPFVKPTWITQCPLLLLDTRMPYGSLSVGCEEHLDPAGTGSFYNEGEA 843 Query: 862 DIVVQHVFALIYAGVRPSTIVVQSPYVSQVQLLRDRLEEFPLSTGVEVATIDSFQGREAD 683 DIVVQHVF+LIYAGV P+ I VQSPYV+QVQLLRD+++E P++TGV+VATIDSFQGREAD Sbjct: 844 DIVVQHVFSLIYAGVPPAAIAVQSPYVAQVQLLRDKIDEIPMATGVDVATIDSFQGREAD 903 Query: 682 AVVISMVRSNNLGAVGFLGDSRRMNVAITRARKHVAIICDSSTICHNTFLARLLRHIRYF 503 AV+ISMVRSNNLGAVGFLGD+RRMNVAITRARKHVA++CDSSTICHNT+LARLLRHIRYF Sbjct: 904 AVIISMVRSNNLGAVGFLGDNRRMNVAITRARKHVAVVCDSSTICHNTYLARLLRHIRYF 963 Query: 502 GRVKHAEPGGSGGYGLSMNPMLPS 431 G+VKH EPG +GL M+PMLP+ Sbjct: 964 GKVKHVEPGSFWEFGLGMDPMLPT 987 >gb|OMO56477.1| hypothetical protein COLO4_35630 [Corchorus olitorius] Length = 1011 Score = 1417 bits (3667), Expect = 0.0 Identities = 707/890 (79%), Positives = 785/890 (88%) Frame = -1 Query: 3094 SNNNTNNKAAVSEEKTRMKQQQVNDEKDGPTSVRALYQNGDPLGRRDLGKGVVKWIGKGM 2915 ++N + K V E K+ Q +K +VR LYQNGDPLGR+DLGK V++WI +GM Sbjct: 122 NSNVSGTKLIVEEMGLLKKKNQQKVKKTKAVNVRTLYQNGDPLGRKDLGKTVIRWISEGM 181 Query: 2914 KAMALDFALAETQGDFADLKQRMGPGLTFVIQAQPYLNAVPMPLGMEAICLKTCTHYPTL 2735 +AMALDFA AE QG+F +L+QRMGPGLTFVIQAQPYLNA+P+PLG+EAI LK CTHYPTL Sbjct: 182 RAMALDFASAELQGEFPELRQRMGPGLTFVIQAQPYLNAIPIPLGLEAISLKACTHYPTL 241 Query: 2734 FDHFQRELRDVLLDLQHKTLIHNWRETESWKLLKELATSAQHRAIARKTSLSKSVHGVLG 2555 FDHFQRELR+VL +LQ K+++ +WRETESWK+LKELA SAQHRAIARK++ K V GVLG Sbjct: 242 FDHFQRELRNVLQELQQKSMVEDWRETESWKMLKELAHSAQHRAIARKSTQPKPVQGVLG 301 Query: 2554 LNIDKAKAIQCRIDEFTKHMSDLLRIERDAELEFTQEELNAVPTPDEHSTSPKPTEFLVS 2375 ++++K KA+Q RIDEFTK MS+LL+IERDAELEFTQEELNAVPTPDE S KP EFLVS Sbjct: 302 MDLEKVKAMQGRIDEFTKWMSELLQIERDAELEFTQEELNAVPTPDEGSNPSKPIEFLVS 361 Query: 2374 HAQSEQELCDTICNLNAISTSTGLGGMHLVLFRVEGNHRLPPTNLSPGDMVCVRICDSRG 2195 H Q++QELCDTICNLNA+STSTGLGGMHLVLFRVEGNHRLPPT LSPGDMVCVRICD+RG Sbjct: 362 HGQAQQELCDTICNLNAVSTSTGLGGMHLVLFRVEGNHRLPPTTLSPGDMVCVRICDNRG 421 Query: 2194 AGATSCMQGFVNNLGDDGCSISVALESLHGDPTFSKLFGKNIRIDRIQGLADALTYERNC 2015 AGAT+CMQGFV+NLG+DGCSISVALES HGDPTFSKLFGK +RIDRIQGLADALTYERNC Sbjct: 422 AGATACMQGFVDNLGEDGCSISVALESRHGDPTFSKLFGKTVRIDRIQGLADALTYERNC 481 Query: 2014 EAXXXXXXXXXXXKNSSIAVVTTIFGDNEDIAWFEDNNMVDWAEAELNGLLDTEFYDTSQ 1835 EA KN SIAVV T+FGD ED+ W E N++ DW E L+GLL +D SQ Sbjct: 482 EALMLLQKNGLQKKNLSIAVVATLFGDKEDMDWLEKNDLADWNETMLDGLLQNGIFDDSQ 541 Query: 1834 QRAIALGLNKKRPVLIIQGPPGTGKTGVLKQLISIAVKQGERVLVTAPTNAAVDNMVEKL 1655 ++AIALGLNKKRP+L++QGPPGTGKTG+LK++I++AV+QGERVLVTAPTNAAVDNMVEKL Sbjct: 542 RKAIALGLNKKRPLLVVQGPPGTGKTGLLKEIIALAVQQGERVLVTAPTNAAVDNMVEKL 601 Query: 1654 SDIGANIVRVGNPARISPAVASKSLVEIVNGRLADFRSEFERKKSDLRKDLSHCLRDDSL 1475 SD G NIVRVGNPARIS AVASKSLVEIVN +LA+FR+EFERKKSDLRKDL CL+DDSL Sbjct: 602 SDTGLNIVRVGNPARISSAVASKSLVEIVNSKLANFRAEFERKKSDLRKDLRLCLKDDSL 661 Query: 1474 AAGIRQLLKQLGKTMKKKERETIREILSSAHVVLATNIGAADPMIRWLNSFDLVVIDEAG 1295 AAGIRQLLKQLGKT+KKKE+ET+REILSSA VVL+TN GAADP+IR L +FDLVVIDEAG Sbjct: 662 AAGIRQLLKQLGKTLKKKEKETVREILSSAQVVLSTNTGAADPLIRRLKTFDLVVIDEAG 721 Query: 1294 QAIEPSCWIPILLGKRCILAGDQCQLAPVILSRKALEGGLGVSFLERASTLHEGVLATKL 1115 QAIEPSCWIPIL GKRCILAGDQCQLAPVILSRKALEGGLGVS LERA+TLHEGVL T L Sbjct: 722 QAIEPSCWIPILQGKRCILAGDQCQLAPVILSRKALEGGLGVSLLERAATLHEGVLTTLL 781 Query: 1114 TTQYRMNDAIASWASKEMYNGLLKSSASVMSHLLSDSPLVKSTWITQCPLLLLDTRMPFG 935 TTQYRMNDAIASWASKEMYNG LKSS SV SHLL DSP VK TWITQCPLLLLDTRMP+G Sbjct: 782 TTQYRMNDAIASWASKEMYNGELKSSPSVASHLLVDSPFVKPTWITQCPLLLLDTRMPYG 841 Query: 934 SLSVGCEEQLDPAGTGSFYNEGEADIVVQHVFALIYAGVRPSTIVVQSPYVSQVQLLRDR 755 SLSVGCEE LDPAGTGSFYNEGEADIVVQHVF LIYAGV P I VQSPYV+QVQLLRDR Sbjct: 842 SLSVGCEEHLDPAGTGSFYNEGEADIVVQHVFYLIYAGVSPKAIAVQSPYVAQVQLLRDR 901 Query: 754 LEEFPLSTGVEVATIDSFQGREADAVVISMVRSNNLGAVGFLGDSRRMNVAITRARKHVA 575 L+EFP + GVEVATIDSFQGREADAV+ISMVRSN LGAVGFLGDSRRMNVAITRARKHVA Sbjct: 902 LDEFPEAAGVEVATIDSFQGREADAVIISMVRSNTLGAVGFLGDSRRMNVAITRARKHVA 961 Query: 574 IICDSSTICHNTFLARLLRHIRYFGRVKHAEPGGSGGYGLSMNPMLPSVS 425 ++CDSSTICHNTFLARLLRHIRYFGRVKHAEPG SGG GL M+PMLPS+S Sbjct: 962 VVCDSSTICHNTFLARLLRHIRYFGRVKHAEPGNSGGSGLGMDPMLPSIS 1011 >ref|XP_022718654.1| DNA-binding protein SMUBP-2-like [Durio zibethinus] Length = 1004 Score = 1416 bits (3665), Expect = 0.0 Identities = 703/898 (78%), Positives = 788/898 (87%) Frame = -1 Query: 3118 EDGRGADVSNNNTNNKAAVSEEKTRMKQQQVNDEKDGPTSVRALYQNGDPLGRRDLGKGV 2939 ++G + + ++ K V E + +Q+Q +K +VR LYQNGDPLGRRDLGK V Sbjct: 107 DNGSSSKSTPELSSTKILVEELELLKEQKQEKVKKTKALNVRTLYQNGDPLGRRDLGKRV 166 Query: 2938 VKWIGKGMKAMALDFALAETQGDFADLKQRMGPGLTFVIQAQPYLNAVPMPLGMEAICLK 2759 V+WI +GMKAMA DF AE QG+F +L+Q M PGLTFVIQAQPYLNA+P+PLG+EAICLK Sbjct: 167 VRWISEGMKAMASDFVSAELQGEFLELRQMMEPGLTFVIQAQPYLNAIPIPLGLEAICLK 226 Query: 2758 TCTHYPTLFDHFQRELRDVLLDLQHKTLIHNWRETESWKLLKELATSAQHRAIARKTSLS 2579 CTHYPTLFDHFQRELR+VL +LQH +++ +WRETESWKLLKELA S QHRAIARK +L Sbjct: 227 ACTHYPTLFDHFQRELRNVLQELQHNSVVEDWRETESWKLLKELANSVQHRAIARKITLP 286 Query: 2578 KSVHGVLGLNIDKAKAIQCRIDEFTKHMSDLLRIERDAELEFTQEELNAVPTPDEHSTSP 2399 K + G+LG+ ++KAKA+Q RIDEFTK MS+LLRIERDAELEFTQEELNAVPTP+E S Sbjct: 287 KPIQGILGIGLEKAKAMQGRIDEFTKRMSELLRIERDAELEFTQEELNAVPTPNEGCDSI 346 Query: 2398 KPTEFLVSHAQSEQELCDTICNLNAISTSTGLGGMHLVLFRVEGNHRLPPTNLSPGDMVC 2219 KP EFLVSH Q++QELCDTICNLNA+STSTGLGGMHLVLFRVEGNHRLPPT LSPGDMVC Sbjct: 347 KPIEFLVSHGQAQQELCDTICNLNAVSTSTGLGGMHLVLFRVEGNHRLPPTTLSPGDMVC 406 Query: 2218 VRICDSRGAGATSCMQGFVNNLGDDGCSISVALESLHGDPTFSKLFGKNIRIDRIQGLAD 2039 VRICDSRGAGATSC+QGFV+NLG+DGCSISVALES HGDPTFSKLFGK++RIDRIQGLAD Sbjct: 407 VRICDSRGAGATSCIQGFVDNLGEDGCSISVALESRHGDPTFSKLFGKSVRIDRIQGLAD 466 Query: 2038 ALTYERNCEAXXXXXXXXXXXKNSSIAVVTTIFGDNEDIAWFEDNNMVDWAEAELNGLLD 1859 ALTYERNCEA KN SIAVV T+FGD ED+AW E+N++ DW + EL+G L Sbjct: 467 ALTYERNCEALMLLQKNGLQKKNPSIAVVATLFGDKEDVAWLEENDLADWNQTELDGSLQ 526 Query: 1858 TEFYDTSQQRAIALGLNKKRPVLIIQGPPGTGKTGVLKQLISIAVKQGERVLVTAPTNAA 1679 +D SQQRAI LGLNKKRP+L++QGPPGTGKTG+LK++I++AV+QGE VLVTAPTNAA Sbjct: 527 NRTFDDSQQRAICLGLNKKRPMLVVQGPPGTGKTGLLKEVIALAVQQGETVLVTAPTNAA 586 Query: 1678 VDNMVEKLSDIGANIVRVGNPARISPAVASKSLVEIVNGRLADFRSEFERKKSDLRKDLS 1499 VDNMVEKLSD G +IVRVGNPARIS VASKSLVEIVN +LAD+R+EFERKKSDLRKDL Sbjct: 587 VDNMVEKLSDSGLDIVRVGNPARISSTVASKSLVEIVNSKLADYRAEFERKKSDLRKDLR 646 Query: 1498 HCLRDDSLAAGIRQLLKQLGKTMKKKERETIREILSSAHVVLATNIGAADPMIRWLNSFD 1319 HCL+DDSLAAGIRQLLKQLGK +KKKE+ET+RE+LSSA VVL+TN GAADP+IR L++FD Sbjct: 647 HCLKDDSLAAGIRQLLKQLGKALKKKEKETVREVLSSAQVVLSTNTGAADPLIRRLDTFD 706 Query: 1318 LVVIDEAGQAIEPSCWIPILLGKRCILAGDQCQLAPVILSRKALEGGLGVSFLERASTLH 1139 LVVIDEAGQAIEPSCWIPIL GKRCILAGD+CQLAPVILSRKALEGGLGVS LERA+TLH Sbjct: 707 LVVIDEAGQAIEPSCWIPILKGKRCILAGDRCQLAPVILSRKALEGGLGVSLLERAATLH 766 Query: 1138 EGVLATKLTTQYRMNDAIASWASKEMYNGLLKSSASVMSHLLSDSPLVKSTWITQCPLLL 959 EGVLAT LTTQYRMNDAIASWASKEMYNG LKSS SV S+LL DSP VK TWITQCPLLL Sbjct: 767 EGVLATMLTTQYRMNDAIASWASKEMYNGELKSSPSVASYLLVDSPFVKPTWITQCPLLL 826 Query: 958 LDTRMPFGSLSVGCEEQLDPAGTGSFYNEGEADIVVQHVFALIYAGVRPSTIVVQSPYVS 779 LDTRMP+GSLSVGCEE LDPAGTGSFYNEGE DIVVQHVF LIYAGV P+ I VQSPYV+ Sbjct: 827 LDTRMPYGSLSVGCEEHLDPAGTGSFYNEGETDIVVQHVFYLIYAGVSPTAIAVQSPYVA 886 Query: 778 QVQLLRDRLEEFPLSTGVEVATIDSFQGREADAVVISMVRSNNLGAVGFLGDSRRMNVAI 599 QVQLLRDRL+EFP + GVEVATIDSFQGREADAV+ISMVRSN LGAVGFLGDSRRMNVAI Sbjct: 887 QVQLLRDRLDEFPQTAGVEVATIDSFQGREADAVIISMVRSNTLGAVGFLGDSRRMNVAI 946 Query: 598 TRARKHVAIICDSSTICHNTFLARLLRHIRYFGRVKHAEPGGSGGYGLSMNPMLPSVS 425 TRARKHVA++CDSSTICHNTFLARLLRHIRYFGRVKHAEPG SGG GL M+PMLPS+S Sbjct: 947 TRARKHVAVVCDSSTICHNTFLARLLRHIRYFGRVKHAEPGASGGSGLGMDPMLPSIS 1004 >ref|XP_021282320.1| DNA-binding protein SMUBP-2 [Herrania umbratica] Length = 1009 Score = 1415 bits (3662), Expect = 0.0 Identities = 705/890 (79%), Positives = 782/890 (87%) Frame = -1 Query: 3094 SNNNTNNKAAVSEEKTRMKQQQVNDEKDGPTSVRALYQNGDPLGRRDLGKGVVKWIGKGM 2915 S++ ++ K V E Q+Q +K +VR LYQNGDPLGRRDLGK VV+WI +GM Sbjct: 120 SSSFSSTKIIVEELGLLKDQKQQKVKKTKAVNVRTLYQNGDPLGRRDLGKRVVRWISEGM 179 Query: 2914 KAMALDFALAETQGDFADLKQRMGPGLTFVIQAQPYLNAVPMPLGMEAICLKTCTHYPTL 2735 KAMA DF AE QG+F +L+QRMGPGLTFVIQAQPYLNA+P+PLG+EAICLK CTHYPTL Sbjct: 180 KAMASDFVTAELQGEFLELRQRMGPGLTFVIQAQPYLNAIPIPLGLEAICLKACTHYPTL 239 Query: 2734 FDHFQRELRDVLLDLQHKTLIHNWRETESWKLLKELATSAQHRAIARKTSLSKSVHGVLG 2555 FDHFQRELR+VL +LQ +++ +WRETESW LLKELA SAQHRAIARK K V GVLG Sbjct: 240 FDHFQRELRNVLQELQKNSVVEDWRETESWTLLKELANSAQHRAIARKIEQPKPVQGVLG 299 Query: 2554 LNIDKAKAIQCRIDEFTKHMSDLLRIERDAELEFTQEELNAVPTPDEHSTSPKPTEFLVS 2375 ++++KAKA+Q RIDEFTK MS+LLRIERDAELEFTQEELNAVPTPDE S S KP EFLVS Sbjct: 300 MDLEKAKAMQGRIDEFTKQMSELLRIERDAELEFTQEELNAVPTPDEGSDSSKPIEFLVS 359 Query: 2374 HAQSEQELCDTICNLNAISTSTGLGGMHLVLFRVEGNHRLPPTNLSPGDMVCVRICDSRG 2195 H Q++QELCDTICNLNA+STSTGLGGMHLVL RVEGNHRLPPT LSPGDMVCVRICDSRG Sbjct: 360 HGQAQQELCDTICNLNAVSTSTGLGGMHLVLLRVEGNHRLPPTTLSPGDMVCVRICDSRG 419 Query: 2194 AGATSCMQGFVNNLGDDGCSISVALESLHGDPTFSKLFGKNIRIDRIQGLADALTYERNC 2015 AGATSCMQGFV+NLG+DGCSISVALES HGDPTFSK FGKN+RIDRIQGLADALTYERNC Sbjct: 420 AGATSCMQGFVDNLGEDGCSISVALESRHGDPTFSKFFGKNVRIDRIQGLADALTYERNC 479 Query: 2014 EAXXXXXXXXXXXKNSSIAVVTTIFGDNEDIAWFEDNNMVDWAEAELNGLLDTEFYDTSQ 1835 EA KN SIAVV T+FGD ED+ W E N+ DW EA+L+GLL +D SQ Sbjct: 480 EALMLLQKNGLQKKNPSIAVVATLFGDTEDVTWLEKNSFADWNEAKLDGLLQNGIFDDSQ 539 Query: 1834 QRAIALGLNKKRPVLIIQGPPGTGKTGVLKQLISIAVKQGERVLVTAPTNAAVDNMVEKL 1655 QRAIALGLNKKRP+L++QGPPGTGKTG+LK++I++AV+QGERVLVTAPTNAAVDNMVEKL Sbjct: 540 QRAIALGLNKKRPILVVQGPPGTGKTGLLKEVIALAVQQGERVLVTAPTNAAVDNMVEKL 599 Query: 1654 SDIGANIVRVGNPARISPAVASKSLVEIVNGRLADFRSEFERKKSDLRKDLSHCLRDDSL 1475 S+ G NIVRVGNPARIS AVASKSLVEIVN +LAD+ +EFERKKSDLRKDL HCL+DDSL Sbjct: 600 SNTGLNIVRVGNPARISSAVASKSLVEIVNSKLADYLAEFERKKSDLRKDLRHCLKDDSL 659 Query: 1474 AAGIRQLLKQLGKTMKKKERETIREILSSAHVVLATNIGAADPMIRWLNSFDLVVIDEAG 1295 AAGIRQLLKQLGK +KKKE+ET+RE+LSSA VVL+TN GAADP+IR +++FDLVVIDEAG Sbjct: 660 AAGIRQLLKQLGKALKKKEKETVREVLSSAQVVLSTNTGAADPLIRRMDTFDLVVIDEAG 719 Query: 1294 QAIEPSCWIPILLGKRCILAGDQCQLAPVILSRKALEGGLGVSFLERASTLHEGVLATKL 1115 QAIEPSCWIPI GKRCILAGDQCQLAPVILSRKAL+GGLGVS LERA+T+HEGVLAT L Sbjct: 720 QAIEPSCWIPIFQGKRCILAGDQCQLAPVILSRKALDGGLGVSLLERAATMHEGVLATML 779 Query: 1114 TTQYRMNDAIASWASKEMYNGLLKSSASVMSHLLSDSPLVKSTWITQCPLLLLDTRMPFG 935 T+QYRMNDAIASWASKEMY+G LKSS SV SHLL DSP VK TWITQCPLLLLDTRMP+G Sbjct: 780 TSQYRMNDAIASWASKEMYDGELKSSPSVGSHLLVDSPFVKPTWITQCPLLLLDTRMPYG 839 Query: 934 SLSVGCEEQLDPAGTGSFYNEGEADIVVQHVFALIYAGVRPSTIVVQSPYVSQVQLLRDR 755 SLSVGCEE LDP GTGSFYNEGEADIVVQHVF LIYAGV P+ I VQSPYV+QVQLLRDR Sbjct: 840 SLSVGCEEHLDPVGTGSFYNEGEADIVVQHVFYLIYAGVSPTAIAVQSPYVAQVQLLRDR 899 Query: 754 LEEFPLSTGVEVATIDSFQGREADAVVISMVRSNNLGAVGFLGDSRRMNVAITRARKHVA 575 L+E P + GVEVATIDSFQGREADAV+ISMVRSN LGAVGFLGDSRRMNVAITRARKHVA Sbjct: 900 LDELPEAAGVEVATIDSFQGREADAVIISMVRSNTLGAVGFLGDSRRMNVAITRARKHVA 959 Query: 574 IICDSSTICHNTFLARLLRHIRYFGRVKHAEPGGSGGYGLSMNPMLPSVS 425 ++CDSSTICHNTFLARLLRHIRYFGRVKHAEPG SGG GL M+PMLPS+S Sbjct: 960 VVCDSSTICHNTFLARLLRHIRYFGRVKHAEPGTSGGSGLGMDPMLPSIS 1009 >ref|XP_016564094.1| PREDICTED: DNA-binding protein SMUBP-2 [Capsicum annuum] Length = 989 Score = 1411 bits (3653), Expect = 0.0 Identities = 721/984 (73%), Positives = 821/984 (83%), Gaps = 18/984 (1%) Frame = -1 Query: 3328 KRSKMEASCIFCGGVSASILKSQGIRHRPS--ESISLYSNKNRLFLSS---PISHRVWXX 3164 K KMEASC FCG + S L Q + S S++L S KNR FL S S R Sbjct: 4 KLLKMEASCNFCGSLVPSCLTRQKRSNLSSFIGSVALSSIKNRTFLDSISLTSSIRATAS 63 Query: 3163 XXXXXXXXXXXXXXRED-----GRGADVSNNN--------TNNKAAVSEEKTRMKQQQVN 3023 ++ G G +V N+ ++ KA + R QQQ Sbjct: 64 SSGGTKAVTTRRRKPKNVGTTGGSGKNVKNSEIPAVTTKGSSGKAIEKVQVKRKNQQQEC 123 Query: 3022 DEKDGPTSVRALYQNGDPLGRRDLGKGVVKWIGKGMKAMALDFALAETQGDFADLKQRMG 2843 ++ GP VRAL+QNGDPLGR+DLGK VV+W+ +GM+AMALDFA AE QG+FA+LKQRM Sbjct: 124 IQEGGPVDVRALHQNGDPLGRKDLGKCVVRWLSQGMRAMALDFATAEMQGEFAELKQRME 183 Query: 2842 PGLTFVIQAQPYLNAVPMPLGMEAICLKTCTHYPTLFDHFQRELRDVLLDLQHKTLIHNW 2663 PGLTFVIQAQPYLNAVPMPLG+EAICLK CTHYPTLFD+FQRELRDVL DLQ K+ + +W Sbjct: 184 PGLTFVIQAQPYLNAVPMPLGLEAICLKACTHYPTLFDNFQRELRDVLQDLQRKSSVQDW 243 Query: 2662 RETESWKLLKELATSAQHRAIARKTSLSKSVHGVLGLNIDKAKAIQCRIDEFTKHMSDLL 2483 R+TESWKLLK+LA+SAQH+AIARK S KSV GV+G++++KAKAIQ RID+FT MSDLL Sbjct: 244 RDTESWKLLKDLASSAQHKAIARKGSQPKSVPGVMGMDLEKAKAIQSRIDDFTNRMSDLL 303 Query: 2482 RIERDAELEFTQEELNAVPTPDEHSTSPKPTEFLVSHAQSEQELCDTICNLNAISTSTGL 2303 IERDAELEFTQEELNAVP PD +S + KP EFLVSHAQ EQELCDTICNL A+STS GL Sbjct: 304 HIERDAELEFTQEELNAVPAPDVNSEAQKPFEFLVSHAQPEQELCDTICNLTAVSTSIGL 363 Query: 2302 GGMHLVLFRVEGNHRLPPTNLSPGDMVCVRICDSRGAGATSCMQGFVNNLGDDGCSISVA 2123 GGMHLVLF++EGNHRLPP NLSPGDMVCVRICDSRGAGATSCMQGFV+NLG+DGCSIS+A Sbjct: 364 GGMHLVLFKLEGNHRLPPANLSPGDMVCVRICDSRGAGATSCMQGFVHNLGEDGCSISLA 423 Query: 2122 LESLHGDPTFSKLFGKNIRIDRIQGLADALTYERNCEAXXXXXXXXXXXKNSSIAVVTTI 1943 LESL GD TFSKLFGKN+RIDRIQGLADALTYERNCEA KN S+AVV T+ Sbjct: 424 LESLQGDTTFSKLFGKNVRIDRIQGLADALTYERNCEALMMLQKKGFRKKNPSVAVVATL 483 Query: 1942 FGDNEDIAWFEDNNMVDWAEAELNGLLDTEFYDTSQQRAIALGLNKKRPVLIIQGPPGTG 1763 FGDNED+ W E+N+M DWAE EL + + +D SQ++AIALGLNK RP++IIQGPPGTG Sbjct: 484 FGDNEDLKWLEENDMADWAEVELPDSTNKKSFDASQRKAIALGLNKNRPIMIIQGPPGTG 543 Query: 1762 KTGVLKQLISIAVKQGERVLVTAPTNAAVDNMVEKLSDIGANIVRVGNPARISPAVASKS 1583 KTG+LK+LIS+AVKQGERVLVTAPTNAAVDNMVEKLSDIG NIVRVGNPARIS +VASKS Sbjct: 544 KTGLLKELISLAVKQGERVLVTAPTNAAVDNMVEKLSDIGINIVRVGNPARISSSVASKS 603 Query: 1582 LVEIVNGRLADFRSEFERKKSDLRKDLSHCLRDDSLAAGIRQLLKQLGKTMKKKERETIR 1403 L EIVN +L+DF +E ERKKSDLRKDL +CL+DDSLAAGIRQLLKQLGK++KKKE+ET++ Sbjct: 604 LAEIVNNKLSDFLAEIERKKSDLRKDLRYCLKDDSLAAGIRQLLKQLGKSIKKKEKETVK 663 Query: 1402 EILSSAHVVLATNIGAADPMIRWLNSFDLVVIDEAGQAIEPSCWIPILLGKRCILAGDQC 1223 EILS+AHVVLATNIGAADP+IR L++FDLV+IDEAGQAIEPS WIPILLGKRCILAGDQ Sbjct: 664 EILSTAHVVLATNIGAADPLIRRLDAFDLVIIDEAGQAIEPSSWIPILLGKRCILAGDQF 723 Query: 1222 QLAPVILSRKALEGGLGVSFLERASTLHEGVLATKLTTQYRMNDAIASWASKEMYNGLLK 1043 QLAPVILSRKALEGGLGVS LERA+TLH+G+L+TKLTTQYRMNDAIASWASKEMY G L Sbjct: 724 QLAPVILSRKALEGGLGVSLLERAATLHDGMLSTKLTTQYRMNDAIASWASKEMYGGSLT 783 Query: 1042 SSASVMSHLLSDSPLVKSTWITQCPLLLLDTRMPFGSLSVGCEEQLDPAGTGSFYNEGEA 863 SS +V SHLL DSP VK TWITQCPLLLLDTRMP+GSLSVGCEE LDPAGTGSFYNEGEA Sbjct: 784 SSPTVASHLLVDSPFVKPTWITQCPLLLLDTRMPYGSLSVGCEEHLDPAGTGSFYNEGEA 843 Query: 862 DIVVQHVFALIYAGVRPSTIVVQSPYVSQVQLLRDRLEEFPLSTGVEVATIDSFQGREAD 683 DIVVQHVF+LIYAGV P+ I VQSPYV+QVQLLRD+++E P++TGV+VATIDSFQGREAD Sbjct: 844 DIVVQHVFSLIYAGVPPAAIAVQSPYVAQVQLLRDKIDEIPMATGVDVATIDSFQGREAD 903 Query: 682 AVVISMVRSNNLGAVGFLGDSRRMNVAITRARKHVAIICDSSTICHNTFLARLLRHIRYF 503 AV+ISMVRSNNLGAVGFLGD+RRMNVAITRA KHVA++CDSSTICHNT+LARLLRHIRYF Sbjct: 904 AVIISMVRSNNLGAVGFLGDNRRMNVAITRASKHVAVVCDSSTICHNTYLARLLRHIRYF 963 Query: 502 GRVKHAEPGGSGGYGLSMNPMLPS 431 G+VKH EPG +GL M+PMLP+ Sbjct: 964 GKVKHVEPGSFWEFGLGMDPMLPT 987 >gb|PHU23400.1| hypothetical protein BC332_08507 [Capsicum chinense] Length = 989 Score = 1409 bits (3648), Expect = 0.0 Identities = 721/984 (73%), Positives = 820/984 (83%), Gaps = 18/984 (1%) Frame = -1 Query: 3328 KRSKMEASCIFCGGVSASILKSQGIRHRPS--ESISLYSNKNRLFLSS---PISHRVWXX 3164 K KMEASC FCG + S L Q + S S++L S KNR FL S S R Sbjct: 4 KLLKMEASCNFCGSLVPSCLTRQKRSNLSSFIGSVALSSIKNRTFLDSISLTSSIRATAS 63 Query: 3163 XXXXXXXXXXXXXXRED-----GRGADVSNNN--------TNNKAAVSEEKTRMKQQQVN 3023 ++ G G +V N+ ++ KA + R QQQ Sbjct: 64 SSGGTKAVTTRRRKPKNVGTTGGSGKNVKNSEIPAVTTKGSSGKAIEKVQVKRKNQQQEC 123 Query: 3022 DEKDGPTSVRALYQNGDPLGRRDLGKGVVKWIGKGMKAMALDFALAETQGDFADLKQRMG 2843 ++ GP VRAL+QNGDPLGR+DLGK VV+W+ +GM+AMALDFA AE QG+FA+LKQRM Sbjct: 124 IQEGGPVDVRALHQNGDPLGRKDLGKCVVRWLSQGMRAMALDFATAEMQGEFAELKQRME 183 Query: 2842 PGLTFVIQAQPYLNAVPMPLGMEAICLKTCTHYPTLFDHFQRELRDVLLDLQHKTLIHNW 2663 PGLTFVIQAQPYLNAVPMPLG+EAICLK CTHYPTLFD+FQRELRDVL DLQ K+ + +W Sbjct: 184 PGLTFVIQAQPYLNAVPMPLGLEAICLKACTHYPTLFDNFQRELRDVLQDLQRKSSVQDW 243 Query: 2662 RETESWKLLKELATSAQHRAIARKTSLSKSVHGVLGLNIDKAKAIQCRIDEFTKHMSDLL 2483 R+TESWKLLK+LA+SAQH+AIARK S KSV GV+G++++KAKAIQ RID+FT MSDLL Sbjct: 244 RDTESWKLLKDLASSAQHKAIARKGSQPKSVPGVMGMDLEKAKAIQSRIDDFTNRMSDLL 303 Query: 2482 RIERDAELEFTQEELNAVPTPDEHSTSPKPTEFLVSHAQSEQELCDTICNLNAISTSTGL 2303 IERDAELEFTQEELNAVP PD +S + KP EFLVSHAQ EQELCDTICNL A+STS GL Sbjct: 304 HIERDAELEFTQEELNAVPAPDVNSEAQKPFEFLVSHAQPEQELCDTICNLTAVSTSIGL 363 Query: 2302 GGMHLVLFRVEGNHRLPPTNLSPGDMVCVRICDSRGAGATSCMQGFVNNLGDDGCSISVA 2123 GGMHLVLF++EGNHRLPP NLSPGDMVCVRICDSRGAGATSCMQGFV+NLG+DGCSIS+A Sbjct: 364 GGMHLVLFKLEGNHRLPPANLSPGDMVCVRICDSRGAGATSCMQGFVHNLGEDGCSISLA 423 Query: 2122 LESLHGDPTFSKLFGKNIRIDRIQGLADALTYERNCEAXXXXXXXXXXXKNSSIAVVTTI 1943 LESL GD TFSKLFGKN+RIDRIQGLADALTYERNCEA KN S+AVV T+ Sbjct: 424 LESLQGDTTFSKLFGKNVRIDRIQGLADALTYERNCEALMMLQKKGFRKKNPSVAVVATL 483 Query: 1942 FGDNEDIAWFEDNNMVDWAEAELNGLLDTEFYDTSQQRAIALGLNKKRPVLIIQGPPGTG 1763 FGDNED+ W E+N+M DWAE EL + + +D SQ++AIALGLNK RP++IIQGPPGTG Sbjct: 484 FGDNEDLKWLEENDMADWAEVELPDSTNKKSFDASQRKAIALGLNKNRPIMIIQGPPGTG 543 Query: 1762 KTGVLKQLISIAVKQGERVLVTAPTNAAVDNMVEKLSDIGANIVRVGNPARISPAVASKS 1583 KTG+LK+LIS+AVKQGERVLVTAPTNAAVDNMVEKLSDIG NIVRVGNPARIS +VASKS Sbjct: 544 KTGLLKELISLAVKQGERVLVTAPTNAAVDNMVEKLSDIGINIVRVGNPARISSSVASKS 603 Query: 1582 LVEIVNGRLADFRSEFERKKSDLRKDLSHCLRDDSLAAGIRQLLKQLGKTMKKKERETIR 1403 L EIVN +L+DF +E ERKKSDLRKDL CL+DDSLAAGIRQLLKQLGK++KKKE+ET++ Sbjct: 604 LAEIVNNKLSDFLAEIERKKSDLRKDLRCCLKDDSLAAGIRQLLKQLGKSIKKKEKETVK 663 Query: 1402 EILSSAHVVLATNIGAADPMIRWLNSFDLVVIDEAGQAIEPSCWIPILLGKRCILAGDQC 1223 EILS+AHVVLATNIGAADP+IR L++FDLV+IDEAGQAIEPS WIPILLGKRCILAGDQ Sbjct: 664 EILSTAHVVLATNIGAADPLIRRLDAFDLVIIDEAGQAIEPSSWIPILLGKRCILAGDQF 723 Query: 1222 QLAPVILSRKALEGGLGVSFLERASTLHEGVLATKLTTQYRMNDAIASWASKEMYNGLLK 1043 QLAPVILSRKALEGGLGVS LERA+TLH+G+L+TKLTTQYRMNDAIASWASKEMY G L Sbjct: 724 QLAPVILSRKALEGGLGVSLLERAATLHDGMLSTKLTTQYRMNDAIASWASKEMYGGSLT 783 Query: 1042 SSASVMSHLLSDSPLVKSTWITQCPLLLLDTRMPFGSLSVGCEEQLDPAGTGSFYNEGEA 863 SS +V SHLL DSP VK TWITQCPLLLLDTRMP+GSLSVGCEE LDPAGTGSFYNEGEA Sbjct: 784 SSPTVASHLLVDSPFVKPTWITQCPLLLLDTRMPYGSLSVGCEEHLDPAGTGSFYNEGEA 843 Query: 862 DIVVQHVFALIYAGVRPSTIVVQSPYVSQVQLLRDRLEEFPLSTGVEVATIDSFQGREAD 683 DIVVQHVF+LIYAGV P+ I VQSPYV+QVQLLRD+++E P++TGV+VATIDSFQGREAD Sbjct: 844 DIVVQHVFSLIYAGVPPAAIAVQSPYVAQVQLLRDKIDEIPMATGVDVATIDSFQGREAD 903 Query: 682 AVVISMVRSNNLGAVGFLGDSRRMNVAITRARKHVAIICDSSTICHNTFLARLLRHIRYF 503 AV+ISMVRSNNLGAVGFLGD+RRMNVAITRA KHVA++CDSSTICHNT+LARLLRHIRYF Sbjct: 904 AVIISMVRSNNLGAVGFLGDNRRMNVAITRASKHVAVVCDSSTICHNTYLARLLRHIRYF 963 Query: 502 GRVKHAEPGGSGGYGLSMNPMLPS 431 G+VKH EPG +GL M+PMLP+ Sbjct: 964 GKVKHVEPGSFWEFGLGMDPMLPT 987 >gb|PHT87733.1| hypothetical protein T459_09839 [Capsicum annuum] Length = 989 Score = 1409 bits (3648), Expect = 0.0 Identities = 720/984 (73%), Positives = 820/984 (83%), Gaps = 18/984 (1%) Frame = -1 Query: 3328 KRSKMEASCIFCGGVSASILKSQGIRHRPS--ESISLYSNKNRLFLSS---PISHRVWXX 3164 K KMEASC FCG + S L Q + S ++L S KNR FL S S R Sbjct: 4 KLLKMEASCNFCGSLVPSCLTRQKRSNLSSFIGPVALSSIKNRTFLDSISLTSSIRATAS 63 Query: 3163 XXXXXXXXXXXXXXRED-----GRGADVSNNN--------TNNKAAVSEEKTRMKQQQVN 3023 ++ G G +V N+ ++ KA + R QQQ Sbjct: 64 SSGGTKAVTTRRRKPKNVGTTGGSGKNVKNSEIPAVTTKGSSGKAIEKVQVKRKNQQQEC 123 Query: 3022 DEKDGPTSVRALYQNGDPLGRRDLGKGVVKWIGKGMKAMALDFALAETQGDFADLKQRMG 2843 ++ GP VRAL+QNGDPLGR+DLGK VV+W+ +GM+AMALDFA AE QG+FA+LKQRM Sbjct: 124 IQEGGPVDVRALHQNGDPLGRKDLGKCVVRWLSQGMRAMALDFATAEMQGEFAELKQRME 183 Query: 2842 PGLTFVIQAQPYLNAVPMPLGMEAICLKTCTHYPTLFDHFQRELRDVLLDLQHKTLIHNW 2663 PGLTFVIQAQPYLNAVPMPLG+EAICLK CTHYPTLFD+FQRELRDVL DLQ K+ + +W Sbjct: 184 PGLTFVIQAQPYLNAVPMPLGLEAICLKACTHYPTLFDNFQRELRDVLQDLQRKSSVQDW 243 Query: 2662 RETESWKLLKELATSAQHRAIARKTSLSKSVHGVLGLNIDKAKAIQCRIDEFTKHMSDLL 2483 R+TESWKLLK+LA+SAQH+AIARK S KSV GV+G++++KAKAIQ RID+FT MSDLL Sbjct: 244 RDTESWKLLKDLASSAQHKAIARKGSQPKSVPGVMGMDLEKAKAIQSRIDDFTNRMSDLL 303 Query: 2482 RIERDAELEFTQEELNAVPTPDEHSTSPKPTEFLVSHAQSEQELCDTICNLNAISTSTGL 2303 IERDAELEFTQEELNAVP PD +S + KP EFLVSHAQ EQELCDTICNL A+STS GL Sbjct: 304 HIERDAELEFTQEELNAVPAPDVNSEAQKPFEFLVSHAQPEQELCDTICNLTAVSTSIGL 363 Query: 2302 GGMHLVLFRVEGNHRLPPTNLSPGDMVCVRICDSRGAGATSCMQGFVNNLGDDGCSISVA 2123 GGMHLVLF++EGNHRLPP NLSPGDMVCVRICDSRGAGATSCMQGFV+NLG+DGCSIS+A Sbjct: 364 GGMHLVLFKLEGNHRLPPANLSPGDMVCVRICDSRGAGATSCMQGFVHNLGEDGCSISLA 423 Query: 2122 LESLHGDPTFSKLFGKNIRIDRIQGLADALTYERNCEAXXXXXXXXXXXKNSSIAVVTTI 1943 LESL GD TFSKLFGKN+RIDRIQGLADALTYERNCEA KN S+AVV T+ Sbjct: 424 LESLQGDTTFSKLFGKNVRIDRIQGLADALTYERNCEALMMLQKKGFRKKNPSVAVVATL 483 Query: 1942 FGDNEDIAWFEDNNMVDWAEAELNGLLDTEFYDTSQQRAIALGLNKKRPVLIIQGPPGTG 1763 FGDNED+ W E+N+M DWAE EL + + +D SQ++AIALGLNK RP++IIQGPPGTG Sbjct: 484 FGDNEDLKWLEENDMADWAEVELPDSTNKKSFDASQRKAIALGLNKNRPIMIIQGPPGTG 543 Query: 1762 KTGVLKQLISIAVKQGERVLVTAPTNAAVDNMVEKLSDIGANIVRVGNPARISPAVASKS 1583 KTG+LK+LIS+AVKQGERVLVTAPTNAAVDNMVEKLSDIG NIVRVGNPARIS +VASKS Sbjct: 544 KTGLLKELISLAVKQGERVLVTAPTNAAVDNMVEKLSDIGINIVRVGNPARISSSVASKS 603 Query: 1582 LVEIVNGRLADFRSEFERKKSDLRKDLSHCLRDDSLAAGIRQLLKQLGKTMKKKERETIR 1403 L EIVN +L+DF +E ERKKSDLRKDL +CL+DDSLAAGIRQLLKQLGK++KKKE+ET++ Sbjct: 604 LAEIVNNKLSDFLAEIERKKSDLRKDLRYCLKDDSLAAGIRQLLKQLGKSIKKKEKETVK 663 Query: 1402 EILSSAHVVLATNIGAADPMIRWLNSFDLVVIDEAGQAIEPSCWIPILLGKRCILAGDQC 1223 EILS+AHVVLATNIGAADP+IR L++FDLV+IDEAGQAIEPS WIPILLGKRCILAGDQ Sbjct: 664 EILSTAHVVLATNIGAADPLIRRLDAFDLVIIDEAGQAIEPSSWIPILLGKRCILAGDQF 723 Query: 1222 QLAPVILSRKALEGGLGVSFLERASTLHEGVLATKLTTQYRMNDAIASWASKEMYNGLLK 1043 QLAPVILSRKALEGGLGVS LERA+TLH+G+L+TKLTTQYRMNDAIASWASKEMY G L Sbjct: 724 QLAPVILSRKALEGGLGVSLLERAATLHDGMLSTKLTTQYRMNDAIASWASKEMYGGSLT 783 Query: 1042 SSASVMSHLLSDSPLVKSTWITQCPLLLLDTRMPFGSLSVGCEEQLDPAGTGSFYNEGEA 863 SS +V SHLL DSP VK TWITQCPLLLLDTRMP+GSLSVGCEE LDPAGTGSFYNEGEA Sbjct: 784 SSPTVASHLLVDSPFVKPTWITQCPLLLLDTRMPYGSLSVGCEEHLDPAGTGSFYNEGEA 843 Query: 862 DIVVQHVFALIYAGVRPSTIVVQSPYVSQVQLLRDRLEEFPLSTGVEVATIDSFQGREAD 683 DIVVQHVF+LIYAGV P+ I VQSPYV+QVQLLRD+++E P++TGV+VATIDSFQGREAD Sbjct: 844 DIVVQHVFSLIYAGVPPAAIAVQSPYVAQVQLLRDKIDEIPMATGVDVATIDSFQGREAD 903 Query: 682 AVVISMVRSNNLGAVGFLGDSRRMNVAITRARKHVAIICDSSTICHNTFLARLLRHIRYF 503 AV+ISMVRSNNLGAVGFLGD+RRMNVAITRA KHVA++CDSSTICHNT+LARLLRHIRYF Sbjct: 904 AVIISMVRSNNLGAVGFLGDNRRMNVAITRASKHVAVVCDSSTICHNTYLARLLRHIRYF 963 Query: 502 GRVKHAEPGGSGGYGLSMNPMLPS 431 G+VKH EPG +GL M+PMLP+ Sbjct: 964 GKVKHVEPGSFWEFGLGMDPMLPT 987 >ref|XP_009771939.1| PREDICTED: DNA-binding protein SMUBP-2 [Nicotiana sylvestris] Length = 980 Score = 1401 bits (3626), Expect = 0.0 Identities = 709/979 (72%), Positives = 820/979 (83%), Gaps = 11/979 (1%) Frame = -1 Query: 3328 KRSKMEASCIFCGGVSA---SILKSQGIRHRPS-----ESISLYSNKNRLFLSSPIS--- 3182 K KME+ C CG +S S L + + R + S++L + KNR+FL S IS Sbjct: 4 KLLKMESLCNSCGSISTLAPSCLTLRFYKKRSNLSSFFGSVTLSNPKNRIFLDSSISFPN 63 Query: 3181 HRVWXXXXXXXXXXXXXXXXREDGRGADVSNNNTNNKAAVSEEKTRMKQQQVNDEKDGPT 3002 + + ++ + +D+ + T EK + Q+ D GP Sbjct: 64 YNIQASSSSGTKSLSPRRRKPKNVKTSDIPSVTTKGSLGKKTEKNQECSQEERDS--GPV 121 Query: 3001 SVRALYQNGDPLGRRDLGKGVVKWIGKGMKAMALDFALAETQGDFADLKQRMGPGLTFVI 2822 +VRAL +NGDP+GR+DLGK VV+WI +GMKAMA DFA AE QG+F ++KQRM PGLTFVI Sbjct: 122 NVRALNENGDPMGRKDLGKCVVRWISQGMKAMATDFATAEMQGEFTEVKQRMEPGLTFVI 181 Query: 2821 QAQPYLNAVPMPLGMEAICLKTCTHYPTLFDHFQRELRDVLLDLQHKTLIHNWRETESWK 2642 QAQPYLNA+PMPLG+EAICLK CTHYPTLFD+FQRELRDVL +LQ K+L+ +WR+TESWK Sbjct: 182 QAQPYLNAIPMPLGLEAICLKACTHYPTLFDNFQRELRDVLQNLQRKSLVQDWRDTESWK 241 Query: 2641 LLKELATSAQHRAIARKTSLSKSVHGVLGLNIDKAKAIQCRIDEFTKHMSDLLRIERDAE 2462 LLK+LA SAQH+AIARKTS K V GV+G++++KAKA+Q RID+FT MSDLLRIERD+E Sbjct: 242 LLKDLAISAQHKAIARKTSQPKFVPGVMGMDLEKAKAMQSRIDDFTNRMSDLLRIERDSE 301 Query: 2461 LEFTQEELNAVPTPDEHSTSPKPTEFLVSHAQSEQELCDTICNLNAISTSTGLGGMHLVL 2282 LEFTQEELNAVP P +S KP EFLVSHAQ EQELCDTICNL A+STS GLGGMHLVL Sbjct: 302 LEFTQEELNAVPAPVLNSEEQKPFEFLVSHAQPEQELCDTICNLTAVSTSIGLGGMHLVL 361 Query: 2281 FRVEGNHRLPPTNLSPGDMVCVRICDSRGAGATSCMQGFVNNLGDDGCSISVALESLHGD 2102 F++EGNHRLPPTNLSPGDMVCVR CDSRGAGATSCMQGFV+NLG+DG SIS+ALESLHGD Sbjct: 362 FKLEGNHRLPPTNLSPGDMVCVRTCDSRGAGATSCMQGFVHNLGEDGRSISLALESLHGD 421 Query: 2101 PTFSKLFGKNIRIDRIQGLADALTYERNCEAXXXXXXXXXXXKNSSIAVVTTIFGDNEDI 1922 TFSKLFGKN+RIDRIQGLADALTYERNCEA KN S+AVV T+FGD ED+ Sbjct: 422 STFSKLFGKNVRIDRIQGLADALTYERNCEALMMLQKKGFQKKNPSVAVVATLFGDKEDL 481 Query: 1921 AWFEDNNMVDWAEAELNGLLDTEFYDTSQQRAIALGLNKKRPVLIIQGPPGTGKTGVLKQ 1742 AW E+N M DW+E EL D + +DTSQ++AIALGLNK RP++IIQGPPGTGKTG+LK+ Sbjct: 482 AWLEENGMADWSEVELPDSTDRKSFDTSQRKAIALGLNKNRPIMIIQGPPGTGKTGMLKE 541 Query: 1741 LISIAVKQGERVLVTAPTNAAVDNMVEKLSDIGANIVRVGNPARISPAVASKSLVEIVNG 1562 LIS+AVKQGERVLVTAPTNAAVDNMVEKLSDIG NIVRVGNPARISPAVASKSL EIVN Sbjct: 542 LISLAVKQGERVLVTAPTNAAVDNMVEKLSDIGLNIVRVGNPARISPAVASKSLTEIVNT 601 Query: 1561 RLADFRSEFERKKSDLRKDLSHCLRDDSLAAGIRQLLKQLGKTMKKKERETIREILSSAH 1382 LADFR+E ERKKSDLR+DL +CL+DDSLAAGIRQLLKQLGK++K++E+ET++EILSSA Sbjct: 602 ELADFRAEIERKKSDLRRDLRYCLKDDSLAAGIRQLLKQLGKSIKREEKETVKEILSSAQ 661 Query: 1381 VVLATNIGAADPMIRWLNSFDLVVIDEAGQAIEPSCWIPILLGKRCILAGDQCQLAPVIL 1202 VVLATNIGAADP+IR L++FDLV+IDEAGQAIEPSCWIPILLGKRCILAGDQ QLAPVIL Sbjct: 662 VVLATNIGAADPLIRRLDTFDLVIIDEAGQAIEPSCWIPILLGKRCILAGDQFQLAPVIL 721 Query: 1201 SRKALEGGLGVSFLERASTLHEGVLATKLTTQYRMNDAIASWASKEMYNGLLKSSASVMS 1022 SRKALEGGLGVS LERA++LH+G+L+TKLTTQYRMN+AIASWASKEMY+G L SS +V S Sbjct: 722 SRKALEGGLGVSLLERAASLHDGMLSTKLTTQYRMNNAIASWASKEMYDGSLISSPTVAS 781 Query: 1021 HLLSDSPLVKSTWITQCPLLLLDTRMPFGSLSVGCEEQLDPAGTGSFYNEGEADIVVQHV 842 HLL DSP VK TW+TQCPLLLLDTRMP+GSLSVGCEE LDPAGTGSF+NEGEADIVVQHV Sbjct: 782 HLLVDSPFVKPTWVTQCPLLLLDTRMPYGSLSVGCEEHLDPAGTGSFFNEGEADIVVQHV 841 Query: 841 FALIYAGVRPSTIVVQSPYVSQVQLLRDRLEEFPLSTGVEVATIDSFQGREADAVVISMV 662 F+LIY+GV P+ I VQSPYV+QVQLLRD+++E P++TGVEVATIDSFQGREADAV+ISMV Sbjct: 842 FSLIYSGVPPAAIAVQSPYVAQVQLLRDKIDELPMATGVEVATIDSFQGREADAVIISMV 901 Query: 661 RSNNLGAVGFLGDSRRMNVAITRARKHVAIICDSSTICHNTFLARLLRHIRYFGRVKHAE 482 RSNNLGAVGFLGDSRRMNVAITRARKHVA++CDSSTICHNT+LARLLRHIRYFG+VKH E Sbjct: 902 RSNNLGAVGFLGDSRRMNVAITRARKHVAVVCDSSTICHNTYLARLLRHIRYFGKVKHVE 961 Query: 481 PGGSGGYGLSMNPMLPSVS 425 PG +GL M+PMLP+ S Sbjct: 962 PGSFWEFGLGMDPMLPTAS 980 >ref|XP_016474118.1| PREDICTED: DNA-binding protein SMUBP-2-like [Nicotiana tabacum] Length = 980 Score = 1400 bits (3625), Expect = 0.0 Identities = 709/979 (72%), Positives = 820/979 (83%), Gaps = 11/979 (1%) Frame = -1 Query: 3328 KRSKMEASCIFCGGVSA---SILKSQGIRHRPS-----ESISLYSNKNRLFLSSPIS--- 3182 K KME+ C CG +S S L + + R + S++L + KNR+FL S IS Sbjct: 4 KLLKMESLCNSCGSISTLAPSCLTLRFYKKRSNLSSFFGSVTLSNPKNRIFLDSSISFPN 63 Query: 3181 HRVWXXXXXXXXXXXXXXXXREDGRGADVSNNNTNNKAAVSEEKTRMKQQQVNDEKDGPT 3002 + + ++ + +D+ + T EK + Q+ D GP Sbjct: 64 YNIQASSSSGTKSLSPRRRKPKNVKTSDIPSVTTKGSLGKKTEKNQECSQEERDS--GPV 121 Query: 3001 SVRALYQNGDPLGRRDLGKGVVKWIGKGMKAMALDFALAETQGDFADLKQRMGPGLTFVI 2822 +VRAL +NGDP+GR+DLGK VV+WI +GMKAMA DFA AE QG+F ++KQRM PGLTFVI Sbjct: 122 NVRALNENGDPMGRKDLGKCVVRWISQGMKAMATDFATAEMQGEFTEVKQRMEPGLTFVI 181 Query: 2821 QAQPYLNAVPMPLGMEAICLKTCTHYPTLFDHFQRELRDVLLDLQHKTLIHNWRETESWK 2642 QAQPYLNA+PMPLG+EAICLK CTHYPTLFD+FQRELRDVL +LQ K+L+ +WR+TESWK Sbjct: 182 QAQPYLNAIPMPLGLEAICLKACTHYPTLFDNFQRELRDVLQNLQRKSLVQDWRDTESWK 241 Query: 2641 LLKELATSAQHRAIARKTSLSKSVHGVLGLNIDKAKAIQCRIDEFTKHMSDLLRIERDAE 2462 LLK+LA SAQH+AIARKTS K V GV+G++++KAKA+Q RID+FT MSDLLRIERD+E Sbjct: 242 LLKDLAISAQHKAIARKTSQPKFVPGVMGMDLEKAKAMQSRIDDFTNRMSDLLRIERDSE 301 Query: 2461 LEFTQEELNAVPTPDEHSTSPKPTEFLVSHAQSEQELCDTICNLNAISTSTGLGGMHLVL 2282 LEFTQEELNAVP P +S KP EFLVSHAQ EQELCDTICNL A+STS GLGGMHLVL Sbjct: 302 LEFTQEELNAVPAPVLNSEEQKPFEFLVSHAQPEQELCDTICNLTAVSTSIGLGGMHLVL 361 Query: 2281 FRVEGNHRLPPTNLSPGDMVCVRICDSRGAGATSCMQGFVNNLGDDGCSISVALESLHGD 2102 F++EGNHRLPPTNLSPGDMVCVR CDSRGAGATSCMQGFV+NLG+DG SIS+ALESLHGD Sbjct: 362 FKLEGNHRLPPTNLSPGDMVCVRTCDSRGAGATSCMQGFVHNLGEDGRSISLALESLHGD 421 Query: 2101 PTFSKLFGKNIRIDRIQGLADALTYERNCEAXXXXXXXXXXXKNSSIAVVTTIFGDNEDI 1922 TFSKLFGKN+RIDRIQGLADALTYERNCEA KN S+AVV T+FGD ED+ Sbjct: 422 STFSKLFGKNVRIDRIQGLADALTYERNCEALMMLQKKGFQKKNPSVAVVATLFGDKEDL 481 Query: 1921 AWFEDNNMVDWAEAELNGLLDTEFYDTSQQRAIALGLNKKRPVLIIQGPPGTGKTGVLKQ 1742 AW E+N M DW+E EL D + +DTSQ++AIALGLNK RP++IIQGPPGTGKTG+LK+ Sbjct: 482 AWLEENGMADWSEVELPDSTDRKSFDTSQRKAIALGLNKNRPIMIIQGPPGTGKTGMLKE 541 Query: 1741 LISIAVKQGERVLVTAPTNAAVDNMVEKLSDIGANIVRVGNPARISPAVASKSLVEIVNG 1562 LIS+AVKQGERVLVTAPTNAAVDNMVEKLSDIG NIVRVGNPARISPAVASKSL EIVN Sbjct: 542 LISLAVKQGERVLVTAPTNAAVDNMVEKLSDIGLNIVRVGNPARISPAVASKSLTEIVNT 601 Query: 1561 RLADFRSEFERKKSDLRKDLSHCLRDDSLAAGIRQLLKQLGKTMKKKERETIREILSSAH 1382 LADFR+E ERKKSDLR+DL +CL+DDSLAAGIRQLLKQLGK++K++E+ET++EILSSA Sbjct: 602 ELADFRAEIERKKSDLRRDLRYCLKDDSLAAGIRQLLKQLGKSIKREEKETVKEILSSAQ 661 Query: 1381 VVLATNIGAADPMIRWLNSFDLVVIDEAGQAIEPSCWIPILLGKRCILAGDQCQLAPVIL 1202 VVLATNIGAADP+IR L++FDLV+IDEAGQAIEPSCWIPILLGKRCILAGDQ QLAPVIL Sbjct: 662 VVLATNIGAADPLIRRLDTFDLVIIDEAGQAIEPSCWIPILLGKRCILAGDQFQLAPVIL 721 Query: 1201 SRKALEGGLGVSFLERASTLHEGVLATKLTTQYRMNDAIASWASKEMYNGLLKSSASVMS 1022 SRKALEGGLGVS LERA++LH+G+L+TKLTTQYRMN+AIASWASKEMY+G L SS +V S Sbjct: 722 SRKALEGGLGVSLLERAASLHDGMLSTKLTTQYRMNNAIASWASKEMYDGSLISSPTVAS 781 Query: 1021 HLLSDSPLVKSTWITQCPLLLLDTRMPFGSLSVGCEEQLDPAGTGSFYNEGEADIVVQHV 842 HLL DSP VK TW+TQCPLLLLDTRMP+GSLSVGCEE LDPAGTGSF+NEGEADIVVQHV Sbjct: 782 HLLVDSPFVKPTWVTQCPLLLLDTRMPYGSLSVGCEEHLDPAGTGSFFNEGEADIVVQHV 841 Query: 841 FALIYAGVRPSTIVVQSPYVSQVQLLRDRLEEFPLSTGVEVATIDSFQGREADAVVISMV 662 F+LIY+GV P+ I VQSPYV+QVQLLRD+++E P++TGVEVATIDSFQGREADAV+ISMV Sbjct: 842 FSLIYSGVPPAAIAVQSPYVAQVQLLRDKVDELPMATGVEVATIDSFQGREADAVIISMV 901 Query: 661 RSNNLGAVGFLGDSRRMNVAITRARKHVAIICDSSTICHNTFLARLLRHIRYFGRVKHAE 482 RSNNLGAVGFLGDSRRMNVAITRARKHVA++CDSSTICHNT+LARLLRHIRYFG+VKH E Sbjct: 902 RSNNLGAVGFLGDSRRMNVAITRARKHVAVVCDSSTICHNTYLARLLRHIRYFGKVKHVE 961 Query: 481 PGGSGGYGLSMNPMLPSVS 425 PG +GL M+PMLP+ S Sbjct: 962 PGSFWEFGLGMDPMLPTAS 980 >ref|XP_019184191.1| PREDICTED: DNA-binding protein SMUBP-2 [Ipomoea nil] Length = 993 Score = 1399 bits (3622), Expect = 0.0 Identities = 714/994 (71%), Positives = 815/994 (81%), Gaps = 30/994 (3%) Frame = -1 Query: 3316 MEASCIFCGGVSASILKSQGIRHRPSESISLYSNKN-------------RLFLSSPISHR 3176 MEASC+FCGG S S L + R R S S +++ + +SP+ H Sbjct: 1 MEASCVFCGGAS-SFLGIRVRRQRDSLHSSFFASVTPFGGNSSFSRGGGSILFASPLPHC 59 Query: 3175 VWXXXXXXXXXXXXXXXXREDGRGA-----------DVSNNNTNNKAAVS---EEKTRMK 3038 + + R + + S N N+ + S E + R K Sbjct: 60 RFQVANSNGGGTKAVRTAKRKSRKSGGSSGPGPGPVETSQNLKNSPVSSSVEFERQGRRK 119 Query: 3037 QQQVNDEKDGPTSVRALYQNGDPLGRRDLGKGVVKWIGKGMKAMALDFALAETQGD--FA 2864 + P +V ALYQ+GDPLGRRDLGK VV WI +GMKAMA+DFA AE QG+ F+ Sbjct: 120 PALTRKNTNTPANVAALYQSGDPLGRRDLGKCVVTWISQGMKAMAIDFATAEVQGEGEFS 179 Query: 2863 DLKQRMGPGLTFVIQAQPYLNAVPMPLGMEAICLKTCTHYPTLFDHFQRELRDVLLDLQH 2684 +L+Q+MGPGLTFVIQAQPYLNAVPMPLG+EAICLKTCTHYPTLFDHFQRELRDVL DLQ Sbjct: 180 ELRQQMGPGLTFVIQAQPYLNAVPMPLGLEAICLKTCTHYPTLFDHFQRELRDVLKDLQS 239 Query: 2683 KTLIHNWRETESWKLLKELATSAQHRAIARKTSLSKSVHGVLGLNIDKAKAIQCRIDEFT 2504 K+L+ +WRETESWKLLKELA SAQH+AIARK S K + GVLG++IDKAKAIQ RID+FT Sbjct: 240 KSLVQDWRETESWKLLKELACSAQHKAIARKISEPKPIQGVLGMDIDKAKAIQSRIDDFT 299 Query: 2503 KHMSDLLRIERDAELEFTQEELNAVPTPDEHSTSP-KPTEFLVSHAQSEQELCDTICNLN 2327 + MS LLRIERDAELEFTQEELNAVPTP E ++ P KP EFLVSHAQ EQELCDTICNL+ Sbjct: 300 EQMSALLRIERDAELEFTQEELNAVPTPAEENSKPSKPIEFLVSHAQPEQELCDTICNLH 359 Query: 2326 AISTSTGLGGMHLVLFRVEGNHRLPPTNLSPGDMVCVRICDSRGAGATSCMQGFVNNLGD 2147 A+STSTGLGGMHLVLF+V+GNHRLPPTNLSPGDMVCVR CDSRGAGATSCMQGFVNNLG+ Sbjct: 360 AVSTSTGLGGMHLVLFKVDGNHRLPPTNLSPGDMVCVRTCDSRGAGATSCMQGFVNNLGE 419 Query: 2146 DGCSISVALESLHGDPTFSKLFGKNIRIDRIQGLADALTYERNCEAXXXXXXXXXXXKNS 1967 DGCSI++ALESL GDPTFSKLFGKN+RIDRIQGLAD LTYERNCEA KN Sbjct: 420 DGCSITLALESLRGDPTFSKLFGKNVRIDRIQGLADTLTYERNCEALMMLKKKGLQKKNP 479 Query: 1966 SIAVVTTIFGDNEDIAWFEDNNMVDWAEAELNGLLDTEFYDTSQQRAIALGLNKKRPVLI 1787 SIAVV T+FGD ED+AW E N++ DWA EL+ +D++ YD SQ+RAIALGLNK+RP+LI Sbjct: 480 SIAVVATLFGDQEDVAWLEKNDLADWAGVELDASIDSKGYDISQRRAIALGLNKRRPILI 539 Query: 1786 IQGPPGTGKTGVLKQLISIAVKQGERVLVTAPTNAAVDNMVEKLSDIGANIVRVGNPARI 1607 +QGPPGTGKTG+LK+LIS+AV+QGERVL+TAPTNAAVDNMVEKLSD+ NIVR GNPARI Sbjct: 540 VQGPPGTGKTGLLKELISLAVQQGERVLITAPTNAAVDNMVEKLSDVAINIVRFGNPARI 599 Query: 1606 SPAVASKSLVEIVNGRLADFRSEFERKKSDLRKDLSHCLRDDSLAAGIRQLLKQLGKTMK 1427 SP V+SKSL EIVN +LA+FR+E RKK+DLRKDL HCL DDSLAAGIRQLLKQLGK++K Sbjct: 600 SPVVSSKSLTEIVNTKLAEFRAELHRKKTDLRKDLRHCLNDDSLAAGIRQLLKQLGKSLK 659 Query: 1426 KKERETIREILSSAHVVLATNIGAADPMIRWLNSFDLVVIDEAGQAIEPSCWIPILLGKR 1247 KKE+ET+RE+LSSA VVLATNIGAADP+IR L++FDLV+IDEA QAIEPS WIPIL GKR Sbjct: 660 KKEKETVREVLSSAQVVLATNIGAADPLIRQLDTFDLVIIDEAAQAIEPSSWIPILRGKR 719 Query: 1246 CILAGDQCQLAPVILSRKALEGGLGVSFLERASTLHEGVLATKLTTQYRMNDAIASWASK 1067 CILAGDQ QLAPVILSRKALEGGLG+S LERA++LHEG+L+TKLTTQYRMNDAIASWASK Sbjct: 720 CILAGDQFQLAPVILSRKALEGGLGISLLERAASLHEGMLSTKLTTQYRMNDAIASWASK 779 Query: 1066 EMYNGLLKSSASVMSHLLSDSPLVKSTWITQCPLLLLDTRMPFGSLSVGCEEQLDPAGTG 887 EMY G LKS V SHLL DSP VK TWIT+CPLLLLDTRMP+GSLS GCEE LDPAGTG Sbjct: 780 EMYGGSLKSFPQVASHLLVDSPFVKPTWITRCPLLLLDTRMPYGSLSTGCEEHLDPAGTG 839 Query: 886 SFYNEGEADIVVQHVFALIYAGVRPSTIVVQSPYVSQVQLLRDRLEEFPLSTGVEVATID 707 SFYNEGEADIVV+HV +L+Y+GV P I VQSPYV+QVQLLRDRL+E P++TGVEVATID Sbjct: 840 SFYNEGEADIVVKHVLSLVYSGVSPVAIAVQSPYVAQVQLLRDRLDEIPVTTGVEVATID 899 Query: 706 SFQGREADAVVISMVRSNNLGAVGFLGDSRRMNVAITRARKHVAIICDSSTICHNTFLAR 527 SFQGREADAV+ISMVRSNN+GAVGFLGDSRRMNVAITRARKHVA++CDSSTICHNTFLAR Sbjct: 900 SFQGREADAVIISMVRSNNMGAVGFLGDSRRMNVAITRARKHVAVVCDSSTICHNTFLAR 959 Query: 526 LLRHIRYFGRVKHAEPGGSGGYGLSMNPMLPSVS 425 LLRHIRYFG VK+AEPG GG+GL M+PMLP+ + Sbjct: 960 LLRHIRYFGHVKNAEPGSFGGFGLGMDPMLPTAN 993 >ref|XP_019259161.1| PREDICTED: DNA-binding protein SMUBP-2 [Nicotiana attenuata] gb|OIT40020.1| regulator of nonsense transcripts 1-like protein [Nicotiana attenuata] Length = 980 Score = 1399 bits (3621), Expect = 0.0 Identities = 710/977 (72%), Positives = 817/977 (83%), Gaps = 9/977 (0%) Frame = -1 Query: 3328 KRSKMEASCIFCGGVSA---SILKSQGIRHRPS-----ESISLYSNKNRLFLSSPISHRV 3173 K KME+ C CG +S S L + + R + S++L + KNR+FL S IS Sbjct: 4 KLLKMESLCNSCGSISTLAPSCLTLRFYKKRSNLSSFFGSVTLSNPKNRIFLDSSISFPN 63 Query: 3172 WXXXXXXXXXXXXXXXXREDGRGADVSNNNTNNKAAVSEEKTRMKQQQVNDEKD-GPTSV 2996 + R + S +KT Q+ +E+D GP +V Sbjct: 64 YNIQASSSSGTKSLSPRRRKPKNVKTSQIPAVTTKGSVVKKTEKIQECSQEERDSGPVNV 123 Query: 2995 RALYQNGDPLGRRDLGKGVVKWIGKGMKAMALDFALAETQGDFADLKQRMGPGLTFVIQA 2816 RAL +NGDP+GR+DLGK VV+WI +GMKAMA DFA AE QG+F ++KQRM PGLTFVIQA Sbjct: 124 RALNENGDPMGRKDLGKCVVRWISQGMKAMATDFATAEMQGEFTEVKQRMEPGLTFVIQA 183 Query: 2815 QPYLNAVPMPLGMEAICLKTCTHYPTLFDHFQRELRDVLLDLQHKTLIHNWRETESWKLL 2636 QPYLNA+PMPLG+EAICLK CTHYPTLFD+FQRELRDVL DLQ K+L+ +WR+TESWKLL Sbjct: 184 QPYLNAIPMPLGLEAICLKACTHYPTLFDNFQRELRDVLQDLQRKSLVQDWRDTESWKLL 243 Query: 2635 KELATSAQHRAIARKTSLSKSVHGVLGLNIDKAKAIQCRIDEFTKHMSDLLRIERDAELE 2456 K+LA+SAQH+AIARKTS K V GV+G++++KAKA+Q RID+FT MSDLLRIERD+ELE Sbjct: 244 KDLASSAQHKAIARKTSQRKFVPGVMGMDLEKAKAMQSRIDDFTNRMSDLLRIERDSELE 303 Query: 2455 FTQEELNAVPTPDEHSTSPKPTEFLVSHAQSEQELCDTICNLNAISTSTGLGGMHLVLFR 2276 FTQEELNAVP P +S KP EFLVSHAQ EQELCDTICNL A+STS GLGGMHLVLF+ Sbjct: 304 FTQEELNAVPAPVLNSEEQKPFEFLVSHAQPEQELCDTICNLTAVSTSIGLGGMHLVLFK 363 Query: 2275 VEGNHRLPPTNLSPGDMVCVRICDSRGAGATSCMQGFVNNLGDDGCSISVALESLHGDPT 2096 +EGNHRLPPTNLSPGDMVCVR CDSRGAGATSCMQGFV+NLG+DG SIS+ALESLHGD T Sbjct: 364 LEGNHRLPPTNLSPGDMVCVRTCDSRGAGATSCMQGFVHNLGEDGRSISLALESLHGDST 423 Query: 2095 FSKLFGKNIRIDRIQGLADALTYERNCEAXXXXXXXXXXXKNSSIAVVTTIFGDNEDIAW 1916 FSKLFGKN+RIDRIQGLADALTYERNCEA KN S+AVV T+FGD ED+AW Sbjct: 424 FSKLFGKNVRIDRIQGLADALTYERNCEALMMLQKKGFLKKNPSVAVVATLFGDKEDLAW 483 Query: 1915 FEDNNMVDWAEAELNGLLDTEFYDTSQQRAIALGLNKKRPVLIIQGPPGTGKTGVLKQLI 1736 E+N M DW+E EL D + +D SQ++AIALGLNK RP++IIQGPPGTGKTG+LK+LI Sbjct: 484 LEENGMADWSEVELPDSTDRKSFDASQRKAIALGLNKNRPIMIIQGPPGTGKTGMLKELI 543 Query: 1735 SIAVKQGERVLVTAPTNAAVDNMVEKLSDIGANIVRVGNPARISPAVASKSLVEIVNGRL 1556 S+AVKQGERVLVTAPTNAAVDNMVEKLSDIG NIVRVGNPARISPAVASKSL EIVN +L Sbjct: 544 SLAVKQGERVLVTAPTNAAVDNMVEKLSDIGLNIVRVGNPARISPAVASKSLAEIVNTKL 603 Query: 1555 ADFRSEFERKKSDLRKDLSHCLRDDSLAAGIRQLLKQLGKTMKKKERETIREILSSAHVV 1376 ADFR+E ERKKSDLR+DL +CL+DDSLAAGIRQLLKQLGK++K++E+ET++EILSSA VV Sbjct: 604 ADFRAEIERKKSDLRRDLRYCLKDDSLAAGIRQLLKQLGKSIKREEKETVKEILSSAQVV 663 Query: 1375 LATNIGAADPMIRWLNSFDLVVIDEAGQAIEPSCWIPILLGKRCILAGDQCQLAPVILSR 1196 LATNIGAADP+IR L++FDLV+IDEAGQAIEPSCWIPILLGKRCILAGDQ QLAPVILSR Sbjct: 664 LATNIGAADPLIRRLDTFDLVIIDEAGQAIEPSCWIPILLGKRCILAGDQFQLAPVILSR 723 Query: 1195 KALEGGLGVSFLERASTLHEGVLATKLTTQYRMNDAIASWASKEMYNGLLKSSASVMSHL 1016 KALEGGLGVS LERA+ LH+G+L+TKLTTQYRMN+AIASWASKEMY+G L SS +V SHL Sbjct: 724 KALEGGLGVSLLERAAGLHDGMLSTKLTTQYRMNNAIASWASKEMYDGSLISSPTVASHL 783 Query: 1015 LSDSPLVKSTWITQCPLLLLDTRMPFGSLSVGCEEQLDPAGTGSFYNEGEADIVVQHVFA 836 L DSP VK TW+TQCPLLLLDTRMP+GSLSVGCEE LDPAGTGSF+NEGEADIVVQHVF+ Sbjct: 784 LVDSPFVKPTWVTQCPLLLLDTRMPYGSLSVGCEEHLDPAGTGSFFNEGEADIVVQHVFS 843 Query: 835 LIYAGVRPSTIVVQSPYVSQVQLLRDRLEEFPLSTGVEVATIDSFQGREADAVVISMVRS 656 LIY+GV P+ I VQSPYV+QVQLLRD+++E P++TGVEVATIDSFQGREADAV+ISMVRS Sbjct: 844 LIYSGVPPAAIAVQSPYVAQVQLLRDKIDELPMATGVEVATIDSFQGREADAVIISMVRS 903 Query: 655 NNLGAVGFLGDSRRMNVAITRARKHVAIICDSSTICHNTFLARLLRHIRYFGRVKHAEPG 476 NNLGAVGFLGDSRRMNVAITRARKHVA++CDSSTICHNT+LARLLRHIRYFG+VKH EPG Sbjct: 904 NNLGAVGFLGDSRRMNVAITRARKHVAVVCDSSTICHNTYLARLLRHIRYFGKVKHVEPG 963 Query: 475 GSGGYGLSMNPMLPSVS 425 +GL M+PMLP+ S Sbjct: 964 SFWEFGLGMDPMLPTAS 980 >ref|XP_012492340.1| PREDICTED: DNA-binding protein SMUBP-2 [Gossypium raimondii] gb|KJB44363.1| hypothetical protein B456_007G248100 [Gossypium raimondii] Length = 1003 Score = 1399 bits (3621), Expect = 0.0 Identities = 699/886 (78%), Positives = 778/886 (87%) Frame = -1 Query: 3082 TNNKAAVSEEKTRMKQQQVNDEKDGPTSVRALYQNGDPLGRRDLGKGVVKWIGKGMKAMA 2903 T V E KQ++ +K +VR LYQNGDPLGRRDLGK VV WI +GMKAMA Sbjct: 118 TRTNILVEELGLFKKQKEQKVQKTKALNVRTLYQNGDPLGRRDLGKRVVWWISEGMKAMA 177 Query: 2902 LDFALAETQGDFADLKQRMGPGLTFVIQAQPYLNAVPMPLGMEAICLKTCTHYPTLFDHF 2723 DFA AE QG+F +L+QRMGPGLTFVIQAQPYLN+VPMPLG+EAICLK CTHYPTLFDHF Sbjct: 178 SDFASAELQGEFLELRQRMGPGLTFVIQAQPYLNSVPMPLGLEAICLKACTHYPTLFDHF 237 Query: 2722 QRELRDVLLDLQHKTLIHNWRETESWKLLKELATSAQHRAIARKTSLSKSVHGVLGLNID 2543 QRELR+VL +LQ +++ +W+ETESWKLLKELA SAQHRAIARK + K V GVLG++++ Sbjct: 238 QRELRNVLQELQQNSMVQDWKETESWKLLKELANSAQHRAIARKVTPPKPVQGVLGMDLE 297 Query: 2542 KAKAIQCRIDEFTKHMSDLLRIERDAELEFTQEELNAVPTPDEHSTSPKPTEFLVSHAQS 2363 KAKA+Q RIDEFTK MS+LLRIERDAELEFTQEEL+AVPT DE S S KP EFLVSH Q+ Sbjct: 298 KAKAMQGRIDEFTKQMSELLRIERDAELEFTQEELDAVPTLDEGSDSSKPIEFLVSHGQA 357 Query: 2362 EQELCDTICNLNAISTSTGLGGMHLVLFRVEGNHRLPPTNLSPGDMVCVRICDSRGAGAT 2183 +QELCDTICNLNA+STSTGLGGMHLVLFRVEGNHRLPPT LSPGDMVCVRI DSRGAGAT Sbjct: 358 QQELCDTICNLNAVSTSTGLGGMHLVLFRVEGNHRLPPTTLSPGDMVCVRISDSRGAGAT 417 Query: 2182 SCMQGFVNNLGDDGCSISVALESLHGDPTFSKLFGKNIRIDRIQGLADALTYERNCEAXX 2003 SC+QGFV+NLGDDGCSISVALES HGDPTFSKLFGK++RIDRI GLADALTYERNCEA Sbjct: 418 SCIQGFVDNLGDDGCSISVALESRHGDPTFSKLFGKSVRIDRIHGLADALTYERNCEALM 477 Query: 2002 XXXXXXXXXKNSSIAVVTTIFGDNEDIAWFEDNNMVDWAEAELNGLLDTEFYDTSQQRAI 1823 KN SIAVV T+F D ED+ W E+N++ DW+ AEL+GLL +D SQQRAI Sbjct: 478 LLQKNGLQKKNPSIAVVATLFADKEDVEWLEENDLADWSPAELDGLLQNGTFDDSQQRAI 537 Query: 1822 ALGLNKKRPVLIIQGPPGTGKTGVLKQLISIAVKQGERVLVTAPTNAAVDNMVEKLSDIG 1643 ALGLNKKRPV+++QGPPGTGKTG+LK++I++A +QGERVLVTAPTNAAVDN+VEKLS+ G Sbjct: 538 ALGLNKKRPVMVVQGPPGTGKTGMLKEVIALAAQQGERVLVTAPTNAAVDNLVEKLSNTG 597 Query: 1642 ANIVRVGNPARISPAVASKSLVEIVNGRLADFRSEFERKKSDLRKDLSHCLRDDSLAAGI 1463 NIVRVGNPARIS AVASKSLVEIVN +LAD+R+EFERKKSDLRKDL HCL+DDSLAAGI Sbjct: 598 LNIVRVGNPARISSAVASKSLVEIVNSKLADYRAEFERKKSDLRKDLRHCLKDDSLAAGI 657 Query: 1462 RQLLKQLGKTMKKKERETIREILSSAHVVLATNIGAADPMIRWLNSFDLVVIDEAGQAIE 1283 RQLLKQLGK +KKKE+ET+RE+LS+A VVL+TN GAADP+IR L++FDLVVIDEAGQAIE Sbjct: 658 RQLLKQLGKALKKKEKETVREVLSNAQVVLSTNTGAADPLIRRLDTFDLVVIDEAGQAIE 717 Query: 1282 PSCWIPILLGKRCILAGDQCQLAPVILSRKALEGGLGVSFLERASTLHEGVLATKLTTQY 1103 PSCWIPIL GKRCILAGDQCQLAPVILSRKALEGGLG+S LERA+TLHEGVLAT L TQY Sbjct: 718 PSCWIPILQGKRCILAGDQCQLAPVILSRKALEGGLGISLLERAATLHEGVLATMLATQY 777 Query: 1102 RMNDAIASWASKEMYNGLLKSSASVMSHLLSDSPLVKSTWITQCPLLLLDTRMPFGSLSV 923 RMNDAIASWASKEMY+G LKSS V SHLL DSP VK TWITQCPLLLLDTRMP+GSLSV Sbjct: 778 RMNDAIASWASKEMYDGELKSSPLVASHLLVDSPFVKPTWITQCPLLLLDTRMPYGSLSV 837 Query: 922 GCEEQLDPAGTGSFYNEGEADIVVQHVFALIYAGVRPSTIVVQSPYVSQVQLLRDRLEEF 743 GCEE LD AGTGSF+NEGEADIVVQHV LIYAGV P+ I VQSPYV+QVQLLRDRL+EF Sbjct: 838 GCEEHLDLAGTGSFFNEGEADIVVQHVLYLIYAGVSPTAIAVQSPYVAQVQLLRDRLDEF 897 Query: 742 PLSTGVEVATIDSFQGREADAVVISMVRSNNLGAVGFLGDSRRMNVAITRARKHVAIICD 563 P + G+EVATIDSFQGREADAV+ISMVRSN LGAVGFLGDSRRMNVAITRARKHVA++CD Sbjct: 898 PEADGIEVATIDSFQGREADAVIISMVRSNTLGAVGFLGDSRRMNVAITRARKHVAVVCD 957 Query: 562 SSTICHNTFLARLLRHIRYFGRVKHAEPGGSGGYGLSMNPMLPSVS 425 SSTICHNTFLARLLRHIRY GRVKHAEPG SGG GL M+PMLPS+S Sbjct: 958 SSTICHNTFLARLLRHIRYVGRVKHAEPGASGGSGLGMDPMLPSIS 1003 >ref|XP_002524012.1| PREDICTED: DNA-binding protein SMUBP-2 [Ricinus communis] gb|EEF38380.1| DNA-binding protein smubp-2, putative [Ricinus communis] Length = 989 Score = 1399 bits (3621), Expect = 0.0 Identities = 700/890 (78%), Positives = 781/890 (87%), Gaps = 2/890 (0%) Frame = -1 Query: 3088 NNTNNKAAVSEEKTRMKQQQVNDEKDGPTSVRALYQNGDPLGRRDLGKGVVKWIGKGMKA 2909 N K AVSEE+ + +VN V++L+QNGDPLG++DLGK VVKWI +GM+A Sbjct: 108 NTDGGKLAVSEEREEKVKMKVN--------VKSLHQNGDPLGKKDLGKTVVKWISQGMRA 159 Query: 2908 MALDFALAETQGDFADLKQRMG--PGLTFVIQAQPYLNAVPMPLGMEAICLKTCTHYPTL 2735 MA DFA AETQG+F +L+QRM GLTFVIQAQPY+NAVP+PLG EA+CLK C HYPTL Sbjct: 160 MAADFASAETQGEFLELRQRMDLEAGLTFVIQAQPYINAVPIPLGFEALCLKACIHYPTL 219 Query: 2734 FDHFQRELRDVLLDLQHKTLIHNWRETESWKLLKELATSAQHRAIARKTSLSKSVHGVLG 2555 FDHFQRELRDVL DLQ K L+ +W+ TESWKLLKELA S QHRA+ARK S K + GVLG Sbjct: 220 FDHFQRELRDVLQDLQRKGLVQDWQNTESWKLLKELANSVQHRAVARKVSKPKPLQGVLG 279 Query: 2554 LNIDKAKAIQCRIDEFTKHMSDLLRIERDAELEFTQEELNAVPTPDEHSTSPKPTEFLVS 2375 +N+DKAKAIQ RIDEFTK MS+LL+IERD+ELEFTQEELNAVPTPDE+S KP EFLVS Sbjct: 280 MNLDKAKAIQSRIDEFTKTMSELLQIERDSELEFTQEELNAVPTPDENSDPSKPIEFLVS 339 Query: 2374 HAQSEQELCDTICNLNAISTSTGLGGMHLVLFRVEGNHRLPPTNLSPGDMVCVRICDSRG 2195 H Q++QELCDTICNLNA+STSTGLGGMHLVLFRVEGNHRLPPTNLSPGDMVCVRICDSRG Sbjct: 340 HGQAQQELCDTICNLNAVSTSTGLGGMHLVLFRVEGNHRLPPTNLSPGDMVCVRICDSRG 399 Query: 2194 AGATSCMQGFVNNLGDDGCSISVALESLHGDPTFSKLFGKNIRIDRIQGLADALTYERNC 2015 AGATSCMQGFVNNLG+DGCSISVALES HGDPTFSKLFGK +RIDRI GLADALTYERNC Sbjct: 400 AGATSCMQGFVNNLGEDGCSISVALESRHGDPTFSKLFGKGVRIDRIHGLADALTYERNC 459 Query: 2014 EAXXXXXXXXXXXKNSSIAVVTTIFGDNEDIAWFEDNNMVDWAEAELNGLLDTEFYDTSQ 1835 EA KN SIA+V T+FGD+ED+AW E+ ++ +W EA+++G +E +D SQ Sbjct: 460 EALMLLQKNGLQKKNPSIAIVATLFGDSEDLAWLEEKDLAEWNEADMDGCFGSERFDDSQ 519 Query: 1834 QRAIALGLNKKRPVLIIQGPPGTGKTGVLKQLISIAVKQGERVLVTAPTNAAVDNMVEKL 1655 +RA+ALGLN+KRP+LIIQGPPGTGK+G+LK+LI AV QGERVLVTAPTNAAVDNMVEKL Sbjct: 520 RRAMALGLNQKRPLLIIQGPPGTGKSGLLKELIVRAVHQGERVLVTAPTNAAVDNMVEKL 579 Query: 1654 SDIGANIVRVGNPARISPAVASKSLVEIVNGRLADFRSEFERKKSDLRKDLSHCLRDDSL 1475 S+IG +IVRVGNPARIS AVASKSL EIVN +LA FR EFERKKSDLRKDL HCL DDSL Sbjct: 580 SNIGLDIVRVGNPARISSAVASKSLSEIVNSKLATFRMEFERKKSDLRKDLRHCLEDDSL 639 Query: 1474 AAGIRQLLKQLGKTMKKKERETIREILSSAHVVLATNIGAADPMIRWLNSFDLVVIDEAG 1295 AAGIRQLLKQLGKTMKKKE+E+++E+LSSA VVLATN GAADP+IR L++FDLVVIDEAG Sbjct: 640 AAGIRQLLKQLGKTMKKKEKESVKEVLSSAQVVLATNTGAADPLIRRLDTFDLVVIDEAG 699 Query: 1294 QAIEPSCWIPILLGKRCILAGDQCQLAPVILSRKALEGGLGVSFLERASTLHEGVLATKL 1115 QAIEPSCWIPIL GKRCILAGDQCQLAPVILSRKALEGGLGVS LERA+TLH+GVLA +L Sbjct: 700 QAIEPSCWIPILQGKRCILAGDQCQLAPVILSRKALEGGLGVSLLERAATLHDGVLALQL 759 Query: 1114 TTQYRMNDAIASWASKEMYNGLLKSSASVMSHLLSDSPLVKSTWITQCPLLLLDTRMPFG 935 TTQYRMNDAIASWASKEMY GLLKSS+ V SHLL SP VK TWITQCPLLLLDTRMP+G Sbjct: 760 TTQYRMNDAIASWASKEMYGGLLKSSSKVASHLLVHSPFVKPTWITQCPLLLLDTRMPYG 819 Query: 934 SLSVGCEEQLDPAGTGSFYNEGEADIVVQHVFALIYAGVRPSTIVVQSPYVSQVQLLRDR 755 SL +GCEE LDPAGTGSFYNEGEA+IVVQHV +LIYAGVRP+TI VQSPYV+QVQLLRDR Sbjct: 820 SLFIGCEEHLDPAGTGSFYNEGEAEIVVQHVISLIYAGVRPTTIAVQSPYVAQVQLLRDR 879 Query: 754 LEEFPLSTGVEVATIDSFQGREADAVVISMVRSNNLGAVGFLGDSRRMNVAITRARKHVA 575 L+E P + GVEVATIDSFQGREADAV+ISMVRSNNLGAVGFLGDSRRMNVAITRAR+HVA Sbjct: 880 LDELPEADGVEVATIDSFQGREADAVIISMVRSNNLGAVGFLGDSRRMNVAITRARRHVA 939 Query: 574 IICDSSTICHNTFLARLLRHIRYFGRVKHAEPGGSGGYGLSMNPMLPSVS 425 ++CDSSTICHNTFLARLLRHIRYFGRVKHAEPG GG GL M+PMLPS+S Sbjct: 940 VVCDSSTICHNTFLARLLRHIRYFGRVKHAEPGSFGGSGLGMDPMLPSIS 989