BLASTX nr result
ID: Phellodendron21_contig00001294
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Phellodendron21_contig00001294 (3586 letters) Database: ./nr 115,041,592 sequences; 42,171,959,267 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value XP_006437411.1 hypothetical protein CICLE_v10030616mg [Citrus cl... 1642 0.0 XP_006484692.1 PREDICTED: DNA-binding protein SMUBP-2 [Citrus si... 1642 0.0 EOY10295.1 P-loop containing nucleoside triphosphate hydrolases ... 1473 0.0 XP_017977299.1 PREDICTED: DNA-binding protein SMUBP-2 [Theobroma... 1471 0.0 OMO99192.1 putative DNA-binding protein smubp-2 [Corchorus capsu... 1453 0.0 OMO56477.1 hypothetical protein COLO4_35630 [Corchorus olitorius] 1449 0.0 XP_016697684.1 PREDICTED: DNA-binding protein SMUBP-2-like [Goss... 1438 0.0 XP_016671666.1 PREDICTED: DNA-binding protein SMUBP-2-like [Goss... 1437 0.0 XP_012492340.1 PREDICTED: DNA-binding protein SMUBP-2 [Gossypium... 1437 0.0 KHG05926.1 DNA-binding SMUBP-2 [Gossypium arboreum] 1436 0.0 OAY44532.1 hypothetical protein MANES_08G158300 [Manihot esculenta] 1431 0.0 XP_017627332.1 PREDICTED: DNA-binding protein SMUBP-2 [Gossypium... 1423 0.0 XP_011009226.1 PREDICTED: DNA-binding protein SMUBP-2 isoform X1... 1419 0.0 XP_012070287.1 PREDICTED: DNA-binding protein SMUBP-2 [Jatropha ... 1416 0.0 XP_002319231.2 hypothetical protein POPTR_0013s07150g [Populus t... 1415 0.0 XP_002264216.1 PREDICTED: DNA-binding protein SMUBP-2 [Vitis vin... 1409 0.0 XP_002524012.1 PREDICTED: DNA-binding protein SMUBP-2 [Ricinus c... 1405 0.0 XP_018828127.1 PREDICTED: DNA-binding protein SMUBP-2 [Juglans r... 1402 0.0 XP_010063606.1 PREDICTED: DNA-binding protein SMUBP-2 [Eucalyptu... 1394 0.0 GAV70650.1 AAA_11 domain-containing protein/AAA_12 domain-contai... 1384 0.0 >XP_006437411.1 hypothetical protein CICLE_v10030616mg [Citrus clementina] ESR50651.1 hypothetical protein CICLE_v10030616mg [Citrus clementina] Length = 1010 Score = 1642 bits (4253), Expect = 0.0 Identities = 850/995 (85%), Positives = 878/995 (88%) Frame = +2 Query: 236 FCGSGIVPVTARKSLALNVRRFNSSVLHAAPLQFSFCSSFRSVCLFIGYKSSSLFAFYQP 415 FCGS VPVT RK+LALNVRRFNSSV H APL+FS CSS RS+CLFIGYKSSS F F+QP Sbjct: 16 FCGSRSVPVTTRKTLALNVRRFNSSVWHPAPLKFSVCSSVRSICLFIGYKSSSSFEFFQP 75 Query: 416 QQFVCYXXXXXXXXXXXXXXXXXXPRRKSSVFSKSRIQXXXXXXXXXXXXXVNVSRVXXX 595 QQFV Y PRRKSS FSKS+IQ NVS + Sbjct: 76 QQFVPYNSSSSSSSTKSSTTFKKKPRRKSSGFSKSKIQRTKTLSGPNSSTKANVSSLVEK 135 Query: 596 XXXXXXXXXXXXXXXXXNVRALSQNGDPLGRRELGKSVVRWICQGMRAMASDFASAEVQG 775 NV+ALSQNG+PLGRRELGK VVRWICQGMRAMASDFASAE+QG Sbjct: 136 SSGEKQQEQPKKSDNAVNVQALSQNGNPLGRRELGKGVVRWICQGMRAMASDFASAEIQG 195 Query: 776 EFSELGQLMGPGLTFVIAAQPYLNAIPMPVGLEAICLKACTHYPTLFDHFQRELRDVLQE 955 EFSEL Q MGPGLTFVI AQPYLNAIPMPVGLEA+CLKA THYPTLFDHFQRELRDVLQE Sbjct: 196 EFSELRQRMGPGLTFVIEAQPYLNAIPMPVGLEAVCLKAGTHYPTLFDHFQRELRDVLQE 255 Query: 956 LQHKSLVQDWHETESWKLLKELANSAHHRAIVRKVTQPKPVQGVLGMDLERFKTIQGRID 1135 LQ K LVQDWHETESWKLLKELANSA HRAIVRKVTQPKPVQGVLGMDLER KTIQ R+D Sbjct: 256 LQQKLLVQDWHETESWKLLKELANSAQHRAIVRKVTQPKPVQGVLGMDLERVKTIQSRLD 315 Query: 1136 EFTKQMSELLRIERDAELEFTQEELNAVPTPDENSDSSKPIEFLVSHGQAPQELCDTICN 1315 EFT++MSELLRIERDAELEFTQEELNAVPTPDENSDSSKPIEFLVSHG+APQELCDTICN Sbjct: 316 EFTQRMSELLRIERDAELEFTQEELNAVPTPDENSDSSKPIEFLVSHGRAPQELCDTICN 375 Query: 1316 LFAVSTSTGLGGMHLVLFRVEGNHRLPPTTLSPGDMVCVRICDSRGACATSCIQGFVHNL 1495 LFAVSTSTGLGGMHLVLFRVEGNHRLPPTTLSPGDMVCVRICDSRGACATSCIQGFVHNL Sbjct: 376 LFAVSTSTGLGGMHLVLFRVEGNHRLPPTTLSPGDMVCVRICDSRGACATSCIQGFVHNL 435 Query: 1496 GEDGCSISVALESRHGDPTFSKLFGRNVRIDRIQGLADTLTYERNCEALMLLQKHGLHKR 1675 GEDGC+ISVALESRHGDPTFSKLFG++VRIDRIQGLADTLTYERNCEALMLLQK+GLHKR Sbjct: 436 GEDGCTISVALESRHGDPTFSKLFGKSVRIDRIQGLADTLTYERNCEALMLLQKNGLHKR 495 Query: 1676 NPSIAAVVTLFGDKEDVAWLVENDLADLSEVKLDGMLGSKTFDDSQKKAIALGLNKKRPL 1855 NPSIAAVVTLFGDKEDV WL ENDLAD SEVKLDG++G KTFDDSQKKAIALGLNKKRPL Sbjct: 496 NPSIAAVVTLFGDKEDVTWLEENDLADWSEVKLDGIMG-KTFDDSQKKAIALGLNKKRPL 554 Query: 1856 LIIQGPPGTGKTGLLKELIARAVQQGERVLVTAPTNAAVDNMVEKLSDIGLNIVRVGNPA 2035 LIIQGPPGTGKTGLLKE+IARAVQQGERVLVTAPTNAAVDNMVEKLSD+GLNIVRVGNPA Sbjct: 555 LIIQGPPGTGKTGLLKEIIARAVQQGERVLVTAPTNAAVDNMVEKLSDVGLNIVRVGNPA 614 Query: 2036 RISPVVASKSLDEIVYSKLASFLAEFERKKSDLRKDLSQCLKDDSLAAGIRXXXXXXXXX 2215 RISP VASKSL EIV SKLASF+AEFERKKSDLRKDL QCLKDDSLAAGIR Sbjct: 615 RISPAVASKSLGEIVKSKLASFVAEFERKKSDLRKDLRQCLKDDSLAAGIRQLLKQLGKT 674 Query: 2216 XXXXXXXXXXXXXSSAQVVLGTNTGAADPLIRRLDTFDLVVIDEAGQAIETSCWIPILQG 2395 SSAQVVL TNTGAADPLIRRLDTFDLVVIDEA QAIE SC IPILQG Sbjct: 675 LKKKEKETVKEVLSSAQVVLATNTGAADPLIRRLDTFDLVVIDEAAQAIEPSCLIPILQG 734 Query: 2396 KRCILAGDQCQLAPVILSRKALEGRLGVSLLERAATLHEGALATKLTTQYRMNDAIASWA 2575 KRCILAGDQCQLAPVILSRKALEG LGVSLLERAATLHEG LATKLTTQYRMNDAIASWA Sbjct: 735 KRCILAGDQCQLAPVILSRKALEGGLGVSLLERAATLHEGVLATKLTTQYRMNDAIASWA 794 Query: 2576 SKEMYGGSLVSSSTVAAHLLVDTPFVKPTWITQCPLLLLDTRMAYGSLSLGCEEHLDPAG 2755 SKEMYGGSL+SSSTVA+HLLVDTPFVKPTWITQCPLLLLDTR+ YGSLSLGCEEHLD AG Sbjct: 795 SKEMYGGSLISSSTVASHLLVDTPFVKPTWITQCPLLLLDTRLPYGSLSLGCEEHLDLAG 854 Query: 2756 TGSFYNEGEAEIVVQHVFSLIYAGVSPSAIAVQSPYVAQVQLLRDRLDELPEAAGVEVAT 2935 TGSFYNEGEAEIVV HVFSLI AGVSPSAIAVQSPYVAQVQLLR+RLDELPEAAGVEVAT Sbjct: 855 TGSFYNEGEAEIVVHHVFSLICAGVSPSAIAVQSPYVAQVQLLRERLDELPEAAGVEVAT 914 Query: 2936 IDSFQGREADAVIISMVRSNTLAAVGFLGDSRRMNVAITRACKHVAVVCDSSTICHNTFL 3115 IDSFQGREADAVIISMVRSNTL AVGFLGDSRRMNVAITRACKHVAVVCDSSTICHNTFL Sbjct: 915 IDSFQGREADAVIISMVRSNTLGAVGFLGDSRRMNVAITRACKHVAVVCDSSTICHNTFL 974 Query: 3116 ARLLRHIRYFGRVKHAEPGSFGESGLDMNPMLPSI 3220 ARLLRHIRYFGRVKHAEPGSFG SGL M+PMLPSI Sbjct: 975 ARLLRHIRYFGRVKHAEPGSFGGSGLGMDPMLPSI 1009 >XP_006484692.1 PREDICTED: DNA-binding protein SMUBP-2 [Citrus sinensis] Length = 1010 Score = 1642 bits (4252), Expect = 0.0 Identities = 850/995 (85%), Positives = 877/995 (88%) Frame = +2 Query: 236 FCGSGIVPVTARKSLALNVRRFNSSVLHAAPLQFSFCSSFRSVCLFIGYKSSSLFAFYQP 415 FCGS VPVT RK+LALNVRRFNSSV H APL+FS CSS RS+CLFIGYKSSS F F+QP Sbjct: 16 FCGSRSVPVTTRKTLALNVRRFNSSVWHPAPLKFSVCSSVRSICLFIGYKSSSSFEFFQP 75 Query: 416 QQFVCYXXXXXXXXXXXXXXXXXXPRRKSSVFSKSRIQXXXXXXXXXXXXXVNVSRVXXX 595 QQFV Y PRRKSS FSKS+IQ NVS V Sbjct: 76 QQFVPYNSSSSSSSTKSSTTFKKKPRRKSSGFSKSKIQKTKTLSGPNSSTKANVSSVVEK 135 Query: 596 XXXXXXXXXXXXXXXXXNVRALSQNGDPLGRRELGKSVVRWICQGMRAMASDFASAEVQG 775 NV+ALSQNG+PLGRRELGK VVRWICQGMRAMASDFASAE+QG Sbjct: 136 SSGEKQQEQPKKSDNAVNVQALSQNGNPLGRRELGKGVVRWICQGMRAMASDFASAEIQG 195 Query: 776 EFSELGQLMGPGLTFVIAAQPYLNAIPMPVGLEAICLKACTHYPTLFDHFQRELRDVLQE 955 EFSEL Q MGPGLTFVI AQPYLNAIPMPVGLEA+CLKA THYPTLFDHFQRELRDVLQE Sbjct: 196 EFSELRQRMGPGLTFVIEAQPYLNAIPMPVGLEAVCLKAGTHYPTLFDHFQRELRDVLQE 255 Query: 956 LQHKSLVQDWHETESWKLLKELANSAHHRAIVRKVTQPKPVQGVLGMDLERFKTIQGRID 1135 LQ K LVQDWHETESWKLLKELANSA HRAIVRKVTQPKPVQGVLGMDLER KTIQ R+D Sbjct: 256 LQQKLLVQDWHETESWKLLKELANSAQHRAIVRKVTQPKPVQGVLGMDLERVKTIQSRLD 315 Query: 1136 EFTKQMSELLRIERDAELEFTQEELNAVPTPDENSDSSKPIEFLVSHGQAPQELCDTICN 1315 EFT++MSELLRIERDAELEFTQEELNAVPTPDENSDSSKPIEFLVSHG+APQELCDTICN Sbjct: 316 EFTQRMSELLRIERDAELEFTQEELNAVPTPDENSDSSKPIEFLVSHGRAPQELCDTICN 375 Query: 1316 LFAVSTSTGLGGMHLVLFRVEGNHRLPPTTLSPGDMVCVRICDSRGACATSCIQGFVHNL 1495 LF VSTSTGLGGMHLVLFRVEGNHRLPPTTLSPGDMVCVRICDSRGACATSCIQGFVHNL Sbjct: 376 LFVVSTSTGLGGMHLVLFRVEGNHRLPPTTLSPGDMVCVRICDSRGACATSCIQGFVHNL 435 Query: 1496 GEDGCSISVALESRHGDPTFSKLFGRNVRIDRIQGLADTLTYERNCEALMLLQKHGLHKR 1675 GEDGC+ISVALESRHGDPTFSKLFG++VRIDRIQGLADTLTYERNCEALMLLQK+GLHKR Sbjct: 436 GEDGCTISVALESRHGDPTFSKLFGKSVRIDRIQGLADTLTYERNCEALMLLQKNGLHKR 495 Query: 1676 NPSIAAVVTLFGDKEDVAWLVENDLADLSEVKLDGMLGSKTFDDSQKKAIALGLNKKRPL 1855 NPSIAAVVTLFGDKEDV WL ENDLAD SEVKLDG++G KTFDDSQKKAIALGLNKKRPL Sbjct: 496 NPSIAAVVTLFGDKEDVTWLEENDLADWSEVKLDGIMG-KTFDDSQKKAIALGLNKKRPL 554 Query: 1856 LIIQGPPGTGKTGLLKELIARAVQQGERVLVTAPTNAAVDNMVEKLSDIGLNIVRVGNPA 2035 LIIQGPPGTGKTGLLKE+IARAVQQGERVLVTAPTNAAVDNMVEKLSD+GLNIVRVGNPA Sbjct: 555 LIIQGPPGTGKTGLLKEIIARAVQQGERVLVTAPTNAAVDNMVEKLSDVGLNIVRVGNPA 614 Query: 2036 RISPVVASKSLDEIVYSKLASFLAEFERKKSDLRKDLSQCLKDDSLAAGIRXXXXXXXXX 2215 RISP VASKSL EIV SKLASF+AEFERKKSDLRKDL QCLKDDSLAAGIR Sbjct: 615 RISPAVASKSLGEIVKSKLASFVAEFERKKSDLRKDLRQCLKDDSLAAGIRQLLKQLGKT 674 Query: 2216 XXXXXXXXXXXXXSSAQVVLGTNTGAADPLIRRLDTFDLVVIDEAGQAIETSCWIPILQG 2395 SSAQVVL TNTGAADPLIRRLDTFDLVVIDEA QAIE SC IPILQG Sbjct: 675 LKKKEKETVKEVLSSAQVVLATNTGAADPLIRRLDTFDLVVIDEAAQAIEPSCLIPILQG 734 Query: 2396 KRCILAGDQCQLAPVILSRKALEGRLGVSLLERAATLHEGALATKLTTQYRMNDAIASWA 2575 KRCILAGDQCQLAPVILSRKALEG LGVSLLERAATLHEG LATKLTTQYRMNDAIASWA Sbjct: 735 KRCILAGDQCQLAPVILSRKALEGGLGVSLLERAATLHEGVLATKLTTQYRMNDAIASWA 794 Query: 2576 SKEMYGGSLVSSSTVAAHLLVDTPFVKPTWITQCPLLLLDTRMAYGSLSLGCEEHLDPAG 2755 SKEMYGGSL+SSSTVA+HLLVDTPFVKPTWITQCPLLLLDTR+ YGSLSLGCEEHLD AG Sbjct: 795 SKEMYGGSLISSSTVASHLLVDTPFVKPTWITQCPLLLLDTRLPYGSLSLGCEEHLDLAG 854 Query: 2756 TGSFYNEGEAEIVVQHVFSLIYAGVSPSAIAVQSPYVAQVQLLRDRLDELPEAAGVEVAT 2935 TGSFYNEGEAEIVV HVFSLI AGVSPSAIAVQSPYVAQVQLLR+RLDELPEAAGVEVAT Sbjct: 855 TGSFYNEGEAEIVVHHVFSLICAGVSPSAIAVQSPYVAQVQLLRERLDELPEAAGVEVAT 914 Query: 2936 IDSFQGREADAVIISMVRSNTLAAVGFLGDSRRMNVAITRACKHVAVVCDSSTICHNTFL 3115 IDSFQGREADAVIISMVRSNTL AVGFLGDSRRMNVAITRACKHVAVVCDSSTICHNTFL Sbjct: 915 IDSFQGREADAVIISMVRSNTLGAVGFLGDSRRMNVAITRACKHVAVVCDSSTICHNTFL 974 Query: 3116 ARLLRHIRYFGRVKHAEPGSFGESGLDMNPMLPSI 3220 ARLLRHIRYFGRVKHAEPGSFG SGL M+PMLPSI Sbjct: 975 ARLLRHIRYFGRVKHAEPGSFGGSGLGMDPMLPSI 1009 >EOY10295.1 P-loop containing nucleoside triphosphate hydrolases superfamily protein isoform 1 [Theobroma cacao] Length = 1008 Score = 1473 bits (3813), Expect = 0.0 Identities = 771/1004 (76%), Positives = 828/1004 (82%), Gaps = 9/1004 (0%) Frame = +2 Query: 236 FCGSGIVPVTARKSLALNVRRFNSSVLHAAPLQFSFCSS-FRSVCLFIGYKSSSLFAFYQ 412 FCGS +P T ++LAL+V+R SS + PL FS SS +S+CLF+G+K + +Q Sbjct: 8 FCGS--IPSTTTRTLALSVQR--SSFSSSLPLSFSSSSSPVKSICLFVGHKYNYPSTKFQ 63 Query: 413 PQQFVCYXXXXXXXXXXXXXXXXXX-PRRKSSVFSKSRIQXXXXXXXXXXXXXVNVSR-- 583 +Q VC PR KS+V SK +I S Sbjct: 64 SKQLVCNGSSSSSRSSRKFTTATKKKPRSKSNVASKPKISENDNDGISSKSTSKPSSSCS 123 Query: 584 -----VXXXXXXXXXXXXXXXXXXXXNVRALSQNGDPLGRRELGKSVVRWICQGMRAMAS 748 V NVR L QNGDPLGRR+LGK V+RWI +GM+AMAS Sbjct: 124 STKIIVEELGLLKNQKQEKVKKTKAVNVRTLYQNGDPLGRRDLGKRVIRWISEGMKAMAS 183 Query: 749 DFASAEVQGEFSELGQLMGPGLTFVIAAQPYLNAIPMPVGLEAICLKACTHYPTLFDHFQ 928 DF +AE+QGEF EL Q MGPGLTFVI AQPYLNAIP+P+GLEAICLKACTHYPTLFDHFQ Sbjct: 184 DFVTAELQGEFLELRQRMGPGLTFVIQAQPYLNAIPIPLGLEAICLKACTHYPTLFDHFQ 243 Query: 929 RELRDVLQELQHKSLVQDWHETESWKLLKELANSAHHRAIVRKVTQPKPVQGVLGMDLER 1108 RELR++LQELQ S+V+DW ETESWKLLKELANSA HRAI RK+TQPKPVQGVLGMDLE+ Sbjct: 244 RELRNILQELQQNSVVEDWRETESWKLLKELANSAQHRAIARKITQPKPVQGVLGMDLEK 303 Query: 1109 FKTIQGRIDEFTKQMSELLRIERDAELEFTQEELNAVPTPDENSDSSKPIEFLVSHGQAP 1288 K +QGRIDEFTKQMSELLRIERDAELEFTQEELNAVPTPDE SDSSKPIEFLVSHGQA Sbjct: 304 AKAMQGRIDEFTKQMSELLRIERDAELEFTQEELNAVPTPDEGSDSSKPIEFLVSHGQAQ 363 Query: 1289 QELCDTICNLFAVSTSTGLGGMHLVLFRVEGNHRLPPTTLSPGDMVCVRICDSRGACATS 1468 QELCDTICNL AVSTSTGLGGMHLVLFRVEGNHRLPPTTLSPGDMVCVRICDSRGA ATS Sbjct: 364 QELCDTICNLNAVSTSTGLGGMHLVLFRVEGNHRLPPTTLSPGDMVCVRICDSRGAGATS 423 Query: 1469 CIQGFVHNLGEDGCSISVALESRHGDPTFSKLFGRNVRIDRIQGLADTLTYERNCEALML 1648 C+QGFV NLGEDGCSISVALESRHGDPTFSK FG+NVRIDRIQGLAD LTYERNCEALML Sbjct: 424 CMQGFVDNLGEDGCSISVALESRHGDPTFSKFFGKNVRIDRIQGLADALTYERNCEALML 483 Query: 1649 LQKHGLHKRNPSIAAVVTLFGDKEDVAWLVENDLADLSEVKLDGMLGSKTFDDSQKKAIA 1828 LQK+GL K+NPSIA V TLFGDKEDV WL +N AD +E KLDG+L + TFDDSQ++AIA Sbjct: 484 LQKNGLQKKNPSIAVVATLFGDKEDVTWLEKNSYADWNEAKLDGLLQNGTFDDSQQRAIA 543 Query: 1829 LGLNKKRPLLIIQGPPGTGKTGLLKELIARAVQQGERVLVTAPTNAAVDNMVEKLSDIGL 2008 LGLNKKRP+L++QGPPGTGKTGLLKE+IA AVQQGERVLV APTNAAVDNMVEKLS+IGL Sbjct: 544 LGLNKKRPILVVQGPPGTGKTGLLKEVIALAVQQGERVLVAAPTNAAVDNMVEKLSNIGL 603 Query: 2009 NIVRVGNPARISPVVASKSLDEIVYSKLASFLAEFERKKSDLRKDLSQCLKDDSLAAGIR 2188 NIVRVGNPARIS VASKSL EIV SKLA +LAEFERKKSDLRKDL CLKDDSLAAGIR Sbjct: 604 NIVRVGNPARISSAVASKSLAEIVNSKLADYLAEFERKKSDLRKDLRHCLKDDSLAAGIR 663 Query: 2189 XXXXXXXXXXXXXXXXXXXXXXSSAQVVLGTNTGAADPLIRRLDTFDLVVIDEAGQAIET 2368 SSAQVVL TNTGAADPLIRR+DTFDLVVIDEAGQAIE Sbjct: 664 QLLKQLGKALKKKEKETVREVLSSAQVVLSTNTGAADPLIRRMDTFDLVVIDEAGQAIEP 723 Query: 2369 SCWIPILQGKRCILAGDQCQLAPVILSRKALEGRLGVSLLERAATLHEGALATKLTTQYR 2548 SCWIPILQGKRCILAGDQCQLAPVILSRKALEG LGVSLLERAAT+HEG LAT LTTQYR Sbjct: 724 SCWIPILQGKRCILAGDQCQLAPVILSRKALEGGLGVSLLERAATMHEGVLATMLTTQYR 783 Query: 2549 MNDAIASWASKEMYGGSLVSSSTVAAHLLVDTPFVKPTWITQCPLLLLDTRMAYGSLSLG 2728 MNDAIA WASKEMY G L SS +V +HLLVD+PFVKPTWITQCPLLLLDTRM YGSLS+G Sbjct: 784 MNDAIAGWASKEMYDGELKSSPSVGSHLLVDSPFVKPTWITQCPLLLLDTRMPYGSLSVG 843 Query: 2729 CEEHLDPAGTGSFYNEGEAEIVVQHVFSLIYAGVSPSAIAVQSPYVAQVQLLRDRLDELP 2908 CEEHLDPAGTGSFYNEGEA+IVVQHVF LIYAGVSP+AIAVQSPYVAQVQLLRDRLDE P Sbjct: 844 CEEHLDPAGTGSFYNEGEADIVVQHVFYLIYAGVSPTAIAVQSPYVAQVQLLRDRLDEFP 903 Query: 2909 EAAGVEVATIDSFQGREADAVIISMVRSNTLAAVGFLGDSRRMNVAITRACKHVAVVCDS 3088 EAAGVEVATIDSFQGREADAVIISMVRSNTL AVGFLGDSRRMNVA+TRA KHVAVVCDS Sbjct: 904 EAAGVEVATIDSFQGREADAVIISMVRSNTLGAVGFLGDSRRMNVAVTRARKHVAVVCDS 963 Query: 3089 STICHNTFLARLLRHIRYFGRVKHAEPGSFGESGLDMNPMLPSI 3220 STICHNTFLARLLRHIRYFGRVKHAEPG+ G SGL M+PMLPSI Sbjct: 964 STICHNTFLARLLRHIRYFGRVKHAEPGTSGGSGLGMDPMLPSI 1007 >XP_017977299.1 PREDICTED: DNA-binding protein SMUBP-2 [Theobroma cacao] XP_007029793.2 PREDICTED: DNA-binding protein SMUBP-2 [Theobroma cacao] Length = 1008 Score = 1471 bits (3809), Expect = 0.0 Identities = 770/1004 (76%), Positives = 828/1004 (82%), Gaps = 9/1004 (0%) Frame = +2 Query: 236 FCGSGIVPVTARKSLALNVRRFNSSVLHAAPLQFSFCSS-FRSVCLFIGYKSSSLFAFYQ 412 FCGS +P T ++LAL+V+R SS + PL FS SS +S+CLF+G+K + +Q Sbjct: 8 FCGS--IPSTTTRTLALSVQR--SSFSSSLPLSFSSSSSPVKSICLFVGHKYNYPSTKFQ 63 Query: 413 PQQFVCYXXXXXXXXXXXXXXXXXX-PRRKSSVFSKSRIQXXXXXXXXXXXXXVNVSR-- 583 +Q VC PR KS+V SK +I S Sbjct: 64 SKQLVCNGSSSSSRSSRKFTTATKKKPRSKSNVASKPKISENDNDGISSKSTSKPSSSCS 123 Query: 584 -----VXXXXXXXXXXXXXXXXXXXXNVRALSQNGDPLGRRELGKSVVRWICQGMRAMAS 748 V NVR L QNGDPLGRR+LGK V+RWI +GM+AMAS Sbjct: 124 STKIIVEELGLLKNQKQEKVKKTKAVNVRTLYQNGDPLGRRDLGKRVIRWISEGMKAMAS 183 Query: 749 DFASAEVQGEFSELGQLMGPGLTFVIAAQPYLNAIPMPVGLEAICLKACTHYPTLFDHFQ 928 DF +AE+QGEF EL Q MGPGLTFVI AQPYLNAIP+P+GLEAICLKACTHYPTLFDHFQ Sbjct: 184 DFVTAELQGEFLELRQRMGPGLTFVIQAQPYLNAIPIPLGLEAICLKACTHYPTLFDHFQ 243 Query: 929 RELRDVLQELQHKSLVQDWHETESWKLLKELANSAHHRAIVRKVTQPKPVQGVLGMDLER 1108 RELR++LQELQ S+V+DW +TESWKLLKELANSA HRAI RK+TQPKPVQGVLGMDLE+ Sbjct: 244 RELRNILQELQQNSVVEDWRKTESWKLLKELANSAQHRAIARKITQPKPVQGVLGMDLEK 303 Query: 1109 FKTIQGRIDEFTKQMSELLRIERDAELEFTQEELNAVPTPDENSDSSKPIEFLVSHGQAP 1288 K +QGRIDEFTKQMSELLRIERDAELEFTQEELNAVPTPDE SDSSKPIEFLVSHGQA Sbjct: 304 AKAMQGRIDEFTKQMSELLRIERDAELEFTQEELNAVPTPDEGSDSSKPIEFLVSHGQAQ 363 Query: 1289 QELCDTICNLFAVSTSTGLGGMHLVLFRVEGNHRLPPTTLSPGDMVCVRICDSRGACATS 1468 QELCDTICNL AVSTSTGLGGMHLVLFRVEGNHRLPPTTLSPGDMVCVRICDSRGA ATS Sbjct: 364 QELCDTICNLNAVSTSTGLGGMHLVLFRVEGNHRLPPTTLSPGDMVCVRICDSRGAGATS 423 Query: 1469 CIQGFVHNLGEDGCSISVALESRHGDPTFSKLFGRNVRIDRIQGLADTLTYERNCEALML 1648 C+QGFV NLGEDGCSISVALESRHGDPTFSK FG+NVRIDRIQGLAD LTYERNCEALML Sbjct: 424 CMQGFVDNLGEDGCSISVALESRHGDPTFSKFFGKNVRIDRIQGLADALTYERNCEALML 483 Query: 1649 LQKHGLHKRNPSIAAVVTLFGDKEDVAWLVENDLADLSEVKLDGMLGSKTFDDSQKKAIA 1828 LQK+GL K+NPSIA V TLFGDKEDV WL +N AD +E KLDG+L + TFDDSQ++AIA Sbjct: 484 LQKNGLQKKNPSIAVVATLFGDKEDVTWLEKNSYADWNEAKLDGLLQNGTFDDSQQRAIA 543 Query: 1829 LGLNKKRPLLIIQGPPGTGKTGLLKELIARAVQQGERVLVTAPTNAAVDNMVEKLSDIGL 2008 LGLNKKRP+L++QGPPGTGKTGLLKE+IA AVQQGERVLV APTNAAVDNMVEKLS+IGL Sbjct: 544 LGLNKKRPILVVQGPPGTGKTGLLKEVIALAVQQGERVLVAAPTNAAVDNMVEKLSNIGL 603 Query: 2009 NIVRVGNPARISPVVASKSLDEIVYSKLASFLAEFERKKSDLRKDLSQCLKDDSLAAGIR 2188 NIVRVGNPARIS VASKSL EIV SKLA +LAEFERKKSDLRKDL CLKDDSLAAGIR Sbjct: 604 NIVRVGNPARISSAVASKSLAEIVNSKLADYLAEFERKKSDLRKDLRHCLKDDSLAAGIR 663 Query: 2189 XXXXXXXXXXXXXXXXXXXXXXSSAQVVLGTNTGAADPLIRRLDTFDLVVIDEAGQAIET 2368 SSAQVVL TNTGAADPLIRR+DTFDLVVIDEAGQAIE Sbjct: 664 QLLKQLGKALKKKEKETVREVLSSAQVVLSTNTGAADPLIRRMDTFDLVVIDEAGQAIEP 723 Query: 2369 SCWIPILQGKRCILAGDQCQLAPVILSRKALEGRLGVSLLERAATLHEGALATKLTTQYR 2548 SCWIPILQGKRCILAGDQCQLAPVILSRKALEG LGVSLLERAAT+HEG LAT LTTQYR Sbjct: 724 SCWIPILQGKRCILAGDQCQLAPVILSRKALEGGLGVSLLERAATMHEGVLATMLTTQYR 783 Query: 2549 MNDAIASWASKEMYGGSLVSSSTVAAHLLVDTPFVKPTWITQCPLLLLDTRMAYGSLSLG 2728 MNDAIA WASKEMY G L SS +V +HLLVD+PFVKPTWITQCPLLLLDTRM YGSLS+G Sbjct: 784 MNDAIAGWASKEMYDGELKSSPSVGSHLLVDSPFVKPTWITQCPLLLLDTRMPYGSLSVG 843 Query: 2729 CEEHLDPAGTGSFYNEGEAEIVVQHVFSLIYAGVSPSAIAVQSPYVAQVQLLRDRLDELP 2908 CEEHLDPAGTGSFYNEGEA+IVVQHVF LIYAGVSP+AIAVQSPYVAQVQLLRDRLDE P Sbjct: 844 CEEHLDPAGTGSFYNEGEADIVVQHVFYLIYAGVSPTAIAVQSPYVAQVQLLRDRLDEFP 903 Query: 2909 EAAGVEVATIDSFQGREADAVIISMVRSNTLAAVGFLGDSRRMNVAITRACKHVAVVCDS 3088 EAAGVEVATIDSFQGREADAVIISMVRSNTL AVGFLGDSRRMNVA+TRA KHVAVVCDS Sbjct: 904 EAAGVEVATIDSFQGREADAVIISMVRSNTLGAVGFLGDSRRMNVAVTRARKHVAVVCDS 963 Query: 3089 STICHNTFLARLLRHIRYFGRVKHAEPGSFGESGLDMNPMLPSI 3220 STICHNTFLARLLRHIRYFGRVKHAEPG+ G SGL M+PMLPSI Sbjct: 964 STICHNTFLARLLRHIRYFGRVKHAEPGTSGGSGLGMDPMLPSI 1007 >OMO99192.1 putative DNA-binding protein smubp-2 [Corchorus capsularis] Length = 1011 Score = 1453 bits (3761), Expect = 0.0 Identities = 768/1007 (76%), Positives = 822/1007 (81%), Gaps = 12/1007 (1%) Frame = +2 Query: 236 FCGSGIVPVTARKSLALNVRRFNSSVLHAAPLQFSFCSSFRSVCLFIGYKSSSLFAFYQP 415 FCGS V K+LAL+V + SS + PL FS S+ +S+CLF+ +K S A + Sbjct: 8 FCGS--VSSITTKTLALSVPK--SSTFSSLPLSFSSSSAVKSICLFVSHKYSYPSAKFPW 63 Query: 416 QQFVC---YXXXXXXXXXXXXXXXXXXPRRKSSVFSKSRIQXXXXXXXXXXXXXV----- 571 +Q VC PR KS+V +K +I Sbjct: 64 KQLVCNGSISKSSSSQSSSKSTATKKKPRSKSNVGNKPKISKEKKSGIVISSESTSKPNS 123 Query: 572 NVSR----VXXXXXXXXXXXXXXXXXXXXNVRALSQNGDPLGRRELGKSVVRWICQGMRA 739 NVS V NVR L QNGDPLGR++LGK+V+RWI +GMRA Sbjct: 124 NVSGTKLIVEEMGLLKKKNQQKVKKTKAVNVRTLYQNGDPLGRKDLGKTVIRWISEGMRA 183 Query: 740 MASDFASAEVQGEFSELGQLMGPGLTFVIAAQPYLNAIPMPVGLEAICLKACTHYPTLFD 919 MA DFASAE+QGEF EL Q MGPGLTFVI AQPYLNAIP+P+GLEAI LKACTHYPTLFD Sbjct: 184 MALDFASAELQGEFPELRQRMGPGLTFVIQAQPYLNAIPIPLGLEAISLKACTHYPTLFD 243 Query: 920 HFQRELRDVLQELQHKSLVQDWHETESWKLLKELANSAHHRAIVRKVTQPKPVQGVLGMD 1099 HFQRELR+VLQELQ KS+V+DW ETESWK+LKELANSA HRAI RK TQPKPVQGVLGMD Sbjct: 244 HFQRELRNVLQELQQKSMVEDWRETESWKMLKELANSAQHRAIARKSTQPKPVQGVLGMD 303 Query: 1100 LERFKTIQGRIDEFTKQMSELLRIERDAELEFTQEELNAVPTPDENSDSSKPIEFLVSHG 1279 LE+ K +QGRIDEFTK MSELL+IERDAELEFTQEELNAVPTPDE S+ SKPIEFLVSHG Sbjct: 304 LEKVKAMQGRIDEFTKWMSELLQIERDAELEFTQEELNAVPTPDEGSNPSKPIEFLVSHG 363 Query: 1280 QAPQELCDTICNLFAVSTSTGLGGMHLVLFRVEGNHRLPPTTLSPGDMVCVRICDSRGAC 1459 QA QELCDTICNL AVSTSTGLGGMHLVLFRVEGNHRLPPTTLSPGDMVCVRICD+RGA Sbjct: 364 QAQQELCDTICNLNAVSTSTGLGGMHLVLFRVEGNHRLPPTTLSPGDMVCVRICDNRGAG 423 Query: 1460 ATSCIQGFVHNLGEDGCSISVALESRHGDPTFSKLFGRNVRIDRIQGLADTLTYERNCEA 1639 AT+C+QGFV NLGEDGCSISVALESRHGDPTFSKLFG+ VRIDRIQGLAD LTYERNCEA Sbjct: 424 ATACMQGFVDNLGEDGCSISVALESRHGDPTFSKLFGKTVRIDRIQGLADALTYERNCEA 483 Query: 1640 LMLLQKHGLHKRNPSIAAVVTLFGDKEDVAWLVENDLADLSEVKLDGMLGSKTFDDSQKK 1819 LMLLQK+GL K+NPSIA V TLFGDKED+ WL +NDLAD +E KLDG+L + FDDSQ+K Sbjct: 484 LMLLQKNGLQKKNPSIAVVATLFGDKEDMDWLEKNDLADWNETKLDGLLQNGIFDDSQRK 543 Query: 1820 AIALGLNKKRPLLIIQGPPGTGKTGLLKELIARAVQQGERVLVTAPTNAAVDNMVEKLSD 1999 AIALGLNKKRP+L++QGPPGTGKTGLLKE+IA AVQQGERVLVTAPTNAAVDNMVEKLSD Sbjct: 544 AIALGLNKKRPVLVVQGPPGTGKTGLLKEIIALAVQQGERVLVTAPTNAAVDNMVEKLSD 603 Query: 2000 IGLNIVRVGNPARISPVVASKSLDEIVYSKLASFLAEFERKKSDLRKDLSQCLKDDSLAA 2179 GLNIVRVGNPARIS VASKSL EIV SKLA+F AEFERKKSDLRKDL CLKDDSLAA Sbjct: 604 TGLNIVRVGNPARISSAVASKSLVEIVNSKLANFRAEFERKKSDLRKDLRLCLKDDSLAA 663 Query: 2180 GIRXXXXXXXXXXXXXXXXXXXXXXSSAQVVLGTNTGAADPLIRRLDTFDLVVIDEAGQA 2359 GIR SSAQVVL TNTGAADPLIRRL TFDLVVIDEAGQA Sbjct: 664 GIRQLLKQLGKTLKKKEKETVREILSSAQVVLSTNTGAADPLIRRLKTFDLVVIDEAGQA 723 Query: 2360 IETSCWIPILQGKRCILAGDQCQLAPVILSRKALEGRLGVSLLERAATLHEGALATKLTT 2539 IE SCWIPILQGKRCILAGDQCQLAPVILSRKALEG LGVSLLERAATLHEG L T LTT Sbjct: 724 IEPSCWIPILQGKRCILAGDQCQLAPVILSRKALEGGLGVSLLERAATLHEGVLTTLLTT 783 Query: 2540 QYRMNDAIASWASKEMYGGSLVSSSTVAAHLLVDTPFVKPTWITQCPLLLLDTRMAYGSL 2719 QYRMNDAIA WASKEMY G L SS +VA+HLLVD+PFVKPTWITQCPLLLLDTRM YGSL Sbjct: 784 QYRMNDAIAGWASKEMYNGELKSSPSVASHLLVDSPFVKPTWITQCPLLLLDTRMPYGSL 843 Query: 2720 SLGCEEHLDPAGTGSFYNEGEAEIVVQHVFSLIYAGVSPSAIAVQSPYVAQVQLLRDRLD 2899 S+GCEEHLDPAGTGSFYNEGEA+IVVQHVF LIYAGVSP IAVQSPYVAQVQLLRDRLD Sbjct: 844 SVGCEEHLDPAGTGSFYNEGEADIVVQHVFYLIYAGVSPKTIAVQSPYVAQVQLLRDRLD 903 Query: 2900 ELPEAAGVEVATIDSFQGREADAVIISMVRSNTLAAVGFLGDSRRMNVAITRACKHVAVV 3079 E PEAAGVEVATIDSFQGREADAVIISMVRSNTL AVGFLGDSRRMNVAITRA KHVAVV Sbjct: 904 EFPEAAGVEVATIDSFQGREADAVIISMVRSNTLGAVGFLGDSRRMNVAITRARKHVAVV 963 Query: 3080 CDSSTICHNTFLARLLRHIRYFGRVKHAEPGSFGESGLDMNPMLPSI 3220 CDSSTICHNTFLARLLRHIRYFGRVKHAEPG+ G SGL M+PMLPSI Sbjct: 964 CDSSTICHNTFLARLLRHIRYFGRVKHAEPGNSGGSGLGMDPMLPSI 1010 >OMO56477.1 hypothetical protein COLO4_35630 [Corchorus olitorius] Length = 1011 Score = 1449 bits (3750), Expect = 0.0 Identities = 768/1007 (76%), Positives = 825/1007 (81%), Gaps = 12/1007 (1%) Frame = +2 Query: 236 FCGSGIVPVTARKSLALNVRRFNSSVLHAAPLQFSFCSSFRSVCLFIGYKSSSLFAFYQP 415 FCGS + +TA +LAL+ ++ SS L + PL FS S+ +S+CLF+ +K S A + Sbjct: 8 FCGS-VSSITAN-TLALSFQK--SSTLSSLPLSFSSSSAVKSICLFVSHKYSYPSAKFPW 63 Query: 416 QQFVC---YXXXXXXXXXXXXXXXXXXPRRKSSVFSKSRIQXXXXXXXXXXXXXV----- 571 +Q VC PR KS+V +K +I Sbjct: 64 KQLVCNGSISKSSSSQSSSKSTATKKKPRSKSNVGNKPKISKDKKSGIVISSESTSKPNS 123 Query: 572 NVSR----VXXXXXXXXXXXXXXXXXXXXNVRALSQNGDPLGRRELGKSVVRWICQGMRA 739 NVS V NVR L QNGDPLGR++LGK+V+RWI +GMRA Sbjct: 124 NVSGTKLIVEEMGLLKKKNQQKVKKTKAVNVRTLYQNGDPLGRKDLGKTVIRWISEGMRA 183 Query: 740 MASDFASAEVQGEFSELGQLMGPGLTFVIAAQPYLNAIPMPVGLEAICLKACTHYPTLFD 919 MA DFASAE+QGEF EL Q MGPGLTFVI AQPYLNAIP+P+GLEAI LKACTHYPTLFD Sbjct: 184 MALDFASAELQGEFPELRQRMGPGLTFVIQAQPYLNAIPIPLGLEAISLKACTHYPTLFD 243 Query: 920 HFQRELRDVLQELQHKSLVQDWHETESWKLLKELANSAHHRAIVRKVTQPKPVQGVLGMD 1099 HFQRELR+VLQELQ KS+V+DW ETESWK+LKELA+SA HRAI RK TQPKPVQGVLGMD Sbjct: 244 HFQRELRNVLQELQQKSMVEDWRETESWKMLKELAHSAQHRAIARKSTQPKPVQGVLGMD 303 Query: 1100 LERFKTIQGRIDEFTKQMSELLRIERDAELEFTQEELNAVPTPDENSDSSKPIEFLVSHG 1279 LE+ K +QGRIDEFTK MSELL+IERDAELEFTQEELNAVPTPDE S+ SKPIEFLVSHG Sbjct: 304 LEKVKAMQGRIDEFTKWMSELLQIERDAELEFTQEELNAVPTPDEGSNPSKPIEFLVSHG 363 Query: 1280 QAPQELCDTICNLFAVSTSTGLGGMHLVLFRVEGNHRLPPTTLSPGDMVCVRICDSRGAC 1459 QA QELCDTICNL AVSTSTGLGGMHLVLFRVEGNHRLPPTTLSPGDMVCVRICD+RGA Sbjct: 364 QAQQELCDTICNLNAVSTSTGLGGMHLVLFRVEGNHRLPPTTLSPGDMVCVRICDNRGAG 423 Query: 1460 ATSCIQGFVHNLGEDGCSISVALESRHGDPTFSKLFGRNVRIDRIQGLADTLTYERNCEA 1639 AT+C+QGFV NLGEDGCSISVALESRHGDPTFSKLFG+ VRIDRIQGLAD LTYERNCEA Sbjct: 424 ATACMQGFVDNLGEDGCSISVALESRHGDPTFSKLFGKTVRIDRIQGLADALTYERNCEA 483 Query: 1640 LMLLQKHGLHKRNPSIAAVVTLFGDKEDVAWLVENDLADLSEVKLDGMLGSKTFDDSQKK 1819 LMLLQK+GL K+N SIA V TLFGDKED+ WL +NDLAD +E LDG+L + FDDSQ+K Sbjct: 484 LMLLQKNGLQKKNLSIAVVATLFGDKEDMDWLEKNDLADWNETMLDGLLQNGIFDDSQRK 543 Query: 1820 AIALGLNKKRPLLIIQGPPGTGKTGLLKELIARAVQQGERVLVTAPTNAAVDNMVEKLSD 1999 AIALGLNKKRPLL++QGPPGTGKTGLLKE+IA AVQQGERVLVTAPTNAAVDNMVEKLSD Sbjct: 544 AIALGLNKKRPLLVVQGPPGTGKTGLLKEIIALAVQQGERVLVTAPTNAAVDNMVEKLSD 603 Query: 2000 IGLNIVRVGNPARISPVVASKSLDEIVYSKLASFLAEFERKKSDLRKDLSQCLKDDSLAA 2179 GLNIVRVGNPARIS VASKSL EIV SKLA+F AEFERKKSDLRKDL CLKDDSLAA Sbjct: 604 TGLNIVRVGNPARISSAVASKSLVEIVNSKLANFRAEFERKKSDLRKDLRLCLKDDSLAA 663 Query: 2180 GIRXXXXXXXXXXXXXXXXXXXXXXSSAQVVLGTNTGAADPLIRRLDTFDLVVIDEAGQA 2359 GIR SSAQVVL TNTGAADPLIRRL TFDLVVIDEAGQA Sbjct: 664 GIRQLLKQLGKTLKKKEKETVREILSSAQVVLSTNTGAADPLIRRLKTFDLVVIDEAGQA 723 Query: 2360 IETSCWIPILQGKRCILAGDQCQLAPVILSRKALEGRLGVSLLERAATLHEGALATKLTT 2539 IE SCWIPILQGKRCILAGDQCQLAPVILSRKALEG LGVSLLERAATLHEG L T LTT Sbjct: 724 IEPSCWIPILQGKRCILAGDQCQLAPVILSRKALEGGLGVSLLERAATLHEGVLTTLLTT 783 Query: 2540 QYRMNDAIASWASKEMYGGSLVSSSTVAAHLLVDTPFVKPTWITQCPLLLLDTRMAYGSL 2719 QYRMNDAIASWASKEMY G L SS +VA+HLLVD+PFVKPTWITQCPLLLLDTRM YGSL Sbjct: 784 QYRMNDAIASWASKEMYNGELKSSPSVASHLLVDSPFVKPTWITQCPLLLLDTRMPYGSL 843 Query: 2720 SLGCEEHLDPAGTGSFYNEGEAEIVVQHVFSLIYAGVSPSAIAVQSPYVAQVQLLRDRLD 2899 S+GCEEHLDPAGTGSFYNEGEA+IVVQHVF LIYAGVSP AIAVQSPYVAQVQLLRDRLD Sbjct: 844 SVGCEEHLDPAGTGSFYNEGEADIVVQHVFYLIYAGVSPKAIAVQSPYVAQVQLLRDRLD 903 Query: 2900 ELPEAAGVEVATIDSFQGREADAVIISMVRSNTLAAVGFLGDSRRMNVAITRACKHVAVV 3079 E PEAAGVEVATIDSFQGREADAVIISMVRSNTL AVGFLGDSRRMNVAITRA KHVAVV Sbjct: 904 EFPEAAGVEVATIDSFQGREADAVIISMVRSNTLGAVGFLGDSRRMNVAITRARKHVAVV 963 Query: 3080 CDSSTICHNTFLARLLRHIRYFGRVKHAEPGSFGESGLDMNPMLPSI 3220 CDSSTICHNTFLARLLRHIRYFGRVKHAEPG+ G SGL M+PMLPSI Sbjct: 964 CDSSTICHNTFLARLLRHIRYFGRVKHAEPGNSGGSGLGMDPMLPSI 1010 >XP_016697684.1 PREDICTED: DNA-binding protein SMUBP-2-like [Gossypium hirsutum] Length = 1000 Score = 1438 bits (3722), Expect = 0.0 Identities = 758/998 (75%), Positives = 816/998 (81%), Gaps = 3/998 (0%) Frame = +2 Query: 236 FCGSGIVPVTARKSLALNVRRFNSSVLHAAPLQFSFCSSFRSVCLFIGYKSSSLFAFYQP 415 FCG+ +P T K+LAL VR+ SS L + P S SS +S+CLF+G + S +Q Sbjct: 8 FCGN--IPSTTTKALALTVRK--SSFLSSLPFSSS-PSSLKSICLFVGRRYSFPSTKFQS 62 Query: 416 QQFVCYXXXXXXXXXXXXXXXXXX---PRRKSSVFSKSRIQXXXXXXXXXXXXXVNVSRV 586 +Q VC PR KS + S +I V + + Sbjct: 63 KQLVCNGGGESSGSHGSSKFATTTKKKPRSKSYIGSNPKISKSENKSTSKPNDSVTRTNI 122 Query: 587 XXXXXXXXXXXXXXXXXXXXNVRALSQNGDPLGRRELGKSVVRWICQGMRAMASDFASAE 766 NVR L QNGDPLGRR+LGK VV WI +GM+AMASDFASAE Sbjct: 123 LVEELGLFKKQKVQKTKAL-NVRTLYQNGDPLGRRDLGKRVVWWISEGMKAMASDFASAE 181 Query: 767 VQGEFSELGQLMGPGLTFVIAAQPYLNAIPMPVGLEAICLKACTHYPTLFDHFQRELRDV 946 +QGEF EL Q MGPGLTFVI AQPYLN+IP+P+GLEAICLKACTHYPTLFDHFQRELR+V Sbjct: 182 LQGEFLELRQRMGPGLTFVIQAQPYLNSIPIPLGLEAICLKACTHYPTLFDHFQRELRNV 241 Query: 947 LQELQHKSLVQDWHETESWKLLKELANSAHHRAIVRKVTQPKPVQGVLGMDLERFKTIQG 1126 LQELQ S+VQDW ETESWKLLKELANSA HRAI RKVT PKPVQGVLGMDLE+ K +QG Sbjct: 242 LQELQQNSMVQDWKETESWKLLKELANSAQHRAIARKVTPPKPVQGVLGMDLEKAKAMQG 301 Query: 1127 RIDEFTKQMSELLRIERDAELEFTQEELNAVPTPDENSDSSKPIEFLVSHGQAPQELCDT 1306 RIDEFTKQMSELLRIERDAELEFTQEEL+AVPT DE SDSSKPIEFLVSHGQA QELCDT Sbjct: 302 RIDEFTKQMSELLRIERDAELEFTQEELDAVPTLDEGSDSSKPIEFLVSHGQAQQELCDT 361 Query: 1307 ICNLFAVSTSTGLGGMHLVLFRVEGNHRLPPTTLSPGDMVCVRICDSRGACATSCIQGFV 1486 ICNL AVSTSTGLGGMHLVLFRVEGNHRLPPTTLSPGDMVCVRI DSRGA ATSCIQGFV Sbjct: 362 ICNLNAVSTSTGLGGMHLVLFRVEGNHRLPPTTLSPGDMVCVRISDSRGAGATSCIQGFV 421 Query: 1487 HNLGEDGCSISVALESRHGDPTFSKLFGRNVRIDRIQGLADTLTYERNCEALMLLQKHGL 1666 NLG+DGCSISVALESRHGDPTFSKLFG++VRIDRI GLAD LTYERNCEALMLLQK+GL Sbjct: 422 DNLGDDGCSISVALESRHGDPTFSKLFGKSVRIDRIHGLADALTYERNCEALMLLQKNGL 481 Query: 1667 HKRNPSIAAVVTLFGDKEDVAWLVENDLADLSEVKLDGMLGSKTFDDSQKKAIALGLNKK 1846 K+NPSIA V TLFGDKEDV WL ENDLAD S +LDG+L + TFDDSQ++AIALGLNKK Sbjct: 482 QKKNPSIAVVATLFGDKEDVEWLEENDLADWSPAELDGLLQNGTFDDSQQRAIALGLNKK 541 Query: 1847 RPLLIIQGPPGTGKTGLLKELIARAVQQGERVLVTAPTNAAVDNMVEKLSDIGLNIVRVG 2026 RP++++QGPPGTGKTG+LKE+IA A QQGERVLVTAPTNAAVDN+VEKLS+ GLNIVRVG Sbjct: 542 RPVMVVQGPPGTGKTGMLKEVIALAAQQGERVLVTAPTNAAVDNLVEKLSNTGLNIVRVG 601 Query: 2027 NPARISPVVASKSLDEIVYSKLASFLAEFERKKSDLRKDLSQCLKDDSLAAGIRXXXXXX 2206 NPARIS VASKSL EIV SKLA + AEFERKKSDLRKDL CLKDDSLAAGIR Sbjct: 602 NPARISSAVASKSLVEIVNSKLADYRAEFERKKSDLRKDLRHCLKDDSLAAGIRQLLKQL 661 Query: 2207 XXXXXXXXXXXXXXXXSSAQVVLGTNTGAADPLIRRLDTFDLVVIDEAGQAIETSCWIPI 2386 S+AQVVL TNTGAADPLIRRLDTFDLVVIDEAGQAIE SCWIPI Sbjct: 662 GKALKKKEKETVREVLSNAQVVLSTNTGAADPLIRRLDTFDLVVIDEAGQAIEPSCWIPI 721 Query: 2387 LQGKRCILAGDQCQLAPVILSRKALEGRLGVSLLERAATLHEGALATKLTTQYRMNDAIA 2566 LQGKRCILAGDQCQLAPVILSRKALEG LG+SLLERAATLHEG LAT L TQYRMNDAIA Sbjct: 722 LQGKRCILAGDQCQLAPVILSRKALEGGLGISLLERAATLHEGVLATMLATQYRMNDAIA 781 Query: 2567 SWASKEMYGGSLVSSSTVAAHLLVDTPFVKPTWITQCPLLLLDTRMAYGSLSLGCEEHLD 2746 SW+SKEMY G L SS VA+HLLV +PFVKPTWITQCPLLLLDTRM YGSLS+GCEEHLD Sbjct: 782 SWSSKEMYDGELKSSPLVASHLLVGSPFVKPTWITQCPLLLLDTRMPYGSLSVGCEEHLD 841 Query: 2747 PAGTGSFYNEGEAEIVVQHVFSLIYAGVSPSAIAVQSPYVAQVQLLRDRLDELPEAAGVE 2926 AGTGSF+NEGEA+IVVQHV LIYAGVSP+AIAVQSPYVAQVQLLRDRLDE PEA G+E Sbjct: 842 LAGTGSFFNEGEADIVVQHVLYLIYAGVSPTAIAVQSPYVAQVQLLRDRLDEFPEADGIE 901 Query: 2927 VATIDSFQGREADAVIISMVRSNTLAAVGFLGDSRRMNVAITRACKHVAVVCDSSTICHN 3106 VATIDSFQGREADAVIISMVRSNTL AVGFLGDSRRMNVAITRA KHVAVVCDSSTICHN Sbjct: 902 VATIDSFQGREADAVIISMVRSNTLGAVGFLGDSRRMNVAITRARKHVAVVCDSSTICHN 961 Query: 3107 TFLARLLRHIRYFGRVKHAEPGSFGESGLDMNPMLPSI 3220 TFLARLLRHIRY GRVKHAEPG+FG SGL M+PMLPSI Sbjct: 962 TFLARLLRHIRYVGRVKHAEPGAFGGSGLGMDPMLPSI 999 >XP_016671666.1 PREDICTED: DNA-binding protein SMUBP-2-like [Gossypium hirsutum] Length = 1003 Score = 1437 bits (3721), Expect = 0.0 Identities = 760/1000 (76%), Positives = 815/1000 (81%), Gaps = 5/1000 (0%) Frame = +2 Query: 236 FCGSGIVPVTARKSLALNVRRFNSSVLHAAPLQFSFCSSFRSVCLFIGYKSSSLFAFYQP 415 FCG+ +P T K+LAL VR+ SS L + P S SS +S+CLF+G + S +Q Sbjct: 8 FCGN--IPSTTTKALALTVRK--SSFLSSLPFSSS-PSSLKSICLFVGRRYSFPSTKFQS 62 Query: 416 QQFVCYXXXXXXXXXXXXXXXXXX---PRRKSSVFSKSRIQXXXXXXXXXXXXXVNVSR- 583 +Q VC PR KS + S +I V + Sbjct: 63 KQLVCNGGGESSGSHGSSKFATTTKKKPRSKSYIGSNPKISKSENKSTSKPNDSVTRTNI 122 Query: 584 -VXXXXXXXXXXXXXXXXXXXXNVRALSQNGDPLGRRELGKSVVRWICQGMRAMASDFAS 760 V NVR L QNGDPLGRR+LGK VV+WI +GM+AMASDFAS Sbjct: 123 LVEELGLFKKQKEQKVQKTKALNVRTLYQNGDPLGRRDLGKRVVKWISEGMKAMASDFAS 182 Query: 761 AEVQGEFSELGQLMGPGLTFVIAAQPYLNAIPMPVGLEAICLKACTHYPTLFDHFQRELR 940 AE+QGEF EL Q MGPGLTFVI AQPYLN+IP+P+GLEAICLKACTHYPTLFDHFQRELR Sbjct: 183 AELQGEFLELRQRMGPGLTFVIQAQPYLNSIPIPLGLEAICLKACTHYPTLFDHFQRELR 242 Query: 941 DVLQELQHKSLVQDWHETESWKLLKELANSAHHRAIVRKVTQPKPVQGVLGMDLERFKTI 1120 +VLQELQ S+VQDW ETESWKLLKELANSA HRAI RKVT PKPVQGVLGMDLE+ KT+ Sbjct: 243 NVLQELQQNSMVQDWKETESWKLLKELANSAQHRAIARKVTPPKPVQGVLGMDLEKAKTM 302 Query: 1121 QGRIDEFTKQMSELLRIERDAELEFTQEELNAVPTPDENSDSSKPIEFLVSHGQAPQELC 1300 QGRIDEFTKQMSELLRIERDAELEFTQEEL+AVPT DE SDSSKPIEFLVSHGQA QELC Sbjct: 303 QGRIDEFTKQMSELLRIERDAELEFTQEELDAVPTLDEGSDSSKPIEFLVSHGQAQQELC 362 Query: 1301 DTICNLFAVSTSTGLGGMHLVLFRVEGNHRLPPTTLSPGDMVCVRICDSRGACATSCIQG 1480 DTICNL AVSTSTGLGGMHLVLFRVEGNHRLPPTTLSPGDMVCVRI DSRGA ATSCIQG Sbjct: 363 DTICNLNAVSTSTGLGGMHLVLFRVEGNHRLPPTTLSPGDMVCVRISDSRGAGATSCIQG 422 Query: 1481 FVHNLGEDGCSISVALESRHGDPTFSKLFGRNVRIDRIQGLADTLTYERNCEALMLLQKH 1660 FV NLG+DGCSISVALESRHGDPTFSKLFG+ VRIDRI GLAD LTYERNCEALMLLQK+ Sbjct: 423 FVDNLGDDGCSISVALESRHGDPTFSKLFGKRVRIDRIHGLADALTYERNCEALMLLQKN 482 Query: 1661 GLHKRNPSIAAVVTLFGDKEDVAWLVENDLADLSEVKLDGMLGSKTFDDSQKKAIALGLN 1840 GL K+NPSIA V TLFGDKEDV WL ENDLAD +LDG+L + TFDDSQ++AI LGLN Sbjct: 483 GLQKKNPSIAVVATLFGDKEDVEWLEENDLADWRPAELDGLLQNGTFDDSQQRAITLGLN 542 Query: 1841 KKRPLLIIQGPPGTGKTGLLKELIARAVQQGERVLVTAPTNAAVDNMVEKLSDIGLNIVR 2020 KKRP++++QGPPGTGKTG+LKE+IA A QQGERVLVTAPTNAAVDN+VEKLS+ GLNIVR Sbjct: 543 KKRPVMVVQGPPGTGKTGMLKEVIALAAQQGERVLVTAPTNAAVDNLVEKLSNTGLNIVR 602 Query: 2021 VGNPARISPVVASKSLDEIVYSKLASFLAEFERKKSDLRKDLSQCLKDDSLAAGIRXXXX 2200 VGNPARIS VASKSL EIV SKLA + AEFERKKSDLRKDL CLKDDSLAAGIR Sbjct: 603 VGNPARISSAVASKSLVEIVNSKLADYRAEFERKKSDLRKDLRHCLKDDSLAAGIRQLLK 662 Query: 2201 XXXXXXXXXXXXXXXXXXSSAQVVLGTNTGAADPLIRRLDTFDLVVIDEAGQAIETSCWI 2380 S+AQVVL TNTGAADPLIRRLDTFDLVVIDEAGQAIE SCWI Sbjct: 663 QLGKALKKKEKETVREVLSNAQVVLSTNTGAADPLIRRLDTFDLVVIDEAGQAIEPSCWI 722 Query: 2381 PILQGKRCILAGDQCQLAPVILSRKALEGRLGVSLLERAATLHEGALATKLTTQYRMNDA 2560 PILQGKRCILAGDQ QLAPVILSRKALEG LGVSLLERAATLHEG LAT L TQYRMNDA Sbjct: 723 PILQGKRCILAGDQWQLAPVILSRKALEGGLGVSLLERAATLHEGVLATMLATQYRMNDA 782 Query: 2561 IASWASKEMYGGSLVSSSTVAAHLLVDTPFVKPTWITQCPLLLLDTRMAYGSLSLGCEEH 2740 IASWASKEMY G L SS VA+HLLVD+PFVKPTWITQCPLLLLDTRM YGSLS+GCEEH Sbjct: 783 IASWASKEMYDGELKSSPLVASHLLVDSPFVKPTWITQCPLLLLDTRMPYGSLSVGCEEH 842 Query: 2741 LDPAGTGSFYNEGEAEIVVQHVFSLIYAGVSPSAIAVQSPYVAQVQLLRDRLDELPEAAG 2920 LD AGTGSF+NEGEA+IVVQHV LIYAGVSP+AIAVQSPYVAQVQLLRDRLDE PEA G Sbjct: 843 LDLAGTGSFFNEGEADIVVQHVLYLIYAGVSPTAIAVQSPYVAQVQLLRDRLDEFPEADG 902 Query: 2921 VEVATIDSFQGREADAVIISMVRSNTLAAVGFLGDSRRMNVAITRACKHVAVVCDSSTIC 3100 +EVATIDSFQGREADAVIISMVRSNTL AVGFLGDSRRMNVAITRA KHVAVVCDSSTIC Sbjct: 903 IEVATIDSFQGREADAVIISMVRSNTLGAVGFLGDSRRMNVAITRARKHVAVVCDSSTIC 962 Query: 3101 HNTFLARLLRHIRYFGRVKHAEPGSFGESGLDMNPMLPSI 3220 HNTFLARLLRHIRY GRVKHAEPG+FG SGL M+PMLPSI Sbjct: 963 HNTFLARLLRHIRYVGRVKHAEPGAFGGSGLGMDPMLPSI 1002 >XP_012492340.1 PREDICTED: DNA-binding protein SMUBP-2 [Gossypium raimondii] KJB44363.1 hypothetical protein B456_007G248100 [Gossypium raimondii] Length = 1003 Score = 1437 bits (3720), Expect = 0.0 Identities = 759/1000 (75%), Positives = 815/1000 (81%), Gaps = 5/1000 (0%) Frame = +2 Query: 236 FCGSGIVPVTARKSLALNVRRFNSSVLHAAPLQFSFCSSFRSVCLFIGYKSSSLFAFYQP 415 FCG+ +P T K+LAL VR+ SS L + P S SS +S+CLF+G + S +Q Sbjct: 8 FCGN--IPSTTTKALALTVRK--SSFLSSLPFSSS-PSSLKSICLFVGRRYSFPSTKFQS 62 Query: 416 QQFVCYXXXXXXXXXXXXXXXXXX---PRRKSSVFSKSRIQXXXXXXXXXXXXXVNVSR- 583 +Q VC PR KS + S +I V + Sbjct: 63 KQLVCNGGGESSGSHGSSKFATTTKKKPRSKSYIGSNPKISKSENKSTSKPNDSVTRTNI 122 Query: 584 -VXXXXXXXXXXXXXXXXXXXXNVRALSQNGDPLGRRELGKSVVRWICQGMRAMASDFAS 760 V NVR L QNGDPLGRR+LGK VV WI +GM+AMASDFAS Sbjct: 123 LVEELGLFKKQKEQKVQKTKALNVRTLYQNGDPLGRRDLGKRVVWWISEGMKAMASDFAS 182 Query: 761 AEVQGEFSELGQLMGPGLTFVIAAQPYLNAIPMPVGLEAICLKACTHYPTLFDHFQRELR 940 AE+QGEF EL Q MGPGLTFVI AQPYLN++PMP+GLEAICLKACTHYPTLFDHFQRELR Sbjct: 183 AELQGEFLELRQRMGPGLTFVIQAQPYLNSVPMPLGLEAICLKACTHYPTLFDHFQRELR 242 Query: 941 DVLQELQHKSLVQDWHETESWKLLKELANSAHHRAIVRKVTQPKPVQGVLGMDLERFKTI 1120 +VLQELQ S+VQDW ETESWKLLKELANSA HRAI RKVT PKPVQGVLGMDLE+ K + Sbjct: 243 NVLQELQQNSMVQDWKETESWKLLKELANSAQHRAIARKVTPPKPVQGVLGMDLEKAKAM 302 Query: 1121 QGRIDEFTKQMSELLRIERDAELEFTQEELNAVPTPDENSDSSKPIEFLVSHGQAPQELC 1300 QGRIDEFTKQMSELLRIERDAELEFTQEEL+AVPT DE SDSSKPIEFLVSHGQA QELC Sbjct: 303 QGRIDEFTKQMSELLRIERDAELEFTQEELDAVPTLDEGSDSSKPIEFLVSHGQAQQELC 362 Query: 1301 DTICNLFAVSTSTGLGGMHLVLFRVEGNHRLPPTTLSPGDMVCVRICDSRGACATSCIQG 1480 DTICNL AVSTSTGLGGMHLVLFRVEGNHRLPPTTLSPGDMVCVRI DSRGA ATSCIQG Sbjct: 363 DTICNLNAVSTSTGLGGMHLVLFRVEGNHRLPPTTLSPGDMVCVRISDSRGAGATSCIQG 422 Query: 1481 FVHNLGEDGCSISVALESRHGDPTFSKLFGRNVRIDRIQGLADTLTYERNCEALMLLQKH 1660 FV NLG+DGCSISVALESRHGDPTFSKLFG++VRIDRI GLAD LTYERNCEALMLLQK+ Sbjct: 423 FVDNLGDDGCSISVALESRHGDPTFSKLFGKSVRIDRIHGLADALTYERNCEALMLLQKN 482 Query: 1661 GLHKRNPSIAAVVTLFGDKEDVAWLVENDLADLSEVKLDGMLGSKTFDDSQKKAIALGLN 1840 GL K+NPSIA V TLF DKEDV WL ENDLAD S +LDG+L + TFDDSQ++AIALGLN Sbjct: 483 GLQKKNPSIAVVATLFADKEDVEWLEENDLADWSPAELDGLLQNGTFDDSQQRAIALGLN 542 Query: 1841 KKRPLLIIQGPPGTGKTGLLKELIARAVQQGERVLVTAPTNAAVDNMVEKLSDIGLNIVR 2020 KKRP++++QGPPGTGKTG+LKE+IA A QQGERVLVTAPTNAAVDN+VEKLS+ GLNIVR Sbjct: 543 KKRPVMVVQGPPGTGKTGMLKEVIALAAQQGERVLVTAPTNAAVDNLVEKLSNTGLNIVR 602 Query: 2021 VGNPARISPVVASKSLDEIVYSKLASFLAEFERKKSDLRKDLSQCLKDDSLAAGIRXXXX 2200 VGNPARIS VASKSL EIV SKLA + AEFERKKSDLRKDL CLKDDSLAAGIR Sbjct: 603 VGNPARISSAVASKSLVEIVNSKLADYRAEFERKKSDLRKDLRHCLKDDSLAAGIRQLLK 662 Query: 2201 XXXXXXXXXXXXXXXXXXSSAQVVLGTNTGAADPLIRRLDTFDLVVIDEAGQAIETSCWI 2380 S+AQVVL TNTGAADPLIRRLDTFDLVVIDEAGQAIE SCWI Sbjct: 663 QLGKALKKKEKETVREVLSNAQVVLSTNTGAADPLIRRLDTFDLVVIDEAGQAIEPSCWI 722 Query: 2381 PILQGKRCILAGDQCQLAPVILSRKALEGRLGVSLLERAATLHEGALATKLTTQYRMNDA 2560 PILQGKRCILAGDQCQLAPVILSRKALEG LG+SLLERAATLHEG LAT L TQYRMNDA Sbjct: 723 PILQGKRCILAGDQCQLAPVILSRKALEGGLGISLLERAATLHEGVLATMLATQYRMNDA 782 Query: 2561 IASWASKEMYGGSLVSSSTVAAHLLVDTPFVKPTWITQCPLLLLDTRMAYGSLSLGCEEH 2740 IASWASKEMY G L SS VA+HLLVD+PFVKPTWITQCPLLLLDTRM YGSLS+GCEEH Sbjct: 783 IASWASKEMYDGELKSSPLVASHLLVDSPFVKPTWITQCPLLLLDTRMPYGSLSVGCEEH 842 Query: 2741 LDPAGTGSFYNEGEAEIVVQHVFSLIYAGVSPSAIAVQSPYVAQVQLLRDRLDELPEAAG 2920 LD AGTGSF+NEGEA+IVVQHV LIYAGVSP+AIAVQSPYVAQVQLLRDRLDE PEA G Sbjct: 843 LDLAGTGSFFNEGEADIVVQHVLYLIYAGVSPTAIAVQSPYVAQVQLLRDRLDEFPEADG 902 Query: 2921 VEVATIDSFQGREADAVIISMVRSNTLAAVGFLGDSRRMNVAITRACKHVAVVCDSSTIC 3100 +EVATIDSFQGREADAVIISMVRSNTL AVGFLGDSRRMNVAITRA KHVAVVCDSSTIC Sbjct: 903 IEVATIDSFQGREADAVIISMVRSNTLGAVGFLGDSRRMNVAITRARKHVAVVCDSSTIC 962 Query: 3101 HNTFLARLLRHIRYFGRVKHAEPGSFGESGLDMNPMLPSI 3220 HNTFLARLLRHIRY GRVKHAEPG+ G SGL M+PMLPSI Sbjct: 963 HNTFLARLLRHIRYVGRVKHAEPGASGGSGLGMDPMLPSI 1002 >KHG05926.1 DNA-binding SMUBP-2 [Gossypium arboreum] Length = 1003 Score = 1436 bits (3718), Expect = 0.0 Identities = 759/1000 (75%), Positives = 816/1000 (81%), Gaps = 5/1000 (0%) Frame = +2 Query: 236 FCGSGIVPVTARKSLALNVRRFNSSVLHAAPLQFSFCSSFRSVCLFIGYKSSSLFAFYQP 415 FCG+ +P T K+LAL VR+ SS L + P S SS +S+CLF+G + S +Q Sbjct: 8 FCGN--IPSTTTKALALTVRK--SSFLSSLPFSSS-PSSLKSICLFVGRRYSFPSTKFQS 62 Query: 416 QQFVCYXXXXXXXXXXXXXXXXXX---PRRKSSVFSKSRIQXXXXXXXXXXXXXVNVSR- 583 +Q VC PR KS + S +I V + Sbjct: 63 KQLVCNGGGESSGSHGSSKFATTTKKKPRSKSYIGSNPKISKSENKSTSKPNDSVTRTNI 122 Query: 584 -VXXXXXXXXXXXXXXXXXXXXNVRALSQNGDPLGRRELGKSVVRWICQGMRAMASDFAS 760 V NVR L QNGDPLGRR+LGK VV+WI +GM+AMASDFAS Sbjct: 123 LVEELGLFKKQKEQKVQKTKALNVRTLYQNGDPLGRRDLGKRVVKWISEGMKAMASDFAS 182 Query: 761 AEVQGEFSELGQLMGPGLTFVIAAQPYLNAIPMPVGLEAICLKACTHYPTLFDHFQRELR 940 AE+QGEF EL Q MGPGLTFVI AQPYLN+IP+P+GLEAICLKACTHYPTLFDHFQRELR Sbjct: 183 AELQGEFLELRQRMGPGLTFVIQAQPYLNSIPIPLGLEAICLKACTHYPTLFDHFQRELR 242 Query: 941 DVLQELQHKSLVQDWHETESWKLLKELANSAHHRAIVRKVTQPKPVQGVLGMDLERFKTI 1120 +VLQELQ S+VQDW ETESWKLLKELANSA HRAI RKVT PKPVQGVLGMDLE+ KT+ Sbjct: 243 NVLQELQQNSMVQDWKETESWKLLKELANSAQHRAIARKVTPPKPVQGVLGMDLEKAKTM 302 Query: 1121 QGRIDEFTKQMSELLRIERDAELEFTQEELNAVPTPDENSDSSKPIEFLVSHGQAPQELC 1300 QGRIDEFTKQMSELLRIERDAELEFTQEEL+AVPT DE SDSSKPIEFLVSHGQA QELC Sbjct: 303 QGRIDEFTKQMSELLRIERDAELEFTQEELDAVPTLDEGSDSSKPIEFLVSHGQAQQELC 362 Query: 1301 DTICNLFAVSTSTGLGGMHLVLFRVEGNHRLPPTTLSPGDMVCVRICDSRGACATSCIQG 1480 DTICNL AVSTSTGLGGMHLVLFRVEGNHRLPPTTLSPGDMVCVRI DSRGA ATSCIQG Sbjct: 363 DTICNLNAVSTSTGLGGMHLVLFRVEGNHRLPPTTLSPGDMVCVRISDSRGAGATSCIQG 422 Query: 1481 FVHNLGEDGCSISVALESRHGDPTFSKLFGRNVRIDRIQGLADTLTYERNCEALMLLQKH 1660 FV NLG+DGCSISVALESRHGDPTFSKLFG++VRIDRI GLAD LTYERNCEALMLLQK+ Sbjct: 423 FVDNLGDDGCSISVALESRHGDPTFSKLFGKSVRIDRIHGLADALTYERNCEALMLLQKN 482 Query: 1661 GLHKRNPSIAAVVTLFGDKEDVAWLVENDLADLSEVKLDGMLGSKTFDDSQKKAIALGLN 1840 GL K+NPSIA V TLFGDKEDV WL ENDLAD +LDG+L + TFDDSQ++AI LGLN Sbjct: 483 GLQKKNPSIAVVATLFGDKEDVEWLEENDLADWRPAELDGLLQNGTFDDSQQRAITLGLN 542 Query: 1841 KKRPLLIIQGPPGTGKTGLLKELIARAVQQGERVLVTAPTNAAVDNMVEKLSDIGLNIVR 2020 KKRP++++QGPPGTGKTG+LKE+IA A QQGERVLVTAPTNAAVDN+VEKLS+ GLNIVR Sbjct: 543 KKRPVMVVQGPPGTGKTGMLKEVIALAAQQGERVLVTAPTNAAVDNLVEKLSNTGLNIVR 602 Query: 2021 VGNPARISPVVASKSLDEIVYSKLASFLAEFERKKSDLRKDLSQCLKDDSLAAGIRXXXX 2200 VGNPARIS VASKSL EIV SKLA + AEFERKKSDLRKDL CLKDDSLAAGIR Sbjct: 603 VGNPARISSAVASKSLVEIVNSKLADYRAEFERKKSDLRKDLRHCLKDDSLAAGIRQLLK 662 Query: 2201 XXXXXXXXXXXXXXXXXXSSAQVVLGTNTGAADPLIRRLDTFDLVVIDEAGQAIETSCWI 2380 S+AQVVL TNTGAADPLIRRLDTFDLVVIDEAGQAIE SCWI Sbjct: 663 QLGKALKKKEKETVREVLSNAQVVLSTNTGAADPLIRRLDTFDLVVIDEAGQAIEPSCWI 722 Query: 2381 PILQGKRCILAGDQCQLAPVILSRKALEGRLGVSLLERAATLHEGALATKLTTQYRMNDA 2560 PILQGKRCILAGDQ QLAPVILSRKALEG LGVSLLERAATLHEG LAT L TQYRMNDA Sbjct: 723 PILQGKRCILAGDQWQLAPVILSRKALEGGLGVSLLERAATLHEGVLATMLATQYRMNDA 782 Query: 2561 IASWASKEMYGGSLVSSSTVAAHLLVDTPFVKPTWITQCPLLLLDTRMAYGSLSLGCEEH 2740 IASWASKEMY G L SS VA+HLLVD+PFVKPTWIT+CPLLLLDTRM YGSLS+GCEEH Sbjct: 783 IASWASKEMYDGELKSSPLVASHLLVDSPFVKPTWITKCPLLLLDTRMPYGSLSVGCEEH 842 Query: 2741 LDPAGTGSFYNEGEAEIVVQHVFSLIYAGVSPSAIAVQSPYVAQVQLLRDRLDELPEAAG 2920 LD AGTGSF+NEGEA+IVVQHV LIYAGVSP+AIAVQSPYVAQVQLLRDRLDE PEA G Sbjct: 843 LDLAGTGSFFNEGEADIVVQHVLYLIYAGVSPTAIAVQSPYVAQVQLLRDRLDEFPEADG 902 Query: 2921 VEVATIDSFQGREADAVIISMVRSNTLAAVGFLGDSRRMNVAITRACKHVAVVCDSSTIC 3100 +EVATIDSFQGREADAVIISMVRSNTL AVGFLGDSRRMNVAITRA KHVAVVCDSSTIC Sbjct: 903 IEVATIDSFQGREADAVIISMVRSNTLGAVGFLGDSRRMNVAITRARKHVAVVCDSSTIC 962 Query: 3101 HNTFLARLLRHIRYFGRVKHAEPGSFGESGLDMNPMLPSI 3220 HNTFLARLLRHIRY GRVKHAEPG+FG SGL M+PMLPSI Sbjct: 963 HNTFLARLLRHIRYVGRVKHAEPGAFGGSGLGMDPMLPSI 1002 >OAY44532.1 hypothetical protein MANES_08G158300 [Manihot esculenta] Length = 981 Score = 1431 bits (3705), Expect = 0.0 Identities = 729/860 (84%), Positives = 775/860 (90%), Gaps = 2/860 (0%) Frame = +2 Query: 647 NVRALSQNGDPLGRRELGKSVVRWICQGMRAMASDFASAEVQGEFSELGQLMG--PGLTF 820 NVRAL+QNGDPLGRR+LGKSVV+WI QGMRAMA+DFASAE QGEFSEL Q MG GLTF Sbjct: 121 NVRALNQNGDPLGRRDLGKSVVKWISQGMRAMATDFASAETQGEFSELRQRMGLEAGLTF 180 Query: 821 VIAAQPYLNAIPMPVGLEAICLKACTHYPTLFDHFQRELRDVLQELQHKSLVQDWHETES 1000 VI AQPY+NA+P+P+GLEA+CLKACTHYPTLFDHFQRELRDVLQELQ K L+Q+W +TES Sbjct: 181 VIQAQPYINAVPIPLGLEALCLKACTHYPTLFDHFQRELRDVLQELQRKGLIQNWQQTES 240 Query: 1001 WKLLKELANSAHHRAIVRKVTQPKPVQGVLGMDLERFKTIQGRIDEFTKQMSELLRIERD 1180 WKLLKELANS HRA+ RKV+Q +P+QGVLGMDLE+ K IQGRIDEFTK+MSELLRIERD Sbjct: 241 WKLLKELANSVQHRAVARKVSQARPLQGVLGMDLEKAKAIQGRIDEFTKKMSELLRIERD 300 Query: 1181 AELEFTQEELNAVPTPDENSDSSKPIEFLVSHGQAPQELCDTICNLFAVSTSTGLGGMHL 1360 AELEFTQEELNAVPT DE+SD+SKPIEFLVSHGQA QELCDTICNL+A STSTGLGGMHL Sbjct: 301 AELEFTQEELNAVPTRDESSDASKPIEFLVSHGQAQQELCDTICNLYADSTSTGLGGMHL 360 Query: 1361 VLFRVEGNHRLPPTTLSPGDMVCVRICDSRGACATSCIQGFVHNLGEDGCSISVALESRH 1540 V+FRVEGNHRLPPTTLSPGDMVCVRICDSRGA ATSCIQGFV+NLGEDGCSISVALESRH Sbjct: 361 VVFRVEGNHRLPPTTLSPGDMVCVRICDSRGAGATSCIQGFVNNLGEDGCSISVALESRH 420 Query: 1541 GDPTFSKLFGRNVRIDRIQGLADTLTYERNCEALMLLQKHGLHKRNPSIAAVVTLFGDKE 1720 GDPTFSKLFG++VRIDRI GLAD LTYERNCEALMLLQK+GL K+NPSIA V TLFGDK Sbjct: 421 GDPTFSKLFGKSVRIDRIYGLADALTYERNCEALMLLQKNGLQKKNPSIAVVATLFGDKR 480 Query: 1721 DVAWLVENDLADLSEVKLDGMLGSKTFDDSQKKAIALGLNKKRPLLIIQGPPGTGKTGLL 1900 DV WL EN LAD E +DG L S FDDSQ+KAIA GLNKKRPLLIIQGPPGTGK+GLL Sbjct: 481 DVTWLEENHLADWHEADMDGSLESTMFDDSQQKAIARGLNKKRPLLIIQGPPGTGKSGLL 540 Query: 1901 KELIARAVQQGERVLVTAPTNAAVDNMVEKLSDIGLNIVRVGNPARISPVVASKSLDEIV 2080 KE+I RAV QGERVLVTAPTNAAVDNMVEKLS+IGL+IVRVGNPARIS VASKSL EIV Sbjct: 541 KEIIVRAVHQGERVLVTAPTNAAVDNMVEKLSNIGLDIVRVGNPARISSTVASKSLSEIV 600 Query: 2081 YSKLASFLAEFERKKSDLRKDLSQCLKDDSLAAGIRXXXXXXXXXXXXXXXXXXXXXXSS 2260 SKLA+F EFERKKSDLRKDL CLKDDSLAAGIR SS Sbjct: 601 NSKLATFRMEFERKKSDLRKDLRHCLKDDSLAAGIRQLLKQLGKTLKKKEKETMKEVLSS 660 Query: 2261 AQVVLGTNTGAADPLIRRLDTFDLVVIDEAGQAIETSCWIPILQGKRCILAGDQCQLAPV 2440 AQVVL TNTGAA+PLIRRLDTFDLVVIDEAGQAIE SCWIPILQG+RCILAGDQCQLAPV Sbjct: 661 AQVVLATNTGAAEPLIRRLDTFDLVVIDEAGQAIEPSCWIPILQGRRCILAGDQCQLAPV 720 Query: 2441 ILSRKALEGRLGVSLLERAATLHEGALATKLTTQYRMNDAIASWASKEMYGGSLVSSSTV 2620 ILSRKALEG LGVSLLERAATLHEG LATKLTTQYRMNDAIASWASKEMYGG L SSS V Sbjct: 721 ILSRKALEGGLGVSLLERAATLHEGVLATKLTTQYRMNDAIASWASKEMYGGLLKSSSKV 780 Query: 2621 AAHLLVDTPFVKPTWITQCPLLLLDTRMAYGSLSLGCEEHLDPAGTGSFYNEGEAEIVVQ 2800 A+HLLVD+ FVKPTWITQCPLLLLDTRM YGSLS+GCEEHLDPAGTGSFYNEGEAEIVV+ Sbjct: 781 ASHLLVDSAFVKPTWITQCPLLLLDTRMTYGSLSVGCEEHLDPAGTGSFYNEGEAEIVVE 840 Query: 2801 HVFSLIYAGVSPSAIAVQSPYVAQVQLLRDRLDELPEAAGVEVATIDSFQGREADAVIIS 2980 HVFSLIY+GV P++IAVQSPYVAQVQLLR+RLDELPEAAG+EVATIDSFQGREADAVIIS Sbjct: 841 HVFSLIYSGVRPTSIAVQSPYVAQVQLLRERLDELPEAAGIEVATIDSFQGREADAVIIS 900 Query: 2981 MVRSNTLAAVGFLGDSRRMNVAITRACKHVAVVCDSSTICHNTFLARLLRHIRYFGRVKH 3160 MVRSNTL AVGFLGDSRRMNVAITRA KHVAVVCDSSTICHNTFLARLLRHIRYFGRVKH Sbjct: 901 MVRSNTLGAVGFLGDSRRMNVAITRARKHVAVVCDSSTICHNTFLARLLRHIRYFGRVKH 960 Query: 3161 AEPGSFGESGLDMNPMLPSI 3220 AEPGSFG SGL M+PMLPSI Sbjct: 961 AEPGSFGGSGLGMDPMLPSI 980 >XP_017627332.1 PREDICTED: DNA-binding protein SMUBP-2 [Gossypium arboreum] Length = 1003 Score = 1423 bits (3684), Expect = 0.0 Identities = 755/1000 (75%), Positives = 811/1000 (81%), Gaps = 5/1000 (0%) Frame = +2 Query: 236 FCGSGIVPVTARKSLALNVRRFNSSVLHAAPLQFSFCSSFRSVCLFIGYKSSSLFAFYQP 415 FCG+ +P T K+LAL VR+ SS L + P S SS +S+CLF+G + S +Q Sbjct: 8 FCGN--IPSTTTKALALTVRK--SSFLSSLPFSSS-PSSLKSICLFVGRRYSFPSTKFQS 62 Query: 416 QQFVCYXXXXXXXXXXXXXXXXXX---PRRKSSVFSKSRIQXXXXXXXXXXXXXVNVSR- 583 +Q VC PR KS + S +I V + Sbjct: 63 KQLVCNGGGESSGSHGSSKFATTTKKKPRSKSYIGSNPKISKSENKSTSKPNDSVTRTNI 122 Query: 584 -VXXXXXXXXXXXXXXXXXXXXNVRALSQNGDPLGRRELGKSVVRWICQGMRAMASDFAS 760 V NVR L QNGDPLGRR+LGK VV+WI +GM+AMASDFAS Sbjct: 123 LVEELGLFKKQKEQKVQKTKALNVRTLYQNGDPLGRRDLGKRVVKWISEGMKAMASDFAS 182 Query: 761 AEVQGEFSELGQLMGPGLTFVIAAQPYLNAIPMPVGLEAICLKACTHYPTLFDHFQRELR 940 AE+QGEF EL Q MGPGLTFVI AQPYLN+IP+P+GLEAICLKACTHYPTLFDHFQRELR Sbjct: 183 AELQGEFLELRQRMGPGLTFVIQAQPYLNSIPIPLGLEAICLKACTHYPTLFDHFQRELR 242 Query: 941 DVLQELQHKSLVQDWHETESWKLLKELANSAHHRAIVRKVTQPKPVQGVLGMDLERFKTI 1120 +VLQELQ S+VQDW ETESWKLLKELANSA HRAI RKVT PKPVQGVLGMDLE+ KT+ Sbjct: 243 NVLQELQQNSMVQDWKETESWKLLKELANSAQHRAIARKVTPPKPVQGVLGMDLEKAKTM 302 Query: 1121 QGRIDEFTKQMSELLRIERDAELEFTQEELNAVPTPDENSDSSKPIEFLVSHGQAPQELC 1300 QGRIDEFTKQMSELLRIERDAELEFTQEEL+AVPT DE SDSSKPIEFLVSHGQA QELC Sbjct: 303 QGRIDEFTKQMSELLRIERDAELEFTQEELDAVPTLDEGSDSSKPIEFLVSHGQAQQELC 362 Query: 1301 DTICNLFAVSTSTGLGGMHLVLFRVEGNHRLPPTTLSPGDMVCVRICDSRGACATSCIQG 1480 DTICNL AVSTSTGLGGMHLVLFRVEGNHRLPPTTLSPGDMVCVRI DSRGA ATSCIQG Sbjct: 363 DTICNLNAVSTSTGLGGMHLVLFRVEGNHRLPPTTLSPGDMVCVRISDSRGAGATSCIQG 422 Query: 1481 FVHNLGEDGCSISVALESRHGDPTFSKLFGRNVRIDRIQGLADTLTYERNCEALMLLQKH 1660 FV NLG+DGCSISVALESRHGDPTFS LF + V I RI GLAD LTYERNCEALMLLQK+ Sbjct: 423 FVDNLGDDGCSISVALESRHGDPTFSNLFVKIVLIYRIHGLADALTYERNCEALMLLQKN 482 Query: 1661 GLHKRNPSIAAVVTLFGDKEDVAWLVENDLADLSEVKLDGMLGSKTFDDSQKKAIALGLN 1840 GL K+NPSIA V TLFGDKEDV WL ENDLAD +LDG+L + TFDDSQ++AI LGLN Sbjct: 483 GLQKKNPSIAVVATLFGDKEDVEWLEENDLADWRPAELDGLLQNGTFDDSQQRAITLGLN 542 Query: 1841 KKRPLLIIQGPPGTGKTGLLKELIARAVQQGERVLVTAPTNAAVDNMVEKLSDIGLNIVR 2020 KKRP++++QGPPGTGKTG+LKE+IA A QQGERVLVTAPTNAAVDN+VEKLS+ GLNIVR Sbjct: 543 KKRPVMVVQGPPGTGKTGMLKEVIALAAQQGERVLVTAPTNAAVDNLVEKLSNTGLNIVR 602 Query: 2021 VGNPARISPVVASKSLDEIVYSKLASFLAEFERKKSDLRKDLSQCLKDDSLAAGIRXXXX 2200 VGNPARIS VASKSL EIV SKLA + AEFERKKSDLRKDL CLKDDSLAAGIR Sbjct: 603 VGNPARISSAVASKSLVEIVNSKLADYRAEFERKKSDLRKDLRHCLKDDSLAAGIRQLLK 662 Query: 2201 XXXXXXXXXXXXXXXXXXSSAQVVLGTNTGAADPLIRRLDTFDLVVIDEAGQAIETSCWI 2380 S+AQVVL TNTGAADPLIRRLDTFDLVVIDEAGQAIE SCWI Sbjct: 663 QLGKALKKKEKETVREVLSNAQVVLSTNTGAADPLIRRLDTFDLVVIDEAGQAIEPSCWI 722 Query: 2381 PILQGKRCILAGDQCQLAPVILSRKALEGRLGVSLLERAATLHEGALATKLTTQYRMNDA 2560 PILQGKRCILAGDQ QLAPVILSRKALEG LGVSLLERAATLHEG LAT L TQYRMNDA Sbjct: 723 PILQGKRCILAGDQWQLAPVILSRKALEGGLGVSLLERAATLHEGVLATMLATQYRMNDA 782 Query: 2561 IASWASKEMYGGSLVSSSTVAAHLLVDTPFVKPTWITQCPLLLLDTRMAYGSLSLGCEEH 2740 IASWASKEMY G L SS VA+HLLVD+PFVKPTWIT+CPLLLLDTRM YGSLS+GCEEH Sbjct: 783 IASWASKEMYDGELKSSPLVASHLLVDSPFVKPTWITKCPLLLLDTRMPYGSLSVGCEEH 842 Query: 2741 LDPAGTGSFYNEGEAEIVVQHVFSLIYAGVSPSAIAVQSPYVAQVQLLRDRLDELPEAAG 2920 LD AGTGSF+NEGEA+IVVQHV LIYAGVSP+AIAVQSPYVAQVQLLRDRLDE PEA G Sbjct: 843 LDLAGTGSFFNEGEADIVVQHVLYLIYAGVSPTAIAVQSPYVAQVQLLRDRLDEFPEADG 902 Query: 2921 VEVATIDSFQGREADAVIISMVRSNTLAAVGFLGDSRRMNVAITRACKHVAVVCDSSTIC 3100 +EVATIDSFQGREADAVIISMVRSNTL AVGFLGDSRRMNVAITRA KHVAVVCDSSTIC Sbjct: 903 IEVATIDSFQGREADAVIISMVRSNTLGAVGFLGDSRRMNVAITRARKHVAVVCDSSTIC 962 Query: 3101 HNTFLARLLRHIRYFGRVKHAEPGSFGESGLDMNPMLPSI 3220 HNTFLARLLRHIRY GRVKHAEPG+FG SGL M+PMLPSI Sbjct: 963 HNTFLARLLRHIRYVGRVKHAEPGAFGGSGLGMDPMLPSI 1002 >XP_011009226.1 PREDICTED: DNA-binding protein SMUBP-2 isoform X1 [Populus euphratica] Length = 983 Score = 1419 bits (3672), Expect = 0.0 Identities = 723/858 (84%), Positives = 768/858 (89%) Frame = +2 Query: 647 NVRALSQNGDPLGRRELGKSVVRWICQGMRAMASDFASAEVQGEFSELGQLMGPGLTFVI 826 +V L +NGDPLGR++LGKSVV+WI Q MRAMA +FASAE QGEF+EL Q MGPGLTFV+ Sbjct: 126 SVCTLKENGDPLGRKDLGKSVVKWISQAMRAMAREFASAEAQGEFTELRQRMGPGLTFVM 185 Query: 827 AAQPYLNAIPMPVGLEAICLKACTHYPTLFDHFQRELRDVLQELQHKSLVQDWHETESWK 1006 AQPYLNA+PMP+GLEAICLKACTHYPTLFDHFQRELR+VLQ+L+ K LVQDW +TESWK Sbjct: 186 QAQPYLNAVPMPLGLEAICLKACTHYPTLFDHFQRELREVLQDLKRKGLVQDWQQTESWK 245 Query: 1007 LLKELANSAHHRAIVRKVTQPKPVQGVLGMDLERFKTIQGRIDEFTKQMSELLRIERDAE 1186 LLKELANSA HRAI RK TQ KP+QGVLGMDLE+ K IQGRI+EFT QMSELLRIERDAE Sbjct: 246 LLKELANSAQHRAIARKATQSKPLQGVLGMDLEKAKAIQGRINEFTNQMSELLRIERDAE 305 Query: 1187 LEFTQEELNAVPTPDENSDSSKPIEFLVSHGQAPQELCDTICNLFAVSTSTGLGGMHLVL 1366 LEFTQEELNAVPT DE+SDSSKPIEFLVSHGQ QELCDTICNL+AVSTSTGLGGMHLVL Sbjct: 306 LEFTQEELNAVPTLDESSDSSKPIEFLVSHGQGQQELCDTICNLYAVSTSTGLGGMHLVL 365 Query: 1367 FRVEGNHRLPPTTLSPGDMVCVRICDSRGACATSCIQGFVHNLGEDGCSISVALESRHGD 1546 FRVEGNHRLPPTTLSPG+MVCVRICDSRGA ATSC+QGFV+NLGEDGCSISVALESRHGD Sbjct: 366 FRVEGNHRLPPTTLSPGEMVCVRICDSRGAGATSCLQGFVNNLGEDGCSISVALESRHGD 425 Query: 1547 PTFSKLFGRNVRIDRIQGLADTLTYERNCEALMLLQKHGLHKRNPSIAAVVTLFGDKEDV 1726 PTFSKL G++VRIDRI GLAD +TYERNCEALMLLQK GLHK+NPSIA V TLFGDKEDV Sbjct: 426 PTFSKLSGKSVRIDRIHGLADAVTYERNCEALMLLQKKGLHKKNPSIAVVATLFGDKEDV 485 Query: 1727 AWLVENDLADLSEVKLDGMLGSKTFDDSQKKAIALGLNKKRPLLIIQGPPGTGKTGLLKE 1906 AWL ENDLA E LD LG K FDDSQ++AI LGLNKKRP LIIQGPPGTGK+GLLKE Sbjct: 486 AWLEENDLASWDEADLDEHLG-KPFDDSQRRAITLGLNKKRPFLIIQGPPGTGKSGLLKE 544 Query: 1907 LIARAVQQGERVLVTAPTNAAVDNMVEKLSDIGLNIVRVGNPARISPVVASKSLDEIVYS 2086 LIA AV +GERVLVTAPTNAAVDNMVEKLS+IGLNIVRVGNPARIS VASKSL +IV S Sbjct: 545 LIALAVGKGERVLVTAPTNAAVDNMVEKLSNIGLNIVRVGNPARISSAVASKSLGDIVNS 604 Query: 2087 KLASFLAEFERKKSDLRKDLSQCLKDDSLAAGIRXXXXXXXXXXXXXXXXXXXXXXSSAQ 2266 KLA+F EFERKKSDLRKDLS CLKDDSLAAGIR SSAQ Sbjct: 605 KLAAFRTEFERKKSDLRKDLSHCLKDDSLAAGIRQLLKQLGKTLKKKEKETVREVLSSAQ 664 Query: 2267 VVLGTNTGAADPLIRRLDTFDLVVIDEAGQAIETSCWIPILQGKRCILAGDQCQLAPVIL 2446 VVL TNTGAADPLIRRLD FDLVV+DEAGQAIE SCWIPILQGKRCILAGDQCQLAPVIL Sbjct: 665 VVLATNTGAADPLIRRLDAFDLVVMDEAGQAIEPSCWIPILQGKRCILAGDQCQLAPVIL 724 Query: 2447 SRKALEGRLGVSLLERAATLHEGALATKLTTQYRMNDAIASWASKEMYGGSLVSSSTVAA 2626 SRKALEG LGVSLLERA+TLHEG LATKLTTQYRMNDAIASWASKEMY G L SSSTVA+ Sbjct: 725 SRKALEGGLGVSLLERASTLHEGVLATKLTTQYRMNDAIASWASKEMYSGLLKSSSTVAS 784 Query: 2627 HLLVDTPFVKPTWITQCPLLLLDTRMAYGSLSLGCEEHLDPAGTGSFYNEGEAEIVVQHV 2806 HLLVD+PFVKPTWITQCPLLLLDTRM YGSLS+GCEEHLDPAGTGSFYNEGEA+IVVQHV Sbjct: 785 HLLVDSPFVKPTWITQCPLLLLDTRMPYGSLSVGCEEHLDPAGTGSFYNEGEADIVVQHV 844 Query: 2807 FSLIYAGVSPSAIAVQSPYVAQVQLLRDRLDELPEAAGVEVATIDSFQGREADAVIISMV 2986 SLI++GV P+AIAVQSPYVAQVQLLR+RLDELPEA GVE+ATIDSFQGREADAVIISMV Sbjct: 845 SSLIFSGVRPTAIAVQSPYVAQVQLLRERLDELPEADGVEIATIDSFQGREADAVIISMV 904 Query: 2987 RSNTLAAVGFLGDSRRMNVAITRACKHVAVVCDSSTICHNTFLARLLRHIRYFGRVKHAE 3166 RSNTL AVGFLGDS+R NVAITRA KHVAVVCDSSTICHNTFLARLLRHIRYFGRVKHAE Sbjct: 905 RSNTLGAVGFLGDSKRTNVAITRARKHVAVVCDSSTICHNTFLARLLRHIRYFGRVKHAE 964 Query: 3167 PGSFGESGLDMNPMLPSI 3220 PGSFG SG DMNPMLPSI Sbjct: 965 PGSFGGSGFDMNPMLPSI 982 >XP_012070287.1 PREDICTED: DNA-binding protein SMUBP-2 [Jatropha curcas] KDP39578.1 hypothetical protein JCGZ_02598 [Jatropha curcas] Length = 981 Score = 1416 bits (3666), Expect = 0.0 Identities = 720/860 (83%), Positives = 770/860 (89%), Gaps = 2/860 (0%) Frame = +2 Query: 647 NVRALSQNGDPLGRRELGKSVVRWICQGMRAMASDFASAEVQGEFSELGQLMG--PGLTF 820 NV++L QNGDPLGRR+LGK+VV+WI QGMRAMA+DFA+AE QGEF EL Q MG GLTF Sbjct: 121 NVKSLHQNGDPLGRRDLGKNVVKWISQGMRAMANDFAAAETQGEFLELRQRMGLEAGLTF 180 Query: 821 VIAAQPYLNAIPMPVGLEAICLKACTHYPTLFDHFQRELRDVLQELQHKSLVQDWHETES 1000 VI AQPY+NA+P+P+GLEA+CLKAC HYPTLFDHFQRELR VLQ+LQ K LVQDW +TES Sbjct: 181 VIQAQPYINAVPIPLGLEALCLKACAHYPTLFDHFQRELRAVLQDLQSKGLVQDWRKTES 240 Query: 1001 WKLLKELANSAHHRAIVRKVTQPKPVQGVLGMDLERFKTIQGRIDEFTKQMSELLRIERD 1180 WKLLKELANS HRA+ RKV+QPKP+QGVLGM LE+ K IQGRIDEFTK MSELLRIERD Sbjct: 241 WKLLKELANSVQHRAVARKVSQPKPLQGVLGMKLEKAKAIQGRIDEFTKSMSELLRIERD 300 Query: 1181 AELEFTQEELNAVPTPDENSDSSKPIEFLVSHGQAPQELCDTICNLFAVSTSTGLGGMHL 1360 AELEFTQEELNAVPTPDE+S+SSKPIEFLVSHGQA QELCDTICNL+AVSTSTGLGGMHL Sbjct: 301 AELEFTQEELNAVPTPDESSNSSKPIEFLVSHGQAQQELCDTICNLYAVSTSTGLGGMHL 360 Query: 1361 VLFRVEGNHRLPPTTLSPGDMVCVRICDSRGACATSCIQGFVHNLGEDGCSISVALESRH 1540 VLFRVEGNHRLPPTTLSPGDMVCVR CDSRGA ATSC+QGFV+NLGEDGCSI +ALESRH Sbjct: 361 VLFRVEGNHRLPPTTLSPGDMVCVRTCDSRGAGATSCMQGFVNNLGEDGCSICLALESRH 420 Query: 1541 GDPTFSKLFGRNVRIDRIQGLADTLTYERNCEALMLLQKHGLHKRNPSIAAVVTLFGDKE 1720 GD TFSKLFG++VRIDRIQGLAD LTYERNCEALMLLQK+GL K+NPSIA V TLFGDKE Sbjct: 421 GDSTFSKLFGKSVRIDRIQGLADALTYERNCEALMLLQKNGLQKKNPSIAVVATLFGDKE 480 Query: 1721 DVAWLVENDLADLSEVKLDGMLGSKTFDDSQKKAIALGLNKKRPLLIIQGPPGTGKTGLL 1900 +VAWL EN LA+ +E +DG GS FD++Q++A+ALGLNKKRPLLIIQGPPGTGK+GLL Sbjct: 481 EVAWLEENHLAEWAETDVDGSSGSLMFDEAQQRALALGLNKKRPLLIIQGPPGTGKSGLL 540 Query: 1901 KELIARAVQQGERVLVTAPTNAAVDNMVEKLSDIGLNIVRVGNPARISPVVASKSLDEIV 2080 KELI RAV QGERVLVTAPTNAAVDNMVEKLS IGL+IVRVGNPARIS VASKSL EIV Sbjct: 541 KELIVRAVDQGERVLVTAPTNAAVDNMVEKLSTIGLDIVRVGNPARISSAVASKSLSEIV 600 Query: 2081 YSKLASFLAEFERKKSDLRKDLSQCLKDDSLAAGIRXXXXXXXXXXXXXXXXXXXXXXSS 2260 SK+A+F EFERKKSDLRKDL CLKDDSLA+GIR SS Sbjct: 601 NSKMATFCMEFERKKSDLRKDLRHCLKDDSLASGIRQLLKQLGKSLKKKEKETVKEVLSS 660 Query: 2261 AQVVLGTNTGAADPLIRRLDTFDLVVIDEAGQAIETSCWIPILQGKRCILAGDQCQLAPV 2440 AQVVL TNTGAADPLIRRLD FDLVVIDEAGQAIE SCWIPILQGKRCILAGDQCQLAPV Sbjct: 661 AQVVLATNTGAADPLIRRLDKFDLVVIDEAGQAIEPSCWIPILQGKRCILAGDQCQLAPV 720 Query: 2441 ILSRKALEGRLGVSLLERAATLHEGALATKLTTQYRMNDAIASWASKEMYGGSLVSSSTV 2620 ILSRKA EG LG+SLLERAA+LHEG LATKLTTQYRMNDAIASWASKEMYGG L SSS V Sbjct: 721 ILSRKASEGGLGISLLERAASLHEGILATKLTTQYRMNDAIASWASKEMYGGLLRSSSEV 780 Query: 2621 AAHLLVDTPFVKPTWITQCPLLLLDTRMAYGSLSLGCEEHLDPAGTGSFYNEGEAEIVVQ 2800 A+HLLVD+PFVKPTW+TQCPLLLLDTRM YGSLS+GCEEHLDPAGTGSFYNEGEAEIVVQ Sbjct: 781 ASHLLVDSPFVKPTWLTQCPLLLLDTRMPYGSLSIGCEEHLDPAGTGSFYNEGEAEIVVQ 840 Query: 2801 HVFSLIYAGVSPSAIAVQSPYVAQVQLLRDRLDELPEAAGVEVATIDSFQGREADAVIIS 2980 HV SLIYAGV P+ IAVQSPYVAQVQLLRDRLDELPEAAGVEVATIDSFQGREADAVIIS Sbjct: 841 HVISLIYAGVRPTTIAVQSPYVAQVQLLRDRLDELPEAAGVEVATIDSFQGREADAVIIS 900 Query: 2981 MVRSNTLAAVGFLGDSRRMNVAITRACKHVAVVCDSSTICHNTFLARLLRHIRYFGRVKH 3160 MVRSNTL AVGFLGDSRRMNVAITRA KHVAVVCDSSTICHNTFLARLLRHIRYFGRVKH Sbjct: 901 MVRSNTLGAVGFLGDSRRMNVAITRARKHVAVVCDSSTICHNTFLARLLRHIRYFGRVKH 960 Query: 3161 AEPGSFGESGLDMNPMLPSI 3220 AEPGSFG SGL M+PMLPSI Sbjct: 961 AEPGSFGGSGLGMDPMLPSI 980 >XP_002319231.2 hypothetical protein POPTR_0013s07150g [Populus trichocarpa] EEE95154.2 hypothetical protein POPTR_0013s07150g [Populus trichocarpa] Length = 983 Score = 1415 bits (3663), Expect = 0.0 Identities = 723/858 (84%), Positives = 766/858 (89%) Frame = +2 Query: 647 NVRALSQNGDPLGRRELGKSVVRWICQGMRAMASDFASAEVQGEFSELGQLMGPGLTFVI 826 +V L +NGDPLGR++LGKSVV+WI Q MRAMA +FASAE QGEF+EL Q MGPGLTFVI Sbjct: 126 SVCTLKENGDPLGRKDLGKSVVKWISQAMRAMAREFASAEAQGEFTELRQRMGPGLTFVI 185 Query: 827 AAQPYLNAIPMPVGLEAICLKACTHYPTLFDHFQRELRDVLQELQHKSLVQDWHETESWK 1006 AQPYLNA+PMP+GLEAICLKACTHYPTLFDHFQRELR+VLQ+L+ K LVQDW +TESWK Sbjct: 186 QAQPYLNAVPMPLGLEAICLKACTHYPTLFDHFQRELREVLQDLKRKGLVQDWQKTESWK 245 Query: 1007 LLKELANSAHHRAIVRKVTQPKPVQGVLGMDLERFKTIQGRIDEFTKQMSELLRIERDAE 1186 LLKELANSA HRAI RK TQ KP+QGVLGM+LE+ K IQGRI+EFT QMSELLRIERDAE Sbjct: 246 LLKELANSAQHRAIARKATQSKPLQGVLGMNLEKAKAIQGRINEFTNQMSELLRIERDAE 305 Query: 1187 LEFTQEELNAVPTPDENSDSSKPIEFLVSHGQAPQELCDTICNLFAVSTSTGLGGMHLVL 1366 LEFTQEELNAVPT DE+SDSSKPIEFLVSHGQ QELCDTICNL+AVSTSTGLGGMHLVL Sbjct: 306 LEFTQEELNAVPTLDESSDSSKPIEFLVSHGQGQQELCDTICNLYAVSTSTGLGGMHLVL 365 Query: 1367 FRVEGNHRLPPTTLSPGDMVCVRICDSRGACATSCIQGFVHNLGEDGCSISVALESRHGD 1546 FRVEGNHRLPPTTLSPGDMVCVRICDSRGA ATS +QGFV+NLGEDGCSISVALESRHGD Sbjct: 366 FRVEGNHRLPPTTLSPGDMVCVRICDSRGAGATSSLQGFVNNLGEDGCSISVALESRHGD 425 Query: 1547 PTFSKLFGRNVRIDRIQGLADTLTYERNCEALMLLQKHGLHKRNPSIAAVVTLFGDKEDV 1726 PTFSKL G++VRIDRI GLAD +TYERNCEALMLLQK GLHK+NPSIA V TLFGDKEDV Sbjct: 426 PTFSKLSGKSVRIDRIHGLADAVTYERNCEALMLLQKKGLHKKNPSIAVVATLFGDKEDV 485 Query: 1727 AWLVENDLADLSEVKLDGMLGSKTFDDSQKKAIALGLNKKRPLLIIQGPPGTGKTGLLKE 1906 AWL ENDLA E D LG K FDDSQ++AI LGLNKKRP LIIQGPPGTGK+GLLKE Sbjct: 486 AWLEENDLASWDEADFDEHLG-KPFDDSQRRAITLGLNKKRPFLIIQGPPGTGKSGLLKE 544 Query: 1907 LIARAVQQGERVLVTAPTNAAVDNMVEKLSDIGLNIVRVGNPARISPVVASKSLDEIVYS 2086 LIA AV +GERVLVTAPTNAAVDNMVEKLS+IGLNIVRVGNPARIS VASKSL +IV S Sbjct: 545 LIALAVGKGERVLVTAPTNAAVDNMVEKLSNIGLNIVRVGNPARISSAVASKSLGDIVNS 604 Query: 2087 KLASFLAEFERKKSDLRKDLSQCLKDDSLAAGIRXXXXXXXXXXXXXXXXXXXXXXSSAQ 2266 KLA+F EFERKKSDLRKDLS CLKDDSLAAGIR SSAQ Sbjct: 605 KLAAFRTEFERKKSDLRKDLSHCLKDDSLAAGIRQLLKQLGKTLKKKEKETVREVLSSAQ 664 Query: 2267 VVLGTNTGAADPLIRRLDTFDLVVIDEAGQAIETSCWIPILQGKRCILAGDQCQLAPVIL 2446 VVL TNTGAADPLIRRLD FDLVV+DEAGQAIE SCWIPILQGKRCILAGDQCQLAPVIL Sbjct: 665 VVLATNTGAADPLIRRLDAFDLVVMDEAGQAIEPSCWIPILQGKRCILAGDQCQLAPVIL 724 Query: 2447 SRKALEGRLGVSLLERAATLHEGALATKLTTQYRMNDAIASWASKEMYGGSLVSSSTVAA 2626 SRKALEG LGVSLLERA+TLHEG LATKLTTQYRMNDAIASWASKEMY G L SSSTVA+ Sbjct: 725 SRKALEGGLGVSLLERASTLHEGVLATKLTTQYRMNDAIASWASKEMYSGLLKSSSTVAS 784 Query: 2627 HLLVDTPFVKPTWITQCPLLLLDTRMAYGSLSLGCEEHLDPAGTGSFYNEGEAEIVVQHV 2806 HLLVDTPFVKPTWITQCPLLLLDTRM YGSLS+GCEEHLDPAGTGSFYNEGEA+IVVQHV Sbjct: 785 HLLVDTPFVKPTWITQCPLLLLDTRMPYGSLSVGCEEHLDPAGTGSFYNEGEADIVVQHV 844 Query: 2807 FSLIYAGVSPSAIAVQSPYVAQVQLLRDRLDELPEAAGVEVATIDSFQGREADAVIISMV 2986 SLI++GV P+AIAVQSPYVAQVQLLR+RLDELPEA GVE+ATIDSFQGREADAVIISMV Sbjct: 845 SSLIFSGVRPTAIAVQSPYVAQVQLLRERLDELPEADGVEIATIDSFQGREADAVIISMV 904 Query: 2987 RSNTLAAVGFLGDSRRMNVAITRACKHVAVVCDSSTICHNTFLARLLRHIRYFGRVKHAE 3166 RSNTL AVGFLGDS+R NVAITRA KHVAVVCDSSTICHNTFLARLLRHIRYFGRVKHAE Sbjct: 905 RSNTLGAVGFLGDSKRTNVAITRARKHVAVVCDSSTICHNTFLARLLRHIRYFGRVKHAE 964 Query: 3167 PGSFGESGLDMNPMLPSI 3220 PGSFG SG DMNPMLPSI Sbjct: 965 PGSFGGSGFDMNPMLPSI 982 >XP_002264216.1 PREDICTED: DNA-binding protein SMUBP-2 [Vitis vinifera] Length = 953 Score = 1409 bits (3648), Expect = 0.0 Identities = 720/858 (83%), Positives = 767/858 (89%) Frame = +2 Query: 647 NVRALSQNGDPLGRRELGKSVVRWICQGMRAMASDFASAEVQGEFSELGQLMGPGLTFVI 826 +VR L QNGDPLGRREL + VVRWI QGMR MA DFASAE+QGEF+EL Q MGPGL+FVI Sbjct: 95 SVRTLYQNGDPLGRRELRRCVVRWISQGMRGMALDFASAELQGEFAELRQRMGPGLSFVI 154 Query: 827 AAQPYLNAIPMPVGLEAICLKACTHYPTLFDHFQRELRDVLQELQHKSLVQDWHETESWK 1006 AQPYLNAIPMP+G EAICLKACTHYPTLFDHFQRELRDVLQ+ Q KS QDW ET+SW+ Sbjct: 155 QAQPYLNAIPMPLGHEAICLKACTHYPTLFDHFQRELRDVLQDHQRKSQFQDWRETQSWQ 214 Query: 1007 LLKELANSAHHRAIVRKVTQPKPVQGVLGMDLERFKTIQGRIDEFTKQMSELLRIERDAE 1186 LLKELANSA HRAI RKV+QPKP++GVLGM+L++ K IQ RIDEFTK+MSELL+IERD+E Sbjct: 215 LLKELANSAQHRAISRKVSQPKPLKGVLGMELDKAKAIQSRIDEFTKRMSELLQIERDSE 274 Query: 1187 LEFTQEELNAVPTPDENSDSSKPIEFLVSHGQAPQELCDTICNLFAVSTSTGLGGMHLVL 1366 LEFTQEELNAVPTPDE+SDSSKPIEFLVSHGQA QELCDTICNL AVST GLGGMHLVL Sbjct: 275 LEFTQEELNAVPTPDESSDSSKPIEFLVSHGQAQQELCDTICNLNAVSTFIGLGGMHLVL 334 Query: 1367 FRVEGNHRLPPTTLSPGDMVCVRICDSRGACATSCIQGFVHNLGEDGCSISVALESRHGD 1546 F+VEGNHRLPPTTLSPGDMVCVRICDSRGA ATSC+QGFV +LG+DGCSISVALESRHGD Sbjct: 335 FKVEGNHRLPPTTLSPGDMVCVRICDSRGAGATSCMQGFVDSLGKDGCSISVALESRHGD 394 Query: 1547 PTFSKLFGRNVRIDRIQGLADTLTYERNCEALMLLQKHGLHKRNPSIAAVVTLFGDKEDV 1726 PTFSKLFG++VRIDRI GLAD LTYERNCEALMLLQK+GL K+NPSIA V TLFGDKEDV Sbjct: 395 PTFSKLFGKSVRIDRIHGLADALTYERNCEALMLLQKNGLQKKNPSIAVVATLFGDKEDV 454 Query: 1727 AWLVENDLADLSEVKLDGMLGSKTFDDSQKKAIALGLNKKRPLLIIQGPPGTGKTGLLKE 1906 AWL ENDL D +EV LD +L S +DDSQ++AIALGLNKKRP+LIIQGPPGTGKT LLKE Sbjct: 455 AWLEENDLVDWAEVGLDELLESGAYDDSQRRAIALGLNKKRPILIIQGPPGTGKTVLLKE 514 Query: 1907 LIARAVQQGERVLVTAPTNAAVDNMVEKLSDIGLNIVRVGNPARISPVVASKSLDEIVYS 2086 LIA AVQQGERVLVTAPTNAAVDNMVEKLS+IG+NIVRVGNPARIS VASKSL EIV S Sbjct: 515 LIALAVQQGERVLVTAPTNAAVDNMVEKLSNIGVNIVRVGNPARISSAVASKSLGEIVNS 574 Query: 2087 KLASFLAEFERKKSDLRKDLSQCLKDDSLAAGIRXXXXXXXXXXXXXXXXXXXXXXSSAQ 2266 KL +FL EFERKKSDLRKDL CLKDDSLAAGIR SSAQ Sbjct: 575 KLENFLTEFERKKSDLRKDLRHCLKDDSLAAGIRQLLKQLGKALKKKEKETVKEVLSSAQ 634 Query: 2267 VVLGTNTGAADPLIRRLDTFDLVVIDEAGQAIETSCWIPILQGKRCILAGDQCQLAPVIL 2446 VVL TNTGAADP+IRRLD FDLV+IDEAGQAIE SCWIPILQGKRCI+AGDQCQLAPVIL Sbjct: 635 VVLATNTGAADPVIRRLDAFDLVIIDEAGQAIEPSCWIPILQGKRCIIAGDQCQLAPVIL 694 Query: 2447 SRKALEGRLGVSLLERAATLHEGALATKLTTQYRMNDAIASWASKEMYGGSLVSSSTVAA 2626 SRKALEG LGVSLLERAATLHE LATKLTTQYRMNDAIASWASKEMYGGSL SSS+V + Sbjct: 695 SRKALEGGLGVSLLERAATLHEEVLATKLTTQYRMNDAIASWASKEMYGGSLKSSSSVFS 754 Query: 2627 HLLVDTPFVKPTWITQCPLLLLDTRMAYGSLSLGCEEHLDPAGTGSFYNEGEAEIVVQHV 2806 HLLVD+PFVKP WITQCPLLLLDTRM YGSLS+GCEEHLDPAGTGSFYNEGEA+IVVQHV Sbjct: 755 HLLVDSPFVKPAWITQCPLLLLDTRMPYGSLSVGCEEHLDPAGTGSFYNEGEADIVVQHV 814 Query: 2807 FSLIYAGVSPSAIAVQSPYVAQVQLLRDRLDELPEAAGVEVATIDSFQGREADAVIISMV 2986 SLI AGVSP+AIAVQSPYVAQVQLLRDRLDE+PEA GVEVATIDSFQGREADAVIISMV Sbjct: 815 LSLISAGVSPTAIAVQSPYVAQVQLLRDRLDEIPEAVGVEVATIDSFQGREADAVIISMV 874 Query: 2987 RSNTLAAVGFLGDSRRMNVAITRACKHVAVVCDSSTICHNTFLARLLRHIRYFGRVKHAE 3166 RSNTL AVGFLGDSRRMNVAITRA KHVAVVCDSSTICHNTFLARLLRHIRY GRVKHAE Sbjct: 875 RSNTLGAVGFLGDSRRMNVAITRARKHVAVVCDSSTICHNTFLARLLRHIRYIGRVKHAE 934 Query: 3167 PGSFGESGLDMNPMLPSI 3220 PG+FG SGL MNPMLP I Sbjct: 935 PGTFGGSGLGMNPMLPFI 952 >XP_002524012.1 PREDICTED: DNA-binding protein SMUBP-2 [Ricinus communis] EEF38380.1 DNA-binding protein smubp-2, putative [Ricinus communis] Length = 989 Score = 1405 bits (3637), Expect = 0.0 Identities = 714/860 (83%), Positives = 764/860 (88%), Gaps = 2/860 (0%) Frame = +2 Query: 647 NVRALSQNGDPLGRRELGKSVVRWICQGMRAMASDFASAEVQGEFSELGQLMG--PGLTF 820 NV++L QNGDPLG+++LGK+VV+WI QGMRAMA+DFASAE QGEF EL Q M GLTF Sbjct: 129 NVKSLHQNGDPLGKKDLGKTVVKWISQGMRAMAADFASAETQGEFLELRQRMDLEAGLTF 188 Query: 821 VIAAQPYLNAIPMPVGLEAICLKACTHYPTLFDHFQRELRDVLQELQHKSLVQDWHETES 1000 VI AQPY+NA+P+P+G EA+CLKAC HYPTLFDHFQRELRDVLQ+LQ K LVQDW TES Sbjct: 189 VIQAQPYINAVPIPLGFEALCLKACIHYPTLFDHFQRELRDVLQDLQRKGLVQDWQNTES 248 Query: 1001 WKLLKELANSAHHRAIVRKVTQPKPVQGVLGMDLERFKTIQGRIDEFTKQMSELLRIERD 1180 WKLLKELANS HRA+ RKV++PKP+QGVLGM+L++ K IQ RIDEFTK MSELL+IERD Sbjct: 249 WKLLKELANSVQHRAVARKVSKPKPLQGVLGMNLDKAKAIQSRIDEFTKTMSELLQIERD 308 Query: 1181 AELEFTQEELNAVPTPDENSDSSKPIEFLVSHGQAPQELCDTICNLFAVSTSTGLGGMHL 1360 +ELEFTQEELNAVPTPDENSD SKPIEFLVSHGQA QELCDTICNL AVSTSTGLGGMHL Sbjct: 309 SELEFTQEELNAVPTPDENSDPSKPIEFLVSHGQAQQELCDTICNLNAVSTSTGLGGMHL 368 Query: 1361 VLFRVEGNHRLPPTTLSPGDMVCVRICDSRGACATSCIQGFVHNLGEDGCSISVALESRH 1540 VLFRVEGNHRLPPT LSPGDMVCVRICDSRGA ATSC+QGFV+NLGEDGCSISVALESRH Sbjct: 369 VLFRVEGNHRLPPTNLSPGDMVCVRICDSRGAGATSCMQGFVNNLGEDGCSISVALESRH 428 Query: 1541 GDPTFSKLFGRNVRIDRIQGLADTLTYERNCEALMLLQKHGLHKRNPSIAAVVTLFGDKE 1720 GDPTFSKLFG+ VRIDRI GLAD LTYERNCEALMLLQK+GL K+NPSIA V TLFGD E Sbjct: 429 GDPTFSKLFGKGVRIDRIHGLADALTYERNCEALMLLQKNGLQKKNPSIAIVATLFGDSE 488 Query: 1721 DVAWLVENDLADLSEVKLDGMLGSKTFDDSQKKAIALGLNKKRPLLIIQGPPGTGKTGLL 1900 D+AWL E DLA+ +E +DG GS+ FDDSQ++A+ALGLN+KRPLLIIQGPPGTGK+GLL Sbjct: 489 DLAWLEEKDLAEWNEADMDGCFGSERFDDSQRRAMALGLNQKRPLLIIQGPPGTGKSGLL 548 Query: 1901 KELIARAVQQGERVLVTAPTNAAVDNMVEKLSDIGLNIVRVGNPARISPVVASKSLDEIV 2080 KELI RAV QGERVLVTAPTNAAVDNMVEKLS+IGL+IVRVGNPARIS VASKSL EIV Sbjct: 549 KELIVRAVHQGERVLVTAPTNAAVDNMVEKLSNIGLDIVRVGNPARISSAVASKSLSEIV 608 Query: 2081 YSKLASFLAEFERKKSDLRKDLSQCLKDDSLAAGIRXXXXXXXXXXXXXXXXXXXXXXSS 2260 SKLA+F EFERKKSDLRKDL CL+DDSLAAGIR SS Sbjct: 609 NSKLATFRMEFERKKSDLRKDLRHCLEDDSLAAGIRQLLKQLGKTMKKKEKESVKEVLSS 668 Query: 2261 AQVVLGTNTGAADPLIRRLDTFDLVVIDEAGQAIETSCWIPILQGKRCILAGDQCQLAPV 2440 AQVVL TNTGAADPLIRRLDTFDLVVIDEAGQAIE SCWIPILQGKRCILAGDQCQLAPV Sbjct: 669 AQVVLATNTGAADPLIRRLDTFDLVVIDEAGQAIEPSCWIPILQGKRCILAGDQCQLAPV 728 Query: 2441 ILSRKALEGRLGVSLLERAATLHEGALATKLTTQYRMNDAIASWASKEMYGGSLVSSSTV 2620 ILSRKALEG LGVSLLERAATLH+G LA +LTTQYRMNDAIASWASKEMYGG L SSS V Sbjct: 729 ILSRKALEGGLGVSLLERAATLHDGVLALQLTTQYRMNDAIASWASKEMYGGLLKSSSKV 788 Query: 2621 AAHLLVDTPFVKPTWITQCPLLLLDTRMAYGSLSLGCEEHLDPAGTGSFYNEGEAEIVVQ 2800 A+HLLV +PFVKPTWITQCPLLLLDTRM YGSL +GCEEHLDPAGTGSFYNEGEAEIVVQ Sbjct: 789 ASHLLVHSPFVKPTWITQCPLLLLDTRMPYGSLFIGCEEHLDPAGTGSFYNEGEAEIVVQ 848 Query: 2801 HVFSLIYAGVSPSAIAVQSPYVAQVQLLRDRLDELPEAAGVEVATIDSFQGREADAVIIS 2980 HV SLIYAGV P+ IAVQSPYVAQVQLLRDRLDELPEA GVEVATIDSFQGREADAVIIS Sbjct: 849 HVISLIYAGVRPTTIAVQSPYVAQVQLLRDRLDELPEADGVEVATIDSFQGREADAVIIS 908 Query: 2981 MVRSNTLAAVGFLGDSRRMNVAITRACKHVAVVCDSSTICHNTFLARLLRHIRYFGRVKH 3160 MVRSN L AVGFLGDSRRMNVAITRA +HVAVVCDSSTICHNTFLARLLRHIRYFGRVKH Sbjct: 909 MVRSNNLGAVGFLGDSRRMNVAITRARRHVAVVCDSSTICHNTFLARLLRHIRYFGRVKH 968 Query: 3161 AEPGSFGESGLDMNPMLPSI 3220 AEPGSFG SGL M+PMLPSI Sbjct: 969 AEPGSFGGSGLGMDPMLPSI 988 >XP_018828127.1 PREDICTED: DNA-binding protein SMUBP-2 [Juglans regia] Length = 957 Score = 1402 bits (3628), Expect = 0.0 Identities = 712/858 (82%), Positives = 766/858 (89%), Gaps = 1/858 (0%) Frame = +2 Query: 650 VRALSQNGDPLGRRELGKSVVRWICQGMRAMASDFASAEVQGEFSELGQLMGPGLTFVIA 829 VR L++NGDPLGRR+LGKSVVRWI QGM+AMA+DFA E+QGEFSEL Q MGPGLTFVI Sbjct: 99 VRGLNENGDPLGRRDLGKSVVRWIRQGMKAMATDFALTEMQGEFSELRQRMGPGLTFVIE 158 Query: 830 AQPYLNAIPMPVGLEAICLKACTHYPTLFDHFQRELRDVLQELQHKSLVQDWHETESWKL 1009 AQPYL AIPMP+GLEA+CLKACTHYPTLFDHFQRELRDVLQ+LQ+KSLV W+ETESWKL Sbjct: 159 AQPYLTAIPMPLGLEALCLKACTHYPTLFDHFQRELRDVLQDLQNKSLVHSWYETESWKL 218 Query: 1010 LKELANSAHHRAIVRKVTQPKP-VQGVLGMDLERFKTIQGRIDEFTKQMSELLRIERDAE 1186 LKELANS HRA+ RKV QPK ++GVLG++LE+ K IQ RIDEFTK+MSELLRIERDAE Sbjct: 219 LKELANSVQHRAVARKVLQPKKYLKGVLGIELEKVKAIQSRIDEFTKRMSELLRIERDAE 278 Query: 1187 LEFTQEELNAVPTPDENSDSSKPIEFLVSHGQAPQELCDTICNLFAVSTSTGLGGMHLVL 1366 LEFTQEEL+AVPTPDENSD+SKPIEFLVSHGQA QELCDTICNL AVSTSTGLGGMHLVL Sbjct: 279 LEFTQEELDAVPTPDENSDASKPIEFLVSHGQAQQELCDTICNLNAVSTSTGLGGMHLVL 338 Query: 1367 FRVEGNHRLPPTTLSPGDMVCVRICDSRGACATSCIQGFVHNLGEDGCSISVALESRHGD 1546 FRVEGNHRLPPTTLSPGDMVCVRICDSRGA ATSC+QGFV+NLGEDGCSI VALESRHGD Sbjct: 339 FRVEGNHRLPPTTLSPGDMVCVRICDSRGAGATSCMQGFVNNLGEDGCSIIVALESRHGD 398 Query: 1547 PTFSKLFGRNVRIDRIQGLADTLTYERNCEALMLLQKHGLHKRNPSIAAVVTLFGDKEDV 1726 PTFSKLFG++VRIDRI GLAD LTYERNCEALMLLQK+GL K+NPSIA TLFGD+ D+ Sbjct: 399 PTFSKLFGKSVRIDRIHGLADALTYERNCEALMLLQKNGLQKKNPSIAVAATLFGDEGDI 458 Query: 1727 AWLVENDLADLSEVKLDGMLGSKTFDDSQKKAIALGLNKKRPLLIIQGPPGTGKTGLLKE 1906 AWL EN+L D +E + DGML + +DDSQ++AIALGLNKKRP+LIIQGPPGTGKTGLLKE Sbjct: 459 AWLEENNLIDWAEEEFDGMLRTGAYDDSQRRAIALGLNKKRPVLIIQGPPGTGKTGLLKE 518 Query: 1907 LIARAVQQGERVLVTAPTNAAVDNMVEKLSDIGLNIVRVGNPARISPVVASKSLDEIVYS 2086 +IA AV QGERVLVTAPTNAAVDNMVEKLS+IGL IVRVGNPARIS VASKSL +IV S Sbjct: 519 IIALAVAQGERVLVTAPTNAAVDNMVEKLSNIGLEIVRVGNPARISKTVASKSLGKIVNS 578 Query: 2087 KLASFLAEFERKKSDLRKDLSQCLKDDSLAAGIRXXXXXXXXXXXXXXXXXXXXXXSSAQ 2266 KL +F EFERKKSDLR+DL CL+DDSLAAGIR SSA+ Sbjct: 579 KLVNFRMEFERKKSDLRRDLRHCLRDDSLAAGIRQLLKQLGKSLKKKEKETVKEVLSSAK 638 Query: 2267 VVLGTNTGAADPLIRRLDTFDLVVIDEAGQAIETSCWIPILQGKRCILAGDQCQLAPVIL 2446 VVL TNTGAADPLIRRLD+FDLVVIDEA QAIE SCWI ILQGKRCILAGDQCQLAPVIL Sbjct: 639 VVLATNTGAADPLIRRLDSFDLVVIDEAAQAIEPSCWIAILQGKRCILAGDQCQLAPVIL 698 Query: 2447 SRKALEGRLGVSLLERAATLHEGALATKLTTQYRMNDAIASWASKEMYGGSLVSSSTVAA 2626 SRKALEG LGVSLLERAATLH+G LATKLTTQYRMNDAI+SWASKEMYGGSL SS TV++ Sbjct: 699 SRKALEGGLGVSLLERAATLHDGILATKLTTQYRMNDAISSWASKEMYGGSLKSSLTVSS 758 Query: 2627 HLLVDTPFVKPTWITQCPLLLLDTRMAYGSLSLGCEEHLDPAGTGSFYNEGEAEIVVQHV 2806 HLLVD PFVKPTWITQCPLLLLDTRM YGSLS+GCEEHLDPAGTGSFYNEGEA+IVVQHV Sbjct: 759 HLLVDAPFVKPTWITQCPLLLLDTRMTYGSLSVGCEEHLDPAGTGSFYNEGEADIVVQHV 818 Query: 2807 FSLIYAGVSPSAIAVQSPYVAQVQLLRDRLDELPEAAGVEVATIDSFQGREADAVIISMV 2986 FSLIY+GVSP+AI VQSPYVAQVQLLRDRLDELPEAAGVEVATIDSFQGREADAVIISMV Sbjct: 819 FSLIYSGVSPAAIVVQSPYVAQVQLLRDRLDELPEAAGVEVATIDSFQGREADAVIISMV 878 Query: 2987 RSNTLAAVGFLGDSRRMNVAITRACKHVAVVCDSSTICHNTFLARLLRHIRYFGRVKHAE 3166 RSN L AVGFLGDSRRMNVA+TRA KHVAVVCDSSTICHNTFLARLL HIRYFGRVKHA+ Sbjct: 879 RSNNLGAVGFLGDSRRMNVALTRARKHVAVVCDSSTICHNTFLARLLHHIRYFGRVKHAD 938 Query: 3167 PGSFGESGLDMNPMLPSI 3220 PG G SGL NPMLPSI Sbjct: 939 PGGLGGSGLGTNPMLPSI 956 >XP_010063606.1 PREDICTED: DNA-binding protein SMUBP-2 [Eucalyptus grandis] KCW90988.1 hypothetical protein EUGRSUZ_A02997 [Eucalyptus grandis] Length = 968 Score = 1394 bits (3609), Expect = 0.0 Identities = 708/858 (82%), Positives = 762/858 (88%) Frame = +2 Query: 647 NVRALSQNGDPLGRRELGKSVVRWICQGMRAMASDFASAEVQGEFSELGQLMGPGLTFVI 826 +V AL QNGDPLG R+LGKSVVRWICQ MRAMASDFA+AEVQGEFSE+ Q MGPGLTFVI Sbjct: 110 SVGALHQNGDPLGWRDLGKSVVRWICQAMRAMASDFAAAEVQGEFSEVRQRMGPGLTFVI 169 Query: 827 AAQPYLNAIPMPVGLEAICLKACTHYPTLFDHFQRELRDVLQELQHKSLVQDWHETESWK 1006 AQPYLNAIPMP+GLEAICLKACTHYPTLFDHFQRELRDVLQ L+ +S+V +W TESWK Sbjct: 170 QAQPYLNAIPMPLGLEAICLKACTHYPTLFDHFQRELRDVLQGLERQSVVPNWRGTESWK 229 Query: 1007 LLKELANSAHHRAIVRKVTQPKPVQGVLGMDLERFKTIQGRIDEFTKQMSELLRIERDAE 1186 LLKELA+SA H+AI RK +QPKPVQGVLG+DLE+ K+IQ RID+FT MSELL IERDAE Sbjct: 230 LLKELASSAQHKAIARKASQPKPVQGVLGLDLEKVKSIQRRIDDFTTNMSELLCIERDAE 289 Query: 1187 LEFTQEELNAVPTPDENSDSSKPIEFLVSHGQAPQELCDTICNLFAVSTSTGLGGMHLVL 1366 LEFTQEEL+AVP PD NSD+SKPIEFLVSHGQA QELCDTICNL+AVSTSTGLGGMHLVL Sbjct: 290 LEFTQEELDAVPMPDTNSDASKPIEFLVSHGQAQQELCDTICNLYAVSTSTGLGGMHLVL 349 Query: 1367 FRVEGNHRLPPTTLSPGDMVCVRICDSRGACATSCIQGFVHNLGEDGCSISVALESRHGD 1546 FRVEGNHRLPPTTLSPGDM+CVR+CDSRGA TSC+QGF+HNLGEDG SISVALESRHGD Sbjct: 350 FRVEGNHRLPPTTLSPGDMICVRVCDSRGASTTSCMQGFIHNLGEDGSSISVALESRHGD 409 Query: 1547 PTFSKLFGRNVRIDRIQGLADTLTYERNCEALMLLQKHGLHKRNPSIAAVVTLFGDKEDV 1726 PTFSKLFG+ +RIDRIQGLAD LTYERNCEALMLLQK+GLHK+NP+IA V TLFGD EDV Sbjct: 410 PTFSKLFGKTLRIDRIQGLADVLTYERNCEALMLLQKNGLHKKNPAIAVVATLFGDTEDV 469 Query: 1727 AWLVENDLADLSEVKLDGMLGSKTFDDSQKKAIALGLNKKRPLLIIQGPPGTGKTGLLKE 1906 A L N L + +E +L+G+ TFDDSQ+KAIALGLNK+RPLLIIQGPPGTGKT LLKE Sbjct: 470 ACLEFNQLVNWAEAELEGLSSYGTFDDSQRKAIALGLNKRRPLLIIQGPPGTGKTCLLKE 529 Query: 1907 LIARAVQQGERVLVTAPTNAAVDNMVEKLSDIGLNIVRVGNPARISPVVASKSLDEIVYS 2086 LI +AVQQGERVL+TAPTNAAVDNMVEKLSDIGL++VR+GNPARIS VASKSL EIV + Sbjct: 530 LIVQAVQQGERVLMTAPTNAAVDNMVEKLSDIGLDVVRMGNPARISESVASKSLGEIVNA 589 Query: 2087 KLASFLAEFERKKSDLRKDLSQCLKDDSLAAGIRXXXXXXXXXXXXXXXXXXXXXXSSAQ 2266 +L SF EFERKK+DLRKDL CLKDDSLAAGIR + AQ Sbjct: 590 RLESFQTEFERKKADLRKDLRHCLKDDSLAAGIRQLLKQLGKAFKKKEKETVKEVLTGAQ 649 Query: 2267 VVLGTNTGAADPLIRRLDTFDLVVIDEAGQAIETSCWIPILQGKRCILAGDQCQLAPVIL 2446 VVL TN+GAADPLIRRLD+FDLVVIDEAGQAIE SCWIP+LQGKRCILAGDQCQLAPV+L Sbjct: 650 VVLATNSGAADPLIRRLDSFDLVVIDEAGQAIEPSCWIPMLQGKRCILAGDQCQLAPVVL 709 Query: 2447 SRKALEGRLGVSLLERAATLHEGALATKLTTQYRMNDAIASWASKEMYGGSLVSSSTVAA 2626 SRKALEG LGVSL+ERAA LHEG LAT L TQYRMNDAIASWASKEMY G L SSSTV++ Sbjct: 710 SRKALEGGLGVSLMERAANLHEGILATLLITQYRMNDAIASWASKEMYEGLLKSSSTVSS 769 Query: 2627 HLLVDTPFVKPTWITQCPLLLLDTRMAYGSLSLGCEEHLDPAGTGSFYNEGEAEIVVQHV 2806 HLLVD+PFVKPTWITQCPLLLLDTRM YGSLS GCEEHLDP GTGS YNEGEA+IVV HV Sbjct: 770 HLLVDSPFVKPTWITQCPLLLLDTRMPYGSLSAGCEEHLDPTGTGSLYNEGEADIVVHHV 829 Query: 2807 FSLIYAGVSPSAIAVQSPYVAQVQLLRDRLDELPEAAGVEVATIDSFQGREADAVIISMV 2986 FSLIYAGVSP AIAVQSPYVAQVQLLRDRLDELPEAAGVEVATIDSFQGREADAVIISMV Sbjct: 830 FSLIYAGVSPRAIAVQSPYVAQVQLLRDRLDELPEAAGVEVATIDSFQGREADAVIISMV 889 Query: 2987 RSNTLAAVGFLGDSRRMNVAITRACKHVAVVCDSSTICHNTFLARLLRHIRYFGRVKHAE 3166 RSNTL AVGFLGDSRRMNVAITRA KHVAVVCDSSTICHNTFLARLLRHIRYFGRVKHAE Sbjct: 890 RSNTLGAVGFLGDSRRMNVAITRARKHVAVVCDSSTICHNTFLARLLRHIRYFGRVKHAE 949 Query: 3167 PGSFGESGLDMNPMLPSI 3220 PGSFG SGL M+PMLPSI Sbjct: 950 PGSFGGSGLGMDPMLPSI 967 >GAV70650.1 AAA_11 domain-containing protein/AAA_12 domain-containing protein [Cephalotus follicularis] Length = 936 Score = 1384 bits (3582), Expect = 0.0 Identities = 703/860 (81%), Positives = 754/860 (87%), Gaps = 2/860 (0%) Frame = +2 Query: 647 NVRALSQNGDPLGRRELGKSVVRWICQGMRAMASDFASAEVQGEFSELGQLMGPGLTFVI 826 NVR L QNGDPLGRR++GKSVV WICQGM+AMA DF +AE QGEF E+ Q MGPGLTFVI Sbjct: 79 NVRTLYQNGDPLGRRDVGKSVVHWICQGMKAMAIDFDAAETQGEFCEVRQRMGPGLTFVI 138 Query: 827 AAQPYLNAIPMPVGLEAICLKACTHYPTLFDHFQRELRDVLQELQHKSLVQD--WHETES 1000 AQPYLNA+PMP+GLEAICLK CTHYPTLFDHFQRELRDVLQ HK + D W ETES Sbjct: 139 QAQPYLNAVPMPLGLEAICLKVCTHYPTLFDHFQRELRDVLQ---HKKRLADTDWRETES 195 Query: 1001 WKLLKELANSAHHRAIVRKVTQPKPVQGVLGMDLERFKTIQGRIDEFTKQMSELLRIERD 1180 WKLLKELANSA HRAI RK +QPKPV+GVLGM+ ++ K IQ RID+FT +MSELLRIERD Sbjct: 196 WKLLKELANSAQHRAIARKASQPKPVKGVLGMNFDKAKAIQTRIDDFTNRMSELLRIERD 255 Query: 1181 AELEFTQEELNAVPTPDENSDSSKPIEFLVSHGQAPQELCDTICNLFAVSTSTGLGGMHL 1360 AELEFTQEELNA+PTPDE+S + KPIEFLVSHGQA QELCDTICNL VSTSTGLGGM L Sbjct: 256 AELEFTQEELNAIPTPDESSGTLKPIEFLVSHGQAQQELCDTICNLNVVSTSTGLGGMQL 315 Query: 1361 VLFRVEGNHRLPPTTLSPGDMVCVRICDSRGACATSCIQGFVHNLGEDGCSISVALESRH 1540 VLFRVEGNHRLPPTTLSPGDMVCVR CDSRGA TSC+QGFV+NLG+DGCSISVALESR Sbjct: 316 VLFRVEGNHRLPPTTLSPGDMVCVRTCDSRGAGTTSCMQGFVNNLGDDGCSISVALESRC 375 Query: 1541 GDPTFSKLFGRNVRIDRIQGLADTLTYERNCEALMLLQKHGLHKRNPSIAAVVTLFGDKE 1720 GDPTFSKLFG+NVRIDRI GLAD LTYER+CEALM+LQK+GL K+NPSIA V TLFGDKE Sbjct: 376 GDPTFSKLFGKNVRIDRIPGLADALTYERDCEALMMLQKNGLQKKNPSIAVVATLFGDKE 435 Query: 1721 DVAWLVENDLADLSEVKLDGMLGSKTFDDSQKKAIALGLNKKRPLLIIQGPPGTGKTGLL 1900 D +WL EN LAD +E +LDG LGS +FDDSQ++AI LGLNKKRP+LI+QGPPGTGKTGLL Sbjct: 436 DFSWLKENHLADFAEAELDGQLGSGSFDDSQRRAITLGLNKKRPVLIVQGPPGTGKTGLL 495 Query: 1901 KELIARAVQQGERVLVTAPTNAAVDNMVEKLSDIGLNIVRVGNPARISPVVASKSLDEIV 2080 KEL+ +VQQGERVLVTAPTNA VDN+VEKLS GL IVRVGNPARISP VASKSL EIV Sbjct: 496 KELVVLSVQQGERVLVTAPTNAGVDNIVEKLSKTGLKIVRVGNPARISPAVASKSLSEIV 555 Query: 2081 YSKLASFLAEFERKKSDLRKDLSQCLKDDSLAAGIRXXXXXXXXXXXXXXXXXXXXXXSS 2260 SK SF AEFERKK+DLRKDL CLKDDSLAAGIR + Sbjct: 556 NSKFESFRAEFERKKTDLRKDLRHCLKDDSLAAGIRQLLKQLGKTLKKKEKEIVKEVLAG 615 Query: 2261 AQVVLGTNTGAADPLIRRLDTFDLVVIDEAGQAIETSCWIPILQGKRCILAGDQCQLAPV 2440 AQVVL TNTGAADPLIRRLDTFDLVVIDEA QAIE SCWIPILQGKRCILAGDQCQLAPV Sbjct: 616 AQVVLATNTGAADPLIRRLDTFDLVVIDEAAQAIEPSCWIPILQGKRCILAGDQCQLAPV 675 Query: 2441 ILSRKALEGRLGVSLLERAATLHEGALATKLTTQYRMNDAIASWASKEMYGGSLVSSSTV 2620 ILSR ALEG LGVSLLERAATLHEG LAT LTTQYRMNDAIA WASKEMYGG L SS TV Sbjct: 676 ILSRSALEGGLGVSLLERAATLHEGVLATILTTQYRMNDAIACWASKEMYGGLLKSSPTV 735 Query: 2621 AAHLLVDTPFVKPTWITQCPLLLLDTRMAYGSLSLGCEEHLDPAGTGSFYNEGEAEIVVQ 2800 A+HLL+D+PFVKPTWITQC LLLLDTRM YGSLS+GCEEHLDPAGTGSFYNEGEA+IVVQ Sbjct: 736 ASHLLIDSPFVKPTWITQCSLLLLDTRMPYGSLSVGCEEHLDPAGTGSFYNEGEADIVVQ 795 Query: 2801 HVFSLIYAGVSPSAIAVQSPYVAQVQLLRDRLDELPEAAGVEVATIDSFQGREADAVIIS 2980 HVF L+YAGVSP+AIAVQSPYVAQVQLLRDRLDELPEAAGVEVATIDSFQGREADAVIIS Sbjct: 796 HVFFLVYAGVSPTAIAVQSPYVAQVQLLRDRLDELPEAAGVEVATIDSFQGREADAVIIS 855 Query: 2981 MVRSNTLAAVGFLGDSRRMNVAITRACKHVAVVCDSSTICHNTFLARLLRHIRYFGRVKH 3160 MVRSNTL AVGFLGDSRRMNVAITRA KH+AVVCDSSTICHNTFLARLLR+IRYFGRVKH Sbjct: 856 MVRSNTLGAVGFLGDSRRMNVAITRARKHIAVVCDSSTICHNTFLARLLRYIRYFGRVKH 915 Query: 3161 AEPGSFGESGLDMNPMLPSI 3220 A+PG+FG SGL M+PMLPSI Sbjct: 916 ADPGTFGGSGLGMDPMLPSI 935