BLASTX nr result

ID: Zingiber25_contig00023784 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Zingiber25_contig00023784
         (3245 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|NP_001051931.1| Os03g0853700 [Oryza sativa Japonica Group] g...   857   0.0  
ref|XP_006650917.1| PREDICTED: PAX3- and PAX7-binding protein 1-...   838   0.0  
ref|XP_002463498.1| hypothetical protein SORBIDRAFT_01g000820 [S...   835   0.0  
ref|XP_004981007.1| PREDICTED: PAX3- and PAX7-binding protein 1-...   833   0.0  
ref|XP_004135116.1| PREDICTED: GC-rich sequence DNA-binding fact...   826   0.0  
ref|XP_003568557.1| PREDICTED: GC-rich sequence DNA-binding fact...   820   0.0  
ref|XP_004159322.1| PREDICTED: GC-rich sequence DNA-binding fact...   810   0.0  
dbj|BAK05949.1| predicted protein [Hordeum vulgare subsp. vulgare]    809   0.0  
gb|EAY92631.1| hypothetical protein OsI_14375 [Oryza sativa Indi...   808   0.0  
ref|XP_002278714.2| PREDICTED: GC-rich sequence DNA-binding fact...   803   0.0  
ref|XP_006838726.1| hypothetical protein AMTR_s00002p00252610 [A...   791   0.0  
ref|XP_006379383.1| hypothetical protein POPTR_0008s00320g [Popu...   770   0.0  
tpg|DAA52554.1| TPA: hypothetical protein ZEAMMB73_777539 [Zea m...   768   0.0  
gb|EMT06523.1| GC-rich sequence DNA-binding factor-like protein ...   766   0.0  
gb|EOY19310.1| GC-rich sequence DNA-binding factor-like protein,...   766   0.0  
ref|XP_004298307.1| PREDICTED: GC-rich sequence DNA-binding fact...   761   0.0  
ref|XP_006468681.1| PREDICTED: PAX3- and PAX7-binding protein 1-...   757   0.0  
ref|XP_006448500.1| hypothetical protein CICLE_v10014191mg [Citr...   750   0.0  
gb|EXB53993.1| GC-rich sequence DNA-binding factor 1 [Morus nota...   741   0.0  
ref|XP_002513154.1| gc-rich sequence DNA-binding factor, putativ...   741   0.0  

>ref|NP_001051931.1| Os03g0853700 [Oryza sativa Japonica Group] gi|29126331|gb|AAO66523.1|
            expressed protein [Oryza sativa Japonica Group]
            gi|108712159|gb|ABF99954.1| expressed protein [Oryza
            sativa Japonica Group] gi|113550402|dbj|BAF13845.1|
            Os03g0853700 [Oryza sativa Japonica Group]
            gi|125588681|gb|EAZ29345.1| hypothetical protein
            OsJ_13411 [Oryza sativa Japonica Group]
          Length = 955

 Score =  857 bits (2213), Expect = 0.0
 Identities = 486/974 (49%), Positives = 642/974 (65%), Gaps = 43/974 (4%)
 Frame = +3

Query: 21   MSSIRAKNFRRRSE-SDDANAEEKSVPSPS-TKSQTLTLXXXXXXXXXXXXRLSFADDEE 194
            MSS R KNFRRR++ ++DA  ++ S   P+ TK+QT  +            RLSF +DE+
Sbjct: 1    MSSHR-KNFRRRTDDAEDAYGDDSSNSKPTATKTQTPPVPKPRSPRRQGASRLSFVEDED 59

Query: 195  EDND--------RRPS----RIPSSSAGAASVHRLTSSKDRSKASRLASSI-----PSNV 323
            +D+         RRP+    +  ++S  AA++HRLT ++DR K+S   ++      PSN 
Sbjct: 60   DDDAEEGPLSQRRRPAATVRQARTASPAAATLHRLTPARDRLKSSTAVAAAVPAPKPSNF 119

Query: 324  QPQVGEYTKERLLELQKNARPL-GSISRSQRPP----------------AVPEPKPRKSD 452
            Q   GEYT ERL ELQKNARPL GS+ R+  PP                A P P    + 
Sbjct: 120  QSHAGEYTPERLRELQKNARPLPGSLMRAPPPPPPPTAEAPRQRLPGAAASPAPATNTTA 179

Query: 453  RPAEPVIVLKGFLKQASPGRDKQEGVVLKRQETNXXXXXXXXXXXDDSNGGSLTGAKFPT 632
               EPV++LKG +K  S     Q  +  +    N           ++   G       P 
Sbjct: 180  AAVEPVVILKGLVKPMS-----QASIGPRNPSQNEDKDEDESEEEEEEEEG-------PV 227

Query: 633  IPDPETIKAIXXXXXXXXXXXXXXXDFISLDGG-MPSSRPSADGSSDEEDTDFQERISLF 809
            IPD  TI+AI               D+ISLDGG + SSR +A GSSDE+D + + RI+++
Sbjct: 228  IPDRATIEAIRAKRQQLQQPRHAAPDYISLDGGGVLSSREAAGGSSDEDDDETRGRIAMY 287

Query: 810  GIKADDKLK-KGVFESIDQRLTITDERKMDGGFRKGDINIXXXXXXXXXXXXXXQFRKGL 986
              K+D +   KGVF  I+ R        ++ GFR+ +                 QFRKGL
Sbjct: 288  AEKSDSQRSTKGVFGVINNRGPAASLGVINDGFREVEDEKDDDEDEEERKWEEEQFRKGL 347

Query: 987  GKRIDDTSSQRV-NYSVAPIPLHPQPSVY---PGVAHQTSASMTSASYGASRSAEVLSIS 1154
            G+R+DD S+QR  N   AP+ + PQPS Y   P      S  +   S  AS SAE LSI+
Sbjct: 348  GRRVDDASAQRAANGGPAPVQVQPQPSGYSIDPRYQPSFSGVLPGTSIFASGSAEFLSIA 407

Query: 1155 QQAEVASRAMQETINRLKESHKITTNSLVRTDTNITESLTEVSSLEKSLKEADDKYNFMQ 1334
            QQA+VAS+A+QE I +LKE+HK T ++LV+TDT++TE+L+E+SSLE  L++A+ K+ +MQ
Sbjct: 408  QQADVASKALQENIRKLKETHKTTVDALVKTDTHLTEALSEISSLESGLQDAERKFVYMQ 467

Query: 1335 QLRDFISVMCDFLNDKAFLIEELEEQMQKLHEKRALAVVERRADDIADDDNEVESAVNAA 1514
            +LR++ISVMCDFLNDKAF IEELEE MQKLHE R  AV ERRA D+AD+ + +E+AVNAA
Sbjct: 468  ELRNYISVMCDFLNDKAFYIEELEEHMQKLHENRVTAVSERRAADLADESSVIEAAVNAA 527

Query: 1515 IAVLSKGSSSAYVXXXXXXXXXXXXXXRESADLPVELDEFGRDINLKKRMDFTRRAESRK 1694
            ++VLSKGSSSAY+              RES++LP ELDEFGRDIN++KRMD  RR E R+
Sbjct: 528  VSVLSKGSSSAYLSAASNAAQAAAAAARESSNLPPELDEFGRDINMQKRMDLKRREEDRR 587

Query: 1695 LRKARAESKRIASMEMD-NMLQIEGELSTDESDSESNAYISSRNELIQTAEEIFSDASEE 1871
             RK R+ESKR++S     N   IEGELSTDESDSES+AY+SSR+EL++TA+ +FSDA+EE
Sbjct: 588  RRKIRSESKRLSSEGRSANNEHIEGELSTDESDSESSAYLSSRDELLKTADLVFSDAAEE 647

Query: 1872 YANLKIVKEWFERWKNQYLSSYRDAYVSISVPSLFSPYVRLELLKWDPLYDATDFFDMEW 2051
            Y++L+IVK+ FE WK QY  +YRDA+V++S PS+F+PYVRLELLKWDPL++ TDFF MEW
Sbjct: 648  YSSLRIVKDKFEGWKTQYPLAYRDAHVALSAPSVFTPYVRLELLKWDPLHETTDFFGMEW 707

Query: 2052 HKLLFNYGLPAKGQDFEPDDADANLIPEIVEKVALPILHHEIEHCWDILNTQRTKGAVFA 2231
            HK+LF+YG        +P++ D +LIP +VEKVALPILHH I HCWDIL+TQRTK AV A
Sbjct: 708  HKILFDYGEQNSESGTDPNNVDKDLIPVLVEKVALPILHHRIMHCWDILSTQRTKNAVDA 767

Query: 2232 TNMVISYVPASSKALRELLAVIHTRLNEAITDLNVPVWSSVITKVVPGAAQFAAYKFGMA 2411
             NMVISY+P SSKAL +LLA +++RL EAI D++VP W S++T+ VPGA+Q+AA++FG+A
Sbjct: 768  INMVISYLPTSSKALHQLLAAVNSRLTEAIADISVPAWGSMVTRTVPGASQYAAHRFGVA 827

Query: 2412 VRLLRNICLWKNILSMPVXXXXXXXXXXXXXXXPHVKSIMPNIHDAIMRTERIIASLVGI 2591
            +RLL+N+CLWK+I + PV               PH+KSI+ + HDAI R ERI A L G+
Sbjct: 828  IRLLKNVCLWKDIFAKPVLEKLALEELLKGKILPHMKSIILDAHDAIARAERISALLKGV 887

Query: 2592 WSGPEVTLGTSQKLQPLVDCISELGGKLEKRHALGVSLEETRGLARRLKNMLVSLNEYDK 2771
            WS P      SQKLQP +D + ELG KLE+RH  G+S EETRGLARRLK++LV LNEYDK
Sbjct: 888  WSSP------SQKLQPFIDLVVELGNKLERRHMSGISEEETRGLARRLKDILVELNEYDK 941

Query: 2772 ARAILRTFQLKEAL 2813
            ARAIL+TFQ++EAL
Sbjct: 942  ARAILKTFQIREAL 955


>ref|XP_006650917.1| PREDICTED: PAX3- and PAX7-binding protein 1-like [Oryza brachyantha]
          Length = 960

 Score =  838 bits (2164), Expect = 0.0
 Identities = 484/979 (49%), Positives = 639/979 (65%), Gaps = 48/979 (4%)
 Frame = +3

Query: 21   MSSIRAKNFRRRSE-SDDANAEEKSVPSPS-TKSQTLTLXXXXXXXXXXXXRLSFADDEE 194
            MSS R KNFRRR++ S+DAN ++ S   P+ TK+Q                RLSFADDE+
Sbjct: 1    MSSHR-KNFRRRTDDSEDANGDDSSNARPAATKAQPRPAPKPRSPRRQGASRLSFADDED 59

Query: 195  EDNDRRPSRIPSSSAGAASVHRLTSSKDRSK------------------ASRLASSIPSN 320
            ED D     +      AA+V +  ++   +                   A+ + +  PSN
Sbjct: 60   ED-DAEEGPLSQRRRPAATVRQARTAPPAAXXXXXXXXXXXXXXXAPAVAAAVPAPKPSN 118

Query: 321  VQPQVGEYTKERLLELQKNARPL-GSISRSQRPPAVPEPKPRK-------SDRPA----- 461
             Q   GEYT ERL ELQKNARPL GS+ R+  PP      PR+       S  PA     
Sbjct: 119  FQSHAGEYTPERLRELQKNARPLPGSLMRAPPPPPPSAEAPRQRLAGAAASPVPATNTTA 178

Query: 462  -----EPVIVLKGFLK---QASPGRDKQEGVVLKRQETNXXXXXXXXXXXDDSNGGSLTG 617
                 EP++VLKG +K   QAS G        L  +E +           +++  G    
Sbjct: 179  AAVAVEPMVVLKGLVKPMSQASIGPRNP----LPNEEKDEDESEEEEEEEEEAEEG---- 230

Query: 618  AKFPTIPDPETIKAIXXXXXXXXXXXXXXXDFISLDGG-MPSSRPSADGSSDEEDTDFQE 794
               P IPD  TI+AI               D+ISLDGG + SSR +A GSSDEED + + 
Sbjct: 231  ---PVIPDRATIEAIRAKRQQLHQPRHPFPDYISLDGGGVLSSRDAAAGSSDEEDDETRG 287

Query: 795  RISLFGIKADDKLK-KGVFESIDQRLTITDERKMDGGFRKGDINIXXXXXXXXXXXXXXQ 971
            RI+++  K+D +   KGVF +I+ R        ++  FR+ + +               Q
Sbjct: 288  RIAMYAEKSDSQRSTKGVFAAINNRGPAASLGVINDSFREVEDDKDDDEDEEERRWEEEQ 347

Query: 972  FRKGLGKRIDDTSSQRV-NYSVAPIPLHPQPSVY---PGVAHQTSASMTSASYGASRSAE 1139
            FRKGLG+R+DD S+QR  N   AP+ + PQPS Y   P      +  +  AS  AS S E
Sbjct: 348  FRKGLGRRVDDASAQRAANGGPAPVQVQPQPSGYSVDPRYQPSFTGVLPGASVFASGSTE 407

Query: 1140 VLSISQQAEVASRAMQETINRLKESHKITTNSLVRTDTNITESLTEVSSLEKSLKEADDK 1319
             LSI+QQA+VAS+A+++ I +LKE+HK T ++LV+TDT+++E+L+E+S+LE  L++A+ K
Sbjct: 408  FLSIAQQADVASKALKDNIRKLKETHKTTVDALVKTDTHLSEALSEISNLESGLQDAEKK 467

Query: 1320 YNFMQQLRDFISVMCDFLNDKAFLIEELEEQMQKLHEKRALAVVERRADDIADDDNEVES 1499
            + +MQ+LR++ISVMCDFLNDKAF IEELEE MQKLHE R  AV ERRA D+AD+ + +E+
Sbjct: 468  FVYMQELRNYISVMCDFLNDKAFYIEELEEHMQKLHENRVTAVSERRAADLADESSIIET 527

Query: 1500 AVNAAIAVLSKGSSSAYVXXXXXXXXXXXXXXRESADLPVELDEFGRDINLKKRMDFTRR 1679
            AVNAA++VLSKGSSSAY+              +ES+++  ELDEFGRDIN++KRMD  RR
Sbjct: 528  AVNAAVSVLSKGSSSAYLSAASNAAQAAAAAAKESSNMLPELDEFGRDINMQKRMDLKRR 587

Query: 1680 AESRKLRKARAESKRIASM-EMDNMLQIEGELSTDESDSESNAYISSRNELIQTAEEIFS 1856
             E R+ RK R+ESKR+ S  +  N   IEGELSTDESDSES+AY+SSR+EL++TA+ +FS
Sbjct: 588  EEDRRRRKIRSESKRLPSTGKSANDEHIEGELSTDESDSESSAYLSSRDELLKTADLVFS 647

Query: 1857 DASEEYANLKIVKEWFERWKNQYLSSYRDAYVSISVPSLFSPYVRLELLKWDPLYDATDF 2036
            DA+EEY++L+IVK+ FE WK QY  +YRDA+V++S PS+F+PYVRLELLKWDPL++ TDF
Sbjct: 648  DAAEEYSSLRIVKDKFEGWKTQYPLAYRDAHVALSAPSVFTPYVRLELLKWDPLHETTDF 707

Query: 2037 FDMEWHKLLFNYGLPAKGQDFEPDDADANLIPEIVEKVALPILHHEIEHCWDILNTQRTK 2216
            FDM WHK+LF+YG+       +P+DAD NLIP +VEKVALPILH  I HCWDIL+TQRTK
Sbjct: 708  FDMGWHKILFDYGVQNNESATDPNDADMNLIPVLVEKVALPILHQRIMHCWDILSTQRTK 767

Query: 2217 GAVFATNMVISYVPASSKALRELLAVIHTRLNEAITDLNVPVWSSVITKVVPGAAQFAAY 2396
             AV A NM ISY+P SSKAL +LLA +++RL EAI D++VP W S++T+VVPGA+Q+AA+
Sbjct: 768  NAVDAVNMAISYLPTSSKALHQLLATVNSRLTEAIADISVPAWGSMVTRVVPGASQYAAH 827

Query: 2397 KFGMAVRLLRNICLWKNILSMPVXXXXXXXXXXXXXXXPHVKSIMPNIHDAIMRTERIIA 2576
            +FG+AVRLL+N+CLWK+I + PV               PH+KSI+ ++HDAI R ERI A
Sbjct: 828  RFGVAVRLLKNVCLWKDIFAKPVLEKLALEDLLRGKILPHMKSIILDVHDAIARAERISA 887

Query: 2577 SLVGIWSGPEVTLGTSQKLQPLVDCISELGGKLEKRHALGVSLEETRGLARRLKNMLVSL 2756
            SL G+WS P      SQKLQP +D + ELG KLE+RH  G+S EETRGLARRLKN+LV L
Sbjct: 888  SLSGVWSSP------SQKLQPFIDLVVELGNKLERRHMSGISEEETRGLARRLKNILVEL 941

Query: 2757 NEYDKARAILRTFQLKEAL 2813
            NEYDKARAIL+TFQL+EAL
Sbjct: 942  NEYDKARAILKTFQLREAL 960


>ref|XP_002463498.1| hypothetical protein SORBIDRAFT_01g000820 [Sorghum bicolor]
            gi|241917352|gb|EER90496.1| hypothetical protein
            SORBIDRAFT_01g000820 [Sorghum bicolor]
          Length = 1094

 Score =  835 bits (2158), Expect = 0.0
 Identities = 484/990 (48%), Positives = 646/990 (65%), Gaps = 59/990 (5%)
 Frame = +3

Query: 21   MSSIRAKNFRRRSESD-DANAEEKSVPSPST----KSQTLTLXXXXXXXXXXXX-RLSFA 182
            MSS R KNFRRR++ D DAN +  S   PST    K++TLT+             RLSFA
Sbjct: 137  MSSSR-KNFRRRADDDEDANGDGGSHTKPSTATSTKTKTLTVPKPKSPPRRQGASRLSFA 195

Query: 183  DDEEEDNDR---------------RPSRIPSSSAGAASVHRLTSSKDRSKAS------RL 299
            DDE+ED+                 RP+R  S +AGA  +HRLT ++DR ++S        
Sbjct: 196  DDEDEDDAEEGPFAQRRRPPTASVRPARTASPAAGA--LHRLTPARDRIRSSPAPAVAAA 253

Query: 300  ASSIPSNVQPQVGEYTKERLLELQKNARPL-GSISRSQRPPAVPEPKPRKSDRP------ 458
            ++  PSN Q   GEYT ERL ELQKNARPL GS+ RSQ  P  P  +PR    P      
Sbjct: 254  SAPKPSNFQSHAGEYTPERLRELQKNARPLPGSLLRSQ--PQTPATEPRSQKLPGIPASS 311

Query: 459  ---------AEPVIVLKGFLKQAS--------PGRDKQEGVVLKRQETNXXXXXXXXXXX 587
                     AE V++LKG +K  S        P  DK+E    + +E             
Sbjct: 312  TPATTTAAAAETVVILKGLVKPMSEASIGPRIPKHDKEEDKSEEEEE------------G 359

Query: 588  DDSNGGSLTGAKFPTIPDPETIKAIXXXXXXXXXXXXXXXDFISLDGG-MPSSRPSADGS 764
            D+ + G       P IPD  TI AI               D+ISLDGG + SSR   D S
Sbjct: 360  DEEDEG-------PVIPDRATIDAIRAKRQQRQQPRHAAPDYISLDGGGVLSSRGGGDES 412

Query: 765  SDEEDTDFQERISLFGIKADDKLK--KGVFESIDQRLTITDERKMDGGFRKGDINIXXXX 938
            SDE+D + ++RI+++  K  D L+  K VF  I  R   T    +  G R  + +     
Sbjct: 413  SDEDDNETRDRIAMYTDKPSDGLRSTKSVFGGISNRGPATSLGTLSDGNRMVEDDRDDDD 472

Query: 939  XXXXXXXXXXQFRKGLGKRIDDTSSQR-VNYSVAPIPLHPQPSVYPGVAH---QTSASMT 1106
                      QFRKGLG+R+DD S+QR  N   A + + PQP  YP  +H     S+ + 
Sbjct: 473  DEEERRWEEEQFRKGLGRRMDDASTQRSANGVPAAMHVQPQPFGYPVGSHYQPSLSSVVP 532

Query: 1107 SASYGASRSAEVLSISQQAEVASRAMQETINRLKESHKITTNSLVRTDTNITESLTEVSS 1286
            +AS  AS +AE LSI+QQA+VA++A+Q+ I +L+E+HK T ++LV+TDT++ E+L+E+SS
Sbjct: 533  AASVFASGTAEFLSIAQQADVANKALQDNIRKLRETHKTTVSALVKTDTHLNEALSEISS 592

Query: 1287 LEKSLKEADDKYNFMQQLRDFISVMCDFLNDKAFLIEELEEQMQKLHEKRALAVVERRAD 1466
            LE  L++A+ ++ +MQ+LRD++SVMCDFLNDKAFLIEELEE +QKLHE RALA+ ERRA 
Sbjct: 593  LESGLQDAEKRFVYMQELRDYVSVMCDFLNDKAFLIEELEENIQKLHENRALAISERRAA 652

Query: 1467 DIADDDNEVESAVNAAIAVLSKGSSSAYVXXXXXXXXXXXXXXRESADLPVELDEFGRDI 1646
            D+AD+   +E+AVNAA+++LSKGSSSAY+              RES++LP ELDEFGRDI
Sbjct: 653  DLADESGVIEAAVNAAVSILSKGSSSAYLSAASNAAQAAAAAARESSNLPPELDEFGRDI 712

Query: 1647 NLKKRMDFTRRAESRKLRKARAESKRIAS-MEMDNMLQIEGELSTDESDSESNAYISSRN 1823
            N++KRMD  RR E+R+ RK ++E+KR+AS ++   + +IEGELSTDESDSES AY+SSR+
Sbjct: 713  NMQKRMDLKRREENRRRRKTQSETKRLASAVKNKGIEKIEGELSTDESDSESTAYVSSRD 772

Query: 1824 ELIQTAEEIFSDASEEYANLKIVKEWFERWKNQYLSSYRDAYVSISVPSLFSPYVRLELL 2003
            E ++ A+ +F+DA EEY++L+ VK+ FE WK QY S+YRDA+V++S PS+F+P+VRLELL
Sbjct: 773  EFLKAADHVFNDAKEEYSSLRTVKDKFEGWKTQYPSAYRDAHVALSAPSVFTPFVRLELL 832

Query: 2004 KWDPLYDATDFFDMEWHKLLFNYGLPAKGQDFEPDDADANLIPEIVEKVALPILHHEIEH 2183
            KWDPL++ TDFFDM+WHK+LF+YG+ A       +D+D  ++P +VEKVALPILHH I+H
Sbjct: 833  KWDPLHETTDFFDMDWHKVLFDYGMQANESPSGSNDSD--VVPVLVEKVALPILHHRIKH 890

Query: 2184 CWDILNTQRTKGAVFATNMVISYVPASSKALRELLAVIHTRLNEAITDLNVPVWSSVITK 2363
            CWD+L+TQRT+ AV A+ MVI Y+P SSK L +LLA + +RL EAI DL+VP W S++T+
Sbjct: 891  CWDVLSTQRTRNAVDASRMVIGYLPTSSKDLHQLLASVRSRLTEAIADLSVPAWGSMVTR 950

Query: 2364 VVPGAAQFAAYKFGMAVRLLRNICLWKNILSMPVXXXXXXXXXXXXXXXPHVKSIMPNIH 2543
             VPGA+Q+AAY+FG+A+RLL+N+CLWK+IL+  V               PH+KSI+ ++H
Sbjct: 951  TVPGASQYAAYRFGVAIRLLKNVCLWKDILAEHVVEKLALDELLRGKILPHMKSIILDVH 1010

Query: 2544 DAIMRTERIIASLVGIWSGPEVTLGTSQKLQPLVDCISELGGKLEKRHALGVSLEETRGL 2723
            DAI R ERI ASL  +W         SQKLQP VD + ELG KLE+RH  G+S EETRGL
Sbjct: 1011 DAITRAERIAASLSEVWP------KQSQKLQPFVDLVVELGNKLERRHTSGISEEETRGL 1064

Query: 2724 ARRLKNMLVSLNEYDKARAILRTFQLKEAL 2813
            ARRLKN+LVSLNEYDKARAIL+TFQL+EAL
Sbjct: 1065 ARRLKNVLVSLNEYDKARAILKTFQLREAL 1094


>ref|XP_004981007.1| PREDICTED: PAX3- and PAX7-binding protein 1-like [Setaria italica]
          Length = 954

 Score =  833 bits (2151), Expect = 0.0
 Identities = 481/976 (49%), Positives = 642/976 (65%), Gaps = 45/976 (4%)
 Frame = +3

Query: 21   MSSIRAKNFRRRSESD-DANAEEKSVPSPSTKSQTLTLXXXXXXXXXXXX-RLSFADDEE 194
            MSS R KNFRRR++ + DAN +  S P P+TK+QTLT+             RLSFADDE+
Sbjct: 1    MSSHR-KNFRRRADDEEDANGDGGSHPKPATKTQTLTVPKPKSPPRRQGASRLSFADDED 59

Query: 195  EDNDRR----PSRIPSSSA--------GAASVHRLTSSKDRSKASRLASSI------PSN 320
            +D+       P R P++S          AAS+HRLT +++R ++S  A+        PSN
Sbjct: 60   DDDAEEGPLAPRRRPTASVRPARTASPAAASLHRLTPARERHRSSPAAAIAAVSAPKPSN 119

Query: 321  VQPQVGEYTKERLLELQKNARPL-GSISRSQRPPAVPEPKPRK------SDRPA------ 461
             Q   GEYT ERL ELQKNARPL GS+ R+  P   PEP+ ++      S  P       
Sbjct: 120  FQSHAGEYTPERLRELQKNARPLPGSLMRAPPPTLAPEPRSQRLAGAPASSTPTTSTAAA 179

Query: 462  -EPVIVLKGFLK---QASPGRDKQEGVVLKRQETNXXXXXXXXXXXDDSNGGSLTGAKFP 629
             EPV++LKG +K   +AS G  K     L++++ +           D+           P
Sbjct: 180  TEPVVILKGLVKPMAEASIGPRKP----LQKEDEDKSEEEEGGDEEDEG----------P 225

Query: 630  TIPDPETIKAIXXXXXXXXXXXXXXXDFISLDGGMPSSRPSADG-SSDEEDTDFQERISL 806
             IPD  TI+AI               D+ISLDGG   S  +A G SSDE+D +   RI++
Sbjct: 226  VIPDRATIEAIRAKRQQMQQPRHAAPDYISLDGGGVLSSKNAGGESSDEDDNETGGRIAM 285

Query: 807  FGIKADDKLK--KGVFESIDQRLTITDERKMDGGFRKGDINIXXXXXXXXXXXXXXQFRK 980
            +  K+ D L+  KGVF  I+ R        +  G R+ + N+              QFRK
Sbjct: 286  YTDKSTDGLRSTKGVFGGINNRGPAASLGALSDGIREVEDNMDDDDDEEERRWEEEQFRK 345

Query: 981  GLGKRIDDTSSQRV-NYSVAPIPLHPQPSVYP-GVAHQTSAS--MTSASYGASRSAEVLS 1148
            GLG+R+DD S+QR  N + A   + PQ   Y  G  HQ S S  + +AS  AS S E LS
Sbjct: 346  GLGRRVDDASAQRTANGAPASAQVQPQAFGYSVGSHHQPSLSGAVPAASVFASGSVEFLS 405

Query: 1149 ISQQAEVASRAMQETINRLKESHKITTNSLVRTDTNITESLTEVSSLEKSLKEADDKYNF 1328
            I+QQA+VA++A+QE I +L+E+HK T ++LV+T+T++ E+L+E+SSL+  LK+A+ K+ +
Sbjct: 406  IAQQADVANKALQENIRKLRETHKTTVSALVKTETHLNEALSEISSLDSGLKDAEKKFVY 465

Query: 1329 MQQLRDFISVMCDFLNDKAFLIEELEEQMQKLHEKRALAVVERRADDIADDDNEVESAVN 1508
            MQ+LR +ISVMCDFLNDKAF IEELEE MQKLHE RALA+ ERRA D+AD+   +E+AV+
Sbjct: 466  MQELRHYISVMCDFLNDKAFYIEELEEHMQKLHENRALAISERRAADLADESGVIEAAVD 525

Query: 1509 AAIAVLSKGSSSAYVXXXXXXXXXXXXXXRESADLPVELDEFGRDINLKKRMDFTRRAES 1688
            AA+++LSKGSSSAY+              RES++LP ELDEFGRDINL+KRMD  RR E+
Sbjct: 526  AAVSILSKGSSSAYLSAASNAAQAAAAAARESSNLPPELDEFGRDINLQKRMDLKRREEN 585

Query: 1689 RKLRKARAESKRIASMEMDNMLQ-IEGELSTDESDSESNAYISSRNELIQTAEEIFSDAS 1865
            R+ RKA++ESKR+AS   +N ++ IEGE+STDESDSES AY+SSR+EL++TA+ +FSDAS
Sbjct: 586  RRRRKAKSESKRLASAVKNNDIEKIEGEISTDESDSESTAYVSSRDELLRTADVVFSDAS 645

Query: 1866 EEYANLKIVKEWFERWKNQYLSSYRDAYVSISVPSLFSPYVRLELLKWDPLYDATDFFDM 2045
            EEY++L+IVK+ FE WK QY S+YRDA+V++S PS+F+PYVRLELLKWDPL+   DFFDM
Sbjct: 646  EEYSSLQIVKDKFEGWKTQYPSAYRDAHVALSAPSVFTPYVRLELLKWDPLHKTIDFFDM 705

Query: 2046 EWHKLLFNYGLPAKGQDFEPDDADANLIPEIVEKVALPILHHEIEHCWDILNTQRTKGAV 2225
            +WHK+LF+Y +    +       D +++P +VEKVALPILHH I+HCWD+L+++RT+ AV
Sbjct: 706  DWHKVLFDYDV-KDNESASGGSTDTDVVPVLVEKVALPILHHRIKHCWDVLSSKRTENAV 764

Query: 2226 FATNMVISYVPASSKALRELLAVIHTRLNEAITDLNVPVWSSVITKVVPGAAQFAAYKFG 2405
             A  MVI Y+PASSK L +LLA + +RL +AI DL+VP W S++T+ VPGA Q+AAY+FG
Sbjct: 765  DAIRMVIGYLPASSKDLHQLLASVKSRLTQAIADLSVPAWGSMVTRTVPGATQYAAYRFG 824

Query: 2406 MAVRLLRNICLWKNILSMPVXXXXXXXXXXXXXXXPHVKSIMPNIHDAIMRTERIIASLV 2585
            +A RLLRN+CLWK+IL+  V               PH+KSI+ + HDAI R ERI ASL 
Sbjct: 825  VATRLLRNVCLWKDILADHVVEELALDGLLTGKILPHMKSIILDFHDAITRAERIAASLS 884

Query: 2586 GIWSGPEVTLGTSQKLQPLVDCISELGGKLEKRHALGVSLEETRGLARRLKNMLVSLNEY 2765
            G+WS        SQKLQP V+ + ELG KLE+RH  G+S EETRGLARRLKN+L  LNEY
Sbjct: 885  GVWS------KQSQKLQPFVNLVVELGNKLERRHTSGISEEETRGLARRLKNILAGLNEY 938

Query: 2766 DKARAILRTFQLKEAL 2813
            DKARAI + FQL+EA+
Sbjct: 939  DKARAISKNFQLREAI 954


>ref|XP_004135116.1| PREDICTED: GC-rich sequence DNA-binding factor 1-like [Cucumis
            sativus]
          Length = 920

 Score =  826 bits (2133), Expect = 0.0
 Identities = 470/959 (49%), Positives = 613/959 (63%), Gaps = 28/959 (2%)
 Frame = +3

Query: 21   MSSIRAKNFRRRSESDDANAEEKSVPSPS----------TKSQTLTLXXXXXXXXXXXXR 170
            MS  RA+NFRRR++ +D + E K   +PS          + + ++               
Sbjct: 1    MSGSRARNFRRRADDNDDDDEPKGSTAPSISASNASSKPSSTSSVVATKPKKANPQGLKL 60

Query: 171  LSFADDEEEDNDRRPSRIPSSS---------AGAASVHRLTSSKDR-SKASRLASSIPSN 320
            LSFA DEE D   RPS   SSS         A  +S H++T+ KDR + +S +++S+PSN
Sbjct: 61   LSFASDEENDAPLRPSSSKSSSSKKPSSARLAKPSSTHKITALKDRIAHSSSISASVPSN 120

Query: 321  VQPQVGEYTKERLLELQKNARPLGSISRSQRPPAVPEPKPRKSDRPAEPVIVLKGFLKQA 500
            VQPQ G YTKE L ELQKN R L S     RP +  +P        AEPVIVLKG LK A
Sbjct: 121  VQPQAGVYTKEALRELQKNTRTLAS----SRPSSESKPS-------AEPVIVLKGLLKPA 169

Query: 501  SPGRDKQEGVVLKRQETNXXXXXXXXXXXDDSNGGSLTGAKFPTIPDPETIKAIXXXXXX 680
                   E V    +E              DS+G S        IPD  TI AI      
Sbjct: 170  -------EQVPDSAREAKESSSEDDEAGRKDSSGSS--------IPDQATINAIRAKRER 214

Query: 681  XXXXXXXXXDFISLDGGMPSSRPSADGSSDEEDTDFQERISLFGIKADDKLKKGVFESID 860
                     D+ISLD G   S  +A G   +E+ +F  RI++ G K +   KKGVFE +D
Sbjct: 215  MRQAGVAAPDYISLDAG---SNRTAPGELSDEEAEFPGRIAMIGGKLESS-KKGVFEEVD 270

Query: 861  QRLTITDERKMDGGFRKGDINIXXXXXXXXXXXXXXQFRKGLGKRIDDTSSQRVNYSVAP 1040
                   E+ +DG   + +I                QFRKGLGKR+DD S++  + SV  
Sbjct: 271  -------EQGIDGA--RTNIIEHSDEDEEEKIWEEEQFRKGLGKRMDDGSTRVESTSVPV 321

Query: 1041 IP-LHPQPSVYP------GVAHQTSASMTSASYGASRSAEVLSISQQAEVASRAMQETIN 1199
            +P + PQ  +YP       V   ++A+    S   S+  + LSISQQAE+A  AMQE++ 
Sbjct: 322  VPSVQPQNLIYPTTIGYSSVPSMSTATSIGGSVSISQGLDGLSISQQAEIAKTAMQESMG 381

Query: 1200 RLKESHKITTNSLVRTDTNITESLTEVSSLEKSLKEADDKYNFMQQLRDFISVMCDFLND 1379
            RLKES++ T  S+++TD N++ SL +++ LEK+L  A DK+ FMQ+LRDF+SV+CDFL  
Sbjct: 382  RLKESYRRTAMSVLKTDENLSASLLKITDLEKALSAAGDKFMFMQKLRDFVSVICDFLQH 441

Query: 1380 KAFLIEELEEQMQKLHEKRALAVVERRADDIADDDNEVESAVNAAIAVLSK-GSSSAYVX 1556
            KA  IEELEEQMQKLHE+RA  VVERR  D  D+  E+E+AV AAI++L+K GSS+  V 
Sbjct: 442  KAPFIEELEEQMQKLHEERASTVVERRVADNDDEMVEIETAVKAAISILNKKGSSNEMVT 501

Query: 1557 XXXXXXXXXXXXXRESADLPVELDEFGRDINLKKRMDFTRRAESRKLRKARAESKRIASM 1736
                         RE A+LP +LDEFGRD+NL+KRMD  RRAE+RK R+++ +SKR+ASM
Sbjct: 502  AATSAAQAAIALSREQANLPTKLDEFGRDLNLQKRMDMKRRAEARKRRRSQYDSKRLASM 561

Query: 1737 EMDNMLQIEGELSTDESDSESNAYISSRNELIQTAEEIFSDASEEYANLKIVKEWFERWK 1916
            E+D   ++EGE STDESDS+S AY S+R+ L+QTAE+IFSDA+EE++ L +VK+ FE WK
Sbjct: 562  EVDGHQKVEGESSTDESDSDSAAYQSNRDLLLQTAEQIFSDAAEEFSQLSVVKQRFEAWK 621

Query: 1917 NQYLSSYRDAYVSISVPSLFSPYVRLELLKWDPLYDATDFFDMEWHKLLFNYGLPAKGQD 2096
              Y ++YRDAY+S+S+P++FSPYVRLELLKWDPL+++ DFFDM WH LLFNYG+P  G D
Sbjct: 622  RDYSATYRDAYMSLSIPAIFSPYVRLELLKWDPLHESADFFDMNWHSLLFNYGMPEDGSD 681

Query: 2097 FEPDDADANLIPEIVEKVALPILHHEIEHCWDILNTQRTKGAVFATNMVISYVPASSKAL 2276
            F P+DADANL+PE+VEKVALPILHHEI HCWD+L+T+ T+ A FAT+++ +YVP SS+AL
Sbjct: 682  FAPNDADANLVPELVEKVALPILHHEIAHCWDMLSTRETRNAAFATSLITNYVPPSSEAL 741

Query: 2277 RELLAVIHTRLNEAITDLNVPVWSSVITKVVPGAAQFAAYKFGMAVRLLRNICLWKNILS 2456
             ELL VI TRL+ AI DL VP W+S++TK VP AA+ AAY+FGM+VRL+RNICLWK I++
Sbjct: 742  TELLVVIRTRLSGAIEDLTVPTWNSLVTKAVPNAARIAAYRFGMSVRLMRNICLWKEIIA 801

Query: 2457 MPVXXXXXXXXXXXXXXXPHVKSIMPNIHDAIMRTERIIASLVGIWSGPEVTLGTSQKLQ 2636
            +P+               PHV+SI  NIHDA+ RTERIIASL G+W+G  +    S KLQ
Sbjct: 802  LPILEKLALEELLYGKVLPHVRSITANIHDAVTRTERIIASLAGVWTGSGIIGDRSHKLQ 861

Query: 2637 PLVDCISELGGKLEKRHALGVSLEETRGLARRLKNMLVSLNEYDKARAILRTFQLKEAL 2813
            PLVD +  LG  LEK+H  G++  ET GLARRLK MLV LNEYD AR I +TF LKEAL
Sbjct: 862  PLVDYVLLLGRTLEKKHISGIAESETSGLARRLKKMLVELNEYDNARDIAKTFHLKEAL 920


>ref|XP_003568557.1| PREDICTED: GC-rich sequence DNA-binding factor 1-like [Brachypodium
            distachyon]
          Length = 954

 Score =  820 bits (2119), Expect = 0.0
 Identities = 490/977 (50%), Positives = 641/977 (65%), Gaps = 46/977 (4%)
 Frame = +3

Query: 21   MSSIRAKNFRRRSE-SDDANAEEKSVPS--PSTKSQTLTLXXXXXXXXXXXX-RLSFADD 188
            MSS R KNFRRR++ +D A  E+  +PS   +TK+Q+  +             RLSFAD+
Sbjct: 1    MSSHR-KNFRRRTDDADGAKGEDAGLPSRPAATKTQSPAVPKPVSPRRQQGASRLSFADE 59

Query: 189  EEEDND---------RRPSRIPSS----SAGAASVHRLTSSKDRSKASRLASSI-----P 314
            E+ED+          RRPS    S    S  A+++HRLT +KDR K+S   S+      P
Sbjct: 60   EDEDDAEEGPFAQQRRRPSASVRSTRTASPAASALHRLTPAKDRLKSSPAISAAVPAPKP 119

Query: 315  SNVQPQVGEYTKERLLELQKNARPL-GSISRSQRPPAVPEPKPRK------------SDR 455
            SN Q   GEYT ERL ELQKNAR L GS+ R   P    E + ++            S  
Sbjct: 120  SNFQSHAGEYTPERLRELQKNARSLPGSLMRPPPPALAAESRHQRFAGTAASPASGTSAV 179

Query: 456  PAEPVIVLKGFLK---QASPGRDKQEGVVLKRQETNXXXXXXXXXXXDDSNGGSLTGAKF 626
              EPV+VLKG +K   QAS G  K      K  E+            ++  G ++   K 
Sbjct: 180  ATEPVVVLKGLVKPMAQASIGPRKPLQNEDKSDES------------EEEEGNNVD--KG 225

Query: 627  PTIPDPETIKAIXXXXXXXXXXXXXXXDFISLDGG-MPSSRPSADGSSDEEDTDFQERIS 803
            P IPD  TI+AI               DFISLDGG + SSR +  GSSDEED + Q RI+
Sbjct: 226  PLIPDKATIEAIRAKRQQLQQPRHAAPDFISLDGGGVLSSRDAVGGSSDEEDNEMQGRIA 285

Query: 804  LFGIKADD--KLKKGVFESIDQRLTITDERKMDGGFRKGDINIXXXXXXXXXXXXXXQFR 977
            ++  K+ D  +  KGVF  I+ R        ++ GFR+ + +               QF+
Sbjct: 286  MYTEKSSDGHRSSKGVFHGINNRGPAASLGVINDGFREPEDDKDDDEEEEERKWEEEQFK 345

Query: 978  KGLGKRIDDTSSQRV-NYSVAPIPLHPQPSVYPGVAH-QTSAS--MTSASYGASRSAEVL 1145
            K LG+R+DD+S+Q+V N + AP  + PQPS Y G  H QTS S  +  AS  AS SAE L
Sbjct: 346  KALGRRMDDSSAQKVANGAPAPKQVQPQPSGYLGGPHYQTSVSGVVPGASVFASGSAEFL 405

Query: 1146 SISQQAEVASRAMQETINRLKESHKITTNSLVRTDTNITESLTEVSSLEKSLKEADDKYN 1325
            SISQQA+VAS+A+QE I +LKE+HK T   LVRTD ++ E+L+E+SSLE SL++A+ K+ 
Sbjct: 406  SISQQADVASKALQENIRKLKETHKATVGGLVRTDAHLNEALSEISSLESSLQDAEKKFV 465

Query: 1326 FMQQLRDFISVMCDFLNDKAFLIEELEEQMQKLHEKRALAVVERRADDIADDDNEVESAV 1505
            +MQ+LR++ISV+CDFLNDKAF IEELEE MQKLHE RALAV ERRA D+AD+ + +E+AV
Sbjct: 466  YMQELRNYISVVCDFLNDKAFFIEELEEHMQKLHENRALAVSERRAADLADESSVIEAAV 525

Query: 1506 NAAIAVLSKGSSSAYVXXXXXXXXXXXXXXRESADLPVELDEFGRDINLKKRMDFTRRAE 1685
            NAAI+VLSKGSSSA +              RE+++LP +LDEFGRDINL+KRMD  RR E
Sbjct: 526  NAAISVLSKGSSSANLSSASNAAQAAAAAARETSNLPPQLDEFGRDINLQKRMDLKRREE 585

Query: 1686 SRKLRKARAESKRIASM-EMDNMLQIEGELSTDESDSESNAYISSRNELIQTAEEIFSDA 1862
            +RK RKAR+ESKR++S  +  +  QIEGELSTDESD++S+AY+SSR+EL++TA+ +FSDA
Sbjct: 586  NRKRRKARSESKRLSSTGKSVSSEQIEGELSTDESDTDSSAYLSSRDELLKTADVVFSDA 645

Query: 1863 SEEYANLKIVKEWFERWKNQYLSSYRDAYVSISVPSLFSPYVRLELLKWDPLYDATDFFD 2042
            +EEY++L IVK+ FE WK QY S+YRDA+ ++S PS+F+PYVRLELLKWDPL++ T FF 
Sbjct: 646  AEEYSSLAIVKDKFEGWKTQYPSAYRDAHAALSAPSVFTPYVRLELLKWDPLHETTGFFG 705

Query: 2043 MEWHKLLFNYGLPAKGQDFEPDDADANLIPEIVEKVALPILHHEIEHCWDILNTQRTKGA 2222
            MEW ++L +YG+  K    + +DAD NL+P +VEKVALPILHH + HCWDIL+TQRTK  
Sbjct: 706  MEWPEILLDYGVQNKDSP-DLNDADVNLVPVLVEKVALPILHHRVMHCWDILSTQRTKNV 764

Query: 2223 VFATNMVISYVPASSKALRELLAVIHTRLNEAITDLNVPVWSSVITKVVPGAAQFAAYKF 2402
            V+A N V+ ++P SS AL +LLA ++ RL  AI DL+VP W S++T+ VPGAAQ+AAY+F
Sbjct: 765  VYAVNTVMDFLPTSSTALHQLLASVYNRLAGAIADLSVPAWGSMVTRAVPGAAQYAAYRF 824

Query: 2403 GMAVRLLRNICLWKNILSMPVXXXXXXXXXXXXXXXPHVKSIMPNIHDAIMRTERIIASL 2582
            G+A RLL+N+C WKN LS  V               PH+KSI+ ++HDAI RTERI ASL
Sbjct: 825  GVATRLLKNVCSWKNTLSEDV-VEKLALELLMGKILPHMKSIILDVHDAITRTERIAASL 883

Query: 2583 VGIWSGPEVTLGTSQKLQPLVDCISELGGKLEKRHALGVSLEETRGLARRLKNMLVSLNE 2762
              IWS P      S+KLQP  D + EL  KLE+RH  G+S EET GLARRLKN++V+LNE
Sbjct: 884  SVIWSSP------SKKLQPFTDLVLELSKKLERRHMSGISEEETHGLARRLKNIMVALNE 937

Query: 2763 YDKARAILRTFQLKEAL 2813
            YDKAR IL++F L+EAL
Sbjct: 938  YDKARNILKSFHLREAL 954


>ref|XP_004159322.1| PREDICTED: GC-rich sequence DNA-binding factor 1-like [Cucumis
            sativus]
          Length = 889

 Score =  810 bits (2091), Expect = 0.0
 Identities = 458/942 (48%), Positives = 603/942 (64%), Gaps = 11/942 (1%)
 Frame = +3

Query: 21   MSSIRAKNFRRRSESDDANAEEKSVPSPSTKSQTLTLXXXXXXXXXXXXRLSFADDEEED 200
            MS  RA+NFRRR++ +D + E K   +PS  +   +                F +     
Sbjct: 1    MSGSRARNFRRRADDNDDDDEPKGSTAPSISASNASSKPSSTSSVVATKPKKFQE----- 55

Query: 201  NDRRPSRIPSSS--AGAASVHRLTSSKDR-SKASRLASSIPSNVQPQVGEYTKERLLELQ 371
                    PSS+  A  +S H++T+ KDR + +S +++S+PSNVQPQ G YTKE L ELQ
Sbjct: 56   --------PSSARLAKPSSTHKITALKDRIAHSSSISASVPSNVQPQAGVYTKEALRELQ 107

Query: 372  KNARPLGSISRSQRPPAVPEPKPRKSDRPAEPVIVLKGFLKQASPGRDKQEGVVLKRQET 551
            KN R L S     RP +  +P        AEPVIVLKG LK A    D          E 
Sbjct: 108  KNTRTLAS----SRPSSESKPS-------AEPVIVLKGLLKPAEQVPDSAREAKESSSE- 155

Query: 552  NXXXXXXXXXXXDDSNGGSLTGAKFPTIPDPETIKAIXXXXXXXXXXXXXXXDFISLDGG 731
                        DD  G   +G+   +IPD  TI AI               D+ISLD G
Sbjct: 156  ------------DDEAGKDSSGS---SIPDQATINAIRAKRERMRQAGVAAPDYISLDAG 200

Query: 732  MPSSRPSADGSSDEEDTDFQERISLFGIKADDKLKKGVFESIDQRLTITDERKMDGGFRK 911
               S  +A G   +E+ +F  RI++ G K +   KKGVFE +D       E+ +DG   +
Sbjct: 201  ---SNRTAPGELSDEEAEFPGRIAMIGGKLESS-KKGVFEEVD-------EQGIDGA--R 247

Query: 912  GDINIXXXXXXXXXXXXXXQFRKGLGKRIDDTSSQRVNYSVAPIP-LHPQPSVYP----- 1073
             +I                QFRKGLGKR+DD S++  + SV  +P + PQ  +YP     
Sbjct: 248  TNIIEHSDEDEEEKIWEEEQFRKGLGKRMDDGSTRVESTSVPVVPSVQPQNLIYPTTIGY 307

Query: 1074 -GVAHQTSASMTSASYGASRSAEVLSISQQAEVASRAMQETINRLKESHKITTNSLVRTD 1250
              V   ++A+    S   S+  + LSISQQAE+A  AMQE++ RLKES++ T  S+++TD
Sbjct: 308  SSVPSVSTATSIGGSVSISQGLDGLSISQQAEIAKTAMQESMGRLKESYRRTAMSVLKTD 367

Query: 1251 TNITESLTEVSSLEKSLKEADDKYNFMQQLRDFISVMCDFLNDKAFLIEELEEQMQKLHE 1430
             N++ SL +++ LEK+L  A DK+ FMQ+LRDF+SV+CDFL  KA  IEELEEQMQKLHE
Sbjct: 368  ENLSASLLKITDLEKALSAAGDKFIFMQKLRDFVSVICDFLQHKAPFIEELEEQMQKLHE 427

Query: 1431 KRALAVVERRADDIADDDNEVESAVNAAIAVLSK-GSSSAYVXXXXXXXXXXXXXXRESA 1607
            +RA  VVERR  D  D+  E+E+AV AAI++L+K GSS+  +              RE A
Sbjct: 428  ERASTVVERRVADNDDEMVEIETAVKAAISILNKKGSSNEMITAATSAAQAAIALSREQA 487

Query: 1608 DLPVELDEFGRDINLKKRMDFTRRAESRKLRKARAESKRIASMEMDNMLQIEGELSTDES 1787
            +LP +LDEFGRD+NL+KRMD  RRAE+RK R+++ +SKR+ASME+D   ++EGE STDES
Sbjct: 488  NLPTKLDEFGRDLNLQKRMDMKRRAEARKRRRSQYDSKRLASMEVDGHQKVEGESSTDES 547

Query: 1788 DSESNAYISSRNELIQTAEEIFSDASEEYANLKIVKEWFERWKNQYLSSYRDAYVSISVP 1967
            DS+S AY S+R+ L+QTAE+IFSDA+EE++ L +VK+ FE WK  Y ++YRDAY+S+S+P
Sbjct: 548  DSDSAAYQSNRDLLLQTAEQIFSDAAEEFSQLSVVKQRFEAWKRDYSATYRDAYMSLSIP 607

Query: 1968 SLFSPYVRLELLKWDPLYDATDFFDMEWHKLLFNYGLPAKGQDFEPDDADANLIPEIVEK 2147
            ++FSPYVRLELLKWDPL+++ DFFDM WH LLFNYG+P  G DF P+DADANL+PE+VEK
Sbjct: 608  AIFSPYVRLELLKWDPLHESADFFDMNWHSLLFNYGMPEDGSDFAPNDADANLVPELVEK 667

Query: 2148 VALPILHHEIEHCWDILNTQRTKGAVFATNMVISYVPASSKALRELLAVIHTRLNEAITD 2327
            VALPILHHEI HCWD+L+T+ T+ A FAT+++ +YVP SS+AL ELL VI TRL+ AI D
Sbjct: 668  VALPILHHEIAHCWDMLSTRETRNAAFATSLITNYVPPSSEALTELLVVIRTRLSGAIED 727

Query: 2328 LNVPVWSSVITKVVPGAAQFAAYKFGMAVRLLRNICLWKNILSMPVXXXXXXXXXXXXXX 2507
            L VP W+S++TK VP AA+ AAY+FGM+VRL+RNICLWK I+++P+              
Sbjct: 728  LTVPTWNSLVTKAVPNAARIAAYRFGMSVRLMRNICLWKEIIALPILEKLALEELLYGKV 787

Query: 2508 XPHVKSIMPNIHDAIMRTERIIASLVGIWSGPEVTLGTSQKLQPLVDCISELGGKLEKRH 2687
             PHV+SI  NIHDA+ RTERIIASL G+W+G  +    S KLQPLVD +  LG  LEK+H
Sbjct: 788  LPHVRSITANIHDAVTRTERIIASLAGVWTGSGIIGDRSHKLQPLVDYVLLLGRTLEKKH 847

Query: 2688 ALGVSLEETRGLARRLKNMLVSLNEYDKARAILRTFQLKEAL 2813
              G++  ET GLARRLK MLV LNEYD AR I +TF LKEAL
Sbjct: 848  ISGIAESETSGLARRLKKMLVELNEYDNARDIAKTFHLKEAL 889


>dbj|BAK05949.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 958

 Score =  809 bits (2089), Expect = 0.0
 Identities = 473/979 (48%), Positives = 621/979 (63%), Gaps = 48/979 (4%)
 Frame = +3

Query: 21   MSSIRAKNFRRRSESDDANAEEKSVPS--PSTKSQTLTLXXXXXXXXXXXXRLSFADDEE 194
            MSS R KNFRRR++ DD    E + P+  PS+K+Q                RLSFAD+EE
Sbjct: 1    MSSHR-KNFRRRTDDDDGGKAEDAGPASRPSSKAQP-----PPAPPKPRTSRLSFADEEE 54

Query: 195  EDND-----------RRPSRIPS----SSAGAASVHRLTSSKDRSKASRLASSI---PSN 320
            +++D           RRPS   S    +S  AA++HR+T ++DR ++S    +    PSN
Sbjct: 55   DEDDAEEGPFAQHRTRRPSASVSQARTASPAAAALHRVTPARDRVRSSPAVVAPVPKPSN 114

Query: 321  VQPQVGEYTKERLLELQKNARPL-GSISRSQRPPAVPEP--KPRKSDR------------ 455
             Q   GEYT ERL ELQKNARPL GS+ R+  PP  P P  +PR                
Sbjct: 115  FQSHAGEYTPERLRELQKNARPLPGSLMRAPAPPPPPPPAAEPRHQRLAGAAASSSAAPT 174

Query: 456  ------PAEPVIVLKGFLKQASPGRDKQEGVVLKRQETNXXXXXXXXXXXDDSNGGSLTG 617
                  PAEPV+VLKG +K  +     Q  +  +R   N           +D   G   G
Sbjct: 175  TAGKAVPAEPVVVLKGLVKPMA-----QASIGPRRPLPNEVQDGDSEEEAEDDGDGEEKG 229

Query: 618  AKFPTIPDPETIKAIXXXXXXXXXXXXXXXDFISLDGG--MPSSRPSADGSSDEEDTDFQ 791
               P IPD  TI+AI               DFISLDGG  + S + +A GSSDE+D + +
Sbjct: 230  ---PLIPDKATIEAIRAKRQQLQQPRHAAPDFISLDGGGVLSSRKGAAGGSSDEDDNEIE 286

Query: 792  ERISLFGIKADD--KLKKGVFESIDQRLTITDERKMDGGFRKGDINIXXXXXXXXXXXXX 965
             RI+++  K  D  +  KGVF+ I+ R        M   F + + +              
Sbjct: 287  GRIAMYSEKQSDGQRSSKGVFQGINNRGPAASLGVMKDRFMEVEDDEVDDEEEEERKWEE 346

Query: 966  XQFRKGLGKRIDDTSS-QRVN--YSVAPIPLHPQPSVYPGVAHQTSASMTSASYGASRSA 1136
             Q +K LG R+DD+SS QR     S A   + PQPS  P      S  +  AS  AS SA
Sbjct: 347  AQVKKALGNRMDDSSSHQRATNGVSAARQQVQPQPSGGPHYQPSFSGVVPGASVFASGSA 406

Query: 1137 EVLSISQQAEVASRAMQETINRLKESHKITTNSLVRTDTNITESLTEVSSLEKSLKEADD 1316
            E LSISQQA+VA +A+QE I +L+E+HK T +SL RTDT++ E+L+E+SSLE  L++A+ 
Sbjct: 407  EFLSISQQADVAGKALQENIRKLRETHKTTVDSLARTDTHLNEALSEISSLESGLQDAEK 466

Query: 1317 KYNFMQQLRDFISVMCDFLNDKAFLIEELEEQMQKLHEKRALAVVERRADDIADDDNEVE 1496
            K+ +MQ+LR++ISVMCDFLNDKAF IEELEE MQKLHE RALAV ERRA D AD+   +E
Sbjct: 467  KFVYMQELRNYISVMCDFLNDKAFFIEELEEHMQKLHENRALAVSERRAADFADESAVIE 526

Query: 1497 SAVNAAIAVLSKGSSSAYVXXXXXXXXXXXXXXRESADLPVELDEFGRDINLKKRMDFTR 1676
            +AV+AAI+VLSKG SSA +              RES++LP ELDEFGRDINL+KRMD  R
Sbjct: 527  AAVSAAISVLSKGPSSANLSAATHAAQAAAAAARESSNLPPELDEFGRDINLQKRMDLKR 586

Query: 1677 RAESRKLRKARAESKRIASMEMDNMLQIEGELSTDESDSESNAYISSRNELIQTAEEIFS 1856
            R E+R+ RKAR+ESKR++S        IEGELSTDESD++++AY+SSR+EL++TA+ +F 
Sbjct: 587  REENRRRRKARSESKRLSSARKSVTEHIEGELSTDESDTDTSAYLSSRDELLKTADAVFG 646

Query: 1857 DASEEYANLKIVKEWFERWKNQYLSSYRDAYVSISVPSLFSPYVRLELLKWDPLYDATDF 2036
            DA+EEY++L IVK+ FE WK QY  +YRDA+VS+S PS+F+PYVRLELL WDPL++ T F
Sbjct: 647  DAAEEYSSLTIVKDKFEGWKTQYPLAYRDAHVSLSAPSVFTPYVRLELLNWDPLHETTSF 706

Query: 2037 FDMEWHKLLFNYGLPAKGQDFEPDDADANLIPEIVEKVALPILHHEIEHCWDILNTQRTK 2216
            FDM+W  +L  YG+  +    +P+D D NLI  + EKVALP+LHH I+HCWDIL+TQRT+
Sbjct: 707  FDMQWTNVLVGYGVQDE-DSADPNDLDLNLIQVLAEKVALPVLHHRIKHCWDILSTQRTQ 765

Query: 2217 GAVFATNMVISYVPASSKALRELLAVIHTRLNEAITDLNVPVWSSVITKVVPGAAQFAAY 2396
             AV AT MVI+YVP +SKAL +LLA++ +RL EAI D++VP W S++T+ VPGAA++AAY
Sbjct: 766  HAVDATFMVINYVPLTSKALHQLLAMVCSRLTEAIADVSVPAWGSMLTRAVPGAAEYAAY 825

Query: 2397 KFGMAVRLLRNICLWKNILSMPVXXXXXXXXXXXXXXXPHVKSIMPNIHDAIMRTERIIA 2576
            +FG+A RLL+N+CLWK +L+                  PH+KSI+  +HDAI R ER+ A
Sbjct: 826  RFGVATRLLKNVCLWKKVLAGDALERLAVEELLIGKILPHMKSIILEVHDAITRAERVAA 885

Query: 2577 SLVGIWSGPEVTLGTSQKLQPLVDCISELGGKLEKRHALGVSLEETRGLARRLKNMLVSL 2756
            SL G+WS P      ++KLQP  D + EL  KL+ RH  GVS EE RGLARRLKN+LV+L
Sbjct: 886  SLSGVWSSP------NKKLQPFTDFVLELSNKLKSRHISGVSEEEIRGLARRLKNILVAL 939

Query: 2757 NEYDKARAILRTFQLKEAL 2813
            NEYDKAR IL+TFQ++EAL
Sbjct: 940  NEYDKARNILKTFQIREAL 958


>gb|EAY92631.1| hypothetical protein OsI_14375 [Oryza sativa Indica Group]
          Length = 930

 Score =  808 bits (2088), Expect = 0.0
 Identities = 469/974 (48%), Positives = 619/974 (63%), Gaps = 43/974 (4%)
 Frame = +3

Query: 21   MSSIRAKNFRRRSE-SDDANAEEKSVPSPS-TKSQTLTLXXXXXXXXXXXXRLSFADDEE 194
            MSS R KNFRRR++ ++DA  ++ S   P+ TK+QT  +            RLSF +DE+
Sbjct: 1    MSSHR-KNFRRRTDDAEDAYGDDSSNSKPTATKTQTPPVPKPRSPRRQGASRLSFVEDED 59

Query: 195  EDND--------RRPS----RIPSSSAGAASVHRLTSSKDRSKASRLASSI-----PSNV 323
            +D+         RRP+    +  ++S  AA++HRLT ++DR K+S   ++      PSN 
Sbjct: 60   DDDAEEGPLSQRRRPAATVRQARTASPAAATLHRLTPARDRLKSSPAVAAAVPAPKPSNF 119

Query: 324  QPQVGEYTKERLLELQKNARPL-GSISRSQRPP----------------AVPEPKPRKSD 452
            Q   GEYT ERL ELQKNARPL GS+ R+  PP                A P P    + 
Sbjct: 120  QSHAGEYTPERLRELQKNARPLPGSLMRAPPPPPPPTAEAPRQRLPGAAASPAPATNTTA 179

Query: 453  RPAEPVIVLKGFLKQASPGRDKQEGVVLKRQETNXXXXXXXXXXXDDSNGGSLTGAKFPT 632
               EPV++LKG +K  S     Q  +  +    N           ++   G       P 
Sbjct: 180  AAVEPVVILKGLVKPMS-----QASIGPRNPSQNEDKDEDESEEEEEEEEG-------PV 227

Query: 633  IPDPETIKAIXXXXXXXXXXXXXXXDFISLDGG-MPSSRPSADGSSDEEDTDFQERISLF 809
            IPD  TI+AI               D+ISLDGG + SSR +A GSSDE+D + + RI+++
Sbjct: 228  IPDRATIEAIRAKRQQLQQPRHAAPDYISLDGGGVLSSREAAGGSSDEDDDETRGRIAMY 287

Query: 810  GIKADDKLK-KGVFESIDQRLTITDERKMDGGFRKGDINIXXXXXXXXXXXXXXQFRKGL 986
              K+D +   KGVF  I+ R        ++ GFR+ +                 QFRKGL
Sbjct: 288  AEKSDSQRSTKGVFGVINNRGPAASLGVINDGFREVEDEKDDDEDEEERKWEEEQFRKGL 347

Query: 987  GKRIDDTSSQRV-NYSVAPIPLHPQPSVY---PGVAHQTSASMTSASYGASRSAEVLSIS 1154
            G+R+DD S+QR  N   AP+ + PQPS Y   P      S  +   S  AS SAE LSI+
Sbjct: 348  GRRVDDASTQRAANGGPAPVQVQPQPSGYSIDPRYQPSFSGVLPGTSIFASGSAEFLSIA 407

Query: 1155 QQAEVASRAMQETINRLKESHKITTNSLVRTDTNITESLTEVSSLEKSLKEADDKYNFMQ 1334
            QQA+VAS+A+QE I +LKE+H+ T ++LV+TDT++TE+L+E+SSLE  L++A+ K+ +MQ
Sbjct: 408  QQADVASKALQENIRKLKETHRTTVDALVKTDTHLTEALSEISSLESGLQDAERKFVYMQ 467

Query: 1335 QLRDFISVMCDFLNDKAFLIEELEEQMQKLHEKRALAVVERRADDIADDDNEVESAVNAA 1514
            +LR++ISVMCDFLNDKAF IEELEE MQKLHE R                          
Sbjct: 468  ELRNYISVMCDFLNDKAFYIEELEEHMQKLHENRQYLS---------------------- 505

Query: 1515 IAVLSKGSSSAYVXXXXXXXXXXXXXXRESADLPVELDEFGRDINLKKRMDFTRRAESRK 1694
               LSKGSSSAY+              RES++LP ELDEFGRDIN++KRMD  RR E R+
Sbjct: 506  ---LSKGSSSAYLSAASNAAQAAAAAARESSNLPPELDEFGRDINMQKRMDLKRREEDRR 562

Query: 1695 LRKARAESKRIASMEMD-NMLQIEGELSTDESDSESNAYISSRNELIQTAEEIFSDASEE 1871
             RK R+ESKR++S     N   IEGELSTDESDSES+AY+SSR+EL++TA+ +FSDA+EE
Sbjct: 563  RRKIRSESKRLSSEGRSANNEHIEGELSTDESDSESSAYLSSRDELLKTADLVFSDAAEE 622

Query: 1872 YANLKIVKEWFERWKNQYLSSYRDAYVSISVPSLFSPYVRLELLKWDPLYDATDFFDMEW 2051
            Y++L+IVK+ FE WK QY  +YRDA+V++S PS+F+PYVRLELLKWDPL++ TDFF MEW
Sbjct: 623  YSSLRIVKDKFEGWKTQYPLAYRDAHVALSAPSVFTPYVRLELLKWDPLHETTDFFGMEW 682

Query: 2052 HKLLFNYGLPAKGQDFEPDDADANLIPEIVEKVALPILHHEIEHCWDILNTQRTKGAVFA 2231
            HK+LF+YG        +P++ D +LIP +VEKVALPILHH I HCWDIL+TQRTK AV A
Sbjct: 683  HKILFDYGEQNSESGTDPNNVDKDLIPVLVEKVALPILHHRIMHCWDILSTQRTKNAVDA 742

Query: 2232 TNMVISYVPASSKALRELLAVIHTRLNEAITDLNVPVWSSVITKVVPGAAQFAAYKFGMA 2411
             NMVISY+P SSKAL +LLA +++RL EAI D++VP W S++T+ VPGA+Q+AA++FG+A
Sbjct: 743  INMVISYLPTSSKALHQLLAAVNSRLTEAIADISVPAWGSMVTRTVPGASQYAAHRFGVA 802

Query: 2412 VRLLRNICLWKNILSMPVXXXXXXXXXXXXXXXPHVKSIMPNIHDAIMRTERIIASLVGI 2591
            +RLL+N+CLWK+I + PV               PH+KSI+ + HDAI R ERI A L G+
Sbjct: 803  IRLLKNVCLWKDIFAKPVLEKLALEELLKGKILPHMKSIILDAHDAIARAERISALLKGV 862

Query: 2592 WSGPEVTLGTSQKLQPLVDCISELGGKLEKRHALGVSLEETRGLARRLKNMLVSLNEYDK 2771
            WS P      SQKLQP +D + ELG KLE+RH  G+S EETRGLARRLK++LV LNEYDK
Sbjct: 863  WSSP------SQKLQPFIDLVVELGNKLERRHMSGISEEETRGLARRLKDILVELNEYDK 916

Query: 2772 ARAILRTFQLKEAL 2813
            ARAIL+TFQ++EAL
Sbjct: 917  ARAILKTFQIREAL 930


>ref|XP_002278714.2| PREDICTED: GC-rich sequence DNA-binding factor 1-like [Vitis
            vinifera]
          Length = 913

 Score =  803 bits (2075), Expect = 0.0
 Identities = 473/955 (49%), Positives = 613/955 (64%), Gaps = 26/955 (2%)
 Frame = +3

Query: 27   SIRAKNFRRRSE---SDDANAEEKSVPSPSTKSQTLTLXXXXXXXXXXXXRL-SFADDEE 194
            S R +NFRRR++   +DD N +   +  P++K  T T             +L SFADDEE
Sbjct: 2    SSRPRNFRRRADDDDNDDTNGDGPPLIKPTSKPSTTTATTAAAAKPKKPPKLLSFADDEE 61

Query: 195  EDNDRR--------PSRIPSSSA-----GAASVHRLTSSKDRSKASRLASSIPSNVQPQV 335
             ++  R        PSR   +S+      ++S H++T++KDR   S  ++S+PSNVQPQ 
Sbjct: 62   NESPSRSSSRSTQPPSRPSKTSSRFTKLSSSSSHKITTTKDRLTPS--SASLPSNVQPQA 119

Query: 336  GEYTKERLLELQKNARPLGSISRSQRPPAVPEPKPRKSDRPAEPVIVLKGFLKQASPGRD 515
            G YTKE L ELQKN R L S SR    PA  EPKP       EPVIVLKG +K  S   D
Sbjct: 120  GTYTKEALRELQKNTRTLAS-SR----PASSEPKPS-----LEPVIVLKGLVKPISAAED 169

Query: 516  KQEGVVLKRQETNXXXXXXXXXXXDDSNGGSLTGAKFPTIPDPETIKAIXXXXXXXXXXX 695
                 V+  +              D             +IPD  TI AI           
Sbjct: 170  ----AVIDEENVEEEPESKDKGGRD-------------SIPDQATINAIRAKRERLRQSR 212

Query: 696  XXXXDFISLDGGMPSSRPSADGSSDEEDTDFQERISLFGIKADDKLKKGVFESIDQRLTI 875
                D+ISLDGG  S+  +A+G SDEE  +FQ RI++FG K +   KKGVFE +D     
Sbjct: 213  AAAPDYISLDGG--SNHGAAEGLSDEEP-EFQGRIAMFGEKPESG-KKGVFEDVD----- 263

Query: 876  TDERKMDGGFRKGDINIXXXXXXXXXXXXXXQFRKGLGKRIDDTSSQRVNYSVAPIP-LH 1052
              ER M+GGF+K   +               QFRKGLGKR+DD SS+ V+ SV  +  + 
Sbjct: 264  --ERGMEGGFKKDAHD--SDDEEEEKIWEEEQFRKGLGKRMDDGSSRVVSSSVPVVQKVQ 319

Query: 1053 PQPSVYPGVAHQTSASMTSA------SYGASRSAEVLSISQQAEVASRAMQETINRLKES 1214
             Q  +Y  V   TS    SA      + G     + +S+SQQAE+A +A+ E + RLKES
Sbjct: 320  QQKFMYSSVTAYTSVPGVSAPLNIGGAVGPLPGFDAMSLSQQAELAKKALHENLRRLKES 379

Query: 1215 HKITTNSLVRTDTNITESLTEVSSLEKSLKEADDKYNFMQQLRDFISVMCDFLNDKAFLI 1394
            H  T +SL RTD N++ SL+ +++LEKSL  A +K+ FMQ LRDF+SV+CDFL  KA  I
Sbjct: 380  HGRTMSSLTRTDENLSSSLSNITTLEKSLTAAGEKFIFMQXLRDFVSVICDFLQHKAPFI 439

Query: 1395 EELEEQMQKLHEKRALAVVERRADDIADDDNEVESAVNAAIAVLSK-GSSSAYVXXXXXX 1571
            EELEEQMQKLHE+RA A++ERRA D  D+  E++++V+AA++V +K GS+ A V      
Sbjct: 440  EELEEQMQKLHEERASAILERRAAD-NDEMMEIQASVDAAMSVFTKSGSNEAMVAAARTA 498

Query: 1572 XXXXXXXXRESADLPVELDEFGRDINLKKRMDFTRRAESRKLRKARAESKRIASMEMDNM 1751
                    RE  +LPV+LDE+GRDINL+K MD  RR+E+R+ ++ R ++KR+  +E ++ 
Sbjct: 499  AQAASAAMREQTNLPVKLDEYGRDINLQKCMDKNRRSEARQRKRDRWDAKRMTFLENESS 558

Query: 1752 LQ-IEGELSTDESDSESNAYISSRNELIQTAEEIFSDASEEYANLKIVKEWFERWKNQYL 1928
             Q IEGE STDESDSE+ AY S+R+ L+QTAE+IF DA+EEY+ L  VKE  ERWK QY 
Sbjct: 559  HQKIEGESSTDESDSETTAYQSNRDLLLQTAEQIFGDAAEEYSQLSAVKERIERWKKQYS 618

Query: 1929 SSYRDAYVSISVPSLFSPYVRLELLKWDPLYDATDFFDMEWHKLLFNYGLPAKGQDFEPD 2108
            SSYRDAY+S+SVP++FSPYVRLELLKWDPLY+  DF DM+WH LLFNYGL   G DF PD
Sbjct: 619  SSYRDAYMSLSVPAIFSPYVRLELLKWDPLYEEADFDDMKWHSLLFNYGLSEDGNDFSPD 678

Query: 2109 DADANLIPEIVEKVALPILHHEIEHCWDILNTQRTKGAVFATNMVISYVPASSKALRELL 2288
            DADANL+PE+VE+VALPILHHE+ HCWDI +T+ TK AV ATN+VI Y+PASS+AL ELL
Sbjct: 679  DADANLVPELVERVALPILHHELAHCWDIFSTRETKNAVSATNLVIRYIPASSEALGELL 738

Query: 2289 AVIHTRLNEAITDLNVPVWSSVITKVVPGAAQFAAYKFGMAVRLLRNICLWKNILSMPVX 2468
            AV+H RL +A+T+  VP W+ ++ K VP AA+ AAY+FGM++RL+RNICLWK+IL++PV 
Sbjct: 739  AVVHKRLYKALTNFMVPPWNILVMKAVPNAARVAAYRFGMSIRLMRNICLWKDILALPVL 798

Query: 2469 XXXXXXXXXXXXXXPHVKSIMPNIHDAIMRTERIIASLVGIWSGPEVTLGTSQKLQPLVD 2648
                          PH+++I  ++HDAI RTERII+SL G+W+GP VT   S KLQPLVD
Sbjct: 799  EKLVLDQLLSGQVLPHIENIASDVHDAITRTERIISSLSGVWAGPSVTGERSNKLQPLVD 858

Query: 2649 CISELGGKLEKRHALGVSLEETRGLARRLKNMLVSLNEYDKARAILRTFQLKEAL 2813
             +  LG +LEKRH  GV+  +T  LARRLK MLV LNEYDKAR I RTF LKEAL
Sbjct: 859  YVLRLGKRLEKRHLPGVTESDTSRLARRLKRMLVELNEYDKARDISRTFHLKEAL 913


>ref|XP_006838726.1| hypothetical protein AMTR_s00002p00252610 [Amborella trichopoda]
            gi|548841232|gb|ERN01295.1| hypothetical protein
            AMTR_s00002p00252610 [Amborella trichopoda]
          Length = 946

 Score =  791 bits (2043), Expect = 0.0
 Identities = 460/971 (47%), Positives = 609/971 (62%), Gaps = 44/971 (4%)
 Frame = +3

Query: 33   RAKNFRRRSESD-DANAEEKSVPSPST----KSQTLTLXXXXXXXXXXXXRLSFADD--- 188
            +++NFRRR + D D N E+   P  S     K+Q  T              LSFA D   
Sbjct: 3    KSRNFRRRGDVDNDRNGEDNDAPPLSKPLSPKTQKPTTKEKKGRNSQGSKLLSFAGDGEA 62

Query: 189  ---------------------EEEDN-----DRRPSRIPSSSAGAASVHRLTSSKDRSKA 290
                                 +EED       R   + P  S+   S H++ + KDR+  
Sbjct: 63   PQKNQSERSGPKPPQRNLLSFDEEDGGSPNIQRSIRKKPGLSSSHGSSHKIIAGKDRTSI 122

Query: 291  SRLASSIPSNVQPQVGEYTKERLLELQKNARPLGSISRSQRPPAVPEPKPRKSDRPAEPV 470
               + S+PSNVQPQ G+YTKE+LLELQKN + LG              KP    +PAEPV
Sbjct: 123  Q--SPSVPSNVQPQAGQYTKEKLLELQKNTKTLGG------------SKPPSETKPAEPV 168

Query: 471  IVLKGFLKQASPGRDKQEGVVLKRQETNXXXXXXXXXXXDDSNGGSLTGAKFPTIPDP-- 644
            IVLKG +K     R  ++  V +  E +           + S G    G     +  P  
Sbjct: 169  IVLKGLVKPILEERKSEKTQVRESMENDREKFSREKEEAESSLGKMGIGQPKEEVGSPVL 228

Query: 645  --ETIKAIXXXXXXXXXXXXXXXDFISLDGGMPSSRPSADG-SSDEEDTDFQERISLFGI 815
               TI AI               D+ISLD G   S   +DG  S +++++FQ RI+L G 
Sbjct: 229  DQATINAIKAKRERLRQARMAP-DYISLDSGGARSMRDSDGLGSSDDESEFQGRIALLG- 286

Query: 816  KADDKLKKGVFESIDQRLTITDERKMDGGFRKGDINIXXXXXXXXXXXXXXQFRKGLGKR 995
            + ++  +KGVFE+ D+++      + +      D                 QFRK LGKR
Sbjct: 287  EGNNSSRKGVFENADEKVFELKREERETEVDDDD--------EEDKKWEEEQFRKALGKR 338

Query: 996  IDDTSSQRVNYSVAPIPLHP--QPSVYPGVAHQTSAS--MTSASYGASRSAEVLSISQQA 1163
            +DD S++    SVA        Q SVY G ++  ++S  +++   G +RS E ++ SQQA
Sbjct: 339  MDDNSNRGSVQSVASAGSVKAVQSSVYSGGSYHGASSGLVSNLGVGVTRSVEFMTTSQQA 398

Query: 1164 EVASRAMQETINRLKESHKITTNSLVRTDTNITESLTEVSSLEKSLKEADDKYNFMQQLR 1343
            EVA++A+++++ RLKESH  T +S+VRTD N++ SL+ +  LEKSL  A +KY FMQ+LR
Sbjct: 399  EVATQALRDSMARLKESHDRTISSIVRTDNNLSASLSNIIDLEKSLSAAGEKYLFMQKLR 458

Query: 1344 DFISVMCDFLNDKAFLIEELEEQMQKLHEKRALAVVERRADDIADDDNEVESAVNAAIAV 1523
            DF+SV+CDFL DKA  IEELEEQMQ+LHE+RA A+V+RRADD AD+  E+E+AVNAAI+V
Sbjct: 459  DFVSVICDFLQDKAPFIEELEEQMQRLHEERASAIVQRRADDDADEMAEIEAAVNAAISV 518

Query: 1524 LSKGSSSAYVXXXXXXXXXXXXXXRESADLPVELDEFGRDINLKKRMDFTRRAESRKLRK 1703
             +KG S   V              +E ++LPVELDEFGRD+NL+KRMD  RRAE+RK RK
Sbjct: 519  FNKGGS---VSSAASAAQAASLAAKEQSNLPVELDEFGRDVNLQKRMDSKRRAEARKRRK 575

Query: 1704 ARAESKRIASMEMDNMLQ-IEGELSTDESDSESNAYISSRNELIQTAEEIFSDASEEYAN 1880
            A +ESKRI ++   +  Q IEGE STDESDS+S AY SS +EL+QTA EIFSDA++E++N
Sbjct: 576  AWSESKRIRTVGDGSSYQRIEGESSTDESDSDSTAYRSSCDELLQTASEIFSDAADEFSN 635

Query: 1881 LKIVKEWFERWKNQYLSSYRDAYVSISVPSLFSPYVRLELLKWDPLYDATDFFDMEWHKL 2060
            L +VK  FE WK QYL +YRDAY+S++  ++FSPYVRLELLKWDPLY  TDF DM WH L
Sbjct: 636  LSVVKVRFEGWKRQYLPTYRDAYMSMNASAIFSPYVRLELLKWDPLYKYTDFDDMRWHSL 695

Query: 2061 LFNYGLPAKGQDFEPDDADANLIPEIVEKVALPILHHEIEHCWDILNTQRTKGAVFATNM 2240
            LF+YG+ A    +E DD+DA+LIP++VEKVALPILHH+I HCWD+L+T+ TK AV AT +
Sbjct: 696  LFDYGIKAGASGYESDDSDADLIPKLVEKVALPILHHDIAHCWDMLSTKETKNAVSATKL 755

Query: 2241 VISYVPASSKALRELLAVIHTRLNEAITDLNVPVWSSVITKVVPGAAQFAAYKFGMAVRL 2420
            +I Y+PASS+AL+ELL  + TRL+EA++ L VP WS+++   VP AAQ AAY+FG +VRL
Sbjct: 756  LIDYIPASSEALQELLVSVRTRLSEAVSKLKVPTWSTLVINAVPQAAQIAAYRFGTSVRL 815

Query: 2421 LRNICLWKNILSMPVXXXXXXXXXXXXXXXPHVKSIMPNIHDAIMRTERIIASLVGIWSG 2600
            ++NICLWK+I+++PV               PHV++IMPNIHDAI RTER++ASL G+W+G
Sbjct: 816  MKNICLWKDIIALPVLEQLVLDELLCARVLPHVRNIMPNIHDAITRTERVVASLAGVWTG 875

Query: 2601 PEVTLGTSQKLQPLVDCISELGGKLEKRHALGVSLEETRGLARRLKNMLVSLNEYDKARA 2780
             ++    S KLQPLVD +  LG  LEK+HALGVS EET GLARRLK MLV LNEYDK RA
Sbjct: 876  RDLIGDRSSKLQPLVDYLMSLGKTLEKKHALGVSTEETTGLARRLKCMLVELNEYDKGRA 935

Query: 2781 ILRTFQLKEAL 2813
            ILRTFQL+EAL
Sbjct: 936  ILRTFQLREAL 946


>ref|XP_006379383.1| hypothetical protein POPTR_0008s00320g [Populus trichocarpa]
            gi|550332058|gb|ERP57180.1| hypothetical protein
            POPTR_0008s00320g [Populus trichocarpa]
          Length = 972

 Score =  770 bits (1989), Expect = 0.0
 Identities = 450/991 (45%), Positives = 599/991 (60%), Gaps = 61/991 (6%)
 Frame = +3

Query: 24   SSIRAKNFRRRSESDD----ANAEEKSVPSPSTKSQTLTLXXXXXXXXXXXXRLSFADDE 191
            SS +++NFRRR + DD    AN       + +T S T                LSFA+DE
Sbjct: 3    SSSKSRNFRRRGDVDDEKTDANTNNTDTNAKATPSTTRKPPPPQSTKPKPKKLLSFAEDE 62

Query: 192  EEDNDRRPSRIPSSSAG--------AASVHRLTSSKDRSKASRLASSIPSNVQPQVGEYT 347
            E++  +  +RIPSS +         ++S H+LT S+DR   +    +  SNVQPQ G YT
Sbjct: 63   EDE--QAVTRIPSSKSKPKPKPKPTSSSSHKLTVSQDRLPPTTSYLTTASNVQPQAGTYT 120

Query: 348  KERLLELQKNARPLGSISRSQRPPAVPEPKPRKSDRPAEPVIVLKGFLKQA-SPGRDKQE 524
            KE LLELQ+N R L   +++  P +  EPK           I+LKG LK + SP  +   
Sbjct: 121  KEALLELQRNTRTLAKSTKTTTPASASEPK-----------IILKGLLKPSFSPSPNPNP 169

Query: 525  GVVLKRQETNXXXXXXXXXXXDDSNG-------------GSLTGAKFPTIPDPETIKAIX 665
                  Q+ +           D  NG             G  T   +   PD +TIK I 
Sbjct: 170  NYSSNHQQQDDADDQSEDENEDKDNGADDAQNRLASMGLGKSTSDDYSCFPDEDTIKKIR 229

Query: 666  XXXXXXXXXXXXXXDFISLDGGMPSSRPSADGSSDEEDTDFQERISLFGIKADDKLKKG- 842
                          D+ISLD G      +  G   +E+ +F+ RI++ G    D    G 
Sbjct: 230  AKRERLRQSRAAAPDYISLDSGS-----NHQGGFSDEEPEFRTRIAMIGTMTKDTATHGG 284

Query: 843  VFESI-DQRLTITDERKM------------------DGGFRKGDINIXXXXXXXXXXXXX 965
            VF++  D      D+R +                  DG        +             
Sbjct: 285  VFDAAADDDEDDDDDRSIKAKALAMMGTHHHHAVVDDGNVAAAASVVHDEEDEEDRIWEE 344

Query: 966  XQFRKGLGKRIDDTSSQRVNYSVAP---------IPLHPQPSVYPGVAHQTSASMTSASY 1118
             QFRKGLGKR+DD S+   N ++A          IP+ PQ    PG     S      ++
Sbjct: 345  EQFRKGLGKRMDDASAPIANRALASTAGAAASSTIPMQPQQRPTPGYG---SIPSIGGAF 401

Query: 1119 GASRSAEVLSISQQAEVASRAMQETINRLKESHKITTNSLVRTDTNITESLTEVSSLEKS 1298
            G+S+  +VLSI QQA++A +A+Q+ + RLKESH  T + L +TD N++ SL  V++LEKS
Sbjct: 402  GSSQGLDVLSIPQQADIAKKALQDNLRRLKESHGRTISLLSKTDENLSASLMNVTALEKS 461

Query: 1299 LKEADDKYNFMQQLRDFISVMCDFLNDKAFLIEELEEQMQKLHEKRALAVVERRADDIAD 1478
            +  A +K+ FMQ+LRDF+SV+C+FL  KA LIEELEE+MQKLHE++A  ++ERR  D  D
Sbjct: 462  ISAAGEKFIFMQKLRDFVSVICEFLQHKATLIEELEERMQKLHEEQASLILERRTADNED 521

Query: 1479 DDNEVESAVNAAIAVLS-KGSSSAYVXXXXXXXXXXXXXXRESADLPVELDEFGRDINLK 1655
            +  EVE+AV AA++V S +G+S+A +              ++ A+LPV+LDEFGRDINL+
Sbjct: 522  EMMEVEAAVKAAMSVFSARGNSAATIDAAKSAAAAALVALKDQANLPVKLDEFGRDINLQ 581

Query: 1656 KRMDFTRRAESRKLRKARAESKRIASMEMDNMLQ-IEGELSTDESDSESN---AYISSRN 1823
            KRMD  +RA++R+ RKAR +SKR++ ME+D+  Q IEGELSTDESDS+S    AY S+R+
Sbjct: 582  KRMDMEKRAKARQRRKARFDSKRLSYMEVDSSDQKIEGELSTDESDSDSEKNAAYQSTRD 641

Query: 1824 ELIQTAEEIFSDASEEYANLKIVKEWFERWKNQYLSSYRDAYVSISVPSLFSPYVRLELL 2003
             L++TAEEIFSDASEEY+ L +VKE FE WK +Y +SYRDAY+S+S P++FSPYVRLELL
Sbjct: 642  LLLRTAEEIFSDASEEYSQLSVVKERFETWKKEYFASYRDAYMSLSAPAIFSPYVRLELL 701

Query: 2004 KWDPLYDATDFFDMEWHKLLFNYGLPAKGQDFEPDDADANLIPEIVEKVALPILHHEIEH 2183
            KWDPL++ +DFFDM+WH LLFNYGLP  G D  PDD DANL+P +VEK+A+PIL+HEI H
Sbjct: 702  KWDPLHEDSDFFDMKWHSLLFNYGLPEDGSDLNPDDVDANLVPGLVEKIAIPILYHEIAH 761

Query: 2184 CWDILNTQRTKGAVFATNMVISYVPASSKALRELLAVIHTRLNEAITDLNVPVWSSVITK 2363
            CWD+L+TQ TK A+ AT++VI+YVPA+S+AL ELLA I TRL +A+    VP WS ++ K
Sbjct: 762  CWDMLSTQETKNAISATSLVINYVPATSEALSELLAAIRTRLADAVASTVVPTWSLLVLK 821

Query: 2364 VVPGAAQFAAYKFGMAVRLLRNICLWKNILSMPVXXXXXXXXXXXXXXXPHVKSIMPNIH 2543
             VP AAQ AAY+FGM+VRL+RNICLWK+IL++PV               PHV+SI  N+H
Sbjct: 822  AVPSAAQVAAYRFGMSVRLMRNICLWKDILALPVLEKLVLDELLCGKVLPHVRSIASNVH 881

Query: 2544 DAIMRTERIIASLVGIWSGPEVTLG-TSQKLQPLVDCISELGGKLEKRHALGVSLEETRG 2720
            DA+ RTERI+ASL   W+GP  T   +S KLQPLVD I  +G  LEKRH  GV+  ET G
Sbjct: 882  DAVTRTERIVASLSRAWAGPSATSDHSSHKLQPLVDFILSIGMTLEKRHVSGVTETETSG 941

Query: 2721 LARRLKNMLVSLNEYDKARAILRTFQLKEAL 2813
            LARRLK MLV LN+YD AR + RTF LKEAL
Sbjct: 942  LARRLKKMLVELNDYDNARDMARTFHLKEAL 972


>tpg|DAA52554.1| TPA: hypothetical protein ZEAMMB73_777539 [Zea mays]
          Length = 935

 Score =  768 bits (1982), Expect = 0.0
 Identities = 459/987 (46%), Positives = 618/987 (62%), Gaps = 56/987 (5%)
 Frame = +3

Query: 21   MSSIRAKNFRRRSE-SDDANAEEKSVPSPST----KSQTLTLXXXXXXXXXXXX-RLSFA 182
            MSS R KNFRRR + ++DAN +  S P PST    K++TLT+             RLSFA
Sbjct: 1    MSSHR-KNFRRRGDDAEDANGDGGSHPKPSTTTATKTKTLTVPKPKSPPRRQGASRLSFA 59

Query: 183  DDEEEDNDR---------------RPSRIPSSSAGAASVHRLTSSKDRSKAS------RL 299
            DDE+ED+                 RP+R  S +AGA  +HRLT +++R K+S       +
Sbjct: 60   DDEDEDDAEAGPFAQRRLPPTASVRPARTASPAAGA--LHRLTPARERIKSSPAPAGAAV 117

Query: 300  ASSIPSNVQPQVGEYTKERLLELQKNARPL-GSISRSQRPPAVPEPKPRK------SDRP 458
            ++  PSN Q   GEYT ERL ELQKNARPL GS+ R+Q      EP+ +K      S  P
Sbjct: 118  SAPKPSNFQSHAGEYTPERLRELQKNARPLPGSLLRAQPRAPATEPRSQKLSGTPASSTP 177

Query: 459  A-------EPVIVLKGFLKQAS--------PGRDKQEGVVLKRQETNXXXXXXXXXXXDD 593
            A       E V+VLKG +K  S        P  DK+E    K +E             D+
Sbjct: 178  ATTTAAATETVVVLKGLVKPMSEASIGPRIPKHDKEED---KSEEEG---------KGDE 225

Query: 594  SNGGSLTGAKFPTIPDPETIKAIXXXXXXXXXXXXXXXDFISLD-GGMPSSRPSADGSSD 770
             + G       P IPD  TI+AI               D+ISLD GG+ SSR +A  SSD
Sbjct: 226  EDEG-------PVIPDRATIEAIRAKRQQRQQPRHAAPDYISLDAGGVLSSRNAAGESSD 278

Query: 771  EEDTDFQERISLFGIKADD--KLKKGVFESIDQRLTITDERKMDGGFRK-GDINIXXXXX 941
            E+D +  +RI+++  K  D  +  KGVF  I  R   T       G R   D        
Sbjct: 279  EDDNEITDRIAMYTDKPGDGPRSTKGVFSGISNRGPATSLGAFSDGSRNVEDDRDDDDDE 338

Query: 942  XXXXXXXXXQFRKGLGKRIDDTSSQRVNYSVAPIPLHPQPSVYPGVAHQTSASMTSASYG 1121
                     QFRKGLG+R+DD     V+                    +   S  +    
Sbjct: 339  EEERKWEEEQFRKGLGRRMDDAFYSEVS--------------------KWGTSCYAGPAT 378

Query: 1122 ASRSAEVLSISQQAEVASRAMQETINRLKESHKITTNSLVRTDTNITESLTEVSSLEKSL 1301
            A    + LSI+QQA+VA++A+Q+ I +L+E+HK T ++LV+TDT++ E+L+E+SSLE  L
Sbjct: 379  AIWIPKFLSIAQQADVANKALQDNIRKLRETHKTTVSALVKTDTHLNEALSEISSLESGL 438

Query: 1302 KEADDKYNFMQQLRDFISVMCDFLNDKAFLIEELEEQMQKLHEKRALAVVERRADDIADD 1481
            ++A+ ++ +MQ+LRD+ISVMCDFLNDKAFLIEELEE +Q+LHEKRALA+ ERRA D+AD+
Sbjct: 439  QDAEKRFVYMQELRDYISVMCDFLNDKAFLIEELEENIQQLHEKRALAISERRAADLADE 498

Query: 1482 DNEVESAVNAAIAVLSKGSSSAYVXXXXXXXXXXXXXXRESADLPVELDEFGRDINLKKR 1661
               +E+AV+AA+++LSKGSSS  +              R S++L  ELDEFGRDIN++KR
Sbjct: 499  SGVIEAAVSAAVSILSKGSSSTCLSAASNAAQAAAAAARGSSNLQPELDEFGRDINMQKR 558

Query: 1662 MDFTRRAESRKLRKARAESKRIASMEMDNMLQ-IEGELSTDESDSESNAYISSRNELIQT 1838
            MD  RR E R+ RK ++E+KR+AS   +  ++ IEGELSTDESDSES AY+SSR+E ++ 
Sbjct: 559  MDLKRREEDRRRRKTQSETKRLASAAKNKDIEKIEGELSTDESDSESTAYVSSRDEFLKA 618

Query: 1839 AEEIFSDASEEYANLKIVKEWFERWKNQYLSSYRDAYVSISVPSLFSPYVRLELLKWDPL 2018
            A+ +F DA EEY++L+IVK+ FE WK QY S+YRDA+V++S PS+FSPYVRLELLKWDPL
Sbjct: 619  ADHVFIDAKEEYSSLRIVKDKFEGWKAQYPSAYRDAHVALSAPSVFSPYVRLELLKWDPL 678

Query: 2019 YDATDFFDMEWHKLLFNYGLPAKGQDFEPDDA--DANLIPEIVEKVALPILHHEIEHCWD 2192
            ++ TDFFDM+WHK+LF+YG+    QD E      D++++P +VEKVALPILHH IE CWD
Sbjct: 679  HETTDFFDMDWHKVLFDYGV----QDDESPSGSNDSDVVPVLVEKVALPILHHRIERCWD 734

Query: 2193 ILNTQRTKGAVFATNMVISYVPASSKALRELLAVIHTRLNEAITDLNVPVWSSVITKVVP 2372
            +L+TQ T+ AV A+ MVI Y+P SSK L  LLA + +RL +A+ DL+VP W S++T+ VP
Sbjct: 735  VLSTQGTRKAVEASRMVIGYLPTSSKDLHRLLAAVSSRLTQAVADLSVPAWGSMVTRTVP 794

Query: 2373 GAAQFAAYKFGMAVRLLRNICLWKNILSMPVXXXXXXXXXXXXXXXPHVKSIMPNIHDAI 2552
            GA+Q+AAY+FG+AVRLL+N+CLWK+IL+  V               PH+KSI+ ++HDAI
Sbjct: 795  GASQYAAYRFGVAVRLLKNVCLWKDILADHVVEKLALDELLRGKILPHMKSIILDVHDAI 854

Query: 2553 MRTERIIASLVGIWSGPEVTLGTSQKLQPLVDCISELGGKLEKRHALGVSLEETRGLARR 2732
             R ER+ A+L  +W         +QKL+P  D ++ELG KLE+RHA G+S +ETRGLARR
Sbjct: 855  TRAERVAAALSEVWP------KQNQKLRPFADLVAELGNKLERRHASGISEDETRGLARR 908

Query: 2733 LKNMLVSLNEYDKARAILRTFQLKEAL 2813
            LKN+L  LNEYDKARAI + F L+EAL
Sbjct: 909  LKNILAVLNEYDKARAISKAFHLREAL 935


>gb|EMT06523.1| GC-rich sequence DNA-binding factor-like protein [Aegilops tauschii]
          Length = 845

 Score =  766 bits (1979), Expect = 0.0
 Identities = 432/861 (50%), Positives = 556/861 (64%), Gaps = 34/861 (3%)
 Frame = +3

Query: 333  VGEYTKERLLELQKNARPLGSISRSQRPPAVPEPKPRKSDR------------------- 455
            + EYT ERL ELQKNARPL     +  P   P P P    R                   
Sbjct: 2    LAEYTPERLRELQKNARPLPWEPHAVLPAPPPPPPPAAESRHQRPAGAAASTSSAPAAAG 61

Query: 456  ---PAEPVIVLKGFLK---QASPGRDKQEGVVLKRQETNXXXXXXXXXXXDDSNGGSLTG 617
               PAEPV+VLKG +K   QAS G   +   +    + N           D+  G     
Sbjct: 62   KAVPAEPVVVLKGLVKPMAQASIGPSPRP--LPNEVQDNDSEEEAEDDGEDEEKG----- 114

Query: 618  AKFPTIPDPETIKAIXXXXXXXXXXXXXXXDFISLDGG--MPSSRPSADGSSDEEDTDFQ 791
               P IPD  TI+AI               DFISLDGG  + S R +A GSSDE+D + +
Sbjct: 115  ---PLIPDKATIEAIRAKRQQLQQPRHAAPDFISLDGGGVLSSRRDAAGGSSDEDDNEME 171

Query: 792  ERISLFGIKADD--KLKKGVFESIDQRLTITDERKMDGGFRKGDINIXXXXXXXXXXXXX 965
             RI+++  K  D  +  KGVF+ I+ R        M   F + + +              
Sbjct: 172  GRIAMYSQKTSDGQRSSKGVFQGINNRGPAASLGAMKDRFMEVEDDEVDDEEEEERKWEE 231

Query: 966  XQFRKGLGKRIDDTSSQRVNYSVAPI--PLHPQPSVYPGVAHQT---SASMTSASYGASR 1130
             Q +K LG R+DD+S+QR    V      + PQPS Y G  H     S  +  AS  AS 
Sbjct: 232  AQVKKALGNRMDDSSAQRATNGVPASRQQVQPQPSGYSGGPHYQPSFSGVVPGASVFASG 291

Query: 1131 SAEVLSISQQAEVASRAMQETINRLKESHKITTNSLVRTDTNITESLTEVSSLEKSLKEA 1310
            SAE LSISQQA+VAS+A+QE I +LKESHK T +SL RTDT++ E+L+E+SSLE  L++A
Sbjct: 292  SAEFLSISQQADVASKALQENIRKLKESHKTTVDSLARTDTHLNEALSEISSLEGGLQDA 351

Query: 1311 DDKYNFMQQLRDFISVMCDFLNDKAFLIEELEEQMQKLHEKRALAVVERRADDIADDDNE 1490
            + K+ +MQ+LR++ISVMCDFLNDKAF IEELEE MQKLHE RALAV ERRA D AD+   
Sbjct: 352  EKKFVYMQELRNYISVMCDFLNDKAFFIEELEEHMQKLHENRALAVSERRAADFADESGV 411

Query: 1491 VESAVNAAIAVLSKGSSSAYVXXXXXXXXXXXXXXRESADLPVELDEFGRDINLKKRMDF 1670
            +E+AV+AAI+VLSKG SSA +              RESA+LP ELDEFGRDINL+KRMD 
Sbjct: 412  IEAAVSAAISVLSKGPSSANLSAASHAAQAAATAARESANLPPELDEFGRDINLQKRMDL 471

Query: 1671 TRRAESRKLRKARAESKRIASMEMDNMLQIEGELSTDESDSESNAYISSRNELIQTAEEI 1850
             RR E+R+ RKAR+ESKR++S        IEGELSTDESD++++AY+SSR+EL++TA+ +
Sbjct: 472  KRREENRRQRKARSESKRLSSARKSATEHIEGELSTDESDTDTSAYLSSRDELLKTADAV 531

Query: 1851 FSDASEEYANLKIVKEWFERWKNQYLSSYRDAYVSISVPSLFSPYVRLELLKWDPLYDAT 2030
            FSDA+EEY++L IVK+ FE WK QY  +YRDA+VS+SVPS+F+PYVRLELL WDPL++ T
Sbjct: 532  FSDAAEEYSSLTIVKDKFEGWKTQYPLAYRDAHVSLSVPSVFTPYVRLELLNWDPLHETT 591

Query: 2031 DFFDMEWHKLLFNYGLPAKGQDFEPDDADANLIPEIVEKVALPILHHEIEHCWDILNTQR 2210
             FFDM+W  +L  YG+  +    +P+D D NLI  + EKVALP+LHH I+HCWDIL+TQR
Sbjct: 592  SFFDMQWTNVLVGYGVQDE-DSADPNDLDLNLIQVLAEKVALPVLHHRIKHCWDILSTQR 650

Query: 2211 TKGAVFATNMVISYVPASSKALRELLAVIHTRLNEAITDLNVPVWSSVITKVVPGAAQFA 2390
            T+ AV AT MVI+YVP +SKAL +LLA + +RL EAI D++VP W S++T+ VPGAA++A
Sbjct: 651  TQHAVDATFMVINYVPVTSKALHQLLATVCSRLTEAIADVSVPAWGSMLTRAVPGAAEYA 710

Query: 2391 AYKFGMAVRLLRNICLWKNILSMPVXXXXXXXXXXXXXXXPHVKSIMPNIHDAIMRTERI 2570
            AY+FG+A RLL+N+CLWK +L++                 PH+KSI+  +HDAI R ERI
Sbjct: 711  AYRFGVATRLLKNVCLWKKVLAVDALEKLALDELLIGKILPHMKSIILEVHDAITRAERI 770

Query: 2571 IASLVGIWSGPEVTLGTSQKLQPLVDCISELGGKLEKRHALGVSLEETRGLARRLKNMLV 2750
             ASL G+WS P      ++KLQP  D + EL  KL+ RH  GVS EE RGLARRLKN+LV
Sbjct: 771  AASLSGVWSSP------NKKLQPFTDLVLELSNKLKSRHISGVSEEEIRGLARRLKNILV 824

Query: 2751 SLNEYDKARAILRTFQLKEAL 2813
            +LNEYDKAR IL+TFQ++EAL
Sbjct: 825  ALNEYDKARNILKTFQIREAL 845


>gb|EOY19310.1| GC-rich sequence DNA-binding factor-like protein, putative isoform 1
            [Theobroma cacao] gi|508727414|gb|EOY19311.1| GC-rich
            sequence DNA-binding factor-like protein, putative
            isoform 1 [Theobroma cacao]
          Length = 934

 Score =  766 bits (1978), Expect = 0.0
 Identities = 461/977 (47%), Positives = 593/977 (60%), Gaps = 47/977 (4%)
 Frame = +3

Query: 24   SSIRAKNFRRRSES--DDANAEEKSVPSPSTKSQTLTLXXXXXXXXXXXXR----LSFAD 185
            S+IRA+NFRRR +   DD N +  +   P+  S T+T             +    LSFAD
Sbjct: 3    SAIRARNFRRRGDDIDDDGNDDNNT---PNIASATVTATKKPSSSKPTAKKPPKLLSFAD 59

Query: 186  DEEEDNDRRPSR-----------IPSSSAGAASVHRLTSSKDRSKASRLASSIPSNVQPQ 332
            DE E+   +PS              S  +   S H++TS+KD     +  S++PSNVQPQ
Sbjct: 60   DENEEETTKPSSNRNRDKEREKPFSSRVSKPLSAHKITSTKD----CKTPSTLPSNVQPQ 115

Query: 333  VGEYTKERLLELQKNARPLGSISRSQRPPAVPEPKPRKSDRPAEPVIVLKGFLKQASPG- 509
             G YTKE LLELQKN R L +            P  R S   +EP IVLKG LK  S   
Sbjct: 116  AGTYTKEALLELQKNMRTLAA------------PSSRASSVSSEPKIVLKGLLKPQSQNL 163

Query: 510  ---RDKQEGVVLKRQETNXXXXXXXXXXXDDSNGGSLTGAKFPTIPDPETIKAIXXXXXX 680
               RD      L++ +T                 G      F   PD  TI AI      
Sbjct: 164  NSERDNDPPEKLQKDDTESRLATMA--------AGKGVDLDFSAFPDQATIDAIKAKKDR 215

Query: 681  XXXXXXXXX-DFISLDGGMPSSRPSADGSSDEEDTDFQERISLFGIKADDKLKKGVFESI 857
                      D+ISLD G        +  SD+E+ +F  R  LFG    +  KKGVFE I
Sbjct: 216  VRKSFARPAPDYISLDRGSNLGGAMEEELSDDEEPEFPGR--LFG----ESGKKGVFEVI 269

Query: 858  DQRLTITDERKMDGGFRKGDINIXXXXXXXXXXXXXXQFRKGLGKRIDDTSS-------- 1013
            ++R      RK DG   + D +               QFRKGLGKR+DD+S+        
Sbjct: 270  EERAVGVGLRK-DGIHDEDDDD-----NEEEKMWEEEQFRKGLGKRMDDSSNRVVSSSNN 323

Query: 1014 ---------------QRVNYSVAPIPLHPQPSVYPGVAHQTSASMTSASYGASRSAEVLS 1148
                           QR  YS     +    S+ P V+    +S+  A+ GAS+  +V S
Sbjct: 324  SGGVGMVHNMQQQHQQRYGYST----MGSYGSMMPSVSPAPPSSIVGAA-GASQGLDVTS 378

Query: 1149 ISQQAEVASRAMQETINRLKESHKITTNSLVRTDTNITESLTEVSSLEKSLKEADDKYNF 1328
            ISQQAE+  +A+QE + RLKESH  T +SL + D N++ SL  +++LEKSL  A +K+ F
Sbjct: 379  ISQQAEITKKALQENVRRLKESHDRTISSLTKADENLSASLFNITALEKSLSAAGEKFIF 438

Query: 1329 MQQLRDFISVMCDFLNDKAFLIEELEEQMQKLHEKRALAVVERRADDIADDDNEVESAVN 1508
            MQ+LRDF+SV+C+FL  KA LIEELEE MQKL+E+RAL+V+ERR+ +  D+  EVE+AV 
Sbjct: 439  MQKLRDFVSVICEFLQHKAPLIEELEEHMQKLNEERALSVLERRSANNDDEMVEVEAAVT 498

Query: 1509 AAIAVLSK-GSSSAYVXXXXXXXXXXXXXXRESADLPVELDEFGRDINLKKRMDFTRRAE 1685
            AA+ V S+ G+S+A +              R   +LPV+LDEFGRD+N +K +D  RRAE
Sbjct: 499  AAMLVFSECGNSAAMIEVAANAAQAAAAAIRGQVNLPVKLDEFGRDVNRQKHLDMERRAE 558

Query: 1686 SRKLRKARAESKRIASMEMDNMLQ-IEGELSTDESDSESNAYISSRNELIQTAEEIFSDA 1862
            +R+ RKAR +SKR++SME+D+  Q IEGE STDESDSES AY S+R+ L+QTA+EIF DA
Sbjct: 559  ARQRRKARFDSKRLSSMEIDSSYQKIEGESSTDESDSESTAYRSNRDMLLQTADEIFGDA 618

Query: 1863 SEEYANLKIVKEWFERWKNQYLSSYRDAYVSISVPSLFSPYVRLELLKWDPLYDATDFFD 2042
            SEEY+ L +VKE FERWK  Y SSYRDAY+S+S+P++FSPYVRLELLKWDPL+   DF D
Sbjct: 619  SEEYSQLSLVKERFERWKKDYSSSYRDAYMSLSIPAIFSPYVRLELLKWDPLHVDEDFSD 678

Query: 2043 MEWHKLLFNYGLPAKGQDFEPDDADANLIPEIVEKVALPILHHEIEHCWDILNTQRTKGA 2222
            M+WH LLFNYG P  G  F PDDADANL+P +VEKVALP+LHHEI HCWD+L+ Q TK A
Sbjct: 679  MKWHNLLFNYGFPEDGS-FAPDDADANLVPALVEKVALPVLHHEISHCWDMLSMQETKNA 737

Query: 2223 VFATNMVISYVPASSKALRELLAVIHTRLNEAITDLNVPVWSSVITKVVPGAAQFAAYKF 2402
            V AT+++I YVPASS+AL ELL  I TRL+EA+ D+ VP WS ++ K VP AA+ AAY+F
Sbjct: 738  VSATSLIIDYVPASSEALAELLVTIRTRLSEAVADIMVPTWSPLVMKAVPNAARVAAYRF 797

Query: 2403 GMAVRLLRNICLWKNILSMPVXXXXXXXXXXXXXXXPHVKSIMPNIHDAIMRTERIIASL 2582
            GM+VRL+RNICLWK IL++P+               PHV++I  ++HDA+ RTERI+ASL
Sbjct: 798  GMSVRLMRNICLWKEILALPILEKLALDELLYGKILPHVRNITSDVHDAVTRTERIVASL 857

Query: 2583 VGIWSGPEVTLGTSQKLQPLVDCISELGGKLEKRHALGVSLEETRGLARRLKNMLVSLNE 2762
             G+W+G  V   +S+KLQPLVD +  LG  LE+RHA GV+   T GLARRLK MLV LNE
Sbjct: 858  SGVWAGTNVIQDSSRKLQPLVDYVLLLGKTLERRHASGVTESGTGGLARRLKKMLVELNE 917

Query: 2763 YDKARAILRTFQLKEAL 2813
            YD AR I R F LKEAL
Sbjct: 918  YDSARDIARRFHLKEAL 934


>ref|XP_004298307.1| PREDICTED: GC-rich sequence DNA-binding factor 1-like [Fragaria vesca
            subsp. vesca]
          Length = 914

 Score =  761 bits (1965), Expect = 0.0
 Identities = 457/964 (47%), Positives = 598/964 (62%), Gaps = 33/964 (3%)
 Frame = +3

Query: 21   MSSIRAKNFRRRSESDDANAEEKSVPSPSTKSQTLTLXXXXXXXXXXXXR-------LSF 179
            MSS R KNFRRR + DD    +    +PST S   +L                    LSF
Sbjct: 1    MSSARPKNFRRRIDDDD----DDDADTPSTTSTLKSLSKPSSSAAKPKKPQSQAPKLLSF 56

Query: 180  ADDEEEDNDRRPSRIPSSS-----------AGAASVHRLTSSKDR---SKASRLASSIPS 317
             DDEE   +  PSR  SSS           A  +S H+LT++KDR   S +S  ++S+PS
Sbjct: 57   VDDEE---NATPSRSSSSSSKRDKSSSSRLAKPSSAHKLTAAKDRLVNSTSSTASASLPS 113

Query: 318  NVQPQVGEYTKERLLELQKNARPLGSISRSQRPPAVPEPKPRKSDRPAEPVIVLKGFLKQ 497
            NVQPQ G YTKE L ELQKN R L S   S    A            AEP IVL+G +K 
Sbjct: 114  NVQPQAGTYTKEALRELQKNTRTLASSRTSSAAAA------------AEPTIVLRGSIKP 161

Query: 498  ASPG-RDKQEGVVLKRQETNXXXXXXXXXXXDDSNGGSLTGAKFPTIPDPETIKAIXXXX 674
            A     D   G                     DS+     G+K    PD  TI+AI    
Sbjct: 162  ADASIADAVNGA-----------------RELDSDDEEQQGSK-DRYPDQATIEAIRKKR 203

Query: 675  XXXXXXXXXXXDFISLDGGMPSSRPSADGSSDEEDTDFQERISLFGIKADDKLKKGVFES 854
                       DFI+LD G  S+  +A+G SDEE  +F+ RI++FG K ++K  KGVFE 
Sbjct: 204  ERLRKSKPAAPDFIALDSG--SNHGAAEGLSDEEP-EFRNRIAMFGEKMENK--KGVFED 258

Query: 855  IDQRLTITDERKMDGGFRKGDINIXXXXXXXXXXXXXXQFRKGLGKRID-DTSSQRVNYS 1031
            +D       +  +DGG R+  + +              QFRKGLGKR+D D +S  V+ S
Sbjct: 259  VD-------DTGVDGGLRRESVVVEDDEDEEEKIWEEEQFRKGLGKRVDNDGASLGVSAS 311

Query: 1032 VAPI-PLHPQPSV-YPGVAH----QTSASMTS--ASYGASRSAEVLSISQQAEVASRAMQ 1187
            V  +    PQP   Y  +A     Q+ A + S   + GAS+ +  LSI++Q+E+A +A+ 
Sbjct: 312  VPRVHSAAPQPKASYNSIAGYSLAQSLAGVASIGGATGASQGSNALSINEQSEIAQKALL 371

Query: 1188 ETINRLKESHKITTNSLVRTDTNITESLTEVSSLEKSLKEADDKYNFMQQLRDFISVMCD 1367
            E + +LKESH  T  SL + + +++ SL  ++ LEKSL  AD+KY FMQ+LRDF+S +CD
Sbjct: 372  ENVRKLKESHGRTKMSLTKANESLSASLLNITDLEKSLSAADEKYKFMQELRDFVSTICD 431

Query: 1368 FLNDKAFLIEELEEQMQKLHEKRALAVVERRADDIADDDNEVESAVNAAIAVLSK-GSSS 1544
            FL DKA LIEELEE+MQK  ++RA A+ ERR  D  D+  EVE+AVNAA+++ SK G+S+
Sbjct: 432  FLQDKAPLIEELEEEMQKQRDERASAIFERRIADNDDEMMEVEAAVNAAMSIFSKEGTSA 491

Query: 1545 AYVXXXXXXXXXXXXXXRESADLPVELDEFGRDINLKKRMDFTRRAESRKLRKARAESKR 1724
              +              RE  +LPV+LDEFGRD+NLKKR+D   RAE+R+ R+ R E+KR
Sbjct: 492  GVIAVAKSAAQAASAAVREQKNLPVKLDEFGRDMNLKKRLDMKGRAEARQRRRKRYEAKR 551

Query: 1725 IASMEMDNMLQ-IEGELSTDESDSESNAYISSRNELIQTAEEIFSDASEEYANLKIVKEW 1901
             +SM++D+  + +EGE STDESD ES  Y S R  ++ TA+++FSDA+EEY+ L +VKE 
Sbjct: 552  ESSMDVDSPDRTVEGESSTDESDGESKEYESHRQLVLGTADQVFSDAAEEYSQLSLVKER 611

Query: 1902 FERWKNQYLSSYRDAYVSISVPSLFSPYVRLELLKWDPLYDATDFFDMEWHKLLFNYGLP 2081
            FE+WK +Y SSYRDAY+S+SVP +FSPYVRLELLKWDPL + TDF  M WH+LL NYG+P
Sbjct: 612  FEKWKREYRSSYRDAYMSLSVPIIFSPYVRLELLKWDPLRENTDFVKMSWHELLENYGVP 671

Query: 2082 AKGQDFEPDDADANLIPEIVEKVALPILHHEIEHCWDILNTQRTKGAVFATNMVISYVPA 2261
              G DF  DDADANLIP +VEKVALPILHH+I HCWDIL+T+ TK AV AT++V  YV +
Sbjct: 672  EDGSDFASDDADANLIPALVEKVALPILHHQIVHCWDILSTRETKNAVAATSLVTDYV-S 730

Query: 2262 SSKALRELLAVIHTRLNEAITDLNVPVWSSVITKVVPGAAQFAAYKFGMAVRLLRNICLW 2441
            SS+AL +LL  I TRL +A++ L VP WS ++ K VP AA+ AAY+FGM+VRL++NICLW
Sbjct: 731  SSEALEDLLVAIRTRLADAVSKLMVPTWSPLVLKAVPNAARIAAYRFGMSVRLMKNICLW 790

Query: 2442 KNILSMPVXXXXXXXXXXXXXXXPHVKSIMPNIHDAIMRTERIIASLVGIWSGPEVTLGT 2621
            K IL++PV               PH++SI  ++HDA+ RTER+IASL G+WSG +VT   
Sbjct: 791  KEILALPVLEKLAINELLCGKVIPHIRSIAADVHDAVTRTERVIASLSGVWSGSDVTGDR 850

Query: 2622 SQKLQPLVDCISELGGKLEKRHALGVSLEETRGLARRLKNMLVSLNEYDKARAILRTFQL 2801
            S+KLQ LVD +  LG  +EK+H+LGV+  ET GLARRLK MLV LNEYDKAR + RTF L
Sbjct: 851  SRKLQSLVDYVLTLGKTIEKKHSLGVTQSETGGLARRLKKMLVELNEYDKARDVARTFHL 910

Query: 2802 KEAL 2813
            KEAL
Sbjct: 911  KEAL 914


>ref|XP_006468681.1| PREDICTED: PAX3- and PAX7-binding protein 1-like [Citrus sinensis]
          Length = 913

 Score =  757 bits (1954), Expect = 0.0
 Identities = 441/955 (46%), Positives = 591/955 (61%), Gaps = 24/955 (2%)
 Frame = +3

Query: 21   MSSIRAKNFRRRSESDDANAEEKSVPSPSTKSQTLTLXXXXXXXXXXXXRLSFADDEEED 200
            MSS RA+NFRRR++ D+ N ++ + PS +T + T                LSFADDEEE 
Sbjct: 1    MSSSRARNFRRRADDDEDNNDDNT-PSAATTTAT----KKPPSSSKPKKLLSFADDEEEK 55

Query: 201  ND-----RRPSRIPSSSAGAASVHRLTSSKDRSKASRLASS--IPSNVQPQVGEYTKERL 359
            ++     R  +R  S  +  +S H++T+SK+R  +S  +SS  + SNVQ Q G YT+E L
Sbjct: 56   SEIPTSNRDRTRPSSRLSKPSSSHKITASKERQSSSATSSSTSLLSNVQAQAGTYTEEYL 115

Query: 360  LELQKNARPLGSISRSQRPPAVPEPKPRKSDRPAEPVIVLKGFLK---------QASPGR 512
            LEL+KN + L +          P  KP     PAEPV+VL+G +K         Q  P R
Sbjct: 116  LELRKNTKTLKA----------PSSKP-----PAEPVVVLRGSIKPEDSNLTRVQQKPSR 160

Query: 513  DKQEGVVLKRQETNXXXXXXXXXXXDDSNGGSLTGAKFPTIPDPETIKAIXXXXXXXXXX 692
            D  +     + ET              S G      +   I D   IKAI          
Sbjct: 161  DSSDSDSDHKAETEKRFA---------SLGVGKIAVQSGVIYDEAEIKAIRAKKDRLRQS 211

Query: 693  XXXXXDFISLDGGMPSSRPSADGSSDEEDTDFQERISLFGIK-ADDKLKKGVFESIDQRL 869
                 D+I LDGG  S R  A+GSSDEE  +F  R+++FG + A  K KKGVFE  D   
Sbjct: 212  GAKAPDYIPLDGGSSSLRGDAEGSSDEEP-EFPRRVAMFGERTASGKKKKGVFEDDD--- 267

Query: 870  TITDER----KMDGGFRKGDINIXXXXXXXXXXXXXXQFRKGLGKRIDDTSSQRVNYSVA 1037
               DER    +++  +   D ++              Q RKGLGKRIDD S +    + +
Sbjct: 268  VDEDERPVVARVENDYEYVDEDVMWEEE---------QVRKGLGKRIDDGSVRVGANTSS 318

Query: 1038 PIPLHPQPSVYPGVAHQTSASMTSASYGASRSAEVLSISQQAEVASRAMQETINRLKESH 1217
             + +  Q   +      T       + GAS+  + +SI+Q+AE A +A+Q  +NRLKESH
Sbjct: 319  SVAMPQQQQQFSYSTTVTPIPSIGGAIGASQGLDTMSIAQKAESAMKALQTNVNRLKESH 378

Query: 1218 KITTNSLVRTDTNITESLTEVSSLEKSLKEADDKYNFMQQLRDFISVMCDFLNDKAFLIE 1397
              T +SL +TD +++ SL +++ LE SL  A +K+ FMQ+LRD++SV+CDFL DKA  IE
Sbjct: 379  ARTMSSLKKTDEDLSSSLLKITDLESSLSAAGEKFIFMQKLRDYVSVICDFLQDKAPYIE 438

Query: 1398 ELEEQMQKLHEKRALAVVERRADDIADDDNEVESAVNAAIAVLSK--GSSSAYVXXXXXX 1571
             LE +MQKL+++RA A++ERRA D  D+  EVE+A+ AA  V+     S+S  +      
Sbjct: 439  TLEAEMQKLNKERASAILERRAADNDDEMTEVEAAIKAATLVIGDRGNSASKLIAASSAA 498

Query: 1572 XXXXXXXXRESADLPVELDEFGRDINLKKRMDFTRRAESRKLRKARAESKRIASMEMDNM 1751
                    +E  +LPV+LDEFGRD+NL+KR D  RRAESR+ R+ R + K+++SM+ D  
Sbjct: 499  QAAAAAAVKEQTNLPVKLDEFGRDMNLQKRRDMERRAESRQHRRTRFDLKQLSSMDADIS 558

Query: 1752 LQ-IEGELSTDESDSESNAYISSRNELIQTAEEIFSDASEEYANLKIVKEWFERWKNQYL 1928
             Q +EGE +TDESDSE+ AY S+R EL++TAE IFSDA+EEY+ L +VKE FE+WK  Y 
Sbjct: 559  SQKLEGESTTDESDSETEAYQSNREELLKTAEHIFSDAAEEYSQLSVVKERFEKWKRDYS 618

Query: 1929 SSYRDAYVSISVPSLFSPYVRLELLKWDPLYDATDFFDMEWHKLLFNYGLPAKGQDFEPD 2108
            SSYRDAY+S+S P++ SPYVRLELLKWDPL++  DF +M+WH LLFNYGLP  G+DF  D
Sbjct: 619  SSYRDAYMSLSTPAIMSPYVRLELLKWDPLHEDADFSEMKWHNLLFNYGLPKDGEDFAHD 678

Query: 2109 DADANLIPEIVEKVALPILHHEIEHCWDILNTQRTKGAVFATNMVISYVPASSKALRELL 2288
            DADANL+P +VEKVALPILHH+I +CWD+L+T+ TK AV AT +V++YVP SS+AL++LL
Sbjct: 679  DADANLVPTLVEKVALPILHHDIAYCWDMLSTRETKNAVSATILVMAYVPTSSEALKDLL 738

Query: 2289 AVIHTRLNEAITDLNVPVWSSVITKVVPGAAQFAAYKFGMAVRLLRNICLWKNILSMPVX 2468
              IHTRL EA+ ++ VP WSS+    VP AA+ AAY+FG++VRL+RNICLWK + ++P+ 
Sbjct: 739  VAIHTRLAEAVANIAVPTWSSLAMSAVPNAARIAAYRFGVSVRLMRNICLWKEVFALPIL 798

Query: 2469 XXXXXXXXXXXXXXPHVKSIMPNIHDAIMRTERIIASLVGIWSGPEVTLGTSQKLQPLVD 2648
                          PHV+SI  N+HDAI RTERI+ASL G+W+GP VT     KLQPLVD
Sbjct: 799  EKLALDELLCRKVLPHVRSIASNVHDAISRTERIVASLSGVWAGPSVTGSCCHKLQPLVD 858

Query: 2649 CISELGGKLEKRHALGVSLEETRGLARRLKNMLVSLNEYDKARAILRTFQLKEAL 2813
             +  L   LEK+H  GV+  ET GLARRLK MLV LNEYD AR I RTF LKEAL
Sbjct: 859  FMLSLAKTLEKKHLPGVTESETAGLARRLKKMLVELNEYDNARDIARTFHLKEAL 913


>ref|XP_006448500.1| hypothetical protein CICLE_v10014191mg [Citrus clementina]
            gi|557551111|gb|ESR61740.1| hypothetical protein
            CICLE_v10014191mg [Citrus clementina]
          Length = 913

 Score =  750 bits (1937), Expect = 0.0
 Identities = 442/958 (46%), Positives = 590/958 (61%), Gaps = 27/958 (2%)
 Frame = +3

Query: 21   MSSIRAKNFRRRSESDDANAEEKSVPSPSTKSQTLTLXXXXXXXXXXXXRLSFADDEEED 200
            MSS RA+NFRRR++ D+ N ++ +   PS  + T T              LSFADDEEE 
Sbjct: 1    MSSSRARNFRRRADDDEDNNDDNT---PSVATTTATKKPPSSSKPKKL--LSFADDEEEK 55

Query: 201  ND-----RRPSRIPSSSAGAASVHRLTSSKDRSKASRLASS--IPSNVQPQVGEYTKERL 359
            ++     R  +R  S  +  +S H++T+SK+R  +S  +SS  + SNVQ Q G YT+E L
Sbjct: 56   SEIPTSNRDRTRPSSRLSKPSSSHKITASKERQSSSATSSSTSLLSNVQAQAGTYTEEYL 115

Query: 360  LELQKNARPLGSISRSQRPPAVPEPKPRKSDRPAEPVIVLKGFLK---------QASPGR 512
            LEL+KN + L +          P  KP     PAEPV+VL+G +K         Q  P R
Sbjct: 116  LELRKNTKTLKA----------PSSKP-----PAEPVVVLRGSIKPEDSNLTRVQQKPSR 160

Query: 513  DKQEGVVLKRQETNXXXXXXXXXXXDDSNGGSLTGAKFPTIPDPETIKAIXXXXXXXXXX 692
            D  +     + ET              S G      +   I D   IKAI          
Sbjct: 161  DSSDSDSDHKAETEKRFA---------SLGVGKIAVQSGVIYDEAEIKAIRAKKDRLRQS 211

Query: 693  XXXXXDFISLDGGMPSSRPSADGSSDEEDTDFQERISLFGIK-ADDKLKKGVFESIDQRL 869
                 D+I LDGG  S R  A+GSSDEE  +F  R+++FG + A  K KKGVFE  D   
Sbjct: 212  GAKAPDYIPLDGGSSSLRGDAEGSSDEEP-EFPRRVAMFGERTASGKKKKGVFEDDD--- 267

Query: 870  TITDER----KMDGGFRKGDINIXXXXXXXXXXXXXXQFRKGLGKRIDDTSSQ---RVNY 1028
               DER    +++  +   D ++              Q RKGLGKRIDD+S +     + 
Sbjct: 268  VDEDERPVVARVENDYEYVDEDVMWEEE---------QVRKGLGKRIDDSSVRVGANTSS 318

Query: 1029 SVAPIPLHPQPSVYPGVAHQTSASMTSASYGASRSAEVLSISQQAEVASRAMQETINRLK 1208
            SVA +P   Q   YP     T       + GAS+  + +SI+Q+AE A +A+Q  +NRLK
Sbjct: 319  SVA-MPQQQQQFSYPTTV--TPIPSIGGAIGASQGLDTMSIAQKAESAMKALQTNVNRLK 375

Query: 1209 ESHKITTNSLVRTDTNITESLTEVSSLEKSLKEADDKYNFMQQLRDFISVMCDFLNDKAF 1388
            ESH  T +SL +TD +++ SL +++ LE SL  A +++ FMQ+LRD++SV+CDFL DKA 
Sbjct: 376  ESHARTMSSLKKTDEDLSSSLLKITDLESSLSAAGERFIFMQKLRDYVSVICDFLQDKAP 435

Query: 1389 LIEELEEQMQKLHEKRALAVVERRADDIADDDNEVESAVNAAIAVLSK--GSSSAYVXXX 1562
             IE LE +MQKL+++RA A++ERRA D  D+  EVE+A+ AA   +     S+S      
Sbjct: 436  YIETLEAEMQKLNKERASAILERRAADNDDEMTEVEAAIKAATLFIGDRGNSASKLTAAS 495

Query: 1563 XXXXXXXXXXXRESADLPVELDEFGRDINLKKRMDFTRRAESRKLRKARAESKRIASMEM 1742
                       +E  +LPV+LDEFGRD+NL+KR D  RRAESR+ R+ R + K+++SM+ 
Sbjct: 496  SAAQAAAAAAIKEQTNLPVKLDEFGRDMNLQKRRDMERRAESRQHRRTRFDLKQLSSMDA 555

Query: 1743 DNMLQ-IEGELSTDESDSESNAYISSRNELIQTAEEIFSDASEEYANLKIVKEWFERWKN 1919
            D   Q +EGE +TDESDSE+ AY S+R EL++TAE IFSDA+EEY+ L +VKE FE+WK 
Sbjct: 556  DISSQKLEGESTTDESDSETEAYQSNREELLKTAEHIFSDAAEEYSQLSVVKERFEKWKR 615

Query: 1920 QYLSSYRDAYVSISVPSLFSPYVRLELLKWDPLYDATDFFDMEWHKLLFNYGLPAKGQDF 2099
             Y SSYRDAY+S+S P++ SPYVRLELLKWDPL++  DF +M+WH LLFNYGLP  G+DF
Sbjct: 616  DYSSSYRDAYMSLSTPAIMSPYVRLELLKWDPLHEDADFSEMKWHNLLFNYGLPKDGEDF 675

Query: 2100 EPDDADANLIPEIVEKVALPILHHEIEHCWDILNTQRTKGAVFATNMVISYVPASSKALR 2279
              DDADANL+P +VEKVALPILHH+I +CWD+L+T+ TK  V AT +V++YVP SS+AL+
Sbjct: 676  AHDDADANLVPTLVEKVALPILHHDIAYCWDMLSTRETKNVVSATILVMAYVPTSSEALK 735

Query: 2280 ELLAVIHTRLNEAITDLNVPVWSSVITKVVPGAAQFAAYKFGMAVRLLRNICLWKNILSM 2459
            +LL  IHTRL EA+ ++ VP WS +    VP +A+ AAY+FG++VRL+RNICLWK + ++
Sbjct: 736  DLLVAIHTRLAEAVANIAVPTWSPLAMSAVPNSARIAAYRFGVSVRLMRNICLWKEVFAL 795

Query: 2460 PVXXXXXXXXXXXXXXXPHVKSIMPNIHDAIMRTERIIASLVGIWSGPEVTLGTSQKLQP 2639
            P+               PHV+SI  N+HDAI RTERI+ASL G+W+GP VT     KLQP
Sbjct: 796  PILEKLALDELLCRKVLPHVRSIASNVHDAISRTERIVASLSGVWAGPSVTGSCCHKLQP 855

Query: 2640 LVDCISELGGKLEKRHALGVSLEETRGLARRLKNMLVSLNEYDKARAILRTFQLKEAL 2813
            LVD +  L   LEK+H  GV+  ET GLARRLK MLV LNEYD AR I RTF LKEAL
Sbjct: 856  LVDFMLSLAKTLEKKHLPGVTESETAGLARRLKKMLVELNEYDNARDIARTFHLKEAL 913


>gb|EXB53993.1| GC-rich sequence DNA-binding factor 1 [Morus notabilis]
          Length = 952

 Score =  741 bits (1912), Expect = 0.0
 Identities = 461/983 (46%), Positives = 587/983 (59%), Gaps = 54/983 (5%)
 Frame = +3

Query: 27   SIRAKNFRRRSESDD--------ANAEEKSVPSPSTKSQTLT--LXXXXXXXXXXXXR-- 170
            S RA+NFRRR+  DD         ++  K+ PS +T + T T  L            R  
Sbjct: 2    SNRARNFRRRTGGDDDDDDNYNIKDSNAKNGPSTTTATTTTTKSLLKPSSTSASKPKRPP 61

Query: 171  ------LSFADDEEEDNDRRP-----SRIPSSSAGAA---SVHRLTSSKDR------SKA 290
                  LSFADDE+ +   R      S++ SSS+  +   S H++T+ KDR      S  
Sbjct: 62   NQSTKLLSFADDEDNETPSRSKPSSSSKLSSSSSRLSKPTSSHKMTALKDRLPHSSSSSP 121

Query: 291  SRLASSIPSNVQPQVGEYTKERLLELQKNARPLGSISRSQRPPAVPEPKPRKSDRPAEPV 470
            S  + S+PSNVQPQ G YTKE L ELQKN R L S   S                 +EPV
Sbjct: 122  SSSSLSLPSNVQPQAGTYTKEALRELQKNTRTLASSKPS-----------------SEPV 164

Query: 471  IVLKGFLKQASPGR--------DKQEGVVLKRQETNXXXXXXXXXXXDDSNGGSLTGAKF 626
            IVLKG LK +   +        ++ E   LK +              D  N      +  
Sbjct: 165  IVLKGLLKPSELAKSDWKLDSEEEDEPDELKERRGELASMEIGAKGRDRDNS-----SPE 219

Query: 627  PTIPDPETIKAIXXXXXXXXXXXXXXXDFISLDGGMPSSRPSADGSSDEEDTDFQERISL 806
            P IPD  TI AI               DFI+LD G  S+   A+G SDEE  + Q RI++
Sbjct: 220  PLIPDQATINAIRAKRERLRQSRAAAPDFIALDAG--SNHGEAEGLSDEEPEN-QTRIAM 276

Query: 807  FGIKADDKLKKGVFES-IDQR-LTITDERKMDGGFRKGDINIXXXXXXXXXXXXXXQFRK 980
            FG KA+   KKGVFE  ID R + +   R+  G   +   N               QFRK
Sbjct: 277  FGEKAEGP-KKGVFEDDIDDRGIELGLLRRKQGVLEE---NHEDDEDEEDKIWEEEQFRK 332

Query: 981  GLGK-RIDDTSSQRVNYSVAPIPLHPQPSVYPGVAHQT---SASMTSASYGASRSAE--- 1139
            GLGK RIDD     V   V  +    Q      V  QT   SAS+     G+S  +    
Sbjct: 333  GLGKTRIDDGGKNSV---VPVVKRETQQKFVSSVGSQTLPPSASIGGTFGGSSGGSSTGL 389

Query: 1140 ---VLSISQQAEVASRAMQETINRLKESHKITTNSLVRTDTNITESLTEVSSLEKSLKEA 1310
               ++  SQQAE+A  A+ + + RLKE+H     SL + D N+++SL  +++LEKSL  A
Sbjct: 390  GLGMMPFSQQAEIALNAIDDNVRRLKETHDQDLVSLNKADKNLSDSLLNITALEKSLSAA 449

Query: 1311 DDKYNFMQQLRDFISVMCDFLNDKAFLIEELEEQMQKLHEKRALAVVERRADDIADDDNE 1490
            D+KY F Q+LRDFIS++CDFL  KA  IEELE+QMQKLHEK A A+VERR  +  D+  E
Sbjct: 450  DEKYKFTQKLRDFISIICDFLQHKAPFIEELEDQMQKLHEKHASAIVERRTANNDDEMME 509

Query: 1491 VESAVNAAIAVLSK-GSSSAYVXXXXXXXXXXXXXXRESADLPVELDEFGRDINLKKRMD 1667
            VE+ VNAA+++ SK GS+   V              RE  +LPV+LDEFGRD+NL+KRM+
Sbjct: 510  VEAEVNAAMSIFSKKGSNVDVVAAAKSAAQAASAALREQGNLPVKLDEFGRDMNLQKRME 569

Query: 1668 FTRRAESRKLRKARAESKRIASMEMDNMLQ-IEGELSTDESDSESNAYISSRNELIQTAE 1844
               RAE+R+ RKAR +SKR++SM++D   Q +EGE STDESDSES A+ S R  L+QTA 
Sbjct: 570  MKGRAEARQCRKARFDSKRLSSMDVDGPYQRMEGESSTDESDSESTAFESHRELLLQTAA 629

Query: 1845 EIFSDASEEYANLKIVKEWFERWKNQYLSSYRDAYVSISVPSLFSPYVRLELLKWDPLYD 2024
             IFSDASEEY+ L +VKE FE WK +Y S+Y DAY+S+S PS+FSPYVRLELLKWDPL++
Sbjct: 630  HIFSDASEEYSQLSVVKERFEEWKREYSSTYSDAYMSLSAPSIFSPYVRLELLKWDPLHE 689

Query: 2025 ATDFFDMEWHKLLFNYGLPAKGQDFEPDDADANLIPEIVEKVALPILHHEIEHCWDILNT 2204
             TDF +M WH LL +YG+P  G  F PDDADANL+PE+VEKVAL ILHHEI HCWD+L+T
Sbjct: 690  KTDFLNMNWHSLLMDYGVPEDGGGFAPDDADANLVPELVEKVALRILHHEIVHCWDMLST 749

Query: 2205 QRTKGAVFATNMVISYVPASSKALRELLAVIHTRLNEAITDLNVPVWSSVITKVVPGAAQ 2384
              T+ AV AT++V  YVPASS+AL +LL  I TRL +A+ +L VP WS  + + VP AA+
Sbjct: 750  LETRNAVAATSLVTDYVPASSEALADLLVAIRTRLADAVANLTVPTWSPPVLQAVPNAAR 809

Query: 2385 FAAYKFGMAVRLLRNICLWKNILSMPVXXXXXXXXXXXXXXXPHVKSIMPNIHDAIMRTE 2564
             AAY+FG++VRL++NICLWK IL++PV               PHV+SI  N+HDAI RTE
Sbjct: 810  LAAYRFGVSVRLMKNICLWKEILALPVLEKLALDELLCGKVLPHVRSIAANVHDAIPRTE 869

Query: 2565 RIIASLVGIWSGPEVTLGTSQKLQPLVDCISELGGKLEKRHALGVSLEETRGLARRLKNM 2744
            +I+ASL G+W+GP VT   S+KLQPLVD +  L   LEK+H  GV+  ET GLARRLK M
Sbjct: 870  KIVASLSGVWAGPSVTGDRSRKLQPLVDYLMLLRKILEKKHESGVTESETSGLARRLKKM 929

Query: 2745 LVSLNEYDKARAILRTFQLKEAL 2813
            LV LNEYDKAR I RTF LKEAL
Sbjct: 930  LVELNEYDKARDIARTFHLKEAL 952


>ref|XP_002513154.1| gc-rich sequence DNA-binding factor, putative [Ricinus communis]
            gi|223548165|gb|EEF49657.1| gc-rich sequence DNA-binding
            factor, putative [Ricinus communis]
          Length = 885

 Score =  741 bits (1912), Expect = 0.0
 Identities = 430/949 (45%), Positives = 568/949 (59%), Gaps = 19/949 (2%)
 Frame = +3

Query: 24   SSIRAKNFRRRSESDDANAEEKSVPSPSTKSQTLTLXXXXXXXXXXXXRLSFADDEEEDN 203
            +S +++NFRRR + ++ N    +  +PS  S+  +              LSFADDEEED 
Sbjct: 3    TSSKSRNFRRRGDENEDNESNSNTTNPSYSSRKSSSKPKKL--------LSFADDEEEDE 54

Query: 204  DR-RPSRIPSSSAGAASVHRLTSSKDRSKASRLASSIPSNVQ------PQVGEYTKERLL 362
            +  RPS+   S     S H+LT+ KDR  +S   S+  +N        PQ G YTKE LL
Sbjct: 55   ETPRPSKQKPSKT--KSSHKLTAPKDRLSSSSTTSTTSTNTNSNNVLLPQAGTYTKEALL 112

Query: 363  ELQKNARPLGSISRSQRPPAVPEPKPRKSDRPAEPVIVLKGFLKQASPGRDKQEGVVLKR 542
            ELQK  R L       +P + P P P  S   +EP I+LKG LK   P    Q+      
Sbjct: 113  ELQKKTRTLA------KPSSKPPPPPPSS---SEPKIILKGLLKPTLPQTLNQQDA---- 159

Query: 543  QETNXXXXXXXXXXXDDSNGGSLTGAKFPTIPDPETIKAIXXXXXXXXXXXXXXXDFISL 722
                           D      +    +  IPD +TIK I               D+ISL
Sbjct: 160  ---------------DPPQDEIIIDEDYSLIPDEDTIKKIRAKRERLRQSRATAPDYISL 204

Query: 723  DGGMPSSRPSADGSSDEEDTDFQERISLFGIKADDK-LKKGVFESID---------QRLT 872
            DGG  +S    D  SDEE  +F+ RI++ G K +       VF+  D         +   
Sbjct: 205  DGGAATS----DAFSDEEP-EFRNRIAMIGKKDNTTPTTHAVFQDFDNGNDSHVIAEETV 259

Query: 873  ITDERKMDGGFRKGDINIXXXXXXXXXXXXXXQFRKGLGKRIDDTSSQRVNYSVAPIPLH 1052
            + DE + D  + +                   QFRK LGKR+DD SS     S+ P P  
Sbjct: 260  VNDEDEEDKIWEE------------------EQFRKALGKRMDDPSSSTP--SLFPTPST 299

Query: 1053 PQPSVYPGVAHQTSASMTSASYGASRSAEVLSISQQAEVASRAMQETINRLKESHKITTN 1232
               +      H         ++G +   + LS+ QQ+ +A +A+ + + RLKESH  T +
Sbjct: 300  STITTTNNHRHSHIVPTIGGAFGPTPGLDALSVPQQSHIARKALLDNLTRLKESHNRTVS 359

Query: 1233 SLVRTDTNITESLTEVSSLEKSLKEADDKYNFMQQLRDFISVMCDFLNDKAFLIEELEEQ 1412
            SL + D N++ SL  +++LEKSL  A +K+ FMQ+LRDF+SV+C+FL  KA  IEELEEQ
Sbjct: 360  SLTKADENLSASLMNITALEKSLSAAGEKFIFMQKLRDFVSVICEFLQHKAPYIEELEEQ 419

Query: 1413 MQKLHEKRALAVVERRADDIADDDNEVESAVNAAIAVLS-KGSSSAYVXXXXXXXXXXXX 1589
            MQ LHE+RA A++ERR  D  D+  EV++A+ AA  V S +GS+ A +            
Sbjct: 420  MQTLHEQRASAILERRTADNDDEMMEVKTALEAAKKVFSARGSNEAAITAAMNAAQDASA 479

Query: 1590 XXRESADLPVELDEFGRDINLKKRMDFTRRAESRKLRKARAESKRIASMEMDNMLQ-IEG 1766
              +E  +LPV+LDEFGRDIN +KR+D  RRAE+R+ RKA+   K+++S+E+D   Q +EG
Sbjct: 480  SMKEQINLPVKLDEFGRDINQQKRLDMKRRAEARQRRKAQ---KKLSSVEVDGSNQKVEG 536

Query: 1767 ELSTDESDSESNAYISSRNELIQTAEEIFSDASEEYANLKIVKEWFERWKNQYLSSYRDA 1946
            E STDESDSES AY S+R+ L+QTA++IF DASEEY  L +VK+ FE WK +Y +SYRDA
Sbjct: 537  ESSTDESDSESAAYQSNRDLLLQTADQIFGDASEEYCQLSVVKQRFENWKKEYSTSYRDA 596

Query: 1947 YVSISVPSLFSPYVRLELLKWDPLYDATDFFDMEWHKLLFNYGLPAKGQDFEPDDADANL 2126
            Y+SIS P++FSPYVRLELLKWDPL++   FF M+WH LL +YGLP  G D  P+DADANL
Sbjct: 597  YMSISAPAIFSPYVRLELLKWDPLHEDAGFFHMKWHSLLSDYGLPQDGSDLSPEDADANL 656

Query: 2127 IPEIVEKVALPILHHEIEHCWDILNTQRTKGAVFATNMVISYVPASSKALRELLAVIHTR 2306
            +PE+VEKVA+PILHHEI HCWD+L+T+ TK AVFATN+V  YVPASS+AL ELL  I TR
Sbjct: 657  VPELVEKVAIPILHHEIAHCWDMLSTRETKNAVFATNLVTDYVPASSEALAELLLAIRTR 716

Query: 2307 LNEAITDLNVPVWSSVITKVVPGAAQFAAYKFGMAVRLLRNICLWKNILSMPVXXXXXXX 2486
            L +A+  + VP WS +  K VP AAQ AAY+FGM+VRL++NICLWK+ILS+PV       
Sbjct: 717  LTDAVVSIMVPTWSPIELKAVPRAAQIAAYRFGMSVRLMKNICLWKDILSLPVLEKLALD 776

Query: 2487 XXXXXXXXPHVKSIMPNIHDAIMRTERIIASLVGIWSGPEVTLGTSQKLQPLVDCISELG 2666
                    PH++S+  N+HDA+ RTERIIASL G+W+G  VT   S KLQPLVDC+  LG
Sbjct: 777  DLLCRKVLPHLQSVASNVHDAVTRTERIIASLSGVWAGTSVTASRSHKLQPLVDCVMSLG 836

Query: 2667 GKLEKRHALGVSLEETRGLARRLKNMLVSLNEYDKARAILRTFQLKEAL 2813
             +L+ +H LG S  E  GLARRLK MLV LN+YDKAR I R F L+EAL
Sbjct: 837  KRLKDKHPLGASEIEVSGLARRLKKMLVELNDYDKAREIARMFSLREAL 885


Top