BLASTX nr result

ID: Akebia24_contig00007163 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia24_contig00007163
         (2120 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007210347.1| hypothetical protein PRUPE_ppa001704mg [Prun...   522   e-145
ref|XP_004309494.1| PREDICTED: uncharacterized protein LOC101290...   499   e-138
gb|AER42647.1| GTL1 [Populus tremula x Populus alba]                  497   e-137
ref|XP_006476231.1| PREDICTED: trihelix transcription factor GTL...   491   e-136
ref|XP_006439158.1| hypothetical protein CICLE_v10018915mg [Citr...   490   e-135
ref|XP_006476232.1| PREDICTED: trihelix transcription factor GTL...   484   e-134
ref|XP_002518968.1| transcription factor, putative [Ricinus comm...   483   e-133
ref|XP_006851901.1| hypothetical protein AMTR_s00041p00147950 [A...   479   e-132
ref|XP_002300534.2| hypothetical protein POPTR_0001s45870g [Popu...   476   e-131
ref|NP_174594.1| trihelix transcription factor GTL1 [Arabidopsis...   439   e-120
ref|XP_006854553.1| hypothetical protein AMTR_s00030p00088210 [A...   439   e-120
ref|XP_006415119.1| hypothetical protein EUTSA_v10007000mg [Eutr...   432   e-118
emb|CAE02791.2| OSJNBa0011L07.15 [Oryza sativa Japonica Group] g...   432   e-118
ref|XP_002893757.1| hypothetical protein ARALYDRAFT_473497 [Arab...   432   e-118
gb|AAG51283.1|AC027035_6 trihelix DNA-binding protein (GTL1) [Ar...   426   e-116
sp|Q9C882.2|GTL1_ARATH RecName: Full=Trihelix transcription fact...   426   e-116
ref|XP_002464376.1| hypothetical protein SORBIDRAFT_01g017120 [S...   426   e-116
emb|CAA05995.1| GTL1 [Arabidopsis thaliana]                           425   e-116
ref|XP_006415118.1| hypothetical protein EUTSA_v10007000mg [Eutr...   417   e-113
ref|XP_002448251.1| hypothetical protein SORBIDRAFT_06g023980 [S...   416   e-113

>ref|XP_007210347.1| hypothetical protein PRUPE_ppa001704mg [Prunus persica]
            gi|462406082|gb|EMJ11546.1| hypothetical protein
            PRUPE_ppa001704mg [Prunus persica]
          Length = 776

 Score =  522 bits (1345), Expect = e-145
 Identities = 347/732 (47%), Positives = 408/732 (55%), Gaps = 86/732 (11%)
 Frame = +3

Query: 147  AVVAEGASPISSRAPPVGGRSSGNLDELVAVSSXXXXXXXXXXXXEMER-----GVLSGG 311
            A + E ASPISSR P     +S NLDEL+ +S               E      GV S G
Sbjct: 46   AQLVEAASPISSRPP---ASASVNLDELMTLSGAAAAAEDALAASRDEADRGGGGVGSSG 102

Query: 312  NRWPRQETIALLKIRSEMDAAFRDATLKGPLWDDVSRKLAELGYVRSAKKCKEKFENVHK 491
            NRWPRQET+ALLKIRSEMD +FRDATLKGPLW+DVSRKLAELGY RSAKKCKEKFENVHK
Sbjct: 103  NRWPRQETLALLKIRSEMDVSFRDATLKGPLWEDVSRKLAELGYKRSAKKCKEKFENVHK 162

Query: 492  YYKRTKEGRAGRQDGKSYRFFSQLEALHXXXXXXXXXXXXXXXXXXXXXXXXS-MGIG-- 662
            YYKRTKEGRAGRQDGKSY+FFS+LEALH                        + + IG  
Sbjct: 163  YYKRTKEGRAGRQDGKSYKFFSELEALHGTTAATSSVNVSASPSIHVTHASPNPVSIGFS 222

Query: 663  ------------TVITTSGRIQPNSETATPNAPPTQIGLPRMNPLDLSGGVQIGVSGSAS 806
                        T+     +  P +    P++ P Q       P+D++       S S+S
Sbjct: 223  NPMPISSFRMSPTIPVMPSQQPPATFPVMPSSQPPQTAATTATPMDINFS-----SNSSS 277

Query: 807  TAVGLAFXXXXXXXXXXXXXXLEGEPSNTNTRKRKRGESS-----SNRKMMGFFEGLMKQ 971
            ++ G                 +EGEPS+   RKRKRG +S     S RKMM FFE LMKQ
Sbjct: 278  SSPGT--------DDEDDDDDVEGEPSS---RKRKRGGASTSGSGSTRKMMEFFEVLMKQ 326

Query: 972  VIERQEAMQQRFLETIEKREQDRMIREEAWKHQEMARLNXXXXXXXXXXXXSATRDAAII 1151
            V+++QE MQQRFLE IEKREQDR IREEAWK QEMARL             SA+RDAAII
Sbjct: 327  VMQKQETMQQRFLEVIEKREQDRTIREEAWKRQEMARLTREHELMSQERAISASRDAAII 386

Query: 1152 AFLQKITGQSIQLPPSVAIXXXXXXXXXXXXXXXXXXXXXXXXXXXXK------------ 1295
            +FLQKITGQ+IQLPP V +                                         
Sbjct: 387  SFLQKITGQTIQLPPPVNVHSAPPPPVPPSVPVVTPLAQQSVQPPIQTSYHQTTPQQQQP 446

Query: 1296 ------KEIIRHQPTTTEL-VVAAIPEQQ-HPPQE---MXXXXXXSLDQ-SSSRWPKAEV 1439
                  +++  HQ  +  L VV A+PEQQ  PPQE          SL+  SSSRWPKAEV
Sbjct: 447  PQQQHGQQVRHHQQQSQNLQVVMAVPEQQVQPPQENIASGGGAGGSLEPASSSRWPKAEV 506

Query: 1440 LALISLRSGLDSRYQDAGPKGPLWEEISAGMHRMGYNRSSKRCKEKWENINKYFKKVKES 1619
            LALI LRSGL+SRYQ+AGPKGPLWEEISAGM RMGY RSSKRCKEKWENINKYFKKVKES
Sbjct: 507  LALIKLRSGLESRYQEAGPKGPLWEEISAGMGRMGYKRSSKRCKEKWENINKYFKKVKES 566

Query: 1620 NKKRPEDAKTCPYFHQLDDLYRKKMLGGVG----SSSFVNQNNEQQ------QLDSSTND 1769
            NKKRPEDAKTCPYFH+LD LYRK++LGG G    SSS  NQN  +Q      QL++  +D
Sbjct: 567  NKKRPEDAKTCPYFHELDALYRKRILGGGGGGGSSSSLGNQNRLEQPQQHQLQLENPKSD 626

Query: 1770 SNPTA---NLQAIMAXXXXXETNEA-------ENKNIDDNNGGSIE----------VKKP 1889
            S       +L+A  +     +T EA       ENKN D +   ++E           KKP
Sbjct: 627  SATQPQDRSLEAQPSVPVMPQTQEAVVATDQSENKNGDQS--ANVENLFGEATDEAAKKP 684

Query: 1890 EDIMME-------HHAQHSVMDFYEKLEEPNRDNQDHXXXXXXXXXXXXXXXXXXXXXXR 2048
            EDI+ E        H Q   +D Y+++EE N DN                         R
Sbjct: 685  EDIVKELMQQEVHDHLQQLAVDDYDRIEEANSDNI-MDQEEDMEDDDIDEEDDEEMEEER 743

Query: 2049 KMGYKIQFQRPN 2084
            KM YKI+FQ+PN
Sbjct: 744  KMAYKIEFQKPN 755


>ref|XP_004309494.1| PREDICTED: uncharacterized protein LOC101290918 [Fragaria vesca
            subsp. vesca]
          Length = 769

 Score =  499 bits (1284), Expect = e-138
 Identities = 335/729 (45%), Positives = 400/729 (54%), Gaps = 85/729 (11%)
 Frame = +3

Query: 147  AVVAEGASPISSRAPPVGGRSSGNLDELVAVSSXXXXXXXXXXXXEMERGVLSGGNRWPR 326
            A V E ASPISSR P  G  S+ NLDEL+ +S                 G  SGGNRWPR
Sbjct: 38   AHVVEEASPISSRPPAGGAISAVNLDELMTLSGAAADVAADQGGGG---GGGSGGNRWPR 94

Query: 327  QETIALLKIRSEMDAAFRDATLKGPLWDDVSRKLAELGYVRSAKKCKEKFENVHKYYKRT 506
            QET+ALLKIRSEMD AFRDATLKGPLW+DVSRKLAELGY R+AKKCKEKFENVHKYYKRT
Sbjct: 95   QETLALLKIRSEMDVAFRDATLKGPLWEDVSRKLAELGYKRNAKKCKEKFENVHKYYKRT 154

Query: 507  KEGRAGRQDGKSYRFFSQLEALHXXXXXXXXXXXXXXXXXXXXXXXXSMGIGTV-----I 671
            KEGRAGRQDGKSY+FFS+LEALH                        S+G G++     I
Sbjct: 155  KEGRAGRQDGKSYKFFSELEALHGSPSPNVSASPPVHVTTAAAAPV-SIGFGSISNPMPI 213

Query: 672  TTSGRIQPNSET----------ATPNAPPTQIGLPRMN--PLDLSGGVQIGVSGSASTAV 815
            ++      N+ T          A P  P +Q  LP     P+D++       S S+S++ 
Sbjct: 214  SSFRMTGGNTSTVPIMSTQATGAIPIMPSSQPPLPASTAAPMDINFS-----SNSSSSSH 268

Query: 816  GLAFXXXXXXXXXXXXXXLEGEPSNTNTRKRKRGESS-----SNRKMMGFFEGLMKQVIE 980
            G                 + GEP    +RKRKRG SS     S R+MM FFE LMKQV++
Sbjct: 269  G-------EDEDYEDDDEVAGEPPANTSRKRKRGTSSRESGGSTRRMMEFFEILMKQVMQ 321

Query: 981  RQEAMQQRFLETIEKREQDRMIREEAWKHQEMARLNXXXXXXXXXXXXSATRDAAIIAFL 1160
            +QE MQQRFLE IEKREQDR IREEAWK QEMARL             SA+RDAAIIAFL
Sbjct: 322  KQETMQQRFLEVIEKREQDRNIREEAWKRQEMARLTREHELMTQERAISASRDAAIIAFL 381

Query: 1161 QKITGQSIQLPPSVAIXXXXXXXXXXXXXXXXXXXXXXXXXXXXKK-------------- 1298
            QKITGQ+IQLPP + +                            ++              
Sbjct: 382  QKITGQTIQLPPPLNVHNAPPPPVPPSVSVHVTPVSAPPPPLAQQQSLHQVHQHQPQQQT 441

Query: 1299 ---EIIRHQPTTT---------ELVVAAIPEQQ-HPPQEMXXXXXXSLDQ--SSSRWPKA 1433
               +I RHQ  T+           VV A+PEQQ    QE+        +   SSSRWPKA
Sbjct: 442  PPTQISRHQIRTSIPPTPSANQTEVVMAVPEQQVAQSQEIVVGSGGGFEATTSSSRWPKA 501

Query: 1434 EVLALISLRSGLDSRYQDAGPKGPLWEEISAGMHRMGYNRSSKRCKEKWENINKYFKKVK 1613
            EVLALI LRSGL++RYQ+AGPKGPLWEEISAGM RMGY R+ KRCKEKWENINKYFKKVK
Sbjct: 502  EVLALIKLRSGLETRYQEAGPKGPLWEEISAGMQRMGYKRNPKRCKEKWENINKYFKKVK 561

Query: 1614 ESNKKRPEDAKTCPYFHQLDDLYRKKMLGG---VGSSSFVNQNNEQQQLDSSTNDSNPTA 1784
            ESNK RPEDAKTCPYFH+LD LYRK++LGG    GSSS +     QQQ   +       +
Sbjct: 562  ESNKVRPEDAKTCPYFHELDALYRKRILGGGPSGGSSSSLGNQTVQQQPQQAPPQPKLDS 621

Query: 1785 NLQAIMAXXXXXE-----TNEAENKNIDDN---------NGGSIEVKKPEDIMME----- 1907
              Q  +A     +     T+++ NK+ DD+         +      KKPEDI+ E     
Sbjct: 622  AAQGTVANTQQTQGTVAATDQSVNKSGDDSPNLQKNLFGDAPEEAAKKPEDIVKELMGQQ 681

Query: 1908 --HH---------AQHSVMDFYEKLEEPNRD-NQDHXXXXXXXXXXXXXXXXXXXXXXRK 2051
              HH          Q  V++ Y+++EE + D N D                       RK
Sbjct: 682  QQHHHQVLNQQGVQQQLVVEDYDRVEEGDSDVNLDQ----DEEEDEEDEEDEEMDDESRK 737

Query: 2052 MGYKIQFQR 2078
            M YKI+FQ+
Sbjct: 738  MDYKIEFQK 746


>gb|AER42647.1| GTL1 [Populus tremula x Populus alba]
          Length = 795

 Score =  497 bits (1279), Expect = e-137
 Identities = 329/744 (44%), Positives = 398/744 (53%), Gaps = 100/744 (13%)
 Frame = +3

Query: 153  VAEGASPISSRAPPVGGRSSG----NLDELVAVSSXXXXXXXXXXXXEMERGVLSGGNRW 320
            V E ASPISSR P     +SG    NLDE + +S                 G ++ GNRW
Sbjct: 50   VVEEASPISSRPPATAATTSGGGLMNLDEFMRLSGGGGAEEDIAGEEADRTGGIASGNRW 109

Query: 321  PRQETIALLKIRSEMDAAFRDATLKGPLWDDVSRKLAELGYVRSAKKCKEKFENVHKYYK 500
            PRQET+ALL+IRSEMDAAFRDATLKGPLW+DVSRKLAE+GY RSAKKCKEKFENVHKYYK
Sbjct: 110  PRQETLALLQIRSEMDAAFRDATLKGPLWEDVSRKLAEMGYKRSAKKCKEKFENVHKYYK 169

Query: 501  RTKEGRAGRQDGKSYRFFSQLEALHXXXXXXXXXXXXXXXXXXXXXXXXSMGIGTVITTS 680
            RTK+GRAGRQDGKSYRFFSQLEAL                            IGT  T+S
Sbjct: 170  RTKDGRAGRQDGKSYRFFSQLEALQNTGGGGGGVSASISNVSGVAPQL----IGTATTSS 225

Query: 681  GRIQPNS-----ETATPNAPPTQIGLPRMN-----PLDLSGGVQIGVSGSASTAVGLAFX 830
              + P S        TP  P +Q+  P  N     P DL  G  +  + +A   VG++F 
Sbjct: 226  LDVAPVSVGIPMPIRTP-PPSSQVPQPASNIGSMFPPDL--GATVAPTAAAGAPVGISFS 282

Query: 831  XXXXXXXXXXXXXLEGEPSNT-----------NTRKRKRGESSSNR----KMMGFFEGLM 965
                          + E                +RKRKR   SS++    +MM FFEGLM
Sbjct: 283  SNESSSSQSSEDDDDDEDGGILGGQTSAMGAGTSRKRKRASLSSSKGETHRMMEFFEGLM 342

Query: 966  KQVIERQEAMQQRFLETIEKREQDRMIREEAWKHQEMARLNXXXXXXXXXXXXSATRDAA 1145
            KQV+++QEAMQQRFLE IEKREQDRMIR+EAWK QEMAR +            SA+RDAA
Sbjct: 343  KQVMQKQEAMQQRFLEAIEKREQDRMIRDEAWKRQEMARSSREHEIMAQERSISASRDAA 402

Query: 1146 IIAFLQKITGQSIQLPPSVAIXXXXXXXXXXXXXXXXXXXXXXXXXXXXKKEIIRHQPTT 1325
            I+AFLQKITGQ+I +P  V+I                             +  ++ QP  
Sbjct: 403  IVAFLQKITGQTIHVPTPVSI---APPVSQPPPPTQPQQVQIAQLVTVSTQPPLQPQPMP 459

Query: 1326 TELVV----AAIPEQQH--------------------------PPQEMXXXXXXS--LDQ 1409
               V       +P+QQH                          P Q++      S   + 
Sbjct: 460  LSQVTPQQNKQLPQQQHHQQQQHQQVHHPRQPPSISSDIVMAVPEQQIAPLELGSGGSEP 519

Query: 1410 SSSRWPKAEVLALISLRSGLDSRYQDAGPKGPLWEEISAGMHRMGYNRSSKRCKEKWENI 1589
            +SSRWPK EVLALI LRSGL++RYQ+AGPKGPLWEEISAGM R+GY RSSKRCKEKWENI
Sbjct: 520  ASSRWPKPEVLALIKLRSGLETRYQEAGPKGPLWEEISAGMLRLGYKRSSKRCKEKWENI 579

Query: 1590 NKYFKKVKESNKKRPEDAKTCPYFHQLDDLYRKKML----GGVGSSS---FVNQNNEQQQ 1748
            NKYFKKVKESNKKRPEDAKTCPYFH+LD LYRKK+L    GG G++S   F +QN  Q+Q
Sbjct: 580  NKYFKKVKESNKKRPEDAKTCPYFHELDALYRKKILGSSSGGAGNTSTSGFDSQNRPQKQ 639

Query: 1749 ----LDSSTNDSNPTANLQAIMAXXXXXETNEAENKNIDDNNGGSIEV------------ 1880
                 +S   D  P    QA+       +T   E++N    NG S++V            
Sbjct: 640  QHQPQESLELDPMPPPMQQAV-----PQQTQATESQN---KNGASVDVQASNTDLAGSPL 691

Query: 1881 --------KKPEDIMME--------HHAQHSVMDFYEKLEEPNRDNQDHXXXXXXXXXXX 2012
                    KKPEDI+ E           Q  ++D Y+K+EE + +N +            
Sbjct: 692  GEGNEGAEKKPEDIVKELIKQQGTQQQQQQLMVDDYDKMEEGDSENVNEDEYDEEDDGDE 751

Query: 2013 XXXXXXXXXXXRKMGYKIQFQRPN 2084
                       RKM YKI+FQR N
Sbjct: 752  DEEEDEALQEERKMAYKIEFQRQN 775


>ref|XP_006476231.1| PREDICTED: trihelix transcription factor GTL1-like isoform X1 [Citrus
            sinensis]
          Length = 797

 Score =  491 bits (1264), Expect = e-136
 Identities = 330/746 (44%), Positives = 404/746 (54%), Gaps = 104/746 (13%)
 Frame = +3

Query: 159  EGASPISSRAPPVGGRSSGNLDELVAVSSXXXXXXXXXXXXEMERGV-LSGGNRWPRQET 335
            E ASPISSR P     S+ NLDEL+ +S             E +RG  +S GNRWP QET
Sbjct: 51   EAASPISSRPPA----SASNLDELMRLSGGDDD--------EGDRGGGVSSGNRWPSQET 98

Query: 336  IALLKIRSEMDAAFRDATLKGPLWDDVSRKLAELGYVRSAKKCKEKFENVHKYYKRTKEG 515
            +ALLKIRS+MDAAFRDAT+KGPLW+DVSRKLAELGY RSAKKCKEKFENVHKYYKRTKEG
Sbjct: 99   LALLKIRSDMDAAFRDATVKGPLWEDVSRKLAELGYKRSAKKCKEKFENVHKYYKRTKEG 158

Query: 516  RAGRQDGKSYRFFSQLEALHXXXXXXXXXXXXXXXXXXXXXXXX--------------SM 653
            RAGRQDGKSY+FF+QLEALH                                      S+
Sbjct: 159  RAGRQDGKSYKFFTQLEALHSSPTSTSTSTATTSNVSASLPKPMTTVADTSTLDVAPVSV 218

Query: 654  GIGTVITTSGRIQPNSETAT----PNAPPTQIGLPRMNPLDLSGGVQIGVSGSASTA--- 812
            GI   I++S RI  +  T T     +   T I  P  + +++ G V   V  +A+T+   
Sbjct: 219  GIPMPISSSVRIPTSPITLTCFPYHDLRSTLIPAPS-SAVNVPGSVTTPVPPTATTSTTP 277

Query: 813  VGLAFXXXXXXXXXXXXXX------LEGEPSNT-------NTRKRKRGESSSNRKMMGFF 953
            VG++F                     EG+PSNT         RKRKR  SSS+R MM FF
Sbjct: 278  VGISFSSKSSSSSPETEDDDDDVMDFEGQPSNTAGTSNRGRNRKRKRQTSSSHR-MMAFF 336

Query: 954  EGLMKQVIERQEAMQQRFLETIEKREQDRMIREEAWKHQEMARLNXXXXXXXXXXXXSAT 1133
            EGLMKQV+++QEAMQQ FLE IEKRE+DRMIREEAWK QEM+RL             SA+
Sbjct: 337  EGLMKQVMQKQEAMQQSFLEVIEKRERDRMIREEAWKRQEMSRLAREHELMAQERAISAS 396

Query: 1134 RDAAIIAFLQKITGQSIQLPPSVAIXXXXXXXXXXXXXXXXXXXXXXXXXXXX------K 1295
            RDA+II FLQKITGQ+IQLPP++ I                                  +
Sbjct: 397  RDASIINFLQKITGQTIQLPPAITIPAAPPPPPPQPQSPSQAVPVATNTTQSHHMPPPER 456

Query: 1296 KEIIRH--------------------QPTTT------ELVVAAIPEQQHPPQE-MXXXXX 1394
            ++I +H                    QP+ T        VV A+PEQQ PP +       
Sbjct: 457  RDIQQHHHRHQQIQSSAAEAVTARHQQPSGTVSTSIHSQVVMAVPEQQVPPSDHQEIGSG 516

Query: 1395 XSLDQSSSRWPKAEVLALISLRSGLDSRYQDAGPKGPLWEEISAGMHRMGYNRSSKRCKE 1574
             +L+ +SSRWPK EVLALI LRSGL+ RYQ+AGPKGPLWEEIS GM RMGYNR++KRCKE
Sbjct: 517  GNLEPASSRWPKVEVLALIKLRSGLEHRYQEAGPKGPLWEEISVGMQRMGYNRNAKRCKE 576

Query: 1575 KWENINKYFKKVKESNKKRPEDAKTCPYFHQLDDLYRKKMLGGVGSSSFVNQNNEQQQLD 1754
            KWENINKYFKKVKESNK+RPEDAKTCPYFH+LD LYRKK++ GVG +S  +QN  +++  
Sbjct: 577  KWENINKYFKKVKESNKRRPEDAKTCPYFHELDALYRKKII-GVGGTSTSSQNRPEERHQ 635

Query: 1755 SSTNDS------NPTAN--------LQAIMAXXXXXETNEAENKNIDDNN---------- 1862
            S +         NP  N        L A +        +E +N N   +N          
Sbjct: 636  SQSEQQHQQENVNPVTNPQESSINVLPAPLLITQAHSDSENKNGNAQASNVGVTGSLFGE 695

Query: 1863 GGSIEVKKPEDIMMEHHAQHS------------VMDFYEKLEEPNRDNQDHXXXXXXXXX 2006
            G     KKPEDI+ E   Q              V D ++K+EE N  ++           
Sbjct: 696  GNLGASKKPEDIVKELMNQQGTQQKQQQPQASIVDDQFDKVEESNMGSES-DNMEYEEED 754

Query: 2007 XXXXXXXXXXXXXRKMGYKIQFQRPN 2084
                         +   YK++FQR N
Sbjct: 755  EREDDEESEEDSNKMANYKVEFQRQN 780


>ref|XP_006439158.1| hypothetical protein CICLE_v10018915mg [Citrus clementina]
            gi|557541420|gb|ESR52398.1| hypothetical protein
            CICLE_v10018915mg [Citrus clementina]
          Length = 794

 Score =  490 bits (1261), Expect = e-135
 Identities = 330/745 (44%), Positives = 404/745 (54%), Gaps = 103/745 (13%)
 Frame = +3

Query: 159  EGASPISSRAPPVGGRSSGNLDELVAVSSXXXXXXXXXXXXEMERGV-LSGGNRWPRQET 335
            E ASPISSR P     S+ NLDEL+ +S             E +RG  +S GNRWP QET
Sbjct: 50   EAASPISSRPPA----SASNLDELMRLSGGDDD--------EGDRGGGVSSGNRWPSQET 97

Query: 336  IALLKIRSEMDAAFRDATLKGPLWDDVSRKLAELGYVRSAKKCKEKFENVHKYYKRTKEG 515
            +ALLKIRS+MDAAFRDAT+KGPLW+DVSRKLAELGY RSAKKCKEKFENVHKYYKRTKEG
Sbjct: 98   LALLKIRSDMDAAFRDATVKGPLWEDVSRKLAELGYKRSAKKCKEKFENVHKYYKRTKEG 157

Query: 516  RAGRQDGKSYRFFSQLEALHXXXXXXXXXXXXXXXXXXXXXXXX--------------SM 653
            RAGRQDGKSY+FFSQLEAL+                                      S+
Sbjct: 158  RAGRQDGKSYKFFSQLEALYSSPTSTSTSTATTSNVSASLPKPVTTVADTSTLDVAPVSV 217

Query: 654  GIGTVITTSGRIQPNSETAT----PNAPPTQIGLPRMNPLDLSGGVQIGVSGSASTA--- 812
            GI   I++S RI  +  T T     +   T I  P  + +++ G V   V  +A+T+   
Sbjct: 218  GIPMPISSSVRIPTSPITLTCFPYHDLRSTLIP-PPSSAVNVPGSVTTPVPPTATTSTTP 276

Query: 813  VGLAFXXXXXXXXXXXXXX-----LEGEPSNT-------NTRKRKRGESSSNRKMMGFFE 956
            VG++F                    EG+PSNT         RKRKR  SSS+R MM FFE
Sbjct: 277  VGISFSSKSSSSPETEDDDEDVMDFEGQPSNTAGTSSRGRNRKRKRQTSSSHR-MMAFFE 335

Query: 957  GLMKQVIERQEAMQQRFLETIEKREQDRMIREEAWKHQEMARLNXXXXXXXXXXXXSATR 1136
            GLMKQV+++QEAMQQ FLE IEKRE+DRMIREEAWK QEM+RL             SA+R
Sbjct: 336  GLMKQVMQKQEAMQQSFLEVIEKRERDRMIREEAWKRQEMSRLAREHELMAQERAISASR 395

Query: 1137 DAAIIAFLQKITGQSIQLPPSVAIXXXXXXXXXXXXXXXXXXXXXXXXXXXX------KK 1298
            DA+II FLQKITGQ+IQLPP++ I                                  ++
Sbjct: 396  DASIINFLQKITGQTIQLPPAITIPAAPPPPPPQPQSPSQAVPVATNTTQSHHMPPPERR 455

Query: 1299 EIIRH--------------------QPTTT------ELVVAAIPEQQHPPQE-MXXXXXX 1397
            +I +H                    QP+ T        VV A+PEQQ PP +        
Sbjct: 456  DIQQHHHRHQQIQSSAAEAVTARHQQPSGTVSTSIPSQVVMAVPEQQVPPSDHQEIGSGG 515

Query: 1398 SLDQSSSRWPKAEVLALISLRSGLDSRYQDAGPKGPLWEEISAGMHRMGYNRSSKRCKEK 1577
            +L+ +SSRWPK EVLALI LRSGL+ RYQ+AGPKGPLWEEIS GM RMGYNR++KRCKEK
Sbjct: 516  NLEPASSRWPKVEVLALIKLRSGLEHRYQEAGPKGPLWEEISVGMQRMGYNRNAKRCKEK 575

Query: 1578 WENINKYFKKVKESNKKRPEDAKTCPYFHQLDDLYRKKMLGGVGSSSFVNQNNEQQQLDS 1757
            WENINKYFKKVKESNK+RPEDAKTCPYFH+LD LYRKK++ GVG +S  +QN  +++  S
Sbjct: 576  WENINKYFKKVKESNKRRPEDAKTCPYFHELDALYRKKII-GVGGTSTSSQNRPEERHQS 634

Query: 1758 STNDS------NPTAN--------LQAIMAXXXXXETNEAENKNIDDNN----------G 1865
             +         NP  N        L A +        +E +N N   +N          G
Sbjct: 635  QSEQQHQQENVNPVTNPQESSINVLPAPLLITQAHSDSENKNGNAQASNVGVTGSLFGEG 694

Query: 1866 GSIEVKKPEDIMMEHHAQHS------------VMDFYEKLEEPNRDNQDHXXXXXXXXXX 2009
                 KKPEDI+ E   Q              V D ++K+EE N  ++            
Sbjct: 695  NLGASKKPEDIVKELMNQQGTQQKQQQPQASIVDDQFDKVEESNMGSES--DNMEYEEDE 752

Query: 2010 XXXXXXXXXXXXRKMGYKIQFQRPN 2084
                        +   YK++FQR N
Sbjct: 753  RDDDEESEEDSNKMANYKVEFQRQN 777


>ref|XP_006476232.1| PREDICTED: trihelix transcription factor GTL1-like isoform X2 [Citrus
            sinensis]
          Length = 706

 Score =  484 bits (1247), Expect = e-134
 Identities = 301/615 (48%), Positives = 365/615 (59%), Gaps = 68/615 (11%)
 Frame = +3

Query: 159  EGASPISSRAPPVGGRSSGNLDELVAVSSXXXXXXXXXXXXEMERGV-LSGGNRWPRQET 335
            E ASPISSR P     S+ NLDEL+ +S             E +RG  +S GNRWP QET
Sbjct: 51   EAASPISSRPPA----SASNLDELMRLSGGDDD--------EGDRGGGVSSGNRWPSQET 98

Query: 336  IALLKIRSEMDAAFRDATLKGPLWDDVSRKLAELGYVRSAKKCKEKFENVHKYYKRTKEG 515
            +ALLKIRS+MDAAFRDAT+KGPLW+DVSRKLAELGY RSAKKCKEKFENVHKYYKRTKEG
Sbjct: 99   LALLKIRSDMDAAFRDATVKGPLWEDVSRKLAELGYKRSAKKCKEKFENVHKYYKRTKEG 158

Query: 516  RAGRQDGKSYRFFSQLEALHXXXXXXXXXXXXXXXXXXXXXXXX--------------SM 653
            RAGRQDGKSY+FF+QLEALH                                      S+
Sbjct: 159  RAGRQDGKSYKFFTQLEALHSSPTSTSTSTATTSNVSASLPKPMTTVADTSTLDVAPVSV 218

Query: 654  GIGTVITTSGRIQPNSETAT----PNAPPTQIGLPRMNPLDLSGGVQIGVSGSASTA--- 812
            GI   I++S RI  +  T T     +   T I  P  + +++ G V   V  +A+T+   
Sbjct: 219  GIPMPISSSVRIPTSPITLTCFPYHDLRSTLIPAPS-SAVNVPGSVTTPVPPTATTSTTP 277

Query: 813  VGLAFXXXXXXXXXXXXXX------LEGEPSNT-------NTRKRKRGESSSNRKMMGFF 953
            VG++F                     EG+PSNT         RKRKR  SSS+R MM FF
Sbjct: 278  VGISFSSKSSSSSPETEDDDDDVMDFEGQPSNTAGTSNRGRNRKRKRQTSSSHR-MMAFF 336

Query: 954  EGLMKQVIERQEAMQQRFLETIEKREQDRMIREEAWKHQEMARLNXXXXXXXXXXXXSAT 1133
            EGLMKQV+++QEAMQQ FLE IEKRE+DRMIREEAWK QEM+RL             SA+
Sbjct: 337  EGLMKQVMQKQEAMQQSFLEVIEKRERDRMIREEAWKRQEMSRLAREHELMAQERAISAS 396

Query: 1134 RDAAIIAFLQKITGQSIQLPPSVAIXXXXXXXXXXXXXXXXXXXXXXXXXXXX------K 1295
            RDA+II FLQKITGQ+IQLPP++ I                                  +
Sbjct: 397  RDASIINFLQKITGQTIQLPPAITIPAAPPPPPPQPQSPSQAVPVATNTTQSHHMPPPER 456

Query: 1296 KEIIRH--------------------QPTTT------ELVVAAIPEQQHPPQE-MXXXXX 1394
            ++I +H                    QP+ T        VV A+PEQQ PP +       
Sbjct: 457  RDIQQHHHRHQQIQSSAAEAVTARHQQPSGTVSTSIHSQVVMAVPEQQVPPSDHQEIGSG 516

Query: 1395 XSLDQSSSRWPKAEVLALISLRSGLDSRYQDAGPKGPLWEEISAGMHRMGYNRSSKRCKE 1574
             +L+ +SSRWPK EVLALI LRSGL+ RYQ+AGPKGPLWEEIS GM RMGYNR++KRCKE
Sbjct: 517  GNLEPASSRWPKVEVLALIKLRSGLEHRYQEAGPKGPLWEEISVGMQRMGYNRNAKRCKE 576

Query: 1575 KWENINKYFKKVKESNKKRPEDAKTCPYFHQLDDLYRKKMLGGVGSSSFVNQNNEQQQLD 1754
            KWENINKYFKKVKESNK+RPEDAKTCPYFH+LD LYRKK++ GVG +S  +QN  +++  
Sbjct: 577  KWENINKYFKKVKESNKRRPEDAKTCPYFHELDALYRKKII-GVGGTSTSSQNRPEERHQ 635

Query: 1755 SSTNDSNPTANLQAI 1799
            S +   +   N+  +
Sbjct: 636  SQSEQQHQQENVNPV 650


>ref|XP_002518968.1| transcription factor, putative [Ricinus communis]
            gi|223541955|gb|EEF43501.1| transcription factor,
            putative [Ricinus communis]
          Length = 741

 Score =  483 bits (1242), Expect = e-133
 Identities = 317/711 (44%), Positives = 387/711 (54%), Gaps = 67/711 (9%)
 Frame = +3

Query: 153  VAEGASPISSRAPPVGGRSSGNLDELVAVSSXXXXXXXXXXXXEMERGVLSGGNRWPRQE 332
            + E ASPISSR P   G   GNLD+ + +S                    + GNRWPRQE
Sbjct: 37   LVEEASPISSRPPATTG---GNLDDFMRLSGSAADEDELADRA-------TSGNRWPRQE 86

Query: 333  TIALLKIRSEMDAAFRDATLKGPLWDDVSRKLAELGYVRSAKKCKEKFENVHKYYKRTKE 512
            TIALL+IRS+MDAAFRDAT+KGPLW+DVSRKL ELGY RSAKKCKEKFENVHKYYKRTKE
Sbjct: 87   TIALLQIRSDMDAAFRDATVKGPLWEDVSRKLNELGYKRSAKKCKEKFENVHKYYKRTKE 146

Query: 513  GRAGRQDGKSYRFFSQLEALHXXXXXXXXXXXXXXXXXXXXXXXXSMGIGTVITTSGRIQ 692
            GR GRQDGK+YRFF+QLEALH                             T  TT+  + 
Sbjct: 147  GRGGRQDGKTYRFFTQLEALHNTTGATINIVSPSQPISTA---------ATTATTTLDVS 197

Query: 693  PNS-------ETATPNAPPTQIGLPRMNPLDLSG------GVQIGVSGSASTAVGLAFXX 833
            P S        ++  N PP+ +G+  + P   +         + G+S S ST+ G +   
Sbjct: 198  PVSIGIPMPAVSSVRNYPPSTVGISTIFPAVTAPLPPPPPPPRAGISFS-STSNGSSSSP 256

Query: 834  XXXXXXXXXXXXLEGEPSNT---NTRKRKRGESSSN-RKMMGFFEGLMKQVIERQEAMQQ 1001
                         + EPSN    ++RKRKR  S    R+MM FFEGLMK V+++QEAMQQ
Sbjct: 257  SFQDDDDDDDD--DDEPSNIAAGSSRKRKRHSSEGGTRRMMDFFEGLMKHVMQKQEAMQQ 314

Query: 1002 RFLETIEKREQDRMIREEAWKHQEMARLNXXXXXXXXXXXXSATRDAAIIAFLQKITGQS 1181
            RFL+ IEKRE DR++REEAWK QEMARL+            SA+RDAAI++F+QKITGQ+
Sbjct: 315  RFLDAIEKRENDRVVREEAWKRQEMARLSREHELMAQERAISASRDAAIVSFIQKITGQT 374

Query: 1182 IQLPPSVAIXXXXXXXXXXXXXXXXXXXXXXXXXXXX------------------KKEII 1307
            IQLP  V I                                              K ++ 
Sbjct: 375  IQLPSPVTIPAVLQPPPAPQSKPVPLAPIVTVSIQQPPLPQPPSAAAPPPPQQQDKHQVH 434

Query: 1308 RHQPTTTELVVA-AIPEQQHPPQEMXXXXXXSLDQSSSRWPKAEVLALISLRSGLDSRYQ 1484
            RH    + +    AI   +  PQE+      SL+ SSSRWPKAEVLALI LRSGL+ RYQ
Sbjct: 435  RHHDRQSSISSELAIGVAEGMPQEIGSSR--SLEPSSSRWPKAEVLALIKLRSGLEFRYQ 492

Query: 1485 DAGPKGPLWEEISAGMHRMGYNRSSKRCKEKWENINKYFKKVKESNKKRPEDAKTCPYFH 1664
            +AGPKGPLWEEISAGM RMGY RS+KRCKEKWENINKYFKKVKESNKKRPEDAKTCPYFH
Sbjct: 493  EAGPKGPLWEEISAGMQRMGYKRSAKRCKEKWENINKYFKKVKESNKKRPEDAKTCPYFH 552

Query: 1665 QLDDLYRKKML------GGVGSSSFVNQ---NNEQQQLDSSTNDSNPTANLQAIMAXXXX 1817
            +LD LYRKK+L      G + +S F NQ     +QQQ +S+  D    A+   + A    
Sbjct: 553  ELDALYRKKVLVTTAGGGTISTSGFANQITRPEQQQQQESTKPDDRTEASATLLPAQSSQ 612

Query: 1818 XETNEAE----NKNIDDN---NGGSIEVKKPEDIMME----------HHAQHSVMDFYEK 1946
             +         N  +  N    G +    KPEDI+ E            +Q +V D YEK
Sbjct: 613  SQAKGGSGADANTGLPGNLFGEGNAGAGNKPEDIVKELIQPQGPQKQQESQFTVHD-YEK 671

Query: 1947 LEEPNRDNQDH-----XXXXXXXXXXXXXXXXXXXXXXRKMGYKIQFQRPN 2084
            +EE +  + DH                           RKM YKI++QRPN
Sbjct: 672  MEEDDDSDIDHENDEDDVEELDLEDDEEEDDDEVQEEERKMAYKIEYQRPN 722


>ref|XP_006851901.1| hypothetical protein AMTR_s00041p00147950 [Amborella trichopoda]
            gi|548855484|gb|ERN13368.1| hypothetical protein
            AMTR_s00041p00147950 [Amborella trichopoda]
          Length = 673

 Score =  479 bits (1232), Expect = e-132
 Identities = 308/645 (47%), Positives = 352/645 (54%), Gaps = 39/645 (6%)
 Frame = +3

Query: 159  EGASPISSRAPPVGGRSSGNLDELVAVSSXXXXXXXXXXXXEMERGVLSGGNRWPRQETI 338
            E ASPISSRAP     S  N +ELV  +             E ERG    GNRWPRQET+
Sbjct: 22   ENASPISSRAP-----SGRNFEELVGPAGGFADEEALVGGEEGERGAT--GNRWPRQETL 74

Query: 339  ALLKIRSEMDAAFRDATLKGPLWDDVSRKLAELGYVRSAKKCKEKFENVHKYYKRTKEGR 518
            ALLK+R +MDAAFRDATLKGPLW +VSRKLAE G+ RSAKKCKEKFENVHKYYKRTKEGR
Sbjct: 75   ALLKVRQDMDAAFRDATLKGPLWQEVSRKLAEQGFNRSAKKCKEKFENVHKYYKRTKEGR 134

Query: 519  AGRQDGKSYRFFSQLEALHXXXXXXXXXXXXXXXXXXXXXXXXSMGIGTVITTSGRIQPN 698
            AGRQDGKSYRFFSQLEALH                                       PN
Sbjct: 135  AGRQDGKSYRFFSQLEALHSSQATPTTSAPPPPQPPPPQ------------------NPN 176

Query: 699  SETATPNAPPTQIGLPRMNPLDLSGGVQIGVSGSASTAVGLAFXXXXXXXXXXXXXXLEG 878
                 P  P      PR  P   +  +QI    +   A G++                EG
Sbjct: 177  PNNPVPLLPSPMASNPRPQP---TPQLQIPKPAADFPATGISLSSGDSSESDDS----EG 229

Query: 879  -EPSNTNTRKRKRGESSSNR--KMMGFFEGLMKQVIERQEAMQQRFLETIEKREQDRMIR 1049
             E    ++RKRKR  S+     KMM FFEGLMKQV+E+QEAMQQ+FLET+EKREQ RMIR
Sbjct: 230  TETVAKDSRKRKRSNSADQMTTKMMDFFEGLMKQVMEKQEAMQQKFLETMEKREQARMIR 289

Query: 1050 EEAWKHQEMARLNXXXXXXXXXXXXSATRDAAIIAFLQKITGQSIQLP-----PSVAIXX 1214
            EEAWK QEMARL             SA+RDAA+I+FLQKITGQ+I +P     P+V +  
Sbjct: 290  EEAWKRQEMARLAREHELVAQERALSASRDAAVISFLQKITGQTIPIPNPPAGPTVPVAP 349

Query: 1215 XXXXXXXXXXXXXXXXXXXXXXXXXXKKEIIRHQPTTTELVVAAIPEQQHPPQEMXXXXX 1394
                                         I+   P  T     A      PP        
Sbjct: 350  VPVPVPPPPTVTA----------------IVPTTPNPTRPPNPAPTPPPPPPPAAPAGAD 393

Query: 1395 XSL---DQSSSRWPKAEVLALISLRSGLDSRYQDAGPKGPLWEEISAGMHRMGYNRSSKR 1565
              L   + SSSRWPKAEV ALI LRSGL+SRYQ+AGPKGPLWEEISAGM R+GYNRS+KR
Sbjct: 394  QDLSGHESSSSRWPKAEVHALIQLRSGLESRYQEAGPKGPLWEEISAGMSRLGYNRSAKR 453

Query: 1566 CKEKWENINKYFKKVKESNKKRPEDAKTCPYFHQLDDLYRKKMLGGVGSSSFVNQNNEQQ 1745
            CKEKWENINKYFKKVKESNKKRPEDAKTCPYFHQLD LY+KK+ GG  SSS     +  +
Sbjct: 454  CKEKWENINKYFKKVKESNKKRPEDAKTCPYFHQLDALYKKKIFGGGVSSSGAASESGGE 513

Query: 1746 QLDSSTNDSNPTANLQAIMAXXXXXETNEAENKNIDDN--------------NGGSIEV- 1880
             L          A+            T + +N N   N              NGGS++  
Sbjct: 514  VLAIMAAPPPVVAS----GGGGGGTTTQQVQNGNGHGNGGGKDGNGGEGNGGNGGSVQTG 569

Query: 1881 --------KKPEDIMME-----HHAQHSVMDFYEKLEEPNRDNQD 1976
                    KKPEDI+ E        Q SVMD Y+KLEEP+ D  D
Sbjct: 570  FYQSGEQSKKPEDIVRELIDLQQQQQQSVMDDYDKLEEPDSDKLD 614


>ref|XP_002300534.2| hypothetical protein POPTR_0001s45870g [Populus trichocarpa]
            gi|550349976|gb|EEE85339.2| hypothetical protein
            POPTR_0001s45870g [Populus trichocarpa]
          Length = 704

 Score =  476 bits (1225), Expect = e-131
 Identities = 304/652 (46%), Positives = 366/652 (56%), Gaps = 69/652 (10%)
 Frame = +3

Query: 153  VAEGASPISSRAPPVGGRSSG----NLDELVAVSSXXXXXXXXXXXXEMER-GVLSGGNR 317
            V E ASPISSR P     +SG    NLDE + +S             + +R G ++ GNR
Sbjct: 47   VVEEASPISSRPPATAATTSGGGVMNLDEFMRLSGGGGGAEEDIAGEDADRTGGIASGNR 106

Query: 318  WPRQETIALLKIRSEMDAAFRDATLKGPLWDDVSRKLAELGYVRSAKKCKEKFENVHKYY 497
            WPRQET+ALL+IRSEMDAAFRDATLKGPLW+DVSRKLAE+GY RSAKKCKEKFENVHKYY
Sbjct: 107  WPRQETLALLQIRSEMDAAFRDATLKGPLWEDVSRKLAEMGYKRSAKKCKEKFENVHKYY 166

Query: 498  KRTKEGRAGRQDGKSYRFFSQLEALHXXXXXXXXXXXXXXXXXXXXXXXXSMGIGTVITT 677
            KRTKEGRAGRQDGKSYRFFSQLEAL                            IGT  T+
Sbjct: 167  KRTKEGRAGRQDGKSYRFFSQLEALQNTGGGGGVSSSISNVSGVAPQL-----IGTATTS 221

Query: 678  SGRIQPNS-----ETATPNAPPTQIGLPRMN-----PLDLSGGVQIGVSGSASTAVGLAF 827
            S  + P S        TP  P +Q+  P  N     P DL  G  +  + +A   V ++F
Sbjct: 222  SLDVAPVSVGIPMPIRTP-PPSSQVPQPASNIGSMFPPDL--GATVARAAAAGAPVRISF 278

Query: 828  XXXXXXXXXXXXXXLEGEPSNT-----------NTRKRKRGESSSNR----KMMGFFEGL 962
                           + E                +RKRKR   SS++    +MM FFEGL
Sbjct: 279  SSNESSSSQSSEDDDDDEDEGILGGQTSAMGAGTSRKRKRASLSSSKGETHRMMEFFEGL 338

Query: 963  MKQVIERQEAMQQRFLETIEKREQDRMIREEAWKHQEMARLNXXXXXXXXXXXXSATRDA 1142
            MKQV+++QEAMQQRFLE IEKREQDRMIR+EAWK QEMARL+            SA+RDA
Sbjct: 339  MKQVMQKQEAMQQRFLEAIEKREQDRMIRDEAWKRQEMARLSREHEIMAQERSISASRDA 398

Query: 1143 AIIAFLQKITGQSIQLPPSVAIXXXXXXXXXXXXXXXXXXXXXXXXXXXXKKEIIRHQPT 1322
            AI+AFLQKITGQ+I LP  V+I                            +   ++ QP 
Sbjct: 399  AIVAFLQKITGQTIHLPTPVSIAPLVSQPQPPPPTQPQQVQIAPLVTVSTQPP-LQPQPM 457

Query: 1323 TTELVV----AAIPEQQH--------------------------PPQEMXXXXXXS--LD 1406
                V       +P+QQH                          P Q++      S   +
Sbjct: 458  PLSQVTPQQNKQLPQQQHHQQQQHQQVHHQHQPPSISSEIVMAVPEQQIAPLELGSGGSE 517

Query: 1407 QSSSRWPKAEVLALISLRSGLDSRYQDAGPKGPLWEEISAGMHRMGYNRSSKRCKEKWEN 1586
             +SSRWPK EVLALI LRSGL++RYQ+AGPKGPLWEEISAGM R+GY RSSKRCKEKWEN
Sbjct: 518  PASSRWPKPEVLALIKLRSGLETRYQEAGPKGPLWEEISAGMLRLGYKRSSKRCKEKWEN 577

Query: 1587 INKYFKKVKESNKKRPEDAKTCPYFHQLDDLYRKKML----GGVGSSS---FVNQNNEQQ 1745
            INKYFKKVKESNKKR EDAKTCPYFH+LD LYRKK+L    GG GS+S   F +Q N  Q
Sbjct: 578  INKYFKKVKESNKKRTEDAKTCPYFHELDALYRKKILGSSSGGAGSTSTSGFDSQINRPQ 637

Query: 1746 QLDSSTNDSNPTANLQAIMAXXXXXETNEAENKNIDDNNGGSIEVKKPEDIM 1901
            +      +S     +   M      +T   E++N    NG S++V+    ++
Sbjct: 638  KQQHQHQESLELDPMPPPMQQTVPQQTQATESQN---KNGASVDVQASNTVL 686


>ref|NP_174594.1| trihelix transcription factor GTL1 [Arabidopsis thaliana]
            gi|332193452|gb|AEE31573.1| trihelix transcription factor
            GTL1 [Arabidopsis thaliana]
          Length = 669

 Score =  439 bits (1129), Expect = e-120
 Identities = 281/632 (44%), Positives = 349/632 (55%), Gaps = 26/632 (4%)
 Frame = +3

Query: 153  VAEGASPISSRAPPVGGRSSGNLDELVAVSSXXXXXXXXXXXXEMERGVLSG--GNRWPR 326
            V E ASPISSR P      + NL+EL+  S+                G  S   GNRWPR
Sbjct: 12   VVEEASPISSRPP------ANNLEELMRFSAAADDGGLGGGGGGGGGGSASSSSGNRWPR 65

Query: 327  QETIALLKIRSEMDAAFRDATLKGPLWDDVSRKLAELGYVRSAKKCKEKFENVHKYYKRT 506
            +ET+ALL+IRS+MD+ FRDATLK PLW+ VSRKL ELGY RS+KKCKEKFENV KYYKRT
Sbjct: 66   EETLALLRIRSDMDSTFRDATLKAPLWEHVSRKLLELGYKRSSKKCKEKFENVQKYYKRT 125

Query: 507  KEGRAGRQDGKSYRFFSQLEALHXXXXXXXXXXXXXXXXXXXXXXXXSMGIGTVITTSGR 686
            KE R GR DGK+Y+FFSQLEAL+                        S     V +    
Sbjct: 126  KETRGGRHDGKAYKFFSQLEALNTTPPSSSLDVTPLSVANPILMPSSSSSPFPVFSQP-- 183

Query: 687  IQPNSETATPNA-------PPTQIGLPRMNPLDLSGGVQIGVSGSASTAVGLAFXXXXXX 845
             QP ++T  P          P  + LP M P+    GV    S S+STA G+        
Sbjct: 184  -QPQTQTQPPQTHNVSFTPTPPPLPLPSMGPIFT--GVTFS-SHSSSTASGMGSDDDDDD 239

Query: 846  XXXXXXXXLEGEPSNTNTRKRKRGESSSNRKMMGFFEGLMKQVIERQEAMQQRFLETIEK 1025
                     +   + +++RKRKRG      KMM  FEGL++QV+++Q AMQ+ FLE +EK
Sbjct: 240  MDVD-----QANIAGSSSRKRKRGNRGGGGKMMELFEGLVRQVMQKQAAMQRSFLEALEK 294

Query: 1026 REQDRMIREEAWKHQEMARLNXXXXXXXXXXXXSATRDAAIIAFLQKITGQSIQLPPSVA 1205
            REQ+R+ REEAWK QEMARL             SA+RDAAII+ +QKITG +IQLPPS++
Sbjct: 295  REQERLDREEAWKRQEMARLAREHEVMSQERAASASRDAAIISLIQKITGHTIQLPPSLS 354

Query: 1206 IXXXXXXXXXXXXXXXXXXXXXXXXXXXXKKEIIRHQ------PTTTELVVAAIPEQ--- 1358
                                         ++ I+         P       A  PEQ   
Sbjct: 355  SQPPPPYQPPPAVTKRVAEPPLSTAQSQSQQPIMAIPQQQILPPPPPSHPHAHQPEQKQQ 414

Query: 1359 QHPPQEMXXXXXXSLDQSSSRWPKAEVLALISLRSGLDSRYQDAGPKGPLWEEISAGMHR 1538
            Q P QEM      S   SSSRWPKAE+LALI+LRSG++ RYQD  PKG LWEEIS  M R
Sbjct: 415  QQPQQEMVMSSEQSSLPSSSRWPKAEILALINLRSGMEPRYQDNVPKGLLWEEISTSMKR 474

Query: 1539 MGYNRSSKRCKEKWENINKYFKKVKESNKKRPEDAKTCPYFHQLDDLYRKKMLGGVGSSS 1718
            MGYNR++KRCKEKWENINKY+KKVKESNKKRP+DAKTCPYFH+LD LYR K+LG  G SS
Sbjct: 475  MGYNRNAKRCKEKWENINKYYKKVKESNKKRPQDAKTCPYFHRLDLLYRNKVLGSGGGSS 534

Query: 1719 FVNQNNEQQQLDSSTNDSNPTANLQAIMAXXXXXETNEAENKNIDDNNGGSIEVKKPEDI 1898
                  +Q+Q    T    P   L  +        T E E   I+++  G+   +KPED+
Sbjct: 535  TSGLPQDQKQ-SPVTAMKPPQEGLVNVQQTHGSASTEEEE--PIEESPQGT---EKPEDL 588

Query: 1899 MMEH--------HAQHSVMDFYEKLEEPNRDN 1970
            +M            Q S++  YEK+EE +  N
Sbjct: 589  VMRELIQQQQQLQQQESMIGEYEKIEESHNYN 620



 Score = 93.2 bits (230), Expect = 4e-16
 Identities = 69/257 (26%), Positives = 114/257 (44%), Gaps = 1/257 (0%)
 Frame = +3

Query: 288  ERGVLSGGNRWPRQETIALLKIRSEMDAAFRDATLKGPLWDDVSRKLAELGYVRSAKKCK 467
            E+  L   +RWP+ E +AL+ +RS M+  ++D   KG LW+++S  +  +GY R+AK+CK
Sbjct: 426  EQSSLPSSSRWPKAEILALINLRSGMEPRYQDNVPKGLLWEEISTSMKRMGYNRNAKRCK 485

Query: 468  EKFENVHKYYKRTKEGRAGR-QDGKSYRFFSQLEALHXXXXXXXXXXXXXXXXXXXXXXX 644
            EK+EN++KYYK+ KE    R QD K+  +F +L+ L+                       
Sbjct: 486  EKWENINKYYKKVKESNKKRPQDAKTCPYFHRLDLLYRNKV------------------- 526

Query: 645  XSMGIGTVITTSGRIQPNSETATPNAPPTQIGLPRMNPLDLSGGVQIGVSGSASTAVGLA 824
              +G G   +TSG  Q   ++      P Q GL  +              GSAST     
Sbjct: 527  --LGSGGGSSTSGLPQDQKQSPVTAMKPPQEGLVNVQQ----------THGSASTE---- 570

Query: 825  FXXXXXXXXXXXXXXLEGEPSNTNTRKRKRGESSSNRKMMGFFEGLMKQVIERQEAMQQR 1004
                            E EP   + +  ++ E    R+++       +Q +++QE+M   
Sbjct: 571  ----------------EEEPIEESPQGTEKPEDLVMRELI-----QQQQQLQQQESMIGE 609

Query: 1005 FLETIEKREQDRMIREE 1055
            + +  E    + M  EE
Sbjct: 610  YEKIEESHNYNNMEEEE 626


>ref|XP_006854553.1| hypothetical protein AMTR_s00030p00088210 [Amborella trichopoda]
            gi|548858239|gb|ERN16020.1| hypothetical protein
            AMTR_s00030p00088210 [Amborella trichopoda]
          Length = 613

 Score =  439 bits (1128), Expect = e-120
 Identities = 275/627 (43%), Positives = 348/627 (55%), Gaps = 19/627 (3%)
 Frame = +3

Query: 153  VAEGASPISSRAPPVGGRSSGNLDELVAVSSXXXXXXXXXXXXEMERGVLSGGNRWPRQE 332
            +AE ASP++ R     GRS   L+ELV   S            E E G + GGNRWPRQE
Sbjct: 47   MAEIASPVNIREK---GRSGSGLEELVGQVSGGYGEEEGFGVEERESGGV-GGNRWPRQE 102

Query: 333  TIALLKIRSEMDAAFRDATLKGPLWDDVSRKLAELGYVRSAKKCKEKFENVHKYYKRTKE 512
            T+ALLKIRS+MDAAFRDATLKGPLW+DVSRKLAELGY RSAKKCKEKFENVHKYYKRTK+
Sbjct: 103  TLALLKIRSDMDAAFRDATLKGPLWEDVSRKLAELGYNRSAKKCKEKFENVHKYYKRTKD 162

Query: 513  GRAGRQDGKSYRFFSQLEALHXXXXXXXXXXXXXXXXXXXXXXXXSMGIGTVITTSGRIQ 692
            GRAGRQDGK+YRFF+QLEAL+                        +    TV+ T+G + 
Sbjct: 163  GRAGRQDGKTYRFFTQLEALN------SNNNNPIPSTNANININTTTSNNTVVATAGILA 216

Query: 693  PNSETATPNAPPTQIGLPRMNPLDLSGGVQIGVSGSASTAVGLAFXXXXXXXXXXXXXXL 872
             N   AT +   T        P++ + G+    SGS+S +                    
Sbjct: 217  GNQIKATQSTFSTDF------PVNQTAGISFS-SGSSSDSG------------------- 250

Query: 873  EGEPSNTNTRKRKRGESSSNRKMMGFFEGLMKQVIERQEAMQQRFLETIEKREQDRMIRE 1052
            +   ++  T KRK G      K+M FFE LMKQVIE+QE +QQ+FL+TIEKRE++R +RE
Sbjct: 251  QKNSNSGETHKRKCG------KIMAFFENLMKQVIEKQEELQQKFLDTIEKREEERAMRE 304

Query: 1053 EAWKHQEMARLNXXXXXXXXXXXXSATRDAAIIAFLQKITGQSIQLPPSVAIXXXXXXXX 1232
            EAWK QEMAR++            SA++DAA+IAFLQK +GQ++Q+P S           
Sbjct: 305  EAWKRQEMARVSREQEMLAHERALSASKDAAVIAFLQKFSGQNVQIPTS----------- 353

Query: 1233 XXXXXXXXXXXXXXXXXXXXKKEIIRHQPTTTELVVAAIPEQQHPPQEMXXXXXXSLDQS 1412
                                   +    P T E     I                  + +
Sbjct: 354  -------------------FPASVPAANPGTQETQANEIEYNHDGGVLAREREVVCFEVA 394

Query: 1413 SSRWPKAEVLALISLRSGLDSRYQDAGPKGPLWEEISAGMHRMGYNRSSKRCKEKWENIN 1592
            SSRWPKAEV ALI LRSGL+ RY++ GPKGPLWEE+SAGM R+GY+RS+KRCKEKWENIN
Sbjct: 395  SSRWPKAEVHALIKLRSGLEFRYRETGPKGPLWEEVSAGMARLGYSRSAKRCKEKWENIN 454

Query: 1593 KYFKKVKESNKKRPEDAKTCPYFHQLDDLYRKKMLGGVGSSSFVNQNNEQQQLDSSTNDS 1772
            KYFKKVKES+KKRP+DAKTCPYF+QL++LY+K+    + S    N+ NE +       + 
Sbjct: 455  KYFKKVKESDKKRPQDAKTCPYFNQLEELYKKRFKHSIDS----NKKNEGE-------EE 503

Query: 1773 NPTANLQAIMAXXXXXETNEAENKNIDDNNGGSI-------EVKKPEDIM---------- 1901
             P A L  +              + + D+ GG++       E KKPED+M          
Sbjct: 504  RPMAILPPV--------------EQMPDSGGGAVRLFQGHEEAKKPEDLMRGIMNSQQKQ 549

Query: 1902 MEHHAQHSVMDFY--EKLEEPNRDNQD 1976
             +   Q SVMD Y  E L   + D  D
Sbjct: 550  QQQEQQESVMDDYDAENLNATHEDEDD 576


>ref|XP_006415119.1| hypothetical protein EUTSA_v10007000mg [Eutrema salsugineum]
            gi|557092890|gb|ESQ33472.1| hypothetical protein
            EUTSA_v10007000mg [Eutrema salsugineum]
          Length = 667

 Score =  432 bits (1112), Expect = e-118
 Identities = 281/651 (43%), Positives = 355/651 (54%), Gaps = 45/651 (6%)
 Frame = +3

Query: 153  VAEGASPISSRAPPVGGRSSGNLDELVAVSSXXXXXXXXXXXXEMERGVLSGGNRWPRQE 332
            V E ASPISSR P        NL+EL+  S+                   S GNRWPR+E
Sbjct: 18   VVEEASPISSRPP-------ANLEELMRFSAAADDGGGLGGGGGGSSSS-SSGNRWPREE 69

Query: 333  TIALLKIRSEMDAAFRDATLKGPLWDDVSRKLAELGYVRSAKKCKEKFENVHKYYKRTKE 512
            T+ALL+IRS+MD+ FRDATLK PLW+ VSRKL ELGY RSAKKCKEKFENV KYYKRTKE
Sbjct: 70   TLALLRIRSDMDSTFRDATLKAPLWEHVSRKLMELGYKRSAKKCKEKFENVQKYYKRTKE 129

Query: 513  GRAGRQDGKSYRFFSQLEALHXXXXXXXXXXXXXXXXXXXXXXXXSMGIGTVITTSGRIQ 692
             R GR DGK+Y+FFSQLEAL+                        S+ +   ++ +  IQ
Sbjct: 130  TRGGRHDGKAYKFFSQLEALNTTPPPLPPPPPPSS----------SLDVAP-LSVANPIQ 178

Query: 693  PNSE--------------------TATPNAPPTQIGLPRMNPLDLSGGVQIGVSGSASTA 812
            P S+                    T +   PP  +  P M P     GV    S S+STA
Sbjct: 179  PQSQFPVFPLPQTQPQPSQPHLTHTVSFTTPPPPLPPPPMGPT--FPGVTFS-SHSSSTA 235

Query: 813  VGLAFXXXXXXXXXXXXXXLEGEPSNTNTRKRKRGESSSNR--KMMGFFEGLMKQVIERQ 986
             G+                ++ + +  ++RKRKRG        KMM  FEGL++QV+E+Q
Sbjct: 236  SGMG---------SDDDDDMDLDEAGPSSRKRKRGNRGGGEGGKMMELFEGLVRQVMEKQ 286

Query: 987  EAMQQRFLETIEKREQDRMIREEAWKHQEMARLNXXXXXXXXXXXXSATRDAAIIAFLQK 1166
             AMQ+ FL+ +EKREQ+R+ REEAWK QEM+RL             SA+RDAAII+ +QK
Sbjct: 287  AAMQRSFLDALEKREQERLQREEAWKRQEMSRLAREHEVMSQERAASASRDAAIISLIQK 346

Query: 1167 ITGQSIQLPPSVAIXXXXXXXXXXXXXXXXXXXXXXXXXXXXKKEIIRHQPTTTELV--- 1337
            ITG +IQLPPS++                              K +   Q TTT+     
Sbjct: 347  ITGHTIQLPPSLS--------------SSQPPQPPHQPPPPAAKRVEPPQTTTTQSQSQP 392

Query: 1338 VAAIPEQQ-----HPP---------QEMXXXXXXSLDQSSSRWPKAEVLALISLRSGLDS 1475
            + AIP+QQ     HPP         QEM      S   SSSRWPKAE+LALI+LRSG++ 
Sbjct: 393  IMAIPQQQILPPPHPPAPPPAPHQQQEMIMSPEQS-SPSSSRWPKAEILALINLRSGMEP 451

Query: 1476 RYQDAGPKGPLWEEISAGMHRMGYNRSSKRCKEKWENINKYFKKVKESNKKRPEDAKTCP 1655
            RYQD  PKG LWEEISA M RMGYNR++KRCKEKWENINKY+KKVKESNKKRP+DAKTCP
Sbjct: 452  RYQDNVPKGLLWEEISASMKRMGYNRNAKRCKEKWENINKYYKKVKESNKKRPQDAKTCP 511

Query: 1656 YFHQLDDLYRKKMLGGVGSSSFVNQNNEQQQLDSSTNDSNPTANLQAIMAXXXXXETNEA 1835
            YFH+LD LYR K+LG    SS      EQ       +  +       +         +  
Sbjct: 512  YFHRLDLLYRNKVLGNGSGSSTSGLPQEQTSTVQKQSPVSKPPQEGVVNIHQPHGSASSE 571

Query: 1836 ENKNIDDNNGGSIEVKKPEDIMM------EHHAQHSVMDFYEKLEEPNRDN 1970
            E + I+++  G+   +KPEDI+M      +   Q S++  YEK+EE +  N
Sbjct: 572  EEERIEESLQGT---EKPEDIVMRELMQQQQQQQESMIGEYEKIEESHNYN 619


>emb|CAE02791.2| OSJNBa0011L07.15 [Oryza sativa Japonica Group]
            gi|116310385|emb|CAH67396.1| H0115B09.8 [Oryza sativa
            Indica Group] gi|218195298|gb|EEC77725.1| hypothetical
            protein OsI_16822 [Oryza sativa Indica Group]
          Length = 739

 Score =  432 bits (1111), Expect = e-118
 Identities = 276/657 (42%), Positives = 339/657 (51%), Gaps = 63/657 (9%)
 Frame = +3

Query: 303  SGGNRWPRQETIALLKIRSEMDAAFRDATLKGPLWDDVSRKLAELGYVRSAKKCKEKFEN 482
            + GNRWPR+ET+AL++IRSEMDA FRDATLKGPLW++VSRKLAELGY RSAKKCKEKFEN
Sbjct: 95   AAGNRWPREETLALIRIRSEMDATFRDATLKGPLWEEVSRKLAELGYKRSAKKCKEKFEN 154

Query: 483  VHKYYKRTKEGRAGRQDGKSYRFFSQLEALHXXXXXXXXXXXXXXXXXXXXXXXXSM--- 653
            VHKYYKRTKEGRAGRQDGKSYRFF++LEALH                             
Sbjct: 155  VHKYYKRTKEGRAGRQDGKSYRFFTELEALHAAAPQTPQPQQQQQQQLPPVTSSAPAMHA 214

Query: 654  ---------GIGTVITTSGRIQPNSETATPNAPPTQIGLPRMNPLDLSGGVQIGVSGSAS 806
                      +  +    G IQP   ++   AP   + LP   P++L G     +SGS S
Sbjct: 215  FAPPVPAPPPMSAMPPPPGPIQPAPISSA--APAVPLELPPQPPINLQGLSFSSMSGSES 272

Query: 807  TAVGLAFXXXXXXXXXXXXXXLEGEPSNTNTRKRKRGESSSNRKMMGFFEGLMKQVIERQ 986
                                 +  E   +  R  KR   +  +++  FFEGL+KQV++RQ
Sbjct: 273  D-------------DESEDDEMTAETGGSQDRLGKRKRGAGGKRLATFFEGLIKQVVDRQ 319

Query: 987  EAMQQRFLETIEKREQDRMIREEAWKHQEMARLNXXXXXXXXXXXXSATRDAAIIAFLQK 1166
            E MQ+RFLET+EKRE +R  REEAW+ QE+ARLN            +A+RDAAII+FLQ+
Sbjct: 320  EEMQRRFLETMEKREAERTAREEAWRRQEVARLNREQEQLAQERAAAASRDAAIISFLQR 379

Query: 1167 ITGQSIQLPPSVAIXXXXXXXXXXXXXXXXXXXXXXXXXXXXKKEIIRHQPTTTE----- 1331
            I GQS+Q+PP+  +                            K+   +HQP  T      
Sbjct: 380  IGGQSVQVPPAATVIQMPTPVQLQTPPPV-------------KQPARQHQPQPTPPPPQA 426

Query: 1332 LVVAAIPEQQHPPQ-----------------------------------EMXXXXXXSLD 1406
              + A P QQ PPQ                                   E          
Sbjct: 427  APIPAAPLQQQPPQPQHKETIHHEAVTPRRAPPTSGSSLELVPAAEQHVESGLGGGEGGS 486

Query: 1407 QSSSRWPKAEVLALISLRSGLDSRYQDAGPKGPLWEEISAGMHRMGYNRSSKRCKEKWEN 1586
             SSSRWPK EV ALI LR  LD RYQ+ GPKGPLWEEIS+GM R+GYNRSSKRCKEKWEN
Sbjct: 487  ASSSRWPKTEVQALIQLRMELDMRYQETGPKGPLWEEISSGMRRLGYNRSSKRCKEKWEN 546

Query: 1587 INKYFKKVKESNKKRPEDAKTCPYFHQLDDLYRKKML--GGVGSSSFVN--------QNN 1736
            INKYFKKVKESNKKRPED+KTCPYFHQLD +YR+K L  GG G +S  N        QN 
Sbjct: 547  INKYFKKVKESNKKRPEDSKTCPYFHQLDVIYRRKHLTGGGGGGASAANVAATAIEHQNP 606

Query: 1737 EQQQLD-SSTNDSNPTANLQAIMAXXXXXETNEAENKNIDDNNGGSIEVKKPEDIMMEHH 1913
             + +++  + ND++   N     A       + A      D + G   +KKPEDI+ E  
Sbjct: 607  NRHEIEGKNINDNDKRKNGGGGGAQVPTSNGDTAPTTATFDVDSG---MKKPEDIVRELS 663

Query: 1914 AQHSVMDFYEKLEEPNRDNQDHXXXXXXXXXXXXXXXXXXXXXXRKMGYKIQFQRPN 2084
             Q        +      D+ D                        KM Y+IQFQRPN
Sbjct: 664  EQPP-----REFTTDETDSDD--------MGDDYTDDGEEGEDDGKMQYRIQFQRPN 707



 Score = 92.4 bits (228), Expect = 7e-16
 Identities = 48/132 (36%), Positives = 76/132 (57%), Gaps = 1/132 (0%)
 Frame = +3

Query: 183 RAPPVGGRSSGNLDELVAVSSXXXXXXXXXXXXEMERGVLSGGNRWPRQETIALLKIRSE 362
           RAPP  G S     ELV  +                 G  +  +RWP+ E  AL+++R E
Sbjct: 456 RAPPTSGSSL----ELVPAAEQHVESGLGGG-----EGGSASSSRWPKTEVQALIQLRME 506

Query: 363 MDAAFRDATLKGPLWDDVSRKLAELGYVRSAKKCKEKFENVHKYYKRTKEGRAGR-QDGK 539
           +D  +++   KGPLW+++S  +  LGY RS+K+CKEK+EN++KY+K+ KE    R +D K
Sbjct: 507 LDMRYQETGPKGPLWEEISSGMRRLGYNRSSKRCKEKWENINKYFKKVKESNKKRPEDSK 566

Query: 540 SYRFFSQLEALH 575
           +  +F QL+ ++
Sbjct: 567 TCPYFHQLDVIY 578


>ref|XP_002893757.1| hypothetical protein ARALYDRAFT_473497 [Arabidopsis lyrata subsp.
            lyrata] gi|297339599|gb|EFH70016.1| hypothetical protein
            ARALYDRAFT_473497 [Arabidopsis lyrata subsp. lyrata]
          Length = 667

 Score =  432 bits (1110), Expect = e-118
 Identities = 278/640 (43%), Positives = 354/640 (55%), Gaps = 34/640 (5%)
 Frame = +3

Query: 153  VAEGASPISSRAPPVGGRSSGNLDELVAVSSXXXXXXXXXXXXEMERGVLSGGNRWPRQE 332
            V E ASPISSR P      + NL+EL+  S+                   S GNRWPR+E
Sbjct: 13   VVEEASPISSRPP------ANNLEELMRFSAAADDGGGGGGGGGGSASS-SSGNRWPREE 65

Query: 333  TIALLKIRSEMDAAFRDATLKGPLWDDVSRKLAELGYVRSAKKCKEKFENVHKYYKRTKE 512
            T+ALL+IRS+MD+ FRDATLK PLW+ VSRKL ELGY RS+KKCKEKFENV KYYKRTKE
Sbjct: 66   TLALLRIRSDMDSTFRDATLKAPLWEHVSRKLLELGYKRSSKKCKEKFENVQKYYKRTKE 125

Query: 513  GRAGRQDGKSYRFFSQLEALHXXXXXXXXXXXXXXXXXXXXXXXXSMGIGTVITTSGR-- 686
             R GR DGK+Y+FFSQLEAL+                        S     V +      
Sbjct: 126  TRGGRHDGKAYKFFSQLEALNTTPPSSSLDVTPLSVANPILMPTSSSSPFPVFSQPQPQP 185

Query: 687  --IQPNSETATPNAPPTQIGLPRMNPLDLSGGVQIGVSGSASTAVGLAFXXXXXXXXXXX 860
              +Q ++ + TP  PP    LP M P     GV    S S+STA G+             
Sbjct: 186  QPLQTHNVSFTPTPPPPP--LPSMVPT--FPGVTFS-SHSSSTASGMGSDDDDDEMDVD- 239

Query: 861  XXXLEGEPSNTNTRKRKRGESSSNRKMMGFFEGLMKQVIERQEAMQQRFLETIEKREQDR 1040
                +   + +++RKRKRG      KMM  FEGL++QV+++Q AMQ+ FLE +EKREQ+R
Sbjct: 240  ----QANIAGSSSRKRKRGNRGGGGKMMKLFEGLVRQVMQKQAAMQRSFLEALEKREQER 295

Query: 1041 MIREEAWKHQEMARLNXXXXXXXXXXXXSATRDAAIIAFLQKITGQSIQLPPSVAIXXXX 1220
            + REEAWK QEMARL             SA+RDAAII+ +QKITG +IQLPPS++     
Sbjct: 296  LDREEAWKRQEMARLAREHEVMSQERAASASRDAAIISLIQKITGHTIQLPPSLSSQPPQ 355

Query: 1221 XXXXXXXXXXXXXXXXXXXXXXXXKKEIIRHQPTTTELVVAAIPEQQ------------- 1361
                                    + ++   QP      + AIP+QQ             
Sbjct: 356  PPPPPYQPPPAVAKRVAEPPLSTAQSQL--QQP------IMAIPQQQILPPPPPPPPHLP 407

Query: 1362 HPP------------QEMXXXXXXSLDQSSSRWPKAEVLALISLRSGLDSRYQDAGPKGP 1505
            H P            QEM      S   SSSRWPKAE+LALI+LRSG++ RYQD  PKG 
Sbjct: 408  HQPEQKQQQQQQPQQQEMVMSSEQSSLPSSSRWPKAEILALINLRSGMEPRYQDNVPKGL 467

Query: 1506 LWEEISAGMHRMGYNRSSKRCKEKWENINKYFKKVKESNKKRPEDAKTCPYFHQLDDLYR 1685
            LWEEIS  M RMGYNR++KRCKEKWENINKY+KKVKESNKKRP+DAKTCPYFH+LD LYR
Sbjct: 468  LWEEISTSMKRMGYNRNAKRCKEKWENINKYYKKVKESNKKRPQDAKTCPYFHRLDLLYR 527

Query: 1686 KKML--GGVGSSSFVNQNNEQQQLDSSTNDSNPTANLQAIMAXXXXXETNEAENKNIDDN 1859
             K+L  GG  S+S + Q+ +Q  + +         N+Q           +  E + I+++
Sbjct: 528  NKVLGSGGGSSTSGLPQDQKQSPVPAMKLPQEGLVNVQ-----QPHGSASSEEEEPIEES 582

Query: 1860 NGGSIEVKKPEDIMME---HHAQHSVMDFYEKLEEPNRDN 1970
              G+   +KPED++M       Q S++  YEK+EE +  N
Sbjct: 583  PQGT---EKPEDLVMRELMQQQQESMIGEYEKIEESHNYN 619


>gb|AAG51283.1|AC027035_6 trihelix DNA-binding protein (GTL1) [Arabidopsis thaliana]
          Length = 594

 Score =  426 bits (1096), Expect = e-116
 Identities = 260/550 (47%), Positives = 316/550 (57%), Gaps = 18/550 (3%)
 Frame = +3

Query: 153  VAEGASPISSRAPPVGGRSSGNLDELVAVSSXXXXXXXXXXXXEMERGVLSG--GNRWPR 326
            V E ASPISSR P      + NL+EL+  S+                G  S   GNRWPR
Sbjct: 12   VVEEASPISSRPP------ANNLEELMRFSAAADDGGLGGGGGGGGGGSASSSSGNRWPR 65

Query: 327  QETIALLKIRSEMDAAFRDATLKGPLWDDVSRKLAELGYVRSAKKCKEKFENVHKYYKRT 506
            +ET+ALL+IRS+MD+ FRDATLK PLW+ VSRKL ELGY RS+KKCKEKFENV KYYKRT
Sbjct: 66   EETLALLRIRSDMDSTFRDATLKAPLWEHVSRKLLELGYKRSSKKCKEKFENVQKYYKRT 125

Query: 507  KEGRAGRQDGKSYRFFSQLEALHXXXXXXXXXXXXXXXXXXXXXXXXSMGIGTVITTSGR 686
            KE R GR DGK+Y+FFSQLEAL+                        S     V +    
Sbjct: 126  KETRGGRHDGKAYKFFSQLEALNTTPPSSSLDVTPLSVANPILMPSSSSSPFPVFSQP-- 183

Query: 687  IQPNSETATPNA-------PPTQIGLPRMNPLDLSGGVQIGVSGSASTAVGLAFXXXXXX 845
             QP ++T  P          P  + LP M P+    GV    S S+STA G+        
Sbjct: 184  -QPQTQTQPPQTHNVSFTPTPPPLPLPSMGPIFT--GVTFS-SHSSSTASGMGSDDDDDD 239

Query: 846  XXXXXXXXLEGEPSNTNTRKRKRGESSSNRKMMGFFEGLMKQVIERQEAMQQRFLETIEK 1025
                     +   + +++RKRKRG      KMM  FEGL++QV+++Q AMQ+ FLE +EK
Sbjct: 240  MDVD-----QANIAGSSSRKRKRGNRGGGGKMMELFEGLVRQVMQKQAAMQRSFLEALEK 294

Query: 1026 REQDRMIREEAWKHQEMARLNXXXXXXXXXXXXSATRDAAIIAFLQKITGQSIQLPPSVA 1205
            REQ+R+ REEAWK QEMARL             SA+RDAAII+ +QKITG +IQLPPS++
Sbjct: 295  REQERLDREEAWKRQEMARLAREHEVMSQERAASASRDAAIISLIQKITGHTIQLPPSLS 354

Query: 1206 IXXXXXXXXXXXXXXXXXXXXXXXXXXXXKKEIIRHQ------PTTTELVVAAIPEQ--- 1358
                                         ++ I+         P       A  PEQ   
Sbjct: 355  SQPPPPYQPPPAVTKRVAEPPLSTAQSQSQQPIMAIPQQQILPPPPPSHPHAHQPEQKQQ 414

Query: 1359 QHPPQEMXXXXXXSLDQSSSRWPKAEVLALISLRSGLDSRYQDAGPKGPLWEEISAGMHR 1538
            Q P QEM      S   SSSRWPKAE+LALI+LRSG++ RYQD  PKG LWEEIS  M R
Sbjct: 415  QQPQQEMVMSSEQSSLPSSSRWPKAEILALINLRSGMEPRYQDNVPKGLLWEEISTSMKR 474

Query: 1539 MGYNRSSKRCKEKWENINKYFKKVKESNKKRPEDAKTCPYFHQLDDLYRKKMLGGVGSSS 1718
            MGYNR++KRCKEKWENINKY+KKVKESNKKRP+DAKTCPYFH+LD LYR K+LG  G SS
Sbjct: 475  MGYNRNAKRCKEKWENINKYYKKVKESNKKRPQDAKTCPYFHRLDLLYRNKVLGSGGGSS 534

Query: 1719 FVNQNNEQQQ 1748
                  +Q+Q
Sbjct: 535  TSGLPQDQKQ 544


>sp|Q9C882.2|GTL1_ARATH RecName: Full=Trihelix transcription factor GTL1; AltName:
            Full=GT2-LIKE protein 1; Short=AtGTL1; Short=Protein
            GT-2-LIKE1; AltName: Full=Trihelix DNA-binding protein
            GTL1
          Length = 587

 Score =  426 bits (1096), Expect = e-116
 Identities = 260/550 (47%), Positives = 316/550 (57%), Gaps = 18/550 (3%)
 Frame = +3

Query: 153  VAEGASPISSRAPPVGGRSSGNLDELVAVSSXXXXXXXXXXXXEMERGVLSG--GNRWPR 326
            V E ASPISSR P      + NL+EL+  S+                G  S   GNRWPR
Sbjct: 12   VVEEASPISSRPP------ANNLEELMRFSAAADDGGLGGGGGGGGGGSASSSSGNRWPR 65

Query: 327  QETIALLKIRSEMDAAFRDATLKGPLWDDVSRKLAELGYVRSAKKCKEKFENVHKYYKRT 506
            +ET+ALL+IRS+MD+ FRDATLK PLW+ VSRKL ELGY RS+KKCKEKFENV KYYKRT
Sbjct: 66   EETLALLRIRSDMDSTFRDATLKAPLWEHVSRKLLELGYKRSSKKCKEKFENVQKYYKRT 125

Query: 507  KEGRAGRQDGKSYRFFSQLEALHXXXXXXXXXXXXXXXXXXXXXXXXSMGIGTVITTSGR 686
            KE R GR DGK+Y+FFSQLEAL+                        S     V +    
Sbjct: 126  KETRGGRHDGKAYKFFSQLEALNTTPPSSSLDVTPLSVANPILMPSSSSSPFPVFSQP-- 183

Query: 687  IQPNSETATPNA-------PPTQIGLPRMNPLDLSGGVQIGVSGSASTAVGLAFXXXXXX 845
             QP ++T  P          P  + LP M P+    GV    S S+STA G+        
Sbjct: 184  -QPQTQTQPPQTHNVSFTPTPPPLPLPSMGPIFT--GVTFS-SHSSSTASGMGSDDDDDD 239

Query: 846  XXXXXXXXLEGEPSNTNTRKRKRGESSSNRKMMGFFEGLMKQVIERQEAMQQRFLETIEK 1025
                     +   + +++RKRKRG      KMM  FEGL++QV+++Q AMQ+ FLE +EK
Sbjct: 240  MDVD-----QANIAGSSSRKRKRGNRGGGGKMMELFEGLVRQVMQKQAAMQRSFLEALEK 294

Query: 1026 REQDRMIREEAWKHQEMARLNXXXXXXXXXXXXSATRDAAIIAFLQKITGQSIQLPPSVA 1205
            REQ+R+ REEAWK QEMARL             SA+RDAAII+ +QKITG +IQLPPS++
Sbjct: 295  REQERLDREEAWKRQEMARLAREHEVMSQERAASASRDAAIISLIQKITGHTIQLPPSLS 354

Query: 1206 IXXXXXXXXXXXXXXXXXXXXXXXXXXXXKKEIIRHQ------PTTTELVVAAIPEQ--- 1358
                                         ++ I+         P       A  PEQ   
Sbjct: 355  SQPPPPYQPPPAVTKRVAEPPLSTAQSQSQQPIMAIPQQQILPPPPPSHPHAHQPEQKQQ 414

Query: 1359 QHPPQEMXXXXXXSLDQSSSRWPKAEVLALISLRSGLDSRYQDAGPKGPLWEEISAGMHR 1538
            Q P QEM      S   SSSRWPKAE+LALI+LRSG++ RYQD  PKG LWEEIS  M R
Sbjct: 415  QQPQQEMVMSSEQSSLPSSSRWPKAEILALINLRSGMEPRYQDNVPKGLLWEEISTSMKR 474

Query: 1539 MGYNRSSKRCKEKWENINKYFKKVKESNKKRPEDAKTCPYFHQLDDLYRKKMLGGVGSSS 1718
            MGYNR++KRCKEKWENINKY+KKVKESNKKRP+DAKTCPYFH+LD LYR K+LG  G SS
Sbjct: 475  MGYNRNAKRCKEKWENINKYYKKVKESNKKRPQDAKTCPYFHRLDLLYRNKVLGSGGGSS 534

Query: 1719 FVNQNNEQQQ 1748
                  +Q+Q
Sbjct: 535  TSGLPQDQKQ 544


>ref|XP_002464376.1| hypothetical protein SORBIDRAFT_01g017120 [Sorghum bicolor]
            gi|241918230|gb|EER91374.1| hypothetical protein
            SORBIDRAFT_01g017120 [Sorghum bicolor]
          Length = 807

 Score =  426 bits (1096), Expect = e-116
 Identities = 270/575 (46%), Positives = 318/575 (55%), Gaps = 47/575 (8%)
 Frame = +3

Query: 153  VAEGASPISSRAPPVGGRSSGNLDELVAVSSXXXXXXXXXXXXEM---ERGVLSG--GNR 317
            +AE  SPISSR P      +   DEL A  +                 E G   G  GNR
Sbjct: 47   LAEAPSPISSRPPASSSAPAQQYDELGASGAGAVLGFDAEGLAAAAAGEEGASGGSAGNR 106

Query: 318  WPRQETIALLKIRSEMDAAFRDATLKGPLWDDVSRKLAELGYVRSAKKCKEKFENVHKYY 497
            WPRQET+ LLKIRS+MDAAFRDATLKGPLW+ VSRKLA+ GY RSAKKCKEKFENVHKYY
Sbjct: 107  WPRQETLELLKIRSDMDAAFRDATLKGPLWEQVSRKLADKGYSRSAKKCKEKFENVHKYY 166

Query: 498  KRTKEGRAGRQDGKSYRFFSQLEALHXXXXXXXXXXXXXXXXXXXXXXXXSMGIGTVITT 677
            KRTKE RAGR DGK+YRFF+QLEALH                                  
Sbjct: 167  KRTKESRAGRNDGKTYRFFTQLEALHG--------------------------------- 193

Query: 678  SGRIQPNSETAT--PNAPPTQIGLP-RMNPLDLSGGVQIGVSG--SASTAVGLAFXXXXX 842
            +G   P S  A+  P A P+ + +P    P  L+GGV +   G  S ST+    +     
Sbjct: 194  TGGAAPASSVASQVPPAGPSAVRVPAEPPPAVLAGGVGMPTMGYPSFSTSNTEDYTDEDD 253

Query: 843  XXXXXXXXXLEGEPSNTNTR-KRKR----GESSS---NRKMMGFFEGLMKQVIERQEAMQ 998
                     + G     + R KRKR    G S++   + KMM FFEGLMKQV+ERQEAMQ
Sbjct: 254  SDDEGTQELVGGGGGGADERGKRKRVSEGGASAAGGGSGKMMRFFEGLMKQVMERQEAMQ 313

Query: 999  QRFLETIEKREQDRMIREEAWKHQEMARLNXXXXXXXXXXXXSATRDAAIIAFLQKITGQ 1178
            QRFLE IEKREQDRMIREEAW+ QEM RL             +A+RDAA+++F+QKITGQ
Sbjct: 314  QRFLEAIEKREQDRMIREEAWRRQEMTRLAREQEILAQERAMAASRDAAVLSFIQKITGQ 373

Query: 1179 SIQLPPSVAIXXXXXXXXXXXXXXXXXXXXXXXXXXXXKKEIIRHQPTTTELVVAAIPEQ 1358
            +I +P   A                                    QP  ++       +Q
Sbjct: 374  TIPMPSIAAPTINAMPPPPPSHPKPPPPQPHPTPIASASPAPPPPQPPASQTPPPQQQQQ 433

Query: 1359 QHPPQ-----EMXXXXXXSLD-----------------------QSSSRWPKAEVLALIS 1454
            Q PP      +       S+D                        +SSRWPKAEV ALI 
Sbjct: 434  QKPPMPASTPQAPAPQQQSMDIVMTTAETTPRADTPVHEGSSGGATSSRWPKAEVHALIQ 493

Query: 1455 LRSGLDSRYQDAGPKGPLWEEISAGMHRMGYNRSSKRCKEKWENINKYFKKVKESNKKRP 1634
            LRS LD+RYQ+AGPKGPLWEEISAGM R+GYNR++KRCKEKWENINKYFKKVKESNKKRP
Sbjct: 494  LRSNLDTRYQEAGPKGPLWEEISAGMRRLGYNRNAKRCKEKWENINKYFKKVKESNKKRP 553

Query: 1635 EDAKTCPYFHQLDDLYRKK-MLGGVGSSSFVNQNN 1736
            ED+KTCPYFHQLD LYR K  L   G+ + V+  N
Sbjct: 554  EDSKTCPYFHQLDALYRNKAALSSSGAGAVVHAVN 588


>emb|CAA05995.1| GTL1 [Arabidopsis thaliana]
          Length = 594

 Score =  425 bits (1092), Expect = e-116
 Identities = 259/550 (47%), Positives = 315/550 (57%), Gaps = 18/550 (3%)
 Frame = +3

Query: 153  VAEGASPISSRAPPVGGRSSGNLDELVAVSSXXXXXXXXXXXXEMERGVLSG--GNRWPR 326
            V E ASPISSR P      + NL+EL+  S+                G  S   GNRWPR
Sbjct: 12   VVEEASPISSRPP------ANNLEELMRFSAAADDGGLGGGGGGGGGGSASSSSGNRWPR 65

Query: 327  QETIALLKIRSEMDAAFRDATLKGPLWDDVSRKLAELGYVRSAKKCKEKFENVHKYYKRT 506
            +ET+ LL+IRS+MD+ FRDATLK PLW+ VSRKL ELGY RS+KKCKEKFENV KYYKRT
Sbjct: 66   EETLVLLRIRSDMDSTFRDATLKAPLWEHVSRKLLELGYKRSSKKCKEKFENVQKYYKRT 125

Query: 507  KEGRAGRQDGKSYRFFSQLEALHXXXXXXXXXXXXXXXXXXXXXXXXSMGIGTVITTSGR 686
            KE R GR DGK+Y+FFSQLEAL+                        S     V +    
Sbjct: 126  KETRGGRHDGKAYKFFSQLEALNTTPPSSSLDVTPLSVANPILMPSSSSSPFPVFSQP-- 183

Query: 687  IQPNSETATPNA-------PPTQIGLPRMNPLDLSGGVQIGVSGSASTAVGLAFXXXXXX 845
             QP ++T  P          P  + LP M P+    GV    S S+STA G+        
Sbjct: 184  -QPQTQTQPPQTHNVSFTPTPPPLPLPSMGPIFT--GVTFS-SHSSSTASGMGSDDDDDD 239

Query: 846  XXXXXXXXLEGEPSNTNTRKRKRGESSSNRKMMGFFEGLMKQVIERQEAMQQRFLETIEK 1025
                     +   + +++RKRKRG      KMM  FEGL++QV+++Q AMQ+ FLE +EK
Sbjct: 240  MDVD-----QANIAGSSSRKRKRGNRGGGGKMMELFEGLVRQVMQKQAAMQRSFLEALEK 294

Query: 1026 REQDRMIREEAWKHQEMARLNXXXXXXXXXXXXSATRDAAIIAFLQKITGQSIQLPPSVA 1205
            REQ+R+ REEAWK QEMARL             SA+RDAAII+ +QKITG +IQLPPS++
Sbjct: 295  REQERLDREEAWKRQEMARLAREHEVMSQERAASASRDAAIISLIQKITGHTIQLPPSLS 354

Query: 1206 IXXXXXXXXXXXXXXXXXXXXXXXXXXXXKKEIIRHQ------PTTTELVVAAIPEQ--- 1358
                                         ++ I+         P       A  PEQ   
Sbjct: 355  SQPPPPYQPPPAVTKRVAEPPLSTAQSQSQQPIMAIPQQQILPPPPPSHPHAHQPEQKQQ 414

Query: 1359 QHPPQEMXXXXXXSLDQSSSRWPKAEVLALISLRSGLDSRYQDAGPKGPLWEEISAGMHR 1538
            Q P QEM      S   SSSRWPKAE+LALI+LRSG++ RYQD  PKG LWEEIS  M R
Sbjct: 415  QQPQQEMVMSSEQSSLPSSSRWPKAEILALINLRSGMEPRYQDNVPKGLLWEEISTSMKR 474

Query: 1539 MGYNRSSKRCKEKWENINKYFKKVKESNKKRPEDAKTCPYFHQLDDLYRKKMLGGVGSSS 1718
            MGYNR++KRCKEKWENINKY+KKVKESNKKRP+DAKTCPYFH+LD LYR K+LG  G SS
Sbjct: 475  MGYNRNAKRCKEKWENINKYYKKVKESNKKRPQDAKTCPYFHRLDLLYRNKVLGSGGGSS 534

Query: 1719 FVNQNNEQQQ 1748
                  +Q+Q
Sbjct: 535  TSGLPQDQKQ 544


>ref|XP_006415118.1| hypothetical protein EUTSA_v10007000mg [Eutrema salsugineum]
            gi|557092889|gb|ESQ33471.1| hypothetical protein
            EUTSA_v10007000mg [Eutrema salsugineum]
          Length = 588

 Score =  417 bits (1072), Expect = e-113
 Identities = 264/562 (46%), Positives = 323/562 (57%), Gaps = 40/562 (7%)
 Frame = +3

Query: 153  VAEGASPISSRAPPVGGRSSGNLDELVAVSSXXXXXXXXXXXXEMERGVLSGGNRWPRQE 332
            V E ASPISSR P        NL+EL+  S+                   S GNRWPR+E
Sbjct: 18   VVEEASPISSRPP-------ANLEELMRFSAAADDGGGLGGGGGGSSSS-SSGNRWPREE 69

Query: 333  TIALLKIRSEMDAAFRDATLKGPLWDDVSRKLAELGYVRSAKKCKEKFENVHKYYKRTKE 512
            T+ALL+IRS+MD+ FRDATLK PLW+ VSRKL ELGY RSAKKCKEKFENV KYYKRTKE
Sbjct: 70   TLALLRIRSDMDSTFRDATLKAPLWEHVSRKLMELGYKRSAKKCKEKFENVQKYYKRTKE 129

Query: 513  GRAGRQDGKSYRFFSQLEALHXXXXXXXXXXXXXXXXXXXXXXXXSMGIGTVITTSGRIQ 692
             R GR DGK+Y+FFSQLEAL+                        S+ +   ++ +  IQ
Sbjct: 130  TRGGRHDGKAYKFFSQLEALNTTPPPLPPPPPPSS----------SLDVAP-LSVANPIQ 178

Query: 693  PNSE--------------------TATPNAPPTQIGLPRMNPLDLSGGVQIGVSGSASTA 812
            P S+                    T +   PP  +  P M P     GV    S S+STA
Sbjct: 179  PQSQFPVFPLPQTQPQPSQPHLTHTVSFTTPPPPLPPPPMGPT--FPGVTFS-SHSSSTA 235

Query: 813  VGLAFXXXXXXXXXXXXXXLEGEPSNTNTRKRKRGESSSNR--KMMGFFEGLMKQVIERQ 986
             G+                ++ + +  ++RKRKRG        KMM  FEGL++QV+E+Q
Sbjct: 236  SGMG---------SDDDDDMDLDEAGPSSRKRKRGNRGGGEGGKMMELFEGLVRQVMEKQ 286

Query: 987  EAMQQRFLETIEKREQDRMIREEAWKHQEMARLNXXXXXXXXXXXXSATRDAAIIAFLQK 1166
             AMQ+ FL+ +EKREQ+R+ REEAWK QEM+RL             SA+RDAAII+ +QK
Sbjct: 287  AAMQRSFLDALEKREQERLQREEAWKRQEMSRLAREHEVMSQERAASASRDAAIISLIQK 346

Query: 1167 ITGQSIQLPPSVAIXXXXXXXXXXXXXXXXXXXXXXXXXXXXKKEIIRHQPTTTELV--- 1337
            ITG +IQLPPS++                              K +   Q TTT+     
Sbjct: 347  ITGHTIQLPPSLS--------------SSQPPQPPHQPPPPAAKRVEPPQTTTTQSQSQP 392

Query: 1338 VAAIPEQQ-----HPP---------QEMXXXXXXSLDQSSSRWPKAEVLALISLRSGLDS 1475
            + AIP+QQ     HPP         QEM      S   SSSRWPKAE+LALI+LRSG++ 
Sbjct: 393  IMAIPQQQILPPPHPPAPPPAPHQQQEMIMSPEQS-SPSSSRWPKAEILALINLRSGMEP 451

Query: 1476 RYQDAGPKGPLWEEISAGMHRMGYNRSSKRCKEKWENINKYFKKVKESNKKRPEDAKTCP 1655
            RYQD  PKG LWEEISA M RMGYNR++KRCKEKWENINKY+KKVKESNKKRP+DAKTCP
Sbjct: 452  RYQDNVPKGLLWEEISASMKRMGYNRNAKRCKEKWENINKYYKKVKESNKKRPQDAKTCP 511

Query: 1656 YFHQLDDLYRKKMLG-GVGSSS 1718
            YFH+LD LYR K+LG G GSS+
Sbjct: 512  YFHRLDLLYRNKVLGNGSGSST 533


>ref|XP_002448251.1| hypothetical protein SORBIDRAFT_06g023980 [Sorghum bicolor]
            gi|241939434|gb|EES12579.1| hypothetical protein
            SORBIDRAFT_06g023980 [Sorghum bicolor]
          Length = 770

 Score =  416 bits (1069), Expect = e-113
 Identities = 288/745 (38%), Positives = 351/745 (47%), Gaps = 104/745 (13%)
 Frame = +3

Query: 162  GASPISSRAP----------PVGGRSSGNLDELVAVSSXXXXXXXXXXXXEMERGVLSGG 311
            G  P+SSR P          P   +   + DEL  VS                    SGG
Sbjct: 36   GPMPLSSRPPSSAQPQPQPQPQPQQPRSSYDELAVVSGTAGAGGFDDEMMGSGGSCGSGG 95

Query: 312  --------NRWPRQETIALLKIRSEMDAAFRDATLKGPLWDDVSRKLAELGYVRSAKKCK 467
                    NRWPR+ET AL++IRSEMDA FRDATLKGPLW+DVSRKLA+LGY RSAKKCK
Sbjct: 96   GGSSGASSNRWPREETQALIRIRSEMDATFRDATLKGPLWEDVSRKLADLGYKRSAKKCK 155

Query: 468  EKFENVHKYYKRTKEGRAGRQDGKSYRFFSQLEALHXXXXXXXXXXXXXXXXXXXXXXXX 647
            EKFENVHKYYKRTKEGRAGRQDGKSYRFF +LEALH                        
Sbjct: 156  EKFENVHKYYKRTKEGRAGRQDGKSYRFFDELEALHAAAPQPQPQPQPPQMQQQQLPPAT 215

Query: 648  SMG-----------IGTVITTSGRIQPNS-ETATP-----NAPPTQIGLPRMNPLDLSGG 776
            +             + ++   +G +QP    +A P     +  P ++      PL+L G 
Sbjct: 216  TAPAPLHAFAAPPPMSSMPPPTGPMQPAPISSAAPAVVQVHQAPVELPPAAHQPLNLQGF 275

Query: 777  VQIGVSGSASTAVGLAFXXXXXXXXXXXXXXLEGEPSNTNTRKRKRGE----SSSNRKMM 944
                +S S S                      E   S     KRKRG+    S S++KMM
Sbjct: 276  SFSSMSDSESD-----------DESEDDDMTAETGGSQDRLGKRKRGDGGGASGSSKKMM 324

Query: 945  GFFEGLMKQVIERQEAMQQRFLETIEKREQDRMIREEAWKHQEMARLNXXXXXXXXXXXX 1124
             FFEGLM+QV++RQE MQ+RFLET+EKRE +R  REEAW+ QE+ARLN            
Sbjct: 325  TFFEGLMQQVVDRQEEMQRRFLETMEKREAERTAREEAWRRQEVARLNREQEQLAQERAA 384

Query: 1125 SATRDAAIIAFLQKITGQSIQLPPSVAIXXXXXXXXXXXXXXXXXXXXXXXXXXXXKKEI 1304
            +A+RDAAIIAFLQ+I GQS+Q   +V +                             +  
Sbjct: 385  AASRDAAIIAFLQRIGGQSVQPATAVVVPMPAPVPVHTPPPPKQQSRQQQPPPPPSPQAT 444

Query: 1305 IRHQPTTTELVVAAIPEQQHPPQEMXXXXXXSLDQ------------------------- 1409
             + +P      ++A P QQ PPQ+         D                          
Sbjct: 445  PQSKP------ISAAPLQQQPPQKQPKDTSSQQDAGTPRSAPPTSGASLELVPVAEHHVD 498

Query: 1410 -----------SSSRWPKAEVLALISLRSGLDSRYQDAGPKGPLWEEISAGMHRMGYNRS 1556
                       SSSRWPK EV ALI LR  LD RYQ+ GPKGPLWE+IS+GM R+GYNRS
Sbjct: 499  SGLGGGDGGAASSSRWPKTEVHALIQLRMDLDMRYQETGPKGPLWEDISSGMRRLGYNRS 558

Query: 1557 SKRCKEKWENINKYFKKVKESNKKRPEDAKTCPYFHQLDDLYRKKMLGGVGSSSFVNQNN 1736
            SKRCKEKWENINKY+KKVKESNKKRPED+KTCPYFHQL+ +Y +K L    +SS      
Sbjct: 559  SKRCKEKWENINKYYKKVKESNKKRPEDSKTCPYFHQLEAIYSRKHLRAAAASS------ 612

Query: 1737 EQQQLDSSTNDSNPTANLQAIMAXXXXXETNEAENKNIDD---NNGGS------------ 1871
                     N +                  +E E KNI+D   NNGGS            
Sbjct: 613  ---------NAAAAAVAPPPAYPDQLNPSRHEIEGKNINDDKRNNGGSGGGTQVPSSNGE 663

Query: 1872 --------------IEVKKPEDIMMEHHAQHSVMDFYEKLEEPNRDNQDHXXXXXXXXXX 2009
                            +KKPEDI+ E +            E+P R+              
Sbjct: 664  TTAPTTTPAAFDADTGMKKPEDIVRELN------------EQPPREFTTEDETDSDDMGD 711

Query: 2010 XXXXXXXXXXXXRKMGYKIQFQRPN 2084
                         KM Y+IQFQRPN
Sbjct: 712  EYTDDGEEGEDDGKMQYRIQFQRPN 736


Top