BLASTX nr result

ID: Rheum21_contig00001502 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rheum21_contig00001502
         (1618 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002270792.1| PREDICTED: uncharacterized protein LOC100261...   286   2e-74
gb|EXB93201.1| hypothetical protein L484_024539 [Morus notabilis]     270   1e-69
ref|XP_002521956.1| DNA binding protein, putative [Ricinus commu...   270   2e-69
ref|XP_006442722.1| hypothetical protein CICLE_v10020621mg [Citr...   269   2e-69
gb|EMJ06633.1| hypothetical protein PRUPE_ppa007786mg [Prunus pe...   267   8e-69
ref|XP_006487736.1| PREDICTED: uncharacterized protein LOC102616...   267   1e-68
gb|EOY10989.1| AT hook motif DNA-binding family protein [Theobro...   266   2e-68
ref|XP_006442721.1| hypothetical protein CICLE_v10020621mg [Citr...   265   3e-68
ref|XP_003521778.1| PREDICTED: putative DNA-binding protein ESCA...   261   8e-67
ref|XP_003554726.1| PREDICTED: putative DNA-binding protein ESCA...   255   3e-65
ref|XP_006577325.1| PREDICTED: putative DNA-binding protein ESCA...   254   1e-64
ref|XP_004494753.1| PREDICTED: putative DNA-binding protein ESCA...   253   2e-64
ref|XP_003626475.1| hypothetical protein MTR_7g116320 [Medicago ...   252   3e-64
ref|XP_006604863.1| PREDICTED: putative DNA-binding protein ESCA...   250   1e-63
gb|ESW19221.1| hypothetical protein PHAVU_006G106500g [Phaseolus...   246   2e-62
ref|XP_004139392.1| PREDICTED: uncharacterized protein LOC101221...   243   1e-61
ref|NP_187109.2| AT hook motif DNA-binding family protein [Arabi...   243   2e-61
ref|XP_006297823.1| hypothetical protein CARUB_v10013864mg [Caps...   240   1e-60
ref|XP_002884453.1| hypothetical protein ARALYDRAFT_477717 [Arab...   237   9e-60
ref|XP_002332023.1| predicted protein [Populus trichocarpa] gi|5...   237   9e-60

>ref|XP_002270792.1| PREDICTED: uncharacterized protein LOC100261576 [Vitis vinifera]
            gi|296087886|emb|CBI35169.3| unnamed protein product
            [Vitis vinifera]
          Length = 357

 Score =  286 bits (732), Expect = 2e-74
 Identities = 166/382 (43%), Positives = 214/382 (56%), Gaps = 11/382 (2%)
 Frame = -3

Query: 1553 MEPHETGLNPYYHQHQHLPASALNP-------HATSGVPTSVSQVTNGGFLPNHXXXXXX 1395
            MEP++T L  Y+H HQ  P     P       H T     + +   + G LP        
Sbjct: 1    MEPNDTRLTSYFHHHQQQPQPPPPPPPQPQPHHQTQNPVAATAASPSNGLLPPSERPPLG 60

Query: 1394 XXXXXALYPQSVAGKAPSPPSEIVRKKRGRPRKYLTPEXXXXXXXXXXXXXXXXXXXXXX 1215
                   Y  SV     SPP E VR+KRGRPRKY T E                      
Sbjct: 61   -------YHHSVPSAVTSPP-ETVRRKRGRPRKYGTSEQGLSAKKSPSSSVPVP------ 106

Query: 1214 XXXSRKRDQQLIGXXXXXXXXXXXAQLGSEANNGQCFMPHIINVQPGEDVAQKIRIFVQQ 1035
                +K++Q L G            QL S  N GQ F PH+I V  GEDVAQKI  F+QQ
Sbjct: 107  ----KKKEQGLGGSSKKS-------QLVSLGNAGQSFTPHVITVASGEDVAQKIMFFMQQ 155

Query: 1034 SRRELCILSASGSVCNVSLLQPATSGGSITYDGSFDILSLSGSFIHTDHGERSGGLSACL 855
            S+RE+CI+SASGS+ N SL QPATSGG++ Y+G F+ILSL+GS++ T+ G R+GGLS CL
Sbjct: 156  SKREICIMSASGSISNASLRQPATSGGNVAYEGRFEILSLTGSYVRTEIGGRTGGLSVCL 215

Query: 854  SASNGQIVGGGLSGPLIALGPVEVIIATFLIDTKKDATAGTKGDIXXXXXXXXXXXPVAN 675
            S ++G+I+GGG+ GPL A GPV+VI+ TFL+D+KKD + G K D             V+N
Sbjct: 216  SNTDGEIIGGGVGGPLKAAGPVQVIVGTFLVDSKKDTSTGLKADASPKFTSPVGGASVSN 275

Query: 674  MGYRPVIEQPARYTVPGVDDHQNMGG--FMIQHQGIQMGSSH-SDWRTGPDTRGGFNYDF 504
            +G+R  +E   R  V G DDHQ +GG  FMIQ +G+QM  +  +DWR+GPD R    YD 
Sbjct: 276  VGFRSAVESSGRIPVMGNDDHQGIGGSHFMIQSRGMQMAPTRPTDWRSGPDARINVGYDL 335

Query: 503  TGRTGHGTHES-ENGDFEHLPD 441
             GR G G  +S ENGD+E +PD
Sbjct: 336  AGRGGRGACQSPENGDYEQIPD 357


>gb|EXB93201.1| hypothetical protein L484_024539 [Morus notabilis]
          Length = 357

 Score =  270 bits (691), Expect = 1e-69
 Identities = 160/376 (42%), Positives = 209/376 (55%), Gaps = 5/376 (1%)
 Frame = -3

Query: 1553 MEPHETGLNPYYHQHQHLPASALNPHATSGVPTSVSQVTNGGFLPNHXXXXXXXXXXXAL 1374
            MEP+E  L+ YYH  Q         H  S    + +  TNG   P H            +
Sbjct: 1    MEPNENQLSSYYHHPQP-------HHHQSPTAAAAASPTNGLLPPTHSGDGSHM-----V 48

Query: 1373 YPQSVAGKAPSPPSEIVRKKRGRPRKYLTPEXXXXXXXXXXXXXXXXXXXXXXXXXSRKR 1194
            YP SV   A + P E  ++KRGRPRKY TPE                          +K 
Sbjct: 49   YPHSVPSSAVTSPLEPSKRKRGRPRKYGTPEQALAAKKAATTLSHASAKE-------KKD 101

Query: 1193 DQQLIGXXXXXXXXXXXAQLGSEANNGQCFMPHIINVQPGEDVAQKIRIFVQQSRRELCI 1014
                             +QLG+  N GQ F PH+INV  GEDV QKI +F+ QS+RE+CI
Sbjct: 102  HSGGAASPSYSASASKKSQLGALGNVGQGFTPHVINVSAGEDVGQKIMMFMHQSKREICI 161

Query: 1013 LSASGSVCNVSLLQPATSGGSITYDGSFDILSLSGSFIHTDHGERSGGLSACLSASNGQI 834
            LSASG++ N SL QPATSGG+ITY+G FDI+S SGS+I T+ G R+GGLS CLS+++GQI
Sbjct: 162  LSASGTISNASLRQPATSGGNITYEGRFDIISCSGSYIRTELGGRTGGLSVCLSSTDGQI 221

Query: 833  VGGGLSGPLIALGPVEVIIATFLIDTKKDATAGTKGDI-XXXXXXXXXXXPVANMGYRPV 657
            +GGG+ GPL A GPV+VI+ TFLIDTKKD  AG KGD               +++G+R  
Sbjct: 222  IGGGVGGPLKAAGPVQVIVGTFLIDTKKDINAGVKGDASGINLPSPVGVTSPSSVGFRSA 281

Query: 656  IEQPARYTVPGVDDHQNMGG--FMIQHQGIQMGSSH-SDWRTGPDTRGGFNYDFTGRTGH 486
            ++   R  V G D+ Q +GG  FMIQ +G+ +  S  ++WR GPD R    Y+ +GR G 
Sbjct: 282  VDPSGRNAVRGNDEQQAIGGSHFMIQPRGMHVTPSRPTEWRPGPDARSTGGYELSGRAGL 341

Query: 485  GTHES-ENGDFEHLPD 441
              H+S ENGD+  +PD
Sbjct: 342  APHQSPENGDYVQMPD 357


>ref|XP_002521956.1| DNA binding protein, putative [Ricinus communis]
            gi|223538760|gb|EEF40360.1| DNA binding protein, putative
            [Ricinus communis]
          Length = 364

 Score =  270 bits (689), Expect = 2e-69
 Identities = 159/378 (42%), Positives = 208/378 (55%), Gaps = 7/378 (1%)
 Frame = -3

Query: 1553 MEPHETGLNPYYHQHQHLPASALNPHATSGVPTSVSQVTNGGFLPNHXXXXXXXXXXXAL 1374
            MEP++T    ++H H HL +S+    +T+  P + +     G LP              +
Sbjct: 1    MEPNDT--QHHHHPHHHLSSSSYFTTSTTPAPATTTPSPTNGLLPPPPHDTGGGGGTHMV 58

Query: 1373 YPQSVA---GKAPSPPSEIVRKKRGRPRKYLTPEXXXXXXXXXXXXXXXXXXXXXXXXXS 1203
            YP SV        S P E  R+KRGRPRKY TPE                          
Sbjct: 59   YPHSVGPSTAAVSSAPVESPRRKRGRPRKYGTPEQALAAKKTASSSSNAVAARE------ 112

Query: 1202 RKRDQQLIGXXXXXXXXXXXAQLGSEANNGQCFMPHIINVQPGEDVAQKIRIFVQQSRRE 1023
            R+                   QL +  N GQ F PH+I+V  GEDVAQKI +F+QQ RRE
Sbjct: 113  RREAAAASSPSYSGFSSRKSQQLVALGNAGQGFTPHVISVSAGEDVAQKIMLFMQQCRRE 172

Query: 1022 LCILSASGSVCNVSLLQPATSGGSITYDGSFDILSLSGSFIHTDHGERSGGLSACLSASN 843
            +CILSASGS+ N SL QPATSGG+ITY+G F+I+SLSGS++ T+ G R+GGLS CLS S+
Sbjct: 173  MCILSASGSISNASLRQPATSGGNITYEGRFEIISLSGSYVRTEIGGRAGGLSVCLSNSD 232

Query: 842  GQIVGGGLSGPLIALGPVEVIIATFLIDTKKDATAGTKGDI-XXXXXXXXXXXPVANMGY 666
            GQI+GGG+ GPLIA GPV+VII TF++D KKD  +G K D              ++N+G+
Sbjct: 233  GQIIGGGIGGPLIAGGPVQVIIGTFVVDNKKDVGSGGKVDASSSKLPSPGGGASMSNIGF 292

Query: 665  RPVIEQPARYTVPGVDDHQNMGG--FMIQHQGIQMGSSHSDWRTGPDTRGGFNYDFTGRT 492
            R   +   R+T  G DDHQ MGG  FMI  +G+       DW +G + R    ++ TGR 
Sbjct: 293  RTPTDTSGRHTFRGNDDHQTMGGNPFMIPPRGMH------DWSSGSEARVNATFELTGRR 346

Query: 491  GHGTHES-ENGDFEHLPD 441
            GHG  +S ENGD+E  PD
Sbjct: 347  GHGARQSPENGDYEQYPD 364


>ref|XP_006442722.1| hypothetical protein CICLE_v10020621mg [Citrus clementina]
            gi|557544984|gb|ESR55962.1| hypothetical protein
            CICLE_v10020621mg [Citrus clementina]
          Length = 379

 Score =  269 bits (688), Expect = 2e-69
 Identities = 164/386 (42%), Positives = 214/386 (55%), Gaps = 15/386 (3%)
 Frame = -3

Query: 1553 MEPHETGLNPYYHQHQHLPASALNPHATSGVPTSVSQVTNGGFLP-------NHXXXXXX 1395
            MEP++T      + + H P +A     TSG   +       G LP       N+      
Sbjct: 1    MEPNDTQQLQQLNSYFHHPTAA----TTSGAAATTGPSPTNGLLPSQHQHHNNNNNNDGG 56

Query: 1394 XXXXXALYPQSVAGKAPSPPSEIVRKKRGRPRKYLTPEXXXXXXXXXXXXXXXXXXXXXX 1215
                  +YP SVA  A +   E  +KKRGRPRKY TPE                      
Sbjct: 57   GGGGGMVYPHSVASSAMTSTLEPAKKKRGRPRKYGTPEQALAAKKTAAYSNSKGKREQRE 116

Query: 1214 XXXSRKRDQQLIGXXXXXXXXXXXA---QLGSEANNGQCFMPHIINVQPGEDVAQKIRIF 1044
                 ++ QQL+G               QLG   N GQ F PH+I+V  GEDV QKI +F
Sbjct: 117  L---HQQQQQLLGSGGSGYSYSGAPGKSQLGGIGNLGQGFTPHVISVAAGEDVGQKIMLF 173

Query: 1043 VQQSRRELCILSASGSVCNVSLLQPATSGGSITYDGSFDILSLSGSFIHTDHGERSGGLS 864
            +QQSRRE+CILSASGS+ N SL QPATSGG+ITY+G F+I+SLSGS++ TD G R+GGLS
Sbjct: 174  MQQSRREICILSASGSISNASLRQPATSGGNITYEGRFEIVSLSGSYVRTDLGGRTGGLS 233

Query: 863  ACLSASNGQIVGGGLSGPLIALGPVEVIIATFLIDTKKDATAGTKGD-IXXXXXXXXXXX 687
             CLS+++GQI+GGG+ GPL A GPV+VI+ TF +++ KD +AG KGD             
Sbjct: 234  VCLSSTDGQIIGGGVGGPLKAAGPVQVIVGTFQVESMKDVSAGLKGDSSGSKLASPVAGA 293

Query: 686  PVANMGYRPVIEQPARYTVPGVDDHQNMGG--FMIQHQGIQMGSSH-SDWRTGPDTRGGF 516
             V+++G+R  IE   R  V G DD Q +GG  FMIQ  G  +  +  +DWR   DTR   
Sbjct: 294  SVSSVGFRSPIESYGRNPVRGNDDFQTIGGTHFMIQPYGNHVSPTQAADWRGSLDTRSSA 353

Query: 515  NYDFTGRTGHGTHES-ENGDFEHLPD 441
             YD TGRTG G ++S ENGD++ + D
Sbjct: 354  GYDMTGRTGRGGNQSPENGDYDQIAD 379


>gb|EMJ06633.1| hypothetical protein PRUPE_ppa007786mg [Prunus persica]
          Length = 355

 Score =  267 bits (683), Expect = 8e-69
 Identities = 157/373 (42%), Positives = 206/373 (55%), Gaps = 2/373 (0%)
 Frame = -3

Query: 1553 MEPHETGLNPYYHQHQHLPASALNPHATSGVPTSVSQVTNGGFLPNHXXXXXXXXXXXAL 1374
            MEP+E  L+ Y+   QH   +     A +   T+ +  TNG  LPN             +
Sbjct: 1    MEPNENQLSSYF---QHPTTTTGTGTAATVTATNTASPTNG-LLPN----THSTDGSHMV 52

Query: 1373 YPQSVAGKAPSPPSEIVRKKRGRPRKYLTPEXXXXXXXXXXXXXXXXXXXXXXXXXSRKR 1194
            Y  SV   A + P E  ++KRGRPRKY TPE                           K+
Sbjct: 53   YSHSVPSSAVTSPLEPAKRKRGRPRKYGTPEQALAAKKAATTSSHSSSSK-------EKK 105

Query: 1193 DQQLIGXXXXXXXXXXXAQLGSEANNGQCFMPHIINVQPGEDVAQKIRIFVQQSRRELCI 1014
            D                 Q  S  N GQ F PH++ V  GEDV QKI  F+QQS+RE+CI
Sbjct: 106  DHHGSASPSYSGSTKKSQQF-SLGNAGQGFTPHVLTVAAGEDVGQKIMFFMQQSKREICI 164

Query: 1013 LSASGSVCNVSLLQPATSGGSITYDGSFDILSLSGSFIHTDHGERSGGLSACLSASNGQI 834
            LSASG++ N SL QPATSGG+ITY+G F+I+SLSGS++ TD G R+GGLS CLS+++GQI
Sbjct: 165  LSASGTISNASLRQPATSGGNITYEGRFEIISLSGSYVRTDLGGRAGGLSVCLSSTDGQI 224

Query: 833  VGGGLSGPLIALGPVEVIIATFLIDTKKDATAGTKGDIXXXXXXXXXXXPVANMGYRPVI 654
            +GGG+ GPL A GPV+VI+ TF++D KKD TAG KGD             + N+ +R  +
Sbjct: 225  IGGGVGGPLKAAGPVQVIVGTFMVDAKKDVTAGVKGD--ASATKLPTAGEMMNVSFRSAV 282

Query: 653  EQPARYTVPGVDDHQNMGGFMIQHQGIQMGSSH-SDWRTGPDTRGGFNYDFTGRTGHGTH 477
            +   R  V G DD Q +GG     QG+ +  S  +DWR GPD RG   Y+ TGR G   H
Sbjct: 283  DSSGRTLVRGNDDQQAIGGSHFMIQGMHVAPSRPTDWRGGPDARGTGAYELTGRAGRAAH 342

Query: 476  ES-ENGDFEHLPD 441
            +S ENGD++ +PD
Sbjct: 343  QSPENGDYDQIPD 355


>ref|XP_006487736.1| PREDICTED: uncharacterized protein LOC102616826 [Citrus sinensis]
          Length = 379

 Score =  267 bits (682), Expect = 1e-68
 Identities = 166/390 (42%), Positives = 216/390 (55%), Gaps = 19/390 (4%)
 Frame = -3

Query: 1553 MEPHETG----LNPYYHQHQHLPASALNPHATSGVPTSVSQVTNGGFLP-------NHXX 1407
            MEP++T     LN Y+H   H  A+      TSG   +       G LP       N+  
Sbjct: 1    MEPNDTQQLQQLNSYFH---HPTATT-----TSGAAATTGPSPTNGLLPSQHQHHNNNNN 52

Query: 1406 XXXXXXXXXALYPQSVAGKAPSPPSEIVRKKRGRPRKYLTPEXXXXXXXXXXXXXXXXXX 1227
                      +YP SVA  A +   E  +KKRGRPRKY TPE                  
Sbjct: 53   NDGGGGGGGMVYPHSVASSAMTSTLEPAKKKRGRPRKYGTPEQALAAKKTAAYSNSKGKR 112

Query: 1226 XXXXXXXSRKRDQQLIGXXXXXXXXXXXA---QLGSEANNGQCFMPHIINVQPGEDVAQK 1056
                     ++ QQL+G               QLG   N GQ F PH+I+V  GEDV QK
Sbjct: 113  EQREL---HQQQQQLLGSGGSGSSYSGAPGKSQLGGIGNLGQGFTPHVISVAAGEDVGQK 169

Query: 1055 IRIFVQQSRRELCILSASGSVCNVSLLQPATSGGSITYDGSFDILSLSGSFIHTDHGERS 876
            I +F+QQS+RE+CILSASGS+ N SL QPATSGG+ITY+G F+I+SLSGS++ TD G R+
Sbjct: 170  IMLFMQQSKREICILSASGSISNASLRQPATSGGNITYEGRFEIVSLSGSYVRTDLGGRT 229

Query: 875  GGLSACLSASNGQIVGGGLSGPLIALGPVEVIIATFLIDTKKDATAGTKGD-IXXXXXXX 699
            GGLS CLS+++GQI+GGG+ GPL A GPV+VI+ TF +++ KD +AG KGD         
Sbjct: 230  GGLSVCLSSTDGQIIGGGVGGPLKAAGPVQVIVGTFQVESMKDVSAGLKGDSSGSKLASP 289

Query: 698  XXXXPVANMGYRPVIEQPARYTVPGVDDHQNMGG--FMIQHQGIQMGSSH-SDWRTGPDT 528
                 V+++G+R  IE   R  V G DD Q +GG  FMIQ  G  +  +  +DWR   DT
Sbjct: 290  VAGASVSSVGFRSPIESYGRNPVRGNDDFQTIGGTHFMIQPYGNHVSPTQAADWRGSLDT 349

Query: 527  RGGFNYDFTGRTGHGTHES-ENGDFEHLPD 441
            R    YD TGRTG G ++S ENGD++ + D
Sbjct: 350  RSSAGYDMTGRTGRGGNQSPENGDYDQIAD 379


>gb|EOY10989.1| AT hook motif DNA-binding family protein [Theobroma cacao]
          Length = 349

 Score =  266 bits (680), Expect = 2e-68
 Identities = 164/380 (43%), Positives = 217/380 (57%), Gaps = 9/380 (2%)
 Frame = -3

Query: 1553 MEPHETGLNPYYHQHQHLPASALNPHATSGVPTSVSQVTNGGFLPNHXXXXXXXXXXXAL 1374
            MEP+ET         QH        + T+ V T+ S  TNG   P+             +
Sbjct: 1    MEPNET--------QQHY----FTTNTTTTVTTTPSP-TNGLLPPSESGGSHHM-----V 42

Query: 1373 YPQSVAGKAPSPPSEIVRKKRGRPRKYLTPEXXXXXXXXXXXXXXXXXXXXXXXXXSRKR 1194
            YP  +     SP  E  R+KRGRPRKY TPE                          R++
Sbjct: 43   YPHPMPSAVTSP-LEPARRKRGRPRKYGTPEQALAAKKTASSSSKER----------REQ 91

Query: 1193 DQQ-----LIGXXXXXXXXXXXAQLGSEANNGQCFMPHIINVQPGEDVAQKIRIFVQQSR 1029
             QQ     L G           +QL +  N GQ F PH+INV  GEDV QKI +F+QQS+
Sbjct: 92   QQQQHQLALGGGGASLSGLSKKSQLVALGNAGQGFTPHVINVVAGEDVGQKIMMFMQQSK 151

Query: 1028 RELCILSASGSVCNVSLLQPATSGGSITYDGSFDILSLSGSFIHTDHGERSGGLSACLSA 849
            RE+CILSASG++ N SL QPATSGG+ITY+G F+I+SLSGS++ T+ G R+GGLS CLS+
Sbjct: 152  REICILSASGTISNASLRQPATSGGNITYEGRFEIISLSGSYVRTETGGRTGGLSVCLSS 211

Query: 848  SNGQIVGGGLSGPLIALGPVEVIIATFLIDTKKDATAGTKGDI-XXXXXXXXXXXPVANM 672
            ++GQI+GGG+ GPL A GPV+VI+ TF+ID KKD +AG KGD              V+N+
Sbjct: 212  ADGQIIGGGIGGPLKAAGPVQVIVGTFVIDNKKDVSAGAKGDASGSKLPSPVGGTSVSNV 271

Query: 671  GYRPVIEQPARYTVPGVDDHQNMGG--FMIQHQGIQMGSSHSDWRTGPDTRGGFNYDFTG 498
            G+R   E   R  + G DDHQ+ GG  FM+Q +G+ +    S+WR+G D R GF  + TG
Sbjct: 272  GFRSAFETSGRNPIGGNDDHQSFGGSHFMMQPRGMHVAPRPSEWRSGLDDRTGF--ELTG 329

Query: 497  RTGHGTHES-ENGDFEHLPD 441
            +TGHG H+S ENGD++ + D
Sbjct: 330  KTGHGAHQSPENGDYDQIAD 349


>ref|XP_006442721.1| hypothetical protein CICLE_v10020621mg [Citrus clementina]
            gi|557544983|gb|ESR55961.1| hypothetical protein
            CICLE_v10020621mg [Citrus clementina]
          Length = 377

 Score =  265 bits (678), Expect = 3e-68
 Identities = 164/386 (42%), Positives = 215/386 (55%), Gaps = 15/386 (3%)
 Frame = -3

Query: 1553 MEPHETGLNPYYHQHQHLPASALNPHATSGVPTSVSQVTNGGFLP-------NHXXXXXX 1395
            MEP++T      + + H P +A     TSG   +       G LP       N+      
Sbjct: 1    MEPNDTQQLQQLNSYFHHPTAA----TTSGAAATTGPSPTNGLLPSQHQHHNNNNNNDGG 56

Query: 1394 XXXXXALYPQSVAGKAPSPPSEIVRKKRGRPRKYLTPEXXXXXXXXXXXXXXXXXXXXXX 1215
                  +YP SVA  A +   E  +KKRGRPRKY TPE                      
Sbjct: 57   GGGGGMVYPHSVASSAMTSTLEPAKKKRGRPRKYGTPE---QALAAKKTAAYSNSKGKRE 113

Query: 1214 XXXSRKRDQQLI---GXXXXXXXXXXXAQLGSEANNGQCFMPHIINVQPGEDVAQKIRIF 1044
                 ++ QQL+   G           +QLG   N GQ F PH+I+V  GEDV QKI +F
Sbjct: 114  QRELHQQQQQLLGSGGSGYSYSGAPGKSQLG--GNLGQGFTPHVISVAAGEDVGQKIMLF 171

Query: 1043 VQQSRRELCILSASGSVCNVSLLQPATSGGSITYDGSFDILSLSGSFIHTDHGERSGGLS 864
            +QQSRRE+CILSASGS+ N SL QPATSGG+ITY+G F+I+SLSGS++ TD G R+GGLS
Sbjct: 172  MQQSRREICILSASGSISNASLRQPATSGGNITYEGRFEIVSLSGSYVRTDLGGRTGGLS 231

Query: 863  ACLSASNGQIVGGGLSGPLIALGPVEVIIATFLIDTKKDATAGTKGD-IXXXXXXXXXXX 687
             CLS+++GQI+GGG+ GPL A GPV+VI+ TF +++ KD +AG KGD             
Sbjct: 232  VCLSSTDGQIIGGGVGGPLKAAGPVQVIVGTFQVESMKDVSAGLKGDSSGSKLASPVAGA 291

Query: 686  PVANMGYRPVIEQPARYTVPGVDDHQNMGG--FMIQHQGIQMGSSH-SDWRTGPDTRGGF 516
             V+++G+R  IE   R  V G DD Q +GG  FMIQ  G  +  +  +DWR   DTR   
Sbjct: 292  SVSSVGFRSPIESYGRNPVRGNDDFQTIGGTHFMIQPYGNHVSPTQAADWRGSLDTRSSA 351

Query: 515  NYDFTGRTGHGTHES-ENGDFEHLPD 441
             YD TGRTG G ++S ENGD++ + D
Sbjct: 352  GYDMTGRTGRGGNQSPENGDYDQIAD 377


>ref|XP_003521778.1| PREDICTED: putative DNA-binding protein ESCAROLA-like isoform X1
            [Glycine max]
          Length = 346

 Score =  261 bits (666), Expect = 8e-67
 Identities = 163/377 (43%), Positives = 209/377 (55%), Gaps = 6/377 (1%)
 Frame = -3

Query: 1553 MEPHETGLNPYYHQH--QHLPASALNPHATSGVPTSVSQVTNGGFLPNHXXXXXXXXXXX 1380
            MEP++  L  ++H H  QH       P  T+  PT+       G LPN            
Sbjct: 1    MEPNDNQLTSFFHHHHQQHQHHQPPPPPQTTASPTN-------GLLPN-------ADGSH 46

Query: 1379 ALYPQSVAGKAPSPPSEIVRKKRGRPRKYLTPEXXXXXXXXXXXXXXXXXXXXXXXXXSR 1200
             LYP SVA  A S   E  ++KRGRPRKY TPE                           
Sbjct: 47   ILYPHSVAS-AVSSQLEPAKRKRGRPRKYGTPEQALAAKKAATTLSHSFSV--------- 96

Query: 1199 KRDQQLIGXXXXXXXXXXXAQLGSEANNGQCFMPHIINVQPGEDVAQKIRIFVQQSRREL 1020
              D++                LG   N GQ F PH+I+V  GEDV QKI +F+QQSRRE+
Sbjct: 97   --DKKPHSPTFPSSKKSHSFALG---NAGQGFTPHVISVAAGEDVGQKIMLFMQQSRREM 151

Query: 1019 CILSASGSVCNVSLLQPATSGGSITYDGSFDILSLSGSFIHTDHGERSGGLSACLSASNG 840
            CILSASGS+ N SL QPATSGGSI Y+G F+I+SL+GS++  + G R+GGLS CLS ++G
Sbjct: 152  CILSASGSISNASLRQPATSGGSIAYEGRFEIISLTGSYVRNELGTRTGGLSVCLSNTDG 211

Query: 839  QIVGGGLSGPLIALGPVEVIIATFLIDTKKDATAGTKGDIXXXXXXXXXXXPVANMGYRP 660
            QI+GGG+ GPL A GPV+VI+ TF ID KKD  AG KGDI           PV+++G+R 
Sbjct: 212  QIIGGGVGGPLKAAGPVQVIVGTFFIDNKKDTGAGVKGDISASKLPSPVGEPVSSLGFRQ 271

Query: 659  VIEQPARYTVPGVDDHQNMGG--FMIQHQGIQMGSSHS-DWRTGPDTRGGFNYDFTGRTG 489
             ++ P+   + G D+HQ MGG  FMIQ  G+      S DW   PD+R    ++ TGR G
Sbjct: 272  SVDSPSGNPIRGNDEHQAMGGSHFMIQQLGLHGTPPRSTDW-GHPDSR-NTGFELTGRIG 329

Query: 488  HGTHES-ENGDFEHLPD 441
            HG H+S ENG +E +PD
Sbjct: 330  HGAHQSPENGGYEQIPD 346


>ref|XP_003554726.1| PREDICTED: putative DNA-binding protein ESCAROLA-like isoform X1
            [Glycine max]
          Length = 356

 Score =  255 bits (652), Expect = 3e-65
 Identities = 157/375 (41%), Positives = 202/375 (53%), Gaps = 4/375 (1%)
 Frame = -3

Query: 1553 MEPHETGLNPYYHQHQHLPASALNPHATSGVPTSVSQVTNGGFLPNHXXXXXXXXXXXAL 1374
            MEP +  L  ++H HQ       + H     P   +     G LPN             L
Sbjct: 1    MEPIDNHLTSFFHHHQQQQQHHQHQHQHPPPPPPTTASPTNGLLPN-------ADGSHML 53

Query: 1373 YPQSVAGKAPSPPSEIVRKKRGRPRKYLTPEXXXXXXXXXXXXXXXXXXXXXXXXXSRKR 1194
            YP SVA  A S   E  ++KRGRPRKY TPE                          +  
Sbjct: 54   YPHSVAS-AVSSQLEPAKRKRGRPRKYGTPEQALAAKKAATTSSQSFSADK------KPH 106

Query: 1193 DQQLIGXXXXXXXXXXXAQLGSEANNGQCFMPHIINVQPGEDVAQKIRIFVQQSRRELCI 1014
                               LG   N GQ F PH+I+V  GEDV QKI +F+QQSRRE+CI
Sbjct: 107  SPTFPSSSFTSSKKSLSFALG---NAGQGFTPHVISVAAGEDVGQKIMLFMQQSRREMCI 163

Query: 1013 LSASGSVCNVSLLQPATSGGSITYDGSFDILSLSGSFIHTDHGERSGGLSACLSASNGQI 834
            LSASGS+ N SL QPATSGGSITY+G F+I+SL+GS++  + G R+GGLS CLS ++GQI
Sbjct: 164  LSASGSISNASLRQPATSGGSITYEGRFEIISLTGSYVRNELGTRTGGLSVCLSNTDGQI 223

Query: 833  VGGGLSGPLIALGPVEVIIATFLIDTKKDATAGTKGDIXXXXXXXXXXXPVANMGYRPVI 654
            +GGG+ GPL A GPV+VI+ TF ID KKD  AG KGD            PV+++G+R  +
Sbjct: 224  IGGGVGGPLKAAGPVQVIVGTFFIDNKKDNGAGLKGDASASKLPSPVSEPVSSLGFRQSV 283

Query: 653  EQPARYTVPGVDDHQNMGG--FMIQHQGIQMGSSHS-DWRTGPDTRGGFNYDFTGRTGHG 483
            +  +   + G D+HQ M G  FMIQ  G+      S DW   PD+R    ++ TGRTGHG
Sbjct: 284  DSSSGNPIRGNDEHQAMDGSHFMIQQLGLHGTPPRSTDWGR-PDSR-NTGFELTGRTGHG 341

Query: 482  THES-ENGDFEHLPD 441
             H+S ENG ++ +PD
Sbjct: 342  AHQSPENGGYDQIPD 356


>ref|XP_006577325.1| PREDICTED: putative DNA-binding protein ESCAROLA-like isoform X2
            [Glycine max]
          Length = 343

 Score =  254 bits (648), Expect = 1e-64
 Identities = 163/378 (43%), Positives = 207/378 (54%), Gaps = 7/378 (1%)
 Frame = -3

Query: 1553 MEPHETGLNPYYHQH--QHLPASALNPHATSGVPTSVSQVTNGGFLPNHXXXXXXXXXXX 1380
            MEP++  L  ++H H  QH       P  T+  PT+       G LPN            
Sbjct: 1    MEPNDNQLTSFFHHHHQQHQHHQPPPPPQTTASPTN-------GLLPN-------ADGSH 46

Query: 1379 ALYPQSVAGKAPSPPSEIVRKKRGRPRKYLTPEXXXXXXXXXXXXXXXXXXXXXXXXXSR 1200
             LYP SVA  A S   E  ++KRGRPRKY TPE                           
Sbjct: 47   ILYPHSVAS-AVSSQLEPAKRKRGRPRKYGTPEQALAAKKAATTLSHSFSV--------- 96

Query: 1199 KRDQQLIGXXXXXXXXXXXAQLGSEANNGQCFMPHIINVQPGEDVAQKIRIFVQQSRREL 1020
              D++                LG   N GQ F PH+I+V  GEDV QKI +F+QQSRRE+
Sbjct: 97   --DKKPHSPTFPSSKKSHSFALG---NAGQGFTPHVISVAAGEDVGQKIMLFMQQSRREM 151

Query: 1019 CILSASGSVCNVSLLQPATSGGSITYDGSFDILSLSGSFIHTDHGERSGGLSACLSASNG 840
            CILSASGS+ N SL QPATSGGSI Y+G F+I+SL+GS++  + G R+GGLS CLS ++G
Sbjct: 152  CILSASGSISNASLRQPATSGGSIAYEGRFEIISLTGSYVRNELGTRTGGLSVCLSNTDG 211

Query: 839  QIVGGGLSGPLIALGPVEVIIATFLIDTKKDATAGTKGDIXXXXXXXXXXXPVANMGYRP 660
            QI+GGG+ GPL A GPV+VI+ TF ID KKD  AG KGDI           PV+++G+R 
Sbjct: 212  QIIGGGVGGPLKAAGPVQVIVGTFFIDNKKDTGAGVKGDISASKLPSPVGEPVSSLGFRQ 271

Query: 659  VIEQPARYTVPGVDDHQNMGG--FMIQHQGIQMGSSHS-DWRTGPDTRG-GFNYDFTGRT 492
             ++ P+   + G D+HQ MGG  FMIQ  G+      S DW   PD+R  GF       T
Sbjct: 272  SVDSPSGNPIRGNDEHQAMGGSHFMIQQLGLHGTPPRSTDW-GHPDSRNTGFEL-----T 325

Query: 491  GHGTHES-ENGDFEHLPD 441
            GHG H+S ENG +E +PD
Sbjct: 326  GHGAHQSPENGGYEQIPD 343


>ref|XP_004494753.1| PREDICTED: putative DNA-binding protein ESCAROLA-like [Cicer
            arietinum]
          Length = 367

 Score =  253 bits (646), Expect = 2e-64
 Identities = 154/376 (40%), Positives = 203/376 (53%), Gaps = 5/376 (1%)
 Frame = -3

Query: 1553 MEPHETGLNPYYHQHQHLPASALNPHAT-SGVPTSVSQVTNGGFLPNHXXXXXXXXXXXA 1377
            MEP++  L+ ++H H H      +      G  T+V   T     P +            
Sbjct: 1    MEPNDNQLSSFFHHHNHHHQQQHHQQQQPQGNSTTVVSATTTTAAPTNGLLSNTDGSHI- 59

Query: 1376 LYPQSVAGKAPSPPSEIVRKKRGRPRKYLTPEXXXXXXXXXXXXXXXXXXXXXXXXXSRK 1197
            LYP SVA  A S   E  ++KRGRPRKY TPE                           K
Sbjct: 60   LYPHSVAS-AVSSQLEPAKRKRGRPRKYGTPEQALAAKKASTSSFSSTPAGADNSS---K 115

Query: 1196 RDQQLIGXXXXXXXXXXXAQLGSEANNGQCFMPHIINVQPGEDVAQKIRIFVQQSRRELC 1017
                                LG   N GQ F  H+I V  GEDV QKI  F+QQ R E+C
Sbjct: 116  NTTHSFSPSSFSSKKSHSLSLG---NAGQGFSAHVIAVAAGEDVGQKIMQFMQQHRGEIC 172

Query: 1016 ILSASGSVCNVSLLQPATSGGSITYDGSFDILSLSGSFIHTDHGERSGGLSACLSASNGQ 837
            ILSASGS+ N SL QPA+SGG+ITY+G FDI+SL+GS++  + G RSGGLS CLS S+GQ
Sbjct: 173  ILSASGSISNASLRQPASSGGNITYEGRFDIISLTGSYVRNETGGRSGGLSVCLSNSDGQ 232

Query: 836  IVGGGLSGPLIALGPVEVIIATFLIDTKKDATAGTKGDIXXXXXXXXXXXPVANMGYRPV 657
            I+GGG+ GPL A GPV+VI+ TF IDT+KD +AG KGD              +N+G+R  
Sbjct: 233  IIGGGVGGPLKAAGPVQVIVGTFFIDTQKDTSAGIKGDASTSKLPSQVGESASNLGFRQA 292

Query: 656  IEQPARYTVPGVDDHQNMGG--FMIQHQGIQMGSSH-SDWRTGPDTRGGFNYDFTGRTGH 486
            ++  +   + G D+HQ MGG  FMIQ  G+ +     +DW + PD+R    YD +GRTGH
Sbjct: 293  VDCSSGNPIRGNDEHQAMGGSHFMIQQLGLHVTPPRPTDWGSHPDSR-NVGYDLSGRTGH 351

Query: 485  GTHES-ENGDFEHLPD 441
            G+H+S +NG ++ +PD
Sbjct: 352  GSHQSPDNGGYDQIPD 367


>ref|XP_003626475.1| hypothetical protein MTR_7g116320 [Medicago truncatula]
            gi|355501490|gb|AES82693.1| hypothetical protein
            MTR_7g116320 [Medicago truncatula]
          Length = 367

 Score =  252 bits (644), Expect = 3e-64
 Identities = 152/379 (40%), Positives = 207/379 (54%), Gaps = 8/379 (2%)
 Frame = -3

Query: 1553 MEPHETGLNPYYHQH----QHLPASALNPHATSGVPTSVSQVTNGGFLPNHXXXXXXXXX 1386
            M+ ++  L+ ++H H    Q       +   ++ V T+ +  TNG  LPN          
Sbjct: 1    MDSNDNQLSSFFHHHNQQQQQQQQQQHHQQNSTTVTTATASPTNG-LLPN-------TDG 52

Query: 1385 XXALYPQSVAGKAPSPPSEIVRKKRGRPRKYLTPEXXXXXXXXXXXXXXXXXXXXXXXXX 1206
               LYP SVA  A S   E  ++KRGRPRKY TPE                         
Sbjct: 53   SHILYPHSVASSAVSSQLEPAKRKRGRPRKYGTPEQALAAKKASTSSFSPTPPTLDTTTN 112

Query: 1205 SRKRDQQLIGXXXXXXXXXXXAQLGSEANNGQCFMPHIINVQPGEDVAQKIRIFVQQSRR 1026
            ++                     LG   N GQ F  H+I V  GEDV QKI  F+QQ R 
Sbjct: 113  NKNTHSFSPSSSSFTTKKSHSLSLG---NAGQGFSAHVIAVAAGEDVGQKIMQFMQQHRG 169

Query: 1025 ELCILSASGSVCNVSLLQPATSGGSITYDGSFDILSLSGSFIHTDHGERSGGLSACLSAS 846
            E+CI+SASGS+ N SL QPA+SGG+I Y+G FDI+SL+GS++  + G RSGGLS CLS S
Sbjct: 170  EICIMSASGSISNASLRQPASSGGNIMYEGRFDIISLTGSYVRNETGGRSGGLSVCLSNS 229

Query: 845  NGQIVGGGLSGPLIALGPVEVIIATFLIDTKKDATAGTKGDIXXXXXXXXXXXPVANMGY 666
            +GQI+GGG+ GPL A GPV+VI+ TF ID KKD +AG KGD            P +++G+
Sbjct: 230  DGQIIGGGVGGPLKAAGPVQVIVGTFFIDNKKDTSAGGKGDPSAGKLPSPVGEPASSLGF 289

Query: 665  RPVIEQPARYTVPGVDDHQNMGG--FMIQHQGIQMGSSH-SDWRTGPDTRGGFNYDFTGR 495
            R  ++  +   + G D+HQ MGG  +MIQ  G+ +     ++W T PD+R    YD +GR
Sbjct: 290  RQTVDSSSGNPIRGNDEHQAMGGSHYMIQQLGLHVTPPRTTEWGTHPDSRHA-GYDLSGR 348

Query: 494  TGHGTHES-ENGDFEHLPD 441
            TGHG+H+S ENG ++ +PD
Sbjct: 349  TGHGSHQSPENGGYDQIPD 367


>ref|XP_006604863.1| PREDICTED: putative DNA-binding protein ESCAROLA-like isoform X2
            [Glycine max]
          Length = 361

 Score =  250 bits (639), Expect = 1e-63
 Identities = 158/379 (41%), Positives = 202/379 (53%), Gaps = 8/379 (2%)
 Frame = -3

Query: 1553 MEPHETGLNPYYHQHQHLPASALNPHATSGVPTSVSQVTNGGFLPNHXXXXXXXXXXXAL 1374
            MEP +  L  ++H HQ       + H     P   +     G LPN             L
Sbjct: 1    MEPIDNHLTSFFHHHQQQQQHHQHQHQHPPPPPPTTASPTNGLLPN-------ADGSHML 53

Query: 1373 YPQSVAGKAPSPPSEIVRKKRGRPRKYLTPEXXXXXXXXXXXXXXXXXXXXXXXXXSRKR 1194
            YP SVA  A S   E  ++KRGRPRKY TPE                          +  
Sbjct: 54   YPHSVAS-AVSSQLEPAKRKRGRPRKYGTPEQALAAKKAATTSSQSFSADK------KPH 106

Query: 1193 DQQLIGXXXXXXXXXXXAQLGSEANNGQCFMPHIINVQPGEDVAQKIRIFVQQSRRELCI 1014
                               LG   N GQ F PH+I+V  GEDV QKI +F+QQSRRE+CI
Sbjct: 107  SPTFPSSSFTSSKKSLSFALG---NAGQGFTPHVISVAAGEDVGQKIMLFMQQSRREMCI 163

Query: 1013 LSASGSVCNVSLLQPATSGGSITYDGSFDILSLSGSFIHTDHGERSGGLSACLSASNGQI 834
            LSASGS+ N SL QPATSGGSITY+G F+I+SL+GS++  + G R+GGLS CLS ++GQI
Sbjct: 164  LSASGSISNASLRQPATSGGSITYEGRFEIISLTGSYVRNELGTRTGGLSVCLSNTDGQI 223

Query: 833  VGGGLSGPLIALGPVEVIIATFLIDTKKDATAGTKGDIXXXXXXXXXXXPVANMGYRPVI 654
            +GGG+ GPL A GPV+VI+ TF ID KKD  AG KGD            PV+++G+R  +
Sbjct: 224  IGGGVGGPLKAAGPVQVIVGTFFIDNKKDNGAGLKGDASASKLPSPVSEPVSSLGFRQSV 283

Query: 653  EQPARYTVPGVDDHQNMGG--FMIQHQGIQMGSSHS-DWRTGPDTRG-GF---NYDFTGR 495
            +  +   + G D+HQ M G  FMIQ  G+      S DW   PD+R  GF    +   GR
Sbjct: 284  DSSSGNPIRGNDEHQAMDGSHFMIQQLGLHGTPPRSTDWGR-PDSRNTGFELTGFLSAGR 342

Query: 494  TGHGTHES-ENGDFEHLPD 441
            TGHG H+S ENG ++ +PD
Sbjct: 343  TGHGAHQSPENGGYDQIPD 361


>gb|ESW19221.1| hypothetical protein PHAVU_006G106500g [Phaseolus vulgaris]
          Length = 358

 Score =  246 bits (628), Expect = 2e-62
 Identities = 158/379 (41%), Positives = 206/379 (54%), Gaps = 8/379 (2%)
 Frame = -3

Query: 1553 MEPHETGLNPYYHQHQHLPASALNPHATSGVPTSVSQVT---NGGFLPNHXXXXXXXXXX 1383
            MEP++  L  ++H H H P    + H     P + +  T     G LPN           
Sbjct: 1    MEPNDNQLTSFFHHHHHHPHHH-HHHQPQPPPQTAATTTASPTNGLLPN-------ADGS 52

Query: 1382 XALYPQSVAGKAPSPPSEIVRKKRGRPRKYLTPEXXXXXXXXXXXXXXXXXXXXXXXXXS 1203
              LYP SVA  A S   E  ++KRGRPRKY TPE                          
Sbjct: 53   HMLYPHSVAS-AVSSQLEPAKRKRGRPRKYGTPEQALAAKKASTASSHSFSADK------ 105

Query: 1202 RKRDQQLIGXXXXXXXXXXXAQLGSEANNGQCFMPHIINVQPGEDVAQKIRIFVQQSRRE 1023
            +                     LG   N GQ F PH+I V  GEDV QKI +F+QQSRRE
Sbjct: 106  KPNSPTFPSSSSFTSKKSHSFALG---NAGQGFTPHVIAVAAGEDVGQKIMLFMQQSRRE 162

Query: 1022 LCILSASGSVCNVSLLQPATSGGSITYDGSFDILSLSGSFIHTDHGERSGGLSACLSASN 843
            +CILSASGS+ N SL QPATSGG+ITY+G F+I+SL+GS++  + G R+GGLS CLS ++
Sbjct: 163  MCILSASGSISNASLRQPATSGGNITYEGRFEIISLTGSYVRNELGTRTGGLSVCLSNTD 222

Query: 842  GQIVGGGLSGPLIALGPVEVIIATFLIDTKKDATAGTKGDIXXXXXXXXXXXPVANMGYR 663
            GQI+GGG+ GPL A GPV+VI+ TF ID KKD++      +           PV+++G+R
Sbjct: 223  GQIIGGGVGGPLKAAGPVQVIVGTFFIDNKKDSSPKVDASV-SKLPPPPVGEPVSSLGFR 281

Query: 662  PVIEQ-PARYTVPGVDDHQNMGG--FMIQHQGIQMGSSHS-DWRTGPDTRGGFNYDFTGR 495
              +E  P    + G D+HQ MGG  FMIQ  G+Q     S DW    D+R   +++ TGR
Sbjct: 282  QSVESPPGGNPIRGNDEHQAMGGSHFMIQQLGLQGTPPRSTDW-ARRDSRNS-SFELTGR 339

Query: 494  TGHGTHES-ENGDFEHLPD 441
            TGHGTH+S ENG +E +PD
Sbjct: 340  TGHGTHQSPENGGYEQIPD 358


>ref|XP_004139392.1| PREDICTED: uncharacterized protein LOC101221844 [Cucumis sativus]
            gi|449520142|ref|XP_004167093.1| PREDICTED:
            uncharacterized protein LOC101229030 [Cucumis sativus]
          Length = 362

 Score =  243 bits (621), Expect = 1e-61
 Identities = 154/378 (40%), Positives = 200/378 (52%), Gaps = 7/378 (1%)
 Frame = -3

Query: 1553 MEPHETGLNPYYHQHQHLPASALNPHATSGVPTSVSQVTNGGFLPNHXXXXXXXXXXXA- 1377
            MEP+E  L+ Y+H HQH        H T   PT+ S  TNG   P H             
Sbjct: 1    MEPNENQLSSYFHHHQH-------HHQT---PTTTSP-TNGLLPPTHHLSAAAASSDAGP 49

Query: 1376 --LYPQSVAGKA-PSPPSEIVRKKRGRPRKYLTPEXXXXXXXXXXXXXXXXXXXXXXXXX 1206
              +YP SV   A  S P E  R+KRGRPRKY TPE                         
Sbjct: 50   HVVYPHSVPSAAVSSSPLEPARRKRGRPRKYGTPEEALAAKKAATASSHSSSSKAKKELA 109

Query: 1205 SRKRDQQLIGXXXXXXXXXXXAQLGSEANNGQCFMPHIINVQPGEDVAQKIRIFVQQSRR 1026
            S       +            +QL +  N GQ F PH+INV  GEDV QKI  F+QQ +R
Sbjct: 110  SSS-SLNAVSASSSFSTPSKKSQLAALGNAGQGFAPHVINVAAGEDVGQKIMQFMQQCKR 168

Query: 1025 ELCILSASGSVCNVSLLQPATSGGSITYDGSFDILSLSGSFIHTDHGERSGGLSACLSAS 846
            E+CILSASGS+ N SL QPA SGG+I Y+G F+I+SL GS++ TD G ++GGLS CLS++
Sbjct: 169  EICILSASGSISNASLRQPAASGGNIAYEGRFEIVSLCGSYVRTDLGGKTGGLSVCLSSA 228

Query: 845  NGQIVGGGLSGPLIALGPVEVIIATFLIDTKKDATAGTKGDIXXXXXXXXXXXPVANMGY 666
             G I+GGG+ GPL A GPV+VI+ TF+ID KK+   G                 ++N+ Y
Sbjct: 229  EGHIIGGGVGGPLKAAGPVQVIVGTFVIDPKKEFGGGKGDGSAVKLPSPIGGTSMSNLRY 288

Query: 665  RPVIEQPARYTVPGVDDHQNMG--GFMIQHQGIQMGSSHS-DWRTGPDTRGGFNYDFTGR 495
               I+      + G D+HQ +G   F++Q +G+ + S  S DWRTG D      YD +GR
Sbjct: 289  GSNIDSGGN-QIRGNDEHQGLGESHFLLQPRGVNLTSPRSTDWRTGLDAT-NTAYDLSGR 346

Query: 494  TGHGTHESENGDFEHLPD 441
            TGH  H  ENGD++ +PD
Sbjct: 347  TGH--HSPENGDYDQIPD 362


>ref|NP_187109.2| AT hook motif DNA-binding family protein [Arabidopsis thaliana]
            gi|119935918|gb|ABM06034.1| At3g04590 [Arabidopsis
            thaliana] gi|225898615|dbj|BAH30438.1| hypothetical
            protein [Arabidopsis thaliana]
            gi|332640581|gb|AEE74102.1| AT hook motif DNA-binding
            family protein [Arabidopsis thaliana]
          Length = 411

 Score =  243 bits (619), Expect = 2e-61
 Identities = 162/393 (41%), Positives = 212/393 (53%), Gaps = 30/393 (7%)
 Frame = -3

Query: 1529 NPYYHQ----HQHLPASALNPHAT-SGVPTSVSQVTNGGFLPNHXXXXXXXXXXXAL--Y 1371
            +PY+H     H HLP +     +T + VP+S     NG F P             +L  Y
Sbjct: 33   SPYFHHQLQHHHHLPTTVATTASTGNAVPSS----NNGLFPPQPQPQHQPNDGSSSLAVY 88

Query: 1370 PQSVAGKAPSPPSEIVRKKRGRPRKYLTPEXXXXXXXXXXXXXXXXXXXXXXXXXSRKRD 1191
            P SV   A + P E V++KRGRPRKY+TPE                          ++R+
Sbjct: 89   PHSVPSSAVTAPMEPVKRKRGRPRKYVTPEQALAAKKLASSASSSSAK--------QRRE 140

Query: 1190 QQLI--GXXXXXXXXXXXAQLGSEANNGQCFMPHIINVQPGEDVAQKIRIFVQQSRRELC 1017
               +  G           +QLGS    GQCF PHI+N+ PGEDV QKI +F  QS+ ELC
Sbjct: 141  LAAVTGGTVSTNSGSSKKSQLGSVGKTGQCFTPHIVNIAPGEDVVQKIMMFANQSKHELC 200

Query: 1016 ILSASGSVCNVSLLQPATSGGSITYDGSFDILSLSGSFIHTDHGERSGGLSACLSASNGQ 837
            +LSASG++ N SL QPA SGG++ Y+G ++ILSLSGS+I T+ G +SGGLS  LSAS+GQ
Sbjct: 201  VLSASGTISNASLRQPAPSGGNLPYEGQYEILSLSGSYIRTEQGGKSGGLSVSLSASDGQ 260

Query: 836  IVGGGLSGPLIALGPVEVIIATFLIDTKKDAT-AGTKGDI---XXXXXXXXXXXPVANMG 669
            I+GG +   L A GPV+VI+ TF +D KKDA  +G KGD                +  MG
Sbjct: 261  IIGGAIGSHLTAAGPVQVILGTFQLDRKKDAAGSGGKGDASNSGSRLTSPVSSGQLLGMG 320

Query: 668  YRPVIEQPARYTVPGVDD------HQ-NMGG---FMIQ-HQGIQMGSSH-SDWR----TG 537
            + P +E   R  + G D+      HQ  +GG   FM+Q  QGI M  S  S+WR    +G
Sbjct: 321  FPPGMESTGRNPMRGNDEQHDHHHHQAGLGGPHHFMMQAPQGIHMTHSRPSEWRGGGNSG 380

Query: 536  PDTRGGFNYDFTGRTGHGTHESENGDFE-HLPD 441
             D RGG  YD +GR GH    SENGD+E  +PD
Sbjct: 381  HDGRGGGGYDLSGRIGH--ESSENGDYEQQIPD 411


>ref|XP_006297823.1| hypothetical protein CARUB_v10013864mg [Capsella rubella]
            gi|482566532|gb|EOA30721.1| hypothetical protein
            CARUB_v10013864mg [Capsella rubella]
          Length = 402

 Score =  240 bits (613), Expect = 1e-60
 Identities = 158/386 (40%), Positives = 211/386 (54%), Gaps = 23/386 (5%)
 Frame = -3

Query: 1529 NPYYH---QHQHLPASALNPHAT-SGVPTSVSQVTNGGFLPNHXXXXXXXXXXXALYPQS 1362
            +PY+H   QH H P +     +T + VP+S     NG F P             A+YP S
Sbjct: 32   SPYFHHQLQHHHYPTAVATSTSTGNAVPSS----NNGLFPPQ--PQPNDGSSSIAVYPHS 85

Query: 1361 VAGKAPSPPSEIVRKKRGRPRKYLTPEXXXXXXXXXXXXXXXXXXXXXXXXXSRKRDQQL 1182
            V   A + P E +++KRGRPRKY+TPE                          R+     
Sbjct: 86   VPSSAVTAPMEPLKRKRGRPRKYVTPEQALAAKKMASSASSSAKER-------RELAAIA 138

Query: 1181 IGXXXXXXXXXXXAQLGSEANNGQCFMPHIINVQPGEDVAQKIRIFVQQSRRELCILSAS 1002
             G           +QLGS    GQ F+PHI+N+ PGEDVAQKI IF  QS+ ELC+LSAS
Sbjct: 139  AGTAPSKSGSSKKSQLGSVGKTGQSFIPHIVNIAPGEDVAQKILIFANQSKHELCVLSAS 198

Query: 1001 GSVCNVSLLQPATSGGSITYDGSFDILSLSGSFIHTDHGERSGGLSACLSASNGQIVGGG 822
            G++ N SL QPA+SGG+++Y+G ++ILSLSGS+I T+ G ++GGLSA LS S+GQI+GG 
Sbjct: 199  GTISNASLRQPASSGGNVSYEGQYEILSLSGSYIRTEQGGKTGGLSASLSGSDGQIIGGA 258

Query: 821  LSGPLIALGPVEVIIATFLIDTKKDAT-AGTKGDI---XXXXXXXXXXXPVANMGYRPVI 654
            +   L A GPV+VI+ TF  D KKDA  +G KGD               P+ +MG+RP +
Sbjct: 259  IGTHLTAAGPVQVILGTFQFDRKKDAAGSGVKGDASNSGNQLTSPASTGPILDMGFRPGM 318

Query: 653  EQPARYTVPGVDD--HQNMGG------FMIQ-HQGIQMGSSH-SDW----RTGPDTRGGF 516
            E   R  + G D+  H +  G      FM+Q  QG+ M  +  S+W     +G D RGG 
Sbjct: 319  ESTGRNPMRGHDEQHHHHQTGLSGSHHFMMQAPQGMHMTHTRPSEWGRGGNSGHDGRGGG 378

Query: 515  NYDFTGRTGHGTHESENGDFE-HLPD 441
             YD +GR GH    SENGD+E  +PD
Sbjct: 379  GYDLSGRLGH--ESSENGDYEQQIPD 402


>ref|XP_002884453.1| hypothetical protein ARALYDRAFT_477717 [Arabidopsis lyrata subsp.
            lyrata] gi|297330293|gb|EFH60712.1| hypothetical protein
            ARALYDRAFT_477717 [Arabidopsis lyrata subsp. lyrata]
          Length = 408

 Score =  237 bits (605), Expect = 9e-60
 Identities = 159/401 (39%), Positives = 217/401 (54%), Gaps = 29/401 (7%)
 Frame = -3

Query: 1556 RMEPHETGLN-PYYH---QHQHLPASALNPHAT-SGVPTSVSQVTNGGFLPNHXXXXXXX 1392
            + + H+  L+ PY+H   QH H P +     +T + VP+S     NG F P         
Sbjct: 22   QQQQHQQRLSSPYFHHQLQHHHHPTTVATTASTGNAVPSS----NNGLFPPQPQPQHQPN 77

Query: 1391 XXXXAL--YPQSVAGKAPSPPSEIVRKKRGRPRKYLTPEXXXXXXXXXXXXXXXXXXXXX 1218
                +L  YP SV   A + P E +++KRGRPRKY+TPE                     
Sbjct: 78   DGSSSLAVYPHSVPSSAVTAPMEPLKRKRGRPRKYVTPEQALAAKKMASSASSSSAK--- 134

Query: 1217 XXXXSRKRDQQLI--GXXXXXXXXXXXAQLGSEANNGQCFMPHIINVQPGEDVAQKIRIF 1044
                  +R+   +  G           +QLGS    GQCF PHI+N+ PGEDVAQKI IF
Sbjct: 135  -----ERRELAAVTGGTVSTNSGSSKKSQLGSVGKTGQCFTPHIVNIAPGEDVAQKIMIF 189

Query: 1043 VQQSRRELCILSASGSVCNVSLLQPATSGGSITYDGSFDILSLSGSFIHTDHGERSGGLS 864
              QS+ ELC+LSASG++ N SL QPAT+G ++ ++G ++ILSLSGS+I T+ G ++GGLS
Sbjct: 190  ANQSKHELCVLSASGTISNASLRQPATAGVNLPHEGQYEILSLSGSYIRTEQGGKTGGLS 249

Query: 863  ACLSASNGQIVGGGLSGPLIALGPVEVIIATFLIDTKKDAT-AGTKGDIXXXXXXXXXXX 687
            A LSAS+GQI+GG +   L A GPV+VI+ TF +D KKDA  +G KGD            
Sbjct: 250  ASLSASDGQIIGGAIGTHLTAAGPVQVILGTFQLDRKKDAAGSGGKGDASNSGSRLTSPA 309

Query: 686  PVANM---GYRPVIEQPARYTVPGVDDHQN------MGG---FMIQ-HQGIQMGSSH-SD 549
                +   G+ P +E   R  + G D+ Q+      +GG   FM+Q  QG+ M  S  ++
Sbjct: 310  STGQLLGIGFPPGMESTGRNPMRGNDEQQHHHHQPGLGGPHHFMMQAPQGMHMTHSRPAE 369

Query: 548  WR----TGPDTRGGFNYDFTGRTGHGTHESENGDFE-HLPD 441
            WR    +G D RGG  YD +GR GH    SENGD+E  +PD
Sbjct: 370  WRGGGNSGLDGRGGGGYDLSGRIGH--ESSENGDYEQQIPD 408


>ref|XP_002332023.1| predicted protein [Populus trichocarpa]
            gi|566224869|ref|XP_006370969.1| DNA-binding family
            protein [Populus trichocarpa] gi|550316552|gb|ERP48766.1|
            DNA-binding family protein [Populus trichocarpa]
          Length = 365

 Score =  237 bits (605), Expect = 9e-60
 Identities = 150/375 (40%), Positives = 194/375 (51%), Gaps = 16/375 (4%)
 Frame = -3

Query: 1529 NPYYHQHQHLPASALNPHATSGVPTSVSQVTNGGFLPNHXXXXXXXXXXXALYPQS---- 1362
            +P   QH H  +   +   T+  P+      NG   P+H            LYP S    
Sbjct: 5    DPRQQQHHHFTSYFSSTPTTTNTPSP----PNGLLPPHHPTDSTTPTGSHLLYPHSMGPS 60

Query: 1361 ----VAGKAPSPPSEIVRKKRGRPRKYLTPEXXXXXXXXXXXXXXXXXXXXXXXXXSRKR 1194
                V G      +   ++KRGRPRKY TPE                           ++
Sbjct: 61   TTATVTGGGAPVEATSAKRKRGRPRKYGTPELALAAKKTATSASVAASR--------ERK 112

Query: 1193 DQQLIGXXXXXXXXXXXAQLGSE---ANNGQCFMPHIINVQPGEDVAQKIRIFVQQSRRE 1023
            +Q   G           +   S+      G  F PH+I V  GEDV QKI  F+QQS RE
Sbjct: 113  EQHQAGSSSTTSSFSGSSSKKSQHVLGTAGHGFTPHVITVAAGEDVGQKIIQFLQQSTRE 172

Query: 1022 LCILSASGSVCNVSLLQPATSGGSITYDGSFDILSLSGSFIHTDHGERSGGLSACLSASN 843
            +CILSASGSV NVSL QPATSGG+I+Y+G F+I+SLSGS+I TD G R+GGLS CLS SN
Sbjct: 173  MCILSASGSVMNVSLRQPATSGGNISYEGRFEIISLSGSYIRTDMGGRAGGLSVCLSDSN 232

Query: 842  GQIVGGGLSGPLIALGPVEVIIATFLIDTKKDATAGTKGDIXXXXXXXXXXXPVANMGYR 663
            GQI+GGG+ GPL A GPV+VI+ TF++D KKD +   KGD             V + G+R
Sbjct: 233  GQIIGGGVGGPLKAAGPVQVIVGTFVLDNKKDGSG--KGDASGSKLPSPVKASVPSFGFR 290

Query: 662  PVIEQPARYTVPGVDDHQNMGG---FMIQHQGIQMGSSHS-DWRTGPDTRGGFNYDFTGR 495
              +E   R    G DD   +GG   F +Q   + + S+ + DWR+ PD R    YDFTGR
Sbjct: 291  LPVESSVRNPARGNDDLLTVGGGNPFTMQPSTMHLLSARTMDWRSSPDVRTTAGYDFTGR 350

Query: 494  TGHGTHESE-NGDFE 453
            TGHG  +S  NGD++
Sbjct: 351  TGHGGSQSPVNGDYD 365


Top