BLASTX nr result

ID: Rehmannia25_contig00004156 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia25_contig00004156
         (894 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004232457.1| PREDICTED: GATA transcription factor 5-like ...   167   6e-39
ref|XP_006340696.1| PREDICTED: GATA transcription factor 5-like ...   165   2e-38
gb|EMJ05895.1| hypothetical protein PRUPE_ppa008278mg [Prunus pe...   157   4e-36
gb|EXC35403.1| GATA transcription factor 5 [Morus notabilis]          157   7e-36
ref|XP_002327771.1| predicted protein [Populus trichocarpa] gi|5...   149   1e-33
gb|EPS59023.1| hypothetical protein M569_15789, partial [Genlise...   147   5e-33
gb|EOX90924.1| GATA transcription factor 5, putative [Theobroma ...   145   2e-32
ref|XP_002310287.2| hypothetical protein POPTR_0007s13700g [Popu...   145   2e-32
gb|ESW29946.1| hypothetical protein PHAVU_002G112000g [Phaseolus...   145   3e-32
ref|XP_006425558.1| hypothetical protein CICLE_v10025844mg [Citr...   144   4e-32
ref|NP_001242253.1| uncharacterized protein LOC100783966 [Glycin...   142   1e-31
ref|XP_006425559.1| hypothetical protein CICLE_v10025844mg [Citr...   141   3e-31
ref|XP_004287842.1| PREDICTED: GATA transcription factor 5-like ...   137   4e-30
gb|ADL36694.1| GATA domain class transcription factor [Malus dom...   137   5e-30
ref|XP_002272762.1| PREDICTED: GATA transcription factor 5 [Viti...   137   7e-30
ref|XP_006572850.1| PREDICTED: uncharacterized protein LOC100783...   134   5e-29
ref|XP_002521500.1| conserved hypothetical protein [Ricinus comm...   132   1e-28
gb|EPS57889.1| hypothetical protein M569_16927, partial [Genlise...   132   2e-28
ref|XP_004234546.1| PREDICTED: GATA transcription factor 6-like ...   131   3e-28
emb|CAN64003.1| hypothetical protein VITISV_037635 [Vitis vinifera]   131   3e-28

>ref|XP_004232457.1| PREDICTED: GATA transcription factor 5-like [Solanum lycopersicum]
          Length = 342

 Score =  167 bits (422), Expect = 6e-39
 Identities = 114/287 (39%), Positives = 140/287 (48%), Gaps = 34/287 (11%)
 Frame = +1

Query: 136 MESVQGGLKNGFGPETAAVM-------DDF----CGVNGGAGDDFFVDELLDFSNGF--- 273
           M+  +  L+N F PET   M       DDF     G NG +GDDFFVD+LLDFSNGF   
Sbjct: 1   MDCAEWALRNSFVPETPLKMTQNQTFGDDFSAAGAGQNGVSGDDFFVDDLLDFSNGFVEG 60

Query: 274 ------SXXXXXXXXXXXXXXXXXXXXQVTPPPE-------KFSLSAGDDFGSLHESELS 414
                                       V+P  +       K ++S  +DF SL  SE+S
Sbjct: 61  EGDEEEEEGKNQGGEGISVQKPCSVSIAVSPLKKTEIDDKGKVTISVNEDFASLPVSEIS 120

Query: 415 FQGEGLESLEWLSHFVEDSFSDYSL---AGKFPLKPMENRSEPAAKVQERPCFTTPVQTK 585
              + L+SLEWLSHFVE+SFS YSL   AGK P++      E   + +++PCF TPVQTK
Sbjct: 121 VPTDDLDSLEWLSHFVEESFSGYSLAYPAGKLPVEKKTGDGEIPVE-EKKPCFATPVQTK 179

Query: 586 ARTKRARTGVRVRPVLSPSFAEPXXXXXXXXXXXXXFL--PNNPWLVHSQ--NDGASLXX 753
           ARTKR R+ VRV PV S S  E                  P   W ++    +   S   
Sbjct: 180 ARTKRGRSSVRVWPVCSGSLTESSSSSTSSSSTTTMSSSPPTGSWFLYPTPVHSAESPGK 239

Query: 754 XXXXXXXXXXXEGGGVGAAAQPRRCTHCGVTKTPQWRAGPLGPKTLC 894
                        GG G   QPRRC+HCGV KTPQWRAGP+G KTLC
Sbjct: 240 PLAKKLKKKPASHGGNG-PQQPRRCSHCGVQKTPQWRAGPMGAKTLC 285


>ref|XP_006340696.1| PREDICTED: GATA transcription factor 5-like [Solanum tuberosum]
          Length = 339

 Score =  165 bits (417), Expect = 2e-38
 Identities = 113/284 (39%), Positives = 138/284 (48%), Gaps = 31/284 (10%)
 Frame = +1

Query: 136 MESVQGGLKNGFGPETAAVM-------DDF----CGVNGGAGDDFFVDELLDFSNGF--- 273
           M+ V+G L+N F PET   M       DD      G NG +GDDFFVD+LLDFSNGF   
Sbjct: 1   MDCVKGALRNSFVPETPLKMTQNQTFGDDLSAAGAGQNGVSGDDFFVDDLLDFSNGFVEG 60

Query: 274 ----SXXXXXXXXXXXXXXXXXXXXQVTP-------PPEKFSLSAGDDFGSLHESELSFQ 420
                                     V+P         +K ++S  +DF SL  SE+S  
Sbjct: 61  EGEEEEGKNQGGEDISVQKPCSVSISVSPLKKTEIDDKDKVTISVKEDFSSLPVSEISVP 120

Query: 421 GEGLESLEWLSHFVEDSFSDYSL---AGKFPLKPMENRSEPAAKVQERPCFTTPVQTKAR 591
            + L+SLEWLSHFVEDSFS YSL   AGK  ++      E   + +++PCF TPVQTKAR
Sbjct: 121 TDDLDSLEWLSHFVEDSFSGYSLAYPAGKLEVEKKTGDGEIPVE-EKKPCFATPVQTKAR 179

Query: 592 TKRARTGVRVRPVLSPSFAE-PXXXXXXXXXXXXXFLPNNPWLVHSQ--NDGASLXXXXX 762
           TKR RT VR  P  S S  +                 P   W ++    +   S      
Sbjct: 180 TKRGRTSVRFWPACSGSLTDSSSSSTSSSSTTTMSSSPTASWFLYPTPVHSAESPGKPLA 239

Query: 763 XXXXXXXXEGGGVGAAAQPRRCTHCGVTKTPQWRAGPLGPKTLC 894
                     GG G   QPRRC+HCGV KTPQWRAGP+G KTLC
Sbjct: 240 KKLKKKPAPHGGNG-PQQPRRCSHCGVQKTPQWRAGPMGAKTLC 282


>gb|EMJ05895.1| hypothetical protein PRUPE_ppa008278mg [Prunus persica]
          Length = 338

 Score =  157 bits (398), Expect = 4e-36
 Identities = 112/278 (40%), Positives = 135/278 (48%), Gaps = 25/278 (8%)
 Frame = +1

Query: 136 MESVQGGLKNGFGPETA------AVMDDFC-----GVNGGAGDDFFVDELLDFSN--GFS 276
           ME V+  LK     E A      AV DD       G NG A DDF VD+LLDFSN  GF 
Sbjct: 1   MECVEAALKTSIRKEMAVKASSQAVFDDLLWGGVNGQNGVACDDFSVDDLLDFSNEDGFV 60

Query: 277 XXXXXXXXXXXXXXXXXXXXQVTPP-PEKFSLSAGDDFGSLHESELSFQGEGLESLEWLS 453
                               Q  P  PE   LS  ++ G    SELS   + LE+LEWLS
Sbjct: 61  ETEAEEDDKDKVKGFASVPPQKQPQDPENSDLSEKNELGPEPTSELSVPADDLENLEWLS 120

Query: 454 HFVEDSFSDYSL---AGKFPLKPM-ENRSEPAAKVQERPCFTTPVQTKARTKRARTGVRV 621
           HFVEDSF++++    AG  P KP  E R +PAA + E+PCF TPV  KAR+KR RTG RV
Sbjct: 121 HFVEDSFTEFTTSLPAGFIPEKPKTEKRPDPAAPLPEKPCFKTPVPAKARSKRTRTGGRV 180

Query: 622 RPVLSPSFAEPXXXXXXXXXXXXXFLPNNPWLVH--SQN-----DGASLXXXXXXXXXXX 780
             + SPS  E                P++PWL++  +QN      G              
Sbjct: 181 WSLGSPSLTETSSSSSSSSSSSS---PSSPWLIYPTTQNREPAEAGGEPVGSVEKPPKKP 237

Query: 781 XXEGGGVGAAAQPRRCTHCGVTKTPQWRAGPLGPKTLC 894
                   ++  PRRC+HCGV KTPQWR GP G KTLC
Sbjct: 238 KRRLVDGSSSQPPRRCSHCGVQKTPQWRTGPNGAKTLC 275


>gb|EXC35403.1| GATA transcription factor 5 [Morus notabilis]
          Length = 393

 Score =  157 bits (396), Expect = 7e-36
 Identities = 116/335 (34%), Positives = 146/335 (43%), Gaps = 44/335 (13%)
 Frame = +1

Query: 22  MLYRTHQHHPFLFTFNPFXXXXXXXXXXXXXXXXXIQG------------------MESV 147
           MLYRTH  HPF F F+PF                                      ME V
Sbjct: 1   MLYRTH--HPFFFHFHPFTRSSSSFPSSSSSYSSSSPSSSKPSTTPSPLSTQVETEMECV 58

Query: 148 QGGLKNGFGPETAA------VMDDFCGVNGGAGDDFFVDELLDFSNGFSXXXXXXXXXXX 309
           +  LK  F  E         V DD   VN     DF VD+LL+FS+              
Sbjct: 59  EAALKTSFRKEMGVRQSPHVVFDDLLDVNVQNVVDFSVDDLLNFSDDDGFVVVEEQDQDG 118

Query: 310 XXXXXXXXXQVTPPPEKFSLSAGDD------FGSLHESELSFQGEGLESLEWLSHFVEDS 471
                    +   P E+ +++  ++        S+  +EL+   E LE+LEWLSHFVE+S
Sbjct: 119 DKDLSSPSQEQNQPAEEEAINDNNNPSTSLFVSSVPTTELTLPAEELENLEWLSHFVEES 178

Query: 472 FSDYS---LAGKFPLKPMENRS---EPAAKVQERPCFTTPVQTKARTKRARTGVRVRPVL 633
           FS++S   LAG    KP E+ +   EP     E+PCFTTP+  KAR+KR RTG RV  + 
Sbjct: 179 FSEFSTSYLAGVSAEKPPEDETFLPEPKRFAPEKPCFTTPIPAKARSKRPRTGGRVWSLG 238

Query: 634 SPSFAEPXXXXXXXXXXXXXFLPNNPWLV---HSQNDGASLXXXXXXXXXXXXX-----E 789
           SPSF E                P +PWL+   HS     S+                   
Sbjct: 239 SPSFIESSSSSTTSSSSSSS--PTSPWLIYATHSHEPACSVQKPAPKKAKKRQAVESFGS 296

Query: 790 GGGVGAAAQPRRCTHCGVTKTPQWRAGPLGPKTLC 894
           G G  +A  PRRC+HCGV KTPQWR GPLG KTLC
Sbjct: 297 GSGPASAQPPRRCSHCGVQKTPQWRTGPLGAKTLC 331


>ref|XP_002327771.1| predicted protein [Populus trichocarpa]
           gi|566170906|ref|XP_006383142.1| zinc finger family
           protein [Populus trichocarpa]
           gi|550338722|gb|ERP60939.1| zinc finger family protein
           [Populus trichocarpa]
          Length = 333

 Score =  149 bits (377), Expect = 1e-33
 Identities = 106/276 (38%), Positives = 130/276 (47%), Gaps = 23/276 (8%)
 Frame = +1

Query: 136 MESVQGGLKNGFGPETAA-----VMDDFCGVN---GGAGDDFFVDELLDFSN--GFSXXX 285
           ME V+G LK  F  E A      V+DDF  VN   G + DDF VDELLDFSN  GF    
Sbjct: 1   MERVEGALKTSFRKEMAVKFSPQVLDDFWPVNVTNGMSSDDFSVDELLDFSNENGFIEDE 60

Query: 286 XXXXXXXXXXXXXXXXXQVTPPPEKFSLSAGDDFGSLHESELSFQGEGLESLEWLSHFVE 465
                                    +  +  +DF S   SEL    + L SLEWLSHFVE
Sbjct: 61  ENPCVVSVSHKQETLKEDKNNDRSPY-FAVKEDFVSGPTSELCVPTDDLASLEWLSHFVE 119

Query: 466 DSFSDYSLAGKFPLKP----MENRSEPAAKVQERPCFTTPVQTKARTKRARTGVRVRPVL 633
           DS S+Y+      + P     EN +E    V   PCF TPV  KAR+KR RTGVRV P+ 
Sbjct: 120 DSNSEYAAPFPAIVSPPEPEKENFAEQEKSVLTEPCFKTPVPAKARSKRTRTGVRVWPLG 179

Query: 634 SPSFAEPXXXXXXXXXXXXXFLPNNPWLVHSQN---------DGASLXXXXXXXXXXXXX 786
           SP+  E                P++PWL+H++          +   +             
Sbjct: 180 SPTLTESSTSSSSSTSSSS---PSSPWLIHTKPLLNAEPLWFEKPVVKRMKKKPSFHAAA 236

Query: 787 EGGGVGAAAQPRRCTHCGVTKTPQWRAGPLGPKTLC 894
            GGG G +   RRC+HCG+ KTPQWRAGP G KTLC
Sbjct: 237 SGGG-GGSHSSRRCSHCGIQKTPQWRAGPNGSKTLC 271


>gb|EPS59023.1| hypothetical protein M569_15789, partial [Genlisea aurea]
          Length = 287

 Score =  147 bits (371), Expect = 5e-33
 Identities = 102/248 (41%), Positives = 119/248 (47%), Gaps = 26/248 (10%)
 Frame = +1

Query: 229 DDFFVDELLDFSNGFSXXXXXXXXXXXXXXXXXXXXQV-TPPPEKFSLSA---------- 375
           D+ FVDELLDFS+ FS                       +  P   S+S           
Sbjct: 1   DELFVDELLDFSHEFSEVEERPGKAEEMGEKVGKAAAAASSSPVSVSVSGSGERVDDDDD 60

Query: 376 ------GDD--------FGSLHESELSFQGEGLESLEWLSHFVEDSFSDYSLAGKFPLKP 513
                 G+D         GSL E+     GEGLESLEWLSHFV+DSFS++SL GK P  P
Sbjct: 61  DDDDNEGNDVLLSRKNYLGSLPETGFPAPGEGLESLEWLSHFVDDSFSEFSLTGKLPPNP 120

Query: 514 MENRSEPAAKVQERPCF-TTPVQTKARTKRARTGVRVRPVLSPSFAEPXXXXXXXXXXXX 690
            E +  P      +P F ++ VQTKARTKRARTG+RV PVLSPSFA+             
Sbjct: 121 PEKK--PTEPENSKPGFVSSQVQTKARTKRARTGIRVWPVLSPSFADSTTTSSSSSSSS- 177

Query: 691 XFLPNNPWLVHSQNDGASLXXXXXXXXXXXXXEGGGVGAAAQPRRCTHCGVTKTPQWRAG 870
                        N  AS               GG V    QPRRC+HCGVTKTPQWRAG
Sbjct: 178 --TTTTTTTTTPLNRTASTAKKKQRAAAEDSAAGGAV--HVQPRRCSHCGVTKTPQWRAG 233

Query: 871 PLGPKTLC 894
           P+G KTLC
Sbjct: 234 PMGSKTLC 241


>gb|EOX90924.1| GATA transcription factor 5, putative [Theobroma cacao]
          Length = 389

 Score =  145 bits (367), Expect = 2e-32
 Identities = 113/333 (33%), Positives = 144/333 (43%), Gaps = 42/333 (12%)
 Frame = +1

Query: 22  MLYRTHQHHPFLFTFNPFXXXXXXXXXXXXXXXXX-IQGMESVQGGLKNGFGPETA---- 186
           MLY+TH HHPF F F  F                  +Q ME V+  LK  F  E A    
Sbjct: 1   MLYQTH-HHPFFFHFRSFTSIPPPLPSTTPSLLPSSLQEMECVEAALKTSFRKEMALKSS 59

Query: 187 --AVMDDFC---GVNGGAGDDFFVDELLDFSN--GF----SXXXXXXXXXXXXXXXXXXX 333
             A ++D     G NG + DDF VD+L DF+N  GF                        
Sbjct: 60  PQAFLEDIWLANGQNGVSSDDFSVDDLFDFTNEEGFLEQQQQPQHEEEEEEEDEGAPISS 119

Query: 334 XQVTPPPEKFS--------LSAGDDFGSLHESELSFQGEGLESLEWLSHFVEDSFSDYSL 489
              +P  +K S         +   D+GSL  SEL+   + + +LEWLSHFVEDSFS++S 
Sbjct: 120 SSSSPKRQKLSQEEHLSNDTTTNFDYGSLPTSELAVPADDVANLEWLSHFVEDSFSEHST 179

Query: 490 AGKFPLKPMENRSEPAAKVQERP-------CFTTPVQTKARTKRARTGVRVRP-VLSPSF 645
           A  +P   +    +  A +   P       CF TPV  KAR+KR RTG RV   V SPS 
Sbjct: 180 A--YPTGTLTENPKLQADILAEPEKPVITTCFKTPVPAKARSKRTRTGGRVWSLVASPSL 237

Query: 646 AEPXXXXXXXXXXXXXFLPNNPWLVHSQNDGASLXXXXXXXXXXXXX----------EGG 795
            E                P++PWL++  +   S                        +  
Sbjct: 238 TESSSSSTSSSSSSS---PSSPWLLYPNSGSGSTFEPSEPLSVEKPPAKKHKKRPATDST 294

Query: 796 GVGAAAQPRRCTHCGVTKTPQWRAGPLGPKTLC 894
           G       RRC+HCGVTKTPQWRAGP+G KTLC
Sbjct: 295 GGNGTQPTRRCSHCGVTKTPQWRAGPMGAKTLC 327


>ref|XP_002310287.2| hypothetical protein POPTR_0007s13700g [Populus trichocarpa]
           gi|550334822|gb|EEE90737.2| hypothetical protein
           POPTR_0007s13700g [Populus trichocarpa]
          Length = 376

 Score =  145 bits (366), Expect = 2e-32
 Identities = 107/283 (37%), Positives = 131/283 (46%), Gaps = 28/283 (9%)
 Frame = +1

Query: 130 QGMESVQGGLKNGFGPETAA-----VMDDFCGVN---GGAGDDFFVDELLDFSNGFSXXX 285
           Q ME V+G LK  F  E A      V+DDF  VN   G + DDF V++LLDFSN      
Sbjct: 37  QEMECVEGALKTSFRKEMAMKFSPQVLDDFWAVNVPNGMSSDDFSVEKLLDFSNENDFIE 96

Query: 286 XXXXXXXXXXXXXXXXXQVTPPPEKFS----------LSAGDDFGSLHESELSFQGEGLE 435
                             V+P  E              +  DDF S+  SEL    +   
Sbjct: 97  EEEEEGGDKEKPCVFSVSVSPKQEALEEDKNSDSSPGFAVKDDFFSVPTSELCVPTDDFA 156

Query: 436 SLEWLSHFVEDSFSDYSLAGKFPLKPMENRSEPAAK----VQERPCFTTPVQTKARTKRA 603
           SLEWLSHFVEDS S+Y+      + P E + E   +    V E P F TPV  KAR+KR 
Sbjct: 157 SLEWLSHFVEDSNSEYAAPFPTNVSPPEPKKENPVEQEKLVLEEPLFKTPVPGKARSKRT 216

Query: 604 RTGVRVRPVLSPSFAEPXXXXXXXXXXXXXFLPNNPWLVHSQNDGASLXXXXXXXXXXXX 783
           R GVRV P+ SPS  E                P++PWLV+S+     +            
Sbjct: 217 RNGVRVWPLGSPSLTESSSSSSSTSSSS----PSSPWLVYSK-PCLKVEPVWFEKPVAKK 271

Query: 784 XEGGGVGAAAQ------PRRCTHCGVTKTPQWRAGPLGPKTLC 894
            +   V AAA+       RRC+HCGV KTPQWRAGP G KTLC
Sbjct: 272 MKKPAVEAAAKGCGSNSSRRCSHCGVQKTPQWRAGPNGSKTLC 314


>gb|ESW29946.1| hypothetical protein PHAVU_002G112000g [Phaseolus vulgaris]
          Length = 319

 Score =  145 bits (365), Expect = 3e-32
 Identities = 105/270 (38%), Positives = 134/270 (49%), Gaps = 17/270 (6%)
 Frame = +1

Query: 136 MESVQGGLKNGF--------GPETAAVMDDFCGVNGGAGDDFFVDELLDFSNGFSXXXXX 291
           ME ++  LK+ F         PET   M++F   NG   DDFFVD+LLDFS+        
Sbjct: 1   MECMEAALKSNFRKEMTVELSPET--FMEEFSVQNGTTCDDFFVDDLLDFSH----VEEE 54

Query: 292 XXXXXXXXXXXXXXXQVTPPPEKFSLSAGDDFGSLHESELSFQGEGLESLEWLSHFVEDS 471
                          +  P  E ++     D+ S+  +ELS   + +   EWLSHFVE+S
Sbjct: 55  PEQQKEQDSVCLSLQKENPSQEPYAFKP--DYSSVPTTELSVLADDVADFEWLSHFVEES 112

Query: 472 FSDYSLAGKFPLKPMENRSEPAAKVQ----ERPCFT--TPVQTKARTKRARTGVRVRPVL 633
           FS++S A   P     N +  AAK      E P FT  TPVQTKAR+KR+R GVRV P+ 
Sbjct: 113 FSEFSAA--LPTVTESNPTGLAAKEPKPELESPVFTFKTPVQTKARSKRSRNGVRVWPLG 170

Query: 634 SPSFAEPXXXXXXXXXXXXXFLPNNPWLVHSQNDGASLXXXXXXXXXXXXXE---GGGVG 804
           SPSF E                P++P L+++ N   SL             +      VG
Sbjct: 171 SPSFTESSSSSTTTTSSSSSSSPSSPLLIYT-NIPRSLDHVCSEPKPNKPKKKHSSDSVG 229

Query: 805 AAAQPRRCTHCGVTKTPQWRAGPLGPKTLC 894
             A PRRC+HCGV KTPQWR GPLGPKTLC
Sbjct: 230 TLA-PRRCSHCGVQKTPQWRTGPLGPKTLC 258


>ref|XP_006425558.1| hypothetical protein CICLE_v10025844mg [Citrus clementina]
           gi|568825030|ref|XP_006466892.1| PREDICTED: GATA
           transcription factor 5-like [Citrus sinensis]
           gi|557527548|gb|ESR38798.1| hypothetical protein
           CICLE_v10025844mg [Citrus clementina]
          Length = 381

 Score =  144 bits (363), Expect = 4e-32
 Identities = 115/328 (35%), Positives = 145/328 (44%), Gaps = 37/328 (11%)
 Frame = +1

Query: 22  MLYRTHQHHPFLF-TFNPFXXXXXXXXXXXXXXXXX--IQGMESVQGGLKNGFGPETAA- 189
           M Y+TH  + F F TF P                     Q ME V+  LK     E A  
Sbjct: 1   MPYQTHHLNFFQFHTFTPIHLSATSLLSSPPPPPPPTDFQDMECVEAALKTSLRKEMALK 60

Query: 190 ----VMDDFCGVN---GGAGDDFFVDELLDFSNG--------FSXXXXXXXXXXXXXXXX 324
                +D+ C VN   G A DDFFVD+LLDFSN                           
Sbjct: 61  LSPQAVDEICAVNLPNGVACDDFFVDDLLDFSNDDVVAEQQQLQEPQQEKGEEQKKHTLT 120

Query: 325 XXXXQVTPPPEKFSLSAGDDFGSLHESELSFQGEGLESLEWLSHFVEDSFSDYSL---AG 495
               Q     E+ +    DD G +  SEL+   + + +LEWLSHFVEDSF++YS    AG
Sbjct: 121 VCSKQDQDLDERLNF---DDLGPIPTSELAVPTDDVANLEWLSHFVEDSFAEYSSPFPAG 177

Query: 496 KFPLKPMENRSEPAAK-VQERPCFTTPVQTKARTKRARTGVRVRPVLSPSFAEPXXXXXX 672
             P+K  EN +EP  K      CF TP+  KAR+KR+RTG+R+  + SPS ++       
Sbjct: 178 TLPVKAKENGAEPEHKPALAIHCFKTPIPAKARSKRSRTGLRIWSLGSPSLSDSSSTSSA 237

Query: 673 XXXXXXXFLPNNPWLVHSQNDG--ASLXXXXXXXXXXXXXE------------GGGVGAA 810
                    P++PW V S N G  ASL             +            GG +   
Sbjct: 238 SSSSS----PSSPWPV-STNPGSLASLRPAEPFIVKPPKKKLKKKSPPEGYNAGGNISWG 292

Query: 811 AQPRRCTHCGVTKTPQWRAGPLGPKTLC 894
              RRC+HCGV KTPQWR GPLG KTLC
Sbjct: 293 QFTRRCSHCGVQKTPQWRTGPLGAKTLC 320


>ref|NP_001242253.1| uncharacterized protein LOC100783966 [Glycine max]
           gi|255637027|gb|ACU18846.1| unknown [Glycine max]
          Length = 352

 Score =  142 bits (359), Expect = 1e-31
 Identities = 108/308 (35%), Positives = 140/308 (45%), Gaps = 17/308 (5%)
 Frame = +1

Query: 22  MLYRTHQ------HHPFLFTFNPFXXXXXXXXXXXXXXXXXIQGMESVQGGLKNGFGPET 183
           MLY+T        HHP   +F+P                   + ME V+  LK+ +  E 
Sbjct: 1   MLYQTPYPQPFQFHHPLPSSFSPLLAVPTTPPPLYLPFPQAEKEMECVEAALKSNYRKEM 60

Query: 184 AAVM------DDFCGVNGGAGDDFFVDELLDFSNGFSXXXXXXXXXXXXXXXXXXXXQVT 345
              +      ++    NG   DDFFV++LLDFS+                          
Sbjct: 61  TLKLSPRTFTEEVSVQNGTTCDDFFVNDLLDFSH------VEEEPEQQEDTPCVSLQHEN 114

Query: 346 PPPEKFSLSAGDDFGSLHESELSFQGEGLESLEWLSHFVEDSFSDYSLAGKFPLKPME-- 519
           P  E  +    DD+ S+  SELS   + L  LEWLSHFVEDSFS++S A  FP       
Sbjct: 115 PSHEPCTFK--DDYASVPTSELSVLADDLADLEWLSHFVEDSFSEFSAA--FPTVTENPT 170

Query: 520 ---NRSEPAAKVQERPCFTTPVQTKARTKRARTGVRVRPVLSPSFAEPXXXXXXXXXXXX 690
                +EP  ++   P F TPVQTKAR+KR R G+RV P  SPSF +             
Sbjct: 171 ACLKEAEPEPEIPVFP-FKTPVQTKARSKRTRNGLRVWPFGSPSFTDSSSSSTTSSFSF- 228

Query: 691 XFLPNNPWLVHSQNDGASLXXXXXXXXXXXXXEGGGVGAAAQPRRCTHCGVTKTPQWRAG 870
            F P++P L+++Q    SL             +       A PRRC+HCGV KTPQWR G
Sbjct: 229 -FSPSSPLLIYTQ----SLDHLCSEPNTKKMKKKPSSDTLA-PRRCSHCGVQKTPQWRTG 282

Query: 871 PLGPKTLC 894
           PLGPKTLC
Sbjct: 283 PLGPKTLC 290


>ref|XP_006425559.1| hypothetical protein CICLE_v10025844mg [Citrus clementina]
           gi|557527549|gb|ESR38799.1| hypothetical protein
           CICLE_v10025844mg [Citrus clementina]
          Length = 340

 Score =  141 bits (356), Expect = 3e-31
 Identities = 105/287 (36%), Positives = 133/287 (46%), Gaps = 34/287 (11%)
 Frame = +1

Query: 136 MESVQGGLKNGFGPETAA-----VMDDFCGVN---GGAGDDFFVDELLDFSNG------- 270
           ME V+  LK     E A       +D+ C VN   G A DDFFVD+LLDFSN        
Sbjct: 1   MECVEAALKTSLRKEMALKLSPQAVDEICAVNLPNGVACDDFFVDDLLDFSNDDVVAEQQ 60

Query: 271 -FSXXXXXXXXXXXXXXXXXXXXQVTPPPEKFSLSAGDDFGSLHESELSFQGEGLESLEW 447
                                  Q     E+ +    DD G +  SEL+   + + +LEW
Sbjct: 61  QLQEPQQEKGEEQKKHTLTVCSKQDQDLDERLNF---DDLGPIPTSELAVPTDDVANLEW 117

Query: 448 LSHFVEDSFSDYSL---AGKFPLKPMENRSEPAAK-VQERPCFTTPVQTKARTKRARTGV 615
           LSHFVEDSF++YS    AG  P+K  EN +EP  K      CF TP+  KAR+KR+RTG+
Sbjct: 118 LSHFVEDSFAEYSSPFPAGTLPVKAKENGAEPEHKPALAIHCFKTPIPAKARSKRSRTGL 177

Query: 616 RVRPVLSPSFAEPXXXXXXXXXXXXXFLPNNPWLVHSQNDG--ASLXXXXXXXXXXXXXE 789
           R+  + SPS ++                P++PW V S N G  ASL             +
Sbjct: 178 RIWSLGSPSLSDSSSTSSASSSSS----PSSPWPV-STNPGSLASLRPAEPFIVKPPKKK 232

Query: 790 ------------GGGVGAAAQPRRCTHCGVTKTPQWRAGPLGPKTLC 894
                       GG +      RRC+HCGV KTPQWR GPLG KTLC
Sbjct: 233 LKKKSPPEGYNAGGNISWGQFTRRCSHCGVQKTPQWRTGPLGAKTLC 279


>ref|XP_004287842.1| PREDICTED: GATA transcription factor 5-like [Fragaria vesca subsp.
           vesca]
          Length = 333

 Score =  137 bits (346), Expect = 4e-30
 Identities = 99/257 (38%), Positives = 126/257 (49%), Gaps = 21/257 (8%)
 Frame = +1

Query: 187 AVMDDFC-GVNGGAG-----DDFFVDELLDFSNGFSXXXXXXXXXXXXXXXXXXXXQVTP 348
           AV DD   G+N   G     +DF VD+LLDFSN                           
Sbjct: 19  AVFDDLLWGLNAQNGGVQNCEDFSVDDLLDFSNDDGFVEQEEQEDDKKDSVLPKKESTVE 78

Query: 349 PPEKFS----LSAGDDFG---SLHESELSFQGEGLESLEWLSHFVEDSFSDYSL---AGK 498
             E  +    +S  ++ G   +   SEL+   + LE+LEWLSHFVEDSFS ++    AG 
Sbjct: 79  EKENSTPSSCVSEKNELGPEPAEPTSELTVPADDLENLEWLSHFVEDSFSGFNASLPAGF 138

Query: 499 FPLKPMENRSEPAAKVQERPCFTTPVQTKARTKRARTGVRVRPVLSPSFAEPXXXXXXXX 678
             +KP E R EP A    +PCF TPV  KAR+KR RTG RV  + SPSF E         
Sbjct: 139 MAVKP-EKRPEPEAL---KPCFKTPVPAKARSKRTRTGGRVWSLGSPSFTETSSSSSSSS 194

Query: 679 XXXXXFLPNNPWLVHSQNDG-----ASLXXXXXXXXXXXXXEGGGVGAAAQPRRCTHCGV 843
                  P++PWL+++   G     +S+             EGGG  ++  PRRC+HCGV
Sbjct: 195 STSSC--PSSPWLIYNPTQGLGGFGSSVEKPQKKPKRPATTEGGG--SSQPPRRCSHCGV 250

Query: 844 TKTPQWRAGPLGPKTLC 894
            KTPQWR GP G KTLC
Sbjct: 251 QKTPQWRTGPNGAKTLC 267


>gb|ADL36694.1| GATA domain class transcription factor [Malus domestica]
          Length = 331

 Score =  137 bits (345), Expect = 5e-30
 Identities = 107/277 (38%), Positives = 133/277 (48%), Gaps = 24/277 (8%)
 Frame = +1

Query: 136 MESVQGGLKNGFGPETAA--------VMDDFCG----VNG-GAGDDFFVDELLDFSN--G 270
           ME V+  LK     E A         V DDF      VNG  A DDF VD+LLDFSN  G
Sbjct: 1   MECVEAALKTSIRKEMAVKATGPQVVVFDDFLWGGAVVNGQNACDDFSVDDLLDFSNEDG 60

Query: 271 F--SXXXXXXXXXXXXXXXXXXXXQVTPPPEKFSLSAGDDFGSLHESELSFQGEGLESLE 444
           F  +                    +     EK +LS   +      SELS   + LE+LE
Sbjct: 61  FVETEAEEEGDKEKVKGFVSVSLQKQNQETEKSNLSEKIE----PASELSVPADDLENLE 116

Query: 445 WLSHFVEDSFSDYSLA---GKFPLKPM-ENRSEPAAKVQERPCFTTPVQTKARTKRARTG 612
           WLSHFVEDSFS+++ A   G  P KP  E R +      E+PCF TPV  KAR+KR RTG
Sbjct: 117 WLSHFVEDSFSEFTTALPAGFLPEKPKSEKRPDLETPFPEKPCFKTPVPAKARSKRRRTG 176

Query: 613 VRVRPVLSPSFAEPXXXXXXXXXXXXXFLPNNPWLVH--SQNDGASLXXXXXXXXXXXXX 786
            RV  + SPS  E                P++PW ++  +QN  ++              
Sbjct: 177 GRVWSLGSPSLTESSSSSSSSSSSS----PSSPWTIYPATQNQESAEPVSSVEKPPRKPK 232

Query: 787 EGGGVGAAAQP-RRCTHCGVTKTPQWRAGPLGPKTLC 894
                G+++QP RRC+HCGV KTPQWR GP G KTLC
Sbjct: 233 RRLVDGSSSQPPRRCSHCGVQKTPQWRTGPNGAKTLC 269


>ref|XP_002272762.1| PREDICTED: GATA transcription factor 5 [Vitis vinifera]
          Length = 338

 Score =  137 bits (344), Expect = 7e-30
 Identities = 102/288 (35%), Positives = 133/288 (46%), Gaps = 35/288 (12%)
 Frame = +1

Query: 136 MESVQGGLKNGF-GPETA-------AVMDDFC---GVNGGAGDDFFVDELLDFSNGFSXX 282
           ME V+  LK+    PE A       A MDD C   G +G +GDDF +D+LLDF+NG    
Sbjct: 1   MECVEKALKSSVVRPELAFKLTQQPACMDDMCMGNGQSGVSGDDFSIDDLLDFTNG--GI 58

Query: 283 XXXXXXXXXXXXXXXXXXQVTPPPE----------KFSLSAGDDFGSLHESELSFQGEGL 432
                              ++P  E            + S  D+F S+  +EL+   + L
Sbjct: 59  GEGLFQEEDEEDEDKGCGSLSPRGELTENDNSNLTTTTFSVKDEFPSVPATELTVPADDL 118

Query: 433 ESLEWLSHFVEDSFSDYSLAGKFPLKPM--------ENRSEPAAKVQERPCFTTPVQTKA 588
             LEWLSHFVEDSFS+YS    FP   +        EN  EP   +Q + C  TP   KA
Sbjct: 119 ADLEWLSHFVEDSFSEYS--APFPHGTLTEKAQNQTENPPEPETPLQIKSCLKTPFPAKA 176

Query: 589 RTKRARTGVRVRPVLSPSFAEPXXXXXXXXXXXXXFLPNNPWLVHS------QNDGASLX 750
           R+KRARTG RV  + SPS  E                 ++PWL++       ++  +++ 
Sbjct: 177 RSKRARTGGRVWSMGSPSLTESSSSSSSSSSSSL----SSPWLIYPNTCQNVESFHSAVK 232

Query: 751 XXXXXXXXXXXXEGGGVGAAAQPRRCTHCGVTKTPQWRAGPLGPKTLC 894
                       E  G  A   P RC+HCGV KTPQWR GPLG KTLC
Sbjct: 233 PPAKKHKKRLDPEASG-SAQPTPHRCSHCGVQKTPQWRTGPLGAKTLC 279


>ref|XP_006572850.1| PREDICTED: uncharacterized protein LOC100783966 isoform X1 [Glycine
           max]
          Length = 308

 Score =  134 bits (337), Expect = 5e-29
 Identities = 98/264 (37%), Positives = 126/264 (47%), Gaps = 11/264 (4%)
 Frame = +1

Query: 136 MESVQGGLKNGFGPETAAVM------DDFCGVNGGAGDDFFVDELLDFSNGFSXXXXXXX 297
           ME V+  LK+ +  E    +      ++    NG   DDFFV++LLDFS+          
Sbjct: 1   MECVEAALKSNYRKEMTLKLSPQTFTEEVSVQNGTTCDDFFVNDLLDFSH------VEEE 54

Query: 298 XXXXXXXXXXXXXQVTPPPEKFSLSAGDDFGSLHESELSFQGEGLESLEWLSHFVEDSFS 477
                           P  E  +    DD+ S+  SELS   + L  LEWLSHFVEDSFS
Sbjct: 55  PEQQEDTPCVSLQHENPSHEPCTFK--DDYASVPTSELSVLADDLADLEWLSHFVEDSFS 112

Query: 478 DYSLAGKFPLKPMENRSEPAAKVQERP-----CFTTPVQTKARTKRARTGVRVRPVLSPS 642
           ++S A  FP    EN +    + +  P      F TPVQTKAR+KR R G+RV P  SPS
Sbjct: 113 EFSAA--FPTVT-ENPTACLKEAEPEPEIPVFSFKTPVQTKARSKRTRNGLRVWPFGSPS 169

Query: 643 FAEPXXXXXXXXXXXXXFLPNNPWLVHSQNDGASLXXXXXXXXXXXXXEGGGVGAAAQPR 822
           F +                P++P L+++Q    SL             +       A PR
Sbjct: 170 FTDSSSSSTTSSSSSSS--PSSPLLIYTQ----SLDHLCSEPNTKKMKKKPSSDTLA-PR 222

Query: 823 RCTHCGVTKTPQWRAGPLGPKTLC 894
           RC+HCGV KTPQWR GPLGPKTLC
Sbjct: 223 RCSHCGVQKTPQWRTGPLGPKTLC 246


>ref|XP_002521500.1| conserved hypothetical protein [Ricinus communis]
            gi|223539178|gb|EEF40771.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 398

 Score =  132 bits (333), Expect = 1e-28
 Identities = 117/343 (34%), Positives = 140/343 (40%), Gaps = 52/343 (15%)
 Frame = +1

Query: 22   MLYRT--HQHHPFLFTFNPFXXXXXXXXXXXXXXXXXIQGMESVQGGLKNGFGPET---- 183
            MLY+T  H HHPF+F                         ME V+G LK  F  E     
Sbjct: 1    MLYQTTHHYHHPFIFHSPSATTSSSFLFYCHLLLFRWEIEMECVEGALKTSFRKELGFKL 60

Query: 184  ---AAVMDDFCGV---NGGAGDDFFVDELLDFSNGFSXXXXXXXXXXXXXXXXXXXX--- 336
               A  +DD   +   NG + DDF VDELLDFSN                          
Sbjct: 61   SPQAFFVDDLYALSMQNGTSSDDFIVDELLDFSNEEEAAVEREDEEEEEQQQQQKACTAV 120

Query: 337  --------QVTPPPEKFSLSAGDDFGSLHESELSFQGEGLESLEWLSHFVEDSFSDYSLA 492
                    Q T  PE   +S   D  S   +EL    + L SLEWLSHFVEDS S+YS  
Sbjct: 121  SVSLSPNQQQTQRPEDGKIS---DSTSNFATELCVPADDLASLEWLSHFVEDSNSEYSTP 177

Query: 493  GKFPLKPM--------ENRSEPAAKVQE-----RPCFTTPVQTKARTKRARTGVRVRPVL 633
              FP   +        EN ++P    Q+        F TPVQTKAR+KR RTGVRV P+ 
Sbjct: 178  --FPAAGIVSHENHKEENDNKPFYVTQKPVVLTETFFKTPVQTKARSKRTRTGVRVWPLG 235

Query: 634  SPSFAEP------XXXXXXXXXXXXXFLPNNPWLVHSQNDGASLXXXXXXXXXXXXXE-- 789
            SPS  E                      P +P+L+ +   G S              +  
Sbjct: 236  SPSLTESSSSSSYTSSSSSSSSSSSSSSPLSPYLIFT-TQGMSRELTEPICYEKTPIKKL 294

Query: 790  --------GGGVGAAAQPRRCTHCGVTKTPQWRAGPLGPKTLC 894
                      G G +  PRRC+HCGV KTPQWR GPLG KTLC
Sbjct: 295  KKRFSGEPASGGGGSQPPRRCSHCGVQKTPQWRTGPLGAKTLC 337


>gb|EPS57889.1| hypothetical protein M569_16927, partial [Genlisea aurea]
          Length = 276

 Score =  132 bits (331), Expect = 2e-28
 Identities = 97/229 (42%), Positives = 110/229 (48%), Gaps = 7/229 (3%)
 Frame = +1

Query: 229 DDFFVDELLDFSNG-FSXXXXXXXXXXXXXXXXXXXX-QVTPPPEKFSLSAGDDFGSLHE 402
           DD FVDELLDFS+  FS                     +V   PE      G   G L E
Sbjct: 1   DDLFVDELLDFSSHEFSDGEGEEEMGEGKRENGRDDSSRVLGEPED-----GGVLGGLPE 55

Query: 403 SELSFQGEGLESLEWLSHFVEDSFSDYSLAGKFPLKPMENRSEPAAKVQERPCFTTP-VQ 579
           S  +    GLESLEWLSHFVE+SFSD+SLAGK      E +     K      F  P VQ
Sbjct: 56  SPFAAAAAGLESLEWLSHFVEESFSDFSLAGKLTSDAAEEKIPAPGKA----WFGGPQVQ 111

Query: 580 TKARTKRARTGVRVRPVLS----PSFAEPXXXXXXXXXXXXXFLPNNPWLVHSQNDGASL 747
           TKAR+KRAR G+RV PVLS     + +               F    P  V S+     L
Sbjct: 112 TKARSKRARIGIRVWPVLSTDSTATSSSSSSSSSTTTATATAFDSLTPNAVQSR---VPL 168

Query: 748 XXXXXXXXXXXXXEGGGVGAAAQPRRCTHCGVTKTPQWRAGPLGPKTLC 894
                        E  G  AA QPRRC+HCGVTKTPQWRAGP+G KTLC
Sbjct: 169 LVGKKRMKKRKESEHSGAVAAGQPRRCSHCGVTKTPQWRAGPMGSKTLC 217


>ref|XP_004234546.1| PREDICTED: GATA transcription factor 6-like [Solanum lycopersicum]
          Length = 280

 Score =  131 bits (330), Expect = 3e-28
 Identities = 99/255 (38%), Positives = 115/255 (45%), Gaps = 2/255 (0%)
 Frame = +1

Query: 136 MESVQGGLKNG-FGPETAAVMDDFCGVNGGAGDDFFVDELLDFSNGFSXXXXXXXXXXXX 312
           M S++  LK   F PETA  M      N  + DDFFVD LLD SNGF+            
Sbjct: 1   MNSLEKALKTSYFRPETAMKMTH----NQPSIDDFFVDNLLDLSNGFAEDEIEQLNEHPN 56

Query: 313 XXXXXXXXQVTPPPEKFSLSAGDDFGSLHESELSFQGEGLESLEWLSHFVE-DSFSDYSL 489
                    V+P  +K       DFG     ELS+   GL++LEWLS FVE DS S YSL
Sbjct: 57  GFNTQNLCSVSP--QKKMEDENGDFGC----ELSYPENGLDNLEWLSQFVEEDSHSGYSL 110

Query: 490 AGKFPLKPMENRSEPAAKVQERPCFTTPVQTKARTKRARTGVRVRPVLSPSFAEPXXXXX 669
            GK P+K  +N+S     VQ   CFT PVQTK RTKR R G RV      S +       
Sbjct: 111 IGKLPVK--KNKSVTENPVQVNSCFTVPVQTKPRTKRRRIGGRVWSFTGSSTSSASSSTI 168

Query: 670 XXXXXXXXFLPNNPWLVHSQNDGASLXXXXXXXXXXXXXEGGGVGAAAQPRRCTHCGVTK 849
                     P            AS+                      QPRRC+HCGV K
Sbjct: 169 TTTAESIVRFP------------ASVSNRRKMK----------TEKPVQPRRCSHCGVHK 206

Query: 850 TPQWRAGPLGPKTLC 894
           TPQWR GP+G KTLC
Sbjct: 207 TPQWRTGPMGAKTLC 221


>emb|CAN64003.1| hypothetical protein VITISV_037635 [Vitis vinifera]
          Length = 338

 Score =  131 bits (330), Expect = 3e-28
 Identities = 100/288 (34%), Positives = 131/288 (45%), Gaps = 35/288 (12%)
 Frame = +1

Query: 136 MESVQGGLKNGF-GPETA-------AVMDDFC---GVNGGAGDDFFVDELLDFSNGFSXX 282
           ME V+  LK+    PE A       A  DD C   G +G +GDDF +D+LLDF+NG    
Sbjct: 1   MECVEKALKSSVVRPELAFKLTQQPACXDDICMGNGQSGVSGDDFSIDDLLDFTNG--GI 58

Query: 283 XXXXXXXXXXXXXXXXXXQVTPPPE----------KFSLSAGDDFGSLHESELSFQGEGL 432
                              ++P  E            + S  D+F S+  +EL+   + L
Sbjct: 59  GEGLFQEEDEEDEDKGCGSLSPRRELTENDNSNLTTTTFSVKDEFPSVPATELTVPADDL 118

Query: 433 ESLEWLSHFVEDSFSDYSLAGKFPLKPM--------ENRSEPAAKVQERPCFTTPVQTKA 588
             LEWLSHFVEDSFS+YS    FP   +        EN  EP   +Q + C  TP   KA
Sbjct: 119 ADLEWLSHFVEDSFSEYS--APFPPGTLTEKAQNQTENPPEPETPLQIKSCLKTPFPAKA 176

Query: 589 RTKRARTGVRVRPVLSPSFAEPXXXXXXXXXXXXXFLPNNPWLVHS------QNDGASLX 750
           R+KRARTG RV  + SPS  E                 ++PWL++       ++  +++ 
Sbjct: 177 RSKRARTGGRVWSMGSPSLTESSSSSSSSSSSSL----SSPWLIYPNTCQNVESFHSAVK 232

Query: 751 XXXXXXXXXXXXEGGGVGAAAQPRRCTHCGVTKTPQWRAGPLGPKTLC 894
                       E  G  A   P RC+HCGV KT QWR GPLG KTLC
Sbjct: 233 PPAKKHKKRLDPEASG-SAQXTPHRCSHCGVQKTXQWRTGPLGAKTLC 279


Top