BLASTX nr result

ID: Atropa21_contig00010476 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Atropa21_contig00010476
         (1111 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006362004.1| PREDICTED: GATA transcription factor 5-like ...   397   e-108
ref|XP_004230938.1| PREDICTED: GATA transcription factor 5-like ...   392   e-106
dbj|BAC98494.1| AG-motif binding protein-4 [Nicotiana tabacum]        392   e-106
ref|XP_002274872.1| PREDICTED: GATA transcription factor 5-like ...   246   9e-63
emb|CBI38005.3| unnamed protein product [Vitis vinifera]              235   3e-59
gb|EPS63160.1| hypothetical protein M569_11630, partial [Genlise...   220   7e-55
gb|ADL36693.1| GATA domain class transcription factor [Malus dom...   220   9e-55
ref|XP_006423461.1| hypothetical protein CICLE_v10028860mg [Citr...   218   3e-54
ref|XP_002512985.1| GATA transcription factor, putative [Ricinus...   218   3e-54
gb|ADL36697.1| GATA domain class transcription factor [Malus dom...   218   5e-54
ref|XP_002305457.2| hypothetical protein POPTR_0004s16860g [Popu...   217   8e-54
gb|EOX97872.1| GATA transcription factor 5, putative [Theobroma ...   212   2e-52
ref|XP_006487363.1| PREDICTED: GATA transcription factor 5-like ...   211   3e-52
ref|XP_002313763.2| hypothetical protein POPTR_0009s12620g [Popu...   210   7e-52
ref|XP_004287842.1| PREDICTED: GATA transcription factor 5-like ...   207   5e-51
gb|EXC35403.1| GATA transcription factor 5 [Morus notabilis]          203   9e-50
ref|XP_004290341.1| PREDICTED: GATA transcription factor 5-like ...   203   1e-49
gb|ESW03754.1| hypothetical protein PHAVU_011G039600g [Phaseolus...   202   2e-49
ref|XP_002272762.1| PREDICTED: GATA transcription factor 5 [Viti...   200   7e-49
ref|XP_006280758.1| hypothetical protein CARUB_v10026725mg [Caps...   196   1e-47

>ref|XP_006362004.1| PREDICTED: GATA transcription factor 5-like [Solanum tuberosum]
          Length = 325

 Score =  397 bits (1021), Expect = e-108
 Identities = 209/296 (70%), Positives = 215/296 (72%), Gaps = 11/296 (3%)
 Frame = +2

Query: 2   VTGINNGTXXXXXXXXXXXXXXXXXXXXXXQELHEDDEKDLFSKSSQNRNSQDSTFSGME 181
           VTGINNG                        ELHEDDEK  FS SSQNRNSQDSTFSGME
Sbjct: 33  VTGINNGASEDFSVDDLLDFSDKDFKDP---ELHEDDEKTSFSGSSQNRNSQDSTFSGME 89

Query: 182 SFG----ELPIPVDELENLEWLSQFVDDTTSEFSLLCPTGSYKDKIGGFQESRSEPVVQ- 346
           SFG    ELPIPVDE+ENLEWLSQFVDDT SEFSLLCP  S+KDK G F E RSEPVV+ 
Sbjct: 90  SFGSLAGELPIPVDEMENLEWLSQFVDDTPSEFSLLCPAESFKDKTGDFTEFRSEPVVRP 149

Query: 347 ---KMRVPCFSLPVPVKPRTKRSRPAGSTWSFXXXXXXXXXXXXXXXXXXXXXX---LFM 508
              KMRVPCF LP PVKPR+KRSRPAG TWSF                          F 
Sbjct: 150 VVKKMRVPCFPLPFPVKPRSKRSRPAGRTWSFPSSTVSGDSSSPTSSSYGSSPFPSGFFT 209

Query: 509 NPVQDGDLFCSVEKPALKKPKKNSLAEPGSGRRCTHCQVQKTPQWRAGPLGPKSLCNACG 688
           NPV DGDLFCSVEKP LKKPKKN  AE GSGRRCTHCQVQKTPQWRAGPLGPK+LCNACG
Sbjct: 210 NPVYDGDLFCSVEKPPLKKPKKNPSAETGSGRRCTHCQVQKTPQWRAGPLGPKTLCNACG 269

Query: 689 VRYKSGRLFPEYRPACSPTFSQEVHSNSHRKVLEMRRKKETGEVIDSGLTSMASNC 856
           VRYKSGRL+PEYRPACSPTFS EVHSNSHRKVLEMRRKKETGEVIDSGL SM S C
Sbjct: 270 VRYKSGRLYPEYRPACSPTFSLEVHSNSHRKVLEMRRKKETGEVIDSGLASMISTC 325


>ref|XP_004230938.1| PREDICTED: GATA transcription factor 5-like [Solanum lycopersicum]
          Length = 325

 Score =  392 bits (1008), Expect = e-106
 Identities = 207/296 (69%), Positives = 213/296 (71%), Gaps = 11/296 (3%)
 Frame = +2

Query: 2   VTGINNGTXXXXXXXXXXXXXXXXXXXXXXQELHEDDEKDLFSKSSQNRNSQDSTFSGME 181
           VTGINNG                        ELHEDDEK  FS SSQ RNSQDSTFSGME
Sbjct: 33  VTGINNGASEDFSVDDLLDFSDKDFKDP---ELHEDDEKTSFSGSSQKRNSQDSTFSGME 89

Query: 182 SFG----ELPIPVDELENLEWLSQFVDDTTSEFSLLCPTGSYKDKIGGFQESRSEPVVQ- 346
           SFG    ELPIPVD++ENLEWLSQFVDDT SEFSLLCPT S+KDK GGF ESRSEPVV+ 
Sbjct: 90  SFGSLAGELPIPVDDMENLEWLSQFVDDTPSEFSLLCPTESFKDKTGGFTESRSEPVVRP 149

Query: 347 ---KMRVPCFSLPVPVKPRTKRSRPAGSTWSFXXXXXXXXXXXXXXXXXXXXXX---LFM 508
              K RVPCF LP PVKPR+KRSR AG TWSF                          F 
Sbjct: 150 VVKKTRVPCFPLPFPVKPRSKRSRQAGRTWSFPSSAVSGDSSSPTSSSYGSSPFPSGFFT 209

Query: 509 NPVQDGDLFCSVEKPALKKPKKNSLAEPGSGRRCTHCQVQKTPQWRAGPLGPKSLCNACG 688
           NPV DGDLFCSVEKP LKKPKKN   E GSGRRCTHCQVQKTPQWRAGPLGPK+LCNACG
Sbjct: 210 NPVYDGDLFCSVEKPPLKKPKKNPSVETGSGRRCTHCQVQKTPQWRAGPLGPKTLCNACG 269

Query: 689 VRYKSGRLFPEYRPACSPTFSQEVHSNSHRKVLEMRRKKETGEVIDSGLTSMASNC 856
           VRYKSGRLFPEYRPACSPTFS EVHSNSHRKVLEMRRKKETGE IDSGL SM S C
Sbjct: 270 VRYKSGRLFPEYRPACSPTFSLEVHSNSHRKVLEMRRKKETGEGIDSGLASMISTC 325


>dbj|BAC98494.1| AG-motif binding protein-4 [Nicotiana tabacum]
          Length = 326

 Score =  392 bits (1006), Expect = e-106
 Identities = 199/263 (75%), Positives = 210/263 (79%), Gaps = 8/263 (3%)
 Frame = +2

Query: 92  QELHEDDEKDLFSKSSQNRNSQDSTFSGMESF-GELPIPVDELENLEWLSQFVDDTTSEF 268
           QELHEDDEKD FS SSQ+RNSQ S FS M+SF GELP+PVDELENLEWLSQFVDD+TSEF
Sbjct: 64  QELHEDDEKDSFSGSSQHRNSQVSNFSCMDSFSGELPVPVDELENLEWLSQFVDDSTSEF 123

Query: 269 SLLCPTGSYKDKIGGFQESRSEPVV----QKMRVPCFSLPVPVKPRTKRSRPAGSTWSFX 436
           SLLCP GS+KDK GGFQ SRSEPVV    QK++VPCF LPV  KPRT RSRPAG  WSF 
Sbjct: 124 SLLCPAGSFKDKTGGFQVSRSEPVVRPVVQKLKVPCFPLPVVQKPRTYRSRPAGRKWSFS 183

Query: 437 XXXXXXXXXXXXXXXXXXXXX---LFMNPVQDGDLFCSVEKPALKKPKKNSLAEPGSGRR 607
                                   LF NPV DGDLFCSVEKP LKKPKK S AE GSGRR
Sbjct: 184 SPTVSADSCSPTSSSYGSSPFPSVLFSNPVLDGDLFCSVEKPPLKKPKKLSTAETGSGRR 243

Query: 608 CTHCQVQKTPQWRAGPLGPKSLCNACGVRYKSGRLFPEYRPACSPTFSQEVHSNSHRKVL 787
           CTHCQVQKTPQWRAGPLGPK+LCNACGVRYKSGRLFPEYRPACSPTFSQEVHSNSHRKVL
Sbjct: 244 CTHCQVQKTPQWRAGPLGPKTLCNACGVRYKSGRLFPEYRPACSPTFSQEVHSNSHRKVL 303

Query: 788 EMRRKKETGEVIDSGLTSMASNC 856
           EMRRKKE+GEV+DSGL +M S C
Sbjct: 304 EMRRKKESGEVVDSGLATMISTC 326


>ref|XP_002274872.1| PREDICTED: GATA transcription factor 5-like [Vitis vinifera]
          Length = 317

 Score =  246 bits (629), Expect = 9e-63
 Identities = 135/256 (52%), Positives = 162/256 (63%), Gaps = 12/256 (4%)
 Frame = +2

Query: 92  QELHEDDEKDLFSKSSQNR----NSQDSTFSGMESF-----GELPIPVDELENLEWLSQF 244
           +E  E++EKD FS SS  R    NS  S+FSG   F     G L +P D+LE+LEWLS F
Sbjct: 66  EEEEEEEEKDSFSWSSLERVDDDNSNSSSFSGTGDFESLSAGGLAVPADDLEHLEWLSHF 125

Query: 245 VDDTT-SEFSLLCP--TGSYKDKIGGFQESRSEPVVQKMRVPCFSLPVPVKPRTKRSRPA 415
           VDD++ SE SLLCP  TG+   K         EP    +R P F  P+P KPR+KR R +
Sbjct: 126 VDDSSASELSLLCPAVTGNSPSK-----RCEEEPRPALLRTPLFPTPLPAKPRSKRHRSS 180

Query: 416 GSTWSFXXXXXXXXXXXXXXXXXXXXXXLFMNPVQDGDLFCSVEKPALKKPKKNSLAEPG 595
           G  W+F                      +F N V + + F S+EKP  KKPKK+  A+  
Sbjct: 181 GRAWAFGSHSPSSSPSSSSSSSSTSCL-IFANTVHNMESFYSLEKPPAKKPKKSPSADSQ 239

Query: 596 SGRRCTHCQVQKTPQWRAGPLGPKSLCNACGVRYKSGRLFPEYRPACSPTFSQEVHSNSH 775
             RRC+HC VQKTPQWR GPLGPK+LCNACGVR+KSGRLFPEYRPACSPTFS E+HSNSH
Sbjct: 240 PQRRCSHCLVQKTPQWRTGPLGPKTLCNACGVRFKSGRLFPEYRPACSPTFSVEIHSNSH 299

Query: 776 RKVLEMRRKKETGEVI 823
           RKVLE+RRKKET E +
Sbjct: 300 RKVLEIRRKKETAEPV 315


>emb|CBI38005.3| unnamed protein product [Vitis vinifera]
          Length = 352

 Score =  235 bits (599), Expect = 3e-59
 Identities = 125/233 (53%), Positives = 149/233 (63%), Gaps = 8/233 (3%)
 Frame = +2

Query: 149 NSQDSTFSGMESF-----GELPIPVDELENLEWLSQFVDDTT-SEFSLLCP--TGSYKDK 304
           NS  S+FSG   F     G L +P D+LE+LEWLS FVDD++ SE SLLCP  TG+   K
Sbjct: 124 NSNSSSFSGTGDFESLSAGGLAVPADDLEHLEWLSHFVDDSSASELSLLCPAVTGNSPSK 183

Query: 305 IGGFQESRSEPVVQKMRVPCFSLPVPVKPRTKRSRPAGSTWSFXXXXXXXXXXXXXXXXX 484
                    EP    +R P F  P+P KPR+KR R +G  W+F                 
Sbjct: 184 -----RCEEEPRPALLRTPLFPTPLPAKPRSKRHRSSGRAWAFGSHSPSSSPSSSSSSSS 238

Query: 485 XXXXXLFMNPVQDGDLFCSVEKPALKKPKKNSLAEPGSGRRCTHCQVQKTPQWRAGPLGP 664
                +F N V + + F S+EKP  KKPKK+  A+    RRC+HC VQKTPQWR GPLGP
Sbjct: 239 TSCL-IFANTVHNMESFYSLEKPPAKKPKKSPSADSQPQRRCSHCLVQKTPQWRTGPLGP 297

Query: 665 KSLCNACGVRYKSGRLFPEYRPACSPTFSQEVHSNSHRKVLEMRRKKETGEVI 823
           K+LCNACGVR+KSGRLFPEYRPACSPTFS E+HSNSHRKVLE+RRKKET E +
Sbjct: 298 KTLCNACGVRFKSGRLFPEYRPACSPTFSVEIHSNSHRKVLEIRRKKETAEPV 350


>gb|EPS63160.1| hypothetical protein M569_11630, partial [Genlisea aurea]
          Length = 312

 Score =  220 bits (561), Expect = 7e-55
 Identities = 126/249 (50%), Positives = 151/249 (60%), Gaps = 17/249 (6%)
 Frame = +2

Query: 113 EKDLFSKSSQNRNSQDSTFSGMESF-----GELPIPVDE-LENLEWLSQFVDDTTSEFSL 274
           ++D  S +    N+  STFSG + F     G L +PV+E L+NLEWLSQF DD+T+  + 
Sbjct: 68  KEDQDSSNKGGSNNSSSTFSGADDFDSLSSGNLHVPVEEDLDNLEWLSQFADDSTAAGAS 127

Query: 275 LCPTGSYKDKIGGFQESRSEPVVQKMRVPCFSLPVPVKPRTKRSRPAGSTWSFXXXXXXX 454
           L P G++  +       +SE  V + R      PVP K R+KR R  G +WS        
Sbjct: 128 LFPIGNFPSRAS----VKSEAAVDE-RAFIIPPPVPRKSRSKRERSNGQSWSLTSPQLSS 182

Query: 455 XXXXXXXXXXXXXXX-----LFMNPV----QDGDLFCSVEKPALKKPKKNSLAEPG--SG 601
                               LF+N      Q+ D F +VEKP  KKPK+    E G  SG
Sbjct: 183 VDSSTASSSSYTSTPPLPILLFLNAAPAAAQEPDWFSTVEKPPAKKPKRKPEPESGGLSG 242

Query: 602 RRCTHCQVQKTPQWRAGPLGPKSLCNACGVRYKSGRLFPEYRPACSPTFSQEVHSNSHRK 781
           RRCTHCQVQKTPQWR GPLGPK+LCNACGVR+KSGRLFPEYRPACSPTFS +VHSNSHRK
Sbjct: 243 RRCTHCQVQKTPQWRTGPLGPKTLCNACGVRFKSGRLFPEYRPACSPTFSHDVHSNSHRK 302

Query: 782 VLEMRRKKE 808
           VLEMRRKKE
Sbjct: 303 VLEMRRKKE 311


>gb|ADL36693.1| GATA domain class transcription factor [Malus domestica]
          Length = 323

 Score =  220 bits (560), Expect = 9e-55
 Identities = 122/248 (49%), Positives = 146/248 (58%), Gaps = 10/248 (4%)
 Frame = +2

Query: 104 EDDEKDLFSKSSQNRNSQDSTFSGMES--FGELPIPVDELENLEWLSQFVDDTTSEFSLL 277
           E +E+D  S   +  NS +S  +  +S    +L +P D+L  LEW+S FVDD+  + SLL
Sbjct: 66  EGEERDSVSVDDETSNSSNSVLADSDSGLATQLVVPDDDLAELEWVSHFVDDSLPDLSLL 125

Query: 278 CPTGSYKDKIGGFQESRSEPVVQKMRVPCFSLPVPVKPRTKRSRPAGSTWSFXXXXXXXX 457
              G  K +      S SEP   ++R   F   VPVKPRTKR R A   WS         
Sbjct: 126 HTIGVQKPEALLANRSESEPKPAQLRASLFPFEVPVKPRTKRCRLASRDWSLSSSSSPSS 185

Query: 458 XXXXXXXXXXXXXX-LFMNPVQDGDLFCSVEKPALKKPKKNSLAEPGSG-------RRCT 613
                          L  NPVQ   +F  V +PA KK KK    + G G       RRC+
Sbjct: 186 PSSSSGSGLSFSTPCLIFNPVQSMHVF--VGEPAAKKQKKKPAVQTGEGSIGGQFQRRCS 243

Query: 614 HCQVQKTPQWRAGPLGPKSLCNACGVRYKSGRLFPEYRPACSPTFSQEVHSNSHRKVLEM 793
           HCQVQKTPQWR GPLGPK+LCNACGVR+KSGRLFPEYRPACSPTFS +VHSNSHRKVLEM
Sbjct: 244 HCQVQKTPQWRTGPLGPKTLCNACGVRFKSGRLFPEYRPACSPTFSGDVHSNSHRKVLEM 303

Query: 794 RRKKETGE 817
           R++KE GE
Sbjct: 304 RKRKEVGE 311


>ref|XP_006423461.1| hypothetical protein CICLE_v10028860mg [Citrus clementina]
           gi|557525395|gb|ESR36701.1| hypothetical protein
           CICLE_v10028860mg [Citrus clementina]
          Length = 315

 Score =  218 bits (556), Expect = 3e-54
 Identities = 126/263 (47%), Positives = 154/263 (58%), Gaps = 15/263 (5%)
 Frame = +2

Query: 110 DEKDLFSKSS---QNRNSQDSTFSGMESF--GELPIPVDELENLEWLSQFVDDTT-SEFS 271
           D+KD FS       + NS   +FS  +S    E   PVD+   LEW+SQFVDD++ SE S
Sbjct: 60  DDKDYFSSPDPVDDDNNSNSGSFSSEQSLLTNEFVEPVDDFAELEWVSQFVDDSSCSELS 119

Query: 272 LLCPTGSYKDKIGGFQESRSEPVVQKMRV------PCFSLPVPVKPRTKRSRPAGSTWSF 433
           LL P    + +     E   +PV  K         PCF L VP K RTKR+R +G  WS 
Sbjct: 120 LLYPNYVERTR----SEPNGKPVSNKTSTNPTTTSPCFPLRVPSKARTKRTRRSGRAWS- 174

Query: 434 XXXXXXXXXXXXXXXXXXXXXXLFMNPVQDGDLFCSVEKPALKKPKKNSLAEPGSG---R 604
                                 +F + VQ+ + F   ++P +KKPKK    + G G   R
Sbjct: 175 --SGSPLSTESTISSSSSTSCLIFTDSVQNIEWFSGFDEPVVKKPKKKPAVQSGGGLFQR 232

Query: 605 RCTHCQVQKTPQWRAGPLGPKSLCNACGVRYKSGRLFPEYRPACSPTFSQEVHSNSHRKV 784
           RC+HCQ QKTPQWR GPLGPK+LCNACGVRYKSGRLFPEYRPACSPTFS ++HSNSHRKV
Sbjct: 233 RCSHCQTQKTPQWRTGPLGPKTLCNACGVRYKSGRLFPEYRPACSPTFSVDMHSNSHRKV 292

Query: 785 LEMRRKKETGEVIDSGLTSMASN 853
           LEMRRKKE+    D GL+ M  +
Sbjct: 293 LEMRRKKESAGP-DVGLSHMVQS 314


>ref|XP_002512985.1| GATA transcription factor, putative [Ricinus communis]
           gi|223547996|gb|EEF49488.1| GATA transcription factor,
           putative [Ricinus communis]
          Length = 368

 Score =  218 bits (556), Expect = 3e-54
 Identities = 126/266 (47%), Positives = 151/266 (56%), Gaps = 19/266 (7%)
 Frame = +2

Query: 104 EDDEKDLFSKSSQNR-------NSQDSTFSGMESFGELPIPVDELENLEWLSQFVDDTTS 262
           E++EKD  S SSQ+R       NS  STF       EL +P+++L  LEW+SQFVDD++ 
Sbjct: 100 EEEEKDSLSVSSQDRSGVDDDNNSNSSTFDESFLTSELAVPIEDLAELEWVSQFVDDSSP 159

Query: 263 EFSLLCPTGSYKDKIGG-FQESRSEPVVQKMRVPCFSLPVPVKPRTKRSRPAGSTWSFXX 439
           EFSLL P  S        FQ    +PV        F + +P KPR+KR+RP G TWS   
Sbjct: 160 EFSLLYPLNSEDHHTRNRFQPEHPKPVALTKPSCLFPVKIPAKPRSKRTRPTGRTWSVES 219

Query: 440 XXXXXXXXXXXXXXXXXXXXLFMNP----VQDGDLFCSVEKPALKKPKKNSLAEPGSG-- 601
                                   P    VQ  D   S  +P  KK K+   A+ G    
Sbjct: 220 LLTDSSSSSSSYCSSSPISSSASTPCFVTVQTIDSLPSFCEPPAKKAKRKPAAQTGGATG 279

Query: 602 -----RRCTHCQVQKTPQWRAGPLGPKSLCNACGVRYKSGRLFPEYRPACSPTFSQEVHS 766
                RRC+HCQVQKTPQWR GPLG K+LCNACGVRYKSGRLFPEYRPACSPTFS ++HS
Sbjct: 280 LTQFQRRCSHCQVQKTPQWRTGPLGAKTLCNACGVRYKSGRLFPEYRPACSPTFSGDIHS 339

Query: 767 NSHRKVLEMRRKKETGEVIDSGLTSM 844
           NSHRKVLE+R+KKE      SGL+ M
Sbjct: 340 NSHRKVLEIRKKKELSGPA-SGLSQM 364


>gb|ADL36697.1| GATA domain class transcription factor [Malus domestica]
          Length = 321

 Score =  218 bits (554), Expect = 5e-54
 Identities = 119/247 (48%), Positives = 144/247 (58%), Gaps = 9/247 (3%)
 Frame = +2

Query: 104 EDDEKDLFSKSSQNRNSQDSTFSGMES--FGELPIPVDELENLEWLSQFVDDTTSEFSLL 277
           E++EK+  S   +  NS        +S    +L +P D+L  LEW+S FVDD+  + SL 
Sbjct: 66  EEEEKESVSVDDEISNSSSLVLPDSDSGLATQLLVPDDDLAELEWVSHFVDDSLPDLSLF 125

Query: 278 CPTGSYKDKIGGFQESRSEPVVQKMRVPCFSLPVPVKPRTKRSRPAGSTWSFXXXXXXXX 457
              G+ K +         EP    +R P F   VPVKPRTKR +PA   WS         
Sbjct: 126 HTIGTQKPEALLMNRFEPEPKPVPLRAPLFPFQVPVKPRTKRYKPASRVWSSSSSCSPSS 185

Query: 458 XXXXXXXXXXXXXXLFMNPVQDGDLFCSVEKPALKKPKKNSLAEPGSG-------RRCTH 616
                         +F NPVQ  D+F  V +PA KK KK    + G G       RRC+H
Sbjct: 186 SPCSSGFSFSTPCLIF-NPVQSMDVF--VGEPAAKKQKKKPAVQTGEGSIGGQFQRRCSH 242

Query: 617 CQVQKTPQWRAGPLGPKSLCNACGVRYKSGRLFPEYRPACSPTFSQEVHSNSHRKVLEMR 796
           CQVQKTPQWR GPLGPK+LCNACGVR+KSGRLFPEYRPACSPTFS  VHSNSHRKVLEMR
Sbjct: 243 CQVQKTPQWRTGPLGPKTLCNACGVRFKSGRLFPEYRPACSPTFSGAVHSNSHRKVLEMR 302

Query: 797 RKKETGE 817
           ++K+ GE
Sbjct: 303 KRKDVGE 309


>ref|XP_002305457.2| hypothetical protein POPTR_0004s16860g [Populus trichocarpa]
           gi|550341195|gb|EEE85968.2| hypothetical protein
           POPTR_0004s16860g [Populus trichocarpa]
          Length = 327

 Score =  217 bits (552), Expect = 8e-54
 Identities = 125/250 (50%), Positives = 144/250 (57%), Gaps = 13/250 (5%)
 Frame = +2

Query: 104 EDDEKDLFSKSSQNR-----NSQDSTFSGMESFGELPIPVDELENLEWLSQFVDDTTSEF 268
           E++EKD  S SSQ+R     NS  S+FS      EL +P D++  LEW+S FV+D+ S+ 
Sbjct: 66  EEEEKDSLSVSSQDRVDDDFNSNSSSFSDSFLSSELAVPTDDIAELEWVSHFVNDSLSDV 125

Query: 269 SLLCPTGSYKDKIGGFQESRSEPVVQKMRVPCFSLP-VPVKPRTKRSRPAGSTWSFXXXX 445
           SLL P    K +         EP     + P F  P VP K RTKRSR  G TWS     
Sbjct: 126 SLLVPACKGKPESHAKNRFEPEPKPSLAKTPGFFPPRVPSKARTKRSRRTGRTWS-GRSN 184

Query: 446 XXXXXXXXXXXXXXXXXXLFMNPVQDGDLFCSVEKPALKKPKKNSL-------AEPGSGR 604
                             +  N VQ  D    + +P +KKPKK          A P   R
Sbjct: 185 QTETPSSSASSTSSMPCLVSANTVQTIDSLSWLSEPPMKKPKKRPAVQTSGITAAPQFQR 244

Query: 605 RCTHCQVQKTPQWRAGPLGPKSLCNACGVRYKSGRLFPEYRPACSPTFSQEVHSNSHRKV 784
           RC+HCQVQKTPQWR GP G K+LCNACGVRYKSGRLFPEYRPACSPTFS EVHSNSHRKV
Sbjct: 245 RCSHCQVQKTPQWRTGPHGAKTLCNACGVRYKSGRLFPEYRPACSPTFSSEVHSNSHRKV 304

Query: 785 LEMRRKKETG 814
           LEMRRKKE G
Sbjct: 305 LEMRRKKEMG 314


>gb|EOX97872.1| GATA transcription factor 5, putative [Theobroma cacao]
          Length = 322

 Score =  212 bits (540), Expect = 2e-52
 Identities = 119/255 (46%), Positives = 150/255 (58%), Gaps = 17/255 (6%)
 Frame = +2

Query: 95  ELHEDDEKDLFSKSSQNRNSQDSTFSGMESFG-------ELPIPVDELENLEWLSQFVDD 253
           E  E+++KD FS SS+ R + D + S   SF        EL +P DE+  LEW+S FVDD
Sbjct: 55  EFEEEEQKDSFSVSSEERVADDDSNSNSSSFSFDSLLTNELSVPDDEIAGLEWVSHFVDD 114

Query: 254 TTSEFSLLCPTGSYKDKIGGFQES--RSEPVVQKMRVPCFSLPVPVKPRTKRSRPAGSTW 427
           +  E  +LCP   +K +  G  ++   +EP +  M+ P FS  VP K R+KR++  G TW
Sbjct: 115 SFPELPILCPV--FKPQSDGHAKTLFETEPELVFMKTPSFSSTVPSKARSKRAKSTGRTW 172

Query: 428 SFXXXXXXXXXXXXXXXXXXXXXXLFMNP-VQDGDLFCSVEKPALKKPKKNSLAEPG--- 595
           S                          +  VQ+ DL     +P  KK KK    +     
Sbjct: 173 SVGSMPLSESSSSTITSSSTSSGFSVTSANVQETDLANDFTEPPTKKQKKKPAVQASGLS 232

Query: 596 SG----RRCTHCQVQKTPQWRAGPLGPKSLCNACGVRYKSGRLFPEYRPACSPTFSQEVH 763
           SG    RRC+HCQVQKTPQWR GPLG K+LCNACGVRYKSGRLFPEYRPACSPTFS ++H
Sbjct: 233 SGNPFQRRCSHCQVQKTPQWRTGPLGAKTLCNACGVRYKSGRLFPEYRPACSPTFSGDIH 292

Query: 764 SNSHRKVLEMRRKKE 808
           SNSHRKVLEMR++KE
Sbjct: 293 SNSHRKVLEMRKRKE 307


>ref|XP_006487363.1| PREDICTED: GATA transcription factor 5-like [Citrus sinensis]
          Length = 316

 Score =  211 bits (538), Expect = 3e-52
 Identities = 125/264 (47%), Positives = 152/264 (57%), Gaps = 16/264 (6%)
 Frame = +2

Query: 110 DEKDLFSKSS---QNRNSQDSTFSGMESF--GELPIPVDELENLEWLSQFVDDTT-SEFS 271
           D+KD FS       + NS   +FS  +S    E   PVD+   LEW+SQFVDD++ SE S
Sbjct: 60  DDKDSFSSPDPVDDDNNSNSGSFSSEQSLLTNEFVEPVDDFAELEWVSQFVDDSSCSELS 119

Query: 272 LLCPTGSYKDKIGGFQESRSEPVVQKMRV-------PCFSLPVPVKPRTKRSRPAGSTWS 430
           LL P    + +     E   +PV  K          PCF L VP K RTKR+R +G  WS
Sbjct: 120 LLYPNYVERTR----SEPDGKPVSNKTSTNPTTTTSPCFPLRVPSKARTKRTRRSGWAWS 175

Query: 431 FXXXXXXXXXXXXXXXXXXXXXXLFMNPVQDGDLFCSVEKPALKKPKKNSLAEPGSG--- 601
                                  +F + VQ+ + F   ++P  KK KK    + G G   
Sbjct: 176 ---SGSPLSTESTISSSSSTSCLIFTDSVQNIEWFSGFDEPVAKKLKKKPAVQSGGGLFQ 232

Query: 602 RRCTHCQVQKTPQWRAGPLGPKSLCNACGVRYKSGRLFPEYRPACSPTFSQEVHSNSHRK 781
           RRC+HCQ QKTPQWR GPLGPK+LCNACGVRYKSGRLFPEYRPACSPTFS ++HSNSHRK
Sbjct: 233 RRCSHCQTQKTPQWRTGPLGPKTLCNACGVRYKSGRLFPEYRPACSPTFSVDMHSNSHRK 292

Query: 782 VLEMRRKKETGEVIDSGLTSMASN 853
           VLEMRRKKE+    D GL+ M  +
Sbjct: 293 VLEMRRKKESAGP-DVGLSHMVQS 315


>ref|XP_002313763.2| hypothetical protein POPTR_0009s12620g [Populus trichocarpa]
           gi|550331601|gb|EEE87718.2| hypothetical protein
           POPTR_0009s12620g [Populus trichocarpa]
          Length = 329

 Score =  210 bits (535), Expect = 7e-52
 Identities = 123/249 (49%), Positives = 143/249 (57%), Gaps = 14/249 (5%)
 Frame = +2

Query: 104 EDDEKDLFSKSSQNR-----NSQDSTFSGMESFGELPIPVDELENLEWLSQFVDDTTSEF 268
           +++EKD  S SSQ+R     NS  S+FS      EL +P D++  LEW+S FVDD+ S+ 
Sbjct: 68  QEEEKDSISVSSQDRVDDDFNSNSSSFSDSFLASELAVPTDDIAELEWVSHFVDDSVSDV 127

Query: 269 SLLCPT--GSYKDKIGGFQESRSEPVVQKMRVPCFSLPVPVKPRTKRSRPAGSTWSFXXX 442
           SLL P   GS K       E  ++P   K     F   VP K RTKRSRP G TWS    
Sbjct: 128 SLLVPACKGSSKRHAKNRFEPETKPTFAKTSC-LFPSRVPSKARTKRSRPTGRTWS-AGS 185

Query: 443 XXXXXXXXXXXXXXXXXXXLFMNPVQDGDLFCSVEKPALKKPKKNS-------LAEPGSG 601
                              +  N VQ  D    + +  +K  KK         +A     
Sbjct: 186 NQSETPSSSTSSTSSMPCLVATNTVQTADSLSWLSEQPMKISKKRPAVHTSGLMASTQFQ 245

Query: 602 RRCTHCQVQKTPQWRAGPLGPKSLCNACGVRYKSGRLFPEYRPACSPTFSQEVHSNSHRK 781
           RRC+HCQVQKTPQWR GPLG K+LCNACGVRYKSGRLFPEYRPACSPTFS EVHSNSHRK
Sbjct: 246 RRCSHCQVQKTPQWRTGPLGAKTLCNACGVRYKSGRLFPEYRPACSPTFSSEVHSNSHRK 305

Query: 782 VLEMRRKKE 808
           VLEMRRKKE
Sbjct: 306 VLEMRRKKE 314


>ref|XP_004287842.1| PREDICTED: GATA transcription factor 5-like [Fragaria vesca subsp.
           vesca]
          Length = 333

 Score =  207 bits (528), Expect = 5e-51
 Identities = 129/276 (46%), Positives = 149/276 (53%), Gaps = 23/276 (8%)
 Frame = +2

Query: 92  QELHEDDEKD--LFSKSSQNRNSQDSTFSGM------------ESFGELPIPVDELENLE 229
           QE  EDD+KD  L  K S     ++ST S              E   EL +P D+LENLE
Sbjct: 58  QEEQEDDKKDSVLPKKESTVEEKENSTPSSCVSEKNELGPEPAEPTSELTVPADDLENLE 117

Query: 230 WLSQFVDDTTSEFSLLCPTGSYKDKIGGFQESRSEPVVQKMRVPCFSLPVPVKPRTKRSR 409
           WLS FV+D+ S F+   P G    K     E R EP   K   PCF  PVP K R+KR+R
Sbjct: 118 WLSHFVEDSFSGFNASLPAGFMAVK----PEKRPEPEALK---PCFKTPVPAKARSKRTR 170

Query: 410 PAGSTWSFXXXXXXXXXXXXXXXXXXXXXX----LFMNPVQD-GDLFCSVEKPALKKPKK 574
             G  WS                           L  NP Q  G    SVEKP  KKPK+
Sbjct: 171 TGGRVWSLGSPSFTETSSSSSSSSSTSSCPSSPWLIYNPTQGLGGFGSSVEKPQ-KKPKR 229

Query: 575 NSLAEPGSG----RRCTHCQVQKTPQWRAGPLGPKSLCNACGVRYKSGRLFPEYRPACSP 742
            +  E G      RRC+HC VQKTPQWR GP G K+LCNACGVRYKSGRL PEYRPACSP
Sbjct: 230 PATTEGGGSSQPPRRCSHCGVQKTPQWRTGPNGAKTLCNACGVRYKSGRLVPEYRPACSP 289

Query: 743 TFSQEVHSNSHRKVLEMRRKKETGEVIDSGLTSMAS 850
           TFS E+HSN HRKV+E+RRKKE     + GL + A+
Sbjct: 290 TFSSELHSNHHRKVMEIRRKKEGPAGPEPGLMTTAA 325


>gb|EXC35403.1| GATA transcription factor 5 [Morus notabilis]
          Length = 393

 Score =  203 bits (517), Expect = 9e-50
 Identities = 118/265 (44%), Positives = 141/265 (53%), Gaps = 25/265 (9%)
 Frame = +2

Query: 92  QELHEDDEKDLFSKSSQ------------NRNSQDSTFSGMESFGELPIPVDELENLEWL 235
           +E  +D +KDL S S +            N N   S F       EL +P +ELENLEWL
Sbjct: 112 EEQDQDGDKDLSSPSQEQNQPAEEEAINDNNNPSTSLFVSSVPTTELTLPAEELENLEWL 171

Query: 236 SQFVDDTTSEFSLLCPTGSYKDKIGGFQESRSEPVVQKMRVPCFSLPVPVKPRTKRSRPA 415
           S FV+++ SEFS     G   +K    +    EP       PCF+ P+P K R+KR R  
Sbjct: 172 SHFVEESFSEFSTSYLAGVSAEKPPEDETFLPEPKRFAPEKPCFTTPIPAKARSKRPRTG 231

Query: 416 GSTWSFXXXXXXXXXXXXXXXXXXXXXXL---FMNPVQDGDLFCSVEKPALKKPKKNSLA 586
           G  WS                            +      +  CSV+KPA KK KK    
Sbjct: 232 GRVWSLGSPSFIESSSSSTTSSSSSSSPTSPWLIYATHSHEPACSVQKPAPKKAKKRQAV 291

Query: 587 EP---GSG-------RRCTHCQVQKTPQWRAGPLGPKSLCNACGVRYKSGRLFPEYRPAC 736
           E    GSG       RRC+HC VQKTPQWR GPLG K+LCNACGVR+KSGRL PEYRPAC
Sbjct: 292 ESFGSGSGPASAQPPRRCSHCGVQKTPQWRTGPLGAKTLCNACGVRFKSGRLLPEYRPAC 351

Query: 737 SPTFSQEVHSNSHRKVLEMRRKKET 811
           SPTFS ++HSN HRKVLEMRRKKE+
Sbjct: 352 SPTFSSDLHSNHHRKVLEMRRKKES 376


>ref|XP_004290341.1| PREDICTED: GATA transcription factor 5-like [Fragaria vesca subsp.
           vesca]
          Length = 353

 Score =  203 bits (516), Expect = 1e-49
 Identities = 124/279 (44%), Positives = 161/279 (57%), Gaps = 28/279 (10%)
 Frame = +2

Query: 92  QELHEDDEKDLFSKSSQNRNSQDSTFSGMESF--GELPIPVDELENLEWLSQFVDDTTSE 265
           +E  E++E+D  S S  +  + +S++   ES    +L +P D++  LEW+S FVDD+ SE
Sbjct: 76  EEEEEEEEEDKDSVSVDSVENSNSSYFTTESTLASQLAVPDDDIAELEWVSHFVDDSASE 135

Query: 266 FSLLCPTGSYKDKIGGFQESRSEPVVQKMRVPCFS-----LP--VPVKPRTKRSRPAG-- 418
            SLL P    K +      +RSEP  +++ +         LP  VPVKPR+KR RPA   
Sbjct: 136 LSLLHPVSKLKPE--ALTLNRSEPEARRLALAHDQSTLSWLPSQVPVKPRSKRFRPASRL 193

Query: 419 --STWS---------FXXXXXXXXXXXXXXXXXXXXXXLFMNPVQDGDLFCSVEKPALKK 565
             S W+                                +  NPV    +F    +PA KK
Sbjct: 194 RSSVWNPLGDSPSLTSSLPSPSSTSSCSSGMSFSTPCLVLTNPVHKVGVFWG--EPAAKK 251

Query: 566 PKKNSLAEPG------SGRRCTHCQVQKTPQWRAGPLGPKSLCNACGVRYKSGRLFPEYR 727
            K+    + G      + RRC+HCQVQKTPQWR GPLGPK+LCNACGVRYKSGRLFPEYR
Sbjct: 252 QKRKPAVQTGDEVVVGTQRRCSHCQVQKTPQWRTGPLGPKTLCNACGVRYKSGRLFPEYR 311

Query: 728 PACSPTFSQEVHSNSHRKVLEMRRKKETGEVIDSGLTSM 844
           PACSPTFS +VHSNSHRKVLEMRR+K+TGE  +SG++ M
Sbjct: 312 PACSPTFSGDVHSNSHRKVLEMRRRKDTGEP-ESGMSKM 349


>gb|ESW03754.1| hypothetical protein PHAVU_011G039600g [Phaseolus vulgaris]
           gi|561004761|gb|ESW03755.1| hypothetical protein
           PHAVU_011G039600g [Phaseolus vulgaris]
          Length = 303

 Score =  202 bits (514), Expect = 2e-49
 Identities = 126/267 (47%), Positives = 159/267 (59%), Gaps = 13/267 (4%)
 Frame = +2

Query: 95  ELHEDDEKDLFSKSSQNR-----NSQDSTFSGMESF--GELPIPVDELENLEWLSQFVDD 253
           E  ED+EKD  S S Q+R     NS  +   G +S   GEL +P D++ +LEW+S FVDD
Sbjct: 64  EEEEDEEKDSSSGSLQDRIEDDSNSNSAACGGGDSVFAGELSVPADDVADLEWVSHFVDD 123

Query: 254 TTSEFSLLCPTGSYKDKIGGFQESRSEPVVQKMRVPCFSLPVPVKPRTKRSR-PAGSTWS 430
           +  E SLL P  + K ++      R+EP  +  R    S  VP + RT++SR P    WS
Sbjct: 124 SLPELSLLYPVPAAKTRV------RTEPEPRPGRAQTAST-VPKRLRTEKSRKPNARVWS 176

Query: 431 FXXXXXXXXXXXXXXXXXXXXXXLFMNPVQDGDLFCSVEKPALKKPKKNSLAEPGSG--R 604
           F                         +PV  G  F    +P  KK KK + A+ G+   R
Sbjct: 177 FTVSPP-----------------FLCSPVLAGVEF---GEPTAKKQKKKAEAQSGTQFQR 216

Query: 605 RCTHCQVQKTPQWRAGPLGPKSLCNACGVRYKSGRLFPEYRPACSPTFSQEVHSNSHRKV 784
           RC+HCQVQKTPQWR GPLGPK+LCNACGVR+KSGRLFPEYRPA SPTFS E+HSNSHRKV
Sbjct: 217 RCSHCQVQKTPQWRTGPLGPKTLCNACGVRFKSGRLFPEYRPAISPTFSGEIHSNSHRKV 276

Query: 785 LEMRRKKETGE-VIDSG--LTSMASNC 856
           LEMRR+KET E V+++G   T +  +C
Sbjct: 277 LEMRRRKETTEPVLETGSDRTQLVPSC 303


>ref|XP_002272762.1| PREDICTED: GATA transcription factor 5 [Vitis vinifera]
          Length = 338

 Score =  200 bits (509), Expect = 7e-49
 Identities = 115/260 (44%), Positives = 143/260 (55%), Gaps = 21/260 (8%)
 Frame = +2

Query: 92  QELHEDDEKDLFSKS-------SQNRNSQDSTFSGMESFGELP-----IPVDELENLEWL 235
           +E  ED++K   S S       + N N   +TFS  + F  +P     +P D+L +LEWL
Sbjct: 65  EEDEEDEDKGCGSLSPRGELTENDNSNLTTTTFSVKDEFPSVPATELTVPADDLADLEWL 124

Query: 236 SQFVDDTTSEFSLLCPTGSYKDKIGGFQESRSEPVVQKMRVPCFSLPVPVKPRTKRSRPA 415
           S FV+D+ SE+S   P G+  +K     E+  EP        C   P P K R+KR+R  
Sbjct: 125 SHFVEDSFSEYSAPFPHGTLTEKAQNQTENPPEPETPLQIKSCLKTPFPAKARSKRARTG 184

Query: 416 GSTWSFXXXXXXXXXXXXXXXXXXXXXX---LFMNPVQDGDLFCSVEKPALKKPKK---- 574
           G  WS                          ++ N  Q+ + F S  KP  KK KK    
Sbjct: 185 GRVWSMGSPSLTESSSSSSSSSSSSLSSPWLIYPNTCQNVESFHSAVKPPAKKHKKRLDP 244

Query: 575 --NSLAEPGSGRRCTHCQVQKTPQWRAGPLGPKSLCNACGVRYKSGRLFPEYRPACSPTF 748
             +  A+P +  RC+HC VQKTPQWR GPLG K+LCNACGVRYKSGRL PEYRPACSPTF
Sbjct: 245 EASGSAQP-TPHRCSHCGVQKTPQWRTGPLGAKTLCNACGVRYKSGRLLPEYRPACSPTF 303

Query: 749 SQEVHSNSHRKVLEMRRKKE 808
           S E+HSN HRKVLEMRRKKE
Sbjct: 304 SSEIHSNHHRKVLEMRRKKE 323


>ref|XP_006280758.1| hypothetical protein CARUB_v10026725mg [Capsella rubella]
           gi|565433824|ref|XP_006280759.1| hypothetical protein
           CARUB_v10026725mg [Capsella rubella]
           gi|482549462|gb|EOA13656.1| hypothetical protein
           CARUB_v10026725mg [Capsella rubella]
           gi|482549463|gb|EOA13657.1| hypothetical protein
           CARUB_v10026725mg [Capsella rubella]
          Length = 342

 Score =  196 bits (499), Expect = 1e-47
 Identities = 114/263 (43%), Positives = 138/263 (52%), Gaps = 9/263 (3%)
 Frame = +2

Query: 92  QELHEDDEKDLFSKSSQNRNSQDSTFSGMESFGELPIPVDELENLEWLSQFVDDTTSEFS 271
           +E  E++E++L           D  FSG     EL +P D+L NLEWLS FV+D+ +E+S
Sbjct: 75  EEEEEEEEEELNDDGDALPRCID--FSGSLPTSELSVPADDLANLEWLSHFVEDSFTEYS 132

Query: 272 LLCPTGSYKDKIGGFQESRSEPVVQKMRVPCFSLPVPVKPRTKRSRPAGSTWSFXXXXXX 451
               TG+  +K       R  PV    +  CF  PVP K R+KR R     WS       
Sbjct: 133 GPNLTGTPTEKPAWLTGDRKHPVTPATQESCFKSPVPAKARSKRHRNGVKAWSLGSSSSS 192

Query: 452 XXXXXXXXXXXXXXXXLFMNPVQDGDLF----CSVEKPALKKPKKNSLAEPGSG-----R 604
                                    DLF     S   P  KK KK S      G     R
Sbjct: 193 GPSSSGSTSSSSSSSGPSSPWFSGADLFEPMVASERPPFPKKHKKRSAESAFCGQLQPQR 252

Query: 605 RCTHCQVQKTPQWRAGPLGPKSLCNACGVRYKSGRLFPEYRPACSPTFSQEVHSNSHRKV 784
           RC+HC VQKTPQWRAGP+G K+LCNACGVRYKSGRL PEYRPACSPTFS E+HSN HRKV
Sbjct: 253 RCSHCGVQKTPQWRAGPMGAKTLCNACGVRYKSGRLLPEYRPACSPTFSSELHSNHHRKV 312

Query: 785 LEMRRKKETGEVIDSGLTSMASN 853
           +EMRRKKE     ++GL  +  +
Sbjct: 313 MEMRRKKEPTSDSETGLNQLVQS 335


Top