BLASTX nr result

ID: Catharanthus22_contig00007675 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus22_contig00007675
         (1580 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

dbj|BAC98494.1| AG-motif binding protein-4 [Nicotiana tabacum]        226   3e-56
ref|XP_006362004.1| PREDICTED: GATA transcription factor 5-like ...   221   7e-55
ref|XP_004230938.1| PREDICTED: GATA transcription factor 5-like ...   219   3e-54
emb|CBI38005.3| unnamed protein product [Vitis vinifera]              201   6e-49
ref|XP_002274872.1| PREDICTED: GATA transcription factor 5-like ...   201   6e-49
ref|XP_002313763.2| hypothetical protein POPTR_0009s12620g [Popu...   194   9e-47
ref|XP_002305457.2| hypothetical protein POPTR_0004s16860g [Popu...   193   2e-46
gb|ADL36697.1| GATA domain class transcription factor [Malus dom...   193   2e-46
gb|EPS63160.1| hypothetical protein M569_11630, partial [Genlise...   190   2e-45
gb|ADL36693.1| GATA domain class transcription factor [Malus dom...   189   3e-45
ref|XP_004287842.1| PREDICTED: GATA transcription factor 5-like ...   188   6e-45
ref|XP_002512985.1| GATA transcription factor, putative [Ricinus...   185   4e-44
ref|XP_004290341.1| PREDICTED: GATA transcription factor 5-like ...   185   5e-44
gb|EOX97872.1| GATA transcription factor 5, putative [Theobroma ...   184   9e-44
ref|XP_006393827.1| hypothetical protein EUTSA_v10004566mg [Eutr...   181   1e-42
ref|XP_006423461.1| hypothetical protein CICLE_v10028860mg [Citr...   180   1e-42
ref|XP_006487363.1| PREDICTED: GATA transcription factor 5-like ...   178   7e-42
ref|XP_006425559.1| hypothetical protein CICLE_v10025844mg [Citr...   177   9e-42
ref|XP_006425558.1| hypothetical protein CICLE_v10025844mg [Citr...   177   9e-42
gb|ESW07921.1| hypothetical protein PHAVU_009G003800g [Phaseolus...   174   7e-41

>dbj|BAC98494.1| AG-motif binding protein-4 [Nicotiana tabacum]
          Length = 326

 Score =  226 bits (575), Expect = 3e-56
 Identities = 136/257 (52%), Positives = 154/257 (59%), Gaps = 7/257 (2%)
 Frame = -2

Query: 1075 EDEEKSSLLVXXXXXXXXXXXXSETDDFGSLLAAELVVPADDLENLEWLSQFVDDSQSEL 896
            ED+EK S               S  D F    + EL VP D+LENLEWLSQFVDDS SE 
Sbjct: 68   EDDEKDSFSGSSQHRNSQVSNFSCMDSF----SGELPVPVDELENLEWLSQFVDDSTSEF 123

Query: 895  SMLCPAGSFKDHKTVVNLPANRSEPGV----QKLIGPSFPLPVSRKMRSKRARPNGGRVW 728
            S+LCPAGSFKD         +RSEP V    QKL  P FPLPV +K R+ R+RP  GR W
Sbjct: 124  SLLCPAGSFKDKTG--GFQVSRSEPVVRPVVQKLKVPCFPLPVVQKPRTYRSRP-AGRKW 180

Query: 727  XXXXXXXXXXXXXXXXXXXXXXXXXSPF---VWANPAQDFELFSTVEXXXXXXXXXXXXP 557
                                      PF   +++NP  D +LF +VE             
Sbjct: 181  SFSSPTVSADSCSPTSSSYGSS----PFPSVLFSNPVLDGDLFCSVEKPPLKKPKKLS-- 234

Query: 556  TADTTTGSQTLRRCSHCQVQKTPQWRTGPMGPKTLCNACGVRYKSGRLFPEYRPACSPTF 377
            TA+T +G    RRC+HCQVQKTPQWR GP+GPKTLCNACGVRYKSGRLFPEYRPACSPTF
Sbjct: 235  TAETGSG----RRCTHCQVQKTPQWRAGPLGPKTLCNACGVRYKSGRLFPEYRPACSPTF 290

Query: 376  SQEIHSNSHRKVLEMRR 326
            SQE+HSNSHRKVLEMRR
Sbjct: 291  SQEVHSNSHRKVLEMRR 307


>ref|XP_006362004.1| PREDICTED: GATA transcription factor 5-like [Solanum tuberosum]
          Length = 325

 Score =  221 bits (563), Expect = 7e-55
 Identities = 130/254 (51%), Positives = 153/254 (60%), Gaps = 4/254 (1%)
 Frame = -2

Query: 1075 EDEEKSSLLVXXXXXXXXXXXXSETDDFGSLLAAELVVPADDLENLEWLSQFVDDSQSEL 896
            ED+EK+S               S  + FGSL A EL +P D++ENLEWLSQFVDD+ SE 
Sbjct: 64   EDDEKTSFSGSSQNRNSQDSTFSGMESFGSL-AGELPIPVDEMENLEWLSQFVDDTPSEF 122

Query: 895  SMLCPAGSFKDHKTVVNLPANRSEPGVQKLIG----PSFPLPVSRKMRSKRARPNGGRVW 728
            S+LCPA SFKD     +    RSEP V+ ++     P FPLP   K RSKR+RP  GR W
Sbjct: 123  SLLCPAESFKDKTG--DFTEFRSEPVVRPVVKKMRVPCFPLPFPVKPRSKRSRP-AGRTW 179

Query: 727  XXXXXXXXXXXXXXXXXXXXXXXXXSPFVWANPAQDFELFSTVEXXXXXXXXXXXXPTAD 548
                                     S F + NP  D +LF +VE             +A+
Sbjct: 180  SFPSSTVSGDSSSPTSSSYGSSPFPSGF-FTNPVYDGDLFCSVEKPPLKKPKKNP--SAE 236

Query: 547  TTTGSQTLRRCSHCQVQKTPQWRTGPMGPKTLCNACGVRYKSGRLFPEYRPACSPTFSQE 368
            T +G    RRC+HCQVQKTPQWR GP+GPKTLCNACGVRYKSGRL+PEYRPACSPTFS E
Sbjct: 237  TGSG----RRCTHCQVQKTPQWRAGPLGPKTLCNACGVRYKSGRLYPEYRPACSPTFSLE 292

Query: 367  IHSNSHRKVLEMRR 326
            +HSNSHRKVLEMRR
Sbjct: 293  VHSNSHRKVLEMRR 306


>ref|XP_004230938.1| PREDICTED: GATA transcription factor 5-like [Solanum lycopersicum]
          Length = 325

 Score =  219 bits (558), Expect = 3e-54
 Identities = 129/254 (50%), Positives = 150/254 (59%), Gaps = 4/254 (1%)
 Frame = -2

Query: 1075 EDEEKSSLLVXXXXXXXXXXXXSETDDFGSLLAAELVVPADDLENLEWLSQFVDDSQSEL 896
            ED+EK+S               S  + FGSL A EL +P DD+ENLEWLSQFVDD+ SE 
Sbjct: 64   EDDEKTSFSGSSQKRNSQDSTFSGMESFGSL-AGELPIPVDDMENLEWLSQFVDDTPSEF 122

Query: 895  SMLCPAGSFKDHKTVVNLPANRSEPGVQKLIG----PSFPLPVSRKMRSKRARPNGGRVW 728
            S+LCP  SFKD         +RSEP V+ ++     P FPLP   K RSKR+R   GR W
Sbjct: 123  SLLCPTESFKDKTG--GFTESRSEPVVRPVVKKTRVPCFPLPFPVKPRSKRSR-QAGRTW 179

Query: 727  XXXXXXXXXXXXXXXXXXXXXXXXXSPFVWANPAQDFELFSTVEXXXXXXXXXXXXPTAD 548
                                     S F + NP  D +LF +VE             + +
Sbjct: 180  SFPSSAVSGDSSSPTSSSYGSSPFPSGF-FTNPVYDGDLFCSVEKPPLKKPKKNP--SVE 236

Query: 547  TTTGSQTLRRCSHCQVQKTPQWRTGPMGPKTLCNACGVRYKSGRLFPEYRPACSPTFSQE 368
            T +G    RRC+HCQVQKTPQWR GP+GPKTLCNACGVRYKSGRLFPEYRPACSPTFS E
Sbjct: 237  TGSG----RRCTHCQVQKTPQWRAGPLGPKTLCNACGVRYKSGRLFPEYRPACSPTFSLE 292

Query: 367  IHSNSHRKVLEMRR 326
            +HSNSHRKVLEMRR
Sbjct: 293  VHSNSHRKVLEMRR 306


>emb|CBI38005.3| unnamed protein product [Vitis vinifera]
          Length = 352

 Score =  201 bits (512), Expect = 6e-49
 Identities = 119/229 (51%), Positives = 133/229 (58%), Gaps = 3/229 (1%)
 Frame = -2

Query: 1003 TDDFGSLLAAELVVPADDLENLEWLSQFVDDSQ-SELSMLCPAGSFKDHKTVVNLPANR- 830
            T DF SL A  L VPADDLE+LEWLS FVDDS  SELS+LCPA          N P+ R 
Sbjct: 133  TGDFESLSAGGLAVPADDLEHLEWLSHFVDDSSASELSLLCPA-------VTGNSPSKRC 185

Query: 829  -SEPGVQKLIGPSFPLPVSRKMRSKRARPNGGRVWXXXXXXXXXXXXXXXXXXXXXXXXX 653
              EP    L  P FP P+  K RSKR R + GR W                         
Sbjct: 186  EEEPRPALLRTPLFPTPLPAKPRSKRHR-SSGRAWAFGSHSPSSSPSSSSSSSSTSC--- 241

Query: 652  SPFVWANPAQDFELFSTVEXXXXXXXXXXXXPTADTTTGSQTLRRCSHCQVQKTPQWRTG 473
               ++AN   + E F ++E                 +  SQ  RRCSHC VQKTPQWRTG
Sbjct: 242  --LIFANTVHNMESFYSLEKPPAKKPKK------SPSADSQPQRRCSHCLVQKTPQWRTG 293

Query: 472  PMGPKTLCNACGVRYKSGRLFPEYRPACSPTFSQEIHSNSHRKVLEMRR 326
            P+GPKTLCNACGVR+KSGRLFPEYRPACSPTFS EIHSNSHRKVLE+RR
Sbjct: 294  PLGPKTLCNACGVRFKSGRLFPEYRPACSPTFSVEIHSNSHRKVLEIRR 342


>ref|XP_002274872.1| PREDICTED: GATA transcription factor 5-like [Vitis vinifera]
          Length = 317

 Score =  201 bits (512), Expect = 6e-49
 Identities = 119/229 (51%), Positives = 133/229 (58%), Gaps = 3/229 (1%)
 Frame = -2

Query: 1003 TDDFGSLLAAELVVPADDLENLEWLSQFVDDSQ-SELSMLCPAGSFKDHKTVVNLPANR- 830
            T DF SL A  L VPADDLE+LEWLS FVDDS  SELS+LCPA          N P+ R 
Sbjct: 98   TGDFESLSAGGLAVPADDLEHLEWLSHFVDDSSASELSLLCPA-------VTGNSPSKRC 150

Query: 829  -SEPGVQKLIGPSFPLPVSRKMRSKRARPNGGRVWXXXXXXXXXXXXXXXXXXXXXXXXX 653
              EP    L  P FP P+  K RSKR R + GR W                         
Sbjct: 151  EEEPRPALLRTPLFPTPLPAKPRSKRHR-SSGRAWAFGSHSPSSSPSSSSSSSSTSC--- 206

Query: 652  SPFVWANPAQDFELFSTVEXXXXXXXXXXXXPTADTTTGSQTLRRCSHCQVQKTPQWRTG 473
               ++AN   + E F ++E                 +  SQ  RRCSHC VQKTPQWRTG
Sbjct: 207  --LIFANTVHNMESFYSLEKPPAKKPKK------SPSADSQPQRRCSHCLVQKTPQWRTG 258

Query: 472  PMGPKTLCNACGVRYKSGRLFPEYRPACSPTFSQEIHSNSHRKVLEMRR 326
            P+GPKTLCNACGVR+KSGRLFPEYRPACSPTFS EIHSNSHRKVLE+RR
Sbjct: 259  PLGPKTLCNACGVRFKSGRLFPEYRPACSPTFSVEIHSNSHRKVLEIRR 307


>ref|XP_002313763.2| hypothetical protein POPTR_0009s12620g [Populus trichocarpa]
            gi|550331601|gb|EEE87718.2| hypothetical protein
            POPTR_0009s12620g [Populus trichocarpa]
          Length = 329

 Score =  194 bits (493), Expect = 9e-47
 Identities = 117/257 (45%), Positives = 139/257 (54%), Gaps = 6/257 (2%)
 Frame = -2

Query: 1078 QEDEEKSSLLVXXXXXXXXXXXXSETDDFGSLLAAELVVPADDLENLEWLSQFVDDSQSE 899
            +++EEK S+ V            + +    S LA+EL VP DD+  LEW+S FVDDS S+
Sbjct: 67   EQEEEKDSISVSSQDRVDDDFNSNSSSFSDSFLASELAVPTDDIAELEWVSHFVDDSVSD 126

Query: 898  LSMLCPA--GSFKDHKTVVNLPANRSEPGVQKLIGPS---FPLPVSRKMRSKRARPNGGR 734
            +S+L PA  GS K H        NR EP  +     +   FP  V  K R+KR+RP G R
Sbjct: 127  VSLLVPACKGSSKRHAK------NRFEPETKPTFAKTSCLFPSRVPSKARTKRSRPTG-R 179

Query: 733  VWXXXXXXXXXXXXXXXXXXXXXXXXXSPFVWANPAQDFELFSTV-EXXXXXXXXXXXXP 557
             W                            V  N  Q  +  S + E             
Sbjct: 180  TWSAGSNQSETPSSSTSSTSSMPC-----LVATNTVQTADSLSWLSEQPMKISKKRPAVH 234

Query: 556  TADTTTGSQTLRRCSHCQVQKTPQWRTGPMGPKTLCNACGVRYKSGRLFPEYRPACSPTF 377
            T+     +Q  RRCSHCQVQKTPQWRTGP+G KTLCNACGVRYKSGRLFPEYRPACSPTF
Sbjct: 235  TSGLMASTQFQRRCSHCQVQKTPQWRTGPLGAKTLCNACGVRYKSGRLFPEYRPACSPTF 294

Query: 376  SQEIHSNSHRKVLEMRR 326
            S E+HSNSHRKVLEMRR
Sbjct: 295  SSEVHSNSHRKVLEMRR 311


>ref|XP_002305457.2| hypothetical protein POPTR_0004s16860g [Populus trichocarpa]
            gi|550341195|gb|EEE85968.2| hypothetical protein
            POPTR_0004s16860g [Populus trichocarpa]
          Length = 327

 Score =  193 bits (491), Expect = 2e-46
 Identities = 117/255 (45%), Positives = 136/255 (53%), Gaps = 1/255 (0%)
 Frame = -2

Query: 1087 GCFQEDEEKSSLLVXXXXXXXXXXXXSETDDFGSLLAAELVVPADDLENLEWLSQFVDDS 908
            G  QE+EEK SL V            + +    S L++EL VP DD+  LEW+S FV+DS
Sbjct: 62   GYAQEEEEKDSLSVSSQDRVDDDFNSNSSSFSDSFLSSELAVPTDDIAELEWVSHFVNDS 121

Query: 907  QSELSMLCPAGSFKDHKTVVNLPANRSEPGVQKLIGPSFPLPVSRKMRSKRARPNGGRVW 728
             S++S+L PA   K      N      +P + K  G  FP  V  K R+KR+R  G R W
Sbjct: 122  LSDVSLLVPACKGKPESHAKNRFEPEPKPSLAKTPG-FFPPRVPSKARTKRSRRTG-RTW 179

Query: 727  XXXXXXXXXXXXXXXXXXXXXXXXXSPFVWANPAQDFELFSTV-EXXXXXXXXXXXXPTA 551
                                        V AN  Q  +  S + E             T+
Sbjct: 180  SGRSNQTETPSSSASSTSSMPC-----LVSANTVQTIDSLSWLSEPPMKKPKKRPAVQTS 234

Query: 550  DTTTGSQTLRRCSHCQVQKTPQWRTGPMGPKTLCNACGVRYKSGRLFPEYRPACSPTFSQ 371
              T   Q  RRCSHCQVQKTPQWRTGP G KTLCNACGVRYKSGRLFPEYRPACSPTFS 
Sbjct: 235  GITAAPQFQRRCSHCQVQKTPQWRTGPHGAKTLCNACGVRYKSGRLFPEYRPACSPTFSS 294

Query: 370  EIHSNSHRKVLEMRR 326
            E+HSNSHRKVLEMRR
Sbjct: 295  EVHSNSHRKVLEMRR 309


>gb|ADL36697.1| GATA domain class transcription factor [Malus domestica]
          Length = 321

 Score =  193 bits (490), Expect = 2e-46
 Identities = 114/251 (45%), Positives = 138/251 (54%)
 Frame = -2

Query: 1078 QEDEEKSSLLVXXXXXXXXXXXXSETDDFGSLLAAELVVPADDLENLEWLSQFVDDSQSE 899
            +E+EEK S+ V             ++D   S LA +L+VP DDL  LEW+S FVDDS  +
Sbjct: 65   EEEEEKESVSVDDEISNSSSLVLPDSD---SGLATQLLVPDDDLAELEWVSHFVDDSLPD 121

Query: 898  LSMLCPAGSFKDHKTVVNLPANRSEPGVQKLIGPSFPLPVSRKMRSKRARPNGGRVWXXX 719
            LS+    G+ K    ++N      EP    L  P FP  V  K R+KR +P   RVW   
Sbjct: 122  LSLFHTIGTQKPEALLMN--RFEPEPKPVPLRAPLFPFQVPVKPRTKRYKP-ASRVWSSS 178

Query: 718  XXXXXXXXXXXXXXXXXXXXXXSPFVWANPAQDFELFSTVEXXXXXXXXXXXXPTADTTT 539
                                   P +  NP Q  ++F   E             T + + 
Sbjct: 179  SSCSPSSSPCSSGFSFST-----PCLIFNPVQSMDVF-VGEPAAKKQKKKPAVQTGEGSI 232

Query: 538  GSQTLRRCSHCQVQKTPQWRTGPMGPKTLCNACGVRYKSGRLFPEYRPACSPTFSQEIHS 359
            G Q  RRCSHCQVQKTPQWRTGP+GPKTLCNACGVR+KSGRLFPEYRPACSPTFS  +HS
Sbjct: 233  GGQFQRRCSHCQVQKTPQWRTGPLGPKTLCNACGVRFKSGRLFPEYRPACSPTFSGAVHS 292

Query: 358  NSHRKVLEMRR 326
            NSHRKVLEMR+
Sbjct: 293  NSHRKVLEMRK 303


>gb|EPS63160.1| hypothetical protein M569_11630, partial [Genlisea aurea]
          Length = 312

 Score =  190 bits (482), Expect = 2e-45
 Identities = 120/260 (46%), Positives = 145/260 (55%), Gaps = 6/260 (2%)
 Frame = -2

Query: 1087 GCFQEDEEKSSLLVXXXXXXXXXXXXSETDDFGSLLAAELVVPAD-DLENLEWLSQFVDD 911
            G F+ED++ S+               S  DDF SL +  L VP + DL+NLEWLSQF DD
Sbjct: 65   GLFKEDQDSSN----KGGSNNSSSTFSGADDFDSLSSGNLHVPVEEDLDNLEWLSQFADD 120

Query: 910  SQSELSMLCPAGSFKDHKTVVNLPANRSEPGVQKLIGPSFPLPVSRKMRSKRARPNGGRV 731
            S +  + L P G+F    +V      +SE  V +      P PV RK RSKR R NG + 
Sbjct: 121  STAAGASLFPIGNFPSRASV------KSEAAVDERAF-IIPPPVPRKSRSKRERSNG-QS 172

Query: 730  WXXXXXXXXXXXXXXXXXXXXXXXXXSP---FVWANPA--QDFELFSTVEXXXXXXXXXX 566
            W                          P   F+ A PA  Q+ + FSTVE          
Sbjct: 173  WSLTSPQLSSVDSSTASSSSYTSTPPLPILLFLNAAPAAAQEPDWFSTVEKPPAKKPKRK 232

Query: 565  XXPTADTTTGSQTLRRCSHCQVQKTPQWRTGPMGPKTLCNACGVRYKSGRLFPEYRPACS 386
                 +  +G  + RRC+HCQVQKTPQWRTGP+GPKTLCNACGVR+KSGRLFPEYRPACS
Sbjct: 233  P----EPESGGLSGRRCTHCQVQKTPQWRTGPLGPKTLCNACGVRFKSGRLFPEYRPACS 288

Query: 385  PTFSQEIHSNSHRKVLEMRR 326
            PTFS ++HSNSHRKVLEMRR
Sbjct: 289  PTFSHDVHSNSHRKVLEMRR 308


>gb|ADL36693.1| GATA domain class transcription factor [Malus domestica]
          Length = 323

 Score =  189 bits (480), Expect = 3e-45
 Identities = 114/251 (45%), Positives = 137/251 (54%)
 Frame = -2

Query: 1078 QEDEEKSSLLVXXXXXXXXXXXXSETDDFGSLLAAELVVPADDLENLEWLSQFVDDSQSE 899
            +E EE+ S+ V            +++D   S LA +LVVP DDL  LEW+S FVDDS  +
Sbjct: 65   EEGEERDSVSVDDETSNSSNSVLADSD---SGLATQLVVPDDDLAELEWVSHFVDDSLPD 121

Query: 898  LSMLCPAGSFKDHKTVVNLPANRSEPGVQKLIGPSFPLPVSRKMRSKRARPNGGRVWXXX 719
            LS+L   G  K    + N   + SEP   +L    FP  V  K R+KR R    R W   
Sbjct: 122  LSLLHTIGVQKPEALLAN--RSESEPKPAQLRASLFPFEVPVKPRTKRCRL-ASRDWSLS 178

Query: 718  XXXXXXXXXXXXXXXXXXXXXXSPFVWANPAQDFELFSTVEXXXXXXXXXXXXPTADTTT 539
                                   P +  NP Q   +F   E             T + + 
Sbjct: 179  SSSSPSSPSSSSGSGLSFST---PCLIFNPVQSMHVF-VGEPAAKKQKKKPAVQTGEGSI 234

Query: 538  GSQTLRRCSHCQVQKTPQWRTGPMGPKTLCNACGVRYKSGRLFPEYRPACSPTFSQEIHS 359
            G Q  RRCSHCQVQKTPQWRTGP+GPKTLCNACGVR+KSGRLFPEYRPACSPTFS ++HS
Sbjct: 235  GGQFQRRCSHCQVQKTPQWRTGPLGPKTLCNACGVRFKSGRLFPEYRPACSPTFSGDVHS 294

Query: 358  NSHRKVLEMRR 326
            NSHRKVLEMR+
Sbjct: 295  NSHRKVLEMRK 305


>ref|XP_004287842.1| PREDICTED: GATA transcription factor 5-like [Fragaria vesca subsp.
            vesca]
          Length = 333

 Score =  188 bits (477), Expect = 6e-45
 Identities = 123/261 (47%), Positives = 139/261 (53%), Gaps = 10/261 (3%)
 Frame = -2

Query: 1078 QEDEEKSSLL------VXXXXXXXXXXXXSETDDFGSLLA---AELVVPADDLENLEWLS 926
            QED++K S+L      V            SE ++ G   A   +EL VPADDLENLEWLS
Sbjct: 61   QEDDKKDSVLPKKESTVEEKENSTPSSCVSEKNELGPEPAEPTSELTVPADDLENLEWLS 120

Query: 925  QFVDDSQSELSMLCPAGSFKDHKTVVNLPANRSEPGVQKLIGPSFPLPVSRKMRSKRARP 746
             FV+DS S  +   PAG       +   P  R EP   K   P F  PV  K RSKR R 
Sbjct: 121  HFVEDSFSGFNASLPAGF------MAVKPEKRPEPEALK---PCFKTPVPAKARSKRTR- 170

Query: 745  NGGRVWXXXXXXXXXXXXXXXXXXXXXXXXXSPFVWANPAQDFELF-STVEXXXXXXXXX 569
             GGRVW                         SP++  NP Q    F S+VE         
Sbjct: 171  TGGRVWSLGSPSFTETSSSSSSSSSTSSCPSSPWLIYNPTQGLGGFGSSVEKPQKKPKRP 230

Query: 568  XXXPTADTTTGSQTLRRCSHCQVQKTPQWRTGPMGPKTLCNACGVRYKSGRLFPEYRPAC 389
                T +    SQ  RRCSHC VQKTPQWRTGP G KTLCNACGVRYKSGRL PEYRPAC
Sbjct: 231  A---TTEGGGSSQPPRRCSHCGVQKTPQWRTGPNGAKTLCNACGVRYKSGRLVPEYRPAC 287

Query: 388  SPTFSQEIHSNSHRKVLEMRR 326
            SPTFS E+HSN HRKV+E+RR
Sbjct: 288  SPTFSSELHSNHHRKVMEIRR 308


>ref|XP_002512985.1| GATA transcription factor, putative [Ricinus communis]
            gi|223547996|gb|EEF49488.1| GATA transcription factor,
            putative [Ricinus communis]
          Length = 368

 Score =  185 bits (470), Expect = 4e-44
 Identities = 119/259 (45%), Positives = 137/259 (52%), Gaps = 8/259 (3%)
 Frame = -2

Query: 1078 QEDEEKSSLLVXXXXXXXXXXXXSETDDF--GSLLAAELVVPADDLENLEWLSQFVDDSQ 905
            +E+EEK SL V            +        S L +EL VP +DL  LEW+SQFVDDS 
Sbjct: 99   EEEEEKDSLSVSSQDRSGVDDDNNSNSSTFDESFLTSELAVPIEDLAELEWVSQFVDDSS 158

Query: 904  SELSMLCPAGSFKDHKTVVNLPANRSEPGVQK---LIGPS--FPLPVSRKMRSKRARPNG 740
             E S+L P  S +DH T      NR +P   K   L  PS  FP+ +  K RSKR RP G
Sbjct: 159  PEFSLLYPLNS-EDHHT-----RNRFQPEHPKPVALTKPSCLFPVKIPAKPRSKRTRPTG 212

Query: 739  GRVWXXXXXXXXXXXXXXXXXXXXXXXXXSPFVWANPAQDFE-LFSTVEXXXXXXXXXXX 563
             R W                         +        Q  + L S  E           
Sbjct: 213  -RTWSVESLLTDSSSSSSSYCSSSPISSSASTPCFVTVQTIDSLPSFCEPPAKKAKRKPA 271

Query: 562  XPTADTTTGSQTLRRCSHCQVQKTPQWRTGPMGPKTLCNACGVRYKSGRLFPEYRPACSP 383
              T   T  +Q  RRCSHCQVQKTPQWRTGP+G KTLCNACGVRYKSGRLFPEYRPACSP
Sbjct: 272  AQTGGATGLTQFQRRCSHCQVQKTPQWRTGPLGAKTLCNACGVRYKSGRLFPEYRPACSP 331

Query: 382  TFSQEIHSNSHRKVLEMRR 326
            TFS +IHSNSHRKVLE+R+
Sbjct: 332  TFSGDIHSNSHRKVLEIRK 350


>ref|XP_004290341.1| PREDICTED: GATA transcription factor 5-like [Fragaria vesca subsp.
            vesca]
          Length = 353

 Score =  185 bits (469), Expect = 5e-44
 Identities = 117/266 (43%), Positives = 135/266 (50%), Gaps = 15/266 (5%)
 Frame = -2

Query: 1078 QEDEEKSSLLVXXXXXXXXXXXXSETDDFGSLLAAELVVPADDLENLEWLSQFVDDSQSE 899
            +E+E+K S+ V            +E     S LA++L VP DD+  LEW+S FVDDS SE
Sbjct: 81   EEEEDKDSVSVDSVENSNSSYFTTE-----STLASQLAVPDDDIAELEWVSHFVDDSASE 135

Query: 898  LSMLCPAGSFKDHKTVVNLPANRSEPGVQKLIGPS-------FPLPVSRKMRSKRARPNG 740
            LS+L P    K       L  NRSEP  ++L            P  V  K RSKR RP  
Sbjct: 136  LSLLHPVSKLKPEA----LTLNRSEPEARRLALAHDQSTLSWLPSQVPVKPRSKRFRPAS 191

Query: 739  ---GRVWXXXXXXXXXXXXXXXXXXXXXXXXXSPF-----VWANPAQDFELFSTVEXXXX 584
                 VW                           F     V  NP     +F        
Sbjct: 192  RLRSSVWNPLGDSPSLTSSLPSPSSTSSCSSGMSFSTPCLVLTNPVHKVGVFWGEPAAKK 251

Query: 583  XXXXXXXXPTADTTTGSQTLRRCSHCQVQKTPQWRTGPMGPKTLCNACGVRYKSGRLFPE 404
                       +   G+Q  RRCSHCQVQKTPQWRTGP+GPKTLCNACGVRYKSGRLFPE
Sbjct: 252  QKRKPAVQTGDEVVVGTQ--RRCSHCQVQKTPQWRTGPLGPKTLCNACGVRYKSGRLFPE 309

Query: 403  YRPACSPTFSQEIHSNSHRKVLEMRR 326
            YRPACSPTFS ++HSNSHRKVLEMRR
Sbjct: 310  YRPACSPTFSGDVHSNSHRKVLEMRR 335


>gb|EOX97872.1| GATA transcription factor 5, putative [Theobroma cacao]
          Length = 322

 Score =  184 bits (467), Expect = 9e-44
 Identities = 114/257 (44%), Positives = 137/257 (53%), Gaps = 3/257 (1%)
 Frame = -2

Query: 1087 GCFQEDEEKSSLLVXXXXXXXXXXXXSETDDFG--SLLAAELVVPADDLENLEWLSQFVD 914
            G F+E+E+K S  V            S +  F   SLL  EL VP D++  LEW+S FVD
Sbjct: 54   GEFEEEEQKDSFSVSSEERVADDDSNSNSSSFSFDSLLTNELSVPDDEIAGLEWVSHFVD 113

Query: 913  DSQSELSMLCPAGSFKDHKTVVNLPANRSEPGVQKLIGPSFPLPVSRKMRSKRARPNGGR 734
            DS  EL +LCP   FK            +EP +  +  PSF   V  K RSKRA+  G R
Sbjct: 114  DSFPELPILCPV--FKPQSDGHAKTLFETEPELVFMKTPSFSSTVPSKARSKRAKSTG-R 170

Query: 733  VWXXXXXXXXXXXXXXXXXXXXXXXXXSPFVWANPAQDFELFST-VEXXXXXXXXXXXXP 557
             W                            V +   Q+ +L +   E             
Sbjct: 171  TWSVGSMPLSESSSSTITSSSTSSGFS---VTSANVQETDLANDFTEPPTKKQKKKPAVQ 227

Query: 556  TADTTTGSQTLRRCSHCQVQKTPQWRTGPMGPKTLCNACGVRYKSGRLFPEYRPACSPTF 377
             +  ++G+   RRCSHCQVQKTPQWRTGP+G KTLCNACGVRYKSGRLFPEYRPACSPTF
Sbjct: 228  ASGLSSGNPFQRRCSHCQVQKTPQWRTGPLGAKTLCNACGVRYKSGRLFPEYRPACSPTF 287

Query: 376  SQEIHSNSHRKVLEMRR 326
            S +IHSNSHRKVLEMR+
Sbjct: 288  SGDIHSNSHRKVLEMRK 304


>ref|XP_006393827.1| hypothetical protein EUTSA_v10004566mg [Eutrema salsugineum]
           gi|78499690|gb|ABB45844.1| hypothetical protein [Eutrema
           halophilum] gi|557090466|gb|ESQ31113.1| hypothetical
           protein EUTSA_v10004566mg [Eutrema salsugineum]
          Length = 332

 Score =  181 bits (458), Expect = 1e-42
 Identities = 111/226 (49%), Positives = 125/226 (55%), Gaps = 2/226 (0%)
 Frame = -2

Query: 997 DFGSLLAAELVVPADDLENLEWLSQFVDDSQSELSMLCPAGSFKDHKTVVNLPANRSEPG 818
           DFGSL  +EL VPAD+L NLEWLS FVDDS  E S     G+         L  +R  P 
Sbjct: 90  DFGSLPLSELSVPADELANLEWLSHFVDDSFMEYSAPNLTGTSTKPAW---LTGDRKHPV 146

Query: 817 VQKLIGPSFPLPVSRKMRSKRARPNGGRVWXXXXXXXXXXXXXXXXXXXXXXXXXSPFVW 638
                   F  PV  K RSKR R NGG+VW                         SP  W
Sbjct: 147 TPATEESCFNSPVPAKARSKRNR-NGGKVWSLGSSSSSGPSSSSSTSSSSSSGPSSP--W 203

Query: 637 ANPAQDFELFSTVEXXXXXXXXXXXXPTADTTTGSQTL--RRCSHCQVQKTPQWRTGPMG 464
            + A+  E F+T E             +A++    Q L  RRCSHC +QKTPQWR GPMG
Sbjct: 204 FSGAELPEPFATSEKPPVPKKHKKR--SAESVYSGQPLQQRRCSHCGIQKTPQWRAGPMG 261

Query: 463 PKTLCNACGVRYKSGRLFPEYRPACSPTFSQEIHSNSHRKVLEMRR 326
            KTLCNACGVRYKSGRL PEYRPACSPTFS E+HSN HRKV+EMRR
Sbjct: 262 AKTLCNACGVRYKSGRLLPEYRPACSPTFSSELHSNHHRKVMEMRR 307


>ref|XP_006423461.1| hypothetical protein CICLE_v10028860mg [Citrus clementina]
           gi|557525395|gb|ESR36701.1| hypothetical protein
           CICLE_v10028860mg [Citrus clementina]
          Length = 315

 Score =  180 bits (457), Expect = 1e-42
 Identities = 105/222 (47%), Positives = 119/222 (53%), Gaps = 1/222 (0%)
 Frame = -2

Query: 988 SLLAAELVVPADDLENLEWLSQFVDDSQ-SELSMLCPAGSFKDHKTVVNLPANRSEPGVQ 812
           SLL  E V P DD   LEW+SQFVDDS  SELS+L P    +        P +       
Sbjct: 87  SLLTNEFVEPVDDFAELEWVSQFVDDSSCSELSLLYPNYVERTRSEPNGKPVSNKTSTNP 146

Query: 811 KLIGPSFPLPVSRKMRSKRARPNGGRVWXXXXXXXXXXXXXXXXXXXXXXXXXSPFVWAN 632
               P FPL V  K R+KR R   GR W                            ++ +
Sbjct: 147 TTTSPCFPLRVPSKARTKRTR-RSGRAWSSGSPLSTESTISSSSSTSC-------LIFTD 198

Query: 631 PAQDFELFSTVEXXXXXXXXXXXXPTADTTTGSQTLRRCSHCQVQKTPQWRTGPMGPKTL 452
             Q+ E FS  +              A  + G    RRCSHCQ QKTPQWRTGP+GPKTL
Sbjct: 199 SVQNIEWFSGFDEPVVKKPKKKP---AVQSGGGLFQRRCSHCQTQKTPQWRTGPLGPKTL 255

Query: 451 CNACGVRYKSGRLFPEYRPACSPTFSQEIHSNSHRKVLEMRR 326
           CNACGVRYKSGRLFPEYRPACSPTFS ++HSNSHRKVLEMRR
Sbjct: 256 CNACGVRYKSGRLFPEYRPACSPTFSVDMHSNSHRKVLEMRR 297


>ref|XP_006487363.1| PREDICTED: GATA transcription factor 5-like [Citrus sinensis]
          Length = 316

 Score =  178 bits (451), Expect = 7e-42
 Identities = 108/227 (47%), Positives = 124/227 (54%), Gaps = 6/227 (2%)
 Frame = -2

Query: 988 SLLAAELVVPADDLENLEWLSQFVDDSQ-SELSMLCP-----AGSFKDHKTVVNLPANRS 827
           SLL  E V P DD   LEW+SQFVDDS  SELS+L P       S  D K V    +N++
Sbjct: 87  SLLTNEFVEPVDDFAELEWVSQFVDDSSCSELSLLYPNYVERTRSEPDGKPV----SNKT 142

Query: 826 EPGVQKLIGPSFPLPVSRKMRSKRARPNGGRVWXXXXXXXXXXXXXXXXXXXXXXXXXSP 647
                    P FPL V  K R+KR R +G   W                           
Sbjct: 143 STNPTTTTSPCFPLRVPSKARTKRTRRSGW-AWSSGSPLSTESTISSSSSTSC------- 194

Query: 646 FVWANPAQDFELFSTVEXXXXXXXXXXXXPTADTTTGSQTLRRCSHCQVQKTPQWRTGPM 467
            ++ +  Q+ E FS  +              A  + G    RRCSHCQ QKTPQWRTGP+
Sbjct: 195 LIFTDSVQNIEWFSGFDEPVAKKLKKKP---AVQSGGGLFQRRCSHCQTQKTPQWRTGPL 251

Query: 466 GPKTLCNACGVRYKSGRLFPEYRPACSPTFSQEIHSNSHRKVLEMRR 326
           GPKTLCNACGVRYKSGRLFPEYRPACSPTFS ++HSNSHRKVLEMRR
Sbjct: 252 GPKTLCNACGVRYKSGRLFPEYRPACSPTFSVDMHSNSHRKVLEMRR 298


>ref|XP_006425559.1| hypothetical protein CICLE_v10025844mg [Citrus clementina]
            gi|557527549|gb|ESR38799.1| hypothetical protein
            CICLE_v10025844mg [Citrus clementina]
          Length = 340

 Score =  177 bits (450), Expect = 9e-42
 Identities = 107/261 (40%), Positives = 127/261 (48%), Gaps = 12/261 (4%)
 Frame = -2

Query: 1072 DEEKSSLLVXXXXXXXXXXXXSETDDFGSLLAAELVVPADDLENLEWLSQFVDDSQSELS 893
            +E+K   L                DD G +  +EL VP DD+ NLEWLS FV+DS +E S
Sbjct: 71   EEQKKHTLTVCSKQDQDLDERLNFDDLGPIPTSELAVPTDDVANLEWLSHFVEDSFAEYS 130

Query: 892  MLCPAGSFKDHKTVVNLPANRSEPGVQKLIGPS-----FPLPVSRKMRSKRARPNGGRVW 728
               PAG+         LP    E G +    P+     F  P+  K RSKR+R  G R+W
Sbjct: 131  SPFPAGT---------LPVKAKENGAEPEHKPALAIHCFKTPIPAKARSKRSR-TGLRIW 180

Query: 727  XXXXXXXXXXXXXXXXXXXXXXXXXSP-------FVWANPAQDFELFSTVEXXXXXXXXX 569
                                      P            PA+ F +    +         
Sbjct: 181  SLGSPSLSDSSSTSSASSSSSPSSPWPVSTNPGSLASLRPAEPF-IVKPPKKKLKKKSPP 239

Query: 568  XXXPTADTTTGSQTLRRCSHCQVQKTPQWRTGPMGPKTLCNACGVRYKSGRLFPEYRPAC 389
                     +  Q  RRCSHC VQKTPQWRTGP+G KTLCNACGVRYKSGRLFPEYRPAC
Sbjct: 240  EGYNAGGNISWGQFTRRCSHCGVQKTPQWRTGPLGAKTLCNACGVRYKSGRLFPEYRPAC 299

Query: 388  SPTFSQEIHSNSHRKVLEMRR 326
            SPTFS E+HSN HRKV+EMRR
Sbjct: 300  SPTFSSELHSNHHRKVMEMRR 320


>ref|XP_006425558.1| hypothetical protein CICLE_v10025844mg [Citrus clementina]
            gi|568825030|ref|XP_006466892.1| PREDICTED: GATA
            transcription factor 5-like [Citrus sinensis]
            gi|557527548|gb|ESR38798.1| hypothetical protein
            CICLE_v10025844mg [Citrus clementina]
          Length = 381

 Score =  177 bits (450), Expect = 9e-42
 Identities = 107/261 (40%), Positives = 127/261 (48%), Gaps = 12/261 (4%)
 Frame = -2

Query: 1072 DEEKSSLLVXXXXXXXXXXXXSETDDFGSLLAAELVVPADDLENLEWLSQFVDDSQSELS 893
            +E+K   L                DD G +  +EL VP DD+ NLEWLS FV+DS +E S
Sbjct: 112  EEQKKHTLTVCSKQDQDLDERLNFDDLGPIPTSELAVPTDDVANLEWLSHFVEDSFAEYS 171

Query: 892  MLCPAGSFKDHKTVVNLPANRSEPGVQKLIGPS-----FPLPVSRKMRSKRARPNGGRVW 728
               PAG+         LP    E G +    P+     F  P+  K RSKR+R  G R+W
Sbjct: 172  SPFPAGT---------LPVKAKENGAEPEHKPALAIHCFKTPIPAKARSKRSR-TGLRIW 221

Query: 727  XXXXXXXXXXXXXXXXXXXXXXXXXSP-------FVWANPAQDFELFSTVEXXXXXXXXX 569
                                      P            PA+ F +    +         
Sbjct: 222  SLGSPSLSDSSSTSSASSSSSPSSPWPVSTNPGSLASLRPAEPF-IVKPPKKKLKKKSPP 280

Query: 568  XXXPTADTTTGSQTLRRCSHCQVQKTPQWRTGPMGPKTLCNACGVRYKSGRLFPEYRPAC 389
                     +  Q  RRCSHC VQKTPQWRTGP+G KTLCNACGVRYKSGRLFPEYRPAC
Sbjct: 281  EGYNAGGNISWGQFTRRCSHCGVQKTPQWRTGPLGAKTLCNACGVRYKSGRLFPEYRPAC 340

Query: 388  SPTFSQEIHSNSHRKVLEMRR 326
            SPTFS E+HSN HRKV+EMRR
Sbjct: 341  SPTFSSELHSNHHRKVMEMRR 361


>gb|ESW07921.1| hypothetical protein PHAVU_009G003800g [Phaseolus vulgaris]
          Length = 300

 Score =  174 bits (442), Expect = 7e-41
 Identities = 103/225 (45%), Positives = 125/225 (55%), Gaps = 2/225 (0%)
 Frame = -2

Query: 994 FGSLLAAELVVPADDLENLEWLSQFVDDSQSELSMLCPAGSFKDHKTVVNLPANRSEPGV 815
           + SL +AEL VPA DLE+LEW+S FVDDS  ELS+L P  S + ++ V        EP  
Sbjct: 91  YDSLFSAELAVPAGDLEDLEWVSHFVDDSLPELSLLYPVRSEEVNRRV------EPEPSA 144

Query: 814 QKLIGPSFP--LPVSRKMRSKRARPNGGRVWXXXXXXXXXXXXXXXXXXXXXXXXXSPFV 641
           +K   P FP  + ++ K R+ R R    RVW                            +
Sbjct: 145 KKT--PRFPCEMKITTKARTVRNRKPNARVWSLGP------------------------L 178

Query: 640 WANPAQDFELFSTVEXXXXXXXXXXXXPTADTTTGSQTLRRCSHCQVQKTPQWRTGPMGP 461
            + P+      S+V                    G+Q  RRCSHCQVQKTPQWRTGP+G 
Sbjct: 179 LSLPSSPSSCSSSVTEPPAKKQKKRAEAQP---VGAQVQRRCSHCQVQKTPQWRTGPLGA 235

Query: 460 KTLCNACGVRYKSGRLFPEYRPACSPTFSQEIHSNSHRKVLEMRR 326
           KTLCNACGVRYKSGRLF EYRPACSPTF  +IHSNSHRKVLE+R+
Sbjct: 236 KTLCNACGVRYKSGRLFSEYRPACSPTFCSDIHSNSHRKVLEIRK 280


Top