BLASTX nr result

ID: Cephaelis21_contig00015259 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cephaelis21_contig00015259
         (1544 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002272762.1| PREDICTED: GATA transcription factor 5 [Viti...   254   4e-65
emb|CAN64003.1| hypothetical protein VITISV_037635 [Vitis vinifera]   250   6e-64
emb|CBI17417.3| unnamed protein product [Vitis vinifera]              243   1e-61
ref|XP_002521500.1| conserved hypothetical protein [Ricinus comm...   231   5e-58
gb|ADL36694.1| GATA domain class transcription factor [Malus x d...   230   9e-58

>ref|XP_002272762.1| PREDICTED: GATA transcription factor 5 [Vitis vinifera]
          Length = 338

 Score =  254 bits (649), Expect = 4e-65
 Identities = 152/318 (47%), Positives = 178/318 (55%), Gaps = 18/318 (5%)
 Frame = +3

Query: 354  NGQSGAVENDFFVDDLLDFSNATLSED--PEEQKQDN------------MLEKDGCATTR 491
            NGQSG   +DF +DDLLDF+N  + E    EE ++D             + E D    T 
Sbjct: 35   NGQSGVSGDDFSIDDLLDFTNGGIGEGLFQEEDEEDEDKGCGSLSPRGELTENDNSNLTT 94

Query: 492  TTAKSGVVLAPKQELDSLPVSGLSVPADDLESLEWLSHFVEDSFSHYSLTCPVPKLPPEA 671
            TT       + K E  S+P + L+VPADDL  LEWLSHFVEDSFS YS   P   L  +A
Sbjct: 95   TT------FSVKDEFPSVPATELTVPADDLADLEWLSHFVEDSFSEYSAPFPHGTLTEKA 148

Query: 672  SKSRSEPEIIMVEVKPSMSITTHVQTKARTKRARTGGRVWXXXXXXXXXXXXXXXXXXXX 851
                  P      ++    + T    KAR+KRARTGGRVW                    
Sbjct: 149  QNQTENPPEPETPLQIKSCLKTPFPAKARSKRARTGGRVWSMGSPSLTESSSSSSSSSSS 208

Query: 852  XXXXXXVPWLLYSSSGLCQTQLTVESXXXXXXXXXXXXXXALIESSSGGAQQ-PRRCSHC 1028
                   PWL+Y ++  CQ    VES               L   +SG AQ  P RCSHC
Sbjct: 209  SLSS---PWLIYPNT--CQN---VESFHSAVKPPAKKHKKRLDPEASGSAQPTPHRCSHC 260

Query: 1029 GVQKTPQWRAGPLGAKTLCNACGVRFKSGRLLPEYRPACSPTFSNELHSNNHRKVLEMRR 1208
            GVQKTPQWR GPLGAKTLCNACGVR+KSGRLLPEYRPACSPTFS+E+HSN+HRKVLEMRR
Sbjct: 261  GVQKTPQWRTGPLGAKTLCNACGVRYKSGRLLPEYRPACSPTFSSEIHSNHHRKVLEMRR 320

Query: 1209 KKEM---ETGLGPPVQSF 1253
            KKE+   E+GL P V SF
Sbjct: 321  KKEVTRPESGLAPAVPSF 338


>emb|CAN64003.1| hypothetical protein VITISV_037635 [Vitis vinifera]
          Length = 338

 Score =  250 bits (639), Expect = 6e-64
 Identities = 151/318 (47%), Positives = 176/318 (55%), Gaps = 18/318 (5%)
 Frame = +3

Query: 354  NGQSGAVENDFFVDDLLDFSNATLSED--PEEQKQDN------------MLEKDGCATTR 491
            NGQSG   +DF +DDLLDF+N  + E    EE ++D             + E D    T 
Sbjct: 35   NGQSGVSGDDFSIDDLLDFTNGGIGEGLFQEEDEEDEDKGCGSLSPRRELTENDNSNLTT 94

Query: 492  TTAKSGVVLAPKQELDSLPVSGLSVPADDLESLEWLSHFVEDSFSHYSLTCPVPKLPPEA 671
            TT       + K E  S+P + L+VPADDL  LEWLSHFVEDSFS YS   P   L  +A
Sbjct: 95   TT------FSVKDEFPSVPATELTVPADDLADLEWLSHFVEDSFSEYSAPFPPGTLTEKA 148

Query: 672  SKSRSEPEIIMVEVKPSMSITTHVQTKARTKRARTGGRVWXXXXXXXXXXXXXXXXXXXX 851
                  P      ++    + T    KAR+KRARTGGRVW                    
Sbjct: 149  QNQTENPPEPETPLQIKSCLKTPFPAKARSKRARTGGRVWSMGSPSLTESSSSSSSSSSS 208

Query: 852  XXXXXXVPWLLYSSSGLCQTQLTVESXXXXXXXXXXXXXXALIESSSGGAQQ-PRRCSHC 1028
                   PWL+Y ++  CQ    VES               L   +SG AQ  P RCSHC
Sbjct: 209  SLSS---PWLIYPNT--CQN---VESFHSAVKPPAKKHKKRLDPEASGSAQXTPHRCSHC 260

Query: 1029 GVQKTPQWRAGPLGAKTLCNACGVRFKSGRLLPEYRPACSPTFSNELHSNNHRKVLEMRR 1208
            GVQKT QWR GPLGAKTLCNACGVRFKSGRLLPEYRPACSPTFS+E+HSN+HRKVLEMRR
Sbjct: 261  GVQKTXQWRTGPLGAKTLCNACGVRFKSGRLLPEYRPACSPTFSSEIHSNHHRKVLEMRR 320

Query: 1209 KKEM---ETGLGPPVQSF 1253
            KKE+    +GL P V SF
Sbjct: 321  KKEVTRPXSGLAPAVPSF 338


>emb|CBI17417.3| unnamed protein product [Vitis vinifera]
          Length = 305

 Score =  243 bits (620), Expect = 1e-61
 Identities = 145/317 (45%), Positives = 172/317 (54%), Gaps = 17/317 (5%)
 Frame = +3

Query: 354  NGQSGAVENDFFVDDLLDFSNATLSED--PEEQKQDN------------MLEKDGCATTR 491
            NGQSG   +DF +DDLLDF+N  + E    EE ++D             + E D    T 
Sbjct: 35   NGQSGVSGDDFSIDDLLDFTNGGIGEGLFQEEDEEDEDKGCGSLSPRGELTENDNSNLTT 94

Query: 492  TTAKSGVVLAPKQELDSLPVSGLSVPADDLESLEWLSHFVEDSFSHYSLTCPVPKLPPEA 671
            TT       + K E  S+P + L+VPADDL  LEWLSHFVEDSFS YS   P   L  +A
Sbjct: 95   TT------FSVKDEFPSVPATELTVPADDLADLEWLSHFVEDSFSEYSAPFPHGTLTEKA 148

Query: 672  SKSRSEPEIIMVEVKPSMSITTHVQTKARTKRARTGGRVWXXXXXXXXXXXXXXXXXXXX 851
                  P      ++    + T    KAR+KRARTGGRVW                    
Sbjct: 149  QNQTENPPEPETPLQIKSCLKTPFPAKARSKRARTGGRVWSMGS---------------- 192

Query: 852  XXXXXXVPWLLYSSSGLCQTQLTVESXXXXXXXXXXXXXXALIESSSGGAQQPRRCSHCG 1031
                   P L  SSS    +  +++                  E+S      P RCSHCG
Sbjct: 193  -------PSLTESSSSSSSSSSSLDP-----------------EASGSAQPTPHRCSHCG 228

Query: 1032 VQKTPQWRAGPLGAKTLCNACGVRFKSGRLLPEYRPACSPTFSNELHSNNHRKVLEMRRK 1211
            VQKTPQWR GPLGAKTLCNACGVR+KSGRLLPEYRPACSPTFS+E+HSN+HRKVLEMRRK
Sbjct: 229  VQKTPQWRTGPLGAKTLCNACGVRYKSGRLLPEYRPACSPTFSSEIHSNHHRKVLEMRRK 288

Query: 1212 KEM---ETGLGPPVQSF 1253
            KE+   E+GL P V SF
Sbjct: 289  KEVTRPESGLAPAVPSF 305


>ref|XP_002521500.1| conserved hypothetical protein [Ricinus communis]
            gi|223539178|gb|EEF40771.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 398

 Score =  231 bits (588), Expect = 5e-58
 Identities = 146/329 (44%), Positives = 173/329 (52%), Gaps = 32/329 (9%)
 Frame = +3

Query: 360  QSGAVENDFFVDDLLDFSN----ATLSEDPEEQKQDNMLEKDGCATTRTTAKSGVVLAPK 527
            Q+G   +DF VD+LLDFSN    A   ED EE++Q    ++  C          V L+P 
Sbjct: 76   QNGTSSDDFIVDELLDFSNEEEAAVEREDEEEEEQQQ--QQKACTAV------SVSLSPN 127

Query: 528  QELDSLPVSG------------LSVPADDLESLEWLSHFVEDSFSHYSLTCPVPKLPP-E 668
            Q+    P  G            L VPADDL SLEWLSHFVEDS S YS   P   +   E
Sbjct: 128  QQQTQRPEDGKISDSTSNFATELCVPADDLASLEWLSHFVEDSNSEYSTPFPAAGIVSHE 187

Query: 669  ASKSRSEPEIIMVEVKPSMSITTH----VQTKARTKRARTGGRVWXXXXXXXXXXXXXXX 836
              K  ++ +   V  KP +   T     VQTKAR+KR RTG RVW               
Sbjct: 188  NHKEENDNKPFYVTQKPVVLTETFFKTPVQTKARSKRTRTGVRVWPLGSPSLTESSSSSS 247

Query: 837  XXXXXXXXXXXV-------PWLLYSSSGLCQTQLTVESXXXXXXXXXXXXXXALIESSSG 995
                               P+L++++ G+ +                         S  G
Sbjct: 248  YTSSSSSSSSSSSSSSPLSPYLIFTTQGMSRELTEPICYEKTPIKKLKKRFSGEPASGGG 307

Query: 996  GAQQPRRCSHCGVQKTPQWRAGPLGAKTLCNACGVRFKSGRLLPEYRPACSPTFSNELHS 1175
            G+Q PRRCSHCGVQKTPQWR GPLGAKTLCNACGVRFKSGRLLPEYRPACSPTF +ELHS
Sbjct: 308  GSQPPRRCSHCGVQKTPQWRTGPLGAKTLCNACGVRFKSGRLLPEYRPACSPTFCSELHS 367

Query: 1176 NNHRKVLEMRRKKE----METGLGPPVQS 1250
            N+HRKVLEMR+KKE    +E GL PP  S
Sbjct: 368  NHHRKVLEMRKKKEVVVQVEPGLVPPAVS 396


>gb|ADL36694.1| GATA domain class transcription factor [Malus x domestica]
          Length = 331

 Score =  230 bits (586), Expect = 9e-58
 Identities = 152/326 (46%), Positives = 181/326 (55%), Gaps = 15/326 (4%)
 Frame = +3

Query: 321  LAFSGDLSGGS--NGQSGAVENDFFVDDLLDFSN------ATLSEDPEEQKQDNMLEKDG 476
            + F   L GG+  NGQ+    +DF VDDLLDFSN          E+ +++K    +    
Sbjct: 26   VVFDDFLWGGAVVNGQNAC--DDFSVDDLLDFSNEDGFVETEAEEEGDKEKVKGFVSVSL 83

Query: 477  CATTRTTAKSGVVLAPKQELDSLPVSGLSVPADDLESLEWLSHFVEDSFSHYSLTCPVPK 656
                + T KS   L+ K E    P S LSVPADDLE+LEWLSHFVEDSFS ++   P   
Sbjct: 84   QKQNQETEKSN--LSEKIE----PASELSVPADDLENLEWLSHFVEDSFSEFTTALPAGF 137

Query: 657  LPPEASKSRSEPEI-IMVEVKPSMSITTHVQTKARTKRARTGGRVWXXXXXXXXXXXXXX 833
            L PE  KS   P++      KP     T V  KAR+KR RTGGRVW              
Sbjct: 138  L-PEKPKSEKRPDLETPFPEKPCFK--TPVPAKARSKRRRTGGRVWSLGSPSLTESSSSS 194

Query: 834  XXXXXXXXXXXXVPWLLYSSSGLCQTQLTVESXXXXXXXXXXXXXXALIESSSGGAQQPR 1013
                         PW +Y ++   Q Q + E                L++ SS  +Q PR
Sbjct: 195  SSSSSSSPSS---PWTIYPAT---QNQESAE-PVSSVEKPPRKPKRRLVDGSS--SQPPR 245

Query: 1014 RCSHCGVQKTPQWRAGPLGAKTLCNACGVRFKSGRLLPEYRPACSPTFSNELHSNNHRKV 1193
            RCSHCGVQKTPQWR GP GAKTLCNACGVR+KSGRLLPEYRPACSPTFS+ELHSN+HRKV
Sbjct: 246  RCSHCGVQKTPQWRTGPNGAKTLCNACGVRYKSGRLLPEYRPACSPTFSSELHSNHHRKV 305

Query: 1194 LEMRRKK------EMETGLGPPVQSF 1253
            +EMRRKK      E  T + P V SF
Sbjct: 306  IEMRRKKEGPGTPEPSTTIPPAVPSF 331