BLASTX nr result

ID: Catharanthus23_contig00002043 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00002043
         (690 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EOY21971.1| DNA binding protein, putative isoform 1 [Theobrom...   109   9e-22
ref|XP_002284441.1| PREDICTED: transcription factor UNE10-like [...   107   3e-21
ref|XP_002514702.1| DNA binding protein, putative [Ricinus commu...   105   2e-20
gb|EOY21974.1| DNA binding protein, putative isoform 4 [Theobrom...   103   5e-20
emb|CBI15153.3| unnamed protein product [Vitis vinifera]              101   3e-19
emb|CAN78817.1| hypothetical protein VITISV_041734 [Vitis vinifera]    98   2e-18
gb|EXC11021.1| Transcription factor UNE10 [Morus notabilis]            90   6e-16
ref|XP_004137596.1| PREDICTED: transcription factor UNE10-like [...    84   5e-14
ref|XP_006440685.1| hypothetical protein CICLE_v10020323mg [Citr...    82   2e-13
gb|EMJ11331.1| hypothetical protein PRUPE_ppa022963mg [Prunus pe...    73   1e-10
gb|EOY21973.1| DNA binding protein, putative isoform 3 [Theobrom...    71   4e-10
ref|XP_004242180.1| PREDICTED: transcription factor PIF7-like [S...    61   4e-07
ref|XP_006591039.1| PREDICTED: transcription factor UNE10-like [...    57   5e-06

>gb|EOY21971.1| DNA binding protein, putative isoform 1 [Theobroma cacao]
           gi|508774716|gb|EOY21972.1| Basic helix-loop-helix
           DNA-binding superfamily protein, putative isoform 1
           [Theobroma cacao]
          Length = 422

 Score =  109 bits (272), Expect = 9e-22
 Identities = 85/206 (41%), Positives = 103/206 (50%), Gaps = 19/206 (9%)
 Frame = +1

Query: 130 YVVPISNYEVAELTWENGQLAMHGLTGAGILQXXXXXXXXXXTFGDTLESIVHQATAGAP 309
           ++VP+SNYEVAELTWENGQLAMHGL  +G+L              DTLESIVHQAT    
Sbjct: 36  HLVPMSNYEVAELTWENGQLAMHGL--SGLLPTAPPTKPTWGRSNDTLESIVHQATCHKQ 93

Query: 310 PININPI---------SAATASGGG----------VEKRASFVKKRMRSSESDQSGRHNX 432
             N N +         S+  AS  G          V   A+ +KKR R S+SDQ  ++  
Sbjct: 94  KQNFNLLQHDQTRSNRSSIAASSVGNWAESSSRLPVAAAAALLKKRAR-SDSDQCRKN-- 150

Query: 433 XXXXXXXXFNDDQQMVVXXXXXXXXXXXXXERSACASESATFCREKDAGTTTMMTWTSFE 612
                     +D+                 +RSACAS SA FCR+ DA   TMMTW S E
Sbjct: 151 ----LSGGIQEDR----------------ADRSACASASAAFCRDNDA---TMMTWASHE 187

Query: 613 SPPGTGSFKTHNNKTTDDDSACHDAS 690
           SP    S KT   KT D+DS+ HD S
Sbjct: 188 SPQ---SMKT---KTADEDSSYHDGS 207


>ref|XP_002284441.1| PREDICTED: transcription factor UNE10-like [Vitis vinifera]
          Length = 423

 Score =  107 bits (268), Expect = 3e-21
 Identities = 76/194 (39%), Positives = 93/194 (47%), Gaps = 7/194 (3%)
 Frame = +1

Query: 130 YVVPISNYEVAELTWENGQLAMHGLTGAGILQXXXXXXXXXXTFGDTLESIVHQATAGAP 309
           ++VP+SNYEVAELTWENGQLAMHGL G   L             GDTLESIVHQAT    
Sbjct: 40  HIVPMSNYEVAELTWENGQLAMHGLGG---LLPTAPTKPTWGRAGDTLESIVHQATCHNQ 96

Query: 310 -------PININPISAATASGGGVEKRASFVKKRMRSSESDQSGRHNXXXXXXXXXFNDD 468
                    N+  + +   S   V+     + K+   S+S   GR+              
Sbjct: 97  NSNFIHHAQNLANMKSTVGSSAHVQTGNQGLMKKRTRSDSAHCGRN-------------- 142

Query: 469 QQMVVXXXXXXXXXXXXXERSACASESATFCREKDAGTTTMMTWTSFESPPGTGSFKTHN 648
                             +RSACAS SATFCR+ +   TTMMTW S ESP      ++  
Sbjct: 143 -------FSTNVHEAERADRSACASASATFCRDNE---TTMMTWPSSESP------RSLK 186

Query: 649 NKTTDDDSACHDAS 690
            KTTD+DSACH  S
Sbjct: 187 AKTTDEDSACHGGS 200


>ref|XP_002514702.1| DNA binding protein, putative [Ricinus communis]
           gi|223546306|gb|EEF47808.1| DNA binding protein,
           putative [Ricinus communis]
          Length = 440

 Score =  105 bits (261), Expect = 2e-20
 Identities = 84/211 (39%), Positives = 99/211 (46%), Gaps = 21/211 (9%)
 Frame = +1

Query: 121 PPYYVVPISNYEVAELTWENGQLAMHGLTGAGILQXXXXXXXXXXTFGDTLESIVHQATA 300
           P  ++VP+ N+E+AELTWENGQ+AMHGL   G +           T  +TLESIVHQAT 
Sbjct: 48  PTTHLVPMPNHEIAELTWENGQIAMHGL--GGFVHPSQTKATWGRT-NETLESIVHQATC 104

Query: 301 GAPPININPIS---------------------AATASGGGVEKRASFVKKRMRSSESDQS 417
               +N N                        A T+SG         +KKR R SES+Q 
Sbjct: 105 HNQNLNSNQQGEKQSHQPTIASSTVASSDGKWAETSSGHQAGMAPLLMKKRTR-SESNQC 163

Query: 418 GRHNXXXXXXXXXFNDDQQMVVXXXXXXXXXXXXXERSACASESATFCREKDAGTTTMMT 597
            R           FN   +                + SACAS SATFCRE D   TTMMT
Sbjct: 164 AR----------SFNGSTR------------EEHMDLSACASASATFCRESD---TTMMT 198

Query: 598 WTSFESPPGTGSFKTHNNKTTDDDSACHDAS 690
           W SFESPP   S K    KTTD+DSA H  S
Sbjct: 199 WASFESPP--PSLKA---KTTDEDSASHGGS 224


>gb|EOY21974.1| DNA binding protein, putative isoform 4 [Theobroma cacao]
          Length = 397

 Score =  103 bits (257), Expect = 5e-20
 Identities = 83/201 (41%), Positives = 98/201 (48%), Gaps = 19/201 (9%)
 Frame = +1

Query: 145 SNYEVAELTWENGQLAMHGLTGAGILQXXXXXXXXXXTFGDTLESIVHQATAGAPPININ 324
           SNYEVAELTWENGQLAMHGL  +G+L              DTLESIVHQAT      N N
Sbjct: 16  SNYEVAELTWENGQLAMHGL--SGLLPTAPPTKPTWGRSNDTLESIVHQATCHKQKQNFN 73

Query: 325 PI---------SAATASGGG----------VEKRASFVKKRMRSSESDQSGRHNXXXXXX 447
            +         S+  AS  G          V   A+ +KKR R S+SDQ  ++       
Sbjct: 74  LLQHDQTRSNRSSIAASSVGNWAESSSRLPVAAAAALLKKRAR-SDSDQCRKN------L 126

Query: 448 XXXFNDDQQMVVXXXXXXXXXXXXXERSACASESATFCREKDAGTTTMMTWTSFESPPGT 627
                +D+                 +RSACAS SA FCR+ DA   TMMTW S ESP   
Sbjct: 127 SGGIQEDR----------------ADRSACASASAAFCRDNDA---TMMTWASHESPQ-- 165

Query: 628 GSFKTHNNKTTDDDSACHDAS 690
            S KT   KT D+DS+ HD S
Sbjct: 166 -SMKT---KTADEDSSYHDGS 182


>emb|CBI15153.3| unnamed protein product [Vitis vinifera]
          Length = 385

 Score =  101 bits (251), Expect = 3e-19
 Identities = 74/189 (39%), Positives = 88/189 (46%), Gaps = 7/189 (3%)
 Frame = +1

Query: 145 SNYEVAELTWENGQLAMHGLTGAGILQXXXXXXXXXXTFGDTLESIVHQATAGAP----- 309
           SNYEVAELTWENGQLAMHGL G   L             GDTLESIVHQAT         
Sbjct: 7   SNYEVAELTWENGQLAMHGLGG---LLPTAPTKPTWGRAGDTLESIVHQATCHNQNSNFI 63

Query: 310 --PININPISAATASGGGVEKRASFVKKRMRSSESDQSGRHNXXXXXXXXXFNDDQQMVV 483
               N+  + +   S   V+     + K+   S+S   GR+                   
Sbjct: 64  HHAQNLANMKSTVGSSAHVQTGNQGLMKKRTRSDSAHCGRN------------------- 104

Query: 484 XXXXXXXXXXXXXERSACASESATFCREKDAGTTTMMTWTSFESPPGTGSFKTHNNKTTD 663
                        +RSACAS SATFCR+ +   TTMMTW S ESP      ++   KTTD
Sbjct: 105 --FSTNVHEAERADRSACASASATFCRDNE---TTMMTWPSSESP------RSLKAKTTD 153

Query: 664 DDSACHDAS 690
           +DSACH  S
Sbjct: 154 EDSACHGGS 162


>emb|CAN78817.1| hypothetical protein VITISV_041734 [Vitis vinifera]
          Length = 367

 Score = 98.2 bits (243), Expect = 2e-18
 Identities = 76/182 (41%), Positives = 85/182 (46%)
 Frame = +1

Query: 145 SNYEVAELTWENGQLAMHGLTGAGILQXXXXXXXXXXTFGDTLESIVHQATAGAPPININ 324
           SNYEVAELTWENGQLAMHGL G   L             GDTLESIVHQAT   P I   
Sbjct: 40  SNYEVAELTWENGQLAMHGLGG---LLPTAPTKPTWGRAGDTLESIVHQAT---PEIQ-- 91

Query: 325 PISAATASGGGVEKRASFVKKRMRSSESDQSGRHNXXXXXXXXXFNDDQQMVVXXXXXXX 504
                             +KKR R S+S   GR+                          
Sbjct: 92  ----------------GLMKKRTR-SDSAHCGRN---------------------FSTNV 113

Query: 505 XXXXXXERSACASESATFCREKDAGTTTMMTWTSFESPPGTGSFKTHNNKTTDDDSACHD 684
                 +RSACAS SATFCR+ +   TTMMTW S ESP      ++   KTTD+DSACH 
Sbjct: 114 HEAERADRSACASASATFCRDNE---TTMMTWPSSESP------RSLKAKTTDEDSACHG 164

Query: 685 AS 690
            S
Sbjct: 165 GS 166


>gb|EXC11021.1| Transcription factor UNE10 [Morus notabilis]
          Length = 449

 Score = 90.1 bits (222), Expect = 6e-16
 Identities = 82/235 (34%), Positives = 94/235 (40%), Gaps = 19/235 (8%)
 Frame = +1

Query: 34  RQVEAGEEERNRXXXXXXXXXXXXRQTHRPPYYVVPISNYEVAELTWENGQLAMHGLTGA 213
           RQ +   EE NR              T    + VVPISNY+V ELT  NGQL MHGL   
Sbjct: 15  RQEQVEGEEGNRSSHVPNQQNPTTTTTTSSSHLVVPISNYQVKELTPANGQLDMHGL--G 72

Query: 214 GILQXXXXXXXXXXTFGDTLESIVHQATA------------GAPPINI-----NPISAAT 342
           G+L           T G TLESIVHQAT             G  P  I      P+    
Sbjct: 73  GLLPLGPAKPTWGRT-GGTLESIVHQATCHTHDPNVTHHGHGQTPATIGSNIVGPLIGKW 131

Query: 343 ASGGGVEKRASFVKKRMRSSESDQSGRHNXXXXXXXXXFNDDQQMVVXXXXXXXXXXXXX 522
           A   G     + V ++   S+SD  GR+           +    M               
Sbjct: 132 AENSGQAPPPTLVMRKRSRSDSDYGGRN----------LSSSSSM------------QEE 169

Query: 523 ERSACASESATFCREKDAGTTTMMTWTSFESPPGTGSFKTHN--NKTTDDDSACH 681
                AS SATFCRE D   TTMMTW SFESP        HN  NKT D+D   H
Sbjct: 170 HGGPSASASATFCRESD---TTMMTWASFESP--------HNLKNKTNDEDFISH 213


>ref|XP_004137596.1| PREDICTED: transcription factor UNE10-like [Cucumis sativus]
           gi|449487081|ref|XP_004157490.1| PREDICTED:
           transcription factor UNE10-like [Cucumis sativus]
          Length = 458

 Score = 83.6 bits (205), Expect = 5e-14
 Identities = 71/230 (30%), Positives = 103/230 (44%), Gaps = 14/230 (6%)
 Frame = +1

Query: 34  RQVEAGEEERNRXXXXXXXXXXXXRQTHRPPYYVVPISNYEVAELTWENGQLAMHGLTGA 213
           RQV+  EEE  R              T     +   ++   + ELTW+NGQLA+HG+ G 
Sbjct: 31  RQVQVEEEEEKRSFHVPAEKNQHSTTTKPLVPFYQQMAKQGITELTWQNGQLALHGIDG- 89

Query: 214 GILQXXXXXXXXXXTFGDTLESIVHQA----------TAGAPPINI-NPISAATASGGGV 360
             LQ             DTLES+V+QA            G P ++    ++ + A+G  V
Sbjct: 90  --LQPTIPPKPTWNRANDTLESVVNQAKLQTQGPNLIQQGEPVVHTGRTLAPSGANGKWV 147

Query: 361 EK---RASFVKKRMRSSESDQSGRHNXXXXXXXXXFNDDQQMVVXXXXXXXXXXXXXERS 531
           E+   +    +KR RS+ SD  G++           ++  Q+               + S
Sbjct: 148 ERGNNQEPTARKRTRST-SDYGGKNVSTSNNNNNNNSNTMQV------------DHGDHS 194

Query: 532 ACASESATFCREKDAGTTTMMTWTSFESPPGTGSFKTHNNKTTDDDSACH 681
            C S SA FCR+ +   TT+MTW SF+SP    S KT   K+ D+DSACH
Sbjct: 195 VCGSASAAFCRDNE---TTLMTWASFDSP---RSLKT---KSIDEDSACH 235


>ref|XP_006440685.1| hypothetical protein CICLE_v10020323mg [Citrus clementina]
           gi|557542947|gb|ESR53925.1| hypothetical protein
           CICLE_v10020323mg [Citrus clementina]
          Length = 419

 Score = 82.0 bits (201), Expect = 2e-13
 Identities = 76/206 (36%), Positives = 96/206 (46%), Gaps = 24/206 (11%)
 Frame = +1

Query: 145 SNYEVA-ELTWENGQLAMHGLTGAGILQXXXXXXXXXXTFGDTLESIVHQAT-------- 297
           SNYEVA +LTW NGQL+MHGL   GI+           +  DTLESIVHQA         
Sbjct: 48  SNYEVAADLTWGNGQLSMHGL--GGIIPTTPTKPTWGRS-NDTLESIVHQAAITCHNNNN 104

Query: 298 ----------AGAPPININPISAATA-----SGGGVEKRASFVKKRMRSSESDQSGRHNX 432
                       +P  N + + +++      S G V      +KKR R ++SDQ GR+  
Sbjct: 105 NKEITLQLHGQNSPAANRSSMVSSSGTKCSESPGQVPVMPGPLKKRTR-ADSDQCGRN-- 161

Query: 433 XXXXXXXXFNDDQQMVVXXXXXXXXXXXXXERSACASESATFCREKDAGTTTMMTWTSFE 612
                   F+  Q+                +RSACAS SAT  RE D   TTMMTW S+E
Sbjct: 162 --------FSSMQE-------------GRGDRSACASASATCFREND---TTMMTWASYE 197

Query: 613 SPPGTGSFKTHNNKTTDDDSACHDAS 690
                 S K+   KTTD+DSA H  S
Sbjct: 198 ------SLKSLKTKTTDEDSASHGRS 217


>gb|EMJ11331.1| hypothetical protein PRUPE_ppa022963mg [Prunus persica]
          Length = 429

 Score = 72.8 bits (177), Expect = 1e-10
 Identities = 67/194 (34%), Positives = 81/194 (41%), Gaps = 15/194 (7%)
 Frame = +1

Query: 145 SNYEVAELTWENGQLAMHGLTGAGILQXXXXXXXXXXTFGDTLESIVHQATA-------- 300
           SNY+V EL  ENGQLAMHGL G   L             GDTLES+VHQAT         
Sbjct: 7   SNYDVRELKLENGQLAMHGLGG---LLPTSQAKHTWGRAGDTLESVVHQATHHKREPNLI 63

Query: 301 --GAPPININPISAA-----TASGGGVEKRASFVKKRMRSSESDQSGRHNXXXXXXXXXF 459
             G  P NI+ + A+     T  GG V     +++KR RS +SD  G +           
Sbjct: 64  HNGQTPANISSMLASSGRTWTDEGGQVPLAEGWMRKRTRS-DSDYHGNNFSGSTTSIHEE 122

Query: 460 NDDQQMVVXXXXXXXXXXXXXERSACASESATFCREKDAGTTTMMTWTSFESPPGTGSFK 639
           + D                    S   S SA  CR+       M TW SFES P   S  
Sbjct: 123 HADPSTCA---------------SPSPSASAKLCRDNQK---IMTTWASFESLPSLKS-- 162

Query: 640 THNNKTTDDDSACH 681
               K+ D+DSA H
Sbjct: 163 ---TKSPDEDSASH 173


>gb|EOY21973.1| DNA binding protein, putative isoform 3 [Theobroma cacao]
          Length = 366

 Score = 70.9 bits (172), Expect = 4e-10
 Identities = 66/185 (35%), Positives = 82/185 (44%), Gaps = 19/185 (10%)
 Frame = +1

Query: 193 MHGLTGAGILQXXXXXXXXXXTFGDTLESIVHQATAGAPPININPIS-----------AA 339
           MHGL+G  +L              DTLESIVHQAT      N N +            AA
Sbjct: 1   MHGLSG--LLPTAPPTKPTWGRSNDTLESIVHQATCHKQKQNFNLLQHDQTRSNRSSIAA 58

Query: 340 TASGGGVEKR--------ASFVKKRMRSSESDQSGRHNXXXXXXXXXFNDDQQMVVXXXX 495
           ++ G   E          A+ +KKR RS +SDQ  ++            +D+        
Sbjct: 59  SSVGNWAESSSRLPVAAAAALLKKRARS-DSDQCRKN------LSGGIQEDRA------- 104

Query: 496 XXXXXXXXXERSACASESATFCREKDAGTTTMMTWTSFESPPGTGSFKTHNNKTTDDDSA 675
                    +RSACAS SA FCR+ DA   TMMTW S ESP    S KT   KT D+DS+
Sbjct: 105 ---------DRSACASASAAFCRDNDA---TMMTWASHESPQ---SMKT---KTADEDSS 146

Query: 676 CHDAS 690
            HD S
Sbjct: 147 YHDGS 151


>ref|XP_004242180.1| PREDICTED: transcription factor PIF7-like [Solanum lycopersicum]
          Length = 414

 Score = 60.8 bits (146), Expect = 4e-07
 Identities = 57/142 (40%), Positives = 63/142 (44%), Gaps = 13/142 (9%)
 Frame = +1

Query: 25  QQIRQVEAGEEERNRXXXXXXXXXXXXRQTHRPPYYVVPISNY-EVAELTWENGQLAMHG 201
           +Q +QV   EEE NR               H     V P+SN  EVAELTWENGQ+AMH 
Sbjct: 12  KQEQQVVEKEEEENRYTRG---------HVHNQQNQVDPMSNKCEVAELTWENGQVAMHR 62

Query: 202 LTGAGILQXXXXXXXXXXTFGDTLESIVHQAT------------AGAPPININPISAATA 345
           L   G               GDTLESIVHQAT             G    NIN       
Sbjct: 63  L---GSNLSNEQTKHTWGKAGDTLESIVHQATFQKQHHSYIMGSDGQNQANIN--REKNV 117

Query: 346 SGGGVEKRASFVKKRMRSSESD 411
           S G  + R   V KRMRSS+SD
Sbjct: 118 SYGAQQTRG--VLKRMRSSDSD 137


>ref|XP_006591039.1| PREDICTED: transcription factor UNE10-like [Glycine max]
          Length = 465

 Score = 57.0 bits (136), Expect = 5e-06
 Identities = 34/80 (42%), Positives = 45/80 (56%), Gaps = 5/80 (6%)
 Frame = +1

Query: 136 VPISNYEVAELTWENGQLAMHGLTGAGILQXXXXXXXXXXTF-----GDTLESIVHQATA 300
           VP+ +YEVAELTWENGQL+MHGL    +            T+       TLESIV+QAT+
Sbjct: 32  VPMLDYEVAELTWENGQLSMHGLGLPRVPVKPPTAATNKYTWEKPRGSGTLESIVNQATS 91

Query: 301 GAPPININPISAATASGGGV 360
            +      P++  +  GGGV
Sbjct: 92  FSHQEKPRPLNGDSGGGGGV 111


Top