BLASTX nr result

ID: Cephaelis21_contig00016020 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cephaelis21_contig00016020
         (2025 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002277884.2| PREDICTED: uncharacterized protein LOC100248...   369   2e-99
ref|XP_002526200.1| transcription factor hy5, putative [Ricinus ...   356   1e-95
ref|XP_002323223.1| predicted protein [Populus trichocarpa] gi|2...   343   1e-91
ref|NP_565946.1| basic helix-loop-helix domain-containing protei...   335   4e-89
gb|AAM96961.1| putative TGACG-sequence-specific bZIP DNA-binding...   334   5e-89

>ref|XP_002277884.2| PREDICTED: uncharacterized protein LOC100248184 [Vitis vinifera]
          Length = 768

 Score =  369 bits (946), Expect = 2e-99
 Identities = 256/572 (44%), Positives = 309/572 (54%), Gaps = 70/572 (12%)
 Frame = -3

Query: 1507 SSSSPEPEVCQLSGD---QVSGDRGSGMNQI-----------------SGDRGHDVSGYL 1388
            +S SPE   C +      QVSGDR S ++ I                 S D+  D +  L
Sbjct: 100  NSPSPESGNCGVESSLPCQVSGDRNSDVSSIELGCCDQKLSPPVASQSSSDQNLDGARVL 159

Query: 1387 NVPSPESHGSKGSNDSRVLNCPSPESQGSGNCRXXXXXXXXXXXXXXXXXXVENGVVDSK 1208
            NVPSPES    GS D R  + P   SQGSGN                      N VVD K
Sbjct: 160  NVPSPES----GSCD-RGFSGPE-SSQGSGN-------------GGSGVPGAVNCVVDQK 200

Query: 1207 IKLEELNNSMFNNTLLKRKKESDDSSNFEAKTNKFRKSTVNSVSSDCXXXXXXXXXXXXE 1028
            +KLE+       N++ KRKKE DDS+  E++++KFR+S++ S +++              
Sbjct: 201  VKLEDSGK----NSVPKRKKEQDDSTT-ESRSSKFRRSSICSETANASNDEEE------- 248

Query: 1027 KKRARLMRNRESAQLSRQRKKHYVEELEDKVKSMHSTIQDLNAKISYFMAENATLRQQXX 848
            KK+ARLMRNRESAQLSRQRKKHYVEELE+K++SMHSTIQDL  KIS  MAENA LRQQ  
Sbjct: 249  KKKARLMRNRESAQLSRQRKKHYVEELEEKIRSMHSTIQDLTGKISIIMAENANLRQQFG 308

Query: 847  XXXXXXXXXXXXXXXXXXXXXXXXXXXXPCGVPPYMVKPQGSQVPLVPIPRLKSQQAASA 668
                                             PY+VKPQGSQVPLVPIPRLK Q   SA
Sbjct: 309  GGGMCPPPHAGMYPHPSMAPMAYPWVPCA----PYVVKPQGSQVPLVPIPRLKPQAPVSA 364

Query: 667  PRVSKKAECKKGEGKTKKFAXXXXXXXXXXXXXXXXXVPMVNMSFGGVRETFSGGSGYTE 488
            P+V KK E KK E K+KK                   VP VN+ +GG++ET  G S Y  
Sbjct: 365  PKV-KKTENKKNETKSKKVVSVSLLGMLSFMFLMGCLVPFVNIKYGGIKETVPGRSDYIS 423

Query: 487  ERFYEKHHGRVLTVNGNMSGADYSE-------------KFGENGKEF------------- 386
             RF + H  R+LTV  +++G++Y               + G +G E              
Sbjct: 424  NRFSDMHRRRILTVKDDLNGSNYGMGVGFDDRIHSERGRGGGSGSEVKQKGGGSKPLPGS 483

Query: 385  -------NCSEPLVASLYVPRNDKMVKIDGNLIIHSVLASEKAMESLRDPANKAGS---- 239
                   N SEPLVASLYVPRNDK+VKIDGNLIIHSVLASEKAM S    A K+      
Sbjct: 484  DGYAHSRNASEPLVASLYVPRNDKLVKIDGNLIIHSVLASEKAMASHAALAKKSPKPSVS 543

Query: 238  ------ETSLAVSRELAPVVPVSG-----GRSPHLYRNPSEHQRAL--GSTSVGKENLKS 98
                  ET LA++  LA   PVS      GR PHL+RNP+E  +AL  GS+   KENL+ 
Sbjct: 544  LANDVRETGLAIAGNLATAFPVSEVGRNKGRHPHLFRNPAEQHKALASGSSDTLKENLQP 603

Query: 97   PATDGRLQQWFREGLAGPMLSSGMCTEVFQFD 2
             +TDG+LQQWFREGLAGPMLSSGMCTEVFQFD
Sbjct: 604  TSTDGKLQQWFREGLAGPMLSSGMCTEVFQFD 635


>ref|XP_002526200.1| transcription factor hy5, putative [Ricinus communis]
            gi|223534478|gb|EEF36179.1| transcription factor hy5,
            putative [Ricinus communis]
          Length = 702

 Score =  356 bits (914), Expect = 1e-95
 Identities = 235/516 (45%), Positives = 290/516 (56%), Gaps = 41/516 (7%)
 Frame = -3

Query: 1426 ISGDRGHDVSGYLNVPSPESHGSKGSNDSRVLNCPSP-ESQGSGNCRXXXXXXXXXXXXX 1250
            ISGD  H V+ YLN     S+ +   +    LN  SP  SQGSGN               
Sbjct: 98   ISGD--HHVATYLNSSPSASNSTTTCSSGDQLNVSSPVSSQGSGN-------------GG 142

Query: 1249 XXXXXVENGVVDSKIKLEE--LNNSMFNNTLLKRKKESDDSSNFEAKTNKFRKSTVNSVS 1076
                   N VVD K+KLEE   N+   N +L KRKKE+      + +  K+R+S  ++ +
Sbjct: 143  SGVSDSVNFVVDQKVKLEEEGSNSKNKNGSLSKRKKENGSE---DTRNQKYRRSENSNAN 199

Query: 1075 SDCXXXXXXXXXXXXEKKRARLMRNRESAQLSRQRKKHYVEELEDKVKSMHSTIQDLNAK 896
            + C             K++ARLMRNRESAQLSRQRKKHYVEELEDKVK+MHSTI DLN+K
Sbjct: 200  TQCVSDEDE-------KRKARLMRNRESAQLSRQRKKHYVEELEDKVKTMHSTIADLNSK 252

Query: 895  ISYFMAENATLRQQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPCGVPPYMVKPQGSQV 716
            IS+FMAENATLRQQ                                   PY+VK QGSQV
Sbjct: 253  ISFFMAENATLRQQLSGGNGMCPPPMYAPMPYPWVPCA-----------PYVVKAQGSQV 301

Query: 715  PLVPIPRLKSQQAASAPRVSKKAECKKGEGKTKKFAXXXXXXXXXXXXXXXXXVPMVNMS 536
            PLVPIPRLKSQQ  SA + SKK++ KK EGKTKK A                 VP+VN+ 
Sbjct: 302  PLVPIPRLKSQQPVSAAK-SKKSDPKKAEGKTKKVASVSFLGLLFFVLLFGGLVPIVNVK 360

Query: 535  FGGVRETFSGGSGYTEERFYEKHHGRVLTVNGNMSGA-----------DYSEKF------ 407
            FGGV E  +G +G+  ++FY +H GRVL V+G+ +G+           D+   F      
Sbjct: 361  FGGVGE--NGANGFVSDKFYNRHRGRVLRVDGHSNGSHENVDVGFSTGDFDSCFRIQCGS 418

Query: 406  GENG-------------------KEFNCSEPLVASLYVPRNDKMVKIDGNLIIHSVLASE 284
            G NG                   +  N S+PL ASLYVPRNDK+VKIDGNLIIHSVLASE
Sbjct: 419  GRNGCLAEKKGRLEHLPEADELVRRGNNSKPLAASLYVPRNDKLVKIDGNLIIHSVLASE 478

Query: 283  KAMESLRDPANKAGSETSLAVSRELAPVVPVSGGRSPHLYRNPSEHQRAL--GSTSVGKE 110
            +AM S  +P      ET LA+ R+L+P  P   GR  HLY + +E Q+AL  GS+    +
Sbjct: 479  RAMSSNENPEANKSKETGLAIPRDLSP-SPTIPGRYSHLYGHHNERQKALTSGSSDTLND 537

Query: 109  NLKSPATDGRLQQWFREGLAGPMLSSGMCTEVFQFD 2
            + KS A DG+LQQWF EGLAGP+LSSGMC+EVFQFD
Sbjct: 538  HKKSAAADGKLQQWFHEGLAGPLLSSGMCSEVFQFD 573


>ref|XP_002323223.1| predicted protein [Populus trichocarpa] gi|222867853|gb|EEF04984.1|
            predicted protein [Populus trichocarpa]
          Length = 623

 Score =  343 bits (880), Expect = 1e-91
 Identities = 229/501 (45%), Positives = 271/501 (54%), Gaps = 21/501 (4%)
 Frame = -3

Query: 1441 SGMNQISGDRGH-DVSGYLNVPSPESHGS--KGSNDSRVLNCPSPESQGSGNCRXXXXXX 1271
            SG + I GD+G  +V  YLN PSP   GS   G +DSR  +     S GSGN        
Sbjct: 57   SGGSGICGDQGGLEVDKYLN-PSPSEAGSCDSGGSDSRSSDLGPASSHGSGNSGSG---- 111

Query: 1270 XXXXXXXXXXXXVENGVVDSKIKLEELNNSMFNNTLLKRKKESDDSSNFEAKTN-KFRKS 1094
                                                  RKKE  D  N +   N K RK+
Sbjct: 112  --------------------------------------RKKEMGDGENGDVMRNFKSRKA 133

Query: 1093 TVNSVSSDCXXXXXXXXXXXXEKKRARLMRNRESAQLSRQRKKHYVEELEDKVKSMHSTI 914
                VS +              K+RARL+RNRESA LSRQRKKHYVEELEDKV++MHSTI
Sbjct: 134  EGEDVSVNVGGGVVSSEEEE--KRRARLVRNRESAHLSRQRKKHYVEELEDKVRAMHSTI 191

Query: 913  QDLNAKISYFMAENATLRQQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPCGVPPYMVK 734
             DLN K+SYFMAENATLRQQ                                   PY+VK
Sbjct: 192  ADLNGKVSYFMAENATLRQQLNGNSACPPPMYAPMAPYPWVP-----------CAPYVVK 240

Query: 733  PQGSQVPLVPIPRLKSQQAASAPRVSKKAECKKGEGKTKKFAXXXXXXXXXXXXXXXXXV 554
            PQGSQVPLVPIPRLK QQA    + +KK E KKGEGKTKK A                  
Sbjct: 241  PQGSQVPLVPIPRLKPQQAVPMAK-TKKVESKKGEGKTKKVASVSLIGLVFFILLFGGLA 299

Query: 553  PMVNMSFGGVRETFSGGSGYTEERFYEKHHGRVLTVNGNMSGADY--------------- 419
            PMV++ FGGVRE+   G G+  ERF ++H GRVL V+G+ +G+                 
Sbjct: 300  PMVDVKFGGVRESGISGFGFGSERFLDQHKGRVLIVDGHSNGSHENHDSANKGAAEHLPG 359

Query: 418  SEKFGENGKEFNCSEPLVASLYVPRNDKMVKIDGNLIIHSVLASEKAMESLRDPANKAGS 239
            S++FG+ G   N SE LVASLYVPRNDK+VKIDGNLIIHS+LASE+AM S   P      
Sbjct: 360  SDEFGQFG---NASEQLVASLYVPRNDKLVKIDGNLIIHSILASERAMASHESPEVNITK 416

Query: 238  ETSLAVSRELAPVVPVSGGRSPHLYRNPSEHQRAL--GSTSVGKENLKSPATDGRLQQWF 65
            +T+LA+     P V  + GR  H+YR  +E Q+AL  GS    K+NLKS A  G+LQQWF
Sbjct: 417  QTALAI-----PDVGNNRGRHSHVYRTHAERQKALASGSADTSKDNLKSSAAKGKLQQWF 471

Query: 64   REGLAGPMLSSGMCTEVFQFD 2
            REGLAGP+LSSGMCTEVFQFD
Sbjct: 472  REGLAGPLLSSGMCTEVFQFD 492


>ref|NP_565946.1| basic helix-loop-helix domain-containing protein [Arabidopsis
            thaliana] gi|20196934|gb|AAB86455.2| bZIP family
            transcription factor [Arabidopsis thaliana]
            gi|330254811|gb|AEC09905.1| basic helix-loop-helix
            domain-containing protein [Arabidopsis thaliana]
          Length = 721

 Score =  335 bits (858), Expect = 4e-89
 Identities = 222/528 (42%), Positives = 286/528 (54%), Gaps = 29/528 (5%)
 Frame = -3

Query: 1498 SPEPEVCQLSGDQVSGDRGSGMNQISGDRGHDVSGYLNVPSPESHGSKGSNDSRVLNCPS 1319
            +PE E   +SGD +             D+    SG +N  SP     + S     L+ P+
Sbjct: 94   TPESESSGISGDCIVPK--------DADKTITTSGCINRESPRDSDDRCSGADHNLDLPT 145

Query: 1318 P-ESQGSGNCRXXXXXXXXXXXXXXXXXXVENGVVDSKIKLEELNNSMFNNTLLKRKKES 1142
            P  SQGSGNC                     N  VD K+K+EE   +    ++ KRKKE 
Sbjct: 146  PLSSQGSGNC-----GSDVSEATNESSPKSRNVAVDQKVKVEEAATT--TTSITKRKKEI 198

Query: 1141 DDSSNFEAKTNKFRKSTVNSVSSDCXXXXXXXXXXXXEKKRARLMRNRESAQLSRQRKKH 962
            D+    E++ +K+R+S  ++ +S              EKKRARLMRNRESAQLSRQRKKH
Sbjct: 199  DEDLTDESRNSKYRRSGEDADAS-------AVTGEEDEKKRARLMRNRESAQLSRQRKKH 251

Query: 961  YVEELEDKVKSMHSTIQDLNAKISYFMAENATLRQQXXXXXXXXXXXXXXXXXXXXXXXX 782
            YVEELE+KV++MHSTI DLN KISYFMAENATLRQQ                        
Sbjct: 252  YVEELEEKVRNMHSTITDLNGKISYFMAENATLRQQLGGNGMCPPHLPPPPMGMYPPMAP 311

Query: 781  XXXXXXPCGVPPYMVKPQGSQVPLVPIPRLKSQQAASAPRVSKKAECKKGEGKTKKFAXX 602
                  PC  PPYMVK QGSQVPL+PIPRLK Q      + +KK+E KK E KTKK A  
Sbjct: 312  MPYPWMPC--PPYMVKQQGSQVPLIPIPRLKPQNTLGTSK-AKKSESKKSEAKTKKVASI 368

Query: 601  XXXXXXXXXXXXXXXVPMVNMSFGGVRETFSGG--SGYTEERFYEKHHGRVLTVNGNMSG 428
                            P+VN+++GG+   F G   S Y  ++ Y +H  RVL  + + +G
Sbjct: 369  SFLGLLFCLFLFGALAPIVNVNYGGISGAFYGNYRSNYITDQIYSQHRDRVLDTSRSGAG 428

Query: 427  ADYSEKFG-ENGKEF------------------NCSEPLVASLYVPRNDKMVKIDGNLII 305
               S   G   G++                   N SEPLVASL+VPRNDK+VKIDGNLII
Sbjct: 429  TGVSNSNGMHRGRDSDRGARKNISATESSVTPGNGSEPLVASLFVPRNDKLVKIDGNLII 488

Query: 304  HSVLASEKAMESLRDPANKAGSETSLAVSRELAPVVPVSG-GR----SPHLYRNPSEHQR 140
            +S+LASEKA+ S R  +     +  L +S++  P +P+   GR    + HLYR+ +E Q+
Sbjct: 489  NSILASEKAVAS-RKASESKERKADLMISKDYTPALPLPDVGRTEELAKHLYRSKAEKQK 547

Query: 139  AL--GSTSVGKENLKSPATDGRLQQWFREGLAGPMLSSGMCTEVFQFD 2
            AL  GS    K+ +K+ A +G +QQWFREG+AGPM SSGMCTEVFQFD
Sbjct: 548  ALSSGSADTLKDQVKTKAANGEMQQWFREGVAGPMFSSGMCTEVFQFD 595


>gb|AAM96961.1| putative TGACG-sequence-specific bZIP DNA-binding protein
            [Arabidopsis thaliana] gi|23198400|gb|AAN15727.1|
            putative TGACG-sequence-specific bZIP DNA-binding protein
            [Arabidopsis thaliana]
          Length = 721

 Score =  334 bits (857), Expect = 5e-89
 Identities = 221/528 (41%), Positives = 286/528 (54%), Gaps = 29/528 (5%)
 Frame = -3

Query: 1498 SPEPEVCQLSGDQVSGDRGSGMNQISGDRGHDVSGYLNVPSPESHGSKGSNDSRVLNCPS 1319
            +PE E   +SGD +             D+    SG +N  SP     + S     L+ P+
Sbjct: 94   TPESESSGISGDCIVPK--------DADKTITTSGCINRESPRDSDDRCSGADHNLDLPT 145

Query: 1318 P-ESQGSGNCRXXXXXXXXXXXXXXXXXXVENGVVDSKIKLEELNNSMFNNTLLKRKKES 1142
            P  SQGSGNC                     N  VD K+K+EE   +    ++ KRKKE 
Sbjct: 146  PLSSQGSGNC-----GSDVSEATNESSPKSRNVAVDQKVKVEEAATT--TTSITKRKKEI 198

Query: 1141 DDSSNFEAKTNKFRKSTVNSVSSDCXXXXXXXXXXXXEKKRARLMRNRESAQLSRQRKKH 962
            D+    E++ +K+R+S  ++ +S              EKKRARLMRNRESAQLSRQRKKH
Sbjct: 199  DEDLTDESRNSKYRRSGEDADAS-------AVTGEEDEKKRARLMRNRESAQLSRQRKKH 251

Query: 961  YVEELEDKVKSMHSTIQDLNAKISYFMAENATLRQQXXXXXXXXXXXXXXXXXXXXXXXX 782
            YVEELE+KV++MHSTI DLN KISYFMAENATLRQQ                        
Sbjct: 252  YVEELEEKVRNMHSTITDLNGKISYFMAENATLRQQLGGNGMCPPHLPPPPMGMYPPMAP 311

Query: 781  XXXXXXPCGVPPYMVKPQGSQVPLVPIPRLKSQQAASAPRVSKKAECKKGEGKTKKFAXX 602
                  PC  PPYMVK QGSQVPL+PIPRLK Q      + +KK+E KK E KTKK A  
Sbjct: 312  MPYPWMPC--PPYMVKQQGSQVPLIPIPRLKPQNTLGTSK-AKKSESKKSEAKTKKVASI 368

Query: 601  XXXXXXXXXXXXXXXVPMVNMSFGGVRETFSGG--SGYTEERFYEKHHGRVLTVNGNMSG 428
                            P+VN+++GG+   F G   S Y  ++ Y +H  RVL  + + +G
Sbjct: 369  SFLGLLFCLFLFGALAPIVNVNYGGISGAFYGNYRSNYITDQIYSQHRDRVLDTSRSGAG 428

Query: 427  ADYSEKFG-ENGKEF------------------NCSEPLVASLYVPRNDKMVKIDGNLII 305
               S   G   G++                   N SEPLVASL+VPRNDK+VKIDGNL+I
Sbjct: 429  TGVSNSNGMHRGRDSDRGARKNISATESSVTPGNGSEPLVASLFVPRNDKLVKIDGNLVI 488

Query: 304  HSVLASEKAMESLRDPANKAGSETSLAVSRELAPVVPVSG-GR----SPHLYRNPSEHQR 140
            +S+LASEKA+ S R  +     +  L +S++  P +P+   GR    + HLYR+ +E Q+
Sbjct: 489  NSILASEKAVAS-RKASESKERKADLMISKDYTPALPLPDVGRTEELAKHLYRSKAEKQK 547

Query: 139  AL--GSTSVGKENLKSPATDGRLQQWFREGLAGPMLSSGMCTEVFQFD 2
            AL  GS    K+ +K+ A +G +QQWFREG+AGPM SSGMCTEVFQFD
Sbjct: 548  ALSSGSADTLKDQVKTKAANGEMQQWFREGVAGPMFSSGMCTEVFQFD 595


Top