BLASTX nr result

ID: Cephaelis21_contig00004494 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cephaelis21_contig00004494
         (906 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002307065.1| predicted protein [Populus trichocarpa] gi|2...   322   6e-86
ref|XP_002515604.1| Williams-Beuren syndrome chromosome region 1...   315   9e-84
ref|XP_002273073.1| PREDICTED: E3 ubiquitin-protein ligase HERC2...   312   6e-83
ref|XP_002873378.1| hypothetical protein ARALYDRAFT_487713 [Arab...   305   7e-81
ref|NP_680156.2| regulator of chromosome condensation repeat-con...   298   9e-79

>ref|XP_002307065.1| predicted protein [Populus trichocarpa] gi|222856514|gb|EEE94061.1|
           predicted protein [Populus trichocarpa]
          Length = 419

 Score =  322 bits (826), Expect = 6e-86
 Identities = 159/264 (60%), Positives = 195/264 (73%), Gaps = 2/264 (0%)
 Frame = -3

Query: 799 IIDPRFGFMFQRWISMAGCENAKDSGKHSAAVWGNGDYGRLGLGNLESQWSPRILGASVF 620
           +I    G+ ++RW+S +  E  K     SAAVWGNGDYGRLG GNL+S W P+++ +S F
Sbjct: 7   LIGGGLGYCYKRWMSSSSSEGRK----RSAAVWGNGDYGRLGYGNLDSMWRPKLMNSSSF 62

Query: 619 GNQNLREIACGGAHTLFLTENGNVYGTGLNDFGQLGISDAKSYTTNPHPVCGLPKEIIRI 440
            N NL+ I+CGGAHTLFLTE G VY TGLNDFGQLG+S+  +Y   P  V GL KEI++I
Sbjct: 63  HNSNLKSISCGGAHTLFLTETGRVYATGLNDFGQLGVSNNTTYCMEPLEVSGLKKEIVQI 122

Query: 439 SAGYHHSSAITVDGELYMWGKNSNGQLGLGKKAAKLVSLPRKVECLNGVTIRITSLGCEH 260
           SAGYHHS AITVDGELY WGKNSNGQLGLGKKA  +V +P KVECL+G+ I++ +L  EH
Sbjct: 123 SAGYHHSCAITVDGELYTWGKNSNGQLGLGKKAENVVPVPTKVECLSGINIKMVALASEH 182

Query: 259 SVAVTDKGEALSW--XXXXXXXXXXXXXXXGFVKSSSEFTPRLIKVLEGVKVKSVAAGVL 86
           S+AVTD G+ALSW                 GF +SSSE+TPR IK LEGVKVK++AAG+L
Sbjct: 183 SIAVTDGGQALSWGGGGSGRLGHGHQSSLLGFFRSSSEYTPRHIKKLEGVKVKNIAAGLL 242

Query: 85  HSACIDENGLLYIFGERAKEKLGY 14
           HSACIDENG +YIFGE+A +KL +
Sbjct: 243 HSACIDENGSVYIFGEKAVDKLAF 266



 Score = 74.7 bits (182), Expect = 3e-11
 Identities = 60/190 (31%), Positives = 91/190 (47%), Gaps = 21/190 (11%)
 Frame = -3

Query: 727 SGKHSAAV--------WGNGDYGRLGLGNLES---------QWSPRILGASVFGNQNLRE 599
           + +HS AV        WG G  GRLG G+  S         +++PR +         ++ 
Sbjct: 179 ASEHSIAVTDGGQALSWGGGGSGRLGHGHQSSLLGFFRSSSEYTPRHI--KKLEGVKVKN 236

Query: 598 IACGGAHTLFLTENGNVYGTGLNDFGQLGISDAKSYTTNPHPVCGLPKEIIRISAGYHHS 419
           IA G  H+  + ENG+VY  G     +L   DA + TT P  +  LP     ++ G +H+
Sbjct: 237 IAAGLLHSACIDENGSVYIFGEKAVDKLAFGDANNATT-PSMIGKLPYSQ-EVACGGYHT 294

Query: 418 SAITVDGELYMWGKNSNGQLGLGKKAAKLVSLPRKVECLNGVTIRI----TSLGCEHSVA 251
             IT  GELY WG N NG LG G  +  ++ +P +VE   G  +R      S G +H+ A
Sbjct: 295 CVITSGGELYTWGSNENGCLGNG--SIDVLHIPERVE---GPFLRSPVEKVSCGWKHTAA 349

Query: 250 VTDKGEALSW 221
           +++ G   +W
Sbjct: 350 ISE-GNVFTW 358


>ref|XP_002515604.1| Williams-Beuren syndrome chromosome region 16 protein, putative
           [Ricinus communis] gi|223545242|gb|EEF46749.1|
           Williams-Beuren syndrome chromosome region 16 protein,
           putative [Ricinus communis]
          Length = 374

 Score =  315 bits (807), Expect = 9e-84
 Identities = 159/257 (61%), Positives = 193/257 (75%), Gaps = 2/257 (0%)
 Frame = -3

Query: 772 FQRWISMAGCENAKDSGKHSAAVWGNGDYGRLGLGNLESQWSPRILGASVFGNQNLREIA 593
           ++RW+S        ++GK  AAVWGNGD+GRLG G+L+SQW P++L +S F N +L+ IA
Sbjct: 10  YRRWMS-------SEAGKRYAAVWGNGDFGRLGTGSLDSQWRPKLLLSSCFANHSLKSIA 62

Query: 592 CGGAHTLFLTENGNVYGTGLNDFGQLGISDAKSYTTNPHPVCGLPKEIIRISAGYHHSSA 413
           CGGAHTLFLTE G VY TGLNDFGQLGIS   SYTT P  V GL KEI++ISAGYHHS A
Sbjct: 63  CGGAHTLFLTETGCVYATGLNDFGQLGISGNLSYTTEPVKVSGLQKEIMQISAGYHHSCA 122

Query: 412 ITVDGELYMWGKNSNGQLGLGKKAAKLVSLPRKVECLNGVTIRITSLGCEHSVAVTDKGE 233
           ITVDGELYMWG+NSNGQLGLGKKA ++V LP KVE LNG+TI++ +LG EHS+AVTD+GE
Sbjct: 123 ITVDGELYMWGRNSNGQLGLGKKAQRIVPLPTKVEYLNGLTIKLVALGSEHSIAVTDRGE 182

Query: 232 ALSW--XXXXXXXXXXXXXXXGFVKSSSEFTPRLIKVLEGVKVKSVAAGVLHSACIDENG 59
           ALSW                  F  S+SE+TPR IK LE VKVKSVAAG+LHSACID NG
Sbjct: 183 ALSWGLGGFGRLGHSNQSGTFRFWTSTSEYTPRSIKKLEEVKVKSVAAGLLHSACIDVNG 242

Query: 58  LLYIFGERAKEKLGYEE 8
            +++FGE + ++ G+ E
Sbjct: 243 SVFVFGEVSVDRTGFGE 259



 Score = 57.8 bits (138), Expect = 3e-06
 Identities = 41/130 (31%), Positives = 60/130 (46%), Gaps = 9/130 (6%)
 Frame = -3

Query: 712 AAVWGNGDYGRLGLGNLE---------SQWSPRILGASVFGNQNLREIACGGAHTLFLTE 560
           A  WG G +GRLG  N           S+++PR +         ++ +A G  H+  +  
Sbjct: 183 ALSWGLGGFGRLGHSNQSGTFRFWTSTSEYTPRSI--KKLEEVKVKSVAAGLLHSACIDV 240

Query: 559 NGNVYGTGLNDFGQLGISDAKSYTTNPHPVCGLPKEIIRISAGYHHSSAITVDGELYMWG 380
           NG+V+  G     + G  +A      P  V  LP     +S G +H+  +T  GELY WG
Sbjct: 241 NGSVFVFGEVSVDRTGFGEAIGAEI-PSMVGKLPCAN-EVSCGGYHTCVVTSGGELYAWG 298

Query: 379 KNSNGQLGLG 350
            N NG LG+G
Sbjct: 299 VNENGCLGIG 308


>ref|XP_002273073.1| PREDICTED: E3 ubiquitin-protein ligase HERC2 [Vitis vinifera]
           gi|296082630|emb|CBI21635.3| unnamed protein product
           [Vitis vinifera]
          Length = 420

 Score =  312 bits (800), Expect = 6e-83
 Identities = 160/274 (58%), Positives = 199/274 (72%), Gaps = 2/274 (0%)
 Frame = -3

Query: 823 SSTNIAKPIIDPRFGFMFQRWISMAGCENAKDSGKHSAAVWGNGDYGRLGLGNLESQWSP 644
           S   +++  +  + G  F R +S      + +  K  AA+WGNGD+GRLGLG+LESQW P
Sbjct: 4   SCMRVSQSALQSKLGLGFCRRLS------SSEPRKRFAALWGNGDFGRLGLGSLESQWRP 57

Query: 643 RILGASVFGNQNLREIACGGAHTLFLTENGNVYGTGLNDFGQLGISDAKSYTTNPHPVCG 464
            +   S F + +L  IACGGAHTLFLTE+G VY  GLNDFGQLG+S  K+YTT P  V  
Sbjct: 58  AV--CSAFDHHSLVAIACGGAHTLFLTESGCVYAAGLNDFGQLGVSVDKNYTTEPLEVSA 115

Query: 463 LPKEIIRISAGYHHSSAITVDGELYMWGKNSNGQLGLGKKAAKLVSLPRKVECLNGVTIR 284
           LPK+II I+AGY+HS+AIT DGELYMWGKNSNGQLGLGKKAA  VS+P KVECLNG++I+
Sbjct: 116 LPKKIIHIAAGYYHSAAITADGELYMWGKNSNGQLGLGKKAANAVSVPSKVECLNGISIK 175

Query: 283 ITSLGCEHSVAVTDKGEALSW--XXXXXXXXXXXXXXXGFVKSSSEFTPRLIKVLEGVKV 110
           + +LG EHSVA TD+GEALSW                 GF ++SSE+ PRLI+ LEG+KV
Sbjct: 176 MVALGSEHSVAATDQGEALSWGAGGSGRLGHGHESSLLGFFRTSSEYRPRLIRRLEGIKV 235

Query: 109 KSVAAGVLHSACIDENGLLYIFGERAKEKLGYEE 8
           K+VAAG+LHSACIDENG ++IFGERA +K G+ E
Sbjct: 236 KNVAAGLLHSACIDENGSVFIFGERAMDKFGFRE 269



 Score = 72.4 bits (176), Expect = 1e-10
 Identities = 60/239 (25%), Positives = 105/239 (43%), Gaps = 15/239 (6%)
 Frame = -3

Query: 736 AKDSGKHSAAVWGNGDYGRLGLGN---------LESQWSPRILGASVFGNQNLREIACGG 584
           A D G+  A  WG G  GRLG G+           S++ PR++         ++ +A G 
Sbjct: 187 ATDQGE--ALSWGAGGSGRLGHGHESSLLGFFRTSSEYRPRLIRR--LEGIKVKNVAAGL 242

Query: 583 AHTLFLTENGNVYGTGLNDFGQLGISDAKSYTTNPHPVCGLPKEIIRISAGYHHSSAITV 404
            H+  + ENG+V+  G     + G  +AK+ T  P  +  LP    +++ G +H+  I+ 
Sbjct: 243 LHSACIDENGSVFIFGERAMDKFGFREAKNATA-PSMISELPYSK-QVACGGYHTCVISS 300

Query: 403 DGELYMWGKNSNGQLGLGKKAAKLVSLPRKVE-CLNGVTIRITSLGCEHSVAVTDKGEAL 227
            GEL+ WG N NG LG+G    + +  P ++E   +   +   S G +H+ A+++ G   
Sbjct: 301 SGELFTWGSNENGCLGMG--FMETIHFPERIEGPFSKNPVSQVSCGWKHTAAISE-GNVF 357

Query: 226 SWXXXXXXXXXXXXXXXGFVK-----SSSEFTPRLIKVLEGVKVKSVAAGVLHSACIDE 65
           +W                  +           P++++  E V+   V+ G  H+  I E
Sbjct: 358 TWGWGGSYGTFSDDGHSSGGQLGQGSDVDHIKPKMVEFEESVRALQVSCGFNHTGAILE 416


>ref|XP_002873378.1| hypothetical protein ARALYDRAFT_487713 [Arabidopsis lyrata subsp.
           lyrata] gi|297319215|gb|EFH49637.1| hypothetical protein
           ARALYDRAFT_487713 [Arabidopsis lyrata subsp. lyrata]
          Length = 434

 Score =  305 bits (782), Expect = 7e-81
 Identities = 153/258 (59%), Positives = 191/258 (74%), Gaps = 2/258 (0%)
 Frame = -3

Query: 781 GFMFQRWISMAGCENAKDSGKHSAAVWGNGDYGRLGLGNLESQWSPRILGASVFGNQNLR 602
           G   +RW+S        +SGK  AA+WG+GDYGRLGLGNL+SQW+P   G S   + ++R
Sbjct: 35  GVSCRRWLS-------NESGKRFAAMWGSGDYGRLGLGNLDSQWTPA--GCSALSDHSIR 85

Query: 601 EIACGGAHTLFLTENGNVYGTGLNDFGQLGISDAKSYTTNPHPVCGLPKEIIRISAGYHH 422
            +ACGGAHTLFLTE   V+ TGLND GQLG+SD KS+  +P  V GL K+I+ ISAGY+H
Sbjct: 86  AVACGGAHTLFLTETRRVFATGLNDCGQLGVSDVKSHAMDPLEVSGLDKDILHISAGYYH 145

Query: 421 SSAITVDGELYMWGKNSNGQLGLGKKAAKLVSLPRKVECLNGVTIRITSLGCEHSVAVTD 242
           S+AITVDGELYMWGKNS+GQLGLGKKAA++V +P KVE L+G+TI+  +LG EHSVAVTD
Sbjct: 146 SAAITVDGELYMWGKNSSGQLGLGKKAARVVRVPTKVEALHGITIQSVALGSEHSVAVTD 205

Query: 241 KGEALSWXXXXXXXXXXXXXXXGF--VKSSSEFTPRLIKVLEGVKVKSVAAGVLHSACID 68
            GE LSW                F  ++S+SEFTPRLIK LEG+KVK+VAAG+LHSAC D
Sbjct: 206 GGEVLSWGGGGSGRLGHGHQSSLFGILRSNSEFTPRLIKELEGIKVKNVAAGLLHSACTD 265

Query: 67  ENGLLYIFGERAKEKLGY 14
           ENG  ++FGER+  K+G+
Sbjct: 266 ENGSAFMFGERSINKMGF 283



 Score = 63.9 bits (154), Expect = 5e-08
 Identities = 51/185 (27%), Positives = 85/185 (45%), Gaps = 18/185 (9%)
 Frame = -3

Query: 721 KHSAAV--------WGNGDYGRLGLGNLES---------QWSPRILGASVFGNQNLREIA 593
           +HS AV        WG G  GRLG G+  S         +++PR++         ++ +A
Sbjct: 198 EHSVAVTDGGEVLSWGGGGSGRLGHGHQSSLFGILRSNSEFTPRLI--KELEGIKVKNVA 255

Query: 592 CGGAHTLFLTENGNVYGTGLNDFGQLGISDAKSYTTNPHPVCGLPKEIIRISAGYHHSSA 413
            G  H+    ENG+ +  G     ++G    ++ TT P  +  +P     ++ G +H+  
Sbjct: 256 AGLLHSACTDENGSAFMFGERSINKMGFGGVRNATT-PSIISEVPYAE-GVACGGYHTCV 313

Query: 412 ITVDGELYMWGKNSNGQLGLGKKAAKLVSLPRKVE-CLNGVTIRITSLGCEHSVAVTDKG 236
           +T  GELY WG N NG   LG  +  +   P +VE      T+   S G +H+ A++D  
Sbjct: 314 VTRGGELYTWGSNENG--CLGTDSTYVSHSPVRVEGPFLESTVSQVSCGWKHTAAISD-N 370

Query: 235 EALSW 221
           +  +W
Sbjct: 371 KVFTW 375


>ref|NP_680156.2| regulator of chromosome condensation repeat-containing protein
           [Arabidopsis thaliana] gi|26452773|dbj|BAC43467.1|
           unknown protein [Arabidopsis thaliana]
           gi|28973187|gb|AAO63918.1| putative UVB-resistance
           protein UVR8 [Arabidopsis thaliana]
           gi|332003957|gb|AED91340.1| regulator of chromosome
           condensation repeat-containing protein [Arabidopsis
           thaliana]
          Length = 434

 Score =  298 bits (764), Expect = 9e-79
 Identities = 149/258 (57%), Positives = 188/258 (72%), Gaps = 2/258 (0%)
 Frame = -3

Query: 781 GFMFQRWISMAGCENAKDSGKHSAAVWGNGDYGRLGLGNLESQWSPRILGASVFGNQNLR 602
           G    RW+S        +SGK  AA+WG+GDYGRLGLGNL+SQW+P +   S   + ++ 
Sbjct: 35  GVCCSRWVS-------SESGKRFAAMWGSGDYGRLGLGNLDSQWTPAV--CSALSDHSIT 85

Query: 601 EIACGGAHTLFLTENGNVYGTGLNDFGQLGISDAKSYTTNPHPVCGLPKEIIRISAGYHH 422
            +ACGGAHTLFLTE   V+ TGLND GQLG+SD KS+  +P  V GL K+I+ ISAGY+H
Sbjct: 86  AVACGGAHTLFLTETRRVFATGLNDCGQLGVSDVKSHAMDPLEVSGLDKDILHISAGYYH 145

Query: 421 SSAITVDGELYMWGKNSNGQLGLGKKAAKLVSLPRKVECLNGVTIRITSLGCEHSVAVTD 242
           S+AITVDGELYMWGKNS+GQLGLGKKAA++V +P KVE L+G+TI+  +LG EHSVAVTD
Sbjct: 146 SAAITVDGELYMWGKNSSGQLGLGKKAARVVRVPTKVEALHGITIQSVALGSEHSVAVTD 205

Query: 241 KGEALSWXXXXXXXXXXXXXXXGF--VKSSSEFTPRLIKVLEGVKVKSVAAGVLHSACID 68
            GE LSW                F  ++S+SEFTPRLIK LEG+KV +VAAG+LHSAC D
Sbjct: 206 GGEVLSWGGGGSGRLGHGHQSSLFGILRSNSEFTPRLIKELEGIKVTNVAAGLLHSACTD 265

Query: 67  ENGLLYIFGERAKEKLGY 14
           ENG  ++FGE++  K+G+
Sbjct: 266 ENGSAFMFGEKSINKMGF 283



 Score = 63.2 bits (152), Expect = 8e-08
 Identities = 50/178 (28%), Positives = 81/178 (45%), Gaps = 18/178 (10%)
 Frame = -3

Query: 721 KHSAAV--------WGNGDYGRLGLGNLES---------QWSPRILGASVFGNQNLREIA 593
           +HS AV        WG G  GRLG G+  S         +++PR++         +  +A
Sbjct: 198 EHSVAVTDGGEVLSWGGGGSGRLGHGHQSSLFGILRSNSEFTPRLI--KELEGIKVTNVA 255

Query: 592 CGGAHTLFLTENGNVYGTGLNDFGQLGISDAKSYTTNPHPVCGLPKEIIRISAGYHHSSA 413
            G  H+    ENG+ +  G     ++G    ++ TT P  +  +P     ++ G +H+  
Sbjct: 256 AGLLHSACTDENGSAFMFGEKSINKMGFGGVRNATT-PSIISEVPYAE-EVACGGYHTCV 313

Query: 412 ITVDGELYMWGKNSNGQLGLGKKAAKLVSLPRKVE-CLNGVTIRITSLGCEHSVAVTD 242
           +T  GELY WG N NG   LG  +  +   P +VE      T+   S G +H+ A++D
Sbjct: 314 VTRGGELYTWGSNENG--CLGTDSTYVSHSPVRVEGPFLESTVSQVSCGWKHTAAISD 369


Top