BLASTX nr result

ID: Cephaelis21_contig00001056 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cephaelis21_contig00001056
         (1470 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002267149.1| PREDICTED: uncharacterized protein LOC100249...   170   8e-40
ref|NP_563971.1| methyl-CPG-binding domain 10 [Arabidopsis thali...   163   1e-37
ref|XP_002517633.1| Nucleosome-binding protein, putative [Ricinu...   161   5e-37
gb|AAN60295.1| unknown [Arabidopsis thaliana]                         160   1e-36
ref|NP_001240074.1| uncharacterized protein LOC100776785 [Glycin...   156   1e-35

>ref|XP_002267149.1| PREDICTED: uncharacterized protein LOC100249094 isoform 1 [Vitis
            vinifera]
          Length = 324

 Score =  170 bits (431), Expect = 8e-40
 Identities = 122/306 (39%), Positives = 164/306 (53%), Gaps = 17/306 (5%)
 Frame = -1

Query: 1197 LPAPASWKKLYLPKKAGTPRKSEIVFVAPTGEEISTRRQLDQYLKSHPGSPSISEFDWST 1018
            LPAP SWKK+++PKK GTPRK+EIVF+APTGEEI++R+QL+QYLKSHPG+P+ISEFDW T
Sbjct: 12   LPAPPSWKKMFMPKK-GTPRKNEIVFIAPTGEEINSRKQLEQYLKSHPGNPAISEFDWGT 70

Query: 1017 GETPRRSARISEKAXXXXXXXXXXXXXKRARTSVGTKKDKEST------NDGNXXXXXXX 856
            GETPRRSARISEKA             KR R S G+KKD + T       +G        
Sbjct: 71   GETPRRSARISEKA-KATPPAESEPPKKRGRKSSGSKKDGKETEAATEEQEGKKEISMQD 129

Query: 855  KMPVAGANEKKDDGGDKPSDTPAVTQGEEEGKAHYEDRNVTE--PKLKVNGQEENSATDV 682
                  AN + +D   +      V    E  +    D N+ E  P   ++G++E   +D 
Sbjct: 130  ADVTEKANAESEDVSKEIQVENGVKTVAEADQVKNLDVNMEEAGPVEAIDGKDEKIQSDT 189

Query: 681  GTEK-ARDEGSTADTVVANQPSE-KVEQKLESKAIQKPEIEAGKDASADTAEQDKVGNDS 508
            G  K A  E    +   A    E K ++  E+ A+ +P  EAG  A     E DK+   +
Sbjct: 190  GDSKVAATETEVVNAEEAQGEKEVKKQEVAEAVAVDEPAKEAG--AKVTQKEGDKLETSA 247

Query: 507  VVVNNGA--EGQRTDVVVPSVGEIKGQ----ENDGKLKAQVERE-NNMKGVVIENGKVDQ 349
             V  N A  + +   + +    E+K +    +NDGK K QVE     ++G V ENGKV+Q
Sbjct: 248  TVELNEAVNKDKPNGLGIAPEEEVKEKQEVPDNDGKCKFQVEENGKKLEGDVTENGKVNQ 307

Query: 348  SGPRET 331
                ET
Sbjct: 308  MQRAET 313


>ref|NP_563971.1| methyl-CPG-binding domain 10 [Arabidopsis thaliana]
            gi|75215632|sp|Q9XI36.1|MBD10_ARATH RecName:
            Full=Methyl-CpG-binding domain-containing protein 10;
            Short=AtMBD10; Short=MBD10; AltName:
            Full=Methyl-CpG-binding protein MBD10
            gi|5103831|gb|AAD39661.1|AC007591_26 ESTs gb|H37032,
            gb|R6425, gb|Z34651, gb|N37268, gb|AA713172 and gb|Z34241
            come from this gene [Arabidopsis thaliana]
            gi|20453139|gb|AAM19811.1| At1g15340/F9L1_28 [Arabidopsis
            thaliana] gi|56382007|gb|AAV85722.1| At1g15340
            [Arabidopsis thaliana] gi|332191184|gb|AEE29305.1|
            methyl-CPG-binding domain 10 [Arabidopsis thaliana]
          Length = 384

 Score =  163 bits (412), Expect = 1e-37
 Identities = 100/275 (36%), Positives = 143/275 (52%), Gaps = 4/275 (1%)
 Frame = -1

Query: 1197 LPAPASWKKLYLPKKAGTPRKSEIVFVAPTGEEISTRRQLDQYLKSHPGSPSISEFDWST 1018
            LPAPASWKKL+ PK+AGTPRK+EIVFVAPTGEEIS+R+QL+QYLK+HPG+P ISEF+W+T
Sbjct: 12   LPAPASWKKLFYPKRAGTPRKTEIVFVAPTGEEISSRKQLEQYLKAHPGNPVISEFEWTT 71

Query: 1017 GETPRRSARISEKAXXXXXXXXXXXXXKRARTSVGTKKDKESTNDGNXXXXXXXKMPVAG 838
            GETPRRS+RIS+K              K+ R+S+ TKKD +   + N        M V  
Sbjct: 72   GETPRRSSRISQKVKATTPTPDKEPLLKKRRSSL-TKKDNKEAAEKNEEAAVKENMDVDK 130

Query: 837  ANEKKDDGGDKPSDTPAVTQGEEEGKAHYEDRNVTEPKLKVNGQE----ENSATDVGTEK 670
              + ++   +K  +   VT+  E  K + E       K+   G++    +   T++   +
Sbjct: 131  DGKTENAEAEKEKEKEGVTEIAEAEKENNEGEKTEAEKVNKEGEKTEAGKEGQTEIAEAE 190

Query: 669  ARDEGSTADTVVANQPSEKVEQKLESKAIQKPEIEAGKDASADTAEQDKVGNDSVVVNNG 490
               EG  A+    N+ +E V  K ES  +   E+E    +     E  KV          
Sbjct: 191  KEKEGEKAE--AENKEAEVVRDKKESMEVDTSELEKKAGSGEGAEEPSKVEGLKDTEMKE 248

Query: 489  AEGQRTDVVVPSVGEIKGQENDGKLKAQVERENNM 385
            A+   T+  V      +  EN G +  +   E N+
Sbjct: 249  AQEVVTEADVEKKPAEEKTENKGSVTTEANGEQNV 283


>ref|XP_002517633.1| Nucleosome-binding protein, putative [Ricinus communis]
            gi|223543265|gb|EEF44797.1| Nucleosome-binding protein,
            putative [Ricinus communis]
          Length = 331

 Score =  161 bits (407), Expect = 5e-37
 Identities = 113/310 (36%), Positives = 164/310 (52%), Gaps = 26/310 (8%)
 Frame = -1

Query: 1197 LPAPASWKKLYLPKKAGTPRKSEIVFVAPTGEEISTRRQLDQYLKSHPGSPSISEFDWST 1018
            LPAP+SWKK+Y PK+AGTPRKSEI+F++PTGEEI++R+QL+QYLKSHPG+P I+EFDW T
Sbjct: 12   LPAPSSWKKMYFPKRAGTPRKSEIMFISPTGEEINSRKQLEQYLKSHPGNPPIAEFDWGT 71

Query: 1017 GETPRRSARISEKAXXXXXXXXXXXXXKRARTSVGTKKDK---ESTNDGNXXXXXXXKMP 847
            GETPRRSARISEK              +  ++S G+KKD    ES ++            
Sbjct: 72   GETPRRSARISEKVKATPTPEKEPAKKRGRKSSSGSKKDNKETESAHEKGEYEKEIQMQD 131

Query: 846  VAGANEKKDDGGDKPSDTP---AVTQGEEEGKA-HYEDRNVTEPKLKVNGQEENSATDVG 679
              GA ++ ++ G K +D      + +G+++ +A   E+ ++ E   K   ++     D G
Sbjct: 132  ADGAGKENEEAG-KENDVAKEGLIEKGDKKEEAGQTENADIEETAQKQVNKDTGVQEDAG 190

Query: 678  TEKARDEGSTADTVVANQPSEKVEQKLESKAIQKPEIEAGK-DASADTAEQDKVGNDS-- 508
             +KA  E       +  Q  E  E+ L+ K+ +    EAG  +  A+   Q +VG ++  
Sbjct: 191  EDKAGPE--NLQQAMEVQEQENPEEALKKKSAE----EAGSGEGIAENVLQTEVGKENDQ 244

Query: 507  -----------VVVNNGAEGQRTDVVVP-SVGEIKG----QENDGKLKAQVERENNMKGV 376
                          N GA  +  +  VP S  EIK     QE D      + +   M G 
Sbjct: 245  GDKMDIPESVPKEANGGAAKENANGAVPVSEEEIKEKPDLQEKDNTPVDGISK--TMDGE 302

Query: 375  VIENGKVDQS 346
            V ENGKV+Q+
Sbjct: 303  VTENGKVNQT 312


>gb|AAN60295.1| unknown [Arabidopsis thaliana]
          Length = 346

 Score =  160 bits (404), Expect = 1e-36
 Identities = 99/275 (36%), Positives = 140/275 (50%), Gaps = 4/275 (1%)
 Frame = -1

Query: 1197 LPAPASWKKLYLPKKAGTPRKSEIVFVAPTGEEISTRRQLDQYLKSHPGSPSISEFDWST 1018
            LPAPASWKKL+ PK+AGTPRK+EIVFVAPTGEEIS+R+QL+QYLK+HPG+P ISEF+W+T
Sbjct: 12   LPAPASWKKLFYPKRAGTPRKTEIVFVAPTGEEISSRKQLEQYLKAHPGNPVISEFEWTT 71

Query: 1017 GETPRRSARISEKAXXXXXXXXXXXXXKRARTSVGTKKDKESTNDGNXXXXXXXKMPVAG 838
            GETPRRS+RIS+K               + R S  TKKD +   + N        M V  
Sbjct: 72   GETPRRSSRISQKV----KATPDKEPLLKKRRSSLTKKDNKEAAEKNEEAAVKENMDVDK 127

Query: 837  ANEKKDDGGDKPSDTPAVTQGEEEGKAHYEDRNVTEPKLKVNGQE----ENSATDVGTEK 670
              + ++   +K  +   VT+  E  K + E       K+   G++    +   T++   +
Sbjct: 128  DGKTENAEAEKEKEKEGVTEIAEAEKENNEGEKTEAEKVNKEGEKTEAGKEGQTEIAEAE 187

Query: 669  ARDEGSTADTVVANQPSEKVEQKLESKAIQKPEIEAGKDASADTAEQDKVGNDSVVVNNG 490
               EG  A+    N+ +E V  K ES  +   E+E    +     E  KV          
Sbjct: 188  KEKEGEKAE--AENKEAEVVRDKKESMEVDTSELEKKAGSGEGAEEPSKVEGLKDTEMKE 245

Query: 489  AEGQRTDVVVPSVGEIKGQENDGKLKAQVERENNM 385
            A+   T+  V      +  EN G +  +   E N+
Sbjct: 246  AQEVVTEADVEKKPAEEKTENKGSVTTEANGEQNV 280


>ref|NP_001240074.1| uncharacterized protein LOC100776785 [Glycine max]
            gi|255645971|gb|ACU23474.1| unknown [Glycine max]
          Length = 275

 Score =  156 bits (395), Expect = 1e-35
 Identities = 99/232 (42%), Positives = 126/232 (54%), Gaps = 7/232 (3%)
 Frame = -1

Query: 1197 LPAPASWKKLYLPKKAGTPRKSEIVFVAPTGEEISTRRQLDQYLKSHPGSPSISEFDWST 1018
            LPAP+ W KL+ PKK GTPRKSEIVF+APTGEEIST++QL+QYLK+HPG+P ISEFDW T
Sbjct: 19   LPAPSGWNKLFFPKKLGTPRKSEIVFIAPTGEEISTKKQLEQYLKAHPGNPVISEFDWGT 78

Query: 1017 GETPRRSARISEKAXXXXXXXXXXXXXKRARTSVGTKKD-KESTNDGNXXXXXXXKMPVA 841
            GETPRRSARISEK              KRAR S G+KKD KE+ +               
Sbjct: 79   GETPRRSARISEKV-KSTPPADSDTPKKRARKSSGSKKDNKETESASEEGKAKSDTEDPK 137

Query: 840  GANEKKDDGGDKPSDTPAVTQGEEEGKAHYEDRNVTEPKLKVNGQEENSATDVGTEKARD 661
             A E+K++G D  +      Q E   K    D    +P + +   EEN   D   +   D
Sbjct: 138  AAEEEKNEGND--NSNSGGKQLENGDKTEQIDEQAKKPDVDM---EENDLNDTNNKLEND 192

Query: 660  EGS------TADTVVANQPSEKVEQKLESKAIQKPEIEAGKDASADTAEQDK 523
                       + V+A +P  +  QK E +  +K   EA     A+TAE +K
Sbjct: 193  SDEIKNSHVNGENVIAERPEGEEAQKQEVEPAEKVAEEA-----ANTAETEK 239


Top