BLASTX nr result

ID: Cephaelis21_contig00011163 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cephaelis21_contig00011163
         (1586 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002514395.1| conserved hypothetical protein [Ricinus comm...   251   5e-64
ref|XP_002882558.1| hypothetical protein ARALYDRAFT_896965 [Arab...   244   5e-62
gb|AAO22623.1| unknown protein [Arabidopsis thaliana]                 234   5e-59
ref|NP_974253.1| methyl-CpG-binding domain protein 4 [Arabidopsi...   234   5e-59
gb|AAF21203.1|AC013483_27 hypothetical protein [Arabidopsis thal...   234   5e-59

>ref|XP_002514395.1| conserved hypothetical protein [Ricinus communis]
            gi|223546492|gb|EEF47991.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 608

 Score =  251 bits (640), Expect = 5e-64
 Identities = 133/246 (54%), Positives = 162/246 (65%), Gaps = 2/246 (0%)
 Frame = -1

Query: 1280 KSQRKASKEVRVVSPYFPKPGKKEELVKTENHLKTSSQKSQQNTRIPVVKISPYFQNTSK 1101
            K   +   +VR VSP F     ++E +K +        K  +   + V  +SPYFQ   K
Sbjct: 362  KPNSRVHIQVRKVSPNFNLSIGQQECMKIK------PLKPCERVGLTVRNVSPYFQKVPK 415

Query: 1100 QGEIGVADP-MIDATE-IIVLPKRSQMAEVIKPTFSAAQKRDEAYERRTSDNLWKPPKSP 927
            Q E   AD  MID       LP++ +       T SAA+KR EAY R+T DN WKPP+S 
Sbjct: 416  QEEEEAADSNMIDNKHGQKKLPEKKKRPARKSITLSAAEKRSEAYRRKTPDNTWKPPRSD 475

Query: 926  HNLLQEDHAHDPWRVLVICMLLNRTTGLQASRVINELFTLCPNAQSAASVDPQYIEKVIQ 747
              LLQEDHA DPWRVLVICMLLN TTG Q   VI++ FTLCP+A++A     + IEK+I 
Sbjct: 476  FGLLQEDHASDPWRVLVICMLLNCTTGKQVRGVISDFFTLCPDAKAATEAKTEEIEKIIV 535

Query: 746  SLGLHKKRAVMIRRFSEEYLGESWTHVTELHGIGKYAADAYAIFCTGKWDRVKPLDHMLN 567
             LGL KKRAVMI+R S+EYL + WTHVT+LHG+GKYAADAYAIFCTGKWD+V+P DHMLN
Sbjct: 536  PLGLQKKRAVMIQRLSQEYLADDWTHVTQLHGVGKYAADAYAIFCTGKWDQVRPKDHMLN 595

Query: 566  KYWDFL 549
             YWDFL
Sbjct: 596  YYWDFL 601


>ref|XP_002882558.1| hypothetical protein ARALYDRAFT_896965 [Arabidopsis lyrata subsp.
            lyrata] gi|297328398|gb|EFH58817.1| hypothetical protein
            ARALYDRAFT_896965 [Arabidopsis lyrata subsp. lyrata]
          Length = 435

 Score =  244 bits (623), Expect = 5e-62
 Identities = 135/287 (47%), Positives = 175/287 (60%), Gaps = 26/287 (9%)
 Frame = -1

Query: 1328 KEEINGDTLGKTEIDGKSQRKASKEVRVVSPYFPKPGKKEELVKTENHLKTSSQKSQQNT 1149
            KEE + D++      G++  K   +V +VSPYF    +   + +  + + +SSQ  +   
Sbjct: 150  KEECDSDSVCSQS--GRNCSKVQAKVPIVSPYF----QSSTISQCGSDIVSSSQSGKNYR 203

Query: 1148 R------IPVVKISPYFQNTSKQGEIGVADPM----------------IDATEIIVLPK- 1038
            R        V + SPYFQ ++   +   A P                  D  ++    K 
Sbjct: 204  RGSSKRQAKVRRDSPYFQESTVSEQPSQAPPRDLRQYFKVVKVSRYFHADGIQVNESQKE 263

Query: 1037 ---RSQMAEVIKPTFSAAQKRDEAYERRTSDNLWKPPKSPHNLLQEDHAHDPWRVLVICM 867
               R +   V+ P+ S +QK DEAY+R+T D  W PP+SP NLLQE H HDPWRVLVICM
Sbjct: 264  KSTRVRKTPVVSPSLSLSQKTDEAYQRKTPDKTWVPPRSPCNLLQEHHWHDPWRVLVICM 323

Query: 866  LLNRTTGLQASRVINELFTLCPNAQSAASVDPQYIEKVIQSLGLHKKRAVMIRRFSEEYL 687
            LLN+T+G Q   VI +LF LCP+A++A  V+ + IE +I+ LGL KKRA MI+RFS EYL
Sbjct: 324  LLNKTSGAQTRGVIEDLFALCPDAKTATEVEEREIESLIKPLGLQKKRARMIQRFSLEYL 383

Query: 686  GESWTHVTELHGIGKYAADAYAIFCTGKWDRVKPLDHMLNKYWDFLR 546
             ESWTHVT+LHGIGKYAADAYAIFC G WDRVKP DHMLN YW+FLR
Sbjct: 384  QESWTHVTQLHGIGKYAADAYAIFCNGNWDRVKPDDHMLNYYWEFLR 430


>gb|AAO22623.1| unknown protein [Arabidopsis thaliana]
          Length = 407

 Score =  234 bits (597), Expect = 5e-59
 Identities = 124/280 (44%), Positives = 168/280 (60%), Gaps = 25/280 (8%)
 Frame = -1

Query: 1310 DTLGKTEIDGKSQRKASKEVRVVSPYFPKPGKKEELVKTENHLKTSSQ------KSQQNT 1149
            D++  + I+ +   K   +V  VSPYF    +   + + ++ + +SSQ      K     
Sbjct: 127  DSVSDSHIERQECSKVQAKVPRVSPYF----QASTISQCDSDIVSSSQSGRNYRKGSSKR 182

Query: 1148 RIPVVKISPYFQNTSKQGEIGVADPMI---------------DATEIIVLPKRS----QM 1026
            ++   ++SPYFQ ++   +   A   +               D  ++    K      + 
Sbjct: 183  QVKARRVSPYFQESTVSEQPNQAPKGLRNYFKVVKVSRYFHADGIQVNESQKEKSRNVRK 242

Query: 1025 AEVIKPTFSAAQKRDEAYERRTSDNLWKPPKSPHNLLQEDHAHDPWRVLVICMLLNRTTG 846
              ++ P  S +QK D+ Y R+T DN W PP+SP NLLQEDH HDPWRVLVICMLLN+T+G
Sbjct: 243  TPIVSPVLSLSQKTDDVYLRKTPDNTWVPPRSPCNLLQEDHWHDPWRVLVICMLLNKTSG 302

Query: 845  LQASRVINELFTLCPNAQSAASVDPQYIEKVIQSLGLHKKRAVMIRRFSEEYLGESWTHV 666
             Q   VI++LF LC +A++A  V  + IE +I+ LGL KKR  MI+R S EYL ESWTHV
Sbjct: 303  AQTRGVISDLFGLCTDAKTATEVKEEEIENLIKPLGLQKKRTKMIQRLSLEYLQESWTHV 362

Query: 665  TELHGIGKYAADAYAIFCTGKWDRVKPLDHMLNKYWDFLR 546
            T+LHG+GKYAADAYAIFC G WDRVKP DHMLN YWD+LR
Sbjct: 363  TQLHGVGKYAADAYAIFCNGNWDRVKPNDHMLNYYWDYLR 402


>ref|NP_974253.1| methyl-CpG-binding domain protein 4 [Arabidopsis thaliana]
            gi|114050633|gb|ABI49466.1| At3g07930 [Arabidopsis
            thaliana] gi|332641100|gb|AEE74621.1| methyl-CpG-binding
            domain protein 4 [Arabidopsis thaliana]
          Length = 445

 Score =  234 bits (597), Expect = 5e-59
 Identities = 137/325 (42%), Positives = 182/325 (56%)
 Frame = -1

Query: 1520 IEVQVASPYFVNSMCEEEVLKRGKLLQNTSKKLHQKTGTSRTLGAPEAVLLSCTSKELKE 1341
            +EV+  SPYF  S   ++  K G      S  +  K G S+       V     +  + +
Sbjct: 144  VEVRRVSPYFQGSTVSQQS-KEGC----DSDSVCSKEGCSKVQAKVPRVSPYFQASTISQ 198

Query: 1340 TVDFKEEINGDTLGKTEIDGKSQRKASKEVRVVSPYFPKPGKKEELVKTENHLKTSSQKS 1161
                 + ++    G+    G S+R+   +VR VSPYF +    E+  +    L+   +  
Sbjct: 199  CDS--DIVSSSQSGRNYRKGSSKRQV--KVRRVSPYFQESTVSEQPNQAPKGLRNYFK-- 252

Query: 1160 QQNTRIPVVKISPYFQNTSKQGEIGVADPMIDATEIIVLPKRSQMAEVIKPTFSAAQKRD 981
                   VVK+S YF     Q      +            +  +   ++ P  S +QK D
Sbjct: 253  -------VVKVSRYFHADGIQVNESQKEKS----------RNVRKTPIVSPVLSLSQKTD 295

Query: 980  EAYERRTSDNLWKPPKSPHNLLQEDHAHDPWRVLVICMLLNRTTGLQASRVINELFTLCP 801
            + Y R+T DN W PP+SP NLLQEDH HDPWRVLVICMLLN+T+G Q   VI++LF LC 
Sbjct: 296  DVYLRKTPDNTWVPPRSPCNLLQEDHWHDPWRVLVICMLLNKTSGAQTRGVISDLFGLCT 355

Query: 800  NAQSAASVDPQYIEKVIQSLGLHKKRAVMIRRFSEEYLGESWTHVTELHGIGKYAADAYA 621
            +A++A  V  + IE +I+ LGL KKR  MI+R S EYL ESWTHVT+LHG+GKYAADAYA
Sbjct: 356  DAKTATEVKEEEIENLIKPLGLQKKRTKMIQRLSLEYLQESWTHVTQLHGVGKYAADAYA 415

Query: 620  IFCTGKWDRVKPLDHMLNKYWDFLR 546
            IFC G WDRVKP DHMLN YWD+LR
Sbjct: 416  IFCNGNWDRVKPNDHMLNYYWDYLR 440


>gb|AAF21203.1|AC013483_27 hypothetical protein [Arabidopsis thaliana]
          Length = 419

 Score =  234 bits (597), Expect = 5e-59
 Identities = 137/325 (42%), Positives = 182/325 (56%)
 Frame = -1

Query: 1520 IEVQVASPYFVNSMCEEEVLKRGKLLQNTSKKLHQKTGTSRTLGAPEAVLLSCTSKELKE 1341
            +EV+  SPYF  S   ++  K G      S  +  K G S+       V     +  + +
Sbjct: 118  VEVRRVSPYFQGSTVSQQS-KEGC----DSDSVCSKEGCSKVQAKVPRVSPYFQASTISQ 172

Query: 1340 TVDFKEEINGDTLGKTEIDGKSQRKASKEVRVVSPYFPKPGKKEELVKTENHLKTSSQKS 1161
                 + ++    G+    G S+R+   +VR VSPYF +    E+  +    L+   +  
Sbjct: 173  CDS--DIVSSSQSGRNYRKGSSKRQV--KVRRVSPYFQESTVSEQPNQAPKGLRNYFK-- 226

Query: 1160 QQNTRIPVVKISPYFQNTSKQGEIGVADPMIDATEIIVLPKRSQMAEVIKPTFSAAQKRD 981
                   VVK+S YF     Q      +            +  +   ++ P  S +QK D
Sbjct: 227  -------VVKVSRYFHADGIQVNESQKEKS----------RNVRKTPIVSPVLSLSQKTD 269

Query: 980  EAYERRTSDNLWKPPKSPHNLLQEDHAHDPWRVLVICMLLNRTTGLQASRVINELFTLCP 801
            + Y R+T DN W PP+SP NLLQEDH HDPWRVLVICMLLN+T+G Q   VI++LF LC 
Sbjct: 270  DVYLRKTPDNTWVPPRSPCNLLQEDHWHDPWRVLVICMLLNKTSGAQTRGVISDLFGLCT 329

Query: 800  NAQSAASVDPQYIEKVIQSLGLHKKRAVMIRRFSEEYLGESWTHVTELHGIGKYAADAYA 621
            +A++A  V  + IE +I+ LGL KKR  MI+R S EYL ESWTHVT+LHG+GKYAADAYA
Sbjct: 330  DAKTATEVKEEEIENLIKPLGLQKKRTKMIQRLSLEYLQESWTHVTQLHGVGKYAADAYA 389

Query: 620  IFCTGKWDRVKPLDHMLNKYWDFLR 546
            IFC G WDRVKP DHMLN YWD+LR
Sbjct: 390  IFCNGNWDRVKPNDHMLNYYWDYLR 414


Top