BLASTX nr result

ID: Scutellaria23_contig00007103 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Scutellaria23_contig00007103
         (2039 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002514395.1| conserved hypothetical protein [Ricinus comm...   258   4e-66
ref|NP_974253.1| methyl-CpG-binding domain protein 4 [Arabidopsi...   244   8e-62
gb|AAF21203.1|AC013483_27 hypothetical protein [Arabidopsis thal...   243   1e-61
gb|AAO22623.1| unknown protein [Arabidopsis thaliana]                 238   3e-60
ref|XP_002882558.1| hypothetical protein ARALYDRAFT_896965 [Arab...   235   4e-59

>ref|XP_002514395.1| conserved hypothetical protein [Ricinus communis]
            gi|223546492|gb|EEF47991.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 608

 Score =  258 bits (659), Expect = 4e-66
 Identities = 168/439 (38%), Positives = 222/439 (50%), Gaps = 46/439 (10%)
 Frame = -3

Query: 1455 EESTGMDSARVLEDKDKINSNSKKQNWRINGDSY-SSNEEVHFSDGQLKE----AELEVL 1291
            EE  G +     + K+K N   +K+N   +G  Y +   +   SD Q+++     E+ V 
Sbjct: 184  EEQGGANVVSRGDGKEKANKRKRKKN---DGAIYPNKTRDTVSSDAQMRDIVKLTEINVA 240

Query: 1290 TSANHGSGGAREMEK-------------VSLDDLFSRFVYTGGKSYNFSTK-------FK 1171
            +  N  +   +   K             +S +D+ S++ Y      NF  K       + 
Sbjct: 241  SDGNMATDDCKTSAKNLLNEQMVAPNAGMSFEDVLSKYAYKSDGRLNFRDKKILGAPHYP 300

Query: 1170 KTERGGMDKMEEGNMKTVKDD----LVVEDNAPLCTADPVGSDNA-------ISPHNSCQ 1024
               +  ++K EE   K  K+      + E+ A    A P G+  +       ++P  + +
Sbjct: 301  MVVKN-IEKYEESENKISKEAEGTLKITENEAAPLPAIPYGNSGSQISEVGNVTPTRNIE 359

Query: 1023 NVKKGLRSDMECAALTKTRTSAEGARKEVRVVSPYFGNAXXXXXXXXXXXXTESKK--LH 850
            N K   R  ++                 VR VSP F  +               ++  L 
Sbjct: 360  NEKPNSRVHIQ-----------------VRKVSPNFNLSIGQQECMKIKPLKPCERVGLT 402

Query: 849  VRKISPYFSSTKKEEENK--------NTANSTNQTRKAKKPKQCSPVLTAARKRDEAYLR 694
            VR +SPYF    K+EE +        N         K K+P + S  L+AA KR EAY R
Sbjct: 403  VRNVSPYFQKVPKQEEEEAADSNMIDNKHGQKKLPEKKKRPARKSITLSAAEKRSEAYRR 462

Query: 693  KAPDNTWIPPRSPFNLLQEDHFFDPWRVLVICMLLNLTGGKQAGKVLPNFFQLCPDAKAA 514
            K PDNTW PPRS F LLQEDH  DPWRVLVICMLLN T GKQ   V+ +FF LCPDAKAA
Sbjct: 463  KTPDNTWKPPRSDFGLLQEDHASDPWRVLVICMLLNCTTGKQVRGVISDFFTLCPDAKAA 522

Query: 513  TEVDTEKIEEVTRSLGLYNKRGAGIQRLSQEYLDESWTHVTELTGVGKYAADAYAIFCTG 334
            TE  TE+IE++   LGL  KR   IQRLSQEYL + WTHVT+L GVGKYAADAYAIFCTG
Sbjct: 523  TEAKTEEIEKIIVPLGLQKKRAVMIQRLSQEYLADDWTHVTQLHGVGKYAADAYAIFCTG 582

Query: 333  KWERVRPVDHMLVKYWEFL 277
            KW++VRP DHML  YW+FL
Sbjct: 583  KWDQVRPKDHMLNYYWDFL 601


>ref|NP_974253.1| methyl-CpG-binding domain protein 4 [Arabidopsis thaliana]
            gi|114050633|gb|ABI49466.1| At3g07930 [Arabidopsis
            thaliana] gi|332641100|gb|AEE74621.1| methyl-CpG-binding
            domain protein 4 [Arabidopsis thaliana]
          Length = 445

 Score =  244 bits (622), Expect = 8e-62
 Identities = 164/443 (37%), Positives = 225/443 (50%), Gaps = 47/443 (10%)
 Frame = -3

Query: 1455 EESTGMDSARVLEDKDKINSNSKKQNWRINGDSYSSNEEVHFSDGQLKEAELEVLTSANH 1276
            ++ + +   R   D D I  + + +++ +  +    N ++   D       L+      H
Sbjct: 20   DDDSSVMMTRRRPDSDFIEVSDENRSFALFKEDDEKNRDLGLVDDGSTNLVLQC-----H 74

Query: 1275 GSGGAREMEKV-SLDDLFSRFVYTGGKSYNFSTKFKKTERGGMDKMEEGNMKTVK----D 1111
              G + E +   SLDDLFS FVY G          ++ +R     +   N+ + +    D
Sbjct: 75   DDGCSLEKDNSNSLDDLFSGFVYKG---------VRRRKRDDFGSITTSNLVSPQIADDD 125

Query: 1110 DLVVEDN---APLCTADPVGSDNAISPH----NSCQNVKKGLRSDMECAALTKTRTSAEG 952
            D  V D+      C+   V     +SP+       Q  K+G  SD  C+    ++  A+ 
Sbjct: 126  DDSVSDSHIERQECSEFHV-EVRRVSPYFQGSTVSQQSKEGCDSDSVCSKEGCSKVQAKV 184

Query: 951  ARKEVRVVSPYFGNAXXXXXXXXXXXXTES----------KKLHVRKISPYFS-STKKEE 805
             R     VSPYF  +            ++S          +++ VR++SPYF  ST  E+
Sbjct: 185  PR-----VSPYFQASTISQCDSDIVSSSQSGRNYRKGSSKRQVKVRRVSPYFQESTVSEQ 239

Query: 804  ENKNTANSTN------------------------QTRKAKKPKQCSPVLTAARKRDEAYL 697
             N+      N                        ++R  +K    SPVL+ ++K D+ YL
Sbjct: 240  PNQAPKGLRNYFKVVKVSRYFHADGIQVNESQKEKSRNVRKTPIVSPVLSLSQKTDDVYL 299

Query: 696  RKAPDNTWIPPRSPFNLLQEDHFFDPWRVLVICMLLNLTGGKQAGKVLPNFFQLCPDAKA 517
            RK PDNTW+PPRSP NLLQEDH+ DPWRVLVICMLLN T G Q   V+ + F LC DAK 
Sbjct: 300  RKTPDNTWVPPRSPCNLLQEDHWHDPWRVLVICMLLNKTSGAQTRGVISDLFGLCTDAKT 359

Query: 516  ATEVDTEKIEEVTRSLGLYNKRGAGIQRLSQEYLDESWTHVTELTGVGKYAADAYAIFCT 337
            ATEV  E+IE + + LGL  KR   IQRLS EYL ESWTHVT+L GVGKYAADAYAIFC 
Sbjct: 360  ATEVKEEEIENLIKPLGLQKKRTKMIQRLSLEYLQESWTHVTQLHGVGKYAADAYAIFCN 419

Query: 336  GKWERVRPVDHMLVKYWEFLRDR 268
            G W+RV+P DHML  YW++LR R
Sbjct: 420  GNWDRVKPNDHMLNYYWDYLRIR 442


>gb|AAF21203.1|AC013483_27 hypothetical protein [Arabidopsis thaliana]
          Length = 419

 Score =  243 bits (621), Expect = 1e-61
 Identities = 163/430 (37%), Positives = 220/430 (51%), Gaps = 47/430 (10%)
 Frame = -3

Query: 1416 DKDKINSNSKKQNWRINGDSYSSNEEVHFSDGQLKEAELEVLTSANHGSGGAREMEKV-S 1240
            D D I  + + +++ +  +    N ++   D       L+      H  G + E +   S
Sbjct: 7    DSDFIEVSDENRSFALFKEDDEKNRDLGLVDDGSTNLVLQC-----HDDGCSLEKDNSNS 61

Query: 1239 LDDLFSRFVYTGGKSYNFSTKFKKTERGGMDKMEEGNMKTVK----DDLVVEDN---APL 1081
            LDDLFS FVY G          ++ +R     +   N+ + +    DD  V D+      
Sbjct: 62   LDDLFSGFVYKG---------VRRRKRDDFGSITTSNLVSPQIADDDDDSVSDSHIERQE 112

Query: 1080 CTADPVGSDNAISPH----NSCQNVKKGLRSDMECAALTKTRTSAEGARKEVRVVSPYFG 913
            C+   V     +SP+       Q  K+G  SD  C+    ++  A+  R     VSPYF 
Sbjct: 113  CSEFHV-EVRRVSPYFQGSTVSQQSKEGCDSDSVCSKEGCSKVQAKVPR-----VSPYFQ 166

Query: 912  NAXXXXXXXXXXXXTES----------KKLHVRKISPYFS-STKKEEENKNTANSTN--- 775
             +            ++S          +++ VR++SPYF  ST  E+ N+      N   
Sbjct: 167  ASTISQCDSDIVSSSQSGRNYRKGSSKRQVKVRRVSPYFQESTVSEQPNQAPKGLRNYFK 226

Query: 774  ---------------------QTRKAKKPKQCSPVLTAARKRDEAYLRKAPDNTWIPPRS 658
                                 ++R  +K    SPVL+ ++K D+ YLRK PDNTW+PPRS
Sbjct: 227  VVKVSRYFHADGIQVNESQKEKSRNVRKTPIVSPVLSLSQKTDDVYLRKTPDNTWVPPRS 286

Query: 657  PFNLLQEDHFFDPWRVLVICMLLNLTGGKQAGKVLPNFFQLCPDAKAATEVDTEKIEEVT 478
            P NLLQEDH+ DPWRVLVICMLLN T G Q   V+ + F LC DAK ATEV  E+IE + 
Sbjct: 287  PCNLLQEDHWHDPWRVLVICMLLNKTSGAQTRGVISDLFGLCTDAKTATEVKEEEIENLI 346

Query: 477  RSLGLYNKRGAGIQRLSQEYLDESWTHVTELTGVGKYAADAYAIFCTGKWERVRPVDHML 298
            + LGL  KR   IQRLS EYL ESWTHVT+L GVGKYAADAYAIFC G W+RV+P DHML
Sbjct: 347  KPLGLQKKRTKMIQRLSLEYLQESWTHVTQLHGVGKYAADAYAIFCNGNWDRVKPNDHML 406

Query: 297  VKYWEFLRDR 268
              YW++LR R
Sbjct: 407  NYYWDYLRIR 416


>gb|AAO22623.1| unknown protein [Arabidopsis thaliana]
          Length = 407

 Score =  238 bits (608), Expect = 3e-60
 Identities = 151/405 (37%), Positives = 212/405 (52%), Gaps = 9/405 (2%)
 Frame = -3

Query: 1455 EESTGMDSARVLEDKDKINSNSKKQNWRINGDSYSSNEEVHFSDGQLKEAELEVLTSANH 1276
            ++ + +   R   D D I  + + +++ +  +    N ++   D       L+      H
Sbjct: 20   DDDSSVMMTRRRPDSDFIEVSDENRSFALFKEDDEKNRDLGLVDDGSTNLVLQC-----H 74

Query: 1275 GSGGAREMEKV-SLDDLFSRFVYTGGKSYNFSTKFKKTERGGMDKMEEGNMKTVK----- 1114
              G + E +   SLDDLFS FVY G          ++ +R     +   N+ + +     
Sbjct: 75   DDGCSLEKDNSNSLDDLFSGFVYKG---------VRRRKRDDFGSITTSNLVSPQIADDD 125

Query: 1113 DDLVVEDNAPLCTADPVGSD-NAISPHNSCQNVKKGLRSDMECAALTKTRTSAEGARKEV 937
            DD V + +        V +    +SP+     + +   SD+  ++ +        ++++V
Sbjct: 126  DDSVSDSHIERQECSKVQAKVPRVSPYFQASTISQ-CDSDIVSSSQSGRNYRKGSSKRQV 184

Query: 936  RV--VSPYFGNAXXXXXXXXXXXXTESKKLHVRKISPYFSSTKKEEENKNTANSTNQTRK 763
            +   VSPYF  +              +    V K+S YF +   +        S N    
Sbjct: 185  KARRVSPYFQESTVSEQPNQAPKGLRNY-FKVVKVSRYFHADGIQVNESQKEKSRN---- 239

Query: 762  AKKPKQCSPVLTAARKRDEAYLRKAPDNTWIPPRSPFNLLQEDHFFDPWRVLVICMLLNL 583
             +K    SPVL+ ++K D+ YLRK PDNTW+PPRSP NLLQEDH+ DPWRVLVICMLLN 
Sbjct: 240  VRKTPIVSPVLSLSQKTDDVYLRKTPDNTWVPPRSPCNLLQEDHWHDPWRVLVICMLLNK 299

Query: 582  TGGKQAGKVLPNFFQLCPDAKAATEVDTEKIEEVTRSLGLYNKRGAGIQRLSQEYLDESW 403
            T G Q   V+ + F LC DAK ATEV  E+IE + + LGL  KR   IQRLS EYL ESW
Sbjct: 300  TSGAQTRGVISDLFGLCTDAKTATEVKEEEIENLIKPLGLQKKRTKMIQRLSLEYLQESW 359

Query: 402  THVTELTGVGKYAADAYAIFCTGKWERVRPVDHMLVKYWEFLRDR 268
            THVT+L GVGKYAADAYAIFC G W+RV+P DHML  YW++LR R
Sbjct: 360  THVTQLHGVGKYAADAYAIFCNGNWDRVKPNDHMLNYYWDYLRIR 404


>ref|XP_002882558.1| hypothetical protein ARALYDRAFT_896965 [Arabidopsis lyrata subsp.
            lyrata] gi|297328398|gb|EFH58817.1| hypothetical protein
            ARALYDRAFT_896965 [Arabidopsis lyrata subsp. lyrata]
          Length = 435

 Score =  235 bits (599), Expect = 4e-59
 Identities = 149/365 (40%), Positives = 191/365 (52%), Gaps = 40/365 (10%)
 Frame = -3

Query: 1242 SLDDLFSRFVYTGGKSYNFSTKFKKTERGGMDKMEEGNMKTVKDDLVVEDNAPLCTADPV 1063
            +LDDLFS FVY G +         KT    +      +  +V +  +   +   C+   V
Sbjct: 75   NLDDLFSGFVYKGVRRRKMDDFGSKTTSNLLSPQIADDDDSVAESHIERQD---CSEFHV 131

Query: 1062 GSDNAISPHNSCQNVKKGLRSDMECAALTKTRTSAEGARK---EVRVVSPYFGNAXXXXX 892
                 +SP+     V +  +S  EC + +    S     K   +V +VSPYF ++     
Sbjct: 132  -EVRRVSPYFQGSTVSQ--QSKEECDSDSVCSQSGRNCSKVQAKVPIVSPYFQSSTISQC 188

Query: 891  XXXXXXXTESKKLH----------VRKISPYFS-STKKEEENK----------------- 796
                   ++S K +          VR+ SPYF  ST  E+ ++                 
Sbjct: 189  GSDIVSSSQSGKNYRRGSSKRQAKVRRDSPYFQESTVSEQPSQAPPRDLRQYFKVVKVSR 248

Query: 795  ---------NTANSTNQTRKAKKPKQCSPVLTAARKRDEAYLRKAPDNTWIPPRSPFNLL 643
                     N +     TR  K P   SP L+ ++K DEAY RK PD TW+PPRSP NLL
Sbjct: 249  YFHADGIQVNESQKEKSTRVRKTPV-VSPSLSLSQKTDEAYQRKTPDKTWVPPRSPCNLL 307

Query: 642  QEDHFFDPWRVLVICMLLNLTGGKQAGKVLPNFFQLCPDAKAATEVDTEKIEEVTRSLGL 463
            QE H+ DPWRVLVICMLLN T G Q   V+ + F LCPDAK ATEV+  +IE + + LGL
Sbjct: 308  QEHHWHDPWRVLVICMLLNKTSGAQTRGVIEDLFALCPDAKTATEVEEREIESLIKPLGL 367

Query: 462  YNKRGAGIQRLSQEYLDESWTHVTELTGVGKYAADAYAIFCTGKWERVRPVDHMLVKYWE 283
              KR   IQR S EYL ESWTHVT+L G+GKYAADAYAIFC G W+RV+P DHML  YWE
Sbjct: 368  QKKRARMIQRFSLEYLQESWTHVTQLHGIGKYAADAYAIFCNGNWDRVKPDDHMLNYYWE 427

Query: 282  FLRDR 268
            FLR R
Sbjct: 428  FLRIR 432


Top