BLASTX nr result

ID: Scutellaria22_contig00007084 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Scutellaria22_contig00007084
         (1695 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002514395.1| conserved hypothetical protein [Ricinus comm...   263   1e-67
ref|NP_974253.1| methyl-CpG-binding domain protein 4 [Arabidopsi...   252   2e-64
gb|AAF21203.1|AC013483_27 hypothetical protein [Arabidopsis thal...   252   2e-64
gb|AAO22623.1| unknown protein [Arabidopsis thaliana]                 244   7e-62
ref|XP_002882558.1| hypothetical protein ARALYDRAFT_896965 [Arab...   241   3e-61

>ref|XP_002514395.1| conserved hypothetical protein [Ricinus communis]
            gi|223546492|gb|EEF47991.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 608

 Score =  263 bits (671), Expect = 1e-67
 Identities = 169/439 (38%), Positives = 228/439 (51%), Gaps = 46/439 (10%)
 Frame = +2

Query: 350  EESTGMDSARVLEDKDKINSKSKKQNWRINGDSY-SSNEEVHFSDGQLKE----AELEVL 514
            EE  G +     + K+K N + +K+N   +G  Y +   +   SD Q+++     E+ V 
Sbjct: 184  EEQGGANVVSRGDGKEKANKRKRKKN---DGAIYPNKTRDTVSSDAQMRDIVKLTEINVA 240

Query: 515  TSANHGSGGAREMEK-------------VSLDDLFSRFVYTGGKSYNFSTK-------FK 634
            +  N  +   +   K             +S +D+ S++ Y      NF  K       + 
Sbjct: 241  SDGNMATDDCKTSAKNLLNEQMVAPNAGMSFEDVLSKYAYKSDGRLNFRDKKILGAPHYP 300

Query: 635  KTERGGMDKMEEENMKTVKDD----LVVEDNAPLCTADPVGSDNA-------ISPHNSCQ 781
               +  ++K EE   K  K+      + E+ A    A P G+  +       ++P  + +
Sbjct: 301  MVVKN-IEKYEESENKISKEAEGTLKITENEAAPLPAIPYGNSGSQISEVGNVTPTRNIE 359

Query: 782  NVKKGLKSDMECAALTKTRTSAKGARKEVRVVSPYFGNAVTKVKITTKQ--RKTESKKLH 955
            N K   +  ++                 VR VSP F  ++ + +    +  +  E   L 
Sbjct: 360  NEKPNSRVHIQ-----------------VRKVSPNFNLSIGQQECMKIKPLKPCERVGLT 402

Query: 956  VRKISPYFSSTKKEEENK--------NTANSTNQTRKAKKPKQCSPVLTAARKRDEAYLR 1111
            VR +SPYF    K+EE +        N         K K+P + S  L+AA KR EAY R
Sbjct: 403  VRNVSPYFQKVPKQEEEEAADSNMIDNKHGQKKLPEKKKRPARKSITLSAAEKRSEAYRR 462

Query: 1112 KTPDNTWIPPRSPFNLLQEDHFFDPWRVLVICMLLNLTGGKQAGKVLPNFFQLCPDAKAA 1291
            KTPDNTW PPRS F LLQEDH  DPWRVLVICMLLN T GKQ   V+ +FF LCPDAKAA
Sbjct: 463  KTPDNTWKPPRSDFGLLQEDHASDPWRVLVICMLLNCTTGKQVRGVISDFFTLCPDAKAA 522

Query: 1292 TEVDTEKIEEVTRSLGLYNKRGAGIQRLSQEYLDESWTHVTELTGVGKYAADAYAIFCTG 1471
            TE  TE+IE++   LGL  KR   IQRLSQEYL + WTHVT+L GVGKYAADAYAIFCTG
Sbjct: 523  TEAKTEEIEKIIVPLGLQKKRAVMIQRLSQEYLADDWTHVTQLHGVGKYAADAYAIFCTG 582

Query: 1472 KWERVRPVDHMLVKYWEFL 1528
            KW++VRP DHML  YW+FL
Sbjct: 583  KWDQVRPKDHMLNYYWDFL 601


>ref|NP_974253.1| methyl-CpG-binding domain protein 4 [Arabidopsis thaliana]
            gi|114050633|gb|ABI49466.1| At3g07930 [Arabidopsis
            thaliana] gi|332641100|gb|AEE74621.1| methyl-CpG-binding
            domain protein 4 [Arabidopsis thaliana]
          Length = 445

 Score =  252 bits (644), Expect = 2e-64
 Identities = 170/443 (38%), Positives = 228/443 (51%), Gaps = 47/443 (10%)
 Frame = +2

Query: 350  EESTGMDSARVLEDKDKINSKSKKQNWRINGDSYSSNEEVHFSDGQLKEAELEVLTSANH 529
            ++ + +   R   D D I    + +++ +  +    N ++   D       L+      H
Sbjct: 20   DDDSSVMMTRRRPDSDFIEVSDENRSFALFKEDDEKNRDLGLVDDGSTNLVLQC-----H 74

Query: 530  GSGGAREMEKV-SLDDLFSRFVYTGGKSYNFSTKFKKTERGGMDKMEEENMKTVK----D 694
              G + E +   SLDDLFS FVY G          ++ +R     +   N+ + +    D
Sbjct: 75   DDGCSLEKDNSNSLDDLFSGFVYKG---------VRRRKRDDFGSITTSNLVSPQIADDD 125

Query: 695  DLVVEDN---APLCTADPVGSDNAISPH----NSCQNVKKGLKSDMECAALTKTRTSAKG 853
            D  V D+      C+   V     +SP+       Q  K+G  SD  C+    ++  AK 
Sbjct: 126  DDSVSDSHIERQECSEFHV-EVRRVSPYFQGSTVSQQSKEGCDSDSVCSKEGCSKVQAKV 184

Query: 854  ARKEVRVVSPYF---------GNAVTKVKITTKQRKTESKK-LHVRKISPYFS-STKKEE 1000
             R     VSPYF          + V+  +     RK  SK+ + VR++SPYF  ST  E+
Sbjct: 185  PR-----VSPYFQASTISQCDSDIVSSSQSGRNYRKGSSKRQVKVRRVSPYFQESTVSEQ 239

Query: 1001 ENKNTANSTN------------------------QTRKAKKPKQCSPVLTAARKRDEAYL 1108
             N+      N                        ++R  +K    SPVL+ ++K D+ YL
Sbjct: 240  PNQAPKGLRNYFKVVKVSRYFHADGIQVNESQKEKSRNVRKTPIVSPVLSLSQKTDDVYL 299

Query: 1109 RKTPDNTWIPPRSPFNLLQEDHFFDPWRVLVICMLLNLTGGKQAGKVLPNFFQLCPDAKA 1288
            RKTPDNTW+PPRSP NLLQEDH+ DPWRVLVICMLLN T G Q   V+ + F LC DAK 
Sbjct: 300  RKTPDNTWVPPRSPCNLLQEDHWHDPWRVLVICMLLNKTSGAQTRGVISDLFGLCTDAKT 359

Query: 1289 ATEVDTEKIEEVTRSLGLYNKRGAGIQRLSQEYLDESWTHVTELTGVGKYAADAYAIFCT 1468
            ATEV  E+IE + + LGL  KR   IQRLS EYL ESWTHVT+L GVGKYAADAYAIFC 
Sbjct: 360  ATEVKEEEIENLIKPLGLQKKRTKMIQRLSLEYLQESWTHVTQLHGVGKYAADAYAIFCN 419

Query: 1469 GKWERVRPVDHMLVKYWEFLRDR 1537
            G W+RV+P DHML  YW++LR R
Sbjct: 420  GNWDRVKPNDHMLNYYWDYLRIR 442


>gb|AAF21203.1|AC013483_27 hypothetical protein [Arabidopsis thaliana]
          Length = 419

 Score =  252 bits (643), Expect = 2e-64
 Identities = 169/430 (39%), Positives = 223/430 (51%), Gaps = 47/430 (10%)
 Frame = +2

Query: 389  DKDKINSKSKKQNWRINGDSYSSNEEVHFSDGQLKEAELEVLTSANHGSGGAREMEKV-S 565
            D D I    + +++ +  +    N ++   D       L+      H  G + E +   S
Sbjct: 7    DSDFIEVSDENRSFALFKEDDEKNRDLGLVDDGSTNLVLQC-----HDDGCSLEKDNSNS 61

Query: 566  LDDLFSRFVYTGGKSYNFSTKFKKTERGGMDKMEEENMKTVK----DDLVVEDN---APL 724
            LDDLFS FVY G          ++ +R     +   N+ + +    DD  V D+      
Sbjct: 62   LDDLFSGFVYKG---------VRRRKRDDFGSITTSNLVSPQIADDDDDSVSDSHIERQE 112

Query: 725  CTADPVGSDNAISPH----NSCQNVKKGLKSDMECAALTKTRTSAKGARKEVRVVSPYF- 889
            C+   V     +SP+       Q  K+G  SD  C+    ++  AK  R     VSPYF 
Sbjct: 113  CSEFHV-EVRRVSPYFQGSTVSQQSKEGCDSDSVCSKEGCSKVQAKVPR-----VSPYFQ 166

Query: 890  --------GNAVTKVKITTKQRKTESKK-LHVRKISPYFS-STKKEEENKNTANSTN--- 1030
                     + V+  +     RK  SK+ + VR++SPYF  ST  E+ N+      N   
Sbjct: 167  ASTISQCDSDIVSSSQSGRNYRKGSSKRQVKVRRVSPYFQESTVSEQPNQAPKGLRNYFK 226

Query: 1031 ---------------------QTRKAKKPKQCSPVLTAARKRDEAYLRKTPDNTWIPPRS 1147
                                 ++R  +K    SPVL+ ++K D+ YLRKTPDNTW+PPRS
Sbjct: 227  VVKVSRYFHADGIQVNESQKEKSRNVRKTPIVSPVLSLSQKTDDVYLRKTPDNTWVPPRS 286

Query: 1148 PFNLLQEDHFFDPWRVLVICMLLNLTGGKQAGKVLPNFFQLCPDAKAATEVDTEKIEEVT 1327
            P NLLQEDH+ DPWRVLVICMLLN T G Q   V+ + F LC DAK ATEV  E+IE + 
Sbjct: 287  PCNLLQEDHWHDPWRVLVICMLLNKTSGAQTRGVISDLFGLCTDAKTATEVKEEEIENLI 346

Query: 1328 RSLGLYNKRGAGIQRLSQEYLDESWTHVTELTGVGKYAADAYAIFCTGKWERVRPVDHML 1507
            + LGL  KR   IQRLS EYL ESWTHVT+L GVGKYAADAYAIFC G W+RV+P DHML
Sbjct: 347  KPLGLQKKRTKMIQRLSLEYLQESWTHVTQLHGVGKYAADAYAIFCNGNWDRVKPNDHML 406

Query: 1508 VKYWEFLRDR 1537
              YW++LR R
Sbjct: 407  NYYWDYLRIR 416


>gb|AAO22623.1| unknown protein [Arabidopsis thaliana]
          Length = 407

 Score =  244 bits (622), Expect = 7e-62
 Identities = 156/404 (38%), Positives = 215/404 (53%), Gaps = 8/404 (1%)
 Frame = +2

Query: 350  EESTGMDSARVLEDKDKINSKSKKQNWRINGDSYSSNEEVHFSDGQLKEAELEVLTSANH 529
            ++ + +   R   D D I    + +++ +  +    N ++   D       L+      H
Sbjct: 20   DDDSSVMMTRRRPDSDFIEVSDENRSFALFKEDDEKNRDLGLVDDGSTNLVLQC-----H 74

Query: 530  GSGGAREMEKV-SLDDLFSRFVYTGGKSYNFSTKFKKTERGGM--DKMEEENMKTVKDDL 700
              G + E +   SLDDLFS FVY G +      + K+ + G +    +    +    DD 
Sbjct: 75   DDGCSLEKDNSNSLDDLFSGFVYKGVR------RRKRDDFGSITTSNLVSPQIADDDDDS 128

Query: 701  VVEDNAPLCTADPVGSD-NAISPHNSCQNVKKGLKSDMECAALTKT-RTSAKGARK---E 865
            V + +        V +    +SP+     + +    D +  + +++ R   KG+ K   +
Sbjct: 129  VSDSHIERQECSKVQAKVPRVSPYFQASTISQ---CDSDIVSSSQSGRNYRKGSSKRQVK 185

Query: 866  VRVVSPYFGNAVTKVKITTKQRKTESKKLHVRKISPYFSSTKKEEENKNTANSTNQTRKA 1045
             R VSPYF  +    +   +  K       V K+S YF +   +        S N     
Sbjct: 186  ARRVSPYFQESTVSEQ-PNQAPKGLRNYFKVVKVSRYFHADGIQVNESQKEKSRN----V 240

Query: 1046 KKPKQCSPVLTAARKRDEAYLRKTPDNTWIPPRSPFNLLQEDHFFDPWRVLVICMLLNLT 1225
            +K    SPVL+ ++K D+ YLRKTPDNTW+PPRSP NLLQEDH+ DPWRVLVICMLLN T
Sbjct: 241  RKTPIVSPVLSLSQKTDDVYLRKTPDNTWVPPRSPCNLLQEDHWHDPWRVLVICMLLNKT 300

Query: 1226 GGKQAGKVLPNFFQLCPDAKAATEVDTEKIEEVTRSLGLYNKRGAGIQRLSQEYLDESWT 1405
             G Q   V+ + F LC DAK ATEV  E+IE + + LGL  KR   IQRLS EYL ESWT
Sbjct: 301  SGAQTRGVISDLFGLCTDAKTATEVKEEEIENLIKPLGLQKKRTKMIQRLSLEYLQESWT 360

Query: 1406 HVTELTGVGKYAADAYAIFCTGKWERVRPVDHMLVKYWEFLRDR 1537
            HVT+L GVGKYAADAYAIFC G W+RV+P DHML  YW++LR R
Sbjct: 361  HVTQLHGVGKYAADAYAIFCNGNWDRVKPNDHMLNYYWDYLRIR 404


>ref|XP_002882558.1| hypothetical protein ARALYDRAFT_896965 [Arabidopsis lyrata subsp.
            lyrata] gi|297328398|gb|EFH58817.1| hypothetical protein
            ARALYDRAFT_896965 [Arabidopsis lyrata subsp. lyrata]
          Length = 435

 Score =  241 bits (616), Expect = 3e-61
 Identities = 147/363 (40%), Positives = 198/363 (54%), Gaps = 38/363 (10%)
 Frame = +2

Query: 563  SLDDLFSRFVYTGGKSYNFSTKFKKTERGGMDKMEEENMKTVKDDLVVEDNAPLCTADPV 742
            +LDDLFS FVY G +         KT    +     ++  +V +  +   +   C+   V
Sbjct: 75   NLDDLFSGFVYKGVRRRKMDDFGSKTTSNLLSPQIADDDDSVAESHIERQD---CSEFHV 131

Query: 743  GSDNAISPHNSCQNVKKGLKSDMECAAL-TKTRTSAKGARKEVRVVSPYF--------GN 895
                 +SP+     V +  K + +  ++ +++  +    + +V +VSPYF        G+
Sbjct: 132  -EVRRVSPYFQGSTVSQQSKEECDSDSVCSQSGRNCSKVQAKVPIVSPYFQSSTISQCGS 190

Query: 896  AVTKVKITTK--QRKTESKKLHVRKISPYFS-STKKEEENK------------------- 1009
             +     + K  +R +  ++  VR+ SPYF  ST  E+ ++                   
Sbjct: 191  DIVSSSQSGKNYRRGSSKRQAKVRRDSPYFQESTVSEQPSQAPPRDLRQYFKVVKVSRYF 250

Query: 1010 -------NTANSTNQTRKAKKPKQCSPVLTAARKRDEAYLRKTPDNTWIPPRSPFNLLQE 1168
                   N +     TR  K P   SP L+ ++K DEAY RKTPD TW+PPRSP NLLQE
Sbjct: 251  HADGIQVNESQKEKSTRVRKTPV-VSPSLSLSQKTDEAYQRKTPDKTWVPPRSPCNLLQE 309

Query: 1169 DHFFDPWRVLVICMLLNLTGGKQAGKVLPNFFQLCPDAKAATEVDTEKIEEVTRSLGLYN 1348
             H+ DPWRVLVICMLLN T G Q   V+ + F LCPDAK ATEV+  +IE + + LGL  
Sbjct: 310  HHWHDPWRVLVICMLLNKTSGAQTRGVIEDLFALCPDAKTATEVEEREIESLIKPLGLQK 369

Query: 1349 KRGAGIQRLSQEYLDESWTHVTELTGVGKYAADAYAIFCTGKWERVRPVDHMLVKYWEFL 1528
            KR   IQR S EYL ESWTHVT+L G+GKYAADAYAIFC G W+RV+P DHML  YWEFL
Sbjct: 370  KRARMIQRFSLEYLQESWTHVTQLHGIGKYAADAYAIFCNGNWDRVKPDDHMLNYYWEFL 429

Query: 1529 RDR 1537
            R R
Sbjct: 430  RIR 432


Top