BLASTX nr result
ID: Scutellaria23_contig00007103
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Scutellaria23_contig00007103 (2039 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002514395.1| conserved hypothetical protein [Ricinus comm... 258 4e-66 ref|NP_974253.1| methyl-CpG-binding domain protein 4 [Arabidopsi... 244 8e-62 gb|AAF21203.1|AC013483_27 hypothetical protein [Arabidopsis thal... 243 1e-61 gb|AAO22623.1| unknown protein [Arabidopsis thaliana] 238 3e-60 ref|XP_002882558.1| hypothetical protein ARALYDRAFT_896965 [Arab... 235 4e-59 >ref|XP_002514395.1| conserved hypothetical protein [Ricinus communis] gi|223546492|gb|EEF47991.1| conserved hypothetical protein [Ricinus communis] Length = 608 Score = 258 bits (659), Expect = 4e-66 Identities = 168/439 (38%), Positives = 222/439 (50%), Gaps = 46/439 (10%) Frame = -3 Query: 1455 EESTGMDSARVLEDKDKINSNSKKQNWRINGDSY-SSNEEVHFSDGQLKE----AELEVL 1291 EE G + + K+K N +K+N +G Y + + SD Q+++ E+ V Sbjct: 184 EEQGGANVVSRGDGKEKANKRKRKKN---DGAIYPNKTRDTVSSDAQMRDIVKLTEINVA 240 Query: 1290 TSANHGSGGAREMEK-------------VSLDDLFSRFVYTGGKSYNFSTK-------FK 1171 + N + + K +S +D+ S++ Y NF K + Sbjct: 241 SDGNMATDDCKTSAKNLLNEQMVAPNAGMSFEDVLSKYAYKSDGRLNFRDKKILGAPHYP 300 Query: 1170 KTERGGMDKMEEGNMKTVKDD----LVVEDNAPLCTADPVGSDNA-------ISPHNSCQ 1024 + ++K EE K K+ + E+ A A P G+ + ++P + + Sbjct: 301 MVVKN-IEKYEESENKISKEAEGTLKITENEAAPLPAIPYGNSGSQISEVGNVTPTRNIE 359 Query: 1023 NVKKGLRSDMECAALTKTRTSAEGARKEVRVVSPYFGNAXXXXXXXXXXXXTESKK--LH 850 N K R ++ VR VSP F + ++ L Sbjct: 360 NEKPNSRVHIQ-----------------VRKVSPNFNLSIGQQECMKIKPLKPCERVGLT 402 Query: 849 VRKISPYFSSTKKEEENK--------NTANSTNQTRKAKKPKQCSPVLTAARKRDEAYLR 694 VR +SPYF K+EE + N K K+P + S L+AA KR EAY R Sbjct: 403 VRNVSPYFQKVPKQEEEEAADSNMIDNKHGQKKLPEKKKRPARKSITLSAAEKRSEAYRR 462 Query: 693 KAPDNTWIPPRSPFNLLQEDHFFDPWRVLVICMLLNLTGGKQAGKVLPNFFQLCPDAKAA 514 K PDNTW PPRS F LLQEDH DPWRVLVICMLLN T GKQ V+ +FF LCPDAKAA Sbjct: 463 KTPDNTWKPPRSDFGLLQEDHASDPWRVLVICMLLNCTTGKQVRGVISDFFTLCPDAKAA 522 Query: 513 TEVDTEKIEEVTRSLGLYNKRGAGIQRLSQEYLDESWTHVTELTGVGKYAADAYAIFCTG 334 TE TE+IE++ LGL KR IQRLSQEYL + WTHVT+L GVGKYAADAYAIFCTG Sbjct: 523 TEAKTEEIEKIIVPLGLQKKRAVMIQRLSQEYLADDWTHVTQLHGVGKYAADAYAIFCTG 582 Query: 333 KWERVRPVDHMLVKYWEFL 277 KW++VRP DHML YW+FL Sbjct: 583 KWDQVRPKDHMLNYYWDFL 601 >ref|NP_974253.1| methyl-CpG-binding domain protein 4 [Arabidopsis thaliana] gi|114050633|gb|ABI49466.1| At3g07930 [Arabidopsis thaliana] gi|332641100|gb|AEE74621.1| methyl-CpG-binding domain protein 4 [Arabidopsis thaliana] Length = 445 Score = 244 bits (622), Expect = 8e-62 Identities = 164/443 (37%), Positives = 225/443 (50%), Gaps = 47/443 (10%) Frame = -3 Query: 1455 EESTGMDSARVLEDKDKINSNSKKQNWRINGDSYSSNEEVHFSDGQLKEAELEVLTSANH 1276 ++ + + R D D I + + +++ + + N ++ D L+ H Sbjct: 20 DDDSSVMMTRRRPDSDFIEVSDENRSFALFKEDDEKNRDLGLVDDGSTNLVLQC-----H 74 Query: 1275 GSGGAREMEKV-SLDDLFSRFVYTGGKSYNFSTKFKKTERGGMDKMEEGNMKTVK----D 1111 G + E + SLDDLFS FVY G ++ +R + N+ + + D Sbjct: 75 DDGCSLEKDNSNSLDDLFSGFVYKG---------VRRRKRDDFGSITTSNLVSPQIADDD 125 Query: 1110 DLVVEDN---APLCTADPVGSDNAISPH----NSCQNVKKGLRSDMECAALTKTRTSAEG 952 D V D+ C+ V +SP+ Q K+G SD C+ ++ A+ Sbjct: 126 DDSVSDSHIERQECSEFHV-EVRRVSPYFQGSTVSQQSKEGCDSDSVCSKEGCSKVQAKV 184 Query: 951 ARKEVRVVSPYFGNAXXXXXXXXXXXXTES----------KKLHVRKISPYFS-STKKEE 805 R VSPYF + ++S +++ VR++SPYF ST E+ Sbjct: 185 PR-----VSPYFQASTISQCDSDIVSSSQSGRNYRKGSSKRQVKVRRVSPYFQESTVSEQ 239 Query: 804 ENKNTANSTN------------------------QTRKAKKPKQCSPVLTAARKRDEAYL 697 N+ N ++R +K SPVL+ ++K D+ YL Sbjct: 240 PNQAPKGLRNYFKVVKVSRYFHADGIQVNESQKEKSRNVRKTPIVSPVLSLSQKTDDVYL 299 Query: 696 RKAPDNTWIPPRSPFNLLQEDHFFDPWRVLVICMLLNLTGGKQAGKVLPNFFQLCPDAKA 517 RK PDNTW+PPRSP NLLQEDH+ DPWRVLVICMLLN T G Q V+ + F LC DAK Sbjct: 300 RKTPDNTWVPPRSPCNLLQEDHWHDPWRVLVICMLLNKTSGAQTRGVISDLFGLCTDAKT 359 Query: 516 ATEVDTEKIEEVTRSLGLYNKRGAGIQRLSQEYLDESWTHVTELTGVGKYAADAYAIFCT 337 ATEV E+IE + + LGL KR IQRLS EYL ESWTHVT+L GVGKYAADAYAIFC Sbjct: 360 ATEVKEEEIENLIKPLGLQKKRTKMIQRLSLEYLQESWTHVTQLHGVGKYAADAYAIFCN 419 Query: 336 GKWERVRPVDHMLVKYWEFLRDR 268 G W+RV+P DHML YW++LR R Sbjct: 420 GNWDRVKPNDHMLNYYWDYLRIR 442 >gb|AAF21203.1|AC013483_27 hypothetical protein [Arabidopsis thaliana] Length = 419 Score = 243 bits (621), Expect = 1e-61 Identities = 163/430 (37%), Positives = 220/430 (51%), Gaps = 47/430 (10%) Frame = -3 Query: 1416 DKDKINSNSKKQNWRINGDSYSSNEEVHFSDGQLKEAELEVLTSANHGSGGAREMEKV-S 1240 D D I + + +++ + + N ++ D L+ H G + E + S Sbjct: 7 DSDFIEVSDENRSFALFKEDDEKNRDLGLVDDGSTNLVLQC-----HDDGCSLEKDNSNS 61 Query: 1239 LDDLFSRFVYTGGKSYNFSTKFKKTERGGMDKMEEGNMKTVK----DDLVVEDN---APL 1081 LDDLFS FVY G ++ +R + N+ + + DD V D+ Sbjct: 62 LDDLFSGFVYKG---------VRRRKRDDFGSITTSNLVSPQIADDDDDSVSDSHIERQE 112 Query: 1080 CTADPVGSDNAISPH----NSCQNVKKGLRSDMECAALTKTRTSAEGARKEVRVVSPYFG 913 C+ V +SP+ Q K+G SD C+ ++ A+ R VSPYF Sbjct: 113 CSEFHV-EVRRVSPYFQGSTVSQQSKEGCDSDSVCSKEGCSKVQAKVPR-----VSPYFQ 166 Query: 912 NAXXXXXXXXXXXXTES----------KKLHVRKISPYFS-STKKEEENKNTANSTN--- 775 + ++S +++ VR++SPYF ST E+ N+ N Sbjct: 167 ASTISQCDSDIVSSSQSGRNYRKGSSKRQVKVRRVSPYFQESTVSEQPNQAPKGLRNYFK 226 Query: 774 ---------------------QTRKAKKPKQCSPVLTAARKRDEAYLRKAPDNTWIPPRS 658 ++R +K SPVL+ ++K D+ YLRK PDNTW+PPRS Sbjct: 227 VVKVSRYFHADGIQVNESQKEKSRNVRKTPIVSPVLSLSQKTDDVYLRKTPDNTWVPPRS 286 Query: 657 PFNLLQEDHFFDPWRVLVICMLLNLTGGKQAGKVLPNFFQLCPDAKAATEVDTEKIEEVT 478 P NLLQEDH+ DPWRVLVICMLLN T G Q V+ + F LC DAK ATEV E+IE + Sbjct: 287 PCNLLQEDHWHDPWRVLVICMLLNKTSGAQTRGVISDLFGLCTDAKTATEVKEEEIENLI 346 Query: 477 RSLGLYNKRGAGIQRLSQEYLDESWTHVTELTGVGKYAADAYAIFCTGKWERVRPVDHML 298 + LGL KR IQRLS EYL ESWTHVT+L GVGKYAADAYAIFC G W+RV+P DHML Sbjct: 347 KPLGLQKKRTKMIQRLSLEYLQESWTHVTQLHGVGKYAADAYAIFCNGNWDRVKPNDHML 406 Query: 297 VKYWEFLRDR 268 YW++LR R Sbjct: 407 NYYWDYLRIR 416 >gb|AAO22623.1| unknown protein [Arabidopsis thaliana] Length = 407 Score = 238 bits (608), Expect = 3e-60 Identities = 151/405 (37%), Positives = 212/405 (52%), Gaps = 9/405 (2%) Frame = -3 Query: 1455 EESTGMDSARVLEDKDKINSNSKKQNWRINGDSYSSNEEVHFSDGQLKEAELEVLTSANH 1276 ++ + + R D D I + + +++ + + N ++ D L+ H Sbjct: 20 DDDSSVMMTRRRPDSDFIEVSDENRSFALFKEDDEKNRDLGLVDDGSTNLVLQC-----H 74 Query: 1275 GSGGAREMEKV-SLDDLFSRFVYTGGKSYNFSTKFKKTERGGMDKMEEGNMKTVK----- 1114 G + E + SLDDLFS FVY G ++ +R + N+ + + Sbjct: 75 DDGCSLEKDNSNSLDDLFSGFVYKG---------VRRRKRDDFGSITTSNLVSPQIADDD 125 Query: 1113 DDLVVEDNAPLCTADPVGSD-NAISPHNSCQNVKKGLRSDMECAALTKTRTSAEGARKEV 937 DD V + + V + +SP+ + + SD+ ++ + ++++V Sbjct: 126 DDSVSDSHIERQECSKVQAKVPRVSPYFQASTISQ-CDSDIVSSSQSGRNYRKGSSKRQV 184 Query: 936 RV--VSPYFGNAXXXXXXXXXXXXTESKKLHVRKISPYFSSTKKEEENKNTANSTNQTRK 763 + VSPYF + + V K+S YF + + S N Sbjct: 185 KARRVSPYFQESTVSEQPNQAPKGLRNY-FKVVKVSRYFHADGIQVNESQKEKSRN---- 239 Query: 762 AKKPKQCSPVLTAARKRDEAYLRKAPDNTWIPPRSPFNLLQEDHFFDPWRVLVICMLLNL 583 +K SPVL+ ++K D+ YLRK PDNTW+PPRSP NLLQEDH+ DPWRVLVICMLLN Sbjct: 240 VRKTPIVSPVLSLSQKTDDVYLRKTPDNTWVPPRSPCNLLQEDHWHDPWRVLVICMLLNK 299 Query: 582 TGGKQAGKVLPNFFQLCPDAKAATEVDTEKIEEVTRSLGLYNKRGAGIQRLSQEYLDESW 403 T G Q V+ + F LC DAK ATEV E+IE + + LGL KR IQRLS EYL ESW Sbjct: 300 TSGAQTRGVISDLFGLCTDAKTATEVKEEEIENLIKPLGLQKKRTKMIQRLSLEYLQESW 359 Query: 402 THVTELTGVGKYAADAYAIFCTGKWERVRPVDHMLVKYWEFLRDR 268 THVT+L GVGKYAADAYAIFC G W+RV+P DHML YW++LR R Sbjct: 360 THVTQLHGVGKYAADAYAIFCNGNWDRVKPNDHMLNYYWDYLRIR 404 >ref|XP_002882558.1| hypothetical protein ARALYDRAFT_896965 [Arabidopsis lyrata subsp. lyrata] gi|297328398|gb|EFH58817.1| hypothetical protein ARALYDRAFT_896965 [Arabidopsis lyrata subsp. lyrata] Length = 435 Score = 235 bits (599), Expect = 4e-59 Identities = 149/365 (40%), Positives = 191/365 (52%), Gaps = 40/365 (10%) Frame = -3 Query: 1242 SLDDLFSRFVYTGGKSYNFSTKFKKTERGGMDKMEEGNMKTVKDDLVVEDNAPLCTADPV 1063 +LDDLFS FVY G + KT + + +V + + + C+ V Sbjct: 75 NLDDLFSGFVYKGVRRRKMDDFGSKTTSNLLSPQIADDDDSVAESHIERQD---CSEFHV 131 Query: 1062 GSDNAISPHNSCQNVKKGLRSDMECAALTKTRTSAEGARK---EVRVVSPYFGNAXXXXX 892 +SP+ V + +S EC + + S K +V +VSPYF ++ Sbjct: 132 -EVRRVSPYFQGSTVSQ--QSKEECDSDSVCSQSGRNCSKVQAKVPIVSPYFQSSTISQC 188 Query: 891 XXXXXXXTESKKLH----------VRKISPYFS-STKKEEENK----------------- 796 ++S K + VR+ SPYF ST E+ ++ Sbjct: 189 GSDIVSSSQSGKNYRRGSSKRQAKVRRDSPYFQESTVSEQPSQAPPRDLRQYFKVVKVSR 248 Query: 795 ---------NTANSTNQTRKAKKPKQCSPVLTAARKRDEAYLRKAPDNTWIPPRSPFNLL 643 N + TR K P SP L+ ++K DEAY RK PD TW+PPRSP NLL Sbjct: 249 YFHADGIQVNESQKEKSTRVRKTPV-VSPSLSLSQKTDEAYQRKTPDKTWVPPRSPCNLL 307 Query: 642 QEDHFFDPWRVLVICMLLNLTGGKQAGKVLPNFFQLCPDAKAATEVDTEKIEEVTRSLGL 463 QE H+ DPWRVLVICMLLN T G Q V+ + F LCPDAK ATEV+ +IE + + LGL Sbjct: 308 QEHHWHDPWRVLVICMLLNKTSGAQTRGVIEDLFALCPDAKTATEVEEREIESLIKPLGL 367 Query: 462 YNKRGAGIQRLSQEYLDESWTHVTELTGVGKYAADAYAIFCTGKWERVRPVDHMLVKYWE 283 KR IQR S EYL ESWTHVT+L G+GKYAADAYAIFC G W+RV+P DHML YWE Sbjct: 368 QKKRARMIQRFSLEYLQESWTHVTQLHGIGKYAADAYAIFCNGNWDRVKPDDHMLNYYWE 427 Query: 282 FLRDR 268 FLR R Sbjct: 428 FLRIR 432