BLASTX nr result
ID: Scutellaria22_contig00007084
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Scutellaria22_contig00007084 (1695 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002514395.1| conserved hypothetical protein [Ricinus comm... 263 1e-67 ref|NP_974253.1| methyl-CpG-binding domain protein 4 [Arabidopsi... 252 2e-64 gb|AAF21203.1|AC013483_27 hypothetical protein [Arabidopsis thal... 252 2e-64 gb|AAO22623.1| unknown protein [Arabidopsis thaliana] 244 7e-62 ref|XP_002882558.1| hypothetical protein ARALYDRAFT_896965 [Arab... 241 3e-61 >ref|XP_002514395.1| conserved hypothetical protein [Ricinus communis] gi|223546492|gb|EEF47991.1| conserved hypothetical protein [Ricinus communis] Length = 608 Score = 263 bits (671), Expect = 1e-67 Identities = 169/439 (38%), Positives = 228/439 (51%), Gaps = 46/439 (10%) Frame = +2 Query: 350 EESTGMDSARVLEDKDKINSKSKKQNWRINGDSY-SSNEEVHFSDGQLKE----AELEVL 514 EE G + + K+K N + +K+N +G Y + + SD Q+++ E+ V Sbjct: 184 EEQGGANVVSRGDGKEKANKRKRKKN---DGAIYPNKTRDTVSSDAQMRDIVKLTEINVA 240 Query: 515 TSANHGSGGAREMEK-------------VSLDDLFSRFVYTGGKSYNFSTK-------FK 634 + N + + K +S +D+ S++ Y NF K + Sbjct: 241 SDGNMATDDCKTSAKNLLNEQMVAPNAGMSFEDVLSKYAYKSDGRLNFRDKKILGAPHYP 300 Query: 635 KTERGGMDKMEEENMKTVKDD----LVVEDNAPLCTADPVGSDNA-------ISPHNSCQ 781 + ++K EE K K+ + E+ A A P G+ + ++P + + Sbjct: 301 MVVKN-IEKYEESENKISKEAEGTLKITENEAAPLPAIPYGNSGSQISEVGNVTPTRNIE 359 Query: 782 NVKKGLKSDMECAALTKTRTSAKGARKEVRVVSPYFGNAVTKVKITTKQ--RKTESKKLH 955 N K + ++ VR VSP F ++ + + + + E L Sbjct: 360 NEKPNSRVHIQ-----------------VRKVSPNFNLSIGQQECMKIKPLKPCERVGLT 402 Query: 956 VRKISPYFSSTKKEEENK--------NTANSTNQTRKAKKPKQCSPVLTAARKRDEAYLR 1111 VR +SPYF K+EE + N K K+P + S L+AA KR EAY R Sbjct: 403 VRNVSPYFQKVPKQEEEEAADSNMIDNKHGQKKLPEKKKRPARKSITLSAAEKRSEAYRR 462 Query: 1112 KTPDNTWIPPRSPFNLLQEDHFFDPWRVLVICMLLNLTGGKQAGKVLPNFFQLCPDAKAA 1291 KTPDNTW PPRS F LLQEDH DPWRVLVICMLLN T GKQ V+ +FF LCPDAKAA Sbjct: 463 KTPDNTWKPPRSDFGLLQEDHASDPWRVLVICMLLNCTTGKQVRGVISDFFTLCPDAKAA 522 Query: 1292 TEVDTEKIEEVTRSLGLYNKRGAGIQRLSQEYLDESWTHVTELTGVGKYAADAYAIFCTG 1471 TE TE+IE++ LGL KR IQRLSQEYL + WTHVT+L GVGKYAADAYAIFCTG Sbjct: 523 TEAKTEEIEKIIVPLGLQKKRAVMIQRLSQEYLADDWTHVTQLHGVGKYAADAYAIFCTG 582 Query: 1472 KWERVRPVDHMLVKYWEFL 1528 KW++VRP DHML YW+FL Sbjct: 583 KWDQVRPKDHMLNYYWDFL 601 >ref|NP_974253.1| methyl-CpG-binding domain protein 4 [Arabidopsis thaliana] gi|114050633|gb|ABI49466.1| At3g07930 [Arabidopsis thaliana] gi|332641100|gb|AEE74621.1| methyl-CpG-binding domain protein 4 [Arabidopsis thaliana] Length = 445 Score = 252 bits (644), Expect = 2e-64 Identities = 170/443 (38%), Positives = 228/443 (51%), Gaps = 47/443 (10%) Frame = +2 Query: 350 EESTGMDSARVLEDKDKINSKSKKQNWRINGDSYSSNEEVHFSDGQLKEAELEVLTSANH 529 ++ + + R D D I + +++ + + N ++ D L+ H Sbjct: 20 DDDSSVMMTRRRPDSDFIEVSDENRSFALFKEDDEKNRDLGLVDDGSTNLVLQC-----H 74 Query: 530 GSGGAREMEKV-SLDDLFSRFVYTGGKSYNFSTKFKKTERGGMDKMEEENMKTVK----D 694 G + E + SLDDLFS FVY G ++ +R + N+ + + D Sbjct: 75 DDGCSLEKDNSNSLDDLFSGFVYKG---------VRRRKRDDFGSITTSNLVSPQIADDD 125 Query: 695 DLVVEDN---APLCTADPVGSDNAISPH----NSCQNVKKGLKSDMECAALTKTRTSAKG 853 D V D+ C+ V +SP+ Q K+G SD C+ ++ AK Sbjct: 126 DDSVSDSHIERQECSEFHV-EVRRVSPYFQGSTVSQQSKEGCDSDSVCSKEGCSKVQAKV 184 Query: 854 ARKEVRVVSPYF---------GNAVTKVKITTKQRKTESKK-LHVRKISPYFS-STKKEE 1000 R VSPYF + V+ + RK SK+ + VR++SPYF ST E+ Sbjct: 185 PR-----VSPYFQASTISQCDSDIVSSSQSGRNYRKGSSKRQVKVRRVSPYFQESTVSEQ 239 Query: 1001 ENKNTANSTN------------------------QTRKAKKPKQCSPVLTAARKRDEAYL 1108 N+ N ++R +K SPVL+ ++K D+ YL Sbjct: 240 PNQAPKGLRNYFKVVKVSRYFHADGIQVNESQKEKSRNVRKTPIVSPVLSLSQKTDDVYL 299 Query: 1109 RKTPDNTWIPPRSPFNLLQEDHFFDPWRVLVICMLLNLTGGKQAGKVLPNFFQLCPDAKA 1288 RKTPDNTW+PPRSP NLLQEDH+ DPWRVLVICMLLN T G Q V+ + F LC DAK Sbjct: 300 RKTPDNTWVPPRSPCNLLQEDHWHDPWRVLVICMLLNKTSGAQTRGVISDLFGLCTDAKT 359 Query: 1289 ATEVDTEKIEEVTRSLGLYNKRGAGIQRLSQEYLDESWTHVTELTGVGKYAADAYAIFCT 1468 ATEV E+IE + + LGL KR IQRLS EYL ESWTHVT+L GVGKYAADAYAIFC Sbjct: 360 ATEVKEEEIENLIKPLGLQKKRTKMIQRLSLEYLQESWTHVTQLHGVGKYAADAYAIFCN 419 Query: 1469 GKWERVRPVDHMLVKYWEFLRDR 1537 G W+RV+P DHML YW++LR R Sbjct: 420 GNWDRVKPNDHMLNYYWDYLRIR 442 >gb|AAF21203.1|AC013483_27 hypothetical protein [Arabidopsis thaliana] Length = 419 Score = 252 bits (643), Expect = 2e-64 Identities = 169/430 (39%), Positives = 223/430 (51%), Gaps = 47/430 (10%) Frame = +2 Query: 389 DKDKINSKSKKQNWRINGDSYSSNEEVHFSDGQLKEAELEVLTSANHGSGGAREMEKV-S 565 D D I + +++ + + N ++ D L+ H G + E + S Sbjct: 7 DSDFIEVSDENRSFALFKEDDEKNRDLGLVDDGSTNLVLQC-----HDDGCSLEKDNSNS 61 Query: 566 LDDLFSRFVYTGGKSYNFSTKFKKTERGGMDKMEEENMKTVK----DDLVVEDN---APL 724 LDDLFS FVY G ++ +R + N+ + + DD V D+ Sbjct: 62 LDDLFSGFVYKG---------VRRRKRDDFGSITTSNLVSPQIADDDDDSVSDSHIERQE 112 Query: 725 CTADPVGSDNAISPH----NSCQNVKKGLKSDMECAALTKTRTSAKGARKEVRVVSPYF- 889 C+ V +SP+ Q K+G SD C+ ++ AK R VSPYF Sbjct: 113 CSEFHV-EVRRVSPYFQGSTVSQQSKEGCDSDSVCSKEGCSKVQAKVPR-----VSPYFQ 166 Query: 890 --------GNAVTKVKITTKQRKTESKK-LHVRKISPYFS-STKKEEENKNTANSTN--- 1030 + V+ + RK SK+ + VR++SPYF ST E+ N+ N Sbjct: 167 ASTISQCDSDIVSSSQSGRNYRKGSSKRQVKVRRVSPYFQESTVSEQPNQAPKGLRNYFK 226 Query: 1031 ---------------------QTRKAKKPKQCSPVLTAARKRDEAYLRKTPDNTWIPPRS 1147 ++R +K SPVL+ ++K D+ YLRKTPDNTW+PPRS Sbjct: 227 VVKVSRYFHADGIQVNESQKEKSRNVRKTPIVSPVLSLSQKTDDVYLRKTPDNTWVPPRS 286 Query: 1148 PFNLLQEDHFFDPWRVLVICMLLNLTGGKQAGKVLPNFFQLCPDAKAATEVDTEKIEEVT 1327 P NLLQEDH+ DPWRVLVICMLLN T G Q V+ + F LC DAK ATEV E+IE + Sbjct: 287 PCNLLQEDHWHDPWRVLVICMLLNKTSGAQTRGVISDLFGLCTDAKTATEVKEEEIENLI 346 Query: 1328 RSLGLYNKRGAGIQRLSQEYLDESWTHVTELTGVGKYAADAYAIFCTGKWERVRPVDHML 1507 + LGL KR IQRLS EYL ESWTHVT+L GVGKYAADAYAIFC G W+RV+P DHML Sbjct: 347 KPLGLQKKRTKMIQRLSLEYLQESWTHVTQLHGVGKYAADAYAIFCNGNWDRVKPNDHML 406 Query: 1508 VKYWEFLRDR 1537 YW++LR R Sbjct: 407 NYYWDYLRIR 416 >gb|AAO22623.1| unknown protein [Arabidopsis thaliana] Length = 407 Score = 244 bits (622), Expect = 7e-62 Identities = 156/404 (38%), Positives = 215/404 (53%), Gaps = 8/404 (1%) Frame = +2 Query: 350 EESTGMDSARVLEDKDKINSKSKKQNWRINGDSYSSNEEVHFSDGQLKEAELEVLTSANH 529 ++ + + R D D I + +++ + + N ++ D L+ H Sbjct: 20 DDDSSVMMTRRRPDSDFIEVSDENRSFALFKEDDEKNRDLGLVDDGSTNLVLQC-----H 74 Query: 530 GSGGAREMEKV-SLDDLFSRFVYTGGKSYNFSTKFKKTERGGM--DKMEEENMKTVKDDL 700 G + E + SLDDLFS FVY G + + K+ + G + + + DD Sbjct: 75 DDGCSLEKDNSNSLDDLFSGFVYKGVR------RRKRDDFGSITTSNLVSPQIADDDDDS 128 Query: 701 VVEDNAPLCTADPVGSD-NAISPHNSCQNVKKGLKSDMECAALTKT-RTSAKGARK---E 865 V + + V + +SP+ + + D + + +++ R KG+ K + Sbjct: 129 VSDSHIERQECSKVQAKVPRVSPYFQASTISQ---CDSDIVSSSQSGRNYRKGSSKRQVK 185 Query: 866 VRVVSPYFGNAVTKVKITTKQRKTESKKLHVRKISPYFSSTKKEEENKNTANSTNQTRKA 1045 R VSPYF + + + K V K+S YF + + S N Sbjct: 186 ARRVSPYFQESTVSEQ-PNQAPKGLRNYFKVVKVSRYFHADGIQVNESQKEKSRN----V 240 Query: 1046 KKPKQCSPVLTAARKRDEAYLRKTPDNTWIPPRSPFNLLQEDHFFDPWRVLVICMLLNLT 1225 +K SPVL+ ++K D+ YLRKTPDNTW+PPRSP NLLQEDH+ DPWRVLVICMLLN T Sbjct: 241 RKTPIVSPVLSLSQKTDDVYLRKTPDNTWVPPRSPCNLLQEDHWHDPWRVLVICMLLNKT 300 Query: 1226 GGKQAGKVLPNFFQLCPDAKAATEVDTEKIEEVTRSLGLYNKRGAGIQRLSQEYLDESWT 1405 G Q V+ + F LC DAK ATEV E+IE + + LGL KR IQRLS EYL ESWT Sbjct: 301 SGAQTRGVISDLFGLCTDAKTATEVKEEEIENLIKPLGLQKKRTKMIQRLSLEYLQESWT 360 Query: 1406 HVTELTGVGKYAADAYAIFCTGKWERVRPVDHMLVKYWEFLRDR 1537 HVT+L GVGKYAADAYAIFC G W+RV+P DHML YW++LR R Sbjct: 361 HVTQLHGVGKYAADAYAIFCNGNWDRVKPNDHMLNYYWDYLRIR 404 >ref|XP_002882558.1| hypothetical protein ARALYDRAFT_896965 [Arabidopsis lyrata subsp. lyrata] gi|297328398|gb|EFH58817.1| hypothetical protein ARALYDRAFT_896965 [Arabidopsis lyrata subsp. lyrata] Length = 435 Score = 241 bits (616), Expect = 3e-61 Identities = 147/363 (40%), Positives = 198/363 (54%), Gaps = 38/363 (10%) Frame = +2 Query: 563 SLDDLFSRFVYTGGKSYNFSTKFKKTERGGMDKMEEENMKTVKDDLVVEDNAPLCTADPV 742 +LDDLFS FVY G + KT + ++ +V + + + C+ V Sbjct: 75 NLDDLFSGFVYKGVRRRKMDDFGSKTTSNLLSPQIADDDDSVAESHIERQD---CSEFHV 131 Query: 743 GSDNAISPHNSCQNVKKGLKSDMECAAL-TKTRTSAKGARKEVRVVSPYF--------GN 895 +SP+ V + K + + ++ +++ + + +V +VSPYF G+ Sbjct: 132 -EVRRVSPYFQGSTVSQQSKEECDSDSVCSQSGRNCSKVQAKVPIVSPYFQSSTISQCGS 190 Query: 896 AVTKVKITTK--QRKTESKKLHVRKISPYFS-STKKEEENK------------------- 1009 + + K +R + ++ VR+ SPYF ST E+ ++ Sbjct: 191 DIVSSSQSGKNYRRGSSKRQAKVRRDSPYFQESTVSEQPSQAPPRDLRQYFKVVKVSRYF 250 Query: 1010 -------NTANSTNQTRKAKKPKQCSPVLTAARKRDEAYLRKTPDNTWIPPRSPFNLLQE 1168 N + TR K P SP L+ ++K DEAY RKTPD TW+PPRSP NLLQE Sbjct: 251 HADGIQVNESQKEKSTRVRKTPV-VSPSLSLSQKTDEAYQRKTPDKTWVPPRSPCNLLQE 309 Query: 1169 DHFFDPWRVLVICMLLNLTGGKQAGKVLPNFFQLCPDAKAATEVDTEKIEEVTRSLGLYN 1348 H+ DPWRVLVICMLLN T G Q V+ + F LCPDAK ATEV+ +IE + + LGL Sbjct: 310 HHWHDPWRVLVICMLLNKTSGAQTRGVIEDLFALCPDAKTATEVEEREIESLIKPLGLQK 369 Query: 1349 KRGAGIQRLSQEYLDESWTHVTELTGVGKYAADAYAIFCTGKWERVRPVDHMLVKYWEFL 1528 KR IQR S EYL ESWTHVT+L G+GKYAADAYAIFC G W+RV+P DHML YWEFL Sbjct: 370 KRARMIQRFSLEYLQESWTHVTQLHGIGKYAADAYAIFCNGNWDRVKPDDHMLNYYWEFL 429 Query: 1529 RDR 1537 R R Sbjct: 430 RIR 432