BLASTX nr result
ID: Scutellaria23_contig00014362
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Scutellaria23_contig00014362 (1535 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CBI28962.3| unnamed protein product [Vitis vinifera] 617 e-174 ref|XP_002885468.1| SET domain-containing protein [Arabidopsis l... 593 e-167 ref|NP_188819.2| histone-lysine N-methyltransferase ATXR2 [Arabi... 592 e-167 dbj|BAB02844.1| unnamed protein product [Arabidopsis thaliana] 589 e-166 ref|XP_003540318.1| PREDICTED: histone-lysine N-methyltransferas... 588 e-165 >emb|CBI28962.3| unnamed protein product [Vitis vinifera] Length = 464 Score = 617 bits (1590), Expect = e-174 Identities = 298/449 (66%), Positives = 360/449 (80%), Gaps = 1/449 (0%) Frame = +2 Query: 38 EISSLLSSPPQTQVQEYFEALVAERQCAGLKVKIERDHGKGVYSTAVFQEDDLILKDQIL 217 EIS+LL PP Q+QEYF+ L+ RQ GLKVK + + GKGVY+ + F E +L+LKDQ+L Sbjct: 14 EISALLKPPPAHQLQEYFDNLIRTRQYLGLKVKHDGEFGKGVYADSDFGEGELVLKDQML 73 Query: 218 VGVQHLSNKIDCLVCSFCFQFIGSIELQIGRKLYLEELGISAADEXXXXXXXXXXXXKTH 397 VG QH SNKI+CLVC FCF+FIGSIELQIGR+LYL+ LG+S + K Sbjct: 74 VGAQHSSNKINCLVCGFCFRFIGSIELQIGRRLYLQGLGVSTNHDELGECASSSSKDKVP 133 Query: 398 IHQNTIQSLMDGSLRFLYSEKFPLPSVVPCLGGCKEAYYCSKSCAKSDWDSSHSLFCIGH 577 + + ++SLM+G L Y ++FPLPS + C GGC EAYYCSK CA++DW+SSHSL C G Sbjct: 134 LPKGVVESLMNGELALPYPKEFPLPSAIACSGGCGEAYYCSKLCAEADWESSHSLLCTGE 193 Query: 578 GSSSPNKEALSKFMKHANETNDIFILAAKAISFTILRYRKLKADYVEQQGNDTSN-PCNS 754 S S +EALSKF++HANETNDIF+LAAK I FTILRY+KLK ++++Q TS + Sbjct: 194 KSESICREALSKFIQHANETNDIFLLAAKVICFTILRYKKLKKAHLKEQEKYTSAIVLKN 253 Query: 755 CIFPLLVEAWKPMSVGFKRRWWDCVALPEDVDSCDEAGFRMQLKDLAFESLQLLKEAIYD 934 PLL+EAWKP+S+GFK+RWWDC+ALP+DV SCDEA FR Q+K+LAF SL+LLKEAI+ Sbjct: 254 GDLPLLLEAWKPISMGFKKRWWDCIALPDDVHSCDEAAFRAQIKELAFTSLKLLKEAIFC 313 Query: 935 KECAPLFSLDIYGHIIGMFELNNLDLVVASPVEDYFLYIDSLPPSQKEECEKITKSFLDA 1114 K C PLFSL+IYGHIIGMFELNNLDLVVASPVEDYFLYID LP QK++ E+IT+ FLDA Sbjct: 314 KGCEPLFSLEIYGHIIGMFELNNLDLVVASPVEDYFLYIDDLPYPQKKKAEEITRQFLDA 373 Query: 1115 LGEDYSVYCQGTAFYPLQSCMNHSCIPNAKAFKREEDRDGQATILALRPISKEEEITISY 1294 LG+DYSV CQGTAF+PLQSCMNHSC PNAKAFKREEDRDGQATI+ALRPI KEEE+TISY Sbjct: 374 LGDDYSVPCQGTAFFPLQSCMNHSCYPNAKAFKREEDRDGQATIIALRPIFKEEEVTISY 433 Query: 1295 IDEDLPYEERQQRLADYGFRCECPRCIEE 1381 IDEDLP++ERQ LADYGFRC+CP+C+EE Sbjct: 434 IDEDLPFDERQALLADYGFRCKCPKCLEE 462 >ref|XP_002885468.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297331308|gb|EFH61727.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 471 Score = 593 bits (1529), Expect = e-167 Identities = 290/465 (62%), Positives = 360/465 (77%), Gaps = 15/465 (3%) Frame = +2 Query: 38 EISSLLSSPPQTQVQEYFEALVAERQCAGLKVKIERDHGKGVYSTAVFQEDDLILKDQIL 217 +I++LLS P Q QEYF L+ R+C G++VK+ GKGVY + FQED+LILKDQIL Sbjct: 10 KIAALLSPLPTLQQQEYFNKLITSRRCNGIEVKLNETIGKGVYVNSEFQEDELILKDQIL 69 Query: 218 VGVQHLSNKIDCLVCSFCFQFIGSIELQIGRKLYLEELGISAA----------DEXXXXX 367 VG+QH SNK+DCLVCSFCF+F+GSIE QIGRKLY + LG+S DE Sbjct: 70 VGIQHSSNKVDCLVCSFCFRFVGSIEKQIGRKLYFKNLGVSGCCDGDSSESGEDECVKYN 129 Query: 368 XXXXXXXKTHIHQNT-----IQSLMDGSLRFLYSEKFPLPSVVPCLGGCKEAYYCSKSCA 532 + NT + SLM+G + Y++ FPLPS + C GGC+EA+YCS+SCA Sbjct: 130 GNEEQCGGSSSSHNTLPEGVVSSLMNGEMALPYTDMFPLPSPLSCPGGCQEAFYCSESCA 189 Query: 533 KSDWDSSHSLFCIGHGSSSPNKEALSKFMKHANETNDIFILAAKAISFTILRYRKLKADY 712 ++DW+SSHSL C G S S ++EAL +F+KHAN+TNDIF+LAAKAI+FTILRYRKLKA++ Sbjct: 190 EADWESSHSLLCTGEKSESNSREALGEFIKHANDTNDIFLLAAKAIAFTILRYRKLKAEH 249 Query: 713 VEQQGNDTSNPCNSCIFPLLVEAWKPMSVGFKRRWWDCVALPEDVDSCDEAGFRMQLKDL 892 V+++ S P S LL+EAWKP+S+G+KRRWWDC+ALP+DVD DE FRMQ+K+L Sbjct: 250 VDKKAKQ-SEPKQS----LLLEAWKPVSIGYKRRWWDCIALPDDVDLSDEGAFRMQIKNL 304 Query: 893 AFESLQLLKEAIYDKECAPLFSLDIYGHIIGMFELNNLDLVVASPVEDYFLYIDSLPPSQ 1072 A SL+LLK AI+DKEC LFSL+IYG+IIGMFELNNLDLVVASPVEDYFLYID LP ++ Sbjct: 305 ACTSLELLKTAIFDKECEALFSLEIYGNIIGMFELNNLDLVVASPVEDYFLYIDDLPDAE 364 Query: 1073 KEECEKITKSFLDALGEDYSVYCQGTAFYPLQSCMNHSCIPNAKAFKREEDRDGQATILA 1252 KEE E+IT+ FLDALG++YS CQGTAF+PLQSCMNHSC PNAKAFKREED+DGQA I+A Sbjct: 365 KEEAEEITRPFLDALGDEYSDCCQGTAFFPLQSCMNHSCCPNAKAFKREEDKDGQAVIIA 424 Query: 1253 LRPISKEEEITISYIDEDLPYEERQQRLADYGFRCECPRCIEEAS 1387 LR ISK EE+TISYIDE+LPY+ERQ LADYGF C+C +C+E++S Sbjct: 425 LRRISKNEEVTISYIDEELPYKERQALLADYGFSCKCSKCLEDSS 469 >ref|NP_188819.2| histone-lysine N-methyltransferase ATXR2 [Arabidopsis thaliana] gi|75251251|sp|Q5PP37.1|ATXR2_ARATH RecName: Full=Histone-lysine N-methyltransferase ATXR2; AltName: Full=Protein SET DOMAIN GROUP 36; AltName: Full=Trithorax-related protein 2; Short=TRX-related protein 2 gi|56236050|gb|AAV84481.1| At3g21820 [Arabidopsis thaliana] gi|59958344|gb|AAX12882.1| At3g21820 [Arabidopsis thaliana] gi|62320769|dbj|BAD95436.1| hypothetical protein [Arabidopsis thaliana] gi|332643034|gb|AEE76555.1| histone-lysine N-methyltransferase ATXR2 [Arabidopsis thaliana] Length = 473 Score = 592 bits (1527), Expect = e-167 Identities = 290/465 (62%), Positives = 362/465 (77%), Gaps = 13/465 (2%) Frame = +2 Query: 32 AAEISSLLSSPPQTQVQEYFEALVAERQCAGLKVKIERDHGKGVYSTAVFQEDDLILKDQ 211 AA++++LL+ P Q+QEYF L+ R+C G++VK GKGVY+ + F ED+LILKD+ Sbjct: 12 AADVAALLAPLPTPQLQEYFNKLITSRRCNGIEVKNNGTIGKGVYANSEFDEDELILKDE 71 Query: 212 ILVGVQHLSNKIDCLVCSFCFQFIGSIELQIGRKLYLEELGISAA--------DEXXXXX 367 ILVG+QH SNK+DCLVCSFCF+FIGSIE QIGRKLY + LG+S DE Sbjct: 72 ILVGIQHSSNKVDCLVCSFCFRFIGSIEKQIGRKLYFKNLGVSGCCDDDSSEEDECVKYN 131 Query: 368 XXXXXXXKTHIHQNT-----IQSLMDGSLRFLYSEKFPLPSVVPCLGGCKEAYYCSKSCA 532 + NT + SLM+G + +++KFPLPS + C GGC+EA+YCS+SCA Sbjct: 132 GNEEQCGGSSSSHNTLPEGVVSSLMNGEMALPHTDKFPLPSPLSCPGGCQEAFYCSESCA 191 Query: 533 KSDWDSSHSLFCIGHGSSSPNKEALSKFMKHANETNDIFILAAKAISFTILRYRKLKADY 712 +DW+SSHSL C G S S ++EAL +F+KHAN+TNDIF+LAAKAI+FTILRYRKLKA++ Sbjct: 192 AADWESSHSLLCTGERSESISREALGEFIKHANDTNDIFLLAAKAIAFTILRYRKLKAEH 251 Query: 713 VEQQGNDTSNPCNSCIFPLLVEAWKPMSVGFKRRWWDCVALPEDVDSCDEAGFRMQLKDL 892 V+++ S P S LL+EAWKP+S+G+KRRWWDC+ALP+DVD DE FRMQ+K+L Sbjct: 252 VDKKAKQ-SEPKQS----LLLEAWKPVSIGYKRRWWDCIALPDDVDPTDEGAFRMQIKNL 306 Query: 893 AFESLQLLKEAIYDKECAPLFSLDIYGHIIGMFELNNLDLVVASPVEDYFLYIDSLPPSQ 1072 A SL+LLK AI+DKEC LFSL+IYG+IIGMFELNNLDLVVASPVEDYFLYID LP ++ Sbjct: 307 ACTSLELLKIAIFDKECEALFSLEIYGNIIGMFELNNLDLVVASPVEDYFLYIDDLPDAE 366 Query: 1073 KEECEKITKSFLDALGEDYSVYCQGTAFYPLQSCMNHSCIPNAKAFKREEDRDGQATILA 1252 KEE E+IT+ FLDALG++YS CQGTAF+PLQSCMNHSC PNAKAFKREEDRDGQA I+A Sbjct: 367 KEETEEITRPFLDALGDEYSDCCQGTAFFPLQSCMNHSCCPNAKAFKREEDRDGQAVIIA 426 Query: 1253 LRPISKEEEITISYIDEDLPYEERQQRLADYGFRCECPRCIEEAS 1387 LR ISK EE+TISYIDE+LPY+ERQ LADYGF C+C +C+E++S Sbjct: 427 LRRISKNEEVTISYIDEELPYKERQALLADYGFSCKCSKCLEDSS 471 >dbj|BAB02844.1| unnamed protein product [Arabidopsis thaliana] Length = 565 Score = 589 bits (1518), Expect = e-166 Identities = 290/469 (61%), Positives = 363/469 (77%), Gaps = 17/469 (3%) Frame = +2 Query: 32 AAEISSLLSSPPQTQVQEYFEALVAERQCAGLKVKIERDHGKGVYSTAVFQEDDLILKDQ 211 AA++++LL+ P Q+QEYF L+ R+C G++VK GKGVY+ + F ED+LILKD+ Sbjct: 100 AADVAALLAPLPTPQLQEYFNKLITSRRCNGIEVKNNGTIGKGVYANSEFDEDELILKDE 159 Query: 212 ILVGVQHLSNKIDCLVCSFCFQFIGSIELQIGRKLYLEELGISAA--------DEXXXXX 367 ILVG+QH SNK+DCLVCSFCF+FIGSIE QIGRKLY + LG+S DE Sbjct: 160 ILVGIQHSSNKVDCLVCSFCFRFIGSIEKQIGRKLYFKNLGVSGCCDDDSSEEDECVKYN 219 Query: 368 XXXXXXXKTHIHQNT-----IQSLMDGSLRFLYSEKFPLPSVVPCLGGCKEAYYCSKSCA 532 + NT + SLM+G + +++KFPLPS + C GGC+EA+YCS+SCA Sbjct: 220 GNEEQCGGSSSSHNTLPEGVVSSLMNGEMALPHTDKFPLPSPLSCPGGCQEAFYCSESCA 279 Query: 533 KSDWDSSHSLFCIGHGSSSPNKEALSKFMKHANETNDIFILAAKAISFTILRYRKLKADY 712 +DW+SSHSL C G S S ++EAL +F+KHAN+TNDIF+LAAKAI+FTILRYRKLKA++ Sbjct: 280 AADWESSHSLLCTGERSESISREALGEFIKHANDTNDIFLLAAKAIAFTILRYRKLKAEH 339 Query: 713 VEQQGNDTSNPCNSCIFPLLVEAWKPMSVGFKRRWWDCVALPEDVDSCDEAGFRMQLKDL 892 V+++ S P S LL+EAWKP+S+G+KRRWWDC+ALP+DVD DE FRMQ+K+L Sbjct: 340 VDKKAKQ-SEPKQS----LLLEAWKPVSIGYKRRWWDCIALPDDVDPTDEGAFRMQIKNL 394 Query: 893 AFESLQLLKEAIYDKECA----PLFSLDIYGHIIGMFELNNLDLVVASPVEDYFLYIDSL 1060 A SL+LLK AI+DKEC P+FSL+IYG+IIGMFELNNLDLVVASPVEDYFLYID L Sbjct: 395 ACTSLELLKIAIFDKECEARIPPMFSLEIYGNIIGMFELNNLDLVVASPVEDYFLYIDDL 454 Query: 1061 PPSQKEECEKITKSFLDALGEDYSVYCQGTAFYPLQSCMNHSCIPNAKAFKREEDRDGQA 1240 P ++KEE E+IT+ FLDALG++YS CQGTAF+PLQSCMNHSC PNAKAFKREEDRDGQA Sbjct: 455 PDAEKEETEEITRPFLDALGDEYSDCCQGTAFFPLQSCMNHSCCPNAKAFKREEDRDGQA 514 Query: 1241 TILALRPISKEEEITISYIDEDLPYEERQQRLADYGFRCECPRCIEEAS 1387 I+ALR ISK EE+TISYIDE+LPY+ERQ LADYGF C+C +C+E++S Sbjct: 515 VIIALRRISKNEEVTISYIDEELPYKERQALLADYGFSCKCSKCLEDSS 563 >ref|XP_003540318.1| PREDICTED: histone-lysine N-methyltransferase ATXR2-like [Glycine max] Length = 484 Score = 588 bits (1517), Expect = e-165 Identities = 296/471 (62%), Positives = 354/471 (75%), Gaps = 21/471 (4%) Frame = +2 Query: 32 AAEISSLLSSPPQTQVQEYFEALVAERQCAGLKVKIERDHGKGVYSTAVFQEDDLILKDQ 211 A EIS+LLS P QVQ+Y+ L+ R C+G+KVK + + GKG+Y+ F+E +L+LKD Sbjct: 12 ATEISALLSPPSPLQVQKYYHDLLTARGCSGIKVKQDGNFGKGLYADMDFKEGELVLKDP 71 Query: 212 ILVGVQHLSNKIDCLVCSFCFQFIGSIELQIGRKLYLEELG------------------I 337 +LVG QH NKIDCLVCSFCF FIGSIELQIGR+LY++ L + Sbjct: 72 MLVGAQHPLNKIDCLVCSFCFCFIGSIELQIGRRLYMQHLRANESHGCEVGSSSKHCHEM 131 Query: 338 SAADEXXXXXXXXXXXXKTHIH--QNTIQSLMDGSLRFLYSEKFPLPSVVPCLGGCKEAY 511 ++DE KT + + ++SLM+G L +SEKF LP VPC GGC EAY Sbjct: 132 DSSDEEESTQQCTSGSSKTKVPLPEGIVESLMNGQLVLPFSEKFSLPPAVPCPGGCGEAY 191 Query: 512 YCSKSCAKSDWDSSHSLFCIGHGSSSPNKEALSKFMKHANETNDIFILAAKAISFTILRY 691 YCS SCA++DW SSHSL C G S S +EAL KF+KHANETNDIF+LAAKAIS T+L Y Sbjct: 192 YCSMSCAEADWGSSHSLLCTGESSDSARREALLKFIKHANETNDIFLLAAKAISSTMLMY 251 Query: 692 RKLKADYVEQQ-GNDTSNPCNSCIFPLLVEAWKPMSVGFKRRWWDCVALPEDVDSCDEAG 868 RKLKA +E+Q ++TS N C +L+EAWKP+S+G KRRWWDC+ALP+DVDS DEA Sbjct: 252 RKLKAVSLEEQMKHNTSCVSNHCNLSILLEAWKPISMGHKRRWWDCIALPDDVDSSDEAS 311 Query: 869 FRMQLKDLAFESLQLLKEAIYDKECAPLFSLDIYGHIIGMFELNNLDLVVASPVEDYFLY 1048 FR+Q+K LAFESLQLLK AI+DKEC PLFSL+IYG+IIGMFELNNLDLVVASPVEDYFLY Sbjct: 312 FRLQIKMLAFESLQLLKTAIFDKECEPLFSLEIYGNIIGMFELNNLDLVVASPVEDYFLY 371 Query: 1049 IDSLPPSQKEECEKITKSFLDALGEDYSVYCQGTAFYPLQSCMNHSCIPNAKAFKREEDR 1228 ID L KEE EKIT+ LDALGE+YS+YC+GTAF+PLQSC+NHSC PNAKAFKREED+ Sbjct: 372 IDDLTYPNKEEAEKITQPVLDALGEEYSIYCEGTAFFPLQSCLNHSCCPNAKAFKREEDK 431 Query: 1229 DGQATILALRPISKEEEITISYIDEDLPYEERQQRLADYGFRCECPRCIEE 1381 DGQATI+A R I K EEITISY+DEDL +EERQ LADYGFRC C +CIEE Sbjct: 432 DGQATIIAQRSICKGEEITISYVDEDLTFEERQASLADYGFRCRCSKCIEE 482