BLASTX nr result
ID: Salvia21_contig00019104
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Salvia21_contig00019104 (1644 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CBI28962.3| unnamed protein product [Vitis vinifera] 585 e-164 ref|XP_003540318.1| PREDICTED: histone-lysine N-methyltransferas... 567 e-159 ref|XP_002885468.1| SET domain-containing protein [Arabidopsis l... 552 e-154 ref|NP_188819.2| histone-lysine N-methyltransferase ATXR2 [Arabi... 549 e-154 dbj|BAB02844.1| unnamed protein product [Arabidopsis thaliana] 545 e-152 >emb|CBI28962.3| unnamed protein product [Vitis vinifera] Length = 464 Score = 585 bits (1508), Expect = e-164 Identities = 285/413 (69%), Positives = 332/413 (80%) Frame = -3 Query: 1591 GVYSEVDFREDDLILKDQMLVGAQHSSNKVNCMVCSFCFQFIGSIELQIGRKLYLEELGI 1412 GVY++ DF E +L+LKDQMLVGAQHSSNK+NC+VC FCF+FIGSIELQIGR+LYL+ LG+ Sbjct: 54 GVYADSDFGEGELVLKDQMLVGAQHSSNKINCLVCGFCFRFIGSIELQIGRRLYLQGLGV 113 Query: 1411 SADXXXXXXXXXXXXXXXEKTRLSHNTIQSLMDGCLRLPYSENFPLPPVFPCLGGCKEAY 1232 S + K L ++SLM+G L LPY + FPLP C GGC EAY Sbjct: 114 STNHDELGECASSSSKD--KVPLPKGVVESLMNGELALPYPKEFPLPSAIACSGGCGEAY 171 Query: 1231 YCSKSCAQADWDSFHALLCIGHGSSSPNREALLKFMKHADETNDIFFPAAKVISSTILRY 1052 YCSK CA+ADW+S H+LLC G S S REAL KF++HA+ETNDIF AAKVI TILRY Sbjct: 172 YCSKLCAEADWESSHSLLCTGEKSESICREALSKFIQHANETNDIFLLAAKVICFTILRY 231 Query: 1051 RKLKAARVEQQGKHVMSNPHDICIFPLLLEAWKPVSMGFKRRWWDCIALPDDVDSCDEAD 872 +KLK A +++Q K+ + PLLLEAWKP+SMGFK+RWWDCIALPDDV SCDEA Sbjct: 232 KKLKKAHLKEQEKYTSAIVLKNGDLPLLLEAWKPISMGFKKRWWDCIALPDDVHSCDEAA 291 Query: 871 FRMQIKDLAFESLQLLKEAIYDRECAPLFSLDIYGHIIGMFELNNLDLVVASPVEDYFLY 692 FR QIK+LAF SL+LLKEAI+ + C PLFSL+IYGHIIGMFELNNLDLVVASPVEDYFLY Sbjct: 292 FRAQIKELAFTSLKLLKEAIFCKGCEPLFSLEIYGHIIGMFELNNLDLVVASPVEDYFLY 351 Query: 691 IDDLPSSQKEEAEKTTKPFLDALGEDYSVSCEGTAFFPLQSCMNHSCIPNAKAFKREEDR 512 IDDLP QK++AE+ T+ FLDALG+DYSV C+GTAFFPLQSCMNHSC PNAKAFKREEDR Sbjct: 352 IDDLPYPQKKKAEEITRQFLDALGDDYSVPCQGTAFFPLQSCMNHSCYPNAKAFKREEDR 411 Query: 511 DGRATILALRPISKEEEITISYIDEDLPYEERQLLLADYGFTCKCPRCVEEAP 353 DG+ATI+ALRPI KEEE+TISYIDEDLP++ERQ LLADYGF CKCP+C+EE P Sbjct: 412 DGQATIIALRPIFKEEEVTISYIDEDLPFDERQALLADYGFRCKCPKCLEEEP 464 >ref|XP_003540318.1| PREDICTED: histone-lysine N-methyltransferase ATXR2-like [Glycine max] Length = 484 Score = 567 bits (1461), Expect = e-159 Identities = 283/431 (65%), Positives = 330/431 (76%), Gaps = 18/431 (4%) Frame = -3 Query: 1591 GVYSEVDFREDDLILKDQMLVGAQHSSNKVNCMVCSFCFQFIGSIELQIGRKLYLE---- 1424 G+Y+++DF+E +L+LKD MLVGAQH NK++C+VCSFCF FIGSIELQIGR+LY++ Sbjct: 54 GLYADMDFKEGELVLKDPMLVGAQHPLNKIDCLVCSFCFCFIGSIELQIGRRLYMQHLRA 113 Query: 1423 ------ELGISA------DXXXXXXXXXXXXXXXEKTR--LSHNTIQSLMDGCLRLPYSE 1286 E+G S+ D KT+ L ++SLM+G L LP+SE Sbjct: 114 NESHGCEVGSSSKHCHEMDSSDEEESTQQCTSGSSKTKVPLPEGIVESLMNGQLVLPFSE 173 Query: 1285 NFPLPPVFPCLGGCKEAYYCSKSCAQADWDSFHALLCIGHGSSSPNREALLKFMKHADET 1106 F LPP PC GGC EAYYCS SCA+ADW S H+LLC G S S REALLKF+KHA+ET Sbjct: 174 KFSLPPAVPCPGGCGEAYYCSMSCAEADWGSSHSLLCTGESSDSARREALLKFIKHANET 233 Query: 1105 NDIFFPAAKVISSTILRYRKLKAARVEQQGKHVMSNPHDICIFPLLLEAWKPVSMGFKRR 926 NDIF AAK ISST+L YRKLKA +E+Q KH S + C +LLEAWKP+SMG KRR Sbjct: 234 NDIFLLAAKAISSTMLMYRKLKAVSLEEQMKHNTSCVSNHCNLSILLEAWKPISMGHKRR 293 Query: 925 WWDCIALPDDVDSCDEADFRMQIKDLAFESLQLLKEAIYDRECAPLFSLDIYGHIIGMFE 746 WWDCIALPDDVDS DEA FR+QIK LAFESLQLLK AI+D+EC PLFSL+IYG+IIGMFE Sbjct: 294 WWDCIALPDDVDSSDEASFRLQIKMLAFESLQLLKTAIFDKECEPLFSLEIYGNIIGMFE 353 Query: 745 LNNLDLVVASPVEDYFLYIDDLPSSQKEEAEKTTKPFLDALGEDYSVSCEGTAFFPLQSC 566 LNNLDLVVASPVEDYFLYIDDL KEEAEK T+P LDALGE+YS+ CEGTAFFPLQSC Sbjct: 354 LNNLDLVVASPVEDYFLYIDDLTYPNKEEAEKITQPVLDALGEEYSIYCEGTAFFPLQSC 413 Query: 565 MNHSCIPNAKAFKREEDRDGRATILALRPISKEEEITISYIDEDLPYEERQLLLADYGFT 386 +NHSC PNAKAFKREED+DG+ATI+A R I K EEITISY+DEDL +EERQ LADYGF Sbjct: 414 LNHSCCPNAKAFKREEDKDGQATIIAQRSICKGEEITISYVDEDLTFEERQASLADYGFR 473 Query: 385 CKCPRCVEEAP 353 C+C +C+EE P Sbjct: 474 CRCSKCIEEEP 484 >ref|XP_002885468.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297331308|gb|EFH61727.1| SET domain-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 471 Score = 552 bits (1422), Expect = e-154 Identities = 274/425 (64%), Positives = 331/425 (77%), Gaps = 13/425 (3%) Frame = -3 Query: 1591 GVYSEVDFREDDLILKDQMLVGAQHSSNKVNCMVCSFCFQFIGSIELQIGRKLYLEELGI 1412 GVY +F+ED+LILKDQ+LVG QHSSNKV+C+VCSFCF+F+GSIE QIGRKLY + LG+ Sbjct: 50 GVYVNSEFQEDELILKDQILVGIQHSSNKVDCLVCSFCFRFVGSIEKQIGRKLYFKNLGV 109 Query: 1411 SA--------DXXXXXXXXXXXXXXXEKTRLSHNT-----IQSLMDGCLRLPYSENFPLP 1271 S + SHNT + SLM+G + LPY++ FPLP Sbjct: 110 SGCCDGDSSESGEDECVKYNGNEEQCGGSSSSHNTLPEGVVSSLMNGEMALPYTDMFPLP 169 Query: 1270 PVFPCLGGCKEAYYCSKSCAQADWDSFHALLCIGHGSSSPNREALLKFMKHADETNDIFF 1091 C GGC+EA+YCS+SCA+ADW+S H+LLC G S S +REAL +F+KHA++TNDIF Sbjct: 170 SPLSCPGGCQEAFYCSESCAEADWESSHSLLCTGEKSESNSREALGEFIKHANDTNDIFL 229 Query: 1090 PAAKVISSTILRYRKLKAARVEQQGKHVMSNPHDICIFPLLLEAWKPVSMGFKRRWWDCI 911 AAK I+ TILRYRKLKA V+++ K S P LLLEAWKPVS+G+KRRWWDCI Sbjct: 230 LAAKAIAFTILRYRKLKAEHVDKKAKQ--SEPKQ----SLLLEAWKPVSIGYKRRWWDCI 283 Query: 910 ALPDDVDSCDEADFRMQIKDLAFESLQLLKEAIYDRECAPLFSLDIYGHIIGMFELNNLD 731 ALPDDVD DE FRMQIK+LA SL+LLK AI+D+EC LFSL+IYG+IIGMFELNNLD Sbjct: 284 ALPDDVDLSDEGAFRMQIKNLACTSLELLKTAIFDKECEALFSLEIYGNIIGMFELNNLD 343 Query: 730 LVVASPVEDYFLYIDDLPSSQKEEAEKTTKPFLDALGEDYSVSCEGTAFFPLQSCMNHSC 551 LVVASPVEDYFLYIDDLP ++KEEAE+ T+PFLDALG++YS C+GTAFFPLQSCMNHSC Sbjct: 344 LVVASPVEDYFLYIDDLPDAEKEEAEEITRPFLDALGDEYSDCCQGTAFFPLQSCMNHSC 403 Query: 550 IPNAKAFKREEDRDGRATILALRPISKEEEITISYIDEDLPYEERQLLLADYGFTCKCPR 371 PNAKAFKREED+DG+A I+ALR ISK EE+TISYIDE+LPY+ERQ LLADYGF+CKC + Sbjct: 404 CPNAKAFKREEDKDGQAVIIALRRISKNEEVTISYIDEELPYKERQALLADYGFSCKCSK 463 Query: 370 CVEEA 356 C+E++ Sbjct: 464 CLEDS 468 >ref|NP_188819.2| histone-lysine N-methyltransferase ATXR2 [Arabidopsis thaliana] gi|75251251|sp|Q5PP37.1|ATXR2_ARATH RecName: Full=Histone-lysine N-methyltransferase ATXR2; AltName: Full=Protein SET DOMAIN GROUP 36; AltName: Full=Trithorax-related protein 2; Short=TRX-related protein 2 gi|56236050|gb|AAV84481.1| At3g21820 [Arabidopsis thaliana] gi|59958344|gb|AAX12882.1| At3g21820 [Arabidopsis thaliana] gi|62320769|dbj|BAD95436.1| hypothetical protein [Arabidopsis thaliana] gi|332643034|gb|AEE76555.1| histone-lysine N-methyltransferase ATXR2 [Arabidopsis thaliana] Length = 473 Score = 549 bits (1414), Expect = e-154 Identities = 273/423 (64%), Positives = 330/423 (78%), Gaps = 11/423 (2%) Frame = -3 Query: 1591 GVYSEVDFREDDLILKDQMLVGAQHSSNKVNCMVCSFCFQFIGSIELQIGRKLYLEELGI 1412 GVY+ +F ED+LILKD++LVG QHSSNKV+C+VCSFCF+FIGSIE QIGRKLY + LG+ Sbjct: 54 GVYANSEFDEDELILKDEILVGIQHSSNKVDCLVCSFCFRFIGSIEKQIGRKLYFKNLGV 113 Query: 1411 S------ADXXXXXXXXXXXXXXXEKTRLSHNT-----IQSLMDGCLRLPYSENFPLPPV 1265 S + + SHNT + SLM+G + LP+++ FPLP Sbjct: 114 SGCCDDDSSEEDECVKYNGNEEQCGGSSSSHNTLPEGVVSSLMNGEMALPHTDKFPLPSP 173 Query: 1264 FPCLGGCKEAYYCSKSCAQADWDSFHALLCIGHGSSSPNREALLKFMKHADETNDIFFPA 1085 C GGC+EA+YCS+SCA ADW+S H+LLC G S S +REAL +F+KHA++TNDIF A Sbjct: 174 LSCPGGCQEAFYCSESCAAADWESSHSLLCTGERSESISREALGEFIKHANDTNDIFLLA 233 Query: 1084 AKVISSTILRYRKLKAARVEQQGKHVMSNPHDICIFPLLLEAWKPVSMGFKRRWWDCIAL 905 AK I+ TILRYRKLKA V+++ K S P LLLEAWKPVS+G+KRRWWDCIAL Sbjct: 234 AKAIAFTILRYRKLKAEHVDKKAKQ--SEPKQ----SLLLEAWKPVSIGYKRRWWDCIAL 287 Query: 904 PDDVDSCDEADFRMQIKDLAFESLQLLKEAIYDRECAPLFSLDIYGHIIGMFELNNLDLV 725 PDDVD DE FRMQIK+LA SL+LLK AI+D+EC LFSL+IYG+IIGMFELNNLDLV Sbjct: 288 PDDVDPTDEGAFRMQIKNLACTSLELLKIAIFDKECEALFSLEIYGNIIGMFELNNLDLV 347 Query: 724 VASPVEDYFLYIDDLPSSQKEEAEKTTKPFLDALGEDYSVSCEGTAFFPLQSCMNHSCIP 545 VASPVEDYFLYIDDLP ++KEE E+ T+PFLDALG++YS C+GTAFFPLQSCMNHSC P Sbjct: 348 VASPVEDYFLYIDDLPDAEKEETEEITRPFLDALGDEYSDCCQGTAFFPLQSCMNHSCCP 407 Query: 544 NAKAFKREEDRDGRATILALRPISKEEEITISYIDEDLPYEERQLLLADYGFTCKCPRCV 365 NAKAFKREEDRDG+A I+ALR ISK EE+TISYIDE+LPY+ERQ LLADYGF+CKC +C+ Sbjct: 408 NAKAFKREEDRDGQAVIIALRRISKNEEVTISYIDEELPYKERQALLADYGFSCKCSKCL 467 Query: 364 EEA 356 E++ Sbjct: 468 EDS 470 >dbj|BAB02844.1| unnamed protein product [Arabidopsis thaliana] Length = 565 Score = 545 bits (1405), Expect = e-152 Identities = 273/427 (63%), Positives = 331/427 (77%), Gaps = 15/427 (3%) Frame = -3 Query: 1591 GVYSEVDFREDDLILKDQMLVGAQHSSNKVNCMVCSFCFQFIGSIELQIGRKLYLEELGI 1412 GVY+ +F ED+LILKD++LVG QHSSNKV+C+VCSFCF+FIGSIE QIGRKLY + LG+ Sbjct: 142 GVYANSEFDEDELILKDEILVGIQHSSNKVDCLVCSFCFRFIGSIEKQIGRKLYFKNLGV 201 Query: 1411 S------ADXXXXXXXXXXXXXXXEKTRLSHNT-----IQSLMDGCLRLPYSENFPLPPV 1265 S + + SHNT + SLM+G + LP+++ FPLP Sbjct: 202 SGCCDDDSSEEDECVKYNGNEEQCGGSSSSHNTLPEGVVSSLMNGEMALPHTDKFPLPSP 261 Query: 1264 FPCLGGCKEAYYCSKSCAQADWDSFHALLCIGHGSSSPNREALLKFMKHADETNDIFFPA 1085 C GGC+EA+YCS+SCA ADW+S H+LLC G S S +REAL +F+KHA++TNDIF A Sbjct: 262 LSCPGGCQEAFYCSESCAAADWESSHSLLCTGERSESISREALGEFIKHANDTNDIFLLA 321 Query: 1084 AKVISSTILRYRKLKAARVEQQGKHVMSNPHDICIFPLLLEAWKPVSMGFKRRWWDCIAL 905 AK I+ TILRYRKLKA V+++ K S P LLLEAWKPVS+G+KRRWWDCIAL Sbjct: 322 AKAIAFTILRYRKLKAEHVDKKAKQ--SEPKQ----SLLLEAWKPVSIGYKRRWWDCIAL 375 Query: 904 PDDVDSCDEADFRMQIKDLAFESLQLLKEAIYDRECA----PLFSLDIYGHIIGMFELNN 737 PDDVD DE FRMQIK+LA SL+LLK AI+D+EC P+FSL+IYG+IIGMFELNN Sbjct: 376 PDDVDPTDEGAFRMQIKNLACTSLELLKIAIFDKECEARIPPMFSLEIYGNIIGMFELNN 435 Query: 736 LDLVVASPVEDYFLYIDDLPSSQKEEAEKTTKPFLDALGEDYSVSCEGTAFFPLQSCMNH 557 LDLVVASPVEDYFLYIDDLP ++KEE E+ T+PFLDALG++YS C+GTAFFPLQSCMNH Sbjct: 436 LDLVVASPVEDYFLYIDDLPDAEKEETEEITRPFLDALGDEYSDCCQGTAFFPLQSCMNH 495 Query: 556 SCIPNAKAFKREEDRDGRATILALRPISKEEEITISYIDEDLPYEERQLLLADYGFTCKC 377 SC PNAKAFKREEDRDG+A I+ALR ISK EE+TISYIDE+LPY+ERQ LLADYGF+CKC Sbjct: 496 SCCPNAKAFKREEDRDGQAVIIALRRISKNEEVTISYIDEELPYKERQALLADYGFSCKC 555 Query: 376 PRCVEEA 356 +C+E++ Sbjct: 556 SKCLEDS 562