BLASTX nr result
ID: Angelica23_contig00018001
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Angelica23_contig00018001 (1142 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002269094.2| PREDICTED: protein SET DOMAIN GROUP 40-like ... 413 e-113 ref|XP_002532790.1| Protein SET DOMAIN GROUP, putative [Ricinus ... 388 e-105 ref|XP_002305239.1| SET domain protein [Populus trichocarpa] gi|... 387 e-105 emb|CBI27360.3| unnamed protein product [Vitis vinifera] 374 e-101 ref|XP_003544959.1| PREDICTED: protein SET DOMAIN GROUP 40-like ... 370 e-100 >ref|XP_002269094.2| PREDICTED: protein SET DOMAIN GROUP 40-like [Vitis vinifera] Length = 504 Score = 413 bits (1062), Expect = e-113 Identities = 198/344 (57%), Positives = 254/344 (73%), Gaps = 4/344 (1%) Frame = +2 Query: 50 VEDAIWAAEKAIGKAKSEWKEAILLMNDLKIKNKLQSLRAWLWASGTISSRTLHIPWDEA 229 V+DAIW E+AI KA+ EWK+AI LM +LK+K +LQ+ RAWLWAS T+SSRT+HIPWD+A Sbjct: 142 VDDAIWVTERAILKAELEWKKAIPLMEELKLKPQLQNFRAWLWASSTVSSRTMHIPWDDA 201 Query: 230 GCLCPVGDLFNYAAPGEELGECDDLYTHGNASCMSTSSC---RVTESPVVEHCDD-SVRL 397 GCLCPVGD +NYAAPGEE +DL N S + SS T + E D S RL Sbjct: 202 GCLCPVGDFYNYAAPGEEPCGWEDLKGSRNESSLQDSSFWNKDATSNSDAEQDDVLSQRL 261 Query: 398 TDGGYEKDIGSYCFYARKSYRKGQQVLLSYGTYTNLELLEHYGFILNRNPNDKAFIPLEP 577 TDGGY++D+ +YCFYARK+Y+KG+QVLLSYGTYTNLELLEHYGF+L+ NPNDKAFIPLEP Sbjct: 262 TDGGYKEDLAAYCFYARKNYKKGEQVLLSYGTYTNLELLEHYGFLLDENPNDKAFIPLEP 321 Query: 578 EMYSCCSWPHNSLFIHQDGKPSFGLLSTIRLWATPPNQRKSIGHIALSGSQISKENELFT 757 E+Y+ SWP +SL+IHQ+GKPSF LLS +RLWATP +QR+S+GH+ SG+Q+S ENE+F Sbjct: 322 EVYASSSWPKDSLYIHQNGKPSFALLSALRLWATPASQRRSVGHLVYSGTQLSSENEIFV 381 Query: 758 MEWIAKKCHFILKNFGTTIKEDNLLLATIDNNCVSKSTMEFEEMPSMIKFEIQGFLNVVD 937 MEWIAK CH +L+N T+++ED+LLL +D ME E FL D Sbjct: 382 MEWIAKSCHVVLENLPTSVEEDSLLLCALDKMQDPDLPMEVGNALRSSGVEFSAFLEAHD 441 Query: 938 VPVGEIGGNIDLCRQVKNSVDRWKLAVEWRVRYKKILVDCISHC 1069 + +G+ + L + + S++RWKLAV+WR+R+K+ILVDCIS C Sbjct: 442 LKIGDGNVGLLLSEKARRSMERWKLAVQWRLRHKRILVDCISRC 485 >ref|XP_002532790.1| Protein SET DOMAIN GROUP, putative [Ricinus communis] gi|223527460|gb|EEF29592.1| Protein SET DOMAIN GROUP, putative [Ricinus communis] Length = 510 Score = 388 bits (996), Expect = e-105 Identities = 193/352 (54%), Positives = 255/352 (72%), Gaps = 7/352 (1%) Frame = +2 Query: 50 VEDAIWAAEKAIGKAKSEWKEAILLMNDLKIKNKLQSLRAWLWASGTISSRTLHIPWDEA 229 V+DAIW AEKAI KA+ + KEA LM +L++K + +LRAW+WA TISSRT+HIPWDEA Sbjct: 148 VDDAIWTAEKAISKAELDRKEAYSLMQELRLKPQFLTLRAWIWACATISSRTMHIPWDEA 207 Query: 230 GCLCPVGDLFNYAAPGEELGECDDLYTHGNASCM---STSSCRVTESPVVEHCDDSVR-L 397 GCLCPVGD FNYAAPGEE ++ + ASC+ S SS R T + E D ++ L Sbjct: 208 GCLCPVGDFFNYAAPGEESSSPENDESWKPASCLEDASLSSERSTSNFCSETFDVQLKSL 267 Query: 398 TDGGYEKDIGSYCFYARKSYRKGQQVLLSYGTYTNLELLEHYGFILNRNPNDKAFIPLEP 577 TDGG+++D +YCFYAR++Y+KG QVLLSYGTYTNLELLEHYGF+LN NPNDK FIPLE Sbjct: 268 TDGGFDEDKAAYCFYARQNYKKGAQVLLSYGTYTNLELLEHYGFLLNENPNDKVFIPLEL 327 Query: 578 EMYSCCSWPHNSLFIHQDGKPSFGLLSTIRLWATPPNQRKSIGHIALSGSQISKENELFT 757 M S +WP S++IHQDGKPSF LL +RLWATP N+R+S+GH+A SGSQ+S ENE+ Sbjct: 328 SMQSSNTWPKESMYIHQDGKPSFSLLCALRLWATPSNRRRSMGHLAYSGSQLSVENEVSI 387 Query: 758 MEWIAKKCHFILKNFGTTIKEDNLLLATIDNNCVSKSTMEFEEMPSMIKFEIQGFL---N 928 ++WI++KCH +LK TT++ED+LLL+ ID S +E +M + + F+ N Sbjct: 388 LKWISRKCHAVLKKLPTTVEEDSLLLSAIDKIQNCHSPLELGKMLHGFEGQASAFVEAHN 447 Query: 929 VVDVPVGEIGGNIDLCRQVKNSVDRWKLAVEWRVRYKKILVDCISHCIGRID 1084 ++++ +G + LC + K S++RWKLAV+WR+ YKK L+DCIS+C ID Sbjct: 448 LLNIKIGT--ESTMLCGKAKRSMERWKLAVKWRLSYKKTLIDCISYCTEVID 497 >ref|XP_002305239.1| SET domain protein [Populus trichocarpa] gi|222848203|gb|EEE85750.1| SET domain protein [Populus trichocarpa] Length = 518 Score = 387 bits (995), Expect = e-105 Identities = 191/343 (55%), Positives = 245/343 (71%), Gaps = 5/343 (1%) Frame = +2 Query: 56 DAIWAAEKAIGKAKSEWKEAILLMNDLKIKNKLQSLRAWLWASGTISSRTLHIPWDEAGC 235 D + + +KA+ KAKSEWKEA LM+ LK+K +L + RAW+WAS TISSR LHIPWDEAGC Sbjct: 162 DVLASFKKAVSKAKSEWKEANSLMDALKLKPQLLTFRAWIWASATISSRALHIPWDEAGC 221 Query: 236 LCPVGDLFNYAAPGEELGECDDLYTHGNASCMSTSSC---RVTESPVVEHCDDSV-RLTD 403 LCPVGDLFNYAAPGEE + +++ NAS + SS T+ + + D + RLTD Sbjct: 222 LCPVGDLFNYAAPGEESNDLENVVHWMNASSLEDSSLSNGETTDDFIGDQPDIGLERLTD 281 Query: 404 GGYEKDIGSYCFYARKSYRKGQQVLLSYGTYTNLELLEHYGFILNRNPNDKAFIPLEPEM 583 GG+++++ +YCFYARK+Y+KG QVLL YGTYTNLELLEHYGF+LN NPNDK FIPLEP M Sbjct: 282 GGFDENMAAYCFYARKNYKKGTQVLLGYGTYTNLELLEHYGFLLNENPNDKVFIPLEPSM 341 Query: 584 YSCCSWPHNSLFIHQDGKPSFGLLSTIRLWATPPNQRKSIGHIALSGSQISKENELFTME 763 YS SWP S++IHQDGKPSF LLS +RLWATPPNQR+SI H+ SGS++S NE+ ++ Sbjct: 342 YSFISWPKVSMYIHQDGKPSFALLSALRLWATPPNQRRSISHLVYSGSRLSVYNEISVLK 401 Query: 764 WIAKKCHFILKNFGTTIKEDNLLLATIDN-NCVSKSTMEFEEMPSMIKFEIQGFLNVVDV 940 WI+K C IL N T I+ED+LLL+TI+ K T E+ E + FL D+ Sbjct: 402 WISKNCAMILSNLPTVIEEDSLLLSTINKIENFDKPT----ELVCTSGGEARAFLEASDL 457 Query: 941 PVGEIGGNIDLCRQVKNSVDRWKLAVEWRVRYKKILVDCISHC 1069 G+ G + + K ++RWKLAV+WR+ YKK L+DCIS+C Sbjct: 458 QKGKNGSELMFSGKTKRVIERWKLAVQWRISYKKTLIDCISYC 500 >emb|CBI27360.3| unnamed protein product [Vitis vinifera] Length = 449 Score = 374 bits (959), Expect = e-101 Identities = 182/340 (53%), Positives = 230/340 (67%) Frame = +2 Query: 50 VEDAIWAAEKAIGKAKSEWKEAILLMNDLKIKNKLQSLRAWLWASGTISSRTLHIPWDEA 229 V+DAIW E+AI KA+ EWK+AI LM +LK+K +LQ+ RAWLWAS T+SSRT+HIPWD+A Sbjct: 142 VDDAIWVTERAILKAELEWKKAIPLMEELKLKPQLQNFRAWLWASSTVSSRTMHIPWDDA 201 Query: 230 GCLCPVGDLFNYAAPGEELGECDDLYTHGNASCMSTSSCRVTESPVVEHCDDSVRLTDGG 409 GCLCPVGD +NYAAPGEE +DL +S RLTDGG Sbjct: 202 GCLCPVGDFYNYAAPGEEPCGWEDLKDAEQDDVLSQ------------------RLTDGG 243 Query: 410 YEKDIGSYCFYARKSYRKGQQVLLSYGTYTNLELLEHYGFILNRNPNDKAFIPLEPEMYS 589 Y++D+ +YCFYARK+Y+KG+QVLLSYGTYTNLELLEHYGF+L+ NPNDKAFIPLEPE+Y+ Sbjct: 244 YKEDLAAYCFYARKNYKKGEQVLLSYGTYTNLELLEHYGFLLDENPNDKAFIPLEPEVYA 303 Query: 590 CCSWPHNSLFIHQDGKPSFGLLSTIRLWATPPNQRKSIGHIALSGSQISKENELFTMEWI 769 SWP +SL+IHQ+GKPSF LLS +RLWATP +QR+S+GH+ SG+Q+S ENE+F MEWI Sbjct: 304 SSSWPKDSLYIHQNGKPSFALLSALRLWATPASQRRSVGHLVYSGTQLSSENEIFVMEWI 363 Query: 770 AKKCHFILKNFGTTIKEDNLLLATIDNNCVSKSTMEFEEMPSMIKFEIQGFLNVVDVPVG 949 AK CH +L+N T+++ED+LLL Sbjct: 364 AKSCHVVLENLPTSVEEDSLLL-------------------------------------- 385 Query: 950 EIGGNIDLCRQVKNSVDRWKLAVEWRVRYKKILVDCISHC 1069 S++RWKLAV+WR+R+K+ILVDCIS C Sbjct: 386 --------------SMERWKLAVQWRLRHKRILVDCISRC 411 >ref|XP_003544959.1| PREDICTED: protein SET DOMAIN GROUP 40-like [Glycine max] Length = 475 Score = 370 bits (950), Expect = e-100 Identities = 179/344 (52%), Positives = 237/344 (68%), Gaps = 4/344 (1%) Frame = +2 Query: 50 VEDAIWAAEKAIGKAKSEWKEAILLMNDLKIKNKLQSLRAWLWASGTISSRTLHIPWDEA 229 V++A+W EKA+ KAKSEWKEA LM DL K + + +AW+WA+ TISSRTLHIPWDEA Sbjct: 146 VDEAMWVTEKAMLKAKSEWKEAHSLMQDLMFKPQFFTFKAWVWAAATISSRTLHIPWDEA 205 Query: 230 GCLCPVGDLFNYAAPGEELGECDDLYTHGNASCMSTSSCRVTESPVVEHCDD----SVRL 397 GCLCPVGDLFNY APG E +DL +H + S RL Sbjct: 206 GCLCPVGDLFNYDAPGIEPSGIEDL----------------------DHAEQLDSHSWRL 243 Query: 398 TDGGYEKDIGSYCFYARKSYRKGQQVLLSYGTYTNLELLEHYGFILNRNPNDKAFIPLEP 577 TDGG+E+D +YCFYAR+ Y+KG QVLL YGTYTNLELLEHYGF+L NPNDK FIPLEP Sbjct: 244 TDGGFEEDANAYCFYAREHYKKGDQVLLCYGTYTNLELLEHYGFLLQENPNDKVFIPLEP 303 Query: 578 EMYSCCSWPHNSLFIHQDGKPSFGLLSTIRLWATPPNQRKSIGHIALSGSQISKENELFT 757 +YS SW SL+IH +GKPSF LL+ +RLWATP N+R+S+GH+ SGS++S +NE+F Sbjct: 304 ALYSSTSWSKESLYIHHNGKPSFALLAALRLWATPQNRRRSVGHLVYSGSRVSTDNEIFI 363 Query: 758 MEWIAKKCHFILKNFGTTIKEDNLLLATIDNNCVSKSTMEFEEMPSMIKFEIQGFLNVVD 937 M+W++K C +L+N T+++ED LLL +DN+ + ME ++ S + E FL + Sbjct: 364 MKWLSKTCDAVLRNLPTSLEEDTLLLNAMDNSQDFSTFMEITKLVSS-REETYTFLETHN 422 Query: 938 VPVGEIGGNIDLCRQVKNSVDRWKLAVEWRVRYKKILVDCISHC 1069 + ++ L R+ + S+DRWKLAV+WR++YKK++ DCIS+C Sbjct: 423 MKDTHSFTDVILSRKARRSMDRWKLAVQWRLKYKKVIFDCISYC 466