BLASTX nr result
ID: Angelica23_contig00003209
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Angelica23_contig00003209 (3774 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value sp|P48786.1|PRH_PETCR RecName: Full=Pathogenesis-related homeodo... 1416 0.0 emb|CBI22504.3| unnamed protein product [Vitis vinifera] 446 e-122 ref|XP_002269077.1| PREDICTED: homeobox protein HAT3.1-like [Vit... 446 e-122 ref|XP_002300247.1| predicted protein [Populus trichocarpa] gi|2... 420 e-114 ref|XP_003555282.1| PREDICTED: homeobox protein HAT3.1-like [Gly... 407 e-111 >sp|P48786.1|PRH_PETCR RecName: Full=Pathogenesis-related homeodomain protein; Short=PRHP gi|666128|gb|AAA62237.1| homeodomain protein [Petroselinum crispum] Length = 1088 Score = 1416 bits (3665), Expect = 0.0 Identities = 753/1040 (72%), Positives = 794/1040 (76%), Gaps = 28/1040 (2%) Frame = +1 Query: 628 SVGAKIVRSAEDSTKLLSCKDFAEDMKLCDSGSMQLHDESSIGISLIPKQATLSHTH--E 801 S +IVRS ED TKL+ C DFAED+KL DS MQ ESSIGI LIPKQ T+SH H E Sbjct: 50 SCWCEIVRSPEDLTKLVPCNDFAEDIKLFDSDPMQQEAESSIGIPLIPKQVTMSHNHDHE 109 Query: 802 SGSEMVNNEVMQENHVIATEYTYKKSEFDRINMEQKETIPEEVIHNSFLEFSTSPIDIQS 981 SGSEMV+NEVMQENHVIATE TY+KS+FDRINM QKET+PEEVIH SFLE STS IDI Sbjct: 110 SGSEMVSNEVMQENHVIATENTYQKSDFDRINMGQKETMPEEVIHKSFLESSTSSIDILL 169 Query: 982 RNHNSDQSGLPPENAAKDCKPIQLGHRSDDATKNSGLEELVIGQKTVARSPSQLVXXXXX 1161 NHNS QSGLPPENA DCK +QLGHRSDDA KNSGL ELVIGQK VA+SPSQLV Sbjct: 170 NNHNSYQSGLPPENAVTDCKQVQLGHRSDDAIKNSGLVELVIGQKNVAKSPSQLVETGKR 229 Query: 1162 XXXXXXKVQTGLEQLVPGQKTAAKSSSQLGDTGKRSRGRPRKVHNSPTSFMENINMEQKE 1341 KVQTGLEQLV GQKTAAKSSSQLGDTGKRSRGRPRKV NSPTSF+ENINMEQKE Sbjct: 230 GRGRPRKVQTGLEQLVIGQKTAAKSSSQLGDTGKRSRGRPRKVQNSPTSFLENINMEQKE 289 Query: 1342 TISEQVTQNSFLESSTFPIDNQSRTYNSDQSGLPPENAAKDCKHIQFGHQSDDATKIYGL 1521 TI EQVTQNS LES T P DNQSRTYNSDQS LPPENAAK+C H QFGHQSDD TKI G Sbjct: 290 TIPEQVTQNSILESLTIPTDNQSRTYNSDQSELPPENAAKNCNHAQFGHQSDDTTKISGF 349 Query: 1522 EELVIGQETGAKSPSQLVDAXXXXXXXXXXXXXXLEQLVPGQKTAAKSFSQLGDTGKRSR 1701 +ELVIGQET AKSPSQLVDA LEQLVP Q+TAAKS SQLGDTGKRSR Sbjct: 350 KELVIGQETVAKSPSQLVDAGKRGRGRPRKVQTGLEQLVPVQETAAKSSSQLGDTGKRSR 409 Query: 1702 GRPRKVQNSPTSLGGSVNVLPEKRKDSQXXXXXXXXXXXXXXXXXXXXPDFSNFVAEEGA 1881 GRPRKVQ+SPTSLGG+V V+PEK KDSQ PD +N VA+EGA Sbjct: 410 GRPRKVQDSPTSLGGNVKVVPEKGKDSQELSVNSSRSLRSRSQEKSIEPDVNNIVADEGA 469 Query: 1882 DXXXXXXXXXXXXXXXXVDEFSRIRTHLRYLLHRIKYEKNFLDAYSGEGWKGQSLDKIKP 2061 D VDEF RIRTHLRYLLHRIKYEKNFLDAYSGEGWKGQSLDKIKP Sbjct: 470 DREKPRKKRKKRMEENRVDEFCRIRTHLRYLLHRIKYEKNFLDAYSGEGWKGQSLDKIKP 529 Query: 2062 EKELKRAKAEIFGRKLKIRDLFQHLDLSRSEGRLPEFLFDSQGEIDSEDIFCAKCGSKDV 2241 EKELKRAKAEIFGRKLKIRDLFQ LDL+RSEGRLPE LFDS+GEIDSEDIFCAKCGSKDV Sbjct: 530 EKELKRAKAEIFGRKLKIRDLFQRLDLARSEGRLPEILFDSRGEIDSEDIFCAKCGSKDV 589 Query: 2242 TLSNDIILCDGACDRGFHQFCLDPPLLKEYIPPDDEGWLCPGCKCKIDCIKLLNDSQETN 2421 TLSNDIILCDGACDRGFHQFCLDPPLLKEYIPPDDEGWLCPGC+CKIDCIKLLNDSQETN Sbjct: 590 TLSNDIILCDGACDRGFHQFCLDPPLLKEYIPPDDEGWLCPGCECKIDCIKLLNDSQETN 649 Query: 2422 ILLSDSWETIFXXXXXXXXSGKNVDDNSGXXXXXXXXXXXXXXXXXXXXKVQGDDSSTDE 2601 ILL DSWE +F SGKN+DDNSG KVQGDDSSTDE Sbjct: 650 ILLGDSWEKVFAEEAAAAASGKNLDDNSGLPSDDSEDDDYDPGGPDLDEKVQGDDSSTDE 709 Query: 2602 SDYQSASDDMQVLPQKESNCGLXXXXXXXXXXXXXALVTDQMFKDSSCSDFTSDSEDFTG 2781 SDYQS SDDMQV+ QK S GL LVTDQM+KDSSCSDFTSDSEDFTG Sbjct: 710 SDYQSESDDMQVIRQKNSR-GLPSDDSEDDEYDPSGLVTDQMYKDSSCSDFTSDSEDFTG 768 Query: 2782 VIDDCKHTGKAQVSLTSTPHHVRNNEEGCGHPELGDTAPLYPRRQVESLDYKKLHD---- 2949 V DD K TGKAQ L STP HVRNNEEGCGHPE GDTAPLYPRRQVESLDYKKL+D Sbjct: 769 VFDDYKDTGKAQGPLASTPDHVRNNEEGCGHPEQGDTAPLYPRRQVESLDYKKLNDIEFS 828 Query: 2950 ----------------------EEYGNTSSDSSDEDYMVTSSPDKKNSDKEATVLLNFGS 3063 EEYGNTSSDSSDEDYMVTSSPDK NSDKEAT + Sbjct: 829 KMCDILDILSSQLDVIICTGNQEEYGNTSSDSSDEDYMVTSSPDKNNSDKEATAM----- 883 Query: 3064 VTTVHGKESSDLDLDKKASESTHNRRTVKNFAVEGTXXXXXXXXXXXAAPVTSSKSTSKT 3243 G+ES DL+LD+KA ESTHNRR +K FAVEGT AAPV SKSTSKT Sbjct: 884 ---ERGRESGDLELDQKARESTHNRRYIKKFAVEGTDSFLSRSCEDSAAPVAGSKSTSKT 940 Query: 3244 LFGEHATQRLLQSFKENQYPQRAVKESLAAELALSVQQVSRWFNNTRWSFRHSSRFASNV 3423 L GEHATQRLLQSFKENQYPQRAVKESLAAELALSV+QVS WFNN RWSFRHSSR S+V Sbjct: 941 LHGEHATQRLLQSFKENQYPQRAVKESLAAELALSVRQVSNWFNNRRWSFRHSSRIGSDV 1000 Query: 3424 AEFASNEGTTHQKSINMSGSSLKSVLDNATCSSEVKKKEQDMESLGLTEGCDRYMTLNMV 3603 A+F SN+ T QKSI+MSG SLKSVLD+AT SE++KKEQD SLGLTEGCDRYMTLNMV Sbjct: 1001 AKFDSND-TPRQKSIDMSGPSLKSVLDSAT-YSEIEKKEQDTASLGLTEGCDRYMTLNMV 1058 Query: 3604 ADEGNGHTPCITETREEITQ 3663 ADEGN HTPCI ETREE T+ Sbjct: 1059 ADEGNVHTPCIAETREEKTE 1078 >emb|CBI22504.3| unnamed protein product [Vitis vinifera] Length = 977 Score = 446 bits (1147), Expect = e-122 Identities = 250/515 (48%), Positives = 313/515 (60%), Gaps = 27/515 (5%) Frame = +1 Query: 1936 DEFSRIRTHLRYLLHRIKYEKNFLDAYSGEGWKGQSLDKIKPEKELKRAKAEIFGRKLKI 2115 DEF+RIR HLRYLL+R+ YE+N +DAYS EGWKGQS++K+KPEKEL+RA +EI RKL+I Sbjct: 218 DEFARIRKHLRYLLNRMSYEQNLIDAYSAEGWKGQSVEKLKPEKELQRASSEISRRKLQI 277 Query: 2116 RDLFQHLDLSRSEGRLPEFLFDSQGEIDSEDIFCAKCGSKDVTLSNDIILCDGACDRGFH 2295 RDLFQHLD +EGR PE LFDS+G+IDSEDIFCAKC SKD++ NDIILCDGACDRGFH Sbjct: 278 RDLFQHLDSLCAEGRFPESLFDSEGQIDSEDIFCAKCESKDMSADNDIILCDGACDRGFH 337 Query: 2296 QFCLDPPLLKEYIPPDDEGWLCPGCKCKIDCIKLLNDSQETNILLSDSWETIFXXXXXXX 2475 QFCL+PPLLKE IPPDDEGWLCP C CK+DC+ LLNDSQ T + + DSWE +F Sbjct: 338 QFCLEPPLLKEEIPPDDEGWLCPACDCKVDCMDLLNDSQGTKLSVIDSWEKVF---PEAA 394 Query: 2476 XSGKNVDDNSGXXXXXXXXXXXXXXXXXXXXKVQGDDSS------------TDESDYQSA 2619 +G N D+NSG K QGD SS +DESD+ SA Sbjct: 395 AAGNNQDNNSGFSSDDSEDNDYDPDCPEVDEKGQGDKSSSDKFDESDEFDESDESDFTSA 454 Query: 2620 SDDMQVLPQKESNCGLXXXXXXXXXXXXXA-LVTDQMFKDSSCSDFTSDSEDFTGVIDDC 2796 SDDM V P E GL A + +Q+ + SS SDFTSDSEDFT +D Sbjct: 455 SDDMVVSPNNEQCLGLPSDDSEDDDFDPDAPEIDEQVNQGSSSSDFTSDSEDFTATLDR- 513 Query: 2797 KHTGKAQVSLTSTPHHVRNNEEGCGHPELG--------DTAPLYPRRQVESLDYKKLHDE 2952 ++ + L R ++ L D APL +R VE LDYKKLHDE Sbjct: 514 RNFSDNEDGLDEQRRFGRKKKDTLKDELLSVLESNSGQDNAPLSAKRHVERLDYKKLHDE 573 Query: 2953 EYGNTSSDSS-DEDYMVTSSPDKKN--SDKEATVLLNFGSVTTVHGKESSDLDLDKKASE 3123 YGN SSDSS DED+ P K+ S A+V N + T +G + D+ D +A+ Sbjct: 574 AYGNVSSDSSDDEDWTENVIPRKRKNLSGNVASVSPNGNTSITENGTNTKDIKHDLEAAG 633 Query: 3124 STHNRRTVKNFAVEGTXXXXXXXXXXXAAPVTSSKSTSKTLF---GEHATQRLLQSFKEN 3294 T RRT + E T +P ++ + + ++ + GE T+RL +SF+EN Sbjct: 634 CTPKRRTRQKLNFESTNNSLAESHKDSRSPGSTGEKSGQSSYKKLGEAVTERLYKSFQEN 693 Query: 3295 QYPQRAVKESLAAELALSVQQVSRWFNNTRWSFRH 3399 QYP RA+KE LA EL ++ +QVS+WF N RWSFRH Sbjct: 694 QYPDRAMKEKLAEELGITSRQVSKWFENARWSFRH 728 >ref|XP_002269077.1| PREDICTED: homeobox protein HAT3.1-like [Vitis vinifera] Length = 968 Score = 446 bits (1147), Expect = e-122 Identities = 250/515 (48%), Positives = 313/515 (60%), Gaps = 27/515 (5%) Frame = +1 Query: 1936 DEFSRIRTHLRYLLHRIKYEKNFLDAYSGEGWKGQSLDKIKPEKELKRAKAEIFGRKLKI 2115 DEF+RIR HLRYLL+R+ YE+N +DAYS EGWKGQS++K+KPEKEL+RA +EI RKL+I Sbjct: 218 DEFARIRKHLRYLLNRMSYEQNLIDAYSAEGWKGQSVEKLKPEKELQRASSEISRRKLQI 277 Query: 2116 RDLFQHLDLSRSEGRLPEFLFDSQGEIDSEDIFCAKCGSKDVTLSNDIILCDGACDRGFH 2295 RDLFQHLD +EGR PE LFDS+G+IDSEDIFCAKC SKD++ NDIILCDGACDRGFH Sbjct: 278 RDLFQHLDSLCAEGRFPESLFDSEGQIDSEDIFCAKCESKDMSADNDIILCDGACDRGFH 337 Query: 2296 QFCLDPPLLKEYIPPDDEGWLCPGCKCKIDCIKLLNDSQETNILLSDSWETIFXXXXXXX 2475 QFCL+PPLLKE IPPDDEGWLCP C CK+DC+ LLNDSQ T + + DSWE +F Sbjct: 338 QFCLEPPLLKEEIPPDDEGWLCPACDCKVDCMDLLNDSQGTKLSVIDSWEKVF---PEAA 394 Query: 2476 XSGKNVDDNSGXXXXXXXXXXXXXXXXXXXXKVQGDDSS------------TDESDYQSA 2619 +G N D+NSG K QGD SS +DESD+ SA Sbjct: 395 AAGNNQDNNSGFSSDDSEDNDYDPDCPEVDEKGQGDKSSSDKFDESDEFDESDESDFTSA 454 Query: 2620 SDDMQVLPQKESNCGLXXXXXXXXXXXXXA-LVTDQMFKDSSCSDFTSDSEDFTGVIDDC 2796 SDDM V P E GL A + +Q+ + SS SDFTSDSEDFT +D Sbjct: 455 SDDMVVSPNNEQCLGLPSDDSEDDDFDPDAPEIDEQVNQGSSSSDFTSDSEDFTATLDR- 513 Query: 2797 KHTGKAQVSLTSTPHHVRNNEEGCGHPELG--------DTAPLYPRRQVESLDYKKLHDE 2952 ++ + L R ++ L D APL +R VE LDYKKLHDE Sbjct: 514 RNFSDNEDGLDEQRRFGRKKKDTLKDELLSVLESNSGQDNAPLSAKRHVERLDYKKLHDE 573 Query: 2953 EYGNTSSDSS-DEDYMVTSSPDKKN--SDKEATVLLNFGSVTTVHGKESSDLDLDKKASE 3123 YGN SSDSS DED+ P K+ S A+V N + T +G + D+ D +A+ Sbjct: 574 AYGNVSSDSSDDEDWTENVIPRKRKNLSGNVASVSPNGNTSITENGTNTKDIKHDLEAAG 633 Query: 3124 STHNRRTVKNFAVEGTXXXXXXXXXXXAAPVTSSKSTSKTLF---GEHATQRLLQSFKEN 3294 T RRT + E T +P ++ + + ++ + GE T+RL +SF+EN Sbjct: 634 CTPKRRTRQKLNFESTNNSLAESHKDSRSPGSTGEKSGQSSYKKLGEAVTERLYKSFQEN 693 Query: 3295 QYPQRAVKESLAAELALSVQQVSRWFNNTRWSFRH 3399 QYP RA+KE LA EL ++ +QVS+WF N RWSFRH Sbjct: 694 QYPDRAMKEKLAEELGITSRQVSKWFENARWSFRH 728 >ref|XP_002300247.1| predicted protein [Populus trichocarpa] gi|222847505|gb|EEE85052.1| predicted protein [Populus trichocarpa] Length = 930 Score = 420 bits (1080), Expect = e-114 Identities = 240/531 (45%), Positives = 315/531 (59%), Gaps = 26/531 (4%) Frame = +1 Query: 1936 DEFSRIRTHLRYLLHRIKYEKNFLDAYSGEGWKGQSLDKIKPEKELKRAKAEIFGRKLKI 2115 DE+SRIR LRYLL+R+ YE++ + AYSGEGWKG SL+K+KPEKEL+RA +EI RK+KI Sbjct: 379 DEYSRIRARLRYLLNRMSYEQSLITAYSGEGWKGLSLEKLKPEKELQRATSEIIRRKVKI 438 Query: 2116 RDLFQHLDLSRSEGRLPEFLFDSQGEIDSEDIFCAKCGSKDVTLSNDIILCDGACDRGFH 2295 RDLFQH+D EGR P LFDS+G+IDSEDIFCAKCGSKD+T NDIILCDGACDRGFH Sbjct: 439 RDLFQHIDSLCGEGRFPASLFDSEGQIDSEDIFCAKCGSKDLTADNDIILCDGACDRGFH 498 Query: 2296 QFCLDPPLLKEYIPPDDEGWLCPGCKCKIDCIKLLNDSQETNILLSDSWETIFXXXXXXX 2475 QFCL PPLL+E IPP DEGWLCPGC CK+DCI LLNDSQ TNI +SD W+ +F Sbjct: 499 QFCLVPPLLREDIPPGDEGWLCPGCDCKVDCIDLLNDSQGTNISISDRWDNVF-PEAAAV 557 Query: 2476 XSGKNVDDNSGXXXXXXXXXXXXXXXXXXXXKVQGDDSSTDESDYQSASDDMQVLPQKES 2655 SG+ +D N G K Q ++SS+DESD+ SASD+ + P + Sbjct: 558 ASGQKLDYNFGLSSDDSDDNDYDPDGPDIDEKSQ-EESSSDESDFSSASDEFEAPPDDKQ 616 Query: 2656 NCGLXXXXXXXXXXXXXALVTDQMFK-DSSCSDFTSDSEDFTGVIDDCKHTGKAQVSLTS 2832 GL A V ++ K +SS SDFTSDSED ++ + + + Sbjct: 617 YLGLPSDDSEDDDYDPDAPVLEEKLKQESSSSDFTSDSEDLDATLNGDGLSLGDEYHMPI 676 Query: 2833 TPHHVRNNE--------------------EGCGHPELGDTAPLYPRRQVESLDYKKLHDE 2952 PH N E H E +AP+ +R +E LDYKKL+DE Sbjct: 677 EPHEDSNGRRSRFGGKKNHSLNSKLLSMLEPDSHQE--KSAPVSGKRNIERLDYKKLYDE 734 Query: 2953 EYGNTSSDSSDEDYMVTSSPDK--KNSDKEATVLLNFGSVTTVHGKESSDLDLDKKASES 3126 YGN + SSD+D+ T +P K KN+ A + N + T +G S +++ + K +E Sbjct: 735 TYGNICT-SSDDDFTDTVAPRKRRKNTGDVAMGIANGDASVTENGLNSKNMNQELKKNEH 793 Query: 3127 THNRRTVKNFAVEGTXXXXXXXXXXXAAPVTSSKSTSKTLF---GEHATQRLLQSFKENQ 3297 T + RT +N + + T + +SSK + + GE TQ+L FKEN+ Sbjct: 794 T-SGRTHQNSSFQDTNVSPAKTHVGESLSGSSSKRVRPSAYKKLGEAVTQKLYSFFKENR 852 Query: 3298 YPQRAVKESLAAELALSVQQVSRWFNNTRWSFRHSSRFASNVAEFASNEGT 3450 YP +A K SLA EL ++ +QV++WF N RWSF HSS ++ AE AS +G+ Sbjct: 853 YPDQAAKASLAEELGITFEQVNKWFMNARWSFNHSSPEGTSKAESASGKGS 903 >ref|XP_003555282.1| PREDICTED: homeobox protein HAT3.1-like [Glycine max] Length = 820 Score = 407 bits (1047), Expect = e-111 Identities = 238/523 (45%), Positives = 315/523 (60%), Gaps = 18/523 (3%) Frame = +1 Query: 1936 DEFSRIRTHLRYLLHRIKYEKNFLDAYSGEGWKGQSLDKIKPEKELKRAKAEIFGRKLKI 2115 D+FSRIR+HLRYLL+RI YE + +DAYSGEGWKG S++K+KPEKEL+RAK+EI RKLKI Sbjct: 252 DQFSRIRSHLRYLLNRISYENSLIDAYSGEGWKGYSMEKLKPEKELQRAKSEILRRKLKI 311 Query: 2116 RDLFQHLDLSRSEGRLPEFLFDSQGEIDSEDIFCAKCGSKDVTLSNDIILCDGACDRGFH 2295 RDLF++LD +EG+ PE LFDS GEIDSEDIFCAKC SK+++ +NDIILCDG CDRGFH Sbjct: 312 RDLFRNLDSLCAEGKFPESLFDSAGEIDSEDIFCAKCQSKELSTNNDIILCDGVCDRGFH 371 Query: 2296 QFCLDPPLLKEYIPPDDEGWLCPGCKCKIDCIKLLNDSQETNILLSDSWETIFXXXXXXX 2475 Q CLDPPLL E IPP DEGWLCPGC CK DC+ L+NDS T++ +SD+WE +F Sbjct: 372 QLCLDPPLLTEDIPPGDEGWLCPGCDCKDDCMDLVNDSFGTSLSISDTWERVF--PEAAS 429 Query: 2476 XSGKNVDDNSGXXXXXXXXXXXXXXXXXXXXKVQGDDSSTDESDYQSASDDMQVLPQKES 2655 +G N+D+N G K++GD+SS+DES+Y SAS+ ++ ++ Sbjct: 430 FAGNNMDNNLG-LPSDDSDDDDYNPNGSDDVKIEGDESSSDESEYASASEKLEGGSHEDQ 488 Query: 2656 NCGLXXXXXXXXXXXXXALVTD-QMFKDSSCSDFTSDSEDFT------------GVIDDC 2796 GL A D ++ ++SS SDFTSDSED G I+ Sbjct: 489 YLGLPSEDSDDGDYDPDAPDVDCKVNEESSSSDFTSDSEDLAAAFEDNTSPGQDGGINSS 548 Query: 2797 KHTGK-AQVSLTSTPHHVRNNEEGCGHPELGDTAPLYPRRQVESLDYKKLHDEEYGNTSS 2973 K GK ++S+ + + G G P P+ +R VE LDYKKL++E Y + +S Sbjct: 549 KKKGKVGKLSMADELSSLLEPDSGQGGP-----TPVSGKRHVERLDYKKLYEETYHSDTS 603 Query: 2974 DSSDEDYMVTSSPDKKNSDKEATVLLNFGSVTTVHGKESSDLDLDKKASESTHN-RRTVK 3150 D DED+ ++P +K K+ T G+VT V ++ ++ S H +R Sbjct: 604 D--DEDWNDAAAPSRK---KKLT-----GNVTPVSPNANA-------SNNSIHTLKRNAH 646 Query: 3151 NFAVEGTXXXXXXXXXXXAAPVTSSK---STSKTLFGEHATQRLLQSFKENQYPQRAVKE 3321 VE T + + K S++ GE QRL +SFKENQYP R+ KE Sbjct: 647 QNKVENTNSSPTKSLDGRSKSGSRDKRSGSSAHKRLGEAVVQRLHKSFKENQYPDRSTKE 706 Query: 3322 SLAAELALSVQQVSRWFNNTRWSFRHSSRFASNVAEFASNEGT 3450 SLA EL L+ QQV++WF+NTRWSFRHSS+ +N AS E T Sbjct: 707 SLAQELGLTYQQVAKWFDNTRWSFRHSSQMETNSGRNASPEAT 749