BLASTX nr result
ID: Scutellaria22_contig00028554
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Scutellaria22_contig00028554 (550 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002516533.1| serine-threonine protein kinase, plant-type,... 166 2e-39 ref|NP_197965.1| Protein kinase family protein with leucine-rich... 163 1e-38 dbj|BAH19649.1| AT5G25930 [Arabidopsis thaliana] 163 1e-38 ref|XP_002872190.1| leucine-rich repeat family protein [Arabidop... 161 5e-38 ref|XP_002324752.1| predicted protein [Populus trichocarpa] gi|2... 161 7e-38 >ref|XP_002516533.1| serine-threonine protein kinase, plant-type, putative [Ricinus communis] gi|223544353|gb|EEF45874.1| serine-threonine protein kinase, plant-type, putative [Ricinus communis] Length = 1026 Score = 166 bits (421), Expect = 2e-39 Identities = 82/148 (55%), Positives = 106/148 (71%), Gaps = 1/148 (0%) Frame = -3 Query: 443 SSMPFLVISQFS-TAERATLLNLKLEWGEPKLLQSWNSTSSPCDWPEIRCSDGGSVTGIF 267 +S PF VISQ + T E++ LLN+K + G P LQSW +++SPC WPEI CSD GSVT + Sbjct: 21 TSTPFNVISQITNTQEQSILLNIKQQLGNPPSLQSWTTSTSPCTWPEISCSDDGSVTALG 80 Query: 266 LKGYNIEGRIPDSMSALSNLTVLDLAFNLFTGDFPAAILSCTKLQYLDLSQNVFVGNIPA 87 L+ NI IP + L NLTVLDLA+N G FP + +C+ L+ LDLSQN FVG +P Sbjct: 81 LRDKNITVAIPARICDLKNLTVLDLAYNYIPGGFPTFLYNCSSLERLDLSQNYFVGTVPD 140 Query: 86 DIDRLKALQYLDLAANNFTGDVPPSIGN 3 DIDRL L+ +DL+ANNF+GD+PP+IGN Sbjct: 141 DIDRLSNLKSIDLSANNFSGDIPPAIGN 168 Score = 62.8 bits (151), Expect = 3e-08 Identities = 48/147 (32%), Positives = 70/147 (47%), Gaps = 2/147 (1%) Frame = -3 Query: 440 SMPFLVISQFSTAERATLLNLKLEWGEPKLLQSWNSTSSPCDWPEIRCSDGGSVTGIFLK 261 +M +L++S S + + L L W +L S N S P I V + + Sbjct: 435 NMTYLMLSNNSFSGK---LPSSLAWNLSRLELSNNKFSGP-----IPTGISSWVNLVVFE 486 Query: 260 GYN--IEGRIPDSMSALSNLTVLDLAFNLFTGDFPAAILSCTKLQYLDLSQNVFVGNIPA 87 N + G IP +++LS+L L L N G P+ I+S L L+LS+N G IPA Sbjct: 487 ASNNLLSGEIPVEVTSLSHLNTLLLDGNQLLGQLPSKIISWKTLNTLNLSRNALSGQIPA 546 Query: 86 DIDRLKALQYLDLAANNFTGDVPPSIG 6 I L L YLDL+ N+ +G +P G Sbjct: 547 AIGSLPDLLYLDLSQNHLSGQIPSEFG 573 Score = 62.8 bits (151), Expect = 3e-08 Identities = 33/90 (36%), Positives = 51/90 (56%) Frame = -3 Query: 272 IFLKGYNIEGRIPDSMSALSNLTVLDLAFNLFTGDFPAAILSCTKLQYLDLSQNVFVGNI 93 + L G + G++P + + L L+L+ N +G PAAI S L YLDLSQN G I Sbjct: 509 LLLDGNQLLGQLPSKIISWKTLNTLNLSRNALSGQIPAAIGSLPDLLYLDLSQNHLSGQI 568 Query: 92 PADIDRLKALQYLDLAANNFTGDVPPSIGN 3 P++ +L + L+L++N F+G +P N Sbjct: 569 PSEFGQLNLIS-LNLSSNQFSGQIPDKFDN 597 Score = 56.2 bits (134), Expect = 3e-06 Identities = 29/92 (31%), Positives = 49/92 (53%) Frame = -3 Query: 281 VTGIFLKGYNIEGRIPDSMSALSNLTVLDLAFNLFTGDFPAAILSCTKLQYLDLSQNVFV 102 +T ++++ N+ G IP+S++ LS+L LDL+ N G P + L YL L N Sbjct: 221 LTFLWIRDANLIGSIPESLANLSSLETLDLSINKLEGSIPDGLFLLKNLTYLYLFHNQLS 280 Query: 101 GNIPADIDRLKALQYLDLAANNFTGDVPPSIG 6 G++P ++ L ++ +DL NN G + G Sbjct: 281 GDMPKKVEALNLVE-VDLGINNLIGSISEDFG 311 >ref|NP_197965.1| Protein kinase family protein with leucine-rich repeat domain [Arabidopsis thaliana] gi|5107831|gb|AAD40144.1|AF149413_25 contains similarity to protein kinase domains (Pfam F00069, Score=162.6, E=6.8e-45, N=1) and leucien rich repeats (Pfam PF00560, Score=210.7, E=2.2e-59, N=10) [Arabidopsis thaliana] gi|28393326|gb|AAO42089.1| putative receptor protein kinase [Arabidopsis thaliana] gi|224589685|gb|ACN59374.1| leucine-rich repeat receptor-like protein kinase [Arabidopsis thaliana] gi|332006119|gb|AED93502.1| Protein kinase family protein with leucine-rich repeat domain [Arabidopsis thaliana] Length = 1005 Score = 163 bits (413), Expect = 1e-38 Identities = 81/147 (55%), Positives = 107/147 (72%), Gaps = 1/147 (0%) Frame = -3 Query: 443 SSMPFLVISQFSTAERATLLNLKLEWGEPKLLQSWNSTSSPCDWPEIRCSDGGSVTGIFL 264 +S+P V SQF+ +++TLLNLK + G+P L+ WN+TSSPC+W EI C+ G+VTGI Sbjct: 14 TSIPLSVFSQFN--DQSTLLNLKRDLGDPPSLRLWNNTSSPCNWSEITCT-AGNVTGINF 70 Query: 263 KGYNIEGRIPDSMSALSNLTVLDLAFNLFTGDFPAAILSCTKLQYLDLSQNVFVGNIPAD 84 K N G +P ++ LSNL LDL+FN F G+FP + +CTKLQYLDLSQN+ G++P D Sbjct: 71 KNQNFTGTVPTTICDLSNLNFLDLSFNYFAGEFPTVLYNCTKLQYLDLSQNLLNGSLPVD 130 Query: 83 IDRLK-ALQYLDLAANNFTGDVPPSIG 6 IDRL L YLDLAAN F+GD+P S+G Sbjct: 131 IDRLSPELDYLDLAANGFSGDIPKSLG 157 Score = 65.5 bits (158), Expect = 5e-09 Identities = 37/93 (39%), Positives = 52/93 (55%) Frame = -3 Query: 284 SVTGIFLKGYNIEGRIPDSMSALSNLTVLDLAFNLFTGDFPAAILSCTKLQYLDLSQNVF 105 ++T +L + G IP S+SA +NL LDL+ N TG P +I + TKLQ L+L N Sbjct: 260 NLTEFYLFANGLTGEIPKSISA-TNLVFLDLSANNLTGSIPVSIGNLTKLQVLNLFNNKL 318 Query: 104 VGNIPADIDRLKALQYLDLAANNFTGDVPPSIG 6 G IP I +L L+ + N TG++P IG Sbjct: 319 TGEIPPVIGKLPGLKEFKIFNNKLTGEIPAEIG 351 Score = 60.5 bits (145), Expect = 2e-07 Identities = 36/90 (40%), Positives = 48/90 (53%), Gaps = 1/90 (1%) Frame = -3 Query: 269 FLKGYN-IEGRIPDSMSALSNLTVLDLAFNLFTGDFPAAILSCTKLQYLDLSQNVFVGNI 93 F G N G P +++LSNL + L N TG+ P I+S L L LS+N G I Sbjct: 477 FKAGNNQFSGEFPKELTSLSNLISIFLDENDLTGELPDEIISWKSLITLSLSKNKLSGEI 536 Query: 92 PADIDRLKALQYLDLAANNFTGDVPPSIGN 3 P + L L LDL+ N F+G +PP IG+ Sbjct: 537 PRALGLLPRLLNLDLSENQFSGGIPPEIGS 566 Score = 60.1 bits (144), Expect = 2e-07 Identities = 31/102 (30%), Positives = 54/102 (52%) Frame = -3 Query: 311 PEIRCSDGGSVTGIFLKGYNIEGRIPDSMSALSNLTVLDLAFNLFTGDFPAAILSCTKLQ 132 PE C GG + G+ + N+ G IP+S+ L + L N F+G FP+ I + + + Sbjct: 371 PENLCK-GGKLQGVVVYSNNLTGEIPESLGDCGTLLTVQLQNNDFSGKFPSRIWNASSMY 429 Query: 131 YLDLSQNVFVGNIPADIDRLKALQYLDLAANNFTGDVPPSIG 6 L +S N F G +P ++ + +++ N F+G++P IG Sbjct: 430 SLQVSNNSFTGELPENV--AWNMSRIEIDNNRFSGEIPKKIG 469 Score = 59.3 bits (142), Expect = 4e-07 Identities = 32/94 (34%), Positives = 53/94 (56%) Frame = -3 Query: 284 SVTGIFLKGYNIEGRIPDSMSALSNLTVLDLAFNLFTGDFPAAILSCTKLQYLDLSQNVF 105 ++ IFL ++ G +PD + + +L L L+ N +G+ P A+ +L LDLS+N F Sbjct: 497 NLISIFLDENDLTGELPDEIISWKSLITLSLSKNKLSGEIPRALGLLPRLLNLDLSENQF 556 Query: 104 VGNIPADIDRLKALQYLDLAANNFTGDVPPSIGN 3 G IP +I LK L ++++N TG +P + N Sbjct: 557 SGGIPPEIGSLK-LTTFNVSSNRLTGGIPEQLDN 589 >dbj|BAH19649.1| AT5G25930 [Arabidopsis thaliana] Length = 835 Score = 163 bits (413), Expect = 1e-38 Identities = 81/147 (55%), Positives = 107/147 (72%), Gaps = 1/147 (0%) Frame = -3 Query: 443 SSMPFLVISQFSTAERATLLNLKLEWGEPKLLQSWNSTSSPCDWPEIRCSDGGSVTGIFL 264 +S+P V SQF+ +++TLLNLK + G+P L+ WN+TSSPC+W EI C+ G+VTGI Sbjct: 14 TSIPLSVFSQFN--DQSTLLNLKRDLGDPPSLRLWNNTSSPCNWSEITCT-AGNVTGINF 70 Query: 263 KGYNIEGRIPDSMSALSNLTVLDLAFNLFTGDFPAAILSCTKLQYLDLSQNVFVGNIPAD 84 K N G +P ++ LSNL LDL+FN F G+FP + +CTKLQYLDLSQN+ G++P D Sbjct: 71 KNQNFTGTVPTTICDLSNLNFLDLSFNYFAGEFPTVLYNCTKLQYLDLSQNLLNGSLPVD 130 Query: 83 IDRLK-ALQYLDLAANNFTGDVPPSIG 6 IDRL L YLDLAAN F+GD+P S+G Sbjct: 131 IDRLSPELDYLDLAANGFSGDIPKSLG 157 Score = 65.5 bits (158), Expect = 5e-09 Identities = 37/93 (39%), Positives = 52/93 (55%) Frame = -3 Query: 284 SVTGIFLKGYNIEGRIPDSMSALSNLTVLDLAFNLFTGDFPAAILSCTKLQYLDLSQNVF 105 ++T +L + G IP S+SA +NL LDL+ N TG P +I + TKLQ L+L N Sbjct: 260 NLTEFYLFANGLTGEIPKSISA-TNLVFLDLSANNLTGSIPVSIGNLTKLQVLNLFNNKL 318 Query: 104 VGNIPADIDRLKALQYLDLAANNFTGDVPPSIG 6 G IP I +L L+ + N TG++P IG Sbjct: 319 TGEIPPVIGKLPGLKEFKIFNNKLTGEIPAEIG 351 Score = 60.5 bits (145), Expect = 2e-07 Identities = 36/90 (40%), Positives = 48/90 (53%), Gaps = 1/90 (1%) Frame = -3 Query: 269 FLKGYN-IEGRIPDSMSALSNLTVLDLAFNLFTGDFPAAILSCTKLQYLDLSQNVFVGNI 93 F G N G P +++LSNL + L N TG+ P I+S L L LS+N G I Sbjct: 477 FKAGNNQFSGEFPKELTSLSNLISIFLDENDLTGELPDEIISWKSLITLSLSKNKLSGEI 536 Query: 92 PADIDRLKALQYLDLAANNFTGDVPPSIGN 3 P + L L LDL+ N F+G +PP IG+ Sbjct: 537 PRALGLLPRLLNLDLSENQFSGGIPPEIGS 566 Score = 60.1 bits (144), Expect = 2e-07 Identities = 31/102 (30%), Positives = 54/102 (52%) Frame = -3 Query: 311 PEIRCSDGGSVTGIFLKGYNIEGRIPDSMSALSNLTVLDLAFNLFTGDFPAAILSCTKLQ 132 PE C GG + G+ + N+ G IP+S+ L + L N F+G FP+ I + + + Sbjct: 371 PENLCK-GGKLQGVVVYSNNLTGEIPESLGDCGTLLTVQLQNNDFSGKFPSRIWNASSMY 429 Query: 131 YLDLSQNVFVGNIPADIDRLKALQYLDLAANNFTGDVPPSIG 6 L +S N F G +P ++ + +++ N F+G++P IG Sbjct: 430 SLQVSNNSFTGELPENV--AWNMSRIEIDNNRFSGEIPKKIG 469 Score = 59.3 bits (142), Expect = 4e-07 Identities = 32/94 (34%), Positives = 53/94 (56%) Frame = -3 Query: 284 SVTGIFLKGYNIEGRIPDSMSALSNLTVLDLAFNLFTGDFPAAILSCTKLQYLDLSQNVF 105 ++ IFL ++ G +PD + + +L L L+ N +G+ P A+ +L LDLS+N F Sbjct: 497 NLISIFLDENDLTGELPDEIISWKSLITLSLSKNKLSGEIPRALGLLPRLLNLDLSENQF 556 Query: 104 VGNIPADIDRLKALQYLDLAANNFTGDVPPSIGN 3 G IP +I LK L ++++N TG +P + N Sbjct: 557 SGGIPPEIGSLK-LTTFNVSSNRLTGGIPEQLDN 589 >ref|XP_002872190.1| leucine-rich repeat family protein [Arabidopsis lyrata subsp. lyrata] gi|297318027|gb|EFH48449.1| leucine-rich repeat family protein [Arabidopsis lyrata subsp. lyrata] Length = 1005 Score = 161 bits (408), Expect = 5e-38 Identities = 81/147 (55%), Positives = 106/147 (72%), Gaps = 1/147 (0%) Frame = -3 Query: 443 SSMPFLVISQFSTAERATLLNLKLEWGEPKLLQSWNSTSSPCDWPEIRCSDGGSVTGIFL 264 +S+P V SQ + +++TLLN+K + G+P LQ WN+TSSPC+W EI C+ G+VTGI Sbjct: 14 TSIPLSVFSQSN--DQSTLLNVKRDLGDPPSLQLWNNTSSPCNWSEITCT-AGNVTGINF 70 Query: 263 KGYNIEGRIPDSMSALSNLTVLDLAFNLFTGDFPAAILSCTKLQYLDLSQNVFVGNIPAD 84 K N G +P ++ LSNL LDL+FN F G+FP + +CTKLQYLDLSQN+F G++P D Sbjct: 71 KNQNFTGTVPTTICDLSNLNFLDLSFNYFAGEFPTVLYNCTKLQYLDLSQNLFNGSLPVD 130 Query: 83 IDRLK-ALQYLDLAANNFTGDVPPSIG 6 IDRL L YLDLAAN F GD+P +IG Sbjct: 131 IDRLSPELDYLDLAANAFAGDIPKNIG 157 Score = 71.6 bits (174), Expect = 7e-11 Identities = 39/83 (46%), Positives = 50/83 (60%) Frame = -3 Query: 254 NIEGRIPDSMSALSNLTVLDLAFNLFTGDFPAAILSCTKLQYLDLSQNVFVGNIPADIDR 75 N+ GRIPD + L NLT L L N TG+ P +I S T + +LDLS N G+IP I Sbjct: 246 NLTGRIPDVLFGLKNLTELYLYANDLTGEIPKSI-SATNMVFLDLSANNLTGSIPVSIGN 304 Query: 74 LKALQYLDLAANNFTGDVPPSIG 6 L L+ L+L N TG++PP IG Sbjct: 305 LTKLEVLNLFNNELTGEIPPVIG 327 Score = 63.9 bits (154), Expect = 2e-08 Identities = 34/93 (36%), Positives = 53/93 (56%) Frame = -3 Query: 284 SVTGIFLKGYNIEGRIPDSMSALSNLTVLDLAFNLFTGDFPAAILSCTKLQYLDLSQNVF 105 ++T ++L ++ G IP S+SA +N+ LDL+ N TG P +I + TKL+ L+L N Sbjct: 260 NLTELYLYANDLTGEIPKSISA-TNMVFLDLSANNLTGSIPVSIGNLTKLEVLNLFNNEL 318 Query: 104 VGNIPADIDRLKALQYLDLAANNFTGDVPPSIG 6 G IP I +L L+ + N TG++P G Sbjct: 319 TGEIPPVIGKLPELKEFKIFTNKLTGEIPAEFG 351 Score = 62.8 bits (151), Expect = 3e-08 Identities = 37/90 (41%), Positives = 50/90 (55%), Gaps = 1/90 (1%) Frame = -3 Query: 269 FLKGYN-IEGRIPDSMSALSNLTVLDLAFNLFTGDFPAAILSCTKLQYLDLSQNVFVGNI 93 F G N G IP +++LSNL + L N TG+ P I+S L L LS+N G I Sbjct: 477 FKAGNNRFSGEIPKELTSLSNLLSIFLDENDLTGELPDDIISWKSLITLSLSKNKLSGKI 536 Query: 92 PADIDRLKALQYLDLAANNFTGDVPPSIGN 3 P + L L LDL+ N F+G++PP IG+ Sbjct: 537 PRALGLLPRLLNLDLSENQFSGEIPPEIGS 566 Score = 59.7 bits (143), Expect = 3e-07 Identities = 33/94 (35%), Positives = 53/94 (56%) Frame = -3 Query: 284 SVTGIFLKGYNIEGRIPDSMSALSNLTVLDLAFNLFTGDFPAAILSCTKLQYLDLSQNVF 105 ++ IFL ++ G +PD + + +L L L+ N +G P A+ +L LDLS+N F Sbjct: 497 NLLSIFLDENDLTGELPDDIISWKSLITLSLSKNKLSGKIPRALGLLPRLLNLDLSENQF 556 Query: 104 VGNIPADIDRLKALQYLDLAANNFTGDVPPSIGN 3 G IP +I LK L L++++N TG +P + N Sbjct: 557 SGEIPPEIGSLK-LTTLNVSSNRLTGGIPEQLDN 589 >ref|XP_002324752.1| predicted protein [Populus trichocarpa] gi|222866186|gb|EEF03317.1| predicted protein [Populus trichocarpa] Length = 1019 Score = 161 bits (407), Expect = 7e-38 Identities = 81/146 (55%), Positives = 102/146 (69%) Frame = -3 Query: 440 SMPFLVISQFSTAERATLLNLKLEWGEPKLLQSWNSTSSPCDWPEIRCSDGGSVTGIFLK 261 S+PF VISQ AE+ LLNLK + G P +QSWNS+SSPC+WP++ C +G +VTG+ L Sbjct: 16 SLPFKVISQDVNAEKTILLNLKQQLGNPSSIQSWNSSSSPCEWPDVYCVEG-AVTGLDLG 74 Query: 260 GYNIEGRIPDSMSALSNLTVLDLAFNLFTGDFPAAILSCTKLQYLDLSQNVFVGNIPADI 81 NI IP S+ L NLT L+L +N G FP + +C KL+ LDLSQN FVG IP DI Sbjct: 75 NKNITQTIPASVCDLKNLTYLNLNWNYIPGGFPKLLYNCKKLEELDLSQNYFVGPIPDDI 134 Query: 80 DRLKALQYLDLAANNFTGDVPPSIGN 3 DRL +L+YL L NNFTG++PP IGN Sbjct: 135 DRLSSLRYLYLQGNNFTGNIPPQIGN 160 Score = 66.6 bits (161), Expect = 2e-09 Identities = 35/93 (37%), Positives = 56/93 (60%) Frame = -3 Query: 281 VTGIFLKGYNIEGRIPDSMSALSNLTVLDLAFNLFTGDFPAAILSCTKLQYLDLSQNVFV 102 ++ + L G G++P ++ + +LT L+L+ N +G P I S L+YLDLSQN F Sbjct: 498 LSNLLLDGNQFSGQLPSTIPSWKSLTSLNLSRNGLSGQIPREIGSLPDLRYLDLSQNHFS 557 Query: 101 GNIPADIDRLKALQYLDLAANNFTGDVPPSIGN 3 G IP + +LK L +L+L++NN +G +P N Sbjct: 558 GEIPPEFGQLK-LIFLNLSSNNLSGKIPDQFDN 589 Score = 65.5 bits (158), Expect = 5e-09 Identities = 49/147 (33%), Positives = 75/147 (51%), Gaps = 2/147 (1%) Frame = -3 Query: 440 SMPFLVISQFSTAERATLLNLKLEWGEPKLLQSWNSTSSPCDWPEIRCSDGGSVTGIFLK 261 +M +L++S+ S + L KL W +L + N S P I V + + Sbjct: 427 NMTYLMLSENSFSGG---LPSKLAWNLSRLELNNNRFSGP-----IPPGVSSWVNLVVFE 478 Query: 260 GYN--IEGRIPDSMSALSNLTVLDLAFNLFTGDFPAAILSCTKLQYLDLSQNVFVGNIPA 87 N G IP +++L +L+ L L N F+G P+ I S L L+LS+N G IP Sbjct: 479 ASNNLFSGEIPVEITSLPHLSNLLLDGNQFSGQLPSTIPSWKSLTSLNLSRNGLSGQIPR 538 Query: 86 DIDRLKALQYLDLAANNFTGDVPPSIG 6 +I L L+YLDL+ N+F+G++PP G Sbjct: 539 EIGSLPDLRYLDLSQNHFSGEIPPEFG 565 Score = 62.4 bits (150), Expect = 4e-08 Identities = 37/93 (39%), Positives = 53/93 (56%) Frame = -3 Query: 284 SVTGIFLKGYNIEGRIPDSMSALSNLTVLDLAFNLFTGDFPAAILSCTKLQYLDLSQNVF 105 S+ + L G ++EG+IP + L NLT L L N +G+ P I+ L +DL+ N Sbjct: 236 SLVHLDLAGNDLEGKIPGGLFLLKNLTNLYLFKNKLSGEIP-QIVETLNLVEIDLAMNHL 294 Query: 104 VGNIPADIDRLKALQYLDLAANNFTGDVPPSIG 6 G+I D +LK LQ L L N+ +G+VP SIG Sbjct: 295 NGSITQDFGKLKKLQLLSLFENHLSGEVPASIG 327 Score = 61.2 bits (147), Expect = 1e-07 Identities = 34/103 (33%), Positives = 56/103 (54%) Frame = -3 Query: 311 PEIRCSDGGSVTGIFLKGYNIEGRIPDSMSALSNLTVLDLAFNLFTGDFPAAILSCTKLQ 132 PE C+ GG + G N+ G++P S+ ++L + L N F+G+ PA I + + Sbjct: 371 PENLCA-GGVLQGAVAFENNLSGQVPQSLGNCNSLRTVQLYSNNFSGEIPAGIWTAFNMT 429 Query: 131 YLDLSQNVFVGNIPADIDRLKALQYLDLAANNFTGDVPPSIGN 3 YL LS+N F G +P+ + L L+L N F+G +PP + + Sbjct: 430 YLMLSENSFSGGLPSKL--AWNLSRLELNNNRFSGPIPPGVSS 470