BLASTX nr result
ID: Mentha26_contig00004759
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha26_contig00004759 (670 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU22784.1| hypothetical protein MIMGU_mgv1a003476mg [Mimulus... 177 2e-42 gb|EYU28219.1| hypothetical protein MIMGU_mgv1a004573mg [Mimulus... 175 1e-41 emb|CAB95829.1| hypothetical protein [Cicer arietinum] 152 9e-35 ref|XP_004506377.1| PREDICTED: patellin-2-like [Cicer arietinum] 152 9e-35 ref|XP_004299816.1| PREDICTED: patellin-3-like [Fragaria vesca s... 152 1e-34 gb|EPS65885.1| hypothetical protein M569_08890, partial [Genlise... 148 1e-33 ref|XP_007020114.1| SEC14 cytosolic factor family protein / phos... 147 2e-33 emb|CBI39339.3| unnamed protein product [Vitis vinifera] 147 4e-33 ref|XP_002267428.1| PREDICTED: patellin-5 isoform 1 [Vitis vinif... 147 4e-33 ref|XP_006585750.1| PREDICTED: patellin-3-like isoform X2 [Glyci... 146 5e-33 ref|XP_007224786.1| hypothetical protein PRUPE_ppa023884mg, part... 146 5e-33 dbj|BAE71201.1| putative cytosolic factor [Trifolium pratense] 146 5e-33 ref|XP_003531832.1| PREDICTED: patellin-3-like isoform X1 [Glyci... 146 5e-33 ref|XP_006591943.1| PREDICTED: patellin-2-like isoform X3 [Glyci... 145 1e-32 ref|XP_006591942.1| PREDICTED: patellin-2-like isoform X2 [Glyci... 145 1e-32 ref|XP_004250845.1| PREDICTED: patellin-5-like [Solanum lycopers... 145 1e-32 ref|XP_003540055.1| PREDICTED: patellin-2-like isoform X1 [Glyci... 145 1e-32 ref|XP_007131297.1| hypothetical protein PHAVU_011G002100g [Phas... 144 2e-32 ref|XP_006859149.1| hypothetical protein AMTR_s00070p00121170 [A... 143 6e-32 ref|XP_007020115.1| SEC14 cytosolic factor family protein / phos... 143 6e-32 >gb|EYU22784.1| hypothetical protein MIMGU_mgv1a003476mg [Mimulus guttatus] Length = 583 Score = 177 bits (450), Expect = 2e-42 Identities = 85/114 (74%), Positives = 90/114 (78%) Frame = +3 Query: 3 EGETEFTTADSAIEETIKPTCKHAVELPITEACTLVWEVRVVGWEVAYGAEFVPSVEGGY 182 + E EFTTAD A EE IKP CKH VELPITEA T VWE RV+GW+V+YGAEFVPS EGGY Sbjct: 470 DAEQEFTTADPATEEIIKPACKHIVELPITEAGTFVWEARVIGWDVSYGAEFVPSAEGGY 529 Query: 183 TWIVHKSRKIGPADEQVVGCSFKIGETGKVVLTFDNQTSXXXXXXYRFKIKPSE 344 TWIV KSRKIGP DE VV CSFK+GETGKVVLTFDNQTS YR K K SE Sbjct: 530 TWIVQKSRKIGPVDETVVSCSFKVGETGKVVLTFDNQTSKKKKLLYRSKTKASE 583 >gb|EYU28219.1| hypothetical protein MIMGU_mgv1a004573mg [Mimulus guttatus] Length = 520 Score = 175 bits (443), Expect = 1e-41 Identities = 88/116 (75%), Positives = 94/116 (81%), Gaps = 2/116 (1%) Frame = +3 Query: 3 EGETEFTTADSAIEETIKPTCKHAVELPITEA-CTLVWEVRVVGWEVAYGAEFVPSVEGG 179 EGE EF TAD+A EETIKP CKH +ELPITE TLVWEVRVVGW+V+YGAEFVPS EGG Sbjct: 405 EGEEEFATADAATEETIKPACKHTIELPITEGGTTLVWEVRVVGWDVSYGAEFVPSAEGG 464 Query: 180 YTWIVHKSRKIGPADEQVVGCSFKIGETGKVVLTFDNQTSXXXXXXYRFK-IKPSE 344 YTWIV KSRKIG ADEQ + CSFK GETGK+VLTFDNQTS YR K IKPSE Sbjct: 465 YTWIVQKSRKIGAADEQSLSCSFKSGETGKLVLTFDNQTSKKKKLVYRTKTIKPSE 520 >emb|CAB95829.1| hypothetical protein [Cicer arietinum] Length = 482 Score = 152 bits (384), Expect = 9e-35 Identities = 77/112 (68%), Positives = 83/112 (74%) Frame = +3 Query: 3 EGETEFTTADSAIEETIKPTCKHAVELPITEACTLVWEVRVVGWEVAYGAEFVPSVEGGY 182 EG+ EFTTAD A E TIKP KHAVE PI E TLVWEVRVVGW+V+YGAEFVPS E GY Sbjct: 368 EGDQEFTTADPATEVTIKPATKHAVEFPIPEKSTLVWEVRVVGWDVSYGAEFVPSAEDGY 427 Query: 183 TWIVHKSRKIGPADEQVVGCSFKIGETGKVVLTFDNQTSXXXXXXYRFKIKP 338 T IV K+RKI PADE V+ +FKIGE GKVVLT DNQTS YR K P Sbjct: 428 TVIVQKNRKIAPADETVINNTFKIGEPGKVVLTIDNQTSKKKKLLYRSKTIP 479 >ref|XP_004506377.1| PREDICTED: patellin-2-like [Cicer arietinum] Length = 612 Score = 152 bits (384), Expect = 9e-35 Identities = 77/112 (68%), Positives = 83/112 (74%) Frame = +3 Query: 3 EGETEFTTADSAIEETIKPTCKHAVELPITEACTLVWEVRVVGWEVAYGAEFVPSVEGGY 182 EG+ EFTTAD A E TIKP KHAVE PI E TLVWEVRVVGW+V+YGAEFVPS E GY Sbjct: 498 EGDQEFTTADPATEVTIKPATKHAVEFPIPEKSTLVWEVRVVGWDVSYGAEFVPSAEDGY 557 Query: 183 TWIVHKSRKIGPADEQVVGCSFKIGETGKVVLTFDNQTSXXXXXXYRFKIKP 338 T IV K+RKI PADE V+ +FKIGE GKVVLT DNQTS YR K P Sbjct: 558 TVIVQKNRKIAPADETVINNTFKIGEPGKVVLTIDNQTSKKKKLLYRSKTIP 609 >ref|XP_004299816.1| PREDICTED: patellin-3-like [Fragaria vesca subsp. vesca] Length = 603 Score = 152 bits (383), Expect = 1e-34 Identities = 71/113 (62%), Positives = 85/113 (75%) Frame = +3 Query: 3 EGETEFTTADSAIEETIKPTCKHAVELPITEACTLVWEVRVVGWEVAYGAEFVPSVEGGY 182 EGE EFTTAD E T+KP CKH VE+P++E+ LVWEVRVV W+V+YGAEFVPS E GY Sbjct: 489 EGECEFTTADPVTEVTVKPACKHTVEIPVSESGVLVWEVRVVAWDVSYGAEFVPSAEDGY 548 Query: 183 TWIVHKSRKIGPADEQVVGCSFKIGETGKVVLTFDNQTSXXXXXXYRFKIKPS 341 T I+ K+RK+ P DE V+ S+KIGE GKVVLT DNQ+S YR KIKP+ Sbjct: 549 TIILQKTRKVAPTDEPVISNSYKIGEAGKVVLTIDNQSSKKKKLLYRSKIKPA 601 >gb|EPS65885.1| hypothetical protein M569_08890, partial [Genlisea aurea] Length = 554 Score = 148 bits (374), Expect = 1e-33 Identities = 70/112 (62%), Positives = 89/112 (79%), Gaps = 1/112 (0%) Frame = +3 Query: 3 EGETEFTTADSAIEETIKPTCKHAVELPITEACTLVWEVRVVGWEVAYGAEFVPSVEGGY 182 EG+ EF++ADSA+EE IKP KHA+E+P++E T+VWEVRV+G +V YGAEFVP EG Y Sbjct: 442 EGDAEFSSADSAVEEIIKPASKHAIEIPVSEPGTVVWEVRVIGSDVIYGAEFVPDAEGSY 501 Query: 183 TWIVHKSRKIGPADE-QVVGCSFKIGETGKVVLTFDNQTSXXXXXXYRFKIK 335 TWIV K+RK+ P++E Q++ SFKIGETGKVV+TF+NQ S YRFKIK Sbjct: 502 TWIVQKARKMSPSEEQQLIFYSFKIGETGKVVVTFENQNSKKKKLVYRFKIK 553 >ref|XP_007020114.1| SEC14 cytosolic factor family protein / phosphoglyceride transfer family protein isoform 1 [Theobroma cacao] gi|508725442|gb|EOY17339.1| SEC14 cytosolic factor family protein / phosphoglyceride transfer family protein isoform 1 [Theobroma cacao] Length = 625 Score = 147 bits (372), Expect = 2e-33 Identities = 73/112 (65%), Positives = 81/112 (72%) Frame = +3 Query: 3 EGETEFTTADSAIEETIKPTCKHAVELPITEACTLVWEVRVVGWEVAYGAEFVPSVEGGY 182 EGE EFT AD+ E TIKP+ KH VE PITE C LVWE+RVVGW+V YGAEFVPS E GY Sbjct: 511 EGEQEFTVADAVTEVTIKPSTKHTVEFPITEKCNLVWELRVVGWDVNYGAEFVPSAEDGY 570 Query: 183 TWIVHKSRKIGPADEQVVGCSFKIGETGKVVLTFDNQTSXXXXXXYRFKIKP 338 T IV K+RK+ ADE V+ SFK G+ GKVVLT DNQTS YR K KP Sbjct: 571 TVIVSKTRKVTTADEAVISDSFKTGDPGKVVLTVDNQTSKKKKLLYRSKTKP 622 >emb|CBI39339.3| unnamed protein product [Vitis vinifera] Length = 274 Score = 147 bits (370), Expect = 4e-33 Identities = 67/114 (58%), Positives = 82/114 (71%) Frame = +3 Query: 3 EGETEFTTADSAIEETIKPTCKHAVELPITEACTLVWEVRVVGWEVAYGAEFVPSVEGGY 182 +G+TEF+ D TIKP CKH +E P +E C L+WE+RV+GW+V YGAEFVP+VEGGY Sbjct: 161 DGDTEFSICDPVTLVTIKPGCKHVIEFPYSEPCQLIWELRVIGWDVTYGAEFVPTVEGGY 220 Query: 183 TWIVHKSRKIGPADEQVVGCSFKIGETGKVVLTFDNQTSXXXXXXYRFKIKPSE 344 T IV K+RKI P DE V+ SFKIGE GKV+LT DNQTS YR K +P + Sbjct: 221 TVIVQKARKIAPTDEPVISNSFKIGEPGKVILTIDNQTSKKKKLLYRSKTQPCD 274 >ref|XP_002267428.1| PREDICTED: patellin-5 isoform 1 [Vitis vinifera] Length = 606 Score = 147 bits (370), Expect = 4e-33 Identities = 67/114 (58%), Positives = 82/114 (71%) Frame = +3 Query: 3 EGETEFTTADSAIEETIKPTCKHAVELPITEACTLVWEVRVVGWEVAYGAEFVPSVEGGY 182 +G+TEF+ D TIKP CKH +E P +E C L+WE+RV+GW+V YGAEFVP+VEGGY Sbjct: 493 DGDTEFSICDPVTLVTIKPGCKHVIEFPYSEPCQLIWELRVIGWDVTYGAEFVPTVEGGY 552 Query: 183 TWIVHKSRKIGPADEQVVGCSFKIGETGKVVLTFDNQTSXXXXXXYRFKIKPSE 344 T IV K+RKI P DE V+ SFKIGE GKV+LT DNQTS YR K +P + Sbjct: 553 TVIVQKARKIAPTDEPVISNSFKIGEPGKVILTIDNQTSKKKKLLYRSKTQPCD 606 >ref|XP_006585750.1| PREDICTED: patellin-3-like isoform X2 [Glycine max] gi|571472879|ref|XP_006585751.1| PREDICTED: patellin-3-like isoform X3 [Glycine max] Length = 540 Score = 146 bits (369), Expect = 5e-33 Identities = 71/112 (63%), Positives = 81/112 (72%) Frame = +3 Query: 3 EGETEFTTADSAIEETIKPTCKHAVELPITEACTLVWEVRVVGWEVAYGAEFVPSVEGGY 182 E E EFTT+D E TIKP KHAVE P++E VWE+RVVGW+V+YGAEFVP E GY Sbjct: 426 EAEQEFTTSDPVTEVTIKPATKHAVEFPVSEKSHAVWEIRVVGWDVSYGAEFVPGAEDGY 485 Query: 183 TWIVHKSRKIGPADEQVVGCSFKIGETGKVVLTFDNQTSXXXXXXYRFKIKP 338 T IV K+RKIGPADE V+ +FKIGE GK+VLT DNQTS YR K KP Sbjct: 486 TVIVQKNRKIGPADETVITNAFKIGEPGKIVLTIDNQTSKKKKLLYRSKTKP 537 >ref|XP_007224786.1| hypothetical protein PRUPE_ppa023884mg, partial [Prunus persica] gi|462421722|gb|EMJ25985.1| hypothetical protein PRUPE_ppa023884mg, partial [Prunus persica] Length = 584 Score = 146 bits (369), Expect = 5e-33 Identities = 70/112 (62%), Positives = 83/112 (74%) Frame = +3 Query: 3 EGETEFTTADSAIEETIKPTCKHAVELPITEACTLVWEVRVVGWEVAYGAEFVPSVEGGY 182 EGE EFTT+D E T+KP KH VE+P++E LVWEVRVVGW+V+YGAEFVPS E GY Sbjct: 470 EGEQEFTTSDPVTEITVKPATKHTVEIPVSENGLLVWEVRVVGWDVSYGAEFVPSAEDGY 529 Query: 183 TWIVHKSRKIGPADEQVVGCSFKIGETGKVVLTFDNQTSXXXXXXYRFKIKP 338 T I+ K+RK+ PADE V+ S+KIGE GKVVLT DNQ+S YR K KP Sbjct: 530 TIILQKTRKVAPADEPVISNSYKIGEAGKVVLTIDNQSSKKKKLLYRSKTKP 581 >dbj|BAE71201.1| putative cytosolic factor [Trifolium pratense] Length = 607 Score = 146 bits (369), Expect = 5e-33 Identities = 72/112 (64%), Positives = 81/112 (72%) Frame = +3 Query: 3 EGETEFTTADSAIEETIKPTCKHAVELPITEACTLVWEVRVVGWEVAYGAEFVPSVEGGY 182 EGE EFTTAD A E TIKP KHAVE PI+E TLVWEVRVV W V YGAEFVPS E GY Sbjct: 493 EGEQEFTTADPATEVTIKPATKHAVEFPISEKSTLVWEVRVVDWSVNYGAEFVPSAEDGY 552 Query: 183 TWIVHKSRKIGPADEQVVGCSFKIGETGKVVLTFDNQTSXXXXXXYRFKIKP 338 T I+ K+RK+ PADE ++ +FKIGE GKV+LT DNQ+S YR K P Sbjct: 553 TVIIQKNRKVAPADETIISNTFKIGEPGKVILTIDNQSSKKKKLLYRSKTIP 604 >ref|XP_003531832.1| PREDICTED: patellin-3-like isoform X1 [Glycine max] Length = 576 Score = 146 bits (369), Expect = 5e-33 Identities = 71/112 (63%), Positives = 81/112 (72%) Frame = +3 Query: 3 EGETEFTTADSAIEETIKPTCKHAVELPITEACTLVWEVRVVGWEVAYGAEFVPSVEGGY 182 E E EFTT+D E TIKP KHAVE P++E VWE+RVVGW+V+YGAEFVP E GY Sbjct: 462 EAEQEFTTSDPVTEVTIKPATKHAVEFPVSEKSHAVWEIRVVGWDVSYGAEFVPGAEDGY 521 Query: 183 TWIVHKSRKIGPADEQVVGCSFKIGETGKVVLTFDNQTSXXXXXXYRFKIKP 338 T IV K+RKIGPADE V+ +FKIGE GK+VLT DNQTS YR K KP Sbjct: 522 TVIVQKNRKIGPADETVITNAFKIGEPGKIVLTIDNQTSKKKKLLYRSKTKP 573 >ref|XP_006591943.1| PREDICTED: patellin-2-like isoform X3 [Glycine max] Length = 587 Score = 145 bits (366), Expect = 1e-32 Identities = 72/112 (64%), Positives = 81/112 (72%) Frame = +3 Query: 3 EGETEFTTADSAIEETIKPTCKHAVELPITEACTLVWEVRVVGWEVAYGAEFVPSVEGGY 182 E E EFT+A E TIKP KH+VE P++E LVWE+RVVGW+V+YGAEFVPS E GY Sbjct: 473 EAEQEFTSAYPVTEFTIKPATKHSVEFPVSEKSHLVWEIRVVGWDVSYGAEFVPSAEDGY 532 Query: 183 TWIVHKSRKIGPADEQVVGCSFKIGETGKVVLTFDNQTSXXXXXXYRFKIKP 338 T IVHKSRKI PADE V+ FKIGE GK+VLT DNQTS YR K KP Sbjct: 533 TVIVHKSRKIAPADETVLTNGFKIGEPGKIVLTIDNQTSKKKKLLYRSKTKP 584 >ref|XP_006591942.1| PREDICTED: patellin-2-like isoform X2 [Glycine max] Length = 596 Score = 145 bits (366), Expect = 1e-32 Identities = 72/112 (64%), Positives = 81/112 (72%) Frame = +3 Query: 3 EGETEFTTADSAIEETIKPTCKHAVELPITEACTLVWEVRVVGWEVAYGAEFVPSVEGGY 182 E E EFT+A E TIKP KH+VE P++E LVWE+RVVGW+V+YGAEFVPS E GY Sbjct: 482 EAEQEFTSAYPVTEFTIKPATKHSVEFPVSEKSHLVWEIRVVGWDVSYGAEFVPSAEDGY 541 Query: 183 TWIVHKSRKIGPADEQVVGCSFKIGETGKVVLTFDNQTSXXXXXXYRFKIKP 338 T IVHKSRKI PADE V+ FKIGE GK+VLT DNQTS YR K KP Sbjct: 542 TVIVHKSRKIAPADETVLTNGFKIGEPGKIVLTIDNQTSKKKKLLYRSKTKP 593 >ref|XP_004250845.1| PREDICTED: patellin-5-like [Solanum lycopersicum] Length = 589 Score = 145 bits (366), Expect = 1e-32 Identities = 72/114 (63%), Positives = 81/114 (71%) Frame = +3 Query: 3 EGETEFTTADSAIEETIKPTCKHAVELPITEACTLVWEVRVVGWEVAYGAEFVPSVEGGY 182 EGE EFT ADSA E+T+KP KH VE P+TE LVWE RVVGW+V YGAEFVPS EGGY Sbjct: 476 EGEQEFTIADSATEDTVKPASKHTVEFPVTEKSNLVWEARVVGWDVCYGAEFVPSAEGGY 535 Query: 183 TWIVHKSRKIGPADEQVVGCSFKIGETGKVVLTFDNQTSXXXXXXYRFKIKPSE 344 T IV KSRKI A+E V+ ++ E GKVVLTFDNQTS YR K K S+ Sbjct: 536 TIIVEKSRKIAAANETVITNNYTAPEAGKVVLTFDNQTSKRKKLVYRSKTKSSD 589 >ref|XP_003540055.1| PREDICTED: patellin-2-like isoform X1 [Glycine max] Length = 606 Score = 145 bits (366), Expect = 1e-32 Identities = 72/112 (64%), Positives = 81/112 (72%) Frame = +3 Query: 3 EGETEFTTADSAIEETIKPTCKHAVELPITEACTLVWEVRVVGWEVAYGAEFVPSVEGGY 182 E E EFT+A E TIKP KH+VE P++E LVWE+RVVGW+V+YGAEFVPS E GY Sbjct: 492 EAEQEFTSAYPVTEFTIKPATKHSVEFPVSEKSHLVWEIRVVGWDVSYGAEFVPSAEDGY 551 Query: 183 TWIVHKSRKIGPADEQVVGCSFKIGETGKVVLTFDNQTSXXXXXXYRFKIKP 338 T IVHKSRKI PADE V+ FKIGE GK+VLT DNQTS YR K KP Sbjct: 552 TVIVHKSRKIAPADETVLTNGFKIGEPGKIVLTIDNQTSKKKKLLYRSKTKP 603 >ref|XP_007131297.1| hypothetical protein PHAVU_011G002100g [Phaseolus vulgaris] gi|561004297|gb|ESW03291.1| hypothetical protein PHAVU_011G002100g [Phaseolus vulgaris] Length = 612 Score = 144 bits (364), Expect = 2e-32 Identities = 72/111 (64%), Positives = 80/111 (72%) Frame = +3 Query: 3 EGETEFTTADSAIEETIKPTCKHAVELPITEACTLVWEVRVVGWEVAYGAEFVPSVEGGY 182 EGE EFTTAD+ E TIKP KHAVE P++E LVWE+RVVGW+V+Y AEFVPS E GY Sbjct: 498 EGEQEFTTADAVTEVTIKPATKHAVEFPVSEKSHLVWEIRVVGWDVSYVAEFVPSTEDGY 557 Query: 183 TWIVHKSRKIGPADEQVVGCSFKIGETGKVVLTFDNQTSXXXXXXYRFKIK 335 T IV K+RKI PADE V+ FK GE GKVVLT DNQTS YR K K Sbjct: 558 TVIVQKNRKIAPADETVISNGFKSGEAGKVVLTIDNQTSKKKKLLYRSKTK 608 >ref|XP_006859149.1| hypothetical protein AMTR_s00070p00121170 [Amborella trichopoda] gi|548863262|gb|ERN20616.1| hypothetical protein AMTR_s00070p00121170 [Amborella trichopoda] Length = 626 Score = 143 bits (360), Expect = 6e-32 Identities = 68/111 (61%), Positives = 84/111 (75%) Frame = +3 Query: 3 EGETEFTTADSAIEETIKPTCKHAVELPITEACTLVWEVRVVGWEVAYGAEFVPSVEGGY 182 E +TEF+TAD+A E TIKP K+ VE+P+TEAC LVWE+R++GW+V+YGAEFVPS E GY Sbjct: 509 ENDTEFSTADAATEFTIKPASKNTVEIPVTEACILVWELRILGWDVSYGAEFVPSAEDGY 568 Query: 183 TWIVHKSRKIGPADEQVVGCSFKIGETGKVVLTFDNQTSXXXXXXYRFKIK 335 T IV K+RKI DE V+ SFKIGE GK+VLT DN +S YR+K K Sbjct: 569 TVIVQKARKIAITDEPVIRNSFKIGEPGKIVLTVDNTSSKKKKLIYRYKTK 619 >ref|XP_007020115.1| SEC14 cytosolic factor family protein / phosphoglyceride transfer family protein isoform 2 [Theobroma cacao] gi|508725443|gb|EOY17340.1| SEC14 cytosolic factor family protein / phosphoglyceride transfer family protein isoform 2 [Theobroma cacao] Length = 626 Score = 143 bits (360), Expect = 6e-32 Identities = 73/113 (64%), Positives = 81/113 (71%), Gaps = 1/113 (0%) Frame = +3 Query: 3 EGETEFTTADSAIEETIKPTCKHAVELPITEA-CTLVWEVRVVGWEVAYGAEFVPSVEGG 179 EGE EFT AD+ E TIKP+ KH VE PITE C LVWE+RVVGW+V YGAEFVPS E G Sbjct: 511 EGEQEFTVADAVTEVTIKPSTKHTVEFPITEQKCNLVWELRVVGWDVNYGAEFVPSAEDG 570 Query: 180 YTWIVHKSRKIGPADEQVVGCSFKIGETGKVVLTFDNQTSXXXXXXYRFKIKP 338 YT IV K+RK+ ADE V+ SFK G+ GKVVLT DNQTS YR K KP Sbjct: 571 YTVIVSKTRKVTTADEAVISDSFKTGDPGKVVLTVDNQTSKKKKLLYRSKTKP 623