BLASTX nr result
ID: Mentha24_contig00024855
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha24_contig00024855 (335 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU36097.1| hypothetical protein MIMGU_mgv1a020159mg [Mimulus... 151 1e-34 ref|XP_004237182.1| PREDICTED: pentatricopeptide repeat-containi... 130 2e-28 gb|EPS59551.1| hypothetical protein M569_15254, partial [Genlise... 118 7e-25 ref|XP_002314741.2| hypothetical protein POPTR_0010s10900g [Popu... 116 3e-24 ref|XP_006470889.1| PREDICTED: pentatricopeptide repeat-containi... 114 1e-23 ref|XP_006420692.1| hypothetical protein CICLE_v10004581mg [Citr... 114 1e-23 ref|XP_003555735.1| PREDICTED: pentatricopeptide repeat-containi... 112 5e-23 ref|XP_002312498.2| hypothetical protein POPTR_0008s14240g [Popu... 111 1e-22 ref|XP_002521865.1| pentatricopeptide repeat-containing protein,... 110 2e-22 gb|AFK45134.1| unknown [Lotus japonicus] 108 8e-22 ref|XP_007015128.1| Pentatricopeptide repeat superfamily protein... 107 2e-21 ref|XP_002269531.1| PREDICTED: pentatricopeptide repeat-containi... 107 2e-21 ref|XP_002273494.2| PREDICTED: pentatricopeptide repeat-containi... 105 5e-21 ref|XP_007153172.1| hypothetical protein PHAVU_003G013000g [Phas... 105 7e-21 ref|XP_003529243.2| PREDICTED: pentatricopeptide repeat-containi... 104 1e-20 ref|XP_004298632.1| PREDICTED: pentatricopeptide repeat-containi... 103 2e-20 ref|XP_004151188.1| PREDICTED: pentatricopeptide repeat-containi... 100 4e-19 ref|XP_004172063.1| PREDICTED: pentatricopeptide repeat-containi... 98 1e-18 ref|XP_006853735.1| hypothetical protein AMTR_s00056p00173630 [A... 97 3e-18 ref|XP_004498165.1| PREDICTED: pentatricopeptide repeat-containi... 94 2e-17 >gb|EYU36097.1| hypothetical protein MIMGU_mgv1a020159mg [Mimulus guttatus] Length = 573 Score = 151 bits (381), Expect = 1e-34 Identities = 69/111 (62%), Positives = 91/111 (81%) Frame = -2 Query: 334 LNSLICSLFKDSQTEHLAYEYYRKAKSKPNFKPQKYTMKLAIRYLFRTKSWGLLISFCQD 155 LN+LIC+LFKD QTE+LA++YY+KAKS P+F+P++YT+KL IRYL R+ +W L SFCQD Sbjct: 43 LNALICTLFKDPQTENLAHDYYQKAKSDPDFRPERYTLKLLIRYLTRSNNWTSLFSFCQD 102 Query: 154 LKDFRVLPDKSTSCRLVLDCVKARKFKILNHLLLLFMDNDAETAVLAFDSA 2 L FRVLPD+ T CRL+ C+KARK K++++LL F+ ND TAV+AFDSA Sbjct: 103 LAQFRVLPDRPTCCRLITTCMKARKLKLVDNLLDCFLSNDVITAVMAFDSA 153 >ref|XP_004237182.1| PREDICTED: pentatricopeptide repeat-containing protein At5g13770, chloroplastic-like [Solanum lycopersicum] Length = 625 Score = 130 bits (327), Expect = 2e-28 Identities = 60/111 (54%), Positives = 81/111 (72%) Frame = -2 Query: 334 LNSLICSLFKDSQTEHLAYEYYRKAKSKPNFKPQKYTMKLAIRYLFRTKSWGLLISFCQD 155 LN +ICSL KD+QT+ + Y+YY KAK + +F+P+K T+KL IRYL + WG + S +D Sbjct: 103 LNGVICSLLKDTQTQEIGYDYYEKAKGEKDFRPEKSTLKLLIRYLVNSSKWGSVFSLSKD 162 Query: 154 LKDFRVLPDKSTSCRLVLDCVKARKFKILNHLLLLFMDNDAETAVLAFDSA 2 L+ +VLPD ST CRL+ C+K+RKFKI+N L LF+ D E +VLAFDSA Sbjct: 163 LRTLKVLPDSSTCCRLISSCMKSRKFKIVNSFLELFIVVDQEISVLAFDSA 213 >gb|EPS59551.1| hypothetical protein M569_15254, partial [Genlisea aurea] Length = 412 Score = 118 bits (296), Expect = 7e-25 Identities = 55/111 (49%), Positives = 72/111 (64%) Frame = -2 Query: 334 LNSLICSLFKDSQTEHLAYEYYRKAKSKPNFKPQKYTMKLAIRYLFRTKSWGLLISFCQD 155 LN LIC LF+DS TE+L YE YR+ K NF PQ T+KL +R+L R K W LL+SFC D Sbjct: 8 LNGLICGLFEDSATENLGYECYRRCKGNLNFTPQNRTLKLLVRHLIRNKDWSLLLSFCDD 67 Query: 154 LKDFRVLPDKSTSCRLVLDCVKARKFKILNHLLLLFMDNDAETAVLAFDSA 2 L+ F +LP KS +L+ CV+ RK + L +F++ D E A+ FDSA Sbjct: 68 LRSFGILPGKSLCVKLISSCVRGRKLNLAYSFLDVFLEIDEEIAIAGFDSA 118 >ref|XP_002314741.2| hypothetical protein POPTR_0010s10900g [Populus trichocarpa] gi|550329541|gb|EEF00912.2| hypothetical protein POPTR_0010s10900g [Populus trichocarpa] Length = 582 Score = 116 bits (291), Expect = 3e-24 Identities = 55/111 (49%), Positives = 77/111 (69%) Frame = -2 Query: 334 LNSLICSLFKDSQTEHLAYEYYRKAKSKPNFKPQKYTMKLAIRYLFRTKSWGLLISFCQD 155 LN +C + +D ++E LAYEYY+KAK KP F+PQ+ +KL IRYL ++ WGL++S D Sbjct: 60 LNDFLCGVLQDPKSEELAYEYYKKAKEKPEFRPQRPVLKLLIRYLIQSDKWGLVLSLADD 119 Query: 154 LKDFRVLPDKSTSCRLVLDCVKARKFKILNHLLLLFMDNDAETAVLAFDSA 2 K + V PD T LV C++AR+FKI+ +LL F +D++ AVLAFDSA Sbjct: 120 FKKYNVFPDSFTFSTLVSSCIRARRFKIVENLLENF-KSDSKIAVLAFDSA 169 >ref|XP_006470889.1| PREDICTED: pentatricopeptide repeat-containing protein At5g13770, chloroplastic-like [Citrus sinensis] Length = 603 Score = 114 bits (285), Expect = 1e-23 Identities = 56/110 (50%), Positives = 76/110 (69%) Frame = -2 Query: 331 NSLICSLFKDSQTEHLAYEYYRKAKSKPNFKPQKYTMKLAIRYLFRTKSWGLLISFCQDL 152 +S + + KD QT+ LAY+YY +AK P F+P+K T+KL IRYL ++K W ++S +D Sbjct: 84 DSFLHGMLKDPQTQELAYDYYNEAKKLPEFRPEKSTLKLLIRYLVQSKKWDSIVSLSEDF 143 Query: 151 KDFRVLPDKSTSCRLVLDCVKARKFKILNHLLLLFMDNDAETAVLAFDSA 2 K + VLPD T RLV CV+ARKFKI N LL +F+ D E A+LAF+SA Sbjct: 144 KIYNVLPDAHTCSRLVASCVRARKFKIANTLLQVFI-TDGEIALLAFNSA 192 >ref|XP_006420692.1| hypothetical protein CICLE_v10004581mg [Citrus clementina] gi|557522565|gb|ESR33932.1| hypothetical protein CICLE_v10004581mg [Citrus clementina] Length = 603 Score = 114 bits (285), Expect = 1e-23 Identities = 56/110 (50%), Positives = 76/110 (69%) Frame = -2 Query: 331 NSLICSLFKDSQTEHLAYEYYRKAKSKPNFKPQKYTMKLAIRYLFRTKSWGLLISFCQDL 152 +S + + KD QT+ LAY+YY +AK P F+P+K T+KL IRYL ++K W ++S +D Sbjct: 84 DSFLHGMLKDPQTQELAYDYYNEAKKLPEFRPEKSTLKLLIRYLVQSKKWDSIVSLSEDF 143 Query: 151 KDFRVLPDKSTSCRLVLDCVKARKFKILNHLLLLFMDNDAETAVLAFDSA 2 K + VLPD T RLV CV+ARKFKI N LL +F+ D E A+LAF+SA Sbjct: 144 KIYNVLPDAHTCSRLVASCVRARKFKIANTLLQVFI-TDGEIALLAFNSA 192 >ref|XP_003555735.1| PREDICTED: pentatricopeptide repeat-containing protein At5g13770, chloroplastic-like [Glycine max] Length = 609 Score = 112 bits (280), Expect = 5e-23 Identities = 52/111 (46%), Positives = 77/111 (69%) Frame = -2 Query: 334 LNSLICSLFKDSQTEHLAYEYYRKAKSKPNFKPQKYTMKLAIRYLFRTKSWGLLISFCQD 155 LN +C LF+D +T+ LA++YY++ K +P F+P+K T+K IRYL KSWG ++S +D Sbjct: 90 LNEFLCGLFEDPKTKELAFDYYQRLKERPEFRPEKPTLKHVIRYLVSLKSWGSILSVSED 149 Query: 154 LKDFRVLPDKSTSCRLVLDCVKARKFKILNHLLLLFMDNDAETAVLAFDSA 2 K + VLPD++T RLV C++ RKF++ LL +F D D++ A LAF SA Sbjct: 150 FKVYHVLPDRATCSRLVKFCIEHRKFRVAESLLYVFKD-DSKVAFLAFSSA 199 >ref|XP_002312498.2| hypothetical protein POPTR_0008s14240g [Populus trichocarpa] gi|550333052|gb|EEE89865.2| hypothetical protein POPTR_0008s14240g [Populus trichocarpa] Length = 603 Score = 111 bits (277), Expect = 1e-22 Identities = 53/111 (47%), Positives = 77/111 (69%) Frame = -2 Query: 334 LNSLICSLFKDSQTEHLAYEYYRKAKSKPNFKPQKYTMKLAIRYLFRTKSWGLLISFCQD 155 LN +C + +D ++E LAYEYY+KAK K F+P++ +KL IRYL +++ WGL++ D Sbjct: 90 LNDFLCGVLRDPKSEELAYEYYKKAKEKQEFRPKRPMLKLLIRYLIQSEKWGLVLPVADD 149 Query: 154 LKDFRVLPDKSTSCRLVLDCVKARKFKILNHLLLLFMDNDAETAVLAFDSA 2 K + VLPD T LV C++ARKFKI+ LL + + +D++ AVLAFDSA Sbjct: 150 FKKYSVLPDSYTFSTLVSSCIRARKFKIVEGLLEISI-SDSKIAVLAFDSA 199 >ref|XP_002521865.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223538903|gb|EEF40501.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 449 Score = 110 bits (276), Expect = 2e-22 Identities = 53/111 (47%), Positives = 72/111 (64%) Frame = -2 Query: 334 LNSLICSLFKDSQTEHLAYEYYRKAKSKPNFKPQKYTMKLAIRYLFRTKSWGLLISFCQD 155 LN +C + +D +TEHLAYEYY+KA + F+P K +KL IRYL ++K+W L++ D Sbjct: 89 LNKFLCGILRDPRTEHLAYEYYKKATERQEFRPDKPMLKLLIRYLMQSKNWDLILPVADD 148 Query: 154 LKDFRVLPDKSTSCRLVLDCVKARKFKILNHLLLLFMDNDAETAVLAFDSA 2 + VLPD +T RLV C+K RKF+I+ LL F +E VLAFDSA Sbjct: 149 FNKYNVLPDSNTCSRLVYSCIKTRKFRIVESLLECFKCY-SEIPVLAFDSA 198 >gb|AFK45134.1| unknown [Lotus japonicus] Length = 208 Score = 108 bits (270), Expect = 8e-22 Identities = 52/111 (46%), Positives = 74/111 (66%) Frame = -2 Query: 334 LNSLICSLFKDSQTEHLAYEYYRKAKSKPNFKPQKYTMKLAIRYLFRTKSWGLLISFCQD 155 LN +C LF+D +TE LA++YY++ K +P F+P+K T+K IRYL R K W ++S +D Sbjct: 90 LNEFLCGLFQDPKTEELAFDYYQRLKDRPVFRPEKSTLKHVIRYLMRFKKWDFILSVSED 149 Query: 154 LKDFRVLPDKSTSCRLVLDCVKARKFKILNHLLLLFMDNDAETAVLAFDSA 2 K + VLPD +T +L+ C++ RKFKI LL F +D+E AV AF SA Sbjct: 150 FKIYHVLPDGATCSKLIEFCIRQRKFKIAETLLNAFR-SDSEVAVFAFGSA 199 >ref|XP_007015128.1| Pentatricopeptide repeat superfamily protein, putative [Theobroma cacao] gi|508785491|gb|EOY32747.1| Pentatricopeptide repeat superfamily protein, putative [Theobroma cacao] Length = 608 Score = 107 bits (267), Expect = 2e-21 Identities = 52/111 (46%), Positives = 76/111 (68%) Frame = -2 Query: 334 LNSLICSLFKDSQTEHLAYEYYRKAKSKPNFKPQKYTMKLAIRYLFRTKSWGLLISFCQD 155 LNS + L +D+Q E LAY+YY KAK +P F P+K ++L IRYL ++K W L++S +D Sbjct: 88 LNSFLRGLLQDTQNERLAYDYYEKAKRRPGFIPEKPMLQLLIRYLVQSKKWDLVMSLSED 147 Query: 154 LKDFRVLPDKSTSCRLVLDCVKARKFKILNHLLLLFMDNDAETAVLAFDSA 2 K + VLPD T RL+ C++ARKFK++ LL +F +D A++AF+SA Sbjct: 148 FKHYHVLPDSYTCSRLINACIRARKFKVVGTLLQVF-KSDKVVALIAFNSA 197 >ref|XP_002269531.1| PREDICTED: pentatricopeptide repeat-containing protein At5g13770, chloroplastic-like [Vitis vinifera] Length = 603 Score = 107 bits (267), Expect = 2e-21 Identities = 50/111 (45%), Positives = 72/111 (64%) Frame = -2 Query: 334 LNSLICSLFKDSQTEHLAYEYYRKAKSKPNFKPQKYTMKLAIRYLFRTKSWGLLISFCQD 155 LN +C LF+D ++E LA++YY+K K +P F+P K T++ IRYL R+K WGL + +D Sbjct: 83 LNDFLCGLFRDPRSEELAFDYYQKVKERPEFRPDKETLERIIRYLIRSKKWGLSLLVFED 142 Query: 154 LKDFRVLPDKSTSCRLVLDCVKARKFKILNHLLLLFMDNDAETAVLAFDSA 2 K F D CRL+ C++ARKF+I LL +F ND + A+L F+SA Sbjct: 143 FKSFDAQLDGDICCRLISSCIRARKFRITESLLEVFSYND-DVALLVFNSA 192 >ref|XP_002273494.2| PREDICTED: pentatricopeptide repeat-containing protein At5g13770, chloroplastic-like [Vitis vinifera] Length = 609 Score = 105 bits (263), Expect = 5e-21 Identities = 50/111 (45%), Positives = 72/111 (64%) Frame = -2 Query: 334 LNSLICSLFKDSQTEHLAYEYYRKAKSKPNFKPQKYTMKLAIRYLFRTKSWGLLISFCQD 155 LN +C LF+D ++E LA++YY+K K +P F+P K T++ IRYL R+K WGL + +D Sbjct: 84 LNDFLCGLFRDPRSEELAFDYYQKVKERPEFRPDKETLERIIRYLIRSKKWGLSLLVFED 143 Query: 154 LKDFRVLPDKSTSCRLVLDCVKARKFKILNHLLLLFMDNDAETAVLAFDSA 2 K F V D CRL+ C++ARKF+I LL +F ND + A+L +SA Sbjct: 144 FKSFDVQLDGDICCRLISSCIRARKFRITESLLEVFSYND-DVALLVCNSA 193 >ref|XP_007153172.1| hypothetical protein PHAVU_003G013000g [Phaseolus vulgaris] gi|561026526|gb|ESW25166.1| hypothetical protein PHAVU_003G013000g [Phaseolus vulgaris] Length = 609 Score = 105 bits (262), Expect = 7e-21 Identities = 48/111 (43%), Positives = 76/111 (68%) Frame = -2 Query: 334 LNSLICSLFKDSQTEHLAYEYYRKAKSKPNFKPQKYTMKLAIRYLFRTKSWGLLISFCQD 155 LN +C LF+DS+T LA++YY++ K +P F+P+K T++ IRYL K W L++S +D Sbjct: 89 LNEFLCGLFEDSKTRELAFDYYQRLKERPEFRPEKSTLRHVIRYLMSLKQWDLILSVSED 148 Query: 154 LKDFRVLPDKSTSCRLVLDCVKARKFKILNHLLLLFMDNDAETAVLAFDSA 2 K + VLPD++T R++ C+ RKF++ + LL +FM +D++ A LA SA Sbjct: 149 FKVYHVLPDRATCSRVIKFCIDRRKFRVADVLLDVFM-SDSKVAFLACSSA 198 >ref|XP_003529243.2| PREDICTED: pentatricopeptide repeat-containing protein At5g13770, chloroplastic-like [Glycine max] Length = 631 Score = 104 bits (260), Expect = 1e-20 Identities = 49/111 (44%), Positives = 75/111 (67%) Frame = -2 Query: 334 LNSLICSLFKDSQTEHLAYEYYRKAKSKPNFKPQKYTMKLAIRYLFRTKSWGLLISFCQD 155 LN +C LF+D +T+ LA++YY++ K +P F+P+K T+K IRYL KSW ++S D Sbjct: 112 LNEFLCGLFEDPKTKELAFDYYQRLKERPEFRPEKPTLKHVIRYLVSLKSWDSILSVSDD 171 Query: 154 LKDFRVLPDKSTSCRLVLDCVKARKFKILNHLLLLFMDNDAETAVLAFDSA 2 K + VLPD++T RLV C++ RKF++ LL +F +D++ A +AF SA Sbjct: 172 FKVYHVLPDRATCSRLVKFCIEHRKFRVAEALLDVF-KSDSKVAFMAFSSA 221 >ref|XP_004298632.1| PREDICTED: pentatricopeptide repeat-containing protein At5g13770, chloroplastic-like [Fragaria vesca subsp. vesca] Length = 604 Score = 103 bits (257), Expect = 2e-20 Identities = 49/111 (44%), Positives = 70/111 (63%) Frame = -2 Query: 334 LNSLICSLFKDSQTEHLAYEYYRKAKSKPNFKPQKYTMKLAIRYLFRTKSWGLLISFCQD 155 LN +C LF+D + E + YE Y KAK F+P K T++ RYL R+K W ++S C D Sbjct: 86 LNEFLCELFQDPEKEAMGYEQYEKAKKVAEFRPNKSTLEHVTRYLVRSKKWDSILSVCTD 145 Query: 154 LKDFRVLPDKSTSCRLVLDCVKARKFKILNHLLLLFMDNDAETAVLAFDSA 2 K + +LPD+ T RLV C++ARKFK++ LL +F +D + A+ A DSA Sbjct: 146 FKTYDLLPDRYTCSRLVTSCIRARKFKVVRTLLQVF-KSDGDVALPALDSA 195 >ref|XP_004151188.1| PREDICTED: pentatricopeptide repeat-containing protein At5g13770, chloroplastic-like [Cucumis sativus] Length = 608 Score = 99.8 bits (247), Expect = 4e-19 Identities = 49/111 (44%), Positives = 70/111 (63%) Frame = -2 Query: 334 LNSLICSLFKDSQTEHLAYEYYRKAKSKPNFKPQKYTMKLAIRYLFRTKSWGLLISFCQD 155 LN +C L ++ TE L Y+YY KAK F+PQK T++ IRYL R K W L++ +D Sbjct: 84 LNDFLCGLSQNPGTEDLIYDYYVKAKETAGFRPQKSTLRHLIRYLVRLKKWDLILLVSRD 143 Query: 154 LKDFRVLPDKSTSCRLVLDCVKARKFKILNHLLLLFMDNDAETAVLAFDSA 2 DF V PD+ T +LV CV+ RKFK++ LL +F + D+ A+ AF++A Sbjct: 144 FVDFGVCPDRDTCSKLVSSCVRGRKFKVVKSLLEVF-ERDSGVAMTAFEAA 193 >ref|XP_004172063.1| PREDICTED: pentatricopeptide repeat-containing protein At5g13770, chloroplastic-like [Cucumis sativus] Length = 608 Score = 97.8 bits (242), Expect = 1e-18 Identities = 48/111 (43%), Positives = 70/111 (63%) Frame = -2 Query: 334 LNSLICSLFKDSQTEHLAYEYYRKAKSKPNFKPQKYTMKLAIRYLFRTKSWGLLISFCQD 155 LN +C L ++ TE L Y+YY KAK F+PQK T++ IRYL R K W L++ +D Sbjct: 84 LNDFLCGLSQNPGTEDLIYDYYVKAKETAGFRPQKSTLRHLIRYLVRLKKWDLILLVSRD 143 Query: 154 LKDFRVLPDKSTSCRLVLDCVKARKFKILNHLLLLFMDNDAETAVLAFDSA 2 DF V PD+ T +LV CV+ RKFK++ LL +F + ++ A+ AF++A Sbjct: 144 FVDFGVCPDRDTCSKLVSSCVRGRKFKVVKSLLEVF-ERNSGVAMTAFEAA 193 >ref|XP_006853735.1| hypothetical protein AMTR_s00056p00173630 [Amborella trichopoda] gi|548857396|gb|ERN15202.1| hypothetical protein AMTR_s00056p00173630 [Amborella trichopoda] Length = 616 Score = 96.7 bits (239), Expect = 3e-18 Identities = 50/112 (44%), Positives = 70/112 (62%), Gaps = 1/112 (0%) Frame = -2 Query: 334 LNSLICSLFKDSQTEHLAYEYYRKAKSKPNFKPQKYTMKLAIRYLFRTKSWGLLISF-CQ 158 LN I LF++ QTE LA++YY+KAK +P F P ++T+ ++ RTK W L F Sbjct: 88 LNKFIQGLFRNRQTETLAFDYYQKAKDQPEFLPDQFTVNALAGFILRTKQWSLFEEFLVV 147 Query: 157 DLKDFRVLPDKSTSCRLVLDCVKARKFKILNHLLLLFMDNDAETAVLAFDSA 2 D+K F V PD CR++ C+KARKFKI + LL +F ND + V A++SA Sbjct: 148 DIKRFSVFPDDQICCRVLRTCIKARKFKITDLLLDVF-GNDKKLGVQAYESA 198 >ref|XP_004498165.1| PREDICTED: pentatricopeptide repeat-containing protein At5g13770, chloroplastic-like [Cicer arietinum] gi|502184228|ref|XP_004517308.1| PREDICTED: pentatricopeptide repeat-containing protein At5g13770, chloroplastic-like [Cicer arietinum] Length = 632 Score = 94.4 bits (233), Expect = 2e-17 Identities = 48/111 (43%), Positives = 65/111 (58%) Frame = -2 Query: 334 LNSLICSLFKDSQTEHLAYEYYRKAKSKPNFKPQKYTMKLAIRYLFRTKSWGLLISFCQD 155 LN +C LF+D + + LA++YY++ K + F P+K T+K IRYL + K W S D Sbjct: 112 LNDFLCGLFEDQKKDELAFDYYQRLKERSEFIPKKSTLKYVIRYLMKFKKWEFFSSLSHD 171 Query: 154 LKDFRVLPDKSTSCRLVLDCVKARKFKILNHLLLLFMDNDAETAVLAFDSA 2 K + V PD +T RL+ C+K RKFKI LL F N +E V AF SA Sbjct: 172 FKVYHVFPDVATCSRLISFCIKNRKFKISETLLDAFSLN-SEVGVFAFGSA 221