BLASTX nr result
ID: Coptis25_contig00002119
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Coptis25_contig00002119 (2570 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002266660.1| PREDICTED: protein WEAK CHLOROPLAST MOVEMENT... 539 e-150 emb|CAN64061.1| hypothetical protein VITISV_000013 [Vitis vinifera] 536 e-149 ref|XP_002522782.1| Paramyosin, putative [Ricinus communis] gi|2... 508 e-141 ref|XP_002298876.1| predicted protein [Populus trichocarpa] gi|2... 502 e-139 ref|XP_003551426.1| PREDICTED: protein PLASTID MOVEMENT IMPAIRED... 494 e-137 >ref|XP_002266660.1| PREDICTED: protein WEAK CHLOROPLAST MOVEMENT UNDER BLUE LIGHT 1 [Vitis vinifera] Length = 650 Score = 539 bits (1388), Expect = e-150 Identities = 321/654 (49%), Positives = 401/654 (61%), Gaps = 5/654 (0%) Frame = +3 Query: 264 MGVKVPQSSTDSPKAEVGEIDTRAPFESVKAAVSLFGEAAFSGSGEKPVIKKSKAFFTER 443 MG K Q++TD+ K EVGEIDT APF+SVK AVSLFGE SGEKP I+K+K ER Sbjct: 1 MGTKDRQNATDNSKVEVGEIDTSAPFQSVKDAVSLFGEF----SGEKPSIRKAKPHSAER 56 Query: 444 VLAKETQLHLAQKDLNNIKEKLKSAEITKAEALTELEKAKRTVEDLSNKLRALNESKESA 623 VLAKETQLHLAQK+LN +KE+LK+AE TKA+AL EL+KAKRTVEDL+ KL ++ESKESA Sbjct: 57 VLAKETQLHLAQKELNKLKEQLKNAETTKAQALVELDKAKRTVEDLNQKLTTVSESKESA 116 Query: 624 MKETEAAKNQAEQLKEANLGGSSESGGSWKQELDTTKEEYTAAASELDAAKQELRKMRQD 803 +K TEAAKNQA+QL EAN G +E+ G+WKQ+++T K++YT ELDA KQE+ K RQD Sbjct: 117 VKATEAAKNQAKQLVEANTGNPAETDGAWKQDMETGKQQYTTIIVELDAVKQEVIKTRQD 176 Query: 804 FEASMEAKDTAFRKAAEAEHLANANKERIGEIKKEIVAADVSIMDVKLASVQAQQEHTKI 983 +AS+E K AF++A EAE +A AN ER E+ KEI A SI VKLAS QAQQE K+ Sbjct: 177 CDASLEGKAAAFKQADEAEQVAKANMERASELSKEISAVQESIGQVKLASEQAQQEQAKL 236 Query: 984 LSEKEAQRQSYRASXXXXXXXXXXXXXXFNPKLKRDLEAKLAETTSEAEVLQKEMKDSMT 1163 +EK+ QRQSYRA+ F+P+L R+LEA+LAET SE ++KEM+++ Sbjct: 237 YAEKDVQRQSYRATLEESAKKLFALKEAFDPELTRNLEAQLAETVSEIGAVKKEMENARA 296 Query: 1164 SDLDSVKSVTLDLDDAKGALHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1343 SDLDSVK+VTL+LDDAK +LH Sbjct: 297 SDLDSVKTVTLELDDAKESLHKVADEESSLRNLVESLKRELENVKKEHSEMKEKEAETES 356 Query: 1344 XAGNLHVKLRKSKAELEATVAEESKSRNASGELITTLQQLSLESENARQXXXXXXXXXXX 1523 AGNLHVKLRKSK+ELEA +AEESK+R AS E+I+TL QLSLE+E ARQ Sbjct: 357 IAGNLHVKLRKSKSELEACLAEESKARGASDEMISTLHQLSLETETARQEAEEMMKKAEE 416 Query: 1524 XXXXXXXTRMSLEVAEEKLQGTMXXXXXXXXXXMQAVDRIKILTEXXXXXXXXXXXXGAK 1703 T+ +LE AE+KL+ + +A+D+IKIL E GA Sbjct: 417 LKKEAQATKSALEEAEKKLRVALEEAEEAKVAETKALDQIKILAERTNAARASTSESGAN 476 Query: 1704 ITVSTEEFESLNRKGEEYDILADLRXXXXXXXXXXXXXSENEARKKLETSYKEIEEIKSA 1883 IT+STEEF++L+RK EE D LA+++ SE EA K+LE + KEIEE+K+A Sbjct: 477 ITISTEEFKALSRKVEESDTLAEMKVAAAMAQVEAVKASEQEAIKRLEATQKEIEEMKAA 536 Query: 1884 TEDXXXXXXXXXXXXXXXXXXLXXXXXXXXXXXXXXXXXXXXXXXKSFESSPGHATAQKP 2063 TE L S ESSP QK Sbjct: 537 TEAALKRAETAEAAKRAVEGELRKWRERDQKKAAEAASRILAESEMSSESSPRQYRIQKQ 596 Query: 2064 TLPE---HKARKM--DKTSVSKKALLPSISGIFHRKKNQVEGGSPSYLPGEKPL 2210 P+ RKM +K+SVSKKALLP++SGIFHRKKNQ+EGGSPSYLPGEKP+ Sbjct: 597 NPPQKINEGGRKMEKEKSSVSKKALLPNLSGIFHRKKNQIEGGSPSYLPGEKPI 650 >emb|CAN64061.1| hypothetical protein VITISV_000013 [Vitis vinifera] Length = 650 Score = 536 bits (1380), Expect = e-149 Identities = 319/654 (48%), Positives = 400/654 (61%), Gaps = 5/654 (0%) Frame = +3 Query: 264 MGVKVPQSSTDSPKAEVGEIDTRAPFESVKAAVSLFGEAAFSGSGEKPVIKKSKAFFTER 443 MG K Q++TD+ K EVGEIDT APF+SVK AVSLFGE SGEKP I+K+K ER Sbjct: 1 MGTKDRQNATDNSKVEVGEIDTSAPFQSVKDAVSLFGEF----SGEKPSIRKAKPHSAER 56 Query: 444 VLAKETQLHLAQKDLNNIKEKLKSAEITKAEALTELEKAKRTVEDLSNKLRALNESKESA 623 VLAKETQLHLAQK+LN +KE+LK+AE TKA+AL EL+KAKRTVEDL+ KL ++ESKESA Sbjct: 57 VLAKETQLHLAQKELNKLKEQLKNAETTKAQALVELDKAKRTVEDLNQKLTTVSESKESA 116 Query: 624 MKETEAAKNQAEQLKEANLGGSSESGGSWKQELDTTKEEYTAAASELDAAKQELRKMRQD 803 +K TEAAKNQA+QL EAN G +E+ G WKQ+++T K++YT ELDA KQE+ K RQD Sbjct: 117 VKATEAAKNQAKQLVEANTGNPAETDGVWKQDMETGKQQYTTIIVELDAVKQEVIKTRQD 176 Query: 804 FEASMEAKDTAFRKAAEAEHLANANKERIGEIKKEIVAADVSIMDVKLASVQAQQEHTKI 983 +AS+E K AF++A EAE +A AN ER E+ KEI A SI VKLAS Q+QQE K+ Sbjct: 177 CDASLEGKAAAFKQADEAEQVAKANMERASELSKEISAVQESIGQVKLASEQSQQEQAKL 236 Query: 984 LSEKEAQRQSYRASXXXXXXXXXXXXXXFNPKLKRDLEAKLAETTSEAEVLQKEMKDSMT 1163 +EK+ QRQ+Y+A+ F+P+L R+LEA+LAET SE ++KEM+++ Sbjct: 237 YAEKDVQRQAYKATLEESAKKLFALKEAFDPELTRNLEAQLAETVSEIGAVKKEMENARA 296 Query: 1164 SDLDSVKSVTLDLDDAKGALHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1343 SDLDSVK+VTL+LDDAK +LH Sbjct: 297 SDLDSVKTVTLELDDAKESLHKVAEEESSLRNLVESLKRELENVKKEHSEMKEKEAETES 356 Query: 1344 XAGNLHVKLRKSKAELEATVAEESKSRNASGELITTLQQLSLESENARQXXXXXXXXXXX 1523 AGNLHVKLRKSK+ELEA +AEESK+R AS E+I+TL QLSLE+E ARQ Sbjct: 357 IAGNLHVKLRKSKSELEACLAEESKARGASDEMISTLHQLSLETETARQEAEEMMKKAEE 416 Query: 1524 XXXXXXXTRMSLEVAEEKLQGTMXXXXXXXXXXMQAVDRIKILTEXXXXXXXXXXXXGAK 1703 T+ +LE AE+KL+ + +A+D+IKIL E GA Sbjct: 417 LKQEAQATKSALEEAEKKLRVALEEAEEAKVAEAKALDQIKILAERTNAARASTSESGAN 476 Query: 1704 ITVSTEEFESLNRKGEEYDILADLRXXXXXXXXXXXXXSENEARKKLETSYKEIEEIKSA 1883 IT+STEEFE+L+RK EE D LA+++ SE EA K+LE + KEIEE+K+A Sbjct: 477 ITISTEEFEALSRKVEESDTLAEMKVAAAMAQVEAVKASEQEAVKRLEATQKEIEEMKAA 536 Query: 1884 TEDXXXXXXXXXXXXXXXXXXLXXXXXXXXXXXXXXXXXXXXXXXKSFESSPGHATAQKP 2063 TE L S ESSP QK Sbjct: 537 TEAALKRAETAEAAKRAVEGELRKWRERDQKKAAEAASRILAESEMSSESSPRQYRIQKQ 596 Query: 2064 TLPE---HKARKM--DKTSVSKKALLPSISGIFHRKKNQVEGGSPSYLPGEKPL 2210 P+ RKM +K+SVSKKALLP++SGIFHRKKNQ+EGGSPSYLPGEKP+ Sbjct: 597 NPPQKINEGGRKMEKEKSSVSKKALLPNLSGIFHRKKNQIEGGSPSYLPGEKPI 650 >ref|XP_002522782.1| Paramyosin, putative [Ricinus communis] gi|223538020|gb|EEF39633.1| Paramyosin, putative [Ricinus communis] Length = 652 Score = 508 bits (1309), Expect = e-141 Identities = 304/654 (46%), Positives = 397/654 (60%), Gaps = 5/654 (0%) Frame = +3 Query: 264 MGVKVPQSSTDSPKAEVGEIDTRAPFESVKAAVSLFGEAAFSGSGEKPVIKKSKAFFTER 443 MG K Q++TDSPK EVGEIDT APF+SVK AV+LFGE AFSG EKP IKK++ ER Sbjct: 1 MGAKERQNATDSPKVEVGEIDTSAPFQSVKDAVTLFGEGAFSG--EKPAIKKTRPHSAER 58 Query: 444 VLAKETQLHLAQKDLNNIKEKLKSAEITKAEALTELEKAKRTVEDLSNKLRALNESKESA 623 VLAKETQLHLAQK+L+ +K+++K+AE TK +AL ELEKAKRTVEDLS KLR + E K++A Sbjct: 59 VLAKETQLHLAQKELSKLKDQVKNAETTKGQALVELEKAKRTVEDLSAKLRTVTELKDTA 118 Query: 624 MKETEAAKNQAEQLKEANLGGSSESGGSWKQELDTTKEEYTAAASELDAAKQELRKMRQD 803 ++ TEAAK+QA+Q++E G +S S G+ KQ+L++ +E+Y +ELDAAKQEL K+RQD Sbjct: 119 IRATEAAKSQAKQIEETKSGDASGSSGARKQDLESAREQYITVFTELDAAKQELWKIRQD 178 Query: 804 FEASMEAKDTAFRKAAEAEHLANANKERIGEIKKEIVAADVSIMDVKLASVQAQQEHTKI 983 EAS+EAK AF +AAEAEH A AN E++ E+ KEI A SI VKLAS+QAQQE KI Sbjct: 179 CEASLEAKLAAFNQAAEAEHAAKANVEKVSELSKEISALQESIGQVKLASLQAQQEQAKI 238 Query: 984 LSEKEAQRQSYRASXXXXXXXXXXXXXXFNPKLKRDLEAKLAETTSEAEVLQKEMKDSMT 1163 +EK Q+QSY+A+ F+P+L +LE +LAET +E + LQK+M+++ Sbjct: 239 FAEKGVQKQSYKATLEASANKLLALKNEFDPELVFNLEKQLAETITEIDALQKQMENAKA 298 Query: 1164 SDLDSVKSVTLDLDDAKGALHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1343 SDLDSV++VT +LD AK +L Sbjct: 299 SDLDSVRTVTSELDGAKESLQKVAEEESSLRSLVESLKLELENVKKEHSELREKEAETES 358 Query: 1344 XAGNLHVKLRKSKAELEATVAEESKSRNASGELITTLQQLSLESENARQXXXXXXXXXXX 1523 AGNLHVKLRKSKAELEA AEESK+R AS E+I+TL QLS E+ENA+Q Sbjct: 359 AAGNLHVKLRKSKAELEAAAAEESKTRGASEEMISTLHQLSSEAENAQQEAEEMKNKAEE 418 Query: 1524 XXXXXXXTRMSLEVAEEKLQGTMXXXXXXXXXXMQAVDRIKILTEXXXXXXXXXXXXGAK 1703 TR++LE AE+KL+ + +A+D+IK L+E GA Sbjct: 419 LKSEAEATRIALEEAEKKLRVALEEAEEAKLAETRALDQIKTLSERTNAARASTSESGAN 478 Query: 1704 ITVSTEEFESLNRKGEEYDILADLRXXXXXXXXXXXXXSENEARKKLETSYKEIEEIKSA 1883 IT+S EE+E+L+RK E + LA+++ SENEA + E KEI+++K+A Sbjct: 479 ITISREEYEALSRKVGESESLAEMKVAAAMAQVEAVKASENEALNRFEAIQKEIDDMKAA 538 Query: 1884 TEDXXXXXXXXXXXXXXXXXXLXXXXXXXXXXXXXXXXXXXXXXXKSFESSPGHATAQKP 2063 T++ L S ESSP H QK Sbjct: 539 TQEAVKRAEMAEAAKKAVEGELRRWREREQKKAAETASRILAETEMSIESSPHHYRIQKQ 598 Query: 2064 T-LPEH--KARKMD--KTSVSKKALLPSISGIFHRKKNQVEGGSPSYLPGEKPL 2210 P++ + RK+D K S SKKALLP++SGIF RKKNQ+EGGSPSYLPGEKP+ Sbjct: 599 NPAPKNVIEVRKLDKEKNSASKKALLPNLSGIFQRKKNQIEGGSPSYLPGEKPV 652 >ref|XP_002298876.1| predicted protein [Populus trichocarpa] gi|222846134|gb|EEE83681.1| predicted protein [Populus trichocarpa] Length = 652 Score = 502 bits (1293), Expect = e-139 Identities = 296/654 (45%), Positives = 396/654 (60%), Gaps = 5/654 (0%) Frame = +3 Query: 264 MGVKVPQSSTDSPKAEVGEIDTRAPFESVKAAVSLFGEAAFSGSGEKPVIKKSKAFFTER 443 MG K Q++T SPK EVGEIDTRAPF+SVK AV+LFGE AFSG EKP I+K+K ER Sbjct: 1 MGAKECQNTTGSPKVEVGEIDTRAPFQSVKDAVTLFGEGAFSG--EKPAIRKAKPHSAER 58 Query: 444 VLAKETQLHLAQKDLNNIKEKLKSAEITKAEALTELEKAKRTVEDLSNKLRALNESKESA 623 VLAKETQLHLAQK++N +K+++++AE TKA+AL ELEKAKRTVEDL++KL+ + ESKESA Sbjct: 59 VLAKETQLHLAQKEMNKLKDQVRNAETTKAQALVELEKAKRTVEDLTDKLKTVTESKESA 118 Query: 624 MKETEAAKNQAEQLKEANLGGSSESGGSWKQELDTTKEEYTAAASELDAAKQELRKMRQD 803 ++ETEAAKNQA+Q++E + S G+ KQ+L++T+E+Y +ELDA KQELRK+RQ+ Sbjct: 119 IRETEAAKNQAKQIEETSNIDLPGSDGARKQDLESTREQYMTVFTELDATKQELRKIRQE 178 Query: 804 FEASMEAKDTAFRKAAEAEHLANANKERIGEIKKEIVAADVSIMDVKLASVQAQQEHTKI 983 ++ S+EAK AF +AA AEH A AN E++ E+ KEI A SI KL +++A QE KI Sbjct: 179 YDTSLEAKLAAFNQAAAAEHAAKANVEKVSELSKEISALQESIGQAKLVALEAHQEQAKI 238 Query: 984 LSEKEAQRQSYRASXXXXXXXXXXXXXXFNPKLKRDLEAKLAETTSEAEVLQKEMKDSMT 1163 +EK+ RQSY+A+ F+P+L R+LE +LAET +E LQK+M+++ Sbjct: 239 FAEKDVLRQSYKATLEASANKLLVLKNEFDPELARNLEKQLAETMNEIGALQKQMENAKA 298 Query: 1164 SDLDSVKSVTLDLDDAKGALHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1343 SDLDSVK+VT +LD AK L Sbjct: 299 SDLDSVKTVTSELDGAKEFLQKVSEEENSLRSLLESLKLELENVKKEHSQLKEKEAETES 358 Query: 1344 XAGNLHVKLRKSKAELEATVAEESKSRNASGELITTLQQLSLESENARQXXXXXXXXXXX 1523 AGNLHVKLRKSK ELE + EESK++ AS E+I+TL QLS E+E+AR+ Sbjct: 359 IAGNLHVKLRKSKTELEQALVEESKAKGASEEMISTLHQLSSEAESARKEAEEMKSKAEE 418 Query: 1524 XXXXXXXTRMSLEVAEEKLQGTMXXXXXXXXXXMQAVDRIKILTEXXXXXXXXXXXXGAK 1703 TR++LE AE+KL+ + +A+D+IK L+E GAK Sbjct: 419 LKNIAEATRIALEEAEKKLRVALEEVEEAKTAETRALDQIKALSERTNAARASTSESGAK 478 Query: 1704 ITVSTEEFESLNRKGEEYDILADLRXXXXXXXXXXXXXSENEARKKLETSYKEIEEIKSA 1883 IT+S EE E+L+RK EE D LA+++ SENEA K+LE + K+IE++++A Sbjct: 479 ITISREECEALSRKVEESDTLAEMKVAAAVAQIEAVKASENEALKRLEAAQKDIEDMRAA 538 Query: 1884 TEDXXXXXXXXXXXXXXXXXXLXXXXXXXXXXXXXXXXXXXXXXXKSFESSPGHATAQK- 2060 TE+ L + ESSP H QK Sbjct: 539 TEEASKRAEMAEAAKRAVEGELRRWREREQKKAADTASRILAETQMASESSPHHYRNQKQ 598 Query: 2061 ----PTLPEHKARKMDKTSVSKKALLPSISGIFHRKKNQVEGGSPSYLPGEKPL 2210 T+ E + +K S+SKKALLP++SGIF+RKKNQ+EGGSPSYLPGEKP+ Sbjct: 599 NPAIQTVIEVRKLDKEKFSLSKKALLPNLSGIFYRKKNQIEGGSPSYLPGEKPV 652 >ref|XP_003551426.1| PREDICTED: protein PLASTID MOVEMENT IMPAIRED 2-like [Glycine max] Length = 653 Score = 494 bits (1272), Expect = e-137 Identities = 298/655 (45%), Positives = 386/655 (58%), Gaps = 6/655 (0%) Frame = +3 Query: 264 MGVKVPQSSTDSP--KAEVGEIDTRAPFESVKAAVSLFGEAAFSGSGEKPVIKKSKAFFT 437 M K+ QS+T+SP K EVGEIDT PF+SVK AVSLFGE AFSG EKP+ KK+K + Sbjct: 1 MVAKIRQSATESPNPKPEVGEIDTSPPFQSVKDAVSLFGEGAFSG--EKPIFKKAKPYSA 58 Query: 438 ERVLAKETQLHLAQKDLNNIKEKLKSAEITKAEALTELEKAKRTVEDLSNKLRALNESKE 617 ERVLAKETQLH+AQK+LN ++E++K+AE TKA+AL ELE+AKRTVEDL+ K++ +++S+E Sbjct: 59 ERVLAKETQLHVAQKELNKLREQVKNAETTKAQALVELERAKRTVEDLTQKIKVISDSRE 118 Query: 618 SAMKETEAAKNQAEQLKEANLGGSSESGGSWKQELDTTKEEYTAAASELDAAKQELRKMR 797 A++ TEAAK+QA+QL E G + +WK+EL+ + Y + +ELDAAKQ L K R Sbjct: 119 LAIEATEAAKSQAKQLTEEKYGVPDGTNVAWKEELEAAVKRYASVMTELDAAKQALSKTR 178 Query: 798 QDFEASMEAKDTAFRKAAEAEHLANANKERIGEIKKEIVAADVSIMDVKLASVQAQQEHT 977 Q++++S++AK +AF+ AAEA + N ER E+ KEI A SI KLAS+ AQQ+ T Sbjct: 179 QEYDSSLDAKKSAFKLAAEAGDASKENTERASELSKEISAVKESIEQAKLASIVAQQQQT 238 Query: 978 KILSEKEAQRQSYRASXXXXXXXXXXXXXXFNPKLKRDLEAKLAETTSEAEVLQKEMKDS 1157 IL+EK+ RQSY+A+ F+P+L ++LE +LAET SE LQKEM++ Sbjct: 239 MILAEKDVLRQSYKATLEQSEKKLLALKKEFSPELAKNLEMQLAETMSEIGTLQKEMENK 298 Query: 1158 MTSDLDSVKSVTLDLDDAKGALHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1337 TSDLDSVKSVTL+LDDAK +L Sbjct: 299 RTSDLDSVKSVTLELDDAKESLQKVADEESSLRSLVESLKVELENVKREHSELQDKESET 358 Query: 1338 XXXAGNLHVKLRKSKAELEATVAEESKSRNASGELITTLQQLSLESENARQXXXXXXXXX 1517 GNLHVKLRK K+ELEA +AEESK R AS E+I TL QL+ E+ENAR+ Sbjct: 359 ESIVGNLHVKLRKCKSELEACMAEESKVRGASEEMILTLSQLTSETENARREAEDMKNRT 418 Query: 1518 XXXXXXXXXTRMSLEVAEEKLQGTMXXXXXXXXXXMQAVDRIKILTEXXXXXXXXXXXXG 1697 T ++LE AE+ L+ + A+D+I ILTE G Sbjct: 419 AELKKEVEVTMLALEEAEKNLKVALEEAEAAKAAEKSALDQITILTERTTAARASTSESG 478 Query: 1698 AKITVSTEEFESLNRKGEEYDILADLRXXXXXXXXXXXXXSENEARKKLETSYKEIEEIK 1877 A IT+S EEF+SL+ K EE D LAD++ SENEA K+LET+ KEIE+IK Sbjct: 479 AVITISKEEFDSLSHKVEESDKLADMKVAAAKAQVEAVKASENEALKRLETTQKEIEDIK 538 Query: 1878 SATEDXXXXXXXXXXXXXXXXXXLXXXXXXXXXXXXXXXXXXXXXXXKSFESSPGHATAQ 2057 +AT++ L S ESSP H Q Sbjct: 539 TATQEALKKAEMAEAAKRAVESELRRWREREQKRAAEAASRILAETQVSTESSPQHYRIQ 598 Query: 2058 KP----TLPEHKARKMDKTSVSKKALLPSISGIFHRKKNQVEGGSPSYLPGEKPL 2210 K T+ E K + +K SVSKK LLP+ISGIF RKKNQVEGGSPSYLPGE P+ Sbjct: 599 KQNPPRTMVEVKKFEKEKVSVSKKTLLPNISGIFQRKKNQVEGGSPSYLPGENPV 653