BLASTX nr result
ID: Glycyrrhiza24_contig00014264
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Glycyrrhiza24_contig00014264 (2778 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_003548551.1| PREDICTED: pentatricopeptide repeat-containi... 1015 0.0 ref|XP_002268821.1| PREDICTED: pentatricopeptide repeat-containi... 778 0.0 ref|XP_002324000.1| predicted protein [Populus trichocarpa] gi|2... 770 0.0 ref|XP_002526948.1| pentatricopeptide repeat-containing protein,... 750 0.0 ref|NP_190245.1| pentatricopeptide repeat-containing protein [Ar... 709 0.0 >ref|XP_003548551.1| PREDICTED: pentatricopeptide repeat-containing protein At3g46610-like [Glycine max] Length = 808 Score = 1015 bits (2624), Expect = 0.0 Identities = 559/830 (67%), Positives = 627/830 (75%), Gaps = 17/830 (2%) Frame = +2 Query: 38 LSTLPSRGG------FVLGSSCVVTDLNRKRRRMKLGFVYSISHSTSFGVFQCLTXXXXX 199 +ST PS+ F +G S V TD NR RRR+KLGF +S+SHS VFQ Sbjct: 2 ISTWPSKVNHLVVPRFEIGPSGV-TDQNR-RRRVKLGFAFSVSHSEKVSVFQ-------- 51 Query: 200 XXXXNSTVVFSGYPNPKFDLRCGFLLGYPGLKYDYTLLKPNKSHVGDLALPPLGWALEEE 379 TVVFSG+ K DLRCGFLLG K +LKP+KSHVGDLA PPLGWALEE+ Sbjct: 52 FSRGYGTVVFSGHA--KLDLRCGFLLGCSRPKLGI-ILKPHKSHVGDLA-PPLGWALEED 107 Query: 380 --------ENTNSTDHTSVNGESEGINPSNSDPVRDGYYCEQEMRGDDNSEHEEAVGDEL 535 E +S D SVN ESEG+ N D V+D + E ++RG D+ + +E+ G+EL Sbjct: 108 GVGSELVDEQIDSND-ASVNRESEGVKSLNLDQVQDSDF-EGQIRGYDD-DSKESGGNEL 164 Query: 536 VEEKTNSTDHTSVNGESEGINSLNSDPVRDGYYCEQEMRXXXXXXXXXXXXXXXXXXXRV 715 VEE+T+S D VNG+ EG+ SLN D V+D CE +M +V Sbjct: 165 VEEQTDSND-ALVNGDLEGVKSLNLDQVKDSD-CEGKM----CGDDNSKEGGEEESDGKV 218 Query: 716 DVRELANSLQTAKTVEDVEEILKDKGDLPLQVYSTIISWFGKQKRMDSALILFDWMKKRN 895 DVR LA SLQT KTVEDV ILKDKGDLPLQV+STIIS FGK+KRMDSALILF+WMKKR Sbjct: 219 DVRALALSLQTVKTVEDVGGILKDKGDLPLQVFSTIISGFGKEKRMDSALILFNWMKKRK 278 Query: 896 VETNGSFAMNIFIYNSLLGVVKQCEQFEEMEAILSEMARDGMAYNVVTYNTLMAIYIAKG 1075 +ETNGSF N+FIYN LLGVVKQ QF EME IL+EMA DG+AYNVVTYNTLMAIYI KG Sbjct: 279 IETNGSFGPNLFIYNGLLGVVKQSGQFAEMEVILNEMAEDGIAYNVVTYNTLMAIYIEKG 338 Query: 1076 EGEKALGMLEEIHRNGLTPSPVSYSQAMLAYRRMEDGNGALNFFVEFREKYRAXXXXXXX 1255 E +KAL MLEEI RNGLTPSPVSYSQA+LAYRRMEDG GALNFFVEFREKYR Sbjct: 339 ECDKALNMLEEIRRNGLTPSPVSYSQALLAYRRMEDGYGALNFFVEFREKYRQGEIGKDD 398 Query: 1256 XXXXXXXXCRSLERFTIRVCYQIMRRWXXXXXXXXXXXXRFLVSMDNAGISLSRADLERL 1435 C LE+FTIRVCYQ+MR W +FLV MDN GI L RADLERL Sbjct: 399 DGEDWEKECLKLEKFTIRVCYQVMRCWLVSRDNLSKNVLKFLVDMDNVGIPLPRADLERL 458 Query: 1436 VWACTREDHYRVVKELYLRIRERYDKISLSVCNHVIWLMGKAKKWWAALEIYEDLLEKGP 1615 WACTREDHY VVKELY RIRERYDKISLSVCNH IWLMGKAKKWWAALEIYEDLL+KGP Sbjct: 459 AWACTREDHYIVVKELYNRIRERYDKISLSVCNHAIWLMGKAKKWWAALEIYEDLLDKGP 518 Query: 1616 KPNNLSYELVMSHFRFLLSAAKKKGIWKWGVRLLNKMEEKGLKPGSNEWNAVLVACSKAS 1795 KPNNLSYEL++SHF FLLSAAK+KGIW+WGV+LLNKME+KGLKPG EWNAVLVACSKAS Sbjct: 519 KPNNLSYELIVSHFNFLLSAAKRKGIWRWGVKLLNKMEDKGLKPGCREWNAVLVACSKAS 578 Query: 1796 ETTAAVQIFRRMVENGEKPTIISYGALLSALEKGKLYDEAFRVWNHMLKVGVEPNAYTYT 1975 ETTAAVQIF+RMVENGEKPTIISYGALLSALEKGKLYD+A RVWNHM+KVGVEPNAY YT Sbjct: 579 ETTAAVQIFKRMVENGEKPTIISYGALLSALEKGKLYDDALRVWNHMIKVGVEPNAYAYT 638 Query: 1976 IMASIYTAQGNFSRVDAIIREMVTIGVEVTVVTYNAIISGSARNGFSSAAYEWFHRMKVQ 2155 IMASI+TAQGNF+RVDAII+EMVT+G+EVTVVTYNAII+G A NG SS AYEWFHRMKVQ Sbjct: 639 IMASIHTAQGNFNRVDAIIQEMVTLGIEVTVVTYNAIITGCAHNGMSSVAYEWFHRMKVQ 698 Query: 2156 NISPNETTYEMLIEALANDGKPRLAYELYLRAQNEGLFLSSKAYDAILQSSLAYGATIDV 2335 NISPNE TYEMLI ALANDGKPRLAY+LY RA+NEGL LSSKAYDA++QSS A ATI++ Sbjct: 699 NISPNEITYEMLIVALANDGKPRLAYQLYTRAKNEGLTLSSKAYDAVVQSSQANNATIEL 758 Query: 2336 GGLGPRPEDKMKKVQIGK--NH*LNSSRRRSKTNLIERKIITH-RHEKSP 2476 G LGPRP DK KKVQI K N N + ++ +R I H + E+SP Sbjct: 759 GLLGPRPVDKKKKVQIRKTLNEFYNLAGVPKRSQPFDRNEIYHSQTEESP 808 >ref|XP_002268821.1| PREDICTED: pentatricopeptide repeat-containing protein At3g46610 [Vitis vinifera] Length = 763 Score = 778 bits (2010), Expect = 0.0 Identities = 411/712 (57%), Positives = 492/712 (69%) Frame = +2 Query: 242 NPKFDLRCGFLLGYPGLKYDYTLLKPNKSHVGDLALPPLGWALEEEENTNSTDHTSVNGE 421 +PKFD CG L GY LK + L + + G A L WALE++ Sbjct: 71 SPKFDFGCGLLSGYSKLKI-FLLCERKRGSFG--ASFALAWALEQQ-------------- 113 Query: 422 SEGINPSNSDPVRDGYYCEQEMRGDDNSEHEEAVGDELVEEKTNSTDHTSVNGESEGINS 601 A+G+E V+E +NS + N E+ I+ Sbjct: 114 --------------------------------AIGNEFVKEDSNSIHSLAGNTETVDIDC 141 Query: 602 LNSDPVRDGYYCEQEMRXXXXXXXXXXXXXXXXXXXRVDVRELANSLQTAKTVEDVEEIL 781 L D RDG + E VDVR LA+ L+ A T +DVEE+L Sbjct: 142 LKVDGARDGDENDNEEEKEAEKNGEVIEEKSR----NVDVRALAHGLEFATTADDVEEVL 197 Query: 782 KDKGDLPLQVYSTIISWFGKQKRMDSALILFDWMKKRNVETNGSFAMNIFIYNSLLGVVK 961 KDK +LPLQVYST+I FG KR+D+A+ L +W+K++ ETNGS N+F+YNSLLG VK Sbjct: 198 KDKVELPLQVYSTMIRGFGTDKRLDAAMALVEWLKRKK-ETNGSKGPNLFVYNSLLGAVK 256 Query: 962 QCEQFEEMEAILSEMARDGMAYNVVTYNTLMAIYIAKGEGEKALGMLEEIHRNGLTPSPV 1141 Q E+F +E ++++MAR+G+ NVVTYNTLM+IY+ +G +AL +LEEI +NGL PSPV Sbjct: 257 QSEKFALVEKVMNDMAREGILPNVVTYNTLMSIYLEQGRSVEALNILEEIQKNGLCPSPV 316 Query: 1142 SYSQAMLAYRRMEDGNGALNFFVEFREKYRAXXXXXXXXXXXXXXXCRSLERFTIRVCYQ 1321 SYS A+L YRRMEDG+GAL FF+E RE Y + L+ FTIR+CYQ Sbjct: 317 SYSTALLVYRRMEDGHGALKFFIELRENYLKGEIGKDADEDWENEFVK-LKNFTIRICYQ 375 Query: 1322 IMRRWXXXXXXXXXXXXRFLVSMDNAGISLSRADLERLVWACTREDHYRVVKELYLRIRE 1501 +MRRW + L MDNAG+ RA+ ERLVWACTRE+HY V KELY RIRE Sbjct: 376 VMRRWLVKEGNQSPILLKLLADMDNAGLQPGRAEYERLVWACTREEHYVVAKELYTRIRE 435 Query: 1502 RYDKISLSVCNHVIWLMGKAKKWWAALEIYEDLLEKGPKPNNLSYELVMSHFRFLLSAAK 1681 R+ +ISLSVCNH+IWLMGKAKKWWAALEIYEDLL+KGPKPNNLSYELV+SHF LL+AA+ Sbjct: 436 RHTEISLSVCNHIIWLMGKAKKWWAALEIYEDLLDKGPKPNNLSYELVVSHFNILLTAAR 495 Query: 1682 KKGIWKWGVRLLNKMEEKGLKPGSNEWNAVLVACSKASETTAAVQIFRRMVENGEKPTII 1861 KKGIW+WGVRLLNKME+KGLKPGS EWNAVLVACSKA+ET+AAV+IFRRMVE GEKPTII Sbjct: 496 KKGIWRWGVRLLNKMEDKGLKPGSREWNAVLVACSKAAETSAAVEIFRRMVEQGEKPTII 555 Query: 1862 SYGALLSALEKGKLYDEAFRVWNHMLKVGVEPNAYTYTIMASIYTAQGNFSRVDAIIREM 2041 SYGALLSALEKGKLYDEA RVW HM+K+GVEPN Y YTIMASI QG RVD+I+REM Sbjct: 556 SYGALLSALEKGKLYDEASRVWEHMVKMGVEPNLYAYTIMASICVGQGKLQRVDSILREM 615 Query: 2042 VTIGVEVTVVTYNAIISGSARNGFSSAAYEWFHRMKVQNISPNETTYEMLIEALANDGKP 2221 T+G++ TVVTYNAIISG ARNG SSAA+EWFHRMKV I PNE TYEMLIEALA DGKP Sbjct: 616 ETLGIDATVVTYNAIISGCARNGLSSAAFEWFHRMKVGKIQPNEITYEMLIEALAKDGKP 675 Query: 2222 RLAYELYLRAQNEGLFLSSKAYDAILQSSLAYGATIDVGGLGPRPEDKMKKV 2377 RLA+ELY RAQNEGL LS+KAYDA++ SS + ATIDV LGPRP +K KK+ Sbjct: 676 RLAFELYSRAQNEGLNLSTKAYDAVVLSSQVHSATIDVSLLGPRPPEKKKKL 727 >ref|XP_002324000.1| predicted protein [Populus trichocarpa] gi|222867002|gb|EEF04133.1| predicted protein [Populus trichocarpa] Length = 709 Score = 770 bits (1989), Expect = 0.0 Identities = 385/560 (68%), Positives = 442/560 (78%) Frame = +2 Query: 710 RVDVRELANSLQTAKTVEDVEEILKDKGDLPLQVYSTIISWFGKQKRMDSALILFDWMKK 889 ++DV LA SL AKTV+D+EE+LKDKG+LP+QVY ++I FG K+M+ A+ L DW+K Sbjct: 122 KIDVPALAQSLYFAKTVDDIEEVLKDKGELPVQVYLSMIKGFGWDKKMEPAIALVDWLKI 181 Query: 890 RNVETNGSFAMNIFIYNSLLGVVKQCEQFEEMEAILSEMARDGMAYNVVTYNTLMAIYIA 1069 + ET+G+ N+FIYNSLL VKQ EQ+EE E IL M ++G+A NVVTYN LM IY+ Sbjct: 182 KK-ETDGTIVPNLFIYNSLLSAVKQSEQYEETEKILERMTQEGVAPNVVTYNILMVIYVK 240 Query: 1070 KGEGEKALGMLEEIHRNGLTPSPVSYSQAMLAYRRMEDGNGALNFFVEFREKYRAXXXXX 1249 +G+ +KAL +LEE+ RNG TPS SYS A+LAYR+MEDG+GAL FFVE ++KY Sbjct: 241 QGQAKKALDVLEEMRRNGFTPSAASYSSALLAYRKMEDGDGALKFFVEIKDKYMKGEIGK 300 Query: 1250 XXXXXXXXXXCRSLERFTIRVCYQIMRRWXXXXXXXXXXXXRFLVSMDNAGISLSRADLE 1429 + LE FTIRVCYQ+MRRW + L MD A + R+D E Sbjct: 301 DADEDWEREYVK-LENFTIRVCYQVMRRWLVRLENLNTNVLKLLTDMDKAELQPGRSDYE 359 Query: 1430 RLVWACTREDHYRVVKELYLRIRERYDKISLSVCNHVIWLMGKAKKWWAALEIYEDLLEK 1609 RLVWACTRE+HY V KELY+RIRER ISLSVCNHVIWLMGKAKKWWAALE+YEDLL+K Sbjct: 360 RLVWACTREEHYVVAKELYIRIRERCSDISLSVCNHVIWLMGKAKKWWAALEVYEDLLDK 419 Query: 1610 GPKPNNLSYELVMSHFRFLLSAAKKKGIWKWGVRLLNKMEEKGLKPGSNEWNAVLVACSK 1789 GPKPNNLSYEL++S+F LL+AAKK+GIW+WGVRLLNKMEEKGLKPGS EWNAVLVACSK Sbjct: 420 GPKPNNLSYELIVSYFNVLLTAAKKRGIWRWGVRLLNKMEEKGLKPGSKEWNAVLVACSK 479 Query: 1790 ASETTAAVQIFRRMVENGEKPTIISYGALLSALEKGKLYDEAFRVWNHMLKVGVEPNAYT 1969 ASET AAVQIFRRMVE GEKPT+ISYGALLSALEKG+LYDEA RVW HMLKVGV+PN Y Sbjct: 480 ASETAAAVQIFRRMVEQGEKPTVISYGALLSALEKGRLYDEAVRVWEHMLKVGVKPNVYA 539 Query: 1970 YTIMASIYTAQGNFSRVDAIIREMVTIGVEVTVVTYNAIISGSARNGFSSAAYEWFHRMK 2149 YTIMAS++T QGNF VDAII EMV+ G+E TVVTYNAIISG ARN SSAAYEWFHRMK Sbjct: 540 YTIMASVFTRQGNFRLVDAIINEMVSTGIEPTVVTYNAIISGCARNNLSSAAYEWFHRMK 599 Query: 2150 VQNISPNETTYEMLIEALANDGKPRLAYELYLRAQNEGLFLSSKAYDAILQSSLAYGATI 2329 VQNISPNE TY+MLIEALA GKPRLAYELYLRAQNE L LS KAYDA++ SS AYGATI Sbjct: 600 VQNISPNEITYDMLIEALAKSGKPRLAYELYLRAQNEDLQLSPKAYDAVMHSSEAYGATI 659 Query: 2330 DVGGLGPRPEDKMKKVQIGK 2389 D LGPRP DK KKVQI K Sbjct: 660 DTSVLGPRPPDKKKKVQIRK 679 >ref|XP_002526948.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223533700|gb|EEF35435.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 671 Score = 750 bits (1936), Expect = 0.0 Identities = 368/559 (65%), Positives = 440/559 (78%) Frame = +2 Query: 713 VDVRELANSLQTAKTVEDVEEILKDKGDLPLQVYSTIISWFGKQKRMDSALILFDWMKKR 892 +DVR LA SL +A+T +DVEE+LKDKG+LPLQVYS++I FG +M+SAL L +W+K+R Sbjct: 83 IDVRSLARSLHSAQTADDVEEVLKDKGELPLQVYSSMIKAFGWDNKMESALALVEWLKRR 142 Query: 893 NVETNGSFAMNIFIYNSLLGVVKQCEQFEEMEAILSEMARDGMAYNVVTYNTLMAIYIAK 1072 E S N+FIYNSLL VK+ + FEE E IL++M ++G+A NVVTYNTLM IY+ K Sbjct: 143 K-EIGSSIGPNLFIYNSLLSAVKKSKLFEEAEKILNDMTQEGIAPNVVTYNTLMGIYVEK 201 Query: 1073 GEGEKALGMLEEIHRNGLTPSPVSYSQAMLAYRRMEDGNGALNFFVEFREKYRAXXXXXX 1252 G+ KAL +LE++H G P+ SYS A+LAYR MEDG+GAL FFV+ ++KY Sbjct: 202 GQATKALNILEQMHEKGFIPTAASYSTALLAYRGMEDGHGALAFFVDIKDKYLKGKIGKN 261 Query: 1253 XXXXXXXXXCRSLERFTIRVCYQIMRRWXXXXXXXXXXXXRFLVSMDNAGISLSRADLER 1432 + LE F IR+CYQ+MRRW + L MD AG+ S+A+ ER Sbjct: 262 SDENWENEFVK-LETFIIRICYQVMRRWLVRHDNFSTDVLKLLTDMDKAGLQPSQAEYER 320 Query: 1433 LVWACTREDHYRVVKELYLRIRERYDKISLSVCNHVIWLMGKAKKWWAALEIYEDLLEKG 1612 LVWACTREDHY V KELY+RIRER+ KISLSVCNH+IWLMGKAKKWWAALEIYEDLL+KG Sbjct: 321 LVWACTREDHYAVGKELYIRIRERHSKISLSVCNHLIWLMGKAKKWWAALEIYEDLLDKG 380 Query: 1613 PKPNNLSYELVMSHFRFLLSAAKKKGIWKWGVRLLNKMEEKGLKPGSNEWNAVLVACSKA 1792 P PNN+SYEL++SHF LL+AA+K+GIW+WGVRLLNKME+KGLKPGS EWNAVLVACSKA Sbjct: 381 PNPNNMSYELIVSHFNILLTAARKRGIWRWGVRLLNKMEDKGLKPGSREWNAVLVACSKA 440 Query: 1793 SETTAAVQIFRRMVENGEKPTIISYGALLSALEKGKLYDEAFRVWNHMLKVGVEPNAYTY 1972 SETTAAVQIFRRM+E GEKPTI+SYGALLSALEKGKLYDEA RVW HMLKV V+PN Y Y Sbjct: 441 SETTAAVQIFRRMIEQGEKPTIVSYGALLSALEKGKLYDEAVRVWEHMLKVDVKPNLYAY 500 Query: 1973 TIMASIYTAQGNFSRVDAIIREMVTIGVEVTVVTYNAIISGSARNGFSSAAYEWFHRMKV 2152 TIMAS++ QG F+ VDAII++MV+ G+E T++TYNAIISG N SSAAYEWFHRMKV Sbjct: 501 TIMASVFAGQGKFTYVDAIIQKMVSSGIEPTIITYNAIISGCTHNNLSSAAYEWFHRMKV 560 Query: 2153 QNISPNETTYEMLIEALANDGKPRLAYELYLRAQNEGLFLSSKAYDAILQSSLAYGATID 2332 QN+ PN+ TYEMLIEALA DGKPRLAYELYLRA+ EGL LS+K YDA+L+SS YGATID Sbjct: 561 QNMPPNKITYEMLIEALAKDGKPRLAYELYLRAKYEGLDLSAKVYDAVLRSSQVYGATID 620 Query: 2333 VGGLGPRPEDKMKKVQIGK 2389 + LGPRP DK K+V+I K Sbjct: 621 INVLGPRPPDKKKRVKIRK 639 >ref|NP_190245.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75206903|sp|Q9SNB7.1|PP264_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At3g46610 gi|6523064|emb|CAB62331.1| hypothetical protein [Arabidopsis thaliana] gi|332644660|gb|AEE78181.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 665 Score = 709 bits (1830), Expect = 0.0 Identities = 347/552 (62%), Positives = 424/552 (76%) Frame = +2 Query: 710 RVDVRELANSLQTAKTVEDVEEILKDKGDLPLQVYSTIISWFGKQKRMDSALILFDWMKK 889 RVDVRELA SL+ AKT +DV+ +LKDKG+LPLQV+ +I FGK KR+ A+ + DW+K+ Sbjct: 115 RVDVRELAFSLRAAKTADDVDAVLKDKGELPLQVFCAMIKGFGKDKRLKPAVAVVDWLKR 174 Query: 890 RNVETNGSFAMNIFIYNSLLGVVKQCEQFEEMEAILSEMARDGMAYNVVTYNTLMAIYIA 1069 + E+ G N+FIYNSLLG ++ F E E IL +M +G+ N+VTYNTLM IY+ Sbjct: 175 KKSESGGVIGPNLFIYNSLLGAMRG---FGEAEKILKDMEEEGIVPNIVTYNTLMVIYME 231 Query: 1070 KGEGEKALGMLEEIHRNGLTPSPVSYSQAMLAYRRMEDGNGALNFFVEFREKYRAXXXXX 1249 +GE KALG+L+ G P+P++YS A+L YRRMEDG GAL FFVE REKY Sbjct: 232 EGEFLKALGILDLTKEKGFEPNPITYSTALLVYRRMEDGMGALEFFVELREKYAKREIGN 291 Query: 1250 XXXXXXXXXXCRSLERFTIRVCYQIMRRWXXXXXXXXXXXXRFLVSMDNAGISLSRADLE 1429 + LE F R+CYQ+MRRW + L +MD+AG+ SR + E Sbjct: 292 DVGYDWEFEFVK-LENFIGRICYQVMRRWLVKDDNWTTRVLKLLNAMDSAGVRPSREEHE 350 Query: 1430 RLVWACTREDHYRVVKELYLRIRERYDKISLSVCNHVIWLMGKAKKWWAALEIYEDLLEK 1609 RL+WACTRE+HY V KELY RIRER+ +ISLSVCNH+IWLMGKAKKWWAALEIYEDLL++ Sbjct: 351 RLIWACTREEHYIVGKELYKRIRERFSEISLSVCNHLIWLMGKAKKWWAALEIYEDLLDE 410 Query: 1610 GPKPNNLSYELVMSHFRFLLSAAKKKGIWKWGVRLLNKMEEKGLKPGSNEWNAVLVACSK 1789 GP+PNNLSYELV+SHF LLSAA K+GIW+WGVRLLNKME+KGLKP WNAVLVACSK Sbjct: 411 GPEPNNLSYELVVSHFNILLSAASKRGIWRWGVRLLNKMEDKGLKPQRRHWNAVLVACSK 470 Query: 1790 ASETTAAVQIFRRMVENGEKPTIISYGALLSALEKGKLYDEAFRVWNHMLKVGVEPNAYT 1969 ASETTAA+QIF+ MV+NGEKPT+ISYGALLSALEKGKLYDEAFRVWNHM+KVG+EPN Y Sbjct: 471 ASETTAAIQIFKAMVDNGEKPTVISYGALLSALEKGKLYDEAFRVWNHMIKVGIEPNLYA 530 Query: 1970 YTIMASIYTAQGNFSRVDAIIREMVTIGVEVTVVTYNAIISGSARNGFSSAAYEWFHRMK 2149 YT MAS+ T Q F+ +D +++EM + G+E +VVT+NA+ISG ARNG S AYEWFHRMK Sbjct: 531 YTTMASVLTGQQKFNLLDTLLKEMASKGIEPSVVTFNAVISGCARNGLSGVAYEWFHRMK 590 Query: 2150 VQNISPNETTYEMLIEALANDGKPRLAYELYLRAQNEGLFLSSKAYDAILQSSLAYGATI 2329 +N+ PNE TYEMLIEALAND KPRLAYEL+++AQNEGL LSSK YDA+++S+ YGATI Sbjct: 591 SENVEPNEITYEMLIEALANDAKPRLAYELHVKAQNEGLKLSSKPYDAVVKSAETYGATI 650 Query: 2330 DVGGLGPRPEDK 2365 D+ LGPRP+ K Sbjct: 651 DLNLLGPRPDKK 662