BLASTX nr result
ID: Akebia25_contig00013012
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia25_contig00013012 (1827 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_007203878.1| hypothetical protein PRUPE_ppa005090mg [Prun... 430 e-117 ref|XP_002272586.2| PREDICTED: uncharacterized protein LOC100248... 427 e-116 ref|XP_006338254.1| PREDICTED: histone-binding protein N1/N2-lik... 426 e-116 ref|XP_004232075.1| PREDICTED: uncharacterized protein LOC101257... 425 e-116 ref|XP_006338255.1| PREDICTED: histone-binding protein N1/N2-lik... 422 e-115 ref|XP_002310182.2| hypothetical protein POPTR_0007s12020g [Popu... 421 e-115 ref|XP_006466683.1| PREDICTED: histone-binding protein N1/N2-lik... 415 e-113 ref|XP_007047098.1| Tetratricopeptide repeat-like superfamily pr... 413 e-112 ref|XP_004300573.1| PREDICTED: uncharacterized protein LOC101312... 412 e-112 ref|XP_006425786.1| hypothetical protein CICLE_v10025507mg [Citr... 411 e-112 ref|XP_002522382.1| conserved hypothetical protein [Ricinus comm... 405 e-110 ref|XP_004511989.1| PREDICTED: nuclear autoantigenic sperm prote... 395 e-107 ref|XP_002307236.1| hypothetical protein POPTR_0005s13910g [Popu... 392 e-106 gb|EYU39386.1| hypothetical protein MIMGU_mgv1a006172mg [Mimulus... 388 e-105 ref|XP_007156860.1| hypothetical protein PHAVU_002G023700g [Phas... 379 e-102 gb|EXB96705.1| Nuclear autoantigenic sperm protein [Morus notabi... 373 e-100 ref|XP_004511990.1| PREDICTED: nuclear autoantigenic sperm prote... 369 3e-99 gb|AFK47874.1| unknown [Medicago truncatula] 366 2e-98 ref|NP_568019.1| tetratricopeptide repeat domain-containing prot... 366 2e-98 ref|XP_006282585.1| hypothetical protein CARUB_v10004537mg, part... 361 5e-97 >ref|XP_007203878.1| hypothetical protein PRUPE_ppa005090mg [Prunus persica] gi|462399409|gb|EMJ05077.1| hypothetical protein PRUPE_ppa005090mg [Prunus persica] Length = 477 Score = 430 bits (1105), Expect = e-117 Identities = 251/468 (53%), Positives = 300/468 (64%), Gaps = 20/468 (4%) Frame = +3 Query: 165 KTLEIEVEETQASTEEPRIQDDQIGGESSTTN---------AEEDSGKSLEYSSELMDKG 317 +TL ETQ S E + Q G ES+ N ++ D KSLE++ ELM+KG Sbjct: 11 ETLPQNALETQRSNEATITEGAQGGAESTCNNDNAEASAVTSDGDREKSLEFADELMEKG 70 Query: 318 SEAMKARDFAEAAECFSRAVEIRVGHYGELAPECSFAYYKYGCALLYKAQEEADPLGSVP 497 S+A+K DF EA ECFSR++EIRV HYGELAP+C AYYKYGCALLYKAQEE DPLG+VP Sbjct: 71 SKAIKDSDFGEATECFSRSLEIRVAHYGELAPQCVNAYYKYGCALLYKAQEETDPLGAVP 130 Query: 498 KKENSEKDGP--------VPEGESATTSVINDAKHDGDSSHNEGESNXXXXXXXXXXXXX 653 KKE G V GES+T S +DA+ D +H EG ++ Sbjct: 131 KKEGESHQGSAKVGSVKNVLNGESSTASASSDAEQDESLNHEEGAADEGASGEKDQEEEH 190 Query: 654 XXXXXXXXXXXXXXXXXXX-AWKMLDVARAIAEKKPDDTMEKVDILSALGEVALEREDFE 830 AWKMLDVARAI EK DTMEKVDILSAL EVALERED E Sbjct: 191 DDSDVEDLAEADEDETDLDLAWKMLDVARAIVEKHSGDTMEKVDILSALAEVALEREDIE 250 Query: 831 TSLSDYLNALSILERLVEPDNRCIAELNFRICLVLEVGSKPDEAIPYCQKAISVCKSRLQ 1010 TSLSDY ALSILERLVEPD+R IAELNFRICL LE+GSKP+EAI YCQKAIS+CKSR++ Sbjct: 251 TSLSDYQKALSILERLVEPDSRRIAELNFRICLCLEIGSKPEEAILYCQKAISICKSRVR 310 Query: 1011 RLTGEVKNLTGSEATMVLPDTNQKVDQALNGSQSDSSVSDKEAEIETLTGLSSDLEKKLE 1190 RL E ++ + S + Q V + ++SDS+V+DK+AEIETLTGLS DLEKKLE Sbjct: 311 RLMLESRSFSESTTSSAASVLEQGVTLSSTVTESDSTVTDKQAEIETLTGLSGDLEKKLE 370 Query: 1191 DLQQLVLNPTSSIFSEVMKMVAAKSTGSRNAVGSTALSSSQMGANTSGGGGFDSPTVSTA 1370 DLQQL NP SI +E++ + +AK+ G+ + S SSS+MGA GGFDSPTVSTA Sbjct: 371 DLQQLASNP-KSILAEILGLASAKAKGTEKSESSAGQSSSRMGA-ADNIGGFDSPTVSTA 428 Query: 1371 HTNGNSGVTHLXXXXXXXXXXLMSSGSTESSPMKKPS--DLVEKGEGS 1508 HTNG SGVTHL LM SG+ ESS KKP+ +KGEG+ Sbjct: 429 HTNGTSGVTHLGVVGRGVKRVLMHSGTAESSSAKKPALDSSEDKGEGN 476 >ref|XP_002272586.2| PREDICTED: uncharacterized protein LOC100248980 [Vitis vinifera] Length = 1123 Score = 427 bits (1097), Expect = e-116 Identities = 256/490 (52%), Positives = 318/490 (64%), Gaps = 29/490 (5%) Frame = +3 Query: 132 SIDSETLEKESKTLEIEVEETQA-------STEEPRIQDDQIGGESSTTN---------A 263 S+ E K+++ +VEE A S+ E I+ + G S+ N + Sbjct: 638 SMVEEVTVKQAENAATKVEEALAPTGQVAQSSNEATIESNAQGDTESSCNNNADTSARPS 697 Query: 264 EEDSGKSLEYSSELMDKGSEAMKARDFAEAAECFSRAVEIRVGHYGELAPECSFAYYKYG 443 + D KSLEY+ ELM+KGS+A+K DF+EA +CFSRA+EIRV H+GELA EC YYKYG Sbjct: 698 DADREKSLEYAEELMEKGSKAVKESDFSEATDCFSRALEIRVAHHGELAFECVNTYYKYG 757 Query: 444 CALLYKAQEEADPLGSVPKKE-----NSEKDGPVPEG---ESATTSVINDAKHDGDSSHN 599 CALLYKAQEEADPL ++P KE NS KDG + ES+T SV +A+ DG S+ Sbjct: 758 CALLYKAQEEADPLATMPNKEAESHENSNKDGSMKNAVNDESSTASV--NAEQDGSSNDQ 815 Query: 600 EGESNXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAWKMLDVARAIAEK-KPDDTMEK 776 + ++ AWKMLDVARAI EK DTMEK Sbjct: 816 KVAADDDTNGKEQEEEDEESDDEDLAEADEDESDLDLAWKMLDVARAIVEKHSAADTMEK 875 Query: 777 VDILSALGEVALEREDFETSLSDYLNALSILERLVEPDNRCIAELNFRICLVLEVGSKPD 956 VDILSAL EVALERED ETSLSDY ALSILERLVEPD+R IAELNFRICL LE+GSK Sbjct: 876 VDILSALAEVALEREDIETSLSDYQKALSILERLVEPDSRHIAELNFRICLCLEIGSKAQ 935 Query: 957 EAIPYCQKAISVCKSRLQRLTGEVKNLTGSEATMVLPDTNQKVDQALNGSQSDSSVSDKE 1136 EAIPYCQ+AIS+CKSR+QRL+ E+K+L+ S A P+ +Q Q+ N SQ+ +S+SDKE Sbjct: 936 EAIPYCQRAISICKSRVQRLSNEIKSLSESPAISPTPELDQSAQQSSNVSQAGNSISDKE 995 Query: 1137 AEIETLTGLSSDLEKKLEDLQQLVLNPTSSIFSEVMKMVAAKSTGSRNAVGSTALSSSQM 1316 +EIETL GL+S+LEKKLEDLQQLV NPT SI SE++ M++AK+ G+ + + SSQ+ Sbjct: 996 SEIETLNGLASELEKKLEDLQQLVSNPT-SILSEILGMMSAKARGADKGASPSVMGSSQI 1054 Query: 1317 GANTSGGGGFDSPTVSTA-HTNGNSGVTHLXXXXXXXXXXLMSSGSTESSPMKKP---SD 1484 G+ S GGFDSPTVSTA HTNG +GVTHL M+SG+ ESSPMKKP S Sbjct: 1055 GSANS-HGGFDSPTVSTASHTNGAAGVTHLGVVGRGVKRVSMNSGTAESSPMKKPPLDSS 1113 Query: 1485 LVEKGEGSGS 1514 L + +GS S Sbjct: 1114 LDKGDDGSAS 1123 Score = 137 bits (345), Expect = 2e-29 Identities = 83/174 (47%), Positives = 108/174 (62%), Gaps = 24/174 (13%) Frame = +3 Query: 144 ETLEKESKTLEIEVEETQA-------STEEPRIQDDQIGGESSTTN---------AEEDS 275 E K+++ +VEE A S+ E I+ + G S+ N ++ D Sbjct: 4 EVTVKQAENAATKVEEALAPTGQVAQSSNEATIESNAQGDTESSCNNNADTSARPSDADR 63 Query: 276 GKSLEYSSELMDKGSEAMKARDFAEAAECFSRAVEIRVGHYGELAPECSFAYYKYGCALL 455 KSLEY+ ELM+KGS+A+K DF+EA +CFSRA+EIRV H+GELA EC YYKYGCALL Sbjct: 64 EKSLEYAEELMEKGSKAVKEGDFSEATDCFSRALEIRVAHHGELAFECVNTYYKYGCALL 123 Query: 456 YKAQEEADPLGSVPKK-----ENSEKDGPVPEG---ESATTSVINDAKHDGDSS 593 YKAQEEADPL ++PKK ENS KDG + ES+T SV +A+ DG S+ Sbjct: 124 YKAQEEADPLATMPKKEAESHENSNKDGSMKNAVNDESSTASV--NAEQDGSSN 175 >ref|XP_006338254.1| PREDICTED: histone-binding protein N1/N2-like isoform X1 [Solanum tuberosum] Length = 471 Score = 426 bits (1096), Expect = e-116 Identities = 248/467 (53%), Positives = 305/467 (65%), Gaps = 6/467 (1%) Frame = +3 Query: 132 SIDSETLEKESKTLEIEVEE-TQASTEEPRIQDDQIGGESSTTNAEEDSGKSLEYSSELM 308 S+ S T E+ ++ +E Q TE ++ ESS ++ + KSLEY+ EL Sbjct: 8 SVTSPTAEQNQNSVNATIESGVQGGTESTCNNNNT--AESSVVTSDGNREKSLEYADELT 65 Query: 309 DKGSEAMKARDFAEAAECFSRAVEIRVGHYGELAPECSFAYYKYGCALLYKAQEEADPLG 488 KGS+A K D+AEA EC+SRA+EIRV H+GELAPEC AYYKYGCALLYKAQ+EADPL Sbjct: 66 VKGSKASKDGDYAEAVECYSRALEIRVAHFGELAPECINAYYKYGCALLYKAQDEADPLV 125 Query: 489 SVPKKEN-----SEKDGPVPEGESATTSVINDAKHDGDSSHNEGESNXXXXXXXXXXXXX 653 S+PKK++ S +DG V S +S+ + A+ G S+ E + Sbjct: 126 SLPKKDSGSQQDSNRDGSVKSVVSCESSISSTAEPGGSSNGKEKVEDDAAEENEDEGDEE 185 Query: 654 XXXXXXXXXXXXXXXXXXXAWKMLDVARAIAEKKPDDTMEKVDILSALGEVALEREDFET 833 AWK+LDVARAIAEK DTMEKVD+LSAL EVALERED ET Sbjct: 186 ESDDEDLAEGDEDETDLDLAWKLLDVARAIAEKHAGDTMEKVDVLSALAEVALEREDVET 245 Query: 834 SLSDYLNALSILERLVEPDNRCIAELNFRICLVLEVGSKPDEAIPYCQKAISVCKSRLQR 1013 SLSDYL ALSILE LVEPD+R IAELNFRICL LE+GSK EAIPYCQKAIS CKSRLQR Sbjct: 246 SLSDYLKALSILEHLVEPDSRHIAELNFRICLCLEIGSKHQEAIPYCQKAISTCKSRLQR 305 Query: 1014 LTGEVKNLTGSEATMVLPDTNQKVDQALNGSQSDSSVSDKEAEIETLTGLSSDLEKKLED 1193 LT E+K L+ S + + +Q Q+ + SQSD SVS KEAE+ETLT LS++LEKKLED Sbjct: 306 LTEEIKLLSESTERLATTNVDQIARQSSSTSQSD-SVSAKEAEVETLTDLSAELEKKLED 364 Query: 1194 LQQLVLNPTSSIFSEVMKMVAAKSTGSRNAVGSTALSSSQMGANTSGGGGFDSPTVSTAH 1373 LQQ + NP SSI S+++ MV+AK+ NA S A++SSQMGA TSG G FDSPTVSTAH Sbjct: 365 LQQCMSNP-SSILSDILGMVSAKARSMENADASVAVNSSQMGAGTSGSGSFDSPTVSTAH 423 Query: 1374 TNGNSGVTHLXXXXXXXXXXLMSSGSTESSPMKKPSDLVEKGEGSGS 1514 TNG +G+THL +++ +TESSP KKP+ G G+ Sbjct: 424 TNGAAGITHLGVVGRGVKRVHLNT-TTESSPAKKPASEQPSDNGDGA 469 >ref|XP_004232075.1| PREDICTED: uncharacterized protein LOC101257719 [Solanum lycopersicum] Length = 472 Score = 425 bits (1093), Expect = e-116 Identities = 244/466 (52%), Positives = 301/466 (64%), Gaps = 5/466 (1%) Frame = +3 Query: 132 SIDSETLEKESKTLEIEVEETQASTEEPRIQDDQIGGESSTTNAEEDSGKSLEYSSELMD 311 S+ S T E+ ++ +E E ++ ESS ++ + KSLEY+ EL+ Sbjct: 8 SVTSPTAEQNQNSVNATIESGVQGGTESTCNNNNNNAESSAVTSDVNREKSLEYADELVV 67 Query: 312 KGSEAMKARDFAEAAECFSRAVEIRVGHYGELAPECSFAYYKYGCALLYKAQEEADPLGS 491 KGS+A + D+ EA ECFSRA+EIRV H+GELAPEC AYYKYGCALLYKAQ+EADPL S Sbjct: 68 KGSKASEDGDYGEAVECFSRALEIRVAHFGELAPECINAYYKYGCALLYKAQDEADPLVS 127 Query: 492 VPKKEN-----SEKDGPVPEGESATTSVINDAKHDGDSSHNEGESNXXXXXXXXXXXXXX 656 +PKK++ S +DG V S +S+ + A+ G S+ E + Sbjct: 128 LPKKDSGSQQDSNRDGSVKSVVSCESSISSTAEPGGSSNGKEKVEDDAAEENEDEGDEEE 187 Query: 657 XXXXXXXXXXXXXXXXXXAWKMLDVARAIAEKKPDDTMEKVDILSALGEVALEREDFETS 836 AWK+LDVARAIAEK DTMEKVD+LSAL EVALERED ETS Sbjct: 188 SDDEDLAEGDEDETDLDLAWKLLDVARAIAEKHAGDTMEKVDVLSALAEVALEREDIETS 247 Query: 837 LSDYLNALSILERLVEPDNRCIAELNFRICLVLEVGSKPDEAIPYCQKAISVCKSRLQRL 1016 LSDYL ALSILERLVEPD+R IA LNFRICL LE+GSK EAIPYCQKAI CKSRLQRL Sbjct: 248 LSDYLKALSILERLVEPDSRHIAALNFRICLCLEIGSKHQEAIPYCQKAILTCKSRLQRL 307 Query: 1017 TGEVKNLTGSEATMVLPDTNQKVDQALNGSQSDSSVSDKEAEIETLTGLSSDLEKKLEDL 1196 T E+K L+ S + D +Q Q+ + SQSD SVS KEAE+ETLT LS++LEKKLEDL Sbjct: 308 TEEIKLLSESTERLATTDVDQIARQSSSISQSD-SVSAKEAEVETLTELSAELEKKLEDL 366 Query: 1197 QQLVLNPTSSIFSEVMKMVAAKSTGSRNAVGSTALSSSQMGANTSGGGGFDSPTVSTAHT 1376 QQ + NP SSI S+++ MV+AK+ NA S A++SSQMG TSG G FDSPTVSTAHT Sbjct: 367 QQCMSNP-SSILSDILGMVSAKARSLENADASVAVNSSQMGVGTSGSGSFDSPTVSTAHT 425 Query: 1377 NGNSGVTHLXXXXXXXXXXLMSSGSTESSPMKKPSDLVEKGEGSGS 1514 NG +G+THL +++ +TESSP KKP+ G G+ Sbjct: 426 NGAAGITHLGVVGRGVKRVHLNT-TTESSPAKKPASEQPSDNGDGT 470 >ref|XP_006338255.1| PREDICTED: histone-binding protein N1/N2-like isoform X2 [Solanum tuberosum] Length = 469 Score = 422 bits (1086), Expect = e-115 Identities = 249/467 (53%), Positives = 304/467 (65%), Gaps = 6/467 (1%) Frame = +3 Query: 132 SIDSETLEKESKTLEIEVEE-TQASTEEPRIQDDQIGGESSTTNAEEDSGKSLEYSSELM 308 S+ S T E+ ++ +E Q TE ++ ESS ++ + KSLEY+ EL Sbjct: 8 SVTSPTAEQNQNSVNATIESGVQGGTESTCNNNNT--AESSVVTSDGNREKSLEYADELT 65 Query: 309 DKGSEAMKARDFAEAAECFSRAVEIRVGHYGELAPECSFAYYKYGCALLYKAQEEADPLG 488 KGS+A K D+AEA EC+SRA+EIRV H+GELAPEC AYYKYGCALLYKAQ+EADPL Sbjct: 66 VKGSKASKDGDYAEAVECYSRALEIRVAHFGELAPECINAYYKYGCALLYKAQDEADPLV 125 Query: 489 SVPKKEN-----SEKDGPVPEGESATTSVINDAKHDGDSSHNEGESNXXXXXXXXXXXXX 653 S+PKK++ S +DG V S +S+ + A+ G S N E Sbjct: 126 SLPKKDSGSQQDSNRDGSVKSVVSCESSISSTAEPGGSS--NGKEKVEDDEENEDEGDEE 183 Query: 654 XXXXXXXXXXXXXXXXXXXAWKMLDVARAIAEKKPDDTMEKVDILSALGEVALEREDFET 833 AWK+LDVARAIAEK DTMEKVD+LSAL EVALERED ET Sbjct: 184 ESDDEDLAEGDEDETDLDLAWKLLDVARAIAEKHAGDTMEKVDVLSALAEVALEREDVET 243 Query: 834 SLSDYLNALSILERLVEPDNRCIAELNFRICLVLEVGSKPDEAIPYCQKAISVCKSRLQR 1013 SLSDYL ALSILE LVEPD+R IAELNFRICL LE+GSK EAIPYCQKAIS CKSRLQR Sbjct: 244 SLSDYLKALSILEHLVEPDSRHIAELNFRICLCLEIGSKHQEAIPYCQKAISTCKSRLQR 303 Query: 1014 LTGEVKNLTGSEATMVLPDTNQKVDQALNGSQSDSSVSDKEAEIETLTGLSSDLEKKLED 1193 LT E+K L+ S + + +Q Q+ + SQSD SVS KEAE+ETLT LS++LEKKLED Sbjct: 304 LTEEIKLLSESTERLATTNVDQIARQSSSTSQSD-SVSAKEAEVETLTDLSAELEKKLED 362 Query: 1194 LQQLVLNPTSSIFSEVMKMVAAKSTGSRNAVGSTALSSSQMGANTSGGGGFDSPTVSTAH 1373 LQQ + NP SSI S+++ MV+AK+ NA S A++SSQMGA TSG G FDSPTVSTAH Sbjct: 363 LQQCMSNP-SSILSDILGMVSAKARSMENADASVAVNSSQMGAGTSGSGSFDSPTVSTAH 421 Query: 1374 TNGNSGVTHLXXXXXXXXXXLMSSGSTESSPMKKPSDLVEKGEGSGS 1514 TNG +G+THL +++ +TESSP KKP+ G G+ Sbjct: 422 TNGAAGITHLGVVGRGVKRVHLNT-TTESSPAKKPASEQPSDNGDGA 467 >ref|XP_002310182.2| hypothetical protein POPTR_0007s12020g [Populus trichocarpa] gi|550334704|gb|EEE90632.2| hypothetical protein POPTR_0007s12020g [Populus trichocarpa] Length = 478 Score = 421 bits (1082), Expect = e-115 Identities = 255/471 (54%), Positives = 297/471 (63%), Gaps = 30/471 (6%) Frame = +3 Query: 189 ETQASTEEPRIQDDQIGGESSTTNAEE--DSGKSLEYSSELMDKGSEAMKARDFAEAAEC 362 ETQAS E D + N E D KSL+++ EL++KGS A+K DF+EA EC Sbjct: 12 ETQASVEVTTTSHDVTADSTCNDNNGETSDPEKSLDFAVELLEKGSTALKENDFSEAVEC 71 Query: 363 FSRAVEIRVGHYGELAPECSFAYYKYGCALLYKAQEEADPLGSVPKKE-----NSEKDGP 527 FSRA+EIRV H+GELA EC AYY YG ALLYKAQEEADPLG VPKK+ N +KD Sbjct: 72 FSRALEIRVLHHGELALECVNAYYHYGRALLYKAQEEADPLGMVPKKDSESKQNDDKDAA 131 Query: 528 ---VPEGESATTSVINDAKHDGDSSHNEGESNXXXXXXXXXXXXXXXXXXXXXXXXXXXX 698 V GES+TTS ++ DG +H E Sbjct: 132 CKNVLNGESSTTSASSNVGEDGGFNHPEASDGKDEEEDDEGSDDDDDLADADEEESDLDL 191 Query: 699 XXXXAWKMLDVARAIAEKKPDDTMEKVDILSALGEVALEREDFETSLSDYLNALSILERL 878 AWKMLDVARAIAEK P DTM+KVDILSAL EVALERED ETSLSDY +LSILERL Sbjct: 192 ----AWKMLDVARAIAEKHPGDTMDKVDILSALAEVALEREDIETSLSDYQKSLSILERL 247 Query: 879 VEPDNRCIAEL--------------------NFRICLVLEVGSKPDEAIPYCQKAISVCK 998 VEPD+R +AEL NFRICL LE+GSK EAIPYCQKAISVCK Sbjct: 248 VEPDSRHLAELYPFQGLHTPLSWDALTVQFRNFRICLCLEIGSKSQEAIPYCQKAISVCK 307 Query: 999 SRLQRLTGEVKNLTGSEATMVLPDTNQKVDQALNGSQSDSSVSDKEAEIETLTGLSSDLE 1178 +RLQRL E+K+ S T + + ++ V Q L+ Q+D SV+DKEAEIETLTGLS +LE Sbjct: 308 ARLQRLINELKSSGESATTPAISELDEGVQQ-LSNMQADKSVTDKEAEIETLTGLSGELE 366 Query: 1179 KKLEDLQQLVLNPTSSIFSEVMKMVAAKSTGSRNAVGSTALSSSQMGANTSGGGGFDSPT 1358 KKLEDLQQLVLNP SI SE++ MVAAK G +V TA++SSQMG TS GGFDSPT Sbjct: 367 KKLEDLQQLVLNP-KSILSEILGMVAAKGKGGEKSVFPTAMNSSQMGTATS-SGGFDSPT 424 Query: 1359 VSTAHTNGNSGVTHLXXXXXXXXXXLMSSGSTESSPMKKPSDLVEKGEGSG 1511 +STAHTNG SGVT L L S+GST SSP+KKP+ +G G Sbjct: 425 ISTAHTNGASGVTDLGVVGRGVKRVLTSTGSTGSSPVKKPTPDPSSDKGDG 475 >ref|XP_006466683.1| PREDICTED: histone-binding protein N1/N2-like [Citrus sinensis] Length = 479 Score = 415 bits (1066), Expect = e-113 Identities = 249/475 (52%), Positives = 303/475 (63%), Gaps = 16/475 (3%) Frame = +3 Query: 141 SETLEKESKTLEIEVEETQASTEEPRIQDDQIGGESSTTNAEEDSG--------KSLEYS 296 S+T+ +++ V TQAS E G ES+ N E SG K++E++ Sbjct: 7 SQTVAEQTAQPTETVGTTQASVEATMESVTVSGTESTCNNNCETSGAIADGEREKTVEFA 66 Query: 297 SELMDKGSEAMKARDFAEAAECFSRAVEIRVGHYGELAPECSFAYYKYGCALLYKAQEEA 476 ELM+KG+ A+K D+ EAAECFSRA+EIRV HYGELA EC AYY+YG ALLYKAQEEA Sbjct: 67 DELMEKGTNALKESDYGEAAECFSRALEIRVSHYGELALECVNAYYQYGRALLYKAQEEA 126 Query: 477 DPLGSVPKKEN-----SEKDGPVPE---GESATTSVINDAKHDGDSSHNEGESNXXXXXX 632 DPL SVPKKE S+KD V GES+T SV + A+ G SS+N+ E+ Sbjct: 127 DPLVSVPKKEGDSQQGSDKDDSVKNAVNGESSTASVSSSAEQHG-SSNNQDEAADDAVPG 185 Query: 633 XXXXXXXXXXXXXXXXXXXXXXXXXXAWKMLDVARAIAEKKPDDTMEKVDILSALGEVAL 812 AWKMLDVARAIAEK D+MEKVDILSAL EVAL Sbjct: 186 DNEEDEEGNDGENVAEADEDESDLDLAWKMLDVARAIAEKHWGDSMEKVDILSALAEVAL 245 Query: 813 EREDFETSLSDYLNALSILERLVEPDNRCIAELNFRICLVLEVGSKPDEAIPYCQKAISV 992 ERED ETSLSDY AL+ILER+VEPD+R IAELNFRICL LE+GSKP EAIPYC KAISV Sbjct: 246 EREDIETSLSDYQKALTILERMVEPDSRHIAELNFRICLCLEIGSKPQEAIPYCHKAISV 305 Query: 993 CKSRLQRLTGEVKNLTGSEATMVLPDTNQKVDQALNGSQSDSSVSDKEAEIETLTGLSSD 1172 CKSR+QRL EVK+L S + + + + Q+ + Q+D ++DKEAEIETL+GL D Sbjct: 306 CKSRVQRLLNEVKSLGESATSSAPAELDDGIQQSSSELQNDKLLTDKEAEIETLSGLCGD 365 Query: 1173 LEKKLEDLQQLVLNPTSSIFSEVMKMVAAKSTGSRNAVGSTALSSSQMGANTSGGGGFDS 1352 LEKKLEDLQQ+ LNP SI SE++ + +AK+ G + S LSSS+MG S G FDS Sbjct: 366 LEKKLEDLQQVALNP-KSILSEILGIASAKAKGDEKSSTSAVLSSSRMGTAHS-DGDFDS 423 Query: 1353 PTVSTAHTNGNSGVTHLXXXXXXXXXXLMSSGSTESSPMKKPSDLVEKGEGSGSL 1517 PTVSTAHT+G +GVTHL MS+GS ES P KK + +G GS+ Sbjct: 424 PTVSTAHTSGAAGVTHLGVVGRGVKRVSMSTGSAESRPSKKSTSDPSSDKGDGSV 478 >ref|XP_007047098.1| Tetratricopeptide repeat-like superfamily protein [Theobroma cacao] gi|508699359|gb|EOX91255.1| Tetratricopeptide repeat-like superfamily protein [Theobroma cacao] Length = 491 Score = 413 bits (1062), Expect = e-112 Identities = 251/492 (51%), Positives = 310/492 (63%), Gaps = 21/492 (4%) Frame = +3 Query: 102 AMAASDGSLKSIDSETLEKESKTLEI-EVEETQASTEEPRIQDDQIGGESSTTN------ 260 A AAS+ S+ T+ +++ L++ E TQ S E I+ GG ST N Sbjct: 8 APAASEASV------TMTEQTPDLKVGETLGTQGSIEGT-IESAVQGGTESTCNNNDNAE 60 Query: 261 -----AEEDSGKSLEYSSELMDKGSEAMKARDFAEAAECFSRAVEIRVGHYGELAPECSF 425 ++ D K+LE++ EL +KGS+A K DFAEAA+CFSRA+EIRV H+GELA EC Sbjct: 61 SLGLASDVDREKTLEFADELAEKGSKAFKENDFAEAADCFSRALEIRVAHHGELAIECLK 120 Query: 426 AYYKYGCALLYKAQEEADPLGSVPKKENSEKDGPVPE--------GESATTSVINDAKHD 581 AYY YG ALLYKAQEE DPL SVPKKE + G E GES+ SV +DAK D Sbjct: 121 AYYLYGRALLYKAQEETDPLVSVPKKEGETQQGSNKEESVKSAVNGESSVASVSSDAKQD 180 Query: 582 GDSSHNEGESNXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAWKMLDVARAIAEKKP- 758 S+ +EG + AWKMLDVARAIA+K+ Sbjct: 181 ESSTQHEGATKDGGRDEEEEDEDSDTDDVAEADEDDSDLDL--AWKMLDVARAIADKQQL 238 Query: 759 DDTMEKVDILSALGEVALEREDFETSLSDYLNALSILERLVEPDNRCIAELNFRICLVLE 938 DTMEKVDILSAL EVALERED E+SLSDY ALSIL++LVEPD+R IAELNFRIC+ LE Sbjct: 239 GDTMEKVDILSALAEVALEREDIESSLSDYQKALSILQQLVEPDHRQIAELNFRICMCLE 298 Query: 939 VGSKPDEAIPYCQKAISVCKSRLQRLTGEVKNLTGSEATMVLPDTNQKVDQALNGSQSDS 1118 +GSKP EAIPYCQKAISVC+SRL+RLT EVK+ GS + + + V Q+ +GSQ+ Sbjct: 299 IGSKPQEAIPYCQKAISVCRSRLERLTNEVKSSAGSALSSAASELDDGVQQSSDGSQTVK 358 Query: 1119 SVSDKEAEIETLTGLSSDLEKKLEDLQQLVLNPTSSIFSEVMKMVAAKSTGSRNAVGSTA 1298 S++DKEAEI TL GL+ DLEKKLEDLQQL NP SI +E++ MV+A+ + TA Sbjct: 359 SITDKEAEITTLAGLAEDLEKKLEDLQQLASNP-KSIIAELLGMVSARGRDGEKSAAPTA 417 Query: 1299 LSSSQMGANTSGGGGFDSPTVSTAHTNGNSGVTHLXXXXXXXXXXLMSSGSTESSPMKKP 1478 +SSS++ A + G FDSPTVSTAH+NG +GVTHL LMS+GS ESS KKP Sbjct: 418 VSSSRI-ATANSNGNFDSPTVSTAHSNGTAGVTHLGVIGRGVKRVLMSTGSVESSSAKKP 476 Query: 1479 SDLVEKGEGSGS 1514 + G GS Sbjct: 477 AIEPSSDNGDGS 488 >ref|XP_004300573.1| PREDICTED: uncharacterized protein LOC101312834 [Fragaria vesca subsp. vesca] Length = 482 Score = 412 bits (1059), Expect = e-112 Identities = 237/460 (51%), Positives = 304/460 (66%), Gaps = 12/460 (2%) Frame = +3 Query: 168 TLEIEVEE--TQASTEEPRIQDDQIGGESSTTNAEEDSGKSLEYSSELMDKGSEAMKARD 341 T+EI EE T+++ +D+ G + T++++ D K+LE+++ELM+KG++AMK D Sbjct: 26 TIEIATEEGGTESTCNNTTTINDKPEGSAVTSSSDGDREKTLEFANELMEKGNKAMKEDD 85 Query: 342 FAEAAECFSRAVEIRVGHYGELAPECSFAYYKYGCALLYKAQEEADPLGSVPKKENSEKD 521 F EA+ECFSR++EIRV H+GELAP+C YYKYGCALLYKAQEEADPLG+VPKKE + Sbjct: 86 FGEASECFSRSLEIRVAHFGELAPQCVKTYYKYGCALLYKAQEEADPLGNVPKKEGESQQ 145 Query: 522 GP--------VPEGESATTSVINDAKHDGDSSHNEGE-SNXXXXXXXXXXXXXXXXXXXX 674 V GES+T SV ++A+ EG + Sbjct: 146 ESANAGAAKNVLNGESSTASVSSNAEQVASPILQEGALDDGVSSGKDQEEDNEDSDVEEL 205 Query: 675 XXXXXXXXXXXXAWKMLDVARAIAEKKPD-DTMEKVDILSALGEVALEREDFETSLSDYL 851 AWKMLDVARAI EK+ DTMEKV++LSAL EVALEREDFETSLSDY Sbjct: 206 AEGDEDETDLDLAWKMLDVARAIIEKQNSGDTMEKVEVLSALAEVALEREDFETSLSDYQ 265 Query: 852 NALSILERLVEPDNRCIAELNFRICLVLEVGSKPDEAIPYCQKAISVCKSRLQRLTGEVK 1031 NALSILERLVEPD+R IAELNFRICL LE+GSKPDEAI YCQKAISVCKSR++RL E K Sbjct: 266 NALSILERLVEPDSRHIAELNFRICLCLEIGSKPDEAISYCQKAISVCKSRVRRLMIESK 325 Query: 1032 NLTGSEATMVLPDTNQKVDQALNGSQSDSSVSDKEAEIETLTGLSSDLEKKLEDLQQLVL 1211 + +GS A+ + +Q V Q+ + ++SD++++DK+AEI+ LT L DLEKKLEDLQQLV Sbjct: 326 SFSGSTASSSASE-DQGVQQSSDATKSDNALTDKQAEIQNLTELCGDLEKKLEDLQQLVT 384 Query: 1212 NPTSSIFSEVMKMVAAKSTGSRNAVGSTALSSSQMGANTSGGGGFDSPTVSTAHTNGNSG 1391 P S +++M MV+AK+ + S + SSQMG GGFDSPT+STAHTNG+SG Sbjct: 385 IPRS--IADLMGMVSAKAKAVEKSASSAVVGSSQMG-TADNNGGFDSPTISTAHTNGSSG 441 Query: 1392 VTHLXXXXXXXXXXLMSSGSTESSPMKKPSDLVEKGEGSG 1511 VTHL L +SG+ E++P KKP+ + G G Sbjct: 442 VTHLGVVGRGVKRTLTNSGTAETNPGKKPAIDPSESNGEG 481 >ref|XP_006425786.1| hypothetical protein CICLE_v10025507mg [Citrus clementina] gi|557527776|gb|ESR39026.1| hypothetical protein CICLE_v10025507mg [Citrus clementina] Length = 478 Score = 411 bits (1056), Expect = e-112 Identities = 249/475 (52%), Positives = 303/475 (63%), Gaps = 16/475 (3%) Frame = +3 Query: 141 SETLEKESKTLEIEVEETQASTEEPRIQDDQIGGESSTTNAEEDSG--------KSLEYS 296 S+T+ +++ V TQAS E G ES+ N E SG K++E++ Sbjct: 7 SQTVAEQTAQPTETVGTTQASVEATMESVTVSGTESTCNNNCETSGAIADGEREKTVEFA 66 Query: 297 SELMDKGSEAMKARDFAEAAECFSRAVEIRVGHYGELAPECSFAYYKYGCALLYKAQEEA 476 ELM+KG+ A+K D+ EAAECFSRA+EIRV HYGELA EC AYY+YG ALLYKAQEEA Sbjct: 67 DELMEKGTNALKESDYGEAAECFSRALEIRVSHYGELALECVNAYYQYGRALLYKAQEEA 126 Query: 477 DPLGSVPKKEN-----SEKDGPVPE---GESATTSVINDAKHDGDSSHNEGESNXXXXXX 632 DPL SVPKKE S+KD V GES+T SV + A+ G SS+N+ E+ Sbjct: 127 DPLVSVPKKEGDSQQGSDKDDSVKNAVNGESSTASVSSSAEQHG-SSNNQDEA-ADDVPG 184 Query: 633 XXXXXXXXXXXXXXXXXXXXXXXXXXAWKMLDVARAIAEKKPDDTMEKVDILSALGEVAL 812 AWKMLDVARAIAEK D+MEKVDILSAL EVAL Sbjct: 185 DNEEDEEGNDGENVAEADEDESDLDLAWKMLDVARAIAEKHWGDSMEKVDILSALAEVAL 244 Query: 813 EREDFETSLSDYLNALSILERLVEPDNRCIAELNFRICLVLEVGSKPDEAIPYCQKAISV 992 ERED ETSLSDY AL+ILER+VEPD+R IAELNFRICL LE+GSKP EAIPYC KAISV Sbjct: 245 EREDIETSLSDYQKALTILERMVEPDSRHIAELNFRICLCLEIGSKPQEAIPYCHKAISV 304 Query: 993 CKSRLQRLTGEVKNLTGSEATMVLPDTNQKVDQALNGSQSDSSVSDKEAEIETLTGLSSD 1172 CKSR+QRL EVK+L S + + + + Q+ + Q+D ++DKEAEIETL+GL D Sbjct: 305 CKSRVQRLLNEVKSLGESATSSAPAELDDGIQQSSSELQNDKLLTDKEAEIETLSGLCGD 364 Query: 1173 LEKKLEDLQQLVLNPTSSIFSEVMKMVAAKSTGSRNAVGSTALSSSQMGANTSGGGGFDS 1352 LEKKLEDLQQ+ LNP SI SE++ + +AK+ G + S LSSS+MG S G FDS Sbjct: 365 LEKKLEDLQQVALNP-KSILSEILGIASAKAKGDEKSSTSAVLSSSRMGTAHS-DGDFDS 422 Query: 1353 PTVSTAHTNGNSGVTHLXXXXXXXXXXLMSSGSTESSPMKKPSDLVEKGEGSGSL 1517 PTVSTAHT+G +GVTHL MS+GS ES P KK + +G GS+ Sbjct: 423 PTVSTAHTSGAAGVTHLGVVGRGVKRVSMSTGSAESRPSKKSTSDPSSDKGDGSV 477 >ref|XP_002522382.1| conserved hypothetical protein [Ricinus communis] gi|223538460|gb|EEF40066.1| conserved hypothetical protein [Ricinus communis] Length = 459 Score = 405 bits (1042), Expect = e-110 Identities = 238/448 (53%), Positives = 292/448 (65%), Gaps = 17/448 (3%) Frame = +3 Query: 189 ETQASTEEPRIQDDQIGGESSTTNAEED------SGKSLEYSSELMDKGSEAMKARDFAE 350 ET+ S E + Q GG ST N S +SLE + E +G++A+ D+ E Sbjct: 2 ETETSNEATIESNAQGGGTESTCNNNNGEPSTLTSAESLELAVEFTQRGTKALNDNDYTE 61 Query: 351 AAECFSRAVEIRVGHYGELAPECSFAYYKYGCALLYKAQEEADPLGSVPK-----KENSE 515 AA+CFSRA+EIRV +YGELA EC AYY+YG ALLYKAQEEADPL +VPK K+ S+ Sbjct: 62 AADCFSRALEIRVSYYGELALECLSAYYQYGRALLYKAQEEADPLATVPKRDAESKQESD 121 Query: 516 KDGPVP---EGESATTSVIN-DAKHDG--DSSHNEGESNXXXXXXXXXXXXXXXXXXXXX 677 +DG V + ES+T S ++ + + DG DSS+ +G ++ Sbjct: 122 QDGSVKSAMKAESSTASAVSSNTEEDGNLDSSNQQGVTDDASGRKDQEEDGEVSDDEDLA 181 Query: 678 XXXXXXXXXXXAWKMLDVARAIAEKKPDDTMEKVDILSALGEVALEREDFETSLSDYLNA 857 AWKMLDVARAIAEK DTM+KVD+LSAL EVALERED ETSLSDY A Sbjct: 182 EADEDESDLDLAWKMLDVARAIAEKHSGDTMDKVDVLSALAEVALEREDIETSLSDYEKA 241 Query: 858 LSILERLVEPDNRCIAELNFRICLVLEVGSKPDEAIPYCQKAISVCKSRLQRLTGEVKNL 1037 L ILERLVEPD+R +AELNFRICL LE+GSKP EAIPYCQ+AIS+CKSRLQRL EVK+ Sbjct: 242 LLILERLVEPDSRHLAELNFRICLCLEIGSKPQEAIPYCQRAISICKSRLQRLMNEVKDS 301 Query: 1038 TGSEATMVLPDTNQKVDQALNGSQSDSSVSDKEAEIETLTGLSSDLEKKLEDLQQLVLNP 1217 + S + + + V Q+ NGSQ D SV+DKEAEIETLTGLS DLEKKLEDLQQL +NP Sbjct: 302 SESAIASAVSELDDGVQQSSNGSQIDVSVTDKEAEIETLTGLSGDLEKKLEDLQQLAVNP 361 Query: 1218 TSSIFSEVMKMVAAKSTGSRNAVGSTALSSSQMGANTSGGGGFDSPTVSTAHTNGNSGVT 1397 SI SE++ MV+AK+ G+ + + SSQ+ A G FDSPTVSTAHTNG + VT Sbjct: 362 -KSILSEILGMVSAKAKGAEKSASPAEVKSSQI-AIAGSSGAFDSPTVSTAHTNG-AAVT 418 Query: 1398 HLXXXXXXXXXXLMSSGSTESSPMKKPS 1481 HL +MS+ ST SSP KKP+ Sbjct: 419 HLGVVGRGVKRVVMSTSSTGSSPAKKPA 446 >ref|XP_004511989.1| PREDICTED: nuclear autoantigenic sperm protein-like isoform X1 [Cicer arietinum] Length = 476 Score = 395 bits (1014), Expect = e-107 Identities = 238/481 (49%), Positives = 307/481 (63%), Gaps = 11/481 (2%) Frame = +3 Query: 102 AMAASDGSLKSID-SETLEKESKTLEIEVEETQASTEEPRIQDDQIGGESSTTNAEEDSG 278 A AAS+ S+ +ET+ TL + + T E + D E+S + D Sbjct: 5 AHAASETSVTMPPPTETVTVVDGTLNRDEHDKNGLTTESAMVSD---AENSGLASGGDDR 61 Query: 279 KSLEYSSELMDKGSEAMKARDFAEAAECFSRAVEIRVGHYGELAPECSFAYYKYGCALLY 458 K+L+ + ELMDK ++AMK D+ EAA+ +SRA+EIRVGHYGELAPEC YYKYGCALLY Sbjct: 62 KALDLADELMDKENKAMKDNDYGEAADNYSRALEIRVGHYGELAPECVHTYYKYGCALLY 121 Query: 459 KAQEEADPLGSVPKKEN-----SEKDGPVPEG---ESATTSVINDAKHDGDSSHNEGESN 614 KAQEEADPLG VPKK++ SEKDG V ES+T S N+A+ D S++ E E Sbjct: 122 KAQEEADPLGVVPKKQDVPQHGSEKDGSVKSAMNAESSTASFPNNAEQDVTSNNRESEVV 181 Query: 615 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAWKMLDVARAIAEKKPDDTMEKVDILSA 794 AWKMLD+ARAI EK+ +TME+VDILS Sbjct: 182 NATSGKNDQEDGGDSDAEDLAEGDEDESDLDLAWKMLDIARAIVEKQSVNTMEQVDILST 241 Query: 795 LGEVALEREDFETSLSDYLNALSILERLVEPDNRCIAELNFRICLVLEVGSKPDEAIPYC 974 LG+VALEREDFETSLSDY ALSILE+LVEPD+R IA+LNFRICL LEVGS+P+EA+ YC Sbjct: 242 LGDVALEREDFETSLSDYQKALSILEQLVEPDDRKIADLNFRICLCLEVGSRPEEAVAYC 301 Query: 975 QKAISVCKSRLQRLTGEVKNLTG--SEATMVLPDTNQKVDQALNGSQSDSSVSDKEAEIE 1148 +KA SVCK+RL RLT EVK+ + S A+ + D +Q GS+S++S+ DK+AEIE Sbjct: 302 EKATSVCKARLHRLTNEVKSFSDLTSSASEIKRD-----EQTYPGSESNNSIVDKQAEIE 356 Query: 1149 TLTGLSSDLEKKLEDLQQLVLNPTSSIFSEVMKMVAAKSTGSRNAVGSTALSSSQMGANT 1328 TLTGLSS+LEKKL+DLQQL+ NP SI +E++ + AAK+ + + G S+ AN+ Sbjct: 357 TLTGLSSELEKKLDDLQQLISNP-KSILAEILGIAAAKAGNGKESSGGRVSSTQLATANS 415 Query: 1329 SGGGGFDSPTVSTAHTNGNSGVTHLXXXXXXXXXXLMSSGSTESSPMKKPSDLVEKGEGS 1508 S GGFDSPT+STAHTNG++ VTHL ++ E+S KKP+ +G+G Sbjct: 416 S--GGFDSPTISTAHTNGSAAVTHL-GVVGRGIKRASNASVEEASIAKKPALETTEGKGD 472 Query: 1509 G 1511 G Sbjct: 473 G 473 >ref|XP_002307236.1| hypothetical protein POPTR_0005s13910g [Populus trichocarpa] gi|566171231|ref|XP_006383290.1| hypothetical protein POPTR_0005s13910g [Populus trichocarpa] gi|222856685|gb|EEE94232.1| hypothetical protein POPTR_0005s13910g [Populus trichocarpa] gi|550338881|gb|ERP61087.1| hypothetical protein POPTR_0005s13910g [Populus trichocarpa] Length = 470 Score = 392 bits (1008), Expect = e-106 Identities = 239/472 (50%), Positives = 299/472 (63%), Gaps = 15/472 (3%) Frame = +3 Query: 141 SETLEKESKTLEIEVEETQASTEEPRIQDDQIGGESSTTNAEEDSG------KSLEYSSE 302 SE+ E KT E + + +T + GG + +T ++++G KSL+++ E Sbjct: 3 SESSVTEPKTAPKENQTSVEATIKGSTTTSSQGGAADSTCNDDNNGETSDPRKSLDFAVE 62 Query: 303 LMDKGSEAMKARDFAEAAECFSRAVEIRVGHYGELAPECSFAYYKYGCALLYKAQEEADP 482 L +KG+ A+K DF+EA ECFSRA+EIRV H+GELA EC AYY YG ALLYKAQEEADP Sbjct: 63 LSEKGTNALKENDFSEAVECFSRALEIRVLHHGELALECVNAYYLYGRALLYKAQEEADP 122 Query: 483 LGSVPKKENSEK-----DGP---VPEGESATTSVINDAKHDGDSSHNEGESNXXXXXXXX 638 L VPKK++ K DG GE ++ SV ++ + S+H EG + Sbjct: 123 LAMVPKKDSESKQDDNKDGASRNFVNGEFSSASVSSNVEEGRGSNHPEGAAGGEEEEDDD 182 Query: 639 XXXXXXXXXXXXXXXXXXXXXXXXAWKMLDVARAIAEKKPDDTMEKVDILSALGEVALER 818 AWKMLDVARAIAEK DDTM+KVDILSAL EVALER Sbjct: 183 EGSDDEDLAEADEEESDLDL----AWKMLDVARAIAEKHLDDTMDKVDILSALAEVALER 238 Query: 819 EDFETSLSDYLNALSILERLVEPDNRCIAELNFRICLVLEVGSKPDEAIPYCQKAISVCK 998 ED ETSLSDY ALSILERLVEPD+R +AELNFRICL LE+GSKP EAIPYCQ+AISVCK Sbjct: 239 EDIETSLSDYQKALSILERLVEPDSRHLAELNFRICLCLEIGSKPQEAIPYCQEAISVCK 298 Query: 999 SRLQRLTGEVKNLTGSEATMVLPDTNQKVDQALNGSQSDSSVSDKEAEIETLTGLSSDLE 1178 +RLQRL EVK+ T S + + + ++ V Q+ N Q+D SV+DKEAEIETL+GLS++LE Sbjct: 299 ARLQRLIKEVKSSTESATSSAVSELDEGVQQSSN-VQADKSVTDKEAEIETLSGLSAELE 357 Query: 1179 KKLEDLQQLVLNPTSSIFSEVMKMVAAKSTGSRNAVGSTALSSSQMGANTSGGGGFDSPT 1358 KKLEDLQQLVLNP SI +E++ MV+ K+ G + SSSQ+ S G FDSPT Sbjct: 358 KKLEDLQQLVLNP-KSILAEILGMVSDKAKGGEKSASPNLTSSSQLVVANS-SGSFDSPT 415 Query: 1359 VSTAHTNGNSGVTHLXXXXXXXXXXLMSSGST-ESSPMKKPSDLVEKGEGSG 1511 +S+AHTNG GVT L L S+GS SS +KKP+ +G G Sbjct: 416 ISSAHTNGVLGVTDLGVAGRGVKRVLTSTGSVGSSSAVKKPTPDPSSDKGDG 467 >gb|EYU39386.1| hypothetical protein MIMGU_mgv1a006172mg [Mimulus guttatus] Length = 454 Score = 388 bits (996), Expect = e-105 Identities = 227/430 (52%), Positives = 273/430 (63%), Gaps = 7/430 (1%) Frame = +3 Query: 246 SSTTNAEEDSGKSLEYSSELMDKGSEAMKARDFAEAAECFSRAVEIRVGHYGELAPECSF 425 SS ++ KSLE + ELM +GS+A K RD+AEA +C+SRA+EIRV H+GELAPEC Sbjct: 50 SSEAIQNDEVEKSLEQADELMARGSKAAKERDYAEATDCYSRALEIRVAHFGELAPECVN 109 Query: 426 AYYKYGCALLYKAQEEADPLGSVPKK-----ENSEKDGPVPEGESATTSV--INDAKHDG 584 AYYK+GCALLYKAQEE DP G++PKK E S + G V E+ +SV ++D Sbjct: 110 AYYKFGCALLYKAQEETDPFGAMPKKDAVSQEGSTRGGSVKNTETGESSVASVSDNAEKC 169 Query: 585 DSSHNEGESNXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAWKMLDVARAIAEKKPDD 764 +++ + AWKMLDVARAI EK DD Sbjct: 170 EATTCSDNTPDETVEGKGEEGDEESDAEELADGEEDESDLDLAWKMLDVARAIVEKLSDD 229 Query: 765 TMEKVDILSALGEVALEREDFETSLSDYLNALSILERLVEPDNRCIAELNFRICLVLEVG 944 TMEKVDILSAL EVALERED ETSLSDYL ALSIL RLVEPD+R IAELNFRICL LE+G Sbjct: 230 TMEKVDILSALAEVALEREDVETSLSDYLKALSILARLVEPDSRLIAELNFRICLCLEIG 289 Query: 945 SKPDEAIPYCQKAISVCKSRLQRLTGEVKNLTGSEATMVLPDTNQKVDQALNGSQSDSSV 1124 SKP+EAIPYC+ AI+VCKSR+QRLT EVKNL+G + Sbjct: 290 SKPEEAIPYCENAITVCKSRVQRLTDEVKNLSG------------------------EPL 325 Query: 1125 SDKEAEIETLTGLSSDLEKKLEDLQQLVLNPTSSIFSEVMKMVAAKSTGSRNAVGSTALS 1304 ++KEAEIETLTGLS +LEKKLEDLQQLVLNP SI ++++ +++AK+ + A S +S Sbjct: 326 AEKEAEIETLTGLSGELEKKLEDLQQLVLNP-KSILADILGIMSAKAKANEKATESVGMS 384 Query: 1305 SSQMGANTSGGGGFDSPTVSTAHTNGNSGVTHLXXXXXXXXXXLMSSGSTESSPMKKPSD 1484 SSQMG G DSPTVSTAHTNG S VTHL +MSS +T+SSP KKPS Sbjct: 385 SSQMGI-AGTAGSTDSPTVSTAHTNGASAVTHLGVVGRGVKRVVMSS-TTQSSPSKKPSV 442 Query: 1485 LVEKGEGSGS 1514 G GS Sbjct: 443 DPSSNHGDGS 452 >ref|XP_007156860.1| hypothetical protein PHAVU_002G023700g [Phaseolus vulgaris] gi|561030275|gb|ESW28854.1| hypothetical protein PHAVU_002G023700g [Phaseolus vulgaris] Length = 476 Score = 379 bits (972), Expect = e-102 Identities = 230/438 (52%), Positives = 283/438 (64%), Gaps = 13/438 (2%) Frame = +3 Query: 237 GGESSTTNAE---EDSGKSLEYSSELMDKGSEAMKARDFAEAAECFSRAVEIRVGHYGEL 407 G E S +NAE D KSLE ++ELM+ G++A+K DF EAA+ FSRA+EIRV HYGEL Sbjct: 46 GVEESVSNAEASVSDPQKSLELANELMEIGNQAIKENDFGEAADNFSRALEIRVSHYGEL 105 Query: 408 APECSFAYYKYGCALLYKAQEEADPLGSVPKKEN-----SEKDGPVPEG---ESATTSVI 563 APEC YYKYGCALLYKAQEEADPL VPKKE+ S K+GPV ES+T S Sbjct: 106 APECVHTYYKYGCALLYKAQEEADPLADVPKKEDGSQLGSTKEGPVKSSVNAESSTASFS 165 Query: 564 NDAKHDGDSSHNEGESNXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAWKMLDVARAI 743 ++A D S+ + AWKMLD+ARAI Sbjct: 166 SNAGQDVTSTDQGEAVDDGSTKNDQEEGDEDSEAEDLAEADEDETDLDLAWKMLDIARAI 225 Query: 744 AEKKPDDTMEKVDILSALGEVALEREDFETSLSDYLNALSILERLVEPDNRCIAELNFRI 923 EK+ +T+E+VDILS L +VALEREDFETSLSDY ALSILE+LVEPD+R IA+LNFRI Sbjct: 226 VEKQSVNTIEQVDILSTLADVALEREDFETSLSDYQKALSILEQLVEPDDRKIADLNFRI 285 Query: 924 CLVLEVGSKPDEAIPYCQKAISVCKSRLQRLTGEVKNLTGSEATMVLPDTNQKVDQALNG 1103 CL LEVGSKP EAI YCQKA SVCK+RLQRLT EVK+ + + D Q V Sbjct: 286 CLCLEVGSKPQEAIAYCQKATSVCKARLQRLTKEVKSCSDLTSA---SDLTQDV-STCPS 341 Query: 1104 SQSDSSVSDKEAEIETLTGLSSDLEKKLEDLQQLVLNPTSSIFSEVMKMVAAKS-TGSRN 1280 S S++S DK++EIETL GLSS+LEKKLEDLQQLV NP SI +E++ + AAK+ G + Sbjct: 342 SDSNNSSMDKQSEIETLKGLSSELEKKLEDLQQLVSNP-KSILAEILGIAAAKAGNGKES 400 Query: 1281 AVGSTALSSSQMGANTSGGGGFDSPTVSTAHTNGNSGVTHLXXXXXXXXXXLMSSGSTES 1460 ++G +SSSQ+ A GGFDSP++STAHTNG+ GVTHL +S + S Sbjct: 401 SLG--MVSSSQL-ATVKNNGGFDSPSISTAHTNGSGGVTHLGVVGRGVKRASNASPAEGS 457 Query: 1461 SPMKKPSDLVE-KGEGSG 1511 +P K + E KG+G+G Sbjct: 458 TPKKPALESTEDKGDGNG 475 >gb|EXB96705.1| Nuclear autoantigenic sperm protein [Morus notabilis] Length = 485 Score = 373 bits (958), Expect = e-100 Identities = 229/470 (48%), Positives = 282/470 (60%), Gaps = 39/470 (8%) Frame = +3 Query: 189 ETQASTEEPRIQDDQIGGESSTT---------NAEEDSGKSLEYSSELMDKGSEAMKARD 341 ETQAS E ++ +GG +T ++ + KSLE + +LM+KGS+A+K D Sbjct: 27 ETQASNEAT-MESGVLGGTMESTCNNDNVSAATSDAEGEKSLELADQLMEKGSQALKDSD 85 Query: 342 FAEAAECFSRAV-------------------EIRVGHYGELAPECSFAYYKYGCALLYKA 464 + EAAE FSRA+ +IRV YGELAPEC +YYKYGCALLYKA Sbjct: 86 YGEAAEFFSRALVFFLKTFCVKEYLLVVFDWKIRVACYGELAPECVNSYYKYGCALLYKA 145 Query: 465 QEEADPLGSVPKKEN-----SEKDGPVPEGESATTSVINDAKHDGDS-----SHNEGESN 614 QEEADPLG+VPKKE S K GPV + +S + +K GD ++ E +N Sbjct: 146 QEEADPLGNVPKKEGESQHESAKYGPVKSVTNGESSTASASKKVGDEETVTVNNPEAVAN 205 Query: 615 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAWKMLDVARAIAEKKPDDTMEKVDILSA 794 AWKMLDVAR I EK+ DTMEKV ILSA Sbjct: 206 DAFNGNDQEEGDDNSDDEDLAEGDEDESDLDLAWKMLDVARVIVEKQVSDTMEKVGILSA 265 Query: 795 LGEVALEREDFETSLSDYLNALSILERLVEPDNRCIAELNFRICLVLEVGSKPDEAIPYC 974 L EVA+ERED ETSLSDY ALSILERLVEPD+R IAELNFRICL LE+GSKP+EAIPYC Sbjct: 266 LAEVAMEREDIETSLSDYQKALSILERLVEPDSRHIAELNFRICLCLELGSKPEEAIPYC 325 Query: 975 QKAISVCKSRLQRLTGEVKNLTGSEATMVLPDTNQKVDQALNGSQSDSSVSDKEAEIETL 1154 QKAISVC++R +RL E K+L S S + +S +DK+ EIETL Sbjct: 326 QKAISVCQARTKRLINEAKSLAES-----------------TSSSTSASDTDKQEEIETL 368 Query: 1155 TGLSSDLEKKLEDLQQLVLNPTSSIFSEVMKMVAAKSTGSRNAVGSTALSSSQMGANTS- 1331 TGL+SDLEKKLEDLQQLV NP SI +E++++ AAKS G S+AL +T+ Sbjct: 369 TGLASDLEKKLEDLQQLVSNP-KSILAEILEIAAAKSKGGEIGASSSALPHKSSPMDTAK 427 Query: 1332 GGGGFDSPTVSTAHTNGNSGVTHLXXXXXXXXXXLMSSGSTESSPMKKPS 1481 GGFD+PT STA T+G+SGVTHL +M+SG+ ESSP KKP+ Sbjct: 428 SNGGFDTPTASTAQTSGSSGVTHLGVVGRGVKRVVMNSGTAESSPPKKPA 477 >ref|XP_004511990.1| PREDICTED: nuclear autoantigenic sperm protein-like isoform X2 [Cicer arietinum] Length = 404 Score = 369 bits (947), Expect = 3e-99 Identities = 216/408 (52%), Positives = 272/408 (66%), Gaps = 12/408 (2%) Frame = +3 Query: 327 MKARDFAEAAECFSRAVEIRVGHYGELAPECSFAYYKYGCALLYKAQEEADPLGSVPKKE 506 MK D+ EAA+ +SRA+EIRVGHYGELAPEC YYKYGCALLYKAQEEADPLG VPKK+ Sbjct: 1 MKDNDYGEAADNYSRALEIRVGHYGELAPECVHTYYKYGCALLYKAQEEADPLGVVPKKQ 60 Query: 507 N-----SEKDGPVPEG---ESATTSVINDAKHDGDSSHNEGE--SNXXXXXXXXXXXXXX 656 + SEKDG V ES+T S N+A+ D S++ E E + Sbjct: 61 DVPQHGSEKDGSVKSAMNAESSTASFPNNAEQDVTSNNRESEVVNGEATSGKNDQEDGGD 120 Query: 657 XXXXXXXXXXXXXXXXXXAWKMLDVARAIAEKKPDDTMEKVDILSALGEVALEREDFETS 836 AWKMLD+ARAI EK+ +TME+VDILS LG+VALEREDFETS Sbjct: 121 SDAEDLAEGDEDESDLDLAWKMLDIARAIVEKQSVNTMEQVDILSTLGDVALEREDFETS 180 Query: 837 LSDYLNALSILERLVEPDNRCIAELNFRICLVLEVGSKPDEAIPYCQKAISVCKSRLQRL 1016 LSDY ALSILE+LVEPD+R IA+LNFRICL LEVGS+P+EA+ YC+KA SVCK+RL RL Sbjct: 181 LSDYQKALSILEQLVEPDDRKIADLNFRICLCLEVGSRPEEAVAYCEKATSVCKARLHRL 240 Query: 1017 TGEVKNLTG--SEATMVLPDTNQKVDQALNGSQSDSSVSDKEAEIETLTGLSSDLEKKLE 1190 T EVK+ + S A+ + D +Q GS+S++S+ DK+AEIETLTGLSS+LEKKL+ Sbjct: 241 TNEVKSFSDLTSSASEIKRD-----EQTYPGSESNNSIVDKQAEIETLTGLSSELEKKLD 295 Query: 1191 DLQQLVLNPTSSIFSEVMKMVAAKSTGSRNAVGSTALSSSQMGANTSGGGGFDSPTVSTA 1370 DLQQL+ NP SI +E++ + AAK+ + + G S+ AN+S GGFDSPT+STA Sbjct: 296 DLQQLISNP-KSILAEILGIAAAKAGNGKESSGGRVSSTQLATANSS--GGFDSPTISTA 352 Query: 1371 HTNGNSGVTHLXXXXXXXXXXLMSSGSTESSPMKKPSDLVEKGEGSGS 1514 HTNG++ VTHL ++ E+S KKP+ +E EG G+ Sbjct: 353 HTNGSAAVTHL-GVVGRGIKRASNASVEEASIAKKPA--LETTEGKGN 397 >gb|AFK47874.1| unknown [Medicago truncatula] Length = 455 Score = 366 bits (940), Expect = 2e-98 Identities = 221/437 (50%), Positives = 282/437 (64%), Gaps = 13/437 (2%) Frame = +3 Query: 243 ESSTTNAEEDSG---KSLEYSSELMDKGSEAMKARDFAEAAECFSRAVEIRVGHYGELAP 413 ES+ T+A S KSL+ ++ELM+KG++AMK DF EAA+ +SRA+EIRV HYGELAP Sbjct: 43 ESAATSAPASSTEGQKSLDLANELMEKGNKAMKENDFGEAADNYSRALEIRVAHYGELAP 102 Query: 414 ECSFAYYKYGCALLYKAQEEADPLGSVPKKEN-----SEKDGPVP---EGESATTSVIND 569 EC YYKYGCALLYKAQEEADPLG+VPKK+ S+KD PV ES+T S ++ Sbjct: 103 ECVHTYYKYGCALLYKAQEEADPLGAVPKKQEGSPHGSDKDEPVKGAVNAESSTASFASN 162 Query: 570 AKHDGDSSHNEGESNXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAWKMLDVARAIAE 749 + D S++ E E + AWKMLDVARAI E Sbjct: 163 VEQDVTSNNQESEVDNVSGKNDQEDDEDSDTEELAEGDEDESDLDL-AWKMLDVARAIVE 221 Query: 750 KKPDDTMEKVDILSALGEVALEREDFETSLSDYLNALSILERLVEPDNRCIAELNFRICL 929 K+ TME+VDILS L +VALEREDFETSLSDY ALSILE+LVEPD+R IA++NFRICL Sbjct: 222 KQSVHTMEQVDILSTLADVALEREDFETSLSDYQKALSILEQLVEPDDRNIADINFRICL 281 Query: 930 VLEVGSKPDEAIPYCQKAISVCKSRLQRLTGEVKNLTGSEATMVLPDTNQKVDQALNGSQ 1109 LEV SKP+EA+ Y +KA SVCK+R+ RLT EVK+ SE+T S+ Sbjct: 282 CLEVSSKPEEAVAYLEKATSVCKARIDRLTNEVKSF--SEST---------------SSE 324 Query: 1110 SDSSVSDKEAEIETLTGLSSDLEKKLEDLQQLVLNPTSSIFSEVMKMVAAKSTGSRNAVG 1289 +++S++DK+AEIE L GLSS+LEKKLEDLQQL+ NP SI +E++ A+ GS Sbjct: 325 TNNSIADKQAEIEILAGLSSELEKKLEDLQQLIANP-KSILAEIL---ASAKAGSGKEPS 380 Query: 1290 STALSSSQMGANTSGGGGFDSPTVSTAHTNGNSGVTHLXXXXXXXXXXLMSSGSTESSPM 1469 +SSSQ+ A + G FDSPT+STAHTNG++GVTHL ++ +TE+S Sbjct: 381 LARVSSSQL-ATENSSGSFDSPTISTAHTNGSAGVTHLGVVGRGVKRS-SNTSTTEASIS 438 Query: 1470 KKPS--DLVEKGEGSGS 1514 KKP+ EKG+G + Sbjct: 439 KKPALETTEEKGDGGNA 455 >ref|NP_568019.1| tetratricopeptide repeat domain-containing protein [Arabidopsis thaliana] gi|13877853|gb|AAK44004.1|AF370189_1 unknown protein [Arabidopsis thaliana] gi|17065596|gb|AAL33778.1| unknown protein [Arabidopsis thaliana] gi|332661368|gb|AEE86768.1| tetratricopeptide repeat domain-containing protein [Arabidopsis thaliana] Length = 492 Score = 366 bits (940), Expect = 2e-98 Identities = 221/471 (46%), Positives = 286/471 (60%), Gaps = 13/471 (2%) Frame = +3 Query: 141 SETLEKESKTLEIEVEET-QASTEEPRIQD---DQIGGESSTTNAEEDSGKSLEYSSELM 308 ++TLE ++E VE Q TE D + ++T +E+ K+LE++ EL Sbjct: 25 AQTLEPNLASIEATVESVVQGGTESTCNNDANNNNAADSAATEVCDEEREKTLEFAEELT 84 Query: 309 DKGSEAMKARDFAEAAECFSRAVEIRVGHYGELAPECSFAYYKYGCALLYKAQEEADPLG 488 +KGS +K DFAEA +CFSRA+EIRV HYGEL EC AYY+YG ALL KAQ EADPLG Sbjct: 85 EKGSVFLKENDFAEAVDCFSRALEIRVAHYGELDAECINAYYRYGLALLAKAQAEADPLG 144 Query: 489 SVPKKENSEKDGPVPEGESATTSVIN-DAKHDGDSSHNEGESNXXXXXXXXXXXXXXXXX 665 ++PKKE E GES SV++ D + G SS EG S Sbjct: 145 NMPKKEG-EVQQESSNGESLAPSVVSGDPERQGSSSGQEG-SGGKDQGEDGEDCQDDDLS 202 Query: 666 XXXXXXXXXXXXXXXAWKMLDVARAIAEKKPDDTMEKVDILSALGEVALEREDFETSLSD 845 AWKMLD+AR I +K+ +TMEKVDIL +L EV+LERED E+SLSD Sbjct: 203 DADGDADEDESDLDMAWKMLDIARVITDKQSTETMEKVDILCSLAEVSLEREDIESSLSD 262 Query: 846 YLNALSILERLVEPDNRCIAELNFRICLVLEVGSKPDEAIPYCQKAISVCKSRLQRLTGE 1025 Y NALSILERLVEPD+R AELNFRIC+ LE G +P EAIPYCQKA+ +CK+R++RL+ E Sbjct: 263 YKNALSILERLVEPDSRRTAELNFRICICLETGCQPKEAIPYCQKALLICKARMERLSNE 322 Query: 1026 VKNLTGSEATMVLPDTNQKVDQALNGSQSDSSVSDKEAEIETLTGLSSDLEKKLEDLQQL 1205 +K +GS + + + ++ + Q+ N D S SDKE EI L GL+ DLEKKLEDL+Q Sbjct: 323 IKGASGSATSSTVSEIDEGIQQSSNVPYIDKSASDKEVEIGDLAGLAEDLEKKLEDLKQQ 382 Query: 1206 VLNPTSSIFSEVMKMVAAKSTGSRNAVGSTA-LSSSQMG-ANTSGGGGFDSPTVSTAHT- 1376 NP + +E+M MV+AK S V + A +SSS+MG NT+ G +SPTVSTAHT Sbjct: 383 AENP-KQVLAELMGMVSAKPNASDKVVPAAAEMSSSRMGTVNTNFGKDLESPTVSTAHTG 441 Query: 1377 ----NGNSGVTHLXXXXXXXXXXLMSSGSTESSPMKKPS-DLVEKGEGSGS 1514 SGVTHL LM++ S ESS KKP+ + +K +G+ S Sbjct: 442 AAGGGAASGVTHLGVVGRGVKRVLMNTTSIESSASKKPALEFSDKADGNSS 492 >ref|XP_006282585.1| hypothetical protein CARUB_v10004537mg, partial [Capsella rubella] gi|482551290|gb|EOA15483.1| hypothetical protein CARUB_v10004537mg, partial [Capsella rubella] Length = 541 Score = 361 bits (927), Expect = 5e-97 Identities = 222/480 (46%), Positives = 294/480 (61%), Gaps = 23/480 (4%) Frame = +3 Query: 144 ETLEKESKTLEIEVEET-QASTEEPRIQDDQIGGESSTTNA-EEDSGKSLEYSSELMDKG 317 +TLE ++E VE Q TE ++ +S+ T+ + + K++E++ EL +KG Sbjct: 66 QTLEPNQASIEATVESAVQGGTESTCNNNNNNAADSAATDVCDVEREKTIEFADELTEKG 125 Query: 318 SEAMKARDFAEAAECFSRAVEIRVGHYGELAPECSFAYYKYGCALLYKAQEEADPLGSVP 497 S +K +DF EA +CFSRA+EIRV HYGEL EC AYY+YG ALL KAQ EADPLG+VP Sbjct: 126 SVFLKEQDFGEAVDCFSRALEIRVEHYGELDAECVNAYYRYGSALLEKAQAEADPLGNVP 185 Query: 498 KKE------NSEKDGPVPE---GESATTSVIN-DAKHDGDSSHNEGESNXXXXXXXXXXX 647 KKE +S KD V GES SV++ + + G SS EG Sbjct: 186 KKEGEVQQESSSKDDSVKNTVNGESLAASVVSSNPERQGSSSGQEGAGGIEQGEDGENCH 245 Query: 648 XXXXXXXXXXXXXXXXXXXXXAWKMLDVARAIAEKKPDDTMEKVDILSALGEVALEREDF 827 AWKMLD+ARAI +K+ DTM KVDIL AL E++LERED Sbjct: 246 DDDLSDADGDEDDSDLDM---AWKMLDIARAITDKQSADTMVKVDILCALAEISLEREDI 302 Query: 828 ETSLSDYLNALSILERLVEPDNRCIAELNFRICLVLEVGSKPDEAIPYCQKAISVCKSRL 1007 E+SLSDY ALSILERLVEPD+R AELNFRIC+ LE G +P EA+PYCQKA+ +CK+R+ Sbjct: 303 ESSLSDYKKALSILERLVEPDSRHTAELNFRICICLETGCQPKEAMPYCQKAMLICKARM 362 Query: 1008 QRLTGEVKNLTGSEATMVLPDTNQKVDQALNGSQSDSSVSDKEAEIETLTGLSSDLEKKL 1187 +RL+ E+K +GS + + + ++ + Q+ N D S SDKEAEIE + GL+ DLEKKL Sbjct: 363 ERLSNEIKGASGSATSSTVSEIDEGIQQSSNVPYIDKSTSDKEAEIEVMAGLAEDLEKKL 422 Query: 1188 EDLQQLVLNPTSSIFSEVMKMVAAKST--GSRNAVGSTA-LSSSQMG-ANTSGGGGFDSP 1355 EDL+Q NP + +E+M MV+AK++ S AV + A +SSS+MG ANT+ G +SP Sbjct: 423 EDLKQQAENP-KQVLAELMGMVSAKASEKASEEAVPAAAEMSSSRMGTANTNFGKDLESP 481 Query: 1356 TVSTAHT------NGNSGVTHLXXXXXXXXXXLMSSGSTESSPMKKP-SDLVEKGEGSGS 1514 TVSTAHT +SGVTHL LM++ S ESS KKP + +K +G+ S Sbjct: 482 TVSTAHTGAAAGGGASSGVTHLGVVGRGVKRVLMNATSIESSASKKPATGPSDKADGNSS 541