BLASTX nr result
ID: Akebia26_contig00013614
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia26_contig00013614 (3018 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CBI22504.3| unnamed protein product [Vitis vinifera] 432 e-118 ref|XP_002269077.1| PREDICTED: homeobox protein HAT3.1-like [Vit... 432 e-118 ref|XP_006486963.1| PREDICTED: homeobox protein HAT3.1-like isof... 429 e-117 ref|XP_006422879.1| hypothetical protein CICLE_v10027725mg [Citr... 429 e-117 ref|XP_007200058.1| hypothetical protein PRUPE_ppa023106mg [Prun... 428 e-117 gb|EXB76647.1| Homeobox protein [Morus notabilis] 419 e-114 ref|XP_002313886.2| hypothetical protein POPTR_0009s09600g [Popu... 409 e-111 ref|XP_002300247.2| homeobox family protein [Populus trichocarpa... 408 e-111 ref|XP_007042568.1| Homeodomain-like protein with RING/FYVE/PHD-... 404 e-109 ref|XP_004289744.1| PREDICTED: uncharacterized protein LOC101296... 389 e-105 ref|XP_002527467.1| Homeobox protein HAT3.1, putative [Ricinus c... 387 e-104 emb|CAN68079.1| hypothetical protein VITISV_006312 [Vitis vinifera] 379 e-102 ref|XP_004140812.1| PREDICTED: uncharacterized protein LOC101204... 374 e-100 ref|XP_006829269.1| hypothetical protein AMTR_s00001p00272780 [A... 368 8e-99 ref|XP_007143079.1| hypothetical protein PHAVU_007G041800g [Phas... 363 3e-97 ref|XP_003555282.1| PREDICTED: homeobox protein HAT3.1-like isof... 361 1e-96 ref|XP_006605989.1| PREDICTED: homeobox protein HAT3.1-like isof... 359 5e-96 ref|XP_004161446.1| PREDICTED: homeobox protein HAT3.1-like [Cuc... 356 4e-95 ref|XP_006589630.1| PREDICTED: pathogenesis-related homeodomain ... 352 8e-94 ref|XP_004496910.1| PREDICTED: pathogenesis-related homeodomain ... 337 1e-89 >emb|CBI22504.3| unnamed protein product [Vitis vinifera] Length = 977 Score = 432 bits (1110), Expect = e-118 Identities = 298/725 (41%), Positives = 374/725 (51%), Gaps = 42/725 (5%) Frame = -1 Query: 2226 SSARENCQTIESSSLKQNRLQGKHELG-------------SEHAQSEPMETKIVESSVLE 2086 S A+E+ +T +SSS KQN L+ +L SE+ Q EP E S VL Sbjct: 4 SPAQESNRTRKSSSPKQNILEEARKLSESVCSESSEQKRPSENGQHEPAEI----SPVLS 59 Query: 2085 NCQKIESSSPKQNTTQEKQNG----QTTESSSPQKIMFGEKNELGLEHVQSETLEIRIVG 1918 NC E S + G T++S + + ++ + + + +V Sbjct: 60 NCIVTEQSELPPEDVGDTILGLPPADVTKNSLTEHLGLPPEDAIKNDGTEQLGFFPEVVT 119 Query: 1917 SDSVDIKQLGT---PPKDAAK------STNRSSRKKNQVSSRPLKRKYMLRSSISSPTVL 1765 S+ I++LG PP++ A+ S + N+ +++ +KRKY LRSS+S VL Sbjct: 120 KSSI-IEKLGQSEPPPENVARYSGLDQSGSAPKDLANKRTAKLVKRKYKLRSSVSGSRVL 178 Query: 1764 QSSSHGTSKPPDPSIS-ANVXXXXXXXXXXXXXXXKTLNDEFSRTRKRLRYLLNRMGYEH 1588 +S S K PS + N KT DEF+R RK LRYLLNRM YE Sbjct: 179 RSRSQEKPKASQPSDNFVNASASRERKGRKKKRMNKTTADEFARIRKHLRYLLNRMSYEQ 238 Query: 1587 NLIDAYSGEGWKGLSMEKIKPEQELQRATSEILRCKLKIRDLFQHLDSLCAEGRLQPSLF 1408 NLIDAYS EGWKG S+EK+KPE+ELQRA+SEI R KL+IRDLFQHLDSLCAEGR SLF Sbjct: 239 NLIDAYSAEGWKGQSVEKLKPEKELQRASSEISRRKLQIRDLFQHLDSLCAEGRFPESLF 298 Query: 1407 DSEGQICSEDIFCAKCGSKDLSDDNDIILCDGICDRGFHLMCLNPPLSKDKIPPEDEGWL 1228 DSEGQI SEDIFCAKC SKD+S DNDIILCDG CDRGFH CL PPL K++IPP+DEGWL Sbjct: 299 DSEGQIDSEDIFCAKCESKDMSADNDIILCDGACDRGFHQFCLEPPLLKEEIPPDDEGWL 358 Query: 1227 CPGCDCKVDCIYLLNDVQGTNLSIDDKWEKVFPEAAATGSGDNPDDNFGFXXXXXXXXXX 1048 CP CDCKVDC+ LLND QGT LS+ D WEKVFPEAAA +G+N D+N GF Sbjct: 359 CPACDCKVDCMDLLNDSQGTKLSVIDSWEKVFPEAAA--AGNNQDNNSGFSSDDSEDNDY 416 Query: 1047 XXDGPELDERVQ------------XXXXXXXXXXXXXXXXDLGASPHEEQYLGLXXXXXX 904 D PE+DE+ Q D+ SP+ EQ LGL Sbjct: 417 DPDCPEVDEKGQGDKSSSDKFDESDEFDESDESDFTSASDDMVVSPNNEQCLGLPSDDSE 476 Query: 903 XXXXXXXXXXXDEQIKKXXXXXXXXXXXXXXXALSDGKGSSGTNVLVHSSVDGAKLLTGS 724 DEQ+ + A D + S + DG Sbjct: 477 DDDFDPDAPEIDEQVNQGSSSSDFTSDSEDFTATLDRRNFS-------DNEDGL------ 523 Query: 723 SNRRSQMGRKKKQSINAELLSILEPGPSQGNASPVLSKRNRERLDYKKLHDEAYGNVPXX 544 + + + GRKKK ++ ELLS+LE Q NA P+ +KR+ ERLDYKKLHDEAYGNV Sbjct: 524 -DEQRRFGRKKKDTLKDELLSVLESNSGQDNA-PLSAKRHVERLDYKKLHDEAYGNVSSD 581 Query: 543 XXXXXXXXXXEAPRKRKNDGGIGNTSVVPPKGXXXXXXXXXXXXXXXXNLKDIESTPTRR 364 PRKRKN GN + V P G +L+ TP RR Sbjct: 582 SSDDEDWTENVIPRKRKNLS--GNVASVSPNGNTSITENGTNTKDIKHDLEAAGCTPKRR 639 Query: 363 TRQKLDSDGAYNIAAETHKDSLGP-DYAVKVATASASRKNTGSTQRNLK--ETKNTPKRR 193 TRQKL+ + N AE+HKDS P K +S + T+R K + P R Sbjct: 640 TRQKLNFESTNNSLAESHKDSRSPGSTGEKSGQSSYKKLGEAVTERLYKSFQENQYPDRA 699 Query: 192 TREKL 178 +EKL Sbjct: 700 MKEKL 704 >ref|XP_002269077.1| PREDICTED: homeobox protein HAT3.1-like [Vitis vinifera] Length = 968 Score = 432 bits (1110), Expect = e-118 Identities = 298/725 (41%), Positives = 374/725 (51%), Gaps = 42/725 (5%) Frame = -1 Query: 2226 SSARENCQTIESSSLKQNRLQGKHELG-------------SEHAQSEPMETKIVESSVLE 2086 S A+E+ +T +SSS KQN L+ +L SE+ Q EP E S VL Sbjct: 4 SPAQESNRTRKSSSPKQNILEEARKLSESVCSESSEQKRPSENGQHEPAEI----SPVLS 59 Query: 2085 NCQKIESSSPKQNTTQEKQNG----QTTESSSPQKIMFGEKNELGLEHVQSETLEIRIVG 1918 NC E S + G T++S + + ++ + + + +V Sbjct: 60 NCIVTEQSELPPEDVGDTILGLPPADVTKNSLTEHLGLPPEDAIKNDGTEQLGFFPEVVT 119 Query: 1917 SDSVDIKQLGT---PPKDAAK------STNRSSRKKNQVSSRPLKRKYMLRSSISSPTVL 1765 S+ I++LG PP++ A+ S + N+ +++ +KRKY LRSS+S VL Sbjct: 120 KSSI-IEKLGQSEPPPENVARYSGLDQSGSAPKDLANKRTAKLVKRKYKLRSSVSGSRVL 178 Query: 1764 QSSSHGTSKPPDPSIS-ANVXXXXXXXXXXXXXXXKTLNDEFSRTRKRLRYLLNRMGYEH 1588 +S S K PS + N KT DEF+R RK LRYLLNRM YE Sbjct: 179 RSRSQEKPKASQPSDNFVNASASRERKGRKKKRMNKTTADEFARIRKHLRYLLNRMSYEQ 238 Query: 1587 NLIDAYSGEGWKGLSMEKIKPEQELQRATSEILRCKLKIRDLFQHLDSLCAEGRLQPSLF 1408 NLIDAYS EGWKG S+EK+KPE+ELQRA+SEI R KL+IRDLFQHLDSLCAEGR SLF Sbjct: 239 NLIDAYSAEGWKGQSVEKLKPEKELQRASSEISRRKLQIRDLFQHLDSLCAEGRFPESLF 298 Query: 1407 DSEGQICSEDIFCAKCGSKDLSDDNDIILCDGICDRGFHLMCLNPPLSKDKIPPEDEGWL 1228 DSEGQI SEDIFCAKC SKD+S DNDIILCDG CDRGFH CL PPL K++IPP+DEGWL Sbjct: 299 DSEGQIDSEDIFCAKCESKDMSADNDIILCDGACDRGFHQFCLEPPLLKEEIPPDDEGWL 358 Query: 1227 CPGCDCKVDCIYLLNDVQGTNLSIDDKWEKVFPEAAATGSGDNPDDNFGFXXXXXXXXXX 1048 CP CDCKVDC+ LLND QGT LS+ D WEKVFPEAAA +G+N D+N GF Sbjct: 359 CPACDCKVDCMDLLNDSQGTKLSVIDSWEKVFPEAAA--AGNNQDNNSGFSSDDSEDNDY 416 Query: 1047 XXDGPELDERVQ------------XXXXXXXXXXXXXXXXDLGASPHEEQYLGLXXXXXX 904 D PE+DE+ Q D+ SP+ EQ LGL Sbjct: 417 DPDCPEVDEKGQGDKSSSDKFDESDEFDESDESDFTSASDDMVVSPNNEQCLGLPSDDSE 476 Query: 903 XXXXXXXXXXXDEQIKKXXXXXXXXXXXXXXXALSDGKGSSGTNVLVHSSVDGAKLLTGS 724 DEQ+ + A D + S + DG Sbjct: 477 DDDFDPDAPEIDEQVNQGSSSSDFTSDSEDFTATLDRRNFS-------DNEDGL------ 523 Query: 723 SNRRSQMGRKKKQSINAELLSILEPGPSQGNASPVLSKRNRERLDYKKLHDEAYGNVPXX 544 + + + GRKKK ++ ELLS+LE Q NA P+ +KR+ ERLDYKKLHDEAYGNV Sbjct: 524 -DEQRRFGRKKKDTLKDELLSVLESNSGQDNA-PLSAKRHVERLDYKKLHDEAYGNVSSD 581 Query: 543 XXXXXXXXXXEAPRKRKNDGGIGNTSVVPPKGXXXXXXXXXXXXXXXXNLKDIESTPTRR 364 PRKRKN GN + V P G +L+ TP RR Sbjct: 582 SSDDEDWTENVIPRKRKNLS--GNVASVSPNGNTSITENGTNTKDIKHDLEAAGCTPKRR 639 Query: 363 TRQKLDSDGAYNIAAETHKDSLGP-DYAVKVATASASRKNTGSTQRNLK--ETKNTPKRR 193 TRQKL+ + N AE+HKDS P K +S + T+R K + P R Sbjct: 640 TRQKLNFESTNNSLAESHKDSRSPGSTGEKSGQSSYKKLGEAVTERLYKSFQENQYPDRA 699 Query: 192 TREKL 178 +EKL Sbjct: 700 MKEKL 704 >ref|XP_006486963.1| PREDICTED: homeobox protein HAT3.1-like isoform X1 [Citrus sinensis] gi|568867273|ref|XP_006486964.1| PREDICTED: homeobox protein HAT3.1-like isoform X2 [Citrus sinensis] Length = 1063 Score = 429 bits (1102), Expect = e-117 Identities = 338/978 (34%), Positives = 450/978 (46%), Gaps = 45/978 (4%) Frame = -1 Query: 2799 MQEKNELGSEHAQSGPMETKIVESNA-EENCQTIESSS----------PKQNTLQGNHAL 2653 ++ K+ LGSE ++ P ET I S + C + SS + + Q N A Sbjct: 45 LESKSLLGSEAVENEPRETSIPNSEKLQAFCGDVPDSSFTDHLAPPSEDMRKSTQTNKAS 104 Query: 2652 CSEHAQSEPVETKILGSNAQEN-----CQTIESSSPKQNILQGKLELGSEHAQSEPMETK 2488 CS+ SE L N Q C I P+ L ++A +P+ Sbjct: 105 CSQQNTSEQKHGTELMHNEQSEQKHQLCYQIVFDKPQATSLV-------DNATLQPVSKD 157 Query: 2487 IVESNAQESYKTIESSS-----------LKQNTLQGKHELGSEHAQSEPME--MKIVEPS 2347 + +S+ + + ++ S + L KH+LGSE Q+EP ++ Sbjct: 158 VSKSSQTGTRQALDFLSGNRCNELDVDCVHSEPLDQKHQLGSEIIQNEPAVNIARLPSDG 217 Query: 2346 AQENCQTIESSSLKQNTSQGKHELGSEHAQSEPMETKIVESSARENCQTIESSSLKQNRL 2167 +EN QTI K E +QS P + A ++CQ E S L+Q+ Sbjct: 218 VEENLQTISEDLTKV--------CPVEPSQSPPRD-------ANKSCQAGEISYLQQSSS 262 Query: 2166 QGKHELGSEHAQSEPMETKIVESSVLENCQKIESSSPKQNTTQEKQNGQTTESSSPQK-- 1993 + E + EP S LE + E+S+ + + E + E + Sbjct: 263 EQTPEFTPGISSHEPSVVNYKLGSQLEQTELGETSAGELGASLELVVKSSIEQLKQPEVP 322 Query: 1992 IMFGEKNELGLEHVQSETLEIRIVGSDSVD----IKQLGTPPKDAAKSTNRSSRKKNQVS 1825 I +H+QS SD ++ ++Q TPP A ++ RK + + Sbjct: 323 ITIPSTKTSATKHLQSS--------SDLMEKKSCLEQSETPPNYVANNSACLGRKGKR-A 373 Query: 1824 SRPLKRKYMLRSSISSPTVLQSSSHGTSKPPDPSIS---ANVXXXXXXXXXXXXXXXKTL 1654 ++ LK Y +RS I S VL+S S PP+ SI+ N K + Sbjct: 374 TKSLKNNYTVRSLIGSDRVLRSRSGERPIPPESSINLADVNSIGERKQKKRNKIRRKKIV 433 Query: 1653 NDEFSRTRKRLRYLLNRMGYEHNLIDAYSGEGWKGLSMEKIKPEQELQRATSEILRCKLK 1474 DE+SR R LRYLLNR+ YE NLIDAYS EGWKGLS+EK+KPE+ELQRATSEILR KLK Sbjct: 434 ADEYSRIRTHLRYLLNRINYEQNLIDAYSSEGWKGLSVEKLKPEKELQRATSEILRRKLK 493 Query: 1473 IRDLFQHLDSLCAEGRLQPSLFDSEGQICSEDIFCAKCGSKDLSDDNDIILCDGICDRGF 1294 IRDLFQ LDSLCA G SLFDSEGQI SEDI+CAKCGSKDLS DNDIILCDG CDRGF Sbjct: 494 IRDLFQRLDSLCAGG-FPKSLFDSEGQIDSEDIYCAKCGSKDLSADNDIILCDGACDRGF 552 Query: 1293 HLMCLNPPLSKDKIPPEDEGWLCPGCDCKVDCIYLLNDVQGTNLSIDDKWEKVFPEAAAT 1114 H CL PPL K+ IPP+DEGWLCPGCDCKVDCI L+N++QGT L I D WEKVFPEAAA Sbjct: 553 HQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLVNELQGTRLFITDNWEKVFPEAAA- 611 Query: 1113 GSGDNPDDNFGFXXXXXXXXXXXXDGPELDERVQXXXXXXXXXXXXXXXXD-----LGAS 949 G N D NFG DG DE+ + + A Sbjct: 612 --GHNQDPNFGLASDDSDDNEYDPDGSATDEQDEGDESSSDGSSSDDSDFTSTSDEVEAP 669 Query: 948 PHEEQYLGLXXXXXXXXXXXXXXXXXDEQIKKXXXXXXXXXXXXXXXALS--DGKGSSGT 775 ++ YLG D+++ + + + SSG Sbjct: 670 ADDKTYLGRSSEDSEDDEYNPDAPDLDDKVTQESSSSGSDFTSDSEDLAAVLEDNRSSGN 729 Query: 774 NVLVHSSVDGAKLLTGSSNRRSQMGRKKKQSINAELLSILEPGPSQGNASPVLSKRNRER 595 + +GA G SN + +S+N ELLSI++PG Q A+PV KR+ ER Sbjct: 730 D-------EGAASPLGHSNGQRYKDGGNNESLNNELLSIIKPG--QDGAAPVYGKRSSER 780 Query: 594 LDYKKLHDEAYGNVPXXXXXXXXXXXXEAPRKRKNDGGIGNTSVVPPKGXXXXXXXXXXX 415 LDYKKL+DE YGNVP +DGG Sbjct: 781 LDYKKLYDETYGNVPYDSSDDESW---------SDDGG---------------------- 809 Query: 414 XXXXXNLKDIESTPTRRTRQKLDSDGAYNIAAETHKDSLGPDYAVKVATASASRKNTGST 235 P +RT+ + A PD V RK+T + Sbjct: 810 -------------PRKRTKSTKEGSSA------------SPDGKTPVIRR---RKSTKAA 841 Query: 234 QRNLKETKNTPKRRTREKLDFEGADNLAAETPKDNLGPDSSGKIATATPSRTLGKAVTQG 55 + L ET+NTPKRR R KL+ E ++ A++ + P S G+ T R +G+ VTQ Sbjct: 842 KEKLNETENTPKRRGRPKLNTEDSNISPAKSHEGCSTPGSRGR-RHRTSYRKIGEEVTQK 900 Query: 54 LLESFRENEYPTRDKKEN 1 L SF+EN+YP R KE+ Sbjct: 901 LYNSFKENQYPNRTTKES 918 Score = 411 bits (1056), Expect = e-111 Identities = 325/939 (34%), Positives = 439/939 (46%), Gaps = 88/939 (9%) Frame = -1 Query: 2730 SNAEENCQTIESSSPKQNTLQGNHALCSEHAQSEPVETK-ILGSNAQENCQTIESSSPKQ 2554 S ++ T + S KQ T + C+E SEP+E+K +LGS A EN + E+S P Sbjct: 10 SPSQVTSSTRDHSCLKQTTPEQMCKFCAECIHSEPLESKSLLGSEAVEN-EPRETSIPNS 68 Query: 2553 NILQGKLELGSEHAQSEPMETKIVESNAQESYKTIESSSLKQNTLQGKHELGSEHAQSEP 2374 LQ G S + ++S +T ++S +QNT + KH G+E +E Sbjct: 69 EKLQA--FCGDVPDSSFTDHLAPPSEDMRKSTQTNKASCSQQNTSEQKH--GTELMHNEQ 124 Query: 2373 MEMKIVEPSAQENCQTI-----ESSSLKQNTSQGKHELGSEHAQSEPMETK-IVESSARE 2212 E K + C I +++SL N + + + ++S T+ ++ + Sbjct: 125 SEQK------HQLCYQIVFDKPQATSLVDNAT--LQPVSKDVSKSSQTGTRQALDFLSGN 176 Query: 2211 NCQTIESSSLKQNRLQGKHELGSEHAQSEPME--TKIVESSVLENCQKIE---------- 2068 C ++ + L KH+LGSE Q+EP ++ V EN Q I Sbjct: 177 RCNELDVDCVHSEPLDQKHQLGSEIIQNEPAVNIARLPSDGVEENLQTISEDLTKVCPVE 236 Query: 2067 -SSSPKQNTTQEKQNGQTT---ESSSPQKIMFG-------------------EKNELGLE 1957 S SP ++ + Q G+ + +SSS Q F E+ ELG Sbjct: 237 PSQSPPRDANKSCQAGEISYLQQSSSEQTPEFTPGISSHEPSVVNYKLGSQLEQTELGET 296 Query: 1956 HVQSETLEIRIVGSDSVD----------------------------------IKQLGTPP 1879 + +V S++ ++Q TPP Sbjct: 297 SAGELGASLELVVKSSIEQLKQPEVPITIPSTKTSATKHLQSSSDLMEKKSCLEQSETPP 356 Query: 1878 KDAAKSTNRSSRKKNQVSSRPLKRKYMLRSSISSPTVLQSSSHGTSKPPDPSIS---ANV 1708 A ++ RK + +++ LK Y +RS I S VL+S S PP+ SI+ N Sbjct: 357 NYVANNSACLGRKGKR-ATKSLKNNYTVRSLIGSDRVLRSRSGERPIPPESSINLADVNS 415 Query: 1707 XXXXXXXXXXXXXXXKTLNDEFSRTRKRLRYLLNRMGYEHNLIDAYSGEGWKGLSMEKIK 1528 K + DE+SR R LRYLLNR+ YE NLIDAYS EGWKGLS+EK+K Sbjct: 416 IGERKQKKRNKIRRKKIVADEYSRIRTHLRYLLNRINYEQNLIDAYSSEGWKGLSVEKLK 475 Query: 1527 PEQELQRATSEILRCKLKIRDLFQHLDSLCAEGRLQPSLFDSEGQICSEDIFCAKCGSKD 1348 PE+ELQRATSEILR KLKIRDLFQ LDSLCA G SLFDSEGQI SEDI+CAKCGSKD Sbjct: 476 PEKELQRATSEILRRKLKIRDLFQRLDSLCA-GGFPKSLFDSEGQIDSEDIYCAKCGSKD 534 Query: 1347 LSDDNDIILCDGICDRGFHLMCLNPPLSKDKIPPEDEGWLCPGCDCKVDCIYLLNDVQGT 1168 LS DNDIILCDG CDRGFH CL PPL K+ IPP+DEGWLCPGCDCKVDCI L+N++QGT Sbjct: 535 LSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLVNELQGT 594 Query: 1167 NLSIDDKWEKVFPEAAATGSGDNPDDNFGFXXXXXXXXXXXXDGPELDERVQ-----XXX 1003 L I D WEKVFPEAAA G N D NFG DG DE+ + Sbjct: 595 RLFITDNWEKVFPEAAA---GHNQDPNFGLASDDSDDNEYDPDGSATDEQDEGDESSSDG 651 Query: 1002 XXXXXXXXXXXXXDLGASPHEEQYLGLXXXXXXXXXXXXXXXXXDEQI--KKXXXXXXXX 829 ++ A ++ YLG D+++ + Sbjct: 652 SSSDDSDFTSTSDEVEAPADDKTYLGRSSEDSEDDEYNPDAPDLDDKVTQESSSSGSDFT 711 Query: 828 XXXXXXXALSDGKGSSGTNVLVHSSVDGAKLLTGSSNRRSQMGRKKKQSINAELLSILEP 649 A+ + SSG + +GA G SN + +S+N ELLSI++P Sbjct: 712 SDSEDLAAVLEDNRSSGND-------EGAASPLGHSNGQRYKDGGNNESLNNELLSIIKP 764 Query: 648 GPSQGNASPVLSKRNRERLDYKKLHDEAYGNVPXXXXXXXXXXXXEAPRKRKNDGGIGNT 469 G Q A+PV KR+ ERLDYKKL+DE YGNVP PRKR G++ Sbjct: 765 G--QDGAAPVYGKRSSERLDYKKLYDETYGNVPYDSSDDESWSDDGGPRKRTKSTKEGSS 822 Query: 468 SVVPPKGXXXXXXXXXXXXXXXXNLKDIESTPTRRTRQKLDSDGAYNIAAETHKDSLGPD 289 + P G L + E+TP RR R KL+++ + A++H+ P Sbjct: 823 A--SPDGKTPVIRRRKSTKAAKEKLNETENTPKRRGRPKLNTEDSNISPAKSHEGCSTPG 880 Query: 288 YAVKVATASASRKNTGSTQR--NLKETKNTPKRRTREKL 178 + S + TQ+ N + P R T+E L Sbjct: 881 SRGRRHRTSYRKIGEEVTQKLYNSFKENQYPNRTTKESL 919 >ref|XP_006422879.1| hypothetical protein CICLE_v10027725mg [Citrus clementina] gi|557524813|gb|ESR36119.1| hypothetical protein CICLE_v10027725mg [Citrus clementina] Length = 1063 Score = 429 bits (1102), Expect = e-117 Identities = 341/978 (34%), Positives = 448/978 (45%), Gaps = 45/978 (4%) Frame = -1 Query: 2799 MQEKNELGSEHAQSGPMETKIVESNA-EENCQTIESSS----------PKQNTLQGNHAL 2653 ++ K+ LGSE ++ P ET I S + C + SS + + Q N A Sbjct: 45 LESKSLLGSEAVENEPRETSIPNSEKLQAFCGDVPDSSFTDHLAPPSEDMRKSTQTNKAS 104 Query: 2652 CSEHAQSEPVETKILGSNAQEN-----CQTIESSSPKQNILQGKLELGSEHAQSEPMETK 2488 CS+ SE L N Q C I P+ L ++A +P+ Sbjct: 105 CSQQNTSEQKHGTELMHNEQSEQKHQLCYQIVFDKPQATSLV-------DNATLQPVSKD 157 Query: 2487 IVESNAQESYKTIESSS-----------LKQNTLQGKHELGSEHAQSEPMEMKIVEPS-- 2347 + +S+ + + ++ S + L KH+LGSE Q+EP PS Sbjct: 158 VSKSSQTGNRQALDFLSGNRCNELDVDCVHSEPLNQKHQLGSEIIQNEPAVNVARLPSDG 217 Query: 2346 AQENCQTIESSSLKQNTSQGKHELGSEHAQSEPMETKIVESSARENCQTIESSSLKQNRL 2167 +EN QTI K E +QS P + A ++CQ E S L+Q+ Sbjct: 218 VEENLQTISEDLTKV--------CPVEPSQSPPRD-------ANKSCQAGEISCLQQSSS 262 Query: 2166 QGKHELGSEHAQSEPMETKIVESSVLENCQKIESSSPKQNTTQEKQNGQTTESSSPQK-- 1993 + E + EP S LE + E+S+ + + E + E + Sbjct: 263 EQTPEFTPGISSHEPSVVNYKLGSQLEQTELGETSAGELGASLELVVKSSIEQLKQLEVP 322 Query: 1992 IMFGEKNELGLEHVQSETLEIRIVGSDSVD----IKQLGTPPKDAAKSTNRSSRKKNQVS 1825 I +H+QS SD ++ ++Q TPP A ++ RK + + Sbjct: 323 ITIPSTKTSATKHLQSS--------SDLMEKKSCLEQSETPPNYVANNSACLGRKGKR-A 373 Query: 1824 SRPLKRKYMLRSSISSPTVLQSSSHGTSKPPDPS---ISANVXXXXXXXXXXXXXXXKTL 1654 ++ LK Y +RS I S VL+S S PP+ S N K + Sbjct: 374 TKSLKNNYTVRSLIGSDRVLRSRSGERPLPPESSNNLADVNSIGERKQKKRNKIRRKKIV 433 Query: 1653 NDEFSRTRKRLRYLLNRMGYEHNLIDAYSGEGWKGLSMEKIKPEQELQRATSEILRCKLK 1474 DE+SR R LRYLLNR+ YE NLIDAYS EGWKGLS+EK+KPE+ELQRATSEILR KLK Sbjct: 434 ADEYSRIRTHLRYLLNRINYEQNLIDAYSSEGWKGLSVEKLKPEKELQRATSEILRRKLK 493 Query: 1473 IRDLFQHLDSLCAEGRLQPSLFDSEGQICSEDIFCAKCGSKDLSDDNDIILCDGICDRGF 1294 IRDLFQ LDSLCA G SLFDSEGQI SEDI+CAKCGSKDLS DNDIILCDG CDRGF Sbjct: 494 IRDLFQRLDSLCAGG-FPKSLFDSEGQIDSEDIYCAKCGSKDLSADNDIILCDGACDRGF 552 Query: 1293 HLMCLNPPLSKDKIPPEDEGWLCPGCDCKVDCIYLLNDVQGTNLSIDDKWEKVFPEAAAT 1114 H CL PPL K+ IPP+DEGWLCPGCDCKVDCI L+N++QGT L I D WEKVFPEAAA Sbjct: 553 HQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLVNELQGTRLFITDNWEKVFPEAAA- 611 Query: 1113 GSGDNPDDNFGFXXXXXXXXXXXXDGPELDERVQXXXXXXXXXXXXXXXXD-----LGAS 949 G N D NFG DG DE+ + + A Sbjct: 612 --GHNQDPNFGLASDDSDDNEYDPDGSATDEQDEGDESSSDGSSSDDSDFTSTSDEVEAP 669 Query: 948 PHEEQYLGLXXXXXXXXXXXXXXXXXDEQIKKXXXXXXXXXXXXXXXALS--DGKGSSGT 775 ++ YLGL D+++ + + + SSG Sbjct: 670 ADDKTYLGLSSEDSEDDEYNPDAPELDDKVTQESSSSGSDFTSDSEDLAAVLEDNRSSGN 729 Query: 774 NVLVHSSVDGAKLLTGSSNRRSQMGRKKKQSINAELLSILEPGPSQGNASPVLSKRNRER 595 + +GA G SN + +S+N ELLSI++PG Q A PV KR+ ER Sbjct: 730 D-------EGAASPLGHSNGQRYKDGGNNESLNNELLSIIKPG--QDGAVPVYGKRSSER 780 Query: 594 LDYKKLHDEAYGNVPXXXXXXXXXXXXEAPRKRKNDGGIGNTSVVPPKGXXXXXXXXXXX 415 LDYKKL+DE YGNVP +DGG Sbjct: 781 LDYKKLYDETYGNVPYDSSDDESW---------SDDGG---------------------- 809 Query: 414 XXXXXNLKDIESTPTRRTRQKLDSDGAYNIAAETHKDSLGPDYAVKVATASASRKNTGST 235 P +RT+ + A PD V RK+T + Sbjct: 810 -------------PRKRTKSTKEGSSA------------SPDGKTPVIRR---RKSTKAA 841 Query: 234 QRNLKETKNTPKRRTREKLDFEGADNLAAETPKDNLGPDSSGKIATATPSRTLGKAVTQG 55 + L ET+NTPKRR R KL+ E ++ A++ + P S G+ T R LG+ VTQ Sbjct: 842 KEKLNETENTPKRRGRPKLNTEDSNISPAKSHEGCSTPGSRGR-RHRTSYRKLGEEVTQK 900 Query: 54 LLESFRENEYPTRDKKEN 1 L SF+EN+YP R KE+ Sbjct: 901 LYNSFKENQYPNRTTKES 918 Score = 412 bits (1058), Expect = e-112 Identities = 334/939 (35%), Positives = 445/939 (47%), Gaps = 88/939 (9%) Frame = -1 Query: 2730 SNAEENCQTIESSSPKQNTLQGNHALCSEHAQSEPVETK-ILGSNAQENCQTIESSSPKQ 2554 S ++ T + S KQ T + C+E SEP+E+K +LGS A EN + E+S P Sbjct: 10 SPSQVTSSTRDHSCLKQTTPEQMCKFCAECIHSEPLESKSLLGSEAVEN-EPRETSIPNS 68 Query: 2553 NILQGKLELGSEHAQSEPMETKIVESNAQESYKTIESSSLKQNTLQGKHELGSEHAQSEP 2374 LQ G S + ++S +T ++S +QNT + KH G+E +E Sbjct: 69 EKLQA--FCGDVPDSSFTDHLAPPSEDMRKSTQTNKASCSQQNTSEQKH--GTELMHNEQ 124 Query: 2373 MEMKIVEPSAQENCQTI-----ESSSLKQN-TSQGKHELGSEHAQSEPMETKIVESSARE 2212 E K + C I +++SL N T Q + S+ +Q+ + ++ + Sbjct: 125 SEQK------HQLCYQIVFDKPQATSLVDNATLQPVSKDVSKSSQTG--NRQALDFLSGN 176 Query: 2211 NCQTIESSSLKQNRLQGKHELGSEHAQSEPM--ETKIVESSVLENCQKIE---------- 2068 C ++ + L KH+LGSE Q+EP ++ V EN Q I Sbjct: 177 RCNELDVDCVHSEPLNQKHQLGSEIIQNEPAVNVARLPSDGVEENLQTISEDLTKVCPVE 236 Query: 2067 -SSSPKQNTTQEKQNGQTT---ESSSPQKIMFG-------------------EKNELG-- 1963 S SP ++ + Q G+ + +SSS Q F E+ ELG Sbjct: 237 PSQSPPRDANKSCQAGEISCLQQSSSEQTPEFTPGISSHEPSVVNYKLGSQLEQTELGET 296 Query: 1962 --------LEHV------QSETLEIRIV--------------GSDSVD----IKQLGTPP 1879 LE V Q + LE+ I SD ++ ++Q TPP Sbjct: 297 SAGELGASLELVVKSSIEQLKQLEVPITIPSTKTSATKHLQSSSDLMEKKSCLEQSETPP 356 Query: 1878 KDAAKSTNRSSRKKNQVSSRPLKRKYMLRSSISSPTVLQSSSHGTSKPPDPS---ISANV 1708 A ++ RK + +++ LK Y +RS I S VL+S S PP+ S N Sbjct: 357 NYVANNSACLGRKGKR-ATKSLKNNYTVRSLIGSDRVLRSRSGERPLPPESSNNLADVNS 415 Query: 1707 XXXXXXXXXXXXXXXKTLNDEFSRTRKRLRYLLNRMGYEHNLIDAYSGEGWKGLSMEKIK 1528 K + DE+SR R LRYLLNR+ YE NLIDAYS EGWKGLS+EK+K Sbjct: 416 IGERKQKKRNKIRRKKIVADEYSRIRTHLRYLLNRINYEQNLIDAYSSEGWKGLSVEKLK 475 Query: 1527 PEQELQRATSEILRCKLKIRDLFQHLDSLCAEGRLQPSLFDSEGQICSEDIFCAKCGSKD 1348 PE+ELQRATSEILR KLKIRDLFQ LDSLCA G SLFDSEGQI SEDI+CAKCGSKD Sbjct: 476 PEKELQRATSEILRRKLKIRDLFQRLDSLCA-GGFPKSLFDSEGQIDSEDIYCAKCGSKD 534 Query: 1347 LSDDNDIILCDGICDRGFHLMCLNPPLSKDKIPPEDEGWLCPGCDCKVDCIYLLNDVQGT 1168 LS DNDIILCDG CDRGFH CL PPL K+ IPP+DEGWLCPGCDCKVDCI L+N++QGT Sbjct: 535 LSADNDIILCDGACDRGFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLVNELQGT 594 Query: 1167 NLSIDDKWEKVFPEAAATGSGDNPDDNFGFXXXXXXXXXXXXDGPELDERVQ-----XXX 1003 L I D WEKVFPEAAA G N D NFG DG DE+ + Sbjct: 595 RLFITDNWEKVFPEAAA---GHNQDPNFGLASDDSDDNEYDPDGSATDEQDEGDESSSDG 651 Query: 1002 XXXXXXXXXXXXXDLGASPHEEQYLGLXXXXXXXXXXXXXXXXXDEQI--KKXXXXXXXX 829 ++ A ++ YLGL D+++ + Sbjct: 652 SSSDDSDFTSTSDEVEAPADDKTYLGLSSEDSEDDEYNPDAPELDDKVTQESSSSGSDFT 711 Query: 828 XXXXXXXALSDGKGSSGTNVLVHSSVDGAKLLTGSSNRRSQMGRKKKQSINAELLSILEP 649 A+ + SSG + +GA G SN + +S+N ELLSI++P Sbjct: 712 SDSEDLAAVLEDNRSSGND-------EGAASPLGHSNGQRYKDGGNNESLNNELLSIIKP 764 Query: 648 GPSQGNASPVLSKRNRERLDYKKLHDEAYGNVPXXXXXXXXXXXXEAPRKRKNDGGIGNT 469 G Q A PV KR+ ERLDYKKL+DE YGNVP PRKR G++ Sbjct: 765 G--QDGAVPVYGKRSSERLDYKKLYDETYGNVPYDSSDDESWSDDGGPRKRTKSTKEGSS 822 Query: 468 SVVPPKGXXXXXXXXXXXXXXXXNLKDIESTPTRRTRQKLDSDGAYNIAAETHKDSLGPD 289 + P G L + E+TP RR R KL+++ + A++H+ P Sbjct: 823 A--SPDGKTPVIRRRKSTKAAKEKLNETENTPKRRGRPKLNTEDSNISPAKSHEGCSTPG 880 Query: 288 YAVKVATASASRKNTGSTQR--NLKETKNTPKRRTREKL 178 + S + TQ+ N + P R T+E L Sbjct: 881 SRGRRHRTSYRKLGEEVTQKLYNSFKENQYPNRTTKESL 919 >ref|XP_007200058.1| hypothetical protein PRUPE_ppa023106mg [Prunus persica] gi|462395458|gb|EMJ01257.1| hypothetical protein PRUPE_ppa023106mg [Prunus persica] Length = 1058 Score = 428 bits (1100), Expect = e-117 Identities = 319/905 (35%), Positives = 429/905 (47%), Gaps = 54/905 (5%) Frame = -1 Query: 2730 SNAEENCQTIESSSPKQNTLQGNHALCSEHAQSEPVETKILGSNAQENCQTIESSSPKQN 2551 S+ CQ E S+ Q G+ + CSE + E + K++ + E C+ + + Sbjct: 15 SSQTAKCQ--EESTSGQIHEIGSESQCSEKTK-ENIGCKVVQNELLEICKASNNPDEQSQ 71 Query: 2550 ILQGKLELGSEHAQSEPMETKIVESNAQESYKTIESSSLK-------------------- 2431 L S H ++ + + V+ ++Q + + +SL Sbjct: 72 SFSENLTENS-HVENLGLPAEDVDKSSQNGAQNVTKNSLTEQLEMPREDPDVNNQSDKTS 130 Query: 2430 ---QNTLQGKHELGSEHAQSEPMEMKIVEPS---AQENCQTIESSSLKQNTSQGKHELGS 2269 Q +L+ ++ G + SEP E + S E QTI + + Q + + Sbjct: 131 CSGQMSLEQTNDSGFGTSSSEPAEERHPSGSFCVQNELLQTIMPLPICGGSEQVQPISEN 190 Query: 2268 EHAQSEPMETKIVESSARENCQTIESSSLKQNRLQGKHELGSEHAQSEPMETKIVESSVL 2089 + S + + + CQT + S Q +E GS SEP + K SV Sbjct: 191 VNMASLNDQAGLPPEDVSKTCQTQKISCPHQITSHQINEFGSGSVPSEPAKQKDQLDSVP 250 Query: 2088 ENCQKIESSSPKQNTTQEKQNGQTTESSSPQKIMFGEKNEL-----GLEHVQSETLEIRI 1924 + ++S ++T +Q G + E+ + + + L L + E L + Sbjct: 251 AQNDEAKTSKAVSSSTVFEQPGPSIEAMTEDSPIGHSEPPLEDLSKSLSDKEMEPLPEDV 310 Query: 1923 VGSDSVDIKQLGTPPKDAAK-STNRSSRKKNQVSSRPLKRKYMLRSSISSPTVLQSSSHG 1747 + S ++QL T K+A K S+ + K SR KRKYM RS + S VL+S + Sbjct: 311 TQNSS--LQQLETASKNALKISSCLGPKDKKNPKSR--KRKYMSRSFVRSDRVLRSKTGE 366 Query: 1746 TSKPPDPSISANVXXXXXXXXXXXXXXXK--------------TLNDEFSRTRKRLRYLL 1609 KP D +S NV + + DEFSR R LRYLL Sbjct: 367 KEKPKDLKLSNNVATLESSNSIANVSNGEEKKRKKRKNRRDNRAIADEFSRIRTHLRYLL 426 Query: 1608 NRMGYEHNLIDAYSGEGWKGLSMEKIKPEQELQRATSEILRCKLKIRDLFQHLDSLCAEG 1429 NR+GYE +LIDAYSGEGWKG S+EK+KPE+ELQRATSEILR KLKIRDLFQ L+SLCAEG Sbjct: 427 NRIGYEKSLIDAYSGEGWKGSSLEKLKPEKELQRATSEILRRKLKIRDLFQRLESLCAEG 486 Query: 1428 RLQPSLFDSEGQICSEDIFCAKCGSKDLSDDNDIILCDGICDRGFHLMCLNPPLSKDKIP 1249 SLFDSEGQI SEDIFC KCGSKD+S DNDIILCDG CDRGFH CL PPL + IP Sbjct: 487 MFPESLFDSEGQIDSEDIFCGKCGSKDVSLDNDIILCDGACDRGFHQFCLEPPLLSEDIP 546 Query: 1248 PEDEGWLCPGCDCKVDCIYLLNDVQGTNLSIDDKWEKVFPEAAATGSGDNPDDNFGFXXX 1069 P+DEGWLCPGCDCKVDCI LLND QGT+LS+ D WEKVFPEAAA S DN G Sbjct: 547 PDDEGWLCPGCDCKVDCIDLLNDSQGTDLSVTDSWEKVFPEAAAAASAGENQDNHGLPSD 606 Query: 1068 XXXXXXXXXDGPELDERVQXXXXXXXXXXXXXXXXDLGA-SPHEEQYLGLXXXXXXXXXX 892 DGPE D +VQ L ++EQYLGL Sbjct: 607 DSDDNDYDPDGPETDNKVQGEESSSDESEYASASDGLETPKSNDEQYLGLPSEDSEDDDY 666 Query: 891 XXXXXXXDEQIKKXXXXXXXXXXXXXXXALSDGKGSSGTNV--LVHSSVDGAKLLTGSSN 718 +E +K+ A D S +V +S+D +K GS Sbjct: 667 NPYAPDVNEDVKQESSSSDFTSDSEDLGAALDDNIMSSEDVEGPKSTSLDDSKPHRGSGE 726 Query: 717 RRSQMGRKKKQSINAELLSILEPGPSQGNASPVLSKRNRERLDYKKLHDEAYGNVPXXXX 538 + S G +KK S+ EL+S+LE GP QG ++P+ KR+ ERLDYK+LHDEAYGNVP Sbjct: 727 QSSISG-QKKHSLKDELISLLESGPGQGESAPLSGKRHIERLDYKRLHDEAYGNVPTDSS 785 Query: 537 XXXXXXXXEAPRKRKNDGGIGNTSVVPPKGXXXXXXXXXXXXXXXXNLKDIESTPTRRTR 358 RKRK G G + P G ++ + E+TP R Sbjct: 786 DDEDWNDIATQRKRKK--GTGQVANRSPNGKTSNIKNGVITKDIKPDVDENENTPRRMPH 843 Query: 357 QKLDSDGAYNIAAETHKDSLGPDYAVKVATASA---SRKNTGSTQRNLKETKNT--PKRR 193 +K + + N++ ++ K S A +S SR +TQR K K P R Sbjct: 844 RKSNVEDTSNLSNKSPKGSTKSGSTSGRAGSSRSTYSRLGEAATQRLCKSFKENHYPDRS 903 Query: 192 TREKL 178 +E L Sbjct: 904 MKESL 908 >gb|EXB76647.1| Homeobox protein [Morus notabilis] Length = 1031 Score = 419 bits (1077), Expect = e-114 Identities = 333/952 (34%), Positives = 450/952 (47%), Gaps = 87/952 (9%) Frame = -1 Query: 2595 QENCQTIESSSPKQNILQGKLELG-SEH---AQSEPMETKIVESNAQESYKTIE-----S 2443 Q N QT + + P+QN+L+ E SEH ++++ +E ES+ ++ E S Sbjct: 13 QVNGQTAKCTHPEQNMLEQVKETEPSEHIHISEAQTVENLRQESSIAATHVFCEQLQPVS 72 Query: 2442 SSLKQNTL-----QGKHELG----------SEHAQSEPM-----------EMKIVEPSAQ 2341 S+ +N+L Q L ++H +S P E+++ + A Sbjct: 73 DSVAKNSLPEAPGQQNEPLEPIAAVGMCSTNKHPESSPRNLTNDSLCNSKELELPQKDAG 132 Query: 2340 ENCQTIESSSLKQNTSQGKHELGSEHAQSEPMETKI-----VESSARENCQTIESSSLKQ 2176 N QT + S K+ ++ KH GS + +E TK +++ R+ T+ S Q Sbjct: 133 ANGQTEKISFPKEAAAEVKHGCGSGNLCAEQAGTKNDVDSNLQNEIRKTDITVSSFVFTQ 192 Query: 2175 NR--LQGKHELGSEHAQSEPMETKIVESSVLENCQKIESSSPKQNTTQEKQN-------G 2023 + K L S + P E V+ +CQ SS P+Q+T + ++ G Sbjct: 193 KLEIVSEKRSLISGGNLAVPSE------DVVRHCQTENSSCPQQSTLGQIKDFDCGCLLG 246 Query: 2022 QTTESSSPQKIMFGEKNELGLEHVQSETLEIRIV----------------GSDSVDIKQL 1891 +T + +++ LG E VQ+ +E RI GSDS KQ+ Sbjct: 247 ETPK----------QEDHLGTELVQNVLVETRIAASNGIVSEHLEPPVGDGSDSYIDKQV 296 Query: 1890 GTPPKDAAKST-------------NRSSR--KKNQVSSRPLKRKYMLRSSISSPTVLQSS 1756 P +D +KS+ N+ S+ +K++ +S+ K++YMLRS + S VL+S Sbjct: 297 EQPSEDVSKSSSLEQLETSSKSLVNKPSQLGRKDKQTSKSRKKQYMLRSLVHSDRVLRSR 356 Query: 1755 SHGTSKPPDPSIS-ANVXXXXXXXXXXXXXXXKT--LNDEFSRTRKRLRYLLNRMGYEHN 1585 + K + S + +N+ T + DEFSR RKRL+Y NR+ YE N Sbjct: 357 TQEKLKSHELSNTLSNIGNGVEKRMKERKKRRGTRVIADEFSRIRKRLKYFFNRIHYEQN 416 Query: 1584 LIDAYSGEGWKGLSMEKIKPEQELQRATSEILRCKLKIRDLFQHLDSLCAEGRLQPSLFD 1405 LIDAYS EGWKG S+EK+KPE+ELQRA SEI R KLKIRDLFQ LDSLCAEGR SLFD Sbjct: 417 LIDAYSSEGWKGTSLEKLKPEKELQRAKSEIFRRKLKIRDLFQQLDSLCAEGRFPKSLFD 476 Query: 1404 SEGQICSEDIFCAKCGSKDLSDDNDIILCDGICDRGFHLMCLNPPLSKDKIPPEDEGWLC 1225 SEGQI SEDIFCAKCGSKD+S +NDIILCDG CDRGFH CL PPL + IPP+DEGWLC Sbjct: 477 SEGQIDSEDIFCAKCGSKDMSANNDIILCDGACDRGFHQFCLEPPLLSEDIPPDDEGWLC 536 Query: 1224 PGCDCKVDCIYLLNDVQGTNLSIDDKWEKVFPEAAATG-SGDNPDDNFGFXXXXXXXXXX 1048 PGCDCKVDC LLND GTNLS+ D WEKVFPEAAA G + D N F Sbjct: 537 PGCDCKVDCFDLLNDSYGTNLSVTDSWEKVFPEAAAAAREGKDQDHNLEFPSDDSEDDDY 596 Query: 1047 XXDGPELDERVQXXXXXXXXXXXXXXXXDL--GASPHEEQYLGLXXXXXXXXXXXXXXXX 874 GPE+ E+V+ +L A P +EQY GL Sbjct: 597 DPYGPEIVEKVEGDESSSDESEYTSACDELEGEAPPKDEQYFGLSSDDSEDNDFDPDDQD 656 Query: 873 XDEQIK-KXXXXXXXXXXXXXXXALSDGKGSSGTNVLVHSSVDGAKLLTGSSNRRSQMGR 697 DE K + L +G+ + V SS+D + L + + S+ G Sbjct: 657 VDENAKQESSSSDFTSDSEDLAFTLDEGQIAEKDEV---SSLDPTRSLGNAVMQSSKRG- 712 Query: 696 KKKQSINAELLSILEPGPSQGNASPVLSKRNRERLDYKKLHDEAYGNVPXXXXXXXXXXX 517 K SI ELL ILE G Q + P+ KR+ ERLDYK+LHDE YG++P Sbjct: 713 GNKSSIKDELLDILESGTGQDGSPPISGKRHVERLDYKRLHDETYGHLPSDSSDDEDWTD 772 Query: 516 XEAPRKRKNDGGIGNTSVVPPKGXXXXXXXXXXXXXXXXNLKDIESTPTRRTRQKLDSDG 337 APRKRK G S V P +L+D E P RR+RQ Sbjct: 773 YAAPRKRKRT--TGQVSSVSPNENASIIKNQTTTDAANNDLEDNEYVPRRRSRQ------ 824 Query: 336 AYNIAAETHKDSLGPDYAVKVATASASRKNTGSTQRNLKETKNTPKRRTREKLDFEGADN 157 N+ T N N P + Sbjct: 825 -----------------------------NSVVTDEN-----NIPNK------------- 837 Query: 156 LAAETPKDNLGPDSSGKIATATPSRTLGKAVTQGLLESFRENEYPTRDKKEN 1 L +PK S+G+ + +R LG+AVTQ L +SF+EN+Y R KE+ Sbjct: 838 LLQGSPKSG----STGRRRELSTNRRLGEAVTQRLYQSFKENQYLDRATKES 885 >ref|XP_002313886.2| hypothetical protein POPTR_0009s09600g [Populus trichocarpa] gi|550331388|gb|EEE87841.2| hypothetical protein POPTR_0009s09600g [Populus trichocarpa] Length = 934 Score = 409 bits (1051), Expect = e-111 Identities = 311/933 (33%), Positives = 436/933 (46%), Gaps = 25/933 (2%) Frame = -1 Query: 2730 SNAEENCQTIESSSPKQNTLQGNHALCSEH------AQSEPVETKILGSNAQENCQTIES 2569 S ++ + QT S P Q L+ H +E+ + +E++I+ + A ++ + Sbjct: 10 SPSQVSSQTKSYSCPSQTKLENTHGFTAEYNCGGYSEEKHKLESEIIQTEAGDSGTAVLQ 69 Query: 2568 SSPKQNILQGKLEL-GSEHAQSEPMETKIVESNAQESYKTIESSSLKQNTLQGKHELGSE 2392 S + + ++ + +P + ES + I +++ Q G + + Sbjct: 70 SGAGETVEPSTEDVTNNSFTDLDPPPEDARGATFDESSRPI-LTAIDQKLEPGARSVNTA 128 Query: 2391 --HAQSEPMEMKIVEPSAQENCQTIESSSLKQNTSQGKHELGSEHAQSEPMETKIVESSA 2218 H +S + N SS + TSQ E + ++ +E + ++S Sbjct: 129 CTHGESSKATDSGILQDEPGNTNAASSSCIANETSQASLENLANNSCTEDVGPPYGDASK 188 Query: 2217 RENCQTIESSSLKQNRLQGKH-ELGSEHAQSEPMETKIVESSVLENCQKIESSSPKQNTT 2041 I+ SS Q + G EL S+ A E E + S L S + Sbjct: 189 GNQ---IDKSSYPQQTISGHALELLSDRACCERSEERQKPGSEL-------SENESTGID 238 Query: 2040 QEKQNGQTTESSSPQKIMFGEKNELGLEHV------------QSETLEIRIVGSDSVDIK 1897 E +G E+S P + +++ + +HV +T D + Sbjct: 239 TELYSGIAIENSEPLTQLVTKRSPI--KHVGLLPGDSIIIPANEQTRPTHDDEDKGPDHE 296 Query: 1896 QLGTPPKDAAKSTNRSSRKKNQVSSRPLKRKYMLRSSISSPTVLQSSSHGTSKPPDPSIS 1717 L TP + A T R R + + +SR ++ YMLRS SS VL+S S K P+ S + Sbjct: 297 HLETPSRVAIGITRRG-RPRGKSASRLSRKIYMLRSLRSSDRVLRSRSQEKPKAPESSNN 355 Query: 1716 A---NVXXXXXXXXXXXXXXXKTLNDEFSRTRKRLRYLLNRMGYEHNLIDAYSGEGWKGL 1546 + N + DE+S+ R LRYLLNRM YE +LI AYSGEGWKGL Sbjct: 356 SGNVNSTGDKKGKRRKKRRGKNIVADEYSKIRAHLRYLLNRMSYEQSLITAYSGEGWKGL 415 Query: 1545 SMEKIKPEQELQRATSEILRCKLKIRDLFQHLDSLCAEGRLQPSLFDSEGQICSEDIFCA 1366 S+EK+KPE+ELQRATSEI R K+KIRDLFQH+DSLC+EGR SLFDSEGQI SEDIFCA Sbjct: 416 SLEKLKPEKELQRATSEITRRKVKIRDLFQHIDSLCSEGRFPSSLFDSEGQIDSEDIFCA 475 Query: 1365 KCGSKDLSDDNDIILCDGICDRGFHLMCLNPPLSKDKIPPEDEGWLCPGCDCKVDCIYLL 1186 KCGSKDL+ DNDIILCDG CDRGFH CL PPL ++ IPP+DEGWLCPGCDCKVDCI LL Sbjct: 476 KCGSKDLNADNDIILCDGACDRGFHQFCLIPPLLREDIPPDDEGWLCPGCDCKVDCIGLL 535 Query: 1185 NDVQGTNLSIDDKWEKVFPEAAATGSGDNPDDNFGFXXXXXXXXXXXXDGPELDERVQXX 1006 ND QGTN+SI D WEKVFPEAAAT SG D NFG DGP++D++ Q Sbjct: 536 NDSQGTNISISDSWEKVFPEAAATASGQKLDHNFGPSSDDSDDNDYEPDGPDIDKKSQEE 595 Query: 1005 XXXXXXXXXXXXXXDLGASPHEEQYLGLXXXXXXXXXXXXXXXXXDEQIKKXXXXXXXXX 826 + A P ++YLGL +E++K+ Sbjct: 596 ESSSDESDFTSASDEFKAPPDGKEYLGLSSDDSEDDDYDPDAPVLEEKLKQESSSSDFTS 655 Query: 825 XXXXXXALSDGKGSSGTNVLVHSSVDGAKLLTGSSNRRSQMGRKKKQSINAELLSILEPG 646 A +G G S + H ++ + S+ R+S+ KK QS+N+ELLS+LEP Sbjct: 656 DSEDLAATINGDGLSLEDE-CHMPIEPRGV---SNGRKSKFDGKKMQSLNSELLSMLEPD 711 Query: 645 PSQGNASPVLSKRNRERLDYKKLHDEAYGNVPXXXXXXXXXXXXEAPRKRKNDGGIGNTS 466 Q ++ V KRN +RLDYKKL+DE YGN+ PRKR+ + G+ + Sbjct: 712 LCQDESATVSGKRNVDRLDYKKLYDETYGNI--STSSDDDYTDTVGPRKRRKN--TGDVA 767 Query: 465 VVPPKGXXXXXXXXXXXXXXXXNLKDIESTPTRRTRQKLDSDGAYNIAAETHKDSLGPDY 286 V G LK+ + P R T Q Sbjct: 768 TVTANGDASVTENGMNSKNMNQELKENKRNPERGTCQ----------------------- 804 Query: 285 AVKVATASASRKNTGSTQRNLKETKNTPKRRTREKLDFEGADNLAAETPKDNLGPDSSGK 106 + +ET +P + + GA +L+ + K ++ P + K Sbjct: 805 -----------------NSSFQETNVSPAK------SYVGA-SLSGSSGK-SVRPSAYKK 839 Query: 105 IATATPSRTLGKAVTQGLLESFRENEYPTRDKK 7 + G+AVTQ L FREN+YP R K Sbjct: 840 L---------GEAVTQRLYSYFRENQYPDRAAK 863 >ref|XP_002300247.2| homeobox family protein [Populus trichocarpa] gi|550348560|gb|EEE85052.2| homeobox family protein [Populus trichocarpa] Length = 930 Score = 408 bits (1049), Expect = e-111 Identities = 318/965 (32%), Positives = 434/965 (44%), Gaps = 16/965 (1%) Frame = -1 Query: 2853 SDAKNCETMESSSPKHNTMQEKNELGSEHAQSG------PMETKIVESNAEENCQTIESS 2692 S +K T S P T++ +E +E+ G +E +I+++ A +N + S Sbjct: 10 SPSKVSHTKSYSCPAQTTLENTHEPSAEYKFGGYPEERHKLECEIIQTEAGDNRAAVLQS 69 Query: 2691 SPKQNTLQGNHALCSEHAQSEPVETKILGSNAQENCQTIESSSPKQNILQGKLELGSEHA 2512 + L S +N S++ Q + G + + Sbjct: 70 CSGEVVQPSTDDLTKSPLIDLDPPPDDARSALFDNSPRPISTAMDQKLEPGATSVNTACV 129 Query: 2511 QSEPMETKIVESNA----QESYKTIESSSLKQNTLQGKHELGSEHAQSEPMEMKIVEPSA 2344 SE +K ++S+ + T SS + T Q E + +++E + +VE S Sbjct: 130 HSE--SSKAIDSSILLDEPRNSNTELSSCIANETSQASLEGLANDSRAEDAGLSLVEAS- 186 Query: 2343 QENCQTIESSSLKQNTSQGK-HELGSEHAQSEPMETKIVESSARENCQTIESSSLKQNRL 2167 N I+ SS Q T+ G+ E S+ A +P+E + Sbjct: 187 --NSDLIDESSYSQQTTSGQTREFHSDRACCKPLEER----------------------- 221 Query: 2166 QGKHELGSEHAQSEPMETKIVESS--VLENCQKIESSSPKQNTTQEKQNGQTTESSSPQK 1993 + GSE A++E ME I S +EN + + K + + S P Sbjct: 222 ---QKPGSELAENESMEIGIGLPSGIAIENLEPLTELVTKSCPIKHIGLPPGDDISIPAN 278 Query: 1992 IMFGEKNELGLEHVQSETLEIRIVGSDSVDIKQLGTPPKDAAKSTNRSSRKKNQVSSRPL 1813 ++ ++ E LE + + I G P + K T++ S KK SSR Sbjct: 279 EQIRPTHDKESKYPDCEHLEK--LSGIVIGITSQGVP---SVKRTSKLSGKKYTSSSR-- 331 Query: 1812 KRKYMLRSSISSPTVLQSSSHGTSKPPDPS---ISANVXXXXXXXXXXXXXXXKTLNDEF 1642 S VL+S+S K P+PS + N + DE+ Sbjct: 332 ----------KSDRVLRSNSQEKPKAPEPSNNSTNVNSTGEEKGKRRKKRRGKSIVADEY 381 Query: 1641 SRTRKRLRYLLNRMGYEHNLIDAYSGEGWKGLSMEKIKPEQELQRATSEILRCKLKIRDL 1462 SR R RLRYLLNRM YE +LI AYSGEGWKGLS+EK+KPE+ELQRATSEI+R K+KIRDL Sbjct: 382 SRIRARLRYLLNRMSYEQSLITAYSGEGWKGLSLEKLKPEKELQRATSEIIRRKVKIRDL 441 Query: 1461 FQHLDSLCAEGRLQPSLFDSEGQICSEDIFCAKCGSKDLSDDNDIILCDGICDRGFHLMC 1282 FQH+DSLC EGR SLFDSEGQI SEDIFCAKCGSKDL+ DNDIILCDG CDRGFH C Sbjct: 442 FQHIDSLCGEGRFPASLFDSEGQIDSEDIFCAKCGSKDLTADNDIILCDGACDRGFHQFC 501 Query: 1281 LNPPLSKDKIPPEDEGWLCPGCDCKVDCIYLLNDVQGTNLSIDDKWEKVFPEAAATGSGD 1102 L PPL ++ IPP DEGWLCPGCDCKVDCI LLND QGTN+SI D+W+ VFPEAAA SG Sbjct: 502 LVPPLLREDIPPGDEGWLCPGCDCKVDCIDLLNDSQGTNISISDRWDNVFPEAAAVASGQ 561 Query: 1101 NPDDNFGFXXXXXXXXXXXXDGPELDERVQXXXXXXXXXXXXXXXXDLGASPHEEQYLGL 922 D NFG DGP++DE+ Q + A P ++QYLGL Sbjct: 562 KLDYNFGLSSDDSDDNDYDPDGPDIDEKSQ-EESSSDESDFSSASDEFEAPPDDKQYLGL 620 Query: 921 XXXXXXXXXXXXXXXXXDEQIKKXXXXXXXXXXXXXXXALSDGKGSSGTNVLVHSSVDGA 742 +E++K+ A +G G S H ++ Sbjct: 621 PSDDSEDDDYDPDAPVLEEKLKQESSSSDFTSDSEDLDATLNGDGLS-LGDEYHMPIEPH 679 Query: 741 KLLTGSSNRRSQMGRKKKQSINAELLSILEPGPSQGNASPVLSKRNRERLDYKKLHDEAY 562 + S+ RRS+ G KK S+N++LLS+LEP Q ++PV KRN ERLDYKKL+DE Y Sbjct: 680 E---DSNGRRSRFGGKKNHSLNSKLLSMLEPDSHQEKSAPVSGKRNIERLDYKKLYDETY 736 Query: 561 GNVPXXXXXXXXXXXXEAPRKRKNDGGIGNTSVVPPKGXXXXXXXXXXXXXXXXNLKDIE 382 GN+ APRKR+ + G+ ++ G LK E Sbjct: 737 GNI--STSSDDDYTDTVAPRKRRKN--TGDVAMGIANGDASVTENGLNSKNMNQELKKNE 792 Query: 381 STPTRRTRQKLDSDGAYNIAAETHKDSLGPDYAVKVATASASRKNTGSTQRNLKETKNTP 202 T R TH++S D V A +GS+ + ++ Sbjct: 793 HTSGR-----------------THQNSSFQDTNVSPAKTHVGESLSGSSSKRVR------ 829 Query: 201 KRRTREKLDFEGADNLAAETPKDNLGPDSSGKIATATPSRTLGKAVTQGLLESFRENEYP 22 + + LG+AVTQ L F+EN YP Sbjct: 830 -----------------------------------PSAYKKLGEAVTQKLYSFFKENRYP 854 Query: 21 TRDKK 7 + K Sbjct: 855 DQAAK 859 >ref|XP_007042568.1| Homeodomain-like protein with RING/FYVE/PHD-type zinc finger domain, putative isoform 1 [Theobroma cacao] gi|590687101|ref|XP_007042569.1| Homeodomain-like protein with RING/FYVE/PHD-type zinc finger domain, putative isoform 1 [Theobroma cacao] gi|508706503|gb|EOX98399.1| Homeodomain-like protein with RING/FYVE/PHD-type zinc finger domain, putative isoform 1 [Theobroma cacao] gi|508706504|gb|EOX98400.1| Homeodomain-like protein with RING/FYVE/PHD-type zinc finger domain, putative isoform 1 [Theobroma cacao] Length = 950 Score = 404 bits (1039), Expect = e-109 Identities = 315/916 (34%), Positives = 429/916 (46%), Gaps = 37/916 (4%) Frame = -1 Query: 2814 PKHNTMQEKNELGSEHAQSGPMETK------IVESNAEENCQTIESSSPKQNTLQGNHAL 2653 P+ +T ++ +E GSE+ + E K ++ + EN + SS G H Sbjct: 24 PEESTSEQAHEFGSEYLLTELSENKNQCGYAATQNESAENATGVSSS--------GVHER 75 Query: 2652 CSEHA--QSEPVETKILGSNAQENCQTIESSSPKQNILQGKLELGSEHAQSEPMETK--- 2488 E+ S P + +L + T +S ++ + E E+ ++E E K Sbjct: 76 SPEYVAKNSSPERSGLLPKGVMGHNHTDKSFYAQETVSGKTHEYDCEYVRTETSEEKHQP 135 Query: 2487 ---IVESNAQESYKTIESSSLKQNTLQGKHELGSEHAQSEPMEMKIVEPSAQE------N 2335 IV++ +E+ + L LQ E SE+A +E + + + E S++ + Sbjct: 136 GSEIVQNELEEACSLV--CDLPAKNLQTFSEGLSENAITESLGL-LPEDSSKHTKTDKLS 192 Query: 2334 CQTIESSSLKQNTSQGK--HELGSEHAQSEPMETKIVESSARENCQTIESSSLKQNRLQG 2161 C + SS N G ELG Q + ++++ + + E+ + SS++ LQ Sbjct: 193 CPQLVSSEPTVNFGSGNVCKELGESPEQRQQLDSESLPNGIEESTIAV-SSNVSNQALQL 251 Query: 2160 KHE-LGSEHAQSEPMETKIVESSVLENCQKIESSSPKQNTTQEKQNGQTTESSSPQKIMF 1984 K E +G H SP + T Q+ S SP Sbjct: 252 KPEDMGKSHCGGH-------------------LHSPPEGVTNVIQS-----SKSPLVEPL 287 Query: 1983 GEKNEL--GLEHVQSETLEIRIVGSDSVDIKQLGTPPKDAAKSTNRSSRKKNQVSSRPLK 1810 G E G Q L + +S ++Q T PK+ ++ S R++N +S+ +K Sbjct: 288 GLPQEFAQGNPSTQQSGLPCEDMAQNS-GVEQHETKPKNLLEN---SGRRRNGKTSKTIK 343 Query: 1809 RKYMLRSSISSPTVLQSSSHGTSKPPDPSIS-ANVXXXXXXXXXXXXXXXKT--LNDEFS 1639 +KYMLRS SS VL+S K + S + A+V + DEFS Sbjct: 344 KKYMLRSLRSSDRVLRSKLQEKPKATESSNNLADVGSSEQQKRRKRRRRKANREVADEFS 403 Query: 1638 RTRKRLRYLLNRMGYEHNLIDAYSGEGWKGLSMEKIKPEQELQRATSEILRCKLKIRDLF 1459 R R LRYLLNR+ YE +LI AYS EGWKGLS+EK+KPE+ELQRATSEILR KLKIRDLF Sbjct: 404 RIRTHLRYLLNRINYERSLIAAYSTEGWKGLSLEKLKPEKELQRATSEILRRKLKIRDLF 463 Query: 1458 QHLDSLCAEGRLQPSLFDSEGQICSEDIFCAKCGSKDLSDDNDIILCDGICDRGFHLMCL 1279 QH+DSLCAEG+L SLFDSEGQI SEDIFCAKCGSKDLS +NDIILCDG CDRGFH CL Sbjct: 464 QHIDSLCAEGKLPESLFDSEGQIDSEDIFCAKCGSKDLSANNDIILCDGACDRGFHQYCL 523 Query: 1278 NPPLSKDKIPPEDEGWLCPGCDCKVDCIYLLNDVQGTNLSIDDKWEKVFPEAAATGSGDN 1099 PPL K+ IPP+DEGWLCPGCDCKVDCI L+N+ QGT+ SI D WEKVFPEAA +G N Sbjct: 524 QPPLLKEDIPPDDEGWLCPGCDCKVDCIELVNESQGTSFSITDSWEKVFPEAAVAAAGQN 583 Query: 1098 PDDNFGFXXXXXXXXXXXXDGPELDERVQXXXXXXXXXXXXXXXXDLGASPHEEQYLGLX 919 D NFG DG E DE+ +L +QYLGL Sbjct: 584 QDPNFGLPSDDSDDNDYNPDGSETDEKDHGDESSSEESEFTSTSEELEVPAKVDQYLGLP 643 Query: 918 XXXXXXXXXXXXXXXXDEQIKKXXXXXXXXXXXXXXXALSDGKGSSGTNVLVHSSVDGAK 739 DE +K S + ++ + K Sbjct: 644 SDDSEDDDYDPDGPNHDEVVKPESSSSDF------------SSDSEDLDAMLEEDITSQK 691 Query: 738 ---LLTGSSNRRSQMGRKK---KQSINAELLSILEPGPSQGNASPVLSKRNRERLDYKKL 577 + S+ R S+ + K K+S+N ELLSI+EP S+ + S + KR+ ERLDYK+L Sbjct: 692 DEGPMANSAPRDSKRRKPKLGEKESMNDELLSIMEPA-SEQDGSAISKKRSIERLDYKRL 750 Query: 576 HDEAYGNVPXXXXXXXXXXXXEAPRKRKNDGGIGNTSVVPPKGXXXXXXXXXXXXXXXXN 397 +DE YGNVP APRKR + P G N Sbjct: 751 YDETYGNVPSSSSDDEDWSDITAPRKRNK--CTAEVASAPENGNVSVSRTVSVSDGLKQN 808 Query: 396 LKDIESTPTRRTRQKLDSDGAYNIAAETH-KDSLGPDYAVKVATASASRKNTGSTQRNLK 220 ++ E P R+TRQ + AE S+ K +++ R QR K Sbjct: 809 PEETEHKPRRKTRQMSRFKDTDSSPAEIQGNTSVSGSSGKKAGSSTYKRLGEAVKQRLYK 868 Query: 219 ETKNT--PKRRTREKL 178 K P R T++ L Sbjct: 869 SFKENQYPDRATKQSL 884 >ref|XP_004289744.1| PREDICTED: uncharacterized protein LOC101296723 [Fragaria vesca subsp. vesca] Length = 1227 Score = 389 bits (999), Expect = e-105 Identities = 317/993 (31%), Positives = 433/993 (43%), Gaps = 60/993 (6%) Frame = -1 Query: 2802 TMQEKNELGSEHAQSGPMETKI-VESNAEENCQTIESSSPKQNTLQGNHALCSEHAQSEP 2626 T +EK +GS+ Q+ ++T I V S E Q I T + N +E A+ P Sbjct: 196 TREEKQHIGSDSVQNELLQTSIPVSSCGSEQLQLI--------TERVNVTFPNEQARVPP 247 Query: 2625 VETKILGSNAQENCQTIESSSPKQNILQGKLELGSEHAQSEPMETKIVESNAQESYKTIE 2446 + S E + + + K GS+ Q+E ++++ + + + Sbjct: 248 KDV----SKTLEPTNNFGFGTSSSVLAEDKQHCGSDVPQNEALQSQYWTRTIRLEVPSCD 303 Query: 2445 SSSLKQNTLQGKHELGSEHAQSEPMEMKIVEPSAQENCQTIESSSLKQNTSQGKHELGSE 2266 + + + + P E ++ + ++ +S Q T + ++ LGS Sbjct: 304 GNEPLLSFCENVDICSLDEKAGLPCEG--LKKTLKQINDVSSGTSYSQPTEENQN-LGSS 360 Query: 2265 HAQSEPMET--KIVESSAREN---------------------------CQTIESSSLKQN 2173 Q EP++T +V S E CQT + S Sbjct: 361 FVQDEPLQTIIPVVSSGGNEQLRVVNENVSVPSLGEQAGLLPEAVSKTCQTDKLSRSLHT 420 Query: 2172 RLQGKHELGSEHAQSEPMETKIVESSVLENCQKIESSSPKQNTTQEKQNGQTTESSS--- 2002 +E GS Q EP E + S+ ++++S+ ++ +Q+G + + + Sbjct: 421 ASDQINESGSGSVQCEPQEQRDQLGSLPSQNDQVKNSTAVSSSIGFEQSGPSVDEMNNSV 480 Query: 2001 -------PQKIMFGEKNELGLEHVQSETLEIRIVGSDSVDIKQLGTPPKDAAKSTNRSSR 1843 P+ EL H T + S+ T K+A+K++ + Sbjct: 481 IGHLEPPPEDASKDHNKELIKPHTNDATQNSCLEPSE--------TASKNASKNSTQFGC 532 Query: 1842 KKNQVSSRPLKRKYMLRSSISSPTVLQSSSHGTSKPPDPSISANVXXXXXXXXXXXXXXX 1663 K + SS K RS +SS VL+S + + KP P +S NV Sbjct: 533 KDKRNSSSRRKS----RSLVSSDRVLRSRT--SEKPEAPELSNNVATLDTSNSVANVSNE 586 Query: 1662 K--------------TLNDEFSRTRKRLRYLLNRMGYEHNLIDAYSGEGWKGLSMEKIKP 1525 K DEFSR R LRY LNR+ YE +LIDAYS EGWKG S+EK+KP Sbjct: 587 KEGKRKKRKKKHRERVAADEFSRIRSHLRYFLNRINYEKSLIDAYSSEGWKGNSLEKLKP 646 Query: 1524 EQELQRATSEILRCKLKIRDLFQHLDSLCAEGRLQPSLFDSEGQICSEDIFCAKCGSKDL 1345 E+ELQRATSEILR K KIRDLFQ LDSLCAEG SLFD EGQI SEDIFCAKCGS D+ Sbjct: 647 EKELQRATSEILRRKSKIRDLFQRLDSLCAEGMFPESLFDEEGQIDSEDIFCAKCGSLDV 706 Query: 1344 SDDNDIILCDGICDRGFHLMCLNPPLSKDKIPPEDEGWLCPGCDCKVDCIYLLNDVQGTN 1165 DNDIILCDG CDRGFH CL PPL ++IPP+DEGWLCPGCDCKVDCI LLND QGT+ Sbjct: 707 YADNDIILCDGACDRGFHQHCLEPPLLSEEIPPDDEGWLCPGCDCKVDCIDLLNDSQGTD 766 Query: 1164 LSIDDKWEKVFPEAA-ATGSGDNPDDNFGFXXXXXXXXXXXXDGPELDERVQXXXXXXXX 988 LSI D WEKVFPEAA A +G + ++N G DGPE DE VQ Sbjct: 767 LSITDSWEKVFPEAAVAASAGQHQENNQGLPSEDSDDDDYDPDGPETDEEVQEGESSSDE 826 Query: 987 XXXXXXXXDLGA-SPHEEQYLGLXXXXXXXXXXXXXXXXXDEQIKKXXXXXXXXXXXXXX 811 L ++EQYLG+ E +K+ Sbjct: 827 SEYASASDGLETPKTNDEQYLGIPSDDSEDDDFNPDAPDPTEDVKQGSSSSDFTSDSEDL 886 Query: 810 XAL--SDGKGSSGTNVLVHSSVDGAKLLTGSSNRRSQMGRKKKQSINAELLSILEPGPSQ 637 A+ D K S ++ + LL GS + S+ G +K+ I EL S++E P Q Sbjct: 887 AAVLDEDRKSFENGEGPQSSVLEASTLLRGSGGKGSKRG-QKRHFIKDELSSLIESDPGQ 945 Query: 636 GNASPVLSKRNRERLDYKKLHDEAYGNVPXXXXXXXXXXXXEAPRKRKNDGGIGNTSVVP 457 ++PV KR+ ERLDYKKLHDE YG++P PRKRK G G S Sbjct: 946 DGSTPVSGKRHVERLDYKKLHDEEYGDIP--TSDDEEYIETAVPRKRKK--GAGQVSPGS 1001 Query: 456 PKGXXXXXXXXXXXXXXXXNLKDIESTPTRRTRQKLDSDGAYNIAAETHKDSLGPDYAVK 277 KG + E TP R R+K + + +S P+ ++K Sbjct: 1002 LKGKPSTIKKGKTTKDIKDDPDKNEHTPRRTPRRK----------SSANDNSSSPNESLK 1051 Query: 276 VA--TASASRKNTGSTQRNLKETKNTPKRRTREKLDFEGADNLAAETPKDNLGPDSSGKI 103 + + S S + GST R L Sbjct: 1052 SSPKSGSTSGRAKGSTYRRL---------------------------------------- 1071 Query: 102 ATATPSRTLGKAVTQGLLESFRENEYPTRDKKE 4 G+AVTQ L SF+EN+YP R KE Sbjct: 1072 ---------GEAVTQRLYTSFKENQYPDRSMKE 1095 >ref|XP_002527467.1| Homeobox protein HAT3.1, putative [Ricinus communis] gi|223533107|gb|EEF34865.1| Homeobox protein HAT3.1, putative [Ricinus communis] Length = 896 Score = 387 bits (995), Expect = e-104 Identities = 254/588 (43%), Positives = 311/588 (52%), Gaps = 21/588 (3%) Frame = -1 Query: 1878 KDAAKSTNRSSRK-KNQVSSRPLKRKYMLRSSISSPTVLQSSSHGTSKPPD-----PSIS 1717 KDA +++R R+ K SR +KYMLR S V+Q S K P+ P++S Sbjct: 187 KDAVSNSSRLGRRVKTTAKSR---KKYMLRCLRRSDRVMQYRSQEKPKAPESSTNLPNVS 243 Query: 1716 ANVXXXXXXXXXXXXXXXKTLNDEFSRTRKRLRYLLNRMGYEHNLIDAYSGEGWKGLSME 1537 +NV + DE+S RK LRYLLNR+GYE +LI AYS EGWKGLS+E Sbjct: 244 SNVEKTRKKKKKRERKSVEA--DEYSIIRKNLRYLLNRIGYEQSLITAYSAEGWKGLSLE 301 Query: 1536 KIKPEQELQRATSEILRCKLKIRDLFQHLDSLCAEGRLQPSLFDSEGQICSEDIFCAKCG 1357 K+KPE+ELQRATSEILR K KIRDLFQ +DSLC EGR SLFDS+GQI SEDIFCAKCG Sbjct: 302 KLKPEKELQRATSEILRRKSKIRDLFQRIDSLCGEGRFPESLFDSDGQISSEDIFCAKCG 361 Query: 1356 SKDLSDDNDIILCDGICDRGFHLMCLNPPLSKDKIPPEDEGWLCPGCDCKVDCIYLLNDV 1177 SKDL+ DNDIILCDG CDRGFH CL PPL K+ IPP+D+GWLCPGCDCKVDCI LLN+ Sbjct: 362 SKDLTADNDIILCDGACDRGFHQYCLVPPLLKEDIPPDDQGWLCPGCDCKVDCIDLLNES 421 Query: 1176 QGTNLSIDDKWEKVFPEAAATGSGDNPDDNFGFXXXXXXXXXXXXDGPELDERVQ-XXXX 1000 QGTN+SI D WEKVFPEAAA G NPD NFG D PE+DE+ Q Sbjct: 422 QGTNISISDSWEKVFPEAAA--PGQNPDQNFGPPSDDSDDNDYDPDIPEIDEKSQGDESS 479 Query: 999 XXXXXXXXXXXXDLGASPHEEQYLGLXXXXXXXXXXXXXXXXXDEQIKKXXXXXXXXXXX 820 +L A P ++Q LGL D+ +K+ Sbjct: 480 SDDSDDSDFTSDELEAPPGDKQQLGLSSEDSGDDDYDPDAPDLDDIVKEESSSSDFTSDS 539 Query: 819 XXXXALSDGKGSSGTNVLVHSSVDGAKLLTG----SSNRRSQMGRKKKQSINAELLSILE 652 A D SG D ++ G S+ S+ GRKKKQS+ +ELLSI E Sbjct: 540 EDLAATLDNNELSGE--------DERRISVGTRGDSTKEGSKRGRKKKQSLQSELLSIEE 591 Query: 651 PGPSQGNASPVLSKRNRERLDYKKLHDEAYGNVPXXXXXXXXXXXXEAPRKRKND----- 487 P PSQ ++P+ KRN ERLDYKKL+DE YGNV KR+ Sbjct: 592 PNPSQDGSAPISGKRNVERLDYKKLYDETYGNVSSDSSDDEDFTDDVGAVKRRKSTQAAL 651 Query: 486 -GGIGNTSVVPPKGXXXXXXXXXXXXXXXXNLKDIESTPTRRTRQKLDSDGAYNIAAETH 310 GN SV +LK+ E P +R+RQ+L S+ + H Sbjct: 652 GSANGNASVT---------------DTGKQDLKETEYVP-KRSRQRLISENTSITPTKAH 695 Query: 309 KDSLGPDYAVKVATASASRK----NTGSTQRNLKETKNTPKRRTREKL 178 + + K S R+ T R+ KE + P R +E L Sbjct: 696 EGTSPSSSCGKTVRPSGYRRLGETVTKGLYRSFKENQ-YPDRDRKEHL 742 >emb|CAN68079.1| hypothetical protein VITISV_006312 [Vitis vinifera] Length = 611 Score = 379 bits (974), Expect = e-102 Identities = 274/656 (41%), Positives = 335/656 (51%), Gaps = 13/656 (1%) Frame = -1 Query: 2478 SNAQESYKTIESSSLKQNTLQGKHELGSEHAQSEPMEMKIVEPSAQENCQTIESSSLKQN 2299 S AQES +T +SSS KQN L+ +L SE SE E K Sbjct: 4 SPAQESNRTRKSSSPKQNILEEARKL-SESVCSESSEQK--------------------- 41 Query: 2298 TSQGKHELGSEHAQSEPMETKIVESSARENCQTIESSSLKQNRLQGKHELGSEHAQSEPM 2119 SE+ Q EP E V S NC E S L + G LG A Sbjct: 42 -------RXSENGQHEPAEISPVLS----NCIVTEQSELPPEDV-GDTILGLPPAD---- 85 Query: 2118 ETKIVESSVLENCQKIESSSPKQNTTQEKQNGQTTESSSPQKIMFGEKNELGLEHVQSET 1939 + ++S+ E+ + K + T+ Q G E + I+ +LG QSE Sbjct: 86 ---VTKNSLXEHLGLPPEDAIKNDGTE--QLGXFPEVVTKSSII----EKLG----QSEP 132 Query: 1938 LEIRIVGSDSVDIKQLGTPPKDAAKSTNRSSRKKNQVSSRPLKRKYMLRSSISSPTVLQS 1759 + +D Q G+ PKD A N+ +++ +KRKY LRSS+S VL+S Sbjct: 133 PPENVARYSGLD--QSGSAPKDLA----------NKRTAKLVKRKYKLRSSVSGSRVLRS 180 Query: 1758 SSHGTSKPPDPSIS-ANVXXXXXXXXXXXXXXXKTLNDEFSRTRKRLRYLLNRMGYEHNL 1582 S K PS + N KT DEF+R RK LRYLLNRM YE NL Sbjct: 181 RSQEKPKASQPSDNFVNASASRERKGRKKKRMNKTTADEFARIRKHLRYLLNRMSYEQNL 240 Query: 1581 IDAYSGEGWKGLSMEKIKPEQELQRATSEILRCKLKIRDLFQHLDSLCAEGRLQPSLFDS 1402 IDAYS EGWKG S+EK+KPE+ELQRA+SEI R KL IRDLFQHLDSLCAEGR SLFDS Sbjct: 241 IDAYSAEGWKGQSVEKLKPEKELQRASSEISRRKLXIRDLFQHLDSLCAEGRFPESLFDS 300 Query: 1401 EGQICSEDIFCAKCGSKDLSDDNDIILCDGICDRGFHLMCLNPPLSKDKIPPEDEGWLCP 1222 EGQI SEDIFCAKC SKD+S DNDIILCDG CDRGFH CL PPL K++IPP+DEGWLCP Sbjct: 301 EGQIDSEDIFCAKCESKDMSADNDIILCDGACDRGFHQFCLEPPLLKEEIPPDDEGWLCP 360 Query: 1221 GCDCKVDCIYLLNDVQGTNLSIDDKWEKVFPEAAATGSGDNPDDNFGFXXXXXXXXXXXX 1042 CDCKVDC+ LLND QGT LS+ D WEKVFPEAAA +G+N D+N GF Sbjct: 361 ACDCKVDCMDLLNDSQGTKLSVIDSWEKVFPEAAA--AGNNQDNNSGFSSDDSEDNDYDP 418 Query: 1041 DGPELDERVQ------------XXXXXXXXXXXXXXXXDLGASPHEEQYLGLXXXXXXXX 898 D PE+DE+ Q D+ SP+ EQ LGL Sbjct: 419 DCPEVDEKGQGDKSSSDKFDESDEFDESDESDFTSASDDMVVSPNNEQCLGLPSDDSEDD 478 Query: 897 XXXXXXXXXDEQIKKXXXXXXXXXXXXXXXALSDGKGSSGTNVLVHSSVDGAKLLTGSSN 718 DEQ+ + A D + S + DG + Sbjct: 479 DFDPDAPEIDEQVNQGSSSSDFTSDSEDFTATLDRRNFS-------DNEDGL-------D 524 Query: 717 RRSQMGRKKKQSINAELLSILEPGPSQGNASPVLSKRNRERLDYKKLHDEAYGNVP 550 + + GRKKK ++ ELLS+LE Q NA P+ +KR+ ERLDYKKLHD + + P Sbjct: 525 EQRRFGRKKKDTLKDELLSVLESNSGQDNA-PLSAKRHVERLDYKKLHDVSAHDSP 579 >ref|XP_004140812.1| PREDICTED: uncharacterized protein LOC101204775 [Cucumis sativus] Length = 1061 Score = 374 bits (959), Expect = e-100 Identities = 300/912 (32%), Positives = 420/912 (46%), Gaps = 35/912 (3%) Frame = -1 Query: 2631 EPVETKILGSNAQENCQTIESSSPKQNILQGKLELGSEHAQSEPMETKIVESNAQESYKT 2452 E V+ + N +E ++ + S N + E + E E + ES + + Sbjct: 19 EAVQEAMASDNMEERDESTGTESRPNNNAEAVQEAKASDNMKERDENTVTESRPNNNAEA 78 Query: 2451 IESSSLKQNTLQGKHELGSEHAQSEPMEMKIVEPSAQENCQTIESSSLKQNTSQGKHELG 2272 + N E E+ +E KI E + QE ++E L +++ K+ G Sbjct: 79 AQEGKASDNM-----EERDENTDTESRPNKIAE-AVQEAKASVEVEVLTCLSNEAKYS-G 131 Query: 2271 SEHAQSEPMETKIVESSARENCQTIESSSLKQNRLQGKHELGSEHAQSEPMETKIVESSV 2092 + + P E S++ + E + ++QN ELGS + SE E S Sbjct: 132 YQELGTTP------EFSSKIDGPDEEKAGVQQNM-----ELGSGYLLSELSEKDNQTISN 180 Query: 2091 LENCQKIESSSPKQNTTQEKQNGQTTESSSPQKIMFGEKNELGLEHVQSETLEIRI--VG 1918 + ++E+ + N K + E + + E +EL LE V +E +G Sbjct: 181 HADNDRVEAGNLLSNDKDTKNLKLSIEDEATT--LLNECSELPLEDVTKNYIEKMNPPIG 238 Query: 1917 --SDSVDIKQLGTPPKDAAKSTNRSSRKKNQVSSRPLKRKYMLRSSISSPTVLQSSSHGT 1744 + I+ L T P S ++ S +K+++ + K+ Y LRS +SS VL+S + Sbjct: 239 DLTQITSIQSLETIP-----SNSQQSARKDKIFLKSKKKNYKLRSHVSSDRVLRSRTQEK 293 Query: 1743 SKPPDPS-----ISANVXXXXXXXXXXXXXXXKTLNDEFSRTRKRLRYLLNRMGYEHNLI 1579 +K P+ S +A DE+S R LRYLLNR+ YE +LI Sbjct: 294 AKAPERSNDLNNFTAEEDGKRKKKKKRNIQGKGARVDEYSSIRNHLRYLLNRIRYEQSLI 353 Query: 1578 DAYSGEGWKGLSMEKIKPEQELQRATSEILRCKLKIRDLFQHLDSLCAEGRLQPSLFDSE 1399 +AYS EGWKG S +K+KPE+ELQRA++EI+R KLKIRDLFQ +D+LCAEGRL SLFDSE Sbjct: 354 EAYSSEGWKGFSSDKLKPEKELQRASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSE 413 Query: 1398 GQICSEDIFCAKCGSKDLSDDNDIILCDGICDRGFHLMCLNPPLSKDKIPPEDEGWLCPG 1219 GQI SEDIFCAKCGSK+LS +NDIILCDGICDRGFH CL PPL IPP+DEGWLCPG Sbjct: 414 GQIDSEDIFCAKCGSKELSLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPG 473 Query: 1218 CDCKVDCIYLLNDVQGTNLSIDDKWEKVFPEAAATGSGDNPDDNFGFXXXXXXXXXXXXD 1039 CDCK DC+ LLN+ QG+NLSI D WEKV+PEAAA +G N D G D Sbjct: 474 CDCKDDCLDLLNEFQGSNLSITDGWEKVYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPD 533 Query: 1038 GPEL-------------DERVQXXXXXXXXXXXXXXXXDLGASPHEEQYLGLXXXXXXXX 898 P+ ++ L S +++QYLGL Sbjct: 534 VPDTIDQDNELSSDESSSDQSNSDPSNSDTSGYASASEGLEVSSNDDQYLGLPSDDSEDN 593 Query: 897 XXXXXXXXXDEQIKKXXXXXXXXXXXXXXXALSDGKGSSGTNVLVHSSVDGAKLLTGSSN 718 DE +++ AL D SS LV S + + N Sbjct: 594 DYDPSVPELDEGVRQESSSSDFTSDSEDLAAL-DNNCSSKDGDLVSSLNNTLPV----KN 648 Query: 717 RRSQMGRKKKQSINAELLSILEPGPSQGNASPVLSKRNRERLDYKKLHDEAYGNVPXXXX 538 Q K +++ EL S+L+ GP + PV +R ERLDYKKLHDE YGNVP Sbjct: 649 SNGQSSGPNKSALHNELSSLLDSGPDKDGLEPVSGRRQVERLDYKKLHDETYGNVPTDSS 708 Query: 537 XXXXXXXXEAPRKRKNDGGIGNTSVVPPKGXXXXXXXXXXXXXXXXNLKDIESTPT--RR 364 ++ R D G T PK +L ++++ + RR Sbjct: 709 DDTYGSTLDSSDDRGWDSG---TRKRGPK----TLVLALSNNGSNDDLTNVKTKRSYKRR 761 Query: 363 TRQKLDSDGAYNIAAETHKDSLGPDYAVKVATASASRKNTGSTQRNLK---ETKNTPKRR 193 TRQK + N ET D+ +VK +T+S++R+ + L + PKR Sbjct: 762 TRQKPGAINVNNSVTETPVDTAKSSSSVKKSTSSSNRRLSQPALERLLASFQENEYPKRA 821 Query: 192 TREKLDFEGADNLA------AETPKDNLGPDSSGKIATATPSRTLGKAVTQGLLESFREN 31 T++ L E L T P SSGK A ++ ++ + G L S E Sbjct: 822 TKQSLAQELGLGLKQVSKWFENTRWSTRHPSSSGKKAKSSSRMSIYLSQASGEL-SKNEP 880 Query: 30 EYPT--RDKKEN 1 E T RD N Sbjct: 881 ESATCFRDTDSN 892 >ref|XP_006829269.1| hypothetical protein AMTR_s00001p00272780 [Amborella trichopoda] gi|548834248|gb|ERM96685.1| hypothetical protein AMTR_s00001p00272780 [Amborella trichopoda] Length = 800 Score = 368 bits (945), Expect = 8e-99 Identities = 258/694 (37%), Positives = 339/694 (48%), Gaps = 25/694 (3%) Frame = -1 Query: 2013 ESSSPQKIMF-GEKNELGLEHVQSETLEIRIVGSDSVDIKQLGTPPKD------------ 1873 E + P+ ++ E+ E E E+ +G S++I++L P D Sbjct: 8 EQNCPRSLILDSERCSTSFEQTTKE--EVPSIGVHSLEIERLTPAPIDPGYAGPNSGIIG 65 Query: 1872 ---AAK-STNRSSRKKNQVSSRPLKRKYMLRSSISSPTVLQSSSHGTSK--PPDPSISAN 1711 A+K +++R K +V+S+ R Y LRSS + VL+ S GTSK P S S+ Sbjct: 66 RNTASKGNSSRQEWKGKKVASQVGSRSYFLRSSSNGVRVLRPRSIGTSKTSPAASSKSSP 125 Query: 1710 VXXXXXXXXXXXXXXXKTL-NDEFSRTRKRLRYLLNRMGYEHNLIDAYSGEGWKGLSMEK 1534 + + L NDE+SRTRK +RYLL R+ +E LIDAYSGEGWKG S EK Sbjct: 126 IMPERRKSRREKRKLKEVLSNDEYSRTRKSVRYLLARINFEQGLIDAYSGEGWKGQSQEK 185 Query: 1533 IKPEQELQRATSEILRCKLKIRDLFQHLDSLCAEGRLQPSLFDSEGQICSEDIFCAKCGS 1354 +KPE+EL+RA EI+R KL+IRDLFQHL +LC EGR+ SLFDSEG+I SEDIFCAKCGS Sbjct: 186 VKPEKELKRAEDEIVRRKLRIRDLFQHLQTLCEEGRIHESLFDSEGKIYSEDIFCAKCGS 245 Query: 1353 KDLSDDNDIILCDGICDRGFHLMCLNPPLSKDKIPPEDEGWLCPGCDCKVDCIYLLNDVQ 1174 KD+ DNDIILCDGIC+RGFH MCL PPL K++IPP DEGWLCPGC+CK C+ L+ND Sbjct: 246 KDVPPDNDIILCDGICNRGFHQMCLVPPLLKEQIPPGDEGWLCPGCECKAFCVDLVNDYL 305 Query: 1173 GTNLSIDDKWEKVFPEAAATGSGDNPDDNFGFXXXXXXXXXXXXDGPELDERVQXXXXXX 994 GT+L I+D WEKVF EAAA SGD D+ G DGP++D+ Q Sbjct: 306 GTDLLIEDGWEKVFAEAAALASGDKQYDDLGLPSDDSEDNDYNPDGPDIDDEAQNSSSSS 365 Query: 993 XXXXXXXXXXDLGASPHEEQYLGLXXXXXXXXXXXXXXXXXDEQIKKXXXXXXXXXXXXX 814 D +S +++ L Sbjct: 366 EESDMTSGSSDSESSSSDDEASSL------------------------------------ 389 Query: 813 XXALSDGKGSSGTNVLVHSSVDGAKLLTGSSNRRSQMGRKKKQSINAELLSILEPGPSQG 634 +G GSS + + + L GS R +Q KK +N+ELLSILEP + Sbjct: 390 ----DEGSGSSLPGPFLSADLS----LNGSEGRSNQ----KKPRMNSELLSILEPESNGK 437 Query: 633 NASPVLSKRNRERLDYKKLHDEAYGNVPXXXXXXXXXXXXEAPRKRKNDGGIGNTSVVPP 454 SP+ KRNRERLDYKKLHDE YGNV + KRK GG+G + +P Sbjct: 438 VVSPLPGKRNRERLDYKKLHDEDYGNVSSDSSDDEDWVAMDT-SKRKKSGGVGRGTRLPT 496 Query: 453 KGXXXXXXXXXXXXXXXXNLKDIESTP----TRRTRQKLDSDGAYNIAAETHKDSLGPDY 286 K +LK ES P T+ QK +S+ ++ TH Sbjct: 497 K----------HCTLSPGSLKIYESIPSLPETQILLQKPNSETIQVGSSLTHNIPGNSQI 546 Query: 285 AVKVATASASRKNTGSTQRNLKETKNTPKRRTREKLDFEGADNLAAETPKDNLGPDSSGK 106 V +AS + + G G S + Sbjct: 547 QVHGVSASGVKSHVGG-------------------------------------GEHISSR 569 Query: 105 IATATP-SRTLGKAVTQGLLESFRENEYPTRDKK 7 TP S+ G+ VTQ L SF+EN YPT++ + Sbjct: 570 NGPVTPLSKRFGRLVTQSLHNSFKENMYPTKETR 603 >ref|XP_007143079.1| hypothetical protein PHAVU_007G041800g [Phaseolus vulgaris] gi|561016269|gb|ESW15073.1| hypothetical protein PHAVU_007G041800g [Phaseolus vulgaris] Length = 826 Score = 363 bits (931), Expect = 3e-97 Identities = 277/824 (33%), Positives = 382/824 (46%), Gaps = 34/824 (4%) Frame = -1 Query: 2454 TIESSSLKQNTLQGKHELGSEHAQSEPMEMKIVEPSAQENCQTIESSSLKQNTSQGKHEL 2275 T S +Q+ L K +LG+EH E E V S+ +S+ + N ++ L Sbjct: 15 TAGGMSTEQSGLSEKTQLGTEHLLKEQRESGTVLTSS---VMDEKSNQVSANATENSVIL 71 Query: 2274 GSEHAQSEPMETKIVESSARENCQTIESSSLKQNRLQGKHELGSEHAQSEPMETKIVESS 2095 Q + +NCQT+E S L+Q+ E+ + + EP Sbjct: 72 LPAPPQHD----------LEKNCQTVEGSHLQQST---DKEVSLQLSNDEPENPS----- 113 Query: 2094 VLENCQKIESSSPKQNTTQEKQNGQTTESSSPQKIMFGEKNELGLEHVQSETLEIRIVGS 1915 Q + + P ++ +GQ + SSP N + + Sbjct: 114 -----QPLSENEPVESAPAFAGDGQ--KQSSPALANTSYVNNM-----------LDPPSG 155 Query: 1914 DSVDIKQLGTPPKDAAKSTNRSSRKKNQVSSRPLKRKYMLRSSISSPTVLQSSSHGTSKP 1735 D+V + K + N R+K + +S+ LK+ YMLRS SS L+S + K Sbjct: 156 DAV----INCSEKVSNSPANSQLRRKGKKNSKFLKKTYMLRSVGSSDRALRSKTKENPKT 211 Query: 1734 PDPSISA--------NVXXXXXXXXXXXXXXXKTLNDEFSRTRKRLRYLLNRMGYEHNLI 1579 P+P+ + N + D+FSR + LRYLLNR+GYE NLI Sbjct: 212 PEPNSNLVDCNNNNNNDGVKKKSFKKKRKSGEVGITDQFSRIKSHLRYLLNRIGYEKNLI 271 Query: 1578 DAYSGEGWKGLSMEKIKPEQELQRATSEILRCKLKIRDLFQHLDSLCAEGRLQPSLFDSE 1399 DAYS EGWKG SMEK+KPE+ELQRA SEI+R KL IR+LF++LDSLC EG+L SLFDSE Sbjct: 272 DAYSAEGWKGYSMEKLKPEKELQRAKSEIIRRKLNIRELFRNLDSLCTEGKLPESLFDSE 331 Query: 1398 GQICSEDIFCAKCGSKDLSDDNDIILCDGICDRGFHLMCLNPPLSKDKIPPEDEGWLCPG 1219 G+I SEDIFCAKC SK+LS +NDIILCDG+CDRGFH +CL+PPL + IPP DEGWLCPG Sbjct: 332 GEIDSEDIFCAKCHSKELSSNNDIILCDGVCDRGFHQLCLDPPLLTEDIPPGDEGWLCPG 391 Query: 1218 CDCKVDCIYLLNDVQGTNLSIDDKWEKVFPEAAATGSGDNPDDNFGFXXXXXXXXXXXXD 1039 CDCK DC+ L+ND GT+LSI D WE+VFPEAAA +G+ D+N G + Sbjct: 392 CDCKDDCMDLINDSFGTSLSISDTWERVFPEAAA-AAGNKTDNNSGLPSDDSDDDDYNPN 450 Query: 1038 GPELDERVQXXXXXXXXXXXXXXXXDLGASPHEEQYLGLXXXXXXXXXXXXXXXXXDEQI 859 GPE D +V+ +L S H +QYLGL D ++ Sbjct: 451 GPE-DVKVEGDESSSDESDYASASENLEGS-HGDQYLGLPSDDSDDGDYDPAAPDADSKV 508 Query: 858 KKXXXXXXXXXXXXXXXALSDGKGSSGTNVLVHS-SVDGAKLLTGSSNRRSQMGRKKKQS 682 A S G + + S S+D K L R+ + G KK S Sbjct: 509 NVESSSSDFTSDSDDLPAAIVENTSPGQDGEIRSASLDDVKCLNSYGKRKGKAG--KKLS 566 Query: 681 INAELLSILEPGPSQGNASPVLSKRNRERLDYKKLHDEAYGNVPXXXXXXXXXXXXEAPR 502 + EL S+LEP Q ++PV +RN ERLDYKKL+DEAY + R Sbjct: 567 MADELSSLLEPDSGQEGSTPVSGRRNLERLDYKKLYDEAYHS--DTSEDEDWTATVTPSR 624 Query: 501 KRKN-------DGGIGNTSVVPPKGXXXXXXXXXXXXXXXXNLKDIESTPTRRTRQKLDS 343 K+K DG N S+ PK +L D + +R+ + K Sbjct: 625 KKKGNATPVSPDGNASNNSMHTPKRNGHQKKFENTKNSPAKSLDDHVKSDSRKQKSK--- 681 Query: 342 DGAYNIAAE------------------THKDSLGPDYAVKVATASASRKNTGSTQRNLKE 217 AY E T K+SL + + + NT + R+ + Sbjct: 682 SSAYKRLGEAVVERLHISFKENQYPDRTTKESLAQELGLTCQQVAKWFDNTRWSFRHSSQ 741 Query: 216 TKNTPKRRTREKLDFEGADNLAAETPKDNLGPDSSGKIATATPS 85 + R T ++ A+N E + P+ SG+I+ A S Sbjct: 742 METNSGRNTLQQSTDSRAENEGGEC--ELRSPEVSGEISKAPSS 783 >ref|XP_003555282.1| PREDICTED: homeobox protein HAT3.1-like isoform X1 [Glycine max] Length = 820 Score = 361 bits (926), Expect = 1e-96 Identities = 275/809 (33%), Positives = 387/809 (47%), Gaps = 51/809 (6%) Frame = -1 Query: 2355 EPSAQENCQTIE------SSSLKQNTSQGKHELGS-------EHAQSEPMETKIVESSA- 2218 EP A E C+ E S L++ + EL S ++ E +++ A Sbjct: 17 EPMATEQCELSEKTPQIGSEGLEREQKELLTELTSFVIDEKSNQVSADVTENSVIQLPAP 76 Query: 2217 -----RENCQTIESSSLKQNRLQGKHELGSEHAQSEPMETKIVESSVLENCQKIESSSPK 2053 +NCQT+E S L+Q+ + E S ++ E K S EN Q S P Sbjct: 77 PQHDFEKNCQTVEGSCLEQSTV----EQVSVDLSNDKSENKCKPLS--ENVQ----SEPV 126 Query: 2052 QNTTQEKQNGQTTESSSPQKIMFGEKNELGLEHVQSETLEIRIVGSDSVDIKQLGTPPKD 1873 ++ +GQ SSP + NEL L+ + +V + + +++ P Sbjct: 127 ESIPAFVVDGQM--QSSPAQANMSSVNEL-LDQPSGD-----VVNNITNCSEKMSNSP-- 176 Query: 1872 AAKSTNRSSRKKNQVSSRPLKRKYMLRSSISSPTVLQSSSHGTSKPPDPSIS-----ANV 1708 ++ SR+K + +S+ LK+KYMLRS SS L+S + K P+P+ + +N Sbjct: 177 ----SHSQSRRKGKRNSKLLKKKYMLRSLGSSGRALRSRTKEKPKEPEPTSNLVDGNSND 232 Query: 1707 XXXXXXXXXXXXXXXKTLNDEFSRTRKRLRYLLNRMGYEHNLIDAYSGEGWKGLSMEKIK 1528 + + D+FSR R LRYLLNR+ YE++LIDAYSGEGWKG SMEK+K Sbjct: 233 GVKRKSGRKKKKRREEGITDQFSRIRSHLRYLLNRISYENSLIDAYSGEGWKGYSMEKLK 292 Query: 1527 PEQELQRATSEILRCKLKIRDLFQHLDSLCAEGRLQPSLFDSEGQICSEDIFCAKCGSKD 1348 PE+ELQRA SEILR KLKIRDLF++LDSLCAEG+ SLFDS G+I SEDIFCAKC SK+ Sbjct: 293 PEKELQRAKSEILRRKLKIRDLFRNLDSLCAEGKFPESLFDSAGEIDSEDIFCAKCQSKE 352 Query: 1347 LSDDNDIILCDGICDRGFHLMCLNPPLSKDKIPPEDEGWLCPGCDCKVDCIYLLNDVQGT 1168 LS +NDIILCDG+CDRGFH +CL+PPL + IPP DEGWLCPGCDCK DC+ L+ND GT Sbjct: 353 LSTNNDIILCDGVCDRGFHQLCLDPPLLTEDIPPGDEGWLCPGCDCKDDCMDLVNDSFGT 412 Query: 1167 NLSIDDKWEKVFPEAAATGSGDNPDDNFGFXXXXXXXXXXXXDGPELDERVQXXXXXXXX 988 +LSI D WE+VFPEAA+ +G+N D+N G +G + D +++ Sbjct: 413 SLSISDTWERVFPEAASF-AGNNMDNNLGLPSDDSDDDDYNPNGSD-DVKIEGDESSSDE 470 Query: 987 XXXXXXXXDLGASPHEEQYLGLXXXXXXXXXXXXXXXXXDEQIKKXXXXXXXXXXXXXXX 808 L HE+QYLGL D ++ + Sbjct: 471 SEYASASEKLEGGSHEDQYLGLPSEDSDDGDYDPDAPDVDCKVNEESSSSDFTSDSEDLA 530 Query: 807 ALSDGKGSSGTNVLVHSSVDGAKLLTGSSNRRSQMGRKKKQSINAELLSILEPGPSQGNA 628 A + S G + G N + G+ K S+ EL S+LEP QG Sbjct: 531 AAFEDNTSPGQD--------------GGINSSKKKGKVGKLSMADELSSLLEPDSGQGGP 576 Query: 627 SPVLSKRNRERLDYKKLHDEAYGNVPXXXXXXXXXXXXEAP-RKRKNDGGIGNTSVVPPK 451 +PV KR+ ERLDYKKL++E Y + AP RK+K G + T V P Sbjct: 577 TPVSGKRHVERLDYKKLYEETYHS---DTSDDEDWNDAAAPSRKKKLTGNV--TPVSPNA 631 Query: 450 GXXXXXXXXXXXXXXXXNLKDIESTPTRR--------TRQKLDSDGAYNIAAE------- 316 +++ S+PT+ +R K A+ E Sbjct: 632 NASNNSIHTLKRNAHQNKVENTNSSPTKSLDGRSKSGSRDKRSGSSAHKRLGEAVVQRLH 691 Query: 315 -THKDSLGPDYAVKVATASASR----------KNTGSTQRNLKETKNTPKRRTREKLDFE 169 + K++ PD + K + A NT + R+ + + R + Sbjct: 692 KSFKENQYPDRSTKESLAQELGLTYQQVAKWFDNTRWSFRHSSQMETNSGRNASPEATDG 751 Query: 168 GADNLAAETPKDNLGPDSSGKIATATPSR 82 A+N E +++ P+ SGK + T SR Sbjct: 752 RAEN-EGEKQCESMSPEVSGKNSKTTSSR 779 >ref|XP_006605989.1| PREDICTED: homeobox protein HAT3.1-like isoform X2 [Glycine max] Length = 751 Score = 359 bits (921), Expect = 5e-96 Identities = 240/622 (38%), Positives = 326/622 (52%), Gaps = 24/622 (3%) Frame = -1 Query: 2355 EPSAQENCQTIE------SSSLKQNTSQGKHELGS-------EHAQSEPMETKIVESSA- 2218 EP A E C+ E S L++ + EL S ++ E +++ A Sbjct: 17 EPMATEQCELSEKTPQIGSEGLEREQKELLTELTSFVIDEKSNQVSADVTENSVIQLPAP 76 Query: 2217 -----RENCQTIESSSLKQNRLQGKHELGSEHAQSEPMETKIVESSVLENCQKIESSSPK 2053 +NCQT+E S L+Q+ + E S ++ E K S EN Q S P Sbjct: 77 PQHDFEKNCQTVEGSCLEQSTV----EQVSVDLSNDKSENKCKPLS--ENVQ----SEPV 126 Query: 2052 QNTTQEKQNGQTTESSSPQKIMFGEKNELGLEHVQSETLEIRIVGSDSVDIKQLGTPPKD 1873 ++ +GQ SSP + NEL L+ + +V + + +++ P Sbjct: 127 ESIPAFVVDGQM--QSSPAQANMSSVNEL-LDQPSGD-----VVNNITNCSEKMSNSP-- 176 Query: 1872 AAKSTNRSSRKKNQVSSRPLKRKYMLRSSISSPTVLQSSSHGTSKPPDPSIS-----ANV 1708 ++ SR+K + +S+ LK+KYMLRS SS L+S + K P+P+ + +N Sbjct: 177 ----SHSQSRRKGKRNSKLLKKKYMLRSLGSSGRALRSRTKEKPKEPEPTSNLVDGNSND 232 Query: 1707 XXXXXXXXXXXXXXXKTLNDEFSRTRKRLRYLLNRMGYEHNLIDAYSGEGWKGLSMEKIK 1528 + + D+FSR R LRYLLNR+ YE++LIDAYSGEGWKG SMEK+K Sbjct: 233 GVKRKSGRKKKKRREEGITDQFSRIRSHLRYLLNRISYENSLIDAYSGEGWKGYSMEKLK 292 Query: 1527 PEQELQRATSEILRCKLKIRDLFQHLDSLCAEGRLQPSLFDSEGQICSEDIFCAKCGSKD 1348 PE+ELQRA SEILR KLKIRDLF++LDSLCAEG+ SLFDS G+I SEDIFCAKC SK+ Sbjct: 293 PEKELQRAKSEILRRKLKIRDLFRNLDSLCAEGKFPESLFDSAGEIDSEDIFCAKCQSKE 352 Query: 1347 LSDDNDIILCDGICDRGFHLMCLNPPLSKDKIPPEDEGWLCPGCDCKVDCIYLLNDVQGT 1168 LS +NDIILCDG+CDRGFH +CL+PPL + IPP DEGWLCPGCDCK DC+ L+ND GT Sbjct: 353 LSTNNDIILCDGVCDRGFHQLCLDPPLLTEDIPPGDEGWLCPGCDCKDDCMDLVNDSFGT 412 Query: 1167 NLSIDDKWEKVFPEAAATGSGDNPDDNFGFXXXXXXXXXXXXDGPELDERVQXXXXXXXX 988 +LSI D WE+VFPEAA+ +G+N D+N G +G + D +++ Sbjct: 413 SLSISDTWERVFPEAASF-AGNNMDNNLGLPSDDSDDDDYNPNGSD-DVKIEGDESSSDE 470 Query: 987 XXXXXXXXDLGASPHEEQYLGLXXXXXXXXXXXXXXXXXDEQIKKXXXXXXXXXXXXXXX 808 L HE+QYLGL D ++ + Sbjct: 471 SEYASASEKLEGGSHEDQYLGLPSEDSDDGDYDPDAPDVDCKVNEESSSSDFTSDSEDLA 530 Query: 807 ALSDGKGSSGTNVLVHSSVDGAKLLTGSSNRRSQMGRKKKQSINAELLSILEPGPSQGNA 628 A + S G + G N + G+ K S+ EL S+LEP QG Sbjct: 531 AAFEDNTSPGQD--------------GGINSSKKKGKVGKLSMADELSSLLEPDSGQGGP 576 Query: 627 SPVLSKRNRERLDYKKLHDEAY 562 +PV KR+ ERLDYKKL++E Y Sbjct: 577 TPVSGKRHVERLDYKKLYEETY 598 >ref|XP_004161446.1| PREDICTED: homeobox protein HAT3.1-like [Cucumis sativus] Length = 749 Score = 356 bits (913), Expect = 4e-95 Identities = 246/652 (37%), Positives = 323/652 (49%), Gaps = 31/652 (4%) Frame = -1 Query: 1863 STNRSSRKKNQVSSRPLKRKYMLRSSISSPTVLQSSSHGTSKPPDPS-----ISANVXXX 1699 S ++ S +K+++ + K+ Y LRS +SS VL+S + +K P+ S +A Sbjct: 22 SNSQQSARKDKIFLKSKKKNYKLRSHVSSDRVLRSRTQEKAKAPERSNDLNNFTAEEDGK 81 Query: 1698 XXXXXXXXXXXXKTLNDEFSRTRKRLRYLLNRMGYEHNLIDAYSGEGWKGLSMEKIKPEQ 1519 DE+S R LRYLLNR+ YE +LI+AYS EGWKG S +K+KPE+ Sbjct: 82 RKKKKKRNIQGKGARVDEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFSSDKLKPEK 141 Query: 1518 ELQRATSEILRCKLKIRDLFQHLDSLCAEGRLQPSLFDSEGQICSEDIFCAKCGSKDLSD 1339 ELQRA++EI+R KLKIRDLFQ +D+LCAEGRL SLFDSEGQI SEDIFCAKCGSK+LS Sbjct: 142 ELQRASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSL 201 Query: 1338 DNDIILCDGICDRGFHLMCLNPPLSKDKIPPEDEGWLCPGCDCKVDCIYLLNDVQGTNLS 1159 +NDIILCDGICDRGFH CL PPL IPP+DEGWLCPGCDCK DC+ LLN+ QG+NLS Sbjct: 202 ENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLS 261 Query: 1158 IDDKWEKVFPEAAATGSGDNPDDNFGFXXXXXXXXXXXXDGPEL-------------DER 1018 I D WEKV+PEAAA +G N D G D P+ ++ Sbjct: 262 ITDGWEKVYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDTIDQDNELSSDESSSDQ 321 Query: 1017 VQXXXXXXXXXXXXXXXXDLGASPHEEQYLGLXXXXXXXXXXXXXXXXXDEQIKKXXXXX 838 L S +++QYLGL DE +++ Sbjct: 322 SNSDPSNSDTSGYASASEGLEVSSNDDQYLGLPSDDSEDNDYDPSVPELDEGVRQESSSS 381 Query: 837 XXXXXXXXXXALSDGKGSSGTNVLVHSSVDGAKLLTGSSNRRSQMGRKKKQSINAELLSI 658 AL D SS LV S + + N Q K +++ EL S+ Sbjct: 382 DFTSDSEDLAAL-DNNCSSKDGDLVSSLNNTLPV----KNSNGQSSGPNKSALHNELSSL 436 Query: 657 LEPGPSQGNASPVLSKRNRERLDYKKLHDEAYGNVPXXXXXXXXXXXXEAPRKRKNDGGI 478 L+ GP + PV +R ERLDYKKLHDE YGNVP ++ R D G Sbjct: 437 LDSGPDKDGLEPVSGRRQVERLDYKKLHDETYGNVPTDSSDDTYGSTLDSSDDRGWDSG- 495 Query: 477 GNTSVVPPKGXXXXXXXXXXXXXXXXNLKDIESTPT--RRTRQKLDSDGAYNIAAETHKD 304 T PK +L ++++ + RRTRQK + N ET D Sbjct: 496 --TRKRGPK----TLVLALSNNGSNDDLTNVKTKRSYKRRTRQKPGAINVNNSVTETPVD 549 Query: 303 SLGPDYAVKVATASASRKNTGSTQRNLK---ETKNTPKRRTREKLDFEGADNLA------ 151 + +VK +T+S++R+ + L + PKR T++ L E L Sbjct: 550 TAKSSSSVKKSTSSSNRRLSQPALERLLASFQENEYPKRATKQSLAQELGLGLKQVSKWF 609 Query: 150 AETPKDNLGPDSSGKIATATPSRTLGKAVTQGLLESFRENEYPT--RDKKEN 1 T P SSGK A ++ ++ + G L S E E T RD N Sbjct: 610 ENTRWSTRHPSSSGKKAKSSSRMSIYLSQASGEL-SKNEPESATCFRDTDSN 660 >ref|XP_006589630.1| PREDICTED: pathogenesis-related homeodomain protein isoform X1 [Glycine max] Length = 820 Score = 352 bits (902), Expect = 8e-94 Identities = 239/619 (38%), Positives = 331/619 (53%), Gaps = 4/619 (0%) Frame = -1 Query: 2406 ELGSEHAQSEPMEMKIVEPSAQENCQTIESSSLKQNTSQGKHELGSEHAQSEPMETKIVE 2227 ++GSE ++E E+ T +SS+ + S + +E+A + E ++ Sbjct: 32 QIGSEGLENEQKELG-----------TELTSSVIEEKSNQVSAIVTENAVIQLPEP--LQ 78 Query: 2226 SSARENCQTIESSSLKQNRLQGKHELGSEHAQSEPMETKIVESSVLENCQKIESSSPKQN 2047 ++NCQT+E S L+Q+ ++ ++ + + +P E K S EN Q S P ++ Sbjct: 79 HDLQKNCQTVEGSCLEQSTVE---QVTVDLSNDKP-ENKCKPLS--ENVQ----SEPVES 128 Query: 2046 TTQEKQNGQTTESSSPQKIMFGEKNELGLEHVQSETLEIRIVGSDSVDIKQLGTPPKDAA 1867 GQ S+P + NEL + D+V+ K + Sbjct: 129 IPAVVVEGQM--QSNPSQANMSSVNEL-----------LDQPSGDAVNNISSNCSEKMSN 175 Query: 1866 KSTNRSSRKKNQVSSRPLKRKYMLRSSISSPTVLQSSSHGTSKPPDPSISA----NVXXX 1699 T+ SR+K + +S+ LK KYMLRS SS L+S + K P+P+ + N Sbjct: 176 SPTHSQSRRKGKKNSKLLK-KYMLRSLGSSDRALRSRTKEKPKEPEPTSNLVDGNNNGVK 234 Query: 1698 XXXXXXXXXXXXKTLNDEFSRTRKRLRYLLNRMGYEHNLIDAYSGEGWKGLSMEKIKPEQ 1519 + + ++FSR R LRYLLNR+ YE++LIDAYSGEGWKG S+EK+KPE+ Sbjct: 235 RKSGRKKKKRKEEGITNQFSRIRSHLRYLLNRISYENSLIDAYSGEGWKGYSIEKLKPEK 294 Query: 1518 ELQRATSEILRCKLKIRDLFQHLDSLCAEGRLQPSLFDSEGQICSEDIFCAKCGSKDLSD 1339 ELQRA SEILR KLKIRDLFQ+LDSLCAEG+ SLFDS G+I SEDIFCAKC SK+LS Sbjct: 295 ELQRAKSEILRRKLKIRDLFQNLDSLCAEGKFPESLFDSAGEIDSEDIFCAKCQSKELST 354 Query: 1338 DNDIILCDGICDRGFHLMCLNPPLSKDKIPPEDEGWLCPGCDCKVDCIYLLNDVQGTNLS 1159 +NDIILCDG+CDRGFH +CL+PP+ + IPP DEGWLCPGCDCK DC+ L+ND GT+LS Sbjct: 355 NNDIILCDGVCDRGFHQLCLDPPMLTEDIPPGDEGWLCPGCDCKDDCMDLVNDSFGTSLS 414 Query: 1158 IDDKWEKVFPEAAATGSGDNPDDNFGFXXXXXXXXXXXXDGPELDERVQXXXXXXXXXXX 979 I D WE+VFPEAA+ +G+N D+N G +GP+ D +V+ Sbjct: 415 ISDTWERVFPEAASF-AGNNMDNNSGVPSDDSDDDDYNPNGPD-DVKVEGDESSSDESEY 472 Query: 978 XXXXXDLGASPHEEQYLGLXXXXXXXXXXXXXXXXXDEQIKKXXXXXXXXXXXXXXXALS 799 L HE+QYLGL + ++ + A Sbjct: 473 ASASEKLEGGSHEDQYLGLPSEDSDDGDYDPDAPDVECKVNEESSSSDFTSDSEDLAAAI 532 Query: 798 DGKGSSGTNVLVHSSVDGAKLLTGSSNRRSQMGRKKKQSINAELLSILEPGPSQGNASPV 619 + S G DG SS ++ ++G KK S+ EL S+LEP Q +PV Sbjct: 533 EDNTSPGQ--------DGG---ISSSKKKGKVG--KKLSLPDELSSLLEPDSGQEAPTPV 579 Query: 618 LSKRNRERLDYKKLHDEAY 562 KR+ ERLDYKKL++E Y Sbjct: 580 SGKRHVERLDYKKLYEETY 598 >ref|XP_004496910.1| PREDICTED: pathogenesis-related homeodomain protein-like isoform X1 [Cicer arietinum] Length = 995 Score = 337 bits (865), Expect = 1e-89 Identities = 240/753 (31%), Positives = 358/753 (47%), Gaps = 72/753 (9%) Frame = -1 Query: 2604 SNAQENCQTIESSSPKQNILQGKLELGSEHAQSEP------METKIVESNAQESYKTIES 2443 S +Q +C T + + ++ ++GSE +E + +++ + + Y + Sbjct: 8 SISQYDCTTEKMGTEHCDLSVKSAQIGSECLDNEQSVVDTVLTNSVIDEKSNQDYVNMTE 67 Query: 2442 SSLKQ----------NTLQGKHELGSEHAQSEPMEMKIVEPSAQENCQTIESSSLKQNTS 2293 +S+ Q NT Q + + SE + + + + + C + + ++ Sbjct: 68 NSVVQLPDPTHHYIENTCQTLEGSCLQQSTSEQVPVWLCDGKLENKCPQNVQNEAGEMSN 127 Query: 2292 QGKHELGSEHAQSEPMETKIV--------------ESSARENCQTIESSSLKQNRLQGKH 2155 + E QS P + +V ++ E + + + +++ Sbjct: 128 AVAALVVEEQTQSVPAQVNVVLDPPSGDVAKNVSFQNEPGEMSEAVAALVVEEQTQSVPA 187 Query: 2154 ELG-------SEHAQSEPMETKIVESS--VLENCQKIESSSPKQNTTQEKQNGQTTESSS 2002 ++ + A+S + ++ E S V++ + S P Q T S Sbjct: 188 QVNVVLDPPSGDVAESVSFQNELAEMSDAVIDVVEDQTQSGPAQVNTDSVNEPLDPPSGE 247 Query: 2001 PQKIMF-----GEKNELGLEHVQSETLEIRIVGSDSVDIKQLGTPP-------------- 1879 KI+ GE ++ + V+ +T I V+ + P Sbjct: 248 VAKIVNLQNEPGEMSDAVIGIVEYQTQSIPXXXXXPVNTYSVNDPSDPPSEDVVKNISSD 307 Query: 1878 ---KDAAKSTNRSSRKKNQVSSRPLKRKYMLRSSISSPTVLQSSSHGTSKPPDP-----S 1723 + + S + SR K + +S+ L +KY+LRS SS L+S + K P+P Sbjct: 308 CSERKSKSSAHLRSRHKGKSNSK-LSKKYILRSLGSSDRALRSRTRDKPKDPEPINNVVD 366 Query: 1722 ISANVXXXXXXXXXXXXXXXKT-LNDEFSRTRKRLRYLLNRMGYEHNLIDAYSGEGWKGL 1546 +S + K +ND++S+ R LRYLLNR+ YE NLIDAYSGEGWKG Sbjct: 367 VSNDAMKTKRGKKKKKKRPRKEGINDQYSKIRAHLRYLLNRISYEQNLIDAYSGEGWKGY 426 Query: 1545 SMEKIKPEQELQRATSEILRCKLKIRDLFQHLDSLCAEGRLQPSLFDSEGQICSEDIFCA 1366 S+EK+KPE+E+QRA SEILR KLKIRDLFQ+LDSLCAEGRL SLFDS+G+I SEDIFCA Sbjct: 427 SLEKLKPEKEIQRAKSEILRRKLKIRDLFQNLDSLCAEGRLPESLFDSKGEIDSEDIFCA 486 Query: 1365 KCGSKDLSDDNDIILCDGICDRGFHLMCLNPPLSKDKIPPEDEGWLCPGCDCKVDCIYLL 1186 KC +K L DNDIILCDG CDRGFH +CL+PPL + IPP DEGWLCPGCDCK DCI L+ Sbjct: 487 KCQTKVLGTDNDIILCDGACDRGFHQLCLDPPLLTEDIPPGDEGWLCPGCDCKDDCIELV 546 Query: 1185 NDVQGTNLSIDDKWEKVFPEAAATGSGDNPDDNFGFXXXXXXXXXXXXDGPE----LDER 1018 ND+ GTNLS+ + WE+VFPE AAT +G D N G +GPE D Sbjct: 547 NDLLGTNLSLTNTWERVFPE-AATAAGSILDHNSGLPSDDSEDDDYNPNGPEDVEVEDAE 605 Query: 1017 VQXXXXXXXXXXXXXXXXDLGASPHEEQYLGLXXXXXXXXXXXXXXXXXDEQI-KKXXXX 841 V+ L S HE+QYLGL ++ ++ Sbjct: 606 VEGDESSSDESEYASASEKLEDSRHEDQYLGLPSEDSEDDDFDPDAPDLGGKVTEESSSS 665 Query: 840 XXXXXXXXXXXALSDGKGSSGTNVLVHSSVDGAKLLTGSSNRRSQMGRKKKQSINAELLS 661 + D + + +D K L G S + ++ +KK S+ EL S Sbjct: 666 DFTSDSEDLAATIKDNMSTGQDGDITSPLLDDVKNLKGFSRQNHKV--RKKPSMADELSS 723 Query: 660 ILEPGPSQGNASPVLSKRNRERLDYKKLHDEAY 562 +L+ Q + +P+ +KRN ERLDY+KL++E Y Sbjct: 724 LLKSDLGQEDITPITAKRNVERLDYQKLYEETY 756