BLASTX nr result
ID: Catharanthus23_contig00009008
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus23_contig00009008 (3267 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_004230722.1| PREDICTED: pathogenesis-related homeodomain ... 505 e-140 ref|XP_006346339.1| PREDICTED: pathogenesis-related homeodomain ... 501 e-139 ref|XP_002313886.2| hypothetical protein POPTR_0009s09600g [Popu... 484 e-134 ref|XP_002300247.2| homeobox family protein [Populus trichocarpa... 483 e-133 ref|XP_004289744.1| PREDICTED: uncharacterized protein LOC101296... 476 e-131 gb|EXB76647.1| Homeobox protein [Morus notabilis] 472 e-130 emb|CBI22504.3| unnamed protein product [Vitis vinifera] 465 e-128 ref|XP_002269077.1| PREDICTED: homeobox protein HAT3.1-like [Vit... 465 e-128 ref|XP_002527467.1| Homeobox protein HAT3.1, putative [Ricinus c... 464 e-128 gb|EMJ01257.1| hypothetical protein PRUPE_ppa023106mg [Prunus pe... 461 e-127 ref|XP_006486963.1| PREDICTED: homeobox protein HAT3.1-like isof... 458 e-126 ref|XP_006422879.1| hypothetical protein CICLE_v10027725mg [Citr... 457 e-125 gb|EOX98399.1| Homeodomain-like protein with RING/FYVE/PHD-type ... 455 e-125 ref|XP_006589630.1| PREDICTED: pathogenesis-related homeodomain ... 451 e-124 ref|XP_003555282.1| PREDICTED: homeobox protein HAT3.1-like isof... 447 e-122 ref|XP_004496910.1| PREDICTED: pathogenesis-related homeodomain ... 445 e-122 gb|ESW15073.1| hypothetical protein PHAVU_007G041800g [Phaseolus... 433 e-118 sp|P48786.1|PRH_PETCR RecName: Full=Pathogenesis-related homeodo... 429 e-117 ref|XP_004161446.1| PREDICTED: homeobox protein HAT3.1-like [Cuc... 415 e-113 ref|XP_004140812.1| PREDICTED: uncharacterized protein LOC101204... 415 e-113 >ref|XP_004230722.1| PREDICTED: pathogenesis-related homeodomain protein-like [Solanum lycopersicum] Length = 796 Score = 505 bits (1301), Expect = e-140 Identities = 310/740 (41%), Positives = 411/740 (55%), Gaps = 15/740 (2%) Frame = -1 Query: 2712 MGSATCAPENMGSATCDAH--------ENHLDLQHSEPAEKDATNVASESVPHEGTSLPS 2557 +G+ + +PE +A H EN Q E E N+ + P Sbjct: 4 LGNTSVSPEKARTAGGGHHTASAGNMSENLGADQSRESCENTVQNLNQSEYREKSPGQPR 63 Query: 2556 RKQISSLLPPVSNRVLRSRSQEKPKASESKAVEVENSANEGRKSKQRKGRMKK-IPVNEF 2380 +++ S P S R+LRS+S+EK ASE+K V + A E +K K+RK + K I NEF Sbjct: 64 KRKSISGSPISSTRLLRSKSKEKSGASEAKNTVVTHDATEEKKRKRRKKKHSKHIAANEF 123 Query: 2379 SRIRTHLRYLLHRVKYERNLIDAYSCEGWKGQSLEKLKPEKELQRARSQIFRYKLKIRDL 2200 +RIR HLRYLL R+KYE+ LI+AYS EGWKGQSLEK+K EKELQRA++ IFRYKLKIRDL Sbjct: 124 TRIRGHLRYLLQRIKYEQTLIEAYSGEGWKGQSLEKIKLEKELQRAKTHIFRYKLKIRDL 183 Query: 2199 FQQLDRSLTEGRFPESLFDSEGQIDSEDIFCAKCGSKDLTIENDIILCDGACERGFHQFC 2020 FQ+LD L EGR P SLFD+EG+IDSEDIFCAKCGS DL +NDIILCDGACERGFHQ C Sbjct: 184 FQRLDTLLAEGRLPASLFDNEGEIDSEDIFCAKCGSMDLPADNDIILCDGACERGFHQLC 243 Query: 2019 LDPPLLKEDIPPDDEGWLCPGCDCKLDCVGMLNDFQESTLSVTDNWEKVFPEEAAAAASG 1840 ++PPLLKEDIPPDDEGWLCPGCDCK+DC+ +LND Q + LSVTD+WEKV+P+EAAAAASG Sbjct: 244 VEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLLNDLQGTDLSVTDSWEKVYPKEAAAAASG 303 Query: 1839 KTMDEGTXXXXXXXXXXXXXXXXXXXDHEVSRGESTSD--GSDYFSASDDIV-PPLDNKQ 1669 + +D+ + S ES+SD SD++SAS+D+ P + + Sbjct: 304 EKLDDISGLPSDDSEDDDYNPEAPDVGKNDSEDESSSDESESDFYSASEDLAEAPTKDDE 363 Query: 1668 IFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDLGAILEDGESPNKDEGHFPS 1489 I D I++ ++G S Sbjct: 364 ILGLSSEDSEDDDYNPDDPDKDEPVKTESSSSDFTSDSEDFSLIVDTNRLRGDEQGVSSS 423 Query: 1488 VSEDSKPNAVASGEKLKAGKRKGRSLNDELSYLTESNTEAVSGKRRGERLDYKKLHDEAY 1309 V ++S PN+V+ EK K GK KG SL DELSYL +S++ VS KR ERLDYKKLHDE Y Sbjct: 424 V-DNSMPNSVSLKEKAKVGKAKGNSLKDELSYLMQSDSPLVSAKRHIERLDYKKLHDETY 482 Query: 1308 RNATSDSSDEDYTETAGAKRRKSNAGKAIRICTNKIQDRNDRTDTMDGNQNHKENDHSVE 1129 N +SDSSDEDY + K RK K + + D + K + H+ + Sbjct: 483 GNGSSDSSDEDYDDGPLPKVRKLRNAKGAMAAPS-----STPADIKYQSGKQKGSGHASD 537 Query: 1128 GKSHKKLKVXXXXXXXXXXXXXXXXXXXXGKDAAKQYKRLGEAVVQRLVESFKENQYPKQ 949 +KLKV ++ + K GE +RL ESFK+NQYP + Sbjct: 538 SGISEKLKV--------------GGTGTSESPSSGKRKTYGEVSTKRLYESFKDNQYPDR 583 Query: 948 DVKQSLAKELGLRVQQVGKWFENARWSFRHSSRMESKMVAAASTNGSSLRTRVHVMSLAA 769 D K+ L KELGL QV KWFENAR RHS + K+++ + S ++++ L Sbjct: 584 DAKEKLGKELGLTAHQVSKWFENARHCHRHSPNWK-KIMSHKVSEESPSKSQIIGEPLGT 642 Query: 768 KQSPVLPSPSDSGMENLISSQVRPGNEECQITDAGEGKSVESEASGEKS---TRKRKVDN 598 + + ++ S +G+E L + E+ D E + + + SG+KS T+K N Sbjct: 643 ESNSII--ASCNGVEKLEQPKQCLNGEKGHAIDKSEEELLIQDTSGKKSSEPTKKVHTTN 700 Query: 597 QGSSAGNCMKQDQHDDTPKS 538 +GS +DTP+S Sbjct: 701 EGS-----------EDTPRS 709 >ref|XP_006346339.1| PREDICTED: pathogenesis-related homeodomain protein-like isoform X1 [Solanum tuberosum] gi|565359059|ref|XP_006346340.1| PREDICTED: pathogenesis-related homeodomain protein-like isoform X2 [Solanum tuberosum] gi|565359061|ref|XP_006346341.1| PREDICTED: pathogenesis-related homeodomain protein-like isoform X3 [Solanum tuberosum] gi|565359063|ref|XP_006346342.1| PREDICTED: pathogenesis-related homeodomain protein-like isoform X4 [Solanum tuberosum] Length = 798 Score = 501 bits (1291), Expect = e-139 Identities = 314/741 (42%), Positives = 414/741 (55%), Gaps = 16/741 (2%) Frame = -1 Query: 2712 MGSATCAPENMGSATCDAHEN--------HLDLQHSEPAEKDATNVASESVPHEGTSLPS 2557 +G+ + +PE + H +L + S A ++A ++S E T Sbjct: 4 LGNTSVSPEKVARTAGGGHRTASVGNMSENLGVDQSGEACENAVQNLNQSEYREKTPGQP 63 Query: 2556 RKQISSLLPPVSN-RVLRSRSQEKPKASESKAVEVENSANEGRKSKQRKGRMKK-IPVNE 2383 RK+ S P+S+ R+LRS+S+EK ASE+ V + A E +K K+RK + K I VNE Sbjct: 64 RKRKSISGSPISSTRLLRSKSKEKSGASEANNTVVTHDATEEKKRKRRKKKHSKHIAVNE 123 Query: 2382 FSRIRTHLRYLLHRVKYERNLIDAYSCEGWKGQSLEKLKPEKELQRARSQIFRYKLKIRD 2203 F+RIR HLRYLL R+ YE+ LI+AYS EGWKGQSLEK+K EKELQRA++ IFRYKLKIRD Sbjct: 124 FTRIRGHLRYLLQRITYEQTLIEAYSGEGWKGQSLEKIKLEKELQRAKTHIFRYKLKIRD 183 Query: 2202 LFQQLDRSLTEGRFPESLFDSEGQIDSEDIFCAKCGSKDLTIENDIILCDGACERGFHQF 2023 LFQ+LD L EGR P SLFD+EG+IDSEDIFCAKCGS DL +NDIILCDGACERGFHQ Sbjct: 184 LFQRLDTLLAEGRLPASLFDNEGEIDSEDIFCAKCGSMDLPADNDIILCDGACERGFHQL 243 Query: 2022 CLDPPLLKEDIPPDDEGWLCPGCDCKLDCVGMLNDFQESTLSVTDNWEKVFPEEAAAAAS 1843 C++PPLLKEDIPPDDEGWLCPGCDCK+DC+ +LND Q + LSVTD+WEKV+P+EAAAAAS Sbjct: 244 CVEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLLNDLQGTDLSVTDSWEKVYPKEAAAAAS 303 Query: 1842 GKTMDEGTXXXXXXXXXXXXXXXXXXXDHEVSRGESTSDGSDYFSASDDI--VPPLDNKQ 1669 G+ +D+ + S ES+SD SD++SAS+D+ PP D+ + Sbjct: 304 GEKLDDISGLPSDDSEDDDYNPETPDVGKNDSEDESSSDESDFYSASEDLAEAPPKDD-E 362 Query: 1668 IFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDLGAILEDGESPNKDEGHFPS 1489 I D I++ ++G S Sbjct: 363 ILGISSEDSEDDDFNPDDPDKDEPVKTESSSSDFTSDSEDFNLIVDTNRLQGDEQGVSSS 422 Query: 1488 VSEDSKPNAVASGEKLKAGKRKGRSLNDELSYLTESNTEAVSGKRRGERLDYKKLHDEAY 1309 V ++S PN+ + EK K GK KG SL DELSYL +S++ VS KR ERLDYKKLHDE Y Sbjct: 423 V-DNSMPNSASQEEKAKVGKAKGNSLKDELSYLMQSDSPLVSAKRHIERLDYKKLHDETY 481 Query: 1308 RNATSDSSDEDYTETAGAKRRK-SNAGKAIRICTNKIQDRNDRTDTMDGNQNHKENDHSV 1132 N +S+SSDEDY + K RK NA A+ ++ D ++ G+ ++ S Sbjct: 482 GNGSSESSDEDYDDGPLPKVRKLRNAKGAMTSPSSTPADIKHQSGKQKGSGRASDSGIS- 540 Query: 1131 EGKSHKKLKVXXXXXXXXXXXXXXXXXXXXGKDAAKQYKRLGEAVVQRLVESFKENQYPK 952 +KLKV ++ + K GE +RL ESFK+NQYP Sbjct: 541 -----EKLKV--------------GGAGTSESPSSGKRKTHGEVATKRLYESFKDNQYPD 581 Query: 951 QDVKQSLAKELGLRVQQVGKWFENARWSFRHSSRMESKMVAAASTNGSSLRTRVHVMSLA 772 +D K L KELGL QV KWFENAR RHSS + M S S + ++ L Sbjct: 582 RDAKGKLGKELGLTAYQVSKWFENARHCHRHSSHWNTIMSQKVSKESPS-KLQIIGEPLG 640 Query: 771 AKQSPVLPSPSDSGMENLISSQVRPGNEECQITDAGEGKSVESEASGEKS---TRKRKVD 601 + + ++ +G+ L + R E+ D E +ASG+KS T+K Sbjct: 641 TESNSII--AFCNGVGKLEQPKQRLNGEKGHAIDKSEEDLFIQDASGKKSSEPTKKVYTT 698 Query: 600 NQGSSAGNCMKQDQHDDTPKS 538 NQGS +DTP++ Sbjct: 699 NQGS-----------EDTPRN 708 >ref|XP_002313886.2| hypothetical protein POPTR_0009s09600g [Populus trichocarpa] gi|550331388|gb|EEE87841.2| hypothetical protein POPTR_0009s09600g [Populus trichocarpa] Length = 934 Score = 484 bits (1247), Expect = e-134 Identities = 298/684 (43%), Positives = 373/684 (54%), Gaps = 17/684 (2%) Frame = -1 Query: 2808 IVISNKDPISNSIPGDFRLPHEN---GAAICAPENMGSATCAPENMGSATCDAHENHLDL 2638 I I N +P++ + + H G +I P N T D + D Sbjct: 245 IAIENSEPLTQLVTKRSPIKHVGLLPGDSIIIPAN---------EQTRPTHDDEDKGPDH 295 Query: 2637 QHSEPAEKDATNVASESVPH-EGTSLPSRKQISSLLPPVSNRVLRSRSQEKPKASESKAV 2461 +H E + A + P + S SRK S+RVLRSRSQEKPKA ES Sbjct: 296 EHLETPSRVAIGITRRGRPRGKSASRLSRKIYMLRSLRSSDRVLRSRSQEKPKAPESSNN 355 Query: 2460 EVENSANEGRKSKQRKGRM-KKIPVNEFSRIRTHLRYLLHRVKYERNLIDAYSCEGWKGQ 2284 ++ +K K+RK R K I +E+S+IR HLRYLL+R+ YE++LI AYS EGWKG Sbjct: 356 SGNVNSTGDKKGKRRKKRRGKNIVADEYSKIRAHLRYLLNRMSYEQSLITAYSGEGWKGL 415 Query: 2283 SLEKLKPEKELQRARSQIFRYKLKIRDLFQQLDRSLTEGRFPESLFDSEGQIDSEDIFCA 2104 SLEKLKPEKELQRA S+I R K+KIRDLFQ +D +EGRFP SLFDSEGQIDSEDIFCA Sbjct: 416 SLEKLKPEKELQRATSEITRRKVKIRDLFQHIDSLCSEGRFPSSLFDSEGQIDSEDIFCA 475 Query: 2103 KCGSKDLTIENDIILCDGACERGFHQFCLDPPLLKEDIPPDDEGWLCPGCDCKLDCVGML 1924 KCGSKDL +NDIILCDGAC+RGFHQFCL PPLL+EDIPPDDEGWLCPGCDCK+DC+G+L Sbjct: 476 KCGSKDLNADNDIILCDGACDRGFHQFCLIPPLLREDIPPDDEGWLCPGCDCKVDCIGLL 535 Query: 1923 NDFQESTLSVTDNWEKVFPEEAAAAASGKTMDEGTXXXXXXXXXXXXXXXXXXXDHEVSR 1744 ND Q + +S++D+WEKVFP EAAA ASG+ +D D + Sbjct: 536 NDSQGTNISISDSWEKVFP-EAAATASGQKLDHNFGPSSDDSDDNDYEPDGPDIDKKSQE 594 Query: 1743 GESTSDGSDYFSASDDIVPPLDNKQIFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1564 ES+SD SD+ SASD+ P D K+ Sbjct: 595 EESSSDESDFTSASDEFKAPPDGKEYLGLSSDDSEDDDYDPDAPVLEEKLKQESSSSDFT 654 Query: 1563 XXXXDLGAILEDGESPNKDEGHFPSVSEDSKPNAVASGEKLKAGKRKGRSLNDELSYLTE 1384 DL A + +DE H P +P V++G K K +K +SLN EL + E Sbjct: 655 SDSEDLAATINGDGLSLEDECHMP-----IEPRGVSNGRKSKFDGKKMQSLNSELLSMLE 709 Query: 1383 -----SNTEAVSGKRRGERLDYKKLHDEAYRNATSDSSDEDYTETAGAKRRKSNAGKAIR 1219 + VSGKR +RLDYKKL+DE Y N S SSD+DYT+T G ++R+ N G Sbjct: 710 PDLCQDESATVSGKRNVDRLDYKKLYDETYGN-ISTSSDDDYTDTVGPRKRRKNTGDVAT 768 Query: 1218 ICTNKIQDRNDRTDTMDG------NQNHKENDHSVEGKSHKKLKVXXXXXXXXXXXXXXX 1057 + N D + T +G NQ KEN + E + + Sbjct: 769 VTAN-----GDASVTENGMNSKNMNQELKENKRNPERGTCQNSSFQETNVSPAKSYVGAS 823 Query: 1056 XXXXXGKDA-AKQYKRLGEAVVQRLVESFKENQYPKQDVKQSLAKELGLRVQQVGKWFEN 880 GK YK+LGEAV QRL F+ENQYP + K SLA+ELG+ +QV KWF N Sbjct: 824 LSGSSGKSVRPSAYKKLGEAVTQRLYSYFRENQYPDRAAKASLAEELGITFEQVNKWFVN 883 Query: 879 ARWSFRHSSRMESKMVAAASTNGS 808 ARWSF HSS + +AS GS Sbjct: 884 ARWSFNHSSSTGTSKAESASGKGS 907 >ref|XP_002300247.2| homeobox family protein [Populus trichocarpa] gi|550348560|gb|EEE85052.2| homeobox family protein [Populus trichocarpa] Length = 930 Score = 483 bits (1243), Expect = e-133 Identities = 315/809 (38%), Positives = 416/809 (51%), Gaps = 12/809 (1%) Frame = -1 Query: 3198 LHNCLLSGVSSSPTKEPPLKHGDEFVSGGGEPVVQKSVTTKSQQISLSEASVRECDCDSV 3019 +H+ + SS + P E S Q S+ + +A + + + Sbjct: 129 VHSESSKAIDSSILLDEPRNSNTELSSCIANETSQASLEGLANDSRAEDAGLSLVEASNS 188 Query: 3018 DNLKILDGSSMNSSFESLTAHDLGSDNI--EPLEQKQDVAQDIGRKSPSETGVVASSELP 2845 D ++D SS + S + SD +PLE++Q ++ E G+ S Sbjct: 189 D---LIDESSYSQQTTSGQTREFHSDRACCKPLEERQKPGSELAENESMEIGIGLPSG-- 243 Query: 2844 GPEYLEHSNGEQIVISNKDPISNSIPGDFRLPHENGAAICAPENMGSATCAPENMGSATC 2665 I I N +P++ + + H I P + A E + T Sbjct: 244 ------------IAIENLEPLTELVTKSCPIKH-----IGLPPGDDISIPANEQI-RPTH 285 Query: 2664 DAHENHLDLQHSEPAEKDATNVASESVPH--EGTSLPSRKQISSLLPPVSNRVLRSRSQE 2491 D + D +H E + S+ VP + L +K SS S+RVLRS SQE Sbjct: 286 DKESKYPDCEHLEKLSGIVIGITSQGVPSVKRTSKLSGKKYTSSSRK--SDRVLRSNSQE 343 Query: 2490 KPKASESKAVEVE-NSANEGRKSKQRKGRMKKIPVNEFSRIRTHLRYLLHRVKYERNLID 2314 KPKA E NS E + +++K R K I +E+SRIR LRYLL+R+ YE++LI Sbjct: 344 KPKAPEPSNNSTNVNSTGEEKGKRRKKRRGKSIVADEYSRIRARLRYLLNRMSYEQSLIT 403 Query: 2313 AYSCEGWKGQSLEKLKPEKELQRARSQIFRYKLKIRDLFQQLDRSLTEGRFPESLFDSEG 2134 AYS EGWKG SLEKLKPEKELQRA S+I R K+KIRDLFQ +D EGRFP SLFDSEG Sbjct: 404 AYSGEGWKGLSLEKLKPEKELQRATSEIIRRKVKIRDLFQHIDSLCGEGRFPASLFDSEG 463 Query: 2133 QIDSEDIFCAKCGSKDLTIENDIILCDGACERGFHQFCLDPPLLKEDIPPDDEGWLCPGC 1954 QIDSEDIFCAKCGSKDLT +NDIILCDGAC+RGFHQFCL PPLL+EDIPP DEGWLCPGC Sbjct: 464 QIDSEDIFCAKCGSKDLTADNDIILCDGACDRGFHQFCLVPPLLREDIPPGDEGWLCPGC 523 Query: 1953 DCKLDCVGMLNDFQESTLSVTDNWEKVFPEEAAAAASGKTMDEGTXXXXXXXXXXXXXXX 1774 DCK+DC+ +LND Q + +S++D W+ VFP EAAA ASG+ +D Sbjct: 524 DCKVDCIDLLNDSQGTNISISDRWDNVFP-EAAAVASGQKLDY-NFGLSSDDSDDNDYDP 581 Query: 1773 XXXXDHEVSRGESTSDGSDYFSASDDIVPPLDNKQIFXXXXXXXXXXXXXXXXXXXXXXX 1594 E S+ ES+SD SD+ SASD+ P D+KQ Sbjct: 582 DGPDIDEKSQEESSSDESDFSSASDEFEAPPDDKQYLGLPSDDSEDDDYDPDAPVLEEKL 641 Query: 1593 XXXXXXXXXXXXXXDLGAILEDGESPNKDEGHFPSVSEDSKPNAVASGEKLKAGKRKGRS 1414 DL A L DE H P +P+ ++G + + G +K S Sbjct: 642 KQESSSSDFTSDSEDLDATLNGDGLSLGDEYHMP-----IEPHEDSNGRRSRFGGKKNHS 696 Query: 1413 LNDELSYLTESNTE-----AVSGKRRGERLDYKKLHDEAYRNATSDSSDEDYTETAGAKR 1249 LN +L + E ++ VSGKR ERLDYKKL+DE Y N S SSD+DYT+T ++ Sbjct: 697 LNSKLLSMLEPDSHQEKSAPVSGKRNIERLDYKKLYDETYGN-ISTSSDDDYTDTVAPRK 755 Query: 1248 RKSNAGK-AIRICTNKIQDRNDRTDTMDGNQNHKENDHSVEGKSHKKLKVXXXXXXXXXX 1072 R+ N G A+ I + ++ + NQ K+N+H+ G++H+ Sbjct: 756 RRKNTGDVAMGIANGDASVTENGLNSKNMNQELKKNEHT-SGRTHQNSSFQDTNVSPAKT 814 Query: 1071 XXXXXXXXXXGKDA-AKQYKRLGEAVVQRLVESFKENQYPKQDVKQSLAKELGLRVQQVG 895 K YK+LGEAV Q+L FKEN+YP Q K SLA+ELG+ +QV Sbjct: 815 HVGESLSGSSSKRVRPSAYKKLGEAVTQKLYSFFKENRYPDQAAKASLAEELGITFEQVN 874 Query: 894 KWFENARWSFRHSSRMESKMVAAASTNGS 808 KWF NARWSF HSS + +AS GS Sbjct: 875 KWFMNARWSFNHSSPEGTSKAESASGKGS 903 >ref|XP_004289744.1| PREDICTED: uncharacterized protein LOC101296723 [Fragaria vesca subsp. vesca] Length = 1227 Score = 476 bits (1224), Expect = e-131 Identities = 328/893 (36%), Positives = 460/893 (51%), Gaps = 27/893 (3%) Frame = -1 Query: 3174 VSSSPTKEPPLKHGDEFVSGGGEP---VVQKSVTTKS---QQISLSEASVRECDCDSVDN 3013 + SS ++ PL+ VS GG VV ++V+ S Q L EA + C D + Sbjct: 357 LGSSFVQDEPLQTIIPVVSSGGNEQLRVVNENVSVPSLGEQAGLLPEAVSKTCQTDKLSR 416 Query: 3012 LKILDGSSMNSSFESLTAHDLGSDNIEPLEQKQDVAQDIGRKSPSETGVVASSELPGPEY 2833 S++++ + + GS EP EQ+ + PS+ V +S Sbjct: 417 -------SLHTASDQINESGSGSVQCEPQEQRDQLGS-----LPSQNDQVKNSTAVSSSI 464 Query: 2832 LEHSNGEQIVISNKDPISNSIPGDFRLPHENGAAICAPENMGSATCAPENMGSATCDAHE 2653 +G + D ++NS+ G P E+ A ++ P T DA + Sbjct: 465 GFEQSGPSV-----DEMNNSVIGHLEPPPED-----ASKDHNKELIKPH-----TNDATQ 509 Query: 2652 NHLDLQHSEPAEKDATNVASESVPHEGTSLPSRKQISSLLPPVSNRVLRSRSQEKPKASE 2473 N L+ SE A K+A+ +++ + + SR++ SL+ S+RVLRSR+ EKP+A E Sbjct: 510 NSC-LEPSETASKNASKNSTQFGCKDKRNSSSRRKSRSLVS--SDRVLRSRTSEKPEAPE 566 Query: 2472 ----------SKAVEVENSANEGRKSKQRKGRMKKIPVNEFSRIRTHLRYLLHRVKYERN 2323 S +V ++ EG++ K++K +++ +EFSRIR+HLRY L+R+ YE++ Sbjct: 567 LSNNVATLDTSNSVANVSNEKEGKRKKRKKKHRERVAADEFSRIRSHLRYFLNRINYEKS 626 Query: 2322 LIDAYSCEGWKGQSLEKLKPEKELQRARSQIFRYKLKIRDLFQQLDRSLTEGRFPESLFD 2143 LIDAYS EGWKG SLEKLKPEKELQRA S+I R K KIRDLFQ+LD EG FPESLFD Sbjct: 627 LIDAYSSEGWKGNSLEKLKPEKELQRATSEILRRKSKIRDLFQRLDSLCAEGMFPESLFD 686 Query: 2142 SEGQIDSEDIFCAKCGSKDLTIENDIILCDGACERGFHQFCLDPPLLKEDIPPDDEGWLC 1963 EGQIDSEDIFCAKCGS D+ +NDIILCDGAC+RGFHQ CL+PPLL E+IPPDDEGWLC Sbjct: 687 EEGQIDSEDIFCAKCGSLDVYADNDIILCDGACDRGFHQHCLEPPLLSEEIPPDDEGWLC 746 Query: 1962 PGCDCKLDCVGMLNDFQESTLSVTDNWEKVFPEEAAAAASGKTMDEGTXXXXXXXXXXXX 1783 PGCDCK+DC+ +LND Q + LS+TD+WEKVFPE A AA++G+ + Sbjct: 747 PGCDCKVDCIDLLNDSQGTDLSITDSWEKVFPEAAVAASAGQHQENNQGLPSEDSDDDDY 806 Query: 1782 XXXXXXXDHEVSRGESTSDGSDYFSASDDI-VPPLDNKQIFXXXXXXXXXXXXXXXXXXX 1606 D EV GES+SD S+Y SASD + P +++Q Sbjct: 807 DPDGPETDEEVQEGESSSDESEYASASDGLETPKTNDEQYLGIPSDDSEDDDFNPDAPDP 866 Query: 1605 XXXXXXXXXXXXXXXXXXDLGAIL-EDGESPNKDEGHFPSVSEDSKPNAVASGEKLKAGK 1429 DL A+L ED +S EG SV E S + G+ K G+ Sbjct: 867 TEDVKQGSSSSDFTSDSEDLAAVLDEDRKSFENGEGPQSSVLEASTLLRGSGGKGSKRGQ 926 Query: 1428 RKGRSLNDELSYLTESN-----TEAVSGKRRGERLDYKKLHDEAYRNATSDSSDEDYTET 1264 ++ + DELS L ES+ + VSGKR ERLDYKKLHDE Y + + S DE+Y ET Sbjct: 927 KR-HFIKDELSSLIESDPGQDGSTPVSGKRHVERLDYKKLHDEEYGDIPT-SDDEEYIET 984 Query: 1263 AGAKRRKSNAGK-AIRICTNKIQDRNDRTDTMDGNQNHKENDHSVEGKSHKKLKVXXXXX 1087 A ++RK AG+ + K T D + +N+H+ +K Sbjct: 985 AVPRKRKKGAGQVSPGSLKGKPSTIKKGKTTKDIKDDPDKNEHTPRRTPRRKSSANDNSS 1044 Query: 1086 XXXXXXXXXXXXXXXGKDA-AKQYKRLGEAVVQRLVESFKENQYPKQDVKQSLAKELGLR 910 A Y+RLGEAV QRL SFKENQYP + +K+ LA+ELG+ Sbjct: 1045 SPNESLKSSPKSGSTSGRAKGSTYRRLGEAVTQRLYTSFKENQYPDRSMKERLAQELGVM 1104 Query: 909 VQQVGKWFENARWSFRHSSRMESKMVAAASTNGSSLRTRVHVMSLAAKQSPVLPSPSDSG 730 +QV KWFENAR + + M + +S++ H + + + + S Sbjct: 1105 AKQVSKWFENARHCVKAGLALPQAMRTQPNQAETSIKDAHHDGAQKNESPGTADAVAGSC 1164 Query: 729 MENLISSQVRPGNEECQITDAGEG--KSVESEASGEKSTRKRKVDNQGSSAGN 577 +++ +++ T A +G + +S+ G K K + SS G+ Sbjct: 1165 SQDVKDNKLATPKSSRAKTSAPKGRKRKSKSDPGGSDLDEKFKTPPETSSRGD 1217 >gb|EXB76647.1| Homeobox protein [Morus notabilis] Length = 1031 Score = 472 bits (1214), Expect = e-130 Identities = 298/694 (42%), Positives = 381/694 (54%), Gaps = 20/694 (2%) Frame = -1 Query: 2640 LQHSEPAEKDATNVASESVPHEGTSLPSRKQISSLLPPV-SNRVLRSRSQEKPKASE-SK 2467 L+ E + K N S+ + + SRK+ L V S+RVLRSR+QEK K+ E S Sbjct: 309 LEQLETSSKSLVNKPSQLGRKDKQTSKSRKKQYMLRSLVHSDRVLRSRTQEKLKSHELSN 368 Query: 2466 AVEVENSANEGRKSKQRKGRMKKIPVNEFSRIRTHLRYLLHRVKYERNLIDAYSCEGWKG 2287 + + E R +++K R ++ +EFSRIR L+Y +R+ YE+NLIDAYS EGWKG Sbjct: 369 TLSNIGNGVEKRMKERKKRRGTRVIADEFSRIRKRLKYFFNRIHYEQNLIDAYSSEGWKG 428 Query: 2286 QSLEKLKPEKELQRARSQIFRYKLKIRDLFQQLDRSLTEGRFPESLFDSEGQIDSEDIFC 2107 SLEKLKPEKELQRA+S+IFR KLKIRDLFQQLD EGRFP+SLFDSEGQIDSEDIFC Sbjct: 429 TSLEKLKPEKELQRAKSEIFRRKLKIRDLFQQLDSLCAEGRFPKSLFDSEGQIDSEDIFC 488 Query: 2106 AKCGSKDLTIENDIILCDGACERGFHQFCLDPPLLKEDIPPDDEGWLCPGCDCKLDCVGM 1927 AKCGSKD++ NDIILCDGAC+RGFHQFCL+PPLL EDIPPDDEGWLCPGCDCK+DC + Sbjct: 489 AKCGSKDMSANNDIILCDGACDRGFHQFCLEPPLLSEDIPPDDEGWLCPGCDCKVDCFDL 548 Query: 1926 LNDFQESTLSVTDNWEKVFPEEAAAAASGKTMDEGTXXXXXXXXXXXXXXXXXXXDHEVS 1747 LND + LSVTD+WEKVFPE AAAA GK D +V Sbjct: 549 LNDSYGTNLSVTDSWEKVFPEAAAAAREGKDQDHNLEFPSDDSEDDDYDPYGPEIVEKVE 608 Query: 1746 RGESTSDGSDYFSASDDI---VPPLDNKQIFXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1576 ES+SD S+Y SA D++ PP D +Q F Sbjct: 609 GDESSSDESEYTSACDELEGEAPPKD-EQYFGLSSDDSEDNDFDPDDQDVDENAKQESSS 667 Query: 1575 XXXXXXXXDLGAILEDGESPNKDEGHFPSVSEDSKPNAVASGEKLKAGKRKGR--SLNDE 1402 DL L++G+ KDE P +++ KR G S+ DE Sbjct: 668 SDFTSDSEDLAFTLDEGQIAEKDE------VSSLDPTRSLGNAVMQSSKRGGNKSSIKDE 721 Query: 1401 LSYLTESNT-----EAVSGKRRGERLDYKKLHDEAYRNATSDSS-DEDYTETAGAKRRKS 1240 L + ES T +SGKR ERLDYK+LHDE Y + SDSS DED+T+ A ++RK Sbjct: 722 LLDILESGTGQDGSPPISGKRHVERLDYKRLHDETYGHLPSDSSDDEDWTDYAAPRKRKR 781 Query: 1239 NAGKAIRICTNKIQDRNDRTDTMDGNQNHKENDHSVEGKSHKKLKV--XXXXXXXXXXXX 1066 G+ + N+ T D N E++ V + ++ V Sbjct: 782 TTGQVSSVSPNENASIIKNQTTTDAANNDLEDNEYVPRRRSRQNSVVTDENNIPNKLLQG 841 Query: 1065 XXXXXXXXGKDAAKQYKRLGEAVVQRLVESFKENQYPKQDVKQSLAKELGLRVQQVGKWF 886 + +RLGEAV QRL +SFKENQY + K+SLA+ELGL QV KWF Sbjct: 842 SPKSGSTGRRRELSTNRRLGEAVTQRLYQSFKENQYLDRATKESLAQELGLTSYQVSKWF 901 Query: 885 ENARWSFRHSSRMESKMVAAASTNGSSLRTRVHVMSLAAKQSPVLPSPSDSGMENLISSQ 706 ENARWS+RHSS + + AS S+L + + + + + + + +G N + Sbjct: 902 ENARWSYRHSSSKKPGISEHASKE-STLSPQTNKKLFETELNTSITNSTCNGALN--NEL 958 Query: 705 VRPGN---EECQITDAGEGK--SVESEASGEKST 619 R GN E C D G+GK E+SG+ ST Sbjct: 959 PRTGNAMPESCS-GDVGDGKVEMPTKESSGQTST 991 >emb|CBI22504.3| unnamed protein product [Vitis vinifera] Length = 977 Score = 465 bits (1196), Expect = e-128 Identities = 311/779 (39%), Positives = 406/779 (52%), Gaps = 33/779 (4%) Frame = -1 Query: 2850 LPGPEYLEHSNGEQIVISNKDPISNSIPGDFRLPHENGAAICAPENMGSATCAPENMGSA 2671 LP + ++S E + + +D I N E E +G + PEN+ Sbjct: 81 LPPADVTKNSLTEHLGLPPEDAIKNDGTEQLGFFPEVVTKSSIIEKLGQSEPPPENVA-- 138 Query: 2670 TCDAHENHLDLQHSEPAEKDATNVASESVPHEGTSLPSRKQISSLLPPVSNRVLRSRSQE 2491 + L S A KD N + + L R +S +RVLRSRSQE Sbjct: 139 ------RYSGLDQSGSAPKDLANKRTAKLVKRKYKL--RSSVSG------SRVLRSRSQE 184 Query: 2490 KPKASESKAVEVENSANEGRKSKQRKGRMKKIPVNEFSRIRTHLRYLLHRVKYERNLIDA 2311 KPKAS+ V SA+ RK +++K RM K +EF+RIR HLRYLL+R+ YE+NLIDA Sbjct: 185 KPKASQPSDNFVNASASRERKGRKKK-RMNKTTADEFARIRKHLRYLLNRMSYEQNLIDA 243 Query: 2310 YSCEGWKGQSLEKLKPEKELQRARSQIFRYKLKIRDLFQQLDRSLTEGRFPESLFDSEGQ 2131 YS EGWKGQS+EKLKPEKELQRA S+I R KL+IRDLFQ LD EGRFPESLFDSEGQ Sbjct: 244 YSAEGWKGQSVEKLKPEKELQRASSEISRRKLQIRDLFQHLDSLCAEGRFPESLFDSEGQ 303 Query: 2130 IDSEDIFCAKCGSKDLTIENDIILCDGACERGFHQFCLDPPLLKEDIPPDDEGWLCPGCD 1951 IDSEDIFCAKC SKD++ +NDIILCDGAC+RGFHQFCL+PPLLKE+IPPDDEGWLCP CD Sbjct: 304 IDSEDIFCAKCESKDMSADNDIILCDGACDRGFHQFCLEPPLLKEEIPPDDEGWLCPACD 363 Query: 1950 CKLDCVGMLNDFQESTLSVTDNWEKVFPEEAAAA-----ASGKTMDEGTXXXXXXXXXXX 1786 CK+DC+ +LND Q + LSV D+WEKVFPE AAA SG + D+ Sbjct: 364 CKVDCMDLLNDSQGTKLSVIDSWEKVFPEAAAAGNNQDNNSGFSSDDSEDNDYDPDCPEV 423 Query: 1785 XXXXXXXXDHEVSRGES----TSDGSDYFSASDDIVPPLDNKQIFXXXXXXXXXXXXXXX 1618 ES SD SD+ SASDD+V +N+Q Sbjct: 424 DEKGQGDKSSSDKFDESDEFDESDESDFTSASDDMVVSPNNEQCLGLPSDDSEDDDFDPD 483 Query: 1617 XXXXXXXXXXXXXXXXXXXXXXDLGAILEDGESPNKDEGHFPSVSED---------SKPN 1465 +++ + F S SED N Sbjct: 484 APE------------------------IDEQVNQGSSSSDFTSDSEDFTATLDRRNFSDN 519 Query: 1464 AVASGEKLKAGKRKGRSLNDELSYLTESNT----EAVSGKRRGERLDYKKLHDEAYRNAT 1297 E+ + G++K +L DEL + ESN+ +S KR ERLDYKKLHDEAY N + Sbjct: 520 EDGLDEQRRFGRKKKDTLKDELLSVLESNSGQDNAPLSAKRHVERLDYKKLHDEAYGNVS 579 Query: 1296 SDSS-DEDYTETAGAKRRKSNAGKAIRICTNKIQDRNDRTDTMDGNQNHKENDHSVEG-- 1126 SDSS DED+TE ++RK+ +G + N T + N K+ H +E Sbjct: 580 SDSSDDEDWTENVIPRKRKNLSGNVASV------SPNGNTSITENGTNTKDIKHDLEAAG 633 Query: 1125 -----KSHKKLKV-XXXXXXXXXXXXXXXXXXXXGKDAAKQYKRLGEAVVQRLVESFKEN 964 ++ +KL K YK+LGEAV +RL +SF+EN Sbjct: 634 CTPKRRTRQKLNFESTNNSLAESHKDSRSPGSTGEKSGQSSYKKLGEAVTERLYKSFQEN 693 Query: 963 QYPKQDVKQSLAKELGLRVQQVGKWFENARWSFRHSSRME-SKMVAAASTNGSSLRTRVH 787 QYP + +K+ LA+ELG+ +QV KWFENARWSFRH E S +A + S+ +T Sbjct: 694 QYPDRAMKEKLAEELGITSRQVSKWFENARWSFRHRPPKEASAGKSAVKKDASTSQT--- 750 Query: 786 VMSLAAKQSPVLPSPSDSGMENLISSQVRPGNEECQITDAGEGKS-VESEASGEKSTRK 613 +Q VL S +G+ S + + + +A GKS V+ +AS ++ +K Sbjct: 751 --DQKPEQEVVLRESSHNGVGKKESPKAGASKVD-RSKEANAGKSAVKKDASTSQTDQK 806 >ref|XP_002269077.1| PREDICTED: homeobox protein HAT3.1-like [Vitis vinifera] Length = 968 Score = 465 bits (1196), Expect = e-128 Identities = 311/779 (39%), Positives = 406/779 (52%), Gaps = 33/779 (4%) Frame = -1 Query: 2850 LPGPEYLEHSNGEQIVISNKDPISNSIPGDFRLPHENGAAICAPENMGSATCAPENMGSA 2671 LP + ++S E + + +D I N E E +G + PEN+ Sbjct: 81 LPPADVTKNSLTEHLGLPPEDAIKNDGTEQLGFFPEVVTKSSIIEKLGQSEPPPENVA-- 138 Query: 2670 TCDAHENHLDLQHSEPAEKDATNVASESVPHEGTSLPSRKQISSLLPPVSNRVLRSRSQE 2491 + L S A KD N + + L R +S +RVLRSRSQE Sbjct: 139 ------RYSGLDQSGSAPKDLANKRTAKLVKRKYKL--RSSVSG------SRVLRSRSQE 184 Query: 2490 KPKASESKAVEVENSANEGRKSKQRKGRMKKIPVNEFSRIRTHLRYLLHRVKYERNLIDA 2311 KPKAS+ V SA+ RK +++K RM K +EF+RIR HLRYLL+R+ YE+NLIDA Sbjct: 185 KPKASQPSDNFVNASASRERKGRKKK-RMNKTTADEFARIRKHLRYLLNRMSYEQNLIDA 243 Query: 2310 YSCEGWKGQSLEKLKPEKELQRARSQIFRYKLKIRDLFQQLDRSLTEGRFPESLFDSEGQ 2131 YS EGWKGQS+EKLKPEKELQRA S+I R KL+IRDLFQ LD EGRFPESLFDSEGQ Sbjct: 244 YSAEGWKGQSVEKLKPEKELQRASSEISRRKLQIRDLFQHLDSLCAEGRFPESLFDSEGQ 303 Query: 2130 IDSEDIFCAKCGSKDLTIENDIILCDGACERGFHQFCLDPPLLKEDIPPDDEGWLCPGCD 1951 IDSEDIFCAKC SKD++ +NDIILCDGAC+RGFHQFCL+PPLLKE+IPPDDEGWLCP CD Sbjct: 304 IDSEDIFCAKCESKDMSADNDIILCDGACDRGFHQFCLEPPLLKEEIPPDDEGWLCPACD 363 Query: 1950 CKLDCVGMLNDFQESTLSVTDNWEKVFPEEAAAA-----ASGKTMDEGTXXXXXXXXXXX 1786 CK+DC+ +LND Q + LSV D+WEKVFPE AAA SG + D+ Sbjct: 364 CKVDCMDLLNDSQGTKLSVIDSWEKVFPEAAAAGNNQDNNSGFSSDDSEDNDYDPDCPEV 423 Query: 1785 XXXXXXXXDHEVSRGES----TSDGSDYFSASDDIVPPLDNKQIFXXXXXXXXXXXXXXX 1618 ES SD SD+ SASDD+V +N+Q Sbjct: 424 DEKGQGDKSSSDKFDESDEFDESDESDFTSASDDMVVSPNNEQCLGLPSDDSEDDDFDPD 483 Query: 1617 XXXXXXXXXXXXXXXXXXXXXXDLGAILEDGESPNKDEGHFPSVSED---------SKPN 1465 +++ + F S SED N Sbjct: 484 APE------------------------IDEQVNQGSSSSDFTSDSEDFTATLDRRNFSDN 519 Query: 1464 AVASGEKLKAGKRKGRSLNDELSYLTESNT----EAVSGKRRGERLDYKKLHDEAYRNAT 1297 E+ + G++K +L DEL + ESN+ +S KR ERLDYKKLHDEAY N + Sbjct: 520 EDGLDEQRRFGRKKKDTLKDELLSVLESNSGQDNAPLSAKRHVERLDYKKLHDEAYGNVS 579 Query: 1296 SDSS-DEDYTETAGAKRRKSNAGKAIRICTNKIQDRNDRTDTMDGNQNHKENDHSVEG-- 1126 SDSS DED+TE ++RK+ +G + N T + N K+ H +E Sbjct: 580 SDSSDDEDWTENVIPRKRKNLSGNVASV------SPNGNTSITENGTNTKDIKHDLEAAG 633 Query: 1125 -----KSHKKLKV-XXXXXXXXXXXXXXXXXXXXGKDAAKQYKRLGEAVVQRLVESFKEN 964 ++ +KL K YK+LGEAV +RL +SF+EN Sbjct: 634 CTPKRRTRQKLNFESTNNSLAESHKDSRSPGSTGEKSGQSSYKKLGEAVTERLYKSFQEN 693 Query: 963 QYPKQDVKQSLAKELGLRVQQVGKWFENARWSFRHSSRME-SKMVAAASTNGSSLRTRVH 787 QYP + +K+ LA+ELG+ +QV KWFENARWSFRH E S +A + S+ +T Sbjct: 694 QYPDRAMKEKLAEELGITSRQVSKWFENARWSFRHRPPKEASAGKSAVKKDASTSQT--- 750 Query: 786 VMSLAAKQSPVLPSPSDSGMENLISSQVRPGNEECQITDAGEGKS-VESEASGEKSTRK 613 +Q VL S +G+ S + + + +A GKS V+ +AS ++ +K Sbjct: 751 --DQKPEQEVVLRESSHNGVGKKESPKAGASKVD-RSKEANAGKSAVKKDASTSQTDQK 806 >ref|XP_002527467.1| Homeobox protein HAT3.1, putative [Ricinus communis] gi|223533107|gb|EEF34865.1| Homeobox protein HAT3.1, putative [Ricinus communis] Length = 896 Score = 464 bits (1195), Expect = e-128 Identities = 292/696 (41%), Positives = 386/696 (55%), Gaps = 9/696 (1%) Frame = -1 Query: 2721 PENMGSATCAPENMGSATCDAHENHLDLQHSEPAEKDATNVASESVPHEGTSLPSRKQIS 2542 P N A E +G DA + H + SE KDA + +S T+ SRK+ Sbjct: 153 PPNNEMKVPASEKLGPPH-DAEDKHWNGTQSEILSKDAVSNSSRLGRRVKTTAKSRKKYM 211 Query: 2541 SLLPPVSNRVLRSRSQEKPKASESKAVEVENSAN-EGRKSKQRKGRMKKIPVNEFSRIRT 2365 S+RV++ RSQEKPKA ES S+N E + K++K K + +E+S IR Sbjct: 212 LRCLRRSDRVMQYRSQEKPKAPESSTNLPNVSSNVEKTRKKKKKRERKSVEADEYSIIRK 271 Query: 2364 HLRYLLHRVKYERNLIDAYSCEGWKGQSLEKLKPEKELQRARSQIFRYKLKIRDLFQQLD 2185 +LRYLL+R+ YE++LI AYS EGWKG SLEKLKPEKELQRA S+I R K KIRDLFQ++D Sbjct: 272 NLRYLLNRIGYEQSLITAYSAEGWKGLSLEKLKPEKELQRATSEILRRKSKIRDLFQRID 331 Query: 2184 RSLTEGRFPESLFDSEGQIDSEDIFCAKCGSKDLTIENDIILCDGACERGFHQFCLDPPL 2005 EGRFPESLFDS+GQI SEDIFCAKCGSKDLT +NDIILCDGAC+RGFHQ+CL PPL Sbjct: 332 SLCGEGRFPESLFDSDGQISSEDIFCAKCGSKDLTADNDIILCDGACDRGFHQYCLVPPL 391 Query: 2004 LKEDIPPDDEGWLCPGCDCKLDCVGMLNDFQESTLSVTDNWEKVFPEEAAAAASGKTMDE 1825 LKEDIPPDD+GWLCPGCDCK+DC+ +LN+ Q + +S++D+WEKVFPE AAA G+ D+ Sbjct: 392 LKEDIPPDDQGWLCPGCDCKVDCIDLLNESQGTNISISDSWEKVFPE---AAAPGQNPDQ 448 Query: 1824 GTXXXXXXXXXXXXXXXXXXXDHEVSRGESTSDGSDYFS-ASDDIVPPLDNKQIFXXXXX 1648 D + ES+SD SD SD++ P +KQ Sbjct: 449 NFGPPSDDSDDNDYDPDIPEIDEKSQGDESSSDDSDDSDFTSDELEAPPGDKQQLGLSSE 508 Query: 1647 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDLGAILEDGESPNKDEGHFPSVSEDSKP 1468 DL A L++ E +DE +S ++ Sbjct: 509 DSGDDDYDPDAPDLDDIVKEESSSSDFTSDSEDLAATLDNNELSGEDERR---ISVGTRG 565 Query: 1467 NAVASGEKLKAGKRKGRSLNDELSYLTESN-----TEAVSGKRRGERLDYKKLHDEAYRN 1303 ++ G K G++K +SL EL + E N + +SGKR ERLDYKKL+DE Y N Sbjct: 566 DSTKEGS--KRGRKKKQSLQSELLSIEEPNPSQDGSAPISGKRNVERLDYKKLYDETYGN 623 Query: 1302 ATSDSS-DEDYTETAGAKRRKSNAGKAIRICTNKIQDRNDRTDTMDGNQNHKENDHSVEG 1126 +SDSS DED+T+ GA +R+ K+ + TDT G Q+ KE ++ V Sbjct: 624 VSSDSSDDEDFTDDVGAVKRR----KSTQAALGSANGNASVTDT--GKQDLKETEY-VPK 676 Query: 1125 KSHKKLKVXXXXXXXXXXXXXXXXXXXXGKDAAKQ-YKRLGEAVVQRLVESFKENQYPKQ 949 +S ++L GK Y+RLGE V + L SFKENQYP + Sbjct: 677 RSRQRLISENTSITPTKAHEGTSPSSSCGKTVRPSGYRRLGETVTKGLYRSFKENQYPDR 736 Query: 948 DVKQSLAKELGLRVQQVGKWFENARWSFRHSSRMESKMVAAASTNGSSLRTRVHVMSLAA 769 D K+ LA+ELG+ QQV KWFENARWSF HSS M++ + N S + ++ +A Sbjct: 737 DRKEHLAEELGITYQQVTKWFENARWSFNHSSSMDANRIGKTPENNSPVSKTTTILLESA 796 Query: 768 KQSPVLPSPSDSGMENLISSQVRPGNEECQITDAGE 661 ++ V + DS + S ++ E + DA E Sbjct: 797 PET-VSGAAIDSAAQREESPKIGDAMVEIYVEDARE 831 >gb|EMJ01257.1| hypothetical protein PRUPE_ppa023106mg [Prunus persica] Length = 1058 Score = 461 bits (1187), Expect = e-127 Identities = 311/741 (41%), Positives = 398/741 (53%), Gaps = 43/741 (5%) Frame = -1 Query: 2967 LTAHDL-----GSDNIEPLEQKQDVAQDIGRKSPSETGVVASS----ELPGPEYLEHSNG 2815 +T+H + GS EP +QK + + ++T SS E PGP Sbjct: 222 ITSHQINEFGSGSVPSEPAKQKDQLDSVPAQNDEAKTSKAVSSSTVFEQPGPSIE----- 276 Query: 2814 EQIVISNKDPISNSIPGDFRLPHENGAAICAPENMGSATCAPENMGSATCDAHENHLDLQ 2635 ++ PI +S P P S + + + M D +N LQ Sbjct: 277 ---AMTEDSPIGHSEP---------------PLEDLSKSLSDKEMEPLPEDVTQNS-SLQ 317 Query: 2634 HSEPAEKDATNVASESVPHEGTSLPSRKQISSLLPPV-SNRVLRSRSQEKPKASESK--- 2467 E A K+A ++S P + + SRK+ V S+RVLRS++ EK K + K Sbjct: 318 QLETASKNALKISSCLGPKDKKNPKSRKRKYMSRSFVRSDRVLRSKTGEKEKPKDLKLSN 377 Query: 2466 ---AVEVENSA-----NEGRKSKQRKGRMKKIPV-NEFSRIRTHLRYLLHRVKYERNLID 2314 +E NS E +K K+RK R + +EFSRIRTHLRYLL+R+ YE++LID Sbjct: 378 NVATLESSNSIANVSNGEEKKRKKRKNRRDNRAIADEFSRIRTHLRYLLNRIGYEKSLID 437 Query: 2313 AYSCEGWKGQSLEKLKPEKELQRARSQIFRYKLKIRDLFQQLDRSLTEGRFPESLFDSEG 2134 AYS EGWKG SLEKLKPEKELQRA S+I R KLKIRDLFQ+L+ EG FPESLFDSEG Sbjct: 438 AYSGEGWKGSSLEKLKPEKELQRATSEILRRKLKIRDLFQRLESLCAEGMFPESLFDSEG 497 Query: 2133 QIDSEDIFCAKCGSKDLTIENDIILCDGACERGFHQFCLDPPLLKEDIPPDDEGWLCPGC 1954 QIDSEDIFC KCGSKD++++NDIILCDGAC+RGFHQFCL+PPLL EDIPPDDEGWLCPGC Sbjct: 498 QIDSEDIFCGKCGSKDVSLDNDIILCDGACDRGFHQFCLEPPLLSEDIPPDDEGWLCPGC 557 Query: 1953 DCKLDCVGMLNDFQESTLSVTDNWEKVFPEEAAAAASGKTMDEGTXXXXXXXXXXXXXXX 1774 DCK+DC+ +LND Q + LSVTD+WEKVFPE AAAA++G+ D Sbjct: 558 DCKVDCIDLLNDSQGTDLSVTDSWEKVFPEAAAAASAGENQD-NHGLPSDDSDDNDYDPD 616 Query: 1773 XXXXDHEVSRGESTSDGSDYFSASDDIVPPLDN-KQIFXXXXXXXXXXXXXXXXXXXXXX 1597 D++V ES+SD S+Y SASD + P N +Q Sbjct: 617 GPETDNKVQGEESSSDESEYASASDGLETPKSNDEQYLGLPSEDSEDDDYNPYAPDVNED 676 Query: 1596 XXXXXXXXXXXXXXXDLGAILEDGESPNKD-EGHFPSVSEDSKPNAVASGEKLKAGKRKG 1420 DLGA L+D ++D EG + +DSKP+ SGE+ +K Sbjct: 677 VKQESSSSDFTSDSEDLGAALDDNIMSSEDVEGPKSTSLDDSKPHR-GSGEQSSISGQKK 735 Query: 1419 RSLNDELSYLTES-----NTEAVSGKRRGERLDYKKLHDEAYRNATSDSS-DEDYTETAG 1258 SL DEL L ES + +SGKR ERLDYK+LHDEAY N +DSS DED+ + A Sbjct: 736 HSLKDELISLLESGPGQGESAPLSGKRHIERLDYKRLHDEAYGNVPTDSSDDEDWNDIAT 795 Query: 1257 AKRRKSNAGK-AIRICTNKIQDRNDRTDTMDGNQNHKENDHSVEGKSHKKLKVXXXXXXX 1081 ++RK G+ A R K + + T D + EN+++ H+K V Sbjct: 796 QRKRKKGTGQVANRSPNGKTSNIKNGVITKDIKPDVDENENTPRRMPHRKSNVEDTSNLS 855 Query: 1080 XXXXXXXXXXXXXGKDAAKQ---YKRLGEAVVQRLVESFKENQYPKQDVKQSLAKELGLR 910 A Y RLGEA QRL +SFKEN YP + +K+SLA+ELGL Sbjct: 856 NKSPKGSTKSGSTSGRAGSSRSTYSRLGEAATQRLCKSFKENHYPDRSMKESLARELGLM 915 Query: 909 VQQ---------VGKWFENAR 874 +Q V KWFENAR Sbjct: 916 AKQVIPSFILASVSKWFENAR 936 >ref|XP_006486963.1| PREDICTED: homeobox protein HAT3.1-like isoform X1 [Citrus sinensis] gi|568867273|ref|XP_006486964.1| PREDICTED: homeobox protein HAT3.1-like isoform X2 [Citrus sinensis] Length = 1063 Score = 458 bits (1178), Expect = e-126 Identities = 313/796 (39%), Positives = 411/796 (51%), Gaps = 16/796 (2%) Frame = -1 Query: 2925 EQKQDVAQDIGRKSPSETGVVASSELPGPEYLEHSNGEQIVISNKDPISNSIPGDFRLPH 2746 EQ + I PS S+L E E S GE + ++ + + S + P Sbjct: 263 EQTPEFTPGISSHEPSVVNYKLGSQLEQTELGETSAGE--LGASLELVVKSSIEQLKQPE 320 Query: 2745 ENGAAICAPENMGSATCAPENMGSATCDAHENHLDLQHSE-PAEKDATNVA--SESVPHE 2575 I P SAT +++ S++ D E L+ SE P A N A Sbjct: 321 ---VPITIPSTKTSAT---KHLQSSS-DLMEKKSCLEQSETPPNYVANNSACLGRKGKRA 373 Query: 2574 GTSLPSRKQISSLLPPVSNRVLRSRSQEKPKASESKAVEVE-NSANEGRKSKQRKGRMKK 2398 SL + + SL+ S+RVLRSRS E+P ES + NS E ++ K+ K R KK Sbjct: 374 TKSLKNNYTVRSLIG--SDRVLRSRSGERPIPPESSINLADVNSIGERKQKKRNKIRRKK 431 Query: 2397 IPVNEFSRIRTHLRYLLHRVKYERNLIDAYSCEGWKGQSLEKLKPEKELQRARSQIFRYK 2218 I +E+SRIRTHLRYLL+R+ YE+NLIDAYS EGWKG S+EKLKPEKELQRA S+I R K Sbjct: 432 IVADEYSRIRTHLRYLLNRINYEQNLIDAYSSEGWKGLSVEKLKPEKELQRATSEILRRK 491 Query: 2217 LKIRDLFQQLDRSLTEGRFPESLFDSEGQIDSEDIFCAKCGSKDLTIENDIILCDGACER 2038 LKIRDLFQ+LD SL G FP+SLFDSEGQIDSEDI+CAKCGSKDL+ +NDIILCDGAC+R Sbjct: 492 LKIRDLFQRLD-SLCAGGFPKSLFDSEGQIDSEDIYCAKCGSKDLSADNDIILCDGACDR 550 Query: 2037 GFHQFCLDPPLLKEDIPPDDEGWLCPGCDCKLDCVGMLNDFQESTLSVTDNWEKVFPEEA 1858 GFHQ+CL+PPLLKEDIPPDDEGWLCPGCDCK+DC+ ++N+ Q + L +TDNWEKVFPE Sbjct: 551 GFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLVNELQGTRLFITDNWEKVFPE-- 608 Query: 1857 AAAASGKTMDEGTXXXXXXXXXXXXXXXXXXXDHEVSRGESTSDG-----SDYFSASDDI 1693 AA+G D D + ES+SDG SD+ S SD++ Sbjct: 609 --AAAGHNQDPNFGLASDDSDDNEYDPDGSATDEQDEGDESSSDGSSSDDSDFTSTSDEV 666 Query: 1692 VPPLDNKQIF--XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDLGAILEDGES 1519 P D+K DL A+LED S Sbjct: 667 EAPADDKTYLGRSSEDSEDDEYNPDAPDLDDKVTQESSSSGSDFTSDSEDLAAVLEDNRS 726 Query: 1518 PNKDEGHFPSVSEDSKPNAVASGEKLKAGKRKGRSLNDELSYLTESNTEA---VSGKRRG 1348 DEG + P ++G++ K G SLN+EL + + + V GKR Sbjct: 727 SGNDEG-------AASPLGHSNGQRYKDG-GNNESLNNELLSIIKPGQDGAAPVYGKRSS 778 Query: 1347 ERLDYKKLHDEAYRNATSDSSDEDYTETAGAKRRKSNAGKAIRICT--NKIQDRNDRTDT 1174 ERLDYKKL+DE Y N DSSD++ G R+++ + K + K R T Sbjct: 779 ERLDYKKLYDETYGNVPYDSSDDESWSDDGGPRKRTKSTKEGSSASPDGKTPVIRRRKST 838 Query: 1173 MDGNQNHKENDHSVEGKSHKKLKVXXXXXXXXXXXXXXXXXXXXGKDAAKQYKRLGEAVV 994 + E +++ + + KL G+ Y+++GE V Sbjct: 839 KAAKEKLNETENTPKRRGRPKLNTEDSNISPAKSHEGCSTPGSRGRRHRTSYRKIGEEVT 898 Query: 993 QRLVESFKENQYPKQDVKQSLAKELGLRVQQVGKWFENARWSFRHSSRMESKMVAAASTN 814 Q+L SFKENQYP + K+SLAKELGL QV KWFEN RWSF H S +K+ A S Sbjct: 899 QKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTRWSFNHPSSKNAKL--ANSEK 956 Query: 813 GSSLRTRVHVMSLAAKQSPVLPSPSDSGMENLISSQVRPGNEECQITDAGEGKSVESEAS 634 G+ + + ++ V + +G EN+ SS+ + C D + + E + Sbjct: 957 GT--------CTPQSNKNTVGRVSNCNGAENVQSSKTGVDDTGCMTGDV-KNNTQECNSI 1007 Query: 633 GEKSTRKRKVDNQGSS 586 S RK D G S Sbjct: 1008 KPTSQTSRKRDRDGKS 1023 >ref|XP_006422879.1| hypothetical protein CICLE_v10027725mg [Citrus clementina] gi|557524813|gb|ESR36119.1| hypothetical protein CICLE_v10027725mg [Citrus clementina] Length = 1063 Score = 457 bits (1175), Expect = e-125 Identities = 314/796 (39%), Positives = 408/796 (51%), Gaps = 16/796 (2%) Frame = -1 Query: 2925 EQKQDVAQDIGRKSPSETGVVASSELPGPEYLEHSNGEQIVISNKDPISNSIPGDFRLPH 2746 EQ + I PS S+L E E S GE + S + + +SI +L Sbjct: 263 EQTPEFTPGISSHEPSVVNYKLGSQLEQTELGETSAGE-LGASLELVVKSSIEQLKQLE- 320 Query: 2745 ENGAAICAPENMGSATCAPENMGSATCDAHENHLDLQHSE-PAEKDATNVA--SESVPHE 2575 I P SAT +++ S++ D E L+ SE P A N A Sbjct: 321 ---VPITIPSTKTSAT---KHLQSSS-DLMEKKSCLEQSETPPNYVANNSACLGRKGKRA 373 Query: 2574 GTSLPSRKQISSLLPPVSNRVLRSRSQEKPKASESKAVEVE-NSANEGRKSKQRKGRMKK 2398 SL + + SL+ S+RVLRSRS E+P ES + NS E ++ K+ K R KK Sbjct: 374 TKSLKNNYTVRSLIG--SDRVLRSRSGERPLPPESSNNLADVNSIGERKQKKRNKIRRKK 431 Query: 2397 IPVNEFSRIRTHLRYLLHRVKYERNLIDAYSCEGWKGQSLEKLKPEKELQRARSQIFRYK 2218 I +E+SRIRTHLRYLL+R+ YE+NLIDAYS EGWKG S+EKLKPEKELQRA S+I R K Sbjct: 432 IVADEYSRIRTHLRYLLNRINYEQNLIDAYSSEGWKGLSVEKLKPEKELQRATSEILRRK 491 Query: 2217 LKIRDLFQQLDRSLTEGRFPESLFDSEGQIDSEDIFCAKCGSKDLTIENDIILCDGACER 2038 LKIRDLFQ+LD SL G FP+SLFDSEGQIDSEDI+CAKCGSKDL+ +NDIILCDGAC+R Sbjct: 492 LKIRDLFQRLD-SLCAGGFPKSLFDSEGQIDSEDIYCAKCGSKDLSADNDIILCDGACDR 550 Query: 2037 GFHQFCLDPPLLKEDIPPDDEGWLCPGCDCKLDCVGMLNDFQESTLSVTDNWEKVFPEEA 1858 GFHQ+CL+PPLLKEDIPPDDEGWLCPGCDCK+DC+ ++N+ Q + L +TDNWEKVFPE Sbjct: 551 GFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLVNELQGTRLFITDNWEKVFPE-- 608 Query: 1857 AAAASGKTMDEGTXXXXXXXXXXXXXXXXXXXDHEVSRGESTSDG-----SDYFSASDDI 1693 AA+G D D + ES+SDG SD+ S SD++ Sbjct: 609 --AAAGHNQDPNFGLASDDSDDNEYDPDGSATDEQDEGDESSSDGSSSDDSDFTSTSDEV 666 Query: 1692 VPPLDNKQI--FXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDLGAILEDGES 1519 P D+K DL A+LED S Sbjct: 667 EAPADDKTYLGLSSEDSEDDEYNPDAPELDDKVTQESSSSGSDFTSDSEDLAAVLEDNRS 726 Query: 1518 PNKDEGHFPSVSEDSKPNAVASGEKLKAGKRKGRSLNDELSYLTESNTEA---VSGKRRG 1348 DEG + P ++G++ K G SLN+EL + + + V GKR Sbjct: 727 SGNDEG-------AASPLGHSNGQRYKDG-GNNESLNNELLSIIKPGQDGAVPVYGKRSS 778 Query: 1347 ERLDYKKLHDEAYRNATSDSSDEDYTETAGAKRRKSNAGKAIRICT--NKIQDRNDRTDT 1174 ERLDYKKL+DE Y N DSSD++ G R+++ + K + K R T Sbjct: 779 ERLDYKKLYDETYGNVPYDSSDDESWSDDGGPRKRTKSTKEGSSASPDGKTPVIRRRKST 838 Query: 1173 MDGNQNHKENDHSVEGKSHKKLKVXXXXXXXXXXXXXXXXXXXXGKDAAKQYKRLGEAVV 994 + E +++ + + KL G+ Y++LGE V Sbjct: 839 KAAKEKLNETENTPKRRGRPKLNTEDSNISPAKSHEGCSTPGSRGRRHRTSYRKLGEEVT 898 Query: 993 QRLVESFKENQYPKQDVKQSLAKELGLRVQQVGKWFENARWSFRHSSRMESKMVAAASTN 814 Q+L SFKENQYP + K+SLAKELGL QV KWFEN RWSF H S S N Sbjct: 899 QKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTRWSFNHPS----------SKN 948 Query: 813 GSSLRTRVHVMSLAAKQSPVLPSPSDSGMENLISSQVRPGNEECQITDAGEGKSVESEAS 634 + + + ++ V + +G EN+ SS+ + C D + + E + Sbjct: 949 AELANSEKGTCTPQSNKNTVGRVSNCNGAENVQSSKTGVDDTGCMTGDV-KNNTQECNSI 1007 Query: 633 GEKSTRKRKVDNQGSS 586 S RK D G S Sbjct: 1008 KPTSQTSRKRDRDGKS 1023 >gb|EOX98399.1| Homeodomain-like protein with RING/FYVE/PHD-type zinc finger domain, putative isoform 1 [Theobroma cacao] gi|508706504|gb|EOX98400.1| Homeodomain-like protein with RING/FYVE/PHD-type zinc finger domain, putative isoform 1 [Theobroma cacao] Length = 950 Score = 455 bits (1171), Expect = e-125 Identities = 308/762 (40%), Positives = 395/762 (51%), Gaps = 35/762 (4%) Frame = -1 Query: 3000 DGSSMNSSFESLTAHDLGSDNIEPLEQKQDVAQDIGRKSPSETGVVASSELPGPEYLEHS 2821 + SS ++ + L+ L S +V +++G +SP + + S LP Sbjct: 180 EDSSKHTKTDKLSCPQLVSSEPTVNFGSGNVCKELG-ESPEQRQQLDSESLP-------- 230 Query: 2820 NG-EQIVISNKDPISNSI----PGDFRLPHENGAAICAPENM-----GSATCAPENMGSA 2671 NG E+ I+ +SN P D H G PE + S + E +G Sbjct: 231 NGIEESTIAVSSNVSNQALQLKPEDMGKSHCGGHLHSPPEGVTNVIQSSKSPLVEPLGLP 290 Query: 2670 TCDAHENHLDLQHSEPAEKDATNVASESVPHEG----------------TSLPSRKQISS 2539 A N Q P E A N E HE TS +K+ Sbjct: 291 QEFAQGNPSTQQSGLPCEDMAQNSGVEQ--HETKPKNLLENSGRRRNGKTSKTIKKKYML 348 Query: 2538 LLPPVSNRVLRSRSQEKPKASESKAVEVENSANEGRKSKQRKGRMKKIPV-NEFSRIRTH 2362 S+RVLRS+ QEKPKA+ES + ++E +K ++R+ R V +EFSRIRTH Sbjct: 349 RSLRSSDRVLRSKLQEKPKATESSNNLADVGSSEQQKRRKRRRRKANREVADEFSRIRTH 408 Query: 2361 LRYLLHRVKYERNLIDAYSCEGWKGQSLEKLKPEKELQRARSQIFRYKLKIRDLFQQLDR 2182 LRYLL+R+ YER+LI AYS EGWKG SLEKLKPEKELQRA S+I R KLKIRDLFQ +D Sbjct: 409 LRYLLNRINYERSLIAAYSTEGWKGLSLEKLKPEKELQRATSEILRRKLKIRDLFQHIDS 468 Query: 2181 SLTEGRFPESLFDSEGQIDSEDIFCAKCGSKDLTIENDIILCDGACERGFHQFCLDPPLL 2002 EG+ PESLFDSEGQIDSEDIFCAKCGSKDL+ NDIILCDGAC+RGFHQ+CL PPLL Sbjct: 469 LCAEGKLPESLFDSEGQIDSEDIFCAKCGSKDLSANNDIILCDGACDRGFHQYCLQPPLL 528 Query: 2001 KEDIPPDDEGWLCPGCDCKLDCVGMLNDFQESTLSVTDNWEKVFPEEAAAAASGKTMDEG 1822 KEDIPPDDEGWLCPGCDCK+DC+ ++N+ Q ++ S+TD+WEKVFP EAA AA+G+ D Sbjct: 529 KEDIPPDDEGWLCPGCDCKVDCIELVNESQGTSFSITDSWEKVFP-EAAVAAAGQNQDPN 587 Query: 1821 TXXXXXXXXXXXXXXXXXXXDHEVSRGESTSDGSDYFSASDDIVPPLDNKQIFXXXXXXX 1642 D + ES+S+ S++ S S+++ P Q Sbjct: 588 FGLPSDDSDDNDYNPDGSETDEKDHGDESSSEESEFTSTSEELEVPAKVDQYLGLPSDDS 647 Query: 1641 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDLGAILEDGESPNKDEGHFP-SVSEDSKPN 1465 DL A+LE+ + KDEG S DSK Sbjct: 648 EDDDYDPDGPNHDEVVKPESSSSDFSSDSEDLDAMLEEDITSQKDEGPMANSAPRDSKRR 707 Query: 1464 AVASGEKLKAGKRKGRSLNDELSYLTESNTE----AVSGKRRGERLDYKKLHDEAYRNAT 1297 GEK S+NDEL + E +E A+S KR ERLDYK+L+DE Y N Sbjct: 708 KPKLGEK--------ESMNDELLSIMEPASEQDGSAISKKRSIERLDYKRLYDETYGNVP 759 Query: 1296 SDSS-DEDYTETAGAKRRKSNAGKAIRICTNKIQDRNDRTDTMDG-NQNHKENDHSVEGK 1123 S SS DED+++ ++R + N + DG QN +E +H K Sbjct: 760 SSSSDDEDWSDITAPRKRNKCTAEVASAPENGNVSVSRTVSVSDGLKQNPEETEHKPRRK 819 Query: 1122 SHKKLKVXXXXXXXXXXXXXXXXXXXXGKDA-AKQYKRLGEAVVQRLVESFKENQYPKQD 946 + + + GK A + YKRLGEAV QRL +SFKENQYP + Sbjct: 820 TRQMSRFKDTDSSPAEIQGNTSVSGSSGKKAGSSTYKRLGEAVKQRLYKSFKENQYPDRA 879 Query: 945 VKQSLAKELGLRVQQVGKWFENARWSFRHSSRMESKMVAAAS 820 KQSLAKEL + QQV KWF+NARWSF +S + AS Sbjct: 880 TKQSLAKELDMTFQQVSKWFDNARWSFNNSPSSHETIANNAS 921 >ref|XP_006589630.1| PREDICTED: pathogenesis-related homeodomain protein isoform X1 [Glycine max] Length = 820 Score = 451 bits (1161), Expect = e-124 Identities = 298/693 (43%), Positives = 375/693 (54%), Gaps = 22/693 (3%) Frame = -1 Query: 2832 LEHSNGEQIVI--SNKDPISNSIPGDFRLPHENGAAICAPENMGSATCAPE--NMGSATC 2665 LE S EQ+ + SN P + P + E +I A G P NM S Sbjct: 93 LEQSTVEQVTVDLSNDKPENKCKPLSENVQSEPVESIPAVVVEGQMQSNPSQANMSSV-- 150 Query: 2664 DAHENHLDLQHSEPAEKDATNVASESVPHEGTSLPSR---KQISSLLPPV-------SNR 2515 N L Q S A + ++ SE + + T SR K+ S LL S+R Sbjct: 151 ----NELLDQPSGDAVNNISSNCSEKMSNSPTHSQSRRKGKKNSKLLKKYMLRSLGSSDR 206 Query: 2514 VLRSRSQEKPKASE--SKAVEVENSANEGRKSKQRKGRMKKIPVNEFSRIRTHLRYLLHR 2341 LRSR++EKPK E S V+ N+ + + +++K R ++ N+FSRIR+HLRYLL+R Sbjct: 207 ALRSRTKEKPKEPEPTSNLVDGNNNGVKRKSGRKKKKRKEEGITNQFSRIRSHLRYLLNR 266 Query: 2340 VKYERNLIDAYSCEGWKGQSLEKLKPEKELQRARSQIFRYKLKIRDLFQQLDRSLTEGRF 2161 + YE +LIDAYS EGWKG S+EKLKPEKELQRA+S+I R KLKIRDLFQ LD EG+F Sbjct: 267 ISYENSLIDAYSGEGWKGYSIEKLKPEKELQRAKSEILRRKLKIRDLFQNLDSLCAEGKF 326 Query: 2160 PESLFDSEGQIDSEDIFCAKCGSKDLTIENDIILCDGACERGFHQFCLDPPLLKEDIPPD 1981 PESLFDS G+IDSEDIFCAKC SK+L+ NDIILCDG C+RGFHQ CLDPP+L EDIPP Sbjct: 327 PESLFDSAGEIDSEDIFCAKCQSKELSTNNDIILCDGVCDRGFHQLCLDPPMLTEDIPPG 386 Query: 1980 DEGWLCPGCDCKLDCVGMLNDFQESTLSVTDNWEKVFPEEAAAAASGKTMDEGTXXXXXX 1801 DEGWLCPGCDCK DC+ ++ND ++LS++D WE+VFPE AA+ +G MD + Sbjct: 387 DEGWLCPGCDCKDDCMDLVNDSFGTSLSISDTWERVFPE--AASFAGNNMDNNSGVPSDD 444 Query: 1800 XXXXXXXXXXXXXDHEVSRGESTSDGSDYFSASDDIVPPLDNKQIFXXXXXXXXXXXXXX 1621 +V ES+SD S+Y SAS+ + Q Sbjct: 445 SDDDDYNPNGPDDV-KVEGDESSSDESEYASASEKLEGGSHEDQYLGLPSEDSDDGDYDP 503 Query: 1620 XXXXXXXXXXXXXXXXXXXXXXXDLGAILEDGESPNKDEGHFPSVSEDSKPNAVASGEKL 1441 DL A +ED SP +D G +S K V G+KL Sbjct: 504 DAPDVECKVNEESSSSDFTSDSEDLAAAIEDNTSPGQDGG----ISSSKKKGKV--GKKL 557 Query: 1440 KAGKRKGRSLNDELSYLTE--SNTEA---VSGKRRGERLDYKKLHDEAYRNATSDSSDED 1276 SL DELS L E S EA VSGKR ERLDYKKL++E Y + TSD DED Sbjct: 558 --------SLPDELSSLLEPDSGQEAPTPVSGKRHVERLDYKKLYEETYHSDTSD--DED 607 Query: 1275 YTETAGAKRRKSNAGKAIRICTNKIQDRND-RTDTMDGNQNHKENDHSVEGKSHKKLKVX 1099 + +TA +K G + N N T + +QN+ EN ++ KS Sbjct: 608 WNDTAAPSGKKKLTGNVTPVSPNGNASNNSIHTPKRNAHQNNVENTNNSPTKS------- 660 Query: 1098 XXXXXXXXXXXXXXXXXXXGKDAAKQYKRLGEAVVQRLVESFKENQYPKQDVKQSLAKEL 919 K + +KRLGEAVVQRL +SFKENQYP + K+SLA+EL Sbjct: 661 --------LEGCSKSGSRDKKSGSSAHKRLGEAVVQRLHKSFKENQYPDRTTKESLAQEL 712 Query: 918 GLRVQQVGKWFENARWSFRHSSRMESKMVAAAS 820 GL QQV KWF N RWSFRHSS+ME+ AS Sbjct: 713 GLTYQQVAKWFGNTRWSFRHSSQMETNSGINAS 745 >ref|XP_003555282.1| PREDICTED: homeobox protein HAT3.1-like isoform X1 [Glycine max] Length = 820 Score = 447 bits (1149), Expect = e-122 Identities = 304/773 (39%), Positives = 395/773 (51%), Gaps = 31/773 (4%) Frame = -1 Query: 2832 LEHSNGEQIVIS-------NK-DPISNSIPGDFRLPHENGAAICAPENMGSATCAPENMG 2677 LE S EQ+ + NK P+S ++ + P E+ A M S+ A NM Sbjct: 93 LEQSTVEQVSVDLSNDKSENKCKPLSENVQSE---PVESIPAFVVDGQMQSSP-AQANMS 148 Query: 2676 SATCDAHENHLDLQHSEPAEKDATNVASE--SVPHEGTSLPSRKQISSLLPPV------- 2524 S N L Q S + TN + + + P S K+ S LL Sbjct: 149 SV------NELLDQPSGDVVNNITNCSEKMSNSPSHSQSRRKGKRNSKLLKKKYMLRSLG 202 Query: 2523 -SNRVLRSRSQEKPKASESKAVEVENSANEGRKSK---QRKGRMKKIPVNEFSRIRTHLR 2356 S R LRSR++EKPK E + V+ ++N+G K K ++K R ++ ++FSRIR+HLR Sbjct: 203 SSGRALRSRTKEKPKEPEPTSNLVDGNSNDGVKRKSGRKKKKRREEGITDQFSRIRSHLR 262 Query: 2355 YLLHRVKYERNLIDAYSCEGWKGQSLEKLKPEKELQRARSQIFRYKLKIRDLFQQLDRSL 2176 YLL+R+ YE +LIDAYS EGWKG S+EKLKPEKELQRA+S+I R KLKIRDLF+ LD Sbjct: 263 YLLNRISYENSLIDAYSGEGWKGYSMEKLKPEKELQRAKSEILRRKLKIRDLFRNLDSLC 322 Query: 2175 TEGRFPESLFDSEGQIDSEDIFCAKCGSKDLTIENDIILCDGACERGFHQFCLDPPLLKE 1996 EG+FPESLFDS G+IDSEDIFCAKC SK+L+ NDIILCDG C+RGFHQ CLDPPLL E Sbjct: 323 AEGKFPESLFDSAGEIDSEDIFCAKCQSKELSTNNDIILCDGVCDRGFHQLCLDPPLLTE 382 Query: 1995 DIPPDDEGWLCPGCDCKLDCVGMLNDFQESTLSVTDNWEKVFPEEAAAAASGKTMDEGTX 1816 DIPP DEGWLCPGCDCK DC+ ++ND ++LS++D WE+VFPE AA+ +G MD Sbjct: 383 DIPPGDEGWLCPGCDCKDDCMDLVNDSFGTSLSISDTWERVFPE--AASFAGNNMDNNLG 440 Query: 1815 XXXXXXXXXXXXXXXXXXDHEVSRGESTSDGSDYFSASDDIVPPLDNKQIFXXXXXXXXX 1636 ++ ES+SD S+Y SAS+ + Q Sbjct: 441 LPSDDSDDDDYNPNGSDDV-KIEGDESSSDESEYASASEKLEGGSHEDQYLGLPSEDSDD 499 Query: 1635 XXXXXXXXXXXXXXXXXXXXXXXXXXXXDLGAILEDGESPNKDEGHFPSVSEDSKPNAVA 1456 DL A ED SP +D G + Sbjct: 500 GDYDPDAPDVDCKVNEESSSSDFTSDSEDLAAAFEDNTSPGQDGG------------INS 547 Query: 1455 SGEKLKAGKRKGRSLNDELSYLTESNT-----EAVSGKRRGERLDYKKLHDEAYRNATSD 1291 S +K K GK S+ DELS L E ++ VSGKR ERLDYKKL++E Y + TSD Sbjct: 548 SKKKGKVGK---LSMADELSSLLEPDSGQGGPTPVSGKRHVERLDYKKLYEETYHSDTSD 604 Query: 1290 SSDEDYTETAGAKRRKSNAGKAIRICTNKIQDRND-RTDTMDGNQNHKENDHSVEGKSHK 1114 DED+ + A R+K G + N N T + +QN EN +S KS Sbjct: 605 --DEDWNDAAAPSRKKKLTGNVTPVSPNANASNNSIHTLKRNAHQNKVENTNSSPTKS-- 660 Query: 1113 KLKVXXXXXXXXXXXXXXXXXXXXGKDAAKQYKRLGEAVVQRLVESFKENQYPKQDVKQS 934 + + +KRLGEAVVQRL +SFKENQYP + K+S Sbjct: 661 -------------LDGRSKSGSRDKRSGSSAHKRLGEAVVQRLHKSFKENQYPDRSTKES 707 Query: 933 LAKELGLRVQQVGKWFENARWSFRHSSRMESKMVAAASTNGSSLRTRVHVMSLAAKQSPV 754 LA+ELGL QQV KWF+N RWSFRHSS+ME+ AS + R SP Sbjct: 708 LAQELGLTYQQVAKWFDNTRWSFRHSSQMETNSGRNASPEATDGRAENEGEKQCESMSPE 767 Query: 753 LPSPSDSGMENLISSQVRPGNEECQITDAGEGKSV----ESEASGEKSTRKRK 607 + + + + E Q+ G S +++ + TRKRK Sbjct: 768 VSGKNSKTTSSRKRKHLSEPLSEAQLDINGLATSSPNVHQTQVGNKMKTRKRK 820 >ref|XP_004496910.1| PREDICTED: pathogenesis-related homeodomain protein-like isoform X1 [Cicer arietinum] Length = 995 Score = 445 bits (1145), Expect = e-122 Identities = 287/695 (41%), Positives = 372/695 (53%), Gaps = 11/695 (1%) Frame = -1 Query: 2631 SEPAEKDATNVASESVPHEGTSLPSRKQISSLLPPVSNRVLRSRSQEKPKASE--SKAVE 2458 SE K + ++ S + L + + SL S+R LRSR+++KPK E + V+ Sbjct: 309 SERKSKSSAHLRSRHKGKSNSKLSKKYILRSL--GSSDRALRSRTRDKPKDPEPINNVVD 366 Query: 2457 VENSANEGRKSKQRKG-RMKKIPVNE-FSRIRTHLRYLLHRVKYERNLIDAYSCEGWKGQ 2284 V N A + ++ K++K R +K +N+ +S+IR HLRYLL+R+ YE+NLIDAYS EGWKG Sbjct: 367 VSNDAMKTKRGKKKKKKRPRKEGINDQYSKIRAHLRYLLNRISYEQNLIDAYSGEGWKGY 426 Query: 2283 SLEKLKPEKELQRARSQIFRYKLKIRDLFQQLDRSLTEGRFPESLFDSEGQIDSEDIFCA 2104 SLEKLKPEKE+QRA+S+I R KLKIRDLFQ LD EGR PESLFDS+G+IDSEDIFCA Sbjct: 427 SLEKLKPEKEIQRAKSEILRRKLKIRDLFQNLDSLCAEGRLPESLFDSKGEIDSEDIFCA 486 Query: 2103 KCGSKDLTIENDIILCDGACERGFHQFCLDPPLLKEDIPPDDEGWLCPGCDCKLDCVGML 1924 KC +K L +NDIILCDGAC+RGFHQ CLDPPLL EDIPP DEGWLCPGCDCK DC+ ++ Sbjct: 487 KCQTKVLGTDNDIILCDGACDRGFHQLCLDPPLLTEDIPPGDEGWLCPGCDCKDDCIELV 546 Query: 1923 NDFQESTLSVTDNWEKVFPEEAAAAASGKTMDEG--TXXXXXXXXXXXXXXXXXXXDHEV 1750 ND + LS+T+ WE+VFPE A AA S + G + D EV Sbjct: 547 NDLLGTNLSLTNTWERVFPEAATAAGSILDHNSGLPSDDSEDDDYNPNGPEDVEVEDAEV 606 Query: 1749 SRGESTSDGSDYFSASDDIVPPLDNKQIFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1570 ES+SD S+Y SAS+ + Q Sbjct: 607 EGDESSSDESEYASASEKLEDSRHEDQYLGLPSEDSEDDDFDPDAPDLGGKVTEESSSSD 666 Query: 1569 XXXXXXDLGAILEDGESPNKDEGHFPSVSEDSKPNAVASGEKLKAGKRKGRSLNDELSYL 1390 DL A ++D S +D + +D K S + K RK S+ DELS L Sbjct: 667 FTSDSEDLAATIKDNMSTGQDGDITSPLLDDVKNLKGFSRQNHKV--RKKPSMADELSSL 724 Query: 1389 TES-----NTEAVSGKRRGERLDYKKLHDEAYRNATSDSSDEDYTETAGAKRRKSNAGKA 1225 +S + ++ KR ERLDY+KL++E Y++ TSD DED+ +A R+K AGK Sbjct: 725 LKSDLGQEDITPITAKRNVERLDYQKLYEETYQSDTSD--DEDWDASATPSRKKKLAGKM 782 Query: 1224 IRICTNKIQDRNDRTDTMDGNQNHKENDHSVEGKSHKKLKVXXXXXXXXXXXXXXXXXXX 1045 + N N R Q HK VE ++ K Sbjct: 783 TPVSPNGNASNNSRHTASRNTQQHK-----VENTNNSPTKT----------LEGCTKSGS 827 Query: 1044 XGKDAAKQYKRLGEAVVQRLVESFKENQYPKQDVKQSLAKELGLRVQQVGKWFENARWSF 865 K YKRLGEAVVQRL +SFKENQYP++ K+SLA+ELGL QQV KWF N RWSF Sbjct: 828 RDKRRGLTYKRLGEAVVQRLYKSFKENQYPERTTKESLAQELGLTFQQVDKWFGNTRWSF 887 Query: 864 RHSSRMESKMVAAASTNGSSLRTRVHVMSLAAKQSPVLPSPSDSGMENLISSQVRPGNEE 685 RHSS E A+ +N S T + + + + G+EN G E Sbjct: 888 RHSSHTE----ASPGSNASQQATDSGAENKEERGNASQQATDSPGVEN-------KGEGE 936 Query: 684 CQITDAGEGKSVESEASGEKSTRKRKVDNQGSSAG 580 C++ G + S +K RKR + Q S AG Sbjct: 937 CELVSQGTSREKSRTQSSKK--RKRLSEPQVSEAG 969 >gb|ESW15073.1| hypothetical protein PHAVU_007G041800g [Phaseolus vulgaris] Length = 826 Score = 433 bits (1113), Expect = e-118 Identities = 288/686 (41%), Positives = 375/686 (54%), Gaps = 21/686 (3%) Frame = -1 Query: 2835 YLEHSNGEQIVI--SNKDPISNSIPGDFRLPHENGAAICAPENMGSATCAPENMGSATCD 2662 +L+ S +++ + SN +P + S P P E+ A S+ A Sbjct: 91 HLQQSTDKEVSLQLSNDEPENPSQPLSENEPVESAPAFAGDGQKQSSPAL------ANTS 144 Query: 2661 AHENHLDLQHSEP----AEKDATNVASESVPHEGTSLPSRKQISSLLPPV--SNRVLRSR 2500 N LD + +EK + + A+ + +G + + +L V S+R LRS+ Sbjct: 145 YVNNMLDPPSGDAVINCSEKVSNSPANSQLRRKGKKNSKFLKKTYMLRSVGSSDRALRSK 204 Query: 2499 SQEKPKASESKAVEVE---NSANEGRKSKQRKGRMKKIPV---NEFSRIRTHLRYLLHRV 2338 ++E PK E + V+ N+ N+G K K K + K V ++FSRI++HLRYLL+R+ Sbjct: 205 TKENPKTPEPNSNLVDCNNNNNNDGVKKKSFKKKRKSGEVGITDQFSRIKSHLRYLLNRI 264 Query: 2337 KYERNLIDAYSCEGWKGQSLEKLKPEKELQRARSQIFRYKLKIRDLFQQLDRSLTEGRFP 2158 YE+NLIDAYS EGWKG S+EKLKPEKELQRA+S+I R KL IR+LF+ LD TEG+ P Sbjct: 265 GYEKNLIDAYSAEGWKGYSMEKLKPEKELQRAKSEIIRRKLNIRELFRNLDSLCTEGKLP 324 Query: 2157 ESLFDSEGQIDSEDIFCAKCGSKDLTIENDIILCDGACERGFHQFCLDPPLLKEDIPPDD 1978 ESLFDSEG+IDSEDIFCAKC SK+L+ NDIILCDG C+RGFHQ CLDPPLL EDIPP D Sbjct: 325 ESLFDSEGEIDSEDIFCAKCHSKELSSNNDIILCDGVCDRGFHQLCLDPPLLTEDIPPGD 384 Query: 1977 EGWLCPGCDCKLDCVGMLNDFQESTLSVTDNWEKVFPEEAAAAASGKTMDEGTXXXXXXX 1798 EGWLCPGCDCK DC+ ++ND ++LS++D WE+VFP EAAAAA KT + Sbjct: 385 EGWLCPGCDCKDDCMDLINDSFGTSLSISDTWERVFP-EAAAAAGNKT--DNNSGLPSDD 441 Query: 1797 XXXXXXXXXXXXDHEVSRGESTSDGSDYFSASDDIVPPLDNKQIFXXXXXXXXXXXXXXX 1618 D +V ES+SD SDY SAS+++ Q Sbjct: 442 SDDDDYNPNGPEDVKVEGDESSSDESDYASASENL-EGSHGDQYLGLPSDDSDDGDYDPA 500 Query: 1617 XXXXXXXXXXXXXXXXXXXXXXDLGAILEDGESPNKDEGHFPSVSEDSKPNAVASGE-KL 1441 DL A + + SP +D G S S D + G+ K Sbjct: 501 APDADSKVNVESSSSDFTSDSDDLPAAIVENTSPGQD-GEIRSASLDDVKCLNSYGKRKG 559 Query: 1440 KAGKRKGRSLNDELSYLTE-----SNTEAVSGKRRGERLDYKKLHDEAYRNATSDSSDED 1276 KAGK+ S+ DELS L E + VSG+R ERLDYKKL+DEAY + TS+ DED Sbjct: 560 KAGKK--LSMADELSSLLEPDSGQEGSTPVSGRRNLERLDYKKLYDEAYHSDTSE--DED 615 Query: 1275 YTETAGAKRRKSNAGKAIRICTNKIQDRND-RTDTMDGNQNHKENDHSVEGKSHKKLKVX 1099 +T T R+K G A + + N T +G+Q EN + KS Sbjct: 616 WTATVTPSRKKK--GNATPVSPDGNASNNSMHTPKRNGHQKKFENTKNSPAKS------- 666 Query: 1098 XXXXXXXXXXXXXXXXXXXGKDAAKQYKRLGEAVVQRLVESFKENQYPKQDVKQSLAKEL 919 K + YKRLGEAVV+RL SFKENQYP + K+SLA+EL Sbjct: 667 --------LDDHVKSDSRKQKSKSSAYKRLGEAVVERLHISFKENQYPDRTTKESLAQEL 718 Query: 918 GLRVQQVGKWFENARWSFRHSSRMES 841 GL QQV KWF+N RWSFRHSS+ME+ Sbjct: 719 GLTCQQVAKWFDNTRWSFRHSSQMET 744 >sp|P48786.1|PRH_PETCR RecName: Full=Pathogenesis-related homeodomain protein; Short=PRHP gi|666128|gb|AAA62237.1| homeodomain protein [Petroselinum crispum] Length = 1088 Score = 429 bits (1103), Expect = e-117 Identities = 265/640 (41%), Positives = 354/640 (55%), Gaps = 26/640 (4%) Frame = -1 Query: 2565 LPSRKQISSLLPPVSNRVLRSRSQEKPKASESKAVEVENSANEGRKSKQRKGRMKKIPVN 2386 +P + + S L S+R LRSRSQEK + + + A+ + K+RK RM++ V+ Sbjct: 429 VPEKGKDSQELSVNSSRSLRSRSQEKSIEPDVNNIVADEGADREKPRKKRKKRMEENRVD 488 Query: 2385 EFSRIRTHLRYLLHRVKYERNLIDAYSCEGWKGQSLEKLKPEKELQRARSQIFRYKLKIR 2206 EF RIRTHLRYLLHR+KYE+N +DAYS EGWKGQSL+K+KPEKEL+RA+++IF KLKIR Sbjct: 489 EFCRIRTHLRYLLHRIKYEKNFLDAYSGEGWKGQSLDKIKPEKELKRAKAEIFGRKLKIR 548 Query: 2205 DLFQQLDRSLTEGRFPESLFDSEGQIDSEDIFCAKCGSKDLTIENDIILCDGACERGFHQ 2026 DLFQ+LD + +EGR PE LFDS G+IDSEDIFCAKCGSKD+T+ NDIILCDGAC+RGFHQ Sbjct: 549 DLFQRLDLARSEGRLPEILFDSRGEIDSEDIFCAKCGSKDVTLSNDIILCDGACDRGFHQ 608 Query: 2025 FCLDPPLLKEDIPPDDEGWLCPGCDCKLDCVGMLNDFQESTLSVTDNWEKVFPEEAAAAA 1846 FCLDPPLLKE IPPDDEGWLCPGC+CK+DC+ +LND QE+ + + D+WEKVF EEAAAAA Sbjct: 609 FCLDPPLLKEYIPPDDEGWLCPGCECKIDCIKLLNDSQETNILLGDSWEKVFAEEAAAAA 668 Query: 1845 SGKTMDEGTXXXXXXXXXXXXXXXXXXXDHEVSRGESTSDGSDYFSASDDIVPPLDNKQI 1666 SGK +D+ + D +V +S++D SDY S SDD+ Q+ Sbjct: 669 SGKNLDDNSGLPSDDSEDDDYDPGGPDLDEKVQGDDSSTDESDYQSESDDM-------QV 721 Query: 1665 FXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDLGAILEDGESPNKDEGHFPSV 1486 + D S ++D F V Sbjct: 722 IRQKNSRGLPSDDSEDDEYDPSGLVTDQMYK---------DSSCSDFTSDSED---FTGV 769 Query: 1485 SEDSKPNAVASGEKLKAGKRKGRSLNDELSYLTESNTEAVSGKRRGERLDYKKLHD---- 1318 +D K A G L + R+ + + + +T + +R+ E LDYKKL+D Sbjct: 770 FDDYKDTGKAQG-PLASTPDHVRNNEEGCGHPEQGDTAPLYPRRQVESLDYKKLNDIEFS 828 Query: 1317 ----------------------EAYRNATSDSSDEDYTETAGAKRRKSNAGKAIRICTNK 1204 E Y N +SDSSDEDY T+ K+N+ K Sbjct: 829 KMCDILDILSSQLDVIICTGNQEEYGNTSSDSSDEDYMVTSSPD--KNNSDKEA-----T 881 Query: 1203 IQDRNDRTDTMDGNQNHKENDHSVEGKSHKKLKVXXXXXXXXXXXXXXXXXXXXGKDAAK 1024 +R + ++ +Q +E+ H+ + KK V K +K Sbjct: 882 AMERGRESGDLELDQKARESTHN--RRYIKKFAVEGTDSFLSRSCEDSAAPVAGSKSTSK 939 Query: 1023 QYKRLGEAVVQRLVESFKENQYPKQDVKQSLAKELGLRVQQVGKWFENARWSFRHSSRME 844 GE QRL++SFKENQYP++ VK+SLA EL L V+QV WF N RWSFRHSSR+ Sbjct: 940 TLH--GEHATQRLLQSFKENQYPQRAVKESLAAELALSVRQVSNWFNNRRWSFRHSSRIG 997 Query: 843 SKMVAAASTNGSSLRTRVHVMSLAAKQSPVLPSPSDSGME 724 S VA +N + + + + + K VL S + S +E Sbjct: 998 SD-VAKFDSNDTPRQKSIDMSGPSLKS--VLDSATYSEIE 1034 >ref|XP_004161446.1| PREDICTED: homeobox protein HAT3.1-like [Cucumis sativus] Length = 749 Score = 415 bits (1067), Expect = e-113 Identities = 280/718 (38%), Positives = 375/718 (52%), Gaps = 31/718 (4%) Frame = -1 Query: 2598 ASESVPHEGTSLPSRKQISSLLPPVSN-RVLRSRSQEKPKASESKAVEVENSANEGRKSK 2422 + +S + L S+K+ L VS+ RVLRSR+QEK KA E +A E K K Sbjct: 24 SQQSARKDKIFLKSKKKNYKLRSHVSSDRVLRSRTQEKAKAPERSNDLNNFTAEEDGKRK 83 Query: 2421 QRKGRM---KKIPVNEFSRIRTHLRYLLHRVKYERNLIDAYSCEGWKGQSLEKLKPEKEL 2251 ++K R K V+E+S IR HLRYLL+R++YE++LI+AYS EGWKG S +KLKPEKEL Sbjct: 84 KKKKRNIQGKGARVDEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKEL 143 Query: 2250 QRARSQIFRYKLKIRDLFQQLDRSLTEGRFPESLFDSEGQIDSEDIFCAKCGSKDLTIEN 2071 QRA ++I R KLKIRDLFQ++D EGR ESLFDSEGQIDSEDIFCAKCGSK+L++EN Sbjct: 144 QRASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLEN 203 Query: 2070 DIILCDGACERGFHQFCLDPPLLKEDIPPDDEGWLCPGCDCKLDCVGMLNDFQESTLSVT 1891 DIILCDG C+RGFHQFCL+PPLL DIPPDDEGWLCPGCDCK DC+ +LN+FQ S LS+T Sbjct: 204 DIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSIT 263 Query: 1890 DNWEKVFPEEAAAAAS-------GKTMDEGTXXXXXXXXXXXXXXXXXXXDHEVSRGEST 1732 D WEKV+PE AAAAA G D+ E S +S Sbjct: 264 DGWEKVYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDTIDQDNELSSDESSSDQSN 323 Query: 1731 SDGSD-----YFSASDDIVPPLDNKQIFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1567 SD S+ Y SAS+ + ++ Q Sbjct: 324 SDPSNSDTSGYASASEGLEVSSNDDQYLGLPSDDSEDNDYDPSVPELDEGVRQESSSSDF 383 Query: 1566 XXXXXDLGAILEDGESPNKDEGHFPSVSEDSKPNAVASGEKLKAGKRKGRSLNDELSYLT 1387 DL A+ D +KD G S ++ P ++G+ +G K +L++ELS L Sbjct: 384 TSDSEDLAAL--DNNCSSKD-GDLVSSLNNTLPVKNSNGQS--SGPNKS-ALHNELSSLL 437 Query: 1386 ESNT-----EAVSGKRRGERLDYKKLHDEAYRNATSDSSDEDYTET----------AGAK 1252 +S E VSG+R+ ERLDYKKLHDE Y N +DSSD+ Y T +G + Sbjct: 438 DSGPDKDGLEPVSGRRQVERLDYKKLHDETYGNVPTDSSDDTYGSTLDSSDDRGWDSGTR 497 Query: 1251 RRKSNAGKAIRICTNKIQDRNDRTDTMDGNQNHKENDHSVEGKSHKKLKVXXXXXXXXXX 1072 +R G + ND + +++K G + V Sbjct: 498 KR----GPKTLVLALSNNGSNDDLTNVKTKRSYKRRTRQKPGAINVNNSVTETPVDTAKS 553 Query: 1071 XXXXXXXXXXGKDAAKQYKRLGEAVVQRLVESFKENQYPKQDVKQSLAKELGLRVQQVGK 892 K + +RL + ++RL+ SF+EN+YPK+ KQSLA+ELGL ++QV K Sbjct: 554 SSSVK------KSTSSSNRRLSQPALERLLASFQENEYPKRATKQSLAQELGLGLKQVSK 607 Query: 891 WFENARWSFRHSSRMESKMVAAASTNGSSLRTRVHVMSLAAKQSPVLPSPSDSGMENLIS 712 WFEN RWS RH S K SS R +++ + + S P S + + S Sbjct: 608 WFENTRWSTRHPSSSGKKAK-------SSSRMSIYLSQASGELSKNEPE-SATCFRDTDS 659 Query: 711 SQVRPGNEECQITDAGEGKSVESEASGEKSTRKRKVDNQGSSAGNCMKQDQHDDTPKS 538 + R +++ + ++ S +S +G+K RK SSA K+ D S Sbjct: 660 NGAR--HQDLPMANSVVA-SCQSGDTGDKKLSSRKTKRADSSATKSRKRKGRSDNTAS 714 >ref|XP_004140812.1| PREDICTED: uncharacterized protein LOC101204775 [Cucumis sativus] Length = 1061 Score = 415 bits (1067), Expect = e-113 Identities = 280/718 (38%), Positives = 375/718 (52%), Gaps = 31/718 (4%) Frame = -1 Query: 2598 ASESVPHEGTSLPSRKQISSLLPPVSN-RVLRSRSQEKPKASESKAVEVENSANEGRKSK 2422 + +S + L S+K+ L VS+ RVLRSR+QEK KA E +A E K K Sbjct: 256 SQQSARKDKIFLKSKKKNYKLRSHVSSDRVLRSRTQEKAKAPERSNDLNNFTAEEDGKRK 315 Query: 2421 QRKGRM---KKIPVNEFSRIRTHLRYLLHRVKYERNLIDAYSCEGWKGQSLEKLKPEKEL 2251 ++K R K V+E+S IR HLRYLL+R++YE++LI+AYS EGWKG S +KLKPEKEL Sbjct: 316 KKKKRNIQGKGARVDEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKEL 375 Query: 2250 QRARSQIFRYKLKIRDLFQQLDRSLTEGRFPESLFDSEGQIDSEDIFCAKCGSKDLTIEN 2071 QRA ++I R KLKIRDLFQ++D EGR ESLFDSEGQIDSEDIFCAKCGSK+L++EN Sbjct: 376 QRASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLEN 435 Query: 2070 DIILCDGACERGFHQFCLDPPLLKEDIPPDDEGWLCPGCDCKLDCVGMLNDFQESTLSVT 1891 DIILCDG C+RGFHQFCL+PPLL DIPPDDEGWLCPGCDCK DC+ +LN+FQ S LS+T Sbjct: 436 DIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSIT 495 Query: 1890 DNWEKVFPEEAAAAAS-------GKTMDEGTXXXXXXXXXXXXXXXXXXXDHEVSRGEST 1732 D WEKV+PE AAAAA G D+ E S +S Sbjct: 496 DGWEKVYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDTIDQDNELSSDESSSDQSN 555 Query: 1731 SDGSD-----YFSASDDIVPPLDNKQIFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1567 SD S+ Y SAS+ + ++ Q Sbjct: 556 SDPSNSDTSGYASASEGLEVSSNDDQYLGLPSDDSEDNDYDPSVPELDEGVRQESSSSDF 615 Query: 1566 XXXXXDLGAILEDGESPNKDEGHFPSVSEDSKPNAVASGEKLKAGKRKGRSLNDELSYLT 1387 DL A+ D +KD G S ++ P ++G+ +G K +L++ELS L Sbjct: 616 TSDSEDLAAL--DNNCSSKD-GDLVSSLNNTLPVKNSNGQS--SGPNKS-ALHNELSSLL 669 Query: 1386 ESNT-----EAVSGKRRGERLDYKKLHDEAYRNATSDSSDEDYTET----------AGAK 1252 +S E VSG+R+ ERLDYKKLHDE Y N +DSSD+ Y T +G + Sbjct: 670 DSGPDKDGLEPVSGRRQVERLDYKKLHDETYGNVPTDSSDDTYGSTLDSSDDRGWDSGTR 729 Query: 1251 RRKSNAGKAIRICTNKIQDRNDRTDTMDGNQNHKENDHSVEGKSHKKLKVXXXXXXXXXX 1072 +R G + ND + +++K G + V Sbjct: 730 KR----GPKTLVLALSNNGSNDDLTNVKTKRSYKRRTRQKPGAINVNNSVTETPVDTAKS 785 Query: 1071 XXXXXXXXXXGKDAAKQYKRLGEAVVQRLVESFKENQYPKQDVKQSLAKELGLRVQQVGK 892 K + +RL + ++RL+ SF+EN+YPK+ KQSLA+ELGL ++QV K Sbjct: 786 SSSVK------KSTSSSNRRLSQPALERLLASFQENEYPKRATKQSLAQELGLGLKQVSK 839 Query: 891 WFENARWSFRHSSRMESKMVAAASTNGSSLRTRVHVMSLAAKQSPVLPSPSDSGMENLIS 712 WFEN RWS RH S K SS R +++ + + S P S + + S Sbjct: 840 WFENTRWSTRHPSSSGKKAK-------SSSRMSIYLSQASGELSKNEPE-SATCFRDTDS 891 Query: 711 SQVRPGNEECQITDAGEGKSVESEASGEKSTRKRKVDNQGSSAGNCMKQDQHDDTPKS 538 + R +++ + ++ S +S +G+K RK SSA K+ D S Sbjct: 892 NGAR--HQDLPMANSVVA-SCQSGDTGDKKLSSRKTKRADSSATKSRKRKGRSDNTAS 946