BLASTX nr result
ID: Catharanthus22_contig00003070
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00003070 (3700 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_004230722.1| PREDICTED: pathogenesis-related homeodomain ... 505 e-140 ref|XP_006346339.1| PREDICTED: pathogenesis-related homeodomain ... 501 e-139 ref|XP_002313886.2| hypothetical protein POPTR_0009s09600g [Popu... 484 e-134 ref|XP_002300247.2| homeobox family protein [Populus trichocarpa... 481 e-133 ref|XP_004289744.1| PREDICTED: uncharacterized protein LOC101296... 474 e-131 gb|EXB76647.1| Homeobox protein [Morus notabilis] 472 e-130 emb|CBI22504.3| unnamed protein product [Vitis vinifera] 465 e-128 ref|XP_002269077.1| PREDICTED: homeobox protein HAT3.1-like [Vit... 465 e-128 ref|XP_002527467.1| Homeobox protein HAT3.1, putative [Ricinus c... 464 e-127 gb|EMJ01257.1| hypothetical protein PRUPE_ppa023106mg [Prunus pe... 462 e-127 ref|XP_006486963.1| PREDICTED: homeobox protein HAT3.1-like isof... 456 e-125 ref|XP_006422879.1| hypothetical protein CICLE_v10027725mg [Citr... 455 e-125 gb|EOX98399.1| Homeodomain-like protein with RING/FYVE/PHD-type ... 455 e-125 ref|XP_006589630.1| PREDICTED: pathogenesis-related homeodomain ... 451 e-124 ref|XP_003555282.1| PREDICTED: homeobox protein HAT3.1-like isof... 447 e-122 ref|XP_004496910.1| PREDICTED: pathogenesis-related homeodomain ... 444 e-121 gb|ESW15073.1| hypothetical protein PHAVU_007G041800g [Phaseolus... 433 e-118 sp|P48786.1|PRH_PETCR RecName: Full=Pathogenesis-related homeodo... 429 e-117 ref|XP_004140812.1| PREDICTED: uncharacterized protein LOC101204... 416 e-113 ref|XP_004161446.1| PREDICTED: homeobox protein HAT3.1-like [Cuc... 415 e-113 >ref|XP_004230722.1| PREDICTED: pathogenesis-related homeodomain protein-like [Solanum lycopersicum] Length = 796 Score = 505 bits (1301), Expect = e-140 Identities = 309/740 (41%), Positives = 410/740 (55%), Gaps = 15/740 (2%) Frame = +3 Query: 1023 MGSATCAPENMGSATCDAH--------ENHLDLQHSEPAEKDATNVASESVPHEGTSLPS 1178 +G+ + +PE +A H EN Q E E N+ + P Sbjct: 4 LGNTSVSPEKARTAGGGHHTASAGNMSENLGADQSRESCENTVQNLNQSEYREKSPGQPR 63 Query: 1179 RKQISSLLPPVSNRVLRSRSQEKPKASESKAVEVENSANEGRKSKQRKGRMKK-IPVNEF 1355 +++ S P S R+LRS+S+EK ASE+K V + A E +K K+RK + K I NEF Sbjct: 64 KRKSISGSPISSTRLLRSKSKEKSGASEAKNTVVTHDATEEKKRKRRKKKHSKHIAANEF 123 Query: 1356 SRIRTHLRYLLHRVKYERNLIDAYSCEGWKGQSLEKLKPEKELQRARSQIFRYKLKIRDL 1535 +RIR HLRYLL R+KYE+ LI+AYS EGWKGQSLEK+K EKELQRA++ IFRYKLKIRDL Sbjct: 124 TRIRGHLRYLLQRIKYEQTLIEAYSGEGWKGQSLEKIKLEKELQRAKTHIFRYKLKIRDL 183 Query: 1536 FQQLDRSLTEGRFPESLFDSEGQIDSEDIFCAKCGSKDLTIENDIILCDGACERGFHQFC 1715 FQ+LD L EGR P SLFD+EG+IDSEDIFCAKCGS DL +NDIILCDGACERGFHQ C Sbjct: 184 FQRLDTLLAEGRLPASLFDNEGEIDSEDIFCAKCGSMDLPADNDIILCDGACERGFHQLC 243 Query: 1716 LDPPLLKEDIPPDDEGWLCPGCDCKLDCVGMLNDFQESTLSVTDNWEKVFPEEAAAAASG 1895 ++PPLLKEDIPPDDEGWLCPGCDCK+DC+ +LND Q + LSVTD+WEKV+P+EAAAAASG Sbjct: 244 VEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLLNDLQGTDLSVTDSWEKVYPKEAAAAASG 303 Query: 1896 KTMDEGTXXXXXXXXXXXXXXXXXXXXHEVSRGESTSD--GSDYFSASDDIV-PPLDNKQ 2066 + +D+ + S ES+SD SD++SAS+D+ P + + Sbjct: 304 EKLDDISGLPSDDSEDDDYNPEAPDVGKNDSEDESSSDESESDFYSASEDLAEAPTKDDE 363 Query: 2067 IFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLGAILEDGESPNKDEGHFPS 2246 I I++ ++G S Sbjct: 364 ILGLSSEDSEDDDYNPDDPDKDEPVKTESSSSDFTSDSEDFSLIVDTNRLRGDEQGVSSS 423 Query: 2247 VSEDSKPNAVASGEKLKAGKRKGRSLNDELSYLTESNTEAVSGKRRGERLDYKKLHDEAY 2426 V ++S PN+V+ EK K GK KG SL DELSYL +S++ VS KR ERLDYKKLHDE Y Sbjct: 424 V-DNSMPNSVSLKEKAKVGKAKGNSLKDELSYLMQSDSPLVSAKRHIERLDYKKLHDETY 482 Query: 2427 RNATSDSSDEDYTETAGAKRRKSNAGKAIRICTNKIQDRNDRTDTMDGNQNHKENDHSVE 2606 N +SDSSDEDY + K RK K + + D + K + H+ + Sbjct: 483 GNGSSDSSDEDYDDGPLPKVRKLRNAKGAMAAPS-----STPADIKYQSGKQKGSGHASD 537 Query: 2607 GKSHKKLKVXXXXXXXXXXXXXXXXXXXXXKDAAKQYKRLGEAVVQRLVESFKENQYPKQ 2786 +KLKV ++ + K GE +RL ESFK+NQYP + Sbjct: 538 SGISEKLKV--------------GGTGTSESPSSGKRKTYGEVSTKRLYESFKDNQYPDR 583 Query: 2787 DVKQSLAKELGLRVQQVGKWFENARWSFRHSSRMESKMVAAASTNGSSLRTRVHVMSLAA 2966 D K+ L KELGL QV KWFENAR RHS + K+++ + S ++++ L Sbjct: 584 DAKEKLGKELGLTAHQVSKWFENARHCHRHSPNWK-KIMSHKVSEESPSKSQIIGEPLGT 642 Query: 2967 KQSPVLPSPSDSGMENLISSQVRPGNEECQITDAGEGKSVESEASGEKS---TRKRKVDN 3137 + + ++ S +G+E L + E+ D E + + + SG+KS T+K N Sbjct: 643 ESNSII--ASCNGVEKLEQPKQCLNGEKGHAIDKSEEELLIQDTSGKKSSEPTKKVHTTN 700 Query: 3138 QGSGAGNCMKQDQHDDTPKS 3197 +GS +DTP+S Sbjct: 701 EGS-----------EDTPRS 709 >ref|XP_006346339.1| PREDICTED: pathogenesis-related homeodomain protein-like isoform X1 [Solanum tuberosum] gi|565359059|ref|XP_006346340.1| PREDICTED: pathogenesis-related homeodomain protein-like isoform X2 [Solanum tuberosum] gi|565359061|ref|XP_006346341.1| PREDICTED: pathogenesis-related homeodomain protein-like isoform X3 [Solanum tuberosum] gi|565359063|ref|XP_006346342.1| PREDICTED: pathogenesis-related homeodomain protein-like isoform X4 [Solanum tuberosum] Length = 798 Score = 501 bits (1291), Expect = e-139 Identities = 313/741 (42%), Positives = 413/741 (55%), Gaps = 16/741 (2%) Frame = +3 Query: 1023 MGSATCAPENMGSATCDAHEN--------HLDLQHSEPAEKDATNVASESVPHEGTSLPS 1178 +G+ + +PE + H +L + S A ++A ++S E T Sbjct: 4 LGNTSVSPEKVARTAGGGHRTASVGNMSENLGVDQSGEACENAVQNLNQSEYREKTPGQP 63 Query: 1179 RKQISSLLPPVSN-RVLRSRSQEKPKASESKAVEVENSANEGRKSKQRKGRMKK-IPVNE 1352 RK+ S P+S+ R+LRS+S+EK ASE+ V + A E +K K+RK + K I VNE Sbjct: 64 RKRKSISGSPISSTRLLRSKSKEKSGASEANNTVVTHDATEEKKRKRRKKKHSKHIAVNE 123 Query: 1353 FSRIRTHLRYLLHRVKYERNLIDAYSCEGWKGQSLEKLKPEKELQRARSQIFRYKLKIRD 1532 F+RIR HLRYLL R+ YE+ LI+AYS EGWKGQSLEK+K EKELQRA++ IFRYKLKIRD Sbjct: 124 FTRIRGHLRYLLQRITYEQTLIEAYSGEGWKGQSLEKIKLEKELQRAKTHIFRYKLKIRD 183 Query: 1533 LFQQLDRSLTEGRFPESLFDSEGQIDSEDIFCAKCGSKDLTIENDIILCDGACERGFHQF 1712 LFQ+LD L EGR P SLFD+EG+IDSEDIFCAKCGS DL +NDIILCDGACERGFHQ Sbjct: 184 LFQRLDTLLAEGRLPASLFDNEGEIDSEDIFCAKCGSMDLPADNDIILCDGACERGFHQL 243 Query: 1713 CLDPPLLKEDIPPDDEGWLCPGCDCKLDCVGMLNDFQESTLSVTDNWEKVFPEEAAAAAS 1892 C++PPLLKEDIPPDDEGWLCPGCDCK+DC+ +LND Q + LSVTD+WEKV+P+EAAAAAS Sbjct: 244 CVEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLLNDLQGTDLSVTDSWEKVYPKEAAAAAS 303 Query: 1893 GKTMDEGTXXXXXXXXXXXXXXXXXXXXHEVSRGESTSDGSDYFSASDDI--VPPLDNKQ 2066 G+ +D+ + S ES+SD SD++SAS+D+ PP D+ + Sbjct: 304 GEKLDDISGLPSDDSEDDDYNPETPDVGKNDSEDESSSDESDFYSASEDLAEAPPKDD-E 362 Query: 2067 IFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLGAILEDGESPNKDEGHFPS 2246 I I++ ++G S Sbjct: 363 ILGISSEDSEDDDFNPDDPDKDEPVKTESSSSDFTSDSEDFNLIVDTNRLQGDEQGVSSS 422 Query: 2247 VSEDSKPNAVASGEKLKAGKRKGRSLNDELSYLTESNTEAVSGKRRGERLDYKKLHDEAY 2426 V ++S PN+ + EK K GK KG SL DELSYL +S++ VS KR ERLDYKKLHDE Y Sbjct: 423 V-DNSMPNSASQEEKAKVGKAKGNSLKDELSYLMQSDSPLVSAKRHIERLDYKKLHDETY 481 Query: 2427 RNATSDSSDEDYTETAGAKRRK-SNAGKAIRICTNKIQDRNDRTDTMDGNQNHKENDHSV 2603 N +S+SSDEDY + K RK NA A+ ++ D ++ G+ ++ S Sbjct: 482 GNGSSESSDEDYDDGPLPKVRKLRNAKGAMTSPSSTPADIKHQSGKQKGSGRASDSGIS- 540 Query: 2604 EGKSHKKLKVXXXXXXXXXXXXXXXXXXXXXKDAAKQYKRLGEAVVQRLVESFKENQYPK 2783 +KLKV ++ + K GE +RL ESFK+NQYP Sbjct: 541 -----EKLKV--------------GGAGTSESPSSGKRKTHGEVATKRLYESFKDNQYPD 581 Query: 2784 QDVKQSLAKELGLRVQQVGKWFENARWSFRHSSRMESKMVAAASTNGSSLRTRVHVMSLA 2963 +D K L KELGL QV KWFENAR RHSS + M S S + ++ L Sbjct: 582 RDAKGKLGKELGLTAYQVSKWFENARHCHRHSSHWNTIMSQKVSKESPS-KLQIIGEPLG 640 Query: 2964 AKQSPVLPSPSDSGMENLISSQVRPGNEECQITDAGEGKSVESEASGEKS---TRKRKVD 3134 + + ++ +G+ L + R E+ D E +ASG+KS T+K Sbjct: 641 TESNSII--AFCNGVGKLEQPKQRLNGEKGHAIDKSEEDLFIQDASGKKSSEPTKKVYTT 698 Query: 3135 NQGSGAGNCMKQDQHDDTPKS 3197 NQGS +DTP++ Sbjct: 699 NQGS-----------EDTPRN 708 >ref|XP_002313886.2| hypothetical protein POPTR_0009s09600g [Populus trichocarpa] gi|550331388|gb|EEE87841.2| hypothetical protein POPTR_0009s09600g [Populus trichocarpa] Length = 934 Score = 484 bits (1247), Expect = e-134 Identities = 295/684 (43%), Positives = 370/684 (54%), Gaps = 17/684 (2%) Frame = +3 Query: 927 IVISNKDPISNSIPGDFRLPHEN---GAAICAPENMGSATCAPENMGSATCDAHENHLDL 1097 I I N +P++ + + H G +I P N T D + D Sbjct: 245 IAIENSEPLTQLVTKRSPIKHVGLLPGDSIIIPAN---------EQTRPTHDDEDKGPDH 295 Query: 1098 QHSEPAEKDATNVASESVPH-EGTSLPSRKQISSLLPPVSNRVLRSRSQEKPKASESKAV 1274 +H E + A + P + S SRK S+RVLRSRSQEKPKA ES Sbjct: 296 EHLETPSRVAIGITRRGRPRGKSASRLSRKIYMLRSLRSSDRVLRSRSQEKPKAPESSNN 355 Query: 1275 EVENSANEGRKSKQRKGRM-KKIPVNEFSRIRTHLRYLLHRVKYERNLIDAYSCEGWKGQ 1451 ++ +K K+RK R K I +E+S+IR HLRYLL+R+ YE++LI AYS EGWKG Sbjct: 356 SGNVNSTGDKKGKRRKKRRGKNIVADEYSKIRAHLRYLLNRMSYEQSLITAYSGEGWKGL 415 Query: 1452 SLEKLKPEKELQRARSQIFRYKLKIRDLFQQLDRSLTEGRFPESLFDSEGQIDSEDIFCA 1631 SLEKLKPEKELQRA S+I R K+KIRDLFQ +D +EGRFP SLFDSEGQIDSEDIFCA Sbjct: 416 SLEKLKPEKELQRATSEITRRKVKIRDLFQHIDSLCSEGRFPSSLFDSEGQIDSEDIFCA 475 Query: 1632 KCGSKDLTIENDIILCDGACERGFHQFCLDPPLLKEDIPPDDEGWLCPGCDCKLDCVGML 1811 KCGSKDL +NDIILCDGAC+RGFHQFCL PPLL+EDIPPDDEGWLCPGCDCK+DC+G+L Sbjct: 476 KCGSKDLNADNDIILCDGACDRGFHQFCLIPPLLREDIPPDDEGWLCPGCDCKVDCIGLL 535 Query: 1812 NDFQESTLSVTDNWEKVFPEEAAAAASGKTMDEGTXXXXXXXXXXXXXXXXXXXXHEVSR 1991 ND Q + +S++D+WEKVFP EAAA ASG+ +D + Sbjct: 536 NDSQGTNISISDSWEKVFP-EAAATASGQKLDHNFGPSSDDSDDNDYEPDGPDIDKKSQE 594 Query: 1992 GESTSDGSDYFSASDDIVPPLDNKQIFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 2171 ES+SD SD+ SASD+ P D K+ Sbjct: 595 EESSSDESDFTSASDEFKAPPDGKEYLGLSSDDSEDDDYDPDAPVLEEKLKQESSSSDFT 654 Query: 2172 XXXXXLGAILEDGESPNKDEGHFPSVSEDSKPNAVASGEKLKAGKRKGRSLNDELSYLTE 2351 L A + +DE H P +P V++G K K +K +SLN EL + E Sbjct: 655 SDSEDLAATINGDGLSLEDECHMP-----IEPRGVSNGRKSKFDGKKMQSLNSELLSMLE 709 Query: 2352 -----SNTEAVSGKRRGERLDYKKLHDEAYRNATSDSSDEDYTETAGAKRRKSNAGKAIR 2516 + VSGKR +RLDYKKL+DE Y N S SSD+DYT+T G ++R+ N G Sbjct: 710 PDLCQDESATVSGKRNVDRLDYKKLYDETYGN-ISTSSDDDYTDTVGPRKRRKNTGDVAT 768 Query: 2517 ICTNKIQDRNDRTDTMDG------NQNHKENDHSVEGKSHKKLKVXXXXXXXXXXXXXXX 2678 + N D + T +G NQ KEN + E + + Sbjct: 769 VTAN-----GDASVTENGMNSKNMNQELKENKRNPERGTCQNSSFQETNVSPAKSYVGAS 823 Query: 2679 XXXXXXKDA-AKQYKRLGEAVVQRLVESFKENQYPKQDVKQSLAKELGLRVQQVGKWFEN 2855 K YK+LGEAV QRL F+ENQYP + K SLA+ELG+ +QV KWF N Sbjct: 824 LSGSSGKSVRPSAYKKLGEAVTQRLYSYFRENQYPDRAAKASLAEELGITFEQVNKWFVN 883 Query: 2856 ARWSFRHSSRMESKMVAAASTNGS 2927 ARWSF HSS + +AS GS Sbjct: 884 ARWSFNHSSSTGTSKAESASGKGS 907 >ref|XP_002300247.2| homeobox family protein [Populus trichocarpa] gi|550348560|gb|EEE85052.2| homeobox family protein [Populus trichocarpa] Length = 930 Score = 481 bits (1239), Expect = e-133 Identities = 313/809 (38%), Positives = 415/809 (51%), Gaps = 12/809 (1%) Frame = +3 Query: 537 LHNCLLSGVSSSPTKEPPLKHGDEFVSGGGEPVVQKSVTTKSQQISLSEASVRECDCDSV 716 +H+ + SS + P E S Q S+ + +A + + + Sbjct: 129 VHSESSKAIDSSILLDEPRNSNTELSSCIANETSQASLEGLANDSRAEDAGLSLVEASNS 188 Query: 717 DNLKILDGSSMNSSFESLTAHDLGSENI--EPLEQKQDVAQDIGRKSPSETGVVASSELP 890 D ++D SS + S + S+ +PLE++Q ++ E G+ S Sbjct: 189 D---LIDESSYSQQTTSGQTREFHSDRACCKPLEERQKPGSELAENESMEIGIGLPSG-- 243 Query: 891 GPEYLEHSNGEQIVISNKDPISNSIPGDFRLPHENGAAICAPENMGSATCAPENMGSATC 1070 I I N +P++ + + H I P + A E + T Sbjct: 244 ------------IAIENLEPLTELVTKSCPIKH-----IGLPPGDDISIPANEQI-RPTH 285 Query: 1071 DAHENHLDLQHSEPAEKDATNVASESVPH--EGTSLPSRKQISSLLPPVSNRVLRSRSQE 1244 D + D +H E + S+ VP + L +K SS S+RVLRS SQE Sbjct: 286 DKESKYPDCEHLEKLSGIVIGITSQGVPSVKRTSKLSGKKYTSSSRK--SDRVLRSNSQE 343 Query: 1245 KPKASESKAVEVE-NSANEGRKSKQRKGRMKKIPVNEFSRIRTHLRYLLHRVKYERNLID 1421 KPKA E NS E + +++K R K I +E+SRIR LRYLL+R+ YE++LI Sbjct: 344 KPKAPEPSNNSTNVNSTGEEKGKRRKKRRGKSIVADEYSRIRARLRYLLNRMSYEQSLIT 403 Query: 1422 AYSCEGWKGQSLEKLKPEKELQRARSQIFRYKLKIRDLFQQLDRSLTEGRFPESLFDSEG 1601 AYS EGWKG SLEKLKPEKELQRA S+I R K+KIRDLFQ +D EGRFP SLFDSEG Sbjct: 404 AYSGEGWKGLSLEKLKPEKELQRATSEIIRRKVKIRDLFQHIDSLCGEGRFPASLFDSEG 463 Query: 1602 QIDSEDIFCAKCGSKDLTIENDIILCDGACERGFHQFCLDPPLLKEDIPPDDEGWLCPGC 1781 QIDSEDIFCAKCGSKDLT +NDIILCDGAC+RGFHQFCL PPLL+EDIPP DEGWLCPGC Sbjct: 464 QIDSEDIFCAKCGSKDLTADNDIILCDGACDRGFHQFCLVPPLLREDIPPGDEGWLCPGC 523 Query: 1782 DCKLDCVGMLNDFQESTLSVTDNWEKVFPEEAAAAASGKTMDEGTXXXXXXXXXXXXXXX 1961 DCK+DC+ +LND Q + +S++D W+ VFP EAAA ASG+ +D Sbjct: 524 DCKVDCIDLLNDSQGTNISISDRWDNVFP-EAAAVASGQKLDY-NFGLSSDDSDDNDYDP 581 Query: 1962 XXXXXHEVSRGESTSDGSDYFSASDDIVPPLDNKQIFXXXXXXXXXXXXXXXXXXXXXXX 2141 E S+ ES+SD SD+ SASD+ P D+KQ Sbjct: 582 DGPDIDEKSQEESSSDESDFSSASDEFEAPPDDKQYLGLPSDDSEDDDYDPDAPVLEEKL 641 Query: 2142 XXXXXXXXXXXXXXXLGAILEDGESPNKDEGHFPSVSEDSKPNAVASGEKLKAGKRKGRS 2321 L A L DE H P +P+ ++G + + G +K S Sbjct: 642 KQESSSSDFTSDSEDLDATLNGDGLSLGDEYHMP-----IEPHEDSNGRRSRFGGKKNHS 696 Query: 2322 LNDELSYLTESNTE-----AVSGKRRGERLDYKKLHDEAYRNATSDSSDEDYTETAGAKR 2486 LN +L + E ++ VSGKR ERLDYKKL+DE Y N S SSD+DYT+T ++ Sbjct: 697 LNSKLLSMLEPDSHQEKSAPVSGKRNIERLDYKKLYDETYGN-ISTSSDDDYTDTVAPRK 755 Query: 2487 RKSNAGK-AIRICTNKIQDRNDRTDTMDGNQNHKENDHSVEGKSHKKLKVXXXXXXXXXX 2663 R+ N G A+ I + ++ + NQ K+N+H+ G++H+ Sbjct: 756 RRKNTGDVAMGIANGDASVTENGLNSKNMNQELKKNEHT-SGRTHQNSSFQDTNVSPAKT 814 Query: 2664 XXXXXXXXXXXKDA-AKQYKRLGEAVVQRLVESFKENQYPKQDVKQSLAKELGLRVQQVG 2840 K YK+LGEAV Q+L FKEN+YP Q K SLA+ELG+ +QV Sbjct: 815 HVGESLSGSSSKRVRPSAYKKLGEAVTQKLYSFFKENRYPDQAAKASLAEELGITFEQVN 874 Query: 2841 KWFENARWSFRHSSRMESKMVAAASTNGS 2927 KWF NARWSF HSS + +AS GS Sbjct: 875 KWFMNARWSFNHSSPEGTSKAESASGKGS 903 >ref|XP_004289744.1| PREDICTED: uncharacterized protein LOC101296723 [Fragaria vesca subsp. vesca] Length = 1227 Score = 474 bits (1221), Expect = e-131 Identities = 327/887 (36%), Positives = 453/887 (51%), Gaps = 25/887 (2%) Frame = +3 Query: 561 VSSSPTKEPPLKHGDEFVSGGGEP---VVQKSVTTKS---QQISLSEASVRECDCDSVDN 722 + SS ++ PL+ VS GG VV ++V+ S Q L EA + C D + Sbjct: 357 LGSSFVQDEPLQTIIPVVSSGGNEQLRVVNENVSVPSLGEQAGLLPEAVSKTCQTDKLSR 416 Query: 723 LKILDGSSMNSSFESLTAHDLGSENIEPLEQKQDVAQDIGRKSPSETGVVASSELPGPEY 902 S++++ + + GS EP EQ+ + PS+ V +S Sbjct: 417 -------SLHTASDQINESGSGSVQCEPQEQRDQLGS-----LPSQNDQVKNSTAVSSSI 464 Query: 903 LEHSNGEQIVISNKDPISNSIPGDFRLPHENGAAICAPENMGSATCAPENMGSATCDAHE 1082 +G + D ++NS+ G P E+ A ++ P T DA + Sbjct: 465 GFEQSGPSV-----DEMNNSVIGHLEPPPED-----ASKDHNKELIKPH-----TNDATQ 509 Query: 1083 NHLDLQHSEPAEKDATNVASESVPHEGTSLPSRKQISSLLPPVSNRVLRSRSQEKPKASE 1262 N L+ SE A K+A+ +++ + + SR++ SL+ S+RVLRSR+ EKP+A E Sbjct: 510 NSC-LEPSETASKNASKNSTQFGCKDKRNSSSRRKSRSLVS--SDRVLRSRTSEKPEAPE 566 Query: 1263 ----------SKAVEVENSANEGRKSKQRKGRMKKIPVNEFSRIRTHLRYLLHRVKYERN 1412 S +V ++ EG++ K++K +++ +EFSRIR+HLRY L+R+ YE++ Sbjct: 567 LSNNVATLDTSNSVANVSNEKEGKRKKRKKKHRERVAADEFSRIRSHLRYFLNRINYEKS 626 Query: 1413 LIDAYSCEGWKGQSLEKLKPEKELQRARSQIFRYKLKIRDLFQQLDRSLTEGRFPESLFD 1592 LIDAYS EGWKG SLEKLKPEKELQRA S+I R K KIRDLFQ+LD EG FPESLFD Sbjct: 627 LIDAYSSEGWKGNSLEKLKPEKELQRATSEILRRKSKIRDLFQRLDSLCAEGMFPESLFD 686 Query: 1593 SEGQIDSEDIFCAKCGSKDLTIENDIILCDGACERGFHQFCLDPPLLKEDIPPDDEGWLC 1772 EGQIDSEDIFCAKCGS D+ +NDIILCDGAC+RGFHQ CL+PPLL E+IPPDDEGWLC Sbjct: 687 EEGQIDSEDIFCAKCGSLDVYADNDIILCDGACDRGFHQHCLEPPLLSEEIPPDDEGWLC 746 Query: 1773 PGCDCKLDCVGMLNDFQESTLSVTDNWEKVFPEEAAAAASGKTMDEGTXXXXXXXXXXXX 1952 PGCDCK+DC+ +LND Q + LS+TD+WEKVFPE A AA++G+ + Sbjct: 747 PGCDCKVDCIDLLNDSQGTDLSITDSWEKVFPEAAVAASAGQHQENNQGLPSEDSDDDDY 806 Query: 1953 XXXXXXXXHEVSRGESTSDGSDYFSASDDI-VPPLDNKQIFXXXXXXXXXXXXXXXXXXX 2129 EV GES+SD S+Y SASD + P +++Q Sbjct: 807 DPDGPETDEEVQEGESSSDESEYASASDGLETPKTNDEQYLGIPSDDSEDDDFNPDAPDP 866 Query: 2130 XXXXXXXXXXXXXXXXXXXLGAIL-EDGESPNKDEGHFPSVSEDSKPNAVASGEKLKAGK 2306 L A+L ED +S EG SV E S + G+ K G+ Sbjct: 867 TEDVKQGSSSSDFTSDSEDLAAVLDEDRKSFENGEGPQSSVLEASTLLRGSGGKGSKRGQ 926 Query: 2307 RKGRSLNDELSYLTESN-----TEAVSGKRRGERLDYKKLHDEAYRNATSDSSDEDYTET 2471 ++ + DELS L ES+ + VSGKR ERLDYKKLHDE Y + + S DE+Y ET Sbjct: 927 KR-HFIKDELSSLIESDPGQDGSTPVSGKRHVERLDYKKLHDEEYGDIPT-SDDEEYIET 984 Query: 2472 AGAKRRKSNAGK-AIRICTNKIQDRNDRTDTMDGNQNHKENDHSVEGKSHKKLKVXXXXX 2648 A ++RK AG+ + K T D + +N+H+ +K Sbjct: 985 AVPRKRKKGAGQVSPGSLKGKPSTIKKGKTTKDIKDDPDKNEHTPRRTPRRKSSANDNSS 1044 Query: 2649 XXXXXXXXXXXXXXXXKDA-AKQYKRLGEAVVQRLVESFKENQYPKQDVKQSLAKELGLR 2825 A Y+RLGEAV QRL SFKENQYP + +K+ LA+ELG+ Sbjct: 1045 SPNESLKSSPKSGSTSGRAKGSTYRRLGEAVTQRLYTSFKENQYPDRSMKERLAQELGVM 1104 Query: 2826 VQQVGKWFENARWSFRHSSRMESKMVAAASTNGSSLRTRVHVMSLAAKQSPVLPSPSDSG 3005 +QV KWFENAR + + M + +S++ H +SP G Sbjct: 1105 AKQVSKWFENARHCVKAGLALPQAMRTQPNQAETSIKD-AHHDGAQKNESP--------G 1155 Query: 3006 MENLISSQVRPGNEECQITDAGEGKSVESEASGEKSTRKRKVDNQGS 3146 + ++ ++ ++ ++ S G K RK K D GS Sbjct: 1156 TADAVAGSCSQDVKDNKLATPKSSRAKTSAPKGRK--RKSKSDPGGS 1200 >gb|EXB76647.1| Homeobox protein [Morus notabilis] Length = 1031 Score = 472 bits (1214), Expect = e-130 Identities = 297/694 (42%), Positives = 380/694 (54%), Gaps = 20/694 (2%) Frame = +3 Query: 1095 LQHSEPAEKDATNVASESVPHEGTSLPSRKQISSLLPPV-SNRVLRSRSQEKPKASE-SK 1268 L+ E + K N S+ + + SRK+ L V S+RVLRSR+QEK K+ E S Sbjct: 309 LEQLETSSKSLVNKPSQLGRKDKQTSKSRKKQYMLRSLVHSDRVLRSRTQEKLKSHELSN 368 Query: 1269 AVEVENSANEGRKSKQRKGRMKKIPVNEFSRIRTHLRYLLHRVKYERNLIDAYSCEGWKG 1448 + + E R +++K R ++ +EFSRIR L+Y +R+ YE+NLIDAYS EGWKG Sbjct: 369 TLSNIGNGVEKRMKERKKRRGTRVIADEFSRIRKRLKYFFNRIHYEQNLIDAYSSEGWKG 428 Query: 1449 QSLEKLKPEKELQRARSQIFRYKLKIRDLFQQLDRSLTEGRFPESLFDSEGQIDSEDIFC 1628 SLEKLKPEKELQRA+S+IFR KLKIRDLFQQLD EGRFP+SLFDSEGQIDSEDIFC Sbjct: 429 TSLEKLKPEKELQRAKSEIFRRKLKIRDLFQQLDSLCAEGRFPKSLFDSEGQIDSEDIFC 488 Query: 1629 AKCGSKDLTIENDIILCDGACERGFHQFCLDPPLLKEDIPPDDEGWLCPGCDCKLDCVGM 1808 AKCGSKD++ NDIILCDGAC+RGFHQFCL+PPLL EDIPPDDEGWLCPGCDCK+DC + Sbjct: 489 AKCGSKDMSANNDIILCDGACDRGFHQFCLEPPLLSEDIPPDDEGWLCPGCDCKVDCFDL 548 Query: 1809 LNDFQESTLSVTDNWEKVFPEEAAAAASGKTMDEGTXXXXXXXXXXXXXXXXXXXXHEVS 1988 LND + LSVTD+WEKVFPE AAAA GK D +V Sbjct: 549 LNDSYGTNLSVTDSWEKVFPEAAAAAREGKDQDHNLEFPSDDSEDDDYDPYGPEIVEKVE 608 Query: 1989 RGESTSDGSDYFSASDDI---VPPLDNKQIFXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 2159 ES+SD S+Y SA D++ PP D +Q F Sbjct: 609 GDESSSDESEYTSACDELEGEAPPKD-EQYFGLSSDDSEDNDFDPDDQDVDENAKQESSS 667 Query: 2160 XXXXXXXXXLGAILEDGESPNKDEGHFPSVSEDSKPNAVASGEKLKAGKRKGR--SLNDE 2333 L L++G+ KDE P +++ KR G S+ DE Sbjct: 668 SDFTSDSEDLAFTLDEGQIAEKDE------VSSLDPTRSLGNAVMQSSKRGGNKSSIKDE 721 Query: 2334 LSYLTESNT-----EAVSGKRRGERLDYKKLHDEAYRNATSDSS-DEDYTETAGAKRRKS 2495 L + ES T +SGKR ERLDYK+LHDE Y + SDSS DED+T+ A ++RK Sbjct: 722 LLDILESGTGQDGSPPISGKRHVERLDYKRLHDETYGHLPSDSSDDEDWTDYAAPRKRKR 781 Query: 2496 NAGKAIRICTNKIQDRNDRTDTMDGNQNHKENDHSVEGKSHKKLKV--XXXXXXXXXXXX 2669 G+ + N+ T D N E++ V + ++ V Sbjct: 782 TTGQVSSVSPNENASIIKNQTTTDAANNDLEDNEYVPRRRSRQNSVVTDENNIPNKLLQG 841 Query: 2670 XXXXXXXXXKDAAKQYKRLGEAVVQRLVESFKENQYPKQDVKQSLAKELGLRVQQVGKWF 2849 + +RLGEAV QRL +SFKENQY + K+SLA+ELGL QV KWF Sbjct: 842 SPKSGSTGRRRELSTNRRLGEAVTQRLYQSFKENQYLDRATKESLAQELGLTSYQVSKWF 901 Query: 2850 ENARWSFRHSSRMESKMVAAASTNGSSLRTRVHVMSLAAKQSPVLPSPSDSGMENLISSQ 3029 ENARWS+RHSS + + AS S+L + + + + + + + +G N + Sbjct: 902 ENARWSYRHSSSKKPGISEHASKE-STLSPQTNKKLFETELNTSITNSTCNGALN--NEL 958 Query: 3030 VRPGN---EECQITDAGEGK--SVESEASGEKST 3116 R GN E C D G+GK E+SG+ ST Sbjct: 959 PRTGNAMPESCS-GDVGDGKVEMPTKESSGQTST 991 >emb|CBI22504.3| unnamed protein product [Vitis vinifera] Length = 977 Score = 465 bits (1196), Expect = e-128 Identities = 311/779 (39%), Positives = 406/779 (52%), Gaps = 33/779 (4%) Frame = +3 Query: 885 LPGPEYLEHSNGEQIVISNKDPISNSIPGDFRLPHENGAAICAPENMGSATCAPENMGSA 1064 LP + ++S E + + +D I N E E +G + PEN+ Sbjct: 81 LPPADVTKNSLTEHLGLPPEDAIKNDGTEQLGFFPEVVTKSSIIEKLGQSEPPPENVA-- 138 Query: 1065 TCDAHENHLDLQHSEPAEKDATNVASESVPHEGTSLPSRKQISSLLPPVSNRVLRSRSQE 1244 + L S A KD N + + L R +S +RVLRSRSQE Sbjct: 139 ------RYSGLDQSGSAPKDLANKRTAKLVKRKYKL--RSSVSG------SRVLRSRSQE 184 Query: 1245 KPKASESKAVEVENSANEGRKSKQRKGRMKKIPVNEFSRIRTHLRYLLHRVKYERNLIDA 1424 KPKAS+ V SA+ RK +++K RM K +EF+RIR HLRYLL+R+ YE+NLIDA Sbjct: 185 KPKASQPSDNFVNASASRERKGRKKK-RMNKTTADEFARIRKHLRYLLNRMSYEQNLIDA 243 Query: 1425 YSCEGWKGQSLEKLKPEKELQRARSQIFRYKLKIRDLFQQLDRSLTEGRFPESLFDSEGQ 1604 YS EGWKGQS+EKLKPEKELQRA S+I R KL+IRDLFQ LD EGRFPESLFDSEGQ Sbjct: 244 YSAEGWKGQSVEKLKPEKELQRASSEISRRKLQIRDLFQHLDSLCAEGRFPESLFDSEGQ 303 Query: 1605 IDSEDIFCAKCGSKDLTIENDIILCDGACERGFHQFCLDPPLLKEDIPPDDEGWLCPGCD 1784 IDSEDIFCAKC SKD++ +NDIILCDGAC+RGFHQFCL+PPLLKE+IPPDDEGWLCP CD Sbjct: 304 IDSEDIFCAKCESKDMSADNDIILCDGACDRGFHQFCLEPPLLKEEIPPDDEGWLCPACD 363 Query: 1785 CKLDCVGMLNDFQESTLSVTDNWEKVFPEEAAAA-----ASGKTMDEGTXXXXXXXXXXX 1949 CK+DC+ +LND Q + LSV D+WEKVFPE AAA SG + D+ Sbjct: 364 CKVDCMDLLNDSQGTKLSVIDSWEKVFPEAAAAGNNQDNNSGFSSDDSEDNDYDPDCPEV 423 Query: 1950 XXXXXXXXXHEVSRGES----TSDGSDYFSASDDIVPPLDNKQIFXXXXXXXXXXXXXXX 2117 ES SD SD+ SASDD+V +N+Q Sbjct: 424 DEKGQGDKSSSDKFDESDEFDESDESDFTSASDDMVVSPNNEQCLGLPSDDSEDDDFDPD 483 Query: 2118 XXXXXXXXXXXXXXXXXXXXXXXLGAILEDGESPNKDEGHFPSVSED---------SKPN 2270 +++ + F S SED N Sbjct: 484 APE------------------------IDEQVNQGSSSSDFTSDSEDFTATLDRRNFSDN 519 Query: 2271 AVASGEKLKAGKRKGRSLNDELSYLTESNT----EAVSGKRRGERLDYKKLHDEAYRNAT 2438 E+ + G++K +L DEL + ESN+ +S KR ERLDYKKLHDEAY N + Sbjct: 520 EDGLDEQRRFGRKKKDTLKDELLSVLESNSGQDNAPLSAKRHVERLDYKKLHDEAYGNVS 579 Query: 2439 SDSS-DEDYTETAGAKRRKSNAGKAIRICTNKIQDRNDRTDTMDGNQNHKENDHSVEG-- 2609 SDSS DED+TE ++RK+ +G + N T + N K+ H +E Sbjct: 580 SDSSDDEDWTENVIPRKRKNLSGNVASV------SPNGNTSITENGTNTKDIKHDLEAAG 633 Query: 2610 -----KSHKKLKV-XXXXXXXXXXXXXXXXXXXXXKDAAKQYKRLGEAVVQRLVESFKEN 2771 ++ +KL K YK+LGEAV +RL +SF+EN Sbjct: 634 CTPKRRTRQKLNFESTNNSLAESHKDSRSPGSTGEKSGQSSYKKLGEAVTERLYKSFQEN 693 Query: 2772 QYPKQDVKQSLAKELGLRVQQVGKWFENARWSFRHSSRME-SKMVAAASTNGSSLRTRVH 2948 QYP + +K+ LA+ELG+ +QV KWFENARWSFRH E S +A + S+ +T Sbjct: 694 QYPDRAMKEKLAEELGITSRQVSKWFENARWSFRHRPPKEASAGKSAVKKDASTSQT--- 750 Query: 2949 VMSLAAKQSPVLPSPSDSGMENLISSQVRPGNEECQITDAGEGKS-VESEASGEKSTRK 3122 +Q VL S +G+ S + + + +A GKS V+ +AS ++ +K Sbjct: 751 --DQKPEQEVVLRESSHNGVGKKESPKAGASKVD-RSKEANAGKSAVKKDASTSQTDQK 806 >ref|XP_002269077.1| PREDICTED: homeobox protein HAT3.1-like [Vitis vinifera] Length = 968 Score = 465 bits (1196), Expect = e-128 Identities = 311/779 (39%), Positives = 406/779 (52%), Gaps = 33/779 (4%) Frame = +3 Query: 885 LPGPEYLEHSNGEQIVISNKDPISNSIPGDFRLPHENGAAICAPENMGSATCAPENMGSA 1064 LP + ++S E + + +D I N E E +G + PEN+ Sbjct: 81 LPPADVTKNSLTEHLGLPPEDAIKNDGTEQLGFFPEVVTKSSIIEKLGQSEPPPENVA-- 138 Query: 1065 TCDAHENHLDLQHSEPAEKDATNVASESVPHEGTSLPSRKQISSLLPPVSNRVLRSRSQE 1244 + L S A KD N + + L R +S +RVLRSRSQE Sbjct: 139 ------RYSGLDQSGSAPKDLANKRTAKLVKRKYKL--RSSVSG------SRVLRSRSQE 184 Query: 1245 KPKASESKAVEVENSANEGRKSKQRKGRMKKIPVNEFSRIRTHLRYLLHRVKYERNLIDA 1424 KPKAS+ V SA+ RK +++K RM K +EF+RIR HLRYLL+R+ YE+NLIDA Sbjct: 185 KPKASQPSDNFVNASASRERKGRKKK-RMNKTTADEFARIRKHLRYLLNRMSYEQNLIDA 243 Query: 1425 YSCEGWKGQSLEKLKPEKELQRARSQIFRYKLKIRDLFQQLDRSLTEGRFPESLFDSEGQ 1604 YS EGWKGQS+EKLKPEKELQRA S+I R KL+IRDLFQ LD EGRFPESLFDSEGQ Sbjct: 244 YSAEGWKGQSVEKLKPEKELQRASSEISRRKLQIRDLFQHLDSLCAEGRFPESLFDSEGQ 303 Query: 1605 IDSEDIFCAKCGSKDLTIENDIILCDGACERGFHQFCLDPPLLKEDIPPDDEGWLCPGCD 1784 IDSEDIFCAKC SKD++ +NDIILCDGAC+RGFHQFCL+PPLLKE+IPPDDEGWLCP CD Sbjct: 304 IDSEDIFCAKCESKDMSADNDIILCDGACDRGFHQFCLEPPLLKEEIPPDDEGWLCPACD 363 Query: 1785 CKLDCVGMLNDFQESTLSVTDNWEKVFPEEAAAA-----ASGKTMDEGTXXXXXXXXXXX 1949 CK+DC+ +LND Q + LSV D+WEKVFPE AAA SG + D+ Sbjct: 364 CKVDCMDLLNDSQGTKLSVIDSWEKVFPEAAAAGNNQDNNSGFSSDDSEDNDYDPDCPEV 423 Query: 1950 XXXXXXXXXHEVSRGES----TSDGSDYFSASDDIVPPLDNKQIFXXXXXXXXXXXXXXX 2117 ES SD SD+ SASDD+V +N+Q Sbjct: 424 DEKGQGDKSSSDKFDESDEFDESDESDFTSASDDMVVSPNNEQCLGLPSDDSEDDDFDPD 483 Query: 2118 XXXXXXXXXXXXXXXXXXXXXXXLGAILEDGESPNKDEGHFPSVSED---------SKPN 2270 +++ + F S SED N Sbjct: 484 APE------------------------IDEQVNQGSSSSDFTSDSEDFTATLDRRNFSDN 519 Query: 2271 AVASGEKLKAGKRKGRSLNDELSYLTESNT----EAVSGKRRGERLDYKKLHDEAYRNAT 2438 E+ + G++K +L DEL + ESN+ +S KR ERLDYKKLHDEAY N + Sbjct: 520 EDGLDEQRRFGRKKKDTLKDELLSVLESNSGQDNAPLSAKRHVERLDYKKLHDEAYGNVS 579 Query: 2439 SDSS-DEDYTETAGAKRRKSNAGKAIRICTNKIQDRNDRTDTMDGNQNHKENDHSVEG-- 2609 SDSS DED+TE ++RK+ +G + N T + N K+ H +E Sbjct: 580 SDSSDDEDWTENVIPRKRKNLSGNVASV------SPNGNTSITENGTNTKDIKHDLEAAG 633 Query: 2610 -----KSHKKLKV-XXXXXXXXXXXXXXXXXXXXXKDAAKQYKRLGEAVVQRLVESFKEN 2771 ++ +KL K YK+LGEAV +RL +SF+EN Sbjct: 634 CTPKRRTRQKLNFESTNNSLAESHKDSRSPGSTGEKSGQSSYKKLGEAVTERLYKSFQEN 693 Query: 2772 QYPKQDVKQSLAKELGLRVQQVGKWFENARWSFRHSSRME-SKMVAAASTNGSSLRTRVH 2948 QYP + +K+ LA+ELG+ +QV KWFENARWSFRH E S +A + S+ +T Sbjct: 694 QYPDRAMKEKLAEELGITSRQVSKWFENARWSFRHRPPKEASAGKSAVKKDASTSQT--- 750 Query: 2949 VMSLAAKQSPVLPSPSDSGMENLISSQVRPGNEECQITDAGEGKS-VESEASGEKSTRK 3122 +Q VL S +G+ S + + + +A GKS V+ +AS ++ +K Sbjct: 751 --DQKPEQEVVLRESSHNGVGKKESPKAGASKVD-RSKEANAGKSAVKKDASTSQTDQK 806 >ref|XP_002527467.1| Homeobox protein HAT3.1, putative [Ricinus communis] gi|223533107|gb|EEF34865.1| Homeobox protein HAT3.1, putative [Ricinus communis] Length = 896 Score = 464 bits (1195), Expect = e-127 Identities = 289/696 (41%), Positives = 383/696 (55%), Gaps = 9/696 (1%) Frame = +3 Query: 1014 PENMGSATCAPENMGSATCDAHENHLDLQHSEPAEKDATNVASESVPHEGTSLPSRKQIS 1193 P N A E +G DA + H + SE KDA + +S T+ SRK+ Sbjct: 153 PPNNEMKVPASEKLGPPH-DAEDKHWNGTQSEILSKDAVSNSSRLGRRVKTTAKSRKKYM 211 Query: 1194 SLLPPVSNRVLRSRSQEKPKASESKAVEVENSAN-EGRKSKQRKGRMKKIPVNEFSRIRT 1370 S+RV++ RSQEKPKA ES S+N E + K++K K + +E+S IR Sbjct: 212 LRCLRRSDRVMQYRSQEKPKAPESSTNLPNVSSNVEKTRKKKKKRERKSVEADEYSIIRK 271 Query: 1371 HLRYLLHRVKYERNLIDAYSCEGWKGQSLEKLKPEKELQRARSQIFRYKLKIRDLFQQLD 1550 +LRYLL+R+ YE++LI AYS EGWKG SLEKLKPEKELQRA S+I R K KIRDLFQ++D Sbjct: 272 NLRYLLNRIGYEQSLITAYSAEGWKGLSLEKLKPEKELQRATSEILRRKSKIRDLFQRID 331 Query: 1551 RSLTEGRFPESLFDSEGQIDSEDIFCAKCGSKDLTIENDIILCDGACERGFHQFCLDPPL 1730 EGRFPESLFDS+GQI SEDIFCAKCGSKDLT +NDIILCDGAC+RGFHQ+CL PPL Sbjct: 332 SLCGEGRFPESLFDSDGQISSEDIFCAKCGSKDLTADNDIILCDGACDRGFHQYCLVPPL 391 Query: 1731 LKEDIPPDDEGWLCPGCDCKLDCVGMLNDFQESTLSVTDNWEKVFPEEAAAAASGKTMDE 1910 LKEDIPPDD+GWLCPGCDCK+DC+ +LN+ Q + +S++D+WEKVFPE AAA G+ D+ Sbjct: 392 LKEDIPPDDQGWLCPGCDCKVDCIDLLNESQGTNISISDSWEKVFPE---AAAPGQNPDQ 448 Query: 1911 GTXXXXXXXXXXXXXXXXXXXXHEVSRGESTSDGSDYFS-ASDDIVPPLDNKQIFXXXXX 2087 + ES+SD SD SD++ P +KQ Sbjct: 449 NFGPPSDDSDDNDYDPDIPEIDEKSQGDESSSDDSDDSDFTSDELEAPPGDKQQLGLSSE 508 Query: 2088 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLGAILEDGESPNKDEGHFPSVSEDSKP 2267 L A L++ E +DE +S ++ Sbjct: 509 DSGDDDYDPDAPDLDDIVKEESSSSDFTSDSEDLAATLDNNELSGEDERR---ISVGTRG 565 Query: 2268 NAVASGEKLKAGKRKGRSLNDELSYLTESN-----TEAVSGKRRGERLDYKKLHDEAYRN 2432 ++ G K G++K +SL EL + E N + +SGKR ERLDYKKL+DE Y N Sbjct: 566 DSTKEGS--KRGRKKKQSLQSELLSIEEPNPSQDGSAPISGKRNVERLDYKKLYDETYGN 623 Query: 2433 ATSDSS-DEDYTETAGAKRRKSNAGKAIRICTNKIQDRNDRTDTMDGNQNHKENDHSVEG 2609 +SDSS DED+T+ GA +R+ K+ + TDT G Q+ KE ++ V Sbjct: 624 VSSDSSDDEDFTDDVGAVKRR----KSTQAALGSANGNASVTDT--GKQDLKETEY-VPK 676 Query: 2610 KSHKKLKVXXXXXXXXXXXXXXXXXXXXXKDAAKQ-YKRLGEAVVQRLVESFKENQYPKQ 2786 +S ++L K Y+RLGE V + L SFKENQYP + Sbjct: 677 RSRQRLISENTSITPTKAHEGTSPSSSCGKTVRPSGYRRLGETVTKGLYRSFKENQYPDR 736 Query: 2787 DVKQSLAKELGLRVQQVGKWFENARWSFRHSSRMESKMVAAASTNGSSLRTRVHVMSLAA 2966 D K+ LA+ELG+ QQV KWFENARWSF HSS M++ + N S + ++ +A Sbjct: 737 DRKEHLAEELGITYQQVTKWFENARWSFNHSSSMDANRIGKTPENNSPVSKTTTILLESA 796 Query: 2967 KQSPVLPSPSDSGMENLISSQVRPGNEECQITDAGE 3074 ++ V + DS + S ++ E + DA E Sbjct: 797 PET-VSGAAIDSAAQREESPKIGDAMVEIYVEDARE 831 >gb|EMJ01257.1| hypothetical protein PRUPE_ppa023106mg [Prunus persica] Length = 1058 Score = 462 bits (1188), Expect = e-127 Identities = 309/741 (41%), Positives = 396/741 (53%), Gaps = 43/741 (5%) Frame = +3 Query: 768 LTAHDL-----GSENIEPLEQKQDVAQDIGRKSPSETGVVASS----ELPGPEYLEHSNG 920 +T+H + GS EP +QK + + ++T SS E PGP Sbjct: 222 ITSHQINEFGSGSVPSEPAKQKDQLDSVPAQNDEAKTSKAVSSSTVFEQPGPSIE----- 276 Query: 921 EQIVISNKDPISNSIPGDFRLPHENGAAICAPENMGSATCAPENMGSATCDAHENHLDLQ 1100 ++ PI +S P P S + + + M D +N LQ Sbjct: 277 ---AMTEDSPIGHSEP---------------PLEDLSKSLSDKEMEPLPEDVTQNS-SLQ 317 Query: 1101 HSEPAEKDATNVASESVPHEGTSLPSRKQISSLLPPV-SNRVLRSRSQEKPKASESK--- 1268 E A K+A ++S P + + SRK+ V S+RVLRS++ EK K + K Sbjct: 318 QLETASKNALKISSCLGPKDKKNPKSRKRKYMSRSFVRSDRVLRSKTGEKEKPKDLKLSN 377 Query: 1269 ---AVEVENSA-----NEGRKSKQRKGRMKKIPV-NEFSRIRTHLRYLLHRVKYERNLID 1421 +E NS E +K K+RK R + +EFSRIRTHLRYLL+R+ YE++LID Sbjct: 378 NVATLESSNSIANVSNGEEKKRKKRKNRRDNRAIADEFSRIRTHLRYLLNRIGYEKSLID 437 Query: 1422 AYSCEGWKGQSLEKLKPEKELQRARSQIFRYKLKIRDLFQQLDRSLTEGRFPESLFDSEG 1601 AYS EGWKG SLEKLKPEKELQRA S+I R KLKIRDLFQ+L+ EG FPESLFDSEG Sbjct: 438 AYSGEGWKGSSLEKLKPEKELQRATSEILRRKLKIRDLFQRLESLCAEGMFPESLFDSEG 497 Query: 1602 QIDSEDIFCAKCGSKDLTIENDIILCDGACERGFHQFCLDPPLLKEDIPPDDEGWLCPGC 1781 QIDSEDIFC KCGSKD++++NDIILCDGAC+RGFHQFCL+PPLL EDIPPDDEGWLCPGC Sbjct: 498 QIDSEDIFCGKCGSKDVSLDNDIILCDGACDRGFHQFCLEPPLLSEDIPPDDEGWLCPGC 557 Query: 1782 DCKLDCVGMLNDFQESTLSVTDNWEKVFPEEAAAAASGKTMDEGTXXXXXXXXXXXXXXX 1961 DCK+DC+ +LND Q + LSVTD+WEKVFPE AAAA++G+ D Sbjct: 558 DCKVDCIDLLNDSQGTDLSVTDSWEKVFPEAAAAASAGENQD-NHGLPSDDSDDNDYDPD 616 Query: 1962 XXXXXHEVSRGESTSDGSDYFSASDDIVPPLDN-KQIFXXXXXXXXXXXXXXXXXXXXXX 2138 ++V ES+SD S+Y SASD + P N +Q Sbjct: 617 GPETDNKVQGEESSSDESEYASASDGLETPKSNDEQYLGLPSEDSEDDDYNPYAPDVNED 676 Query: 2139 XXXXXXXXXXXXXXXXLGAILEDGESPNKD-EGHFPSVSEDSKPNAVASGEKLKAGKRKG 2315 LGA L+D ++D EG + +DSKP+ SGE+ +K Sbjct: 677 VKQESSSSDFTSDSEDLGAALDDNIMSSEDVEGPKSTSLDDSKPHR-GSGEQSSISGQKK 735 Query: 2316 RSLNDELSYLTES-----NTEAVSGKRRGERLDYKKLHDEAYRNATSDSS-DEDYTETAG 2477 SL DEL L ES + +SGKR ERLDYK+LHDEAY N +DSS DED+ + A Sbjct: 736 HSLKDELISLLESGPGQGESAPLSGKRHIERLDYKRLHDEAYGNVPTDSSDDEDWNDIAT 795 Query: 2478 AKRRKSNAGK-AIRICTNKIQDRNDRTDTMDGNQNHKENDHSVEGKSHKKLKVXXXXXXX 2654 ++RK G+ A R K + + T D + EN+++ H+K V Sbjct: 796 QRKRKKGTGQVANRSPNGKTSNIKNGVITKDIKPDVDENENTPRRMPHRKSNVEDTSNLS 855 Query: 2655 XXXXXXXXXXXXXXKDAAKQ---YKRLGEAVVQRLVESFKENQYPKQDVKQSLAKELGLR 2825 A Y RLGEA QRL +SFKEN YP + +K+SLA+ELGL Sbjct: 856 NKSPKGSTKSGSTSGRAGSSRSTYSRLGEAATQRLCKSFKENHYPDRSMKESLARELGLM 915 Query: 2826 VQQ---------VGKWFENAR 2861 +Q V KWFENAR Sbjct: 916 AKQVIPSFILASVSKWFENAR 936 >ref|XP_006486963.1| PREDICTED: homeobox protein HAT3.1-like isoform X1 [Citrus sinensis] gi|568867273|ref|XP_006486964.1| PREDICTED: homeobox protein HAT3.1-like isoform X2 [Citrus sinensis] Length = 1063 Score = 456 bits (1174), Expect = e-125 Identities = 309/794 (38%), Positives = 407/794 (51%), Gaps = 16/794 (2%) Frame = +3 Query: 810 EQKQDVAQDIGRKSPSETGVVASSELPGPEYLEHSNGEQIVISNKDPISNSIPGDFRLPH 989 EQ + I PS S+L E E S GE + ++ + + S + P Sbjct: 263 EQTPEFTPGISSHEPSVVNYKLGSQLEQTELGETSAGE--LGASLELVVKSSIEQLKQPE 320 Query: 990 ENGAAICAPENMGSATCAPENMGSATCDAHENHLDLQHSE-PAEKDATNVA--SESVPHE 1160 I P SAT +++ S++ D E L+ SE P A N A Sbjct: 321 ---VPITIPSTKTSAT---KHLQSSS-DLMEKKSCLEQSETPPNYVANNSACLGRKGKRA 373 Query: 1161 GTSLPSRKQISSLLPPVSNRVLRSRSQEKPKASESKAVEVE-NSANEGRKSKQRKGRMKK 1337 SL + + SL+ S+RVLRSRS E+P ES + NS E ++ K+ K R KK Sbjct: 374 TKSLKNNYTVRSLIG--SDRVLRSRSGERPIPPESSINLADVNSIGERKQKKRNKIRRKK 431 Query: 1338 IPVNEFSRIRTHLRYLLHRVKYERNLIDAYSCEGWKGQSLEKLKPEKELQRARSQIFRYK 1517 I +E+SRIRTHLRYLL+R+ YE+NLIDAYS EGWKG S+EKLKPEKELQRA S+I R K Sbjct: 432 IVADEYSRIRTHLRYLLNRINYEQNLIDAYSSEGWKGLSVEKLKPEKELQRATSEILRRK 491 Query: 1518 LKIRDLFQQLDRSLTEGRFPESLFDSEGQIDSEDIFCAKCGSKDLTIENDIILCDGACER 1697 LKIRDLFQ+LD SL G FP+SLFDSEGQIDSEDI+CAKCGSKDL+ +NDIILCDGAC+R Sbjct: 492 LKIRDLFQRLD-SLCAGGFPKSLFDSEGQIDSEDIYCAKCGSKDLSADNDIILCDGACDR 550 Query: 1698 GFHQFCLDPPLLKEDIPPDDEGWLCPGCDCKLDCVGMLNDFQESTLSVTDNWEKVFPEEA 1877 GFHQ+CL+PPLLKEDIPPDDEGWLCPGCDCK+DC+ ++N+ Q + L +TDNWEKVFPE Sbjct: 551 GFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLVNELQGTRLFITDNWEKVFPE-- 608 Query: 1878 AAAASGKTMDEGTXXXXXXXXXXXXXXXXXXXXHEVSRGESTSDG-----SDYFSASDDI 2042 AA+G D + ES+SDG SD+ S SD++ Sbjct: 609 --AAAGHNQDPNFGLASDDSDDNEYDPDGSATDEQDEGDESSSDGSSSDDSDFTSTSDEV 666 Query: 2043 VPPLDNKQIF--XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLGAILEDGES 2216 P D+K L A+LED S Sbjct: 667 EAPADDKTYLGRSSEDSEDDEYNPDAPDLDDKVTQESSSSGSDFTSDSEDLAAVLEDNRS 726 Query: 2217 PNKDEGHFPSVSEDSKPNAVASGEKLKAGKRKGRSLNDELSYLTESNTEA---VSGKRRG 2387 DEG + P ++G++ K G SLN+EL + + + V GKR Sbjct: 727 SGNDEG-------AASPLGHSNGQRYKDG-GNNESLNNELLSIIKPGQDGAAPVYGKRSS 778 Query: 2388 ERLDYKKLHDEAYRNATSDSSDEDYTETAGAKRRKSNAGKAIRICT--NKIQDRNDRTDT 2561 ERLDYKKL+DE Y N DSSD++ G R+++ + K + K R T Sbjct: 779 ERLDYKKLYDETYGNVPYDSSDDESWSDDGGPRKRTKSTKEGSSASPDGKTPVIRRRKST 838 Query: 2562 MDGNQNHKENDHSVEGKSHKKLKVXXXXXXXXXXXXXXXXXXXXXKDAAKQYKRLGEAVV 2741 + E +++ + + KL + Y+++GE V Sbjct: 839 KAAKEKLNETENTPKRRGRPKLNTEDSNISPAKSHEGCSTPGSRGRRHRTSYRKIGEEVT 898 Query: 2742 QRLVESFKENQYPKQDVKQSLAKELGLRVQQVGKWFENARWSFRHSSRMESKMVAAASTN 2921 Q+L SFKENQYP + K+SLAKELGL QV KWFEN RWSF H S +K+ A S Sbjct: 899 QKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTRWSFNHPSSKNAKL--ANSEK 956 Query: 2922 GSSLRTRVHVMSLAAKQSPVLPSPSDSGMENLISSQVRPGNEECQITDAGEGKSVESEAS 3101 G+ + + ++ V + +G EN+ SS+ + C D + + E + Sbjct: 957 GT--------CTPQSNKNTVGRVSNCNGAENVQSSKTGVDDTGCMTGDV-KNNTQECNSI 1007 Query: 3102 GEKSTRKRKVDNQG 3143 S RK D G Sbjct: 1008 KPTSQTSRKRDRDG 1021 >ref|XP_006422879.1| hypothetical protein CICLE_v10027725mg [Citrus clementina] gi|557524813|gb|ESR36119.1| hypothetical protein CICLE_v10027725mg [Citrus clementina] Length = 1063 Score = 455 bits (1171), Expect = e-125 Identities = 310/794 (39%), Positives = 404/794 (50%), Gaps = 16/794 (2%) Frame = +3 Query: 810 EQKQDVAQDIGRKSPSETGVVASSELPGPEYLEHSNGEQIVISNKDPISNSIPGDFRLPH 989 EQ + I PS S+L E E S GE + S + + +SI +L Sbjct: 263 EQTPEFTPGISSHEPSVVNYKLGSQLEQTELGETSAGE-LGASLELVVKSSIEQLKQLE- 320 Query: 990 ENGAAICAPENMGSATCAPENMGSATCDAHENHLDLQHSE-PAEKDATNVA--SESVPHE 1160 I P SAT +++ S++ D E L+ SE P A N A Sbjct: 321 ---VPITIPSTKTSAT---KHLQSSS-DLMEKKSCLEQSETPPNYVANNSACLGRKGKRA 373 Query: 1161 GTSLPSRKQISSLLPPVSNRVLRSRSQEKPKASESKAVEVE-NSANEGRKSKQRKGRMKK 1337 SL + + SL+ S+RVLRSRS E+P ES + NS E ++ K+ K R KK Sbjct: 374 TKSLKNNYTVRSLIG--SDRVLRSRSGERPLPPESSNNLADVNSIGERKQKKRNKIRRKK 431 Query: 1338 IPVNEFSRIRTHLRYLLHRVKYERNLIDAYSCEGWKGQSLEKLKPEKELQRARSQIFRYK 1517 I +E+SRIRTHLRYLL+R+ YE+NLIDAYS EGWKG S+EKLKPEKELQRA S+I R K Sbjct: 432 IVADEYSRIRTHLRYLLNRINYEQNLIDAYSSEGWKGLSVEKLKPEKELQRATSEILRRK 491 Query: 1518 LKIRDLFQQLDRSLTEGRFPESLFDSEGQIDSEDIFCAKCGSKDLTIENDIILCDGACER 1697 LKIRDLFQ+LD SL G FP+SLFDSEGQIDSEDI+CAKCGSKDL+ +NDIILCDGAC+R Sbjct: 492 LKIRDLFQRLD-SLCAGGFPKSLFDSEGQIDSEDIYCAKCGSKDLSADNDIILCDGACDR 550 Query: 1698 GFHQFCLDPPLLKEDIPPDDEGWLCPGCDCKLDCVGMLNDFQESTLSVTDNWEKVFPEEA 1877 GFHQ+CL+PPLLKEDIPPDDEGWLCPGCDCK+DC+ ++N+ Q + L +TDNWEKVFPE Sbjct: 551 GFHQYCLEPPLLKEDIPPDDEGWLCPGCDCKVDCIDLVNELQGTRLFITDNWEKVFPE-- 608 Query: 1878 AAAASGKTMDEGTXXXXXXXXXXXXXXXXXXXXHEVSRGESTSDG-----SDYFSASDDI 2042 AA+G D + ES+SDG SD+ S SD++ Sbjct: 609 --AAAGHNQDPNFGLASDDSDDNEYDPDGSATDEQDEGDESSSDGSSSDDSDFTSTSDEV 666 Query: 2043 VPPLDNKQI--FXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLGAILEDGES 2216 P D+K L A+LED S Sbjct: 667 EAPADDKTYLGLSSEDSEDDEYNPDAPELDDKVTQESSSSGSDFTSDSEDLAAVLEDNRS 726 Query: 2217 PNKDEGHFPSVSEDSKPNAVASGEKLKAGKRKGRSLNDELSYLTESNTEA---VSGKRRG 2387 DEG + P ++G++ K G SLN+EL + + + V GKR Sbjct: 727 SGNDEG-------AASPLGHSNGQRYKDG-GNNESLNNELLSIIKPGQDGAVPVYGKRSS 778 Query: 2388 ERLDYKKLHDEAYRNATSDSSDEDYTETAGAKRRKSNAGKAIRICT--NKIQDRNDRTDT 2561 ERLDYKKL+DE Y N DSSD++ G R+++ + K + K R T Sbjct: 779 ERLDYKKLYDETYGNVPYDSSDDESWSDDGGPRKRTKSTKEGSSASPDGKTPVIRRRKST 838 Query: 2562 MDGNQNHKENDHSVEGKSHKKLKVXXXXXXXXXXXXXXXXXXXXXKDAAKQYKRLGEAVV 2741 + E +++ + + KL + Y++LGE V Sbjct: 839 KAAKEKLNETENTPKRRGRPKLNTEDSNISPAKSHEGCSTPGSRGRRHRTSYRKLGEEVT 898 Query: 2742 QRLVESFKENQYPKQDVKQSLAKELGLRVQQVGKWFENARWSFRHSSRMESKMVAAASTN 2921 Q+L SFKENQYP + K+SLAKELGL QV KWFEN RWSF H S S N Sbjct: 899 QKLYNSFKENQYPNRTTKESLAKELGLTFSQVRKWFENTRWSFNHPS----------SKN 948 Query: 2922 GSSLRTRVHVMSLAAKQSPVLPSPSDSGMENLISSQVRPGNEECQITDAGEGKSVESEAS 3101 + + + ++ V + +G EN+ SS+ + C D + + E + Sbjct: 949 AELANSEKGTCTPQSNKNTVGRVSNCNGAENVQSSKTGVDDTGCMTGDV-KNNTQECNSI 1007 Query: 3102 GEKSTRKRKVDNQG 3143 S RK D G Sbjct: 1008 KPTSQTSRKRDRDG 1021 >gb|EOX98399.1| Homeodomain-like protein with RING/FYVE/PHD-type zinc finger domain, putative isoform 1 [Theobroma cacao] gi|508706504|gb|EOX98400.1| Homeodomain-like protein with RING/FYVE/PHD-type zinc finger domain, putative isoform 1 [Theobroma cacao] Length = 950 Score = 455 bits (1171), Expect = e-125 Identities = 305/762 (40%), Positives = 392/762 (51%), Gaps = 35/762 (4%) Frame = +3 Query: 735 DGSSMNSSFESLTAHDLGSENIEPLEQKQDVAQDIGRKSPSETGVVASSELPGPEYLEHS 914 + SS ++ + L+ L S +V +++G +SP + + S LP Sbjct: 180 EDSSKHTKTDKLSCPQLVSSEPTVNFGSGNVCKELG-ESPEQRQQLDSESLP-------- 230 Query: 915 NG-EQIVISNKDPISNSI----PGDFRLPHENGAAICAPENM-----GSATCAPENMGSA 1064 NG E+ I+ +SN P D H G PE + S + E +G Sbjct: 231 NGIEESTIAVSSNVSNQALQLKPEDMGKSHCGGHLHSPPEGVTNVIQSSKSPLVEPLGLP 290 Query: 1065 TCDAHENHLDLQHSEPAEKDATNVASESVPHEG----------------TSLPSRKQISS 1196 A N Q P E A N E HE TS +K+ Sbjct: 291 QEFAQGNPSTQQSGLPCEDMAQNSGVEQ--HETKPKNLLENSGRRRNGKTSKTIKKKYML 348 Query: 1197 LLPPVSNRVLRSRSQEKPKASESKAVEVENSANEGRKSKQRKGRMKKIPV-NEFSRIRTH 1373 S+RVLRS+ QEKPKA+ES + ++E +K ++R+ R V +EFSRIRTH Sbjct: 349 RSLRSSDRVLRSKLQEKPKATESSNNLADVGSSEQQKRRKRRRRKANREVADEFSRIRTH 408 Query: 1374 LRYLLHRVKYERNLIDAYSCEGWKGQSLEKLKPEKELQRARSQIFRYKLKIRDLFQQLDR 1553 LRYLL+R+ YER+LI AYS EGWKG SLEKLKPEKELQRA S+I R KLKIRDLFQ +D Sbjct: 409 LRYLLNRINYERSLIAAYSTEGWKGLSLEKLKPEKELQRATSEILRRKLKIRDLFQHIDS 468 Query: 1554 SLTEGRFPESLFDSEGQIDSEDIFCAKCGSKDLTIENDIILCDGACERGFHQFCLDPPLL 1733 EG+ PESLFDSEGQIDSEDIFCAKCGSKDL+ NDIILCDGAC+RGFHQ+CL PPLL Sbjct: 469 LCAEGKLPESLFDSEGQIDSEDIFCAKCGSKDLSANNDIILCDGACDRGFHQYCLQPPLL 528 Query: 1734 KEDIPPDDEGWLCPGCDCKLDCVGMLNDFQESTLSVTDNWEKVFPEEAAAAASGKTMDEG 1913 KEDIPPDDEGWLCPGCDCK+DC+ ++N+ Q ++ S+TD+WEKVFP EAA AA+G+ D Sbjct: 529 KEDIPPDDEGWLCPGCDCKVDCIELVNESQGTSFSITDSWEKVFP-EAAVAAAGQNQDPN 587 Query: 1914 TXXXXXXXXXXXXXXXXXXXXHEVSRGESTSDGSDYFSASDDIVPPLDNKQIFXXXXXXX 2093 + ES+S+ S++ S S+++ P Q Sbjct: 588 FGLPSDDSDDNDYNPDGSETDEKDHGDESSSEESEFTSTSEELEVPAKVDQYLGLPSDDS 647 Query: 2094 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLGAILEDGESPNKDEGHFP-SVSEDSKPN 2270 L A+LE+ + KDEG S DSK Sbjct: 648 EDDDYDPDGPNHDEVVKPESSSSDFSSDSEDLDAMLEEDITSQKDEGPMANSAPRDSKRR 707 Query: 2271 AVASGEKLKAGKRKGRSLNDELSYLTESNTE----AVSGKRRGERLDYKKLHDEAYRNAT 2438 GEK S+NDEL + E +E A+S KR ERLDYK+L+DE Y N Sbjct: 708 KPKLGEK--------ESMNDELLSIMEPASEQDGSAISKKRSIERLDYKRLYDETYGNVP 759 Query: 2439 SDSS-DEDYTETAGAKRRKSNAGKAIRICTNKIQDRNDRTDTMDG-NQNHKENDHSVEGK 2612 S SS DED+++ ++R + N + DG QN +E +H K Sbjct: 760 SSSSDDEDWSDITAPRKRNKCTAEVASAPENGNVSVSRTVSVSDGLKQNPEETEHKPRRK 819 Query: 2613 SHKKLKVXXXXXXXXXXXXXXXXXXXXXKDA-AKQYKRLGEAVVQRLVESFKENQYPKQD 2789 + + + K A + YKRLGEAV QRL +SFKENQYP + Sbjct: 820 TRQMSRFKDTDSSPAEIQGNTSVSGSSGKKAGSSTYKRLGEAVKQRLYKSFKENQYPDRA 879 Query: 2790 VKQSLAKELGLRVQQVGKWFENARWSFRHSSRMESKMVAAAS 2915 KQSLAKEL + QQV KWF+NARWSF +S + AS Sbjct: 880 TKQSLAKELDMTFQQVSKWFDNARWSFNNSPSSHETIANNAS 921 >ref|XP_006589630.1| PREDICTED: pathogenesis-related homeodomain protein isoform X1 [Glycine max] Length = 820 Score = 451 bits (1161), Expect = e-124 Identities = 297/693 (42%), Positives = 374/693 (53%), Gaps = 22/693 (3%) Frame = +3 Query: 903 LEHSNGEQIVI--SNKDPISNSIPGDFRLPHENGAAICAPENMGSATCAPE--NMGSATC 1070 LE S EQ+ + SN P + P + E +I A G P NM S Sbjct: 93 LEQSTVEQVTVDLSNDKPENKCKPLSENVQSEPVESIPAVVVEGQMQSNPSQANMSSV-- 150 Query: 1071 DAHENHLDLQHSEPAEKDATNVASESVPHEGTSLPSR---KQISSLLPPV-------SNR 1220 N L Q S A + ++ SE + + T SR K+ S LL S+R Sbjct: 151 ----NELLDQPSGDAVNNISSNCSEKMSNSPTHSQSRRKGKKNSKLLKKYMLRSLGSSDR 206 Query: 1221 VLRSRSQEKPKASE--SKAVEVENSANEGRKSKQRKGRMKKIPVNEFSRIRTHLRYLLHR 1394 LRSR++EKPK E S V+ N+ + + +++K R ++ N+FSRIR+HLRYLL+R Sbjct: 207 ALRSRTKEKPKEPEPTSNLVDGNNNGVKRKSGRKKKKRKEEGITNQFSRIRSHLRYLLNR 266 Query: 1395 VKYERNLIDAYSCEGWKGQSLEKLKPEKELQRARSQIFRYKLKIRDLFQQLDRSLTEGRF 1574 + YE +LIDAYS EGWKG S+EKLKPEKELQRA+S+I R KLKIRDLFQ LD EG+F Sbjct: 267 ISYENSLIDAYSGEGWKGYSIEKLKPEKELQRAKSEILRRKLKIRDLFQNLDSLCAEGKF 326 Query: 1575 PESLFDSEGQIDSEDIFCAKCGSKDLTIENDIILCDGACERGFHQFCLDPPLLKEDIPPD 1754 PESLFDS G+IDSEDIFCAKC SK+L+ NDIILCDG C+RGFHQ CLDPP+L EDIPP Sbjct: 327 PESLFDSAGEIDSEDIFCAKCQSKELSTNNDIILCDGVCDRGFHQLCLDPPMLTEDIPPG 386 Query: 1755 DEGWLCPGCDCKLDCVGMLNDFQESTLSVTDNWEKVFPEEAAAAASGKTMDEGTXXXXXX 1934 DEGWLCPGCDCK DC+ ++ND ++LS++D WE+VFPE AA+ +G MD + Sbjct: 387 DEGWLCPGCDCKDDCMDLVNDSFGTSLSISDTWERVFPE--AASFAGNNMDNNSGVPSDD 444 Query: 1935 XXXXXXXXXXXXXXHEVSRGESTSDGSDYFSASDDIVPPLDNKQIFXXXXXXXXXXXXXX 2114 +V ES+SD S+Y SAS+ + Q Sbjct: 445 SDDDDYNPNGPDDV-KVEGDESSSDESEYASASEKLEGGSHEDQYLGLPSEDSDDGDYDP 503 Query: 2115 XXXXXXXXXXXXXXXXXXXXXXXXLGAILEDGESPNKDEGHFPSVSEDSKPNAVASGEKL 2294 L A +ED SP +D G +S K V G+KL Sbjct: 504 DAPDVECKVNEESSSSDFTSDSEDLAAAIEDNTSPGQDGG----ISSSKKKGKV--GKKL 557 Query: 2295 KAGKRKGRSLNDELSYLTE--SNTEA---VSGKRRGERLDYKKLHDEAYRNATSDSSDED 2459 SL DELS L E S EA VSGKR ERLDYKKL++E Y + TSD DED Sbjct: 558 --------SLPDELSSLLEPDSGQEAPTPVSGKRHVERLDYKKLYEETYHSDTSD--DED 607 Query: 2460 YTETAGAKRRKSNAGKAIRICTNKIQDRND-RTDTMDGNQNHKENDHSVEGKSHKKLKVX 2636 + +TA +K G + N N T + +QN+ EN ++ KS Sbjct: 608 WNDTAAPSGKKKLTGNVTPVSPNGNASNNSIHTPKRNAHQNNVENTNNSPTKS------- 660 Query: 2637 XXXXXXXXXXXXXXXXXXXXKDAAKQYKRLGEAVVQRLVESFKENQYPKQDVKQSLAKEL 2816 K + +KRLGEAVVQRL +SFKENQYP + K+SLA+EL Sbjct: 661 --------LEGCSKSGSRDKKSGSSAHKRLGEAVVQRLHKSFKENQYPDRTTKESLAQEL 712 Query: 2817 GLRVQQVGKWFENARWSFRHSSRMESKMVAAAS 2915 GL QQV KWF N RWSFRHSS+ME+ AS Sbjct: 713 GLTYQQVAKWFGNTRWSFRHSSQMETNSGINAS 745 >ref|XP_003555282.1| PREDICTED: homeobox protein HAT3.1-like isoform X1 [Glycine max] Length = 820 Score = 447 bits (1149), Expect = e-122 Identities = 303/773 (39%), Positives = 394/773 (50%), Gaps = 31/773 (4%) Frame = +3 Query: 903 LEHSNGEQIVIS-------NK-DPISNSIPGDFRLPHENGAAICAPENMGSATCAPENMG 1058 LE S EQ+ + NK P+S ++ + P E+ A M S+ A NM Sbjct: 93 LEQSTVEQVSVDLSNDKSENKCKPLSENVQSE---PVESIPAFVVDGQMQSSP-AQANMS 148 Query: 1059 SATCDAHENHLDLQHSEPAEKDATNVASE--SVPHEGTSLPSRKQISSLLPPV------- 1211 S N L Q S + TN + + + P S K+ S LL Sbjct: 149 SV------NELLDQPSGDVVNNITNCSEKMSNSPSHSQSRRKGKRNSKLLKKKYMLRSLG 202 Query: 1212 -SNRVLRSRSQEKPKASESKAVEVENSANEGRKSK---QRKGRMKKIPVNEFSRIRTHLR 1379 S R LRSR++EKPK E + V+ ++N+G K K ++K R ++ ++FSRIR+HLR Sbjct: 203 SSGRALRSRTKEKPKEPEPTSNLVDGNSNDGVKRKSGRKKKKRREEGITDQFSRIRSHLR 262 Query: 1380 YLLHRVKYERNLIDAYSCEGWKGQSLEKLKPEKELQRARSQIFRYKLKIRDLFQQLDRSL 1559 YLL+R+ YE +LIDAYS EGWKG S+EKLKPEKELQRA+S+I R KLKIRDLF+ LD Sbjct: 263 YLLNRISYENSLIDAYSGEGWKGYSMEKLKPEKELQRAKSEILRRKLKIRDLFRNLDSLC 322 Query: 1560 TEGRFPESLFDSEGQIDSEDIFCAKCGSKDLTIENDIILCDGACERGFHQFCLDPPLLKE 1739 EG+FPESLFDS G+IDSEDIFCAKC SK+L+ NDIILCDG C+RGFHQ CLDPPLL E Sbjct: 323 AEGKFPESLFDSAGEIDSEDIFCAKCQSKELSTNNDIILCDGVCDRGFHQLCLDPPLLTE 382 Query: 1740 DIPPDDEGWLCPGCDCKLDCVGMLNDFQESTLSVTDNWEKVFPEEAAAAASGKTMDEGTX 1919 DIPP DEGWLCPGCDCK DC+ ++ND ++LS++D WE+VFPE AA+ +G MD Sbjct: 383 DIPPGDEGWLCPGCDCKDDCMDLVNDSFGTSLSISDTWERVFPE--AASFAGNNMDNNLG 440 Query: 1920 XXXXXXXXXXXXXXXXXXXHEVSRGESTSDGSDYFSASDDIVPPLDNKQIFXXXXXXXXX 2099 ++ ES+SD S+Y SAS+ + Q Sbjct: 441 LPSDDSDDDDYNPNGSDDV-KIEGDESSSDESEYASASEKLEGGSHEDQYLGLPSEDSDD 499 Query: 2100 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXLGAILEDGESPNKDEGHFPSVSEDSKPNAVA 2279 L A ED SP +D G + Sbjct: 500 GDYDPDAPDVDCKVNEESSSSDFTSDSEDLAAAFEDNTSPGQDGG------------INS 547 Query: 2280 SGEKLKAGKRKGRSLNDELSYLTESNT-----EAVSGKRRGERLDYKKLHDEAYRNATSD 2444 S +K K GK S+ DELS L E ++ VSGKR ERLDYKKL++E Y + TSD Sbjct: 548 SKKKGKVGK---LSMADELSSLLEPDSGQGGPTPVSGKRHVERLDYKKLYEETYHSDTSD 604 Query: 2445 SSDEDYTETAGAKRRKSNAGKAIRICTNKIQDRND-RTDTMDGNQNHKENDHSVEGKSHK 2621 DED+ + A R+K G + N N T + +QN EN +S KS Sbjct: 605 --DEDWNDAAAPSRKKKLTGNVTPVSPNANASNNSIHTLKRNAHQNKVENTNSSPTKS-- 660 Query: 2622 KLKVXXXXXXXXXXXXXXXXXXXXXKDAAKQYKRLGEAVVQRLVESFKENQYPKQDVKQS 2801 + + +KRLGEAVVQRL +SFKENQYP + K+S Sbjct: 661 -------------LDGRSKSGSRDKRSGSSAHKRLGEAVVQRLHKSFKENQYPDRSTKES 707 Query: 2802 LAKELGLRVQQVGKWFENARWSFRHSSRMESKMVAAASTNGSSLRTRVHVMSLAAKQSPV 2981 LA+ELGL QQV KWF+N RWSFRHSS+ME+ AS + R SP Sbjct: 708 LAQELGLTYQQVAKWFDNTRWSFRHSSQMETNSGRNASPEATDGRAENEGEKQCESMSPE 767 Query: 2982 LPSPSDSGMENLISSQVRPGNEECQITDAGEGKSV----ESEASGEKSTRKRK 3128 + + + + E Q+ G S +++ + TRKRK Sbjct: 768 VSGKNSKTTSSRKRKHLSEPLSEAQLDINGLATSSPNVHQTQVGNKMKTRKRK 820 >ref|XP_004496910.1| PREDICTED: pathogenesis-related homeodomain protein-like isoform X1 [Cicer arietinum] Length = 995 Score = 444 bits (1143), Expect = e-121 Identities = 285/695 (41%), Positives = 370/695 (53%), Gaps = 11/695 (1%) Frame = +3 Query: 1104 SEPAEKDATNVASESVPHEGTSLPSRKQISSLLPPVSNRVLRSRSQEKPKASE--SKAVE 1277 SE K + ++ S + L + + SL S+R LRSR+++KPK E + V+ Sbjct: 309 SERKSKSSAHLRSRHKGKSNSKLSKKYILRSL--GSSDRALRSRTRDKPKDPEPINNVVD 366 Query: 1278 VENSANEGRKSKQRKG-RMKKIPVNE-FSRIRTHLRYLLHRVKYERNLIDAYSCEGWKGQ 1451 V N A + ++ K++K R +K +N+ +S+IR HLRYLL+R+ YE+NLIDAYS EGWKG Sbjct: 367 VSNDAMKTKRGKKKKKKRPRKEGINDQYSKIRAHLRYLLNRISYEQNLIDAYSGEGWKGY 426 Query: 1452 SLEKLKPEKELQRARSQIFRYKLKIRDLFQQLDRSLTEGRFPESLFDSEGQIDSEDIFCA 1631 SLEKLKPEKE+QRA+S+I R KLKIRDLFQ LD EGR PESLFDS+G+IDSEDIFCA Sbjct: 427 SLEKLKPEKEIQRAKSEILRRKLKIRDLFQNLDSLCAEGRLPESLFDSKGEIDSEDIFCA 486 Query: 1632 KCGSKDLTIENDIILCDGACERGFHQFCLDPPLLKEDIPPDDEGWLCPGCDCKLDCVGML 1811 KC +K L +NDIILCDGAC+RGFHQ CLDPPLL EDIPP DEGWLCPGCDCK DC+ ++ Sbjct: 487 KCQTKVLGTDNDIILCDGACDRGFHQLCLDPPLLTEDIPPGDEGWLCPGCDCKDDCIELV 546 Query: 1812 NDFQESTLSVTDNWEKVFPEEAAAAASGKTMDEG--TXXXXXXXXXXXXXXXXXXXXHEV 1985 ND + LS+T+ WE+VFPE A AA S + G + EV Sbjct: 547 NDLLGTNLSLTNTWERVFPEAATAAGSILDHNSGLPSDDSEDDDYNPNGPEDVEVEDAEV 606 Query: 1986 SRGESTSDGSDYFSASDDIVPPLDNKQIFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 2165 ES+SD S+Y SAS+ + Q Sbjct: 607 EGDESSSDESEYASASEKLEDSRHEDQYLGLPSEDSEDDDFDPDAPDLGGKVTEESSSSD 666 Query: 2166 XXXXXXXLGAILEDGESPNKDEGHFPSVSEDSKPNAVASGEKLKAGKRKGRSLNDELSYL 2345 L A ++D S +D + +D K S + K RK S+ DELS L Sbjct: 667 FTSDSEDLAATIKDNMSTGQDGDITSPLLDDVKNLKGFSRQNHKV--RKKPSMADELSSL 724 Query: 2346 TES-----NTEAVSGKRRGERLDYKKLHDEAYRNATSDSSDEDYTETAGAKRRKSNAGKA 2510 +S + ++ KR ERLDY+KL++E Y++ TSD DED+ +A R+K AGK Sbjct: 725 LKSDLGQEDITPITAKRNVERLDYQKLYEETYQSDTSD--DEDWDASATPSRKKKLAGKM 782 Query: 2511 IRICTNKIQDRNDRTDTMDGNQNHKENDHSVEGKSHKKLKVXXXXXXXXXXXXXXXXXXX 2690 + N N R Q HK VE ++ K Sbjct: 783 TPVSPNGNASNNSRHTASRNTQQHK-----VENTNNSPTKT----------LEGCTKSGS 827 Query: 2691 XXKDAAKQYKRLGEAVVQRLVESFKENQYPKQDVKQSLAKELGLRVQQVGKWFENARWSF 2870 K YKRLGEAVVQRL +SFKENQYP++ K+SLA+ELGL QQV KWF N RWSF Sbjct: 828 RDKRRGLTYKRLGEAVVQRLYKSFKENQYPERTTKESLAQELGLTFQQVDKWFGNTRWSF 887 Query: 2871 RHSSRMESKMVAAASTNGSSLRTRVHVMSLAAKQSPVLPSPSDSGMENLISSQVRPGNEE 3050 RHSS E A+ +N S T + + + + G+EN G E Sbjct: 888 RHSSHTE----ASPGSNASQQATDSGAENKEERGNASQQATDSPGVEN-------KGEGE 936 Query: 3051 CQITDAGEGKSVESEASGEKSTRKRKVDNQGSGAG 3155 C++ G + S +K RKR + Q S AG Sbjct: 937 CELVSQGTSREKSRTQSSKK--RKRLSEPQVSEAG 969 >gb|ESW15073.1| hypothetical protein PHAVU_007G041800g [Phaseolus vulgaris] Length = 826 Score = 433 bits (1113), Expect = e-118 Identities = 286/686 (41%), Positives = 373/686 (54%), Gaps = 21/686 (3%) Frame = +3 Query: 900 YLEHSNGEQIVI--SNKDPISNSIPGDFRLPHENGAAICAPENMGSATCAPENMGSATCD 1073 +L+ S +++ + SN +P + S P P E+ A S+ A Sbjct: 91 HLQQSTDKEVSLQLSNDEPENPSQPLSENEPVESAPAFAGDGQKQSSPAL------ANTS 144 Query: 1074 AHENHLDLQHSEP----AEKDATNVASESVPHEGTSLPSRKQISSLLPPV--SNRVLRSR 1235 N LD + +EK + + A+ + +G + + +L V S+R LRS+ Sbjct: 145 YVNNMLDPPSGDAVINCSEKVSNSPANSQLRRKGKKNSKFLKKTYMLRSVGSSDRALRSK 204 Query: 1236 SQEKPKASESKAVEVE---NSANEGRKSKQRKGRMKKIPV---NEFSRIRTHLRYLLHRV 1397 ++E PK E + V+ N+ N+G K K K + K V ++FSRI++HLRYLL+R+ Sbjct: 205 TKENPKTPEPNSNLVDCNNNNNNDGVKKKSFKKKRKSGEVGITDQFSRIKSHLRYLLNRI 264 Query: 1398 KYERNLIDAYSCEGWKGQSLEKLKPEKELQRARSQIFRYKLKIRDLFQQLDRSLTEGRFP 1577 YE+NLIDAYS EGWKG S+EKLKPEKELQRA+S+I R KL IR+LF+ LD TEG+ P Sbjct: 265 GYEKNLIDAYSAEGWKGYSMEKLKPEKELQRAKSEIIRRKLNIRELFRNLDSLCTEGKLP 324 Query: 1578 ESLFDSEGQIDSEDIFCAKCGSKDLTIENDIILCDGACERGFHQFCLDPPLLKEDIPPDD 1757 ESLFDSEG+IDSEDIFCAKC SK+L+ NDIILCDG C+RGFHQ CLDPPLL EDIPP D Sbjct: 325 ESLFDSEGEIDSEDIFCAKCHSKELSSNNDIILCDGVCDRGFHQLCLDPPLLTEDIPPGD 384 Query: 1758 EGWLCPGCDCKLDCVGMLNDFQESTLSVTDNWEKVFPEEAAAAASGKTMDEGTXXXXXXX 1937 EGWLCPGCDCK DC+ ++ND ++LS++D WE+VFP EAAAAA KT + Sbjct: 385 EGWLCPGCDCKDDCMDLINDSFGTSLSISDTWERVFP-EAAAAAGNKT--DNNSGLPSDD 441 Query: 1938 XXXXXXXXXXXXXHEVSRGESTSDGSDYFSASDDIVPPLDNKQIFXXXXXXXXXXXXXXX 2117 +V ES+SD SDY SAS+++ Q Sbjct: 442 SDDDDYNPNGPEDVKVEGDESSSDESDYASASENL-EGSHGDQYLGLPSDDSDDGDYDPA 500 Query: 2118 XXXXXXXXXXXXXXXXXXXXXXXLGAILEDGESPNKDEGHFPSVSEDSKPNAVASGE-KL 2294 L A + + SP +D G S S D + G+ K Sbjct: 501 APDADSKVNVESSSSDFTSDSDDLPAAIVENTSPGQD-GEIRSASLDDVKCLNSYGKRKG 559 Query: 2295 KAGKRKGRSLNDELSYLTE-----SNTEAVSGKRRGERLDYKKLHDEAYRNATSDSSDED 2459 KAGK+ S+ DELS L E + VSG+R ERLDYKKL+DEAY + TS+ DED Sbjct: 560 KAGKK--LSMADELSSLLEPDSGQEGSTPVSGRRNLERLDYKKLYDEAYHSDTSE--DED 615 Query: 2460 YTETAGAKRRKSNAGKAIRICTNKIQDRND-RTDTMDGNQNHKENDHSVEGKSHKKLKVX 2636 +T T R+K G A + + N T +G+Q EN + KS Sbjct: 616 WTATVTPSRKKK--GNATPVSPDGNASNNSMHTPKRNGHQKKFENTKNSPAKS------- 666 Query: 2637 XXXXXXXXXXXXXXXXXXXXKDAAKQYKRLGEAVVQRLVESFKENQYPKQDVKQSLAKEL 2816 K + YKRLGEAVV+RL SFKENQYP + K+SLA+EL Sbjct: 667 --------LDDHVKSDSRKQKSKSSAYKRLGEAVVERLHISFKENQYPDRTTKESLAQEL 718 Query: 2817 GLRVQQVGKWFENARWSFRHSSRMES 2894 GL QQV KWF+N RWSFRHSS+ME+ Sbjct: 719 GLTCQQVAKWFDNTRWSFRHSSQMET 744 >sp|P48786.1|PRH_PETCR RecName: Full=Pathogenesis-related homeodomain protein; Short=PRHP gi|666128|gb|AAA62237.1| homeodomain protein [Petroselinum crispum] Length = 1088 Score = 429 bits (1103), Expect = e-117 Identities = 264/640 (41%), Positives = 353/640 (55%), Gaps = 26/640 (4%) Frame = +3 Query: 1170 LPSRKQISSLLPPVSNRVLRSRSQEKPKASESKAVEVENSANEGRKSKQRKGRMKKIPVN 1349 +P + + S L S+R LRSRSQEK + + + A+ + K+RK RM++ V+ Sbjct: 429 VPEKGKDSQELSVNSSRSLRSRSQEKSIEPDVNNIVADEGADREKPRKKRKKRMEENRVD 488 Query: 1350 EFSRIRTHLRYLLHRVKYERNLIDAYSCEGWKGQSLEKLKPEKELQRARSQIFRYKLKIR 1529 EF RIRTHLRYLLHR+KYE+N +DAYS EGWKGQSL+K+KPEKEL+RA+++IF KLKIR Sbjct: 489 EFCRIRTHLRYLLHRIKYEKNFLDAYSGEGWKGQSLDKIKPEKELKRAKAEIFGRKLKIR 548 Query: 1530 DLFQQLDRSLTEGRFPESLFDSEGQIDSEDIFCAKCGSKDLTIENDIILCDGACERGFHQ 1709 DLFQ+LD + +EGR PE LFDS G+IDSEDIFCAKCGSKD+T+ NDIILCDGAC+RGFHQ Sbjct: 549 DLFQRLDLARSEGRLPEILFDSRGEIDSEDIFCAKCGSKDVTLSNDIILCDGACDRGFHQ 608 Query: 1710 FCLDPPLLKEDIPPDDEGWLCPGCDCKLDCVGMLNDFQESTLSVTDNWEKVFPEEAAAAA 1889 FCLDPPLLKE IPPDDEGWLCPGC+CK+DC+ +LND QE+ + + D+WEKVF EEAAAAA Sbjct: 609 FCLDPPLLKEYIPPDDEGWLCPGCECKIDCIKLLNDSQETNILLGDSWEKVFAEEAAAAA 668 Query: 1890 SGKTMDEGTXXXXXXXXXXXXXXXXXXXXHEVSRGESTSDGSDYFSASDDIVPPLDNKQI 2069 SGK +D+ + +V +S++D SDY S SDD+ Q+ Sbjct: 669 SGKNLDDNSGLPSDDSEDDDYDPGGPDLDEKVQGDDSSTDESDYQSESDDM-------QV 721 Query: 2070 FXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLGAILEDGESPNKDEGHFPSV 2249 + D S ++D F V Sbjct: 722 IRQKNSRGLPSDDSEDDEYDPSGLVTDQMYK---------DSSCSDFTSDSED---FTGV 769 Query: 2250 SEDSKPNAVASGEKLKAGKRKGRSLNDELSYLTESNTEAVSGKRRGERLDYKKLHD---- 2417 +D K A G L + R+ + + + +T + +R+ E LDYKKL+D Sbjct: 770 FDDYKDTGKAQG-PLASTPDHVRNNEEGCGHPEQGDTAPLYPRRQVESLDYKKLNDIEFS 828 Query: 2418 ----------------------EAYRNATSDSSDEDYTETAGAKRRKSNAGKAIRICTNK 2531 E Y N +SDSSDEDY T+ K+N+ K Sbjct: 829 KMCDILDILSSQLDVIICTGNQEEYGNTSSDSSDEDYMVTSSPD--KNNSDKEA-----T 881 Query: 2532 IQDRNDRTDTMDGNQNHKENDHSVEGKSHKKLKVXXXXXXXXXXXXXXXXXXXXXKDAAK 2711 +R + ++ +Q +E+ H+ + KK V K +K Sbjct: 882 AMERGRESGDLELDQKARESTHN--RRYIKKFAVEGTDSFLSRSCEDSAAPVAGSKSTSK 939 Query: 2712 QYKRLGEAVVQRLVESFKENQYPKQDVKQSLAKELGLRVQQVGKWFENARWSFRHSSRME 2891 GE QRL++SFKENQYP++ VK+SLA EL L V+QV WF N RWSFRHSSR+ Sbjct: 940 TLH--GEHATQRLLQSFKENQYPQRAVKESLAAELALSVRQVSNWFNNRRWSFRHSSRIG 997 Query: 2892 SKMVAAASTNGSSLRTRVHVMSLAAKQSPVLPSPSDSGME 3011 S VA +N + + + + + K VL S + S +E Sbjct: 998 SD-VAKFDSNDTPRQKSIDMSGPSLKS--VLDSATYSEIE 1034 >ref|XP_004140812.1| PREDICTED: uncharacterized protein LOC101204775 [Cucumis sativus] Length = 1061 Score = 416 bits (1068), Expect = e-113 Identities = 314/903 (34%), Positives = 439/903 (48%), Gaps = 53/903 (5%) Frame = +3 Query: 642 KSVTTKSQQISLSEASVRECDCDSVDNLKILDGSSMNSSFESL-TAHDLGSENIEPLEQK 818 ++ T+S+ ++EA V+ L L + S ++ L T + S+ P E+K Sbjct: 93 ENTDTESRPNKIAEAVQEAKASVEVEVLTCLSNEAKYSGYQELGTTPEFSSKIDGPDEEK 152 Query: 819 QDVAQDIGRKSPSETGVVAS--SELPGPEYLEHSNGEQI----VISNKDPISNSIPGDFR 980 V Q++ S G + S SE H++ +++ ++SN N + Sbjct: 153 AGVQQNMELGS----GYLLSELSEKDNQTISNHADNDRVEAGNLLSNDKDTKN-----LK 203 Query: 981 LPHENGAAICAPENMGSATCAPENMGSATCDAHENHLDLQHSEPAEKDATNVAS----ES 1148 L E+ A E C+ + T +N+++ + P D T + S E+ Sbjct: 204 LSIEDEATTLLNE------CSELPLEDVT----KNYIEKMN--PPIGDLTQITSIQSLET 251 Query: 1149 VPHEGTS--------LPSRKQISSLLPPVSN-RVLRSRSQEKPKASESKAVEVENSANEG 1301 +P L S+K+ L VS+ RVLRSR+QEK KA E +A E Sbjct: 252 IPSNSQQSARKDKIFLKSKKKNYKLRSHVSSDRVLRSRTQEKAKAPERSNDLNNFTAEED 311 Query: 1302 RKSKQRKGRM---KKIPVNEFSRIRTHLRYLLHRVKYERNLIDAYSCEGWKGQSLEKLKP 1472 K K++K R K V+E+S IR HLRYLL+R++YE++LI+AYS EGWKG S +KLKP Sbjct: 312 GKRKKKKKRNIQGKGARVDEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFSSDKLKP 371 Query: 1473 EKELQRARSQIFRYKLKIRDLFQQLDRSLTEGRFPESLFDSEGQIDSEDIFCAKCGSKDL 1652 EKELQRA ++I R KLKIRDLFQ++D EGR ESLFDSEGQIDSEDIFCAKCGSK+L Sbjct: 372 EKELQRASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKEL 431 Query: 1653 TIENDIILCDGACERGFHQFCLDPPLLKEDIPPDDEGWLCPGCDCKLDCVGMLNDFQEST 1832 ++ENDIILCDG C+RGFHQFCL+PPLL DIPPDDEGWLCPGCDCK DC+ +LN+FQ S Sbjct: 432 SLENDIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSN 491 Query: 1833 LSVTDNWEKVFPEEAAAAAS-------GKTMDEGTXXXXXXXXXXXXXXXXXXXXHEVSR 1991 LS+TD WEKV+PE AAAAA G D+ E S Sbjct: 492 LSITDGWEKVYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDTIDQDNELSSDESSS 551 Query: 1992 GESTSDGSD-----YFSASDDIVPPLDNKQIFXXXXXXXXXXXXXXXXXXXXXXXXXXXX 2156 +S SD S+ Y SAS+ + ++ Q Sbjct: 552 DQSNSDPSNSDTSGYASASEGLEVSSNDDQYLGLPSDDSEDNDYDPSVPELDEGVRQESS 611 Query: 2157 XXXXXXXXXXLGAILEDGESPNKDEGHFPSVSEDSKPNAVASGEKLKAGKRKGRSLNDEL 2336 L A+ D +KD G S ++ P ++G+ +G K +L++EL Sbjct: 612 SSDFTSDSEDLAAL--DNNCSSKD-GDLVSSLNNTLPVKNSNGQ--SSGPNKS-ALHNEL 665 Query: 2337 SYLTES-----NTEAVSGKRRGERLDYKKLHDEAYRNATSDSSDEDYTET---------- 2471 S L +S E VSG+R+ ERLDYKKLHDE Y N +DSSD+ Y T Sbjct: 666 SSLLDSGPDKDGLEPVSGRRQVERLDYKKLHDETYGNVPTDSSDDTYGSTLDSSDDRGWD 725 Query: 2472 AGAKRRKSNAGKAIRICTNKIQDRNDRTDTMDGNQNHKENDHSVEGKSHKKLKVXXXXXX 2651 +G ++R G + ND + +++K G + V Sbjct: 726 SGTRKR----GPKTLVLALSNNGSNDDLTNVKTKRSYKRRTRQKPGAINVNNSV------ 775 Query: 2652 XXXXXXXXXXXXXXXKDAAKQYKRLGEAVVQRLVESFKENQYPKQDVKQSLAKELGLRVQ 2831 K + +RL + ++RL+ SF+EN+YPK+ KQSLA+ELGL ++ Sbjct: 776 TETPVDTAKSSSSVKKSTSSSNRRLSQPALERLLASFQENEYPKRATKQSLAQELGLGLK 835 Query: 2832 QVGKWFENARWSFRHSSRMESKMVAAASTNGSSLRTRVHVMSLAAKQSPVLPSPSDS-GM 3008 QV KWFEN RWS RH S K ++S L +S +S +DS G Sbjct: 836 QVSKWFENTRWSTRHPS-SSGKKAKSSSRMSIYLSQASGELSKNEPESATCFRDTDSNGA 894 Query: 3009 ENLISSQVRPGNEECQITDAGEGK--SVESEASGEKSTRKRKVDNQGSGAGNCMKQDQHD 3182 + CQ D G+ K S +++ + +T+ RK + + K + Sbjct: 895 RHQDLPMANSVVASCQSGDTGDKKLSSRKTKRADSSATKSRKRKGRSDNTASHSKDREGS 954 Query: 3183 DTP 3191 P Sbjct: 955 PRP 957 >ref|XP_004161446.1| PREDICTED: homeobox protein HAT3.1-like [Cucumis sativus] Length = 749 Score = 415 bits (1066), Expect = e-113 Identities = 277/719 (38%), Positives = 370/719 (51%), Gaps = 34/719 (4%) Frame = +3 Query: 1137 ASESVPHEGTSLPSRKQISSLLPPVSN-RVLRSRSQEKPKASESKAVEVENSANEGRKSK 1313 + +S + L S+K+ L VS+ RVLRSR+QEK KA E +A E K K Sbjct: 24 SQQSARKDKIFLKSKKKNYKLRSHVSSDRVLRSRTQEKAKAPERSNDLNNFTAEEDGKRK 83 Query: 1314 QRKGRM---KKIPVNEFSRIRTHLRYLLHRVKYERNLIDAYSCEGWKGQSLEKLKPEKEL 1484 ++K R K V+E+S IR HLRYLL+R++YE++LI+AYS EGWKG S +KLKPEKEL Sbjct: 84 KKKKRNIQGKGARVDEYSSIRNHLRYLLNRIRYEQSLIEAYSSEGWKGFSSDKLKPEKEL 143 Query: 1485 QRARSQIFRYKLKIRDLFQQLDRSLTEGRFPESLFDSEGQIDSEDIFCAKCGSKDLTIEN 1664 QRA ++I R KLKIRDLFQ++D EGR ESLFDSEGQIDSEDIFCAKCGSK+L++EN Sbjct: 144 QRASNEIMRRKLKIRDLFQRIDALCAEGRLSESLFDSEGQIDSEDIFCAKCGSKELSLEN 203 Query: 1665 DIILCDGACERGFHQFCLDPPLLKEDIPPDDEGWLCPGCDCKLDCVGMLNDFQESTLSVT 1844 DIILCDG C+RGFHQFCL+PPLL DIPPDDEGWLCPGCDCK DC+ +LN+FQ S LS+T Sbjct: 204 DIILCDGICDRGFHQFCLEPPLLNTDIPPDDEGWLCPGCDCKDDCLDLLNEFQGSNLSIT 263 Query: 1845 DNWEKVFPEEAAAAAS-------GKTMDEGTXXXXXXXXXXXXXXXXXXXXHEVSRGEST 2003 D WEKV+PE AAAAA G D+ E S +S Sbjct: 264 DGWEKVYPEAAAAAAGRNSDHTLGLPSDDSEDGDYDPDVPDTIDQDNELSSDESSSDQSN 323 Query: 2004 SDGSD-----YFSASDDIVPPLDNKQIFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 2168 SD S+ Y SAS+ + ++ Q Sbjct: 324 SDPSNSDTSGYASASEGLEVSSNDDQYLGLPSDDSEDNDYDPSVPELDEGVRQESSSSDF 383 Query: 2169 XXXXXXLGAILEDGESPNKDEGHFPSVSEDSKPNAVASGEKLKAGKRKGRSLNDELSYLT 2348 L A+ D +KD G S ++ P ++G+ +G K +L++ELS L Sbjct: 384 TSDSEDLAAL--DNNCSSKD-GDLVSSLNNTLPVKNSNGQ--SSGPNKS-ALHNELSSLL 437 Query: 2349 ES-----NTEAVSGKRRGERLDYKKLHDEAYRNATSDSSDEDYTET----------AGAK 2483 +S E VSG+R+ ERLDYKKLHDE Y N +DSSD+ Y T +G + Sbjct: 438 DSGPDKDGLEPVSGRRQVERLDYKKLHDETYGNVPTDSSDDTYGSTLDSSDDRGWDSGTR 497 Query: 2484 RRKSNAGKAIRICTNKIQDRNDRTDTMDGNQNHKENDHSVEGKSHKKLKVXXXXXXXXXX 2663 +R G + ND + +++K G + V Sbjct: 498 KR----GPKTLVLALSNNGSNDDLTNVKTKRSYKRRTRQKPGAINVNNSV------TETP 547 Query: 2664 XXXXXXXXXXXKDAAKQYKRLGEAVVQRLVESFKENQYPKQDVKQSLAKELGLRVQQVGK 2843 K + +RL + ++RL+ SF+EN+YPK+ KQSLA+ELGL ++QV K Sbjct: 548 VDTAKSSSSVKKSTSSSNRRLSQPALERLLASFQENEYPKRATKQSLAQELGLGLKQVSK 607 Query: 2844 WFENARWSFRHSSRMESKMVAAASTNGSSLRTRVHVMSLAAKQSPVLPSPSDS-GMENLI 3020 WFEN RWS RH S K ++S L +S +S +DS G + Sbjct: 608 WFENTRWSTRHPS-SSGKKAKSSSRMSIYLSQASGELSKNEPESATCFRDTDSNGARHQD 666 Query: 3021 SSQVRPGNEECQITDAGEGK--SVESEASGEKSTRKRKVDNQGSGAGNCMKQDQHDDTP 3191 CQ D G+ K S +++ + +T+ RK + + K + P Sbjct: 667 LPMANSVVASCQSGDTGDKKLSSRKTKRADSSATKSRKRKGRSDNTASHSKDREGSPRP 725