BLASTX nr result
ID: Catharanthus22_contig00008367
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00008367 (2309 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_004230901.1| PREDICTED: uncharacterized protein LOC101250... 724 0.0 ref|XP_006359608.1| PREDICTED: transcription factor RF2a-like [S... 724 0.0 ref|XP_002269363.1| PREDICTED: uncharacterized protein LOC100255... 693 0.0 ref|XP_006423515.1| hypothetical protein CICLE_v10028062mg [Citr... 669 0.0 ref|XP_006471421.1| PREDICTED: dentin sialophosphoprotein-like [... 668 0.0 gb|EOX97837.1| Basic-leucine zipper transcription factor family ... 667 0.0 ref|XP_002313753.2| hypothetical protein POPTR_0009s12830g [Popu... 641 0.0 ref|XP_002305468.2| hypothetical protein POPTR_0004s17080g [Popu... 638 e-180 ref|XP_006423514.1| hypothetical protein CICLE_v10028062mg [Citr... 632 e-178 ref|XP_004290316.1| PREDICTED: uncharacterized protein LOC101303... 628 e-177 ref|XP_002513009.1| DNA binding protein, putative [Ricinus commu... 609 e-171 ref|XP_006423516.1| hypothetical protein CICLE_v10028062mg [Citr... 588 e-165 gb|EOX97839.1| Basic-leucine zipper transcription factor family ... 581 e-163 gb|EOX97838.1| Basic-leucine zipper transcription factor family ... 581 e-163 ref|XP_004170187.1| PREDICTED: uncharacterized protein LOC101227... 570 e-159 ref|XP_004148549.1| PREDICTED: uncharacterized protein LOC101216... 566 e-158 gb|ESW03736.1| hypothetical protein PHAVU_011G038200g [Phaseolus... 557 e-156 ref|XP_004148547.1| PREDICTED: uncharacterized protein LOC101215... 550 e-154 gb|EXB61817.1| putative transcription factor PosF21 [Morus notab... 541 e-151 ref|XP_006592080.1| PREDICTED: probable transcription factor Pos... 535 e-149 >ref|XP_004230901.1| PREDICTED: uncharacterized protein LOC101250636 [Solanum lycopersicum] Length = 582 Score = 724 bits (1870), Expect = 0.0 Identities = 390/540 (72%), Positives = 430/540 (79%), Gaps = 5/540 (0%) Frame = -3 Query: 2223 MGGDADEASSDMMQRLQSSFGTSSSSVPKQ-PPMSMNQLDLSQLNSSQFRGQMRHFSPNF 2047 M GD DE SDM+QRLQSSFGTSSSS+PKQ P+SMNQLD+ QL +SQFRGQMR FSPNF Sbjct: 1 MAGDNDEGHSDMVQRLQSSFGTSSSSLPKQLQPISMNQLDIPQLTTSQFRGQMRQFSPNF 60 Query: 2046 SSESAKRVGXXXXXXXXXXXXPYSQIPVSRPVNQQMGMQNFGATPGPGPSHSRXXXXXXX 1867 E++KRVG PYSQIPV+RP NQQMGMQNF + GPSHSR Sbjct: 61 GVENSKRVGIPPSHPQMPPISPYSQIPVTRPGNQQMGMQNFTSA---GPSHSRSLSQPAF 117 Query: 1866 XXXXXXXXXXXXXXXXXXSNSINEPVSTDVSMEDQNAGSQSLLPPSPFTRGHSSRAGESL 1687 S S+++P+S DVSM DQ+ S SLLPP+PF+R +SSRAGESL Sbjct: 118 FSLDSLPPLSPSPYRESPSTSMSDPISADVSMGDQDGNSHSLLPPTPFSRCNSSRAGESL 177 Query: 1686 PPRKAHRRSNSDIPFGFSTIMQSSQPLMPLRSPGA---PVSARENSAAKPAQLVKRESCW 1516 PPRKAHRRSNSDIPFGFS IMQSS PL+PLRSPGA V +R+N KP QLVKRES W Sbjct: 178 PPRKAHRRSNSDIPFGFSGIMQSSPPLVPLRSPGALERSVPSRDNLGGKPVQLVKRESMW 237 Query: 1515 EKGTETS-AEGMGERKSEGEVVDDLFSAYMNLDNIDVLNSSGTDEKQGTDNREDLDSRAS 1339 E+G + + EGMGERKSEGEVVDDLFSAYMNLDNID NSSGTDEK G +NREDLDSRAS Sbjct: 238 ERGNDNNNVEGMGERKSEGEVVDDLFSAYMNLDNIDAFNSSGTDEKLGIENREDLDSRAS 297 Query: 1338 GTKTNGGDSSDNEATSSVNESGNSMQRSSEKREGIKRNAVGDIAPTSRHYRSVSMDSFMG 1159 GTKTNGGDSSDNEATSSVN+S + S +KREG+KR+AVGDIAPT+RHYRSVSMDSFMG Sbjct: 298 GTKTNGGDSSDNEATSSVNDSSSG---SMQKREGVKRSAVGDIAPTTRHYRSVSMDSFMG 354 Query: 1158 KLNFGEESPKLPPSPGTRPGQLSPTNSLDTNSNTFSLEFGNGEFSGAELKKIMANEKLAE 979 KLNF ++SPKLPPSPG RPGQLSPTNSLD NSN+FSLEFGNGEFSGAELKKIMANEKLAE Sbjct: 355 KLNFIDDSPKLPPSPGPRPGQLSPTNSLDGNSNSFSLEFGNGEFSGAELKKIMANEKLAE 414 Query: 978 IALSDPKRAKRILANRQSAARSKERKMRYIAELEHKVXXXXXXXXXXXXXXXXXQRDSAG 799 IAL+DPKRAKRILANRQSAARSKERKMRYIAELEHKV QRD+ G Sbjct: 415 IALADPKRAKRILANRQSAARSKERKMRYIAELEHKVQTLQTEATTLSAQLTLLQRDATG 474 Query: 798 LTSQNNELKFRLQAMEQQAQLRDALNEALTAEVQRLKLATTELNGDAAKFQQLSMNPQMF 619 LTSQN+ELKFRLQAMEQQAQLRDALNEALTAEVQRLK+AT EL+ DA+KFQQLS+NPQMF Sbjct: 475 LTSQNSELKFRLQAMEQQAQLRDALNEALTAEVQRLKIATAELSADASKFQQLSLNPQMF 534 >ref|XP_006359608.1| PREDICTED: transcription factor RF2a-like [Solanum tuberosum] Length = 577 Score = 724 bits (1869), Expect = 0.0 Identities = 389/540 (72%), Positives = 430/540 (79%), Gaps = 5/540 (0%) Frame = -3 Query: 2223 MGGDADEASSDMMQRLQSSFGTSSSSVPKQ-PPMSMNQLDLSQLNSSQFRGQMRHFSPNF 2047 M GD DE +SDM+QRLQSSFGTSSSS+PKQ P+SMNQLD+ QL +SQFRGQMR FSPNF Sbjct: 1 MAGDNDEGNSDMVQRLQSSFGTSSSSLPKQLQPISMNQLDIPQLTTSQFRGQMRQFSPNF 60 Query: 2046 SSESAKRVGXXXXXXXXXXXXPYSQIPVSRPVNQQMGMQNFGATPGPGPSHSRXXXXXXX 1867 E++KRVG PYSQIPV+RP NQQMGMQNF + GPSHSR Sbjct: 61 GVENSKRVGIPPSHPQMPPISPYSQIPVTRPGNQQMGMQNFTSA---GPSHSRSLSQPAF 117 Query: 1866 XXXXXXXXXXXXXXXXXXSNSINEPVSTDVSMEDQNAGSQSLLPPSPFTRGHSSRAGESL 1687 S S+++P+S DVSM DQ+ S SLLPP+PF+R +SSRAGESL Sbjct: 118 FSLDSLPPLSPSPYRESPSTSMSDPISADVSMGDQDGNSHSLLPPTPFSRCNSSRAGESL 177 Query: 1686 PPRKAHRRSNSDIPFGFSTIMQSSQPLMPLRSPGA---PVSARENSAAKPAQLVKRESCW 1516 PPRKAHRRSNSDIPFGFS IMQSS PL+PLRSPGA +R+NS KP QLVKRES W Sbjct: 178 PPRKAHRRSNSDIPFGFSAIMQSSPPLVPLRSPGALERSFPSRDNSGGKPVQLVKRESMW 237 Query: 1515 EKGTE-TSAEGMGERKSEGEVVDDLFSAYMNLDNIDVLNSSGTDEKQGTDNREDLDSRAS 1339 E+G + + EGMGERKSEGEVVDDLFSAYMNLDNID NSSGTDEK G +NREDLDSRAS Sbjct: 238 ERGNDYNNVEGMGERKSEGEVVDDLFSAYMNLDNIDAFNSSGTDEKLGIENREDLDSRAS 297 Query: 1338 GTKTNGGDSSDNEATSSVNESGNSMQRSSEKREGIKRNAVGDIAPTSRHYRSVSMDSFMG 1159 GTKTNGGDSSDNEATSSVN+S + S +KREG+KR+AV DIAPT+RHYRSVSMDSFMG Sbjct: 298 GTKTNGGDSSDNEATSSVNDSSSG---SMQKREGVKRSAVADIAPTTRHYRSVSMDSFMG 354 Query: 1158 KLNFGEESPKLPPSPGTRPGQLSPTNSLDTNSNTFSLEFGNGEFSGAELKKIMANEKLAE 979 KLNF ++SPKLPPSPG RPGQLSPTNSLD NSN+FSLEFGNGEFSGAELKKIMANEKLAE Sbjct: 355 KLNFIDDSPKLPPSPGPRPGQLSPTNSLDGNSNSFSLEFGNGEFSGAELKKIMANEKLAE 414 Query: 978 IALSDPKRAKRILANRQSAARSKERKMRYIAELEHKVXXXXXXXXXXXXXXXXXQRDSAG 799 IAL+DPKRAKRILANRQSAARSKERKMRYIAELEHKV QRD+ G Sbjct: 415 IALADPKRAKRILANRQSAARSKERKMRYIAELEHKVQTLQTEATTLSAQLTLLQRDATG 474 Query: 798 LTSQNNELKFRLQAMEQQAQLRDALNEALTAEVQRLKLATTELNGDAAKFQQLSMNPQMF 619 LTSQN+ELKFRLQAMEQQAQLRDALNEALTAEVQRLK+AT EL+ DA+KFQQLS+NPQMF Sbjct: 475 LTSQNSELKFRLQAMEQQAQLRDALNEALTAEVQRLKIATAELSADASKFQQLSLNPQMF 534 >ref|XP_002269363.1| PREDICTED: uncharacterized protein LOC100255631 [Vitis vinifera] Length = 589 Score = 693 bits (1788), Expect = 0.0 Identities = 386/552 (69%), Positives = 430/552 (77%), Gaps = 19/552 (3%) Frame = -3 Query: 2217 GDADEASSDMMQRLQSSFGTSSSSVPKQPPMSMNQLDLSQLN-SSQFRG-QMRHFSPNFS 2044 GD +EA+ DM+QRLQSSFGTSSSS+ KQP MSMNQLD+ QLN SSQ R MRHFSPNFS Sbjct: 2 GDTEEANIDMIQRLQSSFGTSSSSIQKQP-MSMNQLDIPQLNASSQIRAPMMRHFSPNFS 60 Query: 2043 SESAKRVGXXXXXXXXXXXXP-YSQIPVSRPVNQQMGMQNFGATPGPGPSHSRXXXXXXX 1867 +S+KR G YSQIPV RP NQQ+ QNF PGPSHSR Sbjct: 61 GDSSKRHGFPPSHPHQIPPISPYSQIPVPRPANQQLVSQNFS----PGPSHSRSLSQPSF 116 Query: 1866 XXXXXXXXXXXXXXXXXXSNSINEPVSTDVSMEDQNAGSQSLLPPSP--FTRGHSSRAGE 1693 S SI++ VS D+SMED++A S S+LPPSP F+RG+S R GE Sbjct: 117 FSLDSLPPLSPSPYRDSSSTSISDAVSADISMEDRDASSHSVLPPSPSPFSRGNSMRVGE 176 Query: 1692 SLPPRKAHRRSNSDIPFGFSTIMQSSQPLMPLRSPGA---PVSARENS--AAKPAQLVKR 1528 +LPPRKAHRRS+SDIPFGFS+IMQSS PL+PLR GA +S R+N+ AAKP QLVKR Sbjct: 177 NLPPRKAHRRSSSDIPFGFSSIMQSSPPLIPLRGSGALERSMSGRDNNMAAAKPVQLVKR 236 Query: 1527 ESCWEKGTETSAEGMGERKSEGEVVDDLFSAYMNLDNIDVLNSSGTDEKQGTDNREDLDS 1348 ES WE+G +++AEGMGERKSEGEVVDDL SAYMNLDNID LNS GT+EK GT+NREDLDS Sbjct: 237 ESSWERGGDSNAEGMGERKSEGEVVDDLLSAYMNLDNIDALNSPGTEEKNGTENREDLDS 296 Query: 1347 RASGTKTNGGDSSDNEATSSVNESGNSMQR-----SSEKREGIKRNAVGDIAPTSRHYRS 1183 RASGTKTNGGDSSDNEA SSVNESGNSMQ+ S+EKREG+KR+A GDIAPT+RHYRS Sbjct: 297 RASGTKTNGGDSSDNEAESSVNESGNSMQKLGTSSSAEKREGVKRSAGGDIAPTTRHYRS 356 Query: 1182 VSMDSFMGKLNFGEESPKLPPSPGTRPGQLSPTNSLDTNSNTFSLEFGNGEFSGAELKKI 1003 VSMDSFMGK+NFG+ESPKL PSPGTRPGQLSP+NS+D NS TFSLEFGNGEFSGAELKKI Sbjct: 357 VSMDSFMGKMNFGDESPKLLPSPGTRPGQLSPSNSMDGNSATFSLEFGNGEFSGAELKKI 416 Query: 1002 MANEKLAEIALSDPKRAKRILANRQSAARSKERKMRYIAELEHKVXXXXXXXXXXXXXXX 823 MANEKLAEIAL+DPKRAKRILANRQSAARSKERKMRYI+ELEHKV Sbjct: 417 MANEKLAEIALTDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLT 476 Query: 822 XXQRDSAGLTSQNNELKFRLQAMEQQAQLRDALNEALTAEVQRLKLATTELNGDAAKFQ- 646 QRDSAGLTSQNNELKFRLQAMEQQAQLRDALNEALTAEVQRLKLAT EL G++ + Sbjct: 477 LLQRDSAGLTSQNNELKFRLQAMEQQAQLRDALNEALTAEVQRLKLATAELGGESQASKC 536 Query: 645 ---QLSMNPQMF 619 QLS+NPQMF Sbjct: 537 LVPQLSVNPQMF 548 >ref|XP_006423515.1| hypothetical protein CICLE_v10028062mg [Citrus clementina] gi|557525449|gb|ESR36755.1| hypothetical protein CICLE_v10028062mg [Citrus clementina] Length = 597 Score = 669 bits (1725), Expect = 0.0 Identities = 377/558 (67%), Positives = 419/558 (75%), Gaps = 23/558 (4%) Frame = -3 Query: 2223 MGGDADEAS---SDMMQRLQSSFGTSSSSVPKQPPM-SMNQLDLSQLNSSQFRGQMRHFS 2056 MGGD+DE + +DMMQR+QSSFGTSSSS+PKQ + S NQLDL QLN +Q R RHFS Sbjct: 1 MGGDSDEGNGGNTDMMQRIQSSFGTSSSSIPKQQTLLSANQLDLPQLNQNQLRA--RHFS 58 Query: 2055 P---NFSSESAKRVGXXXXXXXXXXXXP-YSQIPVSRPVNQQMGMQNFGATPGPGPSHSR 1888 NF +S+KRVG YS IPVSRP NQQMG QN+ PGP+HSR Sbjct: 59 QFATNFGGDSSKRVGIPPSHPNQIPPISPYSSIPVSRPGNQQMGSQNYS----PGPTHSR 114 Query: 1887 XXXXXXXXXXXXXXXXXXXXXXXXXSN-SINEPVSTDVSMEDQNAGSQSLLPPSPFTRGH 1711 + S+++ VSTDVSMED++ S SLLPPSPF RG+ Sbjct: 115 SLSQPSLFFSLDSLPPLSPSPFRDSPSTSMSDQVSTDVSMEDRDGNSHSLLPPSPFNRGN 174 Query: 1710 SSRAGESLPPRKAHRRSNSDIPFGFSTIMQSSQPLMPLRSPGA---PVSAREN-SAAKPA 1543 +SR GESLPPR HRRSNSDIPFGFST+MQSS PL+ R G VS REN AKPA Sbjct: 175 ASRIGESLPPRNKHRRSNSDIPFGFSTVMQSSSPLISPRFAGGLDKAVSGRENPGVAKPA 234 Query: 1542 QLVKRESCWEKGTETSAEGMGERKSEGEVVDDLFSAYMNLDNIDVLNSSGTDEKQGTDNR 1363 QLVK+ES WE+G E++ EGMGERKSEGEVVDDLFSAYMNL+NID LNSSGTD+K G +NR Sbjct: 235 QLVKKESSWERGGESNGEGMGERKSEGEVVDDLFSAYMNLENIDALNSSGTDDKNGNENR 294 Query: 1362 EDLDSRASGTKTNGGDSSDNEATSSVNESGNSMQR-----SSEKREGIKRNAVGDIAPTS 1198 EDLDSRASGTKTNGGDSSDNEA SSVNESGNS+QR S+EKREGIKR A GD+A T+ Sbjct: 295 EDLDSRASGTKTNGGDSSDNEAESSVNESGNSLQRAGMNSSAEKREGIKRTAGGDVASTT 354 Query: 1197 RHYRSVSMDSFMGKLNFGEESPKLPPSPGTRPGQLSPTNSLDTNSNTFSLEFGNGEFSGA 1018 RHYRSVSMDSFMGKLNFG+ESPKLPPSPGTRPGQLSP+NS+D NS FSLEFGNGEFSGA Sbjct: 355 RHYRSVSMDSFMGKLNFGDESPKLPPSPGTRPGQLSPSNSIDANSPAFSLEFGNGEFSGA 414 Query: 1017 ELKKIMANEKLAEIALSDPKRAKRILANRQSAARSKERKMRYIAELEHKVXXXXXXXXXX 838 ELKKIMANEKLAEIAL+DPKRAKRILANRQSAARSKERKMRYI+ELEHKV Sbjct: 415 ELKKIMANEKLAEIALTDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTL 474 Query: 837 XXXXXXXQRDSAGLTSQNNELKFRLQAMEQQAQLRDALNEALTAEVQRLKLATTEL--NG 664 QRDS GLT+QNNELKFRLQAMEQQAQLRDALNEALTAEV+RLK+AT E+ Sbjct: 475 SAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDALNEALTAEVRRLKVATQEMASES 534 Query: 663 DAAK---FQQLSMNPQMF 619 D +K QQL MN QMF Sbjct: 535 DPSKGMANQQLPMNSQMF 552 >ref|XP_006471421.1| PREDICTED: dentin sialophosphoprotein-like [Citrus sinensis] gi|568867906|ref|XP_006487269.1| PREDICTED: dentin sialophosphoprotein-like [Citrus sinensis] gi|568881571|ref|XP_006493636.1| PREDICTED: dentin sialophosphoprotein-like [Citrus sinensis] Length = 597 Score = 668 bits (1724), Expect = 0.0 Identities = 377/558 (67%), Positives = 420/558 (75%), Gaps = 23/558 (4%) Frame = -3 Query: 2223 MGGDADEAS---SDMMQRLQSSFGTSSSSVPKQPPM-SMNQLDLSQLNSSQFRGQMRHFS 2056 MGGD++E + +DMMQR+QSSFGTSSSS+PKQ + S NQLDL QLN +Q R RHFS Sbjct: 1 MGGDSNEGNGGNTDMMQRIQSSFGTSSSSIPKQQTLLSANQLDLPQLNQNQLRA--RHFS 58 Query: 2055 P---NFSSESAKRVGXXXXXXXXXXXXP-YSQIPVSRPVNQQMGMQNFGATPGPGPSHSR 1888 NF +S+KRVG YS IPVSRP NQQMG QN+ PGP+HSR Sbjct: 59 QFATNFGGDSSKRVGIPPSHPNQIPPISPYSSIPVSRPGNQQMGSQNYS----PGPTHSR 114 Query: 1887 XXXXXXXXXXXXXXXXXXXXXXXXXSN-SINEPVSTDVSMEDQNAGSQSLLPPSPFTRGH 1711 + S+++ VSTDVSMED++ S SLLPPSPF RG+ Sbjct: 115 SLSQPSSFFSLDSLPPLSPSPFRDSPSTSMSDQVSTDVSMEDRDGNSHSLLPPSPFNRGN 174 Query: 1710 SSRAGESLPPRKAHRRSNSDIPFGFSTIMQSSQPLMPLRSPGA---PVSARENSA-AKPA 1543 +SR GESLPPR HRRSNSDIPFGFST+MQSS PL+ R G VS RENS AKPA Sbjct: 175 ASRIGESLPPRNKHRRSNSDIPFGFSTVMQSSSPLISPRFAGGLDKAVSGRENSGVAKPA 234 Query: 1542 QLVKRESCWEKGTETSAEGMGERKSEGEVVDDLFSAYMNLDNIDVLNSSGTDEKQGTDNR 1363 QLVK+ES WE+G E++ EGMGERKSEGEVVDDLFSAYMNL+NID LNSSGTD+K G +NR Sbjct: 235 QLVKKESSWERGGESNGEGMGERKSEGEVVDDLFSAYMNLENIDALNSSGTDDKNGNENR 294 Query: 1362 EDLDSRASGTKTNGGDSSDNEATSSVNESGNSMQR-----SSEKREGIKRNAVGDIAPTS 1198 EDLDSRASGTKTNGGDSSDNEA SSVNESGNS+QR S+EKREGIKR A GD+A T+ Sbjct: 295 EDLDSRASGTKTNGGDSSDNEAESSVNESGNSLQRAGMNSSAEKREGIKRTAGGDVASTT 354 Query: 1197 RHYRSVSMDSFMGKLNFGEESPKLPPSPGTRPGQLSPTNSLDTNSNTFSLEFGNGEFSGA 1018 RHYRSVSMDSFMGKLNFG+ESPKLPPSPGTRPGQLSP+NS+D NS FSLEFGNGEFSGA Sbjct: 355 RHYRSVSMDSFMGKLNFGDESPKLPPSPGTRPGQLSPSNSIDANSPAFSLEFGNGEFSGA 414 Query: 1017 ELKKIMANEKLAEIALSDPKRAKRILANRQSAARSKERKMRYIAELEHKVXXXXXXXXXX 838 ELKKIMANEKLAEIAL+DPKRAKRILANRQSAARSKERKMRYI+ELEHKV Sbjct: 415 ELKKIMANEKLAEIALTDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTL 474 Query: 837 XXXXXXXQRDSAGLTSQNNELKFRLQAMEQQAQLRDALNEALTAEVQRLKLATTEL--NG 664 QRDS GLT+QNNELKFRLQAMEQQAQLRDALNEALTAEV+RLK+AT E+ Sbjct: 475 SAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDALNEALTAEVRRLKVATQEMASES 534 Query: 663 DAAK---FQQLSMNPQMF 619 D +K QQL MN QMF Sbjct: 535 DPSKGMANQQLPMNSQMF 552 >gb|EOX97837.1| Basic-leucine zipper transcription factor family protein isoform 1 [Theobroma cacao] Length = 591 Score = 667 bits (1721), Expect = 0.0 Identities = 373/555 (67%), Positives = 421/555 (75%), Gaps = 22/555 (3%) Frame = -3 Query: 2217 GDADEASSDMMQRLQSSFGTSSSSVPKQPPMSMNQLDLSQLNSSQFRG--QMRHFSPNFS 2044 GD++E ++D+MQR+QSSFGTSSSS+PKQP +SMNQL++ QLN +Q R HF NF+ Sbjct: 2 GDSEEGNTDVMQRIQSSFGTSSSSIPKQP-LSMNQLEIPQLNPNQIRAPRHFSHFGQNFN 60 Query: 2043 S----ESAKRVGXXXXXXXXXXXXP-YSQIPVSRPVNQQMGMQNFGATPGPGPSHSRXXX 1879 + KRVG YSQIPVSR +NQQMG Q+F PGP+HSR Sbjct: 61 GGVGDAANKRVGIPPSHPNQIPPISPYSQIPVSRQMNQQMGSQSFS----PGPTHSRSLS 116 Query: 1878 XXXXXXXXXXXXXXXXXXXXXXSN-SINEPVSTDVSMEDQNAGSQSLLPPSPFTRGHSSR 1702 S+ ++ + + TDVSMED++A S SLLPPSPF+RG+S R Sbjct: 117 QPSSFFSLDSLPPLSPSPFRDCSSVAVPDQICTDVSMEDRDAASHSLLPPSPFSRGNSPR 176 Query: 1701 AGESLPPRKAHRRSNSDIPFGFSTIMQSSQPLMPLRSPGA---PVSARENSAA-KPAQLV 1534 GESLPPRK+HRRSNSDIPFGF+TIMQSS PL+PLR G VS +ENS KPAQLV Sbjct: 177 VGESLPPRKSHRRSNSDIPFGFNTIMQSSPPLIPLRGSGGLERSVSGKENSGVPKPAQLV 236 Query: 1533 KRESCWEKGTETSAEGMGERKSEGEVVDDLFSAYMNLDNIDVLNSSGTDEKQ-GTDNRED 1357 K+E+ WE+G + +AEGMGERKSEGEVVDDLFSAYMNLDNID LNSSGTD+K GT+N ED Sbjct: 237 KKETSWERGADGNAEGMGERKSEGEVVDDLFSAYMNLDNIDALNSSGTDDKNNGTENHED 296 Query: 1356 LDSRASGTKTNGGDSSDNEATSSVNESGNSMQR----SSEKREGIKRNAVGDIAPTSRHY 1189 LDSRASGTKTNGGDSSDNEA SSVNESGNS R S++KREGIKR+A GDIAPT RHY Sbjct: 297 LDSRASGTKTNGGDSSDNEAESSVNESGNSALRGGMNSTDKREGIKRSAGGDIAPTGRHY 356 Query: 1188 RSVSMDSFMGKLNFGEESPKLPPSPGTRPGQLSPTNSLDTNSNTFSLEFGNGEFSGAELK 1009 RSVSMDSFMGKLNFG+ESPKLPPSPGTRPGQLSP+NS+D NS FSLEFGNGEFSGAELK Sbjct: 357 RSVSMDSFMGKLNFGDESPKLPPSPGTRPGQLSPSNSIDGNSAAFSLEFGNGEFSGAELK 416 Query: 1008 KIMANEKLAEIALSDPKRAKRILANRQSAARSKERKMRYIAELEHKVXXXXXXXXXXXXX 829 KIMANEKLAEIA+SDPKRAKRILANRQSAARSKERKMRYI+ELEHKV Sbjct: 417 KIMANEKLAEIAMSDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQ 476 Query: 828 XXXXQRDSAGLTSQNNELKFRLQAMEQQAQLRDALNEALTAEVQRLKLATTELNGDA--- 658 QRDS GLT+QNNELKFRLQAMEQQAQLRDALNEALTAEV+RLKLAT EL GD+ Sbjct: 477 LTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDALNEALTAEVRRLKLATQELGGDSDPS 536 Query: 657 --AKFQQLSMNPQMF 619 QQLS+N QMF Sbjct: 537 KGMVSQQLSVNHQMF 551 >ref|XP_002313753.2| hypothetical protein POPTR_0009s12830g [Populus trichocarpa] gi|550331611|gb|EEE87708.2| hypothetical protein POPTR_0009s12830g [Populus trichocarpa] Length = 600 Score = 641 bits (1653), Expect = 0.0 Identities = 367/555 (66%), Positives = 413/555 (74%), Gaps = 22/555 (3%) Frame = -3 Query: 2217 GDADEASSDMMQRLQSSFGT---SSSSVPKQPPMSMNQLDLSQLNSSQFRGQMRHFS--- 2056 GD ++A+S+M+QRLQSSFGT SS+++ KQP +NQLD+SQLN +Q + + RHF+ Sbjct: 2 GDTEDANSEMIQRLQSSFGTTQSSSATMSKQPFSLINQLDVSQLNLNQTQLRARHFANFY 61 Query: 2055 PNFSSESAKRVGXXXXXXXXXXXXP-YSQIPVSRPVNQQMGMQNFGATPGPGPSHSRXXX 1879 NFS +S KRVG YSQIPVSRP NQQM QNF GP+HSR Sbjct: 62 QNFSGDSNKRVGIPPSHPNQIPPISPYSQIPVSRPANQQMSTQNFSM----GPTHSRSLS 117 Query: 1878 XXXXXXXXXXXXXXXXXXXXXXSN-SINEPVSTDVSMEDQNAGSQSLLPPSPFTRGHSSR 1702 S+ ++++P+STDVSMED++ S SLLPPSPF RG++ R Sbjct: 118 QPSSFFCLDSLPPLSPAPFRDSSSPTVSDPISTDVSMEDKDGSSHSLLPPSPFNRGNAPR 177 Query: 1701 AGESLPPRKAHRRSNSDIPFGFSTIMQSSQPLMPLRSPGA---PVSARENSA-AKPAQLV 1534 GESLPPRKAHRRSNSDIPFG ++Q S PL+PLR G +S REN A AKPAQLV Sbjct: 178 VGESLPPRKAHRRSNSDIPFG--NVLQCSPPLIPLRGSGGLERSLSGRENPAMAKPAQLV 235 Query: 1533 KRESCWEKGTETSAEGMGERKSEGEVVDDLFSAYMNLDNIDVLNSSGTDEKQGTDNREDL 1354 K+E WE+G E+ AEG GERKSEG+V DDLFSAYMNLDNID LNSSGTDEK G +NREDL Sbjct: 236 KKE--WERGGESIAEGTGERKSEGDV-DDLFSAYMNLDNIDALNSSGTDEKNGNENREDL 292 Query: 1353 DSRASGTKTNGGDSSDNEATSSVNESGNSMQR-----SSEKREGIKRNAVGDIAPTSRHY 1189 DSRASGTKTNGGDSSDNEA SSVNESG S+ R S+EKREGIKR+A GDIAPTSRHY Sbjct: 293 DSRASGTKTNGGDSSDNEAESSVNESGGSVPRGGFSSSTEKREGIKRSAGGDIAPTSRHY 352 Query: 1188 RSVSMDSFMGKLNFGEESPKLPPSPGTRPGQLSPTNSLDTNSNTFSLEFGNGEFSGAELK 1009 RSVSMDSFMGKLNFG+ESPKLPPSPGTRPGQLSPTNS+D N FSLEFGNGEFSGAELK Sbjct: 353 RSVSMDSFMGKLNFGDESPKLPPSPGTRPGQLSPTNSMD--GNAFSLEFGNGEFSGAELK 410 Query: 1008 KIMANEKLAEIALSDPKRAKRILANRQSAARSKERKMRYIAELEHKVXXXXXXXXXXXXX 829 KIMANEKLAEIA +DPKRAKRILANRQSAARSKERKMRYI+ELEHKV Sbjct: 411 KIMANEKLAEIASTDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQ 470 Query: 828 XXXXQRDSAGLTSQNNELKFRLQAMEQQAQLRDALNEALTAEVQRLKLATTELNGDAAKF 649 QRDS GLTSQNNELKFRLQAMEQQAQLRDALNEAL EV+RLK+AT E GD+ Sbjct: 471 LTLLQRDSVGLTSQNNELKFRLQAMEQQAQLRDALNEALNGEVRRLKIATAEQGGDSDPS 530 Query: 648 -----QQLSMNPQMF 619 QQLS+NPQMF Sbjct: 531 KGLVQQQLSVNPQMF 545 >ref|XP_002305468.2| hypothetical protein POPTR_0004s17080g [Populus trichocarpa] gi|550341213|gb|EEE85979.2| hypothetical protein POPTR_0004s17080g [Populus trichocarpa] Length = 607 Score = 638 bits (1645), Expect = e-180 Identities = 359/555 (64%), Positives = 415/555 (74%), Gaps = 22/555 (3%) Frame = -3 Query: 2217 GDADEASSDMMQRLQSSFGT---SSSSVPKQPPMSMNQLDLSQLNSSQFRGQMRHF---S 2056 GD +EA+S+M+QRLQSSFGT SS+++ KQP +NQ+D+SQL+ + + + RHF S Sbjct: 2 GDTEEANSEMIQRLQSSFGTTQSSSTTMAKQPFSLINQIDVSQLSLNPTQMRARHFTNFS 61 Query: 2055 PNFSSESAKRVGXXXXXXXXXXXXP-YSQIPVSRPVNQQMGMQNFGATPGPGPSHSRXXX 1879 NFS +S KRVG YSQIPVSRPVNQQMG Q+F GP+HSR Sbjct: 62 QNFSGDSNKRVGFPPSHPNQIPPISPYSQIPVSRPVNQQMGPQSFSL----GPTHSRSLS 117 Query: 1878 XXXXXXXXXXXXXXXXXXXXXXSN-SINEPVSTDVSMEDQNAGSQSLLPPSPFTRGHSSR 1702 S+ S+++P+STDV ME+++ GS SLLPPSPF RG++ R Sbjct: 118 QPSSFFSLDSLPPLSPAPFRDSSSPSVSDPISTDVFMEEKDGGSHSLLPPSPFNRGNAPR 177 Query: 1701 AGESLPPRKAHRRSNSDIPFGFSTIMQSSQPLMPLRSPGA---PVSAREN-SAAKPAQLV 1534 ESLPPRKAHRRSNSDIPFG + ++Q S PL+P R +S REN AKPAQ V Sbjct: 178 VVESLPPRKAHRRSNSDIPFGLANVLQCSPPLIPSRGSSGLERSMSGRENLGMAKPAQSV 237 Query: 1533 KRESCWEKGTETSAEGMGERKSEGEVVDDLFSAYMNLDNIDVLNSSGTDEKQGTDNREDL 1354 K+E WE+G +++AEGMGERKSEGEVVDDLFSAYMNLDNIDVLNSSGTD+K G +NREDL Sbjct: 238 KKE--WERGGDSNAEGMGERKSEGEVVDDLFSAYMNLDNIDVLNSSGTDDKNGNENREDL 295 Query: 1353 DSRASGTKTNGGDSSDNEATSSVNESGNSMQR-----SSEKREGIKRNAVGDIAPTSRHY 1189 DSRASGTKTNGGDSSDNEA SSVNESG ++ R S+EKREGIKR+A DIAPT+RHY Sbjct: 296 DSRASGTKTNGGDSSDNEAESSVNESGGNLPRAGLSSSTEKREGIKRSAGSDIAPTTRHY 355 Query: 1188 RSVSMDSFMGKLNFGEESPKLPPSPGTRPGQLSPTNSLDTNSNTFSLEFGNGEFSGAELK 1009 RSVSMDSFMGKLNFG ESPKLPPSPGTRPGQLSPT+S+D N FSL+FGNGEFSGAELK Sbjct: 356 RSVSMDSFMGKLNFGNESPKLPPSPGTRPGQLSPTDSID--GNAFSLDFGNGEFSGAELK 413 Query: 1008 KIMANEKLAEIALSDPKRAKRILANRQSAARSKERKMRYIAELEHKVXXXXXXXXXXXXX 829 KIMANEKLAEIAL+DPKRAKRILANRQSAARSKERKMRYI+ELEHKV Sbjct: 414 KIMANEKLAEIALADPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQ 473 Query: 828 XXXXQRDSAGLTSQNNELKFRLQAMEQQAQLRDALNEALTAEVQRLKLATTELNGDAAKF 649 QRDS GLT+QNNELKFR+QAMEQQAQLRDALNEALTAEV+RLK+AT E GD+ Sbjct: 474 LTLLQRDSVGLTNQNNELKFRIQAMEQQAQLRDALNEALTAEVRRLKIATAEQGGDSDPS 533 Query: 648 -----QQLSMNPQMF 619 QQLS+NPQM+ Sbjct: 534 KSMVQQQLSINPQMY 548 >ref|XP_006423514.1| hypothetical protein CICLE_v10028062mg [Citrus clementina] gi|557525448|gb|ESR36754.1| hypothetical protein CICLE_v10028062mg [Citrus clementina] Length = 514 Score = 632 bits (1631), Expect = e-178 Identities = 351/518 (67%), Positives = 390/518 (75%), Gaps = 18/518 (3%) Frame = -3 Query: 2223 MGGDADEAS---SDMMQRLQSSFGTSSSSVPKQPPM-SMNQLDLSQLNSSQFRGQMRHFS 2056 MGGD+DE + +DMMQR+QSSFGTSSSS+PKQ + S NQLDL QLN +Q R RHFS Sbjct: 1 MGGDSDEGNGGNTDMMQRIQSSFGTSSSSIPKQQTLLSANQLDLPQLNQNQLRA--RHFS 58 Query: 2055 P---NFSSESAKRVGXXXXXXXXXXXXP-YSQIPVSRPVNQQMGMQNFGATPGPGPSHSR 1888 NF +S+KRVG YS IPVSRP NQQMG QN+ PGP+HSR Sbjct: 59 QFATNFGGDSSKRVGIPPSHPNQIPPISPYSSIPVSRPGNQQMGSQNYS----PGPTHSR 114 Query: 1887 XXXXXXXXXXXXXXXXXXXXXXXXXSN-SINEPVSTDVSMEDQNAGSQSLLPPSPFTRGH 1711 + S+++ VSTDVSMED++ S SLLPPSPF RG+ Sbjct: 115 SLSQPSLFFSLDSLPPLSPSPFRDSPSTSMSDQVSTDVSMEDRDGNSHSLLPPSPFNRGN 174 Query: 1710 SSRAGESLPPRKAHRRSNSDIPFGFSTIMQSSQPLMPLRSPGA---PVSAREN-SAAKPA 1543 +SR GESLPPR HRRSNSDIPFGFST+MQSS PL+ R G VS REN AKPA Sbjct: 175 ASRIGESLPPRNKHRRSNSDIPFGFSTVMQSSSPLISPRFAGGLDKAVSGRENPGVAKPA 234 Query: 1542 QLVKRESCWEKGTETSAEGMGERKSEGEVVDDLFSAYMNLDNIDVLNSSGTDEKQGTDNR 1363 QLVK+ES WE+G E++ EGMGERKSEGEVVDDLFSAYMNL+NID LNSSGTD+K G +NR Sbjct: 235 QLVKKESSWERGGESNGEGMGERKSEGEVVDDLFSAYMNLENIDALNSSGTDDKNGNENR 294 Query: 1362 EDLDSRASGTKTNGGDSSDNEATSSVNESGNSMQR-----SSEKREGIKRNAVGDIAPTS 1198 EDLDSRASGTKTNGGDSSDNEA SSVNESGNS+QR S+EKREGIKR A GD+A T+ Sbjct: 295 EDLDSRASGTKTNGGDSSDNEAESSVNESGNSLQRAGMNSSAEKREGIKRTAGGDVASTT 354 Query: 1197 RHYRSVSMDSFMGKLNFGEESPKLPPSPGTRPGQLSPTNSLDTNSNTFSLEFGNGEFSGA 1018 RHYRSVSMDSFMGKLNFG+ESPKLPPSPGTRPGQLSP+NS+D NS FSLEFGNGEFSGA Sbjct: 355 RHYRSVSMDSFMGKLNFGDESPKLPPSPGTRPGQLSPSNSIDANSPAFSLEFGNGEFSGA 414 Query: 1017 ELKKIMANEKLAEIALSDPKRAKRILANRQSAARSKERKMRYIAELEHKVXXXXXXXXXX 838 ELKKIMANEKLAEIAL+DPKRAKRILANRQSAARSKERKMRYI+ELEHKV Sbjct: 415 ELKKIMANEKLAEIALTDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTL 474 Query: 837 XXXXXXXQRDSAGLTSQNNELKFRLQAMEQQAQLRDAL 724 QRDS GLT+QNNELKFRLQAMEQQAQLRD + Sbjct: 475 SAQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDGI 512 >ref|XP_004290316.1| PREDICTED: uncharacterized protein LOC101303379 [Fragaria vesca subsp. vesca] Length = 585 Score = 628 bits (1619), Expect = e-177 Identities = 356/552 (64%), Positives = 409/552 (74%), Gaps = 21/552 (3%) Frame = -3 Query: 2217 GDADEASSDMMQRLQSSFGTSSSSVPKQPPMSMNQLDLSQLNSSQFRGQMRHFSPNFSSE 2038 GD +E +SDMMQRLQSSFGTSSSS+ KQP +SM+QL++ Q +SSQ R RHF+ +F+ + Sbjct: 2 GDTEEGNSDMMQRLQSSFGTSSSSILKQP-LSMDQLNIPQFSSSQMRS--RHFAQSFTGD 58 Query: 2037 SAKRVGXXXXXXXXXXXXP-YSQIPVSRPVNQQMGMQNFGATPGPGPSHSRXXXXXXXXX 1861 ++KR+G YSQIPV+RP+NQ MG QNF PGPSHSR Sbjct: 59 TSKRMGIPPSHPNHIPPLSPYSQIPVARPINQTMGSQNFS----PGPSHSRSLSQPAFFS 114 Query: 1860 XXXXXXXXXXXXXXXXSNSINEPVSTDVSMEDQNAGSQSLLPPSPFTRGHSSRAGESLPP 1681 S S++E DVSMED++A S SLLPPSPF R + SR GESLPP Sbjct: 115 LDSLPPLSPSPYRDSPSTSMSE---VDVSMEDRDASSHSLLPPSPFGRANFSRVGESLPP 171 Query: 1680 RKAHRRSNSDIPFGFSTIMQSS-QPLMPLRSPGA---PVSARENSA-AKPAQLVKRESCW 1516 RKAHRRSNSDIPFGFST+MQ + P+ P+R G+ +S ENS KPAQLVK+ES W Sbjct: 172 RKAHRRSNSDIPFGFSTMMQQALPPIAPMRGSGSVDLSMSGTENSGMVKPAQLVKKESSW 231 Query: 1515 EKGTETSAEGMGERKSEGEVVDDLFSAYMNLDNIDVLNSSGTDEKQGTDNREDLDS-RAS 1339 E+ + + EG GERKSEGEVVDDLFSAYMNLD+ID LNSSGTD+K G +NRED+DS RAS Sbjct: 232 ERAGDNNVEGTGERKSEGEVVDDLFSAYMNLDSIDALNSSGTDDKNGNENREDMDSSRAS 291 Query: 1338 GTKTNGGDSSDNEATSSVNESGNSMQRS-----SEKREGIKRNAVGDIAPTSRHYRSVSM 1174 GTKTN DSSDNE SSVNESG MQR + REGIKR+A GDIAPT+RH+RSVSM Sbjct: 292 GTKTNC-DSSDNEVESSVNESGG-MQRPGLNSLTNMREGIKRSAGGDIAPTTRHFRSVSM 349 Query: 1173 DSFMGKLNFGEESPKLPPSPGTRPGQLSPTNSLDTNSNTFSLEFGNGEFSGAELKKIMAN 994 DSFMGKL FG+ESPKLPPSPGTRPGQLSP+NS+DTNSN FSLEFGNGEFSGAE+KKIMAN Sbjct: 350 DSFMGKLQFGDESPKLPPSPGTRPGQLSPSNSIDTNSNAFSLEFGNGEFSGAEMKKIMAN 409 Query: 993 EKLAEIALSDPKRAKRILANRQSAARSKERKMRYIAELEHKVXXXXXXXXXXXXXXXXXQ 814 EKLAEIAL+DPKRAKRILANRQSAARSKERKMRYI+ELEHKV Q Sbjct: 410 EKLAEIALTDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQ 469 Query: 813 RDSAGLTSQNNELKFRLQAMEQQAQLRDALNEALTAEVQRLKLATTELNGD--------- 661 RDS GL++QNNELKFRLQAMEQQAQLRDALNEALT+EVQRLKLATT+LNG+ Sbjct: 470 RDSVGLSNQNNELKFRLQAMEQQAQLRDALNEALTSEVQRLKLATTDLNGESHPSKSMIN 529 Query: 660 AAKFQQLSMNPQ 625 A +FQ +PQ Sbjct: 530 AQRFQLQQQSPQ 541 >ref|XP_002513009.1| DNA binding protein, putative [Ricinus communis] gi|223548020|gb|EEF49512.1| DNA binding protein, putative [Ricinus communis] Length = 574 Score = 609 bits (1570), Expect = e-171 Identities = 346/549 (63%), Positives = 396/549 (72%), Gaps = 16/549 (2%) Frame = -3 Query: 2217 GDADEASSDMMQRLQSSFGTSSSSVPKQPPMSMNQLDLSQLNSSQFRGQ-MRHFSPNFSS 2041 GD +EA+S+MMQRL SSFGT+ SS +QP SM QL++ LN +Q R + HF+ NFS+ Sbjct: 2 GDTEEANSEMMQRLHSSFGTTQSSSKQQPFSSMTQLEIPHLNQTQNRARHFAHFAQNFST 61 Query: 2040 ESAKRVGXXXXXXXXXXXXP-YSQIPVSRPVNQQMGMQNFGATPGPGPSHSRXXXXXXXX 1864 +S+KR+G YSQIPVSRP NQQMG QNF PGP+HSR Sbjct: 62 DSSKRIGIPPSHPNQIPPISPYSQIPVSRPGNQQMGSQNFS----PGPTHSRSLSQPSSF 117 Query: 1863 XXXXXXXXXXXXXXXXXSN-SINEPVSTDVSMEDQNAGSQSLLPPSPFTRGHSSRAGESL 1687 S+ S+ +PVSTDVSME+++A S SLLPPSPF RG++SR ESL Sbjct: 118 FSLDSLPPLSPAPFRDSSSTSVADPVSTDVSMEERDANSHSLLPPSPFNRGNASRVAESL 177 Query: 1686 PPRKAHRRSNSDIPFGFSTIMQSSQPLMPLRSPGA---PVSARENSA-AKPAQLVKRESC 1519 PPRKAHRRSNSDIPFG S +MQSS PL+PLR G VS +ENS+ AKP QLVK+E Sbjct: 178 PPRKAHRRSNSDIPFGLSYVMQSSPPLIPLRPSGGLERSVSGKENSSVAKPTQLVKKE-- 235 Query: 1518 WEKGTETSAEGMGERKSEGEVVDDLFSAYMNLDNIDVLNSSGTDEKQGTDNREDLDSRAS 1339 WE+G ++ AEGMGERKSEGEVVDDLFSAYMNLD ID LNSSGTD+K G +NREDLDSRAS Sbjct: 236 WERGNDSIAEGMGERKSEGEVVDDLFSAYMNLDTIDALNSSGTDDKNGNENREDLDSRAS 295 Query: 1338 GTKTNGGDSSDNEATSSVNESGNSMQR-----SSEKREGIKRNAVGDIAPTSRHYRSVSM 1174 GTKTNGGDSSDNEA SSVNESG+S+ R S+EKREGIKR+A GDIAPT+RHYRSVSM Sbjct: 296 GTKTNGGDSSDNEAESSVNESGSSLLRAGVNSSTEKREGIKRSAGGDIAPTTRHYRSVSM 355 Query: 1173 DSFMGKLNFGEESPKLPPSPGTRPGQLSPTNSLDTNSNTFSLEFGNGEFSGAELKKIMAN 994 DSFMGKLNFG+ESPKLPPSPG+RPGQLSP+NS+D N FSL+FGNGEFSGAELKKIMAN Sbjct: 356 DSFMGKLNFGDESPKLPPSPGSRPGQLSPSNSID--GNAFSLDFGNGEFSGAELKKIMAN 413 Query: 993 EKLAEIALSDPKRAKRILANRQSAARSKERKMRYIAELEHKVXXXXXXXXXXXXXXXXXQ 814 EKLAEIAL+DPKRAKRILANRQSAARSKERKMRYI+ELEHKV Sbjct: 414 EKLAEIALTDPKRAKRILANRQSAARSKERKMRYISELEHKVQ----------------- 456 Query: 813 RDSAGLTSQNNELKFRLQAMEQQAQLRDALNEALTAEVQRLKLATTELNGDA----AKFQ 646 T Q Q Q+ LNEALTAEV+RLKLAT EL+GD+ Q Sbjct: 457 ------TLQTEATTLSAQLTLLQSPYLTTLNEALTAEVRRLKLATAELSGDSEPTKGMVQ 510 Query: 645 QLSMNPQMF 619 QLS+NPQMF Sbjct: 511 QLSINPQMF 519 >ref|XP_006423516.1| hypothetical protein CICLE_v10028062mg [Citrus clementina] gi|557525450|gb|ESR36756.1| hypothetical protein CICLE_v10028062mg [Citrus clementina] Length = 483 Score = 588 bits (1517), Expect = e-165 Identities = 324/470 (68%), Positives = 361/470 (76%), Gaps = 18/470 (3%) Frame = -3 Query: 2223 MGGDADEAS---SDMMQRLQSSFGTSSSSVPKQPPM-SMNQLDLSQLNSSQFRGQMRHFS 2056 MGGD+DE + +DMMQR+QSSFGTSSSS+PKQ + S NQLDL QLN +Q R RHFS Sbjct: 1 MGGDSDEGNGGNTDMMQRIQSSFGTSSSSIPKQQTLLSANQLDLPQLNQNQLRA--RHFS 58 Query: 2055 P---NFSSESAKRVGXXXXXXXXXXXXP-YSQIPVSRPVNQQMGMQNFGATPGPGPSHSR 1888 NF +S+KRVG YS IPVSRP NQQMG QN+ PGP+HSR Sbjct: 59 QFATNFGGDSSKRVGIPPSHPNQIPPISPYSSIPVSRPGNQQMGSQNYS----PGPTHSR 114 Query: 1887 XXXXXXXXXXXXXXXXXXXXXXXXXSN-SINEPVSTDVSMEDQNAGSQSLLPPSPFTRGH 1711 + S+++ VSTDVSMED++ S SLLPPSPF RG+ Sbjct: 115 SLSQPSLFFSLDSLPPLSPSPFRDSPSTSMSDQVSTDVSMEDRDGNSHSLLPPSPFNRGN 174 Query: 1710 SSRAGESLPPRKAHRRSNSDIPFGFSTIMQSSQPLMPLRSPGA---PVSAREN-SAAKPA 1543 +SR GESLPPR HRRSNSDIPFGFST+MQSS PL+ R G VS REN AKPA Sbjct: 175 ASRIGESLPPRNKHRRSNSDIPFGFSTVMQSSSPLISPRFAGGLDKAVSGRENPGVAKPA 234 Query: 1542 QLVKRESCWEKGTETSAEGMGERKSEGEVVDDLFSAYMNLDNIDVLNSSGTDEKQGTDNR 1363 QLVK+ES WE+G E++ EGMGERKSEGEVVDDLFSAYMNL+NID LNSSGTD+K G +NR Sbjct: 235 QLVKKESSWERGGESNGEGMGERKSEGEVVDDLFSAYMNLENIDALNSSGTDDKNGNENR 294 Query: 1362 EDLDSRASGTKTNGGDSSDNEATSSVNESGNSMQR-----SSEKREGIKRNAVGDIAPTS 1198 EDLDSRASGTKTNGGDSSDNEA SSVNESGNS+QR S+EKREGIKR A GD+A T+ Sbjct: 295 EDLDSRASGTKTNGGDSSDNEAESSVNESGNSLQRAGMNSSAEKREGIKRTAGGDVASTT 354 Query: 1197 RHYRSVSMDSFMGKLNFGEESPKLPPSPGTRPGQLSPTNSLDTNSNTFSLEFGNGEFSGA 1018 RHYRSVSMDSFMGKLNFG+ESPKLPPSPGTRPGQLSP+NS+D NS FSLEFGNGEFSGA Sbjct: 355 RHYRSVSMDSFMGKLNFGDESPKLPPSPGTRPGQLSPSNSIDANSPAFSLEFGNGEFSGA 414 Query: 1017 ELKKIMANEKLAEIALSDPKRAKRILANRQSAARSKERKMRYIAELEHKV 868 ELKKIMANEKLAEIAL+DPKRAKRILANRQSAARSKERKMRYI+ELEHKV Sbjct: 415 ELKKIMANEKLAEIALTDPKRAKRILANRQSAARSKERKMRYISELEHKV 464 >gb|EOX97839.1| Basic-leucine zipper transcription factor family protein isoform 3 [Theobroma cacao] Length = 516 Score = 581 bits (1497), Expect = e-163 Identities = 318/467 (68%), Positives = 362/467 (77%), Gaps = 17/467 (3%) Frame = -3 Query: 2217 GDADEASSDMMQRLQSSFGTSSSSVPKQPPMSMNQLDLSQLNSSQFRG--QMRHFSPNFS 2044 GD++E ++D+MQR+QSSFGTSSSS+PKQP +SMNQL++ QLN +Q R HF NF+ Sbjct: 2 GDSEEGNTDVMQRIQSSFGTSSSSIPKQP-LSMNQLEIPQLNPNQIRAPRHFSHFGQNFN 60 Query: 2043 S----ESAKRVGXXXXXXXXXXXXP-YSQIPVSRPVNQQMGMQNFGATPGPGPSHSRXXX 1879 + KRVG YSQIPVSR +NQQMG Q+F PGP+HSR Sbjct: 61 GGVGDAANKRVGIPPSHPNQIPPISPYSQIPVSRQMNQQMGSQSFS----PGPTHSRSLS 116 Query: 1878 XXXXXXXXXXXXXXXXXXXXXXSN-SINEPVSTDVSMEDQNAGSQSLLPPSPFTRGHSSR 1702 S+ ++ + + TDVSMED++A S SLLPPSPF+RG+S R Sbjct: 117 QPSSFFSLDSLPPLSPSPFRDCSSVAVPDQICTDVSMEDRDAASHSLLPPSPFSRGNSPR 176 Query: 1701 AGESLPPRKAHRRSNSDIPFGFSTIMQSSQPLMPLRSPGA---PVSARENSAA-KPAQLV 1534 GESLPPRK+HRRSNSDIPFGF+TIMQSS PL+PLR G VS +ENS KPAQLV Sbjct: 177 VGESLPPRKSHRRSNSDIPFGFNTIMQSSPPLIPLRGSGGLERSVSGKENSGVPKPAQLV 236 Query: 1533 KRESCWEKGTETSAEGMGERKSEGEVVDDLFSAYMNLDNIDVLNSSGTDEKQ-GTDNRED 1357 K+E+ WE+G + +AEGMGERKSEGEVVDDLFSAYMNLDNID LNSSGTD+K GT+N ED Sbjct: 237 KKETSWERGADGNAEGMGERKSEGEVVDDLFSAYMNLDNIDALNSSGTDDKNNGTENHED 296 Query: 1356 LDSRASGTKTNGGDSSDNEATSSVNESGNSMQR----SSEKREGIKRNAVGDIAPTSRHY 1189 LDSRASGTKTNGGDSSDNEA SSVNESGNS R S++KREGIKR+A GDIAPT RHY Sbjct: 297 LDSRASGTKTNGGDSSDNEAESSVNESGNSALRGGMNSTDKREGIKRSAGGDIAPTGRHY 356 Query: 1188 RSVSMDSFMGKLNFGEESPKLPPSPGTRPGQLSPTNSLDTNSNTFSLEFGNGEFSGAELK 1009 RSVSMDSFMGKLNFG+ESPKLPPSPGTRPGQLSP+NS+D NS FSLEFGNGEFSGAELK Sbjct: 357 RSVSMDSFMGKLNFGDESPKLPPSPGTRPGQLSPSNSIDGNSAAFSLEFGNGEFSGAELK 416 Query: 1008 KIMANEKLAEIALSDPKRAKRILANRQSAARSKERKMRYIAELEHKV 868 KIMANEKLAEIA+SDPKRAKRILANRQSAARSKERKMRYI+ELEHKV Sbjct: 417 KIMANEKLAEIAMSDPKRAKRILANRQSAARSKERKMRYISELEHKV 463 >gb|EOX97838.1| Basic-leucine zipper transcription factor family protein isoform 2 [Theobroma cacao] Length = 536 Score = 581 bits (1497), Expect = e-163 Identities = 318/467 (68%), Positives = 362/467 (77%), Gaps = 17/467 (3%) Frame = -3 Query: 2217 GDADEASSDMMQRLQSSFGTSSSSVPKQPPMSMNQLDLSQLNSSQFRG--QMRHFSPNFS 2044 GD++E ++D+MQR+QSSFGTSSSS+PKQP +SMNQL++ QLN +Q R HF NF+ Sbjct: 2 GDSEEGNTDVMQRIQSSFGTSSSSIPKQP-LSMNQLEIPQLNPNQIRAPRHFSHFGQNFN 60 Query: 2043 S----ESAKRVGXXXXXXXXXXXXP-YSQIPVSRPVNQQMGMQNFGATPGPGPSHSRXXX 1879 + KRVG YSQIPVSR +NQQMG Q+F PGP+HSR Sbjct: 61 GGVGDAANKRVGIPPSHPNQIPPISPYSQIPVSRQMNQQMGSQSFS----PGPTHSRSLS 116 Query: 1878 XXXXXXXXXXXXXXXXXXXXXXSN-SINEPVSTDVSMEDQNAGSQSLLPPSPFTRGHSSR 1702 S+ ++ + + TDVSMED++A S SLLPPSPF+RG+S R Sbjct: 117 QPSSFFSLDSLPPLSPSPFRDCSSVAVPDQICTDVSMEDRDAASHSLLPPSPFSRGNSPR 176 Query: 1701 AGESLPPRKAHRRSNSDIPFGFSTIMQSSQPLMPLRSPGA---PVSARENSAA-KPAQLV 1534 GESLPPRK+HRRSNSDIPFGF+TIMQSS PL+PLR G VS +ENS KPAQLV Sbjct: 177 VGESLPPRKSHRRSNSDIPFGFNTIMQSSPPLIPLRGSGGLERSVSGKENSGVPKPAQLV 236 Query: 1533 KRESCWEKGTETSAEGMGERKSEGEVVDDLFSAYMNLDNIDVLNSSGTDEKQ-GTDNRED 1357 K+E+ WE+G + +AEGMGERKSEGEVVDDLFSAYMNLDNID LNSSGTD+K GT+N ED Sbjct: 237 KKETSWERGADGNAEGMGERKSEGEVVDDLFSAYMNLDNIDALNSSGTDDKNNGTENHED 296 Query: 1356 LDSRASGTKTNGGDSSDNEATSSVNESGNSMQR----SSEKREGIKRNAVGDIAPTSRHY 1189 LDSRASGTKTNGGDSSDNEA SSVNESGNS R S++KREGIKR+A GDIAPT RHY Sbjct: 297 LDSRASGTKTNGGDSSDNEAESSVNESGNSALRGGMNSTDKREGIKRSAGGDIAPTGRHY 356 Query: 1188 RSVSMDSFMGKLNFGEESPKLPPSPGTRPGQLSPTNSLDTNSNTFSLEFGNGEFSGAELK 1009 RSVSMDSFMGKLNFG+ESPKLPPSPGTRPGQLSP+NS+D NS FSLEFGNGEFSGAELK Sbjct: 357 RSVSMDSFMGKLNFGDESPKLPPSPGTRPGQLSPSNSIDGNSAAFSLEFGNGEFSGAELK 416 Query: 1008 KIMANEKLAEIALSDPKRAKRILANRQSAARSKERKMRYIAELEHKV 868 KIMANEKLAEIA+SDPKRAKRILANRQSAARSKERKMRYI+ELEHKV Sbjct: 417 KIMANEKLAEIAMSDPKRAKRILANRQSAARSKERKMRYISELEHKV 463 >ref|XP_004170187.1| PREDICTED: uncharacterized protein LOC101227308 [Cucumis sativus] Length = 566 Score = 570 bits (1468), Expect = e-159 Identities = 326/535 (60%), Positives = 380/535 (71%), Gaps = 13/535 (2%) Frame = -3 Query: 2223 MGGDADEASSDMMQRLQSSFGTSSSSVPKQPPMSMNQLDLSQLNSSQFRGQMRHFSPNFS 2044 MG D + +M LQ S+G SSSS P SM+QL +SQ+NSSQ R Q HF NF Sbjct: 1 MGDTEDVNTENMRNHLQCSYGVSSSSAGNLP-FSMDQLKISQMNSSQIRPQ--HFHSNFL 57 Query: 2043 SESAKRVGXXXXXXXXXXXXP--YSQIPVSRPVNQQMGMQNFGATPGPGPSHSRXXXXXX 1870 ++++R+G YSQIP+SRP+NQQ N+ P P+HSR Sbjct: 58 GDNSRRIGIPPSPNSPQIPPISPYSQIPISRPMNQQ----NYN----PVPTHSRSLSQPS 109 Query: 1869 XXXXXXXXXXXXXXXXXXXSNSINEPVSTDVSMEDQNAGSQSLLPPSPFTRGHSSRAGES 1690 + S ++ VS D SMED++ S SLLPPSP+ R +SS+ G+S Sbjct: 110 FFSLDSLPPLSPSPFRESPTTSNSDQVSADTSMEDRDNSSHSLLPPSPYMRVNSSKMGDS 169 Query: 1689 LPPRKAHRRSNSDIPFGFSTIMQSSQPLMPLRSPGA---PVSARENSAA-KPA-QLVKRE 1525 LPPRKAHRRSNSDIPFG S+++Q S PL+P S G S++EN+ KP+ Q VKRE Sbjct: 170 LPPRKAHRRSNSDIPFGLSSMIQPS-PLLPFNSSGGLERSTSSKENAGLLKPSSQFVKRE 228 Query: 1524 SCWEKGTETSAEGMGERKSEGEVVDDLFSAYMNLDNIDVLNSSGTDEKQGTDNREDLDSR 1345 EK + + EGMGERKS+G+ VDDLFSAYMNLD+ID+ NSSGT++K G +NREDLDSR Sbjct: 229 HSLEKSVDNNLEGMGERKSDGDSVDDLFSAYMNLDHIDLFNSSGTNDKNGHENREDLDSR 288 Query: 1344 ASGTKTNGGDSSDNEATSSVNESGNSMQR-----SSEKREGIKRNAVGDIAPTSRHYRSV 1180 SGTKTNGG+SSDNEA SSVNESG+S Q S+EKREGIKR A GDIAPT+RHYRSV Sbjct: 289 GSGTKTNGGESSDNEAESSVNESGDSAQMPGLNSSAEKREGIKRTAGGDIAPTTRHYRSV 348 Query: 1179 SMDSFMGKLNFGEESPKLPPSP-GTRPGQLSPTNSLDTNSNTFSLEFGNGEFSGAELKKI 1003 SMDSFMGKL FG+ESPK+PP+P G RPGQLS N +D NS FSLEFGNGEFSGAELKKI Sbjct: 349 SMDSFMGKLQFGDESPKMPPTPPGVRPGQLSSNNLVDGNSAPFSLEFGNGEFSGAELKKI 408 Query: 1002 MANEKLAEIALSDPKRAKRILANRQSAARSKERKMRYIAELEHKVXXXXXXXXXXXXXXX 823 MAN+KLAEIAL+DPKRAKRILANRQSAARSKERKMRYI+ELEHKV Sbjct: 409 MANDKLAEIALTDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLT 468 Query: 822 XXQRDSAGLTSQNNELKFRLQAMEQQAQLRDALNEALTAEVQRLKLATTELNGDA 658 QRDS GLT+QNNELKFRLQAMEQQAQLRDALNEALTAEVQRLKLATTELN + Sbjct: 469 LLQRDSVGLTNQNNELKFRLQAMEQQAQLRDALNEALTAEVQRLKLATTELNAQS 523 >ref|XP_004148549.1| PREDICTED: uncharacterized protein LOC101216189 [Cucumis sativus] Length = 571 Score = 566 bits (1459), Expect = e-158 Identities = 325/539 (60%), Positives = 382/539 (70%), Gaps = 19/539 (3%) Frame = -3 Query: 2217 GDADEASSD------MMQRLQSSFGTSSSSVPKQPPMSMNQLDLSQLNSSQFRGQMRHFS 2056 GD ++ +++ M LQ S+G SSSS P SM+QL +SQ+NSSQ R Q HF Sbjct: 2 GDTEDVNTENVNTENMRNHLQCSYGVSSSSAGNLP-FSMDQLKISQMNSSQIRPQ--HFH 58 Query: 2055 PNFSSESAKRVGXXXXXXXXXXXXP--YSQIPVSRPVNQQMGMQNFGATPGPGPSHSRXX 1882 NF ++++R+G YSQIP+SRP+NQQ N+ P P+HSR Sbjct: 59 SNFLGDNSRRIGIPPSPNSPQIPPISPYSQIPISRPMNQQ----NYN----PVPTHSRSL 110 Query: 1881 XXXXXXXXXXXXXXXXXXXXXXXSNSINEPVSTDVSMEDQNAGSQSLLPPSPFTRGHSSR 1702 + S ++ VS D SMED++ S SLLPPSP+ R +SS+ Sbjct: 111 SQPSFFSLDSLPPLSPSPFRESPTTSNSDQVSADTSMEDRDNSSHSLLPPSPYMRVNSSK 170 Query: 1701 AGESLPPRKAHRRSNSDIPFGFSTIMQSSQPLMPLRSPGA---PVSARENSAA-KPA-QL 1537 G+SLPPRKAHRRSNSDIPFG S+++Q S PL+P S G S++EN+ KP+ Q Sbjct: 171 MGDSLPPRKAHRRSNSDIPFGLSSMIQPS-PLLPFNSSGGLERSTSSKENAGLLKPSSQF 229 Query: 1536 VKRESCWEKGTETSAEGMGERKSEGEVVDDLFSAYMNLDNIDVLNSSGTDEKQGTDNRED 1357 VKRE EK + + EGMGERKS+G+ VDDLFSAYMNLD+ID+ NSSGT++K G +NRED Sbjct: 230 VKREHSLEKSVDNNLEGMGERKSDGDSVDDLFSAYMNLDHIDLFNSSGTNDKNGHENRED 289 Query: 1356 LDSRASGTKTNGGDSSDNEATSSVNESGNSMQR-----SSEKREGIKRNAVGDIAPTSRH 1192 LDSR SGTKTNGG+SSDNEA SSVNESG+S Q S+EKREGIKR A GDIAPT+RH Sbjct: 290 LDSRGSGTKTNGGESSDNEAESSVNESGDSAQMPGLNSSAEKREGIKRTAGGDIAPTTRH 349 Query: 1191 YRSVSMDSFMGKLNFGEESPKLPPSP-GTRPGQLSPTNSLDTNSNTFSLEFGNGEFSGAE 1015 YRSVSMDSFMGKL FG+ESPK+PP+P G RPGQLS N +D NS FSLEFGNGEFSGAE Sbjct: 350 YRSVSMDSFMGKLQFGDESPKMPPTPPGVRPGQLSSNNLVDGNSAPFSLEFGNGEFSGAE 409 Query: 1014 LKKIMANEKLAEIALSDPKRAKRILANRQSAARSKERKMRYIAELEHKVXXXXXXXXXXX 835 LKKIMAN+KLAEIAL+DPKRAKRILANRQSAARSKERKMRYI+ELEHKV Sbjct: 410 LKKIMANDKLAEIALTDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLS 469 Query: 834 XXXXXXQRDSAGLTSQNNELKFRLQAMEQQAQLRDALNEALTAEVQRLKLATTELNGDA 658 QRDS GLT+QNNELKFRLQAMEQQAQLRDALNEALTAEVQRLKLATTELN + Sbjct: 470 AQLTLLQRDSVGLTNQNNELKFRLQAMEQQAQLRDALNEALTAEVQRLKLATTELNAQS 528 >gb|ESW03736.1| hypothetical protein PHAVU_011G038200g [Phaseolus vulgaris] Length = 561 Score = 557 bits (1435), Expect = e-156 Identities = 328/546 (60%), Positives = 380/546 (69%), Gaps = 13/546 (2%) Frame = -3 Query: 2217 GDADEASSDMMQRLQSSFGTSSSSVPKQPPMSMNQLDLSQLNSSQFRGQMRHFSPNF--S 2044 G+ +EA++DMMQRL S S KQP ++M QL + Q N SQ R + +H + Sbjct: 2 GENEEATNDMMQRLHCS------SFLKQP-LNMEQLSIPQFNPSQMRARHQHQHQHQFDG 54 Query: 2043 SESAKRVGXXXXXXXXXXXXP-YSQIPVSRPVNQQMGMQNFGATPGPGPSHSRXXXXXXX 1867 S KR G YSQIPV+R QQMG N P P+H+R Sbjct: 55 GNSNKRAGIPPSHPHQIPPISPYSQIPVTR--QQQMGSHNIS----PTPTHTRSLSQPSF 108 Query: 1866 XXXXXXXXXXXXXXXXXXSNSINEPVSTDVSMEDQNAGSQSLLPPSPFTR--GHSSRAGE 1693 S S++E + DVSMED++ S SLLPPSPF R S+ + Sbjct: 109 FSLDSLPPLSPSPFRDSSSTSVSE--AADVSMEDRDVTSHSLLPPSPFARTLNTSTNSNL 166 Query: 1692 SLPPRKAHRRSNSDIPFGFSTIMQSSQPLMPLRSPGAPVSARENSAAKPAQLVKRESCWE 1513 LPPRKAHRRSNSDIPFGFST++QSS PL+PLR PV AKPAQLVKRE+ W+ Sbjct: 167 PLPPRKAHRRSNSDIPFGFSTVLQSSPPLIPLRGRENPV------LAKPAQLVKRETPWD 220 Query: 1512 KGTETS-AEGMGERKS-EGEVVDDLFSAYMNLDNIDVLNSSGTDEKQGTDNREDLDSRAS 1339 +G + S EG GE+KS EGEVVDDLFSAYMNLD+ D LNSSGTD+K GT+NR+DLDSRAS Sbjct: 221 RGVDHSNVEGSGEKKSPEGEVVDDLFSAYMNLDSFDALNSSGTDDKNGTENRDDLDSRAS 280 Query: 1338 GTKTNGGDSSDNEATSSVNESGNSMQRS--SEKREGIKRNAVGDIAPTSRHYRSVSMDSF 1165 GTKTNGGDSSDNEA SSVNESG+ R SEKREG+KR+A G+IAPT+RHYRSVSMDSF Sbjct: 281 GTKTNGGDSSDNEAESSVNESGDGAVRQGGSEKREGMKRSAGGEIAPTTRHYRSVSMDSF 340 Query: 1164 MGKLNFGEESPKLPPSPGTRPGQLSPTNSLDTNSNTFSLEFGNGEFSGAELKKIMANEKL 985 + KLNFG+ESPKLPPSPG R G +SP +D NS+ FSLEFGNGEFSG ELKKIMANEKL Sbjct: 341 ISKLNFGDESPKLPPSPGPRTGLISPAGGVDGNSSAFSLEFGNGEFSGPELKKIMANEKL 400 Query: 984 AEIALSDPKRAKRILANRQSAARSKERKMRYIAELEHKVXXXXXXXXXXXXXXXXXQRDS 805 AEIAL+DPKRAKRILANRQSAARSKERKMRYI+ELEHKV QRDS Sbjct: 401 AEIALTDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDS 460 Query: 804 AGLTSQNNELKFRLQAMEQQAQLRDALNEALTAEVQRLKLATTELNGD----AAKFQQLS 637 AGLT+QN+ELKFRLQ+MEQQA+LRDALNEALTAEVQRLK+AT EL+GD + Q S Sbjct: 461 AGLTNQNSELKFRLQSMEQQAKLRDALNEALTAEVQRLKIATAELSGDSHGSSCLIPQHS 520 Query: 636 MNPQMF 619 +NP MF Sbjct: 521 VNPLMF 526 >ref|XP_004148547.1| PREDICTED: uncharacterized protein LOC101215703 [Cucumis sativus] Length = 563 Score = 550 bits (1418), Expect = e-154 Identities = 314/533 (58%), Positives = 374/533 (70%), Gaps = 13/533 (2%) Frame = -3 Query: 2217 GDADEASSDMMQRLQSSFGTSSSSVPKQPPMSMNQLDLSQLNSSQFRGQMRHFSPNFSSE 2038 GD ++A +D ++ LQ SFGTSSSS K SM+QL +SQ+ SQ G+ +HF NF + Sbjct: 2 GDTEDARTDNLRNLQCSFGTSSSSALKHH-FSMDQLKISQMTCSQ--GRPQHFQSNFLGD 58 Query: 2037 SAKRVGXXXXXXXXXXXXP--YSQIPVSRPVNQQMGMQNFGATPGPGPSHSRXXXXXXXX 1864 + +R+G YSQIPVSRP+NQ + P+HSR Sbjct: 59 NNRRIGIPPCPNSPQVPPISPYSQIPVSRPMNQH--------SYNSVPTHSRSLSQPSFF 110 Query: 1863 XXXXXXXXXXXXXXXXXSNSINEPVSTDVSMEDQNAGSQSLLPPSPFTRGHSSRAGESLP 1684 S S ++ VS D SMED++A S SLLPPSP+TR +SS+ ++LP Sbjct: 111 SLDSLPPLSPSPFRDSPSTSNSDQVSADTSMEDRDASSHSLLPPSPYTRANSSKMSDALP 170 Query: 1683 PRKAHRRSNSDIPFGFSTIMQSSQPLMPLRSPGA---PVSARENSAA--KPAQLVKRESC 1519 PRKAHRRSNSDIPFG S+++QS P++P G S++EN+ + +Q VKRE Sbjct: 171 PRKAHRRSNSDIPFGLSSMIQSP-PVLPFSGSGGLERSTSSKENAGIFKQASQFVKREPS 229 Query: 1518 WEKGTETSAEGMGERKSEGEVVDDLFSAYMNLDNIDVLNSSGTDEKQGTDNREDLDSRAS 1339 EK + EGMGE+KSEG+ VDDLFSAYMNLDNID+ NSS T++K G +NREDLDSR S Sbjct: 230 LEKSIDNHMEGMGEKKSEGDTVDDLFSAYMNLDNIDLFNSSVTNDKNGHENREDLDSRGS 289 Query: 1338 GTKTNGGDSSDNEATSSVNESGNSMQR-----SSEKREGIKRNAVGDIAPTSRHYRSVSM 1174 GTKT GG+SSDNEA SSVNESG++ Q S+EKREGIKR A GDIAP +RHYRS+SM Sbjct: 290 GTKT-GGESSDNEAESSVNESGDNSQMPGLNSSAEKREGIKRTAGGDIAPNNRHYRSISM 348 Query: 1173 DSFMGKLNFGEESPKLPPSP-GTRPGQLSPTNSLDTNSNTFSLEFGNGEFSGAELKKIMA 997 DSFMGKL FG+ESPK+PP+P G RPGQLS N +D NS FSLEFGNGEFSGAELKKIMA Sbjct: 349 DSFMGKLQFGDESPKMPPTPPGIRPGQLSSNNLVDGNSTPFSLEFGNGEFSGAELKKIMA 408 Query: 996 NEKLAEIALSDPKRAKRILANRQSAARSKERKMRYIAELEHKVXXXXXXXXXXXXXXXXX 817 N+KLAEIAL+DPKRAKRILANRQSAARSKERKMRYI+ELEHKV Sbjct: 409 NDKLAEIALTDPKRAKRILANRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLL 468 Query: 816 QRDSAGLTSQNNELKFRLQAMEQQAQLRDALNEALTAEVQRLKLATTELNGDA 658 QRDS GLT+QNNELKFRLQAMEQQAQLRDALNEALTAEVQRLKLATT++N + Sbjct: 469 QRDSVGLTNQNNELKFRLQAMEQQAQLRDALNEALTAEVQRLKLATTDINAQS 521 >gb|EXB61817.1| putative transcription factor PosF21 [Morus notabilis] Length = 529 Score = 541 bits (1394), Expect = e-151 Identities = 322/530 (60%), Positives = 366/530 (69%), Gaps = 9/530 (1%) Frame = -3 Query: 2181 RLQSSFGTSSSSVPKQPPMSMNQLDLSQLNSSQFRGQMRHFSPNFSSESAKRVGXXXXXX 2002 RL SSFGTSSSSV KQP ++M+QL++ QLN SQFR RH P+ + Sbjct: 10 RLHSSFGTSSSSVLKQP-LAMDQLNIPQLNPSQFRP--RHLPPSHPHQ------------ 54 Query: 2001 XXXXXXPYSQIPVSRPVNQQMGMQNFGATPGPGPSHSRXXXXXXXXXXXXXXXXXXXXXX 1822 PYSQIPVSRP N MG Q+F P HSR Sbjct: 55 -IPPISPYSQIPVSRPANPHMGSQSFSLGPSQAHHHSRSLSQPSFFSLDSLPPLSPSPFR 113 Query: 1821 XXXSN---SINEPVSTDVSMEDQNAGSQSLLPPSPFTRGHSSRAGESLPPRKAHRRSNSD 1651 + S+++ ++TD+SMED++ S S+LPPSPF++G G SLPPRKAHRRSNSD Sbjct: 114 GDSPSSTTSVSDQITTDISMEDRDMNSHSMLPPSPFSKG-----GGSLPPRKAHRRSNSD 168 Query: 1650 IPFGFSTIMQSSQPLMPLRSPGAPVSARENSAAKPAQLVKRESCWEKGTETSAEGMGERK 1471 +PFGF+ S P L+ PG + AQLVK+ESC E EGMGERK Sbjct: 169 VPFGFA----SFNPFAGLK-PG----------SGNAQLVKKESCCE-------EGMGERK 206 Query: 1470 SEGEVVDDLFSAYMNLDNIDVLNSSGTDEK--QGTDNREDLDSRASGTKTNGGDSSDNEA 1297 SEGEVVDDLFSAYMNLD I+ LNSSGTDEK G +NREDLDSRASGTKTNG DSSDNEA Sbjct: 207 SEGEVVDDLFSAYMNLDGIEALNSSGTDEKNGNGNENREDLDSRASGTKTNGADSSDNEA 266 Query: 1296 TSSVNESGNSMQRSSEKREGIKRNAVGDIAPTSRHYRSVSMDSFMGKLNFGEESPKLPPS 1117 SS+N SEK+EG+KR+A DIAPT+RHYRSVSMDSFMGKLNFG+ESPK+P S Sbjct: 267 ESSMNS-------LSEKKEGMKRSAGTDIAPTTRHYRSVSMDSFMGKLNFGDESPKVPLS 319 Query: 1116 PGTRPGQLSPTNSLDTNSNTFSLEFGNGEFSGAELKKIMANEKLAEIALSDPKRAKRILA 937 PG RPGQ SP+NS+D N FSLEFGNGEFSGAELKKIMAN+KLAEIAL+DPKRAKRILA Sbjct: 320 PGNRPGQHSPSNSIDGN---FSLEFGNGEFSGAELKKIMANDKLAEIALTDPKRAKRILA 376 Query: 936 NRQSAARSKERKMRYIAELEHKVXXXXXXXXXXXXXXXXXQRDSAGLTSQNNELKFRLQA 757 NRQSAARSKERKMRYI+ELEHKV QRDS GLT+QNNELKFRLQA Sbjct: 377 NRQSAARSKERKMRYISELEHKVQTLQTEATTLSAQLTLLQRDSVGLTNQNNELKFRLQA 436 Query: 756 MEQQAQLRDALNEALTAEVQRLKLATTELNGDAAKFQ----QLSMNPQMF 619 MEQQAQLRDALNEALTAEVQRLK+AT EL+GD+ + QLSMNPQMF Sbjct: 437 MEQQAQLRDALNEALTAEVQRLKIATAELSGDSLSSKCMVPQLSMNPQMF 486 >ref|XP_006592080.1| PREDICTED: probable transcription factor PosF21 [Glycine max] Length = 548 Score = 535 bits (1377), Expect = e-149 Identities = 310/517 (59%), Positives = 356/517 (68%), Gaps = 13/517 (2%) Frame = -3 Query: 2130 PMSMNQLDLSQLNSSQFRGQMRHFSPNFSS-ESAKRVGXXXXXXXXXXXXP-YSQIPVSR 1957 P++M QL + Q N SQ R + H F S KR G YSQIPVSR Sbjct: 17 PLNMEQLSIPQFNPSQMRARQHHHQNQFDGGNSNKRAGIPPSHPHPIPPISPYSQIPVSR 76 Query: 1956 PVNQQMGMQNFGATPGPGPSHSRXXXXXXXXXXXXXXXXXXXXXXXXXSNSINEPVSTDV 1777 N P P+H+R S S++E + DV Sbjct: 77 QHNNI----------SPTPTHTRSLSQPSFFSLDSLPPLSPSPFRDSSSTSVSE--AADV 124 Query: 1776 SMEDQNAGSQSLLPPSPFTRGHSSRAGESLPPRKAHRRSNSDIPFGFSTIMQSSQPLMPL 1597 SMED++ S SLLPPSPF+R ++ + LPPRKAHRRSNSDIPFGFST++QSS PL+PL Sbjct: 125 SMEDRDVSSHSLLPPSPFSRTLNN-SNLPLPPRKAHRRSNSDIPFGFSTVLQSSPPLIPL 183 Query: 1596 RSPGAPVSARENSAAKPAQLVKRESCWEKGTETS----AEGMGERKS-EGEVVDDLFSAY 1432 R+P +AKPAQLVKRE+ W++G E + EG GE+KS EGEVVDDLFSAY Sbjct: 184 RNP---------VSAKPAQLVKRETPWDRGVENNNNNNVEGSGEKKSPEGEVVDDLFSAY 234 Query: 1431 MNLDNIDVLNSSGTDEKQGTDNREDLDSRASGTKTNGGDSSDNEATSSVNESGNS--MQR 1258 MNLD+ D LNSSGTD+K G +NR+DLDSRASGTKTNGGDSSDNEA SSVNESG+ Q Sbjct: 235 MNLDSFDALNSSGTDDKNGGENRDDLDSRASGTKTNGGDSSDNEAESSVNESGDGGVRQG 294 Query: 1257 SSEKREGIKRNAVGDIAPTSRHYRSVSMDSFMGKLNFGEESPKLPPSPGTRPGQLSPTNS 1078 +EKREG+KR+A G+IAPT+RHYRSVSMDSF+GKLNF EESPKLPPSPG R +SP Sbjct: 295 GNEKREGMKRSAGGEIAPTTRHYRSVSMDSFIGKLNFDEESPKLPPSPGQRSALMSPAGG 354 Query: 1077 LDTNSNTFSLEFGNGEFSGAELKKIMANEKLAEIALSDPKRAKRILANRQSAARSKERKM 898 +D NS FSLEFGNGEFSG ELKKIMANEKLAEIAL DPKRAKRILANRQSAARSKERKM Sbjct: 355 IDGNSAAFSLEFGNGEFSGPELKKIMANEKLAEIALIDPKRAKRILANRQSAARSKERKM 414 Query: 897 RYIAELEHKVXXXXXXXXXXXXXXXXXQRDSAGLTSQNNELKFRLQAMEQQAQLRDALNE 718 RYI+ELEHKV QRDSAGLT+QN+ELKFRLQ+MEQQA+LRDALNE Sbjct: 415 RYISELEHKVQTLQTEATTLSAQLTLLQRDSAGLTNQNSELKFRLQSMEQQAKLRDALNE 474 Query: 717 ALTAEVQRLKLATTELNGDA----AKFQQLSMNPQMF 619 ALTAEVQRLKLAT EL+GD+ Q S+NP MF Sbjct: 475 ALTAEVQRLKLATAELSGDSHGSGCLIPQHSVNPLMF 511