BLASTX nr result
ID: Rehmannia23_contig00000392
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia23_contig00000392 (2749 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002271475.2| PREDICTED: uncharacterized protein LOC100249... 501 e-139 ref|XP_006341000.1| PREDICTED: transcription factor EMB1444-like... 471 e-130 ref|NP_001234845.1| Prf interactor 30137 [Solanum lycopersicum] ... 464 e-128 ref|XP_006493563.1| PREDICTED: transcription factor EMB1444-like... 444 e-122 ref|XP_006429166.1| hypothetical protein CICLE_v10011164mg [Citr... 443 e-121 ref|XP_002309084.1| hypothetical protein POPTR_0006s09100g [Popu... 424 e-115 gb|EOY07431.1| Basic helix-loop-helix DNA-binding superfamily pr... 422 e-115 ref|XP_004302716.1| PREDICTED: transcription factor EMB1444-like... 416 e-113 ref|XP_002533696.1| basic helix-loop-helix-containing protein, p... 391 e-106 gb|EOY07437.1| Basic helix-loop-helix DNA-binding superfamily pr... 383 e-103 gb|EOY07432.1| Basic helix-loop-helix DNA-binding superfamily pr... 378 e-102 gb|EXB36735.1| hypothetical protein L484_016987 [Morus notabilis] 373 e-100 ref|XP_006846364.1| hypothetical protein AMTR_s00012p00261730 [A... 318 8e-84 gb|ESW12469.1| hypothetical protein PHAVU_008G115700g [Phaseolus... 308 6e-81 ref|XP_003551499.1| PREDICTED: transcription factor LHW-like [Gl... 293 3e-76 ref|XP_004137928.1| PREDICTED: uncharacterized protein LOC101203... 286 4e-74 gb|EOY07438.1| Basic helix-loop-helix DNA-binding superfamily pr... 266 5e-68 gb|EMJ06475.1| hypothetical protein PRUPE_ppa006504mg [Prunus pe... 264 2e-67 gb|EPS66447.1| prf interactor 30137, partial [Genlisea aurea] 206 6e-50 ref|XP_006443867.1| hypothetical protein CICLE_v10018993mg [Citr... 201 1e-48 >ref|XP_002271475.2| PREDICTED: uncharacterized protein LOC100249509 [Vitis vinifera] gi|297740322|emb|CBI30504.3| unnamed protein product [Vitis vinifera] Length = 720 Score = 501 bits (1289), Expect = e-139 Identities = 303/733 (41%), Positives = 421/733 (57%), Gaps = 15/733 (2%) Frame = +1 Query: 451 MGTSFLRPFLQSLCCNSPWNYAVFWKLKHQHEMVLVWEDGFCDILKLRDPVVSPIEDLYL 630 M TS LR L+S C NS W YAVFW+LKHQ+ M+L WEDG+CD R+PV S +D+YL Sbjct: 1 METSALRQLLKSFCNNSHWKYAVFWRLKHQNPMLLTWEDGYCDYPNPREPVESISDDIYL 60 Query: 631 GNSDKILSTFSSSVLDGTPGEC--PVGLAVAEMXXXXXXXXXXXXXXAAFTGNTSWIYSD 804 N++ I S + +DG G PV LAVA M A TGN W+++D Sbjct: 61 NNANDISSL--NCEIDGFNGSYGYPVELAVANMSCLQYAFGEGVVGEVAKTGNHCWVFTD 118 Query: 805 NIETDVFNTVLVPEYPDEWLLQFAAGIKTILLLPVIPHGVLQLGSVETVAENAALVAYVK 984 +I FN+ LVPE PDEWLLQF AGIKT+LL+PVIPHGVLQLGS+E VAEN A+VA +K Sbjct: 119 DIFASRFNSKLVPECPDEWLLQFVAGIKTVLLVPVIPHGVLQLGSLEKVAENVAVVACIK 178 Query: 985 DKFEAHKKPDGYDLRYSVQQFSLVSTFLENLEEPSTVTTEKVNEDQNATHAVRTKDYNLI 1164 D F+ + G+ + F+ N + + + + ED +V+ K+ L+ Sbjct: 179 DSFDTLQNEVGFSV-----------PFISNW---NCLLHKVLYEDSEVVDSVKPKNSKLL 224 Query: 1165 A-NQMMPVFMVQDFANASVRCVADTSENVTENDFSRQPLSMIHLDEPFQLSYEDNQSFIS 1341 + NQ +P+F VQD A + E+ ++ + S + + + ++Q + Sbjct: 225 STNQAIPLFTVQDAFQAFGEDLPLIHESESKKEISVFSVGLNEVSTLKGQCINNSQWGVI 284 Query: 1342 KNDISNYYH-EETSSPSSYFDNFSRRMYGEFMNESMGFHFEDGATDPTFLGRDFDSGICE 1518 ++++S + EE S ++N++ + E M + G +P+ +D + Sbjct: 285 ESNLSRFSCLEEELHAVSQYNNYNLEVLEESSEGIMNSYCAGGLIEPSVGDKDANDTGHR 344 Query: 1519 SGGNFFSFPGDYELNKILGTSVEDNTYPYTYGTS---------ISGHDLACHSSGDRETS 1671 S +FFSFP D EL+K LG +++ T Y G+S I D+ + S Sbjct: 345 STDSFFSFPLDCELHKALGLAMQRQTSDYIRGSSEDASSTAKPICNRDIVDVIEPLTQES 404 Query: 1672 YSMDXXXXXXXXXXXXXXANASSILDDNSSNKSGITSSNVSLREHLGSTKKH--DQSKQS 1845 AN S DD SS++S S+ +L ST H +QS+ S Sbjct: 405 SGYFAKGGDAVNLLEDVVANIHSGSDDTSSHRSNSVKSSTTLSGQF-STSSHVGNQSEGS 463 Query: 1846 APVEAYKVPWNFLTSEFSAPGIXXXXXXXXXXXXVESKIGALSDKQKKRKGYDSQKPGKL 2025 A V+ + W+ + EF A +S + L+D+++++KGY +P K Sbjct: 464 ALVQDDSLLWSHVKPEFVASRGNAFTNSSISSSSFKSTMTTLADEEQQKKGYGCLQPRKG 523 Query: 2026 SRLSTTNKRRAHTGDNQKPKPRDRQLIQDRLKELRELVPNSEKCSIDGLLDKTINHMLFL 2205 S+LS NK+RA G+NQ+P+PRDRQ+IQDR+KELRELVPN KCSIDGLLD+TI HMLFL Sbjct: 524 SKLSNANKKRASPGNNQRPRPRDRQMIQDRVKELRELVPNGAKCSIDGLLDRTIKHMLFL 583 Query: 2206 SNVTKRADKLRDQVLEKDTDEKTRRPAEVNHGHQNGTSWAVELGSEQQLCPIVVKDLNHP 2385 N T +A KL+ +V ++ +K+ R +E HQNGTSWA ELGSE ++CPIVV+DL P Sbjct: 584 RNSTDQAAKLKQRVHQEVASQKSWRSSENKCSHQNGTSWAFELGSELKVCPIVVEDLECP 643 Query: 2386 GHMLIEMLCTDHGRFLEIADVIHRLKLTILRGVMEKSADNSWASFVVETSGSFHRLDIFW 2565 GHMLIEMLC +HG FLEIA VI L+LTIL+GVME +DN WA F+VE S FHR+DIFW Sbjct: 644 GHMLIEMLCNEHGLFLEIAQVIRGLELTILKGVMESRSDNMWAHFIVEVSRGFHRMDIFW 703 Query: 2566 PLMQLLQQSRAPI 2604 PLMQLLQQ++ I Sbjct: 704 PLMQLLQQNQNTI 716 >ref|XP_006341000.1| PREDICTED: transcription factor EMB1444-like [Solanum tuberosum] Length = 744 Score = 471 bits (1212), Expect = e-130 Identities = 304/753 (40%), Positives = 412/753 (54%), Gaps = 32/753 (4%) Frame = +1 Query: 451 MGTSFLRPFLQSLCCNSPWNYAVFWKLKHQHEMVLVWEDGFCDILKLRDPVVSPIEDLYL 630 M + LR FL+SLC SPWNYAVFWKL+HQ +L WEDG+ DI R+P S I + Y Sbjct: 1 MSAASLRHFLESLCFKSPWNYAVFWKLQHQCPTILTWEDGYLDIPGAREPYRSQIGNYYS 60 Query: 631 GNSDKILSTFSSSVLDGTPGECPVGLAVAEMXXXXXXXXXXXXXXAAFTGNTSWIYSDNI 810 +++ S +G G P+ LA+AEM A +G WI SD++ Sbjct: 61 KYLNELSPNCGSRSHNGYLGAHPIDLAMAEMSSTYHIAGKGVVGEVASSGIPRWISSDSL 120 Query: 811 ETDVFNTVLVPEYPDEWLLQFAAGIKTILLLPVIPHGVLQLGSVETVAENAALVAYVKDK 990 V E PD+W+LQF GIKTILL+P IP+GVLQLGSVETVAEN +V + ++ Sbjct: 121 APAELGFDSVAECPDKWMLQFVTGIKTILLVPCIPYGVLQLGSVETVAENMEIVTNLAEE 180 Query: 991 FEAHKK------PDGYDLRYSVQQFSLVSTFLENLEEPSTVTTEKVNEDQNATHAVRTKD 1152 F+AH K P G ++F L ST E L PS TT KVNED A K+ Sbjct: 181 FDAHYKFVESFLPGGRS-----REFLLQSTLSETLNIPSATTTNKVNEDDVAADIPILKE 235 Query: 1153 YNLIAN-QMMPVFMVQ-------------------DFANASVRCVADTSENVTENDFSRQ 1272 + L A M + VQ + V + + EN + + Q Sbjct: 236 HKLSAAFPMTSLIEVQHPFQLSGQHMQNILEDENESITSKFVEHLPNVLENANGREIAMQ 295 Query: 1273 PLSMIHLDEPFQLSY-EDNQSFISKNDIS-NYYHEETSSPSSYFDNFSRRMYGEFMNESM 1446 + MI+L + Y +DN+S I+++ + H + SY S + G ++ + Sbjct: 296 HVDMINLVKHLAHEYSDDNRSGITESSFGRSTCHTKDIDAFSYS---SCNVGGVGVSNEV 352 Query: 1447 GFHFEDGATDPTFLGRDFDSGICESGGNFFSFPGDYELNKILGTSVEDNT--YPYTYGTS 1620 F+F+ DP LG D I + N FS P + EL + G+++ + + S Sbjct: 353 DFYFDGDMLDPRSLGMDCSDTILGNVSNSFSCPTECELYEAFGSTIHNLSGFSANIASKS 412 Query: 1621 ISGHDLACHSSGDRETSYSMDXXXXXXXXXXXXXXANASSILDDNSSNK-SGITSSNVSL 1797 I D + S + A+A DD S +K +G+ S N+S Sbjct: 413 IYTEDCMFNIEPSFGQSNGWNLKEDNTENLLEAVVASACCFSDDYSLHKVAGLESLNMSS 472 Query: 1798 REHLGSTKKHDQSKQSAPVEAYKVPWNFLTSEFSAPGIXXXXXXXXXXXXVESKIGALSD 1977 + + S K+ +QS +S V V + LTS + + A ++ Sbjct: 473 GKPVPSRKRQNQSAESDSV-GEAVTRSTLTSASAGVDKYASTNCLHSASSFDCVASAFNE 531 Query: 1978 KQKKRKGYDSQKPGKLSRLSTTNKRRAHTGDNQKPKPRDRQLIQDRLKELRELVPNSEKC 2157 +Q +RK + S K S++S TNKRR +GD+ KP+PRDRQLIQDRLKELR+LVP+ KC Sbjct: 532 EQHQRKVFSSLSCHKESKVSNTNKRRRWSGDSHKPRPRDRQLIQDRLKELRQLVPSGAKC 591 Query: 2158 SIDGLLDKTINHMLFLSNVTKRADKLRDQV-LEKDTDEKTRRPAEVNHGHQNGTSWAVEL 2334 SID LLDKTI HMLFL +VT +ADKL+ Q +E D D+ + P +V +Q GTSWA+EL Sbjct: 592 SIDSLLDKTIKHMLFLRSVTNQADKLKFQSQIEVDPDKSLQSP-QVKSSNQQGTSWALEL 650 Query: 2335 GSEQQLCPIVVKDLNHPGHMLIEMLCTDHGRFLEIADVIHRLKLTILRGVMEKSADNSWA 2514 GS Q+CPI+VKDL +PGHMLIEM+C DHGRFLEI+DVIHRL+LTIL+GVMEK ++++WA Sbjct: 651 GSADQICPIIVKDLEYPGHMLIEMMCDDHGRFLEISDVIHRLELTILKGVMEKRSESTWA 710 Query: 2515 SFVVETSGSFHRLDIFWPLMQLLQQSRAPI*SN 2613 F+VE SGSFHRLDIFWPLMQLLQQ + I N Sbjct: 711 HFIVEASGSFHRLDIFWPLMQLLQQVPSSISRN 743 >ref|NP_001234845.1| Prf interactor 30137 [Solanum lycopersicum] gi|56157408|gb|AAV80420.1| Prf interactor 30137 [Solanum lycopersicum] Length = 740 Score = 464 bits (1195), Expect = e-128 Identities = 298/742 (40%), Positives = 402/742 (54%), Gaps = 29/742 (3%) Frame = +1 Query: 451 MGTSFLRPFLQSLCCNSPWNYAVFWKLKHQHEMVLVWEDGFCDILKLRDPVVSPIEDLYL 630 M + LR FL+SLC SPWNYAVFWKL+HQ ++L WEDG+ D+ R+P S + Y Sbjct: 1 MSAASLRHFLESLCFKSPWNYAVFWKLQHQCPIILTWEDGYLDVPGAREPYRSQNGNYYS 60 Query: 631 GNSDKILSTFSSSVLDGTPGECPVGLAVAEMXXXXXXXXXXXXXXAAFTGNTSWIYSDNI 810 N + S +G +GLAVAEM A G WI SD++ Sbjct: 61 KNLSDLSPNCGSRSHNGYLSARSIGLAVAEMSSTYHIAGKGVVGEVASLGIPRWISSDSV 120 Query: 811 ETDVFNTVLVPEYPDEWLLQFAAGIKTILLLPVIPHGVLQLGSVETVAENAALVAYVKDK 990 V E PD+W+LQF AGIKTILL+P IP GVLQLGSVETVAEN +V + ++ Sbjct: 121 APAELGFGSVAECPDKWMLQFVAGIKTILLVPCIPXGVLQLGSVETVAENMEMVTILAEE 180 Query: 991 FEAHKK------PDGYDLRYSVQQFSLVSTFLENLEEPSTVTTEKVNEDQNATHAVRTKD 1152 F+AH K P G +F L ST E L PS TT KVNED A +D Sbjct: 181 FDAHLKFVESFLPGGESC-----EFLLQSTLSETLNIPSATTTNKVNEDDVAADIPIVED 235 Query: 1153 YNLIAN-QMMPVFMVQDFANASVRCVADTSENVTENDFSR-------------------Q 1272 + A M + VQ S + + + EN E+ + Q Sbjct: 236 HKSSAVFPMTSLIDVQHPFQLSGQHMQNVLENENESKIGKFVEHMPNVLENAYKWEIPMQ 295 Query: 1273 PLSMIHLDEPFQLSY-EDNQSFISKNDI-SNYYHEETSSPSSYFDNFSRRMYGEFMNESM 1446 + MI+L + Y +DN+S I++ I + H + SY S + G ++ + Sbjct: 296 HVDMINLVKQLAHGYSDDNRSGITERSIVRSSCHTKDIDAFSYS---SCNVGGVGVSNEV 352 Query: 1447 GFHFEDGATDPTFLGRDFDSGICESGGNFFSFPGDYELNKILGTSVEDNTYPYTYGTSIS 1626 FHF+ DP LG D + I + N FS + EL++ G+++ + + +S S Sbjct: 353 DFHFDGDMLDPRSLGMDCHNTILGNVSNSFSCSTERELHEAFGSTIHNLSGFSANPSSKS 412 Query: 1627 GHDLACHSSGDRETSYSMDXXXXXXXXXXXXXXANASSILDDNSSNK-SGITSSNVSLRE 1803 + C + + E S A+A DD S NK +G+ S N+S + Sbjct: 413 IYAADC--TFNSEPSDGWHLKEDNAENLLEAVVASAYCFTDDYSLNKMAGLESLNMSSGK 470 Query: 1804 HLGSTKKHDQSKQSAPVEAYKVPWNFLTSEFSAPGIXXXXXXXXXXXXVESKIGALSDKQ 1983 + S K+ +QS +S V V + LTS + + + + Sbjct: 471 PVPSRKRLNQSAESDSV-GDAVTRSTLTSASAGVDKYASTNRPHSASSFDYVVSTFDEGH 529 Query: 1984 KKRKGYDSQKPGKLSRLSTTNKRRAHTGDNQKPKPRDRQLIQDRLKELRELVPNSEKCSI 2163 + K + S K S++S TNK+R +GD+ KP+PRDRQLIQDRLKELR+LVP+ KCSI Sbjct: 530 HQTKVFSSLDCHKESKISNTNKKRRRSGDSHKPRPRDRQLIQDRLKELRQLVPSGAKCSI 589 Query: 2164 DGLLDKTINHMLFLSNVTKRADKLRDQVLEKDTDEKTRRPAEVNHGHQNGTSWAVELGSE 2343 DGLLDKTI HMLFL +VT +ADK++ Q + +K + + HQ GTSWA+ELGS Sbjct: 590 DGLLDKTIKHMLFLRSVTDQADKIKFQAQTEVAPDKNLQSPPIKSNHQQGTSWALELGSV 649 Query: 2344 QQLCPIVVKDLNHPGHMLIEMLCTDHGRFLEIADVIHRLKLTILRGVMEKSADNSWASFV 2523 Q+CPI+VKDL +PGHMLIEM+C DHGRFLEI+DVIHRL+LTIL+GVMEK ++++WA F+ Sbjct: 650 DQICPIIVKDLEYPGHMLIEMMCDDHGRFLEISDVIHRLELTILKGVMEKRSESTWAHFI 709 Query: 2524 VETSGSFHRLDIFWPLMQLLQQ 2589 VE SGSFHRLDIFWPLMQLLQQ Sbjct: 710 VEASGSFHRLDIFWPLMQLLQQ 731 >ref|XP_006493563.1| PREDICTED: transcription factor EMB1444-like [Citrus sinensis] Length = 730 Score = 444 bits (1143), Expect = e-122 Identities = 282/738 (38%), Positives = 404/738 (54%), Gaps = 20/738 (2%) Frame = +1 Query: 451 MGTSFLRPFLQSLCCNSPWNYAVFWKLKHQHEMVLVWEDGFCDILKLRDPVVSPIEDLYL 630 MGT+ LR L+S C N PWNYAV WKLK + +M+L WEDG+CD LK R P+ ED+Y Sbjct: 1 MGTTALRQLLKSFCYNLPWNYAVLWKLKLEGQMILSWEDGYCDHLKPRQPLGIMSEDIYH 60 Query: 631 GNSDKILSTFS-SSVLDGTPGECPVGLAVAEMXXXXXXXXXXXXXXAAFTGNTSWIYSDN 807 ++++ ST S +S DG +GL +A M A +G W+ D+ Sbjct: 61 NGANELFSTRSETSAGDGGFEGYSIGLVLANMSHLQYALGEGVVGEVANSGTHFWVSYDD 120 Query: 808 IETDVFNTVLVPEYPDEWLLQFAAGIKTILLLPVIPHGVLQLGSVETVAENAALVAYVKD 987 + T N+ LVP+ PDEWLLQ A+GIKTILL+PV+PHGV+QLGS++ +AE+ A+VA +KD Sbjct: 121 VSTTKVNSKLVPKCPDEWLLQLASGIKTILLVPVLPHGVVQLGSLQVIAEDVAVVAGIKD 180 Query: 988 KFEAHKKPD------GYDLRYSVQQFSLVSTFLENLEEPSTVTTEKV-NEDQNATHAVRT 1146 +F + + D+R + +L S +++L+EPS T ++ +ED +A +V+ Sbjct: 181 RFIHNAWRNTVLSILNRDIR-TKSSSTLTSGLMDSLDEPSASTISQLKSEDSDAVDSVKP 239 Query: 1147 KDYNLIA-NQMMPVFMVQDFANASVRCVADTSENVTENDFSRQPLSMIHLDEPFQLSYED 1323 + + ++PV +QD SV+ ++ T + +EN + L + + S Sbjct: 240 NKVLVSTFDPILPVETLQDALRGSVKDLSGTFRSESENKIAVPSLGLSEASKSQGHSLFA 299 Query: 1324 NQ-SFISKNDISNYYHEETSSPSSYFDNFSRRMYGEFMNESMGFHFEDGATDPTFLGRDF 1500 Q + EE S D ++ + GEF +M + P + + F Sbjct: 300 GQWEMMESKFFGLSCLEEELQAYSQCDKYNLELLGEFSGGAMSCY-------PASMEQPF 352 Query: 1501 DSGICE----SGGNFFSFPGDYELNKILGTSVEDNTYPYTYGTSISGHDLACHSSG---D 1659 IC S F +FP D EL+K LG + + +T Y G S D C+SS Sbjct: 353 QHEICNNIDHSSAIFLNFPKDCELHKALGPAFQRHTSDYL-GDSYHLVDNICNSSSLIHK 411 Query: 1660 RETSYSMDXXXXXXXXXXXXXXANASSILDDNSSNK---SGITSSNVSLREHLGSTKKHD 1830 R+ + ++ A +S+ + +G+ SS +SL + + + Sbjct: 412 RDFTDGIEPTSSVKGSDADLLEAVVTSVRRGTYGSPDLYNGVNSSLISLEKFVTLSPPQS 471 Query: 1831 QSKQSAPVEAYKVPWNFLTSEFSAPGIXXXXXXXXXXXXVESKIGALSDKQKKRKGYDSQ 2010 S+ SA +P + + S S G ++ +G D + K ++S Sbjct: 472 HSEDSASAGVDSIPQSKVIST-SLSG--NKNEFSPTSSSFKNAMGTFIDTELFGKEHNSL 528 Query: 2011 KPGKLSRLSTTNKRRAHTGDNQKPKPRDRQLIQDRLKELRELVPNSEKCSIDGLLDKTIN 2190 +P K +LS NKRR GDNQKP+PRDRQLIQDR+KELRELVPN KCSID LL +TI Sbjct: 529 QPRKGMKLSNANKRRTKPGDNQKPRPRDRQLIQDRIKELRELVPNGVKCSIDCLLGRTIE 588 Query: 2191 HMLFLSNVTKRADKLRDQVLEKDTDEKTRRPAEVNHGHQNGTSWAVELGSEQQLCPIVVK 2370 HML+L +VT +A+KL V + K R +E N G QNGT+WA E+G+E CPIVV+ Sbjct: 589 HMLYLRSVTDQAEKLNQWVHREVAARKDLRSSETNDGKQNGTTWAFEVGNELLACPIVVE 648 Query: 2371 DLNHPGHMLIEMLCTDHGRFLEIADVIHRLKLTILRGVMEKSADNSWASFVVETSGSFHR 2550 DL++PGHMLIEMLC + FLEIA VI L+LTIL+GVME +N+WA F+VETS FHR Sbjct: 649 DLSYPGHMLIEMLCNEQSLFLEIAQVIRSLELTILKGVMENRCNNTWAHFIVETSKGFHR 708 Query: 2551 LDIFWPLMQLLQQSRAPI 2604 +IFWPLM LLQ+ R PI Sbjct: 709 TEIFWPLMHLLQRKRKPI 726 >ref|XP_006429166.1| hypothetical protein CICLE_v10011164mg [Citrus clementina] gi|557531223|gb|ESR42406.1| hypothetical protein CICLE_v10011164mg [Citrus clementina] Length = 730 Score = 443 bits (1140), Expect = e-121 Identities = 282/738 (38%), Positives = 404/738 (54%), Gaps = 20/738 (2%) Frame = +1 Query: 451 MGTSFLRPFLQSLCCNSPWNYAVFWKLKHQHEMVLVWEDGFCDILKLRDPVVSPIEDLYL 630 MGT+ LR L+S C N PWNYAV WKLK + +M+L WEDG+CD LK R P+ ED+Y Sbjct: 1 MGTTALRQLLKSFCYNLPWNYAVLWKLKLEGQMILSWEDGYCDHLKPRQPLGIMSEDIYH 60 Query: 631 GNSDKILSTFS-SSVLDGTPGECPVGLAVAEMXXXXXXXXXXXXXXAAFTGNTSWIYSDN 807 ++++ ST S +S DG +GL +A M A +G W+ D+ Sbjct: 61 NGANELFSTRSETSAGDGGFEGYSIGLVLANMSHLQYALGEGVVGEVANSGTHFWVSYDD 120 Query: 808 IETDVFNTVLVPEYPDEWLLQFAAGIKTILLLPVIPHGVLQLGSVETVAENAALVAYVKD 987 + T N+ LVP+ PDEWLLQ A+GIKTILL+PV+PHGV+QLGS++ +AE+ A+VA +KD Sbjct: 121 VSTTKVNSKLVPKCPDEWLLQLASGIKTILLVPVLPHGVVQLGSLQVIAEDVAVVAGIKD 180 Query: 988 KFEAHKKPD------GYDLRYSVQQFSLVSTFLENLEEPSTVTTEKV-NEDQNATHAVRT 1146 +F + + D+R + +L S +++L+EPS T ++ +ED +A +V+ Sbjct: 181 RFIHNAWRNTVLSILNRDIR-TKSSSTLTSGLMDSLDEPSASTISQLKSEDSDAVDSVKP 239 Query: 1147 KDYNLIA-NQMMPVFMVQDFANASVRCVADTSENVTENDFSRQPLSMIHLDEPFQLSYED 1323 + + ++PV +QD SV+ ++ T + +EN + L + + S Sbjct: 240 NKVLVSTFDPILPVETLQDALRGSVKDLSGTFRSESENKIAVPSLGLSEASKSQGHSLFA 299 Query: 1324 NQ-SFISKNDISNYYHEETSSPSSYFDNFSRRMYGEFMNESMGFHFEDGATDPTFLGRDF 1500 Q + EE S D ++ + GEF +M + P + + F Sbjct: 300 GQWEMMESKFFGLSCLEEELQAYSQCDKYNLELLGEFSGGAMSCY-------PASMEQPF 352 Query: 1501 DSGICE----SGGNFFSFPGDYELNKILGTSVEDNTYPYTYGTSISGHDLACHSSG---D 1659 IC S F +FP D EL+K LG + + +T Y G S D C+SS Sbjct: 353 QHEICNNIDHSSAIFLNFPKDCELHKALGPAFQRHTSDYL-GDSYHLVDNICNSSSLIHK 411 Query: 1660 RETSYSMDXXXXXXXXXXXXXXANASSILDDNSSNK---SGITSSNVSLREHLGSTKKHD 1830 R+ + ++ A +S+ + +G+ SS +SL + + + Sbjct: 412 RDFTDGIEPTSSVKGSDADLLEAVVTSVRRGTYGSPDLYNGVNSSLISLEKFVTLSPPQS 471 Query: 1831 QSKQSAPVEAYKVPWNFLTSEFSAPGIXXXXXXXXXXXXVESKIGALSDKQKKRKGYDSQ 2010 S+ SA +P + + S S G ++ +G D + K ++S Sbjct: 472 HSEDSASAGVDSIPQSKVIST-SLSG--NKNEFSPTSSSFKNAMGTFIDTELFGKEHNSL 528 Query: 2011 KPGKLSRLSTTNKRRAHTGDNQKPKPRDRQLIQDRLKELRELVPNSEKCSIDGLLDKTIN 2190 +P K +LS NKRR GDNQKP+PRDRQLIQDR+KELRELVPN KCSID LL +TI Sbjct: 529 QPRKGMKLSNANKRRTKPGDNQKPRPRDRQLIQDRIKELRELVPNGVKCSIDCLLGRTIE 588 Query: 2191 HMLFLSNVTKRADKLRDQVLEKDTDEKTRRPAEVNHGHQNGTSWAVELGSEQQLCPIVVK 2370 HML+L +VT +A+KL V + K R +E N G QNGT+WA E+G+E CPIVV+ Sbjct: 589 HMLYLRSVTDQAEKLNQWVHREVAARKDLRSSETNDGKQNGTTWAFEVGNELLACPIVVE 648 Query: 2371 DLNHPGHMLIEMLCTDHGRFLEIADVIHRLKLTILRGVMEKSADNSWASFVVETSGSFHR 2550 DL++PGHMLIEMLC + FLEIA VI L+LTIL+GVME +N+WA F+VETS FHR Sbjct: 649 DLSYPGHMLIEMLCNEQCLFLEIAQVIRSLELTILKGVMENRCNNTWAHFIVETSKGFHR 708 Query: 2551 LDIFWPLMQLLQQSRAPI 2604 +IFWPLM LLQ+ R PI Sbjct: 709 TEIFWPLMHLLQRKRKPI 726 >ref|XP_002309084.1| hypothetical protein POPTR_0006s09100g [Populus trichocarpa] gi|222855060|gb|EEE92607.1| hypothetical protein POPTR_0006s09100g [Populus trichocarpa] Length = 708 Score = 424 bits (1089), Expect = e-115 Identities = 276/734 (37%), Positives = 400/734 (54%), Gaps = 16/734 (2%) Frame = +1 Query: 451 MGTSFLRPFLQSLCCNSPWNYAVFWKLKHQHEMVLVWEDGFCDILKLRDPVVSPIEDLYL 630 MGT+ LR L+SLC NS W YAV WK+++ M+L WEDG+ D K R+P+ + D+Y Sbjct: 1 MGTTDLRQLLESLCNNSDWKYAVLWKMRYGSPMILTWEDGYFDCPKPREPLQTISSDVYC 60 Query: 631 -GNSDKILSTFSSSVLDGTPGECPVGLAVAEMXXXXXXXXXXXXXXAAFTGNTSWIYSDN 807 G +D S +S + G + L VA+M A+TG+ W+ +N Sbjct: 61 NGGNDLASSLRDASASNANFGGHQIELVVADMLHLQYPLGEGVVGEVAYTGDHFWLSFNN 120 Query: 808 IETDVFNTVLVPEYPDEWLLQFAAGIKTILLLPVIPHGVLQLGSVETVAENAALVAYVKD 987 I + + LVPE+P+EWLLQFA+GIKTILL+PV+PHGVLQLGS + VAE+ +VAY+K Sbjct: 121 IFSCEMSKNLVPEFPEEWLLQFASGIKTILLVPVLPHGVLQLGSFDEVAEDIQIVAYIKG 180 Query: 988 KF-EAHK-KPDGYDL---RYSVQQFSLVSTFLENLEEPSTVTTEKVNEDQNATHAVRTKD 1152 +F + H + + L R Q +L+S +E L S ++ +V + +++ +++ Sbjct: 181 RFNDLHSTRENAVPLTLKREFKAQSTLISCPVEQLNATSAISISQV-KSEDSNYSIPVNS 239 Query: 1153 YNLIANQMMPVFMVQDFANASVRCVADTSENVTENDFSRQPLSMIHLDEPFQLSY--EDN 1326 L ++ VF + N+ AD S +E+ + QP M+ + F+LSY ++ Sbjct: 240 VKLHKDEQPEVFKCESKNNSLSPIFADVSPP-SESLSASQP-GMVE-SKIFELSYLMDEL 296 Query: 1327 QSFISKNDISNYYHEETSSPSSYFDNFSRRMYGEFMNESMGFHFEDGATDPTFLGRDFDS 1506 Q++ N+ ++ +GE ++ M + + + G D + Sbjct: 297 QAYSDCNE------------------YNVGWFGEPLDGMMNTYPTADMVEQSSGGMDAND 338 Query: 1507 GICESGGNFFSFPGDYELNKILGTSVEDNTYPYTYGTSISGHDLACHSSG-------DRE 1665 ++ +F SFP EL+K+LG T T+ S+ D +C SS Sbjct: 339 VYHKNRQSFLSFPKGSELHKVLGPPFLSQTNEKTWEPSLLVED-SCKSSNFIFSEDHSAR 397 Query: 1666 TSYSMDXXXXXXXXXXXXXXANASSILDDNSSNKSGITSSNVSLREHLGSTKKHDQSKQS 1845 S+ N+ S D+ SSN+S S+ L HL +T ++ Q + Sbjct: 398 IEPSLFAREGEVEFLLEPVAGNSYSSSDNASSNRSHSLKSSERLSGHLLATSQN-QFQTR 456 Query: 1846 APVEAYKVPWNFLTSE-FSAPGIXXXXXXXXXXXXVESKIGALSDKQKKRKGYDSQKPGK 2022 V PWN L S S G ++S + + D++++ K + P K Sbjct: 457 TLVGDDLAPWNHLASVCISGSG------NTDTTAALDSMMSTIFDQEQQEKDQSYKHPWK 510 Query: 2023 LSRLSTTNKRRAHTGDNQKPKPRDRQLIQDRLKELRELVPNSEKCSIDGLLDKTINHMLF 2202 ++S +RRA G+NQKP+PRDRQLIQDR+KELRELVPN KCSIDGLLD+TI HM + Sbjct: 511 GQKMSNVARRRARPGENQKPRPRDRQLIQDRVKELRELVPNGSKCSIDGLLDQTIKHMQY 570 Query: 2203 LSNVTKRADKLRDQVLEKDTDEKTRRPAEVNHGHQNGTSWAVELGSEQQLCPIVVKDLNH 2382 L +VT +A+KLR V ++ D K R +E N Q+G SWA E G++ Q+CPIVV+DL + Sbjct: 571 LRSVTDQAEKLRQWVHQEVADRKNCRLSETNVNIQSGKSWAFEFGNDLQICPIVVEDLAY 630 Query: 2383 PGHMLIEMLCTDHGRFLEIADVIHRLKLTILRGVMEKSADNSWASFVVETSGSFHRLDIF 2562 PGH+LIEMLC D G FLEIA VI L LTIL+GVME N+WA F+VE FHRLDIF Sbjct: 631 PGHLLIEMLCNDRGVFLEIAQVIRSLDLTILKGVMESRLSNTWAHFIVEACKGFHRLDIF 690 Query: 2563 WPLMQLLQQSRAPI 2604 WPLMQLLQ+ R+ I Sbjct: 691 WPLMQLLQRKRSSI 704 >gb|EOY07431.1| Basic helix-loop-helix DNA-binding superfamily protein, putative isoform 1 [Theobroma cacao] gi|508715539|gb|EOY07436.1| Basic helix-loop-helix DNA-binding superfamily protein, putative isoform 1 [Theobroma cacao] Length = 682 Score = 422 bits (1085), Expect = e-115 Identities = 271/722 (37%), Positives = 382/722 (52%), Gaps = 4/722 (0%) Frame = +1 Query: 451 MGTSFLRPFLQSLCCNSPWNYAVFWKLKHQHEMVLVWEDGFCDILKLRDPVVSPIEDLYL 630 MGTS LR L+S C NSPW YAV WKL+H+ M L WEDG+C + R+ V S D++ Sbjct: 1 MGTSALRQLLKSFCSNSPWKYAVLWKLRHRSPMSLTWEDGYCVYPRPRESVESISSDVH- 59 Query: 631 GNSDKILSTFSSSVLDGTPGECPVGLAVAEMXXXXXXXXXXXXXXAAFTGNTSWIYSDNI 810 NS+ I S F +S+ DG G P+GL VA M A+TG W+ D+I Sbjct: 60 SNSEIIPSHFETSIHDGCFGGYPIGLVVANMSHLKYAWGEGVVGKVAYTGKHCWVSYDDI 119 Query: 811 ETDVFNTVLVPEYPDEWLLQFAAGIKTILLLPVIPHGVLQLGSVETVAENAALVAYVKDK 990 T N+ LVPE P+EWLLQFA+GIKTI+L+PV+PHGV QLGS+E V E+ + AY+KD+ Sbjct: 120 FTGKANSKLVPECPEEWLLQFASGIKTIVLVPVLPHGVFQLGSLEMVPEDLSTPAYIKDR 179 Query: 991 FEAHKKPDGYDLRYSVQQFSLVSTFLENLEEPSTVTTEKVN-EDQNATHAVRTKDYNLIA 1167 F D+ + L S+ LE LEE S+ + +N ED NA ++ Sbjct: 180 FSCK------DIHTQLPSL-LTSSLLEKLEESSSASISPLNSEDSNAVDGIKPLS----- 227 Query: 1168 NQMMPVFMVQDFANASVRCVADTSENVTENDFSRQPLSMIHLDEPFQLSYEDNQSFISKN 1347 + F V + + + E+ EN S P+S+ + P S Q + ++ Sbjct: 228 --IQNAFQVPEID------LPEVLESEGENKISVPPVSLSEVSSPLSQSINSYQLAMGES 279 Query: 1348 D---ISNYYHEETSSPSSYFDNFSRRMYGEFMNESMGFHFEDGATDPTFLGRDFDSGICE 1518 + +S E ++P ++ ++ GE ++ + +P F D + + Sbjct: 280 EMFGLSCIKEELWANPE--YNGYTVGECGEILDGVTYPYPASDLLEPPF----GDFSVYD 333 Query: 1519 SGGNFFSFPGDYELNKILGTSVEDNTYPYTYGTSISGHDLACHSSGDRETSYSMDXXXXX 1698 +G F SFP D EL+K LG + E + Y + +S D+ D E S++ Sbjct: 334 AG--FLSFPKDCELHKALGPAFEKQSNEYFWESSFLTEDVFRDLFDDIEPSFAKGGDAEY 391 Query: 1699 XXXXXXXXXANASSILDDNSSNKSGITSSNVSLREHLGSTKKHDQSKQSAPVEAYKVPWN 1878 + S + + S++ + ST + S + V +P + Sbjct: 392 LLQAVVGHVYDGSVDIANRSNH-------------FMTSTGQLPVSIRPQSVMGDSIPVS 438 Query: 1879 FLTSEFSAPGIXXXXXXXXXXXXVESKIGALSDKQKKRKGYDSQKPGKLSRLSTTNKRRA 2058 +TS + G +S + L+D + K + K + S+ KRRA Sbjct: 439 RVTS--ALVGEAKNNSSSKTSASFKSTVSTLTDDKNLGKDCYYMQSRKGQKQSSVTKRRA 496 Query: 2059 HTGDNQKPKPRDRQLIQDRLKELRELVPNSEKCSIDGLLDKTINHMLFLSNVTKRADKLR 2238 GDN +P+PRDRQ+IQDRLKELRELVPN +K SID LLD T+ HM +LS+VT +A+KL+ Sbjct: 497 RLGDNPRPRPRDRQMIQDRLKELRELVPNGDKHSIDALLDHTVKHMRYLSSVTNQAEKLK 556 Query: 2239 DQVLEKDTDEKTRRPAEVNHGHQNGTSWAVELGSEQQLCPIVVKDLNHPGHMLIEMLCTD 2418 V + T K R +E +Q G SWA E+G E + CPIVV+DL +PGH LIEMLC + Sbjct: 557 QWVHREVTVRKNMRSSESKDCYQMGASWAFEIGDELKACPIVVEDLAYPGHFLIEMLCNE 616 Query: 2419 HGRFLEIADVIHRLKLTILRGVMEKSADNSWASFVVETSGSFHRLDIFWPLMQLLQQSRA 2598 H FLEIA VI LTIL+GVME ++N+WA F+VE S FHRLDIFWPLMQLLQ+ R Sbjct: 617 HCLFLEIAQVIRSFNLTILKGVMESCSNNTWAHFIVEASRGFHRLDIFWPLMQLLQRQRN 676 Query: 2599 PI 2604 PI Sbjct: 677 PI 678 >ref|XP_004302716.1| PREDICTED: transcription factor EMB1444-like [Fragaria vesca subsp. vesca] Length = 715 Score = 416 bits (1068), Expect = e-113 Identities = 278/734 (37%), Positives = 393/734 (53%), Gaps = 16/734 (2%) Frame = +1 Query: 451 MGTSFLRPFLQSLCCNSPWNYAVFWKLKHQHEMVLVWEDGFCDILKLRDPVVSPIEDLYL 630 MGT+ LR L+SLC NS WNY VFWKLKHQ +++L WEDG+C K R + ++++ Sbjct: 1 MGTTALRQLLKSLCGNSLWNYGVFWKLKHQTDLILRWEDGYCHQPKPRGTMDHATDNIFF 60 Query: 631 GNSDKI-LSTFSSSVLDGTPGECPVGLAVAEMXXXXXXXXXXXXXXAAFTGNTSWIYSDN 807 G ++I +S+ +G +GLAVA+M A TGN SW+ D Sbjct: 61 GEVNEISFKKCGTSIHEGGSAGYSIGLAVADMSHLQYTFGKGVVGGVASTGNHSWVLLDG 120 Query: 808 IETDVFNTVLVPEYPDEWLLQFAAGIKTILLLPVIPHGVLQLGSVETVAENAALVAYVKD 987 + T ++ LV + PDEWLLQFA G+KTILL+PV+PHGVLQ GS+ETVAE+ A+VA++KD Sbjct: 121 LLTSESDSNLVSDCPDEWLLQFALGVKTILLVPVLPHGVLQFGSMETVAEDLAVVAFMKD 180 Query: 988 KFEA-HK---KPDGYDLRYSVQ---QFSLVSTFLENLEEPSTVTTEKVNEDQNATHAVRT 1146 +F A H K ++ S+Q +S S +EN E STV N R+ Sbjct: 181 RFNAIHNVMGKAVSSNIVRSIQAPYSWSQSSGLMENTYESSTVGI-------NPLKVERS 233 Query: 1147 KDYNLIANQMMPVFMVQDFANASV----RCVADTSENVTENDFSRQPLSMIHLDEPFQLS 1314 +D+ I Q + ++ F S D S +F +++ EP + Sbjct: 234 EDFGDI-RQNNTLSTLEQFVQLSTIESPLFGIDPSVLKNSGEFEVGGMAVWSTGEPKTAN 292 Query: 1315 YEDNQSFIS--KNDISNY-YHEETSSPSSYFDNFSRRMYGEFMNESMGFHFEDGATDPTF 1485 + S + +N I EE S ++S ++GE + GF+ ++ Sbjct: 293 QSSDTSLLDMLENQIFGLSCQEEEHVALSQNGSYSFGVFGESFD---GFNSYIAGSEAEQ 349 Query: 1486 LGRDFDSGICESGGNFFSFPGDYELNKILGTSVEDNTYPYTYGTSISGHDLACHSSGDRE 1665 L + + + NFF FP EL+K LGTS + T + SIS D C SSG ++ Sbjct: 350 LFKFNNDTGHNNINNFFEFPETSELHKALGTSFQRQTDEQLWDLSISIDD-TCSSSGVQK 408 Query: 1666 TSYSMDXXXXXXXXXXXXXXANASSILDDNSSNKS-GITSSNVSLREHLGSTKKHDQSKQ 1842 S AS DD SS+ S GI S S R++ S+ K +S++ Sbjct: 409 NLVSRTNPPWFSNGCDAENLLEASLAKDDTSSSISDGIKSCTTSTRQY--SSYKQLKSEE 466 Query: 1843 SAPVEAYKVPWNFLTSEFSAPGIXXXXXXXXXXXXVESKIGALSDKQKKRKGYDSQKPGK 2022 A +E V W+ + + PG + + D Q++ K + +P K Sbjct: 467 GALMECEPVIWSHTS---ALPG------RCNTSSSFTGMMNTVVDNQQEDKRCNPTQPKK 517 Query: 2023 LSRLSTTNKRRAHTGDNQKPKPRDRQLIQDRLKELRELVPNSEKCSIDGLLDKTINHMLF 2202 +LS+TN RR ++ K +PRDRQLIQDR+KELRELVPN KCSIDGLLD+TI HM++ Sbjct: 518 EQKLSSTNPRRPKPSNSPKLRPRDRQLIQDRVKELRELVPNGAKCSIDGLLDRTIKHMMY 577 Query: 2203 LSNVTKRADKLRDQVLEKDTDEKTRRPAEVNHGHQNGTSWAVELGSEQQLCPIVVKDLNH 2382 L ++T +A+KL+ + + G NGTS A ELGSE Q PIVV+DL H Sbjct: 578 LRSMTDQAEKLKSYAHKDQERPHCNNTNKTLSGSSNGTSRAFELGSELQTSPIVVEDLEH 637 Query: 2383 PGHMLIEMLCTDHGRFLEIADVIHRLKLTILRGVMEKSADNSWASFVVETSGSFHRLDIF 2562 PGHMLIEMLC +HG FLEIA I RL+LT+L+GV+E ++N WA FVVE FHR+D+F Sbjct: 638 PGHMLIEMLCDEHGLFLEIAQAIRRLELTVLKGVLETRSNNLWAHFVVEVPRGFHRMDVF 697 Query: 2563 WPLMQLLQQSRAPI 2604 WPL+ LLQ+ ++ + Sbjct: 698 WPLLHLLQRRKSSL 711 >ref|XP_002533696.1| basic helix-loop-helix-containing protein, putative [Ricinus communis] gi|223526407|gb|EEF28691.1| basic helix-loop-helix-containing protein, putative [Ricinus communis] Length = 740 Score = 391 bits (1005), Expect = e-106 Identities = 276/751 (36%), Positives = 383/751 (50%), Gaps = 33/751 (4%) Frame = +1 Query: 451 MGTSFLRPFLQSLCCNSPWNYAVFWKLKHQHEMVLVWEDGFCDILKLRDPVVSPIEDLY- 627 MG + LR L+SLC NS WNYAV WKL+H M+L WEDG+ + K R+ V + +D+Y Sbjct: 1 MGATALRQLLKSLCSNSTWNYAVLWKLRHGSPMILTWEDGYFNYSKSRELVGTISDDVYG 60 Query: 628 LGNSDKILSTFSSSVLDGTPGECPVGLAVAEMXXXXXXXXXXXXXXAAFTGNTSWIYSDN 807 G SD I ++ G E PVGL VA+M A + W+ + Sbjct: 61 KGASDLISPQVETNTSRGISEEYPVGLVVADMSHLQYIFGEGVVGKVAALRDHCWVSFHH 120 Query: 808 IETDVFNTVLVPEYPDEWLLQFAAGIKTILLLPVIPHGVLQLGSVETVAENAALVAYVKD 987 I T + L+PE P+EWLLQFA+GIKTILL+PV+P+GVLQLGS+E VAE+ ++VAY+K Sbjct: 121 IFTG--KSELIPECPEEWLLQFASGIKTILLVPVLPYGVLQLGSLEEVAEDVSIVAYIKY 178 Query: 988 KFEAHKK------PDGYDLRYSVQ-QFSLVSTFLENLEEPST-----VTTEKVNEDQNAT 1131 +F + P Q SL+S+ + L P T V TE V + + Sbjct: 179 RFNCLQSVGENTGPCSLKKESQAQLSSSLISSSNKCLNVPLTNILTSVKTEDVYQSIASN 238 Query: 1132 HAVRTKDYNLIANQMMPVFMVQDFANASVRCVADTSENVTENDFSRQPLSMIHLDEPFQL 1311 D A+ + + QD + + E + N ++ + ++ + P + Sbjct: 239 IVELGNDNLATASYVQRLVTFQDVFTPTGEGLP---EAIIFNRDNKINVPLVEVSNP-SV 294 Query: 1312 SYEDNQSFISKNDISNYY--------HEETSSPSSYFDNFSRRMYGEFMNESMGFHFEDG 1467 S D+Q + ++ + + H E S ++ ++ + E NE M H Sbjct: 295 SINDSQLEMMESKLFDLSCLMEEIQAHSEELQRYSDYNGYNMGLLEESFNEIMNIHPAGS 354 Query: 1468 ATDPTFLGR---DFDSGICESGGNFFSFPGDYELNKILGTSVEDNTYPYTYGTSISGHDL 1638 T + D D+ I S F FP D EL+K L + T + +S + Sbjct: 355 MTGEPCGDKYAIDLDNKIVSS---FLRFPKDSELHKALEPASSKQTSEQFWDSSFMVENT 411 Query: 1639 ACHSS---------GDRETSYSMDXXXXXXXXXXXXXXANASSILDDNSSNKSGITSSNV 1791 SS DR T S ANA DD + S+ Sbjct: 412 CGTSSLPPSKDPNTSDR-TEPSWFARGGDAGYLLEAVVANACHSSDDTICYEFKSLESST 470 Query: 1792 SLREHLGSTKKHDQSKQSAPVEAYKVPWNFLTSEFSAPGIXXXXXXXXXXXXVESKIGAL 1971 S R + K+ Q K S + +P N LTS I + S + + Sbjct: 471 SPRGSASPSPKN-QYKGSDLAKDSSIPRNHLTSAC----ITEDRNADSTSDTLMSMMNTI 525 Query: 1972 SDKQKKRKGYDSQKPGKLSRLSTTNKRRAHTGDNQKPKPRDRQLIQDRLKELRELVPNSE 2151 ++ K G + + K R ++KRRA DNQ+ +PRDRQLIQ+R+KELRELVPN Sbjct: 526 LSQEHKGGGTGNTQLRKERRTLNSSKRRARPSDNQRQRPRDRQLIQERVKELRELVPNGA 585 Query: 2152 KCSIDGLLDKTINHMLFLSNVTKRADKLRDQVLEKDTDEKTRRPAEVNHGHQNGTSWAVE 2331 KCSIDGLLD+TI HM++L +VT +A+KLR + ++ K RP+E +QNGTSWA E Sbjct: 586 KCSIDGLLDRTIKHMMYLRSVTDQAEKLRHCLHQELAGCKNWRPSETEENYQNGTSWAFE 645 Query: 2332 LGSEQQLCPIVVKDLNHPGHMLIEMLCTDHGRFLEIADVIHRLKLTILRGVMEKSADNSW 2511 LG+E Q+CPI V+DL +PGHMLIEMLC +HG FLEIA VI L LTIL+GV++ + N+W Sbjct: 646 LGNEFQVCPIAVEDLAYPGHMLIEMLCDEHGLFLEIAQVIRGLGLTILKGVLKSRSSNTW 705 Query: 2512 ASFVVETSGSFHRLDIFWPLMQLLQQSRAPI 2604 A FVVE S FHRLDIFWPLMQLLQ+ R I Sbjct: 706 ARFVVEASKGFHRLDIFWPLMQLLQRKRKSI 736 >gb|EOY07437.1| Basic helix-loop-helix DNA-binding superfamily protein, putative isoform 7, partial [Theobroma cacao] Length = 713 Score = 383 bits (984), Expect = e-103 Identities = 251/697 (36%), Positives = 361/697 (51%), Gaps = 4/697 (0%) Frame = +1 Query: 451 MGTSFLRPFLQSLCCNSPWNYAVFWKLKHQHEMVLVWEDGFCDILKLRDPVVSPIEDLYL 630 MGTS LR L+S C NSPW YAV WKL+H+ M L WEDG+C + R+ V S D++ Sbjct: 1 MGTSALRQLLKSFCSNSPWKYAVLWKLRHRSPMSLTWEDGYCVYPRPRESVESISSDVH- 59 Query: 631 GNSDKILSTFSSSVLDGTPGECPVGLAVAEMXXXXXXXXXXXXXXAAFTGNTSWIYSDNI 810 NS+ I S F +S+ DG G P+GL VA M A+TG W+ D+I Sbjct: 60 SNSEIIPSHFETSIHDGCFGGYPIGLVVANMSHLKYAWGEGVVGKVAYTGKHCWVSYDDI 119 Query: 811 ETDVFNTVLVPEYPDEWLLQFAAGIKTILLLPVIPHGVLQLGSVETVAENAALVAYVKDK 990 T N+ LVPE P+EWLLQFA+GIKTI+L+PV+PHGV QLGS+E V E+ + AY+KD+ Sbjct: 120 FTGKANSKLVPECPEEWLLQFASGIKTIVLVPVLPHGVFQLGSLEMVPEDLSTPAYIKDR 179 Query: 991 FEAHKKPDGYDLRYSVQQFSLVSTFLENLEEPSTVTTEKVN-EDQNATHAVRTKDYNLIA 1167 F D+ + L S+ LE LEE S+ + +N ED NA ++ Sbjct: 180 FSCK------DIHTQLPSL-LTSSLLEKLEESSSASISPLNSEDSNAVDGIKP------- 225 Query: 1168 NQMMPVFMVQDFANASVRCVADTSENVTENDFSRQPLSMIHLDEPFQLSYEDNQSFISKN 1347 +Q+ + + E+ EN S P+S+ + P S Q + ++ Sbjct: 226 ------LSIQNAFQVPEIDLPEVLESEGENKISVPPVSLSEVSSPLSQSINSYQLAMGES 279 Query: 1348 D---ISNYYHEETSSPSSYFDNFSRRMYGEFMNESMGFHFEDGATDPTFLGRDFDSGICE 1518 + +S E ++P ++ ++ GE ++ + +P F D + + Sbjct: 280 EMFGLSCIKEELWANPE--YNGYTVGECGEILDGVTYPYPASDLLEPPF----GDFSVYD 333 Query: 1519 SGGNFFSFPGDYELNKILGTSVEDNTYPYTYGTSISGHDLACHSSGDRETSYSMDXXXXX 1698 +G F SFP D EL+K LG + E + Y + +S D+ D E S++ Sbjct: 334 AG--FLSFPKDCELHKALGPAFEKQSNEYFWESSFLTEDVFRDLFDDIEPSFAKGGDAEY 391 Query: 1699 XXXXXXXXXANASSILDDNSSNKSGITSSNVSLREHLGSTKKHDQSKQSAPVEAYKVPWN 1878 + S + + S++ + ST + S + V +P + Sbjct: 392 LLQAVVGHVYDGSVDIANRSNH-------------FMTSTGQLPVSIRPQSVMGDSIPVS 438 Query: 1879 FLTSEFSAPGIXXXXXXXXXXXXVESKIGALSDKQKKRKGYDSQKPGKLSRLSTTNKRRA 2058 +TS + G +S + L+D + K + K + S+ KRRA Sbjct: 439 RVTS--ALVGEAKNNSSSKTSASFKSTVSTLTDDKNLGKDCYYMQSRKGQKQSSVTKRRA 496 Query: 2059 HTGDNQKPKPRDRQLIQDRLKELRELVPNSEKCSIDGLLDKTINHMLFLSNVTKRADKLR 2238 GDN +P+PRDRQ+IQDRLKELRELVPN +K SID LLD T+ HM +LS+VT +A+KL+ Sbjct: 497 RLGDNPRPRPRDRQMIQDRLKELRELVPNGDKHSIDALLDHTVKHMRYLSSVTNQAEKLK 556 Query: 2239 DQVLEKDTDEKTRRPAEVNHGHQNGTSWAVELGSEQQLCPIVVKDLNHPGHMLIEMLCTD 2418 V + T K R +E +Q G SWA E+G E + CPIVV+DL +PGH LIEMLC + Sbjct: 557 QWVHREVTVRKNMRSSESKDCYQMGASWAFEIGDELKACPIVVEDLAYPGHFLIEMLCNE 616 Query: 2419 HGRFLEIADVIHRLKLTILRGVMEKSADNSWASFVVE 2529 H FLEIA VI LTIL+GVME ++N+WA F+VE Sbjct: 617 HCLFLEIAQVIRSFNLTILKGVMESCSNNTWAHFIVE 653 >gb|EOY07432.1| Basic helix-loop-helix DNA-binding superfamily protein, putative isoform 2 [Theobroma cacao] gi|508715536|gb|EOY07433.1| Basic helix-loop-helix DNA-binding superfamily protein, putative isoform 2 [Theobroma cacao] gi|508715537|gb|EOY07434.1| Basic helix-loop-helix DNA-binding superfamily protein, putative isoform 2 [Theobroma cacao] gi|508715538|gb|EOY07435.1| Basic helix-loop-helix DNA-binding superfamily protein, putative isoform 2 [Theobroma cacao] Length = 650 Score = 378 bits (970), Expect = e-102 Identities = 251/690 (36%), Positives = 359/690 (52%), Gaps = 4/690 (0%) Frame = +1 Query: 547 MVLVWEDGFCDILKLRDPVVSPIEDLYLGNSDKILSTFSSSVLDGTPGECPVGLAVAEMX 726 M L WEDG+C + R+ V S D++ NS+ I S F +S+ DG G P+GL VA M Sbjct: 1 MSLTWEDGYCVYPRPRESVESISSDVH-SNSEIIPSHFETSIHDGCFGGYPIGLVVANMS 59 Query: 727 XXXXXXXXXXXXXAAFTGNTSWIYSDNIETDVFNTVLVPEYPDEWLLQFAAGIKTILLLP 906 A+TG W+ D+I T N+ LVPE P+EWLLQFA+GIKTI+L+P Sbjct: 60 HLKYAWGEGVVGKVAYTGKHCWVSYDDIFTGKANSKLVPECPEEWLLQFASGIKTIVLVP 119 Query: 907 VIPHGVLQLGSVETVAENAALVAYVKDKFEAHKKPDGYDLRYSVQQFSLVSTFLENLEEP 1086 V+PHGV QLGS+E V E+ + AY+KD+F D+ + L S+ LE LEE Sbjct: 120 VLPHGVFQLGSLEMVPEDLSTPAYIKDRFSCK------DIHTQLPSL-LTSSLLEKLEES 172 Query: 1087 STVTTEKVN-EDQNATHAVRTKDYNLIANQMMPVFMVQDFANASVRCVADTSENVTENDF 1263 S+ + +N ED NA ++ + F V + + + E+ EN Sbjct: 173 SSASISPLNSEDSNAVDGIKPLS-------IQNAFQVPEID------LPEVLESEGENKI 219 Query: 1264 SRQPLSMIHLDEPFQLSYEDNQSFISKND---ISNYYHEETSSPSSYFDNFSRRMYGEFM 1434 S P+S+ + P S Q + +++ +S E ++P ++ ++ GE + Sbjct: 220 SVPPVSLSEVSSPLSQSINSYQLAMGESEMFGLSCIKEELWANPE--YNGYTVGECGEIL 277 Query: 1435 NESMGFHFEDGATDPTFLGRDFDSGICESGGNFFSFPGDYELNKILGTSVEDNTYPYTYG 1614 + + +P F D + ++G F SFP D EL+K LG + E + Y + Sbjct: 278 DGVTYPYPASDLLEPPF----GDFSVYDAG--FLSFPKDCELHKALGPAFEKQSNEYFWE 331 Query: 1615 TSISGHDLACHSSGDRETSYSMDXXXXXXXXXXXXXXANASSILDDNSSNKSGITSSNVS 1794 +S D+ D E S++ + S + + S++ Sbjct: 332 SSFLTEDVFRDLFDDIEPSFAKGGDAEYLLQAVVGHVYDGSVDIANRSNH---------- 381 Query: 1795 LREHLGSTKKHDQSKQSAPVEAYKVPWNFLTSEFSAPGIXXXXXXXXXXXXVESKIGALS 1974 + ST + S + V +P + +TS + G +S + L+ Sbjct: 382 ---FMTSTGQLPVSIRPQSVMGDSIPVSRVTS--ALVGEAKNNSSSKTSASFKSTVSTLT 436 Query: 1975 DKQKKRKGYDSQKPGKLSRLSTTNKRRAHTGDNQKPKPRDRQLIQDRLKELRELVPNSEK 2154 D + K + K + S+ KRRA GDN +P+PRDRQ+IQDRLKELRELVPN +K Sbjct: 437 DDKNLGKDCYYMQSRKGQKQSSVTKRRARLGDNPRPRPRDRQMIQDRLKELRELVPNGDK 496 Query: 2155 CSIDGLLDKTINHMLFLSNVTKRADKLRDQVLEKDTDEKTRRPAEVNHGHQNGTSWAVEL 2334 SID LLD T+ HM +LS+VT +A+KL+ V + T K R +E +Q G SWA E+ Sbjct: 497 HSIDALLDHTVKHMRYLSSVTNQAEKLKQWVHREVTVRKNMRSSESKDCYQMGASWAFEI 556 Query: 2335 GSEQQLCPIVVKDLNHPGHMLIEMLCTDHGRFLEIADVIHRLKLTILRGVMEKSADNSWA 2514 G E + CPIVV+DL +PGH LIEMLC +H FLEIA VI LTIL+GVME ++N+WA Sbjct: 557 GDELKACPIVVEDLAYPGHFLIEMLCNEHCLFLEIAQVIRSFNLTILKGVMESCSNNTWA 616 Query: 2515 SFVVETSGSFHRLDIFWPLMQLLQQSRAPI 2604 F+VE S FHRLDIFWPLMQLLQ+ R PI Sbjct: 617 HFIVEASRGFHRLDIFWPLMQLLQRQRNPI 646 >gb|EXB36735.1| hypothetical protein L484_016987 [Morus notabilis] Length = 749 Score = 373 bits (958), Expect = e-100 Identities = 256/764 (33%), Positives = 380/764 (49%), Gaps = 49/764 (6%) Frame = +1 Query: 451 MGTSFLRPFLQSLCCNSPWNYAVFWKLKHQHEMVLVWEDGFCDILKLRDPVVSPIEDLYL 630 M +S LR FL SLC N+ W YAVFWKL+HQ +L WED +CD K + + S +D ++ Sbjct: 1 MESSPLRQFLISLCNNTHWKYAVFWKLQHQTPPILTWEDAYCDNAKPAEDLGSASDDSHV 60 Query: 631 GNSDKI-LSTFSSSVLDGTPGECPVGLAVAEMXXXXXXXXXXXXXXAAFTGNTSWIYSDN 807 S I + +S+ D C + L VA M A TG +W++ +N Sbjct: 61 NRSKPISFQSRETSMQDIGSEGCQIELLVANMSCVQYALGDGLVGDVACTGKHTWVFFNN 120 Query: 808 IETDVFNTVLVPEYPDEWLLQFAAGIKTILLLPVIPHGVLQLGSVET------------- 948 T F++ LVP++ DEWLLQ A GIKTILL+P++P GVLQLGS+E Sbjct: 121 FFTREFDSNLVPDWTDEWLLQIAMGIKTILLVPLLPDGVLQLGSLEMAVLLERNRFERCE 180 Query: 949 ---------VAENAALVAYVKDKFEAHKKPDGYDLRYSVQQ-------FSLVSTFLENLE 1080 VAE+ ++V ++K++F+A+ + +++ S +S+ +E+L Sbjct: 181 EECGVIWDRVAEDLSVVGFIKERFDAYHSMMSSTIPFTIMMNPVDHSSLSPLSSTVESLN 240 Query: 1081 EPSTVTTEKVNEDQNATHAVRTKDYNLIAN--QMMPVFMVQDFANASVRCVADTSENVTE 1254 EP+ + T +V ++ T + ++ Q +PV VQD D ++ ++ Sbjct: 241 EPTRLITSRVKSEKLEDFDCNTLNERRLSTSKQSIPVQTVQDMLVVPKNDAVDVFKSTSK 300 Query: 1255 NDFSRQPLSMIHLDEPFQLSYEDNQSFISKNDISNYYH-EETSSPSSYFDNFSRRMYGEF 1431 N+ S I LS++ N +++ ++ + EE S ++ Sbjct: 301 NEIGFPEESAIP-----SLSFDVNSLDMAEAEMFGFSCLEEELLAYSLSSGQDVELFENS 355 Query: 1432 MNESMGFHFEDGATDPTFLGRDF-DSGICESGGNFFSFPGDYELNKILGTSVED-NTYPY 1605 +N + A G D+ ++G C+S +F FP D EL++ LG S ++ NTY + Sbjct: 356 LNGVTPCTAGEMAAQ--LFGDDYINNGYCKSMTSFSRFPEDSELHRALGPSFQERNTYEH 413 Query: 1606 TYGTSISGHDLACHSSG---DRET-----------SYSMDXXXXXXXXXXXXXXANASSI 1743 + +S D + +RE S D + S Sbjct: 414 FWDSSFLIEDARTNRPSAFCNRELLDVIEPSWFGGSGDKDYLLEAVVTDLCCSSDDVLSS 473 Query: 1744 LDDNSSNKSGITSSNVSLREHLGSTKKHDQSKQSAPVEAYKVPWNFLTSEFSAPGIXXXX 1923 L DN S +TSS S K Q+ +++ +FL S + Sbjct: 474 LSDNVP--SYVTSSRQSTFSQPQVQSKAGPRMQNCSIQSNLAKPSFLPRVDSLTSL---- 527 Query: 1924 XXXXXXXXVESKIGALSDKQKKRKGYDSQKPGKLSRLSTTNKRRAHTGDNQKPKPRDRQL 2103 + L+++ ++ K + K R T RR G QK +PRDRQL Sbjct: 528 ---------DGMTSTLTNEGRQVKVQGPVQSSKQKRPPNTKTRRTRNGSTQKSRPRDRQL 578 Query: 2104 IQDRLKELRELVPNSEKCSIDGLLDKTINHMLFLSNVTKRADKLRDQVLEKDTDEKTRRP 2283 IQDR+KELRELVPN KCSIDGLLD+TI HML+L +V +A KL+ +L + + RR Sbjct: 579 IQDRVKELRELVPNGAKCSIDGLLDQTIKHMLYLESVAGQAKKLKGHLLREAASGRNRRS 638 Query: 2284 AEVNHGHQNGTSWAVELGSEQQLCPIVVKDLNHPGHMLIEMLCTDHGRFLEIADVIHRLK 2463 + QNGTSWA E GS QQ CPIVV+DL + GHMLIE+LC DHG FL+IA +I RL Sbjct: 639 TATCNTLQNGTSWAFEFGSVQQACPIVVEDLGNTGHMLIEVLCDDHGLFLDIAQLIRRLD 698 Query: 2464 LTILRGVMEKSADNSWASFVVETSGSFHRLDIFWPLMQLLQQSR 2595 LT+L+GVME + N+WA FVVE + FHR++IFWPL+ LLQ+ + Sbjct: 699 LTVLKGVMENRSSNTWAHFVVEATKGFHRMEIFWPLLHLLQRKK 742 >ref|XP_006846364.1| hypothetical protein AMTR_s00012p00261730 [Amborella trichopoda] gi|548849134|gb|ERN08039.1| hypothetical protein AMTR_s00012p00261730 [Amborella trichopoda] Length = 717 Score = 318 bits (815), Expect = 8e-84 Identities = 244/741 (32%), Positives = 357/741 (48%), Gaps = 34/741 (4%) Frame = +1 Query: 466 LRPFLQSLCCNSPWNYAVFWKLKHQHEMVLVWEDGFCDILKLRDPVVSPIEDLY---LGN 636 LR L+ C +S W YAVFWKLKH+ M+L WEDG+ + K + + + +G Sbjct: 5 LRQLLKGFCHDSEWQYAVFWKLKHRSRMLLTWEDGYYNFPKPPCNIQDTTTNAFFNSIGG 64 Query: 637 SDKILSTFSSSVLDGTPGEC---PVGLAVAEMXXXXXXXXXXXXXXAAFTGNTSWIYSDN 807 +D +SS +DG P+G AVA M AF+G W +++ Sbjct: 65 AD-----YSSDAIDGRVRHSVRDPIGAAVANMSYLVYALGEGIIGQVAFSGRHYWAFAEK 119 Query: 808 IETDVFNTVLVPEYPDEWLLQFAAGIKTILLLPVIPHGVLQLGSVETVAENAALVAYVKD 987 + N+ VPEYP EW QFAAGIKTI+L+PV+PHGV+QLGS++ + E+ LV +VK Sbjct: 120 VFNGEGNSQFVPEYPSEWQFQFAAGIKTIVLIPVVPHGVVQLGSLKLLMEDLKLVDHVKS 179 Query: 988 KFEAHKKPDGYDLRYSVQQFSLVSTFLENLEEPSTVTTEKVNEDQNATHAVRT------K 1149 F + G V S +N +P + + + ++++ A+ A+ + Sbjct: 180 SFNMLQNKAGAFFPDPVHCSSN-----KNNPDPVSSSFDSISQNSFASSAIYPSISRGIQ 234 Query: 1150 DYNLIANQMMPVFM--VQDFANASVRCVADTSE--NVTENDFSRQPLS--MIHL------ 1293 NL+ N P+ F N V+ + + + NDF L M HL Sbjct: 235 AENLVENSAAPLVSNSFTYFLNQVVKSELTSFQIHHKPLNDFQDLILGEEMGHLAMRQKP 294 Query: 1294 --DEPFQLSYEDNQ-SFISKNDISNYYHEETSSPSSYFDNFSRRMYGEFMNESMGFHFED 1464 + P Q YED+ +F ++D + SS + D S + +SM Sbjct: 295 VEELPDQNIYEDSLFNFCGQSDSNIMQGSSLSSLTQVVDQDS------LLKQSMR---SA 345 Query: 1465 GATDPTFLGRDFDSGICESGGNFFSFPGDYELNKILGTSVEDNTYPYTYGTSISGHDLAC 1644 D G D+ + SFP + EL+K+L + S + Sbjct: 346 SCKDQEQNGEDYLWAL--------SFPAESELHKVLKP---------VFSNMGSTDAAST 388 Query: 1645 HSSGDRETSYSMDXXXXXXXXXXXXXXANASSILDDNSSN--KSGITSSNVSLREHLGS- 1815 SS T + ++ +LD +N +G S N S GS Sbjct: 389 DSSTQTATMSELIEPLVGEFDAWLRSEGSSEHLLDAVVANALSTGAQSCNSSSTLLGGSC 448 Query: 1816 -TKKHDQSKQSAPVEAYKVPWN-FLTSEFSAPGIXXXXXXXXXXXXVESKIGALSDKQKK 1989 T+ + S ++ PW+ +L + G + SK + + ++ Sbjct: 449 LTESNGGGSGSIADDSISDPWSGYLGFVQGSRGTSVRSPSG-----LSSKAMSTMVEGER 503 Query: 1990 RKGYDSQKPGKLSRLSTTNKRRAHTGDNQKPKPRDRQLIQDRLKELRELVPNSEKCSIDG 2169 ++ + KL S KRRA G++ +P+PRDRQ IQDR+KELRE+VPN KCSID Sbjct: 504 KEVFSCSHSKKLIEPSKLTKRRAKPGESCRPRPRDRQQIQDRVKELREIVPNGAKCSIDA 563 Query: 2170 LLDKTINHMLFLSNVTKRADKLRDQVLEKDTDEKTRRPAEVNHGH--QNGTSWAVELGSE 2343 LL++TI HM+FL NVT ADKL+ + K D K +RP V + Q G SWA++LGS+ Sbjct: 564 LLERTIKHMIFLRNVTSHADKLK--LCSKVADNK-QRPLLVGRSNSDQRGASWALDLGSQ 620 Query: 2344 QQLCPIVVKDLNHPGHMLIEMLCTDHGRFLEIADVIHRLKLTILRGVMEKSADNSWASFV 2523 +CP+VV++L+HPGHML+EMLC + G FLEIA VI L LTI++G+ME AD WA FV Sbjct: 621 TGVCPVVVENLDHPGHMLVEMLCEEDGLFLEIAQVIRNLGLTIIKGLMEARADKFWAHFV 680 Query: 2524 VETSGSFHRLDIFWPLMQLLQ 2586 VE R+D+ W LMQLLQ Sbjct: 681 VEGPRGIQRMDVLWQLMQLLQ 701 >gb|ESW12469.1| hypothetical protein PHAVU_008G115700g [Phaseolus vulgaris] Length = 679 Score = 308 bits (790), Expect = 6e-81 Identities = 242/737 (32%), Positives = 350/737 (47%), Gaps = 19/737 (2%) Frame = +1 Query: 451 MGTSFLRPFLQSLCCNSPWNYAVFWKLKHQHEMVLVWEDGFCDILKLRDPVVSPIEDLYL 630 M + + L+ C ++ W YAVFWKL H M L WE+G+ ++++ S +D Sbjct: 1 MEATSITSLLKGFCDHTQWKYAVFWKLNHHFPMNLTWENGYQKGNEVKE---SMWDDFNF 57 Query: 631 GNSDKILSTFSSSVLDGTPGECPVGLAVAEMXXXXXXXXXXXXXXAAFTGNTSWIYSDNI 810 + ++ S+ S G+ V L + EM A + W+ ++I Sbjct: 58 KSPHELYSSRGEST--DYSGDYSVRLLMIEMSHRKYNFGEGVVGKVALARDHCWVSCEDI 115 Query: 811 ETDVFNTVLVPEYPDEWLLQFAAGIKTILLLPVIPHGVLQLGSVETVAENAALVAYVKDK 990 T F+T L+PE DEWLLQ A GIKTI+L+PV+P GVLQ GS E VAE+ V VKDK Sbjct: 116 LTGKFDTDLIPECHDEWLLQIACGIKTIVLVPVLPLGVLQFGSFEEVAEDLEFVTNVKDK 175 Query: 991 F------EAHKKPDGYDLRYSVQQFS-LVSTFLENLEEPSTVTTEKVNEDQNATHAVRTK 1149 EA+ P Y FS L+ +++L+E S+VT + + + + A+ + Sbjct: 176 VQSIDCTEANINPFNMRTDYQDWSFSDLMHNLMDSLDESSSVTKTILKSEVSTSTALHNE 235 Query: 1150 DYNLIANQMMPVFMVQDFANASVRCVADTSENVTENDFSRQPLSMIHLDEPFQLSYEDNQ 1329 + + N M F +QD S + + + + N+ L M + Sbjct: 236 NGSRRLNPTMLSF-IQDDCCVSRQDLLKSMKRENVNEIGSSSLDMSTVSR---------- 284 Query: 1330 SFISKNDISNYYHEETSSPSSYFDNFSRRMYGEFMNESMGFHFEDGATDPTFLGRDFDSG 1509 I K + + EE S F+ S + +N G F G T+ G D Sbjct: 285 -HIGKMETKPNHMEEEMWSWSVFEEMSNGLDSFSVNNMTGKQF--GGTES---GYDDAKN 338 Query: 1510 ICESGGNFFSFPGDYELNKILGT---SVEDNTYPYTYGTSISGHDLACHSSGDRETSYSM 1680 I N F+FP + EL+K LG+ SV D TY TS C + +E + Sbjct: 339 I-----NDFNFPSESELHKALGSVAYSVGD-----TYHTS-------CLITNKKENDHIK 381 Query: 1681 DXXXXXXXXXXXXXXA---NASSILDDNSSNKSGITSSNVSLREHLGSTKKHDQS---KQ 1842 A N S DD SS + I S E GS + + S K Sbjct: 382 GFELPEDLDPENLLDAVFGNLCSSADDTSSISNSIRSLTTMPTEISGSIQPKNNSDVKKD 441 Query: 1843 SAPVEAYKVPWNF---LTSEFSAPGIXXXXXXXXXXXXVESKIGALSDKQKKRKGYDSQK 2013 K + F TS F G L D+ ++ K D Sbjct: 442 LVAAVTAKRKYEFSNPFTSSFDGNG------------------SLLIDEVQQEKEDDHML 483 Query: 2014 PGKLSRLSTTNKRRAHTGDNQKPKPRDRQLIQDRLKELRELVPNSEKCSIDGLLDKTINH 2193 P +LS+T+K+R +NQK +PRDRQLI DR+KELRELVP+ +CSID LL++TI H Sbjct: 484 PISGPKLSSTHKKRTRVANNQKARPRDRQLIMDRMKELRELVPDGGRCSIDNLLERTIKH 543 Query: 2194 MLFLSNVTKRADKLRDQVLEKDTDEKTRRPAEVNHGHQNGTSWAVELGSEQQLCPIVVKD 2373 ML+L +T +A+KL+ + + E R+ ++N H G S A + SE PIV++D Sbjct: 544 MLYLRKITSQAEKLK-RFANRTVAESKRQ--KINGSHP-GRSCAFDFESELAW-PIVIED 598 Query: 2374 LNHPGHMLIEMLCTDHGRFLEIADVIHRLKLTILRGVMEKSADNSWASFVVETSGSFHRL 2553 L GHMLIEM+C +HG FLEIA VI +L++TIL+G++E + +SWA F+VE FHR+ Sbjct: 599 LECTGHMLIEMICNEHGLFLEIAQVIRKLEVTILKGILENRSSDSWACFIVEVPRGFHRM 658 Query: 2554 DIFWPLMQLLQQSRAPI 2604 D+ PL+ LLQ R PI Sbjct: 659 DVLCPLLHLLQLKRNPI 675 >ref|XP_003551499.1| PREDICTED: transcription factor LHW-like [Glycine max] Length = 698 Score = 293 bits (750), Expect = 3e-76 Identities = 229/749 (30%), Positives = 335/749 (44%), Gaps = 31/749 (4%) Frame = +1 Query: 451 MGTSFLRPFLQSLCCNSPWNYAVFWKLKHQHEMVLVWEDGFCDILKLRDPVVSPI-EDLY 627 M + + L+ C ++ W YA FWKL M L WE+G+ + RD V + DL Sbjct: 1 MDATSIMHLLKGFCDHTQWKYAGFWKLDQHFPMTLTWENGY----QKRDEVKESMWGDLS 56 Query: 628 LGNSDKILSTFSSSVLDGTPGECPVGLAVAEMXXXXXXXXXXXXXXAAFTGNTSWIYSDN 807 + D++ S+ G + L + EM A + W+ ++ Sbjct: 57 FKSPDELYSS------SGENSDYSARLLLIEMSHRKYSLGEGVVGKIALARDHCWVSYED 110 Query: 808 IETDVFNTVLVPEYPDEWLLQFAAGIKTILLLPVIPHGVLQLGSVETVAENAALVAYVKD 987 I T F+T L+ E PDEWLLQFA GIKTI+L+PV+P GVLQ GS E VAE+ V +K+ Sbjct: 111 ILTSKFDTDLITECPDEWLLQFACGIKTIVLVPVLPQGVLQFGSFEAVAEDKEFVTNIKE 170 Query: 988 KF------EAHKKP-----DGYDLRYSVQQFSLVSTFLENLEEPSTVTTEKVNEDQNATH 1134 KF EA P D D+ +S L+ + +L+E S+ T+ + + + +T Sbjct: 171 KFYSTHYLEADITPLNLGTDCQDVSFS----DLMHNLMGSLDESSSSVTKSILKSEVSTS 226 Query: 1135 AVRTKDYNLIANQMMPVFMVQDFANASVRCVADTSENVTENDFSRQPLSMIHLDEPFQLS 1314 N M F +QD S + ++ + EN+ M Sbjct: 227 PAALNSNGSRLNPTMLSF-IQDDCFFSRENLLESLKRENENEIGSSSTEM---------- 275 Query: 1315 YEDNQSFISKNDISNYYHEETSSPSSYFDNFSRRMYGEFMNESMGFHFEDGATDPTFLGR 1494 I K + + EE S S +N G F S G Sbjct: 276 ----PRHIGKVETKPNHMEEIWSWSHLLNNV-----GVFREMSNGLDSSSVINTTQKQLG 326 Query: 1495 DFDSGICESGGNFFSFPGDYELNKILGT--------------SVEDNTYPYTYGTSISGH 1632 ++G N F+FP + E K LG+ SVE+ T + H Sbjct: 327 GIETGHDAKNVNDFAFPSESEFRKALGSVSYGETGKFMSKCISVEETYSNSTLVINKKEH 386 Query: 1633 DLACHSSG-----DRETSYSMDXXXXXXXXXXXXXXANASSILDDNSSNKSGITSSNVSL 1797 D H G D + Y +D N D SS + + S Sbjct: 387 D---HIKGLEFPKDVDLEYLLDAVV-----------GNFCGAAADTSSISNSVRSLTTMP 432 Query: 1798 REHLGSTKKHDQSKQSAPVEAYKVPWNFLTSEFSAPGIXXXXXXXXXXXXVESKIGALSD 1977 E S + + S++S + N L G + L D Sbjct: 433 TEFTSSIQPENYSEESTLIVDSSDVKNDLMPAIMVKG--KDEFSNHFTSSFDGNASLLID 490 Query: 1978 KQKKRKGYDSQKPGKLSRLSTTNKRRAHTGDNQKPKPRDRQLIQDRLKELRELVPNSEKC 2157 + ++ K +P +LS+++K+R G+NQK +PRDRQLI DR+KELRELVP +C Sbjct: 491 EAQQEKANSHMQPIGGPKLSSSSKKRTRVGNNQKSRPRDRQLIMDRMKELRELVPEGGRC 550 Query: 2158 SIDGLLDKTINHMLFLSNVTKRADKLRDQVLEKDTDEKTRRPAEVNHGHQNGTSWAVELG 2337 SID LL++TI HML+L +T +A+KL+ ++ + E R+ +H G S A + Sbjct: 551 SIDNLLERTIKHMLYLRKITSQAEKLK-RIANRAVPECKRQKVNASHP---GRSCAFDFE 606 Query: 2338 SEQQLCPIVVKDLNHPGHMLIEMLCTDHGRFLEIADVIHRLKLTILRGVMEKSADNSWAS 2517 SE PIV++DL GHMLIEM+C +HG FLEIA VI +L +TIL+G++E + NSWA Sbjct: 607 SEVSW-PIVIEDLECSGHMLIEMICNEHGLFLEIAQVIRKLDVTILKGILENCSSNSWAC 665 Query: 2518 FVVETSGSFHRLDIFWPLMQLLQQSRAPI 2604 F+VE FHR+D+ PL+ LLQ R P+ Sbjct: 666 FIVEVPRGFHRMDVLCPLLHLLQLRRNPV 694 >ref|XP_004137928.1| PREDICTED: uncharacterized protein LOC101203710 [Cucumis sativus] gi|449524685|ref|XP_004169352.1| PREDICTED: uncharacterized LOC101203710 [Cucumis sativus] Length = 565 Score = 286 bits (731), Expect = 4e-74 Identities = 212/604 (35%), Positives = 300/604 (49%), Gaps = 19/604 (3%) Frame = +1 Query: 850 PDEWLLQFAAGIKTILLLPVIPHGVLQLGSVETVAENAALVAYVKDKFEAHKKPDGYDLR 1029 P EW++Q+A+GIKTILL+P++P GVLQLGS++ V EN ++VAY+KD+F DG Sbjct: 5 PTEWIIQYASGIKTILLVPLLPFGVLQLGSLQMVTENLSVVAYIKDRFNDINFVDGDACA 64 Query: 1030 YSVQQFSLVSTFLENLEEPSTVTTEKVN-EDQNATHAVRTKDYNLIANQMMPVFMVQDFA 1206 S+V E+L+E + TT + E+ A H ++ NQ + + QD Sbjct: 65 ------SVVPRPFESLDEQTNFTTYMLEAENHGAIHDIKPPVSTF--NQCVTI---QDVL 113 Query: 1207 NASVRCVADT--SENVTENDFSRQPLSMIHLDEPFQLSYEDNQ----SFISKNDI----S 1356 S R +T E ++D R +M L P S + FIS + S Sbjct: 114 TVSRRIRPETLHCEKGHKSDIHRT--NMEELFAPLYQSVSTGEVEFSDFISLESLLPLGS 171 Query: 1357 NYYHEET----SSPSSYFDNFSRRMYGEFMNESMGFHFEDGATDPTFLGRDFDSGICESG 1524 + ET S+P + + G+ ++ E G D Sbjct: 172 QLRNHETGLFESNPHIFHSYSLDNVVGQQSGHNLATKKEYGIAD---------------- 215 Query: 1525 GNFFSFPGDYELNKILGTSV--EDNTYPYTYGTSISGHDLACHSSGDRETSYSMDXXXXX 1698 NFFSFP D EL K LG + + +T ++Y S + D R+ Sbjct: 216 -NFFSFPDDCELQKALGPVLLAQKHTNEFSYDPSSTVKDNTSSMLCSRDLKEG------- 267 Query: 1699 XXXXXXXXXANASSILDDNSSNKSGITSSNVSLREHLGSTKKHDQSKQSAPVEAYKVPWN 1878 +A I DD SN + + + ST + QS+ S V WN Sbjct: 268 DIEHLLEAMISAEDISDDTFSNNTINARIADLVAKPCLSTNTY-QSESSTIVVNDPALWN 326 Query: 1879 FLTSEFSAPGIXXXXXXXXXXXXVESKIGALSDKQKKRKGYDSQKPGKLSRLSTTNKRRA 2058 S +A G S +L +++ + D + K + S ++ R+ Sbjct: 327 IPESTTTATGRKNLTSL--------STSNSLVVNEREERDRDMAQHRKGMKRSNSS-RQI 377 Query: 2059 HTGDNQKPKPRDRQLIQDRLKELRELVPNSEKCSIDGLLDKTINHMLFLSNVTKRADKLR 2238 N + +PRDRQLIQDR+KELR++VPN KCSIDGLL+KTI HML+L VT RA+KL+ Sbjct: 378 KVTSNTRQRPRDRQLIQDRIKELRQIVPNGGKCSIDGLLEKTIKHMLYLQRVTDRAEKLK 437 Query: 2239 DQVLEKDTDEKTRRPAEVNHGHQNGTSW--AVELGSEQQLCPIVVKDLNHPGHMLIEMLC 2412 ++D D + E NGTSW A ++GSE Q+CPIVV+DL + GHMLI+MLC Sbjct: 438 QLAQQEDFDSENCTDLENEGVQPNGTSWTWAFDIGSELQVCPIVVEDLEYQGHMLIKMLC 497 Query: 2413 TDHGRFLEIADVIHRLKLTILRGVMEKSADNSWASFVVETSGSFHRLDIFWPLMQLLQQS 2592 D G FLEI +I L LTIL+GV+E+ ++NSWA F+VE FHR+D+FWPLM LLQ+ Sbjct: 498 NDMGLFLEITQIIRNLDLTILKGVIERHSNNSWAYFIVEAPRGFHRMDVFWPLMHLLQRK 557 Query: 2593 RAPI 2604 R PI Sbjct: 558 RNPI 561 >gb|EOY07438.1| Basic helix-loop-helix DNA-binding superfamily protein, putative isoform 8 [Theobroma cacao] Length = 525 Score = 266 bits (679), Expect = 5e-68 Identities = 188/556 (33%), Positives = 277/556 (49%), Gaps = 4/556 (0%) Frame = +1 Query: 949 VAENAALVAYVKDKFEAHKKPDGYDLRYSVQQFSLVSTFLENLEEPSTVTTEKVN-EDQN 1125 V E+ + AY+KD+F D+ + L S+ LE LEE S+ + +N ED N Sbjct: 9 VPEDLSTPAYIKDRFSCK------DIHTQLPSL-LTSSLLEKLEESSSASISPLNSEDSN 61 Query: 1126 ATHAVRTKDYNLIANQMMPVFMVQDFANASVRCVADTSENVTENDFSRQPLSMIHLDEPF 1305 A ++ + F V + + + E+ EN S P+S+ + P Sbjct: 62 AVDGIKPLS-------IQNAFQVPEID------LPEVLESEGENKISVPPVSLSEVSSPL 108 Query: 1306 QLSYEDNQSFISKND---ISNYYHEETSSPSSYFDNFSRRMYGEFMNESMGFHFEDGATD 1476 S Q + +++ +S E ++P ++ ++ GE ++ + + Sbjct: 109 SQSINSYQLAMGESEMFGLSCIKEELWANPE--YNGYTVGECGEILDGVTYPYPASDLLE 166 Query: 1477 PTFLGRDFDSGICESGGNFFSFPGDYELNKILGTSVEDNTYPYTYGTSISGHDLACHSSG 1656 P F D + ++G F SFP D EL+K LG + E + Y + +S D+ Sbjct: 167 PPF----GDFSVYDAG--FLSFPKDCELHKALGPAFEKQSNEYFWESSFLTEDVFRDLFD 220 Query: 1657 DRETSYSMDXXXXXXXXXXXXXXANASSILDDNSSNKSGITSSNVSLREHLGSTKKHDQS 1836 D E S++ + S + + S++ + ST + S Sbjct: 221 DIEPSFAKGGDAEYLLQAVVGHVYDGSVDIANRSNH-------------FMTSTGQLPVS 267 Query: 1837 KQSAPVEAYKVPWNFLTSEFSAPGIXXXXXXXXXXXXVESKIGALSDKQKKRKGYDSQKP 2016 + V +P + +TS + G +S + L+D + K + Sbjct: 268 IRPQSVMGDSIPVSRVTS--ALVGEAKNNSSSKTSASFKSTVSTLTDDKNLGKDCYYMQS 325 Query: 2017 GKLSRLSTTNKRRAHTGDNQKPKPRDRQLIQDRLKELRELVPNSEKCSIDGLLDKTINHM 2196 K + S+ KRRA GDN +P+PRDRQ+IQDRLKELRELVPN +K SID LLD T+ HM Sbjct: 326 RKGQKQSSVTKRRARLGDNPRPRPRDRQMIQDRLKELRELVPNGDKHSIDALLDHTVKHM 385 Query: 2197 LFLSNVTKRADKLRDQVLEKDTDEKTRRPAEVNHGHQNGTSWAVELGSEQQLCPIVVKDL 2376 +LS+VT +A+KL+ V + T K R +E +Q G SWA E+G E + CPIVV+DL Sbjct: 386 RYLSSVTNQAEKLKQWVHREVTVRKNMRSSESKDCYQMGASWAFEIGDELKACPIVVEDL 445 Query: 2377 NHPGHMLIEMLCTDHGRFLEIADVIHRLKLTILRGVMEKSADNSWASFVVETSGSFHRLD 2556 +PGH LIEMLC +H FLEIA VI LTIL+GVME ++N+WA F+VE S FHRLD Sbjct: 446 AYPGHFLIEMLCNEHCLFLEIAQVIRSFNLTILKGVMESCSNNTWAHFIVEASRGFHRLD 505 Query: 2557 IFWPLMQLLQQSRAPI 2604 IFWPLMQLLQ+ R PI Sbjct: 506 IFWPLMQLLQRQRNPI 521 >gb|EMJ06475.1| hypothetical protein PRUPE_ppa006504mg [Prunus persica] Length = 409 Score = 264 bits (674), Expect = 2e-67 Identities = 149/356 (41%), Positives = 207/356 (58%) Frame = +1 Query: 1528 NFFSFPGDYELNKILGTSVEDNTYPYTYGTSISGHDLACHSSGDRETSYSMDXXXXXXXX 1707 +FFSFP + EL+K LGT+ + T + + +SIS D C SSG ++ Sbjct: 68 SFFSFPENCELHKALGTTFQRQTDEHLWNSSISIDD-TCSSSGLQKDFIRSIEPSRLSKG 126 Query: 1708 XXXXXXANASSILDDNSSNKSGITSSNVSLREHLGSTKKHDQSKQSAPVEAYKVPWNFLT 1887 + DD SS++S S ++ ++ + + + SAP E+ + WN + Sbjct: 127 SDAENLFESMVARDDTSSSRSDNIKSCMTTSSQFPASCEQLKFEASAPTESDSMTWNHAS 186 Query: 1888 SEFSAPGIXXXXXXXXXXXXVESKIGALSDKQKKRKGYDSQKPGKLSRLSTTNKRRAHTG 2067 + F + + L DK++ KGY S KP K + S + RR Sbjct: 187 ASF------------------KGTMSTLLDKEQLGKGYTSTKPKKEQKSSGASARRTRLS 228 Query: 2068 DNQKPKPRDRQLIQDRLKELRELVPNSEKCSIDGLLDKTINHMLFLSNVTKRADKLRDQV 2247 ++ K +PRDRQLIQDR+KELRELVPN KCSIDGLLD+TI HM++L +T +A+KL Sbjct: 229 NSPKLRPRDRQLIQDRVKELRELVPNGAKCSIDGLLDRTIKHMMYLRTMTDQAEKLGCYA 288 Query: 2248 LEKDTDEKTRRPAEVNHGHQNGTSWAVELGSEQQLCPIVVKDLNHPGHMLIEMLCTDHGR 2427 ++ ++ +E G QNGTS E+GSE Q+CPIVV+DL HPGHMLIEMLC +HG Sbjct: 289 HQEVP--RSNNMSEAKIGGQNGTSRGFEIGSELQICPIVVEDLQHPGHMLIEMLCDEHGL 346 Query: 2428 FLEIADVIHRLKLTILRGVMEKSADNSWASFVVETSGSFHRLDIFWPLMQLLQQSR 2595 FL+IA I RL+LTIL+GVME + N WA F+VE FHR+D+FWPL+ LLQ+ R Sbjct: 347 FLDIAQAIRRLELTILKGVMETRSSNMWAHFIVEAPRGFHRMDVFWPLLHLLQRRR 402 >gb|EPS66447.1| prf interactor 30137, partial [Genlisea aurea] Length = 170 Score = 206 bits (523), Expect = 6e-50 Identities = 111/178 (62%), Positives = 130/178 (73%), Gaps = 10/178 (5%) Frame = +1 Query: 2026 SRLSTTNKRR----AHTGDNQKPKPRDRQLIQDRLKELRELVPNSEKCSIDGLLDKTINH 2193 SRLS +NKR+ + +NQ+P+PRDRQLIQDR+KELRELVPNSEKCSIDGLLDKTI H Sbjct: 1 SRLSRSNKRKDCSSSSDNNNQRPRPRDRQLIQDRIKELRELVPNSEKCSIDGLLDKTIKH 60 Query: 2194 MLFLSNVTKRADKLRDQVLEKDTDEKTRRPAEVNHGHQNGTSWAVELG-----SEQQLCP 2358 MLFL NV+ RADKLR ++ D+ ++G SWAVELG S+ Q CP Sbjct: 61 MLFLRNVSDRADKLRKHCCFQEADKMM--------VEESGASWAVELGSSSSSSDAQTCP 112 Query: 2359 IVVKDLNHPGHMLIEMLCTDHGRFLEIADVIHRLKLTILRGVMEKSAD-NSWASFVVE 2529 IVVKDL+ PG MLIEMLC DHGRFLEI + IHRL+LT+L GVMEKS SWA F+VE Sbjct: 113 IVVKDLDQPGQMLIEMLCCDHGRFLEITNAIHRLQLTVLHGVMEKSPPRESWARFIVE 170 >ref|XP_006443867.1| hypothetical protein CICLE_v10018993mg [Citrus clementina] gi|568851769|ref|XP_006479559.1| PREDICTED: transcription factor EMB1444-like [Citrus sinensis] gi|557546129|gb|ESR57107.1| hypothetical protein CICLE_v10018993mg [Citrus clementina] Length = 714 Score = 201 bits (512), Expect = 1e-48 Identities = 108/216 (50%), Positives = 138/216 (63%), Gaps = 11/216 (5%) Frame = +1 Query: 1972 SDKQKKRKGYDSQKPGKLSRL-------STTNKRRAHTGDNQKPKPRDRQLIQDRLKELR 2130 S Q KG+ S P S + NK+RA TG+N +P+PRDRQLIQDR+KELR Sbjct: 498 SSSQMSSKGFSSTCPSTCSEQLDMSSEPAKNNKKRARTGENGRPRPRDRQLIQDRIKELR 557 Query: 2131 ELVPNSEKCSIDGLLDKTINHMLFLSNVTKRADKLRDQVLEKDTDEKTRRPAEVNHG--H 2304 ELVPN KCSID LL++TI HMLFL ++TK ADK L K + K + HG + Sbjct: 558 ELVPNGSKCSIDSLLERTIKHMLFLQSITKHADK-----LSKCAESKMHQKGNGIHGSNY 612 Query: 2305 QNGTSWAVELGSEQQLCPIVVKDLNHPGHMLIEMLCTDHGRFLEIADVIHRLKLTILRGV 2484 + G+SWAVE+GS ++C IVV++LN G ML+EMLC + FLEIA+ I L LTIL+GV Sbjct: 613 EQGSSWAVEMGSHLKVCSIVVENLNKNGQMLVEMLCEECSHFLEIAEAIRSLGLTILKGV 672 Query: 2485 MEKSADNSWASFVVETSGS--FHRLDIFWPLMQLLQ 2586 E D +W FVVE + HR+D+ W L+QLLQ Sbjct: 673 TEAHGDKTWICFVVEGQDNRIMHRMDVLWSLVQLLQ 708 Score = 124 bits (312), Expect = 2e-25 Identities = 74/191 (38%), Positives = 102/191 (53%), Gaps = 8/191 (4%) Frame = +1 Query: 451 MGTSF----LRPFLQSLCCNSPWNYAVFWKLKHQHEMVLVWEDGFCDILKLRDPVVSPIE 618 MGTS L L+SLC N+ W YAVFWKLKH+ MVL WEDG+ D Sbjct: 1 MGTSSTTFDLHGILKSLCFNTAWKYAVFWKLKHRTRMVLTWEDGYYD------------- 47 Query: 619 DLYLGNSDKILSTFSSSVLDGTPG----ECPVGLAVAEMXXXXXXXXXXXXXXAAFTGNT 786 G D + + SS L+ G P+GLAVA+M A TG Sbjct: 48 --NCGQQDSLENKCSSESLENFHGGRYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVTGKH 105 Query: 787 SWIYSDNIETDVFNTVLVPEYPDEWLLQFAAGIKTILLLPVIPHGVLQLGSVETVAENAA 966 WI+SD + T+ ++ E+ D W QF+AGI+TI ++ V+PHGV+QLGS++ V E+ Sbjct: 106 QWIFSDQLVTNSCSSF---EFSDGWQSQFSAGIRTIAVVAVVPHGVVQLGSLDEVTEDMK 162 Query: 967 LVAYVKDKFEA 999 +V +++D F A Sbjct: 163 VVTHIRDVFAA 173