BLASTX nr result
ID: Akebia23_contig00005288
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia23_contig00005288 (2600 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CBI37092.3| unnamed protein product [Vitis vinifera] 438 e-126 ref|XP_003632423.1| PREDICTED: uncharacterized basic helix-loop-... 438 e-120 ref|XP_007050336.1| Basic helix-loop-helix-containing protein, p... 408 e-111 ref|XP_007050337.1| Basic helix-loop-helix DNA-binding superfami... 405 e-110 ref|XP_002532375.1| basic helix-loop-helix-containing protein, p... 374 e-100 ref|XP_007050338.1| Basic helix-loop-helix DNA-binding superfami... 365 6e-98 gb|EXC31934.1| hypothetical protein L484_009784 [Morus notabilis] 365 7e-98 ref|XP_006443867.1| hypothetical protein CICLE_v10018993mg [Citr... 361 8e-97 ref|XP_007200308.1| hypothetical protein PRUPE_ppa001930mg [Prun... 343 2e-91 ref|XP_004292200.1| PREDICTED: transcription factor bHLH155-like... 327 3e-91 ref|XP_006351645.1| PREDICTED: transcription factor bHLH155-like... 314 1e-87 emb|CCX35476.1| hypothetical protein [Malus domestica] 327 2e-86 ref|XP_004247231.1| PREDICTED: transcription factor EMB1444-like... 323 3e-85 ref|XP_006351643.1| PREDICTED: transcription factor bHLH155-like... 316 3e-83 ref|XP_007144919.1| hypothetical protein PHAVU_007G194600g [Phas... 314 1e-82 ref|XP_006443866.1| hypothetical protein CICLE_v10018993mg [Citr... 313 3e-82 ref|XP_006383698.1| basic helix-loop-helix family protein [Popul... 312 4e-82 ref|XP_006351644.1| PREDICTED: transcription factor bHLH155-like... 305 7e-80 ref|XP_003553489.1| PREDICTED: transcription factor EMB1444-like... 296 3e-77 ref|XP_006576937.1| PREDICTED: transcription factor EMB1444-like... 289 5e-75 >emb|CBI37092.3| unnamed protein product [Vitis vinifera] Length = 774 Score = 438 bits (1126), Expect(2) = e-126 Identities = 267/638 (41%), Positives = 361/638 (56%), Gaps = 78/638 (12%) Frame = +1 Query: 901 THLQRTLRNLCFNTEWKYAAFWKLQHGTPMILTLEDAYYNNQESLDSSGHMCFHVSRKNM 1080 T LQ+TLR+LCFNTEWKYA FWKL+H M+LT EDAYY+N + D CF + + Sbjct: 28 TDLQQTLRSLCFNTEWKYAVFWKLKHRARMVLTWEDAYYDNHDQHDPLEDKCFSKTPDTL 87 Query: 1081 HEGHYLQDPLGLAVAKMSYFVYSLGEGIIGQVAITGKHQWIFEDKPGPCSWSSSEFCDVW 1260 H+GHY D LGLAVAKMSY VYSLGEGI+GQVA+TGKHQWIF DK S SS E+CD W Sbjct: 88 HDGHYSHDALGLAVAKMSYHVYSLGEGIVGQVAVTGKHQWIFSDKHTTNSSSSFEYCDGW 147 Query: 1261 QTQFSSGIKTXXXXXXXXXXXXQLGSLNTVIEDMKVVTHIKDVFHSIENSSMEFIPYRLQ 1440 Q QFS+GIKT QLGSL V+ED+K+V+ IKDVF ++++SS+ +IP+ +Q Sbjct: 148 QAQFSAGIKTIVVVAVVPHGVVQLGSLQQVVEDLKLVSRIKDVFFALQDSSVAYIPHPIQ 207 Query: 1441 HTVQSTLHPSKISAKSSGLEIFDDFPESSNEAISNEKANIRSHLLQSLRIFNDLSCDVLP 1620 +++S+L S IS + S +I D + ++ I E+ N+ S + ND S + Sbjct: 208 CSMKSSLAMSDISTRGSASDIVPDSLFNLDKGIHKERPNVWSPMFPIFGKHND-SSFIFQ 266 Query: 1621 LPSLHQRKADGVVNKDLGVESTTSRDDESAVLALLQQKSEILNVKQLNQEVKLLYKNNCG 1800 LP++HQ +A + NKD G+E ++S+ DES LQ +SE ++ Q L N Sbjct: 267 LPAIHQNRAVNMFNKDGGLELSSSQSDEST--KFLQPRSENFVLEGQKQVQMKLISNTKR 324 Query: 1801 EENRGWMEMAAGSKHSDANTPYNFPLENVD--NFILPAYTSGIGYPFCPLGVLDSTVCDR 1974 EE GW + S+H+D + PYN +EN++ + L A S + + P G DS C+R Sbjct: 325 EEASGWRDADVSSEHNDTSYPYNSFMENINSCSTALAADKSQVDFACFPFGFFDSVDCNR 384 Query: 1975 VEFDSVVYVQKKDLRSSEPLEMQLGKGLEQNLE--PEVSCIDSVNTSLKFSSGCELHEAL 2148 ++ V + L +P +MQL K LE+ LE E+S +D+ TSL+FS+G ELHEAL Sbjct: 385 IKLHGVNCHENGVLHLPDPSDMQLQKNLEKKLEFPSELSHVDTSYTSLRFSAGSELHEAL 444 Query: 2149 GSTFKKEKDICAWSKSEDTESEIHSKSPERMVDSQLTAECGLDHL--------------V 2286 G F K+ + C W ++E E+E + PE M SQLT++ G ++L V Sbjct: 445 GPAFLKQSNYCDW-ETEKAETETTIELPEGMSSSQLTSDSGSENLLEAVVAKVCQSGSDV 503 Query: 2287 RSSKELSSNTQSMCSNQLEKHGEPTKIN-------------------------------- 2370 +S K + QS+ + EK EP+ Sbjct: 504 KSEKSFCQSMQSLLTT--EKIPEPSSHTIHTVTSAGYSIDQSSLVEETQNCFKSSEVCGV 561 Query: 2371 ------------------RKRARPSE--SCRARP--------RDRQLIEDRVKELRELVP 2466 + A PS+ RARP RDRQLI+DR+KELRELVP Sbjct: 562 TSQQGISSICPSSCSEQLERSAEPSKVNKKRARPGESCRPRPRDRQLIQDRIKELRELVP 621 Query: 2467 NGSKGSIDSLLERTIKHMLFLKIVTKHAKKMKKCAESK 2580 NGSK SIDSLLERTIKHMLFL+ +T+HA K+ KCAESK Sbjct: 622 NGSKCSIDSLLERTIKHMLFLQSITRHADKLNKCAESK 659 Score = 44.3 bits (103), Expect(2) = e-126 Identities = 21/24 (87%), Positives = 22/24 (91%) Frame = +3 Query: 726 MDQLLLPTSGPPIKRRAGLRRKQA 797 MD+LLLPT GPPIKRRAGLR KQA Sbjct: 1 MDRLLLPTVGPPIKRRAGLRIKQA 24 >ref|XP_003632423.1| PREDICTED: uncharacterized basic helix-loop-helix protein At1g06150-like [Vitis vinifera] Length = 749 Score = 438 bits (1126), Expect = e-120 Identities = 267/638 (41%), Positives = 361/638 (56%), Gaps = 78/638 (12%) Frame = +1 Query: 901 THLQRTLRNLCFNTEWKYAAFWKLQHGTPMILTLEDAYYNNQESLDSSGHMCFHVSRKNM 1080 T LQ+TLR+LCFNTEWKYA FWKL+H M+LT EDAYY+N + D CF + + Sbjct: 3 TDLQQTLRSLCFNTEWKYAVFWKLKHRARMVLTWEDAYYDNHDQHDPLEDKCFSKTPDTL 62 Query: 1081 HEGHYLQDPLGLAVAKMSYFVYSLGEGIIGQVAITGKHQWIFEDKPGPCSWSSSEFCDVW 1260 H+GHY D LGLAVAKMSY VYSLGEGI+GQVA+TGKHQWIF DK S SS E+CD W Sbjct: 63 HDGHYSHDALGLAVAKMSYHVYSLGEGIVGQVAVTGKHQWIFSDKHTTNSSSSFEYCDGW 122 Query: 1261 QTQFSSGIKTXXXXXXXXXXXXQLGSLNTVIEDMKVVTHIKDVFHSIENSSMEFIPYRLQ 1440 Q QFS+GIKT QLGSL V+ED+K+V+ IKDVF ++++SS+ +IP+ +Q Sbjct: 123 QAQFSAGIKTIVVVAVVPHGVVQLGSLQQVVEDLKLVSRIKDVFFALQDSSVAYIPHPIQ 182 Query: 1441 HTVQSTLHPSKISAKSSGLEIFDDFPESSNEAISNEKANIRSHLLQSLRIFNDLSCDVLP 1620 +++S+L S IS + S +I D + ++ I E+ N+ S + ND S + Sbjct: 183 CSMKSSLAMSDISTRGSASDIVPDSLFNLDKGIHKERPNVWSPMFPIFGKHND-SSFIFQ 241 Query: 1621 LPSLHQRKADGVVNKDLGVESTTSRDDESAVLALLQQKSEILNVKQLNQEVKLLYKNNCG 1800 LP++HQ +A + NKD G+E ++S+ DES LQ +SE ++ Q L N Sbjct: 242 LPAIHQNRAVNMFNKDGGLELSSSQSDEST--KFLQPRSENFVLEGQKQVQMKLISNTKR 299 Query: 1801 EENRGWMEMAAGSKHSDANTPYNFPLENVD--NFILPAYTSGIGYPFCPLGVLDSTVCDR 1974 EE GW + S+H+D + PYN +EN++ + L A S + + P G DS C+R Sbjct: 300 EEASGWRDADVSSEHNDTSYPYNSFMENINSCSTALAADKSQVDFACFPFGFFDSVDCNR 359 Query: 1975 VEFDSVVYVQKKDLRSSEPLEMQLGKGLEQNLE--PEVSCIDSVNTSLKFSSGCELHEAL 2148 ++ V + L +P +MQL K LE+ LE E+S +D+ TSL+FS+G ELHEAL Sbjct: 360 IKLHGVNCHENGVLHLPDPSDMQLQKNLEKKLEFPSELSHVDTSYTSLRFSAGSELHEAL 419 Query: 2149 GSTFKKEKDICAWSKSEDTESEIHSKSPERMVDSQLTAECGLDHL--------------V 2286 G F K+ + C W ++E E+E + PE M SQLT++ G ++L V Sbjct: 420 GPAFLKQSNYCDW-ETEKAETETTIELPEGMSSSQLTSDSGSENLLEAVVAKVCQSGSDV 478 Query: 2287 RSSKELSSNTQSMCSNQLEKHGEPTKIN-------------------------------- 2370 +S K + QS+ + EK EP+ Sbjct: 479 KSEKSFCQSMQSLLTT--EKIPEPSSHTIHTVTSAGYSIDQSSLVEETQNCFKSSEVCGV 536 Query: 2371 ------------------RKRARPSE--SCRARP--------RDRQLIEDRVKELRELVP 2466 + A PS+ RARP RDRQLI+DR+KELRELVP Sbjct: 537 TSQQGISSICPSSCSEQLERSAEPSKVNKKRARPGESCRPRPRDRQLIQDRIKELRELVP 596 Query: 2467 NGSKGSIDSLLERTIKHMLFLKIVTKHAKKMKKCAESK 2580 NGSK SIDSLLERTIKHMLFL+ +T+HA K+ KCAESK Sbjct: 597 NGSKCSIDSLLERTIKHMLFLQSITRHADKLNKCAESK 634 >ref|XP_007050336.1| Basic helix-loop-helix-containing protein, putative isoform 1 [Theobroma cacao] gi|508702597|gb|EOX94493.1| Basic helix-loop-helix-containing protein, putative isoform 1 [Theobroma cacao] Length = 708 Score = 408 bits (1049), Expect = e-111 Identities = 253/616 (41%), Positives = 350/616 (56%), Gaps = 49/616 (7%) Frame = +1 Query: 880 MGEEMDLTHLQRTLRNLCFNTEWKYAAFWKLQHGTPMILTLEDAYYNNQESLDSSGHMCF 1059 M + L + LR+LC NTEWKYA FWKL+H M+LT EDAYY+N + D S + CF Sbjct: 1 MASSTSSSGLHQILRSLCLNTEWKYAVFWKLKHRARMVLTWEDAYYDNHDQHDPSENNCF 60 Query: 1060 HVSRKNMHEGHYLQDPLGLAVAKMSYFVYSLGEGIIGQVAITGKHQWIFEDKPGPCSWSS 1239 H + N+ G+ DPLGLAVAKMSY VYSLGEGI+GQVA++GKHQWIF DK S S Sbjct: 61 HHTLDNLQSGYCSHDPLGLAVAKMSYHVYSLGEGIVGQVAVSGKHQWIFADKHVNSSCSL 120 Query: 1240 SEFCDVWQTQFSSGIKTXXXXXXXXXXXXQLGSLNTVIEDMKVVTHIKDVFHSIENSSME 1419 EFCD WQ+QF++GI+T QLGSLN V ED+K+V+HI+DVF ++++SS+ Sbjct: 121 FEFCDGWQSQFAAGIRTIVVVAVVQHGVVQLGSLNKVFEDVKLVSHIRDVFFALQDSSVG 180 Query: 1420 FIPYRLQHTVQSTLHPSKISAK---SSGLEIFDDFPESSNEAISNEKANIRSHLLQSLRI 1590 I ++ +++S+L + K S G+ + E +A+ E ++ R + S R+ Sbjct: 181 HIASPIECSMKSSLFQLDLPTKLLDSDGIPLDKTVDEQGPDALLPEFSHPRKY---SDRL 237 Query: 1591 FNDLSCDVLPLPSLHQRKADGVVNKDLGVESTTSRDDESAVLALLQQKSEILNVKQLNQE 1770 F VLPL + H + A V NK G+E +++R+DESA LL +S + N++ NQ Sbjct: 238 F------VLPLSNNHPKGAVEVENKHEGLELSSARNDESA--KLLTPRSNVSNLEHQNQL 289 Query: 1771 VKLLYKNNCGE-ENRGWMEMAAGSKHSDANTPYNFPLENVDNFILPAYTSGIGYPFCPLG 1947 ++L N + EN GW + ++ AN P + G+ + + Sbjct: 290 GRILINNGVWKGENSGWKNSSLVPENVYANNP-----------VGGRERYGVDHAYFSSN 338 Query: 1948 VLDSTVCDRVEFDSVVYVQKKDLRSSEPLEMQLGKGLEQ-NLEPEVSCIDSVNTSLKFSS 2124 L+S D V+ S+ + L E +M+ K L++ + E+S +D +NTSLKFS Sbjct: 339 FLNSAHSDTVKSSSLSSYPNEVLDIPESSDMKFQKDLKKLGNQNEISHLDPMNTSLKFSV 398 Query: 2125 GCELHEALGSTFKKEKDICAWSKSEDTESEIHSKSPERMVDSQLTAECGLDHLVR----- 2289 GCEL+EALG F ++ W ++E+ E+ + + PE M SQLT E G ++L+ Sbjct: 399 GCELYEALGPAFIRKSIYADW-QAENMEAGGNIEMPEGMSSSQLTFESGSENLLEAVVAN 457 Query: 2290 ---------------------------------------SSKELSSNTQSMCSNQLEKHG 2352 SSK SS S CS Q E+ Sbjct: 458 VCHSGSDIKAERSSCRSAPSLLTTGNTPEPSSQKLCGAMSSKGFSSTCPSNCSEQFERSS 517 Query: 2353 EPTKINRKRARPSESCRARPRDRQLIEDRVKELRELVPNGSKGSIDSLLERTIKHMLFLK 2532 EP K N+KRARP E+ R RPRDRQLI+DR+KELRELVPNG+K SIDSLLERTIKHM+FL+ Sbjct: 518 EPAKNNKKRARPGENPRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMVFLQ 577 Query: 2533 IVTKHAKKMKKCAESK 2580 +TKHA K+ KCAESK Sbjct: 578 GITKHADKLSKCAESK 593 >ref|XP_007050337.1| Basic helix-loop-helix DNA-binding superfamily protein isoform 2 [Theobroma cacao] gi|508702598|gb|EOX94494.1| Basic helix-loop-helix DNA-binding superfamily protein isoform 2 [Theobroma cacao] Length = 709 Score = 405 bits (1040), Expect = e-110 Identities = 253/617 (41%), Positives = 351/617 (56%), Gaps = 50/617 (8%) Frame = +1 Query: 880 MGEEMDLTHLQRTLRNLCFNTEWKYAAFWKLQHGTPMILTLEDAYYNNQESLDSSGHMCF 1059 M + L + LR+LC NTEWKYA FWKL+H M+LT EDAYY+N + D S + CF Sbjct: 1 MASSTSSSGLHQILRSLCLNTEWKYAVFWKLKHRARMVLTWEDAYYDNHDQHDPSENNCF 60 Query: 1060 HVSRKNMHEGHYLQDPLGLAVAKMSYFVYSLGEGIIGQVAITGKHQWIFEDKPGPCSWSS 1239 H + N+ G+ DPLGLAVAKMSY VYSLGEGI+GQVA++GKHQWIF DK S S Sbjct: 61 HHTLDNLQSGYCSHDPLGLAVAKMSYHVYSLGEGIVGQVAVSGKHQWIFADKHVNSSCSL 120 Query: 1240 SEFCDVWQTQFSSGIKTXXXXXXXXXXXXQLGSLNTVI-EDMKVVTHIKDVFHSIENSSM 1416 EFCD WQ+QF++GI+T QLGSLN V+ ED+K+V+HI+DVF ++++SS+ Sbjct: 121 FEFCDGWQSQFAAGIRTIVVVAVVQHGVVQLGSLNKVVFEDVKLVSHIRDVFFALQDSSV 180 Query: 1417 EFIPYRLQHTVQSTLHPSKISAK---SSGLEIFDDFPESSNEAISNEKANIRSHLLQSLR 1587 I ++ +++S+L + K S G+ + E +A+ E ++ R + S R Sbjct: 181 GHIASPIECSMKSSLFQLDLPTKLLDSDGIPLDKTVDEQGPDALLPEFSHPRKY---SDR 237 Query: 1588 IFNDLSCDVLPLPSLHQRKADGVVNKDLGVESTTSRDDESAVLALLQQKSEILNVKQLNQ 1767 +F VLPL + H + A V NK G+E +++R+DESA LL +S + N++ NQ Sbjct: 238 LF------VLPLSNNHPKGAVEVENKHEGLELSSARNDESA--KLLTPRSNVSNLEHQNQ 289 Query: 1768 EVKLLYKNNCGE-ENRGWMEMAAGSKHSDANTPYNFPLENVDNFILPAYTSGIGYPFCPL 1944 ++L N + EN GW + ++ AN P + G+ + + Sbjct: 290 LGRILINNGVWKGENSGWKNSSLVPENVYANNP-----------VGGRERYGVDHAYFSS 338 Query: 1945 GVLDSTVCDRVEFDSVVYVQKKDLRSSEPLEMQLGKGLEQ-NLEPEVSCIDSVNTSLKFS 2121 L+S D V+ S+ + L E +M+ K L++ + E+S +D +NTSLKFS Sbjct: 339 NFLNSAHSDTVKSSSLSSYPNEVLDIPESSDMKFQKDLKKLGNQNEISHLDPMNTSLKFS 398 Query: 2122 SGCELHEALGSTFKKEKDICAWSKSEDTESEIHSKSPERMVDSQLTAECGLDHLVR---- 2289 GCEL+EALG F ++ W ++E+ E+ + + PE M SQLT E G ++L+ Sbjct: 399 VGCELYEALGPAFIRKSIYADW-QAENMEAGGNIEMPEGMSSSQLTFESGSENLLEAVVA 457 Query: 2290 ----------------------------------------SSKELSSNTQSMCSNQLEKH 2349 SSK SS S CS Q E+ Sbjct: 458 NVCHSGSDIKAERSSCRSAPSLLTTGNTPEPSSQKLCGAMSSKGFSSTCPSNCSEQFERS 517 Query: 2350 GEPTKINRKRARPSESCRARPRDRQLIEDRVKELRELVPNGSKGSIDSLLERTIKHMLFL 2529 EP K N+KRARP E+ R RPRDRQLI+DR+KELRELVPNG+K SIDSLLERTIKHM+FL Sbjct: 518 SEPAKNNKKRARPGENPRPRPRDRQLIQDRIKELRELVPNGAKCSIDSLLERTIKHMVFL 577 Query: 2530 KIVTKHAKKMKKCAESK 2580 + +TKHA K+ KCAESK Sbjct: 578 QGITKHADKLSKCAESK 594 >ref|XP_002532375.1| basic helix-loop-helix-containing protein, putative [Ricinus communis] gi|223527931|gb|EEF30018.1| basic helix-loop-helix-containing protein, putative [Ricinus communis] Length = 749 Score = 374 bits (959), Expect = e-100 Identities = 261/648 (40%), Positives = 351/648 (54%), Gaps = 88/648 (13%) Frame = +1 Query: 901 THLQRTLRNLCFNTEWKYAAFWKLQHGTPMILTLEDAYYNNQESLDSSGHMCFHVSRKNM 1080 T L TLR+LCFNT+WKYA FWKL+H T M+LT EDAYYNN E D + CF + +N+ Sbjct: 3 TDLHNTLRSLCFNTDWKYAVFWKLKHRTRMVLTWEDAYYNNCEQHDLLENKCFGETFENL 62 Query: 1081 HEGHYLQDPLGLAVAKMSYFVYSLGEGIIGQVAITGKHQWIFEDKPGPCSWSSSEFCDVW 1260 G Y DP+GLAVAKMSY VYSLGEGI+GQVA+TGKH+WI DK S SS EF D W Sbjct: 63 CGGRYSNDPVGLAVAKMSYHVYSLGEGIVGQVAVTGKHRWIVADKHVTNSISSFEFSDGW 122 Query: 1261 QTQFSSGIKTXXXXXXXXXXXXQLGSLNTVIEDMKVVTHIKDVFHSIENSSMEFIPYRLQ 1440 Q+QFS+GI+T QLGSLN V EDMK+V HIKDVF S+++SS+E I LQ Sbjct: 123 QSQFSAGIRTIIVVAVVPHGVVQLGSLNKVAEDMKLVNHIKDVFSSLQDSSVEQISIPLQ 182 Query: 1441 HTVQSTLHPSKISAKSSGLE--IFDDFPESSNEAISNEKANIRSHLLQSLRIFNDLSCDV 1614 ++++++L+ + +S E + D + ++A N +S + L+ +D S Sbjct: 183 YSMKTSLYLPDVPTQSLDSESVVIPDNLCNLDKAADKGPYN-QSTMFPYLQKQSDDSY-F 240 Query: 1615 LPLPSLHQRKADGVVNKDLGVESTTSRDDESAVLALLQQKSEILNVKQLNQ-EVKLLYKN 1791 LP +HQ+ A +VNK G S + + LLQ +S I ++Q NQ + L+ + Sbjct: 241 YSLPGIHQKTAVELVNKYGG--GGLSLPVNISSVKLLQPRSNISYLEQHNQVGINLVVDH 298 Query: 1792 NCGEENRGWMEMAAGSK-----HSDANTPYNFPLENVDNFILPAYTSGIGYPFCPLGVLD 1956 CG + W + GS+ H D + N N+ + ILP G P+ +LD Sbjct: 299 TCGGKTSVWKDPGRGSELNVTPHLDNSVKDNI---NLCDVILPDQKFGADPANFPMDLLD 355 Query: 1957 STVCDRVEFDSVVYVQKKDLRSSEPLEMQLGKGLEQNLEPEV--SCIDSVNTSLKFSSGC 2130 STVCDR + D + + L E + L K LE+ LE + S ++S +T LKFS+GC Sbjct: 356 STVCDRHKSDE-IDILNGALDMPESSSIDLKKHLEKKLEYQAGSSHLESSSTFLKFSAGC 414 Query: 2131 ELHEALGSTFKKEKDICAWSKSED--TESEIHSKSPERMVDSQLTAECGLDHL------- 2283 ELHEALG F K C + E+ TES + PE + SQ+T + G ++L Sbjct: 415 ELHEALGPAFSKG---CLYFDCEEGKTESADIIEVPEGISTSQMTFDTGSENLLDAVVGN 471 Query: 2284 --------VRSSKELSSNTQSMCSNQLEKHGEPT------------KINR---------- 2373 V+ K + + QS+ + EK EP+ INR Sbjct: 472 VCYSGSTDVKREKSVCKSAQSLLTT--EKMPEPSFQAKHITHSAGYSINRQSVVQNDTHN 529 Query: 2374 -----------------------------KRARPS----------ESCRARPRDRQLIED 2436 +R+ P+ E+CR RPRDRQLI+D Sbjct: 530 CSSSTGVRGATSSNGYSSNCPSTCSEQLDRRSEPAEKNKKRARPGENCRPRPRDRQLIQD 589 Query: 2437 RVKELRELVPNGSKGSIDSLLERTIKHMLFLKIVTKHAKKMKKCAESK 2580 R+KELRELVPNG+K SIDSLLERTIKHMLFL+ +TKHA K+ KCAESK Sbjct: 590 RIKELRELVPNGAKCSIDSLLERTIKHMLFLESITKHADKLNKCAESK 637 >ref|XP_007050338.1| Basic helix-loop-helix DNA-binding superfamily protein isoform 3 [Theobroma cacao] gi|508702599|gb|EOX94495.1| Basic helix-loop-helix DNA-binding superfamily protein isoform 3 [Theobroma cacao] Length = 737 Score = 365 bits (937), Expect = 6e-98 Identities = 246/646 (38%), Positives = 344/646 (53%), Gaps = 79/646 (12%) Frame = +1 Query: 880 MGEEMDLTHLQRTLRNLCFNTEWKYAAFWKLQHGTPMILTLEDAYYNNQESLDSSGHMCF 1059 M + L + LR+LC NTEWKYA FWKL+H M+LT EDAYY+N + D S + CF Sbjct: 1 MASSTSSSGLHQILRSLCLNTEWKYAVFWKLKHRARMVLTWEDAYYDNHDQHDPSENNCF 60 Query: 1060 HVSRKNMHEGHYLQDPLGLAVAKMSYFVYSLGEGIIGQVAITGKHQWIFEDKPGPCSWSS 1239 H + N+ G+ DPLGLAVAKMSY VYSLGEGI+GQVA++GKHQWIF DK S S Sbjct: 61 HHTLDNLQSGYCSHDPLGLAVAKMSYHVYSLGEGIVGQVAVSGKHQWIFADKHVNSSCSL 120 Query: 1240 SEFCDVWQTQFSSGIKTXXXXXXXXXXXXQLGSLNTVIEDMKVVTHIKDVFHSIENSSME 1419 EFCD WQ+QF++GI+T QLGSLN V ED+K+V+HI+DVF ++++SS+ Sbjct: 121 FEFCDGWQSQFAAGIRTIVVVAVVQHGVVQLGSLNKVFEDVKLVSHIRDVFFALQDSSVG 180 Query: 1420 FIPYRLQHTVQSTLHPSKISAK---SSGLEIFDDFPESSNEAISNEKANIRSHLLQSLRI 1590 I ++ +++S+L + K S G+ + E +A+ E ++ R + S R+ Sbjct: 181 HIASPIECSMKSSLFQLDLPTKLLDSDGIPLDKTVDEQGPDALLPEFSHPRKY---SDRL 237 Query: 1591 FNDLSCDVLPLPSLHQRKADGVVNKDLGVESTTSRDDESAVLALLQQKSEILNVKQLNQE 1770 F VLPL + H + A V NK G+E +++R+DESA LL +S + N++ NQ Sbjct: 238 F------VLPLSNNHPKGAVEVENKHEGLELSSARNDESA--KLLTPRSNVSNLEHQNQL 289 Query: 1771 VKLLYKNNCGE-ENRGWMEMAAGSKHSDANTPYNFPLENVDNFILPAYTSGIGYPFCPLG 1947 ++L N + EN GW + ++ AN P + G+ + + Sbjct: 290 GRILINNGVWKGENSGWKNSSLVPENVYANNP-----------VGGRERYGVDHAYFSSN 338 Query: 1948 VLDSTVCDRVEFDSVVYVQKKDLRSSEPLEMQLGKGLEQ-NLEPEVSCIDSVNTSLKFSS 2124 L+S D V+ S+ + L E +M+ K L++ + E+S +D +NTSLKFS Sbjct: 339 FLNSAHSDTVKSSSLSSYPNEVLDIPESSDMKFQKDLKKLGNQNEISHLDPMNTSLKFSV 398 Query: 2125 GCELHEALGSTFKKEKDICAWSKSEDTESEIHSKSPERMVDSQLTAECGLDHL------- 2283 GCEL+EALG F ++ W ++E+ E+ + + PE M SQLT E G ++L Sbjct: 399 GCELYEALGPAFIRKSIYADW-QAENMEAGGNIEMPEGMSSSQLTFESGSENLLEAVVAN 457 Query: 2284 --------------------------------------------VRSSKELSSNTQSMCS 2331 + S + NTQ C Sbjct: 458 VCHSGSDIKAERSSCRSAPSLLTTGNTPEPSSQSKHTINSAGYSINQSSLVEDNTQH-CL 516 Query: 2332 NQLEKHGE----------PTKINRKRARPSESC-----RARP--------RDRQLIEDRV 2442 N E G P+ + + R SE RARP RDRQLI+DR+ Sbjct: 517 NSSELCGAMSSKGFSSTCPSNCSEQFERSSEPAKNNKKRARPGENPRPRPRDRQLIQDRI 576 Query: 2443 KELRELVPNGSKGSIDSLLERTIKHMLFLKIVTKHAKKMKKCAESK 2580 KELRELVPNG+K SIDSLLERTIKHM+FL+ +TKHA K+ KCAESK Sbjct: 577 KELRELVPNGAKCSIDSLLERTIKHMVFLQGITKHADKLSKCAESK 622 >gb|EXC31934.1| hypothetical protein L484_009784 [Morus notabilis] Length = 750 Score = 365 bits (936), Expect = 7e-98 Identities = 251/652 (38%), Positives = 341/652 (52%), Gaps = 90/652 (13%) Frame = +1 Query: 901 THLQRTLRNLCFNTEWKYAAFWKLQHGTPMILTLEDAYYNNQESLDSSGHMCFHVSRKNM 1080 T LQ+ LR+LCFNTEWKYA FWKL+H M+LT EDAYY+ E D + + CF + Sbjct: 3 TDLQQILRSLCFNTEWKYAVFWKLKHRARMVLTWEDAYYDKSEQHDPAENKCFSKKLEKS 62 Query: 1081 HEGHYLQDPLGLAVAKMSYFVYSLGEGIIGQVAITGKHQWIFEDKPGPCSWSSSE-FCDV 1257 H+G Y DPLGLAVAK+SY VYSLGEGI+GQVA++GKHQWIF DK ++SS E + D Sbjct: 63 HDGLYSHDPLGLAVAKLSYHVYSLGEGIVGQVAVSGKHQWIFADKHKLSTYSSFEHYSDG 122 Query: 1258 WQTQFSSGIKTXXXXXXXXXXXXQLGSLNTVIEDMKVVTHIKDVFHSIENSSMEFIPYRL 1437 WQ QFS+GIKT QLGS N V+EDM++V HI+DVF S+++S + +P + Sbjct: 123 WQNQFSAGIKTIAVVAVVPHGVVQLGSFNEVLEDMELVNHIRDVFMSLQDSLVGHVPVPI 182 Query: 1438 QHTVQSTLHPSKISAKSSGLEIFDDFPESSNEAISNEKANIRSHLLQSLRIFNDLSCDVL 1617 Q +V S+++ I +KS E D + ++ ++ E +I + + D S VL Sbjct: 183 QSSVNSSVNLQDIPSKSFTSETVPDCLHNLDKTLNGEGPDIWFSIFPYVGKDGD-SPYVL 241 Query: 1618 PLPSLHQRKADGVVNKDLGVESTTSRDDESAVLALLQQKSEIL---NVKQLNQEVKLLYK 1788 LP+ +Q KA VVNK G+E +T+ DESA LLQ ++ IL N K + ++ +K Sbjct: 242 SLPNNYQEKAVDVVNKHGGLEFSTNGTDESA--KLLQSRTNILEHENHKVIGMNLRDNWK 299 Query: 1789 -----NNC-----GEENRGWMEMAAGSKHSDANTPYNFPLENVDNFILPAYTSGIGYPFC 1938 ++C G N G GS D N P + +LPA + Sbjct: 300 CAGEIDSCKDAAVGPVNNG-NPFLCGSVMGDVNLP---------SIVLPAEKVEVDSAHF 349 Query: 1939 PLGVLDSTVCDRVEFDSVVYVQKKDLRSSEPLEMQLGKGLEQ-NLEPEVSCIDSVNTSLK 2115 G++ S VCDRV DSV Y Q L S P + K + + E+S ID+ +TSLK Sbjct: 350 SSGLVGSAVCDRVRLDSVDYYQNGVLHVSGPSNTKFQKDPDNLEFQTELSHIDTSSTSLK 409 Query: 2116 FSSGCELHEALGSTFKKEKDICAWSKSEDTESEIHSKSPERMVDSQLTAECGLDHL---- 2283 F +G ELHEALG F K W +E + + + PE+M QL A+ +HL Sbjct: 410 FPAGYELHEALGPAFLKNSKYFDWEATETEGTAL--EMPEQMSSRQLAADSHPEHLLEAV 467 Query: 2284 ----------VRSSKELSSNTQSMCSNQLEKHGEPT--------KINRKRARPS------ 2391 V+S K + QS+ S EK+ +P+ N +PS Sbjct: 468 IANVCQSHSDVKSEKSFCKSVQSLLST--EKYPKPSSHTTLITDSSNHSIGQPSVKGEDK 525 Query: 2392 ESC---------------------------------------RARP--------RDRQLI 2430 + C RARP RDRQLI Sbjct: 526 QHCLSSSGICGVMSPKGFSSTCPSASSEQLERSSVHNKNNKKRARPGENCRPRPRDRQLI 585 Query: 2431 EDRVKELRELVPNGSKGSIDSLLERTIKHMLFLKIVTKHAKKMKKCAESKFC 2586 +DR+KELREL+PNG+K SIDSLLERTIKHML+L+ + KHA K+ K A++K C Sbjct: 586 QDRIKELRELIPNGAKCSIDSLLERTIKHMLYLQSIAKHADKLNKYADTKLC 637 >ref|XP_006443867.1| hypothetical protein CICLE_v10018993mg [Citrus clementina] gi|568851769|ref|XP_006479559.1| PREDICTED: transcription factor EMB1444-like [Citrus sinensis] gi|557546129|gb|ESR57107.1| hypothetical protein CICLE_v10018993mg [Citrus clementina] Length = 714 Score = 361 bits (927), Expect = 8e-97 Identities = 235/610 (38%), Positives = 327/610 (53%), Gaps = 43/610 (7%) Frame = +1 Query: 880 MGEEMDLTHLQRTLRNLCFNTEWKYAAFWKLQHGTPMILTLEDAYYNNQESLDSSGHMCF 1059 MG L L++LCFNT WKYA FWKL+H T M+LT ED YY+N DS + C Sbjct: 1 MGTSSTTFDLHGILKSLCFNTAWKYAVFWKLKHRTRMVLTWEDGYYDNCGQQDSLENKCS 60 Query: 1060 HVSRKNMHEGHYLQDPLGLAVAKMSYFVYSLGEGIIGQVAITGKHQWIFEDKPGPCSWSS 1239 S +N H G Y DPLGLAVAKMSY VYSLGEGI+GQVA+TGKHQWIF D+ S SS Sbjct: 61 SESLENFHGGRYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVTGKHQWIFSDQLVTNSCSS 120 Query: 1240 SEFCDVWQTQFSSGIKTXXXXXXXXXXXXQLGSLNTVIEDMKVVTHIKDVFHSIENSSME 1419 EF D WQ+QFS+GI+T QLGSL+ V EDMKVVTHI+DVF ++ + S+ Sbjct: 121 FEFSDGWQSQFSAGIRTIAVVAVVPHGVVQLGSLDEVTEDMKVVTHIRDVFAALNDISVG 180 Query: 1420 FIPYRLQHTVQSTLHPSKISAKSSGLEIFDDFPESSNEAISNEKANIRSHLLQSLRIFND 1599 + +Q +V++TL + KS + + +E ++ +++ + + ND Sbjct: 181 HVSSTIQSSVKNTLSLPDLPTKS-----IPNRWHNLDEVVNRGGPDVQFPMFPYVEKHND 235 Query: 1600 LSCDVLPLPSLHQRKADGVVNKDLGVESTTSRDDESAVLALLQQKSEILNVKQLNQ-EVK 1776 S + + DGVVN++ G+ +++ SA +L KS ++N+ NQ + Sbjct: 236 GS---YAFSGMQPKIGDGVVNRNEGILLSSAGGVGSA--KILHPKSNVINLDYQNQMGIH 290 Query: 1777 LLYKNNCGEENRGWMEMAAGSKHSDANTPYNFPLENVD--NFILPAYTSGIGYPFCPLGV 1950 + E+ GW ++ S+ + N +++++ + L A + Sbjct: 291 FISDGMSRVESSGWKDLGVISEQNGTPFSINSVIDSINLCSVALQAEKFVADRTYLASNP 350 Query: 1951 LDSTVCDRVEFDSVVYVQKKDLRSSEPLEMQLGKGLEQ-NLEPEVSCIDSVNTSLKFSSG 2127 L++ + ++V+ + Q L E +++ K LE+ + E++ +D SLKFS+ Sbjct: 351 LEAVLGEQVKLECTDSCQNGMLHIPEISDIKFEKDLEKLQNQTELNHLDPSGMSLKFSAV 410 Query: 2128 CELHEALGSTFKKEKDICAWSKSEDTESEIHSKSPERMVDSQLTAECGLDHLV------- 2286 ELHEALG F + KDI + E+T PE S L + G ++L+ Sbjct: 411 SELHEALGPAFLR-KDIYNDREPENTVDGETVGMPELTSSSHLMFDSGSENLLDAVVASV 469 Query: 2287 --------------------------------RSSKELSSNTQSMCSNQLEKHGEPTKIN 2370 SSK SS S CS QL+ EP K N Sbjct: 470 CNSGSDVKSERTVCRSMQSLLTTEKKPESSSQMSSKGFSSTCPSTCSEQLDMSSEPAKNN 529 Query: 2371 RKRARPSESCRARPRDRQLIEDRVKELRELVPNGSKGSIDSLLERTIKHMLFLKIVTKHA 2550 +KRAR E+ R RPRDRQLI+DR+KELRELVPNGSK SIDSLLERTIKHMLFL+ +TKHA Sbjct: 530 KKRARTGENGRPRPRDRQLIQDRIKELRELVPNGSKCSIDSLLERTIKHMLFLQSITKHA 589 Query: 2551 KKMKKCAESK 2580 K+ KCAESK Sbjct: 590 DKLSKCAESK 599 >ref|XP_007200308.1| hypothetical protein PRUPE_ppa001930mg [Prunus persica] gi|462395708|gb|EMJ01507.1| hypothetical protein PRUPE_ppa001930mg [Prunus persica] Length = 739 Score = 343 bits (881), Expect = 2e-91 Identities = 246/642 (38%), Positives = 335/642 (52%), Gaps = 79/642 (12%) Frame = +1 Query: 892 MDLTHLQRTLRNLCFNTEWKYAAFWKLQHGTPMILTLEDAYYNNQESLDSSGHMCFHVSR 1071 M + L LR+LCFNTEW YA FWKL++ M+LT EDAYY+N E DSS + CF+ + Sbjct: 1 MGTSDLHHVLRSLCFNTEWNYAIFWKLKYRARMVLTWEDAYYDNCEQHDSSENRCFNKTL 60 Query: 1072 KNMHEGHYLQDPLGLAVAKMSYFVYSLGEGIIGQVAITGKHQWIFEDKPGPCSWSSSEFC 1251 +H+ HY DPLGLAVAKMSY VY+LGEGI+GQVA+T KHQWIF D + S ++C Sbjct: 61 DRLHDSHYSHDPLGLAVAKMSYHVYTLGEGIVGQVAVTRKHQWIFADNLFKNNCSPFQYC 120 Query: 1252 DVWQTQFSSGIKTXXXXXXXXXXXXQLGSLNTVIEDMKVVTHIKDVFHSIENSSMEFIPY 1431 D WQ+QFS+GI+T QLGSLN VIE++K+V+ I+DVF ++++S +E I Sbjct: 121 DGWQSQFSAGIRTIVVVAVPHGVV-QLGSLNKVIENVKLVSEIRDVFSTLQDSPVEQIRN 179 Query: 1432 RLQHTVQSTLHPSKISAKSSGLEIFDDFPESSNEAISNEKA-NIRSHLLQSLRIFNDLSC 1608 LQ + S+ + IS K + D + ++A + E++ ++ S + + +D S Sbjct: 180 PLQSGINSSACLTSISPKGLASGVITDCLHNLDKAANREESPDVWSSIFPHIGKDSDSSY 239 Query: 1609 DVLPLPSLHQRKADGVVNKDLGVESTTSRDDESAVLALLQQKSEILNVKQLNQE-VKLLY 1785 V PLP +KA + NK G+ES+ ESA L Q KS ILN + V+LL Sbjct: 240 -VFPLPENCLKKAVELANKHGGLESSNLGCLESAKLH--QSKSSILNSEHCKLVGVELLD 296 Query: 1786 KNNCGEENRGWMEMAAGSKHSDANTPYNFPLENVDNFILPAYTSGIGYPFCPLGVLDSTV 1965 + C E+ G + S + ENV N A S L+S Sbjct: 297 RTKCKGESSGCKDTRMASMIYSNPLSHGSVQENV-NLCDSADLSAT--------FLNSAA 347 Query: 1966 CDRVEFDSVVYVQKKDLRSSEPLEMQLGKGLEQ-NLEPEVSCIDSVNTSLKFSSG----- 2127 RV D V + Q + L+ SEP +++ K LE + + E +D+ +TS+ F +G Sbjct: 348 HGRVNVDRVDFYQNEVLQVSEPSDVKFQKDLENLDFQTESGHMDTSSTSMAFPAGCELHE 407 Query: 2128 -----------------------------------------CELH--EAL-------GST 2157 C+ H EA+ G+ Sbjct: 408 ALGPAFLNKGNYFDWEAEKNGDGITIEMPEGMKTGQLTSDSCQEHLLEAVVANVCHSGTD 467 Query: 2158 FKKEKDICAWSKSEDTESEIHSKSPE--RMVDSQ--------LTAE-----------CGL 2274 K EK C +S T + S +DS+ L AE CG Sbjct: 468 VKSEKSFCKSMQSLLTTEKYPEPSSHTTHTIDSENYSIDQPSLIAEDTQQCLSSSGVCG- 526 Query: 2275 DHLVRSSKELSSNTQSMCSNQLEKHGEPTKINRKRARPSESCRARPRDRQLIEDRVKELR 2454 V S K SS S CS QLE+ P+K N+KRARP E+ R RPRDRQLI+DR+KELR Sbjct: 527 ---VISPKWFSSPCPSACSEQLERSSGPSKNNKKRARPGENSRPRPRDRQLIQDRIKELR 583 Query: 2455 ELVPNGSKGSIDSLLERTIKHMLFLKIVTKHAKKMKKCAESK 2580 EL+PNG+K SIDSLLERTIKHMLFL+ +TKHA K+ KCA++K Sbjct: 584 ELIPNGAKCSIDSLLERTIKHMLFLQSITKHADKLNKCADAK 625 >ref|XP_004292200.1| PREDICTED: transcription factor bHLH155-like [Fragaria vesca subsp. vesca] Length = 756 Score = 327 bits (839), Expect(2) = 3e-91 Identities = 233/639 (36%), Positives = 325/639 (50%), Gaps = 77/639 (12%) Frame = +1 Query: 901 THLQRTLRNLCFNTEWKYAAFWKLQHGTPMILTLEDAYYNNQESLDSSGHMCFHVSRKNM 1080 T L R LR+LCFNTEW YA FWKL+H M+LT EDAYY+N E D+SG+ F + + + Sbjct: 37 TDLHRVLRSLCFNTEWNYAIFWKLKHRARMVLTWEDAYYDNCEQYDNSGNRSFIKTLEAL 96 Query: 1081 HEGHYLQDPLGLAVAKMSYFVYSLGEGIIGQVAITGKHQWIFEDKPGPCSWSSSEFCDVW 1260 H H + D LGLA+AKMSY VY+LGEGI+GQVAITGKHQWIF D + S SE+CD W Sbjct: 97 HGNHNMHDSLGLAMAKMSYHVYTLGEGIVGQVAITGKHQWIFADNIVKDNCSPSEYCDGW 156 Query: 1261 QTQFSSGIKTXXXXXXXXXXXXQLGSLNTVIEDMKVVTHIKDVFHSIENSSMEFIPYRLQ 1440 Q+QF +GI+T QLGSL + E++++++HIKD F + IP+ LQ Sbjct: 157 QSQFLAGIRTIVVVAVVPHGVVQLGSLKKITENVELISHIKDAFIGSK------IPH-LQ 209 Query: 1441 HTVQSTLHPSKISAKSSGLEIFDDFPESSNEAISNEKANIRSHLLQSLRIFNDLSCDVLP 1620 H S + KI A + F D ++ ++AI+ EK+++ D S + P Sbjct: 210 HIQSSIVISPKILASGA----FPDCLQNLDKAINREKSDVWLSAFPHSGKDGD-SSYIFP 264 Query: 1621 LPSLHQRKADGVVNKDLGVESTTSRDDESAVLALLQQKSEILNVKQLN-QEVKLLYKNNC 1797 L + + A VVNK +ES+ DES L Q KS I N++ V+LL C Sbjct: 265 LTG-NFKNAVEVVNKHGELESSNIGGDESP--KLHQSKSSIFNLENSKLVGVELLDSRKC 321 Query: 1798 GEENRGWMEMAAGSKHSDANTPYNFPLENVDNFILPAYTSGIGYPFCPLGVLDSTVCDRV 1977 E+ G +M S +S PL + ++ + T ++S V DRV Sbjct: 322 TGESSGCKDMGISSTNSAD------PLSHANDCADLSST-----------FVNSDVNDRV 364 Query: 1978 EFDSVVYVQKKDLRSSEPLEMQLGKGLEQ-NLEPEVSCIDSVNTSLKFSSGCELHEALGS 2154 DS+ + + L SEP +++ L+ + E+ D+ ++SL F +GCELHEALG Sbjct: 365 NLDSIDLYRNEVLHVSEPSDVKFQSNLDNLKFQTELGQADTSSSSLMFPAGCELHEALGP 424 Query: 2155 TFKKEKDICAWSKSEDTESEIHSKSPERMVDSQLTAECGLDHL--------------VRS 2292 F + + W ++E ++ PE M SQLT++ +HL V+S Sbjct: 425 AFMHKSNFFDW-EAEKIGDRTTAEMPEGMNSSQLTSDSCPEHLLEAVVAKVCHSGSHVKS 483 Query: 2293 SKELSSNTQSMCSNQLEKHGEPTKIN--------------RKRARPSESC---------- 2400 K + QS+ + EK+ EP+ R ++ C Sbjct: 484 EKSFCKSMQSLLTT--EKYPEPSSHTTHTLDSENYSIDQPSMRGEDTQQCLSSSGICGVI 541 Query: 2401 -----------------------------RARP--------RDRQLIEDRVKELRELVPN 2469 RARP RDRQLI+DR+KELREL PN Sbjct: 542 SPKWFSSPCPSACSEQQERSSGPARNNKKRARPGETSRPRPRDRQLIQDRIKELRELTPN 601 Query: 2470 GSKGSIDSLLERTIKHMLFLKIVTKHAKKMKKCAESKFC 2586 G+K SIDSLLERTIKHMLFL+ +TKHA K+ KCA++K C Sbjct: 602 GAKCSIDSLLERTIKHMLFLQSITKHADKLNKCADAKLC 640 Score = 37.7 bits (86), Expect(2) = 3e-91 Identities = 18/23 (78%), Positives = 19/23 (82%) Frame = +3 Query: 729 DQLLLPTSGPPIKRRAGLRRKQA 797 D+L L GPPIKRRAGLRRKQA Sbjct: 4 DRLPLAAVGPPIKRRAGLRRKQA 26 >ref|XP_006351645.1| PREDICTED: transcription factor bHLH155-like isoform X3 [Solanum tuberosum] Length = 752 Score = 314 bits (804), Expect(2) = 1e-87 Identities = 217/629 (34%), Positives = 305/629 (48%), Gaps = 70/629 (11%) Frame = +1 Query: 916 TLRNLCFNTEWKYAAFWKLQHGTPMILTLEDAYYNNQESLDSSGHMCFHVSRKNMHEGHY 1095 TLR+LC NT WKYA FWKL H M+LT EDAYY+N G + N+++GHY Sbjct: 36 TLRSLCCNTPWKYAVFWKLTHRARMMLTWEDAYYDND---GFPGKKSPGSTAGNLYDGHY 92 Query: 1096 LQDPLGLAVAKMSYFVYSLGEGIIGQVAITGKHQWIFEDKPGPCSWSSSEFCDVWQTQFS 1275 + LG+AVAKMSY VYSLGEGI+GQVAITGKH W+ DK + + E CD WQ QFS Sbjct: 93 SNNHLGVAVAKMSYHVYSLGEGIVGQVAITGKHLWLSADKVAAITSLAPEHCDGWQAQFS 152 Query: 1276 SGIKTXXXXXXXXXXXXQLGSLNTVIEDMKVVTHIKDVFHSIENSSMEFIPYRLQHTVQS 1455 +GIKT QLGSL+++ ED++ + HI+DVF ++ + +Q+++++ Sbjct: 153 AGIKTIVVAAVAPHGVIQLGSLDSIPEDLRAIKHIRDVFSELQELMASCLRSSMQYSMEN 212 Query: 1456 TLHPSKISAKSSGLEIFDDFPESSNEAISNEKANIRSHLLQSLRIFNDLSCDVLPLPSLH 1635 + S+IS ++SG E+F D + ++ + N+ S L S+ D SC Sbjct: 213 SC-LSEISTRTSGSEVFQDCVNNLGRSVCEDGRNMWSPLYTSVEKSVDHSCIFSQPGGFP 271 Query: 1636 QRKADGVVNKDLGVESTTSRDDESAVLALLQQKSEILNVKQLNQEVKLLYKNNCGEENRG 1815 + + V N+ L S DD +L + S I + EE + Sbjct: 272 NKILEAVHNQGLHRTSVQGSDDSENLLPASCESSIIKH----------------QEEGQM 315 Query: 1816 WMEMAAGSKHSDANTPYNFPLENVDNFILPAYTSGIGYPFCPLGVLDSTVCDRVEFDSVV 1995 W E + +N +VD P + S T C + +++ Sbjct: 316 WEETDPKFEGQTSNLRV-LGKGSVDK-CEPTFRSDASIGSVSYDAGQVTECPQPNRNNLA 373 Query: 1996 YVQKKD----LRSSEPLEMQLGKGLEQNLEPEVSCIDSVNTSLKFSSGCELHEALGSTFK 2163 D L S+ K E NL E C D+++T +F +G EL+EALG F+ Sbjct: 374 SEADNDRNRKLGLSDLPNAYADKCAETNLGFETQCNDTMHTPFRFCAGYELYEALGPVFQ 433 Query: 2164 KEKDICAWSKSEDTESEI-----------------------------------------H 2220 K W + E + Sbjct: 434 KGNSSKDWEAGKREEMAVDMLEGIGTSSLVMSNTGNEHLLEAVIANVNRYDNDCSSVKSF 493 Query: 2221 SKSPERMVDSQLTAE-CGLD------------------------HLVRSSKELSSNTQSM 2325 KS + ++ +++TAE C D +RSS+ LSS + S Sbjct: 494 CKSVDSLLTTEITAEPCSSDIGAISSIGYSFDRETLNSFNSSGTCSIRSSRGLSSTSCSR 553 Query: 2326 CSNQLEKHGEPTKINRKRARPSESCRARPRDRQLIEDRVKELRELVPNGSKGSIDSLLER 2505 S +E+ EP K+++KRARP ESCR RPRDRQLI+DR+KELR+LVPNGSK SIDSLLER Sbjct: 554 GSGHVERPLEPVKMHKKRARPGESCRPRPRDRQLIQDRIKELRDLVPNGSKCSIDSLLER 613 Query: 2506 TIKHMLFLKIVTKHAKKMKKCAESKFCDE 2592 TIKHMLF++ VTKHA K+ KC+ SK D+ Sbjct: 614 TIKHMLFMQSVTKHADKLSKCSASKLVDK 642 Score = 39.7 bits (91), Expect(2) = 1e-87 Identities = 19/21 (90%), Positives = 19/21 (90%) Frame = +3 Query: 735 LLLPTSGPPIKRRAGLRRKQA 797 LLL T GPPIKRRAGLRRKQA Sbjct: 8 LLLSTVGPPIKRRAGLRRKQA 28 >emb|CCX35476.1| hypothetical protein [Malus domestica] Length = 741 Score = 327 bits (838), Expect = 2e-86 Identities = 233/643 (36%), Positives = 323/643 (50%), Gaps = 81/643 (12%) Frame = +1 Query: 901 THLQRTLRNLCFNTEWKYAAFWKLQHGTPMILTLEDAYYNNQESLDSSGHMCFHVSRKNM 1080 T L LR+LCFNTEW YA WKL+H M+LT EDAY++N E SS + CF + + Sbjct: 3 TDLHNILRSLCFNTEWNYAVSWKLKHRARMVLTCEDAYFDNCEQQHSSENRCFSKTMDKL 62 Query: 1081 HEGHYLQDPLGLAVAKMSYFVYSLGEGIIGQVAITGKHQWIFEDKPGPCSWSSSEFCDVW 1260 H+ HY DPLGLAVAKMS VY+LGEGI+GQVA+TG+HQWI+ D + S ++CD W Sbjct: 63 HDSHYSHDPLGLAVAKMSCHVYNLGEGIVGQVAVTGEHQWIYADDLVKNNCSPFQYCDGW 122 Query: 1261 QTQFSSGIKTXXXXXXXXXXXXQLGSLNTVIEDMKVVTHIKDVFHSIENSSMEFIPYRLQ 1440 Q+Q+S+GI+T QLGSLN V E++K+++ I D F ++++ +E I Q Sbjct: 123 QSQYSAGIRTIVVVAVVPHRVIQLGSLNKVAENVKLISQITDAFKTLQDFPIEHILNPKQ 182 Query: 1441 HTVQSTLHPSKISAKSSGLEIFDDFPESSNEAISNEKANIRSHLLQSLRIFNDLSCDVLP 1620 ++ S++ + IS + + D + + A + E ++I + + L ND S V Sbjct: 183 SSINSSVCSTNISLEGLASGVLPDCVNNLDTATNRESSDIWASIFPHLVKDNDSSY-VSS 241 Query: 1621 LPSLHQRKADGVVNKDLGVESTTSRDDESAVLALLQQKSEILNVKQLNQE-VKLLYKNNC 1797 L ++ + NK G+ES+ E L Q KS L+++ V+LL C Sbjct: 242 LTENCLKEEVELANKHGGLESSNFGSVEIGKLP--QSKSSALSMEHHRLVGVELLDSRKC 299 Query: 1798 GEENRGWMEMAAGS---KHSDANTPYNFPLENVDNFILPAYTSGIGYPFCPLGVLDSTVC 1968 E+ G + S H ++ P N + N+ +F P LDST Sbjct: 300 KGESSGCKDTGMASVIYAHPLSHDPVN--IVNLCDFA-----------DLPTTFLDSTAH 346 Query: 1969 DRVEFDSVVYVQKKDLRSSEPLEMQLGKGLEQ-NLEPEVSCIDSVNTSLKFSSG------ 2127 +R+ D V Q + L SEP ++ KGLE + E +D+ +TS+ F +G Sbjct: 347 ERINADRVDLHQNEVLHVSEPSVVKFQKGLENLEFQTESGHMDTSSTSMTFPAGCELHEA 406 Query: 2128 ----------------------------------------CELH--EAL-------GSTF 2160 C+ H EA+ GS Sbjct: 407 LGPAFLNQGNYFDWVAGKNGDRITPEIPEGMNTSQLTSASCQEHLLEAVVANVCQSGSLV 466 Query: 2161 KKEKDICAWSKSEDTESEIHSKSPE--RMVDSQ--------LTAE-----------CGLD 2277 K EK C +S T + S +DS+ LT E CG Sbjct: 467 KSEKSFCKSMQSLLTTEKCPEPSSRITHTIDSENYSIDQPSLTGEDMQQCLSSSGVCG-- 524 Query: 2278 HLVRSSKELSSNTQSMCSNQLEKHGEPTKINRKRARPSESCRARPRDRQLIEDRVKELRE 2457 V S K SS S CS QLE+ P+K ++KRARP ES R RPRDRQLI+DR+KELRE Sbjct: 525 --VISPKWFSSPCPSACSEQLERSSGPSKNSKKRARPGESSRPRPRDRQLIQDRIKELRE 582 Query: 2458 LVPNGSKGSIDSLLERTIKHMLFLKIVTKHAKKMKKCAESKFC 2586 L+P G+K SIDSLLERTIKHMLFL+ VTKHA K+ KCA++K C Sbjct: 583 LIPTGAKCSIDSLLERTIKHMLFLQSVTKHADKLNKCADAKLC 625 >ref|XP_004247231.1| PREDICTED: transcription factor EMB1444-like [Solanum lycopersicum] Length = 724 Score = 323 bits (827), Expect = 3e-85 Identities = 223/650 (34%), Positives = 317/650 (48%), Gaps = 86/650 (13%) Frame = +1 Query: 901 THLQRTLRNLCFNTEWKYAAFWKLQHGTPMILTLEDAYYNN-----QESLDSSGHMCFHV 1065 + LQ+ LR+LC NT WKYA FWKL H M+LT EDAYY+N ++S DS+ Sbjct: 3 SQLQQALRSLCCNTPWKYAVFWKLTHRARMMLTWEDAYYDNDGFPGKKSPDSTAG----- 57 Query: 1066 SRKNMHEGHYLQDPLGLAVAKMSYFVYSLGEGIIGQVAITGKHQWIFEDKPGPCSWSSSE 1245 N+++GHY + LG+AVAKMSY VYSLGEGI+GQVAITGKH W+ +K + + E Sbjct: 58 ---NLYDGHYSNNHLGVAVAKMSYHVYSLGEGIVGQVAITGKHLWLSANKVAAITNLAPE 114 Query: 1246 FCDVWQTQFSSGIKTXXXXXXXXXXXXQLGSLNTVIEDMKVVTHIKDVFHSIENSSMEFI 1425 CD WQ QFS+GIKT QLGSL+++ ED++ + HI+DVF ++ + Sbjct: 115 HCDGWQAQFSAGIKTIVVAAVAPHGVVQLGSLDSIPEDLRAIKHIRDVFSELQELMTSCL 174 Query: 1426 PYRLQHTVQSTLHPSKISAKSSGLEIFDDFPESSNEAISNEKANIRSHLLQSLRIFNDLS 1605 +QH+++++ S+IS ++SG EIF D + ++ ++ N+ S L S D S Sbjct: 175 RSSMQHSMENSC-LSEISTRTSGSEIFQDCVNNLGRSVCEDRRNMWSPLYTSFEKSVDHS 233 Query: 1606 CDVLPLPSLHQRKADGVVNKDLGVESTTSRDDESAVLALLQQKSEILNVKQLNQ------ 1767 C L P + K VVN S+ D+S L +S I+ ++ Q Sbjct: 234 CIFLQ-PGGYPNKILEVVNNQRLHRSSVQGSDDSTNLLPASCESSIIKHQEEGQMWEETD 292 Query: 1768 --------EVKLLYKNNCGEENRGW-MEMAAGSKHSDANTPYNFPLENVDNFILPAYTSG 1920 +++L K + + + + + GS DA P N +N AY Sbjct: 293 PKFEGQTSNLRVLGKGSVDKSEPNFKSDTSIGSVSYDAGQVTECPQRNRNNLASEAYND- 351 Query: 1921 IGYPFCPLGVLDSTVCDRVEFDSVVYVQKKDLRSSEPLEMQLGKGLEQNLEPEVSCIDSV 2100 + + L S+ K E NL C D++ Sbjct: 352 ---------------------------RNRMLGLSDLPNAYADKCAETNLGFGTECNDTM 384 Query: 2101 NTSLKFSSGCELHEALGSTFKKEKDICAWSKSEDTESEI--------------------- 2217 +T +F +G EL+EALG F+K W + E + Sbjct: 385 HTPFRFCAGYELYEALGPVFQKGNSSKDWEAGKREEMAVDMLEGIGTSSLVMSNTGNEHL 444 Query: 2218 --------------------HSKSPERMVDSQLTAE-CGLD------------------- 2277 KS + ++ +++TAE C D Sbjct: 445 LEAVIANVNRHDNDCSSVKSFCKSVDSLLTTEITAEPCSSDIGTISSTGYSFDRETLNSF 504 Query: 2278 -----HLVRSSKELSSNTQSMCSNQLEKHGEPTKINRKRARPSESCRARPRDRQLIEDRV 2442 +RSS+ LSS + S S +E+ EP K+++KRARP ESCR RPRDRQLI+DR+ Sbjct: 505 NSSGTCSIRSSRGLSSTSCSRGSGHVERPLEPVKMHKKRARPGESCRPRPRDRQLIQDRI 564 Query: 2443 KELRELVPNGSKGSIDSLLERTIKHMLFLKIVTKHAKKMKKCAESKFCDE 2592 KELR+LVPNGSK SIDSLLERTIKHMLF++ VTKHA K+ KC+ SK D+ Sbjct: 565 KELRDLVPNGSKCSIDSLLERTIKHMLFMQSVTKHADKLSKCSASKLADK 614 >ref|XP_006351643.1| PREDICTED: transcription factor bHLH155-like isoform X1 [Solanum tuberosum] Length = 722 Score = 316 bits (810), Expect = 3e-83 Identities = 218/634 (34%), Positives = 308/634 (48%), Gaps = 70/634 (11%) Frame = +1 Query: 901 THLQRTLRNLCFNTEWKYAAFWKLQHGTPMILTLEDAYYNNQESLDSSGHMCFHVSRKNM 1080 + LQ+ LR+LC NT WKYA FWKL H M+LT EDAYY+N G + N+ Sbjct: 3 SQLQQALRSLCCNTPWKYAVFWKLTHRARMMLTWEDAYYDND---GFPGKKSPGSTAGNL 59 Query: 1081 HEGHYLQDPLGLAVAKMSYFVYSLGEGIIGQVAITGKHQWIFEDKPGPCSWSSSEFCDVW 1260 ++GHY + LG+AVAKMSY VYSLGEGI+GQVAITGKH W+ DK + + E CD W Sbjct: 60 YDGHYSNNHLGVAVAKMSYHVYSLGEGIVGQVAITGKHLWLSADKVAAITSLAPEHCDGW 119 Query: 1261 QTQFSSGIKTXXXXXXXXXXXXQLGSLNTVIEDMKVVTHIKDVFHSIENSSMEFIPYRLQ 1440 Q QFS+GIKT QLGSL+++ ED++ + HI+DVF ++ + +Q Sbjct: 120 QAQFSAGIKTIVVAAVAPHGVIQLGSLDSIPEDLRAIKHIRDVFSELQELMASCLRSSMQ 179 Query: 1441 HTVQSTLHPSKISAKSSGLEIFDDFPESSNEAISNEKANIRSHLLQSLRIFNDLSCDVLP 1620 ++++++ S+IS ++SG E+F D + ++ + N+ S L S+ D SC Sbjct: 180 YSMENSC-LSEISTRTSGSEVFQDCVNNLGRSVCEDGRNMWSPLYTSVEKSVDHSCIFSQ 238 Query: 1621 LPSLHQRKADGVVNKDLGVESTTSRDDESAVLALLQQKSEILNVKQLNQEVKLLYKNNCG 1800 + + V N+ L S DD +L + S I + Sbjct: 239 PGGFPNKILEAVHNQGLHRTSVQGSDDSENLLPASCESSIIKH----------------Q 282 Query: 1801 EENRGWMEMAAGSKHSDANTPYNFPLENVDNFILPAYTSGIGYPFCPLGVLDSTVCDRVE 1980 EE + W E + +N +VD P + S T C + Sbjct: 283 EEGQMWEETDPKFEGQTSNLRV-LGKGSVDK-CEPTFRSDASIGSVSYDAGQVTECPQPN 340 Query: 1981 FDSVVYVQKKD----LRSSEPLEMQLGKGLEQNLEPEVSCIDSVNTSLKFSSGCELHEAL 2148 +++ D L S+ K E NL E C D+++T +F +G EL+EAL Sbjct: 341 RNNLASEADNDRNRKLGLSDLPNAYADKCAETNLGFETQCNDTMHTPFRFCAGYELYEAL 400 Query: 2149 GSTFKKEKDICAWSKSEDTESEI------------------------------------- 2217 G F+K W + E + Sbjct: 401 GPVFQKGNSSKDWEAGKREEMAVDMLEGIGTSSLVMSNTGNEHLLEAVIANVNRYDNDCS 460 Query: 2218 ----HSKSPERMVDSQLTAE-CGLD------------------------HLVRSSKELSS 2310 KS + ++ +++TAE C D +RSS+ LSS Sbjct: 461 SVKSFCKSVDSLLTTEITAEPCSSDIGAISSIGYSFDRETLNSFNSSGTCSIRSSRGLSS 520 Query: 2311 NTQSMCSNQLEKHGEPTKINRKRARPSESCRARPRDRQLIEDRVKELRELVPNGSKGSID 2490 + S S +E+ EP K+++KRARP ESCR RPRDRQLI+DR+KELR+LVPNGSK SID Sbjct: 521 TSCSRGSGHVERPLEPVKMHKKRARPGESCRPRPRDRQLIQDRIKELRDLVPNGSKCSID 580 Query: 2491 SLLERTIKHMLFLKIVTKHAKKMKKCAESKFCDE 2592 SLLERTIKHMLF++ VTKHA K+ KC+ SK D+ Sbjct: 581 SLLERTIKHMLFMQSVTKHADKLSKCSASKLVDK 614 >ref|XP_007144919.1| hypothetical protein PHAVU_007G194600g [Phaseolus vulgaris] gi|561018109|gb|ESW16913.1| hypothetical protein PHAVU_007G194600g [Phaseolus vulgaris] Length = 741 Score = 314 bits (804), Expect = 1e-82 Identities = 229/638 (35%), Positives = 314/638 (49%), Gaps = 78/638 (12%) Frame = +1 Query: 901 THLQRTLRNLCFNTEWKYAAFWKLQHGTPMILTLEDAYYNNQESLDSSGHMCFHVSRKNM 1080 ++L R LR+ C T+WKYA FWKL+H MILT EDAYY+N DSS + + + + Sbjct: 3 SNLHRLLRSFCLGTDWKYAIFWKLKHRARMILTWEDAYYDNPNICDSSENKSCQNTWERI 62 Query: 1081 HEGHYLQDPLGLAVAKMSYFVYSLGEGIIGQVAITGKHQWIFEDKPGPCSWSSSEFCDVW 1260 + DPLGLAVAKMSY VYSLGEGI+GQVA+TGKH+WI D S S EF D W Sbjct: 63 GSADFSHDPLGLAVAKMSYHVYSLGEGIVGQVAVTGKHRWICVDNQVTSSVPSFEFADGW 122 Query: 1261 QTQFSSGIKTXXXXXXXXXXXXQLGSLNTVIEDMKVVTHIKDVFHSIENSSMEFIPYRLQ 1440 Q+QFS+GI+T QLGSLN V EDM V+T I+ +F S ++ ++ P +LQ Sbjct: 123 QSQFSAGIRTIVVIAVVPLGVVQLGSLNKVAEDMGVITCIRSLFLSNQDYTICHAPNQLQ 182 Query: 1441 HTVQSTLHPSKISAKSSGLEIFDDFPESSNEAISNEKANIRSHLLQSLRIFNDLSCDVLP 1620 + S K+S + + ES + + ++ LL ++ F + P Sbjct: 183 N-----------SLKNSSSVMDSETSESVPAYLQTTEKTMKHELLDNIMPFQCPGNNDSP 231 Query: 1621 LPSLHQRKADGVVNKDLGVESTTSRDDESAVLALLQQKSEILNVKQLNQEVKLLYKNNCG 1800 + D V K G E + D S++ LLQ S ++NV+Q KLL Sbjct: 232 HAVYEKTTVD--VAKHEGPELNS---DGSSI--LLQSMSNMMNVEQ----QKLLGMRPVN 280 Query: 1801 E-----ENRGWMEMAAGSKHSDANTPYNFPLEN--VDNFILPAYTSGIGYPFCPLGVLDS 1959 E + G + + S ++ +N +N V++ + P+ +G+ LD+ Sbjct: 281 ERKFEGNSSGREDTSVESGKKLSSFLHNLVTDNNGVNDLVCPSENAGVNSVSFSSDFLDT 340 Query: 1960 TVCDRVEFDSVVYVQKKDLRSSEPLEMQLGKGLEQNLEPEVSCIDSVNTSLKFSSGCELH 2139 VC+ +F V QK S PL+ K + C +LKF +G ELH Sbjct: 341 VVCESEKFHYVDINQKGVKNWSRPLDAYSQKDTGMSKFQTEPCSKDTTYTLKFPAGYELH 400 Query: 2140 EALGSTFKKEKDICAWS-------KSEDTESEIH----SKSPER---------------M 2241 EALG +F KE W+ K+ + EI + P+R Sbjct: 401 EALGPSFLKESKYFDWAVKANQDVKATEISDEISCSQLTSEPQREHLLEAMVANIGHNNN 460 Query: 2242 VDSQLTA---------------------------ECGLD--HLVRSSKELSSNTQSMC-- 2328 V+S+L+ C +D HL R K S ++ +C Sbjct: 461 VNSKLSVSATMQAAIASGGNPEGSIHTVHTINSESCSIDQPHLGREEKHYSLSSSGICGI 520 Query: 2329 --------------SNQLEKHGEPTKINRKRARPSESCRARPRDRQLIEDRVKELRELVP 2466 S Q E+ EPTK ++KRARP ESCR RPRDRQLI+DR+KELRELVP Sbjct: 521 MSPKGFSSTCPSSCSEQFERSSEPTKNSKKRARPGESCRPRPRDRQLIQDRIKELRELVP 580 Query: 2467 NGSKGSIDSLLERTIKHMLFLKIVTKHAKKMKKCAESK 2580 NG+K SIDSLLE TIKHMLFLK VTKHA K+ K ++K Sbjct: 581 NGAKCSIDSLLECTIKHMLFLKNVTKHADKLNKFGDTK 618 >ref|XP_006443866.1| hypothetical protein CICLE_v10018993mg [Citrus clementina] gi|557546128|gb|ESR57106.1| hypothetical protein CICLE_v10018993mg [Citrus clementina] Length = 748 Score = 313 bits (802), Expect = 3e-82 Identities = 224/644 (34%), Positives = 324/644 (50%), Gaps = 77/644 (11%) Frame = +1 Query: 880 MGEEMDLTHLQRTLRNLCFNTEWKYAAFWKLQHGTPMILTLEDAYYNNQESLDSSGHMCF 1059 MG L L++LCFNT WKYA FWKL+H T M+LT ED YY+N DS + C Sbjct: 1 MGTSSTTFDLHGILKSLCFNTAWKYAVFWKLKHRTRMVLTWEDGYYDNCGQQDSLENKCS 60 Query: 1060 HVSRKNMHEGHYLQDPLGLAVAKMSYFVYSLGEGIIGQVAITGKHQWIFEDKPGPCSWSS 1239 S +N H G Y DPLGLAVAKMSY VYSLGEGI+GQVA+TGKHQWIF D+ S SS Sbjct: 61 SESLENFHGGRYSHDPLGLAVAKMSYHVYSLGEGIVGQVAVTGKHQWIFSDQLVTNSCSS 120 Query: 1240 SEFCDVWQTQFSSGIKTXXXXXXXXXXXXQLGSLNTVIEDMKVVTHIKDVFHSIENSSME 1419 EF D WQ+QFS+GI+T QLGSL+ V EDMKVVTHI+DVF ++ + S+ Sbjct: 121 FEFSDGWQSQFSAGIRTIAVVAVVPHGVVQLGSLDEVTEDMKVVTHIRDVFAALNDISVG 180 Query: 1420 FIPYRLQHTVQSTLHPSKISAKSSGLEIFDDFPESSNEAISNEKANIRSHLLQSLRIFND 1599 + +Q +V++TL + KS + + +E ++ +++ + + ND Sbjct: 181 HVSSTIQSSVKNTLSLPDLPTKS-----IPNRWHNLDEVVNRGGPDVQFPMFPYVEKHND 235 Query: 1600 LSCDVLPLPSLHQRKADGVVNKDLGVESTTSRDDESAVLALLQQKSEILNVKQLNQ-EVK 1776 S + + DGVVN++ G+ +++ SA +L KS ++N+ NQ + Sbjct: 236 GS---YAFSGMQPKIGDGVVNRNEGILLSSAGGVGSA--KILHPKSNVINLDYQNQMGIH 290 Query: 1777 LLYKNNCGEENRGWMEMAAGSKHSDANTPYNFPLENVD--NFILPAYTSGIGYPFCPLGV 1950 + E+ GW ++ S+ + N +++++ + L A + Sbjct: 291 FISDGMSRVESSGWKDLGVISEQNGTPFSINSVIDSINLCSVALQAEKFVADRTYLASNP 350 Query: 1951 LDSTVCDRVEFDSVVYVQKKDLRSSEPLEMQLGKGLEQ-NLEPEVSCIDSVNTSLKFSSG 2127 L++ + ++V+ + Q L E +++ K LE+ + E++ +D SLKFS+ Sbjct: 351 LEAVLGEQVKLECTDSCQNGMLHIPEISDIKFEKDLEKLQNQTELNHLDPSGMSLKFSAV 410 Query: 2128 CELHEALGSTFKKEKDICAWSKSEDTES-------EIHSKS-------PERMVDSQLTAE 2265 ELHEALG F + KDI + E+T E+ S S E ++D+ + + Sbjct: 411 SELHEALGPAFLR-KDIYNDREPENTVDGETVGMPELTSSSHLMFDSGSENLLDAVVASV 469 Query: 2266 CGLDHLVRSSKELSSNTQSMCSNQLEKHGEPTKINRKRA-------------------RP 2388 C V+S + + + QS+ + + + N + Sbjct: 470 CNSGSDVKSERTVCRSMQSLLTTEKKPESSSQSKNTNNSVSYSISQSSLVEEDAKHFLNS 529 Query: 2389 SESC--------------------------------RAR------PRDR--QLIEDRVKE 2448 SE C RAR PR R QLI+DR+KE Sbjct: 530 SEVCGAVSSKGFSSTCPSTCSEQLDMSSEPAKNNKKRARTGENGRPRPRDRQLIQDRIKE 589 Query: 2449 LRELVPNGSKGSIDSLLERTIKHMLFLKIVTKHAKKMKKCAESK 2580 LRELVPNGSK SIDSLLERTIKHMLFL+ +TKHA K+ KCAESK Sbjct: 590 LRELVPNGSKCSIDSLLERTIKHMLFLQSITKHADKLSKCAESK 633 >ref|XP_006383698.1| basic helix-loop-helix family protein [Populus trichocarpa] gi|550339661|gb|ERP61495.1| basic helix-loop-helix family protein [Populus trichocarpa] Length = 694 Score = 312 bits (800), Expect = 4e-82 Identities = 231/639 (36%), Positives = 316/639 (49%), Gaps = 79/639 (12%) Frame = +1 Query: 901 THLQRTLRNLCFNTEWKYAAFWKLQHGTPMILTLEDAYYNNQESLDSSGHMCFHVSRKNM 1080 T L TLR+LCFNT+W YA FWKL+H M+LT ED YY+N E D+ + CF +++N+ Sbjct: 5 TDLHDTLRSLCFNTDWNYAVFWKLKHRARMVLTWEDGYYDNCEQHDALENKCFRQTQENL 64 Query: 1081 HEGHYLQDPLGLAVAKMSYFVYSLGEGIIGQVAITGKHQWIFEDKPGPCSWSSSEFCDVW 1260 GHY +DPLGLAVAKMSY VYSLGEGI+GQVA++GKHQWIF DK S+SS EF D W Sbjct: 65 RGGHYPRDPLGLAVAKMSYHVYSLGEGIVGQVAVSGKHQWIFADKHVTNSFSSYEFSDGW 124 Query: 1261 QTQFSSGIKTXXXXXXXXXXXXQLGSLNTVIEDMKVVTHIKDVFHSIENSSMEFIPYRLQ 1440 Q+QFS+GI+T QLGSLN V ED+ +VTHIKDVF ++++S++ + Q Sbjct: 125 QSQFSAGIRTIVVVAVVPYGVVQLGSLNKVSEDVNLVTHIKDVFFALQDSTVSHVTSPSQ 184 Query: 1441 HTVQSTLHPSKISAKSSGLEIFDDFPESSNEAISNEKANIRSHLLQSLRIFNDLSCDVLP 1620 H +++ L K +A+ + + P + ND S D+L Sbjct: 185 HGMKNAL-CLKTAAELKNKQEVLEIPTPT----------------------NDESIDLLN 221 Query: 1621 LPSLHQRKADGVVNKDLGVESTTSRDDESAVLALLQQKSEILNVKQLNQEVKLLYKNNCG 1800 L +S S D + L + I++ + E + G Sbjct: 222 L------------------KSNASYLDHRSQLGM-----NIISDRMFGGETSVWKDLGRG 258 Query: 1801 EENRGWMEMAAGSKHSDANTPYNFPLENV--DNFILPAYTSGIGYPFCPLGVLDSTVCDR 1974 E+ M HS+ +F ENV + +LP G P + DST+CDR Sbjct: 259 SEHNTTM-------HSN-----SFMRENVSLSDLVLPNEKLGADLAGFPADLFDSTICDR 306 Query: 1975 VEFDSVVYVQKKDLRSSEPLEMQLGKGLEQNLE--PEVSCIDSVNTSLKFSSGCELHEAL 2148 + DS+ L + E ++ + LE+ L+ E + +S +T KFS+GCEL EAL Sbjct: 307 DKSDSINLRPNVVLNAPESSDITFKRDLEKKLDHPAESTHFNSSDTFFKFSAGCELLEAL 366 Query: 2149 GSTFKKEKDICAWSKSEDTESEIHS--KSPERMVDSQLTAECGLDHL------------- 2283 G +F C + +SE + + PE M SQ+T + G ++L Sbjct: 367 GPSFLNR---CMPFDYQTGKSEAGNIFEMPEGMSSSQMTFDFGSENLLEAVVGNVCHSGS 423 Query: 2284 -VRSSKELSSNTQSM------------------------------------CSNQLEKHG 2352 V+S K + QS+ SN E G Sbjct: 424 DVKSEKSGCKSVQSLVTAEKLPEPSIQTKHIMNSAGYSINQSSVVEEDVHNLSNSTEVCG 483 Query: 2353 E----------PTKINRKRARPSESC-----RARP--------RDRQLIEDRVKELRELV 2463 P+ + + + SES RA+P RDRQLI+DR+KELRELV Sbjct: 484 GMSSKGFSSTCPSTYSEQLDKRSESAKNSKKRAKPGENCRPRPRDRQLIQDRIKELRELV 543 Query: 2464 PNGSKGSIDSLLERTIKHMLFLKIVTKHAKKMKKCAESK 2580 PNGSK SIDSLLERTIKHMLFL+ +TKHA K+ KCAE K Sbjct: 544 PNGSKCSIDSLLERTIKHMLFLENITKHADKLNKCAEPK 582 >ref|XP_006351644.1| PREDICTED: transcription factor bHLH155-like isoform X2 [Solanum tuberosum] Length = 605 Score = 305 bits (781), Expect = 7e-80 Identities = 213/623 (34%), Positives = 301/623 (48%), Gaps = 70/623 (11%) Frame = +1 Query: 901 THLQRTLRNLCFNTEWKYAAFWKLQHGTPMILTLEDAYYNNQESLDSSGHMCFHVSRKNM 1080 + LQ+ LR+LC NT WKYA FWKL H M+LT EDAYY+N G + N+ Sbjct: 3 SQLQQALRSLCCNTPWKYAVFWKLTHRARMMLTWEDAYYDND---GFPGKKSPGSTAGNL 59 Query: 1081 HEGHYLQDPLGLAVAKMSYFVYSLGEGIIGQVAITGKHQWIFEDKPGPCSWSSSEFCDVW 1260 ++GHY + LG+AVAKMSY VYSLGEGI+GQVAITGKH W+ DK + + E CD W Sbjct: 60 YDGHYSNNHLGVAVAKMSYHVYSLGEGIVGQVAITGKHLWLSADKVAAITSLAPEHCDGW 119 Query: 1261 QTQFSSGIKTXXXXXXXXXXXXQLGSLNTVIEDMKVVTHIKDVFHSIENSSMEFIPYRLQ 1440 Q QFS+GIKT QLGSL+++ ED++ + HI+DVF ++ + +Q Sbjct: 120 QAQFSAGIKTIVVAAVAPHGVIQLGSLDSIPEDLRAIKHIRDVFSELQELMASCLRSSMQ 179 Query: 1441 HTVQSTLHPSKISAKSSGLEIFDDFPESSNEAISNEKANIRSHLLQSLRIFNDLSCDVLP 1620 ++++++ S+IS ++SG E+F D + ++ + N+ S L S+ D SC Sbjct: 180 YSMENSC-LSEISTRTSGSEVFQDCVNNLGRSVCEDGRNMWSPLYTSVEKSVDHSCIFSQ 238 Query: 1621 LPSLHQRKADGVVNKDLGVESTTSRDDESAVLALLQQKSEILNVKQLNQEVKLLYKNNCG 1800 + + V N+ L S DD +L + S I + Sbjct: 239 PGGFPNKILEAVHNQGLHRTSVQGSDDSENLLPASCESSIIKH----------------Q 282 Query: 1801 EENRGWMEMAAGSKHSDANTPYNFPLENVDNFILPAYTSGIGYPFCPLGVLDSTVCDRVE 1980 EE + W E + +N +VD P + S T C + Sbjct: 283 EEGQMWEETDPKFEGQTSNLRV-LGKGSVDK-CEPTFRSDASIGSVSYDAGQVTECPQPN 340 Query: 1981 FDSVVYVQKKD----LRSSEPLEMQLGKGLEQNLEPEVSCIDSVNTSLKFSSGCELHEAL 2148 +++ D L S+ K E NL E C D+++T +F +G EL+EAL Sbjct: 341 RNNLASEADNDRNRKLGLSDLPNAYADKCAETNLGFETQCNDTMHTPFRFCAGYELYEAL 400 Query: 2149 GSTFKKEKDICAWSKSEDTESEI------------------------------------- 2217 G F+K W + E + Sbjct: 401 GPVFQKGNSSKDWEAGKREEMAVDMLEGIGTSSLVMSNTGNEHLLEAVIANVNRYDNDCS 460 Query: 2218 ----HSKSPERMVDSQLTAE-CGLD------------------------HLVRSSKELSS 2310 KS + ++ +++TAE C D +RSS+ LSS Sbjct: 461 SVKSFCKSVDSLLTTEITAEPCSSDIGAISSIGYSFDRETLNSFNSSGTCSIRSSRGLSS 520 Query: 2311 NTQSMCSNQLEKHGEPTKINRKRARPSESCRARPRDRQLIEDRVKELRELVPNGSKGSID 2490 + S S +E+ EP K+++KRARP ESCR RPRDRQLI+DR+KELR+LVPNGSK SID Sbjct: 521 TSCSRGSGHVERPLEPVKMHKKRARPGESCRPRPRDRQLIQDRIKELRDLVPNGSKCSID 580 Query: 2491 SLLERTIKHMLFLKIVTKHAKKM 2559 SLLERTIKHMLF++ VTKHA K+ Sbjct: 581 SLLERTIKHMLFMQSVTKHADKL 603 >ref|XP_003553489.1| PREDICTED: transcription factor EMB1444-like [Glycine max] Length = 756 Score = 296 bits (758), Expect = 3e-77 Identities = 227/652 (34%), Positives = 324/652 (49%), Gaps = 92/652 (14%) Frame = +1 Query: 901 THLQRTLRNLCFNTEWKYAAFWKLQHGTPMILTLEDAYYNNQESLDSSGHMCFHVSRKNM 1080 T+L + L +LC NT W YA FWKL+H MILT EDAYYNN + DSS + + + + Sbjct: 3 TNLHQVLGSLCLNTHWNYAIFWKLKHRARMILTWEDAYYNNPDDFDSSENKHCQKTLEQI 62 Query: 1081 HEGHYLQDPLGLAVAKMSYFVYSLGEGIIGQVAITGKHQWIFEDKPGPCSWSSSEFCDVW 1260 G + LGLAVAKMSY YSLGEGI+GQVA+TGKH+WI D S S EF D W Sbjct: 63 GCGKFSHSALGLAVAKMSYHAYSLGEGIVGQVAVTGKHRWICADNQVASSGLSFEFADGW 122 Query: 1261 QTQFSSGIKTXXXXXXXXXXXXQLGSLNTVIEDMKVVTHIKDVFHSIENSSMEFIPYRLQ 1440 Q+QFS+GI+T QLGSLN VIEDM VTHI+++F S +N S++ P ++Q Sbjct: 123 QSQFSAGIRTIAVVAVVPLGVVQLGSLNKVIEDMGFVTHIRNLFLSTQNYSIQ-CPSQIQ 181 Query: 1441 HTVQSTLHPSKISAKSSGLEIFDDFPESSNEAISNEKANIRSHLLQSLRIFNDLSCDVLP 1620 +++S+ K S ++ +I + +++ +E A++ L S + P Sbjct: 182 GSLKSSSQLDK-SKENFSSDIMRTCFYDTQKSMKSETADVLMPLQCS-----GTGRNCTP 235 Query: 1621 LPSLHQRKADGVVNKDLGVESTTSRDDESAVLALLQQKSEILNVK-QLNQEVKLLYKNNC 1797 PS ++ +D V + E +DES++L LQ S ++NV Q +E+K LY Sbjct: 236 -PSACEKMSDNVAKQ----EGPELYNDESSIL--LQSISNMMNVDCQEFEEMKPLYGTKY 288 Query: 1798 GEENRGWMEMAAGSKHSDANTPYNFPLENV--DNFILPAYTSGIGYPFCPLGVLDSTVCD 1971 + G +M S+ + ++ +F +N ++ I P+ + P LD+ VC+ Sbjct: 289 EGGSSGCKDMRLESEKNVSSFLNDFVTDNASFNDVICPSEKVRVDSACFPSVFLDTVVCE 348 Query: 1972 R-------------------VEFDSVVYVQKKDLRS-----------SEP--------LE 2037 E +S +++K + +EP L+ Sbjct: 349 SDKLHYADINQKGAVNFAQPSEANSQQHIEKSKFHTEPCYKDIPDFQTEPCYKDASHILK 408 Query: 2038 MQLGKGLEQNLEP--------------------EVSCIDSVNTSLKFSSGCELH--EALG 2151 G L + L P V D ++TS S C H EA+ Sbjct: 409 FPAGCELHEALGPAFLKGGKCLDWPAQINQEMKSVEMSDEISTSQLTSESCPEHLLEAML 468 Query: 2152 STFK---------------KEKDICAWSKSEDTESEIHSKSPERMVDSQLTAE------- 2265 + F K+ I + E + +H+ + E QL+ Sbjct: 469 ANFSHSNNDVNSELSFCKSKQSAIVSAKNHEASIHNVHTINSEGYSIDQLSLVREDKHHS 528 Query: 2266 -------CGLDHLVRSSKELSSNTQSMCSNQLEKHGEPTKINRKRARPSESCRARPRDRQ 2424 CG V SSK +SS S S QLE+ EP+K ++KRARP ESCR RPRDRQ Sbjct: 529 LSSSSGICG----VMSSKGISSTFHSSNSGQLERSSEPSKNSKKRARPGESCRPRPRDRQ 584 Query: 2425 LIEDRVKELRELVPNGSKGSIDSLLERTIKHMLFLKIVTKHAKKMKKCAESK 2580 LI+DR+KELRELVPNG+K SIDSLLERTIKHMLFL+ +TKHA K+ +++K Sbjct: 585 LIQDRIKELRELVPNGAKCSIDSLLERTIKHMLFLQSITKHADKLTDFSDTK 636 >ref|XP_006576937.1| PREDICTED: transcription factor EMB1444-like isoform X3 [Glycine max] Length = 679 Score = 289 bits (739), Expect = 5e-75 Identities = 227/653 (34%), Positives = 325/653 (49%), Gaps = 93/653 (14%) Frame = +1 Query: 901 THLQRTLRNLCFNTEWKYAAFWKLQHGTPMILTLEDAYYNNQESLDSSGHMCFHVSRKNM 1080 T+L + LR+LC NT W YA FWKL+H MILT EDAYY+N + DSS + + + + Sbjct: 3 TNLHQVLRSLCLNTHWNYAIFWKLKHRARMILTWEDAYYSNPDDYDSSENKHCQKTLEQI 62 Query: 1081 HEGHYLQDPLGLAVAKMSYFVYSLGEGIIGQVAITGKHQWIFEDKPGPCSWSSSEFCDVW 1260 G + L LAVAKMSY YSLGEGIIGQVA+TGKH+WI D S S EF D W Sbjct: 63 GCGKFSHSALELAVAKMSYHAYSLGEGIIGQVAVTGKHRWICADNQVAGSGLSFEFADGW 122 Query: 1261 QTQFSSGIKTXXXXXXXXXXXXQLGSLNTVIEDMKVVTHIKDVFHSIENSSMEFIPYRLQ 1440 Q+QFS+GI+T QLGSLN VIEDM+ VTHI+++F S +N S+ P ++Q Sbjct: 123 QSQFSAGIRTIAVVAVVPLGVVQLGSLNKVIEDMEFVTHIRNLFLSTQNYSI-LRPSQIQ 181 Query: 1441 HTVQSTLHPSKISAKSSGLEIFDDFPESSNEAISNEKANIRSHLLQSLRIFNDLSCDVLP 1620 +++S+ + S +I + +++ +E A++ L S N Sbjct: 182 GSLKSSSELDTLKENLSS-DIMPTCFYDTQKSMKSETADVLMPLQCSGTGRNYT------ 234 Query: 1621 LPSLHQRKADGVVNKDLGVESTTSRDDESAVLALLQQKSEILNVKQLN-QEVKLLYKNNC 1797 PS H++ +D V + E +DES++L LQ S ++NV +E+K LY Sbjct: 235 -PSAHEKMSDNVAKQ----EGPELYNDESSIL--LQSISNMMNVDCKEFEEMKPLYGMKY 287 Query: 1798 -GEENRGWMEMAAGSKHSDANTPYNFPLENV--DNFILPAYTSGIGYPFCPLGVLDSTVC 1968 G + +M S+ + ++ +F +N ++ I P+ + P LD+ VC Sbjct: 288 EGGSSGDCKDMRLESEKNVSSYLNDFVTDNASFNDLICPSEKVRVDSACFPSVFLDTVVC 347 Query: 1969 DR-------------------VEFDSVVYVQK---------KDLR----------SSEPL 2034 + E +S +++K KD+ +S+ L Sbjct: 348 ESDKLHYADINQKGALNFAQPSEANSQQHIEKSKFHTEPCYKDISDFQTEPCYKDASQML 407 Query: 2035 EMQLGKGLEQNLEPEVSCI--------------------DSVNTSLKFSSGCELH--EAL 2148 G L + L P S + D ++TS S C H EA+ Sbjct: 408 NFPAGCELHEALGPAFSKVGKCFDWPTQVNQEMKPVEMSDEISTSQLTSESCPEHLLEAM 467 Query: 2149 -------GSTFKKEKDIC-----AWSKSEDTESEIHS----KSPERMVD----------- 2247 + E C A + +++ E+ IH+ S ++D Sbjct: 468 LVNINHSNNDVNSELSFCTSKQSAMASAKNHEASIHNVHTINSEGYLMDQLSLVREDKHH 527 Query: 2248 --SQLTAECGLDHLVRSSKELSSNTQSMCSNQLEKHGEPTKINRKRARPSESCRARPRDR 2421 S + CG V SSK +SS S S QLE+ EP+K ++KRARP ESCR RPRDR Sbjct: 528 SLSSSSGICG----VMSSKGVSSTFHSSNSGQLERSSEPSKNSKKRARPGESCRPRPRDR 583 Query: 2422 QLIEDRVKELRELVPNGSKGSIDSLLERTIKHMLFLKIVTKHAKKMKKCAESK 2580 QLI+DR+KELRELVPNG+K SIDSLLER IKH+LFL+ +TKHA K+ A++K Sbjct: 584 QLIQDRIKELRELVPNGAKCSIDSLLERAIKHLLFLQSITKHADKLTDFADTK 636