BLASTX nr result
ID: Glycyrrhiza36_contig00017974
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Glycyrrhiza36_contig00017974 (1170 letters) Database: ./nr 115,041,592 sequences; 42,171,959,267 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value XP_017422950.1 PREDICTED: myb family transcription factor EFM [V... 369 e-122 XP_014507138.1 PREDICTED: uncharacterized protein LOC106766876 [... 369 e-122 XP_007154940.1 hypothetical protein PHAVU_003G159800g [Phaseolus... 368 e-121 XP_003550834.1 PREDICTED: uncharacterized protein LOC100797015 [... 366 e-120 KOM33086.1 hypothetical protein LR48_Vigan01g264200 [Vigna angul... 361 e-118 KHN27391.1 Two-component response regulator ARR18 [Glycine soja] 358 e-118 XP_003525655.1 PREDICTED: uncharacterized protein LOC100807925 [... 358 e-117 XP_004508555.1 PREDICTED: uncharacterized protein LOC101500047 [... 337 e-109 XP_015898154.1 PREDICTED: uncharacterized protein LOC107431689 [... 335 e-108 XP_018836930.1 PREDICTED: myb family transcription factor EFM-li... 322 e-103 XP_003528276.1 PREDICTED: uncharacterized protein LOC100809196 [... 317 e-101 XP_018836998.1 PREDICTED: myb family transcription factor EFM-li... 316 e-101 KHN39723.1 Two-component response regulator ARR18 [Glycine soja] 315 e-100 XP_019416025.1 PREDICTED: myb family transcription factor EFM [L... 314 e-100 XP_003609208.1 myb-like transcription factor family protein [Med... 313 e-100 XP_003524007.1 PREDICTED: transcription factor LUX-like [Glycine... 312 1e-99 XP_017610040.1 PREDICTED: myb family transcription factor EFM [G... 312 2e-99 XP_016690013.1 PREDICTED: uncharacterized protein LOC107907243 [... 312 2e-99 KHG04305.1 Two-component response regulator ARR18 -like protein ... 312 2e-99 XP_007013309.1 PREDICTED: myb family transcription factor EFM [T... 311 4e-99 >XP_017422950.1 PREDICTED: myb family transcription factor EFM [Vigna angularis] BAT76413.1 hypothetical protein VIGAN_01440800 [Vigna angularis var. angularis] Length = 450 Score = 369 bits (948), Expect = e-122 Identities = 220/307 (71%), Positives = 238/307 (77%) Frame = +2 Query: 248 MASPSELGLIDCKPHSYSMLLKSFGGDQTDQTYKLEEFLSHLEEERLKIDAFKRELPLCM 427 MASPSEL L DCKP SYS+LLKSFG DQTDQTYKLEEFLS LEEERLKIDAFKRELPLCM Sbjct: 1 MASPSELSL-DCKPQSYSLLLKSFG-DQTDQTYKLEEFLSRLEEERLKIDAFKRELPLCM 58 Query: 428 QLLTNAMEASRQQLQAYRVNQGTRPVLEEFMPVKHNLTSAESFEKATNIVSDKANWMTSA 607 QLLTNAMEASRQQLQA++VN G +PVLEEF+P+KH ++ES EKATN +SDKANWMTSA Sbjct: 59 QLLTNAMEASRQQLQAFKVNHGAKPVLEEFIPMKH--LASESSEKATN-MSDKANWMTSA 115 Query: 608 QLWSQTSEGITKQQQSTIMTSPKEADIGFSMSSPKPILDNKLQRNNGGAFLPFSKERNPC 787 QLWSQ SEG KQQ + +TSPKEADIGFS+ SPK LDNK RN GGAFLPFSKERN C Sbjct: 116 QLWSQASEG-AKQQPT--LTSPKEADIGFSI-SPKLALDNK-PRNGGGAFLPFSKERNSC 170 Query: 788 SQGSTLRTPLPELSLVSTEKEMMEDKKHGVSEAEKLGISCQKRDXXXXXXXXXDGAIFDQ 967 QGSTLR PLPEL+L S+EKE MEDKK +EAEK G+SCQ + DGA+ DQ Sbjct: 171 -QGSTLR-PLPELALASSEKE-MEDKKR--AEAEK-GLSCQNK----KENSGSDGAVVDQ 220 Query: 968 GNKGSPVXXXXXXXXXXXXXXXXXXXXXXRKARRCWSPDLHRRFVNALQMLGGSQVATPK 1147 G GSPV RKARRCWSPDLHRRFVNALQMLGGSQVATPK Sbjct: 221 GKSGSPV-------ASSLAQTTTTTSQTHRKARRCWSPDLHRRFVNALQMLGGSQVATPK 273 Query: 1148 QIRELMK 1168 QIRELMK Sbjct: 274 QIRELMK 280 >XP_014507138.1 PREDICTED: uncharacterized protein LOC106766876 [Vigna radiata var. radiata] Length = 450 Score = 369 bits (948), Expect = e-122 Identities = 219/307 (71%), Positives = 236/307 (76%) Frame = +2 Query: 248 MASPSELGLIDCKPHSYSMLLKSFGGDQTDQTYKLEEFLSHLEEERLKIDAFKRELPLCM 427 MASPSEL L DCKP SYS+LLKSFG DQTDQTYKLEEFLS LEEERLKIDAFKRELPLCM Sbjct: 1 MASPSELTL-DCKPQSYSLLLKSFG-DQTDQTYKLEEFLSRLEEERLKIDAFKRELPLCM 58 Query: 428 QLLTNAMEASRQQLQAYRVNQGTRPVLEEFMPVKHNLTSAESFEKATNIVSDKANWMTSA 607 QLLTNAMEASRQQLQA++VN G +PVLEEF+P+KH ++ES EKATN +SDKANWMTSA Sbjct: 59 QLLTNAMEASRQQLQAFKVNHGAKPVLEEFIPMKH--LASESSEKATN-MSDKANWMTSA 115 Query: 608 QLWSQTSEGITKQQQSTIMTSPKEADIGFSMSSPKPILDNKLQRNNGGAFLPFSKERNPC 787 QLWSQ SEG Q +TSPKEADIGFS+ SPK LDNK QRN GGAFLPFSKERN C Sbjct: 116 QLWSQASEGAKHQ---PTLTSPKEADIGFSI-SPKLALDNK-QRNGGGAFLPFSKERNSC 170 Query: 788 SQGSTLRTPLPELSLVSTEKEMMEDKKHGVSEAEKLGISCQKRDXXXXXXXXXDGAIFDQ 967 QGSTLR PLPEL+L S+EKE MEDKK +EAEK G+SCQ + DGA+ DQ Sbjct: 171 -QGSTLR-PLPELALASSEKE-MEDKKR--AEAEK-GLSCQNK----KENSGSDGAVVDQ 220 Query: 968 GNKGSPVXXXXXXXXXXXXXXXXXXXXXXRKARRCWSPDLHRRFVNALQMLGGSQVATPK 1147 G GSPV RKARRCWSPDLHRRFVNALQMLGGSQVATPK Sbjct: 221 GKSGSPV-------ASSLAQTTTTTSQTHRKARRCWSPDLHRRFVNALQMLGGSQVATPK 273 Query: 1148 QIRELMK 1168 QIRELMK Sbjct: 274 QIRELMK 280 >XP_007154940.1 hypothetical protein PHAVU_003G159800g [Phaseolus vulgaris] ESW26934.1 hypothetical protein PHAVU_003G159800g [Phaseolus vulgaris] Length = 450 Score = 368 bits (945), Expect = e-121 Identities = 219/307 (71%), Positives = 238/307 (77%) Frame = +2 Query: 248 MASPSELGLIDCKPHSYSMLLKSFGGDQTDQTYKLEEFLSHLEEERLKIDAFKRELPLCM 427 MASPSEL L DCKP SYS+LLKSFG DQTDQTYKLEEFLS LEEERLKIDAFKRELPLCM Sbjct: 1 MASPSELSL-DCKPQSYSLLLKSFG-DQTDQTYKLEEFLSRLEEERLKIDAFKRELPLCM 58 Query: 428 QLLTNAMEASRQQLQAYRVNQGTRPVLEEFMPVKHNLTSAESFEKATNIVSDKANWMTSA 607 QLLTNAMEASRQQLQA++VN G +PVLEEF+P+KH ++ES EKA N +SDKANWMTSA Sbjct: 59 QLLTNAMEASRQQLQAFKVNHGAKPVLEEFIPMKH--LASESSEKAIN-MSDKANWMTSA 115 Query: 608 QLWSQTSEGITKQQQSTIMTSPKEADIGFSMSSPKPILDNKLQRNNGGAFLPFSKERNPC 787 QLWSQ SEG TKQQ + +TSPKEADIGFS+ SPK LD+K QRN GGAFLPFSKERN C Sbjct: 116 QLWSQASEG-TKQQPT--LTSPKEADIGFSI-SPKLALDSK-QRNGGGAFLPFSKERNSC 170 Query: 788 SQGSTLRTPLPELSLVSTEKEMMEDKKHGVSEAEKLGISCQKRDXXXXXXXXXDGAIFDQ 967 QGSTLR PLPEL+L S+EKE MEDKK +E EK G+SCQ + DGA+ DQ Sbjct: 171 -QGSTLR-PLPELALASSEKE-MEDKKR--AEVEK-GVSCQNK----KENSGSDGAVVDQ 220 Query: 968 GNKGSPVXXXXXXXXXXXXXXXXXXXXXXRKARRCWSPDLHRRFVNALQMLGGSQVATPK 1147 G GSPV RKARRCWSPDLHRRFVNALQMLGGSQVATPK Sbjct: 221 GKSGSPV-------ASSLAQTTTTTAQTHRKARRCWSPDLHRRFVNALQMLGGSQVATPK 273 Query: 1148 QIRELMK 1168 QIRELMK Sbjct: 274 QIRELMK 280 >XP_003550834.1 PREDICTED: uncharacterized protein LOC100797015 [Glycine max] KRH03776.1 hypothetical protein GLYMA_17G119600 [Glycine max] Length = 452 Score = 366 bits (939), Expect = e-120 Identities = 218/307 (71%), Positives = 234/307 (76%) Frame = +2 Query: 248 MASPSELGLIDCKPHSYSMLLKSFGGDQTDQTYKLEEFLSHLEEERLKIDAFKRELPLCM 427 M SPSEL L DCKP SYS+LLKSFG DQTDQTYKLEEFLS LEEERLKIDAFKRELPLCM Sbjct: 1 MESPSELSL-DCKPQSYSLLLKSFG-DQTDQTYKLEEFLSRLEEERLKIDAFKRELPLCM 58 Query: 428 QLLTNAMEASRQQLQAYRVNQGTRPVLEEFMPVKHNLTSAESFEKATNIVSDKANWMTSA 607 QLLTNAMEASRQQLQAY+VN GT+PVLEEF+P+KH L S +S EKATN +SDKANWMTSA Sbjct: 59 QLLTNAMEASRQQLQAYKVNHGTKPVLEEFIPMKH-LASDQSSEKATN-MSDKANWMTSA 116 Query: 608 QLWSQTSEGITKQQQSTIMTSPKEADIGFSMSSPKPILDNKLQRNNGGAFLPFSKERNPC 787 QLWSQ SEG TKQQ + +T+PKE+DIGFS+ SPK LDNK QRN GGAFLPFSKERN C Sbjct: 117 QLWSQASEG-TKQQPT--ITTPKESDIGFSI-SPKLALDNK-QRNGGGAFLPFSKERNSC 171 Query: 788 SQGSTLRTPLPELSLVSTEKEMMEDKKHGVSEAEKLGISCQKRDXXXXXXXXXDGAIFDQ 967 QGSTLR PLPEL+L EKE MEDKK E E G+SCQ + DG + DQ Sbjct: 172 -QGSTLR-PLPELALAYAEKE-MEDKKLR-PEVEIKGVSCQSK----KENSGSDGTVVDQ 223 Query: 968 GNKGSPVXXXXXXXXXXXXXXXXXXXXXXRKARRCWSPDLHRRFVNALQMLGGSQVATPK 1147 G GSPV RKARRCWSPDLHRRFVNALQMLGGSQVATPK Sbjct: 224 GKGGSPV-----ASSHATTTTTTSSAQTHRKARRCWSPDLHRRFVNALQMLGGSQVATPK 278 Query: 1148 QIRELMK 1168 QIRELMK Sbjct: 279 QIRELMK 285 >KOM33086.1 hypothetical protein LR48_Vigan01g264200 [Vigna angularis] Length = 461 Score = 361 bits (926), Expect = e-118 Identities = 220/318 (69%), Positives = 238/318 (74%), Gaps = 11/318 (3%) Frame = +2 Query: 248 MASPSELGLIDCKPHSYSMLLKSFGGDQTDQTYKLEEFLSHLEEERLKIDAFKRELPLCM 427 MASPSEL L DCKP SYS+LLKSFG DQTDQTYKLEEFLS LEEERLKIDAFKRELPLCM Sbjct: 1 MASPSELSL-DCKPQSYSLLLKSFG-DQTDQTYKLEEFLSRLEEERLKIDAFKRELPLCM 58 Query: 428 QLLTN-----------AMEASRQQLQAYRVNQGTRPVLEEFMPVKHNLTSAESFEKATNI 574 QLLTN AMEASRQQLQA++VN G +PVLEEF+P+KH ++ES EKATN Sbjct: 59 QLLTNVITHAILFIEIAMEASRQQLQAFKVNHGAKPVLEEFIPMKH--LASESSEKATN- 115 Query: 575 VSDKANWMTSAQLWSQTSEGITKQQQSTIMTSPKEADIGFSMSSPKPILDNKLQRNNGGA 754 +SDKANWMTSAQLWSQ SEG KQQ + +TSPKEADIGFS+ SPK LDNK RN GGA Sbjct: 116 MSDKANWMTSAQLWSQASEG-AKQQPT--LTSPKEADIGFSI-SPKLALDNK-PRNGGGA 170 Query: 755 FLPFSKERNPCSQGSTLRTPLPELSLVSTEKEMMEDKKHGVSEAEKLGISCQKRDXXXXX 934 FLPFSKERN C QGSTLR PLPEL+L S+EKE MEDKK +EAEK G+SCQ + Sbjct: 171 FLPFSKERNSC-QGSTLR-PLPELALASSEKE-MEDKKR--AEAEK-GLSCQNK----KE 220 Query: 935 XXXXDGAIFDQGNKGSPVXXXXXXXXXXXXXXXXXXXXXXRKARRCWSPDLHRRFVNALQ 1114 DGA+ DQG GSPV RKARRCWSPDLHRRFVNALQ Sbjct: 221 NSGSDGAVVDQGKSGSPV-------ASSLAQTTTTTSQTHRKARRCWSPDLHRRFVNALQ 273 Query: 1115 MLGGSQVATPKQIRELMK 1168 MLGGSQVATPKQIRELMK Sbjct: 274 MLGGSQVATPKQIRELMK 291 >KHN27391.1 Two-component response regulator ARR18 [Glycine soja] Length = 428 Score = 358 bits (919), Expect = e-118 Identities = 213/307 (69%), Positives = 231/307 (75%) Frame = +2 Query: 248 MASPSELGLIDCKPHSYSMLLKSFGGDQTDQTYKLEEFLSHLEEERLKIDAFKRELPLCM 427 MASPSEL L DCKP SYS+LLKSFG DQTD +YKLEEFL+ LEEERLKIDAFKRELPLCM Sbjct: 1 MASPSELSL-DCKPQSYSLLLKSFG-DQTDHSYKLEEFLNRLEEERLKIDAFKRELPLCM 58 Query: 428 QLLTNAMEASRQQLQAYRVNQGTRPVLEEFMPVKHNLTSAESFEKATNIVSDKANWMTSA 607 QLLTNAMEASRQQLQA++VN G +PVLEEF+P+KH ++ES EKATN +SDKANWMTSA Sbjct: 59 QLLTNAMEASRQQLQAFKVNHGAKPVLEEFIPMKH--LASESSEKATN-MSDKANWMTSA 115 Query: 608 QLWSQTSEGITKQQQSTIMTSPKEADIGFSMSSPKPILDNKLQRNNGGAFLPFSKERNPC 787 QLWSQ S TKQQ +T+ KE+DIGFS+ SPK LDNK QRN GGAFLPFSKERN C Sbjct: 116 QLWSQASSEGTKQQPP--ITTLKESDIGFSI-SPKLALDNK-QRNGGGAFLPFSKERNSC 171 Query: 788 SQGSTLRTPLPELSLVSTEKEMMEDKKHGVSEAEKLGISCQKRDXXXXXXXXXDGAIFDQ 967 QGSTLR PLPEL L S EKE MEDKK +E E G+SCQ R DGA+ DQ Sbjct: 172 -QGSTLR-PLPELVLASAEKE-MEDKKR--AEVEIKGVSCQSR----KENSGSDGAVVDQ 222 Query: 968 GNKGSPVXXXXXXXXXXXXXXXXXXXXXXRKARRCWSPDLHRRFVNALQMLGGSQVATPK 1147 G GSPV RKARRCWSPDLHRRFVNALQMLGGSQVATPK Sbjct: 223 GKGGSPV-----ASSHAQTTTTTTSAQTHRKARRCWSPDLHRRFVNALQMLGGSQVATPK 277 Query: 1148 QIRELMK 1168 QIRELMK Sbjct: 278 QIRELMK 284 >XP_003525655.1 PREDICTED: uncharacterized protein LOC100807925 [Glycine max] KRH56661.1 hypothetical protein GLYMA_05G011500 [Glycine max] Length = 454 Score = 358 bits (919), Expect = e-117 Identities = 213/307 (69%), Positives = 231/307 (75%) Frame = +2 Query: 248 MASPSELGLIDCKPHSYSMLLKSFGGDQTDQTYKLEEFLSHLEEERLKIDAFKRELPLCM 427 MASPSEL L DCKP SYS+LLKSFG DQTD +YKLEEFL+ LEEERLKIDAFKRELPLCM Sbjct: 1 MASPSELSL-DCKPQSYSLLLKSFG-DQTDHSYKLEEFLNRLEEERLKIDAFKRELPLCM 58 Query: 428 QLLTNAMEASRQQLQAYRVNQGTRPVLEEFMPVKHNLTSAESFEKATNIVSDKANWMTSA 607 QLLTNAMEASRQQLQA++VN G +PVLEEF+P+KH ++ES EKATN +SDKANWMTSA Sbjct: 59 QLLTNAMEASRQQLQAFKVNHGAKPVLEEFIPMKH--LASESSEKATN-MSDKANWMTSA 115 Query: 608 QLWSQTSEGITKQQQSTIMTSPKEADIGFSMSSPKPILDNKLQRNNGGAFLPFSKERNPC 787 QLWSQ S TKQQ +T+ KE+DIGFS+ SPK LDNK QRN GGAFLPFSKERN C Sbjct: 116 QLWSQASSEGTKQQPP--ITTLKESDIGFSI-SPKLALDNK-QRNGGGAFLPFSKERNSC 171 Query: 788 SQGSTLRTPLPELSLVSTEKEMMEDKKHGVSEAEKLGISCQKRDXXXXXXXXXDGAIFDQ 967 QGSTLR PLPEL L S EKE MEDKK +E E G+SCQ R DGA+ DQ Sbjct: 172 -QGSTLR-PLPELVLASAEKE-MEDKKR--AEVEIKGVSCQSR----KENSGSDGAVVDQ 222 Query: 968 GNKGSPVXXXXXXXXXXXXXXXXXXXXXXRKARRCWSPDLHRRFVNALQMLGGSQVATPK 1147 G GSPV RKARRCWSPDLHRRFVNALQMLGGSQVATPK Sbjct: 223 GKGGSPV-----ASSHAQTTTTTTSAQTHRKARRCWSPDLHRRFVNALQMLGGSQVATPK 277 Query: 1148 QIRELMK 1168 QIRELMK Sbjct: 278 QIRELMK 284 >XP_004508555.1 PREDICTED: uncharacterized protein LOC101500047 [Cicer arietinum] Length = 449 Score = 337 bits (865), Expect = e-109 Identities = 198/310 (63%), Positives = 228/310 (73%), Gaps = 3/310 (0%) Frame = +2 Query: 248 MASPSELGLIDCKPHSYSMLLKSFGGDQTDQTYKLEEFLSHLEEERLKIDAFKRELPLCM 427 MASPSELGL DCKPHSYSMLLKS G +Q+DQ+YKLEEFLS LEEERLKIDAFKRELPLCM Sbjct: 1 MASPSELGL-DCKPHSYSMLLKSVG-EQSDQSYKLEEFLSRLEEERLKIDAFKRELPLCM 58 Query: 428 QLLTNAMEASRQQLQAYRVNQGTRPVLEEFMPVKHNLTSAESFEKATNI--VSDKANWMT 601 QLLTNAMEAS+QQLQA++VNQGTRP+LEEF+PVK ++E+ EK TN VSDKANWMT Sbjct: 59 QLLTNAMEASKQQLQAFKVNQGTRPILEEFIPVKQ--VTSETSEKTTNNNNVSDKANWMT 116 Query: 602 SAQLWSQTSEGITKQQQSTIMTSPKEADIGFSMSSPKPILDNKLQRNNGGAFLPFSKERN 781 SAQLWSQTSE TKQQQS+ + +IGF+++SPK +++N + NGG FLPFSKERN Sbjct: 117 SAQLWSQTSELGTKQQQSS--NDETDINIGFNINSPKAVIENNNKHRNGGGFLPFSKERN 174 Query: 782 -PCSQGSTLRTPLPELSLVSTEKEMMEDKKHGVSEAEKLGISCQKRDXXXXXXXXXDGAI 958 CSQG LPEL+L S +KE++EDKKH V+E EK +KR+ + Sbjct: 175 SSCSQG---HNTLPELALASNQKEVVEDKKH-VAEGEK----GEKRE----NNNISGNEV 222 Query: 959 FDQGNKGSPVXXXXXXXXXXXXXXXXXXXXXXRKARRCWSPDLHRRFVNALQMLGGSQVA 1138 D +KGSPV RKARRCWSPDLHRRFVNALQMLGGSQVA Sbjct: 223 VD--HKGSPV------ASSHTHTQTTTNTNTHRKARRCWSPDLHRRFVNALQMLGGSQVA 274 Query: 1139 TPKQIRELMK 1168 TPKQIRELMK Sbjct: 275 TPKQIRELMK 284 >XP_015898154.1 PREDICTED: uncharacterized protein LOC107431689 [Ziziphus jujuba] Length = 462 Score = 335 bits (859), Expect = e-108 Identities = 195/308 (63%), Positives = 221/308 (71%), Gaps = 1/308 (0%) Frame = +2 Query: 248 MASPSELGLIDCKPHSYSMLLKSFGGDQTDQTYKLEEFLSHLEEERLKIDAFKRELPLCM 427 MASPSEL L +CKPHSYSMLLKS G DQT KLEE LS LEEERLKIDAFKRELPLCM Sbjct: 1 MASPSELSL-ECKPHSYSMLLKSIGDHPADQTRKLEETLSRLEEERLKIDAFKRELPLCM 59 Query: 428 QLLTNAMEASRQQLQAYRVNQGTRPVLEEFMPVKHNLTSAESFEKATNIVSDKANWMTSA 607 QLLT A+E SRQQLQAYR NQG RPVLEEF+P+KH +++E EK+TNI SDKANWMTSA Sbjct: 60 QLLTQAVENSRQQLQAYRANQGPRPVLEEFIPLKH--SNSEGSEKSTNI-SDKANWMTSA 116 Query: 608 QLWSQTSEGITKQQQSTIMTSPKEADIGFSMSSPKPILDNKLQRNN-GGAFLPFSKERNP 784 QLWSQ S+ +TK Q + +TS KEADIGF++ SPK DNKLQRNN GGAFLPFSK+R+ Sbjct: 117 QLWSQASD-VTKPQST--ITSTKEADIGFNV-SPKLAFDNKLQRNNGGGAFLPFSKDRSS 172 Query: 785 CSQGSTLRTPLPELSLVSTEKEMMEDKKHGVSEAEKLGISCQKRDXXXXXXXXXDGAIFD 964 S TLR PLPEL+LVST+K++ ++KK EAE G+SC +R + + Sbjct: 173 SSSSPTLR-PLPELALVSTDKDLEDNKK--CLEAEN-GVSCSRRSDNSGKIGNGGCGVIE 228 Query: 965 QGNKGSPVXXXXXXXXXXXXXXXXXXXXXXRKARRCWSPDLHRRFVNALQMLGGSQVATP 1144 G G RKARRCWSPDLHRRFVNALQMLGGSQVATP Sbjct: 229 PGKGGGSGSDGQQTNNNNNGSNTTTSSQTHRKARRCWSPDLHRRFVNALQMLGGSQVATP 288 Query: 1145 KQIRELMK 1168 KQIRELMK Sbjct: 289 KQIRELMK 296 >XP_018836930.1 PREDICTED: myb family transcription factor EFM-like isoform X1 [Juglans regia] Length = 467 Score = 322 bits (825), Expect = e-103 Identities = 195/307 (63%), Positives = 218/307 (71%) Frame = +2 Query: 248 MASPSELGLIDCKPHSYSMLLKSFGGDQTDQTYKLEEFLSHLEEERLKIDAFKRELPLCM 427 MASPSEL L DCKPHSYSMLLKSFG DQ DQT K+E+FLS LEEERLKIDAFKRELPLCM Sbjct: 1 MASPSELSL-DCKPHSYSMLLKSFG-DQIDQTQKIEDFLSRLEEERLKIDAFKRELPLCM 58 Query: 428 QLLTNAMEASRQQLQAYRVNQGTRPVLEEFMPVKHNLTSAESFEKATNIVSDKANWMTSA 607 QLLTNA+EA RQQLQAYR NQG RPVLEEF+P+K +++ES EK++N SDKANWMTSA Sbjct: 59 QLLTNAVEAHRQQLQAYRGNQGPRPVLEEFIPLKQ--STSESSEKSSN-NSDKANWMTSA 115 Query: 608 QLWSQTSEGITKQQQSTIMTSPKEADIGFSMSSPKPILDNKLQRNNGGAFLPFSKERNPC 787 QLWSQ S+G + QS I SPKE DIGFS+ SPK D K + NGGAF PFSKERN C Sbjct: 116 QLWSQASDG--TKSQSAINISPKEPDIGFSV-SPKLAFDGK--QRNGGAFHPFSKERNSC 170 Query: 788 SQGSTLRTPLPELSLVSTEKEMMEDKKHGVSEAEKLGISCQKRDXXXXXXXXXDGAIFDQ 967 TLR LP+L+L S +++ +EDKK E E GISC +RD G + +Q Sbjct: 171 -PSPTLR-DLPDLALASADQKELEDKK--CLETEN-GISCPRRDQNIIGKGGNGGVVVEQ 225 Query: 968 GNKGSPVXXXXXXXXXXXXXXXXXXXXXXRKARRCWSPDLHRRFVNALQMLGGSQVATPK 1147 G KG RKARRCWSPDLHRRFVNALQMLGGSQVATPK Sbjct: 226 G-KGP--GGNSSDGQTTVTTTAPNTSQSHRKARRCWSPDLHRRFVNALQMLGGSQVATPK 282 Query: 1148 QIRELMK 1168 QIRELMK Sbjct: 283 QIRELMK 289 >XP_003528276.1 PREDICTED: uncharacterized protein LOC100809196 [Glycine max] KHN23993.1 Two-component response regulator-like APRR2 [Glycine soja] KRH54846.1 hypothetical protein GLYMA_06G213400 [Glycine max] Length = 467 Score = 317 bits (812), Expect = e-101 Identities = 198/309 (64%), Positives = 217/309 (70%), Gaps = 2/309 (0%) Frame = +2 Query: 248 MASPSELGLIDCKPHSYSMLLKSF-GGDQTDQTYKLEEFLSHLEEERLKIDAFKRELPLC 424 M S +EL ++D KP+SYS LLKSF +TDQTYKLEEFLS LEEER+KIDAFKRELPLC Sbjct: 1 MPSQAELSMMDYKPYSYSTLLKSFLDQTETDQTYKLEEFLSRLEEERVKIDAFKRELPLC 60 Query: 425 MQLLTNAMEASRQQLQAYRVNQGTRPVLEEFMPVKHNLTSAESFEKATNIVSDKANWMTS 604 MQLLTNA+EASRQQLQA+R NQGTRPVLEEFMP+ + S ES EK +NI SDKANWMTS Sbjct: 61 MQLLTNAVEASRQQLQAFRSNQGTRPVLEEFMPILKHPNSQESAEKTSNI-SDKANWMTS 119 Query: 605 AQLWSQTSEGITKQQQSTIMTSPKE-ADIGFSMSSPKPILDNKLQRNNGGAFLPFSKERN 781 AQLWSQ SEG + QSTI + PKE ADIGFS+ SPK LDNK NGGAFLPFSKERN Sbjct: 120 AQLWSQASEG--TKPQSTITSLPKEGADIGFSV-SPKLALDNK--HRNGGAFLPFSKERN 174 Query: 782 PCSQGSTLRTPLPELSLVSTEKEMMEDKKHGVSEAEKLGISCQKRDXXXXXXXXXDGAIF 961 C QG LR LPEL+L S EKE+ E+K EAEK C KR+ +G Sbjct: 175 SC-QG--LR-GLPELALASPEKEIEENKCE--LEAEK----CSKRE-NPGKGGSCEGVNV 223 Query: 962 DQGNKGSPVXXXXXXXXXXXXXXXXXXXXXXRKARRCWSPDLHRRFVNALQMLGGSQVAT 1141 DQG S RKARRCWSPDLHRRFVNALQMLGGSQVAT Sbjct: 224 DQGKSAS--VASEAQTANTTTTTTNTSGQTHRKARRCWSPDLHRRFVNALQMLGGSQVAT 281 Query: 1142 PKQIRELMK 1168 PKQIRELMK Sbjct: 282 PKQIRELMK 290 >XP_018836998.1 PREDICTED: myb family transcription factor EFM-like isoform X2 [Juglans regia] Length = 466 Score = 316 bits (809), Expect = e-101 Identities = 194/307 (63%), Positives = 217/307 (70%) Frame = +2 Query: 248 MASPSELGLIDCKPHSYSMLLKSFGGDQTDQTYKLEEFLSHLEEERLKIDAFKRELPLCM 427 MASPSEL L DCKPHSYSMLLKSFG DQ DQT K+E+FLS LEEERLKIDAFKRELPLCM Sbjct: 1 MASPSELSL-DCKPHSYSMLLKSFG-DQIDQTQKIEDFLSRLEEERLKIDAFKRELPLCM 58 Query: 428 QLLTNAMEASRQQLQAYRVNQGTRPVLEEFMPVKHNLTSAESFEKATNIVSDKANWMTSA 607 QLLTNA+EA RQQLQAYR NQG RPVLEEF+P+K +++ES EK++N SDKANWMTSA Sbjct: 59 QLLTNAVEAHRQQLQAYRGNQGPRPVLEEFIPLKQ--STSESSEKSSN-NSDKANWMTSA 115 Query: 608 QLWSQTSEGITKQQQSTIMTSPKEADIGFSMSSPKPILDNKLQRNNGGAFLPFSKERNPC 787 QLWSQ S+G + QS I SPKE DIGFS+ SPK D K + NGGAF PFSKERN C Sbjct: 116 QLWSQASDG--TKSQSAINISPKEPDIGFSV-SPKLAFDGK--QRNGGAFHPFSKERNSC 170 Query: 788 SQGSTLRTPLPELSLVSTEKEMMEDKKHGVSEAEKLGISCQKRDXXXXXXXXXDGAIFDQ 967 TLR LP+L+L S +++ +EDKK E E GISC +RD G + +Q Sbjct: 171 -PSPTLR-DLPDLALASADQKELEDKK--CLETEN-GISCPRRDQNIIGKGGNGGVVVEQ 225 Query: 968 GNKGSPVXXXXXXXXXXXXXXXXXXXXXXRKARRCWSPDLHRRFVNALQMLGGSQVATPK 1147 G KG RKARRCWSPDLHRRFVNALQMLGGSQ ATPK Sbjct: 226 G-KGP--GGNSSDGQTTVTTTAPNTSQSHRKARRCWSPDLHRRFVNALQMLGGSQ-ATPK 281 Query: 1148 QIRELMK 1168 QIRELMK Sbjct: 282 QIRELMK 288 >KHN39723.1 Two-component response regulator ARR18 [Glycine soja] Length = 462 Score = 315 bits (806), Expect = e-100 Identities = 196/309 (63%), Positives = 219/309 (70%), Gaps = 2/309 (0%) Frame = +2 Query: 248 MASPSELGLIDCKPHSYSMLLKSFGGD-QTDQTYKLEEFLSHLEEERLKIDAFKRELPLC 424 M+S EL + D KP+SYS LLKS+ + +TDQT+KLEEFLS LEEER+KIDAFKRELPLC Sbjct: 1 MSSQVELSM-DYKPYSYSTLLKSYADETETDQTHKLEEFLSRLEEERVKIDAFKRELPLC 59 Query: 425 MQLLTNAMEASRQQLQAYRVNQGTRPVLEEFMPVKHNLTSAESFEKATNIVSDKANWMTS 604 MQLLTNA+EASRQQLQA+R NQGTRPVLEEFMP+KH S ES EK +NI SDKANWMTS Sbjct: 60 MQLLTNAVEASRQQLQAFRSNQGTRPVLEEFMPIKHP-NSQESTEKTSNI-SDKANWMTS 117 Query: 605 AQLWSQTSEGITKQQQSTIMTSPKE-ADIGFSMSSPKPILDNKLQRNNGGAFLPFSKERN 781 AQLWSQ SEG + QSTI TSPK AD+GFS+ SP P LDNK NGGAFLPFSKERN Sbjct: 118 AQLWSQASEG--TKPQSTI-TSPKNGADMGFSV-SPNPALDNK--HRNGGAFLPFSKERN 171 Query: 782 PCSQGSTLRTPLPELSLVSTEKEMMEDKKHGVSEAEKLGISCQKRDXXXXXXXXXDGAIF 961 C QG LR LPE++L S+EKEM +K E+EK C KR+ +G + Sbjct: 172 SC-QG--LR-DLPEVALASSEKEM---EKKCELESEK----CSKRENSGKGSGSCEGVV- 219 Query: 962 DQGNKGSPVXXXXXXXXXXXXXXXXXXXXXXRKARRCWSPDLHRRFVNALQMLGGSQVAT 1141 DQG S RKARRCWSPDLHRRFVNALQMLGGSQVAT Sbjct: 220 DQGKSASVASEAQTTNTTITTTTNNTTGQTHRKARRCWSPDLHRRFVNALQMLGGSQVAT 279 Query: 1142 PKQIRELMK 1168 PKQIRELMK Sbjct: 280 PKQIRELMK 288 >XP_019416025.1 PREDICTED: myb family transcription factor EFM [Lupinus angustifolius] OIV97194.1 hypothetical protein TanjilG_28945 [Lupinus angustifolius] Length = 457 Score = 314 bits (805), Expect = e-100 Identities = 183/307 (59%), Positives = 214/307 (69%), Gaps = 1/307 (0%) Frame = +2 Query: 251 ASPSELGLIDCKPHSYSMLLKSFGGDQTDQTYKLEEFLSHLEEERLKIDAFKRELPLCMQ 430 +SP+EL L DCKP SYSMLLKSFG Q+DQ KLEEFLS LEEERLKIDAFKRELPLCMQ Sbjct: 4 SSPAELSL-DCKPQSYSMLLKSFGDHQSDQCNKLEEFLSRLEEERLKIDAFKRELPLCMQ 62 Query: 431 LLTNAMEASRQQLQAYRVN-QGTRPVLEEFMPVKHNLTSAESFEKATNIVSDKANWMTSA 607 LLTNA+EAS+QQL R N Q TRP++E+F+P+K + S E+ +KA+N+ +KANWMTSA Sbjct: 63 LLTNAVEASKQQLHTIRTNHQVTRPIMEDFIPIKQS-NSEENTDKASNMFDNKANWMTSA 121 Query: 608 QLWSQTSEGITKQQQSTIMTSPKEADIGFSMSSPKPILDNKLQRNNGGAFLPFSKERNPC 787 QLWSQTSEGITK QSTI + + DIGF +PK L NK NGGAF+PFSKE NPC Sbjct: 122 QLWSQTSEGITK-PQSTITPTKESYDIGFMSMNPKLALHNK--HRNGGAFIPFSKELNPC 178 Query: 788 SQGSTLRTPLPELSLVSTEKEMMEDKKHGVSEAEKLGISCQKRDXXXXXXXXXDGAIFDQ 967 GS+ +PEL+LVST ++ +E+K E E +C KR+ DG + DQ Sbjct: 179 PLGSSALRDVPELALVSTAEKDLEEK--NCEEVE----TCYKRE--NLEKGGNDGGVIDQ 230 Query: 968 GNKGSPVXXXXXXXXXXXXXXXXXXXXXXRKARRCWSPDLHRRFVNALQMLGGSQVATPK 1147 G KG+ V RKARRCWSPDLHRRFVNALQMLGGSQVATPK Sbjct: 231 G-KGAEV----------ACEGQTTNVATNRKARRCWSPDLHRRFVNALQMLGGSQVATPK 279 Query: 1148 QIRELMK 1168 QIRELMK Sbjct: 280 QIRELMK 286 >XP_003609208.1 myb-like transcription factor family protein [Medicago truncatula] AES91405.1 myb-like transcription factor family protein [Medicago truncatula] Length = 434 Score = 313 bits (801), Expect = e-100 Identities = 197/310 (63%), Positives = 214/310 (69%), Gaps = 3/310 (0%) Frame = +2 Query: 248 MASPSELGLIDCKPHSYSMLLKSFGGDQTDQTYKLEEFLSHLEEERLKIDAFKRELPLCM 427 MAS SELGL DCKPHSYSMLLKSFG +Q+DQ+YKLEEF+S LEEERLKIDAFKRELPLCM Sbjct: 1 MASSSELGL-DCKPHSYSMLLKSFG-EQSDQSYKLEEFVSRLEEERLKIDAFKRELPLCM 58 Query: 428 QLLTNAMEASRQQLQAYRVNQGTRPVLEEFMPVKHNLTSAESFEKATNI-VSDKANWMTS 604 QLLTNAMEAS+QQLQA+R NQG +P+LEEF+PVK LTS+E+ EK TN V D ANWMTS Sbjct: 59 QLLTNAMEASKQQLQAFRSNQGAKPILEEFIPVKQ-LTSSETLEKTTNNNVCDMANWMTS 117 Query: 605 AQLWSQTSEGITKQQQ-STIMTSPKEADIGFSMSSPKPILDNKLQRNNGGAFLPFSKERN 781 AQLWSQTSE TKQQQ ST + +IGF++ SPK NGGAFLPFSKERN Sbjct: 118 AQLWSQTSELGTKQQQNSTKENNDNNNNIGFNI-SPK--------HRNGGAFLPFSKERN 168 Query: 782 PCS-QGSTLRTPLPELSLVSTEKEMMEDKKHGVSEAEKLGISCQKRDXXXXXXXXXDGAI 958 S QG LPEL+L ST+KE EDKKH V EAEK G Sbjct: 169 NSSCQGQ----GLPELALASTQKE--EDKKH-VGEAEK--------------GKTNSGNE 207 Query: 959 FDQGNKGSPVXXXXXXXXXXXXXXXXXXXXXXRKARRCWSPDLHRRFVNALQMLGGSQVA 1138 D KGSPV RKARRCWSPDLHRRFVNALQMLGGSQVA Sbjct: 208 VDNQGKGSPV------ASSQTQTTSNNSNQTHRKARRCWSPDLHRRFVNALQMLGGSQVA 261 Query: 1139 TPKQIRELMK 1168 TPKQIRELMK Sbjct: 262 TPKQIRELMK 271 >XP_003524007.1 PREDICTED: transcription factor LUX-like [Glycine max] KRH63037.1 hypothetical protein GLYMA_04G151000 [Glycine max] Length = 462 Score = 312 bits (800), Expect = 1e-99 Identities = 195/309 (63%), Positives = 218/309 (70%), Gaps = 2/309 (0%) Frame = +2 Query: 248 MASPSELGLIDCKPHSYSMLLKSFGGD-QTDQTYKLEEFLSHLEEERLKIDAFKRELPLC 424 M+S EL + D KP+SYS LLKS+ + +TDQT+KLEEFLS LEEER+KIDAFKRELPLC Sbjct: 1 MSSQVELSM-DYKPYSYSTLLKSYADETETDQTHKLEEFLSRLEEERVKIDAFKRELPLC 59 Query: 425 MQLLTNAMEASRQQLQAYRVNQGTRPVLEEFMPVKHNLTSAESFEKATNIVSDKANWMTS 604 MQLLTNA+EASRQQLQA+R NQGTRPV EEFMP+KH S ES EK +NI SDKANWMTS Sbjct: 60 MQLLTNAVEASRQQLQAFRSNQGTRPVREEFMPIKHP-NSQESTEKTSNI-SDKANWMTS 117 Query: 605 AQLWSQTSEGITKQQQSTIMTSPKE-ADIGFSMSSPKPILDNKLQRNNGGAFLPFSKERN 781 AQLWSQ SEG + QSTI TSPK AD+GFS+ SP P LDNK NGGAFLPFSKERN Sbjct: 118 AQLWSQASEG--TKPQSTI-TSPKNGADMGFSV-SPNPALDNK--HRNGGAFLPFSKERN 171 Query: 782 PCSQGSTLRTPLPELSLVSTEKEMMEDKKHGVSEAEKLGISCQKRDXXXXXXXXXDGAIF 961 C QG LR LPE++L S+EKEM +K E+EK C KR+ +G + Sbjct: 172 SC-QG--LR-DLPEVALASSEKEM---EKKCELESEK----CSKRENSGKGSGSCEGVV- 219 Query: 962 DQGNKGSPVXXXXXXXXXXXXXXXXXXXXXXRKARRCWSPDLHRRFVNALQMLGGSQVAT 1141 DQG S RKARRCWSPDLHRRFVNALQMLGGSQVAT Sbjct: 220 DQGKSASVASEAQTTNTTITTTTNNTTGQTHRKARRCWSPDLHRRFVNALQMLGGSQVAT 279 Query: 1142 PKQIRELMK 1168 PKQIRELMK Sbjct: 280 PKQIRELMK 288 >XP_017610040.1 PREDICTED: myb family transcription factor EFM [Gossypium arboreum] Length = 471 Score = 312 bits (799), Expect = 2e-99 Identities = 192/307 (62%), Positives = 221/307 (71%) Frame = +2 Query: 248 MASPSELGLIDCKPHSYSMLLKSFGGDQTDQTYKLEEFLSHLEEERLKIDAFKRELPLCM 427 MASP+ L L DCKPHSYSMLLKSFG Q DQT KLE+FL LEEERLKIDAFKRELPLCM Sbjct: 1 MASPTGLTL-DCKPHSYSMLLKSFGDQQIDQTQKLEDFLGRLEEERLKIDAFKRELPLCM 59 Query: 428 QLLTNAMEASRQQLQAYRVNQGTRPVLEEFMPVKHNLTSAESFEKATNIVSDKANWMTSA 607 QLLTNA+EASRQQL A R NQG+RPVLEEF+P+K+ +S+E+ +K+ NI SDKANWMTSA Sbjct: 60 QLLTNAVEASRQQLHACRANQGSRPVLEEFIPLKN--SSSENSDKSQNI-SDKANWMTSA 116 Query: 608 QLWSQTSEGITKQQQSTIMTSPKEADIGFSMSSPKPILDNKLQRNNGGAFLPFSKERNPC 787 QLWSQ TK QS++ SPKEADIGF++ SPK LD K + NGGAFLPFSK+RN Sbjct: 117 QLWSQAGNE-TKPPQSSV-ASPKEADIGFNV-SPKLGLDTK--QRNGGAFLPFSKDRNNP 171 Query: 788 SQGSTLRTPLPELSLVSTEKEMMEDKKHGVSEAEKLGISCQKRDXXXXXXXXXDGAIFDQ 967 GSTL+ PLP+L+L S ++ M+DKK SEAE CQ+R+ GA+ +Q Sbjct: 172 CPGSTLQ-PLPDLALASMNED-MDDKK--CSEAEN---GCQRRE--NSGKTGNGGALVEQ 222 Query: 968 GNKGSPVXXXXXXXXXXXXXXXXXXXXXXRKARRCWSPDLHRRFVNALQMLGGSQVATPK 1147 G KG+ RKARRCWSPDLHRRFVNALQMLGGSQVATPK Sbjct: 223 G-KGT--SCNAAEGQTTNGNTNTNTGQPHRKARRCWSPDLHRRFVNALQMLGGSQVATPK 279 Query: 1148 QIRELMK 1168 QIRELMK Sbjct: 280 QIRELMK 286 >XP_016690013.1 PREDICTED: uncharacterized protein LOC107907243 [Gossypium hirsutum] Length = 471 Score = 312 bits (799), Expect = 2e-99 Identities = 192/307 (62%), Positives = 221/307 (71%) Frame = +2 Query: 248 MASPSELGLIDCKPHSYSMLLKSFGGDQTDQTYKLEEFLSHLEEERLKIDAFKRELPLCM 427 MASP+ L L DCKPHSYSMLLKSFG Q DQT KLE+FL LEEERLKIDAFKRELPLCM Sbjct: 1 MASPTGLTL-DCKPHSYSMLLKSFGDQQIDQTQKLEDFLGRLEEERLKIDAFKRELPLCM 59 Query: 428 QLLTNAMEASRQQLQAYRVNQGTRPVLEEFMPVKHNLTSAESFEKATNIVSDKANWMTSA 607 QLLTNA+EASRQQL A R NQG+RPVLEEF+P+K+ +S+E+ +K+ NI SDKANWMTSA Sbjct: 60 QLLTNAVEASRQQLHACRANQGSRPVLEEFIPLKN--SSSENSDKSQNI-SDKANWMTSA 116 Query: 608 QLWSQTSEGITKQQQSTIMTSPKEADIGFSMSSPKPILDNKLQRNNGGAFLPFSKERNPC 787 QLWSQ TK QS++ SPKEADIGF++ SPK LD K + NGGAFLPFSK+RN Sbjct: 117 QLWSQAGNE-TKPPQSSV-ASPKEADIGFNV-SPKLGLDTK--QRNGGAFLPFSKDRNNP 171 Query: 788 SQGSTLRTPLPELSLVSTEKEMMEDKKHGVSEAEKLGISCQKRDXXXXXXXXXDGAIFDQ 967 GSTL+ PLP+L+L S ++ M+DKK SEAE CQ+R+ GA+ +Q Sbjct: 172 CPGSTLQ-PLPDLALASMNED-MDDKK--CSEAEN---GCQRRE--NSGKTSNGGALVEQ 222 Query: 968 GNKGSPVXXXXXXXXXXXXXXXXXXXXXXRKARRCWSPDLHRRFVNALQMLGGSQVATPK 1147 G KG+ RKARRCWSPDLHRRFVNALQMLGGSQVATPK Sbjct: 223 G-KGT--SCNAAEGQTTNGNTNTNTGQPHRKARRCWSPDLHRRFVNALQMLGGSQVATPK 279 Query: 1148 QIRELMK 1168 QIRELMK Sbjct: 280 QIRELMK 286 >KHG04305.1 Two-component response regulator ARR18 -like protein [Gossypium arboreum] Length = 471 Score = 312 bits (799), Expect = 2e-99 Identities = 192/307 (62%), Positives = 221/307 (71%) Frame = +2 Query: 248 MASPSELGLIDCKPHSYSMLLKSFGGDQTDQTYKLEEFLSHLEEERLKIDAFKRELPLCM 427 MASP+ L L DCKPHSYSMLLKSFG Q DQT KLE+FL LEEERLKIDAFKRELPLCM Sbjct: 1 MASPTGLTL-DCKPHSYSMLLKSFGDQQIDQTQKLEDFLGRLEEERLKIDAFKRELPLCM 59 Query: 428 QLLTNAMEASRQQLQAYRVNQGTRPVLEEFMPVKHNLTSAESFEKATNIVSDKANWMTSA 607 QLLTNA+EASRQQL A R NQG+RPVLEEF+P+K+ +S+E+ +K+ NI SDKANWMTSA Sbjct: 60 QLLTNAVEASRQQLHACRANQGSRPVLEEFIPLKN--SSSENSDKSQNI-SDKANWMTSA 116 Query: 608 QLWSQTSEGITKQQQSTIMTSPKEADIGFSMSSPKPILDNKLQRNNGGAFLPFSKERNPC 787 QLWSQ TK QS++ SPKEADIGF++ SPK LD K + NGGAFLPFSK+RN Sbjct: 117 QLWSQAGNE-TKPPQSSV-ASPKEADIGFNV-SPKLGLDTK--QRNGGAFLPFSKDRNNP 171 Query: 788 SQGSTLRTPLPELSLVSTEKEMMEDKKHGVSEAEKLGISCQKRDXXXXXXXXXDGAIFDQ 967 GSTL+ PLP+L+L S ++ M+DKK SEAE CQ+R+ GA+ +Q Sbjct: 172 CPGSTLQ-PLPDLALASMNED-MDDKK--CSEAEN---GCQRRE--NSGKTGNGGALVEQ 222 Query: 968 GNKGSPVXXXXXXXXXXXXXXXXXXXXXXRKARRCWSPDLHRRFVNALQMLGGSQVATPK 1147 G KG+ RKARRCWSPDLHRRFVNALQMLGGSQVATPK Sbjct: 223 G-KGT--SCNAAEGQTTNGNTNTNTGQPHRKARRCWSPDLHRRFVNALQMLGGSQVATPK 279 Query: 1148 QIRELMK 1168 QIRELMK Sbjct: 280 QIRELMK 286 >XP_007013309.1 PREDICTED: myb family transcription factor EFM [Theobroma cacao] EOY30928.1 Homeodomain-like superfamily protein [Theobroma cacao] Length = 467 Score = 311 bits (797), Expect = 4e-99 Identities = 190/307 (61%), Positives = 219/307 (71%) Frame = +2 Query: 248 MASPSELGLIDCKPHSYSMLLKSFGGDQTDQTYKLEEFLSHLEEERLKIDAFKRELPLCM 427 MASPSEL L DCKPHSYSMLLKSFG Q DQT KLEEFLS LEEERLKIDAFKRELPLCM Sbjct: 1 MASPSELTL-DCKPHSYSMLLKSFGDQQIDQTQKLEEFLSRLEEERLKIDAFKRELPLCM 59 Query: 428 QLLTNAMEASRQQLQAYRVNQGTRPVLEEFMPVKHNLTSAESFEKATNIVSDKANWMTSA 607 QLLTNA+EASRQQL A R N G+RPVLEEFMP+K+ +S+E+ EK+ NI SDKANWMT+A Sbjct: 60 QLLTNAVEASRQQLLACRANHGSRPVLEEFMPLKN--SSSENSEKSQNI-SDKANWMTTA 116 Query: 608 QLWSQTSEGITKQQQSTIMTSPKEADIGFSMSSPKPILDNKLQRNNGGAFLPFSKERNPC 787 QLWSQ TK Q S +TSPKE +IGF++ SPK LD K NGGAFLPF+KERN C Sbjct: 117 QLWSQAGNE-TKPQSS--ITSPKETEIGFNV-SPKLALDTK--PRNGGAFLPFTKERNSC 170 Query: 788 SQGSTLRTPLPELSLVSTEKEMMEDKKHGVSEAEKLGISCQKRDXXXXXXXXXDGAIFDQ 967 GS L+ LP+L+L S K+ MEDK+ S+ E G+SCQ+R+ +G + + Sbjct: 171 -PGSALQA-LPDLALASANKD-MEDKR--CSDTEN-GMSCQRRE---NSGKVSNGVVVIE 221 Query: 968 GNKGSPVXXXXXXXXXXXXXXXXXXXXXXRKARRCWSPDLHRRFVNALQMLGGSQVATPK 1147 +G+ RKARRCWSPDLHRRFVNALQ+LGGSQVATPK Sbjct: 222 QGRGT---ANTIDGQTANTNPSANTTQPHRKARRCWSPDLHRRFVNALQLLGGSQVATPK 278 Query: 1148 QIRELMK 1168 QIRELMK Sbjct: 279 QIRELMK 285