BLASTX nr result
ID: Phellodendron21_contig00003765
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Phellodendron21_contig00003765 (2027 letters) Database: ./nr 115,041,592 sequences; 42,171,959,267 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value EOX96336.1 Basic helix-loop-helix DNA-binding superfamily protei... 473 e-158 XP_017969485.1 PREDICTED: transcription factor UNE10 isoform X1 ... 470 e-157 XP_006445332.1 hypothetical protein CICLE_v10020053mg [Citrus cl... 456 e-151 XP_012083633.1 PREDICTED: transcription factor UNE10 [Jatropha c... 444 e-146 XP_016695355.1 PREDICTED: transcription factor UNE10-like [Gossy... 438 e-144 XP_017613142.1 PREDICTED: transcription factor UNE10 isoform X2 ... 435 e-143 XP_012489734.1 PREDICTED: transcription factor UNE10 [Gossypium ... 434 e-143 OAY62458.1 hypothetical protein MANES_01G269700 [Manihot esculenta] 431 e-142 KJB41054.1 hypothetical protein B456_007G088300 [Gossypium raimo... 430 e-141 OMO69522.1 hypothetical protein CCACVL1_19455 [Corchorus capsula... 428 e-141 XP_017613140.1 PREDICTED: transcription factor UNE10 isoform X1 ... 429 e-141 XP_016694956.1 PREDICTED: transcription factor UNE10-like [Gossy... 428 e-141 KDO85687.1 hypothetical protein CISIN_1g012387mg [Citrus sinensis] 425 e-140 EOX96338.1 Basic helix-loop-helix DNA-binding superfamily protei... 418 e-137 KJB41053.1 hypothetical protein B456_007G088300 [Gossypium raimo... 420 e-137 XP_007052181.2 PREDICTED: transcription factor UNE10 isoform X2 ... 416 e-136 EOX96337.1 Basic helix-loop-helix DNA-binding superfamily protei... 416 e-136 OMP00672.1 hypothetical protein COLO4_12473 [Corchorus olitorius] 414 e-135 XP_015582839.1 PREDICTED: transcription factor UNE10 [Ricinus co... 415 e-135 XP_003516808.1 PREDICTED: transcription factor UNE10-like [Glyci... 413 e-135 >EOX96336.1 Basic helix-loop-helix DNA-binding superfamily protein isoform 1 [Theobroma cacao] Length = 470 Score = 473 bits (1217), Expect = e-158 Identities = 281/472 (59%), Positives = 307/472 (65%), Gaps = 31/472 (6%) Frame = -2 Query: 1747 MSQCVPSWDLDENPNHTRPSLRSRSNSTAPDVPMLDYEVAELTWENGQLAMHGLGQPRVP 1568 MSQCVPSWDLD+NP R SLRS SNSTAPDVPMLDYEVAELTWENGQLAMH LG PRVP Sbjct: 1 MSQCVPSWDLDDNPAIARHSLRSNSNSTAPDVPMLDYEVAELTWENGQLAMHSLGPPRVP 60 Query: 1567 AKPAANNTSHTKYTWEKPRASGTLESIVNQATT------GLPQLKPQLDPWFDQQRAAA- 1409 AKP N+TS +KYTW+KPRA GTLESIVNQAT+ L + +L PWFD RAA Sbjct: 61 AKPL-NSTSPSKYTWDKPRAGGTLESIVNQATSFPYRNVSLDGGRDELVPWFDHHRAAVA 119 Query: 1408 -------------DALVPCSNRSSDGRTTPVI-------GTCAVDCSARVGSCSGPVVAT 1289 DALVPCSNRS D RTT V+ GTC V CS RVGSCSGP T Sbjct: 120 AAAVASSSATMTMDALVPCSNRSED-RTTHVMESIRGLGGTCVVGCSTRVGSCSGPT-GT 177 Query: 1288 KDEDVLNGKXXXXXXXXXXXPEGSSKEHSVSGSATFGRDSQHVTHDTYDMDTGVGFTSTS 1109 +D+ VL PE SSK+ + S SATFG DSQHVT D+Y+ D GVGFTSTS Sbjct: 178 QDDGVLLTGKRAREARVSVAPEWSSKDQNASASATFGTDSQHVTVDSYEKDFGVGFTSTS 237 Query: 1108 MGSPENTSSAKQRTKATTADDHDSV-HSRPPREAXXXXXXXXXXXKSSISTKRSRAAAIH 932 +GSPENTSS + TKATTADDHDSV HSRP R+A SS+STKRSRAAAIH Sbjct: 238 LGSPENTSSPRPCTKATTADDHDSVCHSRPQRKAGEEDKRKETGK-SSVSTKRSRAAAIH 296 Query: 931 NQSERKRRDKINQRMKTLQKLVPNSSKTDKASMLDEVIDYLKQLQAQVXXXXXXXXXXXX 752 NQSERKRRDKINQRMKTLQKLVPNSSKTDKASMLDEVI+YLKQLQAQV Sbjct: 297 NQSERKRRDKINQRMKTLQKLVPNSSKTDKASMLDEVIEYLKQLQAQV---HMMSRMNIP 353 Query: 751 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXMNSMTRPNITGXXXXXXXXXXXMA 572 M++M RPNITG M Sbjct: 354 PMMFPMTMQQQLQMSMMAPMGMGMGMGMGIGMGVMDMSTMGRPNITGISPVLPNPFVTMT 413 Query: 571 SWDGSGDRLQVSA---MTDPLATFLACQSQPMTMDAYSRMAALYQQMQQPPA 425 WDGSGDRLQ ++ M DPL+ FLACQSQP+TMDAYSRMAA+YQQMQ PPA Sbjct: 414 PWDGSGDRLQAASAAVMPDPLSAFLACQSQPITMDAYSRMAAMYQQMQHPPA 465 >XP_017969485.1 PREDICTED: transcription factor UNE10 isoform X1 [Theobroma cacao] Length = 470 Score = 470 bits (1210), Expect = e-157 Identities = 280/472 (59%), Positives = 305/472 (64%), Gaps = 31/472 (6%) Frame = -2 Query: 1747 MSQCVPSWDLDENPNHTRPSLRSRSNSTAPDVPMLDYEVAELTWENGQLAMHGLGQPRVP 1568 MSQCVPSWDLD+NP R SLRS SNSTAPDVPMLDYEVAELTWENGQLAMH LG PRVP Sbjct: 1 MSQCVPSWDLDDNPAIARHSLRSNSNSTAPDVPMLDYEVAELTWENGQLAMHSLGPPRVP 60 Query: 1567 AKPAANNTSHTKYTWEKPRASGTLESIVNQATT------GLPQLKPQLDPWFDQQRAAA- 1409 AKP N+TS +KYTW+KPRA GTLESIVNQAT+ L + +L PWFD RAA Sbjct: 61 AKPL-NSTSPSKYTWDKPRAGGTLESIVNQATSFPYRNVSLDGGRDELVPWFDHHRAAVA 119 Query: 1408 -------------DALVPCSNRSSDGRTTPVI-------GTCAVDCSARVGSCSGPVVAT 1289 DALVPCSNRS D RTT V+ GTC V CS VGSCSGP T Sbjct: 120 AAAVASSSATMTMDALVPCSNRSED-RTTHVMESIRGLGGTCVVGCSTMVGSCSGPT-GT 177 Query: 1288 KDEDVLNGKXXXXXXXXXXXPEGSSKEHSVSGSATFGRDSQHVTHDTYDMDTGVGFTSTS 1109 +D+ VL PE SSK+ + S SATFG DSQHVT D+Y+ D GVGFTSTS Sbjct: 178 QDDGVLLTGKRAREARVSVAPEWSSKDQNASASATFGTDSQHVTVDSYEKDFGVGFTSTS 237 Query: 1108 MGSPENTSSAKQRTKATTADDHDSV-HSRPPREAXXXXXXXXXXXKSSISTKRSRAAAIH 932 +GSPENTSS + TKATTADDHDSV HSRP R+A SS+STKRSRAAAIH Sbjct: 238 LGSPENTSSPRPCTKATTADDHDSVCHSRPQRKAGEEDKRKETGK-SSVSTKRSRAAAIH 296 Query: 931 NQSERKRRDKINQRMKTLQKLVPNSSKTDKASMLDEVIDYLKQLQAQVXXXXXXXXXXXX 752 NQSERKRRDKINQRMKTLQKLVPNSSKTDKASMLDEVI+YLKQLQAQV Sbjct: 297 NQSERKRRDKINQRMKTLQKLVPNSSKTDKASMLDEVIEYLKQLQAQV---HMMSRMNIP 353 Query: 751 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXMNSMTRPNITGXXXXXXXXXXXMA 572 M +M RPNITG M Sbjct: 354 PMMFPMTMQQQLQMSMMAPMGMGMGMGMGIGMGVMDMRTMGRPNITGISPVLPNPFVTMT 413 Query: 571 SWDGSGDRLQVSA---MTDPLATFLACQSQPMTMDAYSRMAALYQQMQQPPA 425 WDGSGDRLQ ++ M DPL+ FLACQSQP+TMDAYSRMAA+YQQMQ PPA Sbjct: 414 PWDGSGDRLQAASAAVMPDPLSAFLACQSQPITMDAYSRMAAMYQQMQHPPA 465 >XP_006445332.1 hypothetical protein CICLE_v10020053mg [Citrus clementina] XP_006490847.1 PREDICTED: transcription factor UNE10 [Citrus sinensis] ESR58572.1 hypothetical protein CICLE_v10020053mg [Citrus clementina] KDO85686.1 hypothetical protein CISIN_1g012387mg [Citrus sinensis] Length = 464 Score = 456 bits (1172), Expect = e-151 Identities = 289/470 (61%), Positives = 309/470 (65%), Gaps = 30/470 (6%) Frame = -2 Query: 1747 MSQCVPSWDLDEN-PNHTRPSLRSRSNSTAPDVPML--DYEVAELTWENGQLAMHGLGQP 1577 MSQCVPSWDLDEN PN+ R SLRSRSNSTAPDVPML DYEVAELTWENGQLAMHGLG P Sbjct: 1 MSQCVPSWDLDENYPNNCRASLRSRSNSTAPDVPMLELDYEVAELTWENGQLAMHGLGPP 60 Query: 1576 RVPAKPAANNTSHTKYTWEKPRASGTLESIVNQATTGLPQLK-----PQLDPW------F 1430 RVPAK AANN S TK T SGTLESIVNQAT+ LPQ + P LD + F Sbjct: 61 RVPAKAAANNPSPTKNT-----CSGTLESIVNQATS-LPQAQRNGKPPLLDEFATAPCCF 114 Query: 1429 DQQRAAA---DALVPCSNRSSDGRTTPVIGTC---AVDCSARVGSCSGPVV-----ATKD 1283 QQR + DALVPCSNR S+ RTT V+ S RVGSCSGPV +TKD Sbjct: 115 HQQRPSMTTMDALVPCSNRRSEERTTQVMDPAPRVGGTRSIRVGSCSGPVPLPIPDSTKD 174 Query: 1282 EDVLNGKXXXXXXXXXXXPEGSSKEHSVSGSATFGRDSQHV--THDTYDMDT--GVGFTS 1115 +DVLNGK E SS++ S SGSATFGR+SQ V THDTYDMD GVGFT Sbjct: 175 DDVLNGKRARVARVPVAP-EWSSRDQSFSGSATFGRESQRVSVTHDTYDMDMDMGVGFTG 233 Query: 1114 TSMGSPENTSSAKQRTKATTADDHDSV-HSRPPREAXXXXXXXXXXXKSSISTKRSRAAA 938 TSMGSPENTSSAKQ KATTADDHDSV HSRP REA KS+ISTKRSRAAA Sbjct: 234 TSMGSPENTSSAKQGNKATTADDHDSVCHSRPLREAGDEEYKKKGNGKSTISTKRSRAAA 293 Query: 937 IHNQSERKRRDKINQRMKTLQKLVPNSSKTDKASMLDEVIDYLKQLQAQVXXXXXXXXXX 758 IHNQSERKRRDKINQRMKTLQKLVPNSSKTDKASMLDEVI+YLKQLQAQV Sbjct: 294 IHNQSERKRRDKINQRMKTLQKLVPNSSKTDKASMLDEVIEYLKQLQAQV-----QVMSR 348 Query: 757 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXMNSMTRPNITGXXXXXXXXXXX 578 MNSM+RPNIT Sbjct: 349 MNMPPMMLPMAMQQQLQMSMLSSMGMGMGMGMGMGVMDMNSMSRPNITS-MPPLLHPFLP 407 Query: 577 MASWDGSGDRLQVSAMTDPLATFLACQSQPMTMDAYSRMAALYQQMQQPP 428 +ASWDG GDRLQ S MTDPL+TFLACQ Q +MDAY+RMAA+YQQMQQ P Sbjct: 408 LASWDGLGDRLQASPMTDPLSTFLACQPQAASMDAYNRMAAMYQQMQQQP 457 >XP_012083633.1 PREDICTED: transcription factor UNE10 [Jatropha curcas] KDP28805.1 hypothetical protein JCGZ_14576 [Jatropha curcas] Length = 474 Score = 444 bits (1141), Expect = e-146 Identities = 271/474 (57%), Positives = 298/474 (62%), Gaps = 34/474 (7%) Frame = -2 Query: 1747 MSQCVPSWDLDE-NPNHTRPSLRSRSNSTAPDVPMLDYEVAELTWENGQLAMHGLGQPRV 1571 MSQCVPSW+LD+ NP + SLRS SNSTAPDVPMLDYEVAELTWENGQLAMHGLG PR Sbjct: 1 MSQCVPSWNLDDSNPAPAKLSLRSHSNSTAPDVPMLDYEVAELTWENGQLAMHGLGPPRA 60 Query: 1570 PAKPAANNTSHTKYTWEKPRASGTLESIVNQATTGLPQLKPQLD--------PWFDQQRA 1415 PAKP A+ S +KY W+KPRASGTLESIVNQAT LPQ K LD PWF+ RA Sbjct: 61 PAKPLAS-ASPSKYAWDKPRASGTLESIVNQATR-LPQRKLGLDACGSDELVPWFENNRA 118 Query: 1414 AA----------DALVPCSNRSSDGR------TTPVIGTCAVDCSARVGSCSGPVVATKD 1283 AA DALVPCSNR++D R + P +G C V S RVGSCSGP AT+D Sbjct: 119 AAVAASSATTTMDALVPCSNRTTDDRKKRAMESVPALGNCVVGSSTRVGSCSGPT-ATQD 177 Query: 1282 EDVLNGKXXXXXXXXXXXPEGSSKEHSVSGSATFGRDSQHVTHDTYDMDTGVGFTSTSMG 1103 ED L PE SS++ SVS SATFGRDSQHVT +T + D G+ FTSTS G Sbjct: 178 EDALLTAKRARVARVPVAPEWSSRDQSVSCSATFGRDSQHVTLETCEPDLGMDFTSTSFG 237 Query: 1102 SPENTSSAKQRTKATTADDHDSV-HSRPPREAXXXXXXXXXXXKSSISTKRSRAAAIHNQ 926 S ENTS K TK T D++DSV HSRP RE KSS STKRSRAAAIHNQ Sbjct: 238 SQENTSCGKPGTKTATVDENDSVCHSRPQREEADEEDKKKGNVKSSASTKRSRAAAIHNQ 297 Query: 925 SERKRRDKINQRMKTLQKLVPNSSKTDKASMLDEVIDYLKQLQAQVXXXXXXXXXXXXXX 746 SERKRRDKINQRMKTLQKLVPNSSKTDKASMLDEVI+YLKQLQAQV Sbjct: 298 SERKRRDKINQRMKTLQKLVPNSSKTDKASMLDEVIEYLKQLQAQV-----QMMSRMNMQ 352 Query: 745 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXMNSMTRPNITG----XXXXXXXXXXX 578 MNS++RPNI Sbjct: 353 PMMLPMAMQQQLQMSMLAPMNMGIGIGMGMGVVDMNSISRPNIAAGISPALHPSAFMPVM 412 Query: 577 MASWDGSGDRLQVSA----MTDPLATFLACQSQPMTMDAYSRMAALYQQMQQPP 428 ASWDGS +RLQ +A M DPL+ FLACQSQPMTMDAYSRMAA+YQQ+QQ P Sbjct: 413 AASWDGSAERLQAAASTTVMPDPLSAFLACQSQPMTMDAYSRMAAMYQQLQQQP 466 >XP_016695355.1 PREDICTED: transcription factor UNE10-like [Gossypium hirsutum] Length = 471 Score = 438 bits (1126), Expect = e-144 Identities = 271/472 (57%), Positives = 301/472 (63%), Gaps = 31/472 (6%) Frame = -2 Query: 1747 MSQCVPSWDLDENPNHTRPSLRSRSNSTAPDVPMLDYEVAELTWENGQLAMHGLGQPRVP 1568 MSQCVPSWDLD++ R SLRS SNSTAPDV M DYEVAELTWENGQLAMHGLG RVP Sbjct: 1 MSQCVPSWDLDDHHVTARHSLRSNSNSTAPDVHMSDYEVAELTWENGQLAMHGLGPARVP 60 Query: 1567 AKPAANNTSHTKYTWEKPRASGTLESIVNQATTGLPQLKPQLD-------PWFDQQRAAA 1409 AKP +N +KYTW+KPRA+GTLESIVNQAT +P LK LD P +Q R AA Sbjct: 61 AKPLVSNPP-SKYTWDKPRANGTLESIVNQATR-VPYLKVSLDGDRDELVPCLNQHREAA 118 Query: 1408 --------DALVPCSNRSSDGRTT------PVIG-TCAVDCSARVGSCSGPVVATKDEDV 1274 DALVPCS R+ +GR + P +G TC V S RVGSCSGP DE + Sbjct: 119 ASSAAMAMDALVPCSKRT-EGRPSHAMESIPGLGRTCLVGGSTRVGSCSGPAGTHDDEVL 177 Query: 1273 LNGKXXXXXXXXXXXPEGSSKEHSVSGSATFG--RDSQHVTHDTYDMDTGVGFTSTSMGS 1100 ++GK E SSKE S S SATFG RDS++VT DTY+ D G+GFTSTS+GS Sbjct: 178 VSGKSTRAARAPLMP-EWSSKEQSASASATFGKDRDSRYVTLDTYEKDFGMGFTSTSLGS 236 Query: 1099 PENTSSAKQRTKATT-ADDHDSV-HSRPPREAXXXXXXXXXXXKSSISTKRSRAAAIHNQ 926 PEN SS K TKATT ADDHDSV HSRP R+ SS+S KRSRAAAIHNQ Sbjct: 237 PENASSTKPCTKATTTADDHDSVCHSRPQRKEFEEDKKETGK--SSVSNKRSRAAAIHNQ 294 Query: 925 SERKRRDKINQRMKTLQKLVPNSSKTDKASMLDEVIDYLKQLQAQVXXXXXXXXXXXXXX 746 SERKRRDKINQRMKTLQKLVPNSSKTDKASMLDEVI+YLKQLQAQV Sbjct: 295 SERKRRDKINQRMKTLQKLVPNSSKTDKASMLDEVIEYLKQLQAQVQMMNRMNIPQMMLP 354 Query: 745 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXMNSMTRPNITGXXXXXXXXXXXMASW 566 +N+M RPNITG M SW Sbjct: 355 MAMQQQLQMSMMAPAMGMGMGMGMGMGMGMGVMDINTMGRPNITGISPVMPNPFMAMTSW 414 Query: 565 DGSGDRLQVSA-----MTDPLATFLACQSQPMTMDAYSRMAALYQQMQQPPA 425 DGSG+RLQ +A M DPL+TFLACQ QPMTMDAYSR+AA+YQQMQQPPA Sbjct: 415 DGSGERLQQAASAAAMMPDPLSTFLACQPQPMTMDAYSRLAAMYQQMQQPPA 466 >XP_017613142.1 PREDICTED: transcription factor UNE10 isoform X2 [Gossypium arboreum] Length = 469 Score = 435 bits (1118), Expect = e-143 Identities = 272/472 (57%), Positives = 300/472 (63%), Gaps = 31/472 (6%) Frame = -2 Query: 1747 MSQCVPSWDLDENPNHTRPSLRSRSNSTAPDVPMLDYEVAELTWENGQLAMHGLGQPRVP 1568 MSQCVPSWDLD+N R SLRS SNSTAPDV M DYEVAELTWENGQLAMHGLG VP Sbjct: 1 MSQCVPSWDLDDNHVTARHSLRSNSNSTAPDVHMSDYEVAELTWENGQLAMHGLGPASVP 60 Query: 1567 AKPAANNTSHTKYTWEKPRASGTLESIVNQATTGLPQLKPQLD-------PWFDQQRAAA 1409 AKP +N +KYTW+KPRA+GTLESIVNQAT +P LK LD P +Q R AA Sbjct: 61 AKPLVSNPP-SKYTWDKPRANGTLESIVNQATR-VPYLKVSLDGDRDELVPCLNQHREAA 118 Query: 1408 --------DALVPCSNRSSDGRTT------PVIG-TCAVDCSARVGSCSGPVVATKDEDV 1274 DALVPCS R+ +GR + P +G TC V S RVGSCSGP DE + Sbjct: 119 ASSATMAMDALVPCSKRT-EGRPSHAMESIPGLGRTCLVGGSTRVGSCSGPAGTHDDEVL 177 Query: 1273 LNGKXXXXXXXXXXXPEGSSKEHSVSGSATFG--RDSQHVTHDTYDMDTGVGFTSTSMGS 1100 ++GK E SSKE S S SATFG RDS++VT DTY+ D G+GFTSTS+GS Sbjct: 178 VSGKSTPAARAPEMP-EWSSKEQSASASATFGKDRDSRYVTLDTYEKDFGMGFTSTSLGS 236 Query: 1099 PENTSSAKQRTKATT-ADDHDSV-HSRPPREAXXXXXXXXXXXKSSISTKRSRAAAIHNQ 926 PEN SS K TKATT ADDHDSV HSRP RE SS+S KRSRAAAIHNQ Sbjct: 237 PENASSTKPCTKATTTADDHDSVCHSRPQREEFEEDKKETGK--SSVSNKRSRAAAIHNQ 294 Query: 925 SERKRRDKINQRMKTLQKLVPNSSKTDKASMLDEVIDYLKQLQAQVXXXXXXXXXXXXXX 746 SERKRRDKINQRMKTLQKLVPNSSKTDKASMLDEVI+YLKQLQAQV Sbjct: 295 SERKRRDKINQRMKTLQKLVPNSSKTDKASMLDEVIEYLKQLQAQV--QMMNRMNIPQMM 352 Query: 745 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXMNSMTRPNITGXXXXXXXXXXXMASW 566 +N+M RPNITG M SW Sbjct: 353 LPMAMQQQLQMSMMAPAMGMGMGMGMGMGMGVMDINTMGRPNITGISPVMPNPFMAMTSW 412 Query: 565 DGSGDRLQVSA-----MTDPLATFLACQSQPMTMDAYSRMAALYQQMQQPPA 425 DGSG+RLQ +A M DPL+TFLACQ QPMTMDAYSR+AA+YQQMQQPPA Sbjct: 413 DGSGERLQQAASAAAMMPDPLSTFLACQPQPMTMDAYSRLAAMYQQMQQPPA 464 >XP_012489734.1 PREDICTED: transcription factor UNE10 [Gossypium raimondii] KJB41052.1 hypothetical protein B456_007G088300 [Gossypium raimondii] Length = 467 Score = 434 bits (1116), Expect = e-143 Identities = 273/472 (57%), Positives = 300/472 (63%), Gaps = 31/472 (6%) Frame = -2 Query: 1747 MSQCVPSWDLDENPNHTRPSLRSRSNSTAPDVPMLDYEVAELTWENGQLAMHGLGQPRVP 1568 MSQCVPSWDLD+N R SLRS SNSTAPDV M DYEVAELTWENGQLAMHGLG RVP Sbjct: 1 MSQCVPSWDLDDNHVTARHSLRSNSNSTAPDVHMSDYEVAELTWENGQLAMHGLGPARVP 60 Query: 1567 AKPAANNTSHTKYTWEKPRASGTLESIVNQATTGLPQLKPQLD-------PWFDQQRAAA 1409 AKP +N +KYTW+KPRA+GTLESIVNQAT +P LK LD P +Q R AA Sbjct: 61 AKPLVSNPP-SKYTWDKPRANGTLESIVNQATR-VPYLKVSLDDGRDELVPCLNQHREAA 118 Query: 1408 --------DALVPCSNRSSDGRTT------PVIG-TCAVDCSARVGSCSGPVVATKDEDV 1274 DALVPCS R+ +GRT P +G TC V S RVGSCSG DE + Sbjct: 119 ASSATIAMDALVPCSKRT-EGRTAHAMESIPGLGRTCLVGGSTRVGSCSGRAGTHDDEVL 177 Query: 1273 LNGKXXXXXXXXXXXPEGSSKEHSVSGSATFGR--DSQHVTHDTYDMDTGVGFTSTSMGS 1100 ++GK E SSKE S S SATFGR DS+ VT DTY+ D G+GFTSTS+GS Sbjct: 178 VSGKRTRAARAPLMP-EWSSKEQSASASATFGRERDSRCVTLDTYEKDFGMGFTSTSLGS 236 Query: 1099 PENTSSAKQRTKATT-ADDHDSV-HSRPPREAXXXXXXXXXXXKSSISTKRSRAAAIHNQ 926 PEN SS K TKATT ADDHDSV HSRP RE SS+S KRSRAAAIHNQ Sbjct: 237 PENASSTKPCTKATTTADDHDSVCHSRPQREEFEEDKKETGK--SSVSNKRSRAAAIHNQ 294 Query: 925 SERKRRDKINQRMKTLQKLVPNSSKTDKASMLDEVIDYLKQLQAQVXXXXXXXXXXXXXX 746 SERKRRDKINQRMKTLQKLVPNSSKTDKASMLDEVI+YLKQLQAQV Sbjct: 295 SERKRRDKINQRMKTLQKLVPNSSKTDKASMLDEVIEYLKQLQAQV----QMMNRMNIPQ 350 Query: 745 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXMNSMTRPNITGXXXXXXXXXXXMASW 566 +N++ RPNITG M SW Sbjct: 351 MMLPMAMQQPLQMSMLAPAMGMGMGMGMGMGVMDINTIGRPNITGISPVMPNPFMAMTSW 410 Query: 565 DGSGDRLQVSA-----MTDPLATFLACQSQPMTMDAYSRMAALYQQMQQPPA 425 DGSG+RLQ +A M DPL+TFLACQSQPMTMDAYSR+AA+YQQMQQPPA Sbjct: 411 DGSGERLQQAASAAAMMPDPLSTFLACQSQPMTMDAYSRLAAMYQQMQQPPA 462 >OAY62458.1 hypothetical protein MANES_01G269700 [Manihot esculenta] Length = 454 Score = 431 bits (1109), Expect = e-142 Identities = 266/466 (57%), Positives = 299/466 (64%), Gaps = 26/466 (5%) Frame = -2 Query: 1747 MSQCVPSWDLDENPNHTRPSLRSRSNSTAPDVPMLDYEVAELTWENGQLAMHGLGQPRVP 1568 MSQCVPSWDLD+NP+ +LRS+SNS APDVPM YEVAELTWENGQLAMHGLG PRVP Sbjct: 1 MSQCVPSWDLDDNPSPANQTLRSQSNSVAPDVPMFQYEVAELTWENGQLAMHGLGPPRVP 60 Query: 1567 AKPAANNTSHTKYT-WEKPRASGTLESIVNQATTGLPQLKP----------QLDPWFDQQ 1421 AKP A +TS +KYT W+KPRA+GTLESIVNQAT+ LP KP ++ PWF+ Sbjct: 61 AKPMA-STSPSKYTSWDKPRANGTLESIVNQATS-LPHRKPGLKNSGCGSEEIVPWFEHN 118 Query: 1420 RAAA----------DALVPCSNRSSDGRTTPVIGTCAVDCSARVGSCSGPVVATKDEDVL 1271 RAA DA+VPCSNR+++ R+ V+G C V S RVGSCSGP V +E L Sbjct: 119 RAAVVPAASATMTMDAMVPCSNRTNE-RSAHVMGNCVVGSSTRVGSCSGPAVTQDEETPL 177 Query: 1270 NGKXXXXXXXXXXXPEGSSKEHSVSGSATFGRDSQHVTHDTYDMDTGVGFTSTSMGSPEN 1091 NGK PE SS++ SVSGSAT GRDSQ D GVGFTSTS GS EN Sbjct: 178 NGK-RQRVARVPVAPEWSSRQ-SVSGSATVGRDSQR--------DLGVGFTSTSFGSQEN 227 Query: 1090 TSSAKQRTKATTADDHDSV-HSRPPREAXXXXXXXXXXXKSSISTKRSRAAAIHNQSERK 914 SS+K TK T AD++DSV +SRP REA KSS+STKRSRAAAIHNQSERK Sbjct: 228 NSSSKPGTKTTAADENDSVCYSRPQREAGDEEEEKKGNGKSSVSTKRSRAAAIHNQSERK 287 Query: 913 RRDKINQRMKTLQKLVPNSSKTDKASMLDEVIDYLKQLQAQVXXXXXXXXXXXXXXXXXX 734 RRDKINQRMKTLQKLVPNSSKTDKASMLDEVI+YLKQLQAQV Sbjct: 288 RRDKINQRMKTLQKLVPNSSKTDKASMLDEVIEYLKQLQAQV------QMMNRMNMQPLI 341 Query: 733 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXMNSMTRPNITG-XXXXXXXXXXXMASWDGS 557 MNS+ RPNI G MASWDGS Sbjct: 342 LPMAMQQQLQMSMLNMGMGVGMGMGVNVMDMNSVARPNIGGLSPVLHPTPFIPMASWDGS 401 Query: 556 GDRLQVSA---MTDPLATFLACQSQPMTMDAYSRMAALYQQMQQPP 428 GDRLQ S+ M DPL+ FLACQSQP+ MDAYSRMAA+YQQ+QQ P Sbjct: 402 GDRLQSSSNTVMPDPLSAFLACQSQPIPMDAYSRMAAIYQQLQQQP 447 >KJB41054.1 hypothetical protein B456_007G088300 [Gossypium raimondii] Length = 468 Score = 430 bits (1106), Expect = e-141 Identities = 269/471 (57%), Positives = 297/471 (63%), Gaps = 30/471 (6%) Frame = -2 Query: 1747 MSQCVPSWDLDENPNHTRPSLRSRSNSTAPDVPMLDYEVAELTWENGQLAMHGLGQPRVP 1568 MSQCVPSWDLD+N R SLRS SNSTAPDV M DYEVAELTWENGQLAMHGLG RVP Sbjct: 1 MSQCVPSWDLDDNHVTARHSLRSNSNSTAPDVHMSDYEVAELTWENGQLAMHGLGPARVP 60 Query: 1567 AKPAANNTSHTKYTWEKPRASGTLESIVNQATTGLPQLKPQLD-------PWFDQQRAAA 1409 AKP +N +KYTW+KPRA+GTLESIVNQAT +P LK LD P +Q R AA Sbjct: 61 AKPLVSNPP-SKYTWDKPRANGTLESIVNQATR-VPYLKVSLDDGRDELVPCLNQHREAA 118 Query: 1408 --------DALVPCSNRSSDGRTT------PVIG-TCAVDCSARVGSCSGPVVATKDEDV 1274 DALVPCS R+ +GRT P +G TC V S RVGSCSG DE + Sbjct: 119 ASSATIAMDALVPCSKRT-EGRTAHAMESIPGLGRTCLVGGSTRVGSCSGRAGTHDDEVL 177 Query: 1273 LNGKXXXXXXXXXXXPEGSSKEHSVSGSATFGR--DSQHVTHDTYDMDTGVGFTSTSMGS 1100 ++GK E SSKE S S SATFGR DS+ VT DTY+ D G+GFTSTS+GS Sbjct: 178 VSGKRTRAARAPLMP-EWSSKEQSASASATFGRERDSRCVTLDTYEKDFGMGFTSTSLGS 236 Query: 1099 PENTSSAKQRTKATT-ADDHDSVHSRPPREAXXXXXXXXXXXKSSISTKRSRAAAIHNQS 923 PEN SS K TKATT ADDHDSV P+ KSS+S KRSRAAAIHNQS Sbjct: 237 PENASSTKPCTKATTTADDHDSVCHSRPQAKEEFEEDKKETGKSSVSNKRSRAAAIHNQS 296 Query: 922 ERKRRDKINQRMKTLQKLVPNSSKTDKASMLDEVIDYLKQLQAQVXXXXXXXXXXXXXXX 743 ERKRRDKINQRMKTLQKLVPNSSKTDKASMLDEVI+YLKQLQAQV Sbjct: 297 ERKRRDKINQRMKTLQKLVPNSSKTDKASMLDEVIEYLKQLQAQV----QMMNRMNIPQM 352 Query: 742 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXMNSMTRPNITGXXXXXXXXXXXMASWD 563 +N++ RPNITG M SWD Sbjct: 353 MLPMAMQQPLQMSMLAPAMGMGMGMGMGMGVMDINTIGRPNITGISPVMPNPFMAMTSWD 412 Query: 562 GSGDRLQVSA-----MTDPLATFLACQSQPMTMDAYSRMAALYQQMQQPPA 425 GSG+RLQ +A M DPL+TFLACQSQPMTMDAYSR+AA+YQQMQQPPA Sbjct: 413 GSGERLQQAASAAAMMPDPLSTFLACQSQPMTMDAYSRLAAMYQQMQQPPA 463 >OMO69522.1 hypothetical protein CCACVL1_19455 [Corchorus capsularis] Length = 428 Score = 428 bits (1100), Expect = e-141 Identities = 267/457 (58%), Positives = 286/457 (62%), Gaps = 16/457 (3%) Frame = -2 Query: 1747 MSQCVPSWDLDENPNHTRPSLRSRSNSTAPDVPMLDYEVAELTWENGQLAMHGLGQPRVP 1568 MSQCVPSWDLDENP R SLRS SNSTAPDVPM DYEVAELTWENGQLAMH LG PRVP Sbjct: 1 MSQCVPSWDLDENPVTARHSLRSNSNSTAPDVPMSDYEVAELTWENGQLAMHSLGPPRVP 60 Query: 1567 AKPAANNTSHTKYTWEKP-RASGTLESIVNQATTGLPQLKPQLDPWFDQQRAAADALVPC 1391 KP+ N+T+ TKY WEKP RASGTLESIVNQAT F DALVPC Sbjct: 61 TKPSLNSTAPTKYAWEKPARASGTLESIVNQATQ------------FPYPTMTMDALVPC 108 Query: 1390 SNRSSDGRTTPVI-------GTCAVDCSARVGSCSGPVVATKDEDVLNGKXXXXXXXXXX 1232 SNRS D RTT V+ GTC V CS RVGSCSGP ++E +L GK Sbjct: 109 SNRSED-RTTHVMESIPGLGGTCVVGCSTRVGSCSGPAGNQEEEVLLTGK-RAKEARVPV 166 Query: 1231 XPEGSSKEHS--VSGSATFGRDSQHVTHDTYDMDTGVGFTSTSMGSPENTSSAKQRTKAT 1058 PE SSK+ S S SATFGRDSQHVT DTY+ D GVGFTST SP+NTS T Sbjct: 167 APEWSSKDQSACASASATFGRDSQHVTVDTYEKDLGVGFTST---SPDNTS--------T 215 Query: 1057 TADDHDSVHSRPPREAXXXXXXXXXXXKSSISTKRSRAAAIHNQSERKRRDKINQRMKTL 878 ADDHDS REA KSS+STKRSRAAAIHNQSERKRRDKINQRMKTL Sbjct: 216 KADDHDS------REA-GEEDKQKETGKSSVSTKRSRAAAIHNQSERKRRDKINQRMKTL 268 Query: 877 QKLVPNSSKTDKASMLDEVIDYLKQLQAQVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 698 QKLVPNSSKTDKASMLDEVI+YLKQLQAQV Sbjct: 269 QKLVPNSSKTDKASMLDEVIEYLKQLQAQV--NMMSRMNMPPMMLPMTMQQQLQMSMMAP 326 Query: 697 XXXXXXXXXXXXXXXXXXMNSMTRPNITGXXXXXXXXXXXMASWDGSGDRLQVSA----- 533 MNSM RPN++G M SWDGSGDRLQ +A Sbjct: 327 MGMGMGMGMGMAGMGVMDMNSMGRPNMSGISPVMPNPFMTMTSWDGSGDRLQAAAAASAA 386 Query: 532 -MTDPLATFLACQSQPMTMDAYSRMAALYQQMQQPPA 425 + DPL+ FLACQSQPMTM+AYSRMAA+YQQMQQPPA Sbjct: 387 VIPDPLSAFLACQSQPMTMEAYSRMAAMYQQMQQPPA 423 >XP_017613140.1 PREDICTED: transcription factor UNE10 isoform X1 [Gossypium arboreum] Length = 477 Score = 429 bits (1102), Expect = e-141 Identities = 271/478 (56%), Positives = 300/478 (62%), Gaps = 37/478 (7%) Frame = -2 Query: 1747 MSQCVPSWDLDENPNHTRPSLRSRSNSTAPDVPMLDYEVAELTWENGQLAMHGLGQPRVP 1568 MSQCVPSWDLD+N R SLRS SNSTAPDV M DYEVAELTWENGQLAMHGLG VP Sbjct: 1 MSQCVPSWDLDDNHVTARHSLRSNSNSTAPDVHMSDYEVAELTWENGQLAMHGLGPASVP 60 Query: 1567 AKPAANNTSHTKYTWEKPRASGTLESIVNQATTGLPQLKPQLD-------PWFDQQRAAA 1409 AKP +N +KYTW+KPRA+GTLESIVNQAT +P LK LD P +Q R AA Sbjct: 61 AKPLVSNPP-SKYTWDKPRANGTLESIVNQATR-VPYLKVSLDGDRDELVPCLNQHREAA 118 Query: 1408 --------DALVPCSNRSSDGRTT------PVIG-TCAVDCSARVGSCSGPVVATKDEDV 1274 DALVPCS R+ +GR + P +G TC V S RVGSCSGP DE + Sbjct: 119 ASSATMAMDALVPCSKRT-EGRPSHAMESIPGLGRTCLVGGSTRVGSCSGPAGTHDDEVL 177 Query: 1273 LNGKXXXXXXXXXXXPEGSSKEHSVSGSATFG--RDSQHVTHDTYDMDTGVGFTSTSMGS 1100 ++GK E SSKE S S SATFG RDS++VT DTY+ D G+GFTSTS+GS Sbjct: 178 VSGKSTPAARAPEMP-EWSSKEQSASASATFGKDRDSRYVTLDTYEKDFGMGFTSTSLGS 236 Query: 1099 PENTSSAKQRTKATT-ADDHDSV-HSRPPRE------AXXXXXXXXXXXKSSISTKRSRA 944 PEN SS K TKATT ADDHDSV HSRP + KSS+S KRSRA Sbjct: 237 PENASSTKPCTKATTTADDHDSVCHSRPQAKFFPFNYREEFEEDKKETGKSSVSNKRSRA 296 Query: 943 AAIHNQSERKRRDKINQRMKTLQKLVPNSSKTDKASMLDEVIDYLKQLQAQVXXXXXXXX 764 AAIHNQSERKRRDKINQRMKTLQKLVPNSSKTDKASMLDEVI+YLKQLQAQV Sbjct: 297 AAIHNQSERKRRDKINQRMKTLQKLVPNSSKTDKASMLDEVIEYLKQLQAQV--QMMNRM 354 Query: 763 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXMNSMTRPNITGXXXXXXXXX 584 +N+M RPNITG Sbjct: 355 NIPQMMLPMAMQQQLQMSMMAPAMGMGMGMGMGMGMGVMDINTMGRPNITGISPVMPNPF 414 Query: 583 XXMASWDGSGDRLQVSA-----MTDPLATFLACQSQPMTMDAYSRMAALYQQMQQPPA 425 M SWDGSG+RLQ +A M DPL+TFLACQ QPMTMDAYSR+AA+YQQMQQPPA Sbjct: 415 MAMTSWDGSGERLQQAASAAAMMPDPLSTFLACQPQPMTMDAYSRLAAMYQQMQQPPA 472 >XP_016694956.1 PREDICTED: transcription factor UNE10-like [Gossypium hirsutum] Length = 469 Score = 428 bits (1101), Expect = e-141 Identities = 270/472 (57%), Positives = 298/472 (63%), Gaps = 31/472 (6%) Frame = -2 Query: 1747 MSQCVPSWDLDENPNHTRPSLRSRSNSTAPDVPMLDYEVAELTWENGQLAMHGLGQPRVP 1568 MSQCVPSWDLD+N R SLRS SNSTAPDV M DYEVAELTWENGQLAMHGLG RVP Sbjct: 1 MSQCVPSWDLDDNHVTARHSLRSNSNSTAPDVHMSDYEVAELTWENGQLAMHGLGPARVP 60 Query: 1567 AKPAANNTSHTKYTWEKPRASGTLESIVNQATTGLPQLKPQLD-------PWFDQQRAAA 1409 AKP +N +KYTW+KPRA+GTLESIVNQAT +P LK LD P ++ R AA Sbjct: 61 AKPLVSNPP-SKYTWDKPRANGTLESIVNQATR-VPYLKVSLDDGRDELVPCLNKHREAA 118 Query: 1408 --------DALVPCSNRSSDGRTT------PVIG-TCAVDCSARVGSCSGPVVATKDEDV 1274 DALVPCS R+ +GRT P +G TC V S RVGSCSG DE + Sbjct: 119 ASSATIAMDALVPCSKRT-EGRTAHAMESIPGLGRTCLVGGSTRVGSCSGRAGTNDDEVL 177 Query: 1273 LNGKXXXXXXXXXXXPEGSSKEHSVSGSATFGR--DSQHVTHDTYDMDTGVGFTSTSMGS 1100 ++GK E SSKE S S SATFGR DS+ VT DTY+ D G+GFTSTS+GS Sbjct: 178 VSGKRTRAARAPLMP-EWSSKEQSASASATFGRERDSRCVTLDTYEKDFGMGFTSTSLGS 236 Query: 1099 PENTSSAKQRTKATT-ADDHDSV-HSRPPREAXXXXXXXXXXXKSSISTKRSRAAAIHNQ 926 PEN SS K TKATT ADDHDSV HSRP RE SS+S KRSRAAAIHNQ Sbjct: 237 PENASSTKPCTKATTTADDHDSVCHSRPQREEFEEDKKETGK--SSVSNKRSRAAAIHNQ 294 Query: 925 SERKRRDKINQRMKTLQKLVPNSSKTDKASMLDEVIDYLKQLQAQVXXXXXXXXXXXXXX 746 SERKRRDKINQRMK QKLVPNSSKTDKASMLDEVI+YLKQLQAQV Sbjct: 295 SERKRRDKINQRMKPPQKLVPNSSKTDKASMLDEVIEYLKQLQAQV--QMMNRMNIPQMM 352 Query: 745 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXMNSMTRPNITGXXXXXXXXXXXMASW 566 +N++ RPNITG M SW Sbjct: 353 LPMAMQQPLQMSMLAPAMGMGMGMGMGMGMGVMDINTIGRPNITGISPVMPNPFMAMTSW 412 Query: 565 DGSGDRLQVSA-----MTDPLATFLACQSQPMTMDAYSRMAALYQQMQQPPA 425 DGSG+RLQ +A M DPL+TFLACQSQPMTMDAYSR+AA+YQQMQQPPA Sbjct: 413 DGSGERLQQAASAAAMMPDPLSTFLACQSQPMTMDAYSRLAAMYQQMQQPPA 464 >KDO85687.1 hypothetical protein CISIN_1g012387mg [Citrus sinensis] Length = 438 Score = 425 bits (1092), Expect = e-140 Identities = 274/449 (61%), Positives = 291/449 (64%), Gaps = 30/449 (6%) Frame = -2 Query: 1747 MSQCVPSWDLDEN-PNHTRPSLRSRSNSTAPDVPML--DYEVAELTWENGQLAMHGLGQP 1577 MSQCVPSWDLDEN PN+ R SLRSRSNSTAPDVPML DYEVAELTWENGQLAMHGLG P Sbjct: 1 MSQCVPSWDLDENYPNNCRASLRSRSNSTAPDVPMLELDYEVAELTWENGQLAMHGLGPP 60 Query: 1576 RVPAKPAANNTSHTKYTWEKPRASGTLESIVNQATTGLPQLK-----PQLDPW------F 1430 RVPAK AANN S TK T SGTLESIVNQAT+ LPQ + P LD + F Sbjct: 61 RVPAKAAANNPSPTKNT-----CSGTLESIVNQATS-LPQAQRNGKPPLLDEFATAPCCF 114 Query: 1429 DQQRAAA---DALVPCSNRSSDGRTTPVIGTC---AVDCSARVGSCSGPVV-----ATKD 1283 QQR + DALVPCSNR S+ RTT V+ S RVGSCSGPV +TKD Sbjct: 115 HQQRPSMTTMDALVPCSNRRSEERTTQVMDPAPRVGGTRSIRVGSCSGPVPLPIPDSTKD 174 Query: 1282 EDVLNGKXXXXXXXXXXXPEGSSKEHSVSGSATFGRDSQHV--THDTYDMDT--GVGFTS 1115 +DVLNGK E SS++ S SGSATFGR+SQ V THDTYDMD GVGFT Sbjct: 175 DDVLNGKRARVARVPVAP-EWSSRDQSFSGSATFGRESQRVSVTHDTYDMDMDMGVGFTG 233 Query: 1114 TSMGSPENTSSAKQRTKATTADDHDSV-HSRPPREAXXXXXXXXXXXKSSISTKRSRAAA 938 TSMGSPENTSSAKQ KATTADDHDSV HSRP REA KS+ISTKRSRAAA Sbjct: 234 TSMGSPENTSSAKQGNKATTADDHDSVCHSRPLREAGDEEYKKKGNGKSTISTKRSRAAA 293 Query: 937 IHNQSERKRRDKINQRMKTLQKLVPNSSKTDKASMLDEVIDYLKQLQAQVXXXXXXXXXX 758 IHNQSERKRRDKINQRMKTLQKLVPNSSKTDKASMLDEVI+YLKQLQAQV Sbjct: 294 IHNQSERKRRDKINQRMKTLQKLVPNSSKTDKASMLDEVIEYLKQLQAQV-----QVMSR 348 Query: 757 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXMNSMTRPNITGXXXXXXXXXXX 578 MNSM+RPNIT Sbjct: 349 MNMPPMMLPMAMQQQLQMSMLSSMGMGMGMGMGMGVMDMNSMSRPNITS-MPPLLHPFLP 407 Query: 577 MASWDGSGDRLQVSAMTDPLATFLACQSQ 491 +ASWDG GDRLQ S MTDPL+TFLACQ Q Sbjct: 408 LASWDGLGDRLQASPMTDPLSTFLACQPQ 436 >EOX96338.1 Basic helix-loop-helix DNA-binding superfamily protein isoform 3 [Theobroma cacao] Length = 448 Score = 418 bits (1075), Expect = e-137 Identities = 259/472 (54%), Positives = 285/472 (60%), Gaps = 31/472 (6%) Frame = -2 Query: 1747 MSQCVPSWDLDENPNHTRPSLRSRSNSTAPDVPMLDYEVAELTWENGQLAMHGLGQPRVP 1568 MSQCVPSWDLD+NP R SLRS SNSTAPDVPMLDYEVAELTWENGQLAMH LG PRVP Sbjct: 1 MSQCVPSWDLDDNPAIARHSLRSNSNSTAPDVPMLDYEVAELTWENGQLAMHSLGPPRVP 60 Query: 1567 AKPAANNTSHTKYTWEKPRASGTLESIVNQATT------GLPQLKPQLDPWFDQQRAAA- 1409 AKP N+TS +KYTW+KPRA GTLESIVNQAT+ L + +L PWFD RAA Sbjct: 61 AKPL-NSTSPSKYTWDKPRAGGTLESIVNQATSFPYRNVSLDGGRDELVPWFDHHRAAVA 119 Query: 1408 -------------DALVPCSNRSSDGRTTPVI-------GTCAVDCSARVGSCSGPVVAT 1289 DALVPCSNRS D RTT V+ GTC V CS RVGSCSGP T Sbjct: 120 AAAVASSSATMTMDALVPCSNRSED-RTTHVMESIRGLGGTCVVGCSTRVGSCSGPT-GT 177 Query: 1288 KDEDVLNGKXXXXXXXXXXXPEGSSKEHSVSGSATFGRDSQHVTHDTYDMDTGVGFTSTS 1109 +D+ VL PE SSK+ + S SATFG DSQHVT D+Y+ D GVGFTSTS Sbjct: 178 QDDGVLLTGKRAREARVSVAPEWSSKDQNASASATFGTDSQHVTVDSYEKDFGVGFTSTS 237 Query: 1108 MGSPENTSSAKQRTKATTADDHDSV-HSRPPREAXXXXXXXXXXXKSSISTKRSRAAAIH 932 +GSPENTSS + TKATTADDHDSV HSRP R+A SS+STKRSRAAAIH Sbjct: 238 LGSPENTSSPRPCTKATTADDHDSVCHSRPQRKAGEEDKRKETGK-SSVSTKRSRAAAIH 296 Query: 931 NQSERKRRDKINQRMKTLQKLVPNSSKTDKASMLDEVIDYLKQLQAQVXXXXXXXXXXXX 752 NQSER TDKASMLDEVI+YLKQLQAQV Sbjct: 297 NQSER----------------------TDKASMLDEVIEYLKQLQAQV---HMMSRMNIP 331 Query: 751 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXMNSMTRPNITGXXXXXXXXXXXMA 572 M++M RPNITG M Sbjct: 332 PMMFPMTMQQQLQMSMMAPMGMGMGMGMGIGMGVMDMSTMGRPNITGISPVLPNPFVTMT 391 Query: 571 SWDGSGDRLQVSA---MTDPLATFLACQSQPMTMDAYSRMAALYQQMQQPPA 425 WDGSGDRLQ ++ M DPL+ FLACQSQP+TMDAYSRMAA+YQQMQ PPA Sbjct: 392 PWDGSGDRLQAASAAVMPDPLSAFLACQSQPITMDAYSRMAAMYQQMQHPPA 443 >KJB41053.1 hypothetical protein B456_007G088300 [Gossypium raimondii] Length = 493 Score = 420 bits (1079), Expect = e-137 Identities = 273/498 (54%), Positives = 300/498 (60%), Gaps = 57/498 (11%) Frame = -2 Query: 1747 MSQCVPSWDLDENPNHTRPSLRSRSNSTAPDVPMLDYEVAELTWENGQLAMHGLGQPRVP 1568 MSQCVPSWDLD+N R SLRS SNSTAPDV M DYEVAELTWENGQLAMHGLG RVP Sbjct: 1 MSQCVPSWDLDDNHVTARHSLRSNSNSTAPDVHMSDYEVAELTWENGQLAMHGLGPARVP 60 Query: 1567 AKPAANNTSHTKYTWEKPRASGTLESIVNQATTGLPQLKPQLD-------PWFDQQRAAA 1409 AKP +N +KYTW+KPRA+GTLESIVNQAT +P LK LD P +Q R AA Sbjct: 61 AKPLVSNPP-SKYTWDKPRANGTLESIVNQATR-VPYLKVSLDDGRDELVPCLNQHREAA 118 Query: 1408 --------DALVPCSNRSSDGRTT------PVIG-TCAVDCSARVGSCSGPVVATKDEDV 1274 DALVPCS R+ +GRT P +G TC V S RVGSCSG DE + Sbjct: 119 ASSATIAMDALVPCSKRT-EGRTAHAMESIPGLGRTCLVGGSTRVGSCSGRAGTHDDEVL 177 Query: 1273 LNGKXXXXXXXXXXXPEGSSKEHSVSGSATFGR--DSQHVTHDTYDMDTGVGFTSTSMGS 1100 ++GK E SSKE S S SATFGR DS+ VT DTY+ D G+GFTSTS+GS Sbjct: 178 VSGKRTRAARAPLMP-EWSSKEQSASASATFGRERDSRCVTLDTYEKDFGMGFTSTSLGS 236 Query: 1099 PENTSSAKQRTKATT-ADDHDSV-HSRPPREAXXXXXXXXXXXKSSISTKRSRAAAIHNQ 926 PEN SS K TKATT ADDHDSV HSRP RE SS+S KRSRAAAIHNQ Sbjct: 237 PENASSTKPCTKATTTADDHDSVCHSRPQREEFEEDKKETGK--SSVSNKRSRAAAIHNQ 294 Query: 925 SERKRRDKINQRMKTLQKLVPNSSKTDKASMLDEVIDYLKQLQAQVXXXXXXXXXXXXXX 746 SERKRRDKINQRMKTLQKLVPNSSKTDKASMLDEVI+YLKQLQAQV Sbjct: 295 SERKRRDKINQRMKTLQKLVPNSSKTDKASMLDEVIEYLKQLQAQV----QMMNRMNIPQ 350 Query: 745 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXMNSMTRPNITGXXXXXXXXXXXMASW 566 +N++ RPNITG M SW Sbjct: 351 MMLPMAMQQPLQMSMLAPAMGMGMGMGMGMGVMDINTIGRPNITGISPVMPNPFMAMTSW 410 Query: 565 DGSGDRLQVSA-----MTDPLATFLACQS--------------------------QPMTM 479 DGSG+RLQ +A M DPL+TFLACQS QPMTM Sbjct: 411 DGSGERLQQAASAAAMMPDPLSTFLACQSQVTFVSHHVCVYRLSILLSKINRTLLQPMTM 470 Query: 478 DAYSRMAALYQQMQQPPA 425 DAYSR+AA+YQQMQQPPA Sbjct: 471 DAYSRLAAMYQQMQQPPA 488 >XP_007052181.2 PREDICTED: transcription factor UNE10 isoform X2 [Theobroma cacao] Length = 448 Score = 416 bits (1068), Expect = e-136 Identities = 258/472 (54%), Positives = 283/472 (59%), Gaps = 31/472 (6%) Frame = -2 Query: 1747 MSQCVPSWDLDENPNHTRPSLRSRSNSTAPDVPMLDYEVAELTWENGQLAMHGLGQPRVP 1568 MSQCVPSWDLD+NP R SLRS SNSTAPDVPMLDYEVAELTWENGQLAMH LG PRVP Sbjct: 1 MSQCVPSWDLDDNPAIARHSLRSNSNSTAPDVPMLDYEVAELTWENGQLAMHSLGPPRVP 60 Query: 1567 AKPAANNTSHTKYTWEKPRASGTLESIVNQATT------GLPQLKPQLDPWFDQQRAAA- 1409 AKP N+TS +KYTW+KPRA GTLESIVNQAT+ L + +L PWFD RAA Sbjct: 61 AKPL-NSTSPSKYTWDKPRAGGTLESIVNQATSFPYRNVSLDGGRDELVPWFDHHRAAVA 119 Query: 1408 -------------DALVPCSNRSSDGRTTPVI-------GTCAVDCSARVGSCSGPVVAT 1289 DALVPCSNRS D RTT V+ GTC V CS VGSCSGP T Sbjct: 120 AAAVASSSATMTMDALVPCSNRSED-RTTHVMESIRGLGGTCVVGCSTMVGSCSGPT-GT 177 Query: 1288 KDEDVLNGKXXXXXXXXXXXPEGSSKEHSVSGSATFGRDSQHVTHDTYDMDTGVGFTSTS 1109 +D+ VL PE SSK+ + S SATFG DSQHVT D+Y+ D GVGFTSTS Sbjct: 178 QDDGVLLTGKRAREARVSVAPEWSSKDQNASASATFGTDSQHVTVDSYEKDFGVGFTSTS 237 Query: 1108 MGSPENTSSAKQRTKATTADDHDSV-HSRPPREAXXXXXXXXXXXKSSISTKRSRAAAIH 932 +GSPENTSS + TKATTADDHDSV HSRP R+A SS+STKRSRAAAIH Sbjct: 238 LGSPENTSSPRPCTKATTADDHDSVCHSRPQRKAGEEDKRKETGK-SSVSTKRSRAAAIH 296 Query: 931 NQSERKRRDKINQRMKTLQKLVPNSSKTDKASMLDEVIDYLKQLQAQVXXXXXXXXXXXX 752 NQSER TDKASMLDEVI+YLKQLQAQV Sbjct: 297 NQSER----------------------TDKASMLDEVIEYLKQLQAQV---HMMSRMNIP 331 Query: 751 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXMNSMTRPNITGXXXXXXXXXXXMA 572 M +M RPNITG M Sbjct: 332 PMMFPMTMQQQLQMSMMAPMGMGMGMGMGIGMGVMDMRTMGRPNITGISPVLPNPFVTMT 391 Query: 571 SWDGSGDRLQVSA---MTDPLATFLACQSQPMTMDAYSRMAALYQQMQQPPA 425 WDGSGDRLQ ++ M DPL+ FLACQSQP+TMDAYSRMAA+YQQMQ PPA Sbjct: 392 PWDGSGDRLQAASAAVMPDPLSAFLACQSQPITMDAYSRMAAMYQQMQHPPA 443 >EOX96337.1 Basic helix-loop-helix DNA-binding superfamily protein isoform 2 [Theobroma cacao] Length = 478 Score = 416 bits (1070), Expect = e-136 Identities = 253/438 (57%), Positives = 278/438 (63%), Gaps = 31/438 (7%) Frame = -2 Query: 1645 LDYEVAELTWENGQLAMHGLGQPRVPAKPAANNTSHTKYTWEKPRASGTLESIVNQATT- 1469 LDYEVAELTWENGQLAMH LG PRVPAKP N+TS +KYTW+KPRA GTLESIVNQAT+ Sbjct: 43 LDYEVAELTWENGQLAMHSLGPPRVPAKPL-NSTSPSKYTWDKPRAGGTLESIVNQATSF 101 Query: 1468 -----GLPQLKPQLDPWFDQQRAAA--------------DALVPCSNRSSDGRTTPVI-- 1352 L + +L PWFD RAA DALVPCSNRS D RTT V+ Sbjct: 102 PYRNVSLDGGRDELVPWFDHHRAAVAAAAVASSSATMTMDALVPCSNRSED-RTTHVMES 160 Query: 1351 -----GTCAVDCSARVGSCSGPVVATKDEDVLNGKXXXXXXXXXXXPEGSSKEHSVSGSA 1187 GTC V CS RVGSCSGP T+D+ VL PE SSK+ + S SA Sbjct: 161 IRGLGGTCVVGCSTRVGSCSGPT-GTQDDGVLLTGKRAREARVSVAPEWSSKDQNASASA 219 Query: 1186 TFGRDSQHVTHDTYDMDTGVGFTSTSMGSPENTSSAKQRTKATTADDHDSV-HSRPPREA 1010 TFG DSQHVT D+Y+ D GVGFTSTS+GSPENTSS + TKATTADDHDSV HSRP R+A Sbjct: 220 TFGTDSQHVTVDSYEKDFGVGFTSTSLGSPENTSSPRPCTKATTADDHDSVCHSRPQRKA 279 Query: 1009 XXXXXXXXXXXKSSISTKRSRAAAIHNQSERKRRDKINQRMKTLQKLVPNSSKTDKASML 830 SS+STKRSRAAAIHNQSERKRRDKINQRMKTLQKLVPNSSKTDKASML Sbjct: 280 GEEDKRKETGK-SSVSTKRSRAAAIHNQSERKRRDKINQRMKTLQKLVPNSSKTDKASML 338 Query: 829 DEVIDYLKQLQAQVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 650 DEVI+YLKQLQAQV Sbjct: 339 DEVIEYLKQLQAQV---HMMSRMNIPPMMFPMTMQQQLQMSMMAPMGMGMGMGMGIGMGV 395 Query: 649 XXMNSMTRPNITGXXXXXXXXXXXMASWDGSGDRLQVSA---MTDPLATFLACQSQPMTM 479 M++M RPNITG M WDGSGDRLQ ++ M DPL+ FLACQSQP+TM Sbjct: 396 MDMSTMGRPNITGISPVLPNPFVTMTPWDGSGDRLQAASAAVMPDPLSAFLACQSQPITM 455 Query: 478 DAYSRMAALYQQMQQPPA 425 DAYSRMAA+YQQMQ PPA Sbjct: 456 DAYSRMAAMYQQMQHPPA 473 >OMP00672.1 hypothetical protein COLO4_12473 [Corchorus olitorius] Length = 431 Score = 414 bits (1064), Expect = e-135 Identities = 261/457 (57%), Positives = 282/457 (61%), Gaps = 16/457 (3%) Frame = -2 Query: 1747 MSQCVPSWDLDENPNHTRPSLRSRSNSTAPDVPMLDYEVAELTWENGQLAMHGLGQPRVP 1568 MSQCVPSWDLDENP R SLRS SNSTAPDVPM DYEVAELTWENGQLAMH LG PRVP Sbjct: 1 MSQCVPSWDLDENPVTARHSLRSNSNSTAPDVPMSDYEVAELTWENGQLAMHSLGPPRVP 60 Query: 1567 AKPAANNTSHTKYTWEKP-RASGTLESIVNQATTGLPQLKPQLDPWFDQQRAAADALVPC 1391 KP N+T+ TKY WEKP RASGTLESIVNQAT + P LD + LVP Sbjct: 61 TKPL-NSTAPTKYAWEKPARASGTLESIVNQATQFPYRKIPTLDG------GGGEELVPW 113 Query: 1390 SNRSSDGRTTPVI-------GTCAVDCSARVGSCSGPVVATKDEDVLNGKXXXXXXXXXX 1232 S + RTT V+ GTC V CS RVGSCSGP ++E +L GK Sbjct: 114 S----EDRTTHVMESIPGLGGTCVVGCSTRVGSCSGPAGTQEEEVLLTGKRAKEARVPVA 169 Query: 1231 XPEGSSKEHSV--SGSATFGRDSQHVTHDTYDMDTGVGFTSTSMGSPENTSSAKQRTKAT 1058 E SSK+ S S SATFGRDSQHVT DTY+ D GVGFTSTS P+NTS T Sbjct: 170 P-EWSSKDQSACASASATFGRDSQHVTVDTYEKDLGVGFTSTS---PDNTS--------T 217 Query: 1057 TADDHDSVHSRPPREAXXXXXXXXXXXKSSISTKRSRAAAIHNQSERKRRDKINQRMKTL 878 ADDHDS REA KSS+STKRSRAAAIHNQSERKRRDKINQRMKTL Sbjct: 218 KADDHDS------REAGEEEDKQKETGKSSVSTKRSRAAAIHNQSERKRRDKINQRMKTL 271 Query: 877 QKLVPNSSKTDKASMLDEVIDYLKQLQAQVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 698 QKLVPNSSKTDKASMLDEVI+YLKQLQAQV Sbjct: 272 QKLVPNSSKTDKASMLDEVIEYLKQLQAQV--NMMSRMNMPPMMLPMTMQQQLQMSMMAP 329 Query: 697 XXXXXXXXXXXXXXXXXXMNSMTRPNITGXXXXXXXXXXXMASWDGSGDRLQVSA----- 533 MNSM RPN++G M SWDGSGDRLQ +A Sbjct: 330 MGMGMGMGMGMAGMGVMDMNSMGRPNMSGISPVMPNPFMTMTSWDGSGDRLQAAAAASAA 389 Query: 532 -MTDPLATFLACQSQPMTMDAYSRMAALYQQMQQPPA 425 + DPL+ FLACQSQPMTM+AYSRMAA+YQQMQQPPA Sbjct: 390 VIPDPLSAFLACQSQPMTMEAYSRMAAMYQQMQQPPA 426 >XP_015582839.1 PREDICTED: transcription factor UNE10 [Ricinus communis] Length = 472 Score = 415 bits (1066), Expect = e-135 Identities = 259/477 (54%), Positives = 294/477 (61%), Gaps = 37/477 (7%) Frame = -2 Query: 1747 MSQCVPSWDLDENPNHT-RPSLRSRSNSTAPDVPMLDYEVAELTWENGQLAMHGLGQPRV 1571 M+QCVPSWDL++NP+ + S RS SNS+APDVPMLDYEVAELTWENGQL+MHGLG PR+ Sbjct: 1 MTQCVPSWDLEDNPSPAAKHSFRSNSNSSAPDVPMLDYEVAELTWENGQLSMHGLGPPRL 60 Query: 1570 PAKPAANNTSHTKYTWEKPRASGTLESIVNQATTGLPQLKP----------QLDPWFDQQ 1421 P K ++ S +KYTWEKPRA GTLESIVNQAT LPQ + ++ PW Sbjct: 61 PVKTIPSS-SPSKYTWEKPRAGGTLESIVNQATR-LPQQRKTDNITGYGSNEVVPWLGHH 118 Query: 1420 ----RAAA-------DALVPCSNRSSDGRTTPVI--------GTCAVDCSARVGSCSGPV 1298 RAA DALVPC+ +S D R+ VI G C V S RVGSCS P Sbjct: 119 HHHHRAATSSPTMTMDALVPCTKQSDDHRSAHVIDSVPAGIGGNCVVGSSTRVGSCSAPT 178 Query: 1297 VATKDEDVLNGKXXXXXXXXXXXPEGSSKEHSVSGSATFGRDSQHVTHDTYDMDTGVGFT 1118 AT+DE+ L PE SS++ SVSGSATFGRDS HVT DT +MD GVGFT Sbjct: 179 TATQDEEALLAAKRARVARVPVAPEWSSRDQSVSGSATFGRDSHHVTLDTCEMDLGVGFT 238 Query: 1117 STSMGSPENTSSAKQRTKATTADDHDSV-HSRPPREAXXXXXXXXXXXKSSISTKRSRAA 941 STS GS ENT +A T D++DSV HSR REA KSS+STKRSRAA Sbjct: 239 STSFGSQENTKTA------TAVDENDSVCHSRHQREAGDDDDKQKANGKSSVSTKRSRAA 292 Query: 940 AIHNQSERKRRDKINQRMKTLQKLVPNSSKTDKASMLDEVIDYLKQLQAQVXXXXXXXXX 761 AIHNQSERKRRDKINQRMKTLQKLVPNSSKTDKASMLDEVI+YLKQLQAQV Sbjct: 293 AIHNQSERKRRDKINQRMKTLQKLVPNSSKTDKASMLDEVIEYLKQLQAQV----QMMSR 348 Query: 760 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXMNSMTRPNITG-XXXXXXXXX 584 MN+++RPNI G Sbjct: 349 MNIQPVMLPMTMQQQLQMSMLAPMNMGMGLAGIGMNVMDMNTISRPNIAGISPVLHPTAF 408 Query: 583 XXMASWDGS--GDRLQVSA---MTDPLATFLACQSQPMTMDAYSRMAALYQQMQQPP 428 M SWDGS GDRLQ ++ M DPLA FLACQ+QPMTMDAYSRMAA+YQQ+QQ P Sbjct: 409 MPMTSWDGSSGGDRLQTASPTVMHDPLAAFLACQTQPMTMDAYSRMAAIYQQLQQQP 465 >XP_003516808.1 PREDICTED: transcription factor UNE10-like [Glycine max] KRH75301.1 hypothetical protein GLYMA_01G076900 [Glycine max] Length = 458 Score = 413 bits (1062), Expect = e-135 Identities = 256/474 (54%), Positives = 283/474 (59%), Gaps = 34/474 (7%) Frame = -2 Query: 1747 MSQCVPSWDLDENPNHTRPSLRSRSNSTAPDVPMLDYEVAELTWENGQLAMHGLGQPRVP 1568 MSQCVPSWD+++NP +R SLRS SNSTAPDVPMLDYEVAELTWENGQL+MHGLG PRVP Sbjct: 1 MSQCVPSWDVEDNPPPSRVSLRSNSNSTAPDVPMLDYEVAELTWENGQLSMHGLGLPRVP 60 Query: 1567 AKPAANNTSHTKYTWEKPRASGTLESIVNQATTGLPQLKPQ--------------LDPWF 1430 KP T+ KYTWEKPRASGTLESIVNQ T+ + KP PWF Sbjct: 61 VKPPTAVTN--KYTWEKPRASGTLESIVNQVTSFPHRGKPTPLNGGGGGGVYGNFRVPWF 118 Query: 1429 DQQRAAA-------DALVPCSNR--SSDGRTTPVIGTCAVDCSARVGSCSGPVVATKDED 1277 D A DALVPCSNR S G + GTC V CS RVGSC G Sbjct: 119 DPHATATTTNTVTMDALVPCSNREQSKQGMESVPGGTCMVGCSTRVGSCCG--------- 169 Query: 1276 VLNGKXXXXXXXXXXXPEGSSKEHSVSGSATFGRDSQHVTHDTYDMDTGVGFTSTSMGSP 1097 GK E + ++ SVSGSATFGRDS+HVT DT D + GVGFTSTS+ S Sbjct: 170 ---GKGAKGH-------EATGRDQSVSGSATFGRDSKHVTLDTCDREFGVGFTSTSINSL 219 Query: 1096 ENTSSAKQRTKATTADDHDSV-HSRPPREAXXXXXXXXXXXKSSISTKRSRAAAIHNQSE 920 ENTSSAK TK TT DDHDSV HS+P E KSS+STKRSRAAAIHNQSE Sbjct: 220 ENTSSAKHCTKTTTVDDHDSVSHSKPVGEDQDEGKKKRANGKSSVSTKRSRAAAIHNQSE 279 Query: 919 RKRRDKINQRMKTLQKLVPNSSKTDKASMLDEVIDYLKQLQAQVXXXXXXXXXXXXXXXX 740 RKRRDKINQRMKTLQKLVPNSSK+DKASMLDEVI+YLKQLQAQ+ Sbjct: 280 RKRRDKINQRMKTLQKLVPNSSKSDKASMLDEVIEYLKQLQAQL--QMINRINMSSMMLP 337 Query: 739 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXMNSMTRPNITG--XXXXXXXXXXXMASW 566 MNSM R +I G ASW Sbjct: 338 LTMQQQLQMSMMSPMGMGLGMGMGMGMGMGMDMNSMNRAHIPGIPPVLHPSAFMPMAASW 397 Query: 565 D-----GSGDRLQ---VSAMTDPLATFLACQSQPMTMDAYSRMAALYQQMQQPP 428 D G GDRLQ + M DPL+TF CQSQPMT+DAYSR+AA+YQQ+ QPP Sbjct: 398 DAAAAAGGGDRLQGTPANVMPDPLSTFFGCQSQPMTIDAYSRLAAMYQQLHQPP 451