BLASTX nr result
ID: Sinomenium21_contig00007455
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Sinomenium21_contig00007455 (853 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CAN79228.1| hypothetical protein VITISV_011041 [Vitis vinifera] 196 1e-47 ref|XP_002273471.2| PREDICTED: uncharacterized protein LOC100266... 192 1e-46 ref|XP_006375135.1| hypothetical protein POPTR_0014s04670g [Popu... 157 4e-36 ref|XP_002511585.1| conserved hypothetical protein [Ricinus comm... 154 4e-35 ref|XP_006290620.1| hypothetical protein CARUB_v10016712mg [Caps... 145 2e-32 gb|EYU31892.1| hypothetical protein MIMGU_mgv1a0019612mg, partia... 143 9e-32 gb|EXB37647.1| hypothetical protein L484_021854 [Morus notabilis] 140 6e-31 ref|XP_002878328.1| DNA binding protein [Arabidopsis lyrata subs... 140 6e-31 ref|XP_006464424.1| PREDICTED: uncharacterized protein LOC102627... 139 1e-30 ref|XP_006445430.1| hypothetical protein CICLE_v10018930mg [Citr... 139 1e-30 ref|XP_006347962.1| PREDICTED: uncharacterized protein LOC102598... 139 1e-30 ref|XP_004164995.1| PREDICTED: uncharacterized protein LOC101224... 138 3e-30 ref|XP_004140629.1| PREDICTED: uncharacterized protein LOC101219... 138 3e-30 ref|NP_191591.2| uncharacterized protein [Arabidopsis thaliana] ... 137 7e-30 ref|XP_006397664.1| hypothetical protein EUTSA_v10001315mg [Eutr... 134 3e-29 ref|XP_002301229.1| hypothetical protein POPTR_0002s13850g [Popu... 134 3e-29 dbj|BAD93791.1| bZIP like protein [Arabidopsis thaliana] 134 3e-29 ref|XP_006402594.1| hypothetical protein EUTSA_v10005793mg [Eutr... 134 4e-29 ref|XP_006402593.1| hypothetical protein EUTSA_v10005793mg [Eutr... 134 4e-29 ref|XP_007052313.1| BZIP domain class transcription factor [Theo... 134 4e-29 >emb|CAN79228.1| hypothetical protein VITISV_011041 [Vitis vinifera] Length = 924 Score = 196 bits (497), Expect = 1e-47 Identities = 103/175 (58%), Positives = 121/175 (69%), Gaps = 4/175 (2%) Frame = -3 Query: 515 PPGSDFYRRSKEDDNDEE---QHRHLDHDDDGETTEREEVHCREWGDHYXXXXXXXXXXX 345 PP S+F+RR D E QH+H DDDGE TEREEV C EWGDHY Sbjct: 265 PPDSEFFRRPTNGDKHSERLHQHQHHHLDDDGEETEREEVQCSEWGDHYSTTSSSDEGDV 324 Query: 344 XXXXXXXXXEVGTRSNFGSSSYNDASP-KSNLRPIGKSEKSDEAGSSVSWNAGNGEISDM 168 +G RSNFGSS +N+ + KS P KS+KSD+AGSSVS+ AG GEISD+ Sbjct: 325 ESRSE-----IGNRSNFGSSVHNEPTTVKSKFPPASKSDKSDDAGSSVSYRAGTGEISDL 379 Query: 167 KMVVRHRDLAEITAAIKDYFDKAAEAGEHVSELLETGRAQLDRSFRQLKKTVYHS 3 K+VVRHRDL+EI A++K+YFDKAA +GE VSE+LE GRAQLDRSFRQLKKTVYHS Sbjct: 380 KIVVRHRDLSEIVASLKEYFDKAASSGERVSEMLEIGRAQLDRSFRQLKKTVYHS 434 >ref|XP_002273471.2| PREDICTED: uncharacterized protein LOC100266818 isoform 1 [Vitis vinifera] Length = 845 Score = 192 bits (489), Expect = 1e-46 Identities = 102/175 (58%), Positives = 120/175 (68%), Gaps = 4/175 (2%) Frame = -3 Query: 515 PPGSDFYRRSKEDDNDEE---QHRHLDHDDDGETTEREEVHCREWGDHYXXXXXXXXXXX 345 PP S+F+RR D E QH+H DDDGE TEREEV C EWGDHY Sbjct: 283 PPDSEFFRRPTNGDKHSERLHQHQHHHLDDDGEETEREEVQCSEWGDHYSTTSSSDEGDV 342 Query: 344 XXXXXXXXXEVGTRSNFGSSSYNDASP-KSNLRPIGKSEKSDEAGSSVSWNAGNGEISDM 168 +G RSNFGSS +N+ + KS P KS K D+AGSSVS++AG GEISD+ Sbjct: 343 ESRSE-----IGNRSNFGSSVHNEPTTVKSKFPPASKSNKFDDAGSSVSYSAGTGEISDL 397 Query: 167 KMVVRHRDLAEITAAIKDYFDKAAEAGEHVSELLETGRAQLDRSFRQLKKTVYHS 3 K+VVRHRDL+EI A++K+YFD+AA AGE VSE+LE GRAQLDRSFRQLKKTVYHS Sbjct: 398 KIVVRHRDLSEIVASLKEYFDQAASAGERVSEMLEIGRAQLDRSFRQLKKTVYHS 452 >ref|XP_006375135.1| hypothetical protein POPTR_0014s04670g [Populus trichocarpa] gi|550323452|gb|ERP52932.1| hypothetical protein POPTR_0014s04670g [Populus trichocarpa] Length = 786 Score = 157 bits (398), Expect = 4e-36 Identities = 101/242 (41%), Positives = 121/242 (50%), Gaps = 38/242 (15%) Frame = -3 Query: 614 YNFPAYFPXXXXXXXXXXXXXXXXXWENFYPPSPPGSDFYRRSKEDDNDEEQHRHLDHDD 435 Y++P F WENFYPPSPP S+F+ R K + N ++QH HLD DD Sbjct: 156 YDYPTAFQNHSTYSTTPSQASSVWNWENFYPPSPPDSEFFAR-KANHNQQQQHPHLDTDD 214 Query: 434 DGET-----------------------------------TEREEVHCREWGDHYXXXXXX 360 + TE+EEV C EWGDH Sbjct: 215 GSSSDADEDVATERFSEYDFFNEKQYTQHKKQQQQNYSETEQEEVQCSEWGDH-DHLSNS 273 Query: 359 XXXXXXXXXXXXXXEVGTRSNFGS---SSYNDASPKSNLRPIGKSEKSDEAGSSVSWNAG 189 E+GTRSNFG S P+ GK + EAGSS + + Sbjct: 274 TTSSDEDNDTESRSEIGTRSNFGPVKHPSQQQPHPQQYDNAFGKLDNKSEAGSSTT-SYR 332 Query: 188 NGEISDMKMVVRHRDLAEITAAIKDYFDKAAEAGEHVSELLETGRAQLDRSFRQLKKTVY 9 GE+S+MKMVVRH+DL EI AIK+ FDKAA AG+ VSE+LE GRAQLDRSFRQLKKTVY Sbjct: 333 TGEVSNMKMVVRHKDLNEIVGAIKENFDKAAAAGDQVSEMLEIGRAQLDRSFRQLKKTVY 392 Query: 8 HS 3 HS Sbjct: 393 HS 394 >ref|XP_002511585.1| conserved hypothetical protein [Ricinus communis] gi|223548765|gb|EEF50254.1| conserved hypothetical protein [Ricinus communis] Length = 809 Score = 154 bits (389), Expect = 4e-35 Identities = 99/231 (42%), Positives = 121/231 (52%), Gaps = 53/231 (22%) Frame = -3 Query: 536 ENFYPPSPPGSDFYRRSKE------DDNDEEQ--------------------------HR 453 ENFYPPSPP S+F+ R + DD D+++ H Sbjct: 191 ENFYPPSPPDSEFFNRKSQNHHLDTDDVDDDEPETETETETEKSEYDFFQLQHKKHNFHN 250 Query: 452 HLDHDDDG-------------------ETTEREEVHCREWGDH--YXXXXXXXXXXXXXX 336 +++DD E TEREEV C EWGDH Y Sbjct: 251 MTNNNDDSINISTNTNSKQQQHNSTADEETEREEVQCSEWGDHDHYSTTSSSEEGEEDDE 310 Query: 335 XXXXXXEVGTRSNFGSSSYNDASPKSNLRPIGKSEKSDEAGSSVSWNAGNGEISDMKMVV 156 E+GTRSNFGSS ++ + + G + KSDEAGSS S+ G E+SDMKMVV Sbjct: 311 DRESRSEIGTRSNFGSSVRAESVKQPPV--YGNATKSDEAGSSASYRTG--EVSDMKMVV 366 Query: 155 RHRDLAEITAAIKDYFDKAAEAGEHVSELLETGRAQLDRSFRQLKKTVYHS 3 RH+DL EI AIK+ FDKAA AG+ VS++LE RAQLDRSFRQLKKTVYHS Sbjct: 367 RHKDLKEIVEAIKENFDKAAAAGDQVSDMLEVSRAQLDRSFRQLKKTVYHS 417 >ref|XP_006290620.1| hypothetical protein CARUB_v10016712mg [Capsella rubella] gi|482559327|gb|EOA23518.1| hypothetical protein CARUB_v10016712mg [Capsella rubella] Length = 784 Score = 145 bits (366), Expect = 2e-32 Identities = 89/217 (41%), Positives = 111/217 (51%), Gaps = 39/217 (17%) Frame = -3 Query: 536 ENFYPPSPPGSDFYRRSKEDDNDEEQHRHLDHDDDGET---------------------- 423 ENFYPPSPP S+F+ R ++ HR+ D + E Sbjct: 176 ENFYPPSPPDSEFFNRKAQEKKQNSDHRYNHEDTETERSEYDFFDTSKQKQKQFESMSNA 235 Query: 422 ------TEREEVHCREWGDHYXXXXXXXXXXXXXXXXXXXXEVGTRSNFGSSSYNDASPK 261 TEREEVHC EW DH E+GTRS+ GS+ ++ Sbjct: 236 VEEETETEREEVHCSEWEDH-DHYSTTSSEEEEDDDRESISEIGTRSDLGSTVRTNSMRP 294 Query: 260 SNLRPI-----------GKSEKSDEAGSSVSWNAGNGEISDMKMVVRHRDLAEITAAIKD 114 + +P GK +K+D+A S G GEI+DMKMVVRHRDL EI AIK+ Sbjct: 295 HHQQPSPMPREYGGAEQGKYDKADDATMSPGSYRGGGEITDMKMVVRHRDLKEIVDAIKE 354 Query: 113 YFDKAAEAGEHVSELLETGRAQLDRSFRQLKKTVYHS 3 FDKAA AG+ VS++LE GRA+LDRSF QLKKTV HS Sbjct: 355 NFDKAASAGDQVSQMLELGRAELDRSFSQLKKTVIHS 391 >gb|EYU31892.1| hypothetical protein MIMGU_mgv1a0019612mg, partial [Mimulus guttatus] Length = 641 Score = 143 bits (360), Expect = 9e-32 Identities = 96/212 (45%), Positives = 115/212 (54%), Gaps = 38/212 (17%) Frame = -3 Query: 524 PP--SPPGSDF---------YRRSKEDDNDEEQH---------------RHLDHDDDGET 423 PP +P G DF Y R+ +++N+ H D ++DGET Sbjct: 34 PPHRNPLGKDFDDRASNYSSYSRNSDNNNNNNNHPKNHKKGQNLKRWESEDEDEEEDGET 93 Query: 422 TEREEVHCREWGDH-YXXXXXXXXXXXXXXXXXXXXEVGTRSNFGSSSYNDASPKS---- 258 EREEV C EWGDH + G RSNFG S N+A+ + Sbjct: 94 -EREEVQCSEWGDHDRYSSSTSSSDEGETEDLKSRSDFGPRSNFGGSMKNEANAAAANAN 152 Query: 257 -NLRPIGKSEK---SDEAGSSVSWNAGNG---EISDMKMVVRHRDLAEITAAIKDYFDKA 99 N KSEK D+ SSVSW G E SD +MVVRHRDLAEI AAIK+YFDKA Sbjct: 153 VNRDFSSKSEKLSSEDDGKSSVSWGDGGSTQKENSDRRMVVRHRDLAEIVAAIKEYFDKA 212 Query: 98 AEAGEHVSELLETGRAQLDRSFRQLKKTVYHS 3 A AGE VSE+LETGRAQ+DRSF+QL+KTVYHS Sbjct: 213 ASAGEQVSEILETGRAQVDRSFKQLRKTVYHS 244 >gb|EXB37647.1| hypothetical protein L484_021854 [Morus notabilis] Length = 851 Score = 140 bits (353), Expect = 6e-31 Identities = 97/222 (43%), Positives = 115/222 (51%), Gaps = 44/222 (19%) Frame = -3 Query: 536 ENFYPPSPPGSDFYRR-----SKE---------DDNDEE--------------------- 462 ENFYPPSPP S+F+ +KE D N +E Sbjct: 217 ENFYPPSPPDSEFFNNRAAAAAKEMRSSNHGGGDINSDEEEGTETETERSEYDFFNAKAG 276 Query: 461 QHRHLDHDDDG-------ETTEREEVHCREWGDHYXXXXXXXXXXXXXXXXXXXXEVGTR 303 Q HL H+ + ET EREEV C EWGDHY +G R Sbjct: 277 QDHHLRHEKNSNHHDFASETMEREEVQCSEWGDHYSTTSSSADEDDRDSRSD----LGAR 332 Query: 302 SNFGSSSYND--ASPKSNLRPIGKSEKSDEAGSSVSWNAGNGEISDMKMVVRHRDLAEIT 129 SNFGSS+ + A+P + KS+E SS GEISDMKMVVRH+DL EI Sbjct: 333 SNFGSSARAESVAAPPPPAAAAAAT-KSEEYSSSY------GEISDMKMVVRHKDLKEIV 385 Query: 128 AAIKDYFDKAAEAGEHVSELLETGRAQLDRSFRQLKKTVYHS 3 AIK+ F+KAA AG+ VSE+LE GRAQLDRSF+QLKKTVYHS Sbjct: 386 EAIKENFEKAAAAGDQVSEMLEIGRAQLDRSFKQLKKTVYHS 427 >ref|XP_002878328.1| DNA binding protein [Arabidopsis lyrata subsp. lyrata] gi|297324166|gb|EFH54587.1| DNA binding protein [Arabidopsis lyrata subsp. lyrata] Length = 794 Score = 140 bits (353), Expect = 6e-31 Identities = 89/222 (40%), Positives = 111/222 (50%), Gaps = 44/222 (19%) Frame = -3 Query: 536 ENFYPPSPPGSDFYRRSKEDDNDEEQHRHLDHDDDGET---------------------- 423 ENFYPPSPP S+F+ R ++ R D D + E Sbjct: 180 ENFYPPSPPDSEFFNRKAQEKKQNSDSRFNDEDTETERSEYDFFDTRKQKKKQFESMSNA 239 Query: 422 ------TEREEVHCREWGDH-----YXXXXXXXXXXXXXXXXXXXXEVGTRSNFGSSSYN 276 TEREEV C EW DH E+GTRS FGS+ + Sbjct: 240 VEEETETEREEVQCSEWEDHDHYSTTSSSDAAEEEEEDDDDRESISEIGTRSEFGSTVRS 299 Query: 275 DASPKSNLRPI-----------GKSEKSDEAGSSVSWNAGNGEISDMKMVVRHRDLAEIT 129 +++ + + +P GK +K D+A S G GEI+DMKMVVRHRDL EI Sbjct: 300 NSTRRHHQQPSPMPQVYGGAEQGKYDKVDDATISSGSYRGGGEIADMKMVVRHRDLKEIV 359 Query: 128 AAIKDYFDKAAEAGEHVSELLETGRAQLDRSFRQLKKTVYHS 3 AIK+ FDKAA +GE VS++LE GRA+LDRSF QLKKTV HS Sbjct: 360 DAIKENFDKAAASGEQVSQMLELGRAELDRSFSQLKKTVIHS 401 >ref|XP_006464424.1| PREDICTED: uncharacterized protein LOC102627291 [Citrus sinensis] Length = 769 Score = 139 bits (351), Expect = 1e-30 Identities = 91/213 (42%), Positives = 110/213 (51%), Gaps = 35/213 (16%) Frame = -3 Query: 536 ENFYPPSPPGSDFY----RRSKEDDNDEEQ----------------HRHLDHDDDG---- 429 ENFYPPSPP S+F+ R+ E D + E H+ + + Sbjct: 177 ENFYPPSPPDSEFFNQKSRQRPEPDPESESEPETEAERSEYDFFQPHQQQQKERESRPFP 236 Query: 428 -----------ETTEREEVHCREWGDHYXXXXXXXXXXXXXXXXXXXXEVGTRSNFGSSS 282 E TEREEV C EWGDHY +G RS+FGSS Sbjct: 237 NNFNYNGSVADEETEREEVQCSEWGDHYSTTTSSDDEEEEMDKESRSE-MGARSDFGSSG 295 Query: 281 YNDASPKSNLRPIGKSEKSDEAGSSVSWNAGNGEISDMKMVVRHRDLAEITAAIKDYFDK 102 PI K + D A SS S+ N EISDMK+V+RH+DL EI A+KDYFDK Sbjct: 296 KQQ-------EPIKKFD--DAASSSASFR--NREISDMKLVIRHKDLKEIVEALKDYFDK 344 Query: 101 AAEAGEHVSELLETGRAQLDRSFRQLKKTVYHS 3 AA AG+ + E+LE GRAQLDRSF+QLKKTVYHS Sbjct: 345 AASAGDQLFEILEIGRAQLDRSFKQLKKTVYHS 377 >ref|XP_006445430.1| hypothetical protein CICLE_v10018930mg [Citrus clementina] gi|557547692|gb|ESR58670.1| hypothetical protein CICLE_v10018930mg [Citrus clementina] Length = 785 Score = 139 bits (351), Expect = 1e-30 Identities = 91/213 (42%), Positives = 110/213 (51%), Gaps = 35/213 (16%) Frame = -3 Query: 536 ENFYPPSPPGSDFY----RRSKEDDNDEEQ----------------HRHLDHDDDG---- 429 ENFYPPSPP S+F+ R+ E D + E H+ + + Sbjct: 193 ENFYPPSPPDSEFFNQKSRQRPEPDPESESEPETEAERSEYDFFQPHQQQQKERESRPFP 252 Query: 428 -----------ETTEREEVHCREWGDHYXXXXXXXXXXXXXXXXXXXXEVGTRSNFGSSS 282 E TEREEV C EWGDHY +G RS+FGSS Sbjct: 253 NNFNYNGSVADEETEREEVQCSEWGDHYSTTTSSDDEEEEMDKESRSE-MGARSDFGSSG 311 Query: 281 YNDASPKSNLRPIGKSEKSDEAGSSVSWNAGNGEISDMKMVVRHRDLAEITAAIKDYFDK 102 PI K + D A SS S+ N EISDMK+V+RH+DL EI A+KDYFDK Sbjct: 312 KQQ-------EPIKKFD--DAASSSASFR--NREISDMKLVIRHKDLKEIVEALKDYFDK 360 Query: 101 AAEAGEHVSELLETGRAQLDRSFRQLKKTVYHS 3 AA AG+ + E+LE GRAQLDRSF+QLKKTVYHS Sbjct: 361 AASAGDQLFEILEIGRAQLDRSFKQLKKTVYHS 393 >ref|XP_006347962.1| PREDICTED: uncharacterized protein LOC102598946 [Solanum tuberosum] Length = 852 Score = 139 bits (350), Expect = 1e-30 Identities = 103/272 (37%), Positives = 123/272 (45%), Gaps = 94/272 (34%) Frame = -3 Query: 536 ENFYPPSPPGSDFYRR-----------SKEDDNDEE---------------QHRHLDHDD 435 ENFYPPSPP S+++ R EDD D++ QH+H H Sbjct: 190 ENFYPPSPPSSEYFERVHNKNNFSREADLEDDEDDKASNYTSHSQYSQHHHQHQHQHHPS 249 Query: 434 D-----------------------------------GET---------------TEREEV 405 D GE EREEV Sbjct: 250 DQSLHGKSNKQFDFFDTDSVNDEKLANGSVRNQKKVGENKQNYAAHHLNNWESEAEREEV 309 Query: 404 HCREWGDH--YXXXXXXXXXXXXXXXXXXXXEVGT-----RSNFGSSSYNDASPKSNLR- 249 C EWGDH Y E+ + RSNFGS++ SN+ Sbjct: 310 QCSEWGDHDHYSTTTSSDDDDDEEEEVEEKDEIRSGYGPSRSNFGSTASVKNEEGSNINV 369 Query: 248 -----PIGKSEKSDEAG--SSVSWNAGNGEI---SDMKMVVRHRDLAEITAAIKDYFDKA 99 + KS+K E G SS+SW GNG++ SD MVVRH+DLAEI AAIK+YFDK Sbjct: 370 NSKNFSVKKSDKMSEDGGSSSMSWGNGNGKVEMVSDRSMVVRHKDLAEIVAAIKEYFDKT 429 Query: 98 AEAGEHVSELLETGRAQLDRSFRQLKKTVYHS 3 A AGE VSE+LETGRAQLDRSF+QLKKTVYHS Sbjct: 430 ASAGEQVSEMLETGRAQLDRSFKQLKKTVYHS 461 >ref|XP_004164995.1| PREDICTED: uncharacterized protein LOC101224117 [Cucumis sativus] Length = 555 Score = 138 bits (347), Expect = 3e-30 Identities = 85/218 (38%), Positives = 112/218 (51%), Gaps = 40/218 (18%) Frame = -3 Query: 536 ENFYPPSPPGSDFY----------------------------------RRSKEDDNDEEQ 459 E+FYPPSPP S+F+ R+S+ +D Q Sbjct: 208 ESFYPPSPPSSEFFQSRSQTQIQPKPHPNNDYHDYDDETEQSEYTFFHRKSESKKDDGHQ 267 Query: 458 HRHLDHDDDGETTEREEVHCREWGDHYXXXXXXXXXXXXXXXXXXXXEVGTRSNFGSSSY 279 + H D TEREEV C +WGDHY E TRSNF SS Sbjct: 268 FQQQKHHLDDTETEREEVQCSDWGDHYSTTSSSDIDEIDGTDADLRSEADTRSNFESSIR 327 Query: 278 NDA------SPKSNLRPIGKSEKSDEAGSSVSWNAGNGEISDMKMVVRHRDLAEITAAIK 117 ++ +P + + EK D+AGSS + GEISD++MVVRH+DL EI A+K Sbjct: 328 TESVAPEPVTPPPPAKYATQMEKFDDAGSSAG-SFRTGEISDLRMVVRHKDLKEIVDALK 386 Query: 116 DYFDKAAEAGEHVSELLETGRAQLDRSFRQLKKTVYHS 3 + F+KAA AG+ VS++LE G+A+LD+SFR LKKTVYHS Sbjct: 387 ENFEKAAVAGDSVSKMLEIGKAELDKSFRHLKKTVYHS 424 >ref|XP_004140629.1| PREDICTED: uncharacterized protein LOC101219960 [Cucumis sativus] Length = 765 Score = 138 bits (347), Expect = 3e-30 Identities = 85/218 (38%), Positives = 112/218 (51%), Gaps = 40/218 (18%) Frame = -3 Query: 536 ENFYPPSPPGSDFY----------------------------------RRSKEDDNDEEQ 459 E+FYPPSPP S+F+ R+S+ +D Q Sbjct: 208 ESFYPPSPPSSEFFQSRSQTQIQPKPHPNNDYHDYDDETEQSEYTFFHRKSESKKDDGHQ 267 Query: 458 HRHLDHDDDGETTEREEVHCREWGDHYXXXXXXXXXXXXXXXXXXXXEVGTRSNFGSSSY 279 + H D TEREEV C +WGDHY E TRSNF SS Sbjct: 268 FQQQKHHLDDTETEREEVQCSDWGDHYSTTSSSDIDEIDGTDADLRSEADTRSNFESSIR 327 Query: 278 NDA------SPKSNLRPIGKSEKSDEAGSSVSWNAGNGEISDMKMVVRHRDLAEITAAIK 117 ++ +P + + EK D+AGSS + GEISD++MVVRH+DL EI A+K Sbjct: 328 TESVAPEPVTPPPPAKYATQMEKFDDAGSSAG-SFRTGEISDLRMVVRHKDLKEIVDALK 386 Query: 116 DYFDKAAEAGEHVSELLETGRAQLDRSFRQLKKTVYHS 3 + F+KAA AG+ VS++LE G+A+LD+SFR LKKTVYHS Sbjct: 387 ENFEKAAVAGDSVSKMLEIGKAELDKSFRHLKKTVYHS 424 >ref|NP_191591.2| uncharacterized protein [Arabidopsis thaliana] gi|16604629|gb|AAL24107.1| putative bZIP protein [Arabidopsis thaliana] gi|332646523|gb|AEE80044.1| uncharacterized protein AT3G60320 [Arabidopsis thaliana] Length = 796 Score = 137 bits (344), Expect = 7e-30 Identities = 87/221 (39%), Positives = 111/221 (50%), Gaps = 43/221 (19%) Frame = -3 Query: 536 ENFYPPSPPGSDFYRRSKEDDNDEEQHRHLDHD--------------------------- 438 ENFYPPSPP S+F+ R ++ +R D D Sbjct: 183 ENFYPPSPPDSEFFNRKAQEKKHNSDNRFNDEDTETVRSEYDFFDTRKQKQKQFESMRNQ 242 Query: 437 -DDGETTEREEVHCREWGDH----YXXXXXXXXXXXXXXXXXXXXEVGTRSNFGSSSYND 273 ++ TEREEV C EW DH EVGTRS FGS+ ++ Sbjct: 243 VEEETETEREEVQCSEWEDHDHYSTTSSSDAAEEEEEDDDRESISEVGTRSEFGSTVRSN 302 Query: 272 ASPKSNLRPI-----------GKSEKSDEAGSSVSWNAGNGEISDMKMVVRHRDLAEITA 126 + + + +P K +K+D+A S G G+I+DMKMVVRHRDL EI Sbjct: 303 SMRRHHQQPSPMPQVYGGAEQSKYDKADDATISSGSYRGGGDIADMKMVVRHRDLKEIID 362 Query: 125 AIKDYFDKAAEAGEHVSELLETGRAQLDRSFRQLKKTVYHS 3 AIK+ FDKAA +GE VS++LE GRA+LDRSF QLKKTV HS Sbjct: 363 AIKENFDKAAASGEQVSQMLELGRAELDRSFSQLKKTVIHS 403 >ref|XP_006397664.1| hypothetical protein EUTSA_v10001315mg [Eutrema salsugineum] gi|557098737|gb|ESQ39117.1| hypothetical protein EUTSA_v10001315mg [Eutrema salsugineum] Length = 797 Score = 134 bits (338), Expect = 3e-29 Identities = 88/224 (39%), Positives = 112/224 (50%), Gaps = 46/224 (20%) Frame = -3 Query: 536 ENFYPPSPPGSDFYRRSKED----------DNDEEQHRHLDHD----------------- 438 ENFYPPSPP S+F+ R ++ D D+ + +HD Sbjct: 182 ENFYPPSPPDSEFFNRKSQERQQNRFGDLADGDDTETERSEHDFFHSRKQKQFESRNSAV 241 Query: 437 DDGETTEREEVHCREWGDH------YXXXXXXXXXXXXXXXXXXXXEVGTRSNFGSS--- 285 ++ + TEREEV C EW DH E+GTRS+FGSS Sbjct: 242 EEEDETEREEVQCSEWEDHDHYSTTSSSDAAQEEEEEEEEDRESVSEIGTRSDFGSSVRT 301 Query: 284 -----SYNDASPKSNLRPIG-----KSEKSDEAGSSVSWNAGNGEISDMKMVVRHRDLAE 135 ++ P + G K K+D+A +S G GE++DMKMVVRHRDL E Sbjct: 302 SSMRRDHHHHQPPPMPQEYGGTAQEKYGKADDATTSSGSYRGGGEMADMKMVVRHRDLKE 361 Query: 134 ITAAIKDYFDKAAEAGEHVSELLETGRAQLDRSFRQLKKTVYHS 3 I AIK+ FDKAA AG+ VS++L GRAQLDRSF LKKTV HS Sbjct: 362 IVDAIKENFDKAASAGDQVSQMLHLGRAQLDRSFSHLKKTVIHS 405 >ref|XP_002301229.1| hypothetical protein POPTR_0002s13850g [Populus trichocarpa] gi|222842955|gb|EEE80502.1| hypothetical protein POPTR_0002s13850g [Populus trichocarpa] Length = 767 Score = 134 bits (338), Expect = 3e-29 Identities = 88/197 (44%), Positives = 103/197 (52%), Gaps = 19/197 (9%) Frame = -3 Query: 536 ENFYPPSPPGSDFYRRSKEDDNDEEQHRHLDHDDDG-------------------ETTER 414 ENFYPPSPP S+F+ R K + N QH+H DDG TE+ Sbjct: 187 ENFYPPSPPDSEFFAR-KANQNHYNQHQHHLDTDDGLSEYDFFKKKQYPQQQQIYSETEQ 245 Query: 413 EEVHCREWGDHYXXXXXXXXXXXXXXXXXXXXEVGTRSNFGSSSYNDASPKSNLRPIGKS 234 EEV C EWGDH E+ TRSNFGS + GKS Sbjct: 246 EEVQCSEWGDHDNYSKTTTSSDEEDNDTDFKSEMETRSNFGSKQQPQPQSQQADNGFGKS 305 Query: 233 EKSDEAGSSVSWNAGNGEISDMKMVVRHRDLAEITAAIKDYFDKAAEAGEHVSELLETGR 54 + EAGSS + + E S MKMV RH+DL EI AIK+ FDKAA AG+ VSE+LE Sbjct: 306 DNKSEAGSSTT-SYRTRETSTMKMV-RHKDLKEIVDAIKENFDKAAAAGDQVSEMLE--- 360 Query: 53 AQLDRSFRQLKKTVYHS 3 LDR+FRQLKKTVYHS Sbjct: 361 --LDRNFRQLKKTVYHS 375 >dbj|BAD93791.1| bZIP like protein [Arabidopsis thaliana] Length = 534 Score = 134 bits (338), Expect = 3e-29 Identities = 86/221 (38%), Positives = 110/221 (49%), Gaps = 43/221 (19%) Frame = -3 Query: 536 ENFYPPSPPGSDFYRRSKEDDNDEEQHRHLDHD--------------------------- 438 ENFYPPSPP S+F+ R ++ +R D D Sbjct: 183 ENFYPPSPPDSEFFNRKAQEKKHNSDNRFNDEDTETVRSEYDFFDTRKQKQKQFESMRNQ 242 Query: 437 -DDGETTEREEVHCREWGDH----YXXXXXXXXXXXXXXXXXXXXEVGTRSNFGSSSYND 273 ++ TEREEV C EW DH EVGTRS FGS+ ++ Sbjct: 243 VEEETETEREEVQCSEWEDHDHYSTTSSSDAAEEEEEDDDRESISEVGTRSEFGSTVRSN 302 Query: 272 ASPKSNLRPI-----------GKSEKSDEAGSSVSWNAGNGEISDMKMVVRHRDLAEITA 126 + + + +P K +K+D+A S G G+I+DMKMVVRHRDL EI Sbjct: 303 SMRRHHQQPSPMPQVYGGAEQSKYDKADDATISSGSYRGGGDIADMKMVVRHRDLKEIID 362 Query: 125 AIKDYFDKAAEAGEHVSELLETGRAQLDRSFRQLKKTVYHS 3 AIK+ FDK A +GE VS++LE GRA+LDRSF QLKKTV HS Sbjct: 363 AIKENFDKDAASGEQVSQMLELGRAELDRSFSQLKKTVIHS 403 >ref|XP_006402594.1| hypothetical protein EUTSA_v10005793mg [Eutrema salsugineum] gi|557103693|gb|ESQ44047.1| hypothetical protein EUTSA_v10005793mg [Eutrema salsugineum] Length = 786 Score = 134 bits (337), Expect = 4e-29 Identities = 89/217 (41%), Positives = 111/217 (51%), Gaps = 39/217 (17%) Frame = -3 Query: 536 ENFYPPSPPGSDFYRRSKED----------DNDEEQHRHLDHDDDGET------------ 423 ENFYPPSPP S+F+ R ++ D+ E Q D + Sbjct: 184 ENFYPPSPPDSEFFDRKAQEKKQKPDNPFSDDTETQRSEYDFFHSSKQKQFESVSSAVEV 243 Query: 422 ---TEREEVHCREWG--DHYXXXXXXXXXXXXXXXXXXXXEVGTRSNFGSSSYNDASPKS 258 TEREEV C EW DHY E+GTRS+FGS+ + + Sbjct: 244 ETETEREEVQCSEWDVHDHYSTTTSSDATEEEDDDRESISEIGTRSDFGSTGRTFSMGRQ 303 Query: 257 NLRP-------IGK-----SEKSDEAGSSVSWNAGNGEISDMKMVVRHRDLAEITAAIKD 114 + +P +G +EK+D+A S G EI+DMKMVVRHRDL EI AI++ Sbjct: 304 HQQPSPMPEEYVGGEHGRYNEKADDATISSGSYRGGREIADMKMVVRHRDLREIADAIQE 363 Query: 113 YFDKAAEAGEHVSELLETGRAQLDRSFRQLKKTVYHS 3 FDKAA AG VS++LE GRAQLDRSF QLKKTV HS Sbjct: 364 NFDKAAAAGNQVSQMLELGRAQLDRSFSQLKKTVIHS 400 >ref|XP_006402593.1| hypothetical protein EUTSA_v10005793mg [Eutrema salsugineum] gi|557103692|gb|ESQ44046.1| hypothetical protein EUTSA_v10005793mg [Eutrema salsugineum] Length = 791 Score = 134 bits (337), Expect = 4e-29 Identities = 89/217 (41%), Positives = 111/217 (51%), Gaps = 39/217 (17%) Frame = -3 Query: 536 ENFYPPSPPGSDFYRRSKED----------DNDEEQHRHLDHDDDGET------------ 423 ENFYPPSPP S+F+ R ++ D+ E Q D + Sbjct: 184 ENFYPPSPPDSEFFDRKAQEKKQKPDNPFSDDTETQRSEYDFFHSSKQKQFESVSSAVEV 243 Query: 422 ---TEREEVHCREWG--DHYXXXXXXXXXXXXXXXXXXXXEVGTRSNFGSSSYNDASPKS 258 TEREEV C EW DHY E+GTRS+FGS+ + + Sbjct: 244 ETETEREEVQCSEWDVHDHYSTTTSSDATEEEDDDRESISEIGTRSDFGSTGRTFSMGRQ 303 Query: 257 NLRP-------IGK-----SEKSDEAGSSVSWNAGNGEISDMKMVVRHRDLAEITAAIKD 114 + +P +G +EK+D+A S G EI+DMKMVVRHRDL EI AI++ Sbjct: 304 HQQPSPMPEEYVGGEHGRYNEKADDATISSGSYRGGREIADMKMVVRHRDLREIADAIQE 363 Query: 113 YFDKAAEAGEHVSELLETGRAQLDRSFRQLKKTVYHS 3 FDKAA AG VS++LE GRAQLDRSF QLKKTV HS Sbjct: 364 NFDKAAAAGNQVSQMLELGRAQLDRSFSQLKKTVIHS 400 >ref|XP_007052313.1| BZIP domain class transcription factor [Theobroma cacao] gi|508704574|gb|EOX96470.1| BZIP domain class transcription factor [Theobroma cacao] Length = 823 Score = 134 bits (337), Expect = 4e-29 Identities = 92/238 (38%), Positives = 112/238 (47%), Gaps = 60/238 (25%) Frame = -3 Query: 536 ENFYPPSPPGSDFYRRSKEDDNDEEQHRHLDHDDDG------------------------ 429 ENFYPPSPP S+F+ + + + RH D + Sbjct: 195 ENFYPPSPPDSEFFDQKLQQQKQQLPRRHHQLDSNNPEDTEDTETEKSEYDFFRPQKLNH 254 Query: 428 --------------ETTEREEVHCREWGDH-----YXXXXXXXXXXXXXXXXXXXXEVGT 306 E TEREEV C EWGDH E+G+ Sbjct: 255 RYNINSNNAKSNFDEETEREEVQCSEWGDHDHDRYTTTSSSDVEEQDEDDDVASRSEIGS 314 Query: 305 RSNFGSSSYNDASPKSNLR----PIGKSE-------------KSDEAGSSVSWNAGNGEI 177 RSNFGSS ++ +LR P+ + KS +AGSS + G + Sbjct: 315 RSNFGSSVRGESEKLHHLRNHTPPVQPQQPMYGATAGNKMDNKSGDAGSSAG-SYRTGAM 373 Query: 176 SDMKMVVRHRDLAEITAAIKDYFDKAAEAGEHVSELLETGRAQLDRSFRQLKKTVYHS 3 DMKMVVRHRDL EI AIK+ FDKAA AG+ VSE+LE GRAQLD+SFRQLKKTVYHS Sbjct: 374 MDMKMVVRHRDLKEIVDAIKENFDKAAAAGDQVSEMLEIGRAQLDKSFRQLKKTVYHS 431