BLASTX nr result
ID: Cocculus22_contig00001054
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cocculus22_contig00001054 (2637 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_003631709.1| PREDICTED: transcription factor GTE2-like [V... 416 e-113 emb|CBI35445.3| unnamed protein product [Vitis vinifera] 407 e-110 ref|XP_006856632.1| hypothetical protein AMTR_s00046p00231370 [A... 401 e-109 gb|EXC03905.1| Transcription factor GTE7 [Morus notabilis] 395 e-107 ref|XP_007148327.1| hypothetical protein PHAVU_006G199400g [Phas... 378 e-102 gb|ABI14812.1| chloroplast bromodomain-containing protein [Pachy... 373 e-100 ref|XP_007042774.1| Global transcription factor group E2, putati... 372 e-100 ref|XP_007042770.1| Global transcription factor group E2, putati... 372 e-100 ref|XP_006379136.1| hypothetical protein POPTR_0009s08350g [Popu... 370 2e-99 ref|XP_002313212.2| hypothetical protein POPTR_0009s08350g [Popu... 370 2e-99 ref|XP_006379135.1| hypothetical protein POPTR_0009s08350g [Popu... 370 2e-99 ref|XP_004147512.1| PREDICTED: transcription factor GTE7-like [C... 367 2e-98 ref|XP_002518322.1| bromodomain-containing protein, putative [Ri... 363 2e-97 ref|NP_001234374.1| PSTVd RNA-binding protein Virp1d [Solanum ly... 363 2e-97 ref|XP_002298808.2| hypothetical protein POPTR_0001s29240g [Popu... 361 8e-97 emb|CAD43284.1| bromodomain-containing RNA-binding protein 1 [Ni... 361 1e-96 ref|NP_001275266.1| bromodomain-containing RNA-binding protein 1... 358 5e-96 ref|XP_007148328.1| hypothetical protein PHAVU_006G199500g [Phas... 358 7e-96 gb|EXC33022.1| Transcription factor GTE4 [Morus notabilis] 358 9e-96 ref|XP_007042773.1| Global transcription factor group, putative ... 357 1e-95 >ref|XP_003631709.1| PREDICTED: transcription factor GTE2-like [Vitis vinifera] Length = 561 Score = 416 bits (1069), Expect = e-113 Identities = 263/591 (44%), Positives = 330/591 (55%), Gaps = 13/591 (2%) Frame = -3 Query: 2419 MASALLASRNEPHWGEPKV---------YMRKNPNSNPTNWRSNPNPNPKEKCSSPRKQI 2267 MASA+LASRNE +W + + +M K +SNP NPN N K K +P I Sbjct: 1 MASAVLASRNESNWAQSRGGGGGGGGGGFMGKFHSSNP-----NPN-NSKRKTHAPAGDI 54 Query: 2266 YDRSGHGRQMDESAAALVLAKSDDSSSLNRKFVSLNNRKESNHGGYVSFNISAYSRKELK 2087 D S A+ + SDD+SS N++ + E N G YV+FNI +YSRK+L Sbjct: 55 NDLS----------PAVTQSASDDASSFNQRSIV-----EFNRGRYVTFNIGSYSRKDLV 99 Query: 2086 GLKKRLIAELEQVRNLSSRIESREFQSRSGYSATQFSGGHGGREVTSSTRPPTHSPEASG 1907 LK RL++ELE+++NLS+RIES + Q RSG G R RP Sbjct: 100 QLKNRLVSELEKIQNLSNRIESGDLQLRSG----------GDRTANKQQRPNNK------ 143 Query: 1906 GKGSTKQKHMTMKVSGSKRPNQLDSGRDTKRLASDPANAKLLSGMMKKCGQLLTKLMKHK 1727 K++G+KRP DSGR KR A++ A+ +MK CGQ LTKLMKHK Sbjct: 144 ------------KIAGNKRPPPFDSGRGPKRSAAENAS------LMKLCGQTLTKLMKHK 185 Query: 1726 HGWIFNVPVDVVGMGLHDYNQIIRLPMDLGTVKSKLGNNLYTSPHDFASDVRLTFNNALR 1547 H W+FN PVDVVGMGLHDY+QII+ PMDLGTVKSK+ NLY SP DFA+DVRLTF+NAL Sbjct: 186 HSWVFNSPVDVVGMGLHDYHQIIKRPMDLGTVKSKIAKNLYDSPLDFAADVRLTFDNALL 245 Query: 1546 YNPKGHDVHVAAEQLLARFEEMFEPEHRKYESERHS----MFGGANELRQRSAAMAEEME 1379 YNPKGHDVHV AEQLLARFE++F+P + K E + + GG A + E Sbjct: 246 YNPKGHDVHVMAEQLLARFEDLFKPVYNKLEEDERDQERIIVGGGRGGVSAIAGTSGGEE 305 Query: 1378 PRRSSWNHQSRTPDSAKEHERFPVXXXXXXXXXXXXXXXXXXXLTERSXXXXXXXXXXXX 1199 + SSWNH TP+ K+ PV + +S Sbjct: 306 LQGSSWNH-IPTPERLKKPSPKPVAKKPERMQVPIPATGSSNPPSVQS---VPTPSPMRA 361 Query: 1198 XXAKQLGVRSGTGKQPPKPKAKDPNKREMTFEEKQKLSLNLQNLPQEKMDQVLQIISKRD 1019 K L R +GKQ PKPKAKDPNKREM+ EEK KL L LQ+LPQEKMDQV+QIISK++ Sbjct: 362 PPVKPLATRPSSGKQ-PKPKAKDPNKREMSLEEKHKLGLGLQSLPQEKMDQVVQIISKKN 420 Query: 1018 SKLAQHGDEIEVDIEYIDKETLWELDRFVGYYKKMISKMKRQAIAAGQNSTATEENKQCT 839 L Q GDEIE+DIE +D ETLWELDR V +KKM+SK+KRQA+ N+++ E Sbjct: 421 GHLTQDGDEIELDIEAVDTETLWELDRLVTNWKKMVSKIKRQALMVNNNTSSMNE----- 475 Query: 838 TVDEVPEAEATMXXXXXXXXXXXXXXXXXXXXXXEMPASNFPPVEIEKDAA 686 E ++ E+P + FPPVEIEKD A Sbjct: 476 ------RTEPSLAPAMAKKPKKGEAGEEDVDIGDEIPTATFPPVEIEKDDA 520 >emb|CBI35445.3| unnamed protein product [Vitis vinifera] Length = 564 Score = 407 bits (1046), Expect = e-110 Identities = 251/546 (45%), Positives = 316/546 (57%), Gaps = 1/546 (0%) Frame = -3 Query: 2320 SNPNPN-PKEKCSSPRKQIYDRSGHGRQMDESAAALVLAKSDDSSSLNRKFVSLNNRKES 2144 SNPNPN K K +P I D S A+ + SDD+SS N++ + E Sbjct: 48 SNPNPNNSKRKTHAPAGDINDLS----------PAVTQSASDDASSFNQRSIV-----EF 92 Query: 2143 NHGGYVSFNISAYSRKELKGLKKRLIAELEQVRNLSSRIESREFQSRSGYSATQFSGGHG 1964 N G YV+FNI +YSRK+L LK RL++ELE+++NLS+RIES + Q RSG G Sbjct: 93 NRGRYVTFNIGSYSRKDLVQLKNRLVSELEKIQNLSNRIESGDLQLRSG----------G 142 Query: 1963 GREVTSSTRPPTHSPEASGGKGSTKQKHMTMKVSGSKRPNQLDSGRDTKRLASDPANAKL 1784 R RP K++G+KRP DSGR KR A++ A+ Sbjct: 143 DRTANKQQRPNNK------------------KIAGNKRPPPFDSGRGPKRSAAENAS--- 181 Query: 1783 LSGMMKKCGQLLTKLMKHKHGWIFNVPVDVVGMGLHDYNQIIRLPMDLGTVKSKLGNNLY 1604 +MK CGQ LTKLMKHKH W+FN PVDVVGMGLHDY+QII+ PMDLGTVKSK+ NLY Sbjct: 182 ---LMKLCGQTLTKLMKHKHSWVFNSPVDVVGMGLHDYHQIIKRPMDLGTVKSKIAKNLY 238 Query: 1603 TSPHDFASDVRLTFNNALRYNPKGHDVHVAAEQLLARFEEMFEPEHRKYESERHSMFGGA 1424 SP DFA+DVRLTF+NAL YNPKGHDVHV AEQLLARFE++F+P + K E + Sbjct: 239 DSPLDFAADVRLTFDNALLYNPKGHDVHVMAEQLLARFEDLFKPVYNKLEEDE------- 291 Query: 1423 NELRQRSAAMAEEMEPRRSSWNHQSRTPDSAKEHERFPVXXXXXXXXXXXXXXXXXXXLT 1244 R + + E++ SSWNH TP+ K+ PV + Sbjct: 292 ---RDQERIIVGELQ--GSSWNH-IPTPERLKKPSPKPVAKKPERMQVPIPATGSSNPPS 345 Query: 1243 ERSXXXXXXXXXXXXXXAKQLGVRSGTGKQPPKPKAKDPNKREMTFEEKQKLSLNLQNLP 1064 +S K L R +GKQ PKPKAKDPNKREM+ EEK KL L LQ+LP Sbjct: 346 VQS---VPTPSPMRAPPVKPLATRPSSGKQ-PKPKAKDPNKREMSLEEKHKLGLGLQSLP 401 Query: 1063 QEKMDQVLQIISKRDSKLAQHGDEIEVDIEYIDKETLWELDRFVGYYKKMISKMKRQAIA 884 QEKMDQV+QIISK++ L Q GDEIE+DIE +D ETLWELDR V +KKM+SK+KRQA+ Sbjct: 402 QEKMDQVVQIISKKNGHLTQDGDEIELDIEAVDTETLWELDRLVTNWKKMVSKIKRQALM 461 Query: 883 AGQNSTATEENKQCTTVDEVPEAEATMXXXXXXXXXXXXXXXXXXXXXXEMPASNFPPVE 704 N+ ATE N+ ++++E E ++ E+P + FPPVE Sbjct: 462 VNNNTAATEVNR--SSMNE--RTEPSLAPAMAKKPKKGEAGEEDVDIGDEIPTATFPPVE 517 Query: 703 IEKDAA 686 IEKD A Sbjct: 518 IEKDDA 523 >ref|XP_006856632.1| hypothetical protein AMTR_s00046p00231370 [Amborella trichopoda] gi|548860513|gb|ERN18099.1| hypothetical protein AMTR_s00046p00231370 [Amborella trichopoda] Length = 602 Score = 401 bits (1031), Expect = e-109 Identities = 259/612 (42%), Positives = 336/612 (54%), Gaps = 36/612 (5%) Frame = -3 Query: 2419 MASALLASRNEP-HWGEPKVYMRKNPNSNPTNWRSNPNPNPKEKCSSPRKQIYDRSGHGR 2243 MASALLAS+NE +WG+ KVYMRK PN T S P + HG Sbjct: 1 MASALLASQNESTYWGDRKVYMRKAPNPKQTLTLD----------SHPHHHNLELPPHGN 50 Query: 2242 QMDESAAALVLAKSDDSSSLNRKFVSLNNR--------KESNHGGYVSFNISAYSRKELK 2087 + LA S DSSSLNRK +SLN + K S+H Y+++N+ +YS+++L+ Sbjct: 51 E----PVLTTLAASSDSSSLNRKSISLNRKEPQLQASAKFSDH--YITYNVGSYSKQDLR 104 Query: 2086 GLKKRLIAELEQVRNLSSRIESREFQSRS----------GYSATQFSGGHGGREVTSSTR 1937 L+KRL+ ELEQVR L++RIESR A + T Sbjct: 105 DLRKRLVLELEQVRTLANRIESRSLWEPETPGSGRPPPLNLQALDLQADGPKEKRTPKAN 164 Query: 1936 PPTHSPEASGGKGSTKQKHMTMKVSGSKRPNQLDSGRDT-KRLA--SDPANAKLLSGMMK 1766 + E GK + GSKRPN + ++ KR+A DP KL+S MK Sbjct: 165 QYYRASEFVMGKEKMPAQENKKVFGGSKRPNPVTKVSESGKRMAISPDPVTGKLVSDFMK 224 Query: 1765 KCGQLLTKLMKHKHGWIFNVPVDVVGMGLHDYNQIIRLPMDLGTVKSKLGNNLYTSPHDF 1586 +CGQ+LTKLMKHKHGW+FNVPVDVVGMGLHDY +I+ PMDLGTVK++L + Y++P +F Sbjct: 225 RCGQILTKLMKHKHGWVFNVPVDVVGMGLHDYYTLIKNPMDLGTVKTRLNQSFYSTPLEF 284 Query: 1585 ASDVRLTFNNALRYNPKGHDVHVAAEQLLARFEEMFEPEHRKYESE--------RHSMFG 1430 A+DVRLTF+NAL YNPKGHDV++ AEQLL FEEM+ P +YE E R + F Sbjct: 285 AADVRLTFHNALTYNPKGHDVNIMAEQLLGFFEEMWNPAFNRYEEERRRAVEEARRNSFS 344 Query: 1429 GANELRQRSAAMAEEMEPRRSSWNHQSRTPDSAKEHERFPVXXXXXXXXXXXXXXXXXXX 1250 G +R +SAA+ E P + R P S+K+ + + Sbjct: 345 GEFPVR-KSAALPEIAMP------VEKRQPQSSKKLDPW--------------------- 376 Query: 1249 LTERSXXXXXXXXXXXXXXAKQLGVRSGTGKQPPKPKAKDPNKREMTFEEKQKLSLNLQN 1070 +G+R + PKPKA+DPNKREM+FEEKQKLS +LQN Sbjct: 377 --------MPPTSVKTKGGGNLVGIRPVQLGKQPKPKARDPNKREMSFEEKQKLSTSLQN 428 Query: 1069 LPQEKMDQVLQIISKRDSKLAQHGDEIEVDIEYIDKETLWELDRFVGYYKKMISKMKR-- 896 LPQEKMDQV+QII +R+S LAQ GDEIEVDI+ +D ETLWELDRFV KKM+SK+KR Sbjct: 429 LPQEKMDQVVQIIRRRNSNLAQDGDEIEVDIDVVDTETLWELDRFVSNCKKMMSKVKRKA 488 Query: 895 ----QAIAAGQNSTATEENKQCTTVDEVPEAEATMXXXXXXXXXXXXXXXXXXXXXXEMP 728 QA+ Q + T + + + V + PEA A +MP Sbjct: 489 GLEDQAMLVQQVNQGTSVDVEKSPVHDTPEAAAA------KKSKKGDQAEEDVDIGDDMP 542 Query: 727 ASNFPPVEIEKD 692 ++NFPPVEIEKD Sbjct: 543 STNFPPVEIEKD 554 >gb|EXC03905.1| Transcription factor GTE7 [Morus notabilis] Length = 605 Score = 395 bits (1016), Expect = e-107 Identities = 263/593 (44%), Positives = 328/593 (55%), Gaps = 17/593 (2%) Frame = -3 Query: 2419 MASALLASRNEPHWGEPKVYMRK---NPNSNPTNWRSNPNPNPKEKCSSPR----KQIYD 2261 MASALLASRNEP WGE KVYMRK N + NP + NPNPNP S+P +Q YD Sbjct: 1 MASALLASRNEPSWGENKVYMRKFTTNASKNPL-LKPNPNPNPNP-ISNPNTGSVRQPYD 58 Query: 2260 RSG--HGRQMDESAAALVLAKSDDSSSLNRKFVSLNNRKESNHGGYVSFNISAYSRKELK 2087 SG RQ+D ++ +SS NRK +SL + S+ G YV+FN+ ++SRKELK Sbjct: 59 ASGGFRFRQIDNHTSSAPAP----TSSPNRKAMSLIEPRVSSQG-YVTFNVGSFSRKELK 113 Query: 2086 GLKKRLIAELEQVRNLSSRIESREFQSRSGYSATQFSGGHGGREVTSSTRPPTHSPEASG 1907 LK RL +ELEQVR L SRIES S G + +S + S E Sbjct: 114 ELKMRLRSELEQVRALMSRIESGSCHS--------LPKGAEKKNPKASLKNYQES-ELVA 164 Query: 1906 GKGSTKQKHMTMKV-----SGSKRPNQLDSGR-DTKRLASDPANAKLLSGMMKKCGQLLT 1745 GKG K+K M V G+KR N D KR A DP + KL+ M+K+CGQ+LT Sbjct: 165 GKGKKKKKKDVMSVVAVDSKGTKRSNPFGGVMADPKRPAIDPISEKLVGSMLKRCGQILT 224 Query: 1744 KLMKHKHGWIFNVPVDVVGMGLHDYNQIIRLPMDLGTVKSKLGNNLYTSPHDFASDVRLT 1565 KLMKHK GW+FN PVDV G+ LHDY+QI++ PMDLGTVKS L +LY SP DFASDVRL Sbjct: 225 KLMKHKFGWVFNAPVDVDGLKLHDYHQIVKNPMDLGTVKSNLERDLYPSPLDFASDVRLA 284 Query: 1564 FNNALRYNPKGHDVHVAAEQLLARFEEMFEPEHRKYESERHSMFGGANELRQRSAAMAEE 1385 FNNAL YNPKG DV+ AEQLL +F +MF P ++K E ER + G + +A E Sbjct: 285 FNNALLYNPKGSDVNFMAEQLLIQFNQMFNPAYKKLEDERRRVLGFGD------PNVAPE 338 Query: 1384 MEPRRSSWNHQSRTPDSAKEHERFPVXXXXXXXXXXXXXXXXXXXLTERSXXXXXXXXXX 1205 R + PD + FP LT+ S Sbjct: 339 NARREIDTMVAVKKPDLVRSKPTFP-------DPTPPPPVHYTQALTKPSVPAPAPSPVS 391 Query: 1204 XXXXAKQLGVRSGTGKQPPKPKAKDPNKREMTFEEKQKLSLNLQNLPQEKMDQVLQIISK 1025 V+ PKPKAKDPNKR+MT+EEK KL NLQNLP EKM Q+L I+ K Sbjct: 392 KPPLVNSRPVKL------PKPKAKDPNKRQMTYEEKAKLGANLQNLPTEKMVQLLHILKK 445 Query: 1024 RDSK--LAQHGDEIEVDIEYIDKETLWELDRFVGYYKKMISKMKRQAIAAGQNSTATEEN 851 R+ + L+Q G+EIE+DIE +D ETLWELDRFVG YKKM+SKMKRQA+ QN + Sbjct: 446 RNDQCHLSQDGEEIELDIEAVDTETLWELDRFVGNYKKMVSKMKRQALMQAQNPAPQNTD 505 Query: 850 KQCTTVDEVPEAEATMXXXXXXXXXXXXXXXXXXXXXXEMPASNFPPVEIEKD 692 + + + +A A + ++P +FPPV IEKD Sbjct: 506 RNTSLESDPADAAAVV-----KSKKVDAAAEEDVDIGEDIPMGDFPPVVIEKD 553 >ref|XP_007148327.1| hypothetical protein PHAVU_006G199400g [Phaseolus vulgaris] gi|561021550|gb|ESW20321.1| hypothetical protein PHAVU_006G199400g [Phaseolus vulgaris] Length = 527 Score = 378 bits (970), Expect = e-102 Identities = 250/581 (43%), Positives = 306/581 (52%), Gaps = 5/581 (0%) Frame = -3 Query: 2419 MASALLASRNEPHWGEPKV----YMRKNPNSNPTNWRSNPNPNPKEKCSSPRKQIYDRSG 2252 MASA+LA+RNEP+W + +V +M K P +NP NPN NPK +S R Q Sbjct: 1 MASAVLANRNEPNWPQHRVGGAGFMGKAPFANP-----NPNSNPK-LANSKRNQ------ 48 Query: 2251 HGRQMDESAAALVLAKSDDSSSLNRKFVSLNNRKESNHGGYVSFNISAYSRKELKGLKKR 2072 + SDD+SS+NR+ + + H YVSFNI + S+KEL +K R Sbjct: 49 --------------SASDDASSINRR-----SNEVVTHSQYVSFNIGSLSKKELGDIKNR 89 Query: 2071 LIAELEQVRNLSSRIESREFQSRSGYSATQFSGGHGGREVTSSTRPPTHSPEASGGKGST 1892 L++ELEQV+ +RIES E Q F+GGH P+ S K Sbjct: 90 LVSELEQVQKFRTRIESGELQP-----GQSFNGGH---------------PKKSSSK--- 126 Query: 1891 KQKHMTMKVSGSKRPNQLDSGRDTKRLASDPANAKLLSGMMKKCGQLLTKLMKHKHGWIF 1712 KVSG+KRP L+S +D KR + N +MK C Q+L KLMKHKHGWIF Sbjct: 127 -------KVSGNKRPLPLNSAKDFKRSLPEVGN------LMKGCSQVLQKLMKHKHGWIF 173 Query: 1711 NVPVDVVGMGLHDYNQIIRLPMDLGTVKSKLGNNLYTSPHDFASDVRLTFNNALRYNPKG 1532 NVPVD VGMGLHDY II+ PMDLGTVKS L N Y++P DFASDVRLTF NAL YNPKG Sbjct: 174 NVPVDAVGMGLHDYYDIIKQPMDLGTVKSNLSKNKYSAPSDFASDVRLTFKNALTYNPKG 233 Query: 1531 HDVHVAAEQLLARFEEMFEPEHRKYESERHSMFGGANELRQRSAAMAEEMEPRRSSWNHQ 1352 HDV+ AEQLL RFEE++ P H K+E + G + E E + SSW+H Sbjct: 234 HDVYTMAEQLLTRFEELYRPMHEKFE----DLVGHDRDF---------EEELQASSWSHV 280 Query: 1351 SRTPDSAKEHERFPVXXXXXXXXXXXXXXXXXXXLTERSXXXXXXXXXXXXXXAKQLGVR 1172 P+ K+ E L + KQ Sbjct: 281 E--PERVKKKENLIPHAKFQQEPPQPPASSSNPPLLQFPVRTPSPMRAPPVKPLKQ---- 334 Query: 1171 SGTGKQPPKPKAKDPNKREMTFEEKQKLSLNLQNLPQEKMDQVLQIISKRDSKLAQHGDE 992 PKPKAKDPNKREM+ EEK KL L LQ+LP EKM+QV+QII +R+ L Q GDE Sbjct: 335 -------PKPKAKDPNKREMSLEEKHKLGLGLQSLPAEKMEQVVQIIRRRNGHLKQDGDE 387 Query: 991 IEVDIEYIDKETLWELDRFVGYYKKMISKMKRQAIAAGQNSTATEENKQCTTVDEVPEAE 812 IE+DIE +D ETLWELDR V YKKM+SK+KRQA+ N + N E+P E Sbjct: 388 IELDIEAVDTETLWELDRLVTNYKKMVSKIKRQALMGNNNVAPHKANM------ELPAGE 441 Query: 811 -ATMXXXXXXXXXXXXXXXXXXXXXXEMPASNFPPVEIEKD 692 A EMP S FPPVEIEKD Sbjct: 442 KADGMPTELKKPKKVEAGDEDVDIGDEMPMSMFPPVEIEKD 482 >gb|ABI14812.1| chloroplast bromodomain-containing protein [Pachysandra terminalis] Length = 428 Score = 373 bits (957), Expect = e-100 Identities = 217/396 (54%), Positives = 255/396 (64%), Gaps = 2/396 (0%) Frame = -3 Query: 1870 KVSGSKRPNQLDSGRDTKRLASDPA-NAKLLSGMMKKCGQLLTKLMKHKHGWIFNVPVDV 1694 KVSGSKRP SGRD+KR AS+PA K+LS MMK+CGQ+LTKLM+HKHGWIFNVPVDV Sbjct: 3 KVSGSKRPLPFTSGRDSKRPASEPAPTGKMLSSMMKQCGQILTKLMRHKHGWIFNVPVDV 62 Query: 1693 VGMGLHDYNQIIRLPMDLGTVKSKLGNNLYTSPHDFASDVRLTFNNALRYNPKGHDVHVA 1514 VGMGLHDYNQII+ PMDLGTVK +G NLY+SP DFASDVRLTFNNAL YNPKGHDV+ Sbjct: 63 VGMGLHDYNQIIKHPMDLGTVKLNIGKNLYSSPLDFASDVRLTFNNALSYNPKGHDVYAM 122 Query: 1513 AEQLLARFEEMFEPEHRKYESERHSMFGGANELRQRSAAMAEEMEPRRSSWNHQSRTPDS 1334 AEQLL RFEEMFEP ++K+E + R+ SA E RRSSW+HQ P+S Sbjct: 123 AEQLLVRFEEMFEPAYKKFEDAQQ---------RKISAG-----EIRRSSWSHQIPMPES 168 Query: 1333 AKEHERFPVXXXXXXXXXXXXXXXXXXXLTERSXXXXXXXXXXXXXXAKQ-LGVRSGTGK 1157 R P+ ++ K + +RS T K Sbjct: 169 IP--NRDPLSSSAATRPGGFAHPMPLSTPQPQAFPQALASTSAPAPAPKPFMAMRSATVK 226 Query: 1156 QPPKPKAKDPNKREMTFEEKQKLSLNLQNLPQEKMDQVLQIISKRDSKLAQHGDEIEVDI 977 Q PKPKAKDPNKREM+FEEK KL L+LQ+LPQEKM+QV+QII KR+ LAQ GDEIE+DI Sbjct: 227 Q-PKPKAKDPNKREMSFEEKHKLGLSLQSLPQEKMEQVVQIIRKRNGHLAQDGDEIELDI 285 Query: 976 EYIDKETLWELDRFVGYYKKMISKMKRQAIAAGQNSTATEENKQCTTVDEVPEAEATMXX 797 E +D ETLWELDRFV KK++SK+KRQA+ + +TA E NK + D AEA Sbjct: 286 EVVDTETLWELDRFVYNCKKLMSKIKRQALVSNNQNTAEEGNKSPVS-DSHEAAEAA--- 341 Query: 796 XXXXXXXXXXXXXXXXXXXXEMPASNFPPVEIEKDA 689 E+P SNFPPVEIEKDA Sbjct: 342 -SAKKIKKGEIGEEDVDIGEEIPTSNFPPVEIEKDA 376 >ref|XP_007042774.1| Global transcription factor group E2, putative isoform 5, partial [Theobroma cacao] gi|508706709|gb|EOX98605.1| Global transcription factor group E2, putative isoform 5, partial [Theobroma cacao] Length = 547 Score = 372 bits (956), Expect = e-100 Identities = 249/590 (42%), Positives = 315/590 (53%), Gaps = 14/590 (2%) Frame = -3 Query: 2419 MASALLASRNEPHWG-EPKVYMRKNPNSNPTNWRSNPNPNPKEKCSSPRKQIY------- 2264 MASA+LA+R+E +W +PK + K P + PNPNPK + ++Q++ Sbjct: 1 MASAVLANRSESNWPPQPKSSVAKFMGKVPFT-ATKPNPNPK---FNKKRQLHQHLPPPD 56 Query: 2263 DRSGHGRQMDESAAALVLAKSDDSSSLNRKFVSLNNRKESNHGGYVSFNISAYSRKELKG 2084 D +GH +D+S A A SDD+SS+NRK + + G YVSF+IS+YSRKEL Sbjct: 57 DVAGH--VVDDSPAVTQSAASDDASSINRKL------NDFSSGAYVSFHISSYSRKELID 108 Query: 2083 LKKRLIAELEQVRNLSSRIESREFQSRSGYSATQFSGGHGGREVTSSTRPPTHSPEASGG 1904 LK RL+AELEQ+R L +RIES +F RS Sbjct: 109 LKNRLVAELEQIRELKNRIESNDFHVRS-------------------------------- 136 Query: 1903 KGSTKQKHMTMKVSGSKRPNQLDSGRDTKRLASDPANAKLLSGMMKKCGQLLTKLMKHKH 1724 STK+ +SG+KRP + ++ KRL + +MK C Q+L KLMK K+ Sbjct: 137 -SSTKKPISKKNISGNKRPLPPNFSKELKRLNPQENGKASTTHLMKNCSQILNKLMKQKY 195 Query: 1723 GWIFNVPVDVVGMGLHDYNQIIRLPMDLGTVKSKLGNNLYTSPHDFASDVRLTFNNALRY 1544 G+IFN PVDVVGMGLHDY II+ PMDLGTVKS++ N Y SP DFA+DVRLTFNNA+ Y Sbjct: 196 GYIFNSPVDVVGMGLHDYYDIIKNPMDLGTVKSRMAKNFYGSPLDFAADVRLTFNNAMLY 255 Query: 1543 NPKGHDVHVAAEQLLARFEEMFEPEHRKYESERHSMFGGANELRQRSAAMAEEMEPRRSS 1364 NPKGH+V++ AEQLLARFEE F P K E + E Q EE++ SS Sbjct: 256 NPKGHEVYMLAEQLLARFEEFFRPLSLKLEEQ---------EEPQEKGYYEEELQ--ASS 304 Query: 1363 WNHQSRTPDSAKEHERFPVXXXXXXXXXXXXXXXXXXXLTERSXXXXXXXXXXXXXXAKQ 1184 W+H KE ER Sbjct: 305 WDH-GEADRMKKERERNGERNIDRDDSVNIVARSDKIGGVS-GFVSNPNVPPPQLQMQAP 362 Query: 1183 LGVRSGTGKQPPKP------KAKDPNKREMTFEEKQKLSLNLQNLPQEKMDQVLQIISKR 1022 V S P KP KAKDPNKREM+ EEKQKL + LQ+LPQEKMD V+QII KR Sbjct: 363 ARVASPVRAPPVKPLKQPKPKAKDPNKREMSMEEKQKLGIGLQSLPQEKMDNVVQIIRKR 422 Query: 1021 DSKLAQHGDEIEVDIEYIDKETLWELDRFVGYYKKMISKMKRQAIAAGQNSTATEENKQC 842 + L Q GDEIE+DIE +D ETLWELDRFV YKKM+SK+KRQA+ A N + + N++ Sbjct: 423 NGHLRQDGDEIELDIEAMDTETLWELDRFVTNYKKMVSKIKRQALMA-NNVVSNDSNREE 481 Query: 841 TTVDEVPEAEATMXXXXXXXXXXXXXXXXXXXXXXEMPASNFPPVEIEKD 692 TV+++ A EMP S+FPPVEIEKD Sbjct: 482 VTVEKIEVA------MEMKKPKKGDAGEEDVDIGDEMPMSSFPPVEIEKD 525 >ref|XP_007042770.1| Global transcription factor group E2, putative isoform 1 [Theobroma cacao] gi|590687812|ref|XP_007042771.1| Global transcription factor group E2, putative isoform 1 [Theobroma cacao] gi|590687818|ref|XP_007042772.1| Global transcription factor group E2, putative isoform 1 [Theobroma cacao] gi|508706705|gb|EOX98601.1| Global transcription factor group E2, putative isoform 1 [Theobroma cacao] gi|508706706|gb|EOX98602.1| Global transcription factor group E2, putative isoform 1 [Theobroma cacao] gi|508706707|gb|EOX98603.1| Global transcription factor group E2, putative isoform 1 [Theobroma cacao] Length = 566 Score = 372 bits (956), Expect = e-100 Identities = 249/590 (42%), Positives = 315/590 (53%), Gaps = 14/590 (2%) Frame = -3 Query: 2419 MASALLASRNEPHWG-EPKVYMRKNPNSNPTNWRSNPNPNPKEKCSSPRKQIY------- 2264 MASA+LA+R+E +W +PK + K P + PNPNPK + ++Q++ Sbjct: 1 MASAVLANRSESNWPPQPKSSVAKFMGKVPFT-ATKPNPNPK---FNKKRQLHQHLPPPD 56 Query: 2263 DRSGHGRQMDESAAALVLAKSDDSSSLNRKFVSLNNRKESNHGGYVSFNISAYSRKELKG 2084 D +GH +D+S A A SDD+SS+NRK + + G YVSF+IS+YSRKEL Sbjct: 57 DVAGH--VVDDSPAVTQSAASDDASSINRKL------NDFSSGAYVSFHISSYSRKELID 108 Query: 2083 LKKRLIAELEQVRNLSSRIESREFQSRSGYSATQFSGGHGGREVTSSTRPPTHSPEASGG 1904 LK RL+AELEQ+R L +RIES +F RS Sbjct: 109 LKNRLVAELEQIRELKNRIESNDFHVRS-------------------------------- 136 Query: 1903 KGSTKQKHMTMKVSGSKRPNQLDSGRDTKRLASDPANAKLLSGMMKKCGQLLTKLMKHKH 1724 STK+ +SG+KRP + ++ KRL + +MK C Q+L KLMK K+ Sbjct: 137 -SSTKKPISKKNISGNKRPLPPNFSKELKRLNPQENGKASTTHLMKNCSQILNKLMKQKY 195 Query: 1723 GWIFNVPVDVVGMGLHDYNQIIRLPMDLGTVKSKLGNNLYTSPHDFASDVRLTFNNALRY 1544 G+IFN PVDVVGMGLHDY II+ PMDLGTVKS++ N Y SP DFA+DVRLTFNNA+ Y Sbjct: 196 GYIFNSPVDVVGMGLHDYYDIIKNPMDLGTVKSRMAKNFYGSPLDFAADVRLTFNNAMLY 255 Query: 1543 NPKGHDVHVAAEQLLARFEEMFEPEHRKYESERHSMFGGANELRQRSAAMAEEMEPRRSS 1364 NPKGH+V++ AEQLLARFEE F P K E + E Q EE++ SS Sbjct: 256 NPKGHEVYMLAEQLLARFEEFFRPLSLKLEEQ---------EEPQEKGYYEEELQ--ASS 304 Query: 1363 WNHQSRTPDSAKEHERFPVXXXXXXXXXXXXXXXXXXXLTERSXXXXXXXXXXXXXXAKQ 1184 W+H KE ER Sbjct: 305 WDH-GEADRMKKERERNGERNIDRDDSVNIVARSDKIGGVS-GFVSNPNVPPPQLQMQAP 362 Query: 1183 LGVRSGTGKQPPKP------KAKDPNKREMTFEEKQKLSLNLQNLPQEKMDQVLQIISKR 1022 V S P KP KAKDPNKREM+ EEKQKL + LQ+LPQEKMD V+QII KR Sbjct: 363 ARVASPVRAPPVKPLKQPKPKAKDPNKREMSMEEKQKLGIGLQSLPQEKMDNVVQIIRKR 422 Query: 1021 DSKLAQHGDEIEVDIEYIDKETLWELDRFVGYYKKMISKMKRQAIAAGQNSTATEENKQC 842 + L Q GDEIE+DIE +D ETLWELDRFV YKKM+SK+KRQA+ A N + + N++ Sbjct: 423 NGHLRQDGDEIELDIEAMDTETLWELDRFVTNYKKMVSKIKRQALMA-NNVVSNDSNREE 481 Query: 841 TTVDEVPEAEATMXXXXXXXXXXXXXXXXXXXXXXEMPASNFPPVEIEKD 692 TV+++ A EMP S+FPPVEIEKD Sbjct: 482 VTVEKIEVA------MEMKKPKKGDAGEEDVDIGDEMPMSSFPPVEIEKD 525 >ref|XP_006379136.1| hypothetical protein POPTR_0009s08350g [Populus trichocarpa] gi|566186955|ref|XP_006379137.1| hypothetical protein POPTR_0009s08350g [Populus trichocarpa] gi|550331302|gb|ERP56933.1| hypothetical protein POPTR_0009s08350g [Populus trichocarpa] gi|550331303|gb|ERP56934.1| hypothetical protein POPTR_0009s08350g [Populus trichocarpa] Length = 547 Score = 370 bits (949), Expect = 2e-99 Identities = 243/588 (41%), Positives = 316/588 (53%), Gaps = 12/588 (2%) Frame = -3 Query: 2419 MASALLASRNEPHWGEPKV------YMRKNPNSNPTNWRSNPNPNPKEKCSSPRK-QIYD 2261 MASA+LA+RNEP+W +P+ +M K P SNP NP + K + P+ QI D Sbjct: 1 MASAVLANRNEPNWTQPQPRGGGAKFMGKIPFSNP-----NPKFSKKRQFQPPQPPQIPD 55 Query: 2260 RSGHGRQMDESAAALVLAKSDDSSSLNRKFVSLNNRKESNHGGYVSFNISAYSRKELKGL 2081 +DES +A SDD+SS+NR+ NN + N GGYVSFN+S+ S+KEL L Sbjct: 56 -------VDESPSAA----SDDASSINRR--PQNNHHDFNTGGYVSFNVSSCSKKELIEL 102 Query: 2080 KKRLIAELEQVRNLSSRIESREFQSRSGYSATQFSGGHGGREVTSSTRPPTHSPEASGGK 1901 K RL+ ELE++R L +RIES +F H + S Sbjct: 103 KSRLVYELEKIRELKNRIESSDF----------------------------HIGQPSSNF 134 Query: 1900 GSTKQKHMTMKVSGSKRPNQLDSGRDTKRLASDPANAKLLSGMMKKCGQLLTKLMKHKHG 1721 S KQ KVSG+KRP S + + +S P NA+L MK C Q+L+KLMK K G Sbjct: 135 SSKKQTSTNKKVSGNKRPFPAPSNFNNFKRSS-PDNAQL----MKNCSQILSKLMKQKLG 189 Query: 1720 WIFNVPVDVVGMGLHDYNQIIRLPMDLGTVKSKLGNNLYTSPHDFASDVRLTFNNALRYN 1541 +IFN PVDVVG+ LHDY+ II+ PMDLGTVK+ L NLY SP DFA+DVRLTFNNA++YN Sbjct: 190 YIFNTPVDVVGLQLHDYHDIIKNPMDLGTVKTNLSKNLYESPRDFAADVRLTFNNAMKYN 249 Query: 1540 PKGHDVHVAAEQLLARFEEMFEPEHRKYESERHSMFGGANELRQRSAAMAEEMEPRRSSW 1361 PKGH+V++ AEQ L RF++++ P K ++ + + +E++ SSW Sbjct: 250 PKGHEVYILAEQFLTRFQDLYRPIKEKV----------GEDVEEEENDLVQEVQ--ASSW 297 Query: 1360 NHQSRTPDSAKEHERFPVXXXXXXXXXXXXXXXXXXXLTERSXXXXXXXXXXXXXXAKQL 1181 +H R P+ + + + S KQ Sbjct: 298 DHIRREPERVSKIDGDFMPVTAKSDPIGQQQQPTGMNQNPNSVRTPSPMRVPQVKPLKQ- 356 Query: 1180 GVRSGTGKQPPKPKAKDPNKREMTFEEKQKLSLNLQNLPQEKMDQVLQIISKRDSKLAQH 1001 PKPKAKDPNKREM EEK KL + LQ+LPQEKM+QV+QII KR+ L Q Sbjct: 357 ----------PKPKAKDPNKREMNLEEKHKLGVGLQSLPQEKMEQVVQIIRKRNGHLRQE 406 Query: 1000 GDEIEVDIEYIDKETLWELDRFVGYYKKMISKMKRQAIA-----AGQNSTATEENKQCTT 836 GDEIE+DIE +D ETLWELDRFV YKKM+SK+KRQA+ AG + + NK Sbjct: 407 GDEIELDIEAVDTETLWELDRFVTNYKKMVSKIKRQALMGINTNAGATAISEGNNK---- 462 Query: 835 VDEVPEAEATMXXXXXXXXXXXXXXXXXXXXXXEMPASNFPPVEIEKD 692 +VP + EMP S+FPPVEIEKD Sbjct: 463 --DVPGNDRMEVVNEAKKPKKGDVGDEDVDIGDEMPMSSFPPVEIEKD 508 >ref|XP_002313212.2| hypothetical protein POPTR_0009s08350g [Populus trichocarpa] gi|550331301|gb|EEE87167.2| hypothetical protein POPTR_0009s08350g [Populus trichocarpa] Length = 546 Score = 370 bits (949), Expect = 2e-99 Identities = 243/588 (41%), Positives = 316/588 (53%), Gaps = 12/588 (2%) Frame = -3 Query: 2419 MASALLASRNEPHWGEPKV------YMRKNPNSNPTNWRSNPNPNPKEKCSSPRK-QIYD 2261 MASA+LA+RNEP+W +P+ +M K P SNP NP + K + P+ QI D Sbjct: 1 MASAVLANRNEPNWTQPQPRGGGAKFMGKIPFSNP-----NPKFSKKRQFQPPQPPQIPD 55 Query: 2260 RSGHGRQMDESAAALVLAKSDDSSSLNRKFVSLNNRKESNHGGYVSFNISAYSRKELKGL 2081 +DES +A SDD+SS+NR+ NN + N GGYVSFN+S+ S+KEL L Sbjct: 56 -------VDESPSAA----SDDASSINRR--PQNNHHDFNTGGYVSFNVSSCSKKELIEL 102 Query: 2080 KKRLIAELEQVRNLSSRIESREFQSRSGYSATQFSGGHGGREVTSSTRPPTHSPEASGGK 1901 K RL+ ELE++R L +RIES +F H + S Sbjct: 103 KSRLVYELEKIRELKNRIESSDF----------------------------HIGQPSSNF 134 Query: 1900 GSTKQKHMTMKVSGSKRPNQLDSGRDTKRLASDPANAKLLSGMMKKCGQLLTKLMKHKHG 1721 S KQ KVSG+KRP S + + +S P NA+L MK C Q+L+KLMK K G Sbjct: 135 SSKKQTSTNKKVSGNKRPFPAPSNFNNFKRSS-PDNAQL----MKNCSQILSKLMKQKLG 189 Query: 1720 WIFNVPVDVVGMGLHDYNQIIRLPMDLGTVKSKLGNNLYTSPHDFASDVRLTFNNALRYN 1541 +IFN PVDVVG+ LHDY+ II+ PMDLGTVK+ L NLY SP DFA+DVRLTFNNA++YN Sbjct: 190 YIFNTPVDVVGLQLHDYHDIIKNPMDLGTVKTNLSKNLYESPRDFAADVRLTFNNAMKYN 249 Query: 1540 PKGHDVHVAAEQLLARFEEMFEPEHRKYESERHSMFGGANELRQRSAAMAEEMEPRRSSW 1361 PKGH+V++ AEQ L RF++++ P K ++ + + +E++ SSW Sbjct: 250 PKGHEVYILAEQFLTRFQDLYRPIKEKV----------GEDVEEEENDLVQEVQ--ASSW 297 Query: 1360 NHQSRTPDSAKEHERFPVXXXXXXXXXXXXXXXXXXXLTERSXXXXXXXXXXXXXXAKQL 1181 +H R P+ + + + S KQ Sbjct: 298 DHIRREPERVSKIDGDFMPVTAKSDPIGQQQQPTGMNQNPNSVRTPSPMRVPQVKPLKQ- 356 Query: 1180 GVRSGTGKQPPKPKAKDPNKREMTFEEKQKLSLNLQNLPQEKMDQVLQIISKRDSKLAQH 1001 PKPKAKDPNKREM EEK KL + LQ+LPQEKM+QV+QII KR+ L Q Sbjct: 357 ----------PKPKAKDPNKREMNLEEKHKLGVGLQSLPQEKMEQVVQIIRKRNGHLRQE 406 Query: 1000 GDEIEVDIEYIDKETLWELDRFVGYYKKMISKMKRQAIA-----AGQNSTATEENKQCTT 836 GDEIE+DIE +D ETLWELDRFV YKKM+SK+KRQA+ AG + + NK Sbjct: 407 GDEIELDIEAVDTETLWELDRFVTNYKKMVSKIKRQALMGINTNAGATAISEGNNK---- 462 Query: 835 VDEVPEAEATMXXXXXXXXXXXXXXXXXXXXXXEMPASNFPPVEIEKD 692 +VP + EMP S+FPPVEIEKD Sbjct: 463 --DVPGNDRMEVVNEAKKPKKGDVGDEDVDIGDEMPMSSFPPVEIEKD 508 >ref|XP_006379135.1| hypothetical protein POPTR_0009s08350g [Populus trichocarpa] gi|550331300|gb|ERP56932.1| hypothetical protein POPTR_0009s08350g [Populus trichocarpa] Length = 541 Score = 370 bits (949), Expect = 2e-99 Identities = 243/588 (41%), Positives = 316/588 (53%), Gaps = 12/588 (2%) Frame = -3 Query: 2419 MASALLASRNEPHWGEPKV------YMRKNPNSNPTNWRSNPNPNPKEKCSSPRK-QIYD 2261 MASA+LA+RNEP+W +P+ +M K P SNP NP + K + P+ QI D Sbjct: 1 MASAVLANRNEPNWTQPQPRGGGAKFMGKIPFSNP-----NPKFSKKRQFQPPQPPQIPD 55 Query: 2260 RSGHGRQMDESAAALVLAKSDDSSSLNRKFVSLNNRKESNHGGYVSFNISAYSRKELKGL 2081 +DES +A SDD+SS+NR+ NN + N GGYVSFN+S+ S+KEL L Sbjct: 56 -------VDESPSAA----SDDASSINRR--PQNNHHDFNTGGYVSFNVSSCSKKELIEL 102 Query: 2080 KKRLIAELEQVRNLSSRIESREFQSRSGYSATQFSGGHGGREVTSSTRPPTHSPEASGGK 1901 K RL+ ELE++R L +RIES +F H + S Sbjct: 103 KSRLVYELEKIRELKNRIESSDF----------------------------HIGQPSSNF 134 Query: 1900 GSTKQKHMTMKVSGSKRPNQLDSGRDTKRLASDPANAKLLSGMMKKCGQLLTKLMKHKHG 1721 S KQ KVSG+KRP S + + +S P NA+L MK C Q+L+KLMK K G Sbjct: 135 SSKKQTSTNKKVSGNKRPFPAPSNFNNFKRSS-PDNAQL----MKNCSQILSKLMKQKLG 189 Query: 1720 WIFNVPVDVVGMGLHDYNQIIRLPMDLGTVKSKLGNNLYTSPHDFASDVRLTFNNALRYN 1541 +IFN PVDVVG+ LHDY+ II+ PMDLGTVK+ L NLY SP DFA+DVRLTFNNA++YN Sbjct: 190 YIFNTPVDVVGLQLHDYHDIIKNPMDLGTVKTNLSKNLYESPRDFAADVRLTFNNAMKYN 249 Query: 1540 PKGHDVHVAAEQLLARFEEMFEPEHRKYESERHSMFGGANELRQRSAAMAEEMEPRRSSW 1361 PKGH+V++ AEQ L RF++++ P K ++ + + +E++ SSW Sbjct: 250 PKGHEVYILAEQFLTRFQDLYRPIKEKV----------GEDVEEEENDLVQEVQ--ASSW 297 Query: 1360 NHQSRTPDSAKEHERFPVXXXXXXXXXXXXXXXXXXXLTERSXXXXXXXXXXXXXXAKQL 1181 +H R P+ + + + S KQ Sbjct: 298 DHIRREPERVSKIDGDFMPVTAKSDPIGQQQQPTGMNQNPNSVRTPSPMRVPQVKPLKQ- 356 Query: 1180 GVRSGTGKQPPKPKAKDPNKREMTFEEKQKLSLNLQNLPQEKMDQVLQIISKRDSKLAQH 1001 PKPKAKDPNKREM EEK KL + LQ+LPQEKM+QV+QII KR+ L Q Sbjct: 357 ----------PKPKAKDPNKREMNLEEKHKLGVGLQSLPQEKMEQVVQIIRKRNGHLRQE 406 Query: 1000 GDEIEVDIEYIDKETLWELDRFVGYYKKMISKMKRQAIA-----AGQNSTATEENKQCTT 836 GDEIE+DIE +D ETLWELDRFV YKKM+SK+KRQA+ AG + + NK Sbjct: 407 GDEIELDIEAVDTETLWELDRFVTNYKKMVSKIKRQALMGINTNAGATAISEGNNK---- 462 Query: 835 VDEVPEAEATMXXXXXXXXXXXXXXXXXXXXXXEMPASNFPPVEIEKD 692 +VP + EMP S+FPPVEIEKD Sbjct: 463 --DVPGNDRMEVVNEAKKPKKGDVGDEDVDIGDEMPMSSFPPVEIEKD 508 >ref|XP_004147512.1| PREDICTED: transcription factor GTE7-like [Cucumis sativus] gi|449511376|ref|XP_004163939.1| PREDICTED: transcription factor GTE7-like [Cucumis sativus] Length = 533 Score = 367 bits (942), Expect = 2e-98 Identities = 251/591 (42%), Positives = 305/591 (51%), Gaps = 12/591 (2%) Frame = -3 Query: 2419 MASALLASRNEPHWGEPKV--------YMRKNPNSNPTNWRSNPNPNPKEKCSSPRKQIY 2264 MASA+LA+RNE +W +P+ +M K P SNP NP N K+ + Sbjct: 1 MASAVLANRNEANWPQPRGNGRGTEEGFMGKVPFSNP-----NPKFNKKQ---------F 46 Query: 2263 DRSGHGRQMDESAAALVLAKSDDSSSLNRKFVSLNNRKESN---HGGYVSFNISAYSRKE 2093 +G QMD+S A A SDD+SS+N ++R+ SN YVSFN+S+ SRKE Sbjct: 47 HGEMNGFQMDDSPAVTQSA-SDDASSIN------HHRRLSNGVDFSQYVSFNVSSCSRKE 99 Query: 2092 LKGLKKRLIAELEQVRNLSSRIESREFQSRSGYSATQFSGGHGGREVTSSTRPPTHSPEA 1913 L LK RLI+ELEQ+R L SRI S E SR P H + Sbjct: 100 LIELKTRLISELEQIRQLKSRINSGELHSR-----------------------PKHQKKF 136 Query: 1912 SGGKGSTKQKHMTMKVSGSKRPNQLDS-GRDTKRLASDPANAKLLSGMMKKCGQLLTKLM 1736 S K G+KRP S G + KR SD N ++K C Q+LTKLM Sbjct: 137 S-------------KTLGTKRPLPTSSNGMELKRSNSDNGN------LLKACSQILTKLM 177 Query: 1735 KHKHGWIFNVPVDVVGMGLHDYNQIIRLPMDLGTVKSKLGNNLYTSPHDFASDVRLTFNN 1556 KHKHGWIFN PVDVVGMGLHDY I++ PMDLG+VK KLG + Y SP+DFASDVRLTF N Sbjct: 178 KHKHGWIFNKPVDVVGMGLHDYYDIVKRPMDLGSVKVKLGKDAYESPYDFASDVRLTFKN 237 Query: 1555 ALRYNPKGHDVHVAAEQLLARFEEMFEPEHRKYESERHSMFGGANELRQRSAAMAEEMEP 1376 A+ YNPKGHDVH AEQLL RFEE+F P E E G EL Sbjct: 238 AMTYNPKGHDVHAMAEQLLVRFEELFRPVAEALEEEDRRFCGYQEEL------------- 284 Query: 1375 RRSSWNHQSRTPDSAKEHERFPVXXXXXXXXXXXXXXXXXXXLTERSXXXXXXXXXXXXX 1196 SSWNH K++ + V S Sbjct: 285 PASSWNHSEAERTVKKDNIQKQVVKKTEPMKAP-------------SSSSNPPMMQSPVK 331 Query: 1195 XAKQLGVRSGTGKQPPKPKAKDPNKREMTFEEKQKLSLNLQNLPQEKMDQVLQIISKRDS 1016 L + PKP+AKDPNKREMT EEK KL + LQ+LP EKM+QV+QII KR+ Sbjct: 332 TPSPLRAPPVKPLKQPKPRAKDPNKREMTLEEKHKLGIGLQSLPPEKMEQVVQIIKKRNG 391 Query: 1015 KLAQHGDEIEVDIEYIDKETLWELDRFVGYYKKMISKMKRQAIAAGQNSTATEENKQCTT 836 L Q GDEIE+DIE +D ETLWELDR V +KKM+SK+KRQA+ + + + N T Sbjct: 392 HLKQDGDEIELDIEAVDTETLWELDRLVTNWKKMMSKIKRQALI---TAASMKPNGVMPT 448 Query: 835 VDEVPEAEATMXXXXXXXXXXXXXXXXXXXXXXEMPASNFPPVEIEKDAAG 683 +++ T EMPASNFPPVEIEKDA G Sbjct: 449 PEKIEVGSET------KKQRKGEAGEEDVDIGDEMPASNFPPVEIEKDAGG 493 >ref|XP_002518322.1| bromodomain-containing protein, putative [Ricinus communis] gi|223542542|gb|EEF44082.1| bromodomain-containing protein, putative [Ricinus communis] Length = 634 Score = 363 bits (933), Expect = 2e-97 Identities = 252/615 (40%), Positives = 326/615 (53%), Gaps = 38/615 (6%) Frame = -3 Query: 2422 FMASALLASRNEPHWGEPK---VYMRKNPNSNPTNWRSNPNPNPKEKCSSPRKQIYDRSG 2252 +MASA+LA+RNE +W +P+ +M K P SNP NPNP K S R Sbjct: 54 YMASAVLANRNEANWTQPRGGAKFMGKVPFSNP-------NPNPNSKFSKKR-------- 98 Query: 2251 HGRQMDESAAALV--------LAKSDDSSSLNRKFVSLNNRKESNHGGYVSFNISAYSRK 2096 Q +AAA A SDD+SS+NR+ + S+ YV+FNI +YS+K Sbjct: 99 ---QFQSAAAAPPPVNFDIHPAASSDDASSINRRPAAT----ASDFNSYVTFNIGSYSKK 151 Query: 2095 ELKGLKKRLIAELEQVRNLSSRIESRE-FQSRSGYSATQFSGGHGGREVTSSTRPPTHSP 1919 EL LK RL+AELEQ+R L +RI+S + FQ RS F+G ++VT + RP P Sbjct: 152 ELLELKSRLVAELEQIRQLKNRIDSSQSFQIRS---TPNFNGKKQNKKVTGNKRP---FP 205 Query: 1918 EASGGKGSTKQKHMTMKVSGSKRPNQLDSGRDTKR---LASDPANAKLLSGMMKKCGQLL 1748 A+ G +D KR S P N +L MKKCGQ+L Sbjct: 206 SATTNYGFV--------------------AKDVKRSDLYNSHPENVQL----MKKCGQML 241 Query: 1747 TKLMKHKHGWIFNVPVDVVGMGLHDYNQIIRLPMDLGTVKSKLGNNLYTSPHDFASDVRL 1568 TKLMKHK G+IFN PVDV M LHDY +II+ PMDLGTVK KLG+N Y SP DFA+DVRL Sbjct: 242 TKLMKHKFGYIFNEPVDVERMNLHDYFEIIKKPMDLGTVKKKLGSNEYESPIDFAADVRL 301 Query: 1567 TFNNALRYNPKGHDVHVAAEQLLARFEEMFEPEHRK-----YESERHSMFGGANELRQRS 1403 TFNNA++YNPKGH+V+ AEQ L+RFEE+F P K + ++ + E+ Sbjct: 302 TFNNAMKYNPKGHEVYTFAEQFLSRFEELFRPIREKLGDFVLDDDQDQIVHHDREIEHEQ 361 Query: 1402 AAMAEEM-EPRRSSWNHQSRTP-------DSAKEHERFPVXXXXXXXXXXXXXXXXXXXL 1247 E++ E + SSW+H S + K+ + + Sbjct: 362 EHEHEQVHEVQASSWDHHSLNRRGGSGDIERVKKDQENVLQITSKSDHPIGKSVPPSVLS 421 Query: 1246 TERSXXXXXXXXXXXXXXAKQLGVRSGTGKQPP--------KPKAKDPNKREMTFEEKQK 1091 +S QL VR+ + + P KPKAKDPNKREM+ EEK K Sbjct: 422 NPQS--------------TSQLPVRTPSPMRAPPVKPVKLPKPKAKDPNKREMSLEEKHK 467 Query: 1090 LSLNLQNLPQEKMDQVLQIISKRDSKLAQHGDEIEVDIEYIDKETLWELDRFVGYYKKMI 911 L + LQ+LPQEKM+QV+QII KR+ L Q GDEIE+DIE +D ETLWELDRFV YKKM+ Sbjct: 468 LGVGLQSLPQEKMEQVVQIIRKRNGHLRQDGDEIELDIEAVDTETLWELDRFVTNYKKMV 527 Query: 910 SKMKRQAI--AAGQNSTATEENKQCTTVDEVPEAEATMXXXXXXXXXXXXXXXXXXXXXX 737 SK+KRQA+ A + +E NK + + + EA Sbjct: 528 SKIKRQALMGIAPTGNAVSEGNKDVSVNERIDITEA-------KKPKKGDAGDEDVDIGD 580 Query: 736 EMPASNFPPVEIEKD 692 EMP S+FPPVEIEKD Sbjct: 581 EMPMSSFPPVEIEKD 595 >ref|NP_001234374.1| PSTVd RNA-binding protein Virp1d [Solanum lycopersicum] gi|10179602|gb|AAG13810.1|AF190891_1 PSTVd RNA-binding protein Virp1a [Solanum lycopersicum] gi|10179604|gb|AAG13811.1|AF190892_1 PSTVd RNA-binding protein Virp1b [Solanum lycopersicum] gi|10179606|gb|AAG13812.1|AF190893_1 PSTVd RNA-binding protein Virp1c [Solanum lycopersicum] gi|10179608|gb|AAG13813.1|AF190894_1 PSTVd RNA-binding protein Virp1d [Solanum lycopersicum] gi|13186132|emb|CAC33448.1| PSTVd RNA-biding protein, Virp1 [Solanum lycopersicum] gi|13186134|emb|CAC33449.1| PSTVd RNA-biding protein, Virp1 [Solanum lycopersicum] gi|13186136|emb|CAC33450.1| PSTVd RNA-biding protein, Virp1 [Solanum lycopersicum] gi|13186138|emb|CAC33451.1| PSTVd RNA-biding protein, Virp1 [Solanum lycopersicum] Length = 602 Score = 363 bits (932), Expect = 2e-97 Identities = 242/607 (39%), Positives = 313/607 (51%), Gaps = 28/607 (4%) Frame = -3 Query: 2419 MASALLASRNEPHW----GEPKVYMRKNPNSNPTNWRSNPNPNPKEKCSSPRKQIYDRSG 2252 MASA+LASRNE W G +M K P S+ T N NPNPK+K +KQ + S Sbjct: 1 MASAVLASRNESSWAQSGGAGGGFMGKTPYSH-TQLNPNHNPNPKKK----QKQFHHTS- 54 Query: 2251 HGRQMDESAAALVLAKSDDSSSLNRKFVSLNNRKESNHGGYVSFNISAYSRKELKGLKKR 2072 +GR MD+S A A D S R S N N GGY++FN+ +Y++ E+ L+ R Sbjct: 55 NGRHMDDSPAVTQTASDDAYSFNQRPIESTTNVDGLNFGGYLTFNVVSYNKAEVNELRSR 114 Query: 2071 LIAELEQVRNLSSRIESREFQSRSGYSATQFSGGHGGREVTSSTRPPTHSPEASGGKGST 1892 L+AE+EQ+RNL RIES + S+T P + +G + Sbjct: 115 LMAEVEQIRNLKDRIESGQL---------------------STTNPRS--------QGKS 145 Query: 1891 KQKHMTMKVSGSKRPNQLDSGRDTKRLASDPANAKL-------------LSGMMKKCGQL 1751 K K SG+KRP S +D K+L + N MMK+C Q+ Sbjct: 146 K------KQSGNKRPTPSGSSKDLKKLPNGVENRNFGNPGGVDGVKAIGTESMMKECRQI 199 Query: 1750 LTKLMKHKHGWIFNVPVDVVGMGLHDYNQIIRLPMDLGTVKSKLGNNLYTSPHDFASDVR 1571 L KLMKHK+GWIFN+PVD +GLHDY+QII+ PMDLGTVKS L N Y SP +FA+DVR Sbjct: 200 LAKLMKHKNGWIFNIPVDAEALGLHDYHQIIKRPMDLGTVKSNLAKNFYPSPFEFAADVR 259 Query: 1570 LTFNNALRYNPKGHDVHVAAEQLLARFEEMFEPEHRKYESERHSMFGGANELRQRSAAMA 1391 LTFNNAL YNPK V+ AEQLL RFE+MF P + + + + GG +R Sbjct: 260 LTFNNALLYNPKTDQVNAFAEQLLGRFEDMFRP----LQDKMNKLEGG-----RRDYHPV 310 Query: 1390 EEMEPRRSSWNHQSRTPDSAKEHERFPV---------XXXXXXXXXXXXXXXXXXXLTER 1238 +E++ SSWNH TP+ K+ + PV + Sbjct: 311 DELQ--GSSWNH-IPTPERVKKPKPTPVPNISKKQERMQNHSSASTPSLPVPPPNPPARQ 367 Query: 1237 SXXXXXXXXXXXXXXAKQLGVRSGTGKQPPKPKAKDPNKREMTFEEKQKLSLNLQNLPQE 1058 Q + T + PKP+AKDPNKREM EEK KL + LQ+LPQE Sbjct: 368 QSPLSTPSPVRAPAAKPQSAAKVPTMGKQPKPRAKDPNKREMNMEEKHKLGVGLQSLPQE 427 Query: 1057 KMDQVLQIISKRDSKLAQHGDEIEVDIEYIDKETLWELDRFVGYYKKMISKMKRQAIA-- 884 KM Q++QII KR+ LAQ GDEIE+DIE +D ETLWELDRFV +KKM+SK KRQA+ Sbjct: 428 KMPQLVQIIRKRNEHLAQDGDEIELDIEALDTETLWELDRFVTNWKKMVSKTKRQALMNN 487 Query: 883 AGQNSTATEENKQCTTVDEVPEAEATMXXXXXXXXXXXXXXXXXXXXXXEMPASNFPPVE 704 G S + + T+V E + + PA++FPPVE Sbjct: 488 LGPPSASAAASAATTSVAEADGPTTSEKNDSFKKAKKGDVGEEDVEIEDDEPATHFPPVE 547 Query: 703 IEKDAAG 683 IEKD G Sbjct: 548 IEKDEGG 554 >ref|XP_002298808.2| hypothetical protein POPTR_0001s29240g [Populus trichocarpa] gi|550348456|gb|EEE83613.2| hypothetical protein POPTR_0001s29240g [Populus trichocarpa] Length = 474 Score = 361 bits (927), Expect = 8e-97 Identities = 233/541 (43%), Positives = 302/541 (55%), Gaps = 15/541 (2%) Frame = -3 Query: 2419 MASALLASRNEPHWGEPKV--------YMRKNPNSNPTNWRSNPNPNPKEKCSSPRK-QI 2267 MASA+LA+RNEP W +P+ +M K P SNP NP + K + P++ QI Sbjct: 1 MASAVLANRNEPSWTQPQPQQRGGGAKFMGKIPFSNP-----NPKFSKKRQFQPPQQPQI 55 Query: 2266 YDRSGHGRQMDESAAALVLAKSDDSSSLNRKFVSLNNRKESNHGGYVSFNISAYSRKELK 2087 D +DES +A SDD+SS+NR+ NN ++ N GG+V+FN+ +YS+KEL Sbjct: 56 LD-------VDESPSAA----SDDASSINRR--PQNNHQDFNTGGFVTFNVGSYSKKELI 102 Query: 2086 GLKKRLIAELEQVRNLSSRIESREFQSRSGYSATQFSGGHGGREVTSSTRPPTHSPEASG 1907 LK RL+ ELE++R+L +RIES E Q R S Sbjct: 103 ELKNRLVHELEKIRDLKNRIESSESQIRQ-----------------------------SS 133 Query: 1906 GKGSTKQKHMTMKVSGSKRPNQLDSGRDTKRLASDPANAKLLSGMMKKCGQLLTKLMKHK 1727 KQ KVSG+KRP S + + S+P NA+L MK C Q+L+KLMKHK Sbjct: 134 NFSYKKQTSTNKKVSGNKRPFPAPSNFNNLK-RSNPENAQL----MKNCSQILSKLMKHK 188 Query: 1726 HGWIFNVPVDVVGMGLHDYNQIIRLPMDLGTVKSKLGNNLYTSPHDFASDVRLTFNNALR 1547 G+IFN PVDVVGM LHDY+ II+ PMDLGTVKSKL NLY SP DFA+DVRLTFNNA++ Sbjct: 189 LGYIFNSPVDVVGMQLHDYHDIIKSPMDLGTVKSKLTKNLYESPRDFAADVRLTFNNAMK 248 Query: 1546 YNPKGHDVHVAAEQLLARFEEMFEPEHRKYESERHSMFGGANELRQRSAAMAEEMEPRRS 1367 YNPKGH+V++ AEQ L RFE+ + P K ++ + +E++ S Sbjct: 249 YNPKGHEVYMLAEQFLTRFEDFYRPIKEKV----------GDDFDEEENDQVQEVQ--AS 296 Query: 1366 SWNHQSRTPDSAKE-HERFPVXXXXXXXXXXXXXXXXXXXLTERSXXXXXXXXXXXXXXA 1190 SW+H R P+ + + F T + Sbjct: 297 SWDHIRREPERVNQIDDDFMQVTAKSDPIGHQMHQQPLQQPTGLNQNPNLVRTPSPMRMP 356 Query: 1189 KQLGVRSGTGKQPPKPKAKDPNKREMTFEEKQKLSLNLQNLPQEKMDQVLQIISKRDSKL 1010 + V+ PKPKAKDPNKREM+ EEK KL + LQ+LPQEKM+QV+QII KR+ L Sbjct: 357 QVKPVKQ------PKPKAKDPNKREMSLEEKHKLGVGLQSLPQEKMEQVVQIIRKRNGHL 410 Query: 1009 AQHGDEIEVDIEYIDKETLWELDRFVGYYKKMISKMKRQAIA-----AGQNSTATEENKQ 845 Q GDEIE+DIE +D ETLWELDRFV YKKM+SK+KRQA+ G ST+ NK Sbjct: 411 RQEGDEIELDIEAVDTETLWELDRFVTNYKKMVSKIKRQALMGINNNVGAISTSEGNNKV 470 Query: 844 C 842 C Sbjct: 471 C 471 >emb|CAD43284.1| bromodomain-containing RNA-binding protein 1 [Nicotiana benthamiana] Length = 615 Score = 361 bits (926), Expect = 1e-96 Identities = 245/608 (40%), Positives = 320/608 (52%), Gaps = 29/608 (4%) Frame = -3 Query: 2419 MASALLASRNEPHW----GEPKVYMRKNPNSNPTNWRSNPNPNPKEKCSSPRKQIYDRSG 2252 MASA+LASRNE W G +M K P S+ T+ N NP PK+K +KQ + S Sbjct: 1 MASAVLASRNESSWAQSGGAGGGFMGKTPYSH-THLNPNSNPKPKKK----QKQFHHAS- 54 Query: 2251 HGRQMDESAAALVLAKSDDSSSLNRKFVSLNNRKESNHGGYVSFNISAYSRKELKGLKKR 2072 +GRQ DES A A D S R S N N GGY+++N+++Y++ EL L+ R Sbjct: 55 NGRQNDESPAVTQTASDDAYSFNQRPIESSTNVDGLNLGGYMTYNVASYNKTELHELRSR 114 Query: 2071 LIAELEQVRNLSSRIESREFQSRSGYSATQFSGGHGGREVTSSTRPPTHSPEASGGKGST 1892 L+AELEQ+R+L RIES G+ TS+ R S + SG K +T Sbjct: 115 LVAELEQIRSLKDRIES-------------------GQLSTSNPRSHGKSKKLSGNKRAT 155 Query: 1891 KQKHMTMKVSGSKRPNQLDSGRDTKRLASDPANAKL-----LSGMMKKCGQLLTKLMKHK 1727 SK P +L +G D + + P + + MMK+C Q+L KLMKHK Sbjct: 156 PS-------GSSKDPKKLPNGVDNRNFGN-PGGVGVKGIIGMENMMKECRQVLGKLMKHK 207 Query: 1726 HGWIFNVPVDVVGMGLHDYNQIIRLPMDLGTVKSKLGNNLYTSPHDFASDVRLTFNNALR 1547 GWIFN PVD +GLHDY+QII+ PMDLGTVKS L N Y +P +FA+DVRLTFNNAL Sbjct: 208 SGWIFNTPVDAEALGLHDYHQIIKRPMDLGTVKSNLSNCFYPTPFEFAADVRLTFNNALL 267 Query: 1546 YNPKGHDVHVAAEQLLARFEEMFEPEHRKYESERHSMFGGANELRQRSAAMAEEMEPRRS 1367 YNPK VH AEQLLARFE+MF P + + + + GG++ +R +E++ Sbjct: 268 YNPKTDQVHGFAEQLLARFEDMFRP----IQDKLNKLDGGSD---RRDFHPTDELQ--GI 318 Query: 1366 SWNH-----------QSRTPDSAKEHERFPVXXXXXXXXXXXXXXXXXXXLTERSXXXXX 1220 SWNH + P +K+ ER + ++S Sbjct: 319 SWNHIPTPERVKKPKPTPAPHISKKQERMMQNHSSALTLPVQQPPDNTPVVRQQSLLSTP 378 Query: 1219 XXXXXXXXXAKQLGVRSGT---GKQPPKPKAKDPNKREMTFEEKQKLSLNLQNLPQEKMD 1049 Q V + GKQ PKP+AKDPNKREM+ EEK KL + LQ+LPQEKM Sbjct: 379 SPVRAPPAPKPQSSVAAKVPPMGKQ-PKPRAKDPNKREMSMEEKHKLGVGLQSLPQEKMP 437 Query: 1048 QVLQIISKRDSKLAQHGDEIEVDIEYIDKETLWELDRFVGYYKKMISKMKRQAIA--AGQ 875 Q++QII KR+ LAQ GDEIE+DIE +D ETLWELDRFV +KKM+SK KRQA+ GQ Sbjct: 438 QLVQIIRKRNEHLAQDGDEIELDIEALDTETLWELDRFVTNWKKMVSKTKRQALINNLGQ 497 Query: 874 NSTATEENKQCTTVD----EVPEAEATMXXXXXXXXXXXXXXXXXXXXXXEMPASNFPPV 707 +A+ TT + P + PA++FPPV Sbjct: 498 PPSASAAASAATTTSVAEADAPSTSEKNDSFKKPKKGGDAGDEDDVEIEDDEPATHFPPV 557 Query: 706 EIEKDAAG 683 EI+KD G Sbjct: 558 EIDKDEGG 565 >ref|NP_001275266.1| bromodomain-containing RNA-binding protein 1 [Solanum tuberosum] gi|57282314|emb|CAD43283.1| bromodomain-containing RNA-binding protein 1 [Solanum tuberosum] Length = 602 Score = 358 bits (920), Expect = 5e-96 Identities = 242/609 (39%), Positives = 315/609 (51%), Gaps = 30/609 (4%) Frame = -3 Query: 2419 MASALLASRNEPHW----GEPKVYMRKNPNSNPTNWRSNPNPNPKEKCSSPRKQIYDRSG 2252 MASA+LASRNE W G +M K P S+ T N NPNPK+K +KQ + S Sbjct: 1 MASAVLASRNESSWAQSGGAGGGFMGKTPFSH-TQLNPNHNPNPKKK----QKQFHHTS- 54 Query: 2251 HGRQMDESAAALVLAKSDDSSSLNRKFVSLNNRKESNHGGYVSFNISAYSRKELKGLKKR 2072 +GR +DES A A D S R S N N GGY++FN+ +Y++ E+ L+ R Sbjct: 55 NGRHIDESPAVTQTASDDAYSFNQRPIESTTNVDGLNFGGYLTFNVVSYNKGEVNELRSR 114 Query: 2071 LIAELEQVRNLSSRIESREFQSRSGYSATQFSGGHGGREVTSSTRPPTHSPEASGGKGST 1892 L+AE+EQ+RNL RIES + S+T P + +G + Sbjct: 115 LLAEVEQIRNLKDRIESGQL---------------------STTNPRS--------QGKS 145 Query: 1891 KQKHMTMKVSGSKRPNQLDSGRDTKRLASDPANAKL-------------LSGMMKKCGQL 1751 K K+SG+KRP S +D K+L + N MMK+C Q+ Sbjct: 146 K------KLSGNKRPTPSGSSKDPKKLPNGVENRNFGNPVGGGGVKAIGTESMMKECRQI 199 Query: 1750 LTKLMKHKHGWIFNVPVDVVGMGLHDYNQIIRLPMDLGTVKSKLGNNLYTSPHDFASDVR 1571 L KLMKHK+GWIFN+PVD +GLHDY+QII+ P+DLGTVKS L N Y SP +FA+DVR Sbjct: 200 LAKLMKHKNGWIFNIPVDAEALGLHDYHQIIKRPIDLGTVKSNLAKNFYPSPFEFAADVR 259 Query: 1570 LTFNNALRYNPKGHDVHVAAEQLLARFEEMFEPEHRKYESERHSMFGGANELRQRSAAMA 1391 LTFNNAL YNPK V+ AEQLL RFE+MF P + + + + GG +R Sbjct: 260 LTFNNALLYNPKTDQVNGFAEQLLGRFEDMFRP----LQDKMNKLEGG-----RRDYHPV 310 Query: 1390 EEMEPRRSSWNHQSRTPDSAKEHERFPV----------XXXXXXXXXXXXXXXXXXXLTE 1241 +E++ SSWNH TP+ K+ + PV + Sbjct: 311 DELQ--GSSWNH-IPTPERVKKPKATPVPHISKKQERMQNHSSASTPSLPVPPPNPPARQ 367 Query: 1240 RSXXXXXXXXXXXXXXAKQLGVRSGTGKQPPKPKAKDPNKREMTFEEKQKLSLNLQNLPQ 1061 +S + GKQ PKP+AKDPNKR M EEK KL + LQ+LPQ Sbjct: 368 QSPLSTPSPVRAPPSKPESAAKVPAMGKQ-PKPRAKDPNKRVMNMEEKHKLGVGLQSLPQ 426 Query: 1060 EKMDQVLQIISKRDSKLAQHGDEIEVDIEYIDKETLWELDRFVGYYKKMISKMKRQAIA- 884 EKM Q++QII KR+ LAQ GDEIE+DIE +D ETLWELDRFV +KKM+SK KRQA+ Sbjct: 427 EKMPQLVQIIRKRNEHLAQDGDEIELDIEALDTETLWELDRFVTNWKKMVSKTKRQALMI 486 Query: 883 --AGQNSTATEENKQCTTVDEVPEAEATMXXXXXXXXXXXXXXXXXXXXXXEMPASNFPP 710 G S + + T+V E + + PA++FPP Sbjct: 487 NNLGPPSASAAASAATTSVAEADGPTTSEKNDSFKKPKKGDVGEEDVEIEDDEPATHFPP 546 Query: 709 VEIEKDAAG 683 VEIEKD G Sbjct: 547 VEIEKDEGG 555 >ref|XP_007148328.1| hypothetical protein PHAVU_006G199500g [Phaseolus vulgaris] gi|561021551|gb|ESW20322.1| hypothetical protein PHAVU_006G199500g [Phaseolus vulgaris] Length = 531 Score = 358 bits (919), Expect = 7e-96 Identities = 242/583 (41%), Positives = 306/583 (52%), Gaps = 7/583 (1%) Frame = -3 Query: 2419 MASALLASRNEPHW----GEPKVYMRKNPNSNPTNWRSNPNPNPKEKCSSPRKQIYDRSG 2252 MASA+LA+RNEP+W G +M K P SNP NPN NPK +S R Q Sbjct: 1 MASAVLANRNEPNWPQHRGGGAGFMGKVPFSNP-----NPNSNPK-LANSKRTQ------ 48 Query: 2251 HGRQMDESAAALVLAKSDDSSSLNRKFVSLNNRKESNHGGYVSFNISAYSRKELKGLKKR 2072 + SDD+SS+NR+ +N +H YV F+IS+ ++KEL +K R Sbjct: 49 --------------SASDDASSINRR----SNDAGVSHSQYVCFSISSCTKKELNDIKNR 90 Query: 2071 LIAELEQVRNLSSRIESREFQSRSGYSATQFSGGHGGREVTSSTRPPTHSPEASGGKGST 1892 L++ELEQVR +RIES + Q Q S GH Sbjct: 91 LVSELEQVRKCRNRIESGKLQPG------QSSNGH------------------------- 119 Query: 1891 KQKHMTMKVSGSKRPNQLDSGRDTKRLASDPANAKLLSGMMKKCGQLLTKLMKHKHGWIF 1712 +K + KVSG+KRP L+S ++ KR S+ N MK C Q+L KLMKHKHGWIF Sbjct: 120 MKKPSSKKVSGNKRPLPLNSVKEMKRSHSEVGNT------MKSCSQILQKLMKHKHGWIF 173 Query: 1711 NVPVDVVGMGLHDYNQIIRLPMDLGTVKSKLGNNLYTSPHDFASDVRLTFNNALRYNPKG 1532 NVPVDVVGMGLHDY II+ PMDLGTVKS L ++Y++P DFA+DVRLTF NAL YNPKG Sbjct: 174 NVPVDVVGMGLHDYYDIIKQPMDLGTVKSNLSKSVYSTPSDFAADVRLTFKNALTYNPKG 233 Query: 1531 HDVHVAAEQLLARFEEMFEPEHRKYESERHSMFGGANELRQRSAAMAEEMEPRRSSWNHQ 1352 HDV+ AEQLL RFEE++ P K + +RQ + E + SSW+H Sbjct: 234 HDVYTMAEQLLMRFEELYRPMRDKSD----------GWIRQ---DQDYDEELQASSWSH- 279 Query: 1351 SRTPDSAKEHERFPVXXXXXXXXXXXXXXXXXXXLTERSXXXXXXXXXXXXXXAKQLGVR 1172 P+ K+ E L + KQ Sbjct: 280 VEPPERVKKKENPIPPAKLQQEPPQPPASSSNPPLLQSPVRTPSPMRAPPVKPLKQ---- 335 Query: 1171 SGTGKQPPKPKAKDPNKREMTFEEKQKLSLNLQNLPQEKMDQVLQIISKRDSKLAQHGDE 992 PKPKAKDPNKREM+ EEK KL L LQ+LP EKM+QV+QII +R+ L Q GDE Sbjct: 336 -------PKPKAKDPNKREMSLEEKHKLGLGLQSLPAEKMEQVVQIIRRRNGHLKQDGDE 388 Query: 991 IEVDIEYIDKETLWELDRFVGYYKKMISKMKRQAIAAGQNST---ATEENKQCTTVDEVP 821 IE+DIE +D ETLWELDR V YKKM+SK+KRQA+ N+ ++ N + +++ Sbjct: 389 IELDIEAVDTETLWELDRLVTNYKKMVSKIKRQALMGNMNNNNEQSSRGNGELAASEKID 448 Query: 820 EAEATMXXXXXXXXXXXXXXXXXXXXXXEMPASNFPPVEIEKD 692 + M EMP S FPPVEIEKD Sbjct: 449 GSAVEM-----KKSKEVEAGEEDIDIGDEMPMSMFPPVEIEKD 486 >gb|EXC33022.1| Transcription factor GTE4 [Morus notabilis] Length = 559 Score = 358 bits (918), Expect = 9e-96 Identities = 245/592 (41%), Positives = 309/592 (52%), Gaps = 13/592 (2%) Frame = -3 Query: 2419 MASALLASRNEPHWG-EPKV------YMRKNPNSNPTNWRSNPNPNPKEKCSSPRKQIYD 2261 MASA+LA+RN+ W +P+ +M K P +NP NP + K+ S Q + Sbjct: 1 MASAVLANRNDTDWPPQPRGGASGAGFMGKVPFANP-----NPKNSSKK---SQFHQFHA 52 Query: 2260 RSG--HGRQMDESAAALVLAKSDDSSSLNRKFVSLNNRKESNHGGYVSFNISAYSRKELK 2087 SG +G Q+DES A A SDD+SS+N + S N H YVSFNI +YSRKEL Sbjct: 53 PSGDFNGCQIDESPAVTQTA-SDDASSINHRRSSEFNL---GHSQYVSFNIGSYSRKELS 108 Query: 2086 GLKKRLIAELEQVRNLSSRIESREFQSRSGYSATQFSGGHGGREVTSSTRPPTHSPEASG 1907 LK RL++EL+++R L SRIE+ + +P P Sbjct: 109 ELKNRLLSELDRIRQLQSRIEASDLDHHH-------------------LQPKKPIP---- 145 Query: 1906 GKGSTKQKHMTMKVSGSKRPNQLDS--GRDTKRLA-SDPANAKLLSGMMKKCGQLLTKLM 1736 + K+SGSKRP +S G+D+ L S P NA L MK C QL+TKLM Sbjct: 146 ----------SKKLSGSKRPFPTNSNHGKDSSHLKRSHPDNANL----MKNCSQLMTKLM 191 Query: 1735 KHKHGWIFNVPVDVVGMGLHDYNQIIRLPMDLGTVKSKLGNNLYTSPHDFASDVRLTFNN 1556 K KH WIFN PVDV+GMGLHDY II+ PMDLGTVK L NLY+SP DFA+DVRLTF N Sbjct: 192 KQKHAWIFNKPVDVIGMGLHDYFDIIKRPMDLGTVKLNLSKNLYSSPSDFAADVRLTFQN 251 Query: 1555 ALRYNPKGHDVHVAAEQLLARFEEMFEPEHRKYESERHSMFGGANELRQRSAAMAEEMEP 1376 A+ YNPKGHDVH AEQLL +FEE+F P K ER +F + + Sbjct: 252 AVTYNPKGHDVHAIAEQLLVKFEELFRPVSEKLGDER--LF---------------DDDL 294 Query: 1375 RRSSWNH-QSRTPDSAKEHERFPVXXXXXXXXXXXXXXXXXXXLTERSXXXXXXXXXXXX 1199 + SSW+H + P +E + P + + Sbjct: 295 QASSWDHVEPERPKKREEKKPEPPVRAPPVPASSSNPIPNSPPVVQLQSPVRTPSPPMRA 354 Query: 1198 XXAKQLGVRSGTGKQPPKPKAKDPNKREMTFEEKQKLSLNLQNLPQEKMDQVLQIISKRD 1019 K L + PKPKAKDPNKREM+ EEKQKL + LQ+LPQEKMDQV+QII KR+ Sbjct: 355 PPVKPL--------KQPKPKAKDPNKREMSMEEKQKLGIGLQSLPQEKMDQVVQIIRKRN 406 Query: 1018 SKLAQHGDEIEVDIEYIDKETLWELDRFVGYYKKMISKMKRQAIAAGQNSTATEENKQCT 839 L Q GDEIE+DIE +D ETLWELDR V +KKM+SK+KRQA+ N+ + Sbjct: 407 GHLKQDGDEIELDIEAVDIETLWELDRLVTNWKKMVSKIKRQALMNNANNNNSNVAPNKG 466 Query: 838 TVDEVPEAEATMXXXXXXXXXXXXXXXXXXXXXXEMPASNFPPVEIEKDAAG 683 + + EMP +NFPPVEIEKD G Sbjct: 467 NAELAGSEKIDSVVAEPKRVKKGEGGDEDVDIGDEMPLNNFPPVEIEKDIGG 518 >ref|XP_007042773.1| Global transcription factor group, putative isoform 4 [Theobroma cacao] gi|508706708|gb|EOX98604.1| Global transcription factor group, putative isoform 4 [Theobroma cacao] Length = 483 Score = 357 bits (917), Expect = 1e-95 Identities = 234/541 (43%), Positives = 295/541 (54%), Gaps = 14/541 (2%) Frame = -3 Query: 2419 MASALLASRNEPHWG-EPKVYMRKNPNSNPTNWRSNPNPNPKEKCSSPRKQIY------- 2264 MASA+LA+R+E +W +PK + K P + PNPNPK + ++Q++ Sbjct: 1 MASAVLANRSESNWPPQPKSSVAKFMGKVPFT-ATKPNPNPK---FNKKRQLHQHLPPPD 56 Query: 2263 DRSGHGRQMDESAAALVLAKSDDSSSLNRKFVSLNNRKESNHGGYVSFNISAYSRKELKG 2084 D +GH +D+S A A SDD+SS+NRK + + G YVSF+IS+YSRKEL Sbjct: 57 DVAGH--VVDDSPAVTQSAASDDASSINRKL------NDFSSGAYVSFHISSYSRKELID 108 Query: 2083 LKKRLIAELEQVRNLSSRIESREFQSRSGYSATQFSGGHGGREVTSSTRPPTHSPEASGG 1904 LK RL+AELEQ+R L +RIES +F RS Sbjct: 109 LKNRLVAELEQIRELKNRIESNDFHVRS-------------------------------- 136 Query: 1903 KGSTKQKHMTMKVSGSKRPNQLDSGRDTKRLASDPANAKLLSGMMKKCGQLLTKLMKHKH 1724 STK+ +SG+KRP + ++ KRL + +MK C Q+L KLMK K+ Sbjct: 137 -SSTKKPISKKNISGNKRPLPPNFSKELKRLNPQENGKASTTHLMKNCSQILNKLMKQKY 195 Query: 1723 GWIFNVPVDVVGMGLHDYNQIIRLPMDLGTVKSKLGNNLYTSPHDFASDVRLTFNNALRY 1544 G+IFN PVDVVGMGLHDY II+ PMDLGTVKS++ N Y SP DFA+DVRLTFNNA+ Y Sbjct: 196 GYIFNSPVDVVGMGLHDYYDIIKNPMDLGTVKSRMAKNFYGSPLDFAADVRLTFNNAMLY 255 Query: 1543 NPKGHDVHVAAEQLLARFEEMFEPEHRKYESERHSMFGGANELRQRSAAMAEEMEPRRSS 1364 NPKGH+V++ AEQLLARFEE F P K E + E Q EE++ SS Sbjct: 256 NPKGHEVYMLAEQLLARFEEFFRPLSLKLEEQ---------EEPQEKGYYEEELQ--ASS 304 Query: 1363 WNHQSRTPDSAKEHERFPVXXXXXXXXXXXXXXXXXXXLTERSXXXXXXXXXXXXXXAKQ 1184 W+H KE ER Sbjct: 305 WDH-GEADRMKKERERNGERNIDRDDSVNIVARSDKIGGVS-GFVSNPNVPPPQLQMQAP 362 Query: 1183 LGVRSGTGKQPPKP------KAKDPNKREMTFEEKQKLSLNLQNLPQEKMDQVLQIISKR 1022 V S P KP KAKDPNKREM+ EEKQKL + LQ+LPQEKMD V+QII KR Sbjct: 363 ARVASPVRAPPVKPLKQPKPKAKDPNKREMSMEEKQKLGIGLQSLPQEKMDNVVQIIRKR 422 Query: 1021 DSKLAQHGDEIEVDIEYIDKETLWELDRFVGYYKKMISKMKRQAIAAGQNSTATEENKQC 842 + L Q GDEIE+DIE +D ETLWELDRFV YKKM+SK+KRQA+ A N + + N+ Sbjct: 423 NGHLRQDGDEIELDIEAMDTETLWELDRFVTNYKKMVSKIKRQALMA-NNVVSNDSNRVS 481 Query: 841 T 839 T Sbjct: 482 T 482