BLASTX nr result

ID: Cocculus22_contig00001054 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cocculus22_contig00001054
         (2637 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003631709.1| PREDICTED: transcription factor GTE2-like [V...   416   e-113
emb|CBI35445.3| unnamed protein product [Vitis vinifera]              407   e-110
ref|XP_006856632.1| hypothetical protein AMTR_s00046p00231370 [A...   401   e-109
gb|EXC03905.1| Transcription factor GTE7 [Morus notabilis]            395   e-107
ref|XP_007148327.1| hypothetical protein PHAVU_006G199400g [Phas...   378   e-102
gb|ABI14812.1| chloroplast bromodomain-containing protein [Pachy...   373   e-100
ref|XP_007042774.1| Global transcription factor group E2, putati...   372   e-100
ref|XP_007042770.1| Global transcription factor group E2, putati...   372   e-100
ref|XP_006379136.1| hypothetical protein POPTR_0009s08350g [Popu...   370   2e-99
ref|XP_002313212.2| hypothetical protein POPTR_0009s08350g [Popu...   370   2e-99
ref|XP_006379135.1| hypothetical protein POPTR_0009s08350g [Popu...   370   2e-99
ref|XP_004147512.1| PREDICTED: transcription factor GTE7-like [C...   367   2e-98
ref|XP_002518322.1| bromodomain-containing protein, putative [Ri...   363   2e-97
ref|NP_001234374.1| PSTVd RNA-binding protein Virp1d [Solanum ly...   363   2e-97
ref|XP_002298808.2| hypothetical protein POPTR_0001s29240g [Popu...   361   8e-97
emb|CAD43284.1| bromodomain-containing RNA-binding protein 1 [Ni...   361   1e-96
ref|NP_001275266.1| bromodomain-containing RNA-binding protein 1...   358   5e-96
ref|XP_007148328.1| hypothetical protein PHAVU_006G199500g [Phas...   358   7e-96
gb|EXC33022.1| Transcription factor GTE4 [Morus notabilis]            358   9e-96
ref|XP_007042773.1| Global transcription factor group, putative ...   357   1e-95

>ref|XP_003631709.1| PREDICTED: transcription factor GTE2-like [Vitis vinifera]
          Length = 561

 Score =  416 bits (1069), Expect = e-113
 Identities = 263/591 (44%), Positives = 330/591 (55%), Gaps = 13/591 (2%)
 Frame = -3

Query: 2419 MASALLASRNEPHWGEPKV---------YMRKNPNSNPTNWRSNPNPNPKEKCSSPRKQI 2267
            MASA+LASRNE +W + +          +M K  +SNP     NPN N K K  +P   I
Sbjct: 1    MASAVLASRNESNWAQSRGGGGGGGGGGFMGKFHSSNP-----NPN-NSKRKTHAPAGDI 54

Query: 2266 YDRSGHGRQMDESAAALVLAKSDDSSSLNRKFVSLNNRKESNHGGYVSFNISAYSRKELK 2087
             D S           A+  + SDD+SS N++ +      E N G YV+FNI +YSRK+L 
Sbjct: 55   NDLS----------PAVTQSASDDASSFNQRSIV-----EFNRGRYVTFNIGSYSRKDLV 99

Query: 2086 GLKKRLIAELEQVRNLSSRIESREFQSRSGYSATQFSGGHGGREVTSSTRPPTHSPEASG 1907
             LK RL++ELE+++NLS+RIES + Q RSG          G R      RP         
Sbjct: 100  QLKNRLVSELEKIQNLSNRIESGDLQLRSG----------GDRTANKQQRPNNK------ 143

Query: 1906 GKGSTKQKHMTMKVSGSKRPNQLDSGRDTKRLASDPANAKLLSGMMKKCGQLLTKLMKHK 1727
                        K++G+KRP   DSGR  KR A++ A+      +MK CGQ LTKLMKHK
Sbjct: 144  ------------KIAGNKRPPPFDSGRGPKRSAAENAS------LMKLCGQTLTKLMKHK 185

Query: 1726 HGWIFNVPVDVVGMGLHDYNQIIRLPMDLGTVKSKLGNNLYTSPHDFASDVRLTFNNALR 1547
            H W+FN PVDVVGMGLHDY+QII+ PMDLGTVKSK+  NLY SP DFA+DVRLTF+NAL 
Sbjct: 186  HSWVFNSPVDVVGMGLHDYHQIIKRPMDLGTVKSKIAKNLYDSPLDFAADVRLTFDNALL 245

Query: 1546 YNPKGHDVHVAAEQLLARFEEMFEPEHRKYESERHS----MFGGANELRQRSAAMAEEME 1379
            YNPKGHDVHV AEQLLARFE++F+P + K E +       + GG        A  +   E
Sbjct: 246  YNPKGHDVHVMAEQLLARFEDLFKPVYNKLEEDERDQERIIVGGGRGGVSAIAGTSGGEE 305

Query: 1378 PRRSSWNHQSRTPDSAKEHERFPVXXXXXXXXXXXXXXXXXXXLTERSXXXXXXXXXXXX 1199
             + SSWNH   TP+  K+    PV                    + +S            
Sbjct: 306  LQGSSWNH-IPTPERLKKPSPKPVAKKPERMQVPIPATGSSNPPSVQS---VPTPSPMRA 361

Query: 1198 XXAKQLGVRSGTGKQPPKPKAKDPNKREMTFEEKQKLSLNLQNLPQEKMDQVLQIISKRD 1019
               K L  R  +GKQ PKPKAKDPNKREM+ EEK KL L LQ+LPQEKMDQV+QIISK++
Sbjct: 362  PPVKPLATRPSSGKQ-PKPKAKDPNKREMSLEEKHKLGLGLQSLPQEKMDQVVQIISKKN 420

Query: 1018 SKLAQHGDEIEVDIEYIDKETLWELDRFVGYYKKMISKMKRQAIAAGQNSTATEENKQCT 839
              L Q GDEIE+DIE +D ETLWELDR V  +KKM+SK+KRQA+    N+++  E     
Sbjct: 421  GHLTQDGDEIELDIEAVDTETLWELDRLVTNWKKMVSKIKRQALMVNNNTSSMNE----- 475

Query: 838  TVDEVPEAEATMXXXXXXXXXXXXXXXXXXXXXXEMPASNFPPVEIEKDAA 686
                    E ++                      E+P + FPPVEIEKD A
Sbjct: 476  ------RTEPSLAPAMAKKPKKGEAGEEDVDIGDEIPTATFPPVEIEKDDA 520


>emb|CBI35445.3| unnamed protein product [Vitis vinifera]
          Length = 564

 Score =  407 bits (1046), Expect = e-110
 Identities = 251/546 (45%), Positives = 316/546 (57%), Gaps = 1/546 (0%)
 Frame = -3

Query: 2320 SNPNPN-PKEKCSSPRKQIYDRSGHGRQMDESAAALVLAKSDDSSSLNRKFVSLNNRKES 2144
            SNPNPN  K K  +P   I D S           A+  + SDD+SS N++ +      E 
Sbjct: 48   SNPNPNNSKRKTHAPAGDINDLS----------PAVTQSASDDASSFNQRSIV-----EF 92

Query: 2143 NHGGYVSFNISAYSRKELKGLKKRLIAELEQVRNLSSRIESREFQSRSGYSATQFSGGHG 1964
            N G YV+FNI +YSRK+L  LK RL++ELE+++NLS+RIES + Q RSG          G
Sbjct: 93   NRGRYVTFNIGSYSRKDLVQLKNRLVSELEKIQNLSNRIESGDLQLRSG----------G 142

Query: 1963 GREVTSSTRPPTHSPEASGGKGSTKQKHMTMKVSGSKRPNQLDSGRDTKRLASDPANAKL 1784
             R      RP                     K++G+KRP   DSGR  KR A++ A+   
Sbjct: 143  DRTANKQQRPNNK------------------KIAGNKRPPPFDSGRGPKRSAAENAS--- 181

Query: 1783 LSGMMKKCGQLLTKLMKHKHGWIFNVPVDVVGMGLHDYNQIIRLPMDLGTVKSKLGNNLY 1604
               +MK CGQ LTKLMKHKH W+FN PVDVVGMGLHDY+QII+ PMDLGTVKSK+  NLY
Sbjct: 182  ---LMKLCGQTLTKLMKHKHSWVFNSPVDVVGMGLHDYHQIIKRPMDLGTVKSKIAKNLY 238

Query: 1603 TSPHDFASDVRLTFNNALRYNPKGHDVHVAAEQLLARFEEMFEPEHRKYESERHSMFGGA 1424
             SP DFA+DVRLTF+NAL YNPKGHDVHV AEQLLARFE++F+P + K E +        
Sbjct: 239  DSPLDFAADVRLTFDNALLYNPKGHDVHVMAEQLLARFEDLFKPVYNKLEEDE------- 291

Query: 1423 NELRQRSAAMAEEMEPRRSSWNHQSRTPDSAKEHERFPVXXXXXXXXXXXXXXXXXXXLT 1244
               R +   +  E++   SSWNH   TP+  K+    PV                    +
Sbjct: 292  ---RDQERIIVGELQ--GSSWNH-IPTPERLKKPSPKPVAKKPERMQVPIPATGSSNPPS 345

Query: 1243 ERSXXXXXXXXXXXXXXAKQLGVRSGTGKQPPKPKAKDPNKREMTFEEKQKLSLNLQNLP 1064
             +S               K L  R  +GKQ PKPKAKDPNKREM+ EEK KL L LQ+LP
Sbjct: 346  VQS---VPTPSPMRAPPVKPLATRPSSGKQ-PKPKAKDPNKREMSLEEKHKLGLGLQSLP 401

Query: 1063 QEKMDQVLQIISKRDSKLAQHGDEIEVDIEYIDKETLWELDRFVGYYKKMISKMKRQAIA 884
            QEKMDQV+QIISK++  L Q GDEIE+DIE +D ETLWELDR V  +KKM+SK+KRQA+ 
Sbjct: 402  QEKMDQVVQIISKKNGHLTQDGDEIELDIEAVDTETLWELDRLVTNWKKMVSKIKRQALM 461

Query: 883  AGQNSTATEENKQCTTVDEVPEAEATMXXXXXXXXXXXXXXXXXXXXXXEMPASNFPPVE 704
               N+ ATE N+  ++++E    E ++                      E+P + FPPVE
Sbjct: 462  VNNNTAATEVNR--SSMNE--RTEPSLAPAMAKKPKKGEAGEEDVDIGDEIPTATFPPVE 517

Query: 703  IEKDAA 686
            IEKD A
Sbjct: 518  IEKDDA 523


>ref|XP_006856632.1| hypothetical protein AMTR_s00046p00231370 [Amborella trichopoda]
            gi|548860513|gb|ERN18099.1| hypothetical protein
            AMTR_s00046p00231370 [Amborella trichopoda]
          Length = 602

 Score =  401 bits (1031), Expect = e-109
 Identities = 259/612 (42%), Positives = 336/612 (54%), Gaps = 36/612 (5%)
 Frame = -3

Query: 2419 MASALLASRNEP-HWGEPKVYMRKNPNSNPTNWRSNPNPNPKEKCSSPRKQIYDRSGHGR 2243
            MASALLAS+NE  +WG+ KVYMRK PN   T              S P     +   HG 
Sbjct: 1    MASALLASQNESTYWGDRKVYMRKAPNPKQTLTLD----------SHPHHHNLELPPHGN 50

Query: 2242 QMDESAAALVLAKSDDSSSLNRKFVSLNNR--------KESNHGGYVSFNISAYSRKELK 2087
            +         LA S DSSSLNRK +SLN +        K S+H  Y+++N+ +YS+++L+
Sbjct: 51   E----PVLTTLAASSDSSSLNRKSISLNRKEPQLQASAKFSDH--YITYNVGSYSKQDLR 104

Query: 2086 GLKKRLIAELEQVRNLSSRIESREFQSRS----------GYSATQFSGGHGGREVTSSTR 1937
             L+KRL+ ELEQVR L++RIESR                   A          + T    
Sbjct: 105  DLRKRLVLELEQVRTLANRIESRSLWEPETPGSGRPPPLNLQALDLQADGPKEKRTPKAN 164

Query: 1936 PPTHSPEASGGKGSTKQKHMTMKVSGSKRPNQLDSGRDT-KRLA--SDPANAKLLSGMMK 1766
                + E   GK     +       GSKRPN +    ++ KR+A   DP   KL+S  MK
Sbjct: 165  QYYRASEFVMGKEKMPAQENKKVFGGSKRPNPVTKVSESGKRMAISPDPVTGKLVSDFMK 224

Query: 1765 KCGQLLTKLMKHKHGWIFNVPVDVVGMGLHDYNQIIRLPMDLGTVKSKLGNNLYTSPHDF 1586
            +CGQ+LTKLMKHKHGW+FNVPVDVVGMGLHDY  +I+ PMDLGTVK++L  + Y++P +F
Sbjct: 225  RCGQILTKLMKHKHGWVFNVPVDVVGMGLHDYYTLIKNPMDLGTVKTRLNQSFYSTPLEF 284

Query: 1585 ASDVRLTFNNALRYNPKGHDVHVAAEQLLARFEEMFEPEHRKYESE--------RHSMFG 1430
            A+DVRLTF+NAL YNPKGHDV++ AEQLL  FEEM+ P   +YE E        R + F 
Sbjct: 285  AADVRLTFHNALTYNPKGHDVNIMAEQLLGFFEEMWNPAFNRYEEERRRAVEEARRNSFS 344

Query: 1429 GANELRQRSAAMAEEMEPRRSSWNHQSRTPDSAKEHERFPVXXXXXXXXXXXXXXXXXXX 1250
            G   +R +SAA+ E   P       + R P S+K+ + +                     
Sbjct: 345  GEFPVR-KSAALPEIAMP------VEKRQPQSSKKLDPW--------------------- 376

Query: 1249 LTERSXXXXXXXXXXXXXXAKQLGVRSGTGKQPPKPKAKDPNKREMTFEEKQKLSLNLQN 1070
                                  +G+R     + PKPKA+DPNKREM+FEEKQKLS +LQN
Sbjct: 377  --------MPPTSVKTKGGGNLVGIRPVQLGKQPKPKARDPNKREMSFEEKQKLSTSLQN 428

Query: 1069 LPQEKMDQVLQIISKRDSKLAQHGDEIEVDIEYIDKETLWELDRFVGYYKKMISKMKR-- 896
            LPQEKMDQV+QII +R+S LAQ GDEIEVDI+ +D ETLWELDRFV   KKM+SK+KR  
Sbjct: 429  LPQEKMDQVVQIIRRRNSNLAQDGDEIEVDIDVVDTETLWELDRFVSNCKKMMSKVKRKA 488

Query: 895  ----QAIAAGQNSTATEENKQCTTVDEVPEAEATMXXXXXXXXXXXXXXXXXXXXXXEMP 728
                QA+   Q +  T  + + + V + PEA A                        +MP
Sbjct: 489  GLEDQAMLVQQVNQGTSVDVEKSPVHDTPEAAAA------KKSKKGDQAEEDVDIGDDMP 542

Query: 727  ASNFPPVEIEKD 692
            ++NFPPVEIEKD
Sbjct: 543  STNFPPVEIEKD 554


>gb|EXC03905.1| Transcription factor GTE7 [Morus notabilis]
          Length = 605

 Score =  395 bits (1016), Expect = e-107
 Identities = 263/593 (44%), Positives = 328/593 (55%), Gaps = 17/593 (2%)
 Frame = -3

Query: 2419 MASALLASRNEPHWGEPKVYMRK---NPNSNPTNWRSNPNPNPKEKCSSPR----KQIYD 2261
            MASALLASRNEP WGE KVYMRK   N + NP   + NPNPNP    S+P     +Q YD
Sbjct: 1    MASALLASRNEPSWGENKVYMRKFTTNASKNPL-LKPNPNPNPNP-ISNPNTGSVRQPYD 58

Query: 2260 RSG--HGRQMDESAAALVLAKSDDSSSLNRKFVSLNNRKESNHGGYVSFNISAYSRKELK 2087
             SG    RQ+D   ++        +SS NRK +SL   + S+ G YV+FN+ ++SRKELK
Sbjct: 59   ASGGFRFRQIDNHTSSAPAP----TSSPNRKAMSLIEPRVSSQG-YVTFNVGSFSRKELK 113

Query: 2086 GLKKRLIAELEQVRNLSSRIESREFQSRSGYSATQFSGGHGGREVTSSTRPPTHSPEASG 1907
             LK RL +ELEQVR L SRIES    S           G   +   +S +    S E   
Sbjct: 114  ELKMRLRSELEQVRALMSRIESGSCHS--------LPKGAEKKNPKASLKNYQES-ELVA 164

Query: 1906 GKGSTKQKHMTMKV-----SGSKRPNQLDSGR-DTKRLASDPANAKLLSGMMKKCGQLLT 1745
            GKG  K+K   M V      G+KR N       D KR A DP + KL+  M+K+CGQ+LT
Sbjct: 165  GKGKKKKKKDVMSVVAVDSKGTKRSNPFGGVMADPKRPAIDPISEKLVGSMLKRCGQILT 224

Query: 1744 KLMKHKHGWIFNVPVDVVGMGLHDYNQIIRLPMDLGTVKSKLGNNLYTSPHDFASDVRLT 1565
            KLMKHK GW+FN PVDV G+ LHDY+QI++ PMDLGTVKS L  +LY SP DFASDVRL 
Sbjct: 225  KLMKHKFGWVFNAPVDVDGLKLHDYHQIVKNPMDLGTVKSNLERDLYPSPLDFASDVRLA 284

Query: 1564 FNNALRYNPKGHDVHVAAEQLLARFEEMFEPEHRKYESERHSMFGGANELRQRSAAMAEE 1385
            FNNAL YNPKG DV+  AEQLL +F +MF P ++K E ER  + G  +        +A E
Sbjct: 285  FNNALLYNPKGSDVNFMAEQLLIQFNQMFNPAYKKLEDERRRVLGFGD------PNVAPE 338

Query: 1384 MEPRRSSWNHQSRTPDSAKEHERFPVXXXXXXXXXXXXXXXXXXXLTERSXXXXXXXXXX 1205
               R        + PD  +    FP                    LT+ S          
Sbjct: 339  NARREIDTMVAVKKPDLVRSKPTFP-------DPTPPPPVHYTQALTKPSVPAPAPSPVS 391

Query: 1204 XXXXAKQLGVRSGTGKQPPKPKAKDPNKREMTFEEKQKLSLNLQNLPQEKMDQVLQIISK 1025
                     V+       PKPKAKDPNKR+MT+EEK KL  NLQNLP EKM Q+L I+ K
Sbjct: 392  KPPLVNSRPVKL------PKPKAKDPNKRQMTYEEKAKLGANLQNLPTEKMVQLLHILKK 445

Query: 1024 RDSK--LAQHGDEIEVDIEYIDKETLWELDRFVGYYKKMISKMKRQAIAAGQNSTATEEN 851
            R+ +  L+Q G+EIE+DIE +D ETLWELDRFVG YKKM+SKMKRQA+   QN      +
Sbjct: 446  RNDQCHLSQDGEEIELDIEAVDTETLWELDRFVGNYKKMVSKMKRQALMQAQNPAPQNTD 505

Query: 850  KQCTTVDEVPEAEATMXXXXXXXXXXXXXXXXXXXXXXEMPASNFPPVEIEKD 692
            +  +   +  +A A +                      ++P  +FPPV IEKD
Sbjct: 506  RNTSLESDPADAAAVV-----KSKKVDAAAEEDVDIGEDIPMGDFPPVVIEKD 553


>ref|XP_007148327.1| hypothetical protein PHAVU_006G199400g [Phaseolus vulgaris]
            gi|561021550|gb|ESW20321.1| hypothetical protein
            PHAVU_006G199400g [Phaseolus vulgaris]
          Length = 527

 Score =  378 bits (970), Expect = e-102
 Identities = 250/581 (43%), Positives = 306/581 (52%), Gaps = 5/581 (0%)
 Frame = -3

Query: 2419 MASALLASRNEPHWGEPKV----YMRKNPNSNPTNWRSNPNPNPKEKCSSPRKQIYDRSG 2252
            MASA+LA+RNEP+W + +V    +M K P +NP     NPN NPK   +S R Q      
Sbjct: 1    MASAVLANRNEPNWPQHRVGGAGFMGKAPFANP-----NPNSNPK-LANSKRNQ------ 48

Query: 2251 HGRQMDESAAALVLAKSDDSSSLNRKFVSLNNRKESNHGGYVSFNISAYSRKELKGLKKR 2072
                          + SDD+SS+NR+     + +   H  YVSFNI + S+KEL  +K R
Sbjct: 49   --------------SASDDASSINRR-----SNEVVTHSQYVSFNIGSLSKKELGDIKNR 89

Query: 2071 LIAELEQVRNLSSRIESREFQSRSGYSATQFSGGHGGREVTSSTRPPTHSPEASGGKGST 1892
            L++ELEQV+   +RIES E Q         F+GGH               P+ S  K   
Sbjct: 90   LVSELEQVQKFRTRIESGELQP-----GQSFNGGH---------------PKKSSSK--- 126

Query: 1891 KQKHMTMKVSGSKRPNQLDSGRDTKRLASDPANAKLLSGMMKKCGQLLTKLMKHKHGWIF 1712
                   KVSG+KRP  L+S +D KR   +  N      +MK C Q+L KLMKHKHGWIF
Sbjct: 127  -------KVSGNKRPLPLNSAKDFKRSLPEVGN------LMKGCSQVLQKLMKHKHGWIF 173

Query: 1711 NVPVDVVGMGLHDYNQIIRLPMDLGTVKSKLGNNLYTSPHDFASDVRLTFNNALRYNPKG 1532
            NVPVD VGMGLHDY  II+ PMDLGTVKS L  N Y++P DFASDVRLTF NAL YNPKG
Sbjct: 174  NVPVDAVGMGLHDYYDIIKQPMDLGTVKSNLSKNKYSAPSDFASDVRLTFKNALTYNPKG 233

Query: 1531 HDVHVAAEQLLARFEEMFEPEHRKYESERHSMFGGANELRQRSAAMAEEMEPRRSSWNHQ 1352
            HDV+  AEQLL RFEE++ P H K+E     + G   +          E E + SSW+H 
Sbjct: 234  HDVYTMAEQLLTRFEELYRPMHEKFE----DLVGHDRDF---------EEELQASSWSHV 280

Query: 1351 SRTPDSAKEHERFPVXXXXXXXXXXXXXXXXXXXLTERSXXXXXXXXXXXXXXAKQLGVR 1172
               P+  K+ E                       L +                 KQ    
Sbjct: 281  E--PERVKKKENLIPHAKFQQEPPQPPASSSNPPLLQFPVRTPSPMRAPPVKPLKQ---- 334

Query: 1171 SGTGKQPPKPKAKDPNKREMTFEEKQKLSLNLQNLPQEKMDQVLQIISKRDSKLAQHGDE 992
                   PKPKAKDPNKREM+ EEK KL L LQ+LP EKM+QV+QII +R+  L Q GDE
Sbjct: 335  -------PKPKAKDPNKREMSLEEKHKLGLGLQSLPAEKMEQVVQIIRRRNGHLKQDGDE 387

Query: 991  IEVDIEYIDKETLWELDRFVGYYKKMISKMKRQAIAAGQNSTATEENKQCTTVDEVPEAE 812
            IE+DIE +D ETLWELDR V  YKKM+SK+KRQA+    N    + N       E+P  E
Sbjct: 388  IELDIEAVDTETLWELDRLVTNYKKMVSKIKRQALMGNNNVAPHKANM------ELPAGE 441

Query: 811  -ATMXXXXXXXXXXXXXXXXXXXXXXEMPASNFPPVEIEKD 692
             A                        EMP S FPPVEIEKD
Sbjct: 442  KADGMPTELKKPKKVEAGDEDVDIGDEMPMSMFPPVEIEKD 482


>gb|ABI14812.1| chloroplast bromodomain-containing protein [Pachysandra terminalis]
          Length = 428

 Score =  373 bits (957), Expect = e-100
 Identities = 217/396 (54%), Positives = 255/396 (64%), Gaps = 2/396 (0%)
 Frame = -3

Query: 1870 KVSGSKRPNQLDSGRDTKRLASDPA-NAKLLSGMMKKCGQLLTKLMKHKHGWIFNVPVDV 1694
            KVSGSKRP    SGRD+KR AS+PA   K+LS MMK+CGQ+LTKLM+HKHGWIFNVPVDV
Sbjct: 3    KVSGSKRPLPFTSGRDSKRPASEPAPTGKMLSSMMKQCGQILTKLMRHKHGWIFNVPVDV 62

Query: 1693 VGMGLHDYNQIIRLPMDLGTVKSKLGNNLYTSPHDFASDVRLTFNNALRYNPKGHDVHVA 1514
            VGMGLHDYNQII+ PMDLGTVK  +G NLY+SP DFASDVRLTFNNAL YNPKGHDV+  
Sbjct: 63   VGMGLHDYNQIIKHPMDLGTVKLNIGKNLYSSPLDFASDVRLTFNNALSYNPKGHDVYAM 122

Query: 1513 AEQLLARFEEMFEPEHRKYESERHSMFGGANELRQRSAAMAEEMEPRRSSWNHQSRTPDS 1334
            AEQLL RFEEMFEP ++K+E  +          R+ SA      E RRSSW+HQ   P+S
Sbjct: 123  AEQLLVRFEEMFEPAYKKFEDAQQ---------RKISAG-----EIRRSSWSHQIPMPES 168

Query: 1333 AKEHERFPVXXXXXXXXXXXXXXXXXXXLTERSXXXXXXXXXXXXXXAKQ-LGVRSGTGK 1157
                 R P+                      ++               K  + +RS T K
Sbjct: 169  IP--NRDPLSSSAATRPGGFAHPMPLSTPQPQAFPQALASTSAPAPAPKPFMAMRSATVK 226

Query: 1156 QPPKPKAKDPNKREMTFEEKQKLSLNLQNLPQEKMDQVLQIISKRDSKLAQHGDEIEVDI 977
            Q PKPKAKDPNKREM+FEEK KL L+LQ+LPQEKM+QV+QII KR+  LAQ GDEIE+DI
Sbjct: 227  Q-PKPKAKDPNKREMSFEEKHKLGLSLQSLPQEKMEQVVQIIRKRNGHLAQDGDEIELDI 285

Query: 976  EYIDKETLWELDRFVGYYKKMISKMKRQAIAAGQNSTATEENKQCTTVDEVPEAEATMXX 797
            E +D ETLWELDRFV   KK++SK+KRQA+ +   +TA E NK   + D    AEA    
Sbjct: 286  EVVDTETLWELDRFVYNCKKLMSKIKRQALVSNNQNTAEEGNKSPVS-DSHEAAEAA--- 341

Query: 796  XXXXXXXXXXXXXXXXXXXXEMPASNFPPVEIEKDA 689
                                E+P SNFPPVEIEKDA
Sbjct: 342  -SAKKIKKGEIGEEDVDIGEEIPTSNFPPVEIEKDA 376


>ref|XP_007042774.1| Global transcription factor group E2, putative isoform 5, partial
            [Theobroma cacao] gi|508706709|gb|EOX98605.1| Global
            transcription factor group E2, putative isoform 5,
            partial [Theobroma cacao]
          Length = 547

 Score =  372 bits (956), Expect = e-100
 Identities = 249/590 (42%), Positives = 315/590 (53%), Gaps = 14/590 (2%)
 Frame = -3

Query: 2419 MASALLASRNEPHWG-EPKVYMRKNPNSNPTNWRSNPNPNPKEKCSSPRKQIY------- 2264
            MASA+LA+R+E +W  +PK  + K     P    + PNPNPK    + ++Q++       
Sbjct: 1    MASAVLANRSESNWPPQPKSSVAKFMGKVPFT-ATKPNPNPK---FNKKRQLHQHLPPPD 56

Query: 2263 DRSGHGRQMDESAAALVLAKSDDSSSLNRKFVSLNNRKESNHGGYVSFNISAYSRKELKG 2084
            D +GH   +D+S A    A SDD+SS+NRK        + + G YVSF+IS+YSRKEL  
Sbjct: 57   DVAGH--VVDDSPAVTQSAASDDASSINRKL------NDFSSGAYVSFHISSYSRKELID 108

Query: 2083 LKKRLIAELEQVRNLSSRIESREFQSRSGYSATQFSGGHGGREVTSSTRPPTHSPEASGG 1904
            LK RL+AELEQ+R L +RIES +F  RS                                
Sbjct: 109  LKNRLVAELEQIRELKNRIESNDFHVRS-------------------------------- 136

Query: 1903 KGSTKQKHMTMKVSGSKRPNQLDSGRDTKRLASDPANAKLLSGMMKKCGQLLTKLMKHKH 1724
              STK+      +SG+KRP   +  ++ KRL          + +MK C Q+L KLMK K+
Sbjct: 137  -SSTKKPISKKNISGNKRPLPPNFSKELKRLNPQENGKASTTHLMKNCSQILNKLMKQKY 195

Query: 1723 GWIFNVPVDVVGMGLHDYNQIIRLPMDLGTVKSKLGNNLYTSPHDFASDVRLTFNNALRY 1544
            G+IFN PVDVVGMGLHDY  II+ PMDLGTVKS++  N Y SP DFA+DVRLTFNNA+ Y
Sbjct: 196  GYIFNSPVDVVGMGLHDYYDIIKNPMDLGTVKSRMAKNFYGSPLDFAADVRLTFNNAMLY 255

Query: 1543 NPKGHDVHVAAEQLLARFEEMFEPEHRKYESERHSMFGGANELRQRSAAMAEEMEPRRSS 1364
            NPKGH+V++ AEQLLARFEE F P   K E +         E  Q      EE++   SS
Sbjct: 256  NPKGHEVYMLAEQLLARFEEFFRPLSLKLEEQ---------EEPQEKGYYEEELQ--ASS 304

Query: 1363 WNHQSRTPDSAKEHERFPVXXXXXXXXXXXXXXXXXXXLTERSXXXXXXXXXXXXXXAKQ 1184
            W+H        KE ER                                            
Sbjct: 305  WDH-GEADRMKKERERNGERNIDRDDSVNIVARSDKIGGVS-GFVSNPNVPPPQLQMQAP 362

Query: 1183 LGVRSGTGKQPPKP------KAKDPNKREMTFEEKQKLSLNLQNLPQEKMDQVLQIISKR 1022
              V S     P KP      KAKDPNKREM+ EEKQKL + LQ+LPQEKMD V+QII KR
Sbjct: 363  ARVASPVRAPPVKPLKQPKPKAKDPNKREMSMEEKQKLGIGLQSLPQEKMDNVVQIIRKR 422

Query: 1021 DSKLAQHGDEIEVDIEYIDKETLWELDRFVGYYKKMISKMKRQAIAAGQNSTATEENKQC 842
            +  L Q GDEIE+DIE +D ETLWELDRFV  YKKM+SK+KRQA+ A  N  + + N++ 
Sbjct: 423  NGHLRQDGDEIELDIEAMDTETLWELDRFVTNYKKMVSKIKRQALMA-NNVVSNDSNREE 481

Query: 841  TTVDEVPEAEATMXXXXXXXXXXXXXXXXXXXXXXEMPASNFPPVEIEKD 692
             TV+++  A                          EMP S+FPPVEIEKD
Sbjct: 482  VTVEKIEVA------MEMKKPKKGDAGEEDVDIGDEMPMSSFPPVEIEKD 525


>ref|XP_007042770.1| Global transcription factor group E2, putative isoform 1 [Theobroma
            cacao] gi|590687812|ref|XP_007042771.1| Global
            transcription factor group E2, putative isoform 1
            [Theobroma cacao] gi|590687818|ref|XP_007042772.1| Global
            transcription factor group E2, putative isoform 1
            [Theobroma cacao] gi|508706705|gb|EOX98601.1| Global
            transcription factor group E2, putative isoform 1
            [Theobroma cacao] gi|508706706|gb|EOX98602.1| Global
            transcription factor group E2, putative isoform 1
            [Theobroma cacao] gi|508706707|gb|EOX98603.1| Global
            transcription factor group E2, putative isoform 1
            [Theobroma cacao]
          Length = 566

 Score =  372 bits (956), Expect = e-100
 Identities = 249/590 (42%), Positives = 315/590 (53%), Gaps = 14/590 (2%)
 Frame = -3

Query: 2419 MASALLASRNEPHWG-EPKVYMRKNPNSNPTNWRSNPNPNPKEKCSSPRKQIY------- 2264
            MASA+LA+R+E +W  +PK  + K     P    + PNPNPK    + ++Q++       
Sbjct: 1    MASAVLANRSESNWPPQPKSSVAKFMGKVPFT-ATKPNPNPK---FNKKRQLHQHLPPPD 56

Query: 2263 DRSGHGRQMDESAAALVLAKSDDSSSLNRKFVSLNNRKESNHGGYVSFNISAYSRKELKG 2084
            D +GH   +D+S A    A SDD+SS+NRK        + + G YVSF+IS+YSRKEL  
Sbjct: 57   DVAGH--VVDDSPAVTQSAASDDASSINRKL------NDFSSGAYVSFHISSYSRKELID 108

Query: 2083 LKKRLIAELEQVRNLSSRIESREFQSRSGYSATQFSGGHGGREVTSSTRPPTHSPEASGG 1904
            LK RL+AELEQ+R L +RIES +F  RS                                
Sbjct: 109  LKNRLVAELEQIRELKNRIESNDFHVRS-------------------------------- 136

Query: 1903 KGSTKQKHMTMKVSGSKRPNQLDSGRDTKRLASDPANAKLLSGMMKKCGQLLTKLMKHKH 1724
              STK+      +SG+KRP   +  ++ KRL          + +MK C Q+L KLMK K+
Sbjct: 137  -SSTKKPISKKNISGNKRPLPPNFSKELKRLNPQENGKASTTHLMKNCSQILNKLMKQKY 195

Query: 1723 GWIFNVPVDVVGMGLHDYNQIIRLPMDLGTVKSKLGNNLYTSPHDFASDVRLTFNNALRY 1544
            G+IFN PVDVVGMGLHDY  II+ PMDLGTVKS++  N Y SP DFA+DVRLTFNNA+ Y
Sbjct: 196  GYIFNSPVDVVGMGLHDYYDIIKNPMDLGTVKSRMAKNFYGSPLDFAADVRLTFNNAMLY 255

Query: 1543 NPKGHDVHVAAEQLLARFEEMFEPEHRKYESERHSMFGGANELRQRSAAMAEEMEPRRSS 1364
            NPKGH+V++ AEQLLARFEE F P   K E +         E  Q      EE++   SS
Sbjct: 256  NPKGHEVYMLAEQLLARFEEFFRPLSLKLEEQ---------EEPQEKGYYEEELQ--ASS 304

Query: 1363 WNHQSRTPDSAKEHERFPVXXXXXXXXXXXXXXXXXXXLTERSXXXXXXXXXXXXXXAKQ 1184
            W+H        KE ER                                            
Sbjct: 305  WDH-GEADRMKKERERNGERNIDRDDSVNIVARSDKIGGVS-GFVSNPNVPPPQLQMQAP 362

Query: 1183 LGVRSGTGKQPPKP------KAKDPNKREMTFEEKQKLSLNLQNLPQEKMDQVLQIISKR 1022
              V S     P KP      KAKDPNKREM+ EEKQKL + LQ+LPQEKMD V+QII KR
Sbjct: 363  ARVASPVRAPPVKPLKQPKPKAKDPNKREMSMEEKQKLGIGLQSLPQEKMDNVVQIIRKR 422

Query: 1021 DSKLAQHGDEIEVDIEYIDKETLWELDRFVGYYKKMISKMKRQAIAAGQNSTATEENKQC 842
            +  L Q GDEIE+DIE +D ETLWELDRFV  YKKM+SK+KRQA+ A  N  + + N++ 
Sbjct: 423  NGHLRQDGDEIELDIEAMDTETLWELDRFVTNYKKMVSKIKRQALMA-NNVVSNDSNREE 481

Query: 841  TTVDEVPEAEATMXXXXXXXXXXXXXXXXXXXXXXEMPASNFPPVEIEKD 692
             TV+++  A                          EMP S+FPPVEIEKD
Sbjct: 482  VTVEKIEVA------MEMKKPKKGDAGEEDVDIGDEMPMSSFPPVEIEKD 525


>ref|XP_006379136.1| hypothetical protein POPTR_0009s08350g [Populus trichocarpa]
            gi|566186955|ref|XP_006379137.1| hypothetical protein
            POPTR_0009s08350g [Populus trichocarpa]
            gi|550331302|gb|ERP56933.1| hypothetical protein
            POPTR_0009s08350g [Populus trichocarpa]
            gi|550331303|gb|ERP56934.1| hypothetical protein
            POPTR_0009s08350g [Populus trichocarpa]
          Length = 547

 Score =  370 bits (949), Expect = 2e-99
 Identities = 243/588 (41%), Positives = 316/588 (53%), Gaps = 12/588 (2%)
 Frame = -3

Query: 2419 MASALLASRNEPHWGEPKV------YMRKNPNSNPTNWRSNPNPNPKEKCSSPRK-QIYD 2261
            MASA+LA+RNEP+W +P+       +M K P SNP     NP  + K +   P+  QI D
Sbjct: 1    MASAVLANRNEPNWTQPQPRGGGAKFMGKIPFSNP-----NPKFSKKRQFQPPQPPQIPD 55

Query: 2260 RSGHGRQMDESAAALVLAKSDDSSSLNRKFVSLNNRKESNHGGYVSFNISAYSRKELKGL 2081
                   +DES +A     SDD+SS+NR+    NN  + N GGYVSFN+S+ S+KEL  L
Sbjct: 56   -------VDESPSAA----SDDASSINRR--PQNNHHDFNTGGYVSFNVSSCSKKELIEL 102

Query: 2080 KKRLIAELEQVRNLSSRIESREFQSRSGYSATQFSGGHGGREVTSSTRPPTHSPEASGGK 1901
            K RL+ ELE++R L +RIES +F                            H  + S   
Sbjct: 103  KSRLVYELEKIRELKNRIESSDF----------------------------HIGQPSSNF 134

Query: 1900 GSTKQKHMTMKVSGSKRPNQLDSGRDTKRLASDPANAKLLSGMMKKCGQLLTKLMKHKHG 1721
             S KQ     KVSG+KRP    S  +  + +S P NA+L    MK C Q+L+KLMK K G
Sbjct: 135  SSKKQTSTNKKVSGNKRPFPAPSNFNNFKRSS-PDNAQL----MKNCSQILSKLMKQKLG 189

Query: 1720 WIFNVPVDVVGMGLHDYNQIIRLPMDLGTVKSKLGNNLYTSPHDFASDVRLTFNNALRYN 1541
            +IFN PVDVVG+ LHDY+ II+ PMDLGTVK+ L  NLY SP DFA+DVRLTFNNA++YN
Sbjct: 190  YIFNTPVDVVGLQLHDYHDIIKNPMDLGTVKTNLSKNLYESPRDFAADVRLTFNNAMKYN 249

Query: 1540 PKGHDVHVAAEQLLARFEEMFEPEHRKYESERHSMFGGANELRQRSAAMAEEMEPRRSSW 1361
            PKGH+V++ AEQ L RF++++ P   K             ++ +    + +E++   SSW
Sbjct: 250  PKGHEVYILAEQFLTRFQDLYRPIKEKV----------GEDVEEEENDLVQEVQ--ASSW 297

Query: 1360 NHQSRTPDSAKEHERFPVXXXXXXXXXXXXXXXXXXXLTERSXXXXXXXXXXXXXXAKQL 1181
            +H  R P+   + +   +                       S               KQ 
Sbjct: 298  DHIRREPERVSKIDGDFMPVTAKSDPIGQQQQPTGMNQNPNSVRTPSPMRVPQVKPLKQ- 356

Query: 1180 GVRSGTGKQPPKPKAKDPNKREMTFEEKQKLSLNLQNLPQEKMDQVLQIISKRDSKLAQH 1001
                      PKPKAKDPNKREM  EEK KL + LQ+LPQEKM+QV+QII KR+  L Q 
Sbjct: 357  ----------PKPKAKDPNKREMNLEEKHKLGVGLQSLPQEKMEQVVQIIRKRNGHLRQE 406

Query: 1000 GDEIEVDIEYIDKETLWELDRFVGYYKKMISKMKRQAIA-----AGQNSTATEENKQCTT 836
            GDEIE+DIE +D ETLWELDRFV  YKKM+SK+KRQA+      AG  + +   NK    
Sbjct: 407  GDEIELDIEAVDTETLWELDRFVTNYKKMVSKIKRQALMGINTNAGATAISEGNNK---- 462

Query: 835  VDEVPEAEATMXXXXXXXXXXXXXXXXXXXXXXEMPASNFPPVEIEKD 692
              +VP  +                         EMP S+FPPVEIEKD
Sbjct: 463  --DVPGNDRMEVVNEAKKPKKGDVGDEDVDIGDEMPMSSFPPVEIEKD 508


>ref|XP_002313212.2| hypothetical protein POPTR_0009s08350g [Populus trichocarpa]
            gi|550331301|gb|EEE87167.2| hypothetical protein
            POPTR_0009s08350g [Populus trichocarpa]
          Length = 546

 Score =  370 bits (949), Expect = 2e-99
 Identities = 243/588 (41%), Positives = 316/588 (53%), Gaps = 12/588 (2%)
 Frame = -3

Query: 2419 MASALLASRNEPHWGEPKV------YMRKNPNSNPTNWRSNPNPNPKEKCSSPRK-QIYD 2261
            MASA+LA+RNEP+W +P+       +M K P SNP     NP  + K +   P+  QI D
Sbjct: 1    MASAVLANRNEPNWTQPQPRGGGAKFMGKIPFSNP-----NPKFSKKRQFQPPQPPQIPD 55

Query: 2260 RSGHGRQMDESAAALVLAKSDDSSSLNRKFVSLNNRKESNHGGYVSFNISAYSRKELKGL 2081
                   +DES +A     SDD+SS+NR+    NN  + N GGYVSFN+S+ S+KEL  L
Sbjct: 56   -------VDESPSAA----SDDASSINRR--PQNNHHDFNTGGYVSFNVSSCSKKELIEL 102

Query: 2080 KKRLIAELEQVRNLSSRIESREFQSRSGYSATQFSGGHGGREVTSSTRPPTHSPEASGGK 1901
            K RL+ ELE++R L +RIES +F                            H  + S   
Sbjct: 103  KSRLVYELEKIRELKNRIESSDF----------------------------HIGQPSSNF 134

Query: 1900 GSTKQKHMTMKVSGSKRPNQLDSGRDTKRLASDPANAKLLSGMMKKCGQLLTKLMKHKHG 1721
             S KQ     KVSG+KRP    S  +  + +S P NA+L    MK C Q+L+KLMK K G
Sbjct: 135  SSKKQTSTNKKVSGNKRPFPAPSNFNNFKRSS-PDNAQL----MKNCSQILSKLMKQKLG 189

Query: 1720 WIFNVPVDVVGMGLHDYNQIIRLPMDLGTVKSKLGNNLYTSPHDFASDVRLTFNNALRYN 1541
            +IFN PVDVVG+ LHDY+ II+ PMDLGTVK+ L  NLY SP DFA+DVRLTFNNA++YN
Sbjct: 190  YIFNTPVDVVGLQLHDYHDIIKNPMDLGTVKTNLSKNLYESPRDFAADVRLTFNNAMKYN 249

Query: 1540 PKGHDVHVAAEQLLARFEEMFEPEHRKYESERHSMFGGANELRQRSAAMAEEMEPRRSSW 1361
            PKGH+V++ AEQ L RF++++ P   K             ++ +    + +E++   SSW
Sbjct: 250  PKGHEVYILAEQFLTRFQDLYRPIKEKV----------GEDVEEEENDLVQEVQ--ASSW 297

Query: 1360 NHQSRTPDSAKEHERFPVXXXXXXXXXXXXXXXXXXXLTERSXXXXXXXXXXXXXXAKQL 1181
            +H  R P+   + +   +                       S               KQ 
Sbjct: 298  DHIRREPERVSKIDGDFMPVTAKSDPIGQQQQPTGMNQNPNSVRTPSPMRVPQVKPLKQ- 356

Query: 1180 GVRSGTGKQPPKPKAKDPNKREMTFEEKQKLSLNLQNLPQEKMDQVLQIISKRDSKLAQH 1001
                      PKPKAKDPNKREM  EEK KL + LQ+LPQEKM+QV+QII KR+  L Q 
Sbjct: 357  ----------PKPKAKDPNKREMNLEEKHKLGVGLQSLPQEKMEQVVQIIRKRNGHLRQE 406

Query: 1000 GDEIEVDIEYIDKETLWELDRFVGYYKKMISKMKRQAIA-----AGQNSTATEENKQCTT 836
            GDEIE+DIE +D ETLWELDRFV  YKKM+SK+KRQA+      AG  + +   NK    
Sbjct: 407  GDEIELDIEAVDTETLWELDRFVTNYKKMVSKIKRQALMGINTNAGATAISEGNNK---- 462

Query: 835  VDEVPEAEATMXXXXXXXXXXXXXXXXXXXXXXEMPASNFPPVEIEKD 692
              +VP  +                         EMP S+FPPVEIEKD
Sbjct: 463  --DVPGNDRMEVVNEAKKPKKGDVGDEDVDIGDEMPMSSFPPVEIEKD 508


>ref|XP_006379135.1| hypothetical protein POPTR_0009s08350g [Populus trichocarpa]
            gi|550331300|gb|ERP56932.1| hypothetical protein
            POPTR_0009s08350g [Populus trichocarpa]
          Length = 541

 Score =  370 bits (949), Expect = 2e-99
 Identities = 243/588 (41%), Positives = 316/588 (53%), Gaps = 12/588 (2%)
 Frame = -3

Query: 2419 MASALLASRNEPHWGEPKV------YMRKNPNSNPTNWRSNPNPNPKEKCSSPRK-QIYD 2261
            MASA+LA+RNEP+W +P+       +M K P SNP     NP  + K +   P+  QI D
Sbjct: 1    MASAVLANRNEPNWTQPQPRGGGAKFMGKIPFSNP-----NPKFSKKRQFQPPQPPQIPD 55

Query: 2260 RSGHGRQMDESAAALVLAKSDDSSSLNRKFVSLNNRKESNHGGYVSFNISAYSRKELKGL 2081
                   +DES +A     SDD+SS+NR+    NN  + N GGYVSFN+S+ S+KEL  L
Sbjct: 56   -------VDESPSAA----SDDASSINRR--PQNNHHDFNTGGYVSFNVSSCSKKELIEL 102

Query: 2080 KKRLIAELEQVRNLSSRIESREFQSRSGYSATQFSGGHGGREVTSSTRPPTHSPEASGGK 1901
            K RL+ ELE++R L +RIES +F                            H  + S   
Sbjct: 103  KSRLVYELEKIRELKNRIESSDF----------------------------HIGQPSSNF 134

Query: 1900 GSTKQKHMTMKVSGSKRPNQLDSGRDTKRLASDPANAKLLSGMMKKCGQLLTKLMKHKHG 1721
             S KQ     KVSG+KRP    S  +  + +S P NA+L    MK C Q+L+KLMK K G
Sbjct: 135  SSKKQTSTNKKVSGNKRPFPAPSNFNNFKRSS-PDNAQL----MKNCSQILSKLMKQKLG 189

Query: 1720 WIFNVPVDVVGMGLHDYNQIIRLPMDLGTVKSKLGNNLYTSPHDFASDVRLTFNNALRYN 1541
            +IFN PVDVVG+ LHDY+ II+ PMDLGTVK+ L  NLY SP DFA+DVRLTFNNA++YN
Sbjct: 190  YIFNTPVDVVGLQLHDYHDIIKNPMDLGTVKTNLSKNLYESPRDFAADVRLTFNNAMKYN 249

Query: 1540 PKGHDVHVAAEQLLARFEEMFEPEHRKYESERHSMFGGANELRQRSAAMAEEMEPRRSSW 1361
            PKGH+V++ AEQ L RF++++ P   K             ++ +    + +E++   SSW
Sbjct: 250  PKGHEVYILAEQFLTRFQDLYRPIKEKV----------GEDVEEEENDLVQEVQ--ASSW 297

Query: 1360 NHQSRTPDSAKEHERFPVXXXXXXXXXXXXXXXXXXXLTERSXXXXXXXXXXXXXXAKQL 1181
            +H  R P+   + +   +                       S               KQ 
Sbjct: 298  DHIRREPERVSKIDGDFMPVTAKSDPIGQQQQPTGMNQNPNSVRTPSPMRVPQVKPLKQ- 356

Query: 1180 GVRSGTGKQPPKPKAKDPNKREMTFEEKQKLSLNLQNLPQEKMDQVLQIISKRDSKLAQH 1001
                      PKPKAKDPNKREM  EEK KL + LQ+LPQEKM+QV+QII KR+  L Q 
Sbjct: 357  ----------PKPKAKDPNKREMNLEEKHKLGVGLQSLPQEKMEQVVQIIRKRNGHLRQE 406

Query: 1000 GDEIEVDIEYIDKETLWELDRFVGYYKKMISKMKRQAIA-----AGQNSTATEENKQCTT 836
            GDEIE+DIE +D ETLWELDRFV  YKKM+SK+KRQA+      AG  + +   NK    
Sbjct: 407  GDEIELDIEAVDTETLWELDRFVTNYKKMVSKIKRQALMGINTNAGATAISEGNNK---- 462

Query: 835  VDEVPEAEATMXXXXXXXXXXXXXXXXXXXXXXEMPASNFPPVEIEKD 692
              +VP  +                         EMP S+FPPVEIEKD
Sbjct: 463  --DVPGNDRMEVVNEAKKPKKGDVGDEDVDIGDEMPMSSFPPVEIEKD 508


>ref|XP_004147512.1| PREDICTED: transcription factor GTE7-like [Cucumis sativus]
            gi|449511376|ref|XP_004163939.1| PREDICTED: transcription
            factor GTE7-like [Cucumis sativus]
          Length = 533

 Score =  367 bits (942), Expect = 2e-98
 Identities = 251/591 (42%), Positives = 305/591 (51%), Gaps = 12/591 (2%)
 Frame = -3

Query: 2419 MASALLASRNEPHWGEPKV--------YMRKNPNSNPTNWRSNPNPNPKEKCSSPRKQIY 2264
            MASA+LA+RNE +W +P+         +M K P SNP     NP  N K+         +
Sbjct: 1    MASAVLANRNEANWPQPRGNGRGTEEGFMGKVPFSNP-----NPKFNKKQ---------F 46

Query: 2263 DRSGHGRQMDESAAALVLAKSDDSSSLNRKFVSLNNRKESN---HGGYVSFNISAYSRKE 2093
                +G QMD+S A    A SDD+SS+N      ++R+ SN      YVSFN+S+ SRKE
Sbjct: 47   HGEMNGFQMDDSPAVTQSA-SDDASSIN------HHRRLSNGVDFSQYVSFNVSSCSRKE 99

Query: 2092 LKGLKKRLIAELEQVRNLSSRIESREFQSRSGYSATQFSGGHGGREVTSSTRPPTHSPEA 1913
            L  LK RLI+ELEQ+R L SRI S E  SR                       P H  + 
Sbjct: 100  LIELKTRLISELEQIRQLKSRINSGELHSR-----------------------PKHQKKF 136

Query: 1912 SGGKGSTKQKHMTMKVSGSKRPNQLDS-GRDTKRLASDPANAKLLSGMMKKCGQLLTKLM 1736
            S             K  G+KRP    S G + KR  SD  N      ++K C Q+LTKLM
Sbjct: 137  S-------------KTLGTKRPLPTSSNGMELKRSNSDNGN------LLKACSQILTKLM 177

Query: 1735 KHKHGWIFNVPVDVVGMGLHDYNQIIRLPMDLGTVKSKLGNNLYTSPHDFASDVRLTFNN 1556
            KHKHGWIFN PVDVVGMGLHDY  I++ PMDLG+VK KLG + Y SP+DFASDVRLTF N
Sbjct: 178  KHKHGWIFNKPVDVVGMGLHDYYDIVKRPMDLGSVKVKLGKDAYESPYDFASDVRLTFKN 237

Query: 1555 ALRYNPKGHDVHVAAEQLLARFEEMFEPEHRKYESERHSMFGGANELRQRSAAMAEEMEP 1376
            A+ YNPKGHDVH  AEQLL RFEE+F P     E E     G   EL             
Sbjct: 238  AMTYNPKGHDVHAMAEQLLVRFEELFRPVAEALEEEDRRFCGYQEEL------------- 284

Query: 1375 RRSSWNHQSRTPDSAKEHERFPVXXXXXXXXXXXXXXXXXXXLTERSXXXXXXXXXXXXX 1196
              SSWNH        K++ +  V                       S             
Sbjct: 285  PASSWNHSEAERTVKKDNIQKQVVKKTEPMKAP-------------SSSSNPPMMQSPVK 331

Query: 1195 XAKQLGVRSGTGKQPPKPKAKDPNKREMTFEEKQKLSLNLQNLPQEKMDQVLQIISKRDS 1016
                L        + PKP+AKDPNKREMT EEK KL + LQ+LP EKM+QV+QII KR+ 
Sbjct: 332  TPSPLRAPPVKPLKQPKPRAKDPNKREMTLEEKHKLGIGLQSLPPEKMEQVVQIIKKRNG 391

Query: 1015 KLAQHGDEIEVDIEYIDKETLWELDRFVGYYKKMISKMKRQAIAAGQNSTATEENKQCTT 836
             L Q GDEIE+DIE +D ETLWELDR V  +KKM+SK+KRQA+     + + + N    T
Sbjct: 392  HLKQDGDEIELDIEAVDTETLWELDRLVTNWKKMMSKIKRQALI---TAASMKPNGVMPT 448

Query: 835  VDEVPEAEATMXXXXXXXXXXXXXXXXXXXXXXEMPASNFPPVEIEKDAAG 683
             +++     T                       EMPASNFPPVEIEKDA G
Sbjct: 449  PEKIEVGSET------KKQRKGEAGEEDVDIGDEMPASNFPPVEIEKDAGG 493


>ref|XP_002518322.1| bromodomain-containing protein, putative [Ricinus communis]
            gi|223542542|gb|EEF44082.1| bromodomain-containing
            protein, putative [Ricinus communis]
          Length = 634

 Score =  363 bits (933), Expect = 2e-97
 Identities = 252/615 (40%), Positives = 326/615 (53%), Gaps = 38/615 (6%)
 Frame = -3

Query: 2422 FMASALLASRNEPHWGEPK---VYMRKNPNSNPTNWRSNPNPNPKEKCSSPRKQIYDRSG 2252
            +MASA+LA+RNE +W +P+    +M K P SNP       NPNP  K S  R        
Sbjct: 54   YMASAVLANRNEANWTQPRGGAKFMGKVPFSNP-------NPNPNSKFSKKR-------- 98

Query: 2251 HGRQMDESAAALV--------LAKSDDSSSLNRKFVSLNNRKESNHGGYVSFNISAYSRK 2096
               Q   +AAA           A SDD+SS+NR+  +      S+   YV+FNI +YS+K
Sbjct: 99   ---QFQSAAAAPPPVNFDIHPAASSDDASSINRRPAAT----ASDFNSYVTFNIGSYSKK 151

Query: 2095 ELKGLKKRLIAELEQVRNLSSRIESRE-FQSRSGYSATQFSGGHGGREVTSSTRPPTHSP 1919
            EL  LK RL+AELEQ+R L +RI+S + FQ RS      F+G    ++VT + RP    P
Sbjct: 152  ELLELKSRLVAELEQIRQLKNRIDSSQSFQIRS---TPNFNGKKQNKKVTGNKRP---FP 205

Query: 1918 EASGGKGSTKQKHMTMKVSGSKRPNQLDSGRDTKR---LASDPANAKLLSGMMKKCGQLL 1748
             A+   G                       +D KR     S P N +L    MKKCGQ+L
Sbjct: 206  SATTNYGFV--------------------AKDVKRSDLYNSHPENVQL----MKKCGQML 241

Query: 1747 TKLMKHKHGWIFNVPVDVVGMGLHDYNQIIRLPMDLGTVKSKLGNNLYTSPHDFASDVRL 1568
            TKLMKHK G+IFN PVDV  M LHDY +II+ PMDLGTVK KLG+N Y SP DFA+DVRL
Sbjct: 242  TKLMKHKFGYIFNEPVDVERMNLHDYFEIIKKPMDLGTVKKKLGSNEYESPIDFAADVRL 301

Query: 1567 TFNNALRYNPKGHDVHVAAEQLLARFEEMFEPEHRK-----YESERHSMFGGANELRQRS 1403
            TFNNA++YNPKGH+V+  AEQ L+RFEE+F P   K      + ++  +     E+    
Sbjct: 302  TFNNAMKYNPKGHEVYTFAEQFLSRFEELFRPIREKLGDFVLDDDQDQIVHHDREIEHEQ 361

Query: 1402 AAMAEEM-EPRRSSWNHQSRTP-------DSAKEHERFPVXXXXXXXXXXXXXXXXXXXL 1247
                E++ E + SSW+H S          +  K+ +   +                    
Sbjct: 362  EHEHEQVHEVQASSWDHHSLNRRGGSGDIERVKKDQENVLQITSKSDHPIGKSVPPSVLS 421

Query: 1246 TERSXXXXXXXXXXXXXXAKQLGVRSGTGKQPP--------KPKAKDPNKREMTFEEKQK 1091
              +S                QL VR+ +  + P        KPKAKDPNKREM+ EEK K
Sbjct: 422  NPQS--------------TSQLPVRTPSPMRAPPVKPVKLPKPKAKDPNKREMSLEEKHK 467

Query: 1090 LSLNLQNLPQEKMDQVLQIISKRDSKLAQHGDEIEVDIEYIDKETLWELDRFVGYYKKMI 911
            L + LQ+LPQEKM+QV+QII KR+  L Q GDEIE+DIE +D ETLWELDRFV  YKKM+
Sbjct: 468  LGVGLQSLPQEKMEQVVQIIRKRNGHLRQDGDEIELDIEAVDTETLWELDRFVTNYKKMV 527

Query: 910  SKMKRQAI--AAGQNSTATEENKQCTTVDEVPEAEATMXXXXXXXXXXXXXXXXXXXXXX 737
            SK+KRQA+   A   +  +E NK  +  + +   EA                        
Sbjct: 528  SKIKRQALMGIAPTGNAVSEGNKDVSVNERIDITEA-------KKPKKGDAGDEDVDIGD 580

Query: 736  EMPASNFPPVEIEKD 692
            EMP S+FPPVEIEKD
Sbjct: 581  EMPMSSFPPVEIEKD 595


>ref|NP_001234374.1| PSTVd RNA-binding protein Virp1d [Solanum lycopersicum]
            gi|10179602|gb|AAG13810.1|AF190891_1 PSTVd RNA-binding
            protein Virp1a [Solanum lycopersicum]
            gi|10179604|gb|AAG13811.1|AF190892_1 PSTVd RNA-binding
            protein Virp1b [Solanum lycopersicum]
            gi|10179606|gb|AAG13812.1|AF190893_1 PSTVd RNA-binding
            protein Virp1c [Solanum lycopersicum]
            gi|10179608|gb|AAG13813.1|AF190894_1 PSTVd RNA-binding
            protein Virp1d [Solanum lycopersicum]
            gi|13186132|emb|CAC33448.1| PSTVd RNA-biding protein,
            Virp1 [Solanum lycopersicum] gi|13186134|emb|CAC33449.1|
            PSTVd RNA-biding protein, Virp1 [Solanum lycopersicum]
            gi|13186136|emb|CAC33450.1| PSTVd RNA-biding protein,
            Virp1 [Solanum lycopersicum] gi|13186138|emb|CAC33451.1|
            PSTVd RNA-biding protein, Virp1 [Solanum lycopersicum]
          Length = 602

 Score =  363 bits (932), Expect = 2e-97
 Identities = 242/607 (39%), Positives = 313/607 (51%), Gaps = 28/607 (4%)
 Frame = -3

Query: 2419 MASALLASRNEPHW----GEPKVYMRKNPNSNPTNWRSNPNPNPKEKCSSPRKQIYDRSG 2252
            MASA+LASRNE  W    G    +M K P S+ T    N NPNPK+K    +KQ +  S 
Sbjct: 1    MASAVLASRNESSWAQSGGAGGGFMGKTPYSH-TQLNPNHNPNPKKK----QKQFHHTS- 54

Query: 2251 HGRQMDESAAALVLAKSDDSSSLNRKFVSLNNRKESNHGGYVSFNISAYSRKELKGLKKR 2072
            +GR MD+S A    A  D  S   R   S  N    N GGY++FN+ +Y++ E+  L+ R
Sbjct: 55   NGRHMDDSPAVTQTASDDAYSFNQRPIESTTNVDGLNFGGYLTFNVVSYNKAEVNELRSR 114

Query: 2071 LIAELEQVRNLSSRIESREFQSRSGYSATQFSGGHGGREVTSSTRPPTHSPEASGGKGST 1892
            L+AE+EQ+RNL  RIES +                      S+T P +        +G +
Sbjct: 115  LMAEVEQIRNLKDRIESGQL---------------------STTNPRS--------QGKS 145

Query: 1891 KQKHMTMKVSGSKRPNQLDSGRDTKRLASDPANAKL-------------LSGMMKKCGQL 1751
            K      K SG+KRP    S +D K+L +   N                   MMK+C Q+
Sbjct: 146  K------KQSGNKRPTPSGSSKDLKKLPNGVENRNFGNPGGVDGVKAIGTESMMKECRQI 199

Query: 1750 LTKLMKHKHGWIFNVPVDVVGMGLHDYNQIIRLPMDLGTVKSKLGNNLYTSPHDFASDVR 1571
            L KLMKHK+GWIFN+PVD   +GLHDY+QII+ PMDLGTVKS L  N Y SP +FA+DVR
Sbjct: 200  LAKLMKHKNGWIFNIPVDAEALGLHDYHQIIKRPMDLGTVKSNLAKNFYPSPFEFAADVR 259

Query: 1570 LTFNNALRYNPKGHDVHVAAEQLLARFEEMFEPEHRKYESERHSMFGGANELRQRSAAMA 1391
            LTFNNAL YNPK   V+  AEQLL RFE+MF P     + + + + GG     +R     
Sbjct: 260  LTFNNALLYNPKTDQVNAFAEQLLGRFEDMFRP----LQDKMNKLEGG-----RRDYHPV 310

Query: 1390 EEMEPRRSSWNHQSRTPDSAKEHERFPV---------XXXXXXXXXXXXXXXXXXXLTER 1238
            +E++   SSWNH   TP+  K+ +  PV                               +
Sbjct: 311  DELQ--GSSWNH-IPTPERVKKPKPTPVPNISKKQERMQNHSSASTPSLPVPPPNPPARQ 367

Query: 1237 SXXXXXXXXXXXXXXAKQLGVRSGTGKQPPKPKAKDPNKREMTFEEKQKLSLNLQNLPQE 1058
                             Q   +  T  + PKP+AKDPNKREM  EEK KL + LQ+LPQE
Sbjct: 368  QSPLSTPSPVRAPAAKPQSAAKVPTMGKQPKPRAKDPNKREMNMEEKHKLGVGLQSLPQE 427

Query: 1057 KMDQVLQIISKRDSKLAQHGDEIEVDIEYIDKETLWELDRFVGYYKKMISKMKRQAIA-- 884
            KM Q++QII KR+  LAQ GDEIE+DIE +D ETLWELDRFV  +KKM+SK KRQA+   
Sbjct: 428  KMPQLVQIIRKRNEHLAQDGDEIELDIEALDTETLWELDRFVTNWKKMVSKTKRQALMNN 487

Query: 883  AGQNSTATEENKQCTTVDEVPEAEATMXXXXXXXXXXXXXXXXXXXXXXEMPASNFPPVE 704
             G  S +   +   T+V E      +                       + PA++FPPVE
Sbjct: 488  LGPPSASAAASAATTSVAEADGPTTSEKNDSFKKAKKGDVGEEDVEIEDDEPATHFPPVE 547

Query: 703  IEKDAAG 683
            IEKD  G
Sbjct: 548  IEKDEGG 554


>ref|XP_002298808.2| hypothetical protein POPTR_0001s29240g [Populus trichocarpa]
            gi|550348456|gb|EEE83613.2| hypothetical protein
            POPTR_0001s29240g [Populus trichocarpa]
          Length = 474

 Score =  361 bits (927), Expect = 8e-97
 Identities = 233/541 (43%), Positives = 302/541 (55%), Gaps = 15/541 (2%)
 Frame = -3

Query: 2419 MASALLASRNEPHWGEPKV--------YMRKNPNSNPTNWRSNPNPNPKEKCSSPRK-QI 2267
            MASA+LA+RNEP W +P+         +M K P SNP     NP  + K +   P++ QI
Sbjct: 1    MASAVLANRNEPSWTQPQPQQRGGGAKFMGKIPFSNP-----NPKFSKKRQFQPPQQPQI 55

Query: 2266 YDRSGHGRQMDESAAALVLAKSDDSSSLNRKFVSLNNRKESNHGGYVSFNISAYSRKELK 2087
             D       +DES +A     SDD+SS+NR+    NN ++ N GG+V+FN+ +YS+KEL 
Sbjct: 56   LD-------VDESPSAA----SDDASSINRR--PQNNHQDFNTGGFVTFNVGSYSKKELI 102

Query: 2086 GLKKRLIAELEQVRNLSSRIESREFQSRSGYSATQFSGGHGGREVTSSTRPPTHSPEASG 1907
             LK RL+ ELE++R+L +RIES E Q R                              S 
Sbjct: 103  ELKNRLVHELEKIRDLKNRIESSESQIRQ-----------------------------SS 133

Query: 1906 GKGSTKQKHMTMKVSGSKRPNQLDSGRDTKRLASDPANAKLLSGMMKKCGQLLTKLMKHK 1727
                 KQ     KVSG+KRP    S  +  +  S+P NA+L    MK C Q+L+KLMKHK
Sbjct: 134  NFSYKKQTSTNKKVSGNKRPFPAPSNFNNLK-RSNPENAQL----MKNCSQILSKLMKHK 188

Query: 1726 HGWIFNVPVDVVGMGLHDYNQIIRLPMDLGTVKSKLGNNLYTSPHDFASDVRLTFNNALR 1547
             G+IFN PVDVVGM LHDY+ II+ PMDLGTVKSKL  NLY SP DFA+DVRLTFNNA++
Sbjct: 189  LGYIFNSPVDVVGMQLHDYHDIIKSPMDLGTVKSKLTKNLYESPRDFAADVRLTFNNAMK 248

Query: 1546 YNPKGHDVHVAAEQLLARFEEMFEPEHRKYESERHSMFGGANELRQRSAAMAEEMEPRRS 1367
            YNPKGH+V++ AEQ L RFE+ + P   K            ++  +      +E++   S
Sbjct: 249  YNPKGHEVYMLAEQFLTRFEDFYRPIKEKV----------GDDFDEEENDQVQEVQ--AS 296

Query: 1366 SWNHQSRTPDSAKE-HERFPVXXXXXXXXXXXXXXXXXXXLTERSXXXXXXXXXXXXXXA 1190
            SW+H  R P+   +  + F                      T  +               
Sbjct: 297  SWDHIRREPERVNQIDDDFMQVTAKSDPIGHQMHQQPLQQPTGLNQNPNLVRTPSPMRMP 356

Query: 1189 KQLGVRSGTGKQPPKPKAKDPNKREMTFEEKQKLSLNLQNLPQEKMDQVLQIISKRDSKL 1010
            +   V+       PKPKAKDPNKREM+ EEK KL + LQ+LPQEKM+QV+QII KR+  L
Sbjct: 357  QVKPVKQ------PKPKAKDPNKREMSLEEKHKLGVGLQSLPQEKMEQVVQIIRKRNGHL 410

Query: 1009 AQHGDEIEVDIEYIDKETLWELDRFVGYYKKMISKMKRQAIA-----AGQNSTATEENKQ 845
             Q GDEIE+DIE +D ETLWELDRFV  YKKM+SK+KRQA+       G  ST+   NK 
Sbjct: 411  RQEGDEIELDIEAVDTETLWELDRFVTNYKKMVSKIKRQALMGINNNVGAISTSEGNNKV 470

Query: 844  C 842
            C
Sbjct: 471  C 471


>emb|CAD43284.1| bromodomain-containing RNA-binding protein 1 [Nicotiana benthamiana]
          Length = 615

 Score =  361 bits (926), Expect = 1e-96
 Identities = 245/608 (40%), Positives = 320/608 (52%), Gaps = 29/608 (4%)
 Frame = -3

Query: 2419 MASALLASRNEPHW----GEPKVYMRKNPNSNPTNWRSNPNPNPKEKCSSPRKQIYDRSG 2252
            MASA+LASRNE  W    G    +M K P S+ T+   N NP PK+K    +KQ +  S 
Sbjct: 1    MASAVLASRNESSWAQSGGAGGGFMGKTPYSH-THLNPNSNPKPKKK----QKQFHHAS- 54

Query: 2251 HGRQMDESAAALVLAKSDDSSSLNRKFVSLNNRKESNHGGYVSFNISAYSRKELKGLKKR 2072
            +GRQ DES A    A  D  S   R   S  N    N GGY+++N+++Y++ EL  L+ R
Sbjct: 55   NGRQNDESPAVTQTASDDAYSFNQRPIESSTNVDGLNLGGYMTYNVASYNKTELHELRSR 114

Query: 2071 LIAELEQVRNLSSRIESREFQSRSGYSATQFSGGHGGREVTSSTRPPTHSPEASGGKGST 1892
            L+AELEQ+R+L  RIES                   G+  TS+ R    S + SG K +T
Sbjct: 115  LVAELEQIRSLKDRIES-------------------GQLSTSNPRSHGKSKKLSGNKRAT 155

Query: 1891 KQKHMTMKVSGSKRPNQLDSGRDTKRLASDPANAKL-----LSGMMKKCGQLLTKLMKHK 1727
                       SK P +L +G D +   + P    +     +  MMK+C Q+L KLMKHK
Sbjct: 156  PS-------GSSKDPKKLPNGVDNRNFGN-PGGVGVKGIIGMENMMKECRQVLGKLMKHK 207

Query: 1726 HGWIFNVPVDVVGMGLHDYNQIIRLPMDLGTVKSKLGNNLYTSPHDFASDVRLTFNNALR 1547
             GWIFN PVD   +GLHDY+QII+ PMDLGTVKS L N  Y +P +FA+DVRLTFNNAL 
Sbjct: 208  SGWIFNTPVDAEALGLHDYHQIIKRPMDLGTVKSNLSNCFYPTPFEFAADVRLTFNNALL 267

Query: 1546 YNPKGHDVHVAAEQLLARFEEMFEPEHRKYESERHSMFGGANELRQRSAAMAEEMEPRRS 1367
            YNPK   VH  AEQLLARFE+MF P     + + + + GG++   +R     +E++    
Sbjct: 268  YNPKTDQVHGFAEQLLARFEDMFRP----IQDKLNKLDGGSD---RRDFHPTDELQ--GI 318

Query: 1366 SWNH-----------QSRTPDSAKEHERFPVXXXXXXXXXXXXXXXXXXXLTERSXXXXX 1220
            SWNH            +  P  +K+ ER                      + ++S     
Sbjct: 319  SWNHIPTPERVKKPKPTPAPHISKKQERMMQNHSSALTLPVQQPPDNTPVVRQQSLLSTP 378

Query: 1219 XXXXXXXXXAKQLGVRSGT---GKQPPKPKAKDPNKREMTFEEKQKLSLNLQNLPQEKMD 1049
                       Q  V +     GKQ PKP+AKDPNKREM+ EEK KL + LQ+LPQEKM 
Sbjct: 379  SPVRAPPAPKPQSSVAAKVPPMGKQ-PKPRAKDPNKREMSMEEKHKLGVGLQSLPQEKMP 437

Query: 1048 QVLQIISKRDSKLAQHGDEIEVDIEYIDKETLWELDRFVGYYKKMISKMKRQAIA--AGQ 875
            Q++QII KR+  LAQ GDEIE+DIE +D ETLWELDRFV  +KKM+SK KRQA+    GQ
Sbjct: 438  QLVQIIRKRNEHLAQDGDEIELDIEALDTETLWELDRFVTNWKKMVSKTKRQALINNLGQ 497

Query: 874  NSTATEENKQCTTVD----EVPEAEATMXXXXXXXXXXXXXXXXXXXXXXEMPASNFPPV 707
              +A+      TT      + P                            + PA++FPPV
Sbjct: 498  PPSASAAASAATTTSVAEADAPSTSEKNDSFKKPKKGGDAGDEDDVEIEDDEPATHFPPV 557

Query: 706  EIEKDAAG 683
            EI+KD  G
Sbjct: 558  EIDKDEGG 565


>ref|NP_001275266.1| bromodomain-containing RNA-binding protein 1 [Solanum tuberosum]
            gi|57282314|emb|CAD43283.1| bromodomain-containing
            RNA-binding protein 1 [Solanum tuberosum]
          Length = 602

 Score =  358 bits (920), Expect = 5e-96
 Identities = 242/609 (39%), Positives = 315/609 (51%), Gaps = 30/609 (4%)
 Frame = -3

Query: 2419 MASALLASRNEPHW----GEPKVYMRKNPNSNPTNWRSNPNPNPKEKCSSPRKQIYDRSG 2252
            MASA+LASRNE  W    G    +M K P S+ T    N NPNPK+K    +KQ +  S 
Sbjct: 1    MASAVLASRNESSWAQSGGAGGGFMGKTPFSH-TQLNPNHNPNPKKK----QKQFHHTS- 54

Query: 2251 HGRQMDESAAALVLAKSDDSSSLNRKFVSLNNRKESNHGGYVSFNISAYSRKELKGLKKR 2072
            +GR +DES A    A  D  S   R   S  N    N GGY++FN+ +Y++ E+  L+ R
Sbjct: 55   NGRHIDESPAVTQTASDDAYSFNQRPIESTTNVDGLNFGGYLTFNVVSYNKGEVNELRSR 114

Query: 2071 LIAELEQVRNLSSRIESREFQSRSGYSATQFSGGHGGREVTSSTRPPTHSPEASGGKGST 1892
            L+AE+EQ+RNL  RIES +                      S+T P +        +G +
Sbjct: 115  LLAEVEQIRNLKDRIESGQL---------------------STTNPRS--------QGKS 145

Query: 1891 KQKHMTMKVSGSKRPNQLDSGRDTKRLASDPANAKL-------------LSGMMKKCGQL 1751
            K      K+SG+KRP    S +D K+L +   N                   MMK+C Q+
Sbjct: 146  K------KLSGNKRPTPSGSSKDPKKLPNGVENRNFGNPVGGGGVKAIGTESMMKECRQI 199

Query: 1750 LTKLMKHKHGWIFNVPVDVVGMGLHDYNQIIRLPMDLGTVKSKLGNNLYTSPHDFASDVR 1571
            L KLMKHK+GWIFN+PVD   +GLHDY+QII+ P+DLGTVKS L  N Y SP +FA+DVR
Sbjct: 200  LAKLMKHKNGWIFNIPVDAEALGLHDYHQIIKRPIDLGTVKSNLAKNFYPSPFEFAADVR 259

Query: 1570 LTFNNALRYNPKGHDVHVAAEQLLARFEEMFEPEHRKYESERHSMFGGANELRQRSAAMA 1391
            LTFNNAL YNPK   V+  AEQLL RFE+MF P     + + + + GG     +R     
Sbjct: 260  LTFNNALLYNPKTDQVNGFAEQLLGRFEDMFRP----LQDKMNKLEGG-----RRDYHPV 310

Query: 1390 EEMEPRRSSWNHQSRTPDSAKEHERFPV----------XXXXXXXXXXXXXXXXXXXLTE 1241
            +E++   SSWNH   TP+  K+ +  PV                               +
Sbjct: 311  DELQ--GSSWNH-IPTPERVKKPKATPVPHISKKQERMQNHSSASTPSLPVPPPNPPARQ 367

Query: 1240 RSXXXXXXXXXXXXXXAKQLGVRSGTGKQPPKPKAKDPNKREMTFEEKQKLSLNLQNLPQ 1061
            +S               +        GKQ PKP+AKDPNKR M  EEK KL + LQ+LPQ
Sbjct: 368  QSPLSTPSPVRAPPSKPESAAKVPAMGKQ-PKPRAKDPNKRVMNMEEKHKLGVGLQSLPQ 426

Query: 1060 EKMDQVLQIISKRDSKLAQHGDEIEVDIEYIDKETLWELDRFVGYYKKMISKMKRQAIA- 884
            EKM Q++QII KR+  LAQ GDEIE+DIE +D ETLWELDRFV  +KKM+SK KRQA+  
Sbjct: 427  EKMPQLVQIIRKRNEHLAQDGDEIELDIEALDTETLWELDRFVTNWKKMVSKTKRQALMI 486

Query: 883  --AGQNSTATEENKQCTTVDEVPEAEATMXXXXXXXXXXXXXXXXXXXXXXEMPASNFPP 710
               G  S +   +   T+V E      +                       + PA++FPP
Sbjct: 487  NNLGPPSASAAASAATTSVAEADGPTTSEKNDSFKKPKKGDVGEEDVEIEDDEPATHFPP 546

Query: 709  VEIEKDAAG 683
            VEIEKD  G
Sbjct: 547  VEIEKDEGG 555


>ref|XP_007148328.1| hypothetical protein PHAVU_006G199500g [Phaseolus vulgaris]
            gi|561021551|gb|ESW20322.1| hypothetical protein
            PHAVU_006G199500g [Phaseolus vulgaris]
          Length = 531

 Score =  358 bits (919), Expect = 7e-96
 Identities = 242/583 (41%), Positives = 306/583 (52%), Gaps = 7/583 (1%)
 Frame = -3

Query: 2419 MASALLASRNEPHW----GEPKVYMRKNPNSNPTNWRSNPNPNPKEKCSSPRKQIYDRSG 2252
            MASA+LA+RNEP+W    G    +M K P SNP     NPN NPK   +S R Q      
Sbjct: 1    MASAVLANRNEPNWPQHRGGGAGFMGKVPFSNP-----NPNSNPK-LANSKRTQ------ 48

Query: 2251 HGRQMDESAAALVLAKSDDSSSLNRKFVSLNNRKESNHGGYVSFNISAYSRKELKGLKKR 2072
                          + SDD+SS+NR+    +N    +H  YV F+IS+ ++KEL  +K R
Sbjct: 49   --------------SASDDASSINRR----SNDAGVSHSQYVCFSISSCTKKELNDIKNR 90

Query: 2071 LIAELEQVRNLSSRIESREFQSRSGYSATQFSGGHGGREVTSSTRPPTHSPEASGGKGST 1892
            L++ELEQVR   +RIES + Q        Q S GH                         
Sbjct: 91   LVSELEQVRKCRNRIESGKLQPG------QSSNGH------------------------- 119

Query: 1891 KQKHMTMKVSGSKRPNQLDSGRDTKRLASDPANAKLLSGMMKKCGQLLTKLMKHKHGWIF 1712
             +K  + KVSG+KRP  L+S ++ KR  S+  N       MK C Q+L KLMKHKHGWIF
Sbjct: 120  MKKPSSKKVSGNKRPLPLNSVKEMKRSHSEVGNT------MKSCSQILQKLMKHKHGWIF 173

Query: 1711 NVPVDVVGMGLHDYNQIIRLPMDLGTVKSKLGNNLYTSPHDFASDVRLTFNNALRYNPKG 1532
            NVPVDVVGMGLHDY  II+ PMDLGTVKS L  ++Y++P DFA+DVRLTF NAL YNPKG
Sbjct: 174  NVPVDVVGMGLHDYYDIIKQPMDLGTVKSNLSKSVYSTPSDFAADVRLTFKNALTYNPKG 233

Query: 1531 HDVHVAAEQLLARFEEMFEPEHRKYESERHSMFGGANELRQRSAAMAEEMEPRRSSWNHQ 1352
            HDV+  AEQLL RFEE++ P   K +            +RQ       + E + SSW+H 
Sbjct: 234  HDVYTMAEQLLMRFEELYRPMRDKSD----------GWIRQ---DQDYDEELQASSWSH- 279

Query: 1351 SRTPDSAKEHERFPVXXXXXXXXXXXXXXXXXXXLTERSXXXXXXXXXXXXXXAKQLGVR 1172
               P+  K+ E                       L +                 KQ    
Sbjct: 280  VEPPERVKKKENPIPPAKLQQEPPQPPASSSNPPLLQSPVRTPSPMRAPPVKPLKQ---- 335

Query: 1171 SGTGKQPPKPKAKDPNKREMTFEEKQKLSLNLQNLPQEKMDQVLQIISKRDSKLAQHGDE 992
                   PKPKAKDPNKREM+ EEK KL L LQ+LP EKM+QV+QII +R+  L Q GDE
Sbjct: 336  -------PKPKAKDPNKREMSLEEKHKLGLGLQSLPAEKMEQVVQIIRRRNGHLKQDGDE 388

Query: 991  IEVDIEYIDKETLWELDRFVGYYKKMISKMKRQAIAAGQNST---ATEENKQCTTVDEVP 821
            IE+DIE +D ETLWELDR V  YKKM+SK+KRQA+    N+    ++  N +    +++ 
Sbjct: 389  IELDIEAVDTETLWELDRLVTNYKKMVSKIKRQALMGNMNNNNEQSSRGNGELAASEKID 448

Query: 820  EAEATMXXXXXXXXXXXXXXXXXXXXXXEMPASNFPPVEIEKD 692
             +   M                      EMP S FPPVEIEKD
Sbjct: 449  GSAVEM-----KKSKEVEAGEEDIDIGDEMPMSMFPPVEIEKD 486


>gb|EXC33022.1| Transcription factor GTE4 [Morus notabilis]
          Length = 559

 Score =  358 bits (918), Expect = 9e-96
 Identities = 245/592 (41%), Positives = 309/592 (52%), Gaps = 13/592 (2%)
 Frame = -3

Query: 2419 MASALLASRNEPHWG-EPKV------YMRKNPNSNPTNWRSNPNPNPKEKCSSPRKQIYD 2261
            MASA+LA+RN+  W  +P+       +M K P +NP     NP  + K+   S   Q + 
Sbjct: 1    MASAVLANRNDTDWPPQPRGGASGAGFMGKVPFANP-----NPKNSSKK---SQFHQFHA 52

Query: 2260 RSG--HGRQMDESAAALVLAKSDDSSSLNRKFVSLNNRKESNHGGYVSFNISAYSRKELK 2087
             SG  +G Q+DES A    A SDD+SS+N +  S  N     H  YVSFNI +YSRKEL 
Sbjct: 53   PSGDFNGCQIDESPAVTQTA-SDDASSINHRRSSEFNL---GHSQYVSFNIGSYSRKELS 108

Query: 2086 GLKKRLIAELEQVRNLSSRIESREFQSRSGYSATQFSGGHGGREVTSSTRPPTHSPEASG 1907
             LK RL++EL+++R L SRIE+ +                         +P    P    
Sbjct: 109  ELKNRLLSELDRIRQLQSRIEASDLDHHH-------------------LQPKKPIP---- 145

Query: 1906 GKGSTKQKHMTMKVSGSKRPNQLDS--GRDTKRLA-SDPANAKLLSGMMKKCGQLLTKLM 1736
                      + K+SGSKRP   +S  G+D+  L  S P NA L    MK C QL+TKLM
Sbjct: 146  ----------SKKLSGSKRPFPTNSNHGKDSSHLKRSHPDNANL----MKNCSQLMTKLM 191

Query: 1735 KHKHGWIFNVPVDVVGMGLHDYNQIIRLPMDLGTVKSKLGNNLYTSPHDFASDVRLTFNN 1556
            K KH WIFN PVDV+GMGLHDY  II+ PMDLGTVK  L  NLY+SP DFA+DVRLTF N
Sbjct: 192  KQKHAWIFNKPVDVIGMGLHDYFDIIKRPMDLGTVKLNLSKNLYSSPSDFAADVRLTFQN 251

Query: 1555 ALRYNPKGHDVHVAAEQLLARFEEMFEPEHRKYESERHSMFGGANELRQRSAAMAEEMEP 1376
            A+ YNPKGHDVH  AEQLL +FEE+F P   K   ER  +F               + + 
Sbjct: 252  AVTYNPKGHDVHAIAEQLLVKFEELFRPVSEKLGDER--LF---------------DDDL 294

Query: 1375 RRSSWNH-QSRTPDSAKEHERFPVXXXXXXXXXXXXXXXXXXXLTERSXXXXXXXXXXXX 1199
            + SSW+H +   P   +E +  P                    + +              
Sbjct: 295  QASSWDHVEPERPKKREEKKPEPPVRAPPVPASSSNPIPNSPPVVQLQSPVRTPSPPMRA 354

Query: 1198 XXAKQLGVRSGTGKQPPKPKAKDPNKREMTFEEKQKLSLNLQNLPQEKMDQVLQIISKRD 1019
               K L        + PKPKAKDPNKREM+ EEKQKL + LQ+LPQEKMDQV+QII KR+
Sbjct: 355  PPVKPL--------KQPKPKAKDPNKREMSMEEKQKLGIGLQSLPQEKMDQVVQIIRKRN 406

Query: 1018 SKLAQHGDEIEVDIEYIDKETLWELDRFVGYYKKMISKMKRQAIAAGQNSTATEENKQCT 839
              L Q GDEIE+DIE +D ETLWELDR V  +KKM+SK+KRQA+    N+  +       
Sbjct: 407  GHLKQDGDEIELDIEAVDIETLWELDRLVTNWKKMVSKIKRQALMNNANNNNSNVAPNKG 466

Query: 838  TVDEVPEAEATMXXXXXXXXXXXXXXXXXXXXXXEMPASNFPPVEIEKDAAG 683
              +     +                         EMP +NFPPVEIEKD  G
Sbjct: 467  NAELAGSEKIDSVVAEPKRVKKGEGGDEDVDIGDEMPLNNFPPVEIEKDIGG 518


>ref|XP_007042773.1| Global transcription factor group, putative isoform 4 [Theobroma
            cacao] gi|508706708|gb|EOX98604.1| Global transcription
            factor group, putative isoform 4 [Theobroma cacao]
          Length = 483

 Score =  357 bits (917), Expect = 1e-95
 Identities = 234/541 (43%), Positives = 295/541 (54%), Gaps = 14/541 (2%)
 Frame = -3

Query: 2419 MASALLASRNEPHWG-EPKVYMRKNPNSNPTNWRSNPNPNPKEKCSSPRKQIY------- 2264
            MASA+LA+R+E +W  +PK  + K     P    + PNPNPK    + ++Q++       
Sbjct: 1    MASAVLANRSESNWPPQPKSSVAKFMGKVPFT-ATKPNPNPK---FNKKRQLHQHLPPPD 56

Query: 2263 DRSGHGRQMDESAAALVLAKSDDSSSLNRKFVSLNNRKESNHGGYVSFNISAYSRKELKG 2084
            D +GH   +D+S A    A SDD+SS+NRK        + + G YVSF+IS+YSRKEL  
Sbjct: 57   DVAGH--VVDDSPAVTQSAASDDASSINRKL------NDFSSGAYVSFHISSYSRKELID 108

Query: 2083 LKKRLIAELEQVRNLSSRIESREFQSRSGYSATQFSGGHGGREVTSSTRPPTHSPEASGG 1904
            LK RL+AELEQ+R L +RIES +F  RS                                
Sbjct: 109  LKNRLVAELEQIRELKNRIESNDFHVRS-------------------------------- 136

Query: 1903 KGSTKQKHMTMKVSGSKRPNQLDSGRDTKRLASDPANAKLLSGMMKKCGQLLTKLMKHKH 1724
              STK+      +SG+KRP   +  ++ KRL          + +MK C Q+L KLMK K+
Sbjct: 137  -SSTKKPISKKNISGNKRPLPPNFSKELKRLNPQENGKASTTHLMKNCSQILNKLMKQKY 195

Query: 1723 GWIFNVPVDVVGMGLHDYNQIIRLPMDLGTVKSKLGNNLYTSPHDFASDVRLTFNNALRY 1544
            G+IFN PVDVVGMGLHDY  II+ PMDLGTVKS++  N Y SP DFA+DVRLTFNNA+ Y
Sbjct: 196  GYIFNSPVDVVGMGLHDYYDIIKNPMDLGTVKSRMAKNFYGSPLDFAADVRLTFNNAMLY 255

Query: 1543 NPKGHDVHVAAEQLLARFEEMFEPEHRKYESERHSMFGGANELRQRSAAMAEEMEPRRSS 1364
            NPKGH+V++ AEQLLARFEE F P   K E +         E  Q      EE++   SS
Sbjct: 256  NPKGHEVYMLAEQLLARFEEFFRPLSLKLEEQ---------EEPQEKGYYEEELQ--ASS 304

Query: 1363 WNHQSRTPDSAKEHERFPVXXXXXXXXXXXXXXXXXXXLTERSXXXXXXXXXXXXXXAKQ 1184
            W+H        KE ER                                            
Sbjct: 305  WDH-GEADRMKKERERNGERNIDRDDSVNIVARSDKIGGVS-GFVSNPNVPPPQLQMQAP 362

Query: 1183 LGVRSGTGKQPPKP------KAKDPNKREMTFEEKQKLSLNLQNLPQEKMDQVLQIISKR 1022
              V S     P KP      KAKDPNKREM+ EEKQKL + LQ+LPQEKMD V+QII KR
Sbjct: 363  ARVASPVRAPPVKPLKQPKPKAKDPNKREMSMEEKQKLGIGLQSLPQEKMDNVVQIIRKR 422

Query: 1021 DSKLAQHGDEIEVDIEYIDKETLWELDRFVGYYKKMISKMKRQAIAAGQNSTATEENKQC 842
            +  L Q GDEIE+DIE +D ETLWELDRFV  YKKM+SK+KRQA+ A  N  + + N+  
Sbjct: 423  NGHLRQDGDEIELDIEAMDTETLWELDRFVTNYKKMVSKIKRQALMA-NNVVSNDSNRVS 481

Query: 841  T 839
            T
Sbjct: 482  T 482


Top