Previous Article | Next Article ![]()
Molecular and Cellular Biology, November 2004, p. 9517-9526, Vol. 24, No. 21
0270-7306/04/$08.00+0 DOI: 10.1128/MCB.24.21.9517-9526.2004
Copyright © 2004, American Society for Microbiology. All Rights Reserved.
Division of Biomedical Sciences, University of Edinburgh, Edinburgh, United Kingdom
Received 30 July 2004/ Accepted 6 August 2004
|
|
|---|
|
|
|---|
Most bHLH proteins bind to a DNA sequence containing a 6-bp E box (CANNTG). Proneural proteins are class A bHLH factors (32), all of which share, at least in vitro, a preference for a DNA sequence containing the E-box variant CAGSTG (where S is C or G). Among nonneural bHLH proteins, there is evidence for differences in DNA binding site specificities between proteins of different families, corresponding to preferences for certain bases central to and flanking the CANNTG (7, 15-18, 22, 26, 27, 42). The Drosophila proteins bind DNA as heterodimers with the ubiquitously expressed bHLH protein Daughterless (Da). Sc/Da binding sites have been located in a number of proneural target gene enhancers (9, 29, 43, 45), and from these studies a proneural consensus binding site that includes preferred bases flanking the 6-bp E box core (GCAGSTGK [where K is G or T and invariant nucleotides are underlined]) has been deduced. However, studies concerning Sc target genes have not examined the issue of specificity between proneural proteins. Regulation of target genes by Ato/Da is rarely addressed or is implicitly assumed to follow the same rules as for Sc/Da. For example, some well-characterized Sc target genes are expressed widely in early neurogenesis and should be Ato targets too (i.e., common proneural target genes), but it is not known how Ato regulates them. Additionally, possible Ato-specific target genes have not been well investigated. The only Ato target for which there is information is the tachykinin receptor-related gene, TAKR86C (formerly NKD), which has an essential Ato/Da binding site in its enhancer (40). However, that report did not address the issue of selective regulation by Ato versus Sc.
Although the area is little explored, there is little indication of DNA binding differences between proneural proteins from in vitro studies (25). Consistent with this, predicted DNA-contacting residues are completely conserved between the bHLH domains of Sc and Ato (12). This suggested that Sc and Ato may not have distinct binding sequences. On this and other evidence, it has been suggested convincingly that the major determinant of proneural specificity is differential interaction with "specificity cofactors" (5, 12, 24).
Thus, it is important that proneural target genes be analyzed to understand how they are selectively regulated by Sc or Ato. Here, we show that a common proneural target gene, Bearded (Brd), is regulated by Sc/Da and Ato/Da via distinct E-box sites. We have established a preliminary Ato consensus binding sequence based on essential E boxes identified in three Ato-regulated target genes. This consensus (EAto) is different from that established for Sc (ESc), particularly in the flanking bases. Thus, despite their similar DNA binding properties in vitro, Ato and Sc utilize different binding sites in vivo even for common target gene regulation. This conclusion is substantiated by observations that these proneural binding sites are alone sufficient to confer highly specific patterns of expression on a green fluorescent protein (GFP) reporter gene. We demonstrate that differences in DNA binding sequence underlie proneural protein functional specificity in vivo.
|
|
|---|
Reporter plasmid construction. For the Brd enhancer constructs, primers were designed to amplify a 1.3-kb fragment (43) (5'-GTGCTACAAACGCGTGAATTCTC-3' and 5'-CATGGTACCTCTGCAGGTAGTTCGGG-3') (restriction sites are underlined). The resultant fragment was cloned in pBluescript following digestion with EcoRI and Asp718I. Three further PCRs subdivided the 1.3-kb enhancer: Brd-A (531 bp) (5'-AGAGAATTCGGACAGTGAAACCTGCCA-3' and 5'-CATGGTACCTGTCGGCAAACGAGAAAT-3'), Brd-B (493 bp) (5'-GTCGAATTCGGATTGGAATACGAATG-3' and 5'-GTTGGTACCTGTCGGCAAACGAGAAAT-3'), and Brd-C (371 bp) (5'-CCGGAATTCGTCATATTACAACACTCG-3' and 5'-TTAGGTACCGTTCGGGCTCTCCAAGA-3'). These were cloned in pCRIITOPO (Invitrogen). All four fragments were transferred to pHStinger (2) to give Brd-1.3-GFP, Brd-A-GFP, Brd-B-GFP, and Brd-C-GFP for germ line transformation. For sc-SMC reporter constructs, the SMC enhancer (13) was amplified and cloned into pHStinger.
Site-directed mutagenesis. E boxes E2 and E3 in pCRIITOPO-Brd-A were mutated by using the Stratagene QuikChange mutagenesis kit. The Brd-E3 E-box sequence was changed from CATGTG to AATGTT, and the Brd-E2 sequence was changed from CACGTG to AACGTT. The appropriate fragments were then cloned in pHStinger for germ line transformation. Mutation of the sequences flanking ato-E1 and sc-E1 and E2 was carried out in a similar way with either ato-FCO-E-GFP or sc-SMC-E-GFP constructs as templates.
Drosophila germ line transformation.
Transformation plasmids were injected into w;
2-. flies. Transformants were selected and then outcrossed to w1118 to remove the
2-. element. At least two independent insertions were analyzed for each construct.
Protein purification. pRSET-Ato, pRSET-Da, and pRSET-Sc plasmids were used to transform BL21-pLysS cells. Cultures were grown and induced with IPTG (isopropyl-ß-D-thiogalactopyranoside) by standard techniques. Cell pellets were stored at 80°C, and Ni2+-nitrilotriacetic acid columns were used to purify the His-tagged proteins in 8 M urea before refolding by stepwise dialysis.
Gel retardation probes.
The following synthetic complementary 36-bp oligonucleotides containing a central 6-bp E box were used in gel retardation experiments: BrdE1GR, 5'-TGAGAGACCGAGAAACACCTGCGCGCTAGGACTCGT 3'; BrdE3GR, 5' CTCGTTTGCCGACAACATGTGTTTAACGAGGGTCTG 3'; AtoE1GR, 5' TGGTAGTAACCATAACAGGTGGCACGGCAGCCGCAC 3'; TAKR86CE2GR, 5' ATGTAGTTGGGGTATCAGGTGTGCTGAACAAGGGGT 3'; and ScE1GR, 5' CATGGCGACGCGTGGCAGGTGTATTTAGTCGAACGA 3'. E boxes are shown in boldface. In AtoE1GR, a 3' C substitution (underlined) disrupts a Pointed binding site adjacent to the E box (47). Brd-E1,3 mutant probes have the CANNTG changed to AANNTT. For both the titration and competition experiments, the top-strand oligonucleotide was labeled prior to hybridization to the complementary oligonucleotide, which was included at a slight molar excess. The labeling reaction used [
-33P]ATP [Amersham or Perkin-Elmer Life Sciences (UK) Ltd.] and T4 polynucleotide kinase (New England Biolabs). Unincorporated ATP was removed by using ProbeQuant G-50 microcolumns (Amersham Biosciences).
Gel retardation.
-33P-labeled Brd-E1 and Brd-E3 36-bp duplexes were prepared as described above and used at 0.1 nM in 20-µl binding reaction mixtures in binding buffer (10 mM Tris HCl, 1 mM dithiothreitol, and 1 mM EDTA, with 40 mM [low salt] or 100 mM [high salt] NaCl). Ato/Da or Sc/Da heterodimers were prepared by incubating the proteins at 200 nM for 20 min on ice in binding buffer, and then 0 to 100 nM Ato/Da or Sc/Da was incubated with Brd-E1 or Brd-E3 probes for 20 min on ice. An 18-µl portion of each binding reaction mixture was then electrophoresed on a 6% polyacrylamide gel with 0.5x Tris-borate-EDTA at 40 mA for 1 h at room temperature. The gels were dried before phosphorimager analysis (Molecular Dynamics). The percent bound DNA was determined in ImageQuant by quantifying the depletion of free DNA, as this gives a better estimate of complex concentration (39). The percent DNA bound versus protein concentration was plotted, and apparent Kds were determined by fitting the data to a one-site binding equation by using Prism 4 (GraphPad). For competition experiments, 5 to 100 nM unlabeled DNA was mixed with 0.1 nM
-33P-labeled Brd-E1 before addition of Ato/Da or Sc/Da heterodimer to 200 nM.
Multimer constructs. Synthetic oligonucleotides were designed to give, when hybridized, two repeats of 20-bp sequences containing the 6-bp E box. These were flanked by BglII and BamHI sites. For ato-E1, the nucleotides were as follows: GATCT ACCATAACAGGTGGCACGGC ACCATAACAGGTGGCACGGC GA TGGTATTGTCCACCGTGCCG TGGTATTGTCCACCGTGCCG CCTAG (E boxes are underlined).
In this case, a mutation was introduced (boldface) to ensure the disruption of an adjacent Pointed protein binding site (zur Lage et al., submitted). Similar oligonucleotides, with different 20-bp repeats, were designed for TAKR86C-E2 (GGGGGTATCAGGTGTGCTGAA) and sc-E1 (CGCGTGGCAGGTGTATTTAG). Complementary oligonucleotides were hybridized at 50 pmol/µl and phosphorylated with T4 polynucleotide kinase (New England Biolabs) before ligation (Roche). The ligated DNA was digested with BglII and BamHI and separated on a 3.5% MetaPhor agarose gel (Cambrex). Multimerized bands were excised and cloned in pBluescript (Stratagene). Clones were sequenced before transfer to pHStinger.
Immunohistochemistry. For immunohistochemical staining, imaginal disks were dissected from wandering third-instar larvae and fixed in 3.7% formaldehyde (10 min at room temperature). Incubations with primary and secondary antibodies were carried out according to standard procedures. Primary antibodies were affinity-purified rabbit anti-Ato (1:2,000), guinea-pig anti-Sens (1:5,000; provided by H. Bellen) (37), mouse anti-ß-galactosidase (1:200), mouse and rabbit anti-GFP (1:500), and mouse anti-Ac (1:50; provided by the Developmental Biology Hybridoma Bank, Iowa City, Iowa). Secondary antibodies (1:1,000) were obtained from Molecular Probes. Confocal microscopy analysis was on a Leica TCS SP microscope.
RNA in situ hybridization. Primers were designed to amplify the GFP open reading frame. The top-strand primer was 5'-CCATGGTGAGCAAGGGCG-3'. The bottom-strand primer was 5'-GTAATACGACTCACTATAGGGCCTTGTACAGCTCGTCCATG-3', with a T7 RNA polymerase promoter incorporated (boldface). After amplification from pHStinger DNA, the PCR product was then in vitro transcribed (Roche). The antisense RNA probe was hybridized to third-instar larval imaginal disks and detected by a tyramide labeling reaction.
|
|
|---|
![]() View larger version (55K): [in a new window] |
FIG. 1. Separable Sc and Ato-dependent regulation of Brd. (A) Schematic diagram of Brd reporter gene constructs, indicating whether they drive Ato and/or Sc-specific expression patterns. The Brd-E1, -E2, and -E3 E boxes mentioned in the text are marked. (B to I) Brd reporter expression in Ato-dependent cells. Green: ß-galactosidase in B,C and GFP in D-I. Red: Ato. (B) Brd-1.5E1 M-lacZ prothoracic leg disk, showing reporter expression in regions of the FCO tibial chordotonal organ (TCO), and prothoracic chordotonal organ (PCO). Ato is expressed in all of these areas (B'), although it does not always overlap because of dynamic Ato expression and perdurable GFP expression. The Brd fragments also support some nonproneural ectodermal expression (asterisk). (C) Brd-0.18-lacZ leg disk. Reporter expression (in Sc-dependent cells, bracket) does not coincide with Ato expression during FCO SOP formation. (D and E) Brd-1.3-GFP. (D) Eye-antennal disk, showing expression in Johnston's organ precursors (JO) and posterior to the morphogenetic furrow (MF). (E) Leg disk, showing expression in FCO as well as nonproneural ectodermal expression (asterisk). (F) Brd-A-GFP-E2 M leg disk, showing no effect of mutating E2 on Ato-dependent expression (FCO and TCO [this is a mesothoracic disk and does not have PCO]). (G) Brd-A-GFP-E3 M leg disk. Mutating E3 causes loss of FCO, TCO, and PCO expression. (H and I) Effect of Ato misexpression on Brd-1.3-GFP in wing disk. (H) In the wild type, GFP is expressed only in nonproneural ectodermal cells (asterisk). (I) When Ato is misexpressed (109-68Gal4/UAS-ato), ectopic GFP expression occurs in proneural clusters in the thorax (arrows) and third wing vein (box). (J) E-box sequences studied in this report, compared with the consensus sequences (core nucleotides are underlined).
|
Distinct Ato/Da and Sc/Da consensus binding sites. Hence, direct regulation of Brd by Ato in the leg and antennal disks may be mediated by Ato/Da binding to the Brd-E3 site. This site is conserved in Drosophila pseudoobscura. The sequence of the Brd-E3 site differs from the established Sc/Da binding consensus sequence in a core base and also in the 5' base flanking the E-box core (ACATGTGT versus GCAGSTGK) (Fig. 1J). Interestingly, the 5' flanking base also differs in the two other confirmed Ato/Da binding sites. An ato autoregulatory enhancer (known as the ato recruitment enhancer, ato-RE [zur Lage et al., submitted]) contains an Ato/Da binding site (ato-E1) in which the 5' base is also A, although the core sequence conforms to the Sc/Da consensus (ACAGGTGG) (Fig. 1J). Similarly, Ato/Da regulates the TAKR86C gene via an upstream E box (TAKR86C-E2) (40), in which, however, the 5' G is replaced by a T (TCAGGTGT). These deviations from the previously deduced proneural binding consensus sequence suggest that Ato/Da regulates at least some specific downstream targets by interacting with a distinct consensus sequence. Most notably, Ato/Da binding sites differ from Sc/Da binding sites in the 5' flanking base, where an invariant G is replaced by an A or T. Comparison of all three identified Ato/Da binding sites yields a provisional Ato/Da binding consensus sequence of AWCAKGTGK (where W is A or T and K is G or T), compared with the Sc/Da binding consensus of GCAGSTGK (Fig. 1J). We refer to these sequences as variant E boxes, with matches to the former referred to as EAto sites and matches to the former to the latter referred to as ESc sites.
No selective recognition of proneural E boxes by Sc/Da and Ato/Da in vitro. Differences in E-box binding sites between major structural and functional families of bHLH protein (e.g., Twist and Myc) have been characterized. In these cases, differential affinity to E-box variants can be observed in binding assays in vitro (8, 44), pointing to gross structural differences in DNA-protein recognition across families. No detailed comparative analysis of in vitro binding site affinities has been carried out for proteins within the proneural family. To compare the affinities of Ato/Da and Sc/Da to ESc and EAto sites (Brd-E1 and Brd-E3), gel retardation experiments were performed under conditions of limiting DNA target probe to distinguish small differences in affinity (11). In these experiments both protein dimers reproducibly bound to each site with indistinguishable affinities (apparent Kd of ca. 10 nM for each) (Fig. 2A to D). High-salt conditions to minimize nonspecific binding (31) produced results identical to those under low-salt conditions.
![]() View larger version (57K): [in a new window] |
FIG. 2. In vitro DNA binding analysis by gel retardation. (A to D) Relative DNA binding affinities for Brd-E1 and Brd-E3 were assessed by titration. (A) Representative titration gel for Brd-E3 with Ato/Da. (B) Representative titration gel for Brd-E3 with Sc/Da. (C) Binding curves of 0.1 nM Brd-E1 DNA probe (open circles) and 0.1 nM Brd-E3 probe (closed circles) with 0 to 200 nM Ato/Da (in 20 nM increments) in high-salt buffer. (D) Similar plot for Sc/Da. (E to H) Competition gel retardation assays with Ato/Da binding to radiolabeled 0.1 nM Brd-E1 (in high-salt buffer) competed with 0, 5, 10, 15, 20, 30, 40, 50, and 100 nM cold competitor DNA (50 to 1,000-fold competition). In each case, lanes 1 and 2 include no protein, lanes 3 and 4 include 150 nM Ato/Da and no competitor, and lanes 5 to 12 include 150 nM Ato/Da and increasing amounts of competitor DNA. Competitors used: (E) Brd-E3, (F) Brd-E3 M, (G) TAKR86C-E2, and (H) sc-E1.
|
Strong selective discrimination of proneural E boxes in vivo. In the absence of in vitro indications for the importance of E-box sequence differences, we carried out an in vivo functional analysis. To test the contributions that the E-box variants make to enhancer specificity, we asked whether these sites could confer specific patterns of expression (and thus be differentially recognized) out of the context of their enhancers, thereby removing the contribution of other DNA binding factors. Inspired by previous findings (13), we made constructs with the GFP gene driven by artificial enhancers consisting of six to seven tandem repeats of a 20-bp sequence that includes a proneural binding site (6-bp E-box core and 7 bp of 5' and 3' flanking sequence) (Fig. 3A). Such artificial enhancers were constructed for an ESc site (sc-E1) and EAto sites (ato-E1 and TAKR86C-E2) to give [sc-E1]6-GFP, [ato-E1]7-GFP, and [TAKR86C-E2]6-GFP. Alignment of the 20-bp sequences used in these constructs revealed no shared motifs apart from the E boxes and their immediate flanks. Remarkably, these constructs support specific and different GFP patterns in transformed flies (Table 1).
![]() View larger version (51K): [in a new window] |
FIG. 3. [sc-E1]6-GFP and [ato-E1]7-GFP artificial enhancers support Sc- and Ato-dependent expression, respectively, in embryos. (A) Schematic of E-box multimer constructs (ato-E1 in this example). (B and C) Comparison of abdominal segments from stage 11 embryos (GFP, green; Ato, red). (B) [ato-E1]7-GFP, showing clear GFP/Ato overlap. (C) [sc-E1]6-GFP, showing no GFP overlap with Ato. (D and E) High-power view of two embryonic stage 16 abdominal segments (GFP, green; differentiated sensory neurons [22C10], red). (D) [ato-E1]7-GFP, showing GFP perduring in Ato-dependent sensory neurons (labeled). (E) [sc-E1]6-GFP late, showing GFP perduring in Sc-dependent sensory neurons (labeled). "Ectopic" ectodermal expression is also indicated (asterisk). (F) Schematic of abdominal sensory neurons with Sc-, Ato-, and Amos-dependent neurons in blue, green, and red, respectively.
|
|
View this table: [in a new window] |
TABLE 1. Summary of expression patterns of E-box multimer constructs
|
![]() View larger version (48K): [in a new window] |
FIG. 4. [ato-E1]7-GFP and [sc-E1]6-GFP multimers support Ato- and Sc-dependent expression, respectively, in imaginal disks. Disks stained for GFP (green), Ato (red), Ac (red in G), and Sens (blue) as indicated. (A to D) [ato-E1]7-GFP supports weak and patchy expression in Ato-dependent areas. (A) Leg disk, showing some expression in FCO. (B) Eye-antennal disk, showing expression behind the morphogenetic furrow and in some Johnston's organ precursors (arrows). (C) Higher magnification of expression in Johnston's organ precursors. (D) High-magnification view of morphogenetic furrow, showing cytoplasmic GFP mRNA (green) in close association with nuclear Ato (red). (E and F) [sc-E1]6-GFP. (E) Leg disk, showing expression in bristle-producing areas (Sc dependent) but not in the FCO (Ato, red). (F) Eye-antennal disk, showing expression in bristle proneural clusters but generally not behind the morphogenetic furrow or in Johnston's organ. (G to I) Misexpression of Ato but not Sc causes ectopic expression of [ato-E1]7-GFP. The third wing vein is boxed. (G) Wild-type wing disk. GFP is not normally expressed, even in the two locations of wild-type Ato expression (arrows). (H) 109-68-Gal4/UAS-ato. Misexpression of Ato (red) causes GFP misexpression, but this is particularly confined to the third wing vein. (I) 109-68-Gal4/UAS-sc wing disk, showing no GFP misexpression in response to Sc misexpression. (J to L) Misexpression of Sc but not Ato causes ectopic expression of [sc-E1]6-GFP. (J) Wild-type wing disk, showing GFP in Sc-dependent proneural clusters. The third wing vein is boxed. (K) 109-68-Gal4,UAS-ato, showing little extra GFP expression in the third wing vein. (L) 109-68-Gal4,UAS-sc, showing GFP misexpression particularly along the third wing vein.
|
Surprisingly, the EAto site TAKR86E-E2 differed from the ato-E1 site in its ability to drive GFP expression. Strong [TAKR86E-E2]6-GFP expression was observed in the eye disk posterior to the morphogenetic furrow and in Bolwig's organ in the embryo (Fig. 5). No expression was observed in other neural cells, including chordotonal precursors, although nonproneural expression was seen in late embryogenesis in ectodermal stripes. The neural pattern supported by this E box is therefore only a subset of that supported by ato-E1. To some degree ectodermal stripes were also seen for the three other constructs (Fig. 3). This expression is reminiscent of the expression of the Ato superfamily member Delilah in muscle attachment cells and so may represent regulation by that bHLH protein. In summary, there are in vivo differences not only between Sc and Ato binding sites but also between different Ato sites.
![]() View larger version (36K): [in a new window] |
FIG. 5. [TAKR86C-E2]6-GFP multimer supports limited GFP expression (green). (A) Late embryo, showing GFP in epidermal stripes and in Bolwig's organ (arrow). (B) Eye disk, showing expression posterior to the morphogenetic furrow.
|
![]() View larger version (60K): [in a new window] |
FIG. 6. Alteration of E-box flanking nucleotides disrupts enhancer activity. (A) Schematic of the ato-RE and sc-SMC enhancer constructs, with E boxes indicated, the regions of the sequence alterations shown, and an indication of expression pattern supported. (B to I) GFP expression (green) relative to SOPs (Sens, red), in Sc-dependent cells (examples indicated with blue arrows) or Ato-dependent cells (white arrows). (B to E) Leg disks scanned using identical low laser power settings under nonsaturating conditions to show reduction of expression for the mutated enhancer relative to the wild-type enhancer. (E and F) Scanned at low laser power to show loss of expression in mutated version. (B) ato-RE-GFP in FCO precursors of leg disk. (C) ato-RE 5'AA>GG in leg disks, showing strong reduction of FCO expression. (D) sc-SMC in Sc-dependent leg bristle SOPs. (E) 5'NG>AA, showing strong reduction of bristle SOP expression. (E and F) Eye-antennal expression for sc-SMC and sc-SMC 5'NG>AA, respectively. (E) The wild type is expressed in some bristle SOPs. (F) Expression is lost from bristle SOPs in the mutated version but is gained in Ato-dependent Johnston's organ SOPs.
|
|
|
|---|
![]() View larger version (13K): [in a new window] |
FIG. 7. Schematic view of the importance of E-box variants and cofactor interactions for proneural target gene specificity. (A-C) Proneural target gene regulation requires Ato/Da or Sc/Da binding to an appropriate E-box variant (EAto or ESc) in conjunction with appropriate protein-protein interaction with cofactors (hypothetically CFAto or CFSc). (D) As suggested in the text, the need for adjacent cofactor binding sites may be overcome by tandem repeats of the E boxes. Interaction between proneural proteins may stabilize DNA binding and allow the recruitment of cofactors by protein interaction.
|
Significantly, ato can rescue mutations of its mouse orthologue, Math1 (46), and vice versa (3), suggesting that DNA site preferences will be conserved among vertebrate orthologues. A number of functional E boxes have been characterized in vertebrate neural-specific genes, and in two cases the interacting bHLH protein is likely to be an Ato orthologue: an autoregulatory site in the Math1 promoter (TCAGCTGG) (23) and a proposed Xath5 site in the ß3 nAChr promoter (ACAGCTGG) (41). Thus, in these cases the E boxes match EAto in the 5' flanking base.
For correct enhancer function, proneural proteins must interact differentially with other DNA binding factors.
Although E-box consensus differences underlie specificity, enhancer context is usually also crucial for this specificity to be manifest. In swapping EAto and ESc sequences between the sc and ato autoregulatory enhancers, in only one case did we observe a corresponding "swap" in enhancer specificity: Ato could be made to regulate the sc-SMC-E enhancer via an EAto site. Otherwise, alteration of E-box flanking bases resulted in a severe loss of enhancer activity. This suggests that recruiting a different proneural protein cannot alone change the function of an enhancer. Correct proneural target enhancer function requires a combination of the correct E-box sequence and the ability to interact with other factors bound to the enhancer. This is reminiscent of the cooperative interaction between MyoD and MEF2 in myogenesis (34) and of interaction between Sc/Da and Pannier/Chip to activate ac in a specific part of the thorax (38). For the Ato enhancer we have recently shown the requirement for cooperative interaction between Ato/Da and the ETS protein Pointed, bound to a site adjacent to the EAto site (zur Lage et al., submitted). Similarly, neurogenin 2 interacts with LIM factors during the activation of subtype-specific target genes (33). Our finding that EAto and ESc sites encode much specificity in artificial enhancers suggests that tandem E boxes remove the requirement for interaction with factors bound to other DNA sites, perhaps because cooperative binding between proneural proteins themselves is then sufficient (4) and may even allow the recruitment of cofactors by protein interactions alone (Fig. 7D). Interestingly, the converse situation may also occur: for both the ato and sc enhancers, there is a low level of expression remaining after swapping of E boxes. This suggests that the original bHLH protein can be recruited to the "wrong" E-box sequence, inefficiently, by interaction with cofactors. A basis for this can be found with MyoD, where interaction with Sp1 allows MyoD to bind to a nonideal site in the human cardiac
-actin promoter (6).
Another indication of the importance of enhancer context is that parent enhancers support patterns different from those of the isolated E boxes, at least in the case of Ato. [ato-E1]7-GFP is widely expressed in Ato-specific regions in the embryo, whereas the parent ato-FCO-E enhancer is limited to a small subset of chordotonal SOPs (zur Lage et al., submitted). In disks, ato-E1 drives expression relatively poorly in FCO precursors compared with ato-FCO-E. TAKR86C is even more extreme: the TAKR86C enhancer is normally active only in a single embryonic chordotonal precursor (the P cell) (40), but the TAKR86C-E2 site drives Ato-dependent expression in the larval and adult eye and not in the P cell. Clearly, the parent enhancers must have other regulatory inputs that restrict expression.
Proneural selective E-box binding sites. Despite the importance of enhancer context and interaction with other factors, the ESc and EAto sequences support strikingly specific expression patterns when taken out of their enhancers. First, all tandem repeat E-box constructs tested are activated almost exclusively during PNS neurogenesis, despite the presence of some 24 class A factors in Drosophila (32). None are activated during CNS neurogenesis, myogenesis, or mesoderm formation, even though AS-C proteins function during the former two processes. In the case of two sites, sc-E1 and ato-E1, expression is remarkably consistent, with regulation solely by Ato or Sc, respectively, in PNS neurogenesis; the sites alone must contain all of the information necessary for specific recognition. It is remarkable that ato-E1 does not respond in vivo to the Ato-related protein Cato or Amos, even though the latter has a basic region almost identical to that of Ato and might be expected to have the same DNA binding properties (20). The main exception to this specificity is the presence of expression in embryonic ectodermal stripes. These resemble muscle attachment sites, suggesting recognition by the Ato superfamily member Delilah (1, 32). The conclusion is that tandem duplications can overcome the need for DNA binding sites for other factors. Cooperative binding of proneural proteins may negate the need for cofactor interactions, or, as suggested above, cooperative binding may allow the recruitment of cofactors directly (Fig. 7).
There are dramatic differences between the two Ato sites tested. Unlike ato-E1, the TAKR86C-E2 site drives expression in only a subset of Ato locations; it appears to be photoreceptor specific despite containing a good class A core E-box match (CAGGTG). This opens up the possibility that there may be different subtypes of Ato binding sites. The spatially restricted recognition of TAKR86C-E2 also implies that cellular context is important in how different sites are recognized. One may speculate, for instance, that eye-specific DNA binding properties of Ato may be conferred by interaction with PAX6 proteins (36). Interestingly, diversity of E-box expression patterns correlates with variability in the consensus sequences. The ESc consensus sequence (based on some 23 sites) is less variable than the Ato/Da consensus, even though the latter is based on only three sites. We suggest that regulatory fine-tuning by E-box variation is more important for Ato target genes than for Sc target genes.
In summary, the E-box sequences and their flanking bases contain impressively sufficient information for regulation by specific proneural proteins. However, there is further complexity: at least the two Ato sites tested support different patterns and have a different relationship with their parent enhancers. Subtle variations in regulation by proneural proteins may therefore contribute to variations in target gene expression; indeed, there may be no such thing as a typical target site or target gene. This may also be true for common target genes: despite the modular regulation of Brd, we do not rule out the possibility that within the spectrum of proneural E boxes there are some sites that are jointly recognized by Sc and Ato in vivo and that this would be another mechanism for regulating common target genes.
A.P.J. is a Senior Fellow of The Wellcome Trust (grant 042182).
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»