Previous Article | Next Article ![]()
Molecular and Cellular Biology, May 2005, p. 3411-3420, Vol. 25, No. 9
0270-7306/05/$08.00+0 doi:10.1128/MCB.25.9.3411-3420.2005
Copyright © 2005, American Society for Microbiology. All Rights Reserved.

Department of Biochemistry and Molecular Biology and Program in Genetics, Michigan State University, East Lansing, Michigan
Received 29 September 2004/ Returned for modification 8 December 2004/ Accepted 3 February 2005
|
|
|---|
|
|
|---|
The early Drosophila melanogaster embryo is a paradigm for developmentally regulated transcriptional control networks. Typical of higher eukaryote systems, the complex cis-regulatory elements of key patterning genes interpret maternal and early embryonic inputs to produce precisely defined transcriptional outputs (62, 70). One of the best-studied complex loci in Drosophila is the even-skipped (eve) gene, which features a series of cis-regulatory elements within a 16-kb region. Five enhancers are responsible for early expression of eve in seven regularly spaced stripes in the blastoderm embryo (29, 65, 72, 73). A key feature of the stripe enhancers is their functional autonomy; the repression of one element does not lead to the general repression of the entire locus (4, 32, 71). This autonomy is based on the properties of short-range transcriptional repressors such as Giant, Krüppel, and Knirps. These proteins block the activity of enhancers when bound within
100 bp of key activator sites or basal promoter elements (4, 31, 32). The magnitude of short-range repression can be modulated by the precise positioning of repressor binding sites and by utilization of C-terminal binding protein (CtBP) cofactor-dependent and -independent activities, providing additional levels of control (34, 38, 75, 79).
In contrast to the fine-tuning offered by short-range repressors, long-range repressors such as the Drosophila Hairy protein block multiple enhancers indiscriminately over distances of several kilobases (7). The molecular mechanisms of short-range and long-range repression are still poorly understood, although the short-range-long-range distinction may result from the recruitment of distinct cofactors. Short-range repressors bind the CtBP corepressor, whereas long-range repressors such as Hairy, and in some contexts Dorsal, interact with the Groucho corepressor (16). Short-range repressors, through CtBP, may mediate localized chromatin modifications, while long-range repressors, via Groucho, may generate extended transcriptionally silent chromatin structures (17, 28, 69, 74).
Traditionally, empirical tests such as analysis of transgenic reporter genes have been used to identify and analyze regulatory elements. Because of the complexity of many regulatory regions, many gene constructs must be tested to provide insights into how an expression pattern is generated. As a result, relatively few higher eukaryotic enhancers have been well characterized, and our understanding of general principles governing cis-regulatory element design remains limited. With the availability of whole genome sequences, bioinformatics methods have the promise of providing a powerful alternative route to the identification and analysis of cis-regulatory modules on a global scale. Currently used approaches include identification of clusters of putative transcription factor binding sites (11, 53, 60, 61, 66). A high local density of transcription factor binding sites has been used as a convenient signpost for computational identification of known and novel cis elements; however, not all clusters are functional enhancers. The potential for a cluster of binding sites to function as an enhancer also depends on the levels of transacting factors, which can often be inferred from gene expression or proteome data. Not only the presence of a binding site, but also the sequence context within which it is found, is also critical. This context can be considered a type of "grammar" of transcriptional code. This grammar is clearly more complex than simply the density of binding sites. Due to cooperative or antagonistic interactions between proteins and synergistic interactions with the transcriptional machinery, the activity of a given binding site can vary. Additional parameters that influence binding site activity include affinities, spacing, and positioning of transcriptional activator and repressor binding sites within cis-regulatory modules (9, 21, 25, 33, 34, 36). Previous analyses of short-range repressors on native enhancers demonstrated that these proteins can block gene expression when bound within
100 bp of the presumed target, either a basal promoter or activator site. In most cases, however, relevant quantitative values were not determined because short-range repression has been studied mostly in the context of complex, endogenous regulatory elements where the identity, number, relative affinities, order, and spacing of binding sites are often not known. Because of this complexity of cis-regulatory elements, the contributions of individual physical parameters to repression have been difficult to ascertain from previous empirical tests.
Evolutionary conservation of binding sites has also been used to identify enhancers (10, 27, 45, 46, 68). This approach is more likely to work with enhancers that possess rather rigid constraints on factor binding sites due to the high degree of protein-protein cooperativity, as seen with so-called "enhanceosomes" (39, 54, 55, 80). Many other enhancers that possess a more plastic structure ("billboard" or information display enhancers) are less likely to be identified by this approach, however (26, 42, 43, 47-49, 59). Even within more flexibly designed enhancers, the spacing or arrangement of activator and repressor binding sites can still be important (42, 43). In particular, spacing between short-range repressor and activator sites within the cis-regulatory element is critical for dictating repression effectiveness (4, 32, 34, 47, 79). However, there is no general understanding of how alterations in binding sites for short-range repressors or adjacent activators might affect transcription; thus, it is difficult to predict whether sequence changes introduced during evolution would affect enhancer function. Computational searches and phylogenetic comparisons that seek to identify cis-regulatory elements would be greatly facilitated by empirical determination of spatial constraints and other features of transcription factor binding sites within cis-regulatory elements.
The Drosophila segmentation network has provided an important test bed for the development of bioinformatics tools to analyze enhancers. Short-range repressors play a central role in this system; therefore, it is of particular interest to identify and quantify aspects of the cis-regulatory grammar that dictates their action. In this study, we analyze highly defined enhancer elements in which the identity, stoichiometry, and exact arrangement of activator and repressor binding sites are well defined. We show that the notion that short-range repressors block the activity of protein complexes within 100 bp is an oversimplification. By targeted alteration of these defined elements, we define contextual parameters that dictate repression effectiveness, including stoichiometry of activators and repressors, relative affinity, spacing, and position of transcription factor binding sites. We further demonstrate that the cis-regulatory logic appears to be specific to different functional classes of transcriptional regulators, indicating that identification of such class-specific rules will be critical for more detailed bioinformatics analysis.
|
|
|---|
(ii) Insulated Gal4-Gal4 AD (aa 753 to 881). A 420-bp fragment of DNA with the gypsy insulator with 12 Su(Hw) sites was amplified from Green Pelican green fluorescent protein vector (6) using DA639 (5'-CGG AAT TCC GAA TTG TAA GCG TTA ATG ACT-3') and DA640 (5' CGG AAT TCC GAT ACA TAC TAG AAT TGA TCG 3').
The fragment was inserted into pTwiggy at the EcoRI site between the twist regulatory elements and the w gene to prevent twist activation of w, allowing us to assay Gal4 activation of w in Fig. 5. The Gal4 AD (aa 753 to 881) KpnI-XbaI fragment was then inserted into this vector.
![]() View larger version (73K): [in a new window] |
FIG. 5. The arrangement of short-range repressor binding sites is critical in dictating repression effectiveness. Reporter genes are shown below the corresponding embryos. (A and B) Giant (gt) is able to repress transcription from both the proximal hsp70 lacZ gene and the distal w promoter, which is located 4.5 kb 3', on a gene in which the repressor sites flank the Gal4 binding sites. (C and D) Giant also represses effectively when binding sites are interspersed between the Gal4 sites. (E and F) Giant does not repress when repressor sites are situated 5' of the activator sites, although activators are within 100 bp of the most proximal Giant site. A minimal Gal4 activator was expressed in ventral regions under the control of the twist promoter. In order to distinguish transcription of the white gene 3' of the lacZ reporter from transcription of the white gene present on the Gal4 driver, insulator sequences were inserted between the twist regulatory element and the white gene in the Gal4 driver to prevent direct activation of white, as described in Materials and Methods. Expression patterns were visualized in 2- to 4-h embryos by in situ hybridization with antisense lacZ probes.
|
Flies expressing the full-length yeast transcriptional activator Gal4 ubiquitously throughout the embryo under the actin 5C enhancer act5cGAL4/CyO (stock number 4414) were also obtained from Bloomington. To obtain ubiquitous activation of the lacZ reporter gene in the early embryo, act5cGAL4/CyO females were crossed to males carrying the reporter transgene.
Reporter genes. The stripe 2/2x UAS/eve-lacZ vector (3) containing two Gal4 binding sites and the minimal eve basal promoter driving lacZ expression was modified to include two Giant (14) binding sites (DA127/128 [5'-AAT TCG CAT GCT ATG ACG CAA GAA GAC CCA GAT CTT TTT ATG ACG CAA GAG CAT GCG-3'; the Giant binding sites are underlined]) using EcoRI-BssH2, 5' of the Gal4 sites. Three additional Gal4 binding sites were inserted (DA139/140 [5' TCG GAT TAG AAG CCG CCG TCG CTA GAG GAA GAC TCT CCT CCG ACG TGA ACG CAG GAC ACT CCT GC GCT GCA-3'; the Gal4 binding sites are underlined]) at the PstI site 3' of the existing Gal4 sites. Oligonucleotides with a 50-bp spacer (DA125 [5'-TCG CTA GAC GTG AAT CTC GTA GCT TCC GTA CCA AAT GCG TAT CAG CTG CA-3'] and DA126 [5'-GCT GAT ACG CAT TTG GTA CGG AAG CTA CGA GAT TCA CGT CTA GCG ATG CA-3']) were introduced at the PstI 3' site, yielding H2g5u-50 (Fig. 1C and D) with two Giant binding sites, five tandemly arrayed Gal4 binding sites, a 50-bp spacer, and minimal eve basal promoter driving lacZ expression.
![]() View larger version (71K): [in a new window] |
FIG. 1. Context dependence of short-range repression. Giant is dependent on gene context for repression of Gal4 activators. (A) The hsp70 lacZ gene, activated by a cluster of five high-affinity Gal4 binding sites, is not repressed by Giant (gt). (B) lacZ expression is not repressed when the repressor-activator cluster is situated 400 bp further 5' of the basal promoter. (C) Giant effectively represses a cluster of five Gal4 binding sites 5' of the eve basal promoter. Arrows indicate regions of Giant expression in the anterior and posterior domains of the embryo. Striping is thought to be caused by the binding of an unidentified pair-rule regulator. wt, wild type. (D) Repression is abolished in the giant (gtA8) mutant embryo. A minimal Gal4 activator (residues 1 to 93, DNA binding, fused to residues 753 to 881, activation domain) was expressed in ventral regions under the control of the twist promoter. Expression patterns were visualized in 2- to 4-h embryos by in situ hybridization with antisense lacZ probes. In this figure and later figures, embryos are oriented anterior to the left and dorsal up. The structure of the reporter gene is shown below the corresponding embryos.
|
![]() View larger version (85K): [in a new window] |
FIG. 2. Stoichiometry of activators to repressors influences repression effectiveness. (A and B) Activity mediated by five Gal4 binding sites 5' of the hsp70 basal promoter elements is not repressed by Giant (gt). (C to F) Reducing the number of activator sites from five to three by deletion or replacement with a neutral spacer permits repression by Giant (arrows). Ventrolateral views are shown (B and D). A minimal Gal4 activator was expressed in ventral regions under the control of the twist promoter. Expression patterns were visualized in 2- to 4-h embryos by in situ hybridization with antisense lacZ probes.
|
![]() View larger version (61K): [in a new window] |
FIG. 4. Weaker activation domains are not more susceptible to repression by Giant. Chimeric Gal4 activators used to drive expression from the reporter gene are indicated to the left of the embryos. The structure of the reporter gene is shown below. (A and B) Neither the more potent Gal4 and Gal4-VP16 activators or the weaker Gal4-Sp1 and Gal4-hTBP activators (C and D) were repressed by Giant (gt) on a lacZ reporter containing five high-affinity Gal4 binding sites. Activators were expressed in ventral regions under the control of the twist promoter. Expression patterns were visualized in 2- to 4-h embryos by in situ hybridization with antisense lacZ probes.
|
![]() View larger version (110K): [in a new window] |
FIG. 7. Short-range repressors exhibit similar functional limits, in contrast to a long-range repressor. The reporter genes contain two binding sites for Giant, Knirps, Krüppel, or Hairy and three or five high-affinity Gal4 binding sites. (A, C, E, and G) All repressors (rep) were able to repress the minimal Gal4 activator on genes containing three Gal4 binding sites. Repression in A, C, and E is indicated by arrows and corresponds to the pattern of expression of the repressor proteins. Hairy is expressed in seven stripes at this stage. (B, D, F, and H) None of the short-range repressors repressed a reporter containing five Gal4 binding sites; however, Hairy induces a pronounced striped pattern, indicating effective repression. To drive expression, a minimal Gal4 activator was expressed in ventral regions under the control of the twist promoter. Expression patterns were visualized in 2- to 4-h embryos by in situ hybridization with antisense lacZ probes.
|
M2g5u-lacZ was modified to replace the five Gal4 sites with HindIII-SphI oligonucleotides containing three high-affinity Gal4 (12) sites (DA469/470 [5'-AGC TTG CCT GCA GGT CGG AGT ACT GTC CTC CGA GCG GAG TAC TGT CCT CCG AGC GGA GTA CTG TCC TCC GAG GCA TG-3'; the Gal4 sites are underlined]) to give M2g3u-lacZ (Fig. 2C and D). This was further modified by introducing SphI spacer oligonucleotides (DA471 [5'-TCA TAC AAC TGG TCA GTG AGC ATA CAA CTG GTC AGT GAG CAT G-3'] and DA472 [5'-CTC ACT GAC CAG TTG TAT GCT CAC TGA CCA GTT GTA TGA CAT G-3']) equal to the length of two Gal4 sites, resulting in M2g3u2x-lacZ (Fig. 2E and F and see Fig. 6A, C, and E). The two Giant binding sites in M2g3u2x-lacZ were replaced by two Knirps sites (DA319/320), two Krüppel sites(DA694/695), or two Hairy sites (DA604/605) 20 nucleotides 5' of the three Gal4 binding sites. The resulting vectors named M2k3u2x-lacZ/M2kr3u2x-lacZ/M2h3u2x-lacZ (see Fig. 7C, E, and G) consist of two Knirps, two Krüppel, and two Hairy binding sites, respectively, three tandemly arrayed Gal4 binding sites, and a spacer followed by the hsp70 TATA box and transcriptional start driving lacZ expression.
![]() View larger version (63K): [in a new window] |
FIG. 6. Additional distance dependence of permissive repressor-activator stoichiometries. The reporter gene structure is shown below and chimeric Gal4 activators used to drive expression from the reporter gene are indicated to the left of the embryos. (A, C, and E) Full-length Gal4 protein was expressed ubiquitously, while the minimal Gal4 activator and a Gal4-VP16 activator were expressed in ventral regions under the twist promoter. Giant (gt) represses all three activators when three Gal4 sites are present. (B, D, and F) Giant repression activity is absent when the three Gal4 sites are moved 37 nucleotides from the Giant sites, although still less than 100 bp from the Giant sites. Expression patterns were visualized in 2- to 4-h embryos by in situ hybridization with antisense lacZ probes.
|
![]() View larger version (52K): [in a new window] |
FIG. 3. Effectiveness of repression correlates with the affinity of Gal4 activator binding sites. Sequences of Gal4 binding sites are shown (left). The high-affinity Gal4 sites were used in reporters shown in Fig. 1A and B and 2, and low-affinity Gal4 binding sites were used here. The fortuitous Bicoid binding site is underlined in gray. (A) For reference, giant expression in the early blastoderm embryo, visualized by in situ hybridization, refines into two stripes anteriorly and one stripe posteriorly. (B and C) Giant (gt) represses lacZ expression driven by a minimal Gal4 activator in ventral regions acting on a cluster of five low-affinity Gal4 binding sites (arrows). These sequences also appear to bind to unidentified pair-rule repressors which confer an overall striped expression pattern on the reporter gene. This pattern made analysis of the Giant repression pattern more difficult; however, lacZ expression was consistently reduced in regions of Giant expression. (D and E) Giant represses Bicoid-mediated activation of the hsp70 lacZ reporter (arrows). Even in the absence of the Gal4 activator, lacZ expression is activated by the transcription factor Bicoid in the anterior region of the embryo from five high-affinity Bicoid sites that overlap the Gal4 sites. Bicoid-mediated activation is refined into two stripes of expression as the embryo develops, in regions where giant is not expressed. (F) lacZ expression in a giant mutant shows unrepressed expression mediated by Bicoid. lacZ expression is no longer refined into a two-stripe pattern in the giant mutant background. The embryo shown in F is of an age comparable to that shown in E. A minimal Gal4 activator was expressed in ventral regions under the control of the twist promoter. Expression patterns were visualized in 2- to 4-h embryos by in situ hybridization with antisense lacZ probes. UAS, upstream activation sequence.
|
P-element transformation, crosses to reporter genes, and in situ hybridizations. P-element transformation vectors were introduced into the Drosophila germ line by injection of yw67 embryos as described previously (72). Embryos were collected either directly from each transgenic reporter line or from a cross between a reporter line and a line expressing the Gal4 activator chimeric proteins in the ventral regions or ubiquitously throughout the embryo. The embryos were fixed and stained using digoxigenin-UTP-labeled antisense RNA probes to either lacZ or w as described previously (72). Embryos shown are generally representative of at least 90% of scored embryos of the relevant age, except as noted otherwise.
|
|
|---|
These results indicate that the simple notion that short-range repressors block the activity of all protein complexes within 100 bp is an oversimplification. Clearly, mere proximity is not the only determinant affecting repression by Giant. We set out to systematically define other factors that dictate repression effectiveness to uncover a potential cis-regulatory grammar of short-range repression. The repressed and nonrepressed reporter genes (Fig. 1, compare A and C) differ in the sequence of the activator sites, nature of the basal promoters, and repressor position with respect to the transcriptional start site. Activator binding site affinity or spacing seems likely to be a more important factor, because Giant has been previously shown to be able to repress genes with both types of basal promoter, and the relative spacing of the repressors to +1 should in fact favor repression as shown in Fig. 1A, in which the Giant repressor sites are closer to the start site of transcription. We first tested whether activator binding sites played a role.
Repression sensitivity correlated to the number of activator binding sites or strength of the activating signal. Studies of the hairy gene in Drosophila led to the suggestion that the overall stoichiometry, rather than the absolute number, of activators and repressors may be critical in dictating enhancer output (44). To test whether the stoichiometry of activators to repressors is a critical factor in determining short-range repression levels by Giant, we reduced the number of Gal4 activator binding sites on the hsp70-lacZ reporter from five (Fig. 2A and B) to three (Fig. 2C and D). As anticipated, the levels of transcriptional activation by the minimal Gal4 activator were lower in the transgene containing three Gal4 sites (Fig. 2C and D), leading to a less robust ventral staining pattern. In this context, Giant was able to block transcription of the lacZ gene (Fig. 2C and D). However, the removal of two Gal4 sites also positions the repressors closer to the start of transcription, which may facilitate repression of the basal promoter ("direct repression"). Therefore, to maintain the distance between Giant binding sites and the start of transcription, a neutral spacer was placed downstream of the three Gal4 sites (Fig. 2E and F). Again, Giant was also able to repress the minimal Gal4 activator. These results demonstrate that repression is critically dependent on the number of activator binding sites but do not explicitly differentiate between the overall level of transcriptional activation and binding site number, an issue addressed in Fig. 4. These results are also consistent with previous analyses of the eve stripe 2 element, where the insertion of additional Bicoid binding sites in an otherwise normal stripe 2 enhancer causes a slight anterior expansion of its expression pattern, suggesting that an excess of Bicoid activators can "overwhelm" the Giant repressor (3).
Binding site affinity. Binding site affinity influences threshold responses to activator gradients in the embryo (25, 36, 76), and indeed, transcription factor binding sites of various affinities are typically found in many developmental enhancers that function during early Drosophila development. Such differences in activator site affinity might similarly influence responses to short-range repressors. We tested whether maintaining the number of activator sites but weakening their affinity would in fact change the response to repressors. We replaced the five high-affinity Gal4 binding sites in the hsp70 lacZ reporter with five copies of a site from the Saccharomyces cerevisiae Gal1-Gal10 promoter that has been characterized as a weaker Gal4 binding site (13, 37). The minimal Gal4 activator drives gene expression in a weaker, striped pattern from the lower-affinity Gal4 sites. Anterior and posterior repression by Giant is evident (Fig. 3B and C, arrows), similar to the pattern observed in Fig. 1C. As expected, later in development, when Giant protein is no longer present, lacZ is expressed in a continuous swathe (data not shown). The striped expression of the constructs is thought to be due to the binding of uncharacterized pair-rule repressors to spacer sequences in the reporter (79).
In the process of weakening the Gal4 binding sites, we inadvertently created five high-affinity binding sites for the Bicoid activator, providing an additional opportunity to assay Giant repression activity. Bicoid is maternally deposited in the anterior regions of the embryo, forming an anterior-to-posterior gradient (24). lacZ expression from the hsp70 reporter is activated even in the absence of the Gal4 activator by the Bicoid transcription factor in anterior regions (Fig. 3D and E). As the embryo develops, Giant inhibits Bicoid activation of lacZ, which is thereby progressively refined into a two-stripe pattern (Fig. 3E), in regions where giant is not expressed (Fig. 3A). Analysis of the transgene in a giant mutant background in the absence of Gal4 confirms that refinement of reporter gene expression is due to repression by Giant (Fig. 3F). These results suggest that five Bicoid binding sites are more susceptible to repression than are five high-affinity Gal4 sites, indicating that stoichiometric relationships of repressors to activators in turn may depend on either distinct DNA binding domains or the type of activation domains.
Repression not dependent on the nature of the activation domain. The differential effectiveness of Giant against five Gal4 or five Bicoid binding sites suggests that the nature of the activation domain itself or the DNA binding domain of the transcriptional activator may play a role in dictating the response to repressors. To distinguish between those two possibilities, we compared the activities of a variety of activation domains fused to the DNA binding domain of Gal4. In addition to the Gal4 activation domain, we tested the acidic transcriptional activation domain of the herpes simplex virus activator VP16, the glutamine-rich activation domain of the mammalian transcription factor Sp1, and the hTBP, which has been shown to function as an activator when targeted to the promoter via the Gal4 DNA binding domain (50). We also sought to test the activity of Gal4-Bicoid activators (35), but unfortunately, these chimeras exhibit strong promoter specificity and are not active on the hsp70 promoter, which precluded a direct comparison (data not shown). The Gal4 chimeric proteins were used to drive expression of the hsp70 lacZ reporter from the cluster of five high-affinity Gal4 sites (Fig. 4). Giant could inhibit neither the strong Gal4 (Fig. 4A) and VP16 (Fig. 4B) activators nor the weak activation domains of Sp1 (Fig. 4C) and hTBP (Fig. 4D). These results indicate that the ability to repress does not depend on the strength of the activation domain or the activation pathway. Only those genes in which the number or affinity of Gal4 sites was reduced showed a response to Giant, suggesting that the Gal4 DNA binding domain provides a stable platform that can resist the activity of Giant. These results are consistent with a mechanism for short-range repression that involves blocking activator access to its cognate sites.
The arrangement or distribution of short-range repressor binding sites is critical in dictating repression effectiveness. Statistical models, based on motif clustering, are only partially successful at finding novel cis-regulatory elements in the genome, perhaps because they consider only site density and relative site affinity (11, 52, 53, 66). However, it is probable that specific arrangements of binding motifs also contribute to biological function (51). We tested the effect of alternative arrangements of Giant repressor and Gal4 activator binding sites to determine if different arrangements or combinations resulted in distinct transcriptional outputs. In all reporter arrangements tested, we used four Giant binding sites and five high-affinity Gal4 binding sites, bound by the minimal Gal4 activator. Flanking the five Gal4 activator sites with two Giant sites on either side resulted in repression of the proximal hsp70 lacZ reporter gene (Fig. 5A). Interspersing the Giant repressor binding sites between the Gal4 activator sites also resulted in the inhibition of lacZ expression (Fig. 5C). However, placing all four Giant binding sites 5' of the five Gal4 sites prevented Giant from repressing the hsp70 lacZ expression (Fig. 5E), suggesting again that promoter response cannot be calculated simply from overall activator-to-repressor stoichiometries.
The Giant binding sites in the reporter genes shown in Fig. 5A and C are in close proximity to the basal promoter; therefore, it is possible that Giant directly represses the basal promoter (4, 31, 34). To distinguish between repressor-basal promoter and repressor-activator effects, we measured transcription of the w gene, which is
4.5 kbp 3' of these sites (Fig. 5B, D, and F). Again, we observed that Giant mediated repression only when flanking or interspersed with activators (Fig. 5B and D) but not when situated 5' of the activator sites (Fig. 5F). This result suggests that Giant is acting on the activator cluster rather than only on the basal promoter element.
Previous analysis of the short-range repressor Giant demonstrated that due to the extreme distance-dependent activity of this protein, subtle changes in the spacing of Giant binding sites endowed a promoter with high or low sensitivity to repression (34). We tested whether Giant's ability to repress a smaller cluster of three Gal4 sites could be affected by small changes in spacing between the activator and repressor binding sites. Moving the smaller cluster of three Gal4 sites 37 bp away from the Giant binding sites results in the loss of repression (Fig. 6, compare A and B), suggesting that reducing the amount of activation potential does not guarantee repression by Giant in all cases, even when the activators are located within 100 bp of the repressor sites. In order to ascertain whether the spacing effects we see are specific to this particular activator protein (i.e., Gal4-Gal4 AD), we tested the ability of Giant to block transcription mediated by the full-length Gal4 protein expressed ubiquitously throughout the embryo (Fig. 6C and D) and the Gal4-VP16 fusion protein (Fig. 6E and F). As seen with the minimal Gal4 activation domain, Giant is able to repress lacZ expression mediated by the full-length Gal4 protein (Fig. 6C) and Gal4-VP16 (Fig. 6E) from three sites that are adjacent to the Giant binding sites. Moving the three sites 37 bp further away results in the loss of repression of both Gal4-mediated (Fig. 6B and D) and Gal4-VP16-mediated (Fig. 6F) activation by Giant.
Specificity of regulatory grammar. The contextual dependencies of repression described above were characterized for the Giant repressor. To determine if similar rules applied to other types of repressors, we carried out parallel evaluations of the short-range repressors Giant, Knirps, and Krüppel. To test quantitative similarities or differences between these factors, we created reporters that would compare repressor activity on genes that represented permissive or nonpermissive contexts for the Giant protein. All three of these short-range repressors were unable to inhibit lacZ expression driven by the minimal Gal4 activator from five high-affinity Gal4 sites, indicating a similar limitation of repression on even proximally bound activators (Fig. 7B, D, and F). The Giant and Krüppel factors were active in the corresponding regions of the embryo when tested against three Gal4 sites (Fig. 7A and E). The Knirps repressor was also active in this context, although in general, the levels of repression appeared to be lower (Fig. 7C). In contrast, the long-range repressor Hairy was able to mediate repression of transgenes containing either three or five high-affinity Gal4 sites (Fig. 7G and H). Interestingly, as the embryo aged, repression by Hairy was first attenuated and then completely absent during germ band elongation (data not shown), indicating that this type of repression, though potent, is also transient. The similarity in the activity of the short-range repressors Giant, Knirps, and Krüppel, in contrast to that of Hairy, suggests that the contextual rules for repression are governed by the functional class of repressor and likely reflects mechanistic differences.
|
|
|---|
100 bp. Although distance is a critical factor in dictating repression effectiveness, it is not the only one, and in some cases, close proximity alone is not sufficient to ensure regulation by these transcriptional repressors (Fig. 1, 4, 5, and 6 and reference 43). Activators can retain function even when the binding sites are within the previously defined 100-bp effective range of short-range repression. The manipulation of these composite enhancer elements in terms of the number of activator and repressor binding sites, relative affinities, spacing, and distribution of binding sites and the type of activation domains allowed us to define other contextual parameters that dictate repression effectiveness. First, we find that the ratio of activators and repressors is an important factor; in the context of five high-affinity Gal4 sites, four Giant sites can mediate repression but two sites do not. Reducing the number of Gal4 binding sites from five to three allowed two Giant sites to repress the lacZ reporter gene. Second, although the effectiveness of repression depends on stoichiometry between the number of activators and repressors, Giant repression of a smaller cluster of activators can be attenuated by subtle changes (<40 bp) in the spacing between the repressor and activator binding sites, even when activator binding sites in this situation are within the previously defined 100-bp range of repression. Such subtle changes in spacing between Giant and activator sites may explain the internal reconfigurations in enhancer design that have been demonstrated to occur between functionally homologous even-skipped stripe 2 enhancers and presumably many other cis-regulatory elements (47). Indeed, we find that in order to mediate repression effectively, short-range repressors need to be judiciously placed, either flanking activator sites or interspersed among them, possibly to block multiple modes of activator-promoter interactions. A fourth finding is that repression effectiveness correlates with activator site affinity, and although binding affinity influences the strength of the activating signal, repression does not depend on the chemical nature of the activation domain. Although we have developed these experiments in the context of Gal4 fusion activators, it is likely that similar principles apply for repression of other activators, as repression of native activators also shows strong context dependence (3, 79). Most likely, quantitative aspects of the relationships we have identified will vary depending on the DNA binding characteristics of different factors, whose characteristics will be established by further empirical tests. Determination of such quantitative factors contributes to our understanding of enhancer design and should find application in bioinformatics analysis of novel gene regulatory sequences as well as providing insights into the evolution and biochemical activity of short-range repressors. Computational analysis of cis-regulatory elements. Computational approaches have focused on the identification of transcriptional regulatory regions based on patterns of binding sites and evolutionary conservation of sequences. A more ambitious objective is to identify quantitative information about enhancers, including temporal, spatial, and quantitative output of such elements. More sophisticated analytical tools might also involve identification of conserved patterns of binding site stoichiometries, arrangements, and affinities that are not readily discernible by using conventional analyses. Recently, bioinformatics analysis of number and affinity of binding sites for the Knirps and Hunchback repressors was used to successfully predict the relative sensitivity of different regulatory sequences to these factors (20). In addition to quantitating the number and affinity of factor sites, our study indicates that bioinformatics analysis should also take into account the stoichiometry of activators to repressors, the exact spacing involved, and the nature of the DNA binding domains involved. Clearly, our studies focus on the effects of one class of repressor protein; more comprehensive work will be required to elaborate parameters relevant for other types of repressors and for activators. It is unlikely that particular contextual grammars would apply to all transcription factors; however, it is encouraging that the short-range repressors tested so far show similar characteristics. It therefore appears possible to model the properties of groups of proteins without having to develop distinct cis-regulatory grammar rules for each one. Incremental improvements to current approaches, based on the identification of cis-regulatory grammars, will usefully enhance the power of computational tools and allow the extension of bioinformatics analysis to specific data sets.
Mechanisms of repression. The contextual grammar defined in this study presents a phenomenological perspective to short-range repression, but our results also shed light on possible repression mechanisms. Three models have been presented for the action of short-range repressors. First, by binding overlapping sites, these repressors might directly compete with activators for binding to DNA, a situation that can be demonstrated experimentally (56). This mechanism has not been shown to play a role in endogenous enhancers, and where experimentally tested, the DNA binding domain of Knirps was not able to mediate repression in the embryo (75). It is in any event unlikely to be important in cases where the activator and repressor binding sites are separated, as is the case here. Second, repressors might "quench" neighboring activators, inhibiting their access to the DNA or blocking their interaction with other components of the transcriptional machinery. Third, the proteins might not affect activators but directly contact the basal transcriptional machinery. The results obtained in this study and a recent study (43) are most compatible with the second, quenching model of action. We have previously demonstrated that closely spaced factors can simultaneously mediate opposite transcriptional regulatory outputs (43), which would be hard to rationalize in the context of basal machinery interactions but is readily explainable in light of different susceptibilities of activators to chromatin remodeling. In addition, as shown in this study, the sensitivity of activators toward repression appears to be most closely linked to the DNA binding domain and affinity of the binding site rather than the activation domain, which may reflect a limited access to the DNA template under repression conditions.
The apparent lack of activator specificity demonstrated by short-range repressors also suggests that these proteins function via a general mechanism. Giant, Knirps, Krüppel, and Snail can block the activity of a number of activators such as Bicoid, Hunchback, Dorsal, Twist, and D-Stat (3, 32, 73). Many biochemical and genetic analyses suggest that at least some of these activators activate transcription via distinct pathways (40, 58, 81). Here, we have demonstrated that repression effectiveness does not depend on the nature of the activation domain but correlates instead with activator binding site affinity and placement. These findings are consistent with a mechanism that inhibits transcription by blocking access to DNA by transcriptional activators via local chromatin changes.
This model is also consistent with biochemical properties of short-range repressors. These proteins interact with CtBP, which in turn binds chromatin-modifying factors, including histone deacetylases (HDAC1 and HDAC2) and histone methyltransferases (18, 19, 69, 78). We have found that Knirps genetically and physically interacts with Rpd3, the Drosophila homolog of HDAC1 (P. Struffi, unpublished data). The Rpd3 protein in yeast is known to deacetylate histones at an extremely local level, consistent with its role in short-range repression in Drosophila (23). Knirps, Giant, and Krüppel can repress in a CtBP-independent fashion (38, 77), but this activity appears to possess similar properties to that mediated by the Drosophila CtBP-dependent activity, providing a quantitative, rather than qualitative, effect (63, 75, 79). Thus, both the Drosophila CtBP-dependent and -independent activities of the short-range repressors might work via chromatin remodeling.
Our study demonstrates that the Hairy repressor, in addition to working over a longer range, is also a more potent repressor on a local level, presumably because of its distinct biochemical mechanism for repression. By examining the nature of the promoter complexes and the chromatin state before and after repression, the defined transcriptional switch elements used in this study will facilitate further biochemical characterization of short- and long-range repressors.
This work was supported by NIH grant GM56976 to D.N.A.
Present address: Department of Genetics, Harvard Medical School, Boston, MA 02115-6092. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»