Molecular and Cellular Biology, April 2001, p. 2629-2640, Vol. 21, No. 8
0270-7306/01/$04.00+0 DOI: 10.1128/MCB.21.8.2629-2640.2001
Copyright © 2001, American Society for Microbiology. All rights reserved.
-Globin Locus Control
Region Hypersensitive Sites in the Absence of Chromatin
Assembly

Department of Biochemistry and Molecular Biology, Powell Gene Therapy Center, University of Florida College of Medicine, Gainesville, Florida1; Adolf Butenandt Institute, Ludwig Maximillian University, Munich, Germany2; Department of Biochemistry, Hiroshima University School of Medicine, Hiroshima, Japan3; and Department of Biochemistry, Molecular Biology and Cell Biology, Northwestern University, Evanston, Illinois4
Received 1 September 2000/Returned for modification 26 October 2000/Accepted 24 January 2001
| |
ABSTRACT |
|---|
|
|
|---|
The human
-globin genes are regulated by the locus control
region (LCR), an element composed of multiple DNase I-hypersensitive sites (HS sites) located 5' to the genes. Various functional studies indicate that the LCR confers high-level, position-independent, and
copy number-dependent expression to linked globin genes in transgenic
mice. However, the structural basis for LCR function is unknown. Here
we show that LCR HS sites can be reconstituted in an erythroid
cell-specific manner on chromatin-assembled LCR templates in vitro.
Surprisingly, HS2 and HS3 are also formed with erythroid proteins in
the absence of chromatin assembly, indicating that sensitivity to
nucleases is not simply a consequence of nucleosome reorganization. The
generation of LCR HS sites in the absence of chromatin assembly leads
to the formation of S1- and KMnO4-sensitive regions in HS2
and HS3. These sites are also sensitive to S1 nuclease in erythroid
cells in vivo, suggesting a distorted DNA structure in the LCR core
enhancer elements. Finally, we show that RNA polymerase II initiates
transcription in the HS2 and HS3 core enhancer regions in vitro.
Transcription in both HS2 and HS3 proceeds in a unidirectional manner.
Taken together, the data suggest that erythroid proteins interact with
the core enhancer elements, distort the DNA structure, and recruit
polymerase II transcription complexes. These results further our
understanding of the structural basis for LCR function and provide an
explanation for why the LCR core regions are so extremely sensitive to
nucleases in erythroid cells.
| |
INTRODUCTION |
|---|
|
|
|---|
The human
-globin locus contains
five genes (
, G
, A
,
, and
), which are organized in
sequential order on chromosome 11 (39). The gene order
reflects the timing of expression during erythroid development, with
the embryonic
gene located at the 5' end and the adult
-globin
gene at the 3' end of the locus. Developmental stage-specific
expression is controlled mainly at the transcriptional level by a
variety of gene-proximal or -distal cis elements. The most
prominent distal regulatory element is the
-globin locus control
region (LCR), located from 8 to 22 kb upstream of the
-globin gene
and composed of several subregions that exhibit heightened sensitivity
to digestion with exogenous DNase I (HS sites) in erythroid cells
(14, 18, 41). Originally, five HS sites (HS1 to HS5) were
associated with LCR function, but more recently additional sites were
mapped even further 5' to the LCR (6). Many studies
indicate that the human
-globin LCR is able to confer
position-independent expression to linked globin genes in transgenic
mice (18, 19). It is generally believed that this activity
is based on the ability of the LCR to provide an open accessible
chromatin structure regardless of where the transgenic locus integrates
in the mouse genome. Recent results suggest that the chromatin-opening
function of the LCR may not be the primary activity in the endogenous
mouse or human globin locus, because the LCR can be deleted without
affecting general sensitivity to DNase I (5, 36).
Regardless of mechanism, virtually all studies agree that the LCR is
required for conferring high-level globin gene expression throughout
erythroid development.
We have previously analyzed the role of individual LCR HS core elements
in the context of the whole
-globin gene locus in
-globin yeast
artificial chromosome (YAC) transgenic mice (9, 10). The
results of these experiments showed that in intact, single-copy
transgenes, deletion of individual core HS elements dramatically
reduced expression of all of the globin genes at all developmental
stages. We also found that these mutations impaired the formation of
DNase I hypersensitivity in the LCR and in the adult
-globin gene
promoter (28).
These results support a model in which the HS sites interact to generate a higher-order structure, referred to as the LCR holocomplex (9, 10, 43). Although this model is consistent with the data from many studies addressing the function of wild-type and mutant LCRs, there is currently no direct physical evidence for the formation of an LCR holocomplex. However, recent work by Lee et al. (27) suggests that the erythroid cell-specific transcription factor EKLF (erythroid Krüppel-like factor) binds cooperatively to HS2 and HS3. In addition, Yoshida et al. (45) recently showed that the protein Bach1, which heterodimerizes with small Maf proteins and interacts with MARE (Maf-responsive element) sequences present in HS2, HS3, and HS4, is able to cross-link HS sites, thereby looping out intervening DNA sequences. These results, together with various in vitro studies, suggest that a network of protein-protein and protein-DNA interactions could mediate the formation of a larger LCR complex. Notable among these in vitro studies are observations that GATA factors are able to engage in a variety of protein-protein interactions involving EKLF, Sp1, LMO2, and Tal1 (30, 42). All of these proteins (with the exception of LMO2, which serves as a bridging molecule between GATA factors and Tal1) were shown to interact with LCR core HS sites.
The individual LCR HS core elements are 200 to 400 bp in size and harbor clusters of binding sites for erythroid cell-specific and ubiquitously expressed transcription factors (20). All of the core HS sites (HS1 to HS5) contain binding sequences for GATA factors and GC-rich elements, which can interact with Sp1 or related proteins. HS2, HS3, and HS4 also harbor MARE sequences that interact with transcription factors of the NF-E2 family (NF-E2, Nrf1, Nrf2, and Bach1). HS2 and HS3 contain potential binding sites for EKLF. E-box motifs present in HS2 were shown to be critical for HS2 function and to interact with helix-loop-helix proteins (heterodimers containing either USF or Tal1 [11]).
Several studies addressed the importance of particular transcription
factors and their binding sites for the generation of DNase I
hypersensitivity in individual LCR HS sites. It was shown that
combinations of MARE and GATA sequences are critical components for the
formation of HS2, HS3, and HS4 (35, 40). Furthermore, NF-E2 is critically involved in remodeling the nucleosome structure over the HS2 core region, where it interacts with tandemly arranged MARE sequences (1, 16). Finally, EKLF has been implicated in the formation of HS3, since this site is not formed efficiently in
EKLF-deficient human
-globin locus transgenic mice
(44).
To analyze the mechanism of HS site formation in the human
-globin
LCR, we have developed an in vitro system to reconstitute the HS sites
on chromatin-assembled templates containing the entire LCR. For these
experiments, we subcloned the LCR from a human
-globin YAC by
homologous recombination (gap repair) in yeast. Biotinylated LCR
templates were immobilized on streptavidin-coated beads and incubated
with chromatin assembly extracts in the presence of erythroid proteins.
These data show that LCR HS sites are generated on chromatin in an
erythroid cell-specific manner and are restricted to the core HS
regions. Surprisingly, HS2 and HS3 are also formed on the LCR templates
in the absence of chromatin assembly, indicating that formation of
these sites does not simply reflect a change in chromatin structure.
The in vitro-reconstituted HS sites are also sensitive to S1 nuclease
and reveal extended regions of sensitivity to KMnO4.
Finally, we show that RNA polymerase II-specific transcripts initiate
within the cores of HS2 and HS3 and proceed in a directional manner.
| |
MATERIALS AND METHODS |
|---|
|
|
|---|
Subcloning of the human
-globin LCR and mutant
derivatives.
The human
-globin LCR was subcloned into pRS316, a
yeast episomal shuttle vector, by homologous recombination in yeast
harboring the whole
-globin locus on a YAC. For generating the
target construct, we isolated restriction fragments corresponding to
the 5' flanking region of HS5 (EcoRI/SacI) and to
the 3' flanking region of HS1 (SacI/XbaI) from a
lambda phage library (9). These fragments were ligated
into the EcoRI/XbaI site of pRS316 to generate
pRSHS5/HS1. The plasmid was linearized with SacI and used to
transform yeast cells harboring a 155-kb human
-globin YAC
(15). After transformation, the yeast cells were plated
onto selective agar plates lacking uracil (Ura
plates).
Subsequently, transformants were restreaked on Ura
plates. DNA isolated from these clones was digested with
PvuII, separated by electrophoresis, and transfered to nylon
membranes. Hybridization was then carried out with a radiolabeled
restriction fragment from pRS316 (220-bp
EcoRI/PvuII) to identify clones in which the LCR
was linked to the episomal vector (resulting in pRS/LCR). DNA from
positive clones was isolated and used to transform electrocompetent
Escherichia coli (DH10B). For subcloning a mutant LCR
lacking the core enhancer of HS2, the SacI-linearized target vector pRSHS5/HS1 was used to transform yeast cells carrying a mutant
-globin YAC in which the 375-bp core enhancer of HS2 was deleted by
homologous recombination (10). Selection of positive transformants and shuttling of plasmids into E. coli was
carried out as described for the generation of pRS/LCR. We refer to the LCR deletion mutant as pRS/LCR
2. We also generated a plasmid in
which the LCR was linked in cis to the adult human
-globin gene. For generating the target construct, we ligated a
HincII/XbaI fragment containing the
-globin
gene into the HincII/XbaI site of pRSHS5/HS1,
placing the
-globin gene 3' to the HS1 flanking region. This vector,
pRSHS5/HS1-
, was used to transform yeast cells carrying wild-type
human
-globin YACs. Yeast clones in which the LCR had integrated
into the plasmid (referred to as pRS/LCR-
) were selected as
described above, and DNA isolated from these clones was used to
transform E. coli cells.
Immobilization of LCR constructs.
The LCR containing
constructs (pRS/LCR, pRS/LCR
2, and pRS/LCR-
) were linearized with
ClaI and XhoI, and the 5' overhanging ends were
filled in with biotinylated dATP, dGTP, biotinylated dCTP, biotinylated
dUTP, and Klenow polymerase. The biotinylated templates were then
attached to streptavidin-coated magnetic beads using a kilobase binder
kit (Dynal) as recommended by the manufacturer. To monitor the
efficiency of coupling, an aliquot of DNA attached to beads was
digested with EcoRI and analyzed on 1% agarose gel. The DNA
concentration was determined by fluorometry using a Versafluor fluorometer (Bio-Rad).
Chromatin assembly. The preparation of Drosophila chromatin assembly extracts and the assembly of LCR constructs into chromatin was carried out essentially as described by Sandaltzopoulos et al. (38). Briefly, 600 ng of immobilized template DNA was incubated with 300 µg of Drosophila chromatin assembly extract in a reaction mixture (total volume of 70 µl) containing 10 mM HEPES-KOH, (pH 7.6), 50 mM KCl, 3 mM MgCl2 (pH 8.0), 0.5 mM EGTA (pH 8.0), 10 mM glycerolphosphate, 30 mM creatine phosphate, 30 mM ATP, 1 µg of creatine phosphokinase/ml, 10% glycerol, 1 mM dithiothreitol (DTT), and 0.2 mM phenylmethyl sulfonyl fluoride (PMSF) for 6 h at 26°C. After the chromatin assembly reaction, all unbound material was removed using a magnet (Dynal), and the beads were resuspended in buffer A (10 mM HEPES [pH 7.6], 10 mM KCl, 1.5 mM MgCl2, 0.5 mM EGTA [pH 8.0], 10% glycerol, 10 mM glycerolphosphate, 1 mM DTT, 0.2 mM PMSF).
Preparation of protein extracts. Extracts from two different cell types were used in the in vitro experiments: MEL (mouse erythroleukemia) cells and HeLa cells (derived from human cervical carcinoma). Protein extracts used for transcription reactions were prepared as described by Bungert et al. (8). The range of protein concentrations was between 9 and 11 mg/ml. To prepare protein extracts used for nuclease HS site and KMnO4 mapping experiments, 108 cells were pelleted by centrifugation, washed in phosphate-buffered saline (13.7 mM NaCl, 0.27 mM KCl, 0.43 mM Na2HPO4 · 7H2O, 0.14 mM KH2PO4), and resuspended in 1 ml of lysis buffer (20 mM HEPES [pH 7.6], 20% glycerol, 10 mM NaCl, 1.5 mM MgCl2, 0.2 mM EDTA, 0.1% Triton X-100, 1 mM DTT, 1 mM PMSF). After incubation for 3 min on ice and centrifugation at 2,000 rpm (Eppendorf Microcentrifuge) for 10 min at 4°C, the pellets were resuspended by rocking in 1 ml of nuclear extraction buffer (20 mM HEPES [pH 7.6], 20% glycerol, 400 mM NaCl, 1.5 ml MgCl2, 0.2 mM EDTA, 1 mM DTT, 1 mM PMSF) for 1 h at 4°C. After centrifugation at 10,000 rpm for 10 min at 4°C, the supernatant was dialyzed for 16 h in 2 liters of dialysis buffer (20 mM HEPES [pH 7.8], 20% glycerol, 100 mM KCl, 0.2 mM EDTA, 1 mM DTT, 1 mM PMSF). The final protein concentration of different extracts was between 4 and 6 mg/ml.
Nuclease HS site mapping. Immobilized LCR constructs (600 ng) were incubated with 100 µg of MEL or HeLa extracts for 45 min at 30°C in buffer A (total volume, 70 µl). For analyzing DNase I sensitivity on chromatin, the templates were either preincubated (prior to chromatin assembly) or postincubated with MEL or HeLa extracts for 45 min at 30°C. All unbound material was removed by incubating the reactions on a magnet for 1 min at room temperature (RT), and the beads were resuspended in 160 µl of buffer A. The samples were divided into 30-µl aliquots and digested for 1 min at RT with DNase I. For this reaction, a 30-µl mix containing increasing concentrations of DNase I (Sigma) (from 0.001 to 0.02 U for naked DNA and from 0.01 to 0.2 U for chromatin-assembled DNA) in buffer A (plus 5 mM CaCl2 and 10 mM MgCl2) was added to the samples. After addition of 20 µl of stop solution (2.5% sarcosyl, 100 mM EDTA), the samples were incubated with RNase I (1 µg/reaction) for 30 min at 37°C and subsequently overnight with 5 µl of proteinase K (50 µg/reaction) and 8 µl of 2% sodium dodecyl sulfate (SDS). After sequential phenol-chloroform and chloroform extractions, the DNA was precipitated, resuspended in 1× restriction buffer, and digested for 6 h to overnight with EcoRI. The DNA was precipitated, resuspended in Tris-EDTA (pH 7.6), loaded onto 1.3% agarose gels, and separated by electrophoresis for 6 h. After blotting to nylon membranes, the DNA was hybridized to a 32P-labeled 360-bp restriction fragment corresponding to the 3' flanking region of HS2 (a NotI/XbaI restriction fragment from pRS306 containing the HS2 flanking regions embedding the core enhancer of HS3 [10]). The S1 sensitivity mapping experiments were carried out in the same way as described for the DNase I HS site mapping experiments except that instead of being digested with increasing concentrations of DNase I, the DNA was incubated for 2 min with S1 (Promega) (from 0.01 to 1 U in S1 nuclease reaction buffer). The samples were then processed and analyzed as described for the DNase I mapping experiment.
For micrococcal nuclease (MNase) mapping of nucleosomes, the LCR constructs were incubated with chromatin assembly extracts and, after removal of unbound material through use of a magnet, resuspended in 70 µl of buffer A. After addition of 100 µl of MNase mix (buffer A plus 5 mM CaCl and 50 U of MNase [Sigma]), 30-µl aliquots were removed (at 15, 60, and 180 s) and added to 20 µl of stop solution (2.5% sarcosyl, 100 mM EDTA [pH 8.0]). After incubation with RNase I for 30 min at 37°C, 5 µl of proteinase K (10 mg/ml) and 8 µl of SDS (2%) were added to the samples, and incubation was continued overnight at 37°C. For analyzing the efficiency of chromatin assembly, the DNA was precipitated, resuspended, and separated by electrophoresis in 1.3% agarose gels. For mapping the position of nucleosomes, the samples were prepared essentially as described for the DNase I mapping experiments except that the DNA was digested with ApaI and BglII. The procedure for mapping DNase I and S1 sensitivity in vivo has been described previously (28). Briefly, nuclei were isolated from exponentially growing K562 (human erythroleukemia) cells, and aliquots (106 nuclei) were treated with increasing concentrations of either DNase I (0, 0.5, 2, and 10 U/ml) or S1 nuclease (0, 0.25, 1, 5, 10, and 30 U/ml). The DNA was isolated, digested with EcoRI, electrophoresed, and blotted to a nylon membrane. The radioactive probe used in Southern blotting experiments was the same as the one used in the DNase I and S1 nuclease in vitro analysis.Mapping of single-stranded regions with KMnO4.
The template DNA (600 ng of immobilized LCR constructs) was incubated
with HeLa or MEL protein extracts (100 µg) for 45 min at 30°C in a
total volume of 70 µl (in buffer A). After adding KMnO4
(final concentration, 4 mM), the samples were incubated 5 min at RT.
The reaction was stopped by the addition of 200 µl of stop buffer
(consisting of 6%
-mercaptoethanol, 2% SDS, and 100 mM EDTA). The
samples were then incubated with 1 µg of RNase for 30 min at 37°C.
After the addition of 20 µl of proteinase K (10-mg/ml stock solution)
and 3 µl of SDS (20%) the samples were incubated overnight at
37°C. After sequential phenol-chloroform-isoamyl alcohol and
chloroform-isoamyl alcohol extractions the DNA was precipitated and
resuspended in 4 µl of H2O. After addition of 1 µl of
32P-labeled primers (HS2US, 5'
GCATCCTCATCTCTGATTAAATAAGC 3'; HS2DS, 5'
GTCACATTCTGTCTCAGGCATCCAT 3'; HS3 US, 5'
TGGTGTGCCAGATGTGTCTA 3'; HS3DS, 5' GCTGCTATGCTGTGCCTCCC 3')
and 45 µl of PCR mix (5 µl of 10× Taq polymerase
buffer, 1 µl of 100 mM deoxynucleoside triphosphate, 1 µl of 1M
MgCl2, 0.25 µl of Taq polymerase, and 37.75 µl of H2O), DNA was amplified by primer extension-PCR
(eight cycles of 1 min at 92°C, 1 min at 60°C, and 1 min at 72°C)
and then analyzed by electrophoresis in 8% sequencing gels and autoradiography.
In vitro transcription assayed by primer extension.
A
typical transcription reaction mixture contained 300 ng of immobilized
template (pRS/LCR or pRS/LCR-
), 4 µl of transcription buffer
(Promega), 3 mM MgCl2, 40 µM ribonucleoside
triphosphates, 0.5 µl of RNasin (40 U/µl), and 70 µg of protein
extract (55 µg of MEL protein extract/15 µg of HeLa protein
extract) in a final volume of 25 µl and was incubated for 60 min at
30°C. After the addition of 175 µl of stop buffer (Promega) and
phenol-chloroform-isoamyl alcohol extraction, nucleic acids were
precipitated with 500 µl of ethanol for 15 min on dry ice and
pelleted by centrifugation. The pellets were resuspended in 5 µl of
primer extension buffer (Promega) and 5 µl of nuclease-free water.
After addition of 1 µl of 32P-labeled primer, the samples
were denatured (10 min at 70°C and 5 min on ice) and incubated for 20 min at 55°C and 10 min at RT. After addition of 9 µl of buffer
containing 5 µl of primer extension buffer, 2.8 mM sodium
pyrophosphate, 12 ng of actinomycin D, 0.5 µl of RNasin, and 1 µl
of avian myeloblastosis virus reverse transcriptase (1 U/µl;
Promega), the reaction mixtures were incubated for 60 min at 42°C.
The primers used in the primer extension assay were the same as those
described for the KMnO4 mapping experiments with the
exception of the HS3 downstream primer (HS3 DS*), which has the
sequence 5' CATGTCTGCCCTCTACTCATGG 3'. The primer for analyzing transcription of the
-globin gene has the sequence 5'
CGGCAGACTTCTCCTCAGGAGTCAGGTG 3'. After precipitation, the samples were resuspended in 5 µl of loading dye (Promega), loaded on 8% sequencing gels, separated by electrophoresis, and subjected to autoradiography.
| |
RESULTS |
|---|
|
|
|---|
Subcloning and immobilization of the human
-globin LCR.
The
LCR was subcloned from yeast cells carrying human
-globin YACs by
homologous recombination (Fig. 1;
Materials and Methods). A yeast episomal plasmid was generated by
ligating the 5' flanking region of HS5 and the 3' flanking region of
HS1 into pRS316. The plasmid was linearized with SacI and
used to transform yeast cells carrying the human
-globin YAC (A201
F4 [15]). Clones able to grow on Ura
plates were restreaked, and DNA was isolated from single colonies. The
DNA was digested with PvuII, subjected to electrophoresis, blotted to nylon membrane, and hybridized to a 220-bp
EcoRI/PvuII restriction fragment from pRS316.
Plasmids that incorporated sequences from the human
-globin LCR were
transformed into E. coli. DNA from transformed cells was
analyzed by restriction digestion with EcoRI and
HindIII. Using this strategy, we were able to generate pRS constructs containing the wild-type LCR, a mutant LCR lacking the
core enhancer of HS2, and the wild-type LCR linked to the adult human
-globin gene (pRS/LCR, pRS/LCR
2, and pRS/LCR-
, respectively
[Fig. 1B]).
|
2, and pRS/LCR-
were
linearized, biotinylated, and bound to streptavidin-coated magnetic
beads overnight at RT (Materials and Methods).
Erythroid cell-specific reconstitution of DNase I HS sites in
vitro.
To test whether formation of HS sites could be
reconstituted on chromatin-assembled LCR templates in vitro, pRS/LCR
was assembled into chromatin using an extract from
Drosophila embryos enriched in histones and nucleosome
assembly factors (Fig. 2)
(38). Chromatin assembly was monitored by MNase, which
digests nucleosomal DNA preferentially in the linker region, releasing
mono- or oligonucleosomal fragments. As shown in Fig. 2B, the 22-kb LCR
template is efficiently assembled into chromatin, revealing a typical
nucleosome ladder with a repeat length of about 200 bp. For
reconstitution of DNase I HS sites, the LCR was first assembled into
chromatin and then postincubated with MEL or HeLa cell extracts, after
which all unbound proteins were removed by incubating the beads on a
magnet (Fig. 2C). Chromatin assembly was performed as described above. After DNase I followed by EcoRI digestion, which releases a
10.4-kb fragment containing HS2, HS3, and HS4, the DNA was blotted and hybridized to a 32P-labeled probe corresponding to the 3'
flanking region of HS2.
|
|
Generation of erythroid cell-specific LCR HS sites in the absence
of chromatin assembly.
As a control for the experiment shown in
Fig. 2, the immobilized LCR template was incubated with proteins from
HeLa or MEL cells without prior chromatin assembly and then analyzed
with DNase I. Surprisingly, we found that the same sites generated on
the chromatin-assembled template are also generated on naked DNA (Fig.
4B). It should be noted that the MEL and
HeLa cell extracts used in these studies are likely to contain proteins
associated with chromatin or with the regulation of chromatin
structure. However, these proteins are not capable of forming a
nucleosomal structure over the LCR (Fig. 3D and data not shown) and
thus do not generate nucleosomal templates.
|
-like globin genes (Fig.
5). The results of these experiments show
that HS2 and HS3 are also sensitive to S1 nuclease in erythroid cells
in vivo. Importantly, S1 nuclease sensitivity colocalizes with DNase I
sensitivity in HS2 and HS3. Two observations may be of interest. First,
it appears that sensitivity to S1 nuclease in vivo is weaker in HS2
than it is in HS3. Second, the parental 10.4-kbp band disappears
completely with higher concentrations of DNase I but not with
increasing concentrations of S1 nuclease, possibly indicating that S1
nuclease sensitivity in the LCR is restricted to only a fraction of
cells. Together, these data show that one more (and novel) aspect of HS
site formation in the human
-globin LCR is recapitulated in our in
vitro system.
|
|
Transcription of the HS2 and HS3 core enhancers.
To determine
whether the S1-sensitive sites in HS2 and HS3 represent extended
regions of single-stranded DNA, we analyzed sensitivity to
KMnO4 in HS2 and HS3 after incubating the LCR with HeLa or
MEL cell extracts. Permanganate reacts with and oxidizes thymine
residues in regions of single-stranded DNA, leading to strand breaks.
The strand breaks induced by KMnO4 can be monitored by
primer extension-PCR (Materials and Methods). KMnO4 has
previously been used to map single-stranded DNA in promoter regions and
to localize stalled polymerases (26, 31). As shown in Fig.
7A, incubation of the LCR with MEL
extracts leads to the generation of an extended region of
KMnO4 reactivity in the core of HS2. This region is about
130 bp long and encompasses the MARE sequences, a GATA site, and an E
box, previously shown to interact with helix-loop-helix proteins and to
be required for HS2 function (11). We did not detect
KMnO4-reactive sites in HS2 after incubation of the LCR with HeLa extracts, despite the DNase I mapping experiments showing that a subsite in HS2 is formed by HeLa extracts (Fig. 4B). These data
show that the HeLa-induced sensitivity in HS2 does not represent a
region of DNA that is distorted or unwound because it does not react
with KMnO4. The control experiments analyzing
KMnO4 reactivity in the absence of protein extracts, or
primer extension products in the absence of KMnO4, indicate
that a subregion in HS2 is specifically rendered single stranded by
erythroid proteins.
|
-globin gene in a
construct in which the
-globin gene is linked to the LCR
(pRS/LCR-
) was analyzed simultaneously. As shown in Fig. 7B, the
-globin gene linked to the LCR is efficiently transcribed in vitro.
Using a primer that specifically hybridizes to the downstream region of
HS2, we also detected transcripts initiating within the HS2 core
enhancer. A major transcription start site mapped to a region 5' to the
two MARE sequences; this transcript proceeds in a 5'-to-3' direction
(Fig. 7B, lane 2). We detected no transcripts in HS2 that proceed in
the opposite direction (Fig. 7B, lanes 4 to 6). Taken together, the
results indicate that transcription initiates within HS2 and proceeds
in a unidirectional (5'-to-3') manner. To determine which polymerase is
responsible for this transcription, RNA synthesis was analyzed in the
presence of
-amanitin. Transcription in HS2 is inhibited by low
concentrations of
-amanitin, demonstrating that generation of the
HS2 unidirectional transcript is mediated by RNA polymerase II (Fig.
7B, lane 3), as is the
-globin transcript (Fig. 7B, lane 10).
KMnO4-sensitive regions were also analyzed in HS3 (Fig.
8A). Similar to the data obtained from
the analysis of HS2, an extended region of about 90 to 100 bp in HS3
was sensitive to KMnO4 after incubation with MEL cell
extracts. This region in the core of HS3 encompasses a putative binding
site for EKLF (or Sp1), an E box, and two GATA sites. It should be
noted that there is a higher background in the control experiments
using the HS3-specific primers than what was observed in the
experiments using the HS2-specific primers. We have focused our
attention to only those sites that are highly sensitive to
KMnO4 and erythroid cell specific.
|
-amanitin per ml
and proceeded in a 5'-to-3' direction was detected. The smaller primer
extension products seen in Fig. 8A were still detectable in reaction
mixtures incubated with
-amanitin. These smaller products do not
represent transcripts generated by RNA polymerase III, because they
were not inhibited by millimolar concentrations of
-amanitin (data
not shown). The polymerase II-specific transcript in HS3 maps to a
position 5' to a putative EKLF binding site. No transcripts were
detected initiating in HS3 and proceeding in a 3'-to-5' direction
(HS3US [Fig. 8, lanes 4 to 6]).
| |
DISCUSSION |
|---|
|
|
|---|
The human
-globin LCR has long been the subject of intense
study. Recent advances in analyzing the function of the LCR in the
endogenous mouse
-globin locus or in transgenic mice carrying the
whole human globin locus on YAC or cosmid constructs clearly established an important functional role of the LCR in mediating high-level expression of all the
-like globin genes throughout erythroid development (7, 12). Although the LCR in the
endogenous mouse locus does not appear to be required for the
generation of an open, DNase I-accessible chromatin structure, it is
quite clear that the human
-globin LCR has the ability to provide an open accessible chromatin structure in transgenic mice in a
position-independent manner (21). Despite these advances,
however, the mechanism and structural basis for LCR function are poorly
understood. We have established conditions that allow us to
reconstitute and analyze the formation of HS sites in the human
-globin LCR in vitro. We wished to analyze HS site formation in the
context of the entire LCR because previous results indicated that the
HS sites synergize to establish a fully functional LCR (9,
10). Therefore, these studies were initiated by subcloning the
LCR and mutant derivatives from yeast carrying
-globin YACs by
homologous recombination and gap repair. We then established conditions
to first attach the LCR constructs to magnetic beads via
biotin-streptavidin and to then assemble the DNA into chromatin using
Drosophila chromatin assembly extracts (4, 38).
Reconstitution of nuclease HS sites on chromatin-assembled LCR
templates in vitro.
The initial experiments showed that HS sites
can be reconstituted on chromatin-assembled, immobilized LCR templates
in vitro. Two significant observations are of note: (i) formation of
hypersensitivity is erythroid cell specific, and (ii) hypersensitivity
in the LCR is principally restricted to the 200- to 400-bp core
enhancer regions. These results show that two aspects of HS site
formation observed in vivo can be recapitulated in this in vitro
system. HS site formation in HS2 and HS3 was more efficient when the
LCR was preincubated with MEL protein extracts prior to chromatin assembly than in the postincubation experiments (compare Fig. 2C and
D). In addition, the formation of an apparently nucleosome-free HS2
core is also more efficient when the LCR is preincubated with erythroid
proteins prior to chromatin assembly (Fig. 3). An explanation for these
observations could be that once the chromatin structure is formed over
the LCR, erythroid proteins cannot efficiently displace nucleosomes. It
could be argued that MEL cells may not contain activities required for
the remodeling of chromatin structure over the LCR. However, nucleosome
remodeling complexes have been purified from MEL cells and shown to
change the chromatin structure at the
-globin promoter
(2). A plausible alternative explanation is that an active
LCR is established by the interaction of proteins or protein complexes
prior to the formation of chromatin structure. The binding of erythroid
proteins to the core enhancers could prevent the association of a
repressive chromatin structure over the LCR. Data published by Milot et
al. (32) provide evidence that mutations in the LCR may
affect the activity of the globin locus in transgenic mice in a way
such that the globin genes are expressed for only a short time after
cell division and become silenced because a mutant LCR may no longer be
able to resist heterochromatization. In vivo studies of transcription
factor binding and chromatin structure in the Drosophila
hsp70 gene promoter showed that although the transcription factors
were displaced from the promoter in mitotic chromatin, a characteristic
DNase I HS site was still detectable (29). These results
were interpreted to mean that a noncanonical chromatin conformation was
maintained at the hsp70 promoter during mitosis, which then
allows the reassembly of a functional promoter during interphase. All
these data are in agreement with a model proposed by Felsenfeld
(13), according to which regulatory elements, like the
human
-globin LCR, are structurally prepared by the binding of
proteins after replication, and these proteins may then confer
resistance to the formation of repressive chromatin.
Nuclease HS site formation in the absence of chromatin assembly. It is generally believed that DNase I HS sites represent regions in chromatin in which nucleosomes are displaced, excluded, or modified. It was therefore surprising to us that DNase I hypersensitivity in HS2 and HS3 is also formed in an erythroid cell-specific manner in the absence of chromatin assembly. This result indicates that it is not simply the rearrangement of nucleosomes that renders these sites highly sensitive to nucleases. Similar to what was observed on chromatin-assembled DNA, hypersensitivity in the unassembled LCR construct is erythroid cell specific and tightly limited to the core enhancer regions. It could be argued that hypersensitivity in the core regions simply reflects the binding of transcription factors, as in vitro footprinting studies often show that sites immediately flanking protected regions reveal higher sensitivity to DNase I. However, these data cannot be compared to the long-range HS site mapping experiments in our study. There are many transcription factor binding sites in the LCR, not just in the core enhancer elements (23), yet our results show that hypersensitivity to DNase I is mainly restricted to the HS2 and HS3 core regions.
We hypothesized that a specific combination of protein binding sites in the HS2 and HS3 core enhancers could recruit protein complexes that change the topology of the DNA and render these sites highly sensitive to nucleases. In support of this hypothesis, we found that the same regions in HS2 and HS3 that reveal sensitivity to DNase I are also sensitive to S1 nuclease, indicating that the DNA in the core regions is distorted or rendered single stranded by erythroid proteins. Importantly, we found that HS2 and HS3 are also sensitive to S1 nuclease in K562 cells in vivo (Fig. 5). In contrast to the in vitro analysis, in which both HS2 and HS3 are equally sensitive to S1, it appears that HS3 is more sensitive to S1 nuclease than HS2 in K562 cells. The difference could be due to higher-order chromatin structure or to developmental stage-specific differences, as the in vitro experiments were performed with an extract from erythroid cells revealing an adult specific pattern of globin gene expression (MEL cells). To further explore the extent of single stranded regions in vitro, we mapped KMnO4 reactivity in the HS2 and HS3 core enhancers. In HS2, an extended region of about 130 bp is rendered sensitive to KMnO4 after incubation with MEL cell extract. This region encompasses the tandem MARE sequences, two GATA sites, and an E-box motif. The KMnO4-sensitive region in HS3 is about 90 to 100 bp long and extends over a putative EKLF binding site as well as two GATA sites and an E-box motif. Several mechanisms could lead to the formation of S1 nuclease- and KMnO4-sensitive regions in the core enhancer. First, erythroid protein complexes could assemble at the core enhancer regions and distort or unwind the DNA. Second, nuclear protein complexes like 13S condensin have DNA-dependent ATPase activities and change the topology of the DNA, which in turn could create regions sensitive to S1 nuclease or KMnO4 (24). Finally, it is also possible that the assembly of protein complexes could recruit transcription complexes, which would lead to the generation of single-stranded regions in the core enhancer.Transcription of LCR core HS sites.
To test whether
transcripts could be detected in the HS2 and HS3 core enhancer regions,
we assayed for RNA synthesis by primer extension. In HS2, a specific
transcript that initiates just upstream of the two MARE sequences was
detected. Transcription is unidirectional, proceeding in a 5'-to-3'
direction (i.e., we were unable to detect transcripts initiating in HS2
and proceeding in a 3'-to-5' orientation). This result is in agreement
with previously published experiments showing that unidirectional
transcripts initiate in HS2 in erythroid cells (25).
Transcription in HS3 is initiated 5' to a putative EKLF binding site
and also proceeds in a 5'-to-3' direction. Furthermore, we show that
transcription of both HS2 and HS3 core enhancers is mediated by RNA
polymerase II. How the RNA polymerase II transcription complex is
recruited to the core enhancers is not known, but our in vitro system
is ideally suited to analyze the regulatory sequences that recruit
these transcription complexes. There is no obvious TATA sequence in the
vicinity of the putative transcription start site in HS2 or HS3.
However, it is worth noting that an E box is located about 50 bp
downstream of the initiation site in HS2 and about 30 bp downstream of
the transcription start site in HS3. E-box binding proteins have
previously been implicated in transcription complex assembly on a
variety of polymerase II-transcribed genes (37), including
genes specifically expressed in erythroid cells (8). In
this respect, it is interesting that an E box is also located in the
downstream promoter-initiator region of the adult
-globin gene
(K. M. Leach et al., unpublished data). The E-box motif in HS3 is
not phylogenetically conserved (20). Comparison of the
effects of HS3 deletions in the endogenous mouse locus and in the
transgenic human
-globin locus points to functional differences
between the human and mouse HS3 enhancers. Deletion of HS3 in the
murine locus leads to a mild reduction of adult
-globin gene (mouse
maj) expression (22), whereas deletion of
the human HS3 enhancer dramatically reduced expression of both the
embryonic
-globin and adult
-globin genes (9, 33).
Thus, the E box could confer a unique function to the human HS3 enhancer.
-amanitin (data not shown), indicating that transcription per se is
not required for HS site formation. At this point we cannot rule out
the possibility that the formation of a transcription complex, a step
not inhibited by
-amanitin, contributes to the generation of
single-stranded regions and HS site formation. However, it appears more
likely that the generation of DNase I- and S1-sensitive regions in HS2
and HS3 precedes transcription. In other words, protein complexes that
assemble on the individual HS sites may change the topology of the DNA
and may represent attachment sites for RNA polymerase II transcription
complexes. The in vivo analysis of LCR transcription in the human
-globin locus (3, 17) showed that LCR transcripts are
detectable in only a small fraction of erythroid cells, which supports
the notion that these transcripts may play only a transient role in
setting up an active locus. Data presented by Gribnau et al.
(17) suggest that transcription of LCR and intergenic
regions is important for chromatin opening because the appearance of
intergenic transcripts correlates with the localization of
nuclease-sensitive domains. Another possibility is that the LCR
contains multiple attachment sites for RNA polymerase II transcription
complexes and these are somehow delivered to the individual globin gene
promoters. Although the globin genes appear to have stronger promoters
(at least in vitro) and are expected to recruit transcription complexes
more efficiently than the core enhancer elements of the LCR, it is
possible that in vivo the promoter regions may not be as accessible as
the LCR. An indication for this may be that the LCR HS sites are much
more sensitive to nucleases than are the individual globin gene
promoters (14, 28, 41). In this respect, a distorted DNA
conformation in the LCR core enhancer elements could contribute to
higher accessibility and more efficient recruitment of polymerase II
transcription complexes in vivo. For example, it is possible that
changes in DNA topology over the core enhancer elements in vivo could
prevent the association of repressive chromatin, thereby allowing
transcription complexes to be recruited preferentially to the LCR core
regions. Current models view the LCR as a unit in which individual HS
sites interact via protein-protein interactions. Such a large
protein-DNA complex may be very efficient in recruiting transcription
complexes and chromatin remodeling factors, and these activities could
be delivered to individual globin gene promoters by a tracking or looping mechanism (7, 12).
Studies addressing the DNA structure in genes reactivated after mitosis
came to the conclusion that transcription initiation sites in genes
scheduled for reactivation display a distorted DNA conformation,
whereas start sites of those genes that remain repressed are
undistorted (31). It was argued that protein-dependent conformational changes in the DNA structure could mark genes for reexpression. Similarly, conformational changes in the
-globin LCR
could also mark regions that remain active after mitosis, before
chromatin structure is completely reassembled. This hypothesis is
consistent with our observation that HS site formation in chromatin is
more efficient when the LCR is preincubated with erythroid proteins. We
are currently in the process of analyzing the changes in DNA structure
and bound proteins in the LCR during the cell cycle.
In summary, our data indicate that the mechanisms leading to HS site
formation in the human
-globin LCR involve reorganization of
chromatin structure as well as the generation of S1- and
KMnO4-sensitive regions in the core enhancers. These
results further our understanding of the structural basis for LCR
function and also have implications for the structure and function of
other regulatory elements in the globin locus as well as other loci.
| |
ACKNOWLEDGMENTS |
|---|
We are grateful to Gail Green for expert technical assistance. We thank Mike Kilberg and Thomas Yang (University of Florida) for critically reading the manuscript.
This work was supported by grants from the NIH (HL24415 to J.D.E.; DK 52356 to J.B.) and from the Howard Hughes Medical Institute (Research Resources Program, University of Florida, to J.B.).
| |
FOOTNOTES |
|---|
* Corresponding author. Mailing address: University of Florida, Powell Gene Therapy Center, Department of Biochemistry and Molecular Biology, Gainesville, FL 32610. Phone: (352) 392-0121. Fax: (352) 392-2953. E-mail: jbungert{at}college.med.ufl.edu.
Present address: Department of Biochemistry, University of
Cambridge, Cambridge CB2 1GA, United Kingdom.
| |
REFERENCES |
|---|
|
|
|---|
| 1. |
Armstrong, J. A., and B. M. Emerson.
1996.
NF-E2 disrupts chromatin structure at -globin locus control region hypersensitive site 2 in vitro.
Mol. Cell. Biol.
16:5634-5644[Abstract].
|
| 2. | Armstrong, J. A., J. J. Bieker, and B. M. Emerson. 1998. A SWI/SNF-related chromatin remodeling complex, E-RC1, is required for tissue-specific transcriptional regulation by EKLF in vitro. Cell 95:93-104[CrossRef][Medline]. |
| 3. |
Ashe, H. L.,
J. Monks,
M. Wijgerde,
P. Fraser, and N. J. Proudfoot.
1997.
Intergenic transcription and transinduction of the human -globin locus.
Genes Dev.
11:2494-2509 |
| 4. |
Becker, P. B., and C. Wu.
1992.
Cell-free system for assembly of transcriptionally repressed chromatin from Drosophila embryos.
Mol. Cell. Biol.
12:2241-2249 |
| 5. |
Bender, M.,
M. Bulger,
J. Close, and M. Groudine.
2000.
Globin gene switching and DNase I sensitivity of the endogenous -globin locus in mice does not require the locus control region.
Mol. Cell
5:387-393[CrossRef][Medline].
|
| 6. |
Bulger, M.,
J. H. von Doorninck,
N. Saitoh,
A. Telling,
C. Farrell,
M. A. Bender,
G. Felsenfeld,
R. Axel, and M. Groudine.
1999.
Conservation of sequence and structure flanking the mouse and human beta-globin loci: the beta-globin genes are embedded within an array of odorant receptor genes.
Proc. Natl. Acad. Sci. USA
96:5129-5134 |
| 7. |
Bulger, M., and M. Groudine.
1999.
Looping versus linking: toward a model for long-distance gene activation.
Genes Dev.
13:2465-2477 |
| 8. | Bungert, J., I. Kober, F. Düring, and K. H. Seifart. 1992. Transcription factor eUSF is an essential component of isolated transcription complexes on the duck histone H5 gene and it mediates the interaction of TFIID with a TATA-deficient promoter. J. Mol. Biol. 223:885-898[CrossRef][Medline]. |
| 9. |
Bungert, J.,
U. Davé,
K.-C. Lim,
K. H. Lieuw,
J. A. Shavit,
Q. Liu, and J. D. Engel.
1995.
Synergistic regulation of human -globin gene switching by locus control region elements HS3 and HS4.
Genes Dev.
9:3083-3096 |
| 10. |
Bungert, J.,
K. Tanimoto,
S. Patel,
Q. Liu,
M. Fear, and J. D. Engel.
1999.
Hypersensitive site 2 specifies a unique function within the human -globin locus control region to stimulate globin gene transcription.
Mol. Cell. Biol.
19:3062-3072 |
| 11. |
Elnitski, L.,
W. Miller, and R. Hardison.
1997.
Conserved E-boxes function as part of the enhancer in hypersensitive site 2 of the beta-globin locus control region: role of basic helix-loop-helix proteins.
J. Biol. Chem.
272:369-378 |
| 12. | Engel, J. D., and K. Tanimoto. 2000. Looping, linking and chromatin activity: new insights into beta-globin locus regulation. Cell 100:499-502[CrossRef][Medline]. |
| 13. | Felsenfeld, G. 1992. Chromatin as an essential part of the transcriptional mechanism. Nature 355:219-224[CrossRef][Medline]. |
| 14. |
Forrester, W. C.,
S. Takegawa,
T. Papayannopoulos,
G. Stamatoyannopoulos, and M. Groudine.
1987.
Evidence for a locus activation region: the formation of developmentally stable hypersensitive sites in globin-expressing hybrids.
Nucleic Acids Res.
15:10159-10177 |
| 15. |
Gaensler, K. M. L.,
M. Burmeister,
B. H. Brownstein,
P. Taillon-Miller, and R. M. Myers.
1991.
Physical mapping of yeast artificial chromosomes containing sequences from the human -globin gene region.
Genomics
10:976-984[CrossRef][Medline].
|
| 16. |
Gong, Q. H.,
J. C. McDowell, and A. Dean.
1996.
Essential role of NF-E2 in remodeling of chromatin structure and transcriptional activation of the epsilon globin gene by 5' hypersensitive site 2 of the -globin locus control region.
Mol. Cell. Biol.
16:6055-6064[Abstract].
|
| 17. |
Gribnau, J,
K. Diderich,
S. Pruzina,
R. Calzolari, and P. Fraser.
2000.
Intergenic transcription and developmental remodeling of chromatin subdomains in the human -globin locus.
Mol. Cell
5:377-386[CrossRef][Medline].
|
| 18. |
Grosveld, F.,
G. B. van Assendelft,
D. R. Greaves, and G. Kollias.
1987.
Position-independent, high level expression of the human -globin gene in transgenic mice.
Cell
51:975-985[CrossRef][Medline].
|
| 19. | Grosveld, F. 1999. Activation by locus control regions? Curr. Opin. Genet. Dev. 9:152-157[CrossRef][Medline]. |
| 20. | Hardison, R., J. L. Slightom, D. L. Gumucio, M. Goodman, N. Stojanovic, and W. Miller. 1997. Locus control regions of mammalian beta-globin gene clusters: combining phylogenetic analyses and experimental results to gain functional insights. Gene 20:73-94. |
| 21. | Higgs, D. 1998. Do LCRs open chromatin domains? Cell 95:299-302[CrossRef][Medline]. |
| 22. |
Hug, B. A.,
R. L. Wesselschmidt,
S. Fiering,
M. A. Bender,
E. Epner,
M. Groudine, and T. J. Ley.
1996.
Analysis of mice containing a targeted deletion of -globin locus control region hypersensitive site 3.
Mol. Cell. Biol.
16:2906-2912[Abstract].
|
| 23. |
Jackson, J. D.,
W. Miller, and R. C. Hardison.
1996.
Sequences within and flanking hypersensitive sites 3 and 2 of the -globin locus control region required for synergistic versus additive interaction with the -globin gene promoter.
Nucleic Acids Res.
24:4327-4335 |
| 24. | Kimura, K., and T. Hirano. 1997. ATP-dependent positive supercoiling of DNA by 13S condensins: a biochemical implication for chromosome condensation. Cell 90:625-634[CrossRef][Medline]. |
| 25. | Kong, S., D. Bohl, C. Li, and D. Tuan. 1997. Transcription of the HS2 enhancer toward a cis-linked gene is independent of the orientation, position, and distance of the enhancer relative to the gene. Mol. Cell. Biol. 17:3955-3965[Abstract]. |
| 26. |
Krumm, A.,
L. B. Hickey, and M. Groudine.
1995.
Promoter-proximal pausing of RNA polymerase II defines a general rate-limiting step after transcription initiation.
Genes Dev.
9:559-572 |
| 27. |
Lee, J.-S.,
C.-H. Lee, and J. H. Chung.
1999.
The -globin promoter is important for recruitment of erythroid Krüppel-like factor to the locus control region in erythroid cells.
Proc. Natl. Ac |