Previous Article | Next Article ![]()
Molecular and Cellular Biology, April 2004, p. 3497-3504, Vol. 24, No. 8
0270-7306/04/$08.00+0 DOI: 10.1128/MCB.24.8.3497-3504.2004
Copyright © 2004, American Society for Microbiology. All Rights Reserved.
Department of Development and Genetics, Evolution Biology Centre, Uppsala University, S-752 36 Uppsala, Sweden,1 Molecular Pathology Section, Laboratory of Immunopathology, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, Maryland 20892,2 Department of Biological Sciences, University of Essex, Wivenhoe Park, Colchester CQ4 3SQ, United Kingdom,3 Genetisches Institut der Justus-Liebig-Universität, D-35392 Giessen, Germany4
Received 4 September 2003/ Returned for modification 20 November 2003/ Accepted 14 January 2004
|
|
|---|
|
|
|---|
Despite similar functions, the mouse and human H19 ICRs display little sequence similarity with the exception of a CpG-rich element repeated four times in the mouse H19 ICR and seven times in the human H19 ICR (9). We and others have noted that the 11-zinc finger protein CTCF, which associates with all known vertebrate chromatin insulator cis elements (2, 19), interacted with these repeat elements in a methylation-sensitive manner (3, 9, 13). Importantly, CTCF interacts in vivo with only the maternal H19 ICR allele (13), which is consistent with an involvement with the in vivo insulation of the maternal Igf2 from downstream lineage-specific enhancers. To confirm an in vivo role of the CTCF target sites at the Igf2/H19 locus, we previously created a knock-in mouse with mutations in three of the four target sites. This mouse showed robust biallelic expression of Igf2 and substantial gain of methylation at the maternal H19 ICR region (22). Similar results have been reported by another group (24).
While these results underscore the dualistic properties of the CTCF target sites, i.e., they organize long-range insulators and propagate methylation-free domains, it is not known whether these effects depended on the additive or synergistic effects of all the CTCF target sites or to what degree these sites contribute individually to methylation protection and/or insulation. To gain insight into this issue, we created a knock-in mouse with a mutation only in the fourth target site. It was striking that despite the fact that CTCF interacted less efficiently with site 4 than with sites 1 and 3, there was a robust activation of the maternal Igf2 allele. The results reported here significantly add to our understanding of the complexity of the function of the maternally inherited H19 ICR and suggest that the essential features of the H19 ICR reflect cooperativity between the CTCF target sites.
|
|
|---|
![]() View larger version (28K): [in a new window] |
FIG. 1. Schematic illustration of the knock-in strategy. (A) Recombination map of the targeting construct and endogenous H19 ICR. CTCF target site 4 (cerise box) was targeted in the knock-in procedure. Wt, wild-type. (B) Deletion of the neomycin gene by breeding mice of strain 121 with mice harboring a ß-actin promoter-driven Cre recombinase gene to generate the 121* substrain. (C) The recombination event in clone 121 was assessed by using a probe that is positioned 5' of the sequence covered in the targeting construct. The properly recombined insert generated a larger fragment due to the extra sequence information provided by the neomycin gene. The mutated targeting allele replaced the endogenous sequence in clone 121 as confirmed by EcoRV (EV) digestion and Southern blot analysis. The mutated allele of the 121* strain is indistinguishable from the wild-type allele when digested with BamHI (B) and DraI (D). For additional information, see the text.
|
ChIP analysis. Fetal mouse liver cells (embryonic day 16 [E16]) were mechanically dispersed and cross-linked with formaldehyde. Chromatin immunopurification (ChIP) DNA was prepared from fetal mouse liver by using an affinity-purified rabbit antibody against the N-terminal portion of CTCF and analyzed for the presence of H19 ICR sequences as described previously (13).
Expression analyses. To determine the imprinted status of Igf2, homozygous 121* mice were mated with SD7 mice a congenic strain carrying the distal end of chromosome 7 of Mus musculus spretus on a Mus musculus domesticus background (6). RNA was isolated from individual organs of 1- to 2-day-old neonatal mice using TriPure DNA or RNA isolation reagent (Roche). RNA was treated with RQ1 DNase I (Promega) to remove any contaminating genomic DNA. Reverse transcription-PCR (RT-PCR) analyses were performed using the Qiagen one-step RT-PCR kit per the manufacturer's guidelines. Primers and PCR conditions have previously been published (6). The PCR product was purified and digested with BsaAI to determine the allelic origin of Igf2 transcripts as described earlier (6, 22).
For Northern blot hybridization, total RNA was isolated using the TriPure reagent (Roche), according to the manufacturer's instructions. Approximately 10 µg of total RNA was electrophoresed on a 1% agarose-formaldehyde gel and blotted onto a Hybond N+ membrane (Amersham Biosciences). Hybridization was done with a 32P-labeled Igf2 cDNA probe (covering exons 4 to 6 and recognizing all major transcripts). Washed blots were exposed to a PhosphorImager screen (Fuji FLA 3000) for 4 h for quantitation, and later, the blots were exposed to X-ray films for 2 days. The Igf2-hybridized blots were stripped and subsequently hybridized to a glyceraldehyde-3-phosphate dehydrogenase cDNA probe (Ambion).
EMSAs.
Individual DNA fragments containing sites 1, 3, and 4, corresponding to bp -4023 to -3761, -2752 to -2427, and -2193 to -1983, with respect to the H19 transcription start site, were generated by PCR amplification using primer sets. The primer sets follow: site 1 For (5'-TCC TGC TCA CTG TCC ATT CA-3') and site 1 Rev (5'-GGA TTC TTT GCA GAG AGT AAG-3'), site 3 For (5'-CTC AGT GGT CGA TAT GGT TT-3') and site 3 Rev (5'-TGA GTC AAG TTC TCT GGT TC-3'), and site 4 For (5'-ACT CTC CAC GCT GTG CAG ATT T-3') and site 4 Rev (5'-CCC ACC CTC CTG CTT CAC TTC AAA-3'). The DNA fragments containing sites 1, 3, and 4 were then subjected to restriction digestion with HpaII-DdeI, AvaII, and BglII, respectively. The gel-purified fragments were labeled with [
-32P]ATP and subsequently purified on a Sephadex G-50 column. Binding reaction mixtures (20 µl) containing binding buffer (0.1 mM EDTA, 2 mM MgCl2, 75 mM KCl, 40 mM HEPES, 10% glycerol), CTCF, and radiolabeled DNA were analyzed by electrophoretic mobility shift assays (EMSAs) as described previously (13). For competition experiments, unlabeled competitor site 4 DNA was added at different amounts (1-, 5-, 10-, 50-, 100-, 200-, and 500-fold). CTCF-DNA interaction was quantitated by scoring free and bound DNA probe by FLA 3000 PhosphorImager analysis. The normalized ratios of bound DNA versus unbound DNA were plotted against the concentration of the unlabeled competitor DNA.
Methylation analyses. For Southern blot analysis, genomic DNA was isolated from tissues of 2-day-old neonatal mice. Fifteen micrograms of DNA was digested with DraI, BamHI, and BstXI and with or without the methylation-sensitive restriction enzyme HhaI. BstXI has a unique site present in the Dom allele, but not in the SD7 allele, at -3537 bp with respect to the H19 transcription start site (W. Reik, personal communication). DNA was treated with bisulfite using a modified version (22) of an established protocol (20). DNA was purified using the Wizard DNA clean-up system (Promega) per the manufacturer's guidelines. DNA was subjected to PCR amplification using the published primers and conditions (23). Amplified fragments were cloned into the pGEMT-Easy vector (Promega) and subsequently sequenced with the BigDye 3.0 Terminator Cycle sequencing kit (Applied Biosystems).
In vivo and in vitro CTCF association analyses.
In order to elucidate the CTCF
CTCF association in living cells, the myc-tagged full-length chicken-CTCF cDNA (16) was transfected into COS 7 cells. The nuclear lysate of transfected cells was prepared and used for coimmunoprecipitation with the anti-c-myc-tagged monoclonal antibody (9E10) (Sigma Chemical Co.). Coimmunoprecipitation was performed by a modified version of the method of Chernukhin et al. (4). Specifically, a protein A-Sepharose (Sigma Chemical Co.) slurry (20 µl) and c-myc (9E10) antibody (final concentration, 5 µg/ml) were added to the lysate. The anti-
-tubulin antibody (Sigma) and the preimmune serum were used as controls. The suspension was incubated on a rotator at 4°C overnight, the complexes were then washed three times with washing buffer (25 mM Tris-HEPES [pH 8.0], 2 mM EDTA, 1 mM phenylmethylsulfonyl fluoride, 0.25 M NaCl, 0.5% Tween 20), resuspended in 50 µl of 3x sodium dodecyl sulfate (SDS) loading buffer and boiled for 5 min. Reduced complexes were separated on an SDS-10% polyacrylamide gel and then blotted onto a polyvinylidene difluoride nylon membrane, and the Western blot was probed with the CTCF rabbit polyclonal antibody.
For EMSAs designed to detect interactions between two CTCF-DNA complexes, DNA fragments containing site 3 (corresponding to -2691 to -2399 bp; forward primer, 5'-GGTCACTGAACCCCAAAAACCAGC-3'; reverse primer, 5'-CCAGCCACTGACGATCTCGGGCTG-3') and site 4 (corresponding to bp -2285 to -2095; forward primer, 5'-CAATGTTCATAAGGGTCATGGGTG-3'; reverse primer, 5'-CCCAACCTATGCCGCGTCTGCCGAG-3') were PCR amplified with these primers labeled at the 5' ends with [
-32P]ATP and T4 kinase, gel purified, and incubated with in vitro-translated (IVT) (TNT lysate) CTCF (8) either singly or in combination by mixing two preassembled CTCF-DNA complexes in the modified EMSA buffer. The CTCF-DNA complexes were empirically adjusted to maximize CTCF binding and to reduce nonspecific background levels resulting from the fortuitous binding of proteins present in nuclear extracts and even in the TNT reticulocyte lysates to the DNA probes. The DNA probes were more than 80 to 100 bp long and were previously shown to be near a minimal DNA length required for the efficient sequence-specific CTCF-DNA interactions (18). Free and CTCF-bound probes were resolved on 5% acrylamide gels containing 1% glycerol as described previously (13). IVT CTCF was used in a single EMSA binding reaction performed by mixing it with 10 µl of the 2x modified EMSA buffer containing 100 mM HEPES (pH 7.8), 500 mM NaCl, 5 mM MgCl2, 10 mM dithiothreitol, 1 mM ZnSO4, 100 µg of poly(dI-dC) per ml, 0.02% NP-40, and 20% polyethylene glycol 10,000.
To study the interactions between two preassembled (for 30 min) complexes of CTCF and 32P-labeled DNA, one half (10 µl) of the binding reaction mixture with one hot probe was mixed together with the same portion from the binding reaction mixture with the other hot probe, and the resulting 20-µl mixtures with two DNA probes were incubated for an additional 30 min at room temperature. To obtain controls for the duration of each binding reaction mixture containing two DNA probes versus one DNA probe, the second 10-µl portion from each 30-min reaction mixture with a single hot probe was mixed with 10 µl from a 30-min binding reaction mixture containing IVT luciferase in place of the IVT CTCF and incubated for an additional 30 min before loading on the same EMSA gels. Fragments that were resolved were later eluted from the EMSA gel, treated with proteinase K, and reanalyzed on acrylamide gels as shown in Fig. 5C.
![]() View larger version (28K): [in a new window] |
FIG. 5. CTCF interacts with itself in vivo and in vitro, while only heterologous CTCF-DNA complexes interact in vitro. (A) DNA fragments containing CTCF target sites 3 and 4 were labeled with 32P and subjected to CTCF band shift analysis either separately or in combination. Shifted band fragments (bands 1, 2, and 3) were eluted from the gel (left gel), treated with proteinase K, and resolved on a 5% acrylamide gel (right gel). F, free; B, bound. (B) The C terminus of CTCF binds to the 11 ZF domain of CTCF. The latter was initially noticed in the preliminary pull-down assay with various portions of CTCF (data not shown). Next, the IVT radiolabeled full-length CTCF, the 11 ZF domain, and the CTCF-C domain were mixed together at a ratio of approximately 1:1:1 as shown in the Input lane and subjected to a GST pull-down assay with the three bacterially expressed GST fusion proteins, GST-ABL, GST-CTCF-C, and GST-CTCF-N, immobilized on glutathione beads. (C) CTCF interacts with itself in vivo. COS 7 cells were transfected with myc-tagged CTCF, and the myc-tagged (9E10) antibody was used for coimmunoprecipitation. Immunoprecipitates (Immunoprec.) and control lysates were resolved on SDS-10% polyacrylamide gels and probed with the anti-CTCF antibody. The positions of the endogenous CTCF (16) and myc-tagged exogenous CTCF (15) are indicated to the left of the blot. No signal was observed in the controls obtained with the preimmune serum and anti- -tubulin antibody.
|
0.8-kb NdeI-XhoI fragment of the human CTCF cDNA (8) from a pCITE4b-based plasmid into the EcoRI and XhoI ends of the pGEX5X-1 vector (Pharmacia) after the NdeI and EcoRI ends had been filled in with the Klenow fragment. The resulting bacterial expression vector encoded the GST-CTCF-C fusion protein that contained the C-terminal part of human CTCF (CTCF-C) beginning from the middle of the 11th Zn finger and extending to the stop codon. To express the GST-CTCF-C protein in bacteria and to purify it when it was left immobilized on the beads, we used the GST purification kit (Pharmacia) and other methods described in detail earlier by Klenova et al. (15). The purity and concentration of the isolated GST-CTCF-C protein on glutathione beads were both verified by SDS-polyacrylamide gel electrophoresis, and the concentration was adjusted to 1 mg per ml of beads during the final washings and final dilution of the beads in phosphate-buffered saline. The [35S]methionine-labeled proteins, including the full-length polypeptide, CTCF-N (the N-terminal region starting from the first Met and extending downstream to the first Zn finger), the 11 ZF domain (domain with all 11 zinc fingers), and CTCF-C (the C-terminal domain of CTCF), have been produced from the pCITE4b- or pET16b-based constructs described earlier (8, 15) and used as the DNA templates in the Promega coupled transcription-translation TNT system for 35S-labeled proteins. A construct for GST-Abl SH3 fusion protein used as an unrelated control for GST-CTCF-C was obtained from the laboratory of P. Soriano (Fred Hutchison Cancer Research Center, Seattle, Wash.). Binding buffer A (BB-A) for the pull-down assays contained 20 mM HEPES (pH 7.0), 100 mM KCl, 2 mM MgCl2, 1 mM EDTA, 10% glycerol, 1 mM dithiothreitol, 0.1% NP-40, and 1% calf serum. For the binding reaction, 5 to 10 mCi of each 35S-labeled protein was mixed in 300 µl of BB-A with the GST beads taken at volumes giving equal amounts of each immobilized protein as measured by 35S counts of the corresponding SDS-polyacrylamide gel band. Reaction mixtures were sealed, incubated in a cold room for 2 h in a rotating tube platform, and loaded into the upper chambers of the catcher tubes (CytoSignal), which allowed us to wash the beads (with BB-A containing higher salt concentrations [100 to 300 mM KCl] and no MgCl2) by repeatedly collecting the buffer pushed from the upper chamber to the lower chamber by a low-speed centrifugation without losing any material. Radiolabeled proteins from the beads that had been washed six times were eluted by 50 µl of the SDS-polyacrylamide gel loading buffer, run together with the rainbow markers on the 12.5% polyacrylamide Bio-Rad minigels, and visualized on X-ray films by fluorographic enhancement of the 35S signal.
|
|
|---|
A single CTCF target site mutation leads to loss of Igf2 imprinting. RT-PCR analysis was performed to assess the imprinting status of Igf2. A BsaAI polymorphic site between M. m. domesticus and SD7 mice allowed identification of the allelic origin (6). Figure 2A shows that biallelic expression of Igf2 could be documented in a wide range of tissues after maternal transmission of the mutation, while the imprinted status of Igf2 was maintained after paternal transmission of the mutant H19 ICR allele. In all instances, we could not document any changes in the imprinted state of the H19 gene (data not shown).
![]() View larger version (49K): [in a new window] |
FIG. 2. Expression analyses of Igf2. (A) RT-PCR analyses of Igf2 expression pattern in various organs dissected from neonatal offspring after reciprocal crosses between 121 and SD7 mice. A polymorphic BsaAI site in exon 4 of Igf2, present only in SD7 mice, was used to discriminate between the alleles. The animal crosses are presented in the order of female to male. The -RT lane contained RNA and primers but no reverse transcriptase for a control. (B) Northern blot analysis of Igf2 expression in neonatal mice harboring the mutated allele on the maternal chromosome. The blot was stripped and reprobed with the glyceraldehyde-3-phosphate dehydrogenase (GAP) probe to confirm RNA loading. (C) Graph showing Igf2 expression after the levels were normalized to GAP expression. The level of wild-type Igf2 expression was set at 100%. Contr., contributed; Mat. inher., maternally inherited.
|
Differential binding efficiencies between the CTCF target sites. Given the importance of site 4 in in vivo chromatin insulation, we wanted to examine the possibility that this site displayed a particularly high affinity towards CTCF. To this end, we labeled fragments with sites 1, 3, and 4 with the CTCF target site positioned in the middle and examined the ability of a cold fragment encompassing site 4 to compete out CTCF binding in band shift assays. Surprisingly, this approach revealed that site 4 displayed a markedly weaker binding efficiency to CTCF than sites 1 and 3 (Fig. 3). We conclude that the loss of Igf2 imprinting with a mutated site 4 inherited maternally cannot be explained by binding properties alone at individual CTCF target sites.
![]() View larger version (53K): [in a new window] |
FIG. 3. Binding efficiencies of the H19 ICR CTCF target sites. (A) CTCF target sites 1, 2, and 4 were labeled with 32P and competed with cold site 4 in band shift analyses. (B) The graph depicts the mean values for two independent experiments describing the degree of competition on individual sites in relation to fold increases in the amount of the cold competitor.
|
![]() View larger version (57K): [in a new window] |
FIG. 4. CpG methylation analyses. (A) Southern blot analysis of genomic DNA isolated from various neonatal organs of reciprocal crosses between 121 and SD7 mice. The various tissues were digested with HhaI (+) or not digested with HhaI (-). The positions of molecular size markers (in kilobases) are indicated to the right of the blots. The parental alleles were discriminated by exploiting a polymorphic BstXI site, which is shown in the map below the blots. (B) Bisulfite sequencing analyses of the maternally transmitted mutant H19 ICR allele covering all four CTCF target sites. Region A contained sites 1 and 2, while region B contained sites 3 and 4. The wild-type and mutant alleles were discriminated by known single-base polymorphism in region A and by the introduced point mutation in region B. Filled circles represent methylated CpGs, and open circles represent unmethylated CpGs. (C) Graphical representation of the percentage of individual CpG methylation in H19 ICR.
|
In contrast to the more or less dramatic differences in methylation states at sites 3 and 4 of region B, the CTCF target sites (sites 1 and 2) of region A were moderately affected by mutation of site 4 (Fig. 4B and C). Although we note that the methylation pattern of sites 1 and 2 appears to be reciprocal between liver and heart tissue (Fig. 4B and C), the data are not extensive enough to allow a firm conclusion on these lines.
CTCF interactions in vitro and in cultured cells. To explore the possibility of physical interactions between CTCF-DNA complexes, we tested fragments containing sites 3 and 4 independently and in combination in CTCF band shift assays. Figure 5A shows that neither of the sites by themselves revealed any interaction between CTCF-DNA complexes, since the CTCF-DNA interaction was represented by a single band in each instance. Importantly, when CTCF-DNA complexes representative for each of these sites were mixed, an additional and prominent low-mobility band (band 3 in Fig. 5A) could be observed. After elution and purification, it was evident that both sites 3 and 4 were included in this band. We conclude that CTCF-DNA complexes within the H19 ICR interact in a heterologous manner, thereby supporting our proposal that CTCF target sites interact within the H19 ICR to organize chromatin insulator and methylation protection features.
The mechanism underlying the heterologous, but not homologous, association of the CTCF-DNA complexes might reflect conformational changes in CTCF that varies between sites 3 and 4. This supposition draws support from our previous observation that zinc finger utilization differs between sites 3 and 4 (13). Hence, the CTCF-DNA complexes at site 3 and 4 might display a different repertoire of zinc fingers not engaged in DNA binding but available for protein-protein interactions. Support for this interpretation is provided in Fig. 5B, which shows an analysis of the ability of various GST baits to interact with CTCF and its derivatives. Of these three baits, only the beads with immobilized CTCF-C domain showed efficient specific binding to the 11 ZF domain alone or in the context of the full-length CTCF, retaining nearly all of the input of 35S-labeled full-length CTCF and 11 ZF domain proteins during the extensive washing procedure, which completely removed each of the 35S-labeled proteins from the beads with GST-CTCF-N or GST-ABL protein. Since the C-terminal CTCF domain (in GST-CTCF-C) binds to the individual 11 ZF domain and to the full-length CTCF, it is likely that the C terminus in the context of the full-length CTCF is not shielding intramolecularly the same regions of the 11 ZF domain (or any sets of individual zinc fingers) that are required for binding to the C terminus intermolecularly.
To test whether CTCF can interact with itself in living cells, the pHHc-myc plasmid (15) was transfected into COS 7 cells. The cell lysates produced were then used for the coimmunoprecipitation experiment with anti-myc-tagged antibody (9E10). As shown in Fig. 5C, two bands appear after immunoprecipitation with the anti-myc-tagged CTCF antibodies. The
175-kDa upper band represents the myc-tagged CTCF (15), and the second lower band (
130 kDa [16]) represents the endogenous CTCF from the COS 7 cell lysate. No signal was observed in the immunoprecipitates obtained with the preimmune serum and anti-
-tubulin antibodies (Fig. 5C). These analyses document that two CTCF molecules can interact in living cells.
|
|
|---|
Our proposal of a cooperativity of CTCF target sites in chromatin insulation is at odds with a previous report that the imprinted state of Igf2 was maintained despite deletion of sites 3 and 4 (7). To explain this discrepancy, there are several, not mutually exclusive possibilities that come to our minds. For example, genetic deletions might perturb local chromatin conformation to uncover cis-acting elements normally made unavailable by nucleosomes. Such cis elements could potentiate or antagonize the desired effects and yet go unnoticed. Indeed, the entire wild-type H19 ICR allele is covered by positioned nucleosomes, and upsetting their phasing by insertional mutagenesis abolished the insulator function and CTCF binding to its wild-type target site (14). Alternatively, the polarity of the H19 ICR insulator function (12) might be controlled by cis elements other than CTCF target sites. If these are deleted or mutated without intent, such as in large deletion knockout strategies, the polarity of the insulator might switch with ensuing loss of Igf2 imprinting.
Our results indicate strongly that the CTCF target sites within the H19 ICR cooperate to manifest both methylation protection and insulator functions. Since we have also shown that the in vitro binding strength of CTCF to several hundred novel CTCF target sites from mouse fetal liver was directly proportional to their insulating properties (R. Mukhopadhyay et al., unpublished data), we conclude that strong insulation properties might be restricted to regions in the genome displaying in essence a constitutive binding of CTCF. This might be achieved by either multiple CTCF target sites with weaker affinity or by single cis elements, which interact with CTCF in an essentially irreversible manner. However, these considerations fail to explain the strong insulation of sites 1 and 3 in transient-transfection approaches (9, 13) but absence of insulation in the endogenous position. Therefore, we propose that the CTCF target site functions at the H19 ICR are context dependent and involve an aspect of higher-order chromatin conformation that is not recapitulated at ectopic sites.
This work was supported in part by the Swedish Science Research Council (VR), the Swedish Cancer Research Foundation (CF), the Swedish Pediatric Cancer Foundation, and the Wallenberg and Lundberg Foundations, as well as Stiftelsen Wenner-Grenska Samfundet, and the intramural research funding from the National Institute of Allergy and Infectious Diseases of the National Institutes of Health (to V.L.) and the Association for International Cancer Research (IC, EK).
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»