ABSTRACT
The 2.4-kb H19 imprinting control region (H19ICR) is required to establish parent-of-origin-specific epigenetic marks and expression patterns at the Igf2/H19 locus. H19ICR activity is regulated by DNA methylation. The ICR is methylated in sperm but not in oocytes, and this paternal chromosome-specific methylation is maintained throughout development. We recently showed that the H19ICR can work as an ICR even when inserted into the normally nonimprinted alpha fetoprotein locus. Paternal but not maternal copies of the ICR become methylated in somatic tissue. However, the ectopic ICR remains unmethylated in sperm. To extend these findings and investigate the mechanisms that lead to methylation of the H19ICR in the male germ line, we characterized novel mouse knock-in lines. Our data confirm that the 2.4-kb element is an autonomously acting ICR whose function is not dependent on germ line methylation. Ectopic ICRs become methylated in the male germ line, but the timing of methylation is influenced by the insertion site and by additional genetic information. Our results support the idea that DNA methylation is not the primary genomic imprint and that the H19ICR insertion is sufficient to transmit parent-of-origin-dependent DNA methylation patterns independent of its methylation status in sperm.
Genomic imprinting is an unusual form of gene regulation in which expression is restricted to one allele based on its parental origin. Some genes, like H19 (2), are expressed exclusively from the maternally inherited chromosome, while others, like Insulin-like growth factor 2 (Igf2) (11), are expressed specifically when inherited paternally. To date, about 100 imprinted genes have been identified in the mouse, and the majority of them are conserved in other mammals, including humans (http://igc.otago.ac.nz/home.html ). Imprinted expression patterns must be maintained for normal mammalian development (39, 51). In humans, aberrations at imprinted loci are associated with several developmental disorders and with many human cancers (13, 41).
The mouse Igf2 and H19 genes were among the first genes identified as regulated by genomic imprinting. Transcriptional regulation of the two genes has been studied extensively, and the locus remains a key model system for elucidating molecular mechanisms underlying genomic imprinting. Moreover, the molecular basis of Igf2/H19 imprinting is of direct biomedical significance, as a loss of imprinting at this locus is associated with the developmental disorder Beckwith-Wiedemann syndrome (BWS) and with several human cancers (4, 7, 10, 47).
The overall structure of the Igf2/H19 locus is depicted in Fig. 1A. The two genes are about 80 kb apart and share downstream enhancer elements (25, 32). In addition to their distinct gene expression patterns (i.e., maternal chromosome-specific H19 and paternal chromosome-specific Igf2 expression), the parental chromosomes are distinguished by multiple epigenetic marks, including parent-of-origin-specific patterns of DNA methylation (1, 17, 58), histone modification (23, 27, 48, 52), nucleosome organization (28), and DNA looping (29, 42, 59). All of these parent-of-origin differences are dependent on a 2.4-kb DNA sequence, called the H19 imprinting control region (H19ICR), that lies between the two genes, 4.4 to 2.0 kb upstream of the H19 promoter. In mice, deletion of the H19ICR results in the loss of all parent-of-origin differences in the epigenome and in gene expression.
Schematic depictions of the wild-type Igf2/H19, Afp, and CD3 loci, the knock-in mouse models, and the BAC transgene described in this study. Transcriptional start sites are indicated by the horizontal arrows above the chromosomes. (A) Wild-type and mutant alleles at the Igf2/H19 locus. Open and closed circles downstream of the H19 gene indicate the shared endodermal and mesodermal enhancers, respectively. A CpG island (***), defined by 54% cytosine-guanine dinucleotides and an observed versus expected CpG ratio of 0.64 within 500 bp, covers the H19 promoter and was annotated by a computer search using the EMBL-EB1 CpG Plot computer program (http://www.ebi.ac.uk/Tools/emboss/cpgplot/ ). An approximately 450-bp region rich in guanine nucleotides (boxed G) is located between the H19ICR and the CpG island. The H19-BAC1 transgene (26) carries the H19 sequence from kb −7 to +140. The H19R chromosome (26) carries the H19ICR inserted as a 2.4-kb BglII fragment at the +10-kb EcoRI site. All numbering is relative to the H19 transcriptional start site. (B) Wild-type and H19ICR insertion alleles at the Afp locus. Enhancers (gray circles) are located upstream of Afp (49). H19ICR insertions are at the XbaI site at −0.9 kb relative to the Afp transcriptional start. Afp-A (45) carries the H19 sequence from kb−4.4 to −2, Afp-D (45) carries the H19 sequence from kb −9.7 to −0.8, and Afp-DCK (this study) carries the H19 sequence from kb −4.4 to +3.0. (C) Wild-type and H19ICR insertion alleles at the CD3 locus. CD3-CMG (this study) carries the H19 sequence from kb −4.4 to −2.0, inserted at the BpuAI site between CD3 gamma and CD3 delta.
The ICR, not coincidentally, is a differentially methylated region (DMR). It carries 67 CpGs that are each methylated in sperm but remain unmethylated in oocytes (58). Thus, the zygote carries one methylated (paternal) and one unmethylated (maternal) copy of the ICR. This parent-of-origin difference in ICR methylation is maintained throughout development in all cell types, except for the primordial germ cells (21, 22, 58). Moreover, numerous analyses of both mice and humans have confirmed that this paternal chromosome-specific ICR methylation is critical for maintaining the parent-of-origin-specific epigenomes and expression patterns at this locus (15, 46).
Parent-of-origin-dependent gene expression of Igf2 and H19 is regulated in cis by the H19ICR (26, 50, 55). On the maternal chromosome, the unmethylated ICR binds the transcription factor CTCF (3, 24, 27), and this binding organizes the region into loop domains that prevent Igf2 expression by blocking interactions between the Igf2 promoters and the downstream enhancers (29, 34, 42, 59). At the same time, these maternal chromosome-specific loop domains induce enhancer activation of the H19 promoter. On the paternal chromosome, methylation of the ICR prevents its recognition by the CTCF protein, thus inducing alternative DNA looping domains that drive Igf2 expression by permitting interaction of the Igf2 promoter with the shared enhancers. In addition, during early postimplantation development, DNA methylation and heterochromatin structures spread from the methylated ICR into the adjacent paternal H19 promoter, stably and permanently silencing its expression (1, 50).
Given its critical functional importance, there is considerable interest in identifying the cis- and trans-acting elements required for establishing ICR methylation in sperm. Recently, we reported that the 2.4-kb H19ICR is sufficient for parent-of-origin-dependent DNA methylation (45). When inserted upstream of the alpha fetoprotein (Afp) gene on mouse chromosome 5, the ICR becomes methylated on the paternal allele and remains unmethylated on the maternal allele in somatic tissue, thus mimicking the endogenous ICR. However, the timing of the acquisition of paternal methylation does not mimic that of the endogenous ICR. At its endogenous location on chromosome 7, methylation is acquired during spermatogenesis (58), but in Afp knock-in mice, ICR methylation occurs only after fertilization. These results imply that DNA methylation is not the primary imprint but is a consequence of some other epigenetic mark on the paternal chromosome.
In this report, we address the sequence requirements for acquisition of paternal allele-specific methylation of the ICR. We generated and characterized novel ICR insertion mutations to establish the following points. First, we confirm that the 2.4-kb element is an autonomously acting ICR/DMR. That is, the 2.4-kb element is sufficient to supply parent-of-origin identity to its insertion locus. Second, we demonstrate that the timing of methylation acquisition is locus dependent in sperm and requires additional sequence information from the Igf2/H19 locus for full methylation during spermatogenesis. Finally, we show that the methylation acquired at the ectopic H19ICR during spermatogenesis is maintained during the genomewide demethylation wave that occurs after fertilization.
MATERIALS AND METHODS
Generation of mutant mice.All animal research was done according to NIH and Public Health Service (PHS) policy and was approved by the Eunice Kennedy Shriver NICHD Animal Care and Use Committee. The generation of Afp-A, Afp-D, H19R (45), and H19-BAC1 transgenic (26) animals (Fig. 1A and B) has been described previously. To generate the Afp-DCK insertion, a 7.3-kb BglII-SalI fragment carrying the H19ICR and an additional downstream sequence including the entire H19 promoter and mRNA-encoding region was targeted to the XbaI site at −0.9 kb relative to the Afp transcriptional start site (Fig. 1B). The CD3-CMG insertion carries the H19ICR as a 2.4-kb BglII-BglII fragment that has been targeted to the BpuAI site between the CD3 gamma and CD3 delta genes (Fig. 1C). Animals with germ line transmission of the Afp-DCK or CD3-CMG insertion were mated to EIIa-cre transgenic mice (30) to excise the Neor cassette that was used for positive selection during embryonic stem cell mutagenesis.
Southern hybridization.Genomic DNA was isolated from tissues of adult mice by phenol-chloroform extraction, and 30 μg was digested with SacI. Half of the SacI-digested DNA was incubated further with the methylation-sensitive enzyme AciI. SacI and SacI-plus-AciI-digested DNAs were electrophoresed in a 0.8% agarose gel, blotted onto a nylon membrane, and probed with a 32P-labeled 1.6-kb KpnI-HindIII fragment internal to the H19ICR (Fig. 2A).
DNA methylation at the endogenous and ectopic H19ICRs in somatic tissues and in testis. DNAs isolated from kidney or testis were digested with SacI (−) or SacI plus AciI (+) and then analyzed by Southern blotting. The identity of the ICR insertion and its parental origin are indicated above the lanes. (A) The 3.8- and 6.7-kb SacI fragments carrying the H19ICR (thickened line) at the endogenous Igf2/H19 locus (top line) and at the CD3-CMG chromosome (bottom line) are depicted. The arrowhead above the top line indicates the polymorphic SacI site that distinguishes wild-type domesticus (Dom) and castaneus (Cas) H19 alleles, cleaving castaneus H19 alleles into 2.5-kb (Cas1) and 1.3-kb (Cas2) SacI fragments. AciI restriction sites within the SacI fragments are indicated by vertical lines. The 1.6-kb KpnI-HindIII probe used to identify the ICR is indicated. (B) By crossing CD3-CMG animals with DIS7CAS mice (19) carrying the endogenous H19ICR on a castaneus allele, we were able to use SacI digestion to distinguish between the three H19ICRs present in these progeny, i.e., the ICR inserted at CD3 and the maternal and paternal endogenous H19ICRs. The ICR insertion is always of the same parental origin as the endogenous domesticus (Dom) allele. (C) Methylation of the H19ICR insertion in Afp-DCK and H19-BAC1 mice was analyzed in animals homozygous for the H19Δ13 mutation, which deletes the entire H19 gene and 10 kb of upstream sequence, including the endogenous H19ICR. Thus, the insertion is the only H19ICR in these animals.
Embryo collection.Adult female mice were treated for superovulation (43) and mated to adult males homozygous for the CD3-CMG insertion. Female mice were sacrificed at embryonic day 1 (E1) or E3.5 to collect zygotes or morulae and blastocysts, respectively. To collect zygotes, matings were set up so that the endogenous maternal allele was DIS7CAS (19), harboring single nucleotide polymorphisms (SNPs) specific to the castaneus mouse strain, and the endogenous paternal allele was H19Δ13 (31). Morulae and blastocysts were homozygous for H19Δ13 on chromosome 7.
Bisulfite sequencing.DNA methylation in mature sperm, testis, and preimplantation mouse embryos was determined as previously described (21, 35). Briefly, mature sperm was incubated in 500 μl lysis buffer (10 mM Tris-HCl, pH 8, 10 mM EDTA, pH 8, 1% SDS [wt/vol]), 0.8 mg/ml proteinase K, 40 mM dithiothreitol (DTT), and 0.04 mg/ml glycogen at 55°C overnight. Individual pools of embryos were mixed with 100 μl lysis buffer (without DTT), 2 mg/ml proteinase K, and 0.2 mg/ml glycogen and incubated at 55°C for 5 h. Genomic DNA was recovered from sperm and embryos by phenol-chloroform extraction, using the Phase Lock Gel Heavy system (PLG 1.5 ml; Eppendorf North America), and dissolved in sterile water. Genomic testis DNA was isolated by traditional phenol-chloroform extraction. Genomic DNA equivalent to 100 ng of sperm and testis DNA and 2 to 17 blastocysts, 0.5 to 6 morulae, and 7 to 25 zygotes was embedded in low-melting-point agarose beads, and up to seven individual beads per 800 μl 2.5 M bisulfite-hydroquinone solution were incubated at 50°C for 4 h. Single beads were then subjected to nested PCR. Primers were newly designed or adopted from the work of Park et al. (45) and Tremblay et al. (57) and used to amplify five consecutive regions within the H19ICR. For region 1 (GenBank accession no. NT_039448; upper strand), the primers were Bis-CD3-CMG for19, 5′-TGA TTT TTT TGT TGA ATT TGG GGT AT-3′; Bis-CD3-CMG rev19, 5′-AAA ACT TAA CTC ATT CCC TA-3′ (400 bp); Bis-CD3-CMG for21, 5′-TTG GGG TAT TTA AAG TTT TGT T-3′; and Bis-CD3-CMG rev21, 5′-ATC CCA CAT ACT TTA TCA TA-3′ (304 bp). For region 2 (accession no. NT_039448; lower strand), the primers were BDMRTR3, 5′-CTA CCC AAA AAA TAT ATA TTA TAC CAC CCC-3′; BDMRTF7, 5′-ATA TGG TTT ATA AGA GGT TGG AA-3′ (507 bp); BDMRTR4, 5′-CCC TTA TAA ATC ATT AAA TAC TAT ACC TAA-3′; and BDMRTF8, 5′-TAT TTG TGT TTT TGG AGG GGG TT-3′ (450 bp). For region 3 (accession no. U19619; upper strand), the primers were BMsp4t1, 5′-GGA ATT TTA TAT TAA GTT TTG GGT GT-3′; BHha4t2, 5′-AAC CCC CTC CAA AAA CTC AAA T-3′ (455 bp); BMsp4t2, 5′-AGT TAA AAT TAA TTG AAG AGG-3′; and BHha4t3, 5′-ATT CCA ACC TCT TAT AAA CCA TAT-3′ (406 bp). For region 4 (accession no. U19619; upper strand), the primers were BHha2t1, 5′-ATA GTT ATG GGT TTT ATG AGG-3′; BMsp3t2, 5′-CCT CTT CAA TTA ATT TTA ACT-3′ (448 bp); BHha2t2, 5′-AGG GGT TTA TGT TAG TTT TTG ATA A-3′; and BMsp3t, 5′-ACA CCC AAA ACT TAA TAT AAA ATT CC-3′ (403 bp). For region 5 (accession no. U19619; upper strand), the primers were BMsp2t1, 5′-GAG TAT TTA GGA GGT ATA AGA ATT-3′; BHha1t3, 5′-ATC AAA AAC TAA CAT AAA CCC CT-3′ (471 bp); BMsp2t2, 5′-GTA AGG AGA TTA TGT TTA TTT TTG G-3′; and BHha1t4, 5′-CCT CAT TAA TCC CAT AAC TAT-3′ (423 bp). Nested primers amplifying control regions within the Snrpn (420 bp) and Rasgrf1 (284 bp) genes were adopted from the work of Lucifero et al. (36) and Li et al. (33), respectively; they read as follows: Snrpn outside forward, 5′-TAT GTA ATA TGA TAT AGT TTA GAA ATT AG-3′; Snrpn outside reverse, 5′-AAT AAA CCC AAA TCT AAA ATA TTT TAA TC-3′; Snrpn inside forward, 5′-AAT TTG TGT GAT GTT TGT AAT TAT TTG G-3′; Snrpn inside reverse, 5′-ATA AAA TAC ACT TTC ACT ACT AAA ATC C-3′; Rasgrf1 outside forward, 5′-TAA TTT TAG GTG TAG AAT ATG GGG TTG-3′; Rasgrf1 outside reverse, 5′-TAA AAA AAC AAA AAC AAC AAT AAC AAC TAA AAC AAA AAC AA-3′; Rasgrf1 inside forward, 5′-TAG AGA GTT TAT AAA GTT AG-3′; and Rasgrf1 inside reverse, 5′-ACT AAA ACA AAA ACA ACA-3′. PCR conditions for all primers were adopted from the work of Tremblay et al. (57). PCR products were cloned into the pCRII vector of the TOPO-TA cloning system (Invitrogen), and individual clones were sequenced.
RESULTS
Parent-of-origin-specific methylation of CD3-CMG H19ICR insertion on chromosome 9.At its endogenous location, the H19ICR is methylated on the paternally but not on the maternally inherited chromosome. This parent-of-origin difference arises via methylation of the ICR during spermatogenesis and is then maintained in all somatic tissues after fertilization (58). Likewise, an insertion of a second copy of the 2.4-kb H19ICR at the Igf2/H19 locus (H19R) (Fig. 1A) becomes normally methylated in both somatic tissue and testes (45). When the same 2.4-kb H19ICR element is inserted at the Afp locus (Afp-A) (Fig. 1B) on chromosome 5, it becomes methylated successfully upon paternal inheritance in somatic tissue, but this methylation is acquired only after fertilization. That is, when inserted on chromosome 5, all 65 of the assayed CpGs are unmethylated in sperm (45). In this study, we wanted to address the generality of these findings and also wanted to better understand the genetic basis for acquisition of DNA methylation at the ICR. We therefore established and characterized several novel mouse strains.
We first generated a knock-in mouse model (CD3-CMG) which carries the 2.4-kb H19ICR between the CD3 gamma and CD3 delta genes on mouse chromosome 9 (Fig. 1C). CD3 maps at 26 cM, in a nonimprinted region of the mouse genome. The only chromosome 9 genes known to be imprinted are Rasgrf1 (encoding RAS protein-specific guanine nucleotide-releasing factor 1) and A19, which map at 50 cM (12). Therefore, by choosing the CD3 locus, we were able to study the autonomous function of the H19ICR, excluding any likely interactions between the ICR and a neighboring imprinting regulatory element. The CD3 locus does not harbor any CpG islands, based on a computer search using the EMBL-EBI CpGPlot computer program (http://www.ebi.ac.uk/Tools/emboss/cpgplot/ ). The search was based on the definition of a CpG island by Takai and Jones (53). The closest CpG islands are located 60 kb upstream and 157 kb downstream of the inserted H19ICR. CD3 gamma contains 97 CpG dinucleotides and CD3 delta contains 61 CpG dinucleotides, scattered throughout their 10.8-kb and 4.5-kb gene bodies, respectively.
Using Southern hybridization, we studied parent-of-origin DNA methylation patterns of the CD3-CMG insertion in thymus, kidney, testis, and mature sperm. Five animals carried the CD3-CMG insertion on the maternal allele, and five animals carried the insertion on the paternal allele. Our data show that the H19ICR inserted into the CD3 locus is unmethylated on the maternal allele and methylated on the paternal allele in kidney DNA (Fig. 2B) and in the thymus (data not shown), just like the endogenous ICR. Thus, the 2.4-kb element is sufficient to effectively mark parental origin at this ectopic locus. Note that the DNA methylation patterns seen in Fig. 2 are dependent only upon the parent of origin and are entirely independent of the grandparental origin of the ICR insertion (data not shown).
At the endogenous ICR, the paternal allele-specific DNA methylation was already established during spermatogenesis, and the ICR was completely refractory to digestion with the methylation-sensitive enzyme AciI (see “Dom” and “Cas” bands in Fig. 2B). In contrast, the CD3-CMG insertion was digested by AciI, indicating that the ICR is relatively unmethylated in sperm (Fig. 2B). Thus, unlike the case for the endogenous ICR but like that for the ICR insertion at the Afp locus (45), methylation of the CD3-CMG insertion seen in somatic tissue was not simply maintenance of the methylation patterns created in sperm.
We did, however, note a 3.6-kb fragment in testis, but not in kidney (Fig. 2B), that is consistent with partial methylation of the CD3-CMG insertion in sperm. To test this hypothesis and to assess DNA methylation in all CpGs across the ICR, we performed bisulfite sequencing. Genomic DNA was isolated from the testes and mature sperm from four animals homozygous for the CD3-CMG insertion on chromosome 9 and the H19Δ13 deletion on chromosome 7. H19Δ13 is a 13-kb chromosomal deletion at the Igf2/H19 locus that removes the entire H19 gene and 10 kb of additional upstream sequences, including the entire H19ICR (31). Thus, the ectopic ICR insertion is the only copy of the H19ICR in these animals. Following bisulfite treatment, five regions within the H19ICR insertion were PCR amplified to characterize 59 of 67 CpG dinucleotides within the ICR (Fig. 3). Regions 1 and 2, normally proximal to the H19 promoter, showed only minimal methylation. However, promoter-distal regions 4 and 5 were heavily methylated on at least some chromosomes. This finding indicates that the 2.4-kb H19ICR becomes partially methylated in testis when inserted at the CD3 location and thus is somewhat different in its behavior from the Afp-A insertion reported by Park et al. (45), where 65 of the 67 CpGs analyzed were unmethylated in testes and sperm. Thus, the methylation of the H19ICR in sperm is affected by its neighboring sequences. At its endogenous location or when inserted just 12 kb downstream of its normal location (H19R) (Fig. 1A), all CpGs are methylated in sperm, and this methylation is maintained during embryogenesis. In contrast, when the H19ICR is inserted at normally nonimprinted sites, much (CD3 insertion) or all (Afp insertions) of the CpG methylation is acquired only after fertilization.
DNA methylation of individual CpG dinucleotides within the ectopic H19ICR at CD3 in sperm (Sp) and testis (T). Genomic DNAs from animals carrying only the ectopic H19ICR in their genomes were analyzed by bisulfite sequencing. Numbers 1 to 5 indicate regions within the ICR that were individually amplified by PCR. Region 1 shows the most proximal and region 5 shows the most distal region relative to the endogenous H19 promoter. Each lane of circles represents one individual clone. Lanes drawn for different regions do not originate from the same chromosome. Open circles represent unmethylated CpGs, and closed circles represent methylated CpGs. Arrows beneath individual CpGs indicate AciI restriction sites harboring the given CpG within their sequence. The Snrpn and Rasgrf1 DMRs were used as positive controls to test the accuracy of the bisulfite assay. In sperm, the Snrpn DMR (36) is unmethylated and the Rasgrf1 DMR (33) is methylated.
Additional H19 sequences can rescue H19ICR methylation in sperm.We next asked whether adding more sequence from the endogenous imprinted environment would rescue methylation of an exogenous H19ICR in testis. Using the Afp-D mouse model (Fig. 1B), we had already noted that an additional sequence upstream of the H19ICR did not help to methylate the ICR in testis (45). We therefore searched for a candidate sequence downstream of the endogenous H19ICR. The H19 promoter harbors a G box which is very highly conserved in mammals but whose function remains unclear, because targeted deletion of the element on chromosome 7 does not result in any loss-of-imprinting phenotype (56). The H19 promoter also overlaps a conserved CpG island (Fig. 1A). We considered that these elements might function to efficiently target DNA methyltransferases during spermatogenesis.
Therefore, we generated the Afp-DCK mouse by inserting the H19ICR and 5 kb of additional downstream sequence, including the H19 promoter's G box and CpG island and the entire H19 gene body upstream of Afp (Fig. 1B). Parent-of-origin-specific methylation of the ectopic ICR was analyzed in mice in the H19Δ13/H19Δ13 background. In somatic tissue, the Afp-DCK insertion, like the Afp-A, Afp-D, and CD3-CMG insertions, was unmethylated on the maternal allele and methylated on the paternal allele (Fig. 2C, top panel). In testis (Fig. 2C, bottom left panel) and sperm (data not shown), however, the maternally inherited Afp-DCK insertion remained completely unmethylated. Thus, the additional 5 kb of sequence downstream of the endogenous H19ICR did not rescue the H19ICR methylation in testis.
We did note, however, that the paternally inherited Afp-DCK insertion became heavily methylated in a small but visible portion of testicular (Fig. 2C, bottom right panel) and sperm (data not shown) cells.
Finally, we analyzed the ICR's methylation pattern in testes isolated from animals homozygous for a bacterial artificial chromosome (BAC) transgene (H19-BAC1) (26) (Fig. 1A) carrying the H19 sequence from kb −7 to +140. This fragment is, of course, too large to target to a specific locus and so was generated by random insertion of a single-copy BAC. We had previously shown that H19 expression from this transgene is maternal allele specific (26). The H19ICR within this transgene becomes specifically methylated on the paternal allele in somatic tissue (data not shown). In Fig. 2C, we show that this transgene's ICR was fully methylated in testis (Fig. 2C). Thus, when sufficient sequence downstream of the H19 gene body is included, ectopic H19ICR insertions behave like the endogenous element.
Is DNA methylation on ectopic H19ICR inserts maintained after fertilization?The mammalian genome becomes demethylated after fertilization, except for methylation imprints that have been established in the germ line (40). DMRs that are not ICRs, such as the DMRs upstream of and within the Igf2 gene, do not escape the demethylation activity, and their methylation patterns are removed from the parental alleles (16). To test whether the H19ICR inserted at CD3 maintains its methylation acquired during spermatogenesis after fertilization and therefore behaves like the endogenous ICR on chromosome 7, we studied its methylation patterns in zygotes, morulae, and blastocysts (Fig. 4). We assayed the 3′ end of the ICR (region 5 in Fig. 3), which includes 15 CpGs and is the most methylated region in sperm. We found that the methylation patterns in sperm, morulae, and blastocysts were not distinguishable. Thus, the CD3-CMG insertion does escape the genomewide demethylation wave that occurs after fertilization and does indeed behave like the endogenous ICR.
DNA methylation patterns of the exogenous H19ICR at CD3 in zygotes and in E3.5 morulae and blastocysts. The DNA region analyzed by bisulfite sequencing in this study covers 15 CpGs within the H19ICR, from kb −4.0 to −3.5, and includes two CTCF sites. This region corresponds to the promoter-distal region 5, whose methylation patterns in sperm and testis are shown in Fig. 3. Zygotes, morulae, and blastocysts had paternal alleles inherited from CD3-CMG fathers. Morulae and blastocysts were homozygous for the H19Δ13 allele. Thus, the ectopic ICR was the only H19ICR in these genomes. Zygotes were in an H19+ (cas)/H19Δ13 background. Clones associated with the ectopic H19ICR were identified using single nucleotide polymorphisms that distinguish castaneus and domesticus H19ICRs. Each circle represents one CpG, and each lane represents one clone. Unmethylated CpGs and methylated CpGs are depicted as open circles and filled circles, respectively.
DISCUSSION
We previously reported that insertions of the H19ICR at the Afp locus became methylated in somatic tissue (45). Surprisingly, these ectopic ICR insertions were unmethylated in testis, indicating that ICR function as an ICR/DMR might not depend upon its methylation in sperm. This was the first study that analyzed the 2.4-kb H19ICR as a knock-in insertion in a nonimprinted environment.
To generalize these findings, we first inserted the 2.4-kb ICR at the CD3 locus. Regardless of the insertion site, the ICR is highly effective at marking chromosomal origin in somatic tissue. That is, maternally inherited chromosomes never show CpG methylation of the ICR insertion, while paternally inherited ICRs are completely methylated. Thus, the 2.4-kb element is sufficient to imprint the normally nonimprinted CD3 (this study) and Afp (45) loci in somatic tissue. Likewise, Tanimoto et al. and Matsuzaki et al. used a 2.8-kb element carrying the H19ICR to effectively imprint a human β-globin yeast artificial chromosome transgene (37, 54). Our data are also consistent with earlier studies using nontargeted H19 transgenes. In these studies, much larger DNA sequences were randomly inserted into the mouse genome, but any transgene that did carry the 2.4-kb element used in this study was appropriately imprinted (6).
We also characterized the timing of acquisition of the paternal allele-specific methylation. At its endogenous location, the ICR is completely methylated during spermatogenesis, and this methylation is maintained throughout development. At the ectopic loci, the methylation pattern was not fully established until after fertilization. The Afp-A and Afp-D insertions (45), as well as the β-globin insertion (37, 54), were essentially unmethylated in sperm. At the CD3 locus, there was minimal methylation at the 3′ end of the ICR insertion but already heavy methylation at the 5′ end (Fig. 3). In all cases, methylation of the ICR proceeded even during early developmental stages. In sum, we concluded that the 2.4-kb element is sufficient to act entirely autonomously as an ICR and DMR but that these functions do not require that it be methylated during spermatogenesis.
Additional endogenous sequence can facilitate the recognition of the H19ICR as a target for DNA methylation in sperm at an exogenous location. In fact, we saw complete ICR methylation in sperm in two cases: the H19R insertion (Fig. 1A) and the H19-BAC1 transgene (Fig. 2C). In the first case, a second copy of the 2.4-kb ICR was inserted just downstream of the H19 gene, and thus adjacent to other potentially critical regulatory elements. The H19-BAC1 transgene carries the H19 sequence from kb −7 to +140. Together, these experiments show that sperm methylation of the ICR can be achieved if sufficient cis information is provided.
Moreover, data obtained from the analysis of the Afp-DCK insertion hint that much more limited sequence information than the 147 kb from the H19-BAC1 transgene is required to methylate the H19ICR in testis. Unlike all other exogenous insertions at Afp, at least some fraction of Afp-DCK chromosomes were already fully methylated in sperm cells, at least when paternally inherited. Thus, an exogenous H19ICR can become methylated during spermatogenesis when sufficient endogenous sequence has been transferred to the exogenous location.
Additional endogenous sequence increased the ability of the methylation machinery to target the ICR, and a few testicular and sperm cells became methylated. These cells paternally inherited the Afp-DCK insertion. Analogous to what Davis et al. (8, 9) observed by analyzing the developmental methylation patterns in maternal and paternal sperm chromosomes, the time frame of spermatogenesis from E14 to E17 might not be long enough for the exogenous H19ICR to become methylated. The endogenous H19ICR in sperm is fully methylated by E17 on the paternally inherited chromosome but takes until after birth to be methylated completely on the maternally inherited chromosome. The time difference in establishing methylation on the maternal and paternal sperm chromosomes was explained by the need of first removing maternal allele-specific properties from the maternally inherited H19ICR prior to establishing the paternal allele-specific methylation pattern. Because paternally inherited chromosomes already carry sperm-specific properties that do not have to be exchanged, the time frame of spermatogenesis is sufficient to remethylate the H19ICR on paternal sperm chromosomes but insufficient to do the same on maternal sperm chromosomes (8, 9). Analogously, the maternal Afp-DCK insertion may remain unmethylated in testis because the time period of spermatogenesis is not enough to identify the exogenous H19ICR as a target for DNA methylation, exchange maternal allele-specific marks with paternal marks, and remethylate the ICR.
After formation of the zygote, the CD3-CMG insertion stably maintains its methylation acquired during spermatogenesis (Fig. 4), despite the genomewide demethylation that occurs on both maternal and paternal chromosomes (38, 44). This finding indicates that although the ectopic ICR does not become fully methylated during spermatogenesis and differs from its endogenous counterpart in this respect, it does behave like the endogenous ICR after fertilization. That is, the germ line imprint, even if only partially established, escapes demethylation, like the endogenous ICR.
Several lines of evidence now support the idea that DNA methylation is not necessarily the primary genomic imprint. First, as demonstrated here and in previous studies (37, 45, 54), ectopic insertions of the 2.4-kb H19ICR are highly efficient DMR elements whose paternal allele-specific methylation can be established largely or even completely after fertilization. Second, as discussed above, Davis et al. (8, 9) demonstrated that in male gametogenesis, after erasure of the ICR methylation, paternal and maternal ICRs are still not functionally equivalent. Rather, paternal chromosomes are remethylated earlier than the maternal chromosomes. Their results are consistent with the presence of other chromatin marks that can distinguish parental origin even in the absence of DNA methylation differences. Third, El-Maarri et al. (14) have shown that methylation of the SNRPN-SNURF DMR in humans can occur postfertilization (note that Geuns et al. found that methylation is established in female gametes [18]). Finally, Ciccone et al. (5) recently demonstrated that maternal methylation imprints for several loci are dependent upon histone H3K4 demethylase. Altogether, these studies suggest that modified chromatin represents the primary imprint and that differential DNA methylation may be established to replace and/or stabilize this primary mark. The idea of DNA methylation serving as a chromatin stabilizer is supported by the timely coordinated erasure and reestablishment of histones and DNA methylation, as recently reported by Hajkova et al. (20).
In summary, we have shown that the 2.4-kb H19ICR is sufficient to become methylated at exogenous locations in early embryos and somatic tissue but needs additional endogenous sequence downstream of H19 to become methylated in testis. However, the H19ICR does not have to become methylated during spermatogenesis in order to transmit the paternal allele-specific methylation pattern to the offspring. Zygotes maintain and establish the paternal allele-specific methylation pattern of the H19ICR even if the ICR is partially (CD3-CMG) or completely (Afp-A and Afp-D) (data not shown) unmethylated in sperm. Thus, complete germ line methylation is not required for correct parent-of-origin-specific methylation in the offspring.
ACKNOWLEDGMENTS
Claudia Gebert was supported by the intramural program of the National Institutes of Health (NIH) and the Eunice Kennedy Shriver National Institute of Child Health and Human Development (NICHD) and by a Feodor-Lynen Fellowship from the German Alexander von Humboldt Foundation.
FOOTNOTES
- Received 1 May 2009.
- Returned for modification 29 June 2009.
- Accepted 16 December 2009.
- Accepted manuscript posted online 28 December 2009.
- Copyright © 2010 American Society for Microbiology