Previous Article | Next Article ![]()
Molecular and Cellular Biology, February 2006, p. 1051-1062, Vol. 26, No. 3
0270-7306/06/$08.00+0 doi:10.1128/MCB.26.3.1051-1062.2006
Copyright © 2006, American Society for Microbiology. All Rights Reserved.
Sunita Ramanathan,
,
Yukiko Okuno,
,
Chiharu Kumagai,
Seemab S. Shaikh, and
David M. Gilbert*
Department of Biochemistry and Molecular Biology, SUNY Upstate Medical University, 750 East Adams St., Syracuse, New York 13210
Received 7 September 2005/ Returned for modification 5 November 2005/ Accepted 15 November 2005
|
|
|---|
|
|
|---|
A large (50-kb) region downstream of the dihydrofolate reductase (DHFR) gene in Chinese hamster ovary (CHO) cells is the most extensively studied initiation zone, yet the precise distribution of initiation sites within this region has still not been resolved. High-resolution mapping of the locations of small nascent DNA strands within a 12-kb region detected two very specific initiation sites (35), while two-dimensional gel electrophoresis (2D-gel) analysis of replication intermediates (13, 45) identified DNA structures representing replication bubbles within over 20 different fragments throughout the initiation zone. One possibility is that there are multiple discrete sites of initiation that would be revealed by a more comprehensive analysis of small nascent strands or by more recently developed methods of visualizing origins on DNA fibers (5, 50). Regardless of the precise pattern of initiation sites within the zone, the majority of initiation activity appears to be confined to this zone, as no comparably significant initiation has been detected elsewhere throughout approximately 270 kb. Since the initiation pattern at the DHFR locus is similar to several other large intergenic regions in metazoa (11, 20, 28, 39, 50), it is important to understand the mechanism that confines initiation sites to this region.
The act of transcription itself appears to play a role in the specification of origins at the CHO DHFR locus. Of the sites that have been interrogated at this locus, significant initiation activity has been confined to the intergenic region (although some very weak activity has been detected within a downstream, weakly transcribed gene) (12). Deletion of either the promoter (62) or the transcription terminator (31) of the DHFR gene results in a reduction in initiation activity throughout the locus. In the case of a promoter deletion, this is accompanied by the appearance of weak initiation activity within the body of the DHFR gene, resulting in a broadening of the initiation zone to include the silent transcription unit. In the case of the terminator deletion, transcription continues into the intergenic locus, and initiation throughout this extended transcription unit becomes suppressed (44). When integrated at ectopic sites, replication can initiate within either DHFR gene or nongene sequences, unless an active promoter is placed upstream (38). These observations provide strong genetic evidence for the suppression of initiation within active transcription units. Interestingly, almost all eukaryotic replication origins have been localized outside of active transcription units (22, 23, 43).
By incubating nuclei from CHO cells in Xenopus egg extracts, we have established a cell-free system that recapitulates the physiological initiation pattern of DNA replication at the CHO DHFR locus (24, 73). However, physiological origin specification requires nuclei prepared late in G1 phase. With early-G1-phase nuclei, replication appears to initiate at sites distributed throughout the entire locus (70). Hence, at a distinct point during G1 phase, CHO nuclei experience a transition (the origin decision point [ODP]) that selects which of many potential chromosomal sites will function as an origin of replication in the upcoming S phase. Intriguingly, several transcription inhibitors were shown to prevent the selection of origins at the ODP (33). This result is consistent with the hypothesis that initiation of replication is prevented from taking place within active transcription units, and it suggested that the ODP could represent the resumption of transcription within this region after global transcriptional shutdown during mitosis (15, 22, 23). However, the DHFR gene is transcribed from an E2F-regulated promoter that is presumed to be repressed until after the restriction point (9, 18, 49, 68), whereas the ODP takes place prior to the restriction point and is independent of mitogen stimulation (33, 71). Moreover, our previous results interrogated only 17 specific sites throughout the 120 kb surrounding the initiation zone. Hence, we did not have the resolution to conclude that initiation of replication was excluded from transcription units. Surprisingly, despite the amount of effort that has been invested in the CHO DHFR locus, very little of this region has previously been sequenced, and unique probes have only been available for a few scattered regions.
We have now sequenced 121 kb surrounding the DHFR initiation zone and have constructed DNA arrays containing either plasmid or PCR products that, after the removal of repetitive sequences, cover 78.9% of the sequence. We detected initiation activity throughout the intergenic region, with little or no activity within any of the three previously defined transcription units. When pre-ODP nuclei were incubated in Xenopus egg extracts, replication initiated without preference for any site, genic or intergenic; however, with post-ODP nuclei, initiation resembled the pattern seen in vivo. Nuclear run-on analysis with nuclei isolated at different times after mitosis demonstrated that transcription is confined to these previously described genes and that the same concentrations of alpha-amanitin that inhibited transcription of these genes also allowed initiation to take place within transcription units. Surprisingly, transcriptional activity was detected throughout G1 phase with little change at the ODP. These results demonstrate that replication can initiate at a remarkably flexible number of sites within mammalian chromatin, including within sites of ongoing transcription in pre-ODP nuclei. However, at the ODP, initiation of replication is suppressed within transcribed regions. Hence, transcription is necessary but not sufficient for this suppression.
|
|
|---|
Array assembly. (i) Design of the PCR probes. We essentially followed the approach of Rinn et al. (61). RepeatMasker (http://www.repeatmasker.org/) was used to identify repetitive sequences. A total of 78,810 bp of the 121,368-bp sequenced region were identified as nonrepetitive (i.e., 35% repetitive and 65% nonrepetitive). Initially, 90 PCR primer sets that would amplify fragments of the size of 300 to 700 bp were designed using MacVector (Accelrys, Inc.). The primers were selected such that their annealing temperatures were between 55°C and 65°C and they had an average G+C content of 45%. An additional 60 primers were designed manually, by visual inspection of sequences not chosen by MacVector. The sequences of these PCR primers sets will be provided on request.
(ii) PCRs. PCRs were performed with 25 ng of CHOC 400 genomic DNA and 0.4 µM each forward and reverse primer. Control reactions were performed with forward or reverse primers alone during the first round of PCR, and only primer sets that were negative under these conditions were used. The reaction mixture contained 0.2 mM each deoxynucleoside triphosphate, 1.5 mM MgCl2, 1x buffer B (Fisher Scientific), and 1 U of Taq in a 25-µl reaction mixture. PCR product sizes were verified on a 2% agarose gel, and then products were gel purified in a 1.5% agarose gel with a Gel CleanUp kit (Eppendorf). These PCR products were then used as a template (5 ng) for a second round of PCR using the same primer sets to generate probes for the array. The second round of PCR amplification enabled the amplification of large quantities of PCR product free of contamination from genomic DNA and nonspecific amplification products. These PCR products were then diluted to 10 ng/µl in 10:0.1 TE (10 mM Tris-HCl [pH 8.0], 0.1 mM EDTA) and were printed onto Hybond-N+ membranes (Amersham).
(iii) Removal of primer-dimers. To remove primer-dimers that were formed in certain PCRs, PCR products were selectively precipitated from the reaction mixture by the addition of a 0.1 volume of 3 M sodium acetate (pH 5.2) and a 0.7 volume of isopropanol (P. Bertone, personal communication). Tubes were thoroughly mixed and centrifuged at 14,000 rpm at 4°C for 30 min. The pellet was washed twice with 70% ethanol, air dried, and resuspended in 10:0.1 TE.
(iv) Printing of arrays. PCR products were diluted to 10 ng/µl in 10:0.1 TE in a total volume of 100 µl. A total of 20 µl of bromophenol blue in 10:0.1 TE (1 mg/ml) was added to these tubes, after which the DNA was denatured at 96°C for 10 min and quick chilled on ice. The DNA was spotted on Hybond N+ membranes using a DNA replicator (VP Scientific), per the manufacturer's instructions. Probes were cross-linked to the membranes with a UV cross-linker (Stratalinker; Stratagene), after which the arrays were stored in sealed plastic bags for further use. A total of 10 ng (each) of a segment of lambda DNA was spotted at various locations on the array as a negative hybridization control. In addition, 10 ng (each) of the 17 probes used for origin mapping in prior experiments was also included in each array (37); results with these probes were reproducibly consistent with our previously reported results.
(v) Identification of probes that contain repetitive sequences. Although the PCR probes were carefully designed to avoid the repetitive sequences that were identified by RepeatMasker, the probes were further scanned empirically for repetitive sequences, as follows. Arrays were hybridized to labeled genomic DNA from CHOC 400 cells, and the intensities were represented as counts per minute per base pair. These intensities were normalized to the weakest hybridizing probe. Probes whose normalized intensities (in counts per minute per base pair) were 20 times greater than the intensities of the weakest hybridizing probe (61) were categorized as containing repetitive sequences and not used for further analysis.
(vi) Plasmid array. A total of 100 ng of each plasmid obtained from shot gun sequencing was spotted on a Hybond N+ membrane as described above. Plasmids containing highly repetitive sequences were identified by being screened with labeled CHOC 400 genomic DNA, as described above.
ELFH assay. CHOC 400 cells were cultured and synchronized in G0/G1 by isoleucine starvation (71), after which the cells were blocked at the G1/S boundary by the addition of complete medium containing 10-µg/ml aphidicolin or 400 µM mimosine. After 14 h, the cells were released to S phase by replacing the medium with complete warm growth medium for 5 min (aphidicolin) or 20 min (mimosine), and cells were collected by trypsinization. Nuclei were prepared by digitonin permeabilization (73). Early replication intermediates were labeled as previously described (24, 73), except that 5 million nuclei in 35 µl of replication cocktail were labeled at 34°C for 1.5 min. The reaction was stopped by the addition of 470 µl of lysis buffer (50 mM Tris [pH 7.8], 10 mM EDTA, 0.4 M NaCl, 0.6% sodium dodecyl sulfate [SDS]) containing 200-µg/ml proteinase K. DNA was isolated and sonicated as previously described (24, 73). Typically, 2 x 107 cells resulted in 30 million cpm of labeled DNA. Nascent DNA strands at replication forks in asynchronously growing cells were similarly labeled. Radiolabeled DNA was heat denatured and hybridized to four independently printed PCR arrays (and/or one to three plasmid arrays) in hybridization bottles as previously described (73). To normalize for any variation in probe intensity due to the size of the probes, hybridization intensity, or AT content, total genomic DNA was random labeled according to the manufacturer's protocol (Invitrogen) and similarly hybridized. Hybridization was allowed to proceed for 14 h, after which the membranes were washed twice with 2x SSC (1x SSC is 150 mM NaCl and 15 mM sodium citrate [pH 7.0]) and 0.2% SDS for 20 min each wash and with 0.2x SSC-0.2% SDS for 15 minutes each. The signal intensities were measured by exposure to Molecular Dynamics phosphorimager screens. The sizes of nascent strands were determined as previously described (72), except that 0.7% gels were run overnight at 60 V.
Analysis of DNA replication in Xenopus egg extracts. Xenopus egg extracts were prepared and handled as previously described (73), except that 40 nM geminin was added to all reactions prior to the addition of nuclei to extract, to inhibit in vitro assembly of prereplication complexes (pre-RCs) (16, 52). Analysis of replication was essentially as previously described (14, 73). Briefly, cells were synchronized at various time points during G1 phase by mitotic selection and permeabilized with digitonin. For experiments that evaluated the positions of aphidicolin-arrested forks, nuclei were incubated in Xenopus egg extract at 21°C for 90 min at 10,000 nuclei per µl of extract in the presence of 100 µg of aphidicolin/ml. After incubation in Xenopus extract, reactions were stopped by the addition of ice-cold hypotonic buffer (20 mM HEPES [pH 7.5], 5 mM KCl, 1.5 mM MgCl2). Nuclei were pelleted at full speed for 16 s in a Sorvall MC-12V, resuspended in hypotonic buffer, and incubated on ice for 10 min to remove aphidicolin and nucleotide pools. Nuclei were pelleted and resuspended in a labeling mixture that contained 50 µCi of [32P]dATP, and the labeling reaction was performed as previously described for 10 min at 12°C (73). For experiments that labeled nascent DNA directly, nuclei were incubated in Xenopus egg extract supplemented with 2.0-mCi/ml [32P]dATP at 21°C for 40 min at 25,000 nuclei per µl of extract. Ice-cold hypotonic buffer was added to stop the reaction, and nuclei were pelleted and resuspended in lysis buffer with proteinase K as described above. DNA isolation and hybridization were as described above.
Nuclear run-on hybridizations.
We followed the protocol of Hirayoshi and Lis (26), with modifications described by Chasin and colleagues (65). Subconfluent dishes of cells were chilled on ice, and cells were washed twice with ice-cold 1x SSC. Cells were removed from the culture plate using a cell scraper in 10 ml of 1x SSC per dish and collected by centrifugation (500 x g; 5 min). Cell pellets were resuspended in 1 ml of NP-40 lysis buffer (10 mM Tris-HCl [pH 7.4], 10 mM NaCl, 3 mM MgCl2, 0.1 mM EDTA, 0.5% [vol/vol] NP-40) per 107cells and incubated for 5 min on ice. Nuclei were then washed twice with NP-40 lysis buffer, resuspended in 100 µl freezing buffer (50 mM Tris-HCl [pH 8.3], 40% [vol/vol] glycerol, 5 mM MgCl2, 0.1 mM EDTA) per 107 cells, snap frozen in liquid nitrogen, and stored at 80°C in 100-µl aliquots. To perform run-on reactions, aliquots of nuclei were thawed, mixed with 100 µl of 2x reaction buffer (10 mM Tris-HCl [pH 8.0]; 5 mM MgCl2; 300 mM KCl; 0.5 mM each ATP, CTP; and GTP; and 150 µCi [
-P32]UTP [800 Ci/mmol]), and incubated at 30°C for 15 min. A total of 60 U of RNase-free DNaseI (Promega) were added, and the reaction mixture was incubated for an additional 30 min at 37°C. An equal volume of stop buffer (20 mM Tris-HCl [pH 7.4], 2% SDS, 10 mM EDTA, 200-µg/ml proteinase K) was added to the reaction mixture and further incubated at 42°C for 30 min. RNA was then extracted twice with water-saturated phenol:chloroform:isoamyl alchohol (25:24:1). NH4 acetate was added to 2.5 M, and RNA was precipitated with 3.5 volumes of ethanol. RNA pellets were resuspended in 100-µl diethyl pyrocarbonate-treated water, reprecipitated in ethanol, and resuspended again in 100 µl diethyl pyrocarbonate-treated water. Ten units of DNaseI and 10 µl of 10x reaction buffer (400 mM Tris-HCl [pH 8.0], 100 mM MgSO4, 10 mM CaCl2; Promega) was added and incubated at 37°C for 30 min to completely remove all contaminating DNA. Reactions were extracted once with phenol to remove DNaseI, and DNA was precipitated and resuspended in 100 µl of water. DNA was denatured at 65°C for 5 min, added to hybridization buffer, and hybridized as described above.
|
|
|---|
![]() View larger version (58K): [in a new window] |
FIG. 1. Distribution of PCR and plasmid probes used to assemble arrays. (A) Map positions of the sequences determined in silico containing repeats using RepeatMasker and the plasmid and PCR probes remaining after eliminating probes containing repeats, as described in the text. Gray arrows represent the transcription units of Msh3, DHFR, and 2BE2121 genes. The arrowhead indicates the transcription termination site for the gene (the Msh3 termination site is not within the sequence); multiple termination sites for DHFR (63) are indicated by different shades of gray. Promoter elements for DHFR and Msh3 are located in a G+C-rich region near 18 kb. Positions of exons are indicated as black boxes. (B) Histograms illustrate the number and sizes of gaps with arrays constructed from PCR products and plasmids.
|
![]() View larger version (49K): [in a new window] |
FIG. 2. Hybridization pattern of the earliest-labeled nascent strands to an array of PCR probes. (A) Nascent DNA strands in nuclei prepared from cells that were synchronized at the G1/S boundary with either aphidicolin (G1/S Aph.) or mimosine (G1/S Mimo.) were labeled in an in vitro replication cocktail and then separated by size on alkaline agarose gels. Gels were scanned with a phosphorimager, and each lane was traced with Image Quant software. Positions of molecular weights, determined from markers included in the same gel, are shown on the x axis. (B) Nascent strands prepared as in panel A, as well as those prepared similarly from asynchronously growing cells (Asynch.), were hybridized to PCR arrays. Shown are examples of array images after being scanned with a phosphorimager. (C) In each experiment, nascent strands were hybridized to four independently printed arrays to control for printing errors. Raw phosphorimager values for each probe were normalized to the mean of all probes on each array (raw/mean). Shown are the mean values from the four arrays hybridized in a single experiment. Error bars, representing the standard error of the mean, illustrate the reproducibility between independently printed batches of arrays. (Top) Comparison of aphidicolin-synchronized (red) to asynchronous (black) samples; (bottom) mimosine-synchronized (blue) compared to asynchronous (black) samples. The values for labeled gDNA (green), hybridized to the same set of arrays, provide a reference for variability between probes due to hybridization efficiencies. (D) Values for each probe hybridized with nascent strands were divided by the corresponding values with gDNA to correct for differences in probe hybridization efficiency. Shown are the means of two independent experiments synchronizing with mimosine (blue) and three independent experiments synchronizing with aphidicolin (red). Error bars indicate the standard deviation between experiments for each probe. A schematic diagram of the transcription units is provided below the x axis. A higher dynamic range of signals was consistently obtained with nascent strands from aphidicolin-arrested cells (red) compared to mimosine-arrested cells (blue). The reasons for this difference are not clear (similar total counts per minute were labeled) but could indicate that mimosine synchronizes cells at an earlier point in S phase, when fewer DHFR origins have fired relative to the population of earliest replicating origins, resulting in a lower signal-to-noise ratio. Consistent with this interpretation, the total counts per minute across all DHFR probes on arrays hybridized with mimosine-arrested forks were consistently two-thirds of those with aphidicolin-arrested forks, despite the similar total counts per minute labeled.
|
Several sequence analyses of the 121-kb region were performed to determine if there were any unique features of the intergenic sequence that could distinguish it from the transcription units. Previous analysis of an initiation zone in Drosophila melanogaster (28) revealed peaks of AT richness coinciding with peaks of initiation activity. Sliding 200-bp windows in 10-bp steps were analyzed for AT content, revealing two sharp peaks of G+C-rich DNA, one in the bidirectional DHFR-Msh3 promoter region (17) and one at nucleotide positions 82.7 to 85.2 kb. Otherwise, uniform variation in AT content was observed throughout the remainder of the sequence. We also did not detect any matches to a previously reported consensus sequence for mammalian replication origins (57), although we detected occurrences of
16/20 matches throughout the entire 121-kb region. We also did not identify any DNA sequence characteristics that were unique to the intergenic region.
As shown in Fig. 1, the PCR-amplified probes represented in our array naturally contained gaps that were difficult to fill, due to the presence of repetitive sequences. However, one advantage of using CHOC 400 to analyze origin distribution is that the approximately 1,000 copies of the DHFR locus overshadowed the more moderately repetitive sequences, allowing us to interrogate a greater percentage of the DHFR locus. Hence, we constructed an array consisting of overlapping plasmid clones obtained as an outcome of shotgun sequencing. To eliminate probes that contained highly repetitive elements (>1,000 copies per cell), this panel was hybridized to labeled CHOC 400 genomic DNA, and probes that deviated >20 fold from the average counts per minute per base pair were removed from the array, leaving a panel of 122 overlapping plasmid probes with an average insert size of 970 bp, spanning 69.2% of the 121-kb sequenced region. When one considers the coverage of both the PCR and plasmid arrays, 78.9% (95,780 bp) of the total sequenced region (121,368 bp) was covered, leaving very few gaps (Fig. 1B).
The experiments shown in Fig. 2 were repeated, hybridizing labeled nascent DNA from aphidicolin- and mimosine-arrested cells to this panel of plasmid probes (Fig. 3). These results also revealed a nearly uniform distribution of earliest-labeled nascent strands throughout the intergenic region and excluded from the known transcription units. Overlaying the plasmid and PCR arrays revealed a similar pattern of initiation from both arrays, supporting the conclusion that initiation is largely, if not exclusively, confined to the intergenic region.
![]() View larger version (52K): [in a new window] |
FIG. 3. Hybridization pattern of the earliest labeled nascent strands to an array of plasmid probes. Nascent strands from cells synchronized at the G1/S border with aphidicolin (A) or mimosine (B) were labeled as in Fig. 2 and hybridized to a plasmid array (red). Values for each probe were corrected to the corresponding values from arrays hybridized with randomly labeled genomic DNA, as in Fig. 2. Shown are the means from two independent experiments and the standard deviation. To illustrate the coverage obtained by using both PCR and plasmid arrays, the results of the PCR array from Fig. 2C (blue) were plotted alongside those from the plasmid arrays (red).
|
Two different labeling schemes were used to map the sites of initiation of replication in Xenopus egg extracts. In the first, nuclei isolated at various times after mitosis were introduced into geminin-supplemented extracts that also contained aphidicolin to arrest in vitro-generated replication forks near their sites of initiation. After we allowed sufficient time to initiate replication in all nuclei, nuclei were then washed free of aphidicolin-supplemented extract and replication forks were extended in a replication cocktail containing [32P]dATP. Under these conditions, labeled nascent strands were <1 kb (Fig. 4A) (14, 32), suggesting that forks are much more efficiently arrested in the cell-free system than they are in vivo. In the second method, we labeled replication intermediates directly in geminin-supplemented extracts that contained [32P]dATP but no aphidicolin, which labels nascent DNA starting from the first nucleotides to be synthesized but captures a population of nascent strands ranging in size from 1 to 20 kb (Fig. 4A). In both cases, labeled nascent DNA was hybridized to both PCR and plasmid arrays as in Fig. 2 and 3. An example of results using each method with nuclei isolated 5 h after mitosis is shown in Fig. 4. With both methods, the initiation pattern closely resembles that seen when replication initiates in vivo (Fig. 2 and 3), except that slightly more initiation activity was detected within the 2BE2121 gene and there appeared to be a more even distribution of initiation sites within the intergenic region in vitro than in vivo. Both PCR and plasmid arrays gave similar results, consistent with a broad distribution of initiation sites throughout the intergenic region.
![]() View larger version (33K): [in a new window] |
FIG. 4. Sites of initiation of replication in Xenopus egg extracts. Nuclei from cells synchronized at 5 h after mitosis were introduced into a Xenopus egg extract, and nascent strands were either arrested with aphidicolin for 90 min and then labeled after release (in vitro aphidicolin) or were labeled directly in extract for 40 min (i.e., a 20-min lag period plus 20 min of label incorporation). (A) The sizes of nascent strands were determined as in Fig. 2. (B) Nascent strands were hybridized to both PCR (blue) and plasmid (red) arrays and analyzed as in the results shown in Fig. 3. Shown are the means of three independent experiments in which nascent strands were hybridized to four independently printed arrays. A gray line is drawn at 1.0, to indicate the mean of all probes in each array.
|
![]() View larger version (44K): [in a new window] |
FIG. 5. Initiation is redistributed to the intergenic region at the ODP. (A to E) Nuclei from cells synchronized at hourly intervals after mitosis were introduced into Xenopus egg extracts, and the sites of initiation were mapped by the direct labeling method shown in Fig. 4. As in Fig. 4, results with both PCR (blue) and plasmid (red) arrays are coplotted, and a gray line indicates the mean of all probes in each experiment. Shown are mean values from two independent experiments and the standard deviation. (F) The total counts per minute across all DHFR probes for each time point were normalized to the same total for nuclei at 5 h after mitosis. Shown are the mean values from two independent experiments.
|
Because CHO DHFR is a weakly transcribed gene (65), it was very difficult to detect nuclear run-on products using the small probes present on our PCR array, even though the DHFR gene in CHOC 400 cells is present at 1,000 copies, and even after growing CHOC 400 cells for several weeks in 400-µg/ml methotrexate, the drug used to select for a high DHFR copy number. To overcome this obstacle, we pooled groups of PCR-generated nonrepetitive probes representing
10-kb regions across the sequenced region and immobilized these pooled probes on a slot blot (Fig. 6A). Radiolabeled nuclear run-on products were prepared using nuclei isolated from cells synchronized at 2 and 5 h after mitosis (pre- and post-ODP), 12 h after mitosis (S phase), and from asynchronous cells. To compare results from different run-on reactions, equal numbers of total counts per minute from run-on products were used; a cDNA probe from the hamster adenine phosphoribosyltransferase (APRT) gene, which is robustly expressed throughout the cell cycle (47), was included on the slot blot. After quantitative analysis of the hybridized filter, the relative counts per minute for each pool of PCR products were normalized to the corresponding value for the APRT gene (Fig. 5B). When Fig. 6B is compared to Fig. 2 to 5, it is clear that there is an inverse relationship between transcription and origin activity near the 3' end of the DHFR gene, with the probes in the region from nucleotide position 56 to 79 kb being the most active initiation sites and the most weakly transcribed. Weak transcription activity above the background detected with the negative control lambda DNA probe was detected from 80 to 106 kb. Consistently, this region also showed lower activity for replication initiation (Fig. 2 to 5). However, although transcription within the 2BE2121 gene was also weak, suppression of replication initiation activity was considerably more pronounced in this region (Fig. 2 to 5). Moreover, transcription was clearly detected prior to the ODP.
![]() View larger version (51K): [in a new window] |
FIG. 6. Nuclear run-on analysis of transcription during G1 phase. (A) Labeled transcription products from nuclear run-on reactions with nuclei synchronized at 2, 5, or 12 h, as well as from asynchronous (Asynch.) cells, were hybridized to pools of PCR probes covering approximately 10 kb per pool. The arrangement of these probes is schematically illustrated on the left. APRT is transcribed throughout interphase (47) and serves as an internal control for each hybridization (separate 3.8- and 2.7-kb APRT probes). (B) Quantification of the data in panel A, with the relative counts per minute for the larger APRT probe adjusted to 1.0 for each hybridization. The x axis includes a schematic diagram of the probe positions indicated in panel A, the map position boundaries of the probes pooled together in each data point, and gray boxes to indicate the positions of the pooled probe boundaries on the schematic. Shown are the means of three independent experiments and the standard deviation. (C) To verify the polarity of transcripts synthesized in nuclear run-on reactions, labeled products were also hybridized to previously described (24) single-stranded (M13 phage) probes from specific locations along the length of the DHFR locus. For genes, antisense refers to the DNA strand predicted to be transcribed, whereas for intergenic probes, antisense refers to the orientation of the predicted transcribed strand for the DHFR gene. Note that the sense and antisense strands of probe O were inadvertently spotted in the reverse order, as indicated. (D) Quantification of the data in panel C, with the relative counts per minute for the double-stranded APRT probe adjusted to 1.0 as in panel B. Here, gray triangles point to the indicated map position of the single probe on the schematic. Probe C at 60.6 kb is at the position of ori-ß as it has been most precisely defined (35). Shown are the means of three independent experiments and the standard error of the mean. The x axis indicates the map positions of probes, and the gray triangles indicate these positions on the schematic map.
|
Origin specification requires transcription. The genomic regions within which initiation of replication becomes excluded coincide with the known transcription units, with the greatest amount of suppression within the more highly expressed DHFR and Msh3 genes. These data, together with data summarized in the introduction, strongly suggest that although transcription is not sufficient to suppress initiation of replication, it is likely to play some role. To more directly address whether transcription is necessary for focusing initiation to the intergenic region, we examined the effect of inhibiting RNA Pol II transcription on the specification of origin sites at the ODP. Alpha-amanitin binds to the large subunit of Pol II and inhibits transcription by Pol II considerably more effectively than rRNA polymerases I and III (69). CHOC 400 cells were treated with 0-, 1-, 10-, or 100-µg/ml alpha-amanitin at 2 h after mitosis and then incubated an additional 3 h. Nuclei were then isolated, and aliquots of those nuclei were subjected to nuclear run-on assays to measure the amount of DHFR transcription, while the remainder of the nuclei were introduced into Xenopus egg extract to examine the sites of initiation of DNA replication. As illustrated in Fig. 7, the focusing of initiation sites to within the intergenic region at the ODP was strongly prevented between 1- and 10-µg/ml alpha-amanitin (Fig. 7A), and DHFR and Msh3 gene transcription was strongly inhibited within the same range of alpha-amanitin concentrations. Since we have previously shown that origin selection does not require protein synthesis (33), the requirement for transcription is not due to an indirect requirement for the expression of a gene product during this time. These results implicate a pretranslational role for transcription in the suppression of initiation of DNA replication that occurs at the ODP.
![]() View larger version (40K): [in a new window] |
FIG. 7. Alpha-amanitin inhibition of transcription and origin selection during G1 phase. (A) CHOC 400 cells were treated with the indicated concentrations of alpha-amanitin at 2 h after mitosis, nuclei were collected at 5 h, and the sites of initiation of replication in Xenopus egg extracts were mapped as in Fig. 5. (B) Aliquots of the same populations of cells shown in panel A were analyzed for transcription by nuclear run-on assays as in Fig. 6. (C) The total counts per minute hybridized to all DHFR probes for each time point in panel A are expressed as a percentage of that for nuclei isolated from untreated cells, as in Fig. 5F. (D) The total counts per minute incorporated into high-molecular-weight RNA in each of the experiments shown in panel B are expressed as a percentage of that for untreated cells. Since the vast majority of RNA synthesis in nuclear run-on reactions is ribosomal and mediated by RNA polymerases I and III, which are poorly inhibited by alpha-amanitin, the total counts per minute hybridized in the experiments shown in panel B were very similar. Shown are the mean values from two independent experiments and the standard deviation.
|
|
|
|---|
The contribution of DNA sequence in the context of an initiation zone. Our previous results demonstrated that potential sites for the initiation of replication become more focused to the DHFR intergenic region at the ODP. However, the resolution of these prior studies, which interrogated 17 independent sites throughout a 110-kb region, could not distinguish whether initiation within pre-ODP nuclei was taking place at many or only a few additional sites. For example, we have demonstrated that a very specific site within the DHFR gene is highly preferred as an initiation site when topoisomerase II-condensed metaphase chromosomes are introduced into Xenopus egg extracts (37). Activation of just this one additional site could substantially skew the relative distribution of replication forks among these 17 probes. By sequencing the entire DHFR locus, we have provided a novel tool with which to probe initiation in this region, allowing us to demonstrate that there is little or no site preference for initiation throughout the entire 121 kb when replication initiates within pre-ODP nuclei. Thus, under these conditions, it appears that virtually any sequence can function as a site for the initiation of replication. Indeed, it is possible that initiation within the DHFR locus in pre-ODP nuclei is as flexible as initiation in early Xenopus embryos, which initiate replication at random with respect to DNA sequence (27, 42). Although we cannot yet rule out the possibility that many specific sites are spaced at frequent intervals (every few kilobases), DNA sequence specificity is all but ruled out.
In post-ODP nuclei and in vivo, initiation is clearly suppressed within the transcription units, but our results indicate that initiation can take place at a large number of closely spaced sites within the intergenic region. At first glance, this may seem to contradict the precise identification of ori-ß and ori-ß' initiation sites (35). However, these results can be reconciled by considering the resolution of the different studies. These specific sites were identified by interrogating the presence of a few specific sequences in preparations of 0.8- to 1.5-kb nascent DNA strands. Our analysis is more comprehensive but is a lower-resolution study that might not distinguish differences in initiation activity within a few kilobases, due to the sizes of the probes (300 to 1,500 bp) and nascent strands (10 to 20 kb for aphidicolin and 500 to 2,500 bp for mimosine). Moreover, Fig. 2 and 3 suggest some discontinuity in the frequency of site usage within the intergenic region. In fact, when we have hybridized smaller (0.8- to 1.5-kb) nascent strands to our arrays, more-prominent peaks could be observed, including those at ori-ß and ori-ß', but no sites within the intergenic region were completely silent (under these conditions, many peaks were represented by single probes, rendering these data too preliminary to report). Hence, our results are more consistent with 2D-gel analyses that detect some initiation activity in regions that appear silent by the directed PCR method (13). At present, then, the only remaining discrepancy in the initiation pattern at this locus is whether the sites surrounding these peaks are truly silent, as the directed PCR studies suggest, or whether they are used at a lower frequency. Higher-resolution arrays should resolve this long-standing controversy.
Regardless, the large number of sites that can function as replication origins raises the question as to the role of DNA sequences within the context of such a relaxed initiation zone. In fact, deletion of ori-ß or even larger segments of the DHFR intergenic region appears to stimulate the frequency of usage of other sequences in the intergenic region (45), suggesting that redundancy is part of a mechanism to ensure that initiation takes place somewhere within this region. How is it, then, that specific sequences within a small DHFR ori-ß fragment are required for initiation activity when this fragment is moved to ectopic sites (3)? Indeed, this seems almost impossible to reconcile with the results of 2D-gel studies that detect replication bubbles at many sites within the same specific fragment (30). At present, the most parsimonious explanation is that, even within the context of relaxed replicator selection in a large initiation zone, there are localized effects of specific sequences that can influence the frequency with which particular sites are utilized. In this view, transcription contributes to the global exclusion of initiation from transcription units, while other features of chromatin, DNA methylation, topology, and DNA sequence composition contribute to the relative frequency with which replication will initiate at particular sites (22). A challenge for the future is to determine whether the locations of these sites serve some function or whether they are simply the most permissive sites for initiation in that region. Since it is now clear that duplicating genomes once per cell cycle does not require site-specific initiation (23), any putative function for specific origin sites must transcend the basic need to duplicate the genome.
Replication and transcription. One obvious conclusion of our analysis is that replication initiation is largely suppressed within the transcription units of the DHFR locus. This is consistent with a recent genetic study showing that deletion of the DHFR promoter resulted in the initiation of replication at many sites within the gene (62). We have previously suggested a model in which pre-RCs in the path of a moving RNA polymerase are destabilized upon collision with the transcriptional machinery (15, 22, 23). This model accounted for all prior observations, including the occasional replication origin found within transcription units (34, 64), provided that transcription does not occur prior to the initiation of replication at any given origin. In fact, a large inhibitor study revealed that all but one inhibitor of general transcription also inhibited the ODP (33). The one exception was a protein kinase A inhibitor, which likely inhibits transcription through the CREB pathway that does not regulate DHFR transcription (75). However, our results reported here indicate that this model is an oversimplification. First, transcription of the genes within the DHFR locus is active prior to the exclusion of transcription units as sites of initiation at the ODP. No significant increase in the number of RNA polymerase molecules loaded onto DHFR transcription units was detected at the ODP. Hence, the resumption of transcription during G1 phase is not sufficient to inactivate pre-RCs. Furthermore, the DHFR gene is not transcribed very efficiently; the approximately 1,000 copies were transcribed at a frequency similar to that of the single-copy APRT gene (Fig. 6). The 2BE2121 gene was transcribed even less efficiently. Hence, it is possible that only a minority of the gene copies are active in any given cell.
Nonetheless, both the inhibition of transcription (Fig. 7) and deletion of the DHFR promoter (62) have profound affects on origin specification. Inhibitor studies show that the effect does not require translation of RNA (33), and genetic studies suggest that the mechanism linking transcription and origin specification acts in cis at the DHFR locus rather than through a trans-acting, noncoding RNA. Since a sufficient fraction of the total amplified DHFR promoters are occupied with transcription factors to generate an in vivo footprint (54, 67), it is possible that some aspect of the transcriptionally active state, rather than the act of transcription itself, renders the entire transcription unit inaccessible to replication initiation factors. Alternatively, it is possible that only those gene copies that are actively engaged in transcription are competent to initiate replication and that those are also the copies for which transcription has suppressed intragenic initiation. This would imply that transcription plays both positive and negative roles in origin selection. Whatever the nature of this component, the transcriptionally active state on its own is clearly not sufficient to alter the usage of replication origin sites in pre-ODP nuclei. Some additional event, possibly cooperating with the transcriptionally active state, restricts initiation to the intergenic region.
The use of arrays to probe initiation. The remarkable flexibility in the DNA sequences that can function as replication origins in mammalian cells underscores the need for origin mapping approaches that are not biased toward the evaluation of any particular sites predicted to be origin or nonorigin regions. Presently, three such methods have been described. One method is to map the sites of initiation along the length of stretched DNA fibers (5, 50). This method has the distinct advantage of detecting the frequency of multiple initiation events on the same DNA fiber, but it is currently a very demanding and time-consuming method. Another method is to evaluate the presence of many individual sequences positioned at 1-kb intervals in preparations of small nascent DNA strands (58). Carrying out this method one primer set at a time is also extremely laborious. In principle, hybridization of the entire population of small nascent strands to a complete-coverage DNA array should provide a much more convenient and rapid means of evaluating origin usage. Unfortunately, this method has not succeeded in any eukaryotic organism. In fact, the genome-wide identification of origins using DNA arrays has only been indirectly determined in budding yeast by identifying sites of origin replication complex binding (60) or earliest-replicating DNA (59, 74) and in Drosophila by hybridizing earliest-labeled DNA to a high-resolution array of a chromosome arm (43). Here, we present the first similar approach in mammalian cells, hybridizing earliest-labeled G1/S DNA to an array covering a defined 121-kb region. As discussed above, our attempts to hybridize purified small nascent strands to this array have not yet provided conclusive results, a problem that we hope to solve with higher-resolution arrays. However, starting with the amplified DHFR locus in CHOC 400 cells provides the sensitivity afforded by simpler model organisms, while at the same time allowing us to work through the methodology in mammalian cells. By taking advantage of cell lines with fewer and fewer DHFR amplicons (41), we hope to systematically work our way toward the goal of rapidly mapping the distribution of origins at single-copy loci in mammalian cells.
This work was supported by NIH grant GM-57233 to D.M.G. T.S. was supported by a fellowship from the New York State DOH Breast Cancer Initiative.
These authors contributed equally. ![]()
Present address: Roswell Park Memorial Institute, Molecular and Cellular Biology, 666 Elm St., Buffalo, NY 14263. ![]()
Present address: Graduate School of Frontier Biosciences, Osaka University, 1-3 Yamadaoka, Suita, Osaka 565-0871, Japan. ![]()
|
|
|---|
, consists of AT-rich multiple discrete initiation sites. Chromosoma 109:551-564.[Medline]
- and ßA-globin genes show opposite epigenetic modifications. Mol. Cell. Biol. 23:3536-3549.This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»