Mol Cell Biol, January 1998, p. 276-289, Vol. 18, No. 1
0270-7306/98/$04.00+0
Copyright © 1998, American Society for Microbiology. All rights reserved.
Department of Chemistry and Biochemistry, University of California at Los Angeles, Los Angeles, California 90095-1569
Received 28 July 1997/Returned for modification 30 September 1997/Accepted 13 October 1997
| |
ABSTRACT |
|---|
|
|
|---|
We mapped the elements that mediate termination of transcription
downstream of the chicken
H- and
A-globin
gene poly(A) sites. We found no unique element and no segment of
3'-flanking DNA to be significantly more effective than any other. When
we replaced the native 3'-flanking DNA with bacterial DNA, it too
supported transcription termination. Termination in the bacterial DNA
depended on a functional poly(A) signal, which apparently compelled
termination to occur in the downstream DNA with little regard for its
sequence. We also studied premature termination by poorly processive
polymerases close to the promoter. The rate of premature termination
varied for different DNA sequences. However, the efficiencies of
poly(A)-driven termination and promoter-proximal premature termination
varied similarly on different DNAs, suggesting that poly(A)-driven
termination functions by returning the transcription complex to a form
which resembles a prior state of low processivity. The poly(A)-driven
termination described here differs dramatically from the
poly(A)-assisted termination previously described for the simian virus
40 (SV40) early transcription unit. In the SV40 early transcription
unit, essentially no termination occurs downstream of the poly(A) site
unless a special termination element is present. The difference between
the
H-globin and SV40 modes of termination is governed
by sequences in the upstream DNA. For maximum efficiency, the
H-globin poly(A) signal required the assistance of
upstream enhancing sequences. Moreover, the SV40 early poly(A) signal
also drove termination in
H-globin style when it was
placed in a
H-globin sequence context. These studies
were facilitated by a rapid, improved method of run-on transcription
analysis, based on the use of a vector containing two G-free cassettes.
| |
INTRODUCTION |
|---|
|
|
|---|
Our understanding of transcription termination by eukaryotic RNA polymerase II is steadily increasing as more terminators from protein-coding genes are characterized. One theme that has emerged from the characterization of several transcription units is the requirement of a functional poly(A) signal for efficient termination to proceed (11, 32, 36, 66). In many cases, if any part of the poly(A) signal is damaged by mutation, transcription termination is impaired (11, 15, 36, 66). In other cases, however, a correlation between the efficiency of polyadenylation and the efficiency of termination is not apparent (7, 17, 61). One interpretation of poly(A)-dependent termination is that some aspect of the 3'-end processing reaction is involved in potentiating termination by the polymerase. Possibly the 3'-end cleavage event activates a termination factor (11, 51). Alternatively, assembly of the cleavage-polyadenylation complex at the polymerase surface may signal termination (41).
The nature of the actual termination event potentiated by the poly(A) signal is unclear. Logan et al. (36) have suggested that interaction with the poly(A) signal returns the transcriptional apparatus to a prior state of low processivity. Thus altered, the polymerase dissociates stochastically from the template. This model is based on the observation that successful transcription of eukaryotic pre-mRNA-coding genes requires not only the initiation of transcription at the promoter but also the establishment of a processive elongation complex. The commencement of elongation without the establishment of processivity results in premature termination of transcription within the first several hundred base pairs downstream of the promoter (8, 28, 33, 40, 42, 47-49, 52, 53, 67, 69). The model of Logan et al. (36) is consistent with the behavior of transcription units in which transcription terminates heterogeneously, beginning immediately downstream of the poly(A) site (1, 22). However, no candidate Logan-type termination region has yet been definitively characterized.
On the other hand, not explained by the model of Logan et al. (36) are the many instances in which transcription proceeds for great distances past the poly(A) site with little decrease in processivity (12, 27, 39, 44). In the well-characterized case of a chimeric plasmid containing the simian virus 40 (SV40) early transcription unit (10, 11), little termination occurs beyond the poly(A) site unless a special termination element is encountered. However, a functional poly(A) signal is required in order for the downstream termination element to be fully effective (10, 11). Thus, efficient termination in this case is potentiated by a functional poly(A) signal but is realized only upon encountering the additional downstream terminator.
Downstream terminator elements may not be uncommon and apparently take a variety of forms. The one described by Connelly and Manley (10, 12) is the protein binding site, CCAAT, probably bound by CP1. Ashfield et al. (2, 3) have also characterized a protein binding site (for MAZ) that acts as a downstream terminator. Additional downstream terminators that probably act at the level of DNA sequence per se rather than as protein binding sites have previously been described (16, 60). How these elements act is unknown, but their collaboration with poly(A) sites has led to the suggestion that they are pause sites, whose role is to delay the polymerase until polyadenylation has had sufficient time to occur (11, 51). Indeed, independent evidence suggests that one of these elements is in fact a pause site (16).
Although the downstream elements that have been characterized so far depend on functional upstream poly(A) signals for full efficiency, in at least one case, the CCAAT box, the signal is partially effective on its own. Moreover, duplicating the element increases its effectiveness when no poly(A) site is present (12). Thus, as has previously been suggested (14, 55), it is possible that some terminators of processive RNA polymerase II transcription do not require the assistance of a poly(A) signal.
In the present work, we characterized extensively the elements
responsible for termination of chicken
H-globin gene
transcription. We found that termination of transcription for this gene
is best described by the model of Logan et al. (36) and thus
differs from that of most genes so far characterized. Termination was
poly(A) dependent, and no additional downstream termination element was
required. Moreover, termination appeared to commence promptly after the
poly(A) site; no significant span of processive transcription
intervened between the poly(A) signal and the region of termination. In
comparison to the previous work described above, these results
suggest that the following two modes of transcription termination
exist:poly(A)-driven, terminator-independent termination and poly(A)-assisted, terminator-dependent
termination. We therefore carried out a functional comparison of
the 3'-flanking regions of the
H-globin and SV40 early
transcription units. The comparison confirmed the inferred difference
between the two termination modes and showed moreover that the
poly(A)-driven mode of the
H-globin gene depends on the
presence of an upstream termination enhancer which is not present in
the SV40 early region.
| |
MATERIALS AND METHODS |
|---|
|
|
|---|
Plasmids pORgf3, pORgf3·2, pORgf7, p<ABCDEF>, p<ABCDE>, pAsv<BsvL>, p<3>1·2, and pHyb. The 376-bp G-free segment for pORgf3 was obtained as a BamHI-EcoRI fragment from the original G-free construct, pC2AT (56), and inserted into the multiple cloning site of pBluescript SKII+ (Stratagene) at the SmaI site in the G-free orientation with respect to the T7 promoter. The multiple cloning site with embedded G-free cassette was excised by BssHII digestion and inserted into the HindIII site of pOR4 (13). This is just downstream of the SV40 origin so that G-free transcription is driven by the SV40 early promoter (21). For pORgf3·2 (Fig. 1A), the 261-bp G-free segment was obtained as a HindIII-BglII fragment from pRL542 (37) and inserted into the HindIII site of pBluescript SKII+ in the G-free orientation with respect to the T3 promoter. This was excised from the multiple cloning site as a BamHI-SalI fragment and inserted into the BstXI site downstream of the 376-bp cassette of pORgf3. For the 750-bp G-free cassette in pORgf7, a 374-bp G-free SstI-SmaI fragment from pORgf3 was reinserted into the SmaI site of pORgf3 to tandemize the G-free cassette.
|
H 3'-flanking region in p<ABCDEF> was reassembled
from the inserts in pCBG11 and pCBG13 (63) as follows.
First, DNA was assembled in the
ClaI-SalI-XhoI-ApaI-KpnI
portion of the pBluescript SKII+ multiple cloning site by inserting a
Bsp1286I fragment (A) into the SalI site (with
regeneration of SalI sites flanking fragment A) and an
MboII fragment (BCDE) into the ApaI site to give
the intermediate pABCDE. Then the ensemble containing fragments A, B,
C, D, and E was transferred as a ClaI-KpnI
fragment into the BamHI site of pORgf3·2 to give
p<ABCDE>. Finally, an F-containing BstXI-XbaI
fragment from p<DEF> (Table 1) (BstXI is
in segment D, and XbaI is in the vector) was used to replace
the homologous segment of p<ABCDE>, thus generating p<ABCDEF>. This
construction route introduces some multiple cloning site nucleotides
(including the XhoI site) between segments A and B and also
results in the repetition of 35 chicken nucleotides at the A-B
junction.
|
Confirmation of plasmid constructs. All constructs used in this work were confirmed by restriction mapping with at least three different digests. Selected plasmids were also subjected to further confirmation as follows. The presence of the G-free cassettes in pORgf3·2 (Fig. 1A) and in p<neo> and p<ABCDE> (Table 1) was confirmed by using a T7 promoter situated 5' of the first G-free cassette to generate G-free transcripts in the presence of T1 RNase. Sequencing was used to confirm the 5' ligation junction of the upstream cassette in pORgf3·2. To verify that the plasmids in a given preparation constituted a homogeneous population (i.e., contained no admixed deletion variants), plasmids pORgf3·2, p<neo>, p<ABC>, p<ABD>, p<ABCDE>, and p<ABDEF> were cut with restriction enzymes to isolate the two G-free cassettes on separate restriction fragments and then analyzed by gel electrophoresis. The ethidium bromide staining pattern was recorded by digital photography and analyzed with NIH Image (version 1.60) to verify that the cassette-containing fragments within each plasmid were present in equal molar ratio.
Transfection and isolation of nuclei.
For our early work
(Fig. 1D), we used DEAE-dextran transfections into COS cells by the
method of Kaufman (30), except that there was no chloroquine
treatment. For most of the work reported here, we used calcium
phosphate transfections by the method of Ausubel et al. (4),
except that COS cells were incubated at 37°C under 5%
CO2 at all times. The DNA-calcium phosphate mix was
incubated at room temperature for exactly 20 min before being added to
cells. After 20 to 24 h, cells were rinsed twice with 5 ml of
serum-free Dulbecco modified Eagle medium and replated with 10 ml of
Dulbecco modified Eagle medium containing 10% fetal bovine serum. For
our most recent experiments, we used Lipofectamine (Gibco BRL) for
transfection, employing the protocol supplied by the manufacturer. Six
micrograms of DNA and 18 µl of Lipofectamine were used on cells that
were 50 to 80% confluent in a 60-mm-diameter tissue culture dish. In
our experience, Lipofectamine gives the most reliable results because
of the improved signal-to-noise ratio that results from the higher
transfection efficiency. In all cases, nuclei were isolated 44 to
48 h after transfection by the method of Ausubel et al.
(4). During the final step, nuclei were washed with nuclei
storage buffer containing 1 mM dithiothreitol and resuspended in a
final volume of 40 µl of nuclei storage buffer (except for the
hybridization run-on procedure, where the final volume was 100 µl).
Nuclei were stored at
70°C for up to several months.
Nuclear run-on transcription and hybridization. Run-on transcription was performed by the method of Ausubel et al. (4). The final RNA sample was resuspended in hybridization buffer (0.25 M Na2HPO4, 0.25 M NaH2PO4, 1 mM EDTA, 1% bovine serum albumin, 7% sodium dodecyl sulfate [SDS]). M13 single-stranded DNAs containing sequences complementary to the pre- and postsequence transcripts of pHyb were prepared by infecting log-phase JM109 cells with M13 clones. The supernatant of the overnight culture was collected, and the phage were precipitated with 3.2% polyethylene glycol 8000-2.4% NaCl and then resuspended in 10 mM Tris-1 mM EDTA (pH 8) for the isolation of DNA by phenol-chloroform extraction.
The Nytran membrane for blotting was soaked first in deionized water and then in 8× SSC (1× SSC is 0.15 M NaCl plus 0.015 M sodium citrate). Together with two pieces of similarly prewet filter paper, the membrane was assembled into the slot blot apparatus. Wells were washed twice with 8× SSC, and then 10 µg of single-stranded DNA in 250 µl of 8× SSC was applied to wells. The blot was washed twice with 250 µl of 8× SSC, and DNA was immobilized on the membrane by UV cross-linking. The blot was prehybridized in hybridization buffer overnight at 65°C, and then the sample was applied to the blot and hybridized overnight at 65°C. The blot was washed twice with 2× SSC-0.1% SDS at 65°C for 15 min and twice with 0.1× SSC-0.1% SDS at 65°C for 15 min. Finally, the blot was wrapped in Saran Wrap on wet Whatman 3MM filter paper for exposure to X-ray film. The resulting image was quantitated by densitometry and captured for publication by using an AGFA Arcus II scanner with FotoLook 95 (version 2.0) software.G-free nuclear run-on transcription assay.
Forty microliters
of nuclei (105 nuclei/µl) was mixed with 40 µl of 2×
transcription buffer to give final concentrations of 5 mM Tris-Cl (pH
8), 2.5 mM MgCl2, 300 mM NH4SO4,
0.5 mM ATP, 0.5 mM UTP, 2.5 mM dithiothreitol, 1,000 U of
T1 RNase, 32 U of RNase inhibitor, and 30 µCi of
[
-32P]CTP and incubated at 37°C for 45 min. The
reaction was then brought up to 2.5 mM CTP (nonradioactive) and 0.025 mM GTP for a 12-min cold chase. The chase is necessary to maximize the
yield of cassette transcripts. Apparently, the concentration of CTP (nanomolar levels) at the labeling step is insufficient to permit quantitative elongation to the ends of the cassettes within an experimentally reasonable length of time. The cold chase presumably allows all cassette transcripts to reach full length for accurate quantitation by gel electrophoresis.
| |
RESULTS |
|---|
|
|
|---|
G-free cassette assay for transcription termination. The method used is based on transfection, followed by run-on transcription with a plasmid such as pORgf3·2 (Fig. 1A). This plasmid contains two G-free cassettes (56) of different lengths arranged in tandem and separated by a multiple cloning site into which foreign DNA can be inserted. When they are transcribed in the G-free orientation, G-free cassettes give rise to transcripts devoid of G residues and immune to digestion by the G-specific nuclease, T1. After transfection, any transcription termination that occurs between the two cassettes in vivo leads to a lower polymerase density in the downstream member of the duo. The relative polymerase densities in the two cassettes can be determined by isolating nuclei, carrying out run-on transcription in the presence of T1 RNase, running the surviving G-free transcripts from the two cassettes on a gel, quantitating the densities of the two G-free bands, and establishing their ratio. A similar rationale has previously been applied to the study of premature termination in vitro in nuclear extracts (33, 37).
To illustrate the use of pORgf3·2 in measuring transcription termination, the results of a typical experiment are shown in Fig. 1B. The ABCDE region of the chicken
H-globin gene
transcription unit (Fig. 2) encompasses
the 3' end of the gene itself and flanking sequences (CDE) in which
transcription termination is known to occur in vivo (50).
The ABCDE region was inserted between the two cassettes of
pORgf3·2 and transfected into COS cells in parallel with a
control plasmid containing a prokaryotic DNA insert (neo, a
segment from the neomycin resistance gene). By our convention, we refer
to these plasmids as p<ABCDE> and p<neo>, where < and > denote the pre- and postcassettes respectively. We call the two
cassettes and the region between them collectively the cassette window.
Figure 1B, lane 1, shows that with neo in the cassette window,
there was little effect on the abilities of polymerases to read through
from one cassette into the next ("uncorrected readthrough").
Moreover, when the p<neo> results were normalized to the results (not
shown) of a parallel transfection with pORgf3 · 2 lacking any
insert, there was no termination that is attributable to the
prokaryotic DNA at all (normalized readthrough) (Fig. 1B). In contrast,
Fig. 1B, lane 2, shows that with the ABCDE region of the
H-globin gene in the cassette window, virtually none of
the polymerases that transited the first cassette succeeded in reaching
the second. These results confirm both the validity of the assay and
the functionality of the
H-globin gene terminator in the
pORgf3·2 background.
|
-32P]-labeled
nucleotides, each polymerase resumes elongation for a short distance,
generating a radioactively labeled transcript tag that signifies its
location. Under the high-salt run-on conditions used here, all
polymerases are believed to resume elongation in vitro, regardless of
whether they were elongating or pausing in vivo, and the results
obtained agree with those of other methods for locating and counting
polymerases (31, 46).
In conventional run-on procedures, the labeling step is followed by
hybridization of the in vitro-labeled RNA to various probes that have
been blotted on membranes. The results of an experiment illustrating
this approach are shown in Fig. 1D. The experiment was exactly parallel
to that of Fig. 1B except that the DNA segments ABCDE and neo were
inserted into pHyb (Fig. 1C) and run-on transcripts were quantitated by
slot blot hybridization rather than by G-free analysis. Figure 1D shows
that there was strong hybridization of the run-on RNA to the
postsequence probe for the neo insert but poor hybridization to
the postsequence probe for the ABCDE insert. These results confirm that
the ABCDE region is a strong terminator of transcription, which is
consistent with the G-free analysis (Fig. 1B).
It is evident from Fig. 1B that gel lanes from the G-free procedure
exhibited some background. Concerned that the background may have been
due to nonspecific degradation of G-free transcripts, we checked
several parameters. First, we found that the background was of
cellular, not plasmid, origin because it was present whether cells had
been transfected with a plasmid or not (Fig.
3A). Second, the transcription of G-free
RNA is
-amanitin sensitive, whereas the transcription of background
RNA is
-amanitin and
5,6-dichloro-1-
-D-ribofuranosylbenzimidazole-resistant (Fig. 3B and unpublished observations). Therefore, G-free transcription is due to RNA polymerase II, whereas background transcription is not.
Finally, in contrast to G-free RNA, the efficiency of labeling of
background RNA did not depend on a cold-CTP chase (data not shown),
suggesting that the background does not arise from long tracts of
genomic G-free sequence. Nevertheless, the background pattern was
fairly consistent, notably the cluster of bands indicated by brackets
in Fig. 3A and B. The T1 resistance of background RNA may
reflect special structural characteristics that protect this RNA from
the T1 nuclease in our procedure.
|
|
Deletion analysis of the
H-globin gene transcription
termination region.
The ABCDEF region of the
H-globin transcription unit (Fig. 2) encompasses the 3'
end of the gene itself (AB) and the termination region (CDEF). In order
to locate the elements required for transcription termination, we
carried out a deletion analysis by progressively removing DNA from the
3' end of the region and then assaying for termination efficiency in
pORgf3·2. Figure 5A shows that the
complete region, ABCDEF (lane 1), acted as an effective terminator when it was placed in the cassette window of pORgf3·2. Sequential
removal of segments F, E, and D had no effect on termination efficiency (Fig. 5A, lanes 2 through 4). However, the removal of segment C
drastically reduced termination efficiency (Fig. 5A, lanes 5 and 6).
Apparently half of the overall termination in ABCDEF occurs within
segment C.
|
H-globin
termination region as previously reported for the murine
maj-globin gene (60). Further deletion
analysis showed that AB was required for efficient termination (Fig.
5A, lane 8) but that EF was not (lane 9). Thus, either segment C or D
can support efficient termination (lanes 4 and 9) but not in the
absence of AB (lane 8). Moreover, both segments A and B are required
for efficient termination; segment B alone is only partially effective
in potentiating termination in segment D (Fig. 5A, lanes 9 through 11).
Thus, the complete terminator is apparently tripartite. Elements in
both segments A and B were necessary for maximum efficiency (Fig. 5A,
lanes 9 through 11), and downstream DNA was required (lanes 4 and 9 versus lane 5). The element in segment B is presumably the poly(A)
signal, whereas that in segment A is an unidentified termination
enhancer. The enhancer could be the 3' splice site contained in segment
A (Fig. 2), which would be expected to enhance termination by
increasing the efficiency of polyadenylation (26, 35, 45,
64), or it could be a relative of another recently described
enhancer of termination (17). An investigation into the
nature of each of these three elements is described in more detail
below.
None of the constructs (Fig. 5A) allowed complete
readthrough. This is what would be expected of a typical RNA
polymerase II transcription unit, where the promoter dispatches
both highly processive and poorly processive polymerases to the
template downstream (see the introduction). Thus, about half of
the polymerases entering the cassette window (located only about 165 bp
downstream of the promoter in pORgf3·2) were apparently of the
poorly processive type, failing to traverse the DNA insert even when it
was taken from the body of the
H-globin gene (i.e.,
segment A) (Fig. 5A, lane 6).
If the failure to exceed 50 to 60% readthrough (Fig. 5A) reflects
premature termination by a separate class of polymerases, then
eliminating that class of polymerases before they enter the cassette
region should allow the remaining polymerases to register complete
readthrough. We used segments A and D to test this prediction. Neither
segment A nor D contains a poly(A) site, yet neither allowed more than
50 to 60% readthrough (Fig. 5A, lanes 6 and 11). We then modified
construct p<A> by inserting segment D between the promoter and the
cassette window. As expected, those polymerases that did traverse D
were also capable of traversing A so that nearly 100% readthrough was
recorded (Fig. 5B, lane 1). We regularly obtained this type of result
with various modular rearrangements of DNA segments in our system
(e.g., Fig. 6B; compare lane 7 with lane
5); therefore, we conclude that this reflects the presence of about
50% poorly processive polymerases in the promoter-proximal regions of
our constructs.
|
A-Globin gene transcription termination region.
The properties of the
A-globin gene transcription
termination region resemble those of the
H-globin gene.
Figure 5C shows the results of a deletion analysis of the WXYZ region
(Fig. 2) of the
A-globin gene. Any combination of
segments X, Y, and Z can be deleted with no impairment of termination,
provided that at least one such segment remains. However, deletion of
the complete XYZ region, leaving only segment W, which contains the
poly(A) signal, substantially impairs termination. Thus, as for the
H-globin gene, there is redundancy of termination
potential in the 3'-flanking region of the
A-globin
gene.
Prokaryotic DNA can support poly(A)-dependent termination.
Since poly(A)-dependent termination appears to occur somewhat
promiscuously in the 3'-flanking DNA of the
H- and
A-globin genes, we wondered whether prokaryotic DNA
would also support termination. Figure 6A shows that three arbitrarily
selected segments of prokaryotic DNA supported substantial amounts of
transcription termination when they were appended to the
H-globin AB segment, which contains the poly(A) site. To
confirm that transcription termination indeed is potentiated in
prokaryotic DNA and does not occur within segment AB itself, we removed
segment AB from the cassette window and placed it upstream, yielding
split constructs (Fig. 6B). Figure 6B, lane 1, shows that neo DNA alone was relatively transparent to the passage of polymerases. Even poorly
processive polymerases got through for the most part. However, when
segment AB was placed upstream of the cassette window, neo became a
respectable terminator (Fig. 6B, lane 2). Note that the 44%
termination registered by pAB<neo> did not include any poorly processive polymerases being dislodged, since most of these were removed by the upstream A segment and never reached the cassettes (see
above). Thus, the even-numbered lanes in Fig. 6B record the actual
percentages of processive polymerases that succumbed to poly(A)-dependent termination. Accordingly, it can be seen that all
three examples of prokaryotic DNA supported termination potentiated by
the AB segment upstream. Figure 6B, lane 7, shows that the presence of
the B segment was required for termination to be potentiated. Since the
poorly processive polymerases are removed by the A region, most of the
polymerases that survive region A also read through cat. Hence,
pA<cat> registered more readthrough (Fig. 6B, lane 7) than did
p<cat>, its parent (lane 5).
H-globin B fragment. Therefore, we replaced fragment
B in some of the constructs described above with a synthetic DNA
fragment containing a 43-bp sequence (P) taken from the core poly(A)
signal of the SV40 early transcription unit (11). Lanes 1 and 2 of Fig. 6C show that the core poly(A) signal from SV40 supported essentially the same level of termination in prokaryotic DNA as did the
B fragment of the
H-globin gene. To further evaluate the
poly(A) dependence of termination in prokaryotic DNA, we mutated the
SV40 poly(A) signal by deleting its downstream U-rich element. This
mutation substantially weakens the poly(A) signal, seriously impairing
its ability to direct 3'-end processing and to potentiate termination
in its native context (11). Fig. 6C, lane 3, shows that this
weakened poly(A) signal (P
) was also seriously impaired in its
ability to direct termination in prokaryotic DNA. Finally, Fig. 6C,
lane 4, shows (as expected) that when the poly(A) signal was
inactivated by inversion, essentially no termination occurred. These
results firmly establish that prokaryotic DNA supports genuine
poly(A)-dependent termination in our system. This contrasts with the
report of Tantravahi et al. (60), who observed no
termination in prokaryotic (cat) DNA when it was placed
downstream of the mouse
maj-globin poly(A) region.
Postprocessive trans-poly(A) polymerases resemble preprocessive polymerases. One model for poly(A)-dependent termination, originally proposed by Logan et al. (36), is that the elongation complex is returned to a state that resembles its original preprocessive form. Since DNA sequences vary in how permissive they are to elongation, a prediction of this model is that promoter-proximal preprocessive polymerases resemble trans-poly(A) postprocessive polymerases in their ability to negotiate different DNA sequences. [We refer to polymerases that have crossed the poly(A) site as trans-poly(A) polymerases.] The results in Fig. 6B confirm this prediction. Figure 6B, lanes 1, 3, and 5, show that the prokaryotic DNA segments neo, lac, and cat decreased in their permissivity to elongation of promoter-proximal polymerases in that order. Similarly, lanes 2, 4, and 6 of Fig. 6B show that the permissivities to elongation of trans-poly(A) polymerases decreased in the same order.
We refer to permissive sequences like neo as being smooth and to poorly permissive ones like cat as being bumpy. This emphasizes that the sequence features responsible for these differences in permissivity are likely to be mundane, multiple, and minor. It is unlikely, for example, that the cat DNA segment, taken from the coding region of a prokaryotic gene, would contain any sequence elements of special significance to transcription termination in a eukaryotic system. Therefore, the similar responses to bumpiness of both promoter-proximal and trans-poly(A) polymerases suggest that the latter are converted to a state that is capable of terminating in response to rather general features of DNA sequence.Termination is negligible within G-free cassette sequences. Since prokaryotic DNA dislodges poorly processive polymerases and supports poly(A)-dependent termination, we decided to evaluate polymerase processivity and termination within the artificial G-free cassette sequences themselves (Fig. 3C and D and 6D). Cassette-mediated termination could conceivably occur both in vivo, thereby decreasing the magnitude of the cassette signal, and in vitro (during the run-on procedure), thereby decreasing the clarity of the cassette signal by smearing the band to lower molecular weights. Figure 3C and D show that there was no detectable termination during the run-on procedure in vitro since even a 750-bp-long segment of G-free DNA yielded a clear, sharp, discrete band.
Figure 6D, lanes 1 through 3, shows that cassette-mediated termination in vivo also was minimal. The construct in Fig. 6D, lane 1, pORgf3·2, provided information on the amount of termination by poorly processive polymerases, whereas the construct in lane 2 provided an estimate of the amount of poly(A)-dependent termination by processive polymerases. In both cases, the effect was only about 10%. Specifically, this means that there is only about 10% excess cassette-mediated termination in the postcassette in vivo. (Excess cassette-mediated termination in the precassette in vivo is invisible in this procedure except as a general decrease in overall signal.) To obtain an additional estimate of the amount of termination that actually occurs within G-free cassette DNA in vivo, we constructed p<3>1·2 (Fig. 6D, lane 3). This construct contains 380 bp of G-free cassette DNA (imbedded in 166 bp of multiple cloning site DNA) as the insert in its cassette window. The construct was also made with a shortened precassette (130 bp), thus moving the insert closer to the promoter to maximize exposure to poorly processive polymerases. Figure 6D, lane 3, shows that inserted G-free cassette DNA and increased proximity to the promoter resulted in only a small additional increment in termination. Thus, G-free cassette DNA is relatively permissive to the passage of polymerases. Cassette-mediated termination in the postcassette introduces, at worst, a correctable systematic overestimate of termination, which is significant only for inserts that are poor terminators. Because the results discussed above show that the effect is small, we did not attempt to correct for this effect in these studies. In addition to the controls described immediately above relevant to the postcassette, we also wished to test directly our premise that variations in termination within the precassette are invisible in this procedure (see above). We therefore measured termination across neo of poorly processive (<neo>) and processive but poly(A)-potentiated (AB<neo>) polymerases by using the shortened (130-bp) cassette in the pre- sequence position. The results (Fig. 6D, lanes 4 and 5) were virtually identical to those previously obtained with the 376-bp precassette (Fig. 6B, lanes 1 and 2). Taken together, these data show that the G-free cassette procedure for measuring termination is quantitatively reliable.Termination commences promptly after the poly(A) site.
It has
previously been suggested that after the poly(A) site of the mouse
maj-globin gene, there is an obligatory spacing
requirement of about 500 bp before termination can begin
(60). Indeed, in many transcription units, the polymerase
may traverse several kilobases downstream of the poly(A) site with
little or no termination at all (10, 44). Since in some of
our constructs termination appeared to occur efficiently with distances
between the poly(A) site and the downstream cassette that were close to
the minimum mentioned above (Fig. 5A, lane 9; Fig. 6A, lane 3) (see
Table 1 for distances), we decided to address this issue. For this
purpose, we prepared p<ABC1> and p<ABC2>,
which contain the upstream and downstream halves, respectively, of the
H-globin 3'-flanking segment C. In both of these
constructs, the distance between the poly(A) site and the downstream
cassette is 500 bp (Table 1). Figure 7A,
lanes 2 and 3, shows that both constructs exhibited efficient
termination, suggesting that any spacing requirement is either small or
absent. Moreover, both constructs exhibited similar levels of
termination, reinforcing the view that there is no specific DNA
sequence requirement for termination in this system.
|
H-globin 3'-flanking region
lacks a discrete terminator, we cannot check directly for a minimum
spacing requirement simply by moving a terminator element progressively closer to the poly(A) site. Moreover, since a specific terminator element is absent, termination occurs gradually over several hundred base pairs of DNA (44a). Therefore, in addition to any
spacing after the poly(A) site that may be required before the
termination process can be initiated (60), a sufficient
length of DNA must be present prior to the downstream cassette in order
to capture a sufficient number of termination events within the
cassette window for reliable measurement. Within these limitations, we pursued further the question of the spacing requirement in the following three ways. First, we carried out a variation of the direct
test (Fig. 7A) by inserting B just upstream of C2 in
pD<C2> to give pD<BC2>. A comparison of
lane 4 to lane 1 of Fig. 7B shows that despite a distance of only about
500 bp between the poly(A) site and the downstream cassette,
significant termination occurred. Second, noting that segment A
enhanced poly(A)-dependent termination (Fig. 5A; compare lanes 9 and
10) but did not itself potentiate any termination (Fig. 6B, lane 7, and
C, lane 4), we placed segment A in front of the cassette window of
pD<BC2> to give pDA<BC2>. A comparison of
lanes 4 and 5 in Fig. 7B shows that the poly(A)-dependent termination
in these constructs was enhanced, once again despite a distance of only
500 bp. For our third test, we reasoned that if there is a spacing
requirement for poly(A)-dependent termination in subfragment
C2, then moving the poly(A) site next to subfragment C2 from farther upstream should reduce termination
efficiency. To test this prediction, we compared termination across
subfragment C2 in pDAB<C2> (Fig. 7B, lane 3)
with that in pDA<BC2> (lane 5), in which the poly(A) site
is closer to subfragment C2 by more than 350 bp. It is
clear that moving the poly(A) site closer to subfragment C2
had no detrimental effect on termination and, if anything, may have
caused termination efficiency to increase. Based on a total of four
independent comparisons, we therefore conclude that any spacing
requirement for poly(A)-dependent termination in the chicken
H-globin gene is either small or absent.
SV40 early transcription unit lacks a termination enhancer and
fails to induce termination in downstream DNA.
The results
described so far reveal the
H-globin system to be
dramatically different from most of the other poly(A)-dependent terminators that have been characterized. For example,
poly(A)-dependent termination after the SV40 early transcription unit
does not commence immediately downstream of the poly(A) site. Rather,
transcription continues with scarcely any decrease until a special
termination element is encountered (10). Thus, both
H-globin transcription termination and SV40 early
transcription termination are poly(A) dependent, but SV40 early
termination does not occur without an additional termination signal in
the downstream DNA. In contrast,
H-globin termination
begins soon after the poly(A) site and requires no additional
downstream signal.
H-globin and SV40 early
transcription units which may contribute to the contrasting modes of
termination are the following: (i) the polyadenylation signal, (ii) the
3'-terminal exon sizes (the size of the final
H-globin
exon [Fig. 2] is less than half the average for terminal exons
[6], whereas the size of the last SV40 early exon is more than three times the average [9], and (iii) the
promoters (transcription of the
H-globin constructs in
our experimental system is driven [ironically] by the SV40 early
promoter, whereas transcription of the SV40 early region in the
constructs of Connelly and Manley [10] is driven by
the adenovirus late promoter). We show below that none of these
differences in the structure of the two transcription units is
responsible for their differences in termination mechanism.
We can already rule out the poly(A) signals themselves as being
responsible for the terminator-dependent (i.e., SV40) versus terminator-independent (i.e.,
H-globin) mechanisms.
Figure 6C shows that in the presence of the A fragment from
H-globin, the SV40 early poly(A) signal potentiated
termination in prokaryotic DNA without the need of any additional
downstream information. Therefore, the SV40 early poly(A) signal can
potentiate terminator-independent transcription termination in the
appropriate sequence background.
To assess the possible role of exon size in directing termination, we
took advantage of pRSVcat (24). This expression vector is
based on a derivative of the SV40 early transcription unit in which the
size of the terminal exon has been reduced by more than 1.3 kb
(25). We inserted a G-free cassette upstream of the SV40
early poly(A) site of pRSVcat, and downstream we inserted some spacer
DNA plus another G-free cassette. These insertions were well away from
the sequences required for full polyadenylation activity
(11). The resulting plasmid was
pAsv<BsvL> (Fig.
8, lane 2), a construct analogous to
pDA<BC2> (Fig. 7B, lane 5, and 8, lane 1). Figure 8, lane
2, shows that pAsv<BsvL> exhibited complete
readthrough, in contrast to pDA<BC2>, which exhibited substantial termination (Fig. 8, lane 1).
|
H-globin mechanisms of termination is therefore
determined by some property of the upstream A portions of these two
transcription units. We have already shown that the A region of the
H-globin gene contains an enhancer of termination.
Apparently, SV40 lacks this enhancer and depends instead on a
downstream element (a CCAAT box in the constructs of Connelly and
Manley [10, 12]) to effect the actual termination
event.
| |
DISCUSSION |
|---|
|
|
|---|
We used an improved method of transcription run-on analysis to
characterize the transcription termination elements of the chicken
H-globin gene. We have shown that transcription
termination for this gene is poly(A) dependent and that it requires a
termination enhancer upstream of the poly(A) site for maximum
efficiency. No additional downstream termination signal is required;
even prokaryotic DNA downstream of the poly(A) site supports
termination.
This mode of RNA polymerase II termination differs dramatically from
that reported for the SV40 early and mouse
maj-globin
genes, in which no detectable termination downstream of the poly(A)
site occurs, even for several kilobases in the case of SV40, until
special termination elements are encountered (10, 12, 60).
We have established that these differences are real by using our
methods of measurement to confirm the lack of termination downstream of
the poly(A) site for SV40 and, conversely, by showing that the same
prokaryotic sequence (cat) that fails to support termination in
the mouse globin system does so efficiently in the chicken
H-globin system. We conclude that there are at least two
distinct types of poly(A)-dependent termination. In poly(A)-assisted
termination (the SV40 type), the poly(A) signal renders the polymerase
susceptible to a downstream termination signal but does not otherwise
impart a significant decrease in processivity to the polymerase. In
poly(A)-driven termination (the
H-globin type), the
poly(A) signal goes beyond assisting downstream termination and
actually revokes the processivity of the transcription complex, thus
guaranteeing rapid, signal-independent termination on almost any
downstream DNA with little regard for sequence.
G-free cassette analysis of run-on transcription. The studies reported here were facilitated by a new method of run-on transcription analysis that makes use of G-free cassettes (56). As described above, there are considerable practical advantages to the G-free method, not the least of which are convenience and turnaround time. In addition, the inherent modularity of the method facilitates the rearrangement of functional modules, as we did throughout this work in order to gain new insights into the interactions among different elements.
There are also substantial theoretical advantages to the G-free procedure. In conventional run-on procedures, the perceived locations of polymerases vary with the duration of the in vitro elongation step because polymerases can move both into and out of probe regions during elongation (58). This effect is unimportant only if polymerase densities and elongation rates are uniform across the entire region being assayed, a condition not commonly fulfilled in transcription termination studies. For G-free analysis, the 32P labeling step in the run-on procedure is carried out in the absence of GTP so that no upstream polymerases can invade the cassettes in vitro and no polymerases in the cassettes can leave. Hence, the signal for each cassette arises strictly from the polymerases that are trapped within the cassette at the time of nuclear isolation. Moreover, since elongation rates within the two cassettes are likely to be essentially the same, we assume that the cassettes act as unbiased receptacles for polymerases in vivo. An additional theoretical advantage of the G-free procedure is that only genuine termination, not pausing, is predicted to give a decrease in the postcassette/precassette ratio. To illustrate, consider a strong pause sequence inserted at the KpnI site of the conventional hybridization vector pHyb (Fig. 1C). Any stacking of polymerases behind a transcription complex immobilized by pausing would result in increased polymerase density on the upstream presequence. The resulting increase in run-on transcript production from this region would then decrease the postsequence/presequence hybridization ratio, a result indistinguishable from termination. In contrast, similar stacking of polymerases into the precassette of the G-free vector pORgf3·2 (Fig. 1A) would reduce rather than increase the precassette signal because run-on transcription for the G-free procedure, unlike that for a conventional hybridization assay, is carried out in the absence of GTP. Any polymerases accumulated over a pause region in inserted DNA would therefore remain immobilized during the run-on step and physically prevent any polymerases that are stacked back into the cassette from yielding full-length cassette transcripts (59). Consequently, the postcassette/precassette ratio for the assay would increase rather than decrease. This scenario illustrates that in the G-free assay, only termination can reduce the postcassette/precassette ratio. We therefore regard any reduction in this ratio to be indicative of genuine termination.Poly(A)-driven transcription termination.
The main conclusion
from this work is that the 3' end of the chicken
H-globin gene directs RNA polymerase II to terminate
transcription beginning directly downstream of the poly(A) site,
without the need for any further termination signals in the downstream
DNA. This mode of termination is consistent with the model for
poly(A)-dependent termination proposed by Logan et al. (36).
The key feature of this model is that an event triggered by the poly(A)
signal causes the processive transcription elongation complex to revert
to a state of poor processivity. An obligatory correlate of the model is that termination downstream of the poly(A) site is a consequence simply of destabilization of the elongation complex and does not require any additional termination signal in the downstream DNA.
H- and
A-globin genes, virtually any segment of the 3'-flanking
DNA can support termination downstream of their respective poly(A)
sites. The implication that termination for these genes may, in fact, be signal independent downstream of the poly(A) site was confirmed by
the results shown in Fig. 6, which demonstrated that even randomly selected segments of prokaryotic DNA supported termination.
The prokaryotic DNA segments neo, lac, and cat varied in
the effectiveness with which they supported the imposed poly(A)-driven termination (Fig. 6A, lanes 2 through 4, and B, lanes 2, 4, and 6).
This suggests that among the influences which increase the probability
of termination of destabilized elongation complexes are some rather
general features of DNA sequence. If the trans-poly(A) poorly
processive transcription complexes resemble the promoter-proximal poorly processive transcription complexes, as suggested by Logan et al.
(36), then one would expect these two poorly processive versions of the transcription complex to respond similarly to these
general DNA sequence features. Figure 6B shows that this was indeed the
case, with promoter-proximal complexes exhibiting decreasing
stabilities on prokaryotic DNAs in the same order (neo, lac,
and cat [lanes 1, 3, and 5, respectively]) as did trans-poly(A) transcription complexes (lanes 2, 4, and 6, respectively). This is
predicted by the model of Logan et al. (36) but is not
expected by (though not inconsistent with) other models (11,
51).
If transcription terminates principally because of general
destabilization of the transcription complex (loss of processivity) rather than in response to a specific termination signal, then one
would expect the polymerase density after the poly(A) site to decay
stochastically to zero. The rate of this decay would determine the
distance required for termination to reach completion. Thus, the length
of a DNA fragment (rather than any specific signal it might contain)
and general features of DNA sequence (as yet undefined) are the
principal operative parameters that govern termination efficiency in an
assay for this type of termination. Accordingly, p<ABC> (Fig. 5A,
lane 4), with 800 bp between the poly(A) site and the postcassette
(Table 1), terminated more effectively than did p<ABC1>
and p<ABC2> (Fig. 7A, lanes 2 and 3, respectively), with
only 500 bp between the poly(A) site and the postcassette. Indeed, 700 to 800 bp appears to be a generally sufficient length of 3'-flanking
DNA within which to allow complete termination to occur (Fig. 5A, lanes
4 and 9, and C, lanes 2, 4, and 5). This is consistent with the results
of genomic run-on transcription of the
H- and
A-globin genes in chick erythrocyte nuclei (Fig. 2) when
the fact that run-on transcripts can extend several hundred base pairs past the positions of stalled polymerases is taken into account (58). Of course, it is not surprising that the 3'-flanking
DNAs of the
H- and
A-globin genes, though
they lack specific termination signals, were somewhat more efficient
than was the prokaryotic DNA we tested in supporting transcription
termination (compare Fig. 5 and 6).
Termination enhancer.
For maximum efficiency,
H-globin termination must be enhanced by an upstream
element in segment A of the
H-globin map (Fig. 5A, lanes
9 and 10, and 7B, lanes 4 and 5). This element did not by itself
potentiate termination (Fig. 6B, lanes 5 and 7, and 7B, lanes 1 and 2)
but acted only through a downstream poly(A) signal (Fig. 6B, lanes 6 and 7, and C, lanes 2 and 4, and 7B, lanes 2 and 3). This element can
legitimately be regarded as an enhancer because it operated similarly
on two unrelated poly(A) sites (Fig. 6B, lane 6, and C, lane 2) and
because it functioned well even when its distance from the poly(A) site was increased by an insertion of more than 375 bp of foreign DNA (Fig.
7B, lane 5).
H-globin poly(A) signal (segment B)
led to no significant loss of chloramphenicol acetyltransferase
activity in expression assays, showing that the
H-globin
poly(A) site is not dependent on a special enhancer for its activity
(34a). These results are consistent with those for the only
other upstream enhancer of termination so far identified (17), which has little discernible effect on polyadenylation efficiency but contributes strongly to transcription termination.
Poly(A)-assisted transcription termination.
Connelly and
Manley, studying the SV40 early transcription unit, were the first to
provide a complete description of the elements required for
poly(A)-dependent termination (10-12). They showed that for
an SV40-adenovirus recombinant transcription unit, both a functional
poly(A) signal and a downstream terminator are required to ensure
complete transcription termination. In contrast to the
H-globin system described here, they showed that there
is essentially no termination downstream of the poly(A) site, as
measured by run-on transcription, unless the terminator is present
(10, 12). Moreover, an S1 protection assay for genome-length
transcripts within nuclear RNA showed that the average polymerase
continued for more than 4 kb past the poly(A) site when the terminator
was removed. Thus, the SV40 early poly(A) site can assist a downstream element in terminating transcription but is ineffective in directing termination on its own. The nature of the interaction between the
poly(A) signal and the transcription apparatus in poly(A)-assisted termination is thus quite different from that in the poly(A)-driven mode of termination, which is characteristic of the
H-globin gene.
cis-acting elements, not the poly(A) signal itself,
govern the mode of poly(A) dependence.
The most thoroughly
characterized of the poly(A)-assisted class of transcription
terminators is the one that contains the SV40 early poly(A) signal
(10-12). In the absence of an auxiliary downstream
termination element, polymerases cross the SV40 early poly(A) signal
and continue for at least several kilobases with no detectable
termination. Nevertheless, we found that a PCR copy of the same signal
directed immediate, signal-independent termination of the
poly(A)-driven type when it was placed in the context of chicken
H-globin upstream sequences (Fig. 6C). Evidently, the
SV40 poly(A) signal, limited to a facilitating function in its native
context (10, 12) (Fig. 8, lanes 2 and 4), became the final
instrument of termination when it was placed in a chicken background
(Fig. 6C).
2a poly(A) signal, drove
substantial downstream termination, whereas the deletion variant SphX,
lacking only a small segment of upstream sequence, exhibited complete
readthrough (17). Similarly, a DNA segment, designated
µm
, containing the immunoglobulin M membrane-form µ chain
poly(A) signal directed efficient downstream termination, whereas an
upstream substitution variant, 3'SP
, did not (61). In
both of these cases, it was verified that polyadenylation efficiency was not altered. Therefore, the upstream sequences are genuine termination elements, not merely modulators of polyadenylation efficiency.
Mutations within the poly(A) signal region can also influence
termination efficiency, as illustrated by a recent study of yeast
(7). Thus, pNU, containing the poly(A) signal from the ura4 gene of Schizosaccharomyces pombe, directed
efficient termination apparently at a discrete element that was about
200 bp downstream of the poly(A) cleavage site. In contrast, pNUM,
bearing several mutations in the vicinity of the poly(A) signal,
terminated only gradually throughout a region that extended an
additional 400 bp downstream. However, pNU and pNUM exhibited
equivalent polyadenylation efficiencies (7).
Taken together, all of these results indicate that the reported
correlation between poly(A) sit