-Globin Gene
Michael J. Dye,
Celina Lafaille, and
Nick J. Proudfoot*
Sir William Dunn School of Pathology, University of Oxford, Oxford, United Kingdom
Received 17 September 2004/ Returned for modification 18 October 2004/ Accepted 23 December 2004
| ABSTRACT |
|---|
|
|
|---|
-globin genes form part of a 5-kb tandem duplication within the ß-globin gene cluster on chromosome 11. Despite a high degree of identity between the two genes, we show that while the upstream G
-globin gene terminates transcription efficiently, termination in the A
gene is inefficient. This is primarily due to the different strengths of the poly(A) signals of the two genes; G
-globin has a functionally stronger poly(A) signal than the A
gene. The probable cause of this difference in poly(A) efficiency characteristics lies with a number of base changes which reduce the G/U content of the GU/U-rich region of the A
poly(A) signal relative to that of G
. The 3' flanking regions of the two
-globin genes have similar abilities to promote transcription termination. We found no evidence to suggest a cotranscriptional cleavage event, such as that seen in the human ß-globin gene, occurs in either
-globin 3' flank. Instead we find evidence that the 3' flank of the G
-globin gene contains multiple weak pause elements which, combined with the strong poly(A) signal the gene possesses, are likely to cause gradual termination across the 3' flank. | INTRODUCTION |
|---|
|
|
|---|
The 3' end of all eukaryotic mRNAs, with the exception of that of the replication-dependent histone-encoding mRNAs, is defined by a tail of adenosine residues (approximately 200 in human mRNAs). These are not encoded but rather are added by the template-independent poly(A) polymerase following cleavage at a site specified in the primary transcript. The poly(A) site lies between a highly conserved hexanucleotide (AAUAAA) and a more poorly defined U- or GU-rich region. The polyadenylation machinery consists of a number of proteins which together recognize the poly(A) site, cleave the transcript, and add the adenosine residues. Cleavage and polyadenylation specificity factor interacts with the AAUAAA hexanucleotide, while cleavage stimulatory factor (CstF) binds to the GU/U-rich region; together, these complexes define the cleavage site. Two additional proteins, cleavage factors I and II, are needed for endonucleolytic cleavage to occur. Following the generation of the 3' end by cleavage, poly(A) polymerase adds approximately 200 A residues to generate the mature mRNA (for a review of the biochemistry of polyadenylation, see reference 30).
It has been appreciated for a number of years that 3' processing is an intimate component of the transcription termination mechanism for protein-coding genes; in the absence of a functional poly(A) signal, transcription does not terminate (11, 15). More-specific experiments using yeast have shown that cleavage, rather than polyadenylation factors, is necessary (8). Termination sites and signals have been mapped for a number of genes and fall into several categories. Pausing at AU-rich sequences has been implicated in termination in the human
-globin (14) and mouse ß-major globin (27) genes. Binding sites for transcription factors, for example, a MAZ binding site downstream of the human complement C2 gene (6), also have pausing effects which lead to termination, although this is by no means ubiquitous among all protein binding sites. Data such as these have led to a number of different transcription termination models, including the antiterminator and the torpedo models. The former proposes that recognition of the poly(A) site causes a conformational change in the transcription complex which renders it competent to terminate (17). The demonstration that components of the poly(A) machinery interact with the carboxyl terminal domain of the largest Pol II (6a, 16a) suggested a biochemical mechanism for the antiterminator model; the departure of poly(A) factors from the transcription complex following the generation of the 3' processing signals would be the change that renders the complex termination competent. The torpedo model suggests a rather different role for the poly(A) signal during termination; following cleavage at the poly(A) site, the downstream, nascent RNA fragment is left with an uncapped 5' end, which renders it susceptible to exonucleases that degrade the molecule, culminating in termination when the degradation reaches the transcription complex (9, 25).
More-recent work has revealed additional aspects of the transcription termination mechanism. With the human ß-globin gene it was shown that a cotranscriptional RNA cleavage (CoTC) event occurs in the 3' flank and is an essential component of transcription termination (12). Meanwhile, in yeast it has been shown that chromatin structure can play a role in termination (2). Models which incorporate these new discoveries are presently being developed (for reviews, see references 24 and 26).
We have now studied transcription termination in the human
-globin genes. Since these genes form part of a recent tandem duplication and are closely related, any similarities or differences in the ways they terminate may provide an insight into termination mechanisms. We have shown that in the G
-globin gene the combination of multiple weak pause sites and a strong poly(A) signal promotes efficient transcription termination. By contrast, termination in the A
-globin gene is less efficient, largely due to its weak poly(A) signal.
| MATERIALS AND METHODS |
|---|
|
|
|---|
-globin minigene clones used in this work were transcribed from the human immunodeficiency virus (HIV) long terminal repeat (LTR) promoter (10) from nucleotides (nt) 161 to +80 relative to the transcription start site. To this were fused exons 1 and 2 and the first 17 bp of exon 3 derived from the A
gene (position +35 to a BstEII site at +1373 relative to the transcription start site), since the two
-globin genes are almost identical over this region. The remainder of exon 3 and various regions of flanking DNA from the two
-globin genes were generated by PCR using PfG DNA polymerase (Stratagene) and primers which introduce unique restriction sites for subsequent cloning into this basic minigene as described below.
The H
A clone contains the region +1374 to +4347 from the A
-globin gene fused to the above-described minigene, while in the H
G (H
G14) clone the equivalent sequence from +1394 (BstEII) to +4730 relative to the G
-globin transcription start site was added. Coordinates of the G
-globin fragments added to the basic clones were as follows: H
G9, +1394 to +3529; H
G7, +1394 to +2722; H
G4, +1394 to +2003; and H
G3, +1394 to +1711. Coordinates of the internal deletions generated in H
G7 were as follows:
6, 2394 to 2620;
5, 2004 to 2409; and
4, 1714 to 2004. Clone H
A7 contained the region +35 to +2557 from the A
gene. The flank exchange clones were generated from H
A7, replacing various regions with the equivalent region of the G
-globin gene, as follows: for H
GpA/A7, +1374 to +1611 of A
was replaced with the equivalent region of G
; for H
ApA/G7, +1611 to +2557 of A
was replaced with the same region of G
; for H
G4/A7, +1374 to +1966 of A
was replaced with G
; and for H
A4/G7, +1966 to +2557 of A
was replaced with G
. Poly(A) competition clones were as follows: H
ApA/SPA contains the region of A
-globin from +35 to +1611 fused to the strong synthetic polyadenylation signal (SPA) (16) via the pBluescript polylinker; H
GpA/SPA was a similar clone in which the region of A
-globin from +1374 to +1630 was replaced with the equivalent region of G
-globin. Fragments of the G
3' flank were cloned into H
ApA/SPA at a blunt-ended SalI site between the two poly(A) sites (fragment A, 1709 to 1982; fragment B, 1982 to 2198; fragment C, 2198 to 2372; fragment D, 2372 to 2722) (all locations relative to the G
-globin transcription start site). The
-globin/SPA clone with inactive method poly(A) signal has been described previously (19); fragments of the A
- and G
-globin genes from BstEII (positions +1373 and +1393, respectively) to DraI (+1611 and +1630) were inserted into a PvuII site between the
-globin gene and SPA. pVa is a pUC18 plasmid containing the VA1 gene and was used as a cotransfection control. The Tat plasmid has been described previously (1).
Cell culture and transfection. HeLa cells were maintained in D-MEM supplemented with 10% bovine serum and 2 mM L-glutamine. Cells were transfected with plasmids as indicated in the figure legends by the use of calcium phosphate precipitation (11) for steady-state analysis or of Effectene according to the instructions of the manufacturer [QIAGEN]) for nuclear run-on (NRO) analysis. At 24 h after transfection cells were harvested by washing and scraping into cold phosphate-buffered saline (PBS).
Steady-state RNA isolation and analysis. Transfected HeLa cells were fractionated into nuclear and cytoplasmic components as described previously (13). Cytoplasmic RNA was purified by phenol-chloroform extraction and precipitated with ethanol. RNA was extracted from isolated nuclei by the hot-phenol method: nuclei were resuspended in PBS and added to a 1:1 mixture of acid phenol and 100 mM NaCl-10 mM Tris-HCl (pH 7.5)-1 mM EDTA-1% sodium dodecyl sulfate. Following centrifugation the aqueous phase was reextracted with phenol-chloroform and chloroform, precipitated with ethanol, and resuspended in R-loop (80% formamide-40 mM PIPES [piperazine-N,N'-bis(2-ethanesulfonic acid] [pH 6.8]-1 mM EDTA-400 mM NaCl).
A labeled riboprobe spanning the HIV transcription start site (161 to +123) was generated, and RNase protection assays were carried out as previously described (5). The poly(A) site competition assay (using S1 nuclease analysis) has been described previously (14).
NRO analysis.
NRO analysis was carried out as described previously (4), with some modifications. Nuclei were isolated at 4°C by lysing the cells with 0.5% NP-40 in 10 mM Tris-HCl (pH 7.5)-10 mM NaCl-2.5 mM MgCl2, followed by centrifugation through a 10% sucrose cushion made in the same buffer. Transcription reactions were carried out in 40 mM Tris-HCl (pH 7.9)-300 mM KCl-10 mM MaCl2-40% glycerol-2 mM dithiothreitol supplemented with 250 µM concentrations each of ATP, CTP, and GTP plus 1.1 MBq of [
-32P]UTP. After 15 min at 30°C, the nuclei were pelleted briefly (30 s) and resuspended in PBS, and RNA was extracted using the hot-phenol approach. The precipitated RNA was treated with RNase-free DNase I, extracted with phenol-chloroform, and reprecipitated as previously described (4). RNA was then hydrolyzed with 0.2 M NaOH, neutralized with 0.5 M Tris-0.5 M HCl, and hybridized to filters containing single-stranded DNA probes as described in Table 1. Selection of RNAs contiguous with a biotinylated riboprobe containing the reverse complement of the region 1424 to 1711 [spanning the G
poly(A) site] was carried out as described for the ß-globin gene (12). Briefly, run-on RNAs were hybridized to the biotinylated probe in R-loop buffer overnight and were selected using streptavidin-coated magnetic beads (Promega). After two washes in 300 mM NaCl-10 mM Tris-HCl (pH 7.5)-5 mM EDTA to reduce nonspecific binding, selected RNAs were released by hydrolysis with 0.2 M NaOH and neutralized with 0.5 M Tris-0.5 M HCl as described for the standard run-on technique.
|
| RESULTS |
|---|
|
|
|---|
-globin genes (G
and A
) are found in tandem within the ß-globin gene cluster on chromosome 11, with a distance of only 3.3 kb between the poly(A) site of the upstream G
gene and the transcription start site of the A
gene. The
-globins are predominantly expressed within fetal erythroid cells, where the upstream G
gene represents the more highly expressed (70:30) of the pair (21).
The two
-globin genes are very closely related, with >95% identity across the proximal promoter and three exons. The two introns show similar degrees of conservation, with a 20-nt insertion in intron 2 of G
relative to A
marking the only major difference between them (Fig. 1a). The DNA flanking the
-globin genes is also well conserved, with approximately 70 to 80% identity across the 1 kb upstream of the proximal promoter and the 2 kb downstream of the poly(A) site (Fig. 1a). Thus, as illustrated in Fig. 1a, the
-globins lie within a 5-kb direct tandem repeat.
|
-globin genes have been complicated by their repetitive nature and by the existence within the ß-globin gene cluster of additional, so-called intergenic transcripts. These unstable transcripts initiate between the globin transcription units and within the locus control region of the ß-globin gene cluster. The extent of intergenic transcription downstream of the two
-globin genes is indicated in Fig. 1a. Previous work (4) has suggested that transcription initiating at the A
and G
promoters gradually reduces over their 3' flanks but does not completely cease prior to the start of intergenic transcription.
The ß-globin cluster intergenic transcripts are erythroid specific (4); however, they can be induced in nonerythroid cells by a variety of methods, including the provision of erythroid transcription factors, disruption of higher-order chromatin structure with histone deacetylase inhibitors (22), and the process of transinduction (4). This latter phenomenon occurs when plasmids which express parts of the ß-globin cluster are transfected into nonerythroid cells; transcribing plasmids localize with the chromosomal ß-globin cluster and trigger the transcription of the intergenic, but not genic, regions of the cluster by an undefined mechanism (4). This further complicates the analysis of transcription termination in the
-globin genes, since termination within the
-globin flank contained within a transfected plasmid can be masked by the stimulation of intergenic transcripts from the chromosome.
Comparison of the efficiencies of transcription termination in the A
- and G
-globin genes.
The ß-globin gene and its 3' flank were cloned downstream of the HIV LTR promoter, and, following cotransfection into HeLa cells with the trans-activator Tat (10), high levels of transcription were detected by NRO. Crucially, the efficiency of termination could be assessed using a probe (U3) which lies immediately upstream of the HIV LTR transcription start site, thus providing a measure of read-through transcription which is uncontaminated by signals derived from the trans-induced chromosomal ß-globin cluster (11).
We have employed this assay to compare the efficiency characteristics of termination in the G
and A
-globin 3' flanks. Since the two genes are almost identical (98%) over the coding region and the introns (with no mismatches in exons or sequences involved in splicing), the first two exons and the 5' end of exon 3 (from position +35 to +1373) derived from the same (A
) gene were cloned downstream of the HIV LTR promoter. Exon 3 and the 3' flank from the G
-globin gene (from position +1394 to +4730, 3.1 kb downstream of the polyadenylation site) were fused into this minigene, which was then assayed as described above. As shown in Fig. 1b, there is very little read round into probe U3, suggesting that efficient termination occurs in this clone even though NRO signals are detected across the entire G
3' flank (probes G4 to G9). By comparison, a similar region of the A
gene (to position +4347, 2.8 kb downstream of the polyadenylation site) shows significant amounts of read round into probe U3, indicating that transcription termination in the A
gene is much less efficient than in G
(Fig. 1b). Interestingly, the overall level of NRO signals derived from the A
clone was much lower than those derived from the G
clone (compare probe PH from the two panels of Fig. 1b). This is probably due to occlusion of the HIV LTR promoter by transcription complexes which read through the promoter and is a further indication of the inefficiency of termination in the A
gene.
A 1.1-kb 3' flanking region of the G
-globin gene is sufficient for efficient termination.
To facilitate the rapid analysis of large numbers of deletion clones we have developed a new transient transfection transcription termination assay based on read round into the HIV LTR promoter. As with the previous assay, transcripts which do not terminate read around the transfected plasmid and into the promoter from which they were initiated. These read-through transcripts are unstable and are not exported from the nucleus (data not shown); however, they can be detected in the nuclear RNA fraction (e.g., Fig. 2b, lanes 4 and 5). Thus, following transfection and 24 h of expression, transcripts which initiate at the HIV promoter can be differentiated from those which read round the plasmid by an RNase protection assay using a probe which spans the HIV promoter (Fig. 2a).
|
3' flank is necessary for efficient termination, a series of deletion clones were generated. As with the minigenes shown in Fig. 1, exons 1 and 2 and the first 17 bp of exon 3 derived from the A
gene were cloned downstream of the HIV LTR promoter and the remainder of exon 3 and various pieces of 3' flank derived from the G
-globin gene were fused into this clone to give the minigenes shown in Fig. 2b. Three clones containing large regions of 3' flank (H
G14, 3.1 kb; H
G9, 1.9 kb; H
G7, 1.1 kb) all gave no detectible read-through transcript, indicative of efficient termination (Fig. 2b, lanes 1 to 3). However, when the 3' flanking region was reduced to 0.4 kb (H
G4), read-through transcripts were readily apparent (lane 5). Interestingly, when the region between 0.4 and 1.1 kb downstream of the G
poly(A) site was replaced with the adjacent region (1.1 to 1.9 kb), generating a clone of a similar size (H
G9
5-7), termination was inefficient. This suggests that sequences important for termination are located within the region 0.4 to 1.1 kb downstream of the G
poly(A) site.
Existence of multiple termination elements within the G
-globin 3' flank.
To more precisely define the sequences needed for efficient termination in the G
gene, we generated a set of clones with deletions within 1.1 kb downstream of the poly(A) site. These were transiently transfected, and the amount of read-through transcript generated by each clone was compared to that of the H
G7 clone, which contains the entire 1.1-kb flank (Fig. 2c, lane 1). When the 3' flank was deleted to within 120 nt of the poly(A) signal (H
G3), termination was very inefficient, with noticeable occlusion of the HIV promoter (Fig. 2c, lane 5; compare the amount of HIV-initiated transcripts to the results seen with larger clones). Smaller deletions of 290, 380, and 226 nt across the 3' flank (lanes 2, 4, and 6) all had a minor effect on the amount of read round detected, and no promoter occlusion was apparent. Thus, no single region appears to be entirely responsible for efficient termination in the G
-globin gene, suggesting there may be multiple, weak elements within 1.1 kb downstream of the poly(A) site which are together sufficient for efficient termination to occur.
Inefficient termination in the A
-globin gene.
In comparison to G
results, it is apparent that termination in the A
-globin gene is less efficient (Fig. 1b). Although the 3' flanks of the
-globin genes are similar (averaging 75% identity over the first 1.1 kb) (Fig. 1a), there is considerable scope for sequences important in termination to differ between these two genes. In particular, the G
-globin gene possesses an insertion of 120 nt at a position 0.5 kb downstream of the poly(A) site, within the probe G5 (Fig. 1).
We have generated a set of clones with hybrid 3' flanks to determine which differences in the 3' flank of the
-globin genes account for their different characteristics with respect to the ability to terminate. Figure 3a, lanes 1 and 2, emphasizes that while 1.1 kb of G
3' flank is sufficient for termination, the equivalent region of A
is not. Surprisingly, when various regions of the 3' flanks of the two
-globin genes were swapped (lanes 3 to 6) it became apparent that the major determinant of the efficiency of termination was the poly(A) signal utilized. Thus, when the entire A
3' flank was linked to the poly(A) signal of the G
gene (lane 3; H
GpA/A7) termination was efficient, while when the G
3' flank was coupled to the A
poly(A) signal significant read round was observed (lane 4; H
ApA/G7). This was the case for every region tested (not all data shown); for example, when the regions from 0.4 to 1.1 kb downstream of the
-globin poly(A) sites were swapped efficient termination was observed in the clone containing the G
poly(A) signal (lane 5; H
G4/A7) but in not that containing the A
poly(A) signal (lane 6; H
A4/G7).
|
G7 and H
ApA/G7) gave opposite results in this assay (Fig. 3a, lanes 2 and 3), strongly suggesting that RNA stability is not a major determinant. In addition, the principle findings of the assay have been verified in later run-on experiments; therefore, we believe this read-round assay to be a useful initial analytical tool.
To investigate the difference between the poly(A) signals of the two
-globin genes we carried out a poly(A) competition assay (Fig. 3b). In this assay the upstream
poly(A) signal competes with a stronger downstream synthetic poly(A) signal, SPA (16). The balance of poly(A) site usage is influenced by the relative strengths of the two poly(A) signals, and so this assay can be used to rank poly(A) sites according to their strengths by comparing each to the SPA in turn (16).
The poly(A) site of each of the
-globin genes was competed against the SPA by cloning the latter at a position 37 nt downstream of the cleavage site of each
-globin gene in the HIV promoter minigene setting used in previous experiments (Fig. 3bi). In this context, the A
-globin poly(A) signal was able to compete at roughly 40% with the SPA (lane 1). By comparison the G
-globin poly(A) signal was much stronger, contributing over 90% of the signal (lane 2). We note the difference in the overall signal obtained from the two competition clones (lanes 1 and 2), and although we cannot presently explain this difference, the result was reproducible (data not shown) and all the usual checks on plasmid integrity revealed no inconsistencies. We have carried out the poly(A) competition assay in a different minigene setting in which transcription of the three exons of
-globin was driven by its own promoter under the additional regulation of the simian virus 40 enhancer. In this case we were able to measure the difference in the strengths of the two poly(A) signals without a change in the overall signal observed (Fig. 3bii).
These experiments suggest an obvious and simple difference between the two
-globin genes. G
has a strong poly(A) signal which enables it, in the presence of its 3' flank, to terminate transcription efficiently. By comparison, A
has a much weaker poly(A) signal, and therefore, despite having a termination competent 3' flank (as demonstrated in Fig. 3a, lanes 3 and 5), this gene does not terminate efficiently. A sequence comparison between the two poly(A) sites (Fig. 3c) shows a number of differences in the downstream sequence element which have the net effect of reducing the G and U content and which would be predicted to reduce the efficiency of cleavage and polyadenylation (39).
Mechanism of termination in the G
-globin gene.
(i) Transcription does not stop within 1.1 kb of G
-globin. For a number of genes (in particular, yeast genes) it has been possible to use nuclear run on to show directly where transcription terminates. We have carried out NRO analysis on a number of the
-globin minigenes we generated, using a panel of probes across the 3' flank (probes A and G3 to G6) and into the plasmid backbone (probe pG7 and A to E) along with probe U3, upstream of the HIV transcription start site. It should be noted that the Tat expression plasmid contains related vector sequences, although transcription in this plasmid runs in the opposite orientation. Transfection of the Tat plasmid alone suggested that it makes a minor contribution to run-on signals across probes A to E (data not shown).
In the H
G7 minigene, transcription continues across the 3' flank and into the plasmid backbone, where it gradually reduces before reaching the HIV promoter (Fig. 4a, top panel). By comparison, the G
deletion clone H
G4 terminates inefficiently, with higher run-on signals across the plasmid and into the HIV promoter (second panel from top). Termination in the A
-globin minigene H
A7 is less efficient still (third panel from top), showing high signals across the 3' flank and into the plasmid as well as over the HIV promoter. The large effect that the poly(A) signal strength has on termination is reiterated in the bottom two panels; the H
ApA/G7 minigene, containing the weak A
-globin poly(A) signal and the G
3' flank, fails to terminate transcription and reads into the plasmid and HIV promoter at high levels, while in the H
GpA/A7 clone the majority of transcription has ceased prior to the HIV promoter.
|
This analysis suggests that there is not a single, strong termination site within the H
G7 minigene but rather that termination occurs in a gradual manner across both the G
-globin 3' flank and the adjacent vector. This is in agreement with our previous finding that multiple regions across the G
-globin 3' flank have weak termination-promoting activities.
(ii) The G
-globin gene shows no evidence of pretermination cleavage.
A novel transcription termination mechanism has been reported for the human ß-globin gene (12), as suggested by hybrid selection nuclear run-on analysis. In this procedure, transcripts are hybridized to a complementary biotinylated RNA probe. Contiguous transcripts are then isolated with streptavidin-coated magnetic beads and, following their release, are detected by hybridization to a panel of single-stranded probes. In the case of the human ß-globin gene, it was shown that there is a region 0.9 to 1.6 kb downstream of the poly(A) signal through which selection does not occur, suggesting that the transcripts in this region are not contiguous. Further work showed that the RNA generated in this region of the ß-globin gene undergoes cleavage and that this event is an important precursor to transcription termination (12). These data were interpreted into a two-step model for termination in which a reiterative cotranscription cleavage (CoTC) event mediates the ultimate release of RNA polymerase following polyadenylation and the release of the mature mRNA (reviewed in reference 26).
We have used hybrid selection nuclear run-on analysis to look for CoTC in the
-globin genes (Fig. 4b). A series of five nuclear run-on probes (G3 to pG7) provide complete coverage of the 1.1-kb region of the G
-globin 3' flank which is required for transcription termination (Fig. 4bi). Using an antisense hybrid selection probe corresponding to region G3, which straddles the G
poly(A) site, we carried out hybrid selection NRO as outlined above. Contiguous transcripts were selected throughout the entire 1.1-kb 3' flanking region (Fig. 4bi, top panel). By the use of the same selection probe, similar results were obtained for the A
-globin gene; contiguous transcripts were selected across 1 kb of the A
-globin 3' flank (Fig. 4bii, top panel). A control selection performed using a probe generated from the ß-globin gene (bottom panels) was carried out to ensure that selection was specific.
These data therefore rule out CoTC as a component of the termination mechanism in the
-globin genes. Thus, the human
-globin genes differ from the ß-globin gene, where strong CoTC elements mediate transcription termination, and also from the human
-globin gene, for which there is evidence that weak CoTC elements have a similar role.
(iii) Transcription pausing in the G
-globin 3' flank.
It has previously been demonstrated for a number of yeast and mammalian genes that transcriptional pausing downstream of the poly(A) signal may contribute towards the ability of Pol II to terminate (3, 7, 27). For example, in the human
-globin gene, a strong pause site 350 nt downstream of the poly(A) signal brings about transcription termination (14). We have analyzed the 3' flank of the G
-globin gene to determine whether a similar mechanism is utilized.
The poly(A) competition assay was developed as a means of detecting transcriptional pausing (14). As described above, when a weak upstream poly(A) signal is placed in close proximity to a stronger downstream poly(A) signal the resulting competition will tend to favor the stronger signal and will reflect their relative strengths. By inserting fragments of DNA between the two poly(A) signals the balance of competition can be pushed in favor of the upstream site, since as larger fragments are inserted it will begin to gain a temporal advantage by virtue of being transcribed first. A transcription pause site effectively has a much larger effect on the balance of poly(A) competition; by slowing the rate of transcription, it has a greater temporal effect. With this assay it is also possible to identify polyadenylation enhancers; indeed, transcription pause sites and terminators in general form one class of poly(A) enhancer.
We have inserted fragments of the G
3' flank, ranging from 180 to 350 nt, between the weak A
poly(A) signal and the strong SPA (Fig. 4c). The fragments were inserted in both orientations to allow us to distinguish pausing effects over and above the distance effects one might expect to see. As shown in Fig. 4c, in the absence of any intervening sequences the upstream A
poly(A) signal contributes 37% of the mRNA produced from this clone (lane 8). Each of the four fragments A to D increased this contribution when inserted in the positive orientation (lanes 2, 5, 7, and 3, respectively), with the largest increase observed when region A [117 to 390 nt downstream of the G
poly(A) site] was analyzed (lane 2). Importantly, the effect of the increase in distance between the two poly(A) signals, assessed by analyzing the largest and smallest fragments (fragments D and C, respectively), is not significant in this assay (lanes 4 and 6).
The data presented above show enhanced use of the weak upstream poly(A) site in the assay, which may be interpreted in three ways. Firstly, polyadenylation enhancement as a result of the binding of positive regulatory factors to the flanking RNA is possible, although since the effect was shown to spread over 1.1 kb of the 3' flank this seems unlikely. Secondly, it is possible that the G
-globin 3' flank forms an RNA secondary structure which enhances polyadenylation, although since the four separate regions of the flank each have an individual effect of poly(A) site usage this again seems unlikely. In addition, since we have shown that the G
-globin poly(A) signal is extremely efficient even in the absence of its 3' flank (Fig. 3b) it seems unlikely that poly(A) enhancers such as these would affect its usage in the whole-gene setting. We therefore favor a third explanation, according to which transcriptional pausing in the G
-globin 3' flank accounts for the enhanced use of the upstream poly(A) site in our assay. This, in combination with the strong G
-poly(A) site, would be expected to contribute toward transcription termination in the G
gene.
| DISCUSSION |
|---|
|
|
|---|
-globin genes form part of a 5-kb tandem repeat within the ß-globin gene cluster and show a high degree of identity across their exons and introns (average > 95%, as shown in Fig. 1a) and also across both 5' and 3' flanks (average, 75 to 80%). In the 5' flank and coding region, this similarity reflects the conserved regulation and function of the two genes. In the 3' flank of the
-globin genes, the similarity may merely be indicative of a recent evolutionary origin; however, it is also possible that here too it reflects conserved regulation, potentially in the termination stage of transcription.
We have compared the transcription profiles of the G
- and A
-globin genes by the use of a transient transfection system. Plasmid minigenes under the control of the HIV LTR promoter were generated for both human
-globins, and these were transfected into HeLa cells and nascent transcription was measured using NRO. In both cases nascent transcripts were detected across the entire flank. However, when a probe immediately upstream of the HIV LTR transcription start site (probe U3) was used it was seen that while transcription of the G
-globin gene terminates prior to the HIV promoter, presumably within the backbone of the plasmid, in the A
-gene termination is inefficient and transcripts across probe U3 are readily apparent (Fig. 1b). This resulted in a noticeable promoter occlusion effect in the A
clone relative to the G
clone results. We predict that the more-efficient termination process for the G
-globin gene may be required to minimize transcriptional interference with the downstream, coexpressed A
-globin gene.
We investigated the difference in the termination efficiency characteristics of the two
-globin genes further by generating a series of heterologous minigenes containing various combinations of the two
-globin genes. These minigenes revealed that the primary difference in the termination efficiency characteristics of the two
-globin genes is due to the different strengths of their poly(A) signals. Thus, while the strong poly(A) signal of the G
-globin gene correlates with efficient transcription termination, the weaker poly(A) signal of the A
gene results in inefficient termination, even in the context of the G
3' flank (Fig. 3). Within the region tested, only seven nucleotide differences occur between the two poly(A) sites; the majority of these lower the GU content in the downstream GU/U-rich region of the A
signal and would therefore be predicted to reduce its strength. The correlation between poly(A) signal strength and transcription termination is in agreement with the results seen with other studies (15b, 27a). Furthermore, in our previous studies using a similar ß-globin minigene, mutation of the poly(A) signal significantly increased the amount of transcriptional read through into the HIV promoter, as assessed by NRO (11). In an alternative approach, plasmid minigenes containing weak, intermediate, or strong poly(A) signals were injected into Xenopus oocytes and it was shown using electron microscopy that poly(A) efficiency correlates both with the percentage of terminating plasmids and with the site of termination (20).
We also attempted to dissect the G
-globin 3' flank to define precisely the sequences required to promote termination and the mechanism by which termination in the G
-globin gene occurs. Sequences up to 1.1 kb downstream of the poly(A) site were sufficient to cause efficient termination in combination with the strong G
poly(A) site (Fig. 2b and 4a). However, our efforts to map more precisely which sequences are necessary for termination suggested that there is no single region responsible but rather that multiple sequences spread across the 3' flank each weakly contribute to the combined termination efficiency of the G
-globin gene (Fig. 2c). These results are supported by our efforts to define the mechanism of termination in the G
-globin gene; using the poly(A) competition assay we have shown that a number of weak pause elements exist across the G
3' flank (Fig. 4c).
Transcriptional pausing has been shown to be an important component of the termination mechanism for a number of mammalian and yeast genes. In the fission yeast Schizosaccharomyces pombe, pause sites in the 3' flanks of the ura4 and nmt2 genes are required for transcription termination (3, 7). Similar pause sites have been identified in the human
2-globin (14), C2 complement (6), and mouse immunoglobulin M (21a) genes. Pausing downstream of the poly(A) site is an integral part of the Torpedo model of termination; transcription complexes which are slowed by pausing are more readily overtaken by the RNA-degrading activity, which eventually causes template release (25). Recent evidence from both yeast and mammalian systems has confirmed that the major nuclear 5'-3 exonuclease (Rat1p in yeast and Xrn2 in mammals) is indeed required for efficient Pol II termination (15a, 27b). Analysis of the C2 pause site in an in vitro-coupled transcription-polyadenylation system showed that pausing can also stimulate polyadenylation (28, 29). In addition the latter studies showed that not all pause sites are able to stimulate polyadenylation; however, they do suggest an additional mechanism by which pausing promotes transcription termination (26).
An alternative type of terminator for the human ß-globin gene has recently been described (12). A termination region was identified 900 to 1,600 nt downstream of the poly(A) site of the ß-globin gene, and hybrid selection analysis of the nascent transcripts showed that they were not contiguous across this region, instead being cleaved in a cotranscriptional event. Although cotranscriptional cleavage (CoTC) is not dependent on polyadenylation, both events are necessary for termination to occur downstream of the site of CoTC (12). We have looked for cotranscriptional cleavage in both the G
- and A
-globin 3' flanks but have found no evidence that it occurs in either gene (Fig. 4b). We therefore predict that while some genes employ CoTC to mediate efficient Pol II termination, other genes utilize pausing mechanisms instead.
The emerging view of transcription termination is one in which events integrated through the carboxy-terminal domain influence the conformation of the transcription complex, RNA processing, and the structure of the chromatin through which transcription occurs (24). Ultimately, this results in transcription termination, not in some random fashion but in a way which is regulated by activities occurring across the genes. Polyadenylation is at the heart of the events leading to transcription termination, and the work on the duplicated
-globin genes in humans presented here highlights this role by directly comparing two poly(A) signals which have different strengths, ultimately leading to markedly different profiles of transcriptional termination.
| ACKNOWLEDGMENTS |
|---|
This work was supported by the Wellcome Trust (program grant to N.J.P.).
| FOOTNOTES |
|---|
Present address: School of Biomedical and Molecular Sciences, University of Surrey, Guildford, Surrey GU2 7XH, United Kingdom. ![]()
| REFERENCES |
|---|
|
|
|---|
2. Alén, C., N. A. Kent, H. S. Jones, O'Sullivan, J., A. Aranda, and N. J. Proudfoot. 2002. A role for chromatin remodeling in transcriptional termination by RNA polymerase II. Mol. Cell 10:1441-1452.[CrossRef][Medline]
3. Aranda, A. and N. J. Proudfoot. 1999. Definition of transcriptional pause elements in fission yeast. Mol. Cell. Biol. 19:1251-1261.
4. Ashe, H. L., J. Monks, M. Wijgerde, P. Fraser, and N. J. Proudfoot. 1997. Intergenic transcription and transinduction of the human beta-globin locus. Genes Dev. 11:2494-2509.
5. Ashe, M. P., P. Griffin, W. James, and N. J. Proudfoot. 1995. Poly(A) site selection in the HIV-1 provirus: inhibition of promoter-proximal polyadenylation by the downstream major splice donor site. Genes Dev. 9:3008-3025.
6. Ashfield, R., A. J. Patel, S. A. Bossone, H. Brown, R. D. Campbell, K. B. Marcu, and N. J. Proudfoot. 1994. MAZ-dependent termination between closely spaced human complement genes. EMBO J. 13:5656-5667.[Medline]
6. Barilla, D., B. A. Lee, and N. J. Proudfoot. 2001. Cleavage/polyadenylation factor IA associates with the carboxyl-terminal domain of RNA polymerase II in Saccharomyces cerevisiae. Proc. Natl. Acad. Sci. USA 98:445-450.
7. Birse, C. E., B. A. Lee, K. Hansen, and N. J. Proudfoot. 1997. Transcriptional termination signals for RNA polymerase II in fission yeast. EMBO J. 16:3633-3643.[CrossRef][Medline]
8. Birse, C. E., L. Minvielle Sebastia, B. A. Lee, W. Keller, and N. J. Proudfoot. 1998. Coupling termination of transcription to messenger RNA maturation in yeast. Science 280:298-301.
9. Connelly, S. and J. L. Manley. 1988. A functional messenger-RNA polyadenylation signal is required for transcription termination by RNA polymerase-ii. Genes Dev. 2:440-452.
10. Cullen, B. R. 1993. Does HIV-1 Tat induce a change in viral initiation rights? Cell 73:417-420.[CrossRef][Medline]
10. Dieci, G., and A. Sentenac. 2002. Detours and shortcuts to transcription reinitiation. Trends Biochem. Sci. 28:202-209.
11. Dye, M. J., and N. J. Proudfoot. 1999. Terminal exon definition occurs cotranscriptionally and promotes termination of RNA polymerase II. Mol. Cell 3:371-378.[CrossRef][Medline]
12. Dye, M. J., and N. J. Proudfoot. 2001. Multiple transcript cleavage precedes polymerase release in termination by RNA polymerase II. Cell 105:669-681.[CrossRef][Medline]
13. Eggermont, J., and N. J. Proudfoot. 1993. Poly(A) signals and transcriptional pause sites combine to prevent interference between RNA polymerase II promoters. EMBO J. 12:2539-2548.[Medline]
14. Enriquez Harris, P., N. Levitt, D. Briggs, and N. J. Proudfoot. 1991. A pause site for RNA polymerase II is associated with termination of transcription. EMBO J. 10:1833-1842.[Medline]
15. Greger, I. H., and N. J. Proudfoot. 1998. Poly(A) signals control both transcriptional termination and initiation between the tandem GAL10 and GAL7 genes of Saccharomyces cerevisiae. EMBO J. 17:4771-4779.[CrossRef][Medline]
15. Kim, M., N. J. Krogan, L. Vasiljeva, O. J. Rando,E. Nedea, J. F. Greenblatt, and S. Buratowski. 2004. The Rat1 exonuclease promotes transcriptional termination by RNA polymerase II. Nature 432:517-522.[CrossRef][Medline]
15. Kim, S. I., and H. G. Martinson. 2003. Poly(A) dependent transcription termination. J. Biol. Chem. 278:41691-41701.
16. Levitt, N., D. Briggs, A. Gil, and N. J. Proudfoot. 1989. Definition of an efficient synthetic poly(A) site. Genes Dev. 3:1019-1025.
16. Licatalosi, D. D., G. Geiger, M. Minet, S. Schroeder, K. Cilli, J. B. McNeil, and D. L. Bentley. 2002. Functional interaction of yeast pre-mRNA 3' end processing factors with RNA polymerase II. Mol. Cell 9:1101-1111.[CrossRef][Medline]
17. Logan, J., E. Falck Pedersen, J. E. Darnell, and T. Shenk. 1987. A poly(A) addition site and a downstream termination region are required for efficient cessation of transcription by RNA polymerase II in the mouse beta maj-globin gene. Proc. Natl. Acad. Sci. USA 84:8306-8310.
18. Mayor, C., M. Brudno, J. R. Schwartz, A. Poliakov, E. M. Rubin, K. A. Frazer, L. S. Pachter, and I. Dubchak. 2000. VISTA: visualizing global DNA sequence alignments of arbitrary length. Bioinformatics 16:1046-1047.
19. Moreira, A., M. Wollerton, J. Monks, and N. J. Proudfoot. 1995. Upstream sequence elements enhance poly(A) site efficiency of the C2 complement gene and are phylogenetically conserved. EMBO J. 14:3809-3819.[Medline]
20. Osheim, Y. N., N. J. Proudfoot, and A. L. Beyer. 1999. EM visualization of transcription by RNA polymerase II: downstream termination requires a poly(A) signal but not transcript cleavage. Mol. Cell 3:379-387.[CrossRef][Medline]
21. Papassotiriou, I., R. Ducrocq, C. Prehu, J. Bardakdjian-Michau, and H. Wajcman. 1998. Gamma chain heterogeneity: determination of Hb F composition by perfusion chromatography. Hemoglobin 22:469-481.[Medline]
21. Peterson, M. L., S. Bertolino, and F. Davis. 2002. An RNA polymerase pause site is associated with the immunoglobulin µs poly(A) site. Mol. Cell. Biol. 22:5606-5615.
22. Plant, K. E., S. J. E. Routledge, and N. J. Proudfoot. 2001. Intergenic transcription in the human ß-globin gene cluster. Mol. Cell. Biol. 21:6507-6514.
23. Platt, T. 1998. RNA structure in transcription elongation, termination, and antitermination, p. 541-574. In R. Simons and M. Grunberg-Manago (ed.), RNA structure and function. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y.
24. Proudfoot, N. 2004. New perspectives on connecting messenger RNA 3' end formation to transcription. Curr. Opin. Cell Biol. 16:272-278.[CrossRef][Medline]
25. Proudfoot, N. J. 1989. How RNA polymerase II terminates transcription in higher eukaryotes. Trends Biochem. Sci. 14:105-110.[CrossRef][Medline]
26. Proudfoot, N. J., A. Furger, and M. J. Dye. 2002. Integrating rnRNA processing with transcription. Cell 108:501-512.[CrossRef][Medline]
27. Tantravahi, J., M. Alvira, and E. Falck Pedersen. 1993. Characterization of the mouse ßmaj globin transcription termination region: a spacing sequence is required between the poly(A) signal sequence and multiple downstream termination elements. Mol. Cell. Biol. 13:578-587.
27. Tran, D. P., S. J. Kim, N. J. Park, T. M. Jew, and H. G. Martinson. 2001. Mechanism of poly(A) signal transduction of RNA polymerase II in vitro. Mol. Cell. Biol. 21:7495-7508.
27. West, S., N. Gromak, and N. J. Proudfoot. 2004. Human 5'->3' exonuclease Xrn2 promotes transcription termination from sites of co-transcriptional cleavage. Nature 432:522-525.[CrossRef][Medline]
28. Yonaha, M., and N. J. Proudfoot. 1999. Specific transcriptional pausing activates polyadenylation in a coupled in vitro system. Mol. Cell 3:593-600.[CrossRef][Medline]
29. Yonaha, M., and N. J. Proudfoot. 2000. Transcriptional termination and coupled polyadenylation in vitro. EMBO J. 19:3770-3777.[CrossRef][Medline]
30. Zhao, J., L. Hyman, and C. Moore. 1999. Formation of mRNA 3' ends in eukaryotes: mechanism, regulation, and interrelationships with other steps in mRNA synthesis. Microbiol. Mol. Biol. Rev. 63:405-445.
This article has been cited by other articles:
| |||||||||||||||||||||