| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Previous Article | Next Article ![]()
Molecular and Cellular Biology, January 2008, p. 50-60, Vol. 28, No. 1
0270-7306/08/$08.00+0 doi:10.1128/MCB.01251-07
Copyright © 2008, American Society for Microbiology. All Rights Reserved.
,
and
Michael R. Lieber*
USC Norris Comprehensive Cancer Ctr., Rm. 5428, and Departments of Pathology, Biochemistry and Molecular Biology, Molecular Microbiology and Immunology, and Biological Sciences, University of Southern California Keck School of Medicine, 1441 Eastlake Ave., MC9176, Los Angeles, California 90089-9176
Received 12 July 2007/ Returned for modification 21 August 2007/ Accepted 9 October 2007
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
In vitro studies of R-loop formation at prokaryotic origins and Ig class switch regions have paralleled many of the observations seen in vivo. In vitro studies utilize prokaryotic RNA polymerases, often the phage T7 or T3 RNA polymerases, and purified plasmid DNA. Ig class switch recombination (CSR) sequences have been the focus of most of these in vitro studies (6, 9, 28, 29, 38), although studies on mitochondrial and prokaryotic replication origins have also been done (18, 41). The R-loops only form when in vitro transcription occurs in the direction that results in a G-rich transcript. There has been no systematic study of how G-rich or how long the regions must be, nor has there been any sequence modification to assess any aspect of these G-rich regions for their propensity to form R-loops.
Ig CSR occurs at switch regions. In mammals, recombination occurs between the Sµ region, which is located upstream of the constant exons encoding Igµ heavy chain, and any one of the downstream switch regions, S
, S
, or S
, which are located upstream of the constant exons encoding the Ig
, Ig
, and Ig
heavy chains, respectively (3, 5, 33, 45). In mammals, the Ig switch regions are usually several kilobases in length, G-rich on the nontemplate strand (thereby generating a G-rich RNA transcript), and repetitive (with a repeat length of between 25 and 80 bp). Many of the G's are in clusters of 2 to 5 nucleotides (nt). Promoters are present in front of each switch region, the transcripts generated from these promoters do not encode any protein (hence, the name sterile transcripts), and removal of the promoter results in the loss of switching to that specific switch region (35, 42). Ig CSR occurs in germinal-center B cells located in the peripheral lymphoid tissues (e.g., lymph nodes, Peyer's patches, and spleen) upon cytokine stimulation. Different cytokines stimulate the promoters upstream of the different switch regions (36).
Ig CSR requires a cytidine deaminase called activation-induced deaminase (AID), which is expressed in activated B cells (24). AID only deaminates C's when these are located in single-stranded DNA (4, 25, 44). CSR at the downstream switch regions occurs within the switch repetitive regions, and recombination at Sµ can sometimes occur upstream (35%) or downstream (8%) of the Sµ switch repeats (8, 21-23). Given that AID requires single-stranded DNA, a key question concerns how any single strandedness is exposed within the switch regions (33). We have shown that R-loops are detectable at the S
3 and S
2b acceptor switch regions (13, 43) and, more recently, at the Sµ switch regions (12). These R-loops can be kilobases in length and provide a ready target for AID at any of the C's within the top strand. Single strandedness on the bottom strand may derive from partial or complete action by endogenous RNase H in its removal of the RNA that is annealed to the bottom strand (43, 45). After AID action on the top and bottom strands, uracil glycosylase (UNG2) converts these to abasic sites (16, 27), and apurinic/apyrimidinic endonuclease (APE1) may nick the phosphodiester backbone 5' of the abasic site (10). Double-strand breaks could arise from nicks that are sufficiently close, or after nucleases, such as Exo 1 (1), resect from the nick toward an adjacent nick.
The mechanism of R-loop formation is of key importance not only for the immune system during CSR but also for any processes in biology where R-loops arise. Here we have focused on three central issues about the mechanism of R-loop formation. First, how does the RNA that forms the R-loop arrive at a position that permits it to reanneal with the template DNA strand? Is it simply an extension of the standard 9-bp RNA-DNA hybrid formed during transcription and known to form within all RNA polymerases? Or does it thread back to anneal with the template DNA strand after traversing the exit pore that exists in all RNA polymerases? Second, is G-quartet formation by the G-rich nontemplate strand essential for R-loop formation? Third, what is the minimum length of Ig switch region DNA necessary for R-loop formation, and can a reduction in G density or a reduction in the extent of G clustering still permit R-loop formation?
| MATERIALS AND METHODS |
|---|
|
|
|---|
3 region. The wild-type switch sequences in pDR3 include 4 of the 41 repeats from the murine S
3 region. These four are repeat numbers 13, 14, 15, and 16 (37), and they were introduced using oligonucleotides and PCR into pKY127. pKY127 is simply a derivative of pBKS(+) with a deletion of the lacZ promoter and replacement with an oligonucleotide that serves as a multipurpose cloning region.
pTW122 (also called pTW-EL91) contains 5.5 repeats and is the same as pTW-SS91 (43) except for minor differences in the multipurpose cloning region. Another four-repeat substrate (pDR18), pDR49 (one repeat), pDR50 (two repeats), pDR51 (three repeats), and substrates containing four modified repeats, pDR22, pDR26, and pDR30, used in our studies, were also made similarly as described above for pDR3. In pDR22, G clusters containing four or five G's in the wild-type sequence were reduced to clusters of three G's, whereas pDR26 was made with four repeats in which all clusters of three, four, or five G's were made to clusters of two G's, while pDR30 has the four S
3 repeats where all clusters of five, four, three, and two G's were disrupted such as no two G's could be adjacent to each other. A transcription substrate called pDR54 was constructed to have 49.7% G density on the nontemplate strand such that every alternate nucleotide is a G. This substrate was constructed to compare the effect of G density on R-loop formation and was made by using a 189-bp XhoI-digested PCR fragment that was cloned so that transcription from the T7 promoter generates a G-rich RNA.
The plasmid DNA was extracted from bacterial cultures of transformed bacterial colonies and purified on CsCl gradients, followed by ethanol precipitation using standard procedures. For the G-quartet experiments, pTW122 was purified by using a CsCl gradient, followed by precipitation and reprecipitation without monovalent salt but with glycogen to preclude introduction of other monovalent cations into the DNA preparation. An ethanol rinse was then done (with no added salt), followed by resuspension in the transcription buffer with the specified cation. Hence, any G quartets formed during transcription would need to form in the presence of the specified cation. In the experiments where this template is linearized, the restriction digestion was done using in a Na+-free buffer containing 10 mM Tris-HCl (pH 8.0) and 10 mM MgCl2.
DR050 and DR051 or DR056 were used for PCR, while DR075, DR076, or DR110 were used as probes to detect bisulfite converted R-loop derivative molecules in the colony lift hybridization assay.
For the sequences of the oligonucleotides and other related information about enzymes, reagents, and the construction of the plasmids, see the supplemental material.
In vitro transcription of switch substrates.
Supercoiled or linearized (restriction enzyme-digested) switch DNA substrates were transcribed with T7 RNA polymerase (Promega, Madison, WI) in the physiological orientation at 37°C for 1 h in accordance with the polymerase manufacturer's instructions. Radiolabeled [
-32P]UTP was added to the reaction wherever specified. For the RNase T1 experiment, 1 µg of RNase A or 100 U of RNase T1 was added (per µg of DNA transcribed) during the transcription or after heat inactivation of the reaction at 65°C for 20 min. Then, 50 ng of purified E. coli RNase H1 was added per µg of DNA transcribed where specified. Unless otherwise specified, RNase H1 and/or RNase A were added after the transcription reaction, followed by incubation at 37°C for 1 h. For the G-quartet test, pTW122 was digested in Na+-free buffer containing 10 mM Tris-HCl (pH 8.0) and 10 mM MgCl2, and transcription was done in buffers that had the same composition as the commercially available buffer from Promega except that the Na+ was replaced with Li+, K+, or Cs+. The transcribed DNA was electrophoresed on 1% agarose gel in 0.5x Tris-borate-EDTA buffer and poststained with ethidium bromide. Radiolabeled species were detected by exposing the gels to phosphorimager screens and scanning them on a Molecular Dynamics Imager 445SI (Sunnyvale, CA). Bands were analyzed with ImageQuant software version 5.0.
Determination of frequency of R-loop formation.
Portions (2 µg) of SalI-digested plasmid switch substrates were transcribed, treated with RNase A, organically extracted, and precipitated with ethanol. Sodium bisulfite treatment was done as described previously (43). PCR amplification on bisulfite-modified DNA was done with DR050 and DR051 or DR056 (all native primers). The PCR fragment was cloned with a TOPO-TA cloning kit (Invitrogen, Carlsbad, CA). Bacterial colonies were lifted onto nylon membranes (13) and probed with oligonucleotides designed to anneal to a region containing C-to-T conversions but not to an unconverted region on the nontemplate strand. Each probe was designed to bind with a region with approximately six C-to-T changes over a length of approximately 25 bp. Oligonucleotide probes DR075 (for pDR18, pDR26, pDR49, pDR50, and pDR51), DR076 (for pDR26), and DR110 (for pDR54) were radiolabeled in the presence of [
-32P]ATP and used for the detection of putative clones with regions containing C-to-T converted sites. The C-to-T conversions found in the region between DR050 and DR051 are shown in appropriate figures. Molecules with
25-nt stretches containing at least three consecutive C-to-T conversions were considered to be regions of single strandedness.
Enrichment of R-loops by cutting the RNA-DNA hybrid bands from an agarose gel. To study the nature of shifted DNA species, we ran 2 µg of restriction-digested and T7-transcribed, RNase A-treated DNA on 0.8% low-melting-temperature agarose gel in 0.5x Tris-borate-EDTA and cut out the shifted fragment seen on the gel or the corresponding position where a shift is expected (marginally above the DNA band containing the switch region). This gel slice was incubated with sodium bisulfite and analyzed further by sequencing.
| RESULTS |
|---|
|
|
|---|
3 switch region consists of 41 consecutive head-to-tail copies of an
49-bp repeat. We have shown previously that shorter forms of S
3 can still support R-loop formation on supercoiled plasmids very efficiently in vitro (43, 46). Supercoiled plasmids form R-loops particularly efficiently because the inherent negative supercoiling favors strand separation. We wanted to eliminate any contribution of negative supercoiling, and thus we have used linear DNA substrates here except where specified; this provides a more stringent test of R-loop formation. Moreover, DNA in the genome, while somewhat negatively supercoiled (17), is unlikely to be as negatively supercoiled as prokaryotic plasmids.
For the studies here, we use prokaryotic RNA polymerases, typically T7 RNA polymerase, and we transcribe for 1 h of incubation at 37°C. The samples are organically extracted, ethanol precipitated, resuspended, and then run on agarose gels. Where indicated, the transcription is done using radiolabeled [
-32P]UTP. The fraction of substrate that forms an R-loop exhibits a mobility difference and runs more slowly than the double-stranded DNA substrate, even on these linear substrates (7). We and others have demonstrated that the DNA in the shifted position is R-looped, based on RNase H1 sensitivity and sodium bisulfite chemical probing for single strandedness (12, 31, 43).
Test of a thread-back model for R-loop formation. We were interested in how the RNA comes to be annealed to the template DNA strand, and there are two major pathways that one can consider (see Fig. S1 in the supplemental material). All RNA polymerases (prokaryotic and eukaryotic) have exit channels where the nascent RNA normally exits. One possibility is that the RNA comes out of the exit pore of the RNA polymerase and then anneals to the template strand before the two DNA strands anneal to one another. Once an initial RNA-DNA association is formed between the G-rich RNA and the template DNA, the rest of the RNA can thread back and form an RNA-DNA hybrid, which is thermodynamically more stable than the double-stranded B-DNA (30). This model of R-loop formation can be termed RNA "thread back" and requires that the RNA be single stranded for a short period of time before hybridizing with the template DNA strand.
A second possibility, which can be called the "extended-hybrid" model, assumes that the transcript that forms upon transcription of the switch sequences fails to denature from the template in the transcription bubble, owing to the high thermodynamic stability of the short G-rich RNA-DNA hybrid. Then, the remainder of the transcript simply extends as the RNA polymerase transcribes and moves forward on the template. If this model applies, then there would be no free-RNA phase, and the RNA would be annealed to the template strand the entire time.
To distinguish between the two models and dissect the mechanism of RNA association with the DNA, we added RNase T1 during the transcription of linearized switch substrates containing four wild type, murine S
3 (switch region) repeats (diagrammed in Fig. 1C). If the nascent RNA is exposed at any time, the RNase T1, which nicks 3' of the G's, will suppress R-loop formation (see Fig. S2 in the supplemental material). For the present study, RNase T1 is superior to RNase A, which cuts only after pyrimidines, because the RNA strand is relatively poor in pyrimidines and may be cut less frequently than with RNase T1.
|
Because the transcription was done with radiolabeled [
-32P]UTP, gel exposure provides information about the location of the RNA on the gel. The radiolabeled RNA is associated in mobility with the linear DNA fragment (Fig. 1B, lanes 17 and 18), and this RNA is not visible in the lanes where RNase T1 is present during transcription (Fig. 1B, lanes 9 and 10). Note that when RNase A is not used at any time in the experiment, then the lanes are obscured by the radiolabeled RNA (lanes 13 and 14).
Therefore, the nascent RNA is vulnerable to single-strand-specific RNases, such as RNase T1, when there are sufficient cut sites at which these can act. This finding is inconsistent with the extended-hybrid model and is quite consistent with the thread-back model (see Fig. S1 and S2 in the supplemental material).
Test of G-quartet formation at Ig class sequence R-loops.
As mentioned above, class switch sequences are repetitive in nature and are extremely rich in G nucleotides on the nontemplate strand (49% G for the mouse S
3 nontemplate strand). In addition, the G nucleotides in the switch regions tend to occur in clusters of three, four, or five G's, making G quartets a possibility (6, 9). Such structures have been proposed to exist at other G-rich sites in the genome, and this raises the possibility that the R-loops found at the switch sequences are dependent upon G-quartet formation at the G-rich nontemplate DNA strand. G quartets have very specific dimensions and are stabilized by K+ or Na+ cations, which have the right size for the cavity in the center of a G quartet or between planes of more than one G quartet. G quartets are destabilized in presence of Cs+, which is too large, or by Li+, which is too small (32).
We wondered whether R-loops are formed as a result of G-quartet formation on the nontemplate DNA strand. If this were true, then the presence of inhibitory cations that inhibit G-quartet formation would also suppress R-looping. To test this, we prepared DNA templates in a manner such that only one type of monovalent cation was present (see Materials and Methods). We transcribed a minimal switch substrate containing 5.5 repeats of murine S
3, either in supercoiled or in linearized form, with T7 RNA polymerase with only Li+, Na+, K+, or Cs+ present in the transcription buffer as the monovalent cation. Hence, if G quartets were to form during the transcription, then they would have to do so in the presence of only these specified cations. We then ran the samples on a gel to assess the transcription-induced mobility-shifted species. We found that in Li+ (Fig. 2, lane 4 for supercoiled substrate and lane 12 for linearized substrate) or in Cs+ (Fig. 2, lane 7 for supercoiled substrate and lane 15 for linearized substrate) the R-loop-induced shift persists (Fig. 2, lanes 5 and 6 for supercoiled and lanes 13 and 14 for linearized substrate). Based on this, R-loop formation does not require G-quartet formation. Moreover, the amount of R-loop formation is similar or higher for all of our buffers containing Li+, Na+, K+, or Cs+ (Fig. 2, lanes 4 to 7 and lanes 12 to 15 and data not shown) relative to the manufacturer's buffer for T7 RNA polymerase (Fig. 2, lanes 2 and 10), which is prepared using Na+ (and has the same composition as our Na+-based transcription buffer). Therefore, the stability of R-loops appears very unlikely to be reliant on G-quartet formation.
|
3 repeats containing one to four repeats downstream of the T7 promoter in the physiological orientation so that transcription generates a G-rich transcript. If the 41 murine S
3 repeats are assigned numbers 1 through 41, then pDR18 contains repeats 13 through 16 (8, 26). pDR49 contains one repeat (repeat 13), pDR50 contains two repeats (repeats 13 and 14), pDR51 contains three repeats (repeats 13 to 15), and pDR16 does not contain any switch DNA and is used as a "no-switch" DNA control. Transcription reactions were done by using linearized substrates in the presence of radiolabeled UTP (as described in Materials and Methods) and run on agarose gels. Agarose gel analysis shows that an RNase H1-sensitive shifted species is present for two, three, and four repeats but not for one or zero repeats (Fig. 3, lanes 2 and 16, lanes 5 and 17, lanes 8 and 18, lanes 11 and 19, and lanes 14 and 20 for pDR16, pDR49, pDR50, pDR51, and pDR18, respectively). The amount of the shifted species (R-loop) is less for the two-repeat substrate (lane 18) than for the three- or four-repeat substrates (lanes 19 and 20). Therefore, R-loops can form on linear segments of DNA containing only two S
3 repeats.
|
We found that R-loops are present for the one-, two-, three-, and four-repeat-containing substrates (Table 1). As expected from the agarose gel and radiolabeling data (Fig. 3), the order of efficiency of R-loop formation is highest for pDR18, containing four repeats (6.7%), and decreases to 5% for three repeats, 1% for two repeats, and 0.37% for one repeat (Table 1 and see Fig. S3 in the supplemental material). The colony lift hybridization assay for the detection of the R-loops appears to be more sensitive than the gel shift assay, which did not show any detectable R-loop formation for the one-repeat substrate (Fig. 3).
|
3, where a subset of R-loops extend downstream for hundreds of base pairs (12, 13). This difference may be a function of how quickly the G density drops after the last switch repeat. In the genome, this drop is very gradual, whereas on the substrates used here, the drop is very sharp.
|
3 is ca. 49% G on the nontemplate strand, which is much higher than the mammalian genomewide average of
20.5% G content. We sought to determine whether R-looping efficiency decreases with a decrease in G clustering on the nontemplate strand. To assess this, we modified the wild-type four-repeat substrate, pDR18, to make three derivative substrates in a manner that reduces the G-cluster size. For pDR22, the clusters of GGGGG were changed to cGGGc, and the clusters of GGGG were changed to GGGc. This amounts to a change of only 10 bp out of the 189-bp four-repeat switch region, and there is no change in GC density; all G changes are to C. More substantial reductions in G density and G clustering were made for pDR26 (Table 2) and pDR30 (see Materials and Methods and the methods described in the supplemental material). We then used the colony lift hybridization assay to determine the amount of R-loop formation. We found that even these minimal changes in pDR22 decrease the R-loop frequency from 6.7% to 0.23% (Table 2). The location of the R-loops is similar to that found for the wild-type four-repeat substrate (Fig. 5A). The additional decrease in pDR26 drops the frequency to an undetectable level (Table 2). Therefore, the clustering of G's is critical for R-loop formation.
|
|
Dissection of G density from G clustering in R-loop formation efficiency.
After observing the effect of G-cluster size on R-loop formation, we sought to determine whether a complete loss of G clustering can still support R-loop formation in the context of a high G density (identical to the levels of G density on the nontemplate strand of wild-type repeats of murine switch
3). Therefore, we constructed a transcription substrate (pDR54) identical in size and length to the four repeats contained in the wild-type substrate (pDR18) and designed it such that the substrate contains 49.7% G's over a length of 189 bp on the nontemplate strand, but with every second nucleotide being G, thereby abolishing any G clustering (no two G's next to one another). In a transcription-induced shift assay, we found that in comparison to the wild-type substrate that had a strong transcription-induced and RNase H1-sensitive gel mobility shift, this "dispersed G" substrate (pDR54) showed no notable shift (Fig. 6A), suggesting that G density alone is not sufficient for the induction of R-looping in templates with a high G density. However, in our colony lift assay, we did identify two R-looped molecules (Fig. 6C). In these two molecules, the R-loops were located in the G-dense region downstream of the promoter. The frequency of such R-looped molecules was only 0.29%, which is substantially lower than the 6.7% R-loop frequency observed for the wild-type (G-clustered) repeat substrate (pDR18). An enrichment method for detection of R-loops (as described above) allowed us to study additional molecules that were in an R-loop conformation. These experiments indicate that G clustering is the most important determinant of R-loop formation. Although R-loops can be independently supported by extremely G-dense regions, high G density is a less important determinant of R-loop formation.
|
| DISCUSSION |
|---|
|
|
|---|
The stability of the R-loop does not require G-quartet formation (Fig. 2). Therefore, whether the nascent RNA remains unassociated with the template DNA strand versus threading back to anneal with the template strand is determined largely by the energy difference between these two states, and this is clearly a function of the DNA sequence of the region. That is, the RNA polymerase itself may have little role in determining the balance between R-loop formation and no R-loop formation. Experiments in which the species of the RNA polymerase was varied, the temperature was varied, or the ribonucleotide concentration was varied had little effect on R-loop formation (K. Yu, T. E. Wilson, G. A. Daniels, and M. R. Lieber, unpublished results). These are all factors that would influence the rate of transcription, and their lack of effect suggests that the rate of the movement of the RNA polymerase is a secondary issue for R-loop formation. In contrast, the use of ITP in place of GTP resulted in no R-loop formation (9; T. E. Wilson and M. R. Lieber, unpublished data), a finding consistent with the energy of the interaction between the RNA and DNA strands being a critical factor.
The data here support the view that clustering of G's is an important determinant of R-loop formation. In line with this, there is some propensity for the R-loops to begin at the first repeat, regardless of whether there are three or four repeats within the switch region. In particular, the R-loops frequently initiate at the GGGGTGCTGGGGTAGG sequence at the beginning of the first repeat (repeat 13) (Fig. 4 and Fig. 6A). However, this sequence alone cannot efficiently form R-loops (data not shown), and this is obvious from the inefficiency of the one-repeat substrate in forming R-loops (Table 1). Therefore, the length and the G density of the region downstream of such an R-loop initiation site probably determines the efficiency of any R-loop formation. This is further supported by our observations that R-loops located on a high-G-density substrate (but with no clusters), pDR54, are contained within the zone of high G density. Removal of G clusters dramatically decreases the efficiency of R-loop formation, even if the overall G density of the nontemplate strand is maintained. Therefore, it is quite apparent that G clusters support R-loop formation much more efficiently relative to merely a corresponding region of high G density.
The role of G clusters experimentally observed here is distinct from the conjectured role of G clusters in G-quartet formation. As we have shown above, G-quartet formation is not necessary for R-loop formation, and we have observed nothing to indicate that G quartets are forming at the R-loops in vitro or in vivo (12, 13, 43; the present study). In fact, although it occurs at low efficiency, R-loop formation does occur with the fully dispersed G-rich substrate, pDR54 (Fig. 6), and this DNA would not form consecutive planes of G quartets.
If it is not for purposes of G-quartet formation, then why is G clustering more important than mere G-richness for R-loop formation. One possibility relates to the initiation of the R-loop. Clearly, the initiation event requires that a segment of the nascent RNA begin to thread back. This thread back must begin at a few nucleotides (a nucleation site) because the template and nontemplate strands of DNA would not be open for a sufficient length to permit a long segment of RNA to anneal all at once. The initiation or nucleation site would optimally contain more than one G. Hence, G clusters would be favored for this R-loop initiation phase rather than for any postinitiation stabilization phase (such as G-quartet formation).
Short R-loops may be less stable because of the ability of the nontemplate DNA strand to branch migrate so as to displace the RNA of the R-loop. When the R-loop achieves sufficient size, displacement of the RNA due to branch migration of the DNA may be inefficient. We saw no evidence of any R-loops extending downstream of the switch sequences in the present study. Whether R-loops extend downstream (or whether branch migration occurs so as to extend R-loops further downstream) is almost certainly a function of the sequence downstream of the switch regions. For all of the substrates here, the G density falls sharply to a random G density (ca. 20 to 25%) immediately after the last switch repeat. This is in contrast to switch regions in vivo, such as Sµ and S
3, where the G density decreases gradually over several hundred base pairs and, hence, where R-loops extend downstream of the core repeat region (12, 13). Therefore, it seems that the downstream endpoint of R-loops is determined by the G density, and this determines whether the nascent RNA or the nontemplate DNA strand is favored for base pairing with the DNA template strand.
The steep dependence of R-loop formation on G clustering and high G density is noteworthy in an evolutionary context. CSR evolved over a hundred million years after AID had evolved for its function in somatic hypermutation (2, 15, 39). Amphibians have class switch regions that are rich in preferred AID sites (WRC) but are not G-rich on the nontemplate strand (47). Mice and humans have switch regions that not only are uniformly G-rich on the nontemplate strand but also contain clusters of G's, and all of these switch regions are G-rich across much of the repetitive core regions. We speculate that the G clustering and the overall G-richness of mammalian switch regions evolved to drive efficient R-loop formation so as to make a more efficient single-stranded DNA target at which AID can act. Based on our studies here, the mammalian switch regions are at approximately the G-clustering and G-density level that is needed to efficiently form R-loops. From an evolutionary standpoint, there would have been little reason for the G clustering and G density to evolve to even higher levels, once they had reached a sufficient level. The fact that the G clustering is close to the minimum needed for efficient R-loop formation is yet another reason for regarding R-loop formation as the basis for the high G clustering of the nontemplate strand. Otherwise, it is unclear why the G clusters would evolve to precisely this critical point.
The contribution of R-loop formation to mammalian class switch recombination may be
4-fold for each switch region, based on the fact that the inversion of S
1 results in a 4-fold drop in CSR (34). The substitution of a Xenopus Sµ region in place of S
1 also shows a fourfold reduction, and this is consistent with the S
1 inversion data because the Xenopus segment does not form R-loops (47). Hence, the downstream (acceptor) switch regions appear to have evolved a G-richness on the top strand to improve their use as targets by the single-strand specific AID enzyme. Although this enrichment may only be
4-fold for each switch region, this may improve overall CSR substantially based on the fact that the ratio of switched isotypes to IgM is often 100-fold or more in mammals but is typically 1 or less in Xenopus.
| ACKNOWLEDGMENTS |
|---|
This study was supported by NIH grants to M.R.L. D.R. was supported in part by the Cellular, Biochemical, and Molecular Training Program.
| FOOTNOTES |
|---|
Published ahead of print on 22 October 2007. ![]()
Supplemental material for this article may be found at http://mcb.asm.org/. ![]()
Present address: Microbiology and Molecular Genetics, Michigan State University, 5175 Biomedical Physical Sciences, East Lansing, MI 48824. ![]()
| REFERENCES |
|---|
|
|
|---|
2. Barreto, V. M., Q. Pan-Hammarstrom, Y. Zhao, L. Hammarstrom, Z. Misulovin, and M. C. Nussenzweig. 2005. AID from bony fish catalyzes class switch recombination. J. Exp. Med. 202:733-738.
3. Basu, U., J. Chaudhuri, R. T. Phan, A. Datta, and F. W. Alt. 2007. Regulation of activation induced deaminase via phosphorylation. Adv. Exp. Med. Biol. 596:129-137.[Medline]
4. Bransteitter, R., P. Pham, M. D. Scharff, and M. F. Goodman. 2003. Activation-induced cytidine deaminase deaminates deoxycytidine on single-stranded DNA but requires the action of RNase. Proc. Natl. Acad. Sci. USA 100:4102-4107.
5. Chaudhuri, J., and F. W. Alt. 2004. Class-switch recombination: interplay of transcription, DNA deamination, and DNA repair. Nat. Rev. Immunol. 4:541-552.[CrossRef][Medline]
6. Daniels, G. A., and M. R. Lieber. 1995. RNA:DNA complex formation upon transcription of immunoglobulin switch regions: implications for the mechanism and regulation of class switch recombination. Nucleic Acids Res. 23:5006-5011.
7. Drolet, M., S. Broccoli, F. Rallu, C. Hraiky, C. Fortin, E. Masse, and I. Baaklini. 2003. The problem of hypernegative supercoiling and R-loop formation in transcription. Front. Biosci. 8:d210-221.[Medline]
8. Dunnick, W. A., G. Z. Hertz, L. Scappino, and C. Gritzmacher. 1993. DNA sequence at immunoglobulin switch region recombination sites. Nucleic Acids Res. 21:365-372.
9. Duquette, M. L., P. Handa, J. A. Vincent, A. F. Taylor, and N. Maizels. 2004. Intracellular transcription of G-rich DNAs induces formation of G-loops, novel structures containing G4 DNA. Genes Dev. 18:1618-1629.
10. Fan, J., Y. Matsumoto, and D. M. Wilson. 2006. Nucleotide sequence and DNA secondary structure, as well as replication protein A, modulate the single-stranded abasic endonuclease activity of APE1. J. Biol. Chem. 281:3889-3898.
11. Gritzmacher, C. A. 1989. Molecular aspects of heavy-chain class switching. Crit. Rev. Immunol. 9:173-200.[Medline]
12. Huang, F.-T., K. Yu, B. B. Balter, E. Selsing, Z. Oruc, A. A. Khamlichi, C.-L. Hsieh, and M. R. Lieber. 2007. Sequence dependence of chromosomal R-loops at the immunoglobulin heavy chain Smu class switch region. Mol. Cell. Biol. 27:5921-5932.
13. Huang, F.-T., K. Yu, C.-L. Hsieh, and M. R. Lieber. 2006. The downstream boundary of chromosomal R-loops at murine switch regions: implications for the mechanism of class switch recombination. Proc. Natl. Acad. Sci. USA 103:5030-5035.
14. Huertas, P., and A. Aguilera. 2003. Cotranscriptionally formed DNA:RNA hybrids mediate transcription elongation impairment and transcription-associated recombination. Mol. Cell 12:711-721.[CrossRef][Medline]
15. Ichikawa, H. T., M. P. Sowden, A. T. Torelli, J. Bachl, P. Huang, G. S. Dance, S. H. Marr, J. Robert, J. E. Wedekind, H. C. Smith, and A. Bottaro. 2006. Structural phylogenetic analysis of activation-induced deaminase function. J. Immunol. 177:355-361.
16. Imai, K., G. Slupphaug, W. I. Lee, P. Revy, S. Nonoyama, N. Catalan, L. Yel, M. Forveille, B. Kavli, H. E. Krokan, H. D. Ochs, A. Fischer, and A. Durandy. 2003. Human uracil-DNA glycosylase deficiency associated with profoundly impaired immunoglobulin class-switch recombination. Nat. Immunol. 4:1023-1028.[CrossRef][Medline]
17. Kramer, P. R., and R. R. Sinden. 1997. Measurement of unrestrained negative supercoiling and topological domain size in living human cells. Biochemistry 36:3151-3158.[CrossRef][Medline]
18. Lee, D. Y., and D. A. Clayton. 1996. Properties of a primer RNA-DNA hybrid at the mouse mitochondrial DNA leading-strand origin of replication. J. Biol. Chem. 271:24262-24269.
19. Li, X., and J. L. Manley. 2005. Inactivation of the SR protein splicing factor ASF/SF2 results in genomic instability. Cell 122:365-378.[CrossRef][Medline]
20. Masukata, H., and J. Tomizawa. 1990. A mechanism of formation of a persistent hybrid between elongating RNA and template DNA. Cell 62:331-338.[CrossRef][Medline]
21. Min, I. M., L. R. Rothlein, C. E. Schrader, J. Stavnezer, and E. Selsing. 2005. Shifts in targeting of class switch recombination sites in mice that lack mu switch region tandem repeats or Msh2. J. Exp. Med. 201:1885-1890.
22. Min, I. M., C. E. Schrader, J. Vardo, T. M. Luby, N. D'Avirro, J. Stavnezer, and E. Selsing. 2003. The Smu tandem repeat region is critical for Ig isotype switching in the absence of Msh2. Immunity 19:515-524.[CrossRef][Medline]
23. Min, I. M., and E. Selsing. 2005. Antibody class switch recombination: roles for switch sequences and mismatch repair proteins. Adv. Immunol. 87:297-328.[Medline]
24. Muramatsu, M., V. Sankaranand, S. Anant, M. Sugai, K. Kinoshita, N. Davidson, and T. Honjo. 1999. Specific expression of activation-induced cytidine deaminase (AID), a novel member of the RNA-editing deaminase family in germinal center B cells. J. Biol. Chem. 274:18470-18476.
25. Petersen-Mahrt, S. K., R. S. Harris, and M. S. Neuberger. 2002. AID mutates Escherichia coli suggesting a DNA deamination mechanism for antibody diversification. Nature 418:99-103.[Medline]
26. Petrini, J., and W. Dunnick. 1989. Products and implied mechanism of H chain switch recombination. J. Immunol. 142:2932-2935.[Abstract]
27. Rada, C., G. T. Williams, H. Nilsen, D. E. Barnes, T. Lindahl, and M. S. Neuberger. 2002. Immunoglobulin isotype switching is inhibited and somatic hypermutation perturbed in UNG-deficient mice. Curr. Biol. 12:1748-1755.[CrossRef][Medline]
28. Reaban, M. E., and J. A. Griffin. 1990. Induction of RNA-stabilized DNA conformers by transcription of an immunoglobulin switch region. Nature 348:342-344.[CrossRef][Medline]
29. Reaban, M. E., J. Lebowitz, and J. A. Griffin. 1994. Transcription induces the formation of a stable RNA:DNA hybrid in the immunoglobulin alpha switch region. J. Biol. Chem. 269:21850-21857.
30. Roberts, R. W., and D. M. Crothers. 1992. Stability and properties of double and triple helices: dramatic effects of RNA or DNA backbone composition. Science 258:1463-1466.
31. Ronai, D., M. D. Iglesias-Ussel, M. Fan, Z. Li, A. Martin, and M. D. Scharff. 2007. Detection of chromatin-associated single-stranded DNA in regions targeted for somatic hypermutation. J. Exp. Med. 204:181-190.
32. Saenger, W. 1984. Principles of nucleic acid structure. Springer-Verlag, New York, NY.
33. Selsing, E. 2006. Ig class switching: targeting the recombinational mechanism. Curr. Opin. Immunol. 18:249-254.[CrossRef][Medline]
34. Shinkura, R., M. Tian, C. Khuong, K. Chua, E. Pinaud, and F. W. Alt. 2003. The influence of transcriptional orientation on endogenous switch region function. Nat. Immunol. 4:435-441.[CrossRef][Medline]
35. Stavnezer, J., and C. T. Amemiya. 2004. Evolution of isotype switching. Semin. Immunol. 16:257-275.[CrossRef][Medline]
36. Stavnezer, J., and S. Sirlin. 1986. Specificity of immunoglobulin heavy chain switch correlates with activity of germline heavy chain genes prior to switching. EMBO J. 5:95-102.[Medline]
37. Szurek, P., J. Petrini, and W. Dunnick. 1985. Complete nucleotide sequence of the murine g3 switch region and analysis of switch recombination sites in two
3-expressing hybridomas. J. Immunol. 135:620-626.[Abstract]
38. Tian, M., and F. W. Alt. 2000. Transcription induced cleavage of immunoglobulin switch regions by nucleotide excision repair nucleases in vitro. J. Biol. Chem. 275:24163-24172.
39. Wakae, K., B. G. Magor, H. Saunders, H. Nagaoka, A. Kawamura, K. Kinoshita, T. Honjo, and M. Muramatsu. 2006. Evolution of class switch recombination function in fish activation-induced cytidine deaminase, AID. Int. Immunol. 18:41-47.
40. Westover, K. D., D. A. Bushnell, and R. D. Kornberg. 2004. Structural basis of transcription: separation of RNA from DNA by RNA polymerase II. Science 303:1014-1016.
41. Xu, B., and D. A. Clayton. 1996. RNA-DNA hybrid formation at the human mitochondrial heavy-strand origin ceases at replication start sites: an implication for RNA-DNA hybrids serving at primers. EMBO J. 15:3135-3143.[Medline]
42. Xu, L., B. Gorham, S. C. Li, A. Bottaro, F. W. Alt, and P. Rothman. 1993. Replacement of germ-line
promoter by gene targeting alters control of immunoglobulin heavy chain class switching. Proc. Natl. Acad. Sci. USA 90:3705-3709.
43. Yu, K., F. Chedin, C.-L. Hsieh, T. E. Wilson, and M. R. Lieber. 2003. R-loops at immunoglobulin class switch regions in the chromosomes of stimulated B cells. Nat. Immunol. 4:442-451.[CrossRef][Medline]
44. Yu, K., F. T. Huang, and M. R. Lieber. 2004. DNA substrate length and surrounding sequence affect the activation induced deaminase activity at cytidine. J. Biol. Chem. 279:6496-6500.
45. Yu, K., and M. R. Lieber. 2003. Nucleic acid structures and enzymes in the immunoglobulin class switch recombination mechanism. DNA Repair 2:1163-1174.[Medline]
46. Yu, K., D. Roy, M. Bayramyan, I. S. Haworth, and M. R. Lieber. 2005. Fine-structure analysis of activation-induced deaminase accessbility to class switch region R-loops. Mol. Cell. Biol. 25:1730-1736.
47. Zarrin, A. A., F. W. Alt, J. Chaudhuri, N. Stokes, D. Kaushal, L. DuPasquier, and M. Tian. 2004. An evolutionarily conserved target motif for immunoglobulin class-switch recombination. Nat. Immunol. 5:1275-1281.[CrossRef][Medline]
This article has been cited by other articles:
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||