Previous Article | Next Article ![]()
Molecular and Cellular Biology, March 2005, p. 1730-1736, Vol. 25, No. 5
0270-7306/05/$08.00+0 doi:10.1128/MCB.25.5.1730-1736.2005
Copyright © 2005, American Society for Microbiology. All Rights Reserved.
Department of Biochemistry and Molecular Biology,1 Departments of Pathology, Molecular Microbiology and Immunology, and Biological Sciences, University of Southern California Keck School of Medicine,2 Department of Pharmaceutical Sciences, University of Southern California School of Pharmacy, Los Angeles, California3
Received 2 October 2004/ Returned for modification 15 November 2004/ Accepted 24 November 2004
|
|
|---|
|
|
|---|
for IgG,
for IgA, and
for IgE, respectively). Class switch recombination is mediated by large (2- to 12-kb) repetitive regions, called switch regions, upstream of each coding constant-region exon (14). Recombination is thought to occur through a DNA double-strand break (DSB) intermediate (25). CSR generates DSBs that can be detected by ligation-mediated PCR (4, 30, 36); moreover, the deleted intervening DNA between the two breaks forms a circular molecule (17, 18, 22, 35), which is also detectable in class-switching B cells. The joining of the two chromosomal ends of the DSB is thought to be mediated by the nonhomologous DNA end-joining pathway (3, 6, 20, 21), although dependence on some components is not certain (1). A key step in committing a certain switch region to recombination is transcription through that switch region from an upstream promoter (called an I exon promoter) (39). These transcripts are called germ line transcripts [to be distinguished from the transcript containing the V(D)J exon] or sterile transcripts, because they do not encode any protein (14, 39). The germ line transcripts are highly G rich, due to an asymmetry between a G-rich nontemplate and a C-rich template DNA strand of the switch region. The two switch regions involved in CSR do not share any extensive homology, and switch junctions do not reveal any consensus sequence motif. This type of DNA recombination event can be best described as regionally specific recombination (19, 39).
When switch regions are transcribed in vitro, the RNA transcripts stay bound to the DNA template, forming an RNA-DNA hybrid (9, 29, 34, 37). Biochemical probing has provided evidence that the RNA-DNA hybrid forms in vivo in switch regions and is most consistent with an R-loop structure (37). R-loop formation depends on transcription through switch regions in the correct orientation. Recent mouse models with inverted S
1 switch regions show markedly reduced CSR efficiency (32). In addition, an artificial DNA fragment can partially replace S
1 in CSR but only in the orientation capable of R-loop formation and not in the other orientation (32). All of these features are consistent with R-loop formation at the switch regions upon transcription.
The only lymphoid-specific factor required for CSR was identified by a subtractive hybridization from a B-cell line stimulated to undergo switching (23, 24). The identified gene product was termed activation-induced cytidine deaminase (AID) because it bears sequence homology to an RNA-editing enzyme, APOBEC1, which deaminates a cytidine residue in the ApoB100 RNA. How AID functions in CSR is currently under debate. One model proposes that AID, like APOBEC1, edits an mRNA which then encodes the active recombinase (15). The other model proposes that AID directly deaminates cytidine in the DNA (11, 26) and the resulting uracil is then processed by the base excision repair components, including uracil DNA glycosylase (UDG) or uracil nucleotide glycosylase (UNG) and the apurinic-apyrimidinic (AP) endonuclease (APE), to generate a nick on the DNA strand. UNG null mice (28) and human patients bearing UNG mutations (16) are impaired for CSR, supporting the DNA deaminase model. A number of biochemical studies have shown that purified recombinant AID protein is capable of deamination of cytidine residues on single-stranded DNA in vitro (2, 8, 10, 33, 38). The elucidation of the R-loop structure in vitro and in vivo immediately implied a possible mechanism for generating DNA breaks in the switch region. The displaced template in an R-loop is single stranded, and that template might be the target for AID (6, 13, 39).
Though it is clear that the displaced strand in an R-loop is accessible to the bisulfite anion, it remains to be tested whether the same single strand is accessible to agents of protein size, specifically the 24-kDa AID, which, even if a globular monomer, would have a theoretical radius of approximately 25 Å. The displaced strand must still wrap around the RNA-DNA duplex, so it is not entirely like a segment of free single-stranded DNA (Fig. 1). The distance between the displaced strand and the RNA-DNA duplex (which itself is likely to be A form) can vary from less than 4 to more than 21 Å. Hence, AID may experience steric hindrance in acting on cytidines within this displaced strand.
![]() ![]() View larger version (61K): [in a new window] |
FIG. 1. Two- and three-dimensional representations of R-loops. (A) Two-dimensional diagrams of R-loops. The thick horizontal lines represent each of the two DNA strands. The thin horizontal line represents the RNA transcript. Vertical dashes indicate base pairing. The short thick arrow above the R-loop indicates the position and direction of the RNA polymerase that is generating the RNA. The upper diagram shows the displaced strand with maximal displacement from the RNA-DNA duplex but with zero twist relative to the RNA-DNA duplex. The lower diagram illustrates the displaced DNA strand with shorter distance of displacement from the RNA-DNA duplex and with some twist relative to the RNA-DNA duplex. (B) Three-dimensional model of an R-loop. One switch repeat is shown configured as an R-loop. On the left, the structure is viewed from one end, with the RNA-DNA duplex (A-form) in the center and the displaced nontemplate strand (green) on the perimeter. On the right, a longitudinal view is shown. The template strand loses two helical turns (720°) as it unwinds and extends away from the RNA-DNA duplex, with a maximum displacement of 21 Å from the duplex. For comparison, a globular 24-kDa protein (the size of an AID monomer) would have a radius of approximately 50 Å. Green, nontemplate strand (displaced R-loop); red, template strand; blue, RNA; yellow, cytosine bases in the R-loop.
|
Here we explore R-loop-AID interactions at sequencing gel resolution. This resolution can be achieved only by limiting the region of R-loop formation to a much shorter length than that of a full switch region. We find that AID does have access to the displaced strand of the R-loop and a much more limited access to the template DNA strand. However, the preferred action of AID at WRC (where W is A or T and R is A or G) sites on single-stranded DNA is altered on the displaced strand of the R-loop, presumably reflecting the steric constraints to AID access.
|
|
|---|
In vitro transcription. One microgram of supercoiled plasmid prepared from a CsCl gradient was transcribed in vitro in a 20-µl reaction mixture with either T7 or T3 RNA polymerase according to a Promega protocol. The reaction mixture was incubated at 37°C for 1 h, and the RNA polymerase was heat inactivated by incubation at 70°C for 20 min. Free RNA was removed by adding 1 µg of RNase A and incubating the mixture for 30 min at 37°C. Nucleic acid was then purified by phenol and chloroform extraction followed by ethanol precipitation.
Detection of AID-mediated cytidine deamination on an R-loop. R-loop DNA (T7-transcribed plasmid) was incubated with purified recombinant mouse AID and bacterial UNG (Invitrogen) for 1 h at 37°C in a buffer containing 25 mM Tris-HCl (pH 8.0), 50 mM NaCl, and 5 mM EDTA. Typically, 50 ng of purified recombinant AID protein (0.8 pmol) was used to treat 100 ng of R-loop plasmid (0.15 pmol, corresponding to approximately 4 pmol of potential C residues on the displaced G-rich strand of the R-loop) in a 10-µl total reaction volume. Nucleic acid was purified by phenol and chloroform extraction followed by ethanol precipitation. Purified nucleic acid was resuspended in Tris-EDTA buffer (pH 8.0). In the primer extension reaction, 10 ng of purified nucleic acid from the AID-treated sample was added to a 10-µl reaction mixture containing a 1 µM concentration of 32P-labeled T3 primer, a 200 µM concentration of each deoxynucleoside triphosphate, 0.5 U of Vent exo, and 1x Thermopol buffer supplied by New England BioLabs. Primer extension was done by incubation of the mixture for 2 min at 95°C, 5 min at 48°C, and 15 min at 72°C with a Robocycler 96 (Stratagene, La Jolla, Calif.). The primer extension mixture was then resolved on a 6% denaturing polyacrylamide gel containing 1x Tris-borate-EDTA and 7 M urea. Primer extension products were visualized by exposing the dried gel to a phosphorimager screen and scanned with a molecular imager FX (Bio-Rad, Hercules, Calif.). Gel quantitation was done with Quantity One v4.2.3 (Bio-Rad).
Three-dimensional model of an R-loop. R-loop structures were built with an in-house nucleic acid-building algorithm, NASDAC (5). Initial base positions were generated through a series of rotational and translational input parameters. A-DNA helical parameters were used for duplex generation. The "unwound" structures were built by allowing the terminal bases of the coding strand to remain as a part of the hybrid while unwinding the remainder of the strand. The nontemplate strand gradually loses two helical turns (720° of twist over 39 bases), which allows it to unwind and extend away from the A-form duplex. A maximum displacement of 21 Å from the duplex axis was achieved; further displacement causes too great a distance between glycosidic N atoms of successive bases in the displaced strand. NASDAC-generated structures were energy minimized in the AMBER force field. Na+ cations were added for electroneutrality. Energy minimization of the structures was carried out in two steps. Initially, all the base atoms were constrained to their initial positions, allowing only the backbone atoms to relax. This constraint was followed by relaxation of all atoms. Three thousand cycles of minimization were performed in each step: 500 cycles of steepest-descent minimization followed by 2,500 cycles of conjugate gradient minimization. Conditions included a distance-dependent dielectric constant and a nonbonded cutoff of 12 Å. Structures were visualized with the Weblab viewer visualization program.
|
|
|---|
3 sequence required for generation of a stable R-loop on a plasmid.
In order to access AID sites of deamination upon sequencing gel resolution, it was necessary to use short lengths of switch sequences that were still sufficient to efficiently form R-loops. We constructed a series of plasmids that contained 1, 1.5, 2, and 2.5 repeats of mouse S
3 (Fig. 2). When transcribed in a physiological orientation by T7 phage RNA polymerase, only plasmids containing 2.5 repeats (pTW-EL54) showed a significant electrophoretic mobility shift (Fig. 2, lane 11), whereas the smaller switch regions were equivalent to plasmids lacking switch region DNA (Fig. 2, lanes 2, 5, and 8). The resulting shift was sensitive to RNase H treatment (Fig. 2, lane 12), indicating the presence of an RNA-DNA hybrid. Bisulfite sequencing of these transcribed 2.5-repeat molecules confirmed that the top strand was single stranded and the bottom strand was base paired, consistent with the R-loop structure that we have demonstrated previously (data not shown) (37). Hence, 2.5 repeats appeared to be sufficient for stable R-loop formation.
![]() View larger version (45K): [in a new window] |
FIG. 2. R-loop formation on short segments of S 3 switch region DNA. Plasmids harboring 1, 1.5, 2, and 2.5 S 3 repeats were transcribed in vitro with T7 RNA polymerase. The transcription products were resolved on an agarose gel stained with ethidium bromide after electrophoresis. The components of each reaction are indicated at the top of the gel. NC, nicked circular; SC, supercoiled.
|
3 repeats was digested with restriction enzymes to determine whether the structure extends outside of the switch region (Fig. 3A). We found that unique enzyme sites (SacI and HindIII) located upstream of the switch sequence but downstream of the T7 promoter could not be digested to completion (Fig. 3A, lanes 4 and 6) unless RNase H was added to the digestion mixture (Fig. 3A, lanes 5 and 7). This finding indicated that a fraction of the R-loop extends upstream of the switch region. However, unique sites upstream of the T7 promoter (PstI) could be digested to completion even in the absence of RNase H, indicating that the R-loop does not extend beyond the promoter. When we digested the R-loop with enzymes (KpnI and XbaI) that have unique sites downstream of the switch region, the digestion always went to completion, even in the absence of RNase H. This finding suggests that R-loop formation terminates promptly by the time that the RNA polymerase reaches the end of the 2.5-repeat switch region.
![]() View larger version (30K): [in a new window] |
FIG. 3. Extent of R-loop formation on a 2.5-S 3 repeat segment. (A) The top panel is a diagram representing a part of pTW-EL54. Shaded thick arrows represent S 3 repeats. The lower panel shows the digestion products of T7 RNA polymerase-transcribed pTW-EL54 (R-loop). Irrelevant lanes between lanes 7 and 8 have been removed. The location of the R-loop was probed by testing for the cleavability of restriction sites upstream and downstream of the switch region. Bands below the linear position represent uncut plasmid due to the R-loop formation. RNase H removes the RNA of the R-loop, thereby permitting the two DNA strands to anneal and become cleavable by the restriction enzymes. The brackets indicate the undigested plasmid due to R-loop formation in the region of the restriction enzyme sites. (B) R-loop status of the middle of the 125-bp switch region. A PvuII restriction site within the middle of the 2.5-repeat switch region was tested for cleavability. NC, nicked circular; L, linear; SC, supercoiled; T7, T7 promoter; P, PstI; S, SacI; H, HindIII; K, KpnI; X, XbaI.
|
AID can deaminate cytidine on the displaced G-rich strand of an R-loop. To detect AID-mediated deamination on an R-loop, we treated a minimal R-loop (Fig. 2, lane 11) with recombinant mouse AID purified from baculovirus-infected insect cells (Fig. 4A). AID-treated R-loops were then incubated with Escherichia coli uracil glycosylase, which removes the U to generate an abasic site. The resulting AP site could be detected by primer extension because the DNA polymerase used here (Vent exo) could not bypass this AP site.
![]() View larger version (50K): [in a new window] |
FIG. 4. Enzyme and strand specificity for AID action on an R-loop. (A) Schematic diagram illustrating the assay for AID action on an R-loop (top strand). R-loops were generated by transcribing supercoiled plasmid with T7 RNA polymerase. Purified R-loops were then treated with AID and UDG, followed by primer extension with a radioactively labeled primer to detect the AP site that results from AID and UDG action. Diagrammed DNA strands are represented by straight lines, and the RNA transcript is represented by a wavy line. The oval represents the switch region. Arrows above the oval indicate switch repeats. The T7 and T3 promoters are indicated by arrows flanking the switch region. A star indicates the radioactive label. (B) Sequence of the G-rich displaced DNA strand of the 2.5-S 3 repeat switch region. All C residues are underlined. Each bold C corresponds to a WRC sequence. (C) AID action on the displaced G-rich strand of an R-loop. Primer extension was carried out with a labeled T3 primer, shown at the bottom left of the gel. Each dot indicates the position of a cytidine residue on the G-rich strand (each closed circle conforms to a WRC sequence, and open circles do not). The components of the reaction are shown at the top of the gel. Single-stranded markers (in nucleotides) are shown on the right. (D) AID action on the C-rich strand of an R-loop, which is base paired with the RNA transcript. Primer extension was carried out with a labeled T7 primer, shown at the bottom left of the gel. All other symbols are the same as described for panel C. The components of the reaction are shown at the top of the gel. Single-stranded markers are shown on the right.
|
When the assay was performed with DNA that does not have an R-loop structure, such as an untranscribed supercoiled plasmid (Fig. 4C, lane 7), a plasmid transcribed with T3 RNA polymerase (nonphysiological orientation) (Fig. 4C, lane 6), or an R-loop that was treated with RNase H (Fig. 4C, lane 5), we did not detect any stalled primer extension product. This finding indicates that AID action is dependent on the R-loop structure. In addition, the primer extension products align exactly with each C residue of the G-rich strand within the switch region (Fig. 4C, lanes 1 and 4; sequencing ladder not shown). When we used another primer (KY439) further downstream for the primer extension, a corresponding shift in size for every band was observed (data not shown), confirming that the primer extension results are attributable to the C deamination sites inferred.
When we analyzed the C-rich strand by using labeled T7 primer as for the extension reaction, we also observed some weak bands (Fig. 4D, lanes 1 and 4), although the intensity of the bands was only slightly higher than the background intensity. From our previous bisulfite sequencing analysis, we knew that the C-rich strand was paired with the RNA transcript. We suspected that this minimal amount of activity on the C-rich strands results from deamination of unpaired C's at the border of the R-loop. Similar to that of the G-rich strand, the primer extension product observed for C-rich strands was also dependent on AID and UDG (Fig. 4D, lanes 2 and 3) and dependent on the R-loop structure (Fig. 4D, lanes 5 to 7). When AID- but not UDG-treated R-loops were transformed into an ung null bacterial strain (NR8052), C-to-T mutations were detected (data not shown), consistent with the action of AID as a cytidine deaminase. However, clones containing these conversions were rare among all cloned molecules.
The deamination intensities conformed largely to the WRCr (where W is A or T, R is A or G, and r indicates a small preference for purine) preferences defined previously for single-stranded DNA. However, there were clear deviations. For example, the third labeled cytidine (Fig. 4D, lanes 1 and 4) was a strong WRC site on purely single-stranded DNA but was barely detectable as a site of AID action. Similarly, the first labeled cytidine should have been deaminated as well as the second labeled cytidine (Fig. 4D, lanes 1 and 4), yet it was not. These deviations are most readily explained by steric constraints within the R-loop conformation that restrict access by AID at these particular sites.
|
|
|---|
Constraints on AID accessibility to the displaced DNA strand of the R-loop. AID has been demonstrated to have quantitative site preferences within single-stranded DNA (27, 38). The two nucleotides upstream and the one nucleotide downstream of the cytidine influence the AID specificity over a 10-fold range (38). If the displaced strand within the R-loop existed simply as single-stranded DNA, then we would expect the profile of relative deaminations to match that seen for single-stranded DNA. Several specific deviations illustrate that this is not entirely the case. This finding indicates that the R-loop conformation does constrain some sites from AID action.
Interestingly, analysis of the mutations around chromosomal switch junctions also does not conform uniformly to the WRCr rule at C residues (12, 31). Our findings here provide a potential explanation for these mutations that is based in the R-loop conformation.
Structure of transcription-induced R-loops. R-loops generated by transcription are quite distinct from those generated by assembly of RNA and DNA oligonucleotides (34). During transcription, positive superhelical tension is generated ahead of the RNA polymerase and negative superhelical tension is generated behind the polymerase. Even more importantly, the displaced DNA strand will wrap around the RNA-DNA duplex with a transient periodicity defined by the size of the RNA polymerase. The greater the distance that the polymerase pushes the displaced strand away from the RNA-DNA duplex, the less frequently the displaced strand will wrap around the RNA-DNA duplex during transcription. At the extreme, the displaced strand would not wrap around the duplex at all, which is how we and others have typically drawn R-loops in two dimensions (Fig. 1A). However, at the other extreme, the displaced DNA strand could wrap within the major groove and have a periodicity that is the same as that of the RNA-DNA duplex. The diameter of the RNA polymerase protein determines the distance with which the displaced strand is transiently pushed away from the RNA-DNA duplex.
After the RNA polymerase has passed through the region, the displaced DNA strand might conceivably change position, given that the positive superhelical tension ahead and the negative superhelical tension behind the R-loop will dissipate by diffusion into the adjacent regions of normal duplex DNA. In addition, topoisomerases will compensate for positive and negative superhelical deviations.
We are confronted with a particularly refractory conformational problem when considering transcription-induced R-loops. These must be generated by transcription, which means that they are too heterogeneous (37) and too large for nuclear magnetic resonance or crystallography. Visual resolution of the single strands by transmission electron microscopy, cryoelectron microscopy, or atomic force microscopy is not possible. In light of the lack of structural approaches, we are limited to chemical and enzymatic probing of the R-loop. We have previously reported sequence-level chemical probing of the bottom and top DNA strands of 2-kb transcription-induced R-loops (37). The present study uses a natural enzyme, AID, to probe the R-loop structure.
The fact that even this 50-kDa protein (24-kDa AID plus the 26-kDa glutathione S-transferase moiety) can gain access to the displaced strand of the R-loop is consistent with the results of our earlier chemical probing methods. The bisulfite chemical probing would be inconsistent with the displaced strand lying in the major groove in a triplex conformation, because stacking of the bases in that strand would prevent a nucleophilic attack by bisulfite. Access by AID to this strand provides an independent confirmation that this is a single strand and is accessible.
Upstream boundary of transcription-induced R-loops and AID action. Our data are from sequencing gels rather than low-resolution gels. This fact permits the first precise correlation of the boundaries of the R-loop and the boundary of the AID action on the top strand of transcription-induced R-loops. The boundary of the R-loop extends further upstream than simply the 2.5 switch repeats. This finding is based on restriction enzyme digestion upstream of the switch region on R-loop plasmids. Restriction enzymes fail to cut for about 20 to 30 bp upstream of the R-loop. Interestingly, AID also deaminates for a distance of about 20 to 30 nucleotides on the displaced strand upstream of the switch region. Why should the R-loop extend upstream of the switch region, even if only for 20 to 30 bp? We are not certain, but branch migration of the R-loop is the most likely explanation. In such migration, the free RNA may anneal with the template DNA strand, thereby displacing the top strand. Further extension of the branch migration may be limited by constraints of superhelical tension.
|
|
|---|
-H2AX focus formation and mutations at sites of class switching. Nature 414:660-665.[CrossRef][Medline]
regions prior to class switch recombination. EMBO J. 22:5893-5903.[CrossRef][Medline]
3 DNA-specific double strand breaks are induced in mitogen-activated B cells and are implicated in switch recombination. J. Immunol. 159:4139-4144.[Abstract]
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»