Previous Article | Next Article ![]()
Molecular and Cellular Biology, June 2004, p. 5620-5634, Vol. 24, No. 12
0270-7306/04/$08.00+0 DOI: 10.1128/MCB.24.12.5620-5634.2004
Copyright © 2004, American Society for Microbiology. All Rights Reserved.
Department of Biochemistry, Indian Institute of Science, Bangalore 560012,1 Jawaharlal Nehru Centre for Advanced Scientific Research, Jakkur, Bangalore 560084, India2
Received 6 February 2004/ Returned for modification 4 March 2004/ Accepted 23 March 2004
|
|
|---|
|
|
|---|
Several recombination hot spots have been identified in humans and mice. In humans this has been possible because of the availability of high resolution physical and genetic maps, while analysis of recombinants in crosses between laboratory strains have helped identify the hot spots in mice. Among the few hot spots that have been mapped to a fine resolution, the human hotspots include (i) minisatellite-associated hot spot MS 32 (20), (ii) the CMT1A/HNPP hot spot associated with hereditary neuropathies (39), and (iii) the major histocompatibility complex (MHC)-associated TAP2 and DNA2 hot spots (19, 21). In the mouse, a total of eight hotspots have been identified so far (56), and the ones that have been characterized to a high degree of resolution include (i) the Lmp2/Psmb9 hot spot (32), (ii) the hot spot located in the Eb second intron (57), (iii) the Pb hot spot (18), and (iv) the hot spot located in the Ea fourth intron (22). All the known mouse recombination hot spots have been identified in the MHC region by pedigree analysis, hence not much is known about the hot spot morphology and the recombination processes operating within them, except for the recently published study of the mouse MHC Eb hot spot showing crossover asymmetry and the resulting biased gene conversion of markers (56). More importantly, little is known about the recombinational activity of DNA outside these mouse MHC hot spots.
The primary DNA sequence features associated with known recombination hot spots are long terminal repeat (LTR) elements, transposable sequences, hypervariable minisatellite sequences (HVMS), and GT repeats (29, 46, 53). It is now becoming clear that increased accessibility of DNA within the chromatin to the recombination machinery is an important prerequisite for initiating recombination. Open chromatin structure can manifest because of the intrinsic nature of the DNA sequence (bent DNA sequence, scaffold/matrix-associated regions [S/MARs]) that makes it exclude nucleosomes (ß hot spots) or as a result of transcription factor binding and transcriptional activity in the region (
hot spots) (34, 36). The mechanistic relationship between meiotic recombination and transcription has been substantiated by the global mapping of recombination hotspots in S. cerevisiae (13).
S/MARs are DNA elements 300 bp to several thousand base pairs long which mediate attachment of the chromatin to the nuclear scaffold or matrix and are separated by loops of approximately 60 to 100 kb. Important processes like replication and transcription have been shown to take place at macromolecular complexes located on the nuclear matrix (30). Actively transcribed genes are closely associated with the nuclear matrix, although the actual DNA attachment sites are within the noncoding regions. S/MARs are expected to be recombinogenic due to their significant potential for alternate secondary structure and the presence of curved sequences, which promote base unpairing under superhelical stress (5). This expectation is supported by the observations that transfected S/MAR constructs integrate at much higher copy numbers than their S/MAR-free counterparts and that they are the dominant integration sites for retroviruses (5). S/MARs have therefore been shown to be associated with illegitimate and site-specific recombination, though nothing much is known about their association with meiotic recombination.
The pachytene interval of meiotic prophase is also known to exhibit high levels of DNA repair synthesis (35). DNA polymerase ß has been shown to immunolocalize to discrete sites along homologous chromosomes during zygotene and pachytene intervals (37). More recently, a novel DNA polymerase, polymerase
, has been reported which is expressed only in late pachytene spermatocytes (12), suggesting distinct functions for DNA polymerases ß and
during meiotic recombination. Earlier work from our laboratory, while characterizing the DNA repair sites of pachytene spermatocytes, had resulted in the isolation of a 1.3-kb DNA fragment from the rat genome, which contained several recombination potentiating sequences (38), that is highly conserved in most mammalian species (unpublished data). We have now isolated a larger 17.2-kb fragment from the mouse genome flanking the mouse 1.3-kb sequence and used haplotype and LD analysis to provide evidence that the 17.2-kb locus in the mouse genome does define a recombination hot spot. We also provide the first estimate of the recombination parameter R for any mouse hot spot locus and the recombination intensity and average size of gene conversion tracts for this locus. A close correlation between hot spot activity and the presence of S/MAR elements was observed for the 17.2-kb locus as well as for the three other previously mapped mouse MHC hotspots. Our results therefore suggest S/MAR elements may define an important new molecular feature of meiotic hotspot activity in the mouse genome. We also find a novel 2.4-kb non-protein-coding RNA polymerase II transcript located at the 5' end of the crossover domain in the 17.2-kb hot spot locus.
|
|
|---|
|
View this table: [in a new window] |
TABLE 1. Sequence of primers used in the present study
|
Chromosome localization by mapping with radiation hybrid panels. DNA samples from 96 mouse-hamster cell lines from the T31 radiation hybrid panel were purchased from Research Genetics. Approximately 50 ng of DNA from each of these samples was used for PCR analysis. We sequenced the 1.3-kb fragment from the hamster genome, and due to the conserved nature of the mouse and hamster 1.3-kb sequences, we used primers 5 and 6, which amplify an 800-bp product specifically from the mouse genome, from a region adjacent to the mouse 1.3-kb position in the 17.2-kb fragment. As an internal control we used primers 7 and 8, which amplify a 260-bp product from both the mouse and hamster 1.3-kb locus. Each of the 96 DNA samples from the radiation hybrid panel was scored for the presence of the 800-bp product specific to the mouse 17.2-kb fragment, as well as for a 260-bp product that amplifies from both the mouse and hamster 1.3-kb sequences. The Jackson Laboratory RH database server was used for interpretation of the data from which the chromosomal location of the 17.2-kb locus was identified.
Northern blotting, RT-PCR, and primer extension.
Poly(A)+ RNA was isolated from mouse liver and testis by standard methods and electrophoretically separated (2 µg/lane) on a 1% agarose gel containing 2.2 M formaldehyde. After transferring to nylon membranes, the blot was probed with each of the 32P-labeled contigs. For multiple-tissue Northern analysis, the mouse MessageMap blot (Stratagene) containing electrophoretically separated poly(A)+ RNA was probed with the 900-bp reverse transcription (RT)-PCR product amplified with primers 9 and 10. ß-Actin cDNA was used as an internal control. RT-PCR to amplify the 2.4-kb transcript was performed using total RNA prepared from mouse testis, which was reverse transcribed into cDNA using avian myeloblastosis virus reverse transcriptase (Invitrogen) with the antisense primer 12 at 52°C, followed by PCR amplification with primers 11 and 12 by standard techniques. The transcription start site for the 2.4-kb transcript was determined by primer extension on mouse testis total RNA by using primer 13 end-labeled with
-32P. The extension products were run on a 6% polyacrylamide gel containing 8.3 M urea and were autoradiographed.
Haplotype diversity analysis.
Genomic DNA for the M. musculus species was obtained from Jackson Laboratories for all of the 33 strains except for Swiss, BALB/c, CD1, C3H, C57, and FVB/n. For these laboratory strains, DNA was isolated from the tails of mice by standard procedures. For strain 129/SvEvTacfBR, the genomic library in
FIX II vector was available. For haplotype analysis, a 1-kb region from 13.3 to 14.3 kb was PCR amplified using the primer pair 18 and 19. The PCR products were gel eluted and sequenced on an ABI 377 sequencer. The sequences from all of the 40 strains were aligned using the CLUSTALW program. The DnaSP version 3.5 software (41) was used for the analysis of polymorphism and LD between parsimony informative sites (sites where the low-frequency variant is present at least two times) as well as to perform the Tajima's D test (50) on the unphased diploid genotype data. The Tajima's D test is used for testing the hypothesis that all mutations or polymorphisms seen in a DNA sequence are selectively neutral. Pairwise LD values were plotted against the physical map using MATLAB software. Recombination events were also identified by the four gamete test (17). Coalescent simulations to simulate the history of our sample for different values of 4Nr were performed using DnaSP. The GENECONV program (44) was used to determine gene conversion tracts (http://www.math.wustl.edu/
sawyer). This program looks for aligned segments (after removal of monomorphic sites) for which a pair of sequences are sufficiently similar to be suggestive of past gene conversion events. The similarity is judged based on an unusually long pairwise identity or an unusually high score (matches count as +1 with a penalty for mismatches) for that pair of sequences.
Association of the 17.2-kb locus and mouse recombination hot spots with the nuclear matrix. Matrix and loop DNA were prepared from mouse total testis. The testis were decapsulated and treated with 0.04% collagenase in 1x phosphate-buffered saline containing 0.2% bovine serum albumin and 0.1% sucrose at 30°C for 20 min to make a cell suspension which was pelleted at 500 x g at 4°C and washed thrice in 1x phosphate-buffered saline. Isolation of nuclei and subsequent preparation of MAR and loop DNA were done according to the low-salt lithium diiodosalicylate (LIS) method of Mirkovitch et al. (31). Briefly, the nuclear pellet was obtained after cell lysis with 20 mM KCl and 0.1% digitonin, and the integrity of the nuclei was confirmed by DAPI (4',6'-diamidino-2-phenylindole) staining. Nuclei were incubated at 37°C for 20 min, followed by the addition of extraction buffer containing 25 mM LIS. The resulting nucleoid structure was pelleted at 2,400 x g for 20 min at room temperature and washed with digestion buffer to remove LIS. Restriction enzyme digestion was performed using EcoRI and NcoI, each at 500 U/ml in combination at 37°C for 6 h. The S/MAR fraction was pelleted from the digested loop DNA by centrifugation at 2,400 x g for 10 min at 4°C, and both fractions were purified using standard DNA extraction techniques. The authenticity of the S/MAR pellet was also checked by analyzing the acid (0.25 N HCl)-extracted proteins on a sodium dodecyl sulfate-12% polyacrylamide gel electrophoresis. Total genomic DNA was isolated from the nuclei by standard procedures. Total input DNA was estimated by solubilizing the nuclei in 2 M NaCl and 5 M urea and measuring the absorbance at 260 nm. Accurate estimates of the MAR, loop, and total DNA concentrations for PCR were obtained using a spectrofluorimeter with the dye Hoechst 33258 (26). In addition to analyzing the partitioning of different regions of the 17.2-kb locus between the matrix and the loop, we have also analyzed the predicted MAR regions of the mouse recombination hot spots. For this purpose, primers 20 and 21 (for Ea), 22 and 23 (for Psmb9), 24 and 25 (for Pb), and 26 and 27 were designed based on published sequence data for amplifying the mapped crossover segments for the Ea, Psmb9, and Pb hot spots and the experimentally known immunoglobulin (Ig) kappa light chain MAR sequence as a positive control (6). The domain C (13,500 to 14,000 bp; also see Fig. 10) corresponding to the crossover region of the 17.2-kb fragment was amplified using primers 28 and 29. A locus upstream and adjacent to the transcribed noncoding RNA which is flanked by EcoRI and NcoI sites with a predicted low MAR potential (domain E, 360 to 800 bp; also see Fig. 10) was amplified using primers 30 and 31 and used as a negative control. PCR was performed on 150 ng of MAR, loop, and total DNA for 25 cycles, where the amplification was seen to be in the linear range. Subsequently, aliquots of the PCR products were electrophoresed on a 1% agarose gel, stained with ethidium bromide, and quantified densitometrically in a Bio-Rad gel documentation system. The enrichment factor of a particular sequence in the MAR fraction was calculated as the band intensity in the matrix divided by the sum of intensities of matrix plus loop fractions (9). Equal efficiency of PCR amplification in the MAR, loop, and total genomic DNA was judged by the ratio between the sum of intensities of MAR plus loop fractions and band intensity of the PCR product with total genomic DNA as an internal control.
![]() View larger version (15K): [in a new window] |
FIG. 10. S/MAR profile of the Ea, Psmb9, Pb, and the 17.2-kb mouse recombination hot spots. (A) MAR profile of the 17.2-kb fragment. The crossover mapped to domain C is from 13,500 to 14,000 bp. (B) Ea hot spot (2.2 kb), accession number K00971. Crossovers mapped to the fourth intron from 1,163 to 1,780 nucleotides. (C) Pb hot spot (15 kb), accession number AF100956. The crossovers are mapped to a 2.4-kb interval (from 4.6 to 7 kb). (D) Psmb9 hot spot (7.2 kb), accession number D43620. The crossovers are mapped to the 2 kb from the poly(A) signal (5,166 bp) of the Psmb9 gene. Arrows indicate regions which can function as S/MARs in all four panels.
|
|
|
|---|
FIX II vector. This analysis yielded a single positive signal having an insert of 17.2 kb (Fig. 1A), which was completely sequenced after subcloning as four smaller contigs of 3.4, 4, 4.5, and 5.3 kb (Fig. 1B and C). The mouse 1.3-kb sequence was present from 4,560 to 5,800 bp in the 4-kb contig (Fig. 1C), and thus we had around 5 and 11 kb of additional sequence information on either side of the mouse 1.3-kb fragment to study the sequence context in which it functions as a meiotic repair site. The 17.2-kb fragment was mapped to mouse chromosome 8 C-D by PCR analysis of the T31 mouse-hamster radiation hybrid panel, which included 96 independent hybrid cell lines. A representative gel pattern of the PCR products is shown in Fig. 1D. Out of the 96 hybrid cell lines, 29 were positive for the presence of the 800-bp mouse-specific product, while all of them yielded the 260-bp product which is common for both mouse and hamster genomes. Analysis of these data by the Jackson Laboratory RH database server gave a logarithm of odds score of 13.2 and 11.1 with respect to the microsatellite markers D8Mit347 and D8Mit79, both of which are located on chromosome 8 at 38.7 and 40 cM, respectively, corresponding to a cytological position of 8 C-D (Fig. 1E).
![]() View larger version (45K): [in a new window] |
FIG. 1. Cloning and chromosomal localization of a 17.2-kb mouse genomic fragment flanking the mouse homolog of the 1.3-kb meiotic repair site. (A) Release of the 17.2-kb insert with NotI digestion from the mouse genomic clone and subsequent Southern hybridization with the 564-bp probe (118 to 682 bp within the 1.3-kb repair fragment). M, /HindIII marker. (B) Generation of four smaller contigs following a BamHI and NotI double digestion. (C) Position of each of the four contigs and the location of the 1.3-kb fragment (=), the 260-bp fragment ( ), and the 800-bp fragment ( ) used for screening the mouse-hamster radiation hybrid panel. (D) Representative gel of the PCR analysis of the mouse-hamster radiation hybrid panel. Lanes 6 and 8 show amplification of the 800-bp product while all lanes show amplification of the control 260-bp product. M, /HindIII marker. The highest two logarithm of odds (L.O.D.) score values of the mouse 800-bp fragment with known mouse markers are also shown. (E) Cytogenetic map of mouse chromosome 8.
|
G value, is at least 25 kcal/mol less than the average value for the whole sequence (95.9 kcal/mol). DUEs are known to be associated with eukaryotic replication origins, as its intrinsic helical instability facilitates replication initiation (8). DUEs are also a feature of S/MARs due to their property of stress-induced duplex destabilization, which is essential for S/MAR function (3). The presence of sequence features corresponding to the transcribed regions analyzed using the NIX suite of programs resulted in the prediction of several exons, two complete gene models, several promoter elements, and poly(A) sites, as shown in Fig. 2. Interestingly, no CpG islands were detected with either the NIX program or the CpG island searcher program. Two matches to the existing Swissprot and EST databases were also seen. The entire 17.2-kb sequence of the mouse locus was not found in the mouse genome database (http://www2.igh.cnrs.fr/Mouse-Genome_DBS.html); however, complete matches to the high-throughput genomic unfinished mouse draft sequence were seen for many regions of the 17.2-kb fragment.
![]() View larger version (19K): [in a new window] |
FIG. 2. Sequence features of the mouse 17.2-kb fragment. Non-LTRs include long interspersed nuclear elements, short interspersed nuclear elements (including the Alu and MIR [mammalian interspersed repeats]), and microsatellite repeats. In addition to individual exons, two complete gene models have been predicted, one on the top strand with three exons (1,643 to 1,673 bp, 5,668 to 5,767 bp, and 10,922 to 11,045 bp) and one on the complementary strand with two exons (14,354 to 14,552 bp and 9,562 to 9,582 bp). Predicted poly(A) and TATA box promoter sites are indicated, as well as matches to existing EST and Swissprot databases. The position of the recombination hot spot deduced from LD analysis is also shown.
|
![]() View larger version (19K): [in a new window] |
FIG. 3. Output of the THERMODYN program. Regions of the 17.2-kb fragment which can function as DUEs are indicated by arrows.
|
![]() View larger version (48K): [in a new window] |
FIG. 4. Transcriptional analysis of the mouse 17.2-kb fragment. (A) Northern analysis carried out on 2 µg of poly(A)+ RNA isolated from mouse liver (L) and testis (T) with each of the four contigs and GAPDH (glyceraldehyde-3-phosphate dehydrogenase) as probes. (B) Amplification of the 2.4-kb RT-PCR product amplified from mouse testis (T) total RNA. M, /HindIII marker. (C) Primer extension carried out on mouse testis poly(A)+ RNA (P) using primer M. M, pUC 19/MspI digest. The primer position is below the 34-bp marker. (D) Northern analysis carried out on a mouse tissue blot having 2 µg of poly(A)+ RNA from brain (B), heart (H), kidney (K), liver (L), lung (Lu), skeletal muscle (M), spleen (S), and testis (T). A 900-bp fragment (from 1.5 to 2.4 kb) from the 3.8-kb contig and ß-actin cDNA were used as probes.
|
![]() View larger version (81K): [in a new window] |
FIG. 5. The 2.4-kb transcript is a non-protein-coding RNA. (A) Sequence of the 3.4-kb contig which contains the transcribed region with the GC box (610 bp), the CAAT box (655 bp), the TATA box (780 bp), and the poly(A) signal sequence (3,170 bp) in bold. The 2.4-kb transcribed region is shown by two arrows from 810 to 3,210 bp. The dotted arrow shows the position of the 3' primer M used for primer extension. The open arrows show the 900-bp fragment used as the probe in Northern analysis. (B) Position of predicted ORFs in the transcript in all of the six reading frames. (C) Predicted secondary structure of the transcribed RNA.
|
|
View this table: [in a new window] |
TABLE 2. Summary of the polymorphisms seen among the 40 strains of mice in the 1-kb sequenced region
|
![]() View larger version (53K): [in a new window] |
FIG. 6. Haplotype diversity analysis of the 17.2-kb mouse fragment. The 31 parsimony informative polymorphic sites (with two variants) marked from P1 to P31 are shown for the 40 strains of mice. Their corresponding nucleotide positions with respect to the 17.2-kb fragment are indicated on top.
|
![]() View larger version (79K): [in a new window] |
FIG. 7. Pairwise LD values for statistically significant combinations of the parsimony informative polymorphic sites along with nucleotide distance between the sites. The lower triangular panel shows LD values while the upper triangular panel shows intermarker nucleotide distances. Open squares indicate pairwise combinations that were not statistically significant. The label 1 indicates 0.01 < P < 0.05, 2 denotes 0.001 < P < 0.01, and 3 denotes P < 0.001 (chi-square test).
|
![]() View larger version (15K): [in a new window] |
FIG. 8. Pairwise LD values as a function of the physical map and four gamete test in the 1-kb hot spot locus. (A) D' measure of LD for all statistically significant pairwise combinations of the parsimony informative polymorphic sites as a function of the physical map from 13,300 to 14,300 bp. (B) Numbers of segregating sites showing presence of all four haplotypes.
|
|
View this table: [in a new window] |
TABLE 3. Gene conversion tracts
|
= 4Nµ) and the recombination parameter (R = 4Nr). The value of the mutation parameter (
) has been estimated to be 19.2/gene or 0.02145/bp (total number of sites, 866) from the number of segregating sites. Estimating R is a little more difficult since it is not directly observable in the sample. However, an observable quantity that is related to R is RM, and one can estimate R by using the coalescent approach to simulate the history of the sample for different values of 4Nr that give the observed RM of 2 (17). Such simulations performed 1,000 times for different values of 4Nr (from 0.5 to 6 in increments of 0.5) gave an R value of 0.00257/bp (based on an average nucleotide distance between the most distant sites of 972.33) (Fig. 9 A and B) for this locus. The error bounds on this estimate are wide, which is a known drawback of this method (16, 17). We have therefore also estimated the recombination parameter (R = 4Nr) based on the variance of the average number of nucleotide differences between pairs of sequences (Sk2 in equations 1 and 4 in reference 16). This gave a similar value of 2.50 per locus (1,000 bp) or 0.0026/bp (average nucleotide distance between the most distant sites being 972.33). Given the large data set and the R/
value of 0.1, which is much less than 4 (17), this would probably represent a reliable estimate of R for this locus. We could not get estimates of the effective population size (N) for the mouse population from the literature, and estimating it from this locus (using
= 4Nµ) would have a significant upper bias because of the contribution of recombination to the nucleotide diversity at this locus. If one were to assume a value similar to the human population of 104 (most eukaryotic populations have N values from 103 to 104) and the R value of 0.0026, we would get an r value of 6.5 x 107, which is 10-fold higher than the average value of 0.6 x 108 (4). With an N of 103, we would get an r value of 6.5 x 107, which is 100-fold higher than the average value of the mouse recombination rate. Hence, our estimate of the recombination intensity value for this locus is around 10- to 100-fold higher than the genome average.
![]() View larger version (11K): [in a new window] |
FIG. 9. Determination of the recombination parameter (R) and recombination intensity for the 1-kb hot spot. (A) Estimation of R from RM by coalescent simulations of the history of the sample for different values of 4Nr. For each value of 4Nr, the RM value obtained is shown. (B) Graphical representation of the expected number of obligate recombination events (RM) as a function of the population recombination parameter 4Nr. Each point is an average of over 1,000 simulated replicates of the data at 95% confidence interval (CI). Vertical bars show the values at lower and upper limits of the confidence interval.
|
![]() View larger version (41K): [in a new window] |
FIG. 11. Experimental detection of predicted MAR regions in matrix DNA preparations from mouse testis. Panels A to F show amplification of PCR fragments corresponding to the predicted MAR regions in the loop (L), matrix (M), and total genomic (T) DNA for (A) 17.2-kb locus domain C, (B) 17.2-kb locus domain E, (C) the Ea hot spot, (D) the Pb hot spot, (E) the Psmb9 hot spot, and (F) Ig kappa MAR. The amount of PCR product formed as a function of the number of PCR cycles (20 to 28), by using 150 ng of mouse testis genomic DNA as template, is also shown for each of the panels.
|
|
View this table: [in a new window] |
TABLE 4. MAR characteristics of DNA fragments tested by PCR-based assay
|
|
|
|---|
|
View larger version (7K): [in a new window] |
FIG. 12. Summary of the important domains in the 17.2-kb fragment. The solid box shows the position of the 1-kb region sequenced from all seven strains of mice of the species M. musculus. A larger region from 9.5 to 14.5 kb, within which most of the sequence variation is localized, is shown by a checkered box. A, B, and C indicate the positions of the transcribed region (arrow shows orientation of the transcript), the 1.3-kb meiotic repair site, and the region showing a recombination breakpoint, respectively. D indicates the experimentally determined S/MAR domain flanked by EcoRI and NcoI sites. E indicates the loop-associated region upstream and adjacent to the noncoding RNA.
|
Very little is known about gene conversion in mammals, despite being an important mechanism in breaking down allelic associations. There are few current experimental estimates of conversion tract lengths in organisms other than yeast. In Drosophila spp., for example, estimates are almost exclusively restricted to the rosy locus, where it was estimated to be around 352 bp by crossing strains with very close markers (15). Studies on the human recombination hot spot DNA2 by sperm PCR have provided evidence for a localized width of recombination hot spots of 1 to 2 kb with crossover asymmetry accompanied by biased gene conversion (21), and similar studies by sperm PCR for the mouse Psmb9 hot spot have shown conversion tracts ranging from 315 to 1,018 bp, with an average length of 480 bp (14). We did not observe tract lengths longer than 627 bp, most likely because recombination would quickly reshuffle the polymorphisms, except for those that are very close together. This is particularly true while inferring conversion tract lengths from population-based DNA sequence data, and previous studies based on DNA sequence data have shown short tract lengths of 10 and 16 bp for the Est-5ß and Est-5 C genes (23) and an average length of 51 bp for the rp49 locus (40) in Drosophila spp., because of conversion events getting broken up by subsequent recombination events. Hence, calculations of conversion tract length from population-based sequence data are likely to be underestimates for regions with a high density of crossover or conversion events. Interestingly, two HVMS sequences positioned at 13,466 and 13,576 bp and an LTR sequence at 13,748 bp are in close proximity to the location of the conversion tracts in the 1-kb hot spot locus between 13,595 and 13,971 bp. HVMS and LTR sequences are known to be potentiators of meiotic recombination in in vitro assays (11, 46, 54).
A comparison of this recombination hot spot with the cluster of four recombination hot spots in the mouse MHC complex reveals several parallel features. All of the four recombination hot spots in the MHC region contain an LTR element, a middle repetitive element of the mouse transcript family, and tandem repeats of a tetrameric sequence resembling HVMS. The concentration of the crossover points is either in the 3' end of the genes as seen in the Psmb9 hot spot or in the intron of the genes as seen in the Eb, Ea, and Pb hot spots, with none of the breakpoints located at the 5' region of the genes as in S. cerevisiae. Our localization of crossover activity follows a similar paradigm, with the crossover regions being present close to LTR and HVMS sequences. Further, they lie at the 3' end of the transcribed region. Unlike the Pb and Psmb9 hot spots, where transcriptional activity is not seen in meiotic cells (18), we have seen that the 2.4-kb noncoding RNA is present in the testis and flanks the MAR domain in the 17.2-kb fragment.
The noncoding RNAs have been implicated in several functions such as transcriptional regulation, chromosomal replication, and protein translocation (10, 48). The mouse genome has also been reported to have an abundance of noncoding RNAs, most of which are unspliced, single-exon RNA polymerase II-mediated transcripts (52). At this stage we do not know what function this RNA serves, but the presence of transcriptional activity close to a recombination hot spot region is in agreement with a role of open chromatin domains in recombinational activity and is suggestive of an
hot spot. There is also a striking correlation between the presence of DUEs at 11.2 kb, a S/MAR domain between 10.4 and 15.8 kb, and recombinational activity nearby. The presence of such an intrinsically open chromatin structure which would be prone to double-strand breaks shows that it can also function as a ß hot spot. The recombination hot spot in the 17.2-kb locus thus has properties of both
and ß hot spots. However, the moderate percentage (38.57%) of G+C content rules out a
hot spot-like nature. While yeast and mammalian recombination hot spots exhibit a global relationship of GC-rich domains and elevated recombination rates (36), the lack of such correlation seen for the 17.2-kb locus is probably due to a high resolution analysis.
Another important new feature of mouse recombination hot spots that has emerged from the present study is the association seen between S/MAR elements and the crossover sites. Both in silico analysis and experimental studies using LIS-extracted matrix DNA have provided evidence for such an in vivo association. LTR elements are also known to integrate preferentially at S/MARs and are the most abundant class of repeats in the 17.2-kb fragment. Thus, the crossover site which has been fine mapped to between 13.3 and 14.3 kb is contained in a S/MAR domain which is characterized by the ease of DNA unwinding, which could be required for different aspects of the meiotic pairing process. The association of meiotic activity with S/MAR elements was also found to be true for the Ea, Pb, and Psmb9 mouse MHC hot spots. These observations suggest that a subset of S/MARs associated with the meiotic chromosome cores are converted into crossover regions, for at least some recombination hot spots, which is a distinct possibility considering that they are capable of undergoing stress-induced duplex destabilization. In conclusion, our studies define a new recombination hot spot in proximity to a novel noncoding RNA transcript. These studies also provide the first estimate for the R value for any hot spot locus in the mouse and also the probable length of gene conversion tracts for a hot spot outside the MHC cluster.
Supplemental material for this article may be found at http://mcb.asm.org. ![]()
|
|
|---|
vectors, p. 6.4-6.11. In Molecular cloning, vol 1. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y.
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»