Previous Article | Next Article ![]()
Molecular and Cellular Biology, December 2005, p. 10628-10638, Vol. 25, No. 23
0270-7306/05/$08.00+0 doi:10.1128/MCB.25.23.10628-10638.2005
Copyright © 2005, American Society for Microbiology. All Rights Reserved.
and
Emilio Fernández*
Departamento de Bioquímica y Biología Molecular, Edificio Severo Ochoa Planta Baja, Facultad de Ciencias, Campus de Rabanales, Universidad de Córdoba, 14071 Córdoba, Spain
Received 26 January 2005/ Returned for modification 23 June 2005/ Accepted 23 July 2005
|
|
|---|
|
|
|---|
Class I of mobile elements includes long terminal repeat (LTR) retrotransposons whose genome has structural features very similar to those of retroviruses. They are bounded by two long terminal repeats and possess a coding region that is normally divided in a single gag-pol or two separate gag and pol open reading frames (ORFs) that determine several mature proteins. The former, named gag (specific antigen group), encodes an RNA binding protein (Gag) and other proteins involved in maturation and packaging of retrotransposons, and it usually presents zinc finger CCHC motifs similar to those described in eukaryotic transcription factors. The latter, named pol (polymerase), encodes several proteins, such as protease, integrase, reverse transcriptase, and RNase H. The retrotranscriptase and RNase H activities are required for cDNA synthesis and the integrase activity is required for the integration of the cDNA into a new chromosomal location (3). The pol-encoded protease is necessary for cleavage of the polyproteins into functional peptides.
Two families of LTR elements (Ty1-copia and Ty3-gypsy) have been differentiated in terms of structure and known protein sequences (mainly reverse transcriptase). Both of them are ancient and ubiquitous constituents of eukaryotic genomes. The main difference among them concerns the relative position of the integrase domain with respect to the ORF homologous to the pol gene (28). Some members of the Ty3-gypsy group from invertebrates have recently been suggested to be endogenous retroviruses, since they show a third additional ORF related to the retroviral env gene (5, 27) and at least one of those elements, mdg-4 of Drosophila melanogaster, is infectious (17).
Retrotransposons are particularly abundant in plants and account for about 50 to 80% of repetitive DNA in maize and about 90% in wheat (2, 20). Genome size variations correlate with both the total mass of retrotransposons and the number of different retrotransposons families, like the 13,000 copies of the del family in Lilium speciosum (22) and the 50,000 copies of the Bare-1 element in barley (26). Both LTR and non-LTR retrotransposons account for 70 to 85% of the nuclear genome in maize (42). In contrast to these plants, the Arabidopsis nuclear genome contains few repeated sequences, and therefore it is likely that the content of retrotransposons is about 4 to 10% of total DNA, although many types of both LTR and non-LTR retrotransposons have been reported and appear concentrated on centromeric regions (20).
It is estimated that 30% of the nuclear genome of Chlamydomonas spp. contains repetitive DNA sequences and examples of each of the two main classes of mobile elements (11). In this eukaryotic alga, a TOC1 element classified as an LTR retrotransposon contains an unusual rearrangement of the long terminal repeats (7, 8), and a DNA fragment named CRRE1 encodes a protein (21) with homology to the rvt sequence.
One of the exciting topics in retrotransposons biology concerns the control of activation of its transcription and retrotransposition. Most plant retrotransposons are inactive during normal growth and development but are transcriptionally activated by biotic and abiotic stress factors (10). In addition, the activation of retrotransposons by introduction of foreign DNA is a phenomenon observed in hybrids of marsupials (32), rice (24), and Arabidopsis thaliana (13). Moreover, the retrotransposon activation by DNA introduction is followed by rapid DNA methylation and repression (24). In Drosophila, where DNA methylation does not exist, retrotransposons are located mainly in regions of the heterochromatin (36) and gene silencing, including that of mobile endogenous elements, occurs by a group of polycomb proteins (45).
In this work, we have identified a new LTR retrotransposon in Chlamydomonas reinhardtii named REM1, phylogenetically related to the gypsy family elements, that possesses unique structural and expression features not still described in mobile elements of this group. Foreign DNA integration and/or genetic crosses activated retrotransposition and transcription of REM1. However, integration of CrREM1 was restricted to those events even though the expression of its primary transcript remained highly activated. A regulation of retrotransposition mostly dependent on the DNA integration step is proposed.
|
|
|---|
|
View this table: [in a new window] |
TABLE 1. Chlamydomonas strains useda
|
Screening procedures, DNA and RNA isolation, and hybridization analysis.
The
EMBL4 genomic DNA library (Carolyn F. Silflow, University of Minnesota) was screened according to previously reported methods (38). For Southern blot experiments, chromosomal DNA was extracted after lysis of the cells with 2% sodium dodecyl sulfate (SDS) in 100 mM Tris-HCl (pH 8.0), 400 mM NaCl, and 50 mM EDTA. Following phenol extraction, DNA was precipitated with 2.5 volumes of 95% ethanol, washed in 70% ethanol, and resuspended in 10 mM Tris-HCl, pH 8.0. Upon digestion, 2 µg of DNA per lane was loaded on 0.8% agarose gels. Total RNA was isolated as described by Schloss et al. (43).
Northern blots were performed by loading and fractionation of 15 to 20 µg RNA per sample on 1.2% agarose gels containing 17.5% formaldehyde and transferred onto nylon membranes (Hybond-N+; Amersham) (41). After blotting onto the above-mentioned nylon membranes the fixation was carried out at 80°C for 2 h. Probes used were radioactively labeled with [
-32P]dCTP by the random primer method. The membranes were prehybridized and hybridized in 50% formamide solutions at 42°C (41, 43) and washes were performed at 65°C, with 0.2x SSC (1x SSC is 0.15 M NaCl and 0.015 M sodium citrate) and 0.2% (wt/vol) SDS solution. Probes used were Gag-1, a 650-bp PvuII-EcoRV fragment of the gag gene; Gag-2, a 719-bp SmaI/SmaI fragment corresponding to ORF1 of CrREM1; GP-CrREM1, a 1,753-bp fragment isolated by PvuII/EcoRV digestion from pN1EV plasmid corresponding to the 3' end of ORF1 and 5' of ORF2; reverse transcriptase, a 1,433-bp SalI/SalI fragment from pB3 in the reverse transcriptase coding region of CrREM1; and Nii1, a 642-bp SalI/SalI fragment of the Nii1 gene (39). The intensity of hybridization signals was analyzed with Data Processing Program Quantity-One.
DNA sequencing. Plasmid isolations were obtained in prepacked purification columns (QIAGEN Tip-20; QIAGEN). DNA sequencing was performed with fluorescence-labeled terminators for an automated sequencer (model ABI Prism 373XL; Applied Biosystems) in the Central Service for Research Support of the University of Córdoba. Sequences were normalized using the DNAStar for Windows and MAC software (Lasergene Navigator) and Chromas program and subsequently analyzed as indicated below in "Bioinformatic tools for analysis."
Bioinformatics tools for analysis. Sequence search homologies were conducted using the BLASTN (nucleotide versus nucleotide) and TBLASTX (nucleotide versus trans nucleotide) program available at DDBJ (http://www.ddbj.nig.ac.jp) and ChlamyDB (http://genome.jgi-psf.org/chlre1/chlre1.home.html; http://bahama.jgi-psf.org/prod/bin/chlamy/home.chlamy.cgi). Structural analysis and protein domains described in this article were identified using RPsBLAST or CDD from NCBI (http://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi) and other programs whose links are in the database http://molbiol-tools.ca. Multiple alignments of protein sequences were created by BioEdit Sequence Alignment Editor (http://www.mbio.ncsu.edu/BioEdit/bioedit.html). Phylogenetic analysis was conducted using the MegAlign program within the DNAStar software.
Nucleotide sequence accession numbers. Sequence data in this article have been deposited in the EMBL/GenBank data libraries under accession numbers as follows: for nucleotide sequences N1, AY227353, and CrREM1, AY227352; and for translation products of pB3 and ORF1p, AA073550; ORF2p, AA073551; and ORF3p, AA073552.
|
|
|---|
![]() View larger version (17K): [in a new window] |
FIG. 1. Structure of pN1EV and pB3 clones. CrREM1 is located between positions 1819 and 7627 of pB3 (shadowed box). Black boxes indicate direct repeats of target site integration, striped boxes are 5' and 3' LTRs and the gray box is the central coding region of CrREM1. This figure shows the three ORFs identified with ORF-Finder of NCBI and the domains that contain each protein. GAG, Retro-gag (pfam03732); Zn, zinc finger type C2H2 (smart00355); AP, aspartic protease (pfam00077); RTase, reverse transcriptase (pfam00078); PHD, PHD-finger domain (pfam00628, smart00249); CHD: chromatin organization modifier domain (smart00298). Ap, ApaI; E, EcoRI; EV, EcoRV; H, HindIII; Pv, PvuII; Sl, SalI; Sm, SmaI; X, XhoI. Plasmid pMN24 is not drawn to scale. Primers deduced from the NR promoter and used in inverse PCR: NDO5 (A) and NUP5 (B).
|
EMBL4 genomic DNA library of the wild-type strain 21gr was screened with the XhoI-ApaI probe. Among 50,000 phages screened, six overlapping clones were recovered and named
A2,
A4,
A6,
B3,
B5, and
B8. Subsequently, the clones were analyzed by EcoRI digestions that released their genomic DNA inserts from the arms of phage
EMBL4 (Fig. 2A), and hybridized with the probe used in the screening of the genomic DNA library to study the fragments which contained the XhoI-ApaI sequence (Fig. 2B). These fragments were cloned in pBKSII-, previously linearized and dephosphorylated. The physical maps of each clone were deduced from band patterns of double and simple digestions with restriction enzymes and the interesting fragments were subcloned.
![]() View larger version (42K): [in a new window] |
FIG. 2. Analysis of six phage-positive clones from strain 21gr of C. reinhardtii. (A) Restriction map from phage clones with endonuclease EcoRI. The black bars correspond to fragments hybridizing to the probe used in the screening of the library. The restriction sites of EcoRI at the ends come from the lambda cloning vector (dashed lines). (B) The 1.2-kb XhoI-ApaI fragment from pN1EV was labeled with digoxigenin-dUTP and used as a probe in the Southern blot analysis of phages. Six hybridization signals corresponding to a fragment of 6 kb in clones B3 and B5 and between 4.5 and 3.5 kb in A2, A4, A6, and B8 are shown.
|
B3). A BLAST database search did not allow us to find significant identity with any known gene or protein. Because of this fact, an adjacent fragment from phage
B3 was also subcloned and sequenced to expand the region affected by the insertion to 7912 bp whose restriction map is shown in the Fig. 1. A new BLAST search revealed the presence of two long direct terminal repeats that would be flanking a long retrotransposon-like sequence. The translation product of pB3 in the databases of GenBank/EMBL/DDBJ/PDB at the NCBI produced a significant degree of similarity with gene products of some LTR retrotransposons of the gypsy type, especially at region 6090 to 6763 of clone pB3 that contains a reverse transcriptase domain. A comparison with the Chlamydomonas genome database (http://genome.jgi-psf.org/chlre1/chlre1.home.html) revealed that this sequence had not yet been reported and corresponds to a new gypsy LTR retrotransposon from C. reinhardtii that we have named REM1.
When the sequence of pN1EV was compared to that of pB3 (Fig. 1) it was found that plasmid pMN24 had inserted at a central region of CrREM1 causing a deletion of 1,590 bp at the 5' end and still maintaining only 52 bp of the 5' LTR (Fig. 1). We also sequenced genomic regions adjacent at both the 5' and 3' LTRs and internal regions of the retrotransposon from
A2,
A4,
A6, and
B8. It was found that the sequences obtained were identical to those in pB3. Thus, these DNAs had the same structural organization and correspond to the same genomic localization, suggesting that strain 21gr bears a single copy of REM1.
Structural characteristics of CrREM1. In general, CrREM1 showed most of the structural characteristics of LTR retrotransposons with some important differences. The analysis of the sequence showed that CrREM1 is a mobile element of 5,812 bp delimited by two LTRs of 286 bp. Sequences at the 5' LTR and 3' LTR of CrREM1 are 100% identical and the LTR tips have the retroviral consensus inverted repeat terminal sequences 5'TG... CA3' that determine the beginning and end of the retrotransposon. CrREM1 is also flanked by the 5-bp direct repeat sequence ATGCT that seems to correspond to the target integration site that duplicated during retrotransposition. The signals controlling the start and end of the transcription of the retrotransposons are found in its LTRs excepting the position of the primer binding site that could not be identified unambiguously in CrREM1.
Three ORFs were also identified (Fig. 1). Coding regions of CrREM1 were confirmed by reverse transcription-PCR with primers designed to anneal to predicted exons by computer analysis (results not shown). The hypothetical protein products in ORF1 and ORF2 of 556 and 776 amino acids, respectively, exhibit significant identity with gene products of known retroelements, especially with plant, fungal, and Drosophila gypsy retrotransposons and would correspond with the genes gag and pol, respectively. ORF1p contains a domain of 96 amino acids (326 to 421) named Retro-gag or the WCCH domain. There is a central motif, QGXXEXXXXXFXXLXXH, common to Gag proteins and a region of 20 amino acids (507 to 526) with the characteristic zinc finger ZnF-C2H2 present in transcription activator/repressor proteins (34). The N-terminal portion of ORF2p includes a folding of 24 amino acids (residues 163 to 186) which show homology with the single domain retroviral aspartic protease (RVP) from retroviruses, retrotransposons, and hepadnaviruses (plant dsDNA viruses). The RVP of CrREM1 shows the highly conserved domain DTGA characteristic of this type of protease that usually form part of a larger polyprotein Pol.
An RVT sequence of 231 amino acids (Fig. 1) was subsequently identified in a final portion of ORF2p (residues 414 to 644), which contains the seven characteristic domains of the retrotranscriptase (46) as well as the conserved sequence YXDDIL that seems to be located at the active site of the reverse transcriptase (44). In the alignment of the Ty3-gypsy type retrotranscriptases from eukaryotic organisms and that from CrREM1 (results not shown), we observed that they share 42% to 49% (using nr, not redundant database) or 34 to 36% (Swissprot database) amino acid identity to reverse transcriptases from photosynthetic eukaryotes.
Analysis of the 3' end of ORF2p failed to identify any RNase H and integrase motifs in CrREM1. Therefore, the domain order did not allow us to confirm that CrREM1 is related to a copia or gypsy retrotransposon. However, an evolutionary study of CrREM1 comparing the amino acid sequence of reverse transcriptases from both groups showed a higher evolutionary distance with retroelements of the group copia than with those of the group gypsy (Fig. 3). Moreover, the translation products of the identified ORFs in CrREM1 present an important similarity with the gene products of gypsy-like LTR retrotransposons, especially with putative polyproteins from C. reinhardtii, Zea mays, and Oryza sativa (Fig. 3 and data not shown).
![]() View larger version (14K): [in a new window] |
FIG. 3. Phylogenetic tree of LTR retrotransposons based on the reverse transcriptases sequence of the following retroelements: retrotransposons in the gypsy group: Oryza sativa P. polyprotein (BAB68106); C. reinhardtii CRRE1 (AB033239); Streptococcus bicolor gypsy (AAD22153); Zea mays Gag-Pol (AAL59229); Arabidopsis thaliana P. Gypsy/Ty3 (1) (NP_671762); Arabidopsis thaliana P. Gypsy/Ty3 (2) (NP_178767); ORF2p of C. reinhardtii REM1 (AA073551); Arabidopsis thaliana Ty3/gypsy (AAG51046); Saccharomyces cerevisiae Ty3B (CAA97115); Schizosaccharomyces pombe Tf2 (Q05654); Drosophila virilis gypsy (P10401); Drosophila melanogaster 17.6 (P04323); and Arabidopsis thaliana 412 (NP_671762). Retrotransposons in the copia group: Zea mays Sto-4 (T17429); Arabidopsis thaliana Ty1/copia (AG51258); Arabidopsis thaliana Ta1-3 (X13291); Saccharomyces cerevisiae Ty1-H3 (B28097); Saccharomyces cerevisiae Ty2B (P25384); Saccharomyces cerevisiae Ty4B (P47024); copia from Drosophila melanogaster (P04146); and Tnt1 from Neurospora crassa (CAA32025).
|
![]() View larger version (58K): [in a new window] |
FIG. 4. Alignment of chromodomains and PHD module from chromatin modifiers proteins and various gypsy-type LTR retrotransposons. (A) The upper group includes authentic chromodomains of chromatin-modifying proteins polycomb (P26017) and HP1 (P05205) of Drosophila melanogaster and the consensus sequence of chromodomain available at the NCBI conserved domain database. The lower group represents chromodomains of CrREM1 and 3'-end integrase of Ty3-gypsy class of LTR retrotransposons. The accession numbers of retrotransposons which have been translated from their DNA sequence are as follows: ORF3p REM1, AA073552; Reina, U69258 and Pyret, AB062507. The accession numbers of retrotransposons available in the protein databases are as follows: Del-like 1 and 3, AC002534 and Z97342; Cft1, AF051915; Maggy, AAA33420; Skippy, AAA88791; Tf2, T38401; and Skipper, AAC48336. (B) The upper block includes authentic PHD-finger domain from chromatin-modifying proteins: chromo domain helicase-DNA-binding protein 4 (CHD4) of Homo sapiens (Q14839), CHD3 of Drosophila melanogaster (AAF49162), and consensus sequence of PHD available in the NCBI conserved domain database. The lower block contains PHD conserved in ORF3p encoded by REM1 of C. reinhardtii (AA073552), ORF1p encoded by non-LTR retrotransposons (CR_DM of Drosophila melanogaster, [15]), and the retrotransposons Q (T43019) and T1 (M93689) of Anopheles gambiae. The asterisks show conserved cysteine and histidine residues matching the PHD domain. Black boxes, identical amino acids; gray boxes, similar amino acids.
|
Filters containing DNA of each strain digested with endonucleases HindIII and SalI were hybridized with the Gag-1 probe (Fig. 5A). The expected 2.6-kb HindIII band was obtained in all the strains. Nevertheless, two additional bands of 8 and 12 kb were identified in N1 and another of 5.8 kb in N10. In the lanes of the SalI digestion, the expected 2.5-kb band was only observed in the wild-type strain 21gr together with additional bands. Larger fragments were found in all the other strains.
![]() View larger version (41K): [in a new window] |
FIG. 5. Study of the transposition and copy number of CrREM1 in strains of C. reinhardtii. (A) The locations of the probes used and the restriction map of CrREM1 are indicated. (B) DNA (2 µg) from mutants N1 (1), N10 (10), and N24 (24), wild-type 21gr (2), and strain 305CW15 (C) was digested with endonuclease HindIII or SalI and analyzed by Southern blot using the Gag-1 probe. (C) gDNA of the indicated strains (2 µg) was digested with SmaI. The different lanes in each strain correspond to different dilutions of SmaI digestion: A, 1; B, 1/5; C, 1/10. The probes used were Gag-2 and Nii1. These probes were subsequently purified, quantified and added together (each at 106 cpm/ml) to the same hybridization solution. The expected band sizes are indicated with arrows.
|
The CrREM1 copy number in these CSA mutants was compared to that in the parental strain 305CW15. A Southern blot of SmaI digestions of DNA from these strains was hybridized with two probes of similar size: one of the Nii1 gene (a single-copy gene control) and another from ORF1 of CrREM1 (Gag-2) (Fig. 5C). Quantification of radioactive signals indicated that strain 305CW15 possesses one to two copies of CrREM1in the genome and the mutants CSA about 1.5 to 2 times more. Thus, after the insertion of the pMN24 plasmid, these three CSA mutants had increased CrREM1 copy numbers.
Thereby, to know whether DNA introduction caused transposition of CrREM1, we studied its mobilization in different strains that, like N1, N24 and N10, had been transformed with plasmids (PAD35 and pMN24) containing the Nia1 gene (Fig. 6A). A Southern blot of total DNA digestions with SalI was hybridized with the GP-CrREM1 probe (Fig. 6B). As shown in Fig. 6C, an increase in CrREM1 copy number and hybridizing bands was again observed in the three CSA mutants, but also in strains that had undergone a greater number of changes in its genome (genetic crosses and transformation), as deduced from comparing strain P3 with its parental strains E18 and 6145c (Fig. 6C). A54 and E18 do not possess copies of CrREM1 since they have a genetic background different from the other strains analyzed. However, this increase in CrREM1 copy number does not occur in all transformants like Tx-7 that showed identical hybridization signals than its parental strains 305 (Fig. 6C). Moreover, the expected 1,429-bp SalI band is observed in all strains while the 2,500-bp band is only observed in strains 21gr, 305, and Tx-7. These results suggest that in those strains with a higher copy number of CrREM1, the region analyzed with the GP-CrREM1 probe would have undergone methylations, to which SalI is sensitive (GTm5CGAC, GTCGm6AC, and Ghm5UCGAC). Some differences in bands intensity, like those between P3 and P2 (Fig. 6C), might also be due to differences in DNA amounts loaded on the gels.
![]() View larger version (29K): [in a new window] |
FIG. 6. Study of CrREM1 mobilization in strains of C. reinhardtii. (A) Relationships of parental-transformant-segregant strains used in the Southern blot analysis. Plasmids pMN24 and pAD35 contain the structural gene of the NR. (B) Probe GP-CrREM1 (a 1,753-bp fragment) was isolated by PvuII/EcoRV digestion from plasmid pN1EV. (C) DNA (1 µg) from the following strains was digested with restriction endonuclease SalI: 2, 21gr; 6, 6145c; C, 305CW15; 1, N1; 10, N10; 24, N24; A, A54; E, E18; P3, P3; P2, P3-NR-2; P4, P3-NR-4; 3, 305; PD, PAD305-2; and T: Tx-7. DNA hybridization was carried out with the GP-CrREM1 probe. The expected band sizes are indicated with arrows.
|
Figure 7 shows hybridization with the probe Gag-1, where we should have observed two bands of 600 and 329 bp if CrREM1 is not methylated. Both hybridization signals were identified in all the lines although in the MspI-digested samples the intensity of these bands was very weak. Moreover, in MspI lines we observed bands with larger sizes than expected (Fig. 7). These results suggest that there exist copies of CrREM1 whose first C residue at the CCGG target site, within the studied region with the Gag-1 probe, would present the methylation type conferring inhibition of MspI digestion. A different methylation pattern was also observed when this hybridization was performed with the reverse transcriptase probe (Fig. 7). We expected three bands, one of 1371 bp only detected in HpaII digestion, another of 481 bp detected in both digestions but less intense in MspI lines, and a band of 403 bp which was not identified probably because it was not transferred to the nylon membrane. In both digestions again we can observe larger bands, indicating that reverse transcriptase of CrREM1 copies, especially in mutants N1, N10, and N24, are highly methylated and that DNA methylation occurred in different regions of this retroelement.
![]() View larger version (63K): [in a new window] |
FIG. 7. Analysis of methylation at regions of CrREM1. Total DNA (2 µg) from strains 305CW15 (C), N1 (1), N10 (10), and N24 (24) were digested with the endonucleases HpaII, MspI, Sau3AI, and MboI, which vary in their sensitivity to methylation, as detailed in the text. The probes used were Gag-1, RTase, and Nii1.
|
Analysis of CrREM1 expression. The first step to control retrotransposition activity of a retrotransposon is by regulation of its transcriptional initiation (12, 20). Thus, we studied the expression of CrREM1 transcripts from 14 parallel subcultures of the wild-type types and transformant strains analyzed above. Figure 8 shows RNA hybridization using the Gag-CrREM1 probe. We detected the overexpression of a transcript in the CSA mutants N1, N10, and N24 and the strains P3, P3-NR-4 and Tx-7 that would correspond with the expected size of a full primary transcript encoding CrREM1 polyproteins (6 kb). We have observed that this overexpression pattern is stably maintained in many of the above strains during several years of growth. In the lane of mutant P3-NR-2 bands corresponding to smaller transcripts were observed that were less intense in strain 305. No transcripts of the expected size were detected in any of the other analyzed strains. However, after a long overexposure of the filters the band of 6 kb could be observed in most of them (results not shown). The transcripts observed in Northern blot analysis showed discrete bands of a hybridizing RNA smear, presumably because of mRNA degradation. We obtained very similar results by using the reverse transcriptase probe (results not shown).
![]() View larger version (122K): [in a new window] |
FIG. 8. Expression of CrREM1 transcripts in wild-type and mutants strains of C. reinhardtii. Cells were grown in TAP medium containing 8 mM ammonium. Total RNA of the indicated strains (abbreviations are shown in the legend to Fig. 6) was extracted and subsequently analyzed in RNA transfer hybridizations using the specific GP-CrREM1 probe (Fig. 6B).
|
The expression of the putative ORF3 was also studied by using an antisense single strand DNA probe; however, no transcript signal was detected. This result suggests that ORF3 would have a very low expression not easily detectable in Northern blots.
|
|
|---|
The hypothetical protein products of the ORFs of CrREM1 include the domains Retro-gag, zinc finger, protease and reverse transcriptase conserved among autonomously active retrotransposons and retroviruses (3, 29), though typical RNase H and integrase domains could not be identified (Fig. 1). In spite of this fact, CrREM1 is functional since its primary transcript is overexpressed and multiple copies can be integrated. On the basis of these results, sequence comparison and phylogenetic study, we conclude that a new and unusual LTR retrotransposon related to the Ty3-gypsy family elements has been identified in C. reinhardtii.
The distinctive feature of CrREM1 is a particular coding region, named ORF3, which possesses a structural order and translational orientation different to that reported in LTR retrotransposons. This ORF3 was identified between the 5' LTR and the gag gene and its transcriptional orientation is reverse to that of the REM1 polyprotein (Fig. 1). In addition ORF3p presents two specific domains, named PHD-finger and chromodomain; each of these functional domains have been described separately in some mobile elements. The characterization of DNA sequences adjacent to the 5' and 3' LTRs in all the isolated phages showed that we had isolated the same copy of CrREM1, probably the only one, present in the 21gr strain of C. reinhardtii. However, if another copy of REM1 with another gene structure is present in this strain, we would have been unable to detect it and PCR experiments designed to that purpose (not shown) would have failed.
The PHD finger has been identified in the ORF1 proteins encoded by CR1 from fruit flies and T1 and Q non-LTR retrotransposons from the African malaria mosquito (15), and the chromodomain is encoded by the Ty3-gypsy group of retrotransposons at the integrase within the C-terminal end of pol (25) (Fig. 9). In the isolated copy of CrREM1, we could not identify the integrase in the order defined for that family of retroelements. Nevertheless, we could identify a chromodomain associated with a PHD-finger domain at the C terminus of ORF3p (Fig. 9) that could correspond to the integrase from CrREM1. A structural organization of ORFs or enzymatic modules similar to those in CrREM1 has not been described in other retrotransposons. This would imply that the presence of both domains in CrREM1 is highly divergent. Alternatively, CrREM1 could borrow integrase and RNase H proteins from other endogenous retrotransposons for its functionality.
![]() View larger version (28K): [in a new window] |
FIG. 9. Schematic structure of CrREM1, non-LTR and LTR retrotransposon with PHD and chromodomain. The boxes represent the ORFs in which the positions of the different domains are shown. PBS, primer binding sites; PPT, polypurine tract; NA, nucleic acid binding motif; ZF, zinc finger motif type C2H2; EN, endonuclease; CP, capsid-like protein; GAG, retro-Gag domain; PR, protease; RT, retrotranscriptase; RH, RNase H; IN, integrase; 5'-UTR, 5' untranslated region; 3'-UTR, 3' untranslated region; IR, inverted direct repeats; DR, flanking target direct repeat; CHD, chromatin organization modifier domain; and PHD, PHD-finger domain. The transcription senses of the LTR retrotransposon, CrREM1 primary transcript, and ORF3 transcript are shown by arrows.
|
We failed to find a sequence similar to CrREM1 in a search against the C. reinhardtii genome database. We have to consider that about 5% of the C. reinhardtii genome sequence not yet released could include regions of heterochromatin, telomeres, repetitive sequences, etc. CrREM1 copies might be at these chromosomal locations. In addition, the sequencing of the Chlamydomonas reinhardtii genome was carried out from the cw92 strain and we obtained the copies of CrREM1 from a genomic DNA library of the wild-type strain 21gr.
Transposition and transcription of CrREM1. REM1 is overexpressed in some C. reinhardtii strains while in others its transcripts could hardly be detected (Fig. 8), only after overexposing the films. These results seem to indicate that the regulation/silencing mechanism of this retroelement is not active in all strains. On the other hand, it could be observed that REM1 is a distinctively low copy number retrotransposon in the C. reinhardtii strains studied and in some mutant strains, mainly in those with major changes in their genome (transformation and genetic crosses), REM1 has increased its copy number. This low copy number might reflect a highly efficient mechanism for inhibiting its transposition and integration in spite of very active transcription.
These results open several questions: Why and how does CrREM1 copy number increase? A54 and E18 do not have any CrREM1 copies in their genome, whereas wild-type strains 21gr and 6145c probably possess a single copy since they come from different genetic backgrounds. Strain P3, a segregant from the genetic cross between 6145c and E18, presents six to eight copies in its genome. Mutants N1, N10, and N24 also possess a CrREM1 copy number higher than that of their parental strain 305CW15, which was transformed with the pMN24 plasmid to generate these CSA mutants (Fig. 6C). In plants, biotic and abiotic stress factors activate transcriptionally retrotransposons that are inactive during normal growth (10). However, the C. reinhardtii strains showing amplification of REM1 had not been exposed to any of these factors. Therefore, the only common feature among those strains is that all underwent either transformation and/or genetic crossover events.
Activation of retrotransposons by introduction of foreign DNA has also been observed in marsupials (32) and plants (13, 24). In fact, when Arabidopsis thaliana is transformed with tobacco retrotransposon Tto1, transpositional activation of Tto1 and the endogenous retrotransposon Tar17 is observed (13). However, the process of transformation or generation of hybrid lines is not always associated to the mobilization of retrotransposons (35). In agreement with this, we also found that the C. reinhardtii transformant strain Tx-7 did not increase the single REM1 copy present in its parental strain 305. Our results lead us to propose that factors participating in the insertion of foreign DNA during transformation of C. reinhardtii and/or meiotic recombination promote transposition of REM1 in a process that might or not be productive in this copy number increase. However, mitotic events do not appear to favor REM1 transposition since cells of the studied strains have grown vegetatively during a very long time (years) with a stable copy number.
Another question is why CrREM1 is overexpressed in some strains. Since retrotransposons transpose by an RNA intermediate, the inhibition of their transcription would be an effective way to minimize and regulate the possible deleterious effects of retrotransposition in the host. DNA methylation might prevent retrotransposition of transposable elements (13, 32, 48). In fact, retrotransposon activation by DNA introduction in rice is followed by rapid repression and DNA methylation (24). The methylation analysis of CrREM1 regions using Gag-1 and reverse transcriptase probes in strains overexpressing CrREM1 transcript indicate that they contain hypermethylated and demethylated copies of CrREM1 and copies which might have internal deletions. Hirochika et al. (13) have shown that Tto1 of Arabidopsis, initially silenced by hypermethylation, became transcriptional and transpositionally reactivated when introduced in a homozygous ddm1 ("decrease DNA methylation 1") background. Therefore, they suggest that this mechanism is necessary for silencing of repeated genes. Liu and Wendel (24) have also proposed this way of silencing by DNA methylation to regulate the reactivation of the element Tos17.
Methylation of CrREM1 might also serve as a control mechanism of its transposition. However, not all CrREM1 copies were silenced, since it was transcriptionally very active. This implies that methylation is not the only mean of silencing retrotransposon in C. reinhardtii. In Drosophila melanogaster, retrotransposons are located mainly in regions of the heterochromatin (36) and are silenced by a group of polycomb proteins (45).
The increase in CrREM1 copy number could be associated with cytosine changes as described in rice (24) but this phenomenon should be more extensively studied. In hybrid lines of rice (Oryza sativa x Zea latifolia), it has been observed that occasional internal deletion of a gypsy retrotransposon is accompanied by the activation of the element (24). This could explain CrREM1 overexpression in mutant N1, where the CrREM1 copy isolated by inverse PCR had a deletion of 1,590 bp which affected partially to the 5' LTR and the gag gene and was probably caused by the pMN24 insertion. This phenomenon could have also occurred in other mutant strains that overexpressed CrREM1 and have been transformed with selectable markers.
In conclusion, we have isolated a new LTR retrotransposon in C. reinhardtii (REM1) which possesses structural features not described to date, and is phylogenetically related to those of the Ty3-gypsy type. An ORF containing a PHD-finger and a chromodomain is present in CrREM1 with a transcriptional orientation reverse to those of the primary retrotransposon and gag-pol transcripts. CrREM1 can amplify its copy number in strains that undergo phenomena of foreign DNA integration and/or genetic crosses. Though extensive methylation might occur in some copies of the retrotransposon another copies are highly active transcriptionally in a stable fashion. However, this fact does not imply an increase in the retrotransposon copy number that is also stably maintained during very long periods of cell growth. It is proposed that elements involved in the integration of foreign DNA and/or meiotic recombination are needed for integration and transposition of this retrotransposon, probably directed to heterochromatin locations.
This work was supported by Ministerio de Ciencia y Tecnología, Spain (MCYT grant no. BMC2002-03325), Junta de Andalucía, Spain (Plan Andaluz de Investigación, CVI-0128), and the Vth Framework Program of the EU (HPRN-CT-2002-00247).
Present address: CIP/Institute de Chimie B6, University of Liege, 4000 Liege-1, Sart Tilman, Belgium. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2010 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»