Previous Article | Next Article ![]()
Molecular and Cellular Biology, January 2004, p. 105-122, Vol. 24, No. 1
0270-7306/04/$08.00+0 DOI: 10.1128/MCB.24.1.105-122.2004
Copyright © 2004, American Society for Microbiology. All Rights Reserved.
Department of Integrated Biosciences, Graduate School of Frontier Sciences, University of Tokyo, Kashiwa, Chiba 277-8562, Japan,1 Department of Molecular Biology and Genetics, Johns Hopkins University School of Medicine, Baltimore, Maryland 212052
Received 16 June 2003/ Returned for modification 5 August 2003/ Accepted 25 September 2003
|
|
|---|
|
|
|---|
Considerable numbers of non-LTR retrotransposons, including human L1, have two putative open reading frames (ORFs), whereas some elements have only one ORF (22). In the former non-LTR retrotransposon group, the second ORF (ORF2) encodes endonuclease (EN) and reverse transcriptase (RT) domains. In ORF2, EN and RT are the only characterized domains and are essential for retrotransposition of non-LTR retrotransposons with two ORFs. However, there is very little functional information available on the first ORF (ORF1), because of low levels of sequence similarity among non-LTR retrotransposons. In ORF1, CCHC zinc finger motifs are considered essential (37), although a leucine zipper motif may substitute for the zinc fingers in ORF1 of the human L1 element (15). Moran et al. showed that three regions conserved in the ORF1 sequences of mouse, rat, and rabbit L1s are essential for in vivo retrotransposition, but their functional roles were not yet clarified (26). Moreover, previous studies indicated that ORF1 of non-LTR retrotransposons binds to nucleic acids in vitro (7, 14, 18, 19), multimerizes to form ribonucleoprotein particles (RNPs) (13, 28), acts as a chaperone to nucleic acids to facilitate stable reverse transcription (7, 25), and transports RNPs into the nucleus (31, 32). There are a few results to show which structure in ORF1 is responsible for specific functions (13, 24, 26). At present, however, these roles of ORF1 protein have been demonstrated mostly by in vitro methods and are insufficiently understood.
The ORF1 and ORF2 proteins of non-LTR retrotransposons are translated from their mRNAs, although it is not yet known how the translation of two independent units of overlapping ORFs is regulated. Two putative ORF proteins and the mRNA are assembled into RNPs, and then the RNPs are transported into the nucleus, make a nick in the target DNA by using the EN domain in ORF2, and start the reverse transcription of RNA by using the nick as primer. These processes in the nucleus that are specific to non-LTR retrotransposons are called target-primed reverse transcription (TPRT) (21). There are two hypotheses explaining reverse transcription in the nucleus. One states that the integration event occurs during mitosis, when nuclear transport itself is unnecessary. The other states that active transport of RNPs is controlled by unidentified functional domains in the ORFs. The nuclear transport mechanism of RNPs of non-LTR elements, however, is largely unknown.
To clarify the functional roles of ORF1 in the retrotransposition of non-LTR retrotransposons, we used telomere-specific SART1 of the silkworm Bombyx mori. Recently, we established the retrotransposition system of SART1 in vivo, using baculovirus-mediated expression (37), and therefore it is possible to identify every domain essential for SART1 retrotransposition with mutation-based analyses. SART1 is a sequence-specific non-LTR retrotransposon that integrates into insect telomeric repeats (TTAGG)n (36). Because telomerase activity cannot be detected in the silkworm B. mori (33), SART1 may contribute to telomere maintenance or back up the undetectable levels of telomerase activity in this insect (11).
In this study, by combining an in vivo retrotransposition assay and green fluorescent protein (GFP) fusion analyses, we demonstrate that three domains of ORF1 are required for the retrotransposition of SART1. Each of these domains is essential for either active transport into the nucleus, control of the dotted localization pattern in the nucleus, or putative RNP formation. Furthermore, we also show that baculovirus-expressed ORF1 protein rescues the ORF1-deficient mutants of SART1 in trans in cells, suggesting that the ORF1 protein works as an independent functional unit in multiple forms. This report demonstrates directly that nuclear import signals in ORF1 of SART1 are required for retrotransposition and that several functional domains involved in retrotransposition are clearly localized to ORF1 of non-LTR retrotransposons.
|
|
|---|
(i) Mutation introduction. The deletion and substitution mutants were constructed by inverse PCR and self-ligation with 5'-phosphorylated primers (Table 1). After the inverse PCR, the products with the phosphorylated 5' ends were self-ligated with the TaKaRa DNA ligation kit (version 2) for 16 h. To introduce various mutations, one primer with a 3- or 12-base substitution at its 5' end was used in inverse PCR. Double-mutated constructs were generated from a single mutated construct as template DNA by inverse PCR and self-ligation as described above. Phosphorylated primers were made with T4 polynucleotide kinase (TaKaRa) and ATP (Roche). Deletion mutants for in vivo retrotransposition assays were subcloned between NcoI and BglII sites of the pAcGHLT-B plasmid (PharMingen). SART1ORF1pWT was amplified by PCR with SART1S880 and SART1A3029NotI and subcloned between NcoI and NotI sites of pAcGHLT-B.
|
View this table: [in a new window] |
TABLE 1. Primers
|
Generation of recombinant Autographa californica nuclear polyhedrosis virus (AcNPV). Sf9 cells were cultured at 27°C in TC-100 medium containing 10% fetal bovine serum (Nihon-Nosankougyou Co., Yokohama, Japan) and penicillin-streptomycin (Gibco). The recombinant baculovirus was generated by cotransfection of the wild-type (WT) and mutant SART1-pAcGHLT-B plasmids together with BaculoGold DNA (PharMingen) into the Sf9 cells by using the Tfx-20 lipofection reagent (Promega). Four days later, the recombinant baculovirus was plaque purified and amplified from a single plaque, according to the manufacturers' instructions.
In vivo retrotransposition assay. The in vivo retrotransposition assay of the SART1 element was performed essentially as described by Takahashi and Fujiwara (37). Sf9 cells (106) were infected with a SART1 ORF-containing AcNPV at a multiplicity of infection of 10 PFU. In the trans-complementation experiments, Sf9 cells were infected with two AcNPVs at a multiplicity of infection of 5 PFU. At 24, 36, and 48 h after infection, cells were collected, and the total genomic DNAs were purified with the Puregene DNA isolation kit (Gentra Systems). PCR amplification was carried out with LA-Taq DNA polymerase (TaKaRa) and the primer set SART1S6311-CCTAA+T (Table 1) in the presence of anti-Taq antibody (Clontech) and 10 ng of Sf9 DNA. The reaction mixture was denatured at 96°C for 2 min, followed by 35 cycles of 98°C for 20 s, 62°C for 30 s, and 72°C for 30 s. The PCR products were electrophoresed on 2% agarose gels in Tris-borate-EDTA buffer and visualized by ethidium bromide staining. PCR products were cloned into the pGemT-Easy vector (Promega) and sequenced with the Big Dye terminator cycle sequencing kit (Applied Biosystems) on an ABI 310 genetic analyzer.
GFP microscopy. Sf9 cells were transfected with GFP-expressing plasmids by using the Tfx-20 lipofection reagent. After 72 h, transfected cells were gathered, adhered to coverslips, and then fixed with 4% paraformaldehyde in phosphate-buffered saline (PBS). The slips were washed with PBS and mounted by using Vectashield mounting medium with DAPI (4',6'-diamidino-2-phenylindole) (Vector). The cellular localization of GFP-fused proteins was analyzed and photographed with UV illumination on a Zeiss Axioscop instrument, and GFP images were imported into Adobe Photoshop Elements.
Immunoblot detection of GST-His6 fusion protein. Glutathione S-transferase (GST)-His6-fusion proteins were detected by immunoblot analysis. Total proteins were extracted from Sf9 cells at various times after the AcNPV infection and subjected to sodium dodecyl sulfate-10% polyacrylamide gel electrophoresis. Electrophoresed proteins were transferred to a polyvinylidene difluoride membrane (Nippon Genetics). The membrane was immersed in 4% skim milk in PBS-T (PBS plus 0.1% Tween 20), washed several times with PBS-T, and incubated with anti-His antibody (Amersham) in PBS-T. Band detection was carried out with the ECL Plus Western blotting system (Amersham).
|
|
|---|
![]() View larger version (34K): [in a new window] |
FIG. 1. Structural features of ORF1 of SART1. (A) Schematic structure and sequence specificity of SART1. SART1 inserts between T and A of the telomeric repeats (TTAGG)n. The 5' and 3' untranslated regions (UTR) are shown in gray, and the ORFs are shown as open boxes. APE/RT, endonuclease/RT domain. Vertical lines in ORF1 and ORF2 represent the zinc finger motifs. ORF1 terminates at codon 713. (B) Amino acid sequence of the putative ORF1 protein of SART1 (accession number D85594.1). Asterisks indicate the NLS-like motifs predicted by the PSORT program. Three CCHC-type zinc finger motifs at the C-terminal end are underlined. (C) Electrical properties of the SART1 ORF1 protein. ORF1 is subdivided into five regions. The regional isoelectric points were calculated with the sequence analysis software Vector NTI and are indicated in the open boxes. The amino acid numbers are shown below the diagram. Arrowheads represent the positions of putative NLSs. There are four putative NLSs, N1, N2, N3, and N4.
|
![]() View larger version (53K): [in a new window] |
FIG. 3. The N terminus of SART1 ORF1 protein has a karyophilic domain. (A) Expression constructs in which GFP was fused with each region of SART1 ORF1. The nuclear localization ability of each region in SART1 ORF1 was examined with GFP fusion proteins. Open boxes indicateeach region of SART1 ORF1, and gray boxes indicate the GFP region. The vertical lines in constructs VI, IX, and X represent the zinc finger motifs. Amino acid numbers for the ORF1 region used for each construct are shown below the open boxes. N, nucleus; C, cytoplasm; N/C, both nucleus and cytoplasm. I, GFP without SART1 ORF1; II to VI, the five ORF1 regions (see Fig. 1) fused to GFP; VII to X, large ORF1 regions fused to GFP. (B) Cellular localization of the GFP fused with each region of SART1 ORF1 in Sf9 cells. The left and right pictures show cellular localizations of GFP fusion protein and DNA stained with DAPI, respectively. Bar, 10 µm.
|
![]() View larger version (54K): [in a new window] |
FIG. 2. SART1 ORF1 proteins localize in the nucleus in a dotted pattern. (A) Constructs for analysis of the localization of GFP-SART1 ORF1 fusion protein. Open boxes represent the SART1 ORF1 protein and gray boxes represent the GFP region. Amino acid numbers for the ORF1 deletions are shown below constructs I to IV. The results from panel B are summarized on the right. Vertical lines represent zinc finger motifs. (B) Cellular localization of the GFP fused with each region of SART1 ORF1 in Sf9 cells. Intact SART1 ORF1 protein is localized in the nucleus in a dotted pattern. The left-hand pictures show the localization of GFP fusion protein analyzed by fluorescence microscopy. The center pictures show DNA localization by staining with DAPI. Merged pictures of GFP-fused protein and DNA (for panels I to III) are shown on the right. In panels I, arrowheads indicate GFP signals in the cytoplasm (cytoplasmic cluster). Bar, 10 µm.
|
|
View this table: [in a new window] |
TABLE 2. Localization pattern of GFP-fused SART1 ORF1
|
![]() View larger version (39K): [in a new window] |
FIG.4. Mutational analysis of NLS activity in the N-terminal domain of SART1 ORF1. (A) NLS activity was examined by mutational analysis of SART1 ORF1. Expression constructs in which GFP was fused with each region of SART1 ORF1 are shown on the left. The left and right pictures show the localizations of GFP fusion protein and DNA stained with DAPI, respectively. N, nucleus; N/C, both nucleus and cytoplasm; N>C, many signals in the nucleus. The two putative NLSs (N1 and N2) identified by a computer-based search are represented by closed boxes. In panels V and VI, hatched boxes indicate the alanine substitution (N1, RRKR to AAAA; N2, KRGR to AAAA) within the putative NLS sequences. Bars, 10 µm. (B) Cellular localization of the NLS-less and full-length SART1 ORF1 proteins fused with GFP. The expression construct is shown at the top. Open boxes represent the SART1 ORF1 protein, and gray boxes represent the GFP region. Amino acid numbers for the ORF1 region are shown below the open box. Hatched boxes indicate the alanine substitution (N1, RRKR to AAAA; N2, KRGR to AAAA) within the NLS sequences, exactly as in panels VI in panel A. Vertical lines show zinc finger motifs. The left and right pictures show the localization of GFP fusion protein and DNA stained with DAPI, respectively. The arrowhead indicates the cytoplasmic cluster. Bar, 10 µm.
|
The GFP fusion protein containing residues 1 to 284 of ORF1 was localized only in the nucleus (Fig. 3, panels VII). However, other larger fused proteins, containing residues 143 to 447 (Fig. 3, panels VIII), 448 to 712 (panels IX), or 285 to 712 (panels X), were found only in the cytoplasm (panels VIII and X) or in the whole cellular region (panels IX). This observation indicates that the first region of SART1 ORF1, which includes residues 1 to 284, possesses an authentic NLS. The small GFP molecule fused with residues 580 to 712 of ORF1 entered the nucleus by passive diffusion and was localized in the nucleus. Since a longer GFP molecule fused with residues 285 to 712 distinctly localized in the cytoplasm (Fig. 3, panels X), residues 580 to 712 do not have obvious activity to import the ORF1 protein into the nucleus. It is surprising that there was no NLS activity in the regions 285-447 (Fig. 3, panels IV), 143-447 (panels VIII), and 285-712 (panels X), all of which include the putative NLSs, N3 and N4 (see above). This indicates that N3 and N4, which appear to be typical NLSs with strongly basic domains, do not act as signals for nuclear translocation in SART1.
Localization in the nucleus requires both N1 and N2 signals near the N terminus of SART1 ORF1. Two putative NLSs, N1 and N2, were identified in the N-terminal region by a computer-based search (Fig. 1B). Therefore, we used mutational analysis to study these putative NLSs, which occur in residues 96 to 134 (Fig. 4). When this region was completely deleted from the WT construct (Fig. 4A, panels I), nuclear localization was blocked and GFP signals were observed in both the nucleus and the cytoplasm (Fig. 4A, panels II). In contrast, a minimal region including N1 and N2 (92 to 142) was sufficient to localize GFP signals in the nucleus (Fig. 4A, panels III). These results strongly suggest that the nuclear import activity is encoded within the region from residue 92 to 142. To clarify whether N1 and N2 are essential for nuclear localization, we deleted the N2 region (residues 124 to 142) (Fig. 4A, panels IV) or introduced severe substitution mutations into the basic region of N1 (changing RRKR to AAAA) (Fig. 4A, panels V). In both cases, the GFP signals were observed predominantly in the nucleus, with few in the cytoplasm. When double mutations were introduced into both N1 and N2 (RRKR to AAAA in N1 and KRGR to AAAA in N2), the nuclear localization of GFP was abolished (Fig. 4A, panels VI). These results demonstrate that N1 and N2 work independently as weak NLSs but that both signals are required for nuclear transport activity. It seems that localization of some of the GFP signals shown in Fig. 4A is affected by passive diffusion because the GFP-ORF1 proteins are relatively small.
Furthermore, we analyzed localization of a full-length ORF1 construct without an NLS (Fig. 4B). This construct includes the alanine substitution in both N1 and N2, which is exactly the same as the substitution construct in Fig. 4A, panel VI. GFP signals fused to the full-length ORF1 lacking the NLS were observed only in the cytoplasm and not in the nuclear portion. When 31 examples of this construct were tested (Table 2), all GFP signals were localized only in the cytoplasm, and half of them showed dotted signals in the cytoplasm (Fig. 4B; Table 2). Improper cytoplasmic accumulation of GFP-fused protein is possibly due to blocking of localization into the nucleus. These data demonstrate that N1 and N2 control the nuclear localization of SART1 ORF1 protein.
An N-terminal domain containing NLSs is essential for in vivo retrotransposition of SART1.
Next, we examined the requirement for the NLSs at the N terminus of ORF1 in the retrotransposition of SART1, using a recently developed in vivo retrotransposition assay (37). Highly expressed SART1 under control of the polyhedrin promoter of AcNPV in Sf9 cells can transpose into specific sites at telomeric repeats (TTAGG)n. This event is detectable by PCR with a specific primer set (Fig. 5A). Since the genome of Sf9 cells does not have the Bombyx SART1, the retrotransposed SART1 copy expressed by AcNPV infection could be detected by this analysis. First, we constructed an NLS-deficient mutant of SART1 (
1-142 construct) in the AcNPV vector and infected Sf9 cells with it. At 24 to 48 h after infection, we extracted the genomic DNA and used PCR to amplify the boundaries between the retrotransposed SART1 3' ends and the telomeric repeats. Although retrotransposition of wild-type SART1 occurred actively and increased from 24 to 48 h postinfection (hpi),
1-142 did not show a significant PCR band even at 48 hpi (Fig. 5B). The above result, therefore, shows that the N-terminal region of SART1 ORF1 is essential for the efficient retrotransposition of the SART1 element.
![]() View larger version (40K): [in a new window] |
FIG. 5. In vivo retrotransposition assay of WT SART1 and 1-142 SART1. (A) Schematic overview of constructs and primers used for the in vivo retrotransposition assay of SART1. The SART1 ORF1, ORF2, and 3' untranslated region (UTR) were expressed downstream from the GST-His6 tag under control of the polyhedrin promoter of AcNPV (37). PCR amplification was performed with two primers (open arrows) (see Table 1) between the 3' UTR of retrotransposed SART1 and the telomeric repeats in the Sf9 host genome. In 1-142 SART1, the ORF1 N-terminal residues 1 to 142, which include both N1 and N2, were deleted from WT SART1 AcNPV. Therefore, in 1-142, all regions other than residues 1 to 142 of ORF1 are the same as those of the WT construct. ORF1pWT AcNPV includes only the ORF1 region of SART1 and not ORF2 and the 3' UTR. (B) Results of the in vivo retrotransposition assay. Lane M, size marker with 100-base ladder; Sf9, intact Sf9 genome with no recombinant AcNPV infection. The PCR band, which represents the exact retrotransposition, is 423 bp in length (arrow). (C) Immunoblot detection of ORF1 proteins N-terminally conjugated with the GST-His6 tag in the in vivo retrotransposition assay (see panel A). Total proteins from Sf9 cells infected with each recombinant AcNPV were separated by sodium dodecyl sulfate-polyacrylamide gel electrophoresis and analyzed by immunoblot assay with anti-His6 antiserum to detect His-tagged proteins. Lanes: 1, WT SART1; 2, 1-142; 3, coinfection of 1-142 and ORF1pWT; 4, mock infection; and 5, noninfected Sf9 cells. The mock infection should express only GST-His6.
|
1-142 mutant could be rescued by normal ORF1 protein by trans-complementation. When the AcNPV virus that expresses only the ORF1 protein of SART1 (Fig. 5A, ORF1pWT) was coinfected with
1-142 mutant virus, a strong PCR band representing highly active retrotransposition was observed 36 to 48 h after coinfection, which seemed slightly delayed compared with WT infection (Fig. 5B). This time lag for retrotransposition between infection with WT SART1 only and ORF1pWT/
1-142 coinfection may indicate that multiple ORF protein complexes are formed less stably among WT ORF1 and N-terminally deficient
1-142 ORF1 proteins or that cis preference is more efficient than trans-complementation in the SART1 element (39). The result clearly shows that the loss of the N-terminal region of
1-142 can be compensated for by WT ORF1 protein.
ORF1 protein of SART1 acts as an independent functional unit.
The above result of the trans-complementation experiment also suggests another important concept described below. Until now, it has not been demonstrated that ORF1 protein in non-LTR retrotransposon acts as an independent functional unit. We usually envisage the ORF1 and ORF2 units based only on sequence information, because we do not know the exact translation mechanisms for producing the two overlapping ORFs in non-LTR retrotransposons. In this study, therefore, we expressed the longest ORF1, which starts at ATG+1 (+880 from the SART1 5' end) and ends at putative stop codon TAA+713 (+3021), conjugated with the GST-His6 tag in the baculovirus expression system (Fig. 5A). This putative ORF1 protein from ORF1pWT complemented the NLS-deficient element
1-142 in trans (Fig. 5B).
To confirm this hypothesis, we further studied this trans-complementation event at the protein level by immunoblotting analyses (Fig. 5C). The total size of the GST-His6 tag fused to ORF1 1-712 of SART1 expressed in the AcNPV system is estimated to be 106 kDa. When the proteins expressed from the WT SART1-AcNPV hybrid element were analyzed by Western blotting with anti-His antibody, we detected a band of about 115 kDa, with no other bands (Fig. 5C, lane 1). Although some readthrough products caused by frameshift translation are usually observed in retroviruses and LTR-type retrotransposons, such a frameshift event did not occur, at least in the non-LTR element SART1. We observed about a 9-kDa increase in size in the actual ORF1 protein, which is probably caused by some modification. This proposition is supported by the observation that the ORF1 protein of
1-142 (calculated to be 91 kDa) was detected as a protein of about 100 kDa (lane 2). Because similar amounts of ORF1 proteins were expressed in both the WT and the
1-142 mutant (Fig. 5C, lanes 1 and 2), a loss of ORF1 function but not a decrease in ORF1 quantity was responsible for the block of retrotransposition in
1-142 (Fig. 5B). When the
1-142 mutant was rescued with ORF1pWT by trans-complementation, two bands of 100 and 115 kDa were expressed (Fig. 5C, lane 3). The band in lane 1 (WT SART1 origin) and the uppermost band (ORF1pWT origin) in lane 3 seem to be almost the same in molecular mass (115 kDa). This indicates that WT SART1 ORF1 (lane 1) is actually translated from the presumed ORF to 712Glu (Fig. 1B).
In this type of trans-complementation experiment, however, it is possible that a WT retrotransposon unit may be produced by recombination (Fig. 6A). To understand what occurs in the trans-complementation of
1-142 and ORF1pWT, we studied the viral genotypes in Sf9 cells by PCR analysis (Fig. 6B). After 72 h of infection with WT (Fig. 6B, lane 1),
1-142 (lane 2),
1-142 plus ORF1pWT (lane 3), or ORF1pWT (lane 4), genomic DNAs were extracted and PCR was performed with two different primer sets. With the primer set pS2941 and S1A1935, the PCR bands were expected to appear in all cases, and in fact we found an approximately 1.0-kb band for the WT ORF1 genome (lanes 1, 3, and 4) and a band of about 0.6 kb for
1-142 genome (lanes 2 and 3). PCR analysis with another primer set, consisting of S1S1153 in the N-terminal region of ORF1 and S1A3235 in the ORF2 region, produced a band only in the WT (lane 1) and not in
1-142 (lane 2),
1-142 plus ORF1pWT (lane 3), or ORF1pWT (lane 4). The result in lane 3 showed that there was no recombination event producing the WT SART1 element when
1-142 and the ORF1pWT virus were coinfected. These results indicate that the translation of SART1 ORF1 ends at a stop codon of TAA713 and that its product has the functional roles in retrotransposition of SART1.
![]() View larger version (30K): [in a new window] |
FIG. 6. Identification of the viral genotype in coinfection. (A) Schematic model for the possible recombinant that could appear in the trans-complementation experiment with 1-142 plus ORF1pWT. Arrows indicate the primers used to analyze the viral genotype that appears during the coinfection (see panel B). The PCR products for the WT genotype should be 2,082 bp for the primer set S1S1153-S1A3235 and 1,006 bp for pS2941-S1A1935. The PCR product for the 1-142 genotype with the pS2941-S1A1935 primer set should be 580 bp. (B) Identification of the viral genotype by PCR. Lanes: 1, WT SART1; 2, 1-142; 3, coinfection of 1-142 and ORF1pWT; and 4, ORF1pWT. Size markers are shown on the left.
|
1-52) (Fig. 7A), in vivo retrotransposition occurred from 24 hpi and increased gradually (Fig. 7B). The similar retrotransposition activities of WT and the
1-52 mutant indicate that about 50 amino acids at the N-terminal end of SART1 ORF1 are not essential under the conditions used in the assay. However, an N1 deletion mutant,
1-105, showed a dramatic decrease in retrotransposition activity, similar to that of
1-142, in which both N1 and N2 are deleted. In these deletion mutants, there is a possibility that incorrect folding of ORF1 may cause the block of their retrotransposition. We then tried to test the substitution mutants. An N1 mutant, N1>A, in which RRKR in the N1 sequence was replaced by AAAA, also exhibited a dramatic decrease in the retrotransposition activity. As described above, the same mutation (with RRKR in the N1 sequence replaced by AAAA) was studied in a GFP fusion experiment and resulted in a weakened nuclear localization signal (Fig. 4, panels V). Therefore, degeneration of the nuclear localization signal in SART1 ORF1 strongly affects the retrotransposition activity of the SART1 element. Similarly, an N2 mutant, N2>A (KRGR to AAAA), and an N1/N2 double mutant (RRKR to AAAA in N1 and KRGR to AAAA in N2) also lacked the retrotransposition activity. In order to confirm that the loss of retrotransposition activity in these N-terminal mutants is not due to destabilization of the ORF1 proteins, we analyzed the proteins expressed in the mutants by the Western blotting assay. In all N-terminal mutants, we identified ORF1 proteins the expected sizes (Fig. 7C, lanes 2 to 7). Moreover, the retrotransposition activity of the N-terminal mutants (lanes 2 to 7) was rescued by coinfection of ORF1pWT expressing only WT SART1 ORF1 (Fig. 7D). This demonstrates that all mutations and deletions at the N terminus of ORF1 did not affect the mRNA stability, ORF2 functions, and RNP formation. These data indicate that the functional defect but not the instability of ORF1 proteins causes the block of retrotransposition for these mutants. In conclusion, N1 and N2 act as NLSs to control the retrotransposition activity of SART1.
![]() View larger version (50K): [in a new window] |
FIG. 7. Retrotransposition activity of SART1 with mutations in the N-terminal and central regions of ORF1. (A) Constructs of various SART1 mutants. WT and mutated SART1 ORF1, ORF2, and 3' untranslated region (UTR) were expressed downstream from the GST-His6 tag by AcNPV-recombinant virus infection of Sf9 cells. Closed boxes indicate putative NLSs. Hatched boxes indicate the substituted regions where several amino acids were replaced by alanine. In constructs N1>A, N2>A, and N1,2>A, RRKR in N1 and KRGR in N2 were replaced with AAAA. The K residues at 383 (K383A) and 440 (K440A) were replaced with A. The results of the in vivo retrotransposition assay (see panel B) are summarized on the right. The numbers at the top represent hpi. +++, very strong signal; ++, strong signal; +, weak signal; ±, very weak signal; -, no signal. (B) PCR amplification of the 3' junctions of the retrotransposed WT SART1 or various mutants. The PCR band, which represents the SART1 retrotransposition, is 423 bp in length. (C) Western blots of mutant proteins. Protein production in mutants was analyzed by immunoblot assay with anti-His6 antiserum to detect His-tagged proteins. Lanes: 1, WT SART1; 2, 1-52; 3, 1-105; 4, 1-142; 5, N1>A; 6, N2>A; 7, N1,2>A; 8, K383,440A; 9, 440-447; 10, 353-447. (D) Rescue of mutant constructs by trans-complementation. At 36 h after coinfection of AcNPV viruses expressing various mutants with a construct producing only SART1 ORF1 (ORF1pWT), Sf9 genomic DNA was extracted and assayed by PCR detection (see Fig. 5B). Lanes: M, size marker (100-base ladder); 1, WT SART1; 2, 1-52 plus ORF1pWT; 3, 1-105 plus ORF1pWT; 4, 1-142 plus ORF1pWT; 5, N1>A plus ORF1pWT; 6, N2>A plus ORF1pWT; 7, N1,2>A plus ORF1pWT; 8, K383,440A plus ORF1pWT; 9, 440-447 plus ORF1pWT; 10, 353-447 plus ORF1pWT; 11, ORF1pWT.
|
354-447). This implies that the deletion may cause incorrect folding of proteins and that a conformation including the highly basic region is important for SART1 retrotransposition. These constructs normally expressed mutated SART1 ORF1 proteins of the expected sizes (Fig. 7C, lanes 8 to 10) and also could retrotranspose to the telomeric repeats by trans-complementation with ORF1pWT (Fig. 7D, lanes 8 to 10). These results show that a highly basic region in the center of ORF1, residues 354 to 447, is essential for SART1 retrotransposition. |
|
|---|
![]() View larger version (19K): [in a new window] |
FIG. 8. Three essential domains in ORF1 required for the retrotransposition activity of SART1. This figure summarizes the results in this paper, indicating the essential ORF1 regions involved in cellular localization and retrotransposition of SART1. N/dot, nuclear and dotted signal distribution; C/broad, cytoplasmic and broad signal distribution; N/broad, nuclear and broad signal distribution. +, active retrotransposition; -, no retrotransposition; -*, very weak retrotransposition (see Fig. 5). Closed boxes and vertical lines indicate N1 and N2 NLSs and the three CCHC-type zinc finger motifs, respectively. The broken line in the first zinc finger motif in the bottom construct shows the substitution mutation site (1H626P, LGHVS to LGPVS), which results in complete loss of retrotransposition activity (37). Deletion of residues 354 to 447 inhibited both the dotted nuclear localization and retrotransposition of SART1.
|
(ii) Central highly basic regions for dotted pattern localization. As shown in Fig. 2, residues 354 to 447 are involved in the formation of the dotted pattern. This localization activity in the nucleus seemed to be dependent on the NLS signals of N1 and N2, because the region from residue 143 to 447, which contains residues 354 to 447 but not N1 and N2, showed only broad cytoplasmic localization for the SART1 ORF1 protein (Fig. 3, panels VIII). A recent report demonstrated that the ORF1 protein of HeT-A and TART, the telomere-specific non-LTR retrotransposons of Drosophila melanogaster, are localized in a similar dotted pattern in the nucleus, like those in this study, whereas the ORF1s of the other three elements (jockey, Doc, and I) in Drosophila showed cytoplasmic localization (30). Colocalization of the ORF1s of HeT-A and the telomere-binding protein HOAP suggests that the ORF1s of these elements play a role in localization to the telomere, although there is no information on the responsible structure in the ORF1s (31). Combining the observations on telomere-specific retrotoransposons in Drosophila, the dotted localization pattern of ORF1 of the telomeric repeat-specific retrotransposon SART1 may also reflect telomere or telomeric repeat binding. Our preliminary results from Southwestern hybridization analysis and GST pull-down assays indicated that full-length ORF1 protein binds specifically to (TTAGG)n repeats (data not shown), although it is unclear at present whether the 354-447 region is involved in the (TTAGG)n binding.
In several site-specific non-LTR retrotransposons, it has been demonstrated that EN domains in ORF2 specifically nick the target DNA sequence (4, 8, 9). We reported that EN of TRAS1, another type of telomeric repeat-specific non-LTR retrotransposon, has a specific nicking activity for (TTAGG/CCTAA)n (1). However, we found that EN of SART1 has a relatively lower specificity for the telomeric repeats (T. Anzai and H. Fujiwara, unpublished data). Basically, a specific enzymatic activity of EN plays a central role in TRAS1 integration into the telomeric repeat, while the SART1 retrotransposition into telomeres seems to be dependent both on the SART1 EN activity in ORF2 and on the telomere-targeting ability in ORF1.
(iii) Zinc finger motif. Most non-LTR retrotransposons, except mammalian LINEs, retain one or more zinc finger motifs in ORF1 (7). At present, however, there is little functional implication of the zinc finger motifs in non-LTR retrotransposons. The zinc finger motifs are also found among Gag proteins in other retroelements, i.e., the LTR-type retrotransposons and retroviruses. These zinc finger motifs (CXXCXXXXHXXXXC, where X is a space), which are an unusual type not seen among many other zinc finger proteins, are specifically conserved among retroelements (7). In human immunodeficiency virus type 1 (HIV-1), these zinc finger motifs in the nucleocapsid are involved in the recognition and packaging of the viral RNA genome (29). In retroviruses and LTR retrotransposons such as Ty1, it is believed that nucleic acid chaperone and matchmaker properties are derived from CCHC zinc fingers or basic amino acid clusters (5, 10). In this study, we found that the SART1 ORF1 protein was localized in the cytoplasm as cytoplasmic clusters, which were observed abundantly in the WT and in the full-length construct lacking an NLS but rarely in the zinc finger-deficient constructs (Fig. 2B [panels I] and 4B and Table 2). Recently, Rashkova et al. have reported that zinc finger-containing regions in ORF1 of the Drosophila telomeric non-LTR retrotransposons were localized with a body or cluster pattern, indicating protein-protein interactions (32). These findings imply that the domain containing zinc fingers in ORF1 of non-LTR retrotransposon has the functional roles in RNP formation, although further studies are necessary to confirm the hypothesis.
ORF1 protein in the process of retrotransposition. Most non-LTR retrotransposons encode two putative ORFs, but there is no report that the ORFs work as substantial and independent units. One reason for this is that we do not yet understand the exact translation mechanisms that produce ORF proteins from the bicistronic non-LTR retrotransposon mRNA. In the SART1 element, ORF2 overlaps ORF1 by 54 nucleotides in the -1 reading frame (36). In retroviruses and LTR retrotransposons, -1 (or rarely +1) translational frameshifting usually produces Gag-Pol fusion proteins from the mRNA. In non-LTR retrotransposons, however, it is unclear whether the two ORFs are translated into one fusion protein by -1 frameshifting or into two independent proteins by specific mechanisms.
Selem et al. (34) detected the putative ORF1 protein of a Drosophila non-LTR element, I factor, by Western blotting and indirect immunofluorescence labeling. Here, we have demonstrated not only that ORF1s of the expected sizes expressed from their mRNAs by both the WT and deletion mutant
1-142 (Fig. 5C) but also that ORF1 protein itself can rescue the NLS-less
1-142 function by trans-complementation (Fig. 5B). This is the direct evidence showing that ORF1 of SART1 is an independent functional unit and is translated by using the putative stop codon (+713; TAA) and not by translational frameshifting. SART1 ORF1 protein expressed with baculovirus in Sf9 was observed to have a 9-kDa increase in size, and we speculate that this is caused by some modification such as glycosylation.
The human L1 element shows cis preference for the ORF proteins produced from its own mRNA (39). SART1 showed trans-complementation, but its lower retrotranspositional efficiency also suggests a cis preference in the SART1 element. These observations indicate that ORF1 and ORF2 of non-LTR retrotransposons are assembled with their mRNAs immediately after translation in the cytoplasm. By analogy with other retroelements, multimerized ORF1 may act as the Gag protein does, to surround and enclose ORF1 and mRNA, but its RNP structure remains unclear. Although the ORF1 proteins of SART1 (this study) and Drosophila TART and HeT-A (30) are actually transported into the nucleus, there is no report of nuclear localization of ORF1 in some other elements, such as mouse LINE-1 (23, 38) and Drosophila I factor (30, 34). Therefore, it is not currently clear whether RNP transport into the nucleus is generally conserved among non-LTR retrotransposons. Compared with that in general non-LTR retrotransposons, the capacity for nuclear transport may be elevated in telomere-specific non-LTR elements, such as SART, HeT-A, and TART, resulting in nuclear localization during the interphase of cell division when large proteins without NLS cannot pass the nuclear membrane.
Comparison of nuclear transport mechanisms in various retroelements. There is a critical difference between non-LTR retrotransposons and other retroelements, such as LTR retrotransposons and retroviruses, in the process of retrotransposition (Fig. 9). The latter elements are reverse transcribed and formed into double-stranded cDNA in the cytoplasm, whereas the non-LTR elements are reverse-transcribed in the nucleus, in a process called TPRT (21). During the TPRT event in the nucleus, the non-LTR retrotransposon machinery includes mRNA, the ORF2 unit (EN and RT), and ORF1 protein, which is required for nuclear localization (Fig. 9A, step III), as shown in this study. In addition to the NLS function, ORF1 may be generally required in the retrotransposition machinery during TPRT. Martin and Bushman (25) reported that the ORF1 protein from mouse LINE-1 promotes the annealing of the cDNA strand and mediates nucleic acid transfer steps during TPRT. Several other studies also reported that the ORF1 proteins from non-LTR retrotransposons have roles in nucleic acid binding activity (7, 14, 18, 19). These data indicate that the ORF1 protein has nucleic acid chaperone activity and brings mRNA and ORF2 together into the RNP in the non-LTR retrotransposition machinery. Three essential domains, i.e., the NLS, telomere-binding, and zinc finger domains, in SART1 ORF1 support this hypothesis that ORF1 protein is necessary in the nucleus. The Gag protein of LTR retroelements and retroviruses also has nucleic acid chaperone activity during reverse transcription in the cytoplasm (5, 10).
![]() View larger version (27K): [in a new window] |
FIG. 9. Comparison of the life cycles of non-LTR retrotransposons and LTR retrotransposons. (A). Non-LTR retrotransposons. I, translated ORF proteins and their own mRNAs assemble to form RNPs. II, RNP moves to the nucleus. SART1 has a remarkable NLS in the ORF1 protein, which results in efficient import of the element into the nucleus, leading to its efficient retrotransposition. III, RNP retrotransposes into the target site of the host genome by TPRT. In TPRT, the ORF1 protein may be required for nucleic acid chaperone function to promote the formation of stable double-stranded DNA, as suggested by Martin and Bushman (25). (B). LTR retrotransposons. I, translated ORF proteins and their own mRNAs form virus-like particles. II, reverse transcription occurs in the cytoplasm to generate a DNA intermediate in the virus-like particle. In this process, Gag protein is thought to exhibit the nucleic acid chaperone activity. III, the DNA intermediate and integrase (IN) from a complex and are imported into the nucleus by the NLS of IN. VI, the IN-DNA complex moves to an insertion site in the host genome, and the insertion event is catalyzed by IN. Closed arrows represent the role of ORF1 (or Gag) as the nucleic acid chaperone in both non-LTR and LTR elements.
|
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»