Previous Article | Next Article 
Molecular and Cellular Biology, January 2000, p. 213-223, Vol. 20, No. 1
0270-7306/0/$04.00+0
Copyright © 2000, American Society for Microbiology. All rights reserved.
Integration of Bombyx mori R2 Sequences into the 28S
Ribosomal RNA Genes of Drosophila melanogaster
Danna G.
Eickbush,
Dongmei D.
Luan,
and
Thomas H.
Eickbush*
Department of Biology, University of
Rochester, Rochester, New York 14627-0211
Received 9 August 1999/Returned for modification 21 September
1999/Accepted 29 September 1999
 |
ABSTRACT |
R2 non-long-terminal-repeat retrotransposable elements integrate
into a precise location in the 28S rRNA genes of arthropods. The
purified protein encoded by R2 can cleave the 28S gene target site and
use the 3' hydroxyl group generated by this cleavage to prime reverse
transcription of its own RNA, a process called target-primed reverse
transcription. An integration system is described here in which
components from the R2 element of the silkmoth, Bombyx
mori, are injected into the preblastoderm embryo of
Drosophila melanogaster. Silkmoth R2 sequences were readily detected in the 28S rRNA genes of the surviving adults as well as in
the genes of their progeny. The 3' junctions of these insertions were
similar to those seen in our in vitro assays, as well as those from
endogenous R2 retrotransposition events. The 5' junctions of the
insertions originally contained major deletions of both R2 and 28S gene
sequences, a problem overcome by the inclusion of upstream 28S gene
sequences at the 5' end of the injected RNA. The resulting 5' junctions
suggested a recombination event between the cDNA and the upstream
target sequences. This in vivo integration system should help determine
the mechanism of R2 retrotransposition and be useful as a delivery
system to integrate defined DNA sequences into the rRNA genes of organisms.
 |
INTRODUCTION |
Non-long-terminal-repeat (non-LTR)
retrotransposable elements are a widespread and abundant class of
eukaryotic mobile elements. Direct evidence for the mechanism of
non-LTR retrotransposition has been obtained from studies of R2, an
element which exhibits extraordinary insertion specificity for the 28S
rRNA genes of its arthropod host (2, 3). The single open
reading frame (ORF) of the R2 element from the silkmoth, Bombyx
mori, was expressed in bacteria and found to encode a
sequence-specific endonuclease (37). In vitro studies
revealed that the purified R2 protein was capable of synthesizing a
cDNA copy of its own RNA transcript directly onto the 28S target site
(27). As shown in Fig. 1, this
mechanism involves the formation of a specific nick on the noncoding
strand of the target 28S DNA. The exposed 3' hydroxyl group is then
used to prime reverse transcription, a process termed target-primed
reverse transcription (TPRT), before cleavage of the coding strand.
Only RNA molecules containing the 250 bp 3' untranslated region (3'
UTR) of the R2 element can support the TPRT reaction (25).
This reaction has many similarities to the mechanism used by mobile
group II introns to insert into unoccupied target sites (intron homing)
(5, 41).

View larger version (23K):
[in this window]
[in a new window]
|
FIG. 1.
Diagram of the TPRT model for R2 retrotransposition. In
the first step of the reaction the R2 protein cleaves the noncoding
(primer) strand of the target site and uses the released 3' end to
prime reverse transcription. After reverse transcription, cleavage of
the coding (nonprimer) strand occurs. The R2 element does not have
RNase H activity (27); thus, removal of the RNA template
after reverse transcription is conducted by the cellular machinery. It
is not known whether attachment of the cDNA to the upstream target
sequences and synthesis of the second DNA strand of the element is also
catalyzed by the R2 protein or is dependent upon the cellular DNA
repair machinery. Thick lines, DNA target sequences; thin line, RNA
template; dashed lines, synthesized first and second DNA strands of the
new insertion.
|
|
Studies of non-LTR integration mechanisms have also been conducted with
the L1 elements found in mammals (6, 8, 29). These studies
have suggested that L1 elements are also likely to use a TPRT mechanism
of retrotransposition; however, the endonuclease cleavage and RNA
binding of the L1-encoded proteins are considerably less sequence
specific than that of the R2 protein. This indiscriminate choice of RNA
templates and insertion sites by L1 elements has significantly shaped
the human genome. Nearly 30% of the human genome can be attributed to
the reverse transcription of L1 and various other RNA templates
(19, 20, 33).
Although the mechanism by which the R2 protein cleaves the DNA target
and initiates the TPRT reaction has been extensively studied (25,
26, 39, 40), little is known of the subsequent step in the
integration process: attachment of the R2 sequences to the upstream 28S
gene target site. Analysis of endogenous R2 5' junctions from a variety
of arthropods have suggested that 5' attachment is accomplished either
by a recombination event or a jump (template switch) of the reverse
transcriptase from the RNA template to the upstream DNA target site
(3, 10). In either case, complete integration of an R2
element is likely to be highly dependent upon the cell's DNA repair machinery.
An R2 protein-R2 RNA complex can find and cleave the 28S gene target
when incubated with total genomic DNA in vitro (39). In
addition, a 1,000-fold excess of nonspecific RNA does not prevent the
R2 reverse transcriptase from recognizing the 3' end of its own
transcript for TPRT (25). It thus seemed reasonable that an
R2 protein-RNA complex injected into an intact cell would find its 28S
rRNA gene target site and initiate the TPRT reaction. Such an in vivo
system could supply any required cellular DNA repair machinery in
trans, allowing complete R2 integration reactions to occur. In
this report, we describe such an integration system in which RNP
complexes containing R2 protein and R2 RNA from B. mori are
injected into Drosophila melanogaster preblastoderm stage (1 to 2 h) embryos. R2 integration events in the 28S rDNA genes of
the surviving adult flies and their progeny were detected by PCR
amplification assays. This injection system has enabled us to monitor
which RNA sequences may be needed at the 5' end of the R2 transcript to
complete the TPRT reaction. This approach should eventually make it
possible to study engineered R2 elements introduced into their normal
location within the nucleolus.
 |
MATERIALS AND METHODS |
Synthesis of the R2 RNA templates.
The clone used to
synthesize the mini-R2-28S RNA was constructed by separate PCR
amplification of the 5' and 3' ends of an R2Bm insertion in B. mori DNA. The 3' end of the construct was amplified by using
5'-CTAAGTCGACTTGGTTGAGCCTTGCACAG-3' to prime synthesis
within the R2 element (starting 255 bp from the 3' end) and
5'-CTGCAAGCTTGCTAGATAGTAGATAGG-3' to prime synthesis in the downstream 28S gene sequence (ending 81 bp downstream of the R2 insertion site). The PCR product was cloned into the SalI
and HindIII sites of the pBSIISK
(pBluescript)
vector. The 5' half of the construct was amplified by using
5'-CTAACTCGAGGAGTCTCTAGTCGATAG-3' to prime synthesis in the
upstream 28S gene (starting 495 bp upstream of the R2 site) and
5'-CTAAGTCGACCGTTCTAAGGCGGCACT-3' to prime synthesis within
the R2 element (ending 470 bp from the 5' end), and inserted into the
XhoI and SalI sites of the above clone. The clone
used to synthesize the micro-R2-28S RNA was constructed by PCR
amplification of the 5' end of an R2Bm insertion from B. mori DNA by using 5'-CTATGTCGACGGTACCCAGATTAAGACGAC-3'
to prime synthesis in the upstream 28S gene (starting 170 bp
upstream of the R2 site) and 5'-CATTGGTACCTCAGCTCAGAACTGGCACGG-3'
to prime synthesis in the R2 element (ending 50 bp from the 5'
end). This PCR product was inserted into the KpnI and
SalI sites of construct R2Bm249 (25).
For in vitro transcription of RNA, plasmid HR4 (
27) was
linearized with
XmnI, plasmid mini-R2-28S with
HindIII and micro-R2-28S
with
EcoRI. Vector
RNA was synthesized from pBSIISK

linearized
with
ApalI.
DNA templates used for the synthesis of HR4/10R and
HR4/1A RNA were
generated by PCR amplification of the HR4 plasmid
by using the
universal

40 primer and specific primers to generate
DNA with the
appropriate 3' ends as previously described (
25,
26). After
restriction digestion or PCR amplification the DNA
was treated with
proteinase K, extracted with phenol-chloroform,
and ethanol
precipitated. RNAs with 5' methyl-G caps were synthesized
with the T7
mMessage mMachine kit (Ambion). One microgram of linearized
template
DNA was transcribed in a 20-µl volume containing a ratio
of cap
analog [m
7G(5')ppp(5')G] to GTP of 4:1, as well as the
standard concentrations
of T7 polymerase and remaining ribonucleotides.
Reactions were
carried out at 37°C for 2 h. The template DNA was
degraded with
2 U of DNase I for 15 min and phenol extracted. After
isopropyl
alcohol precipitation, the RNA was resuspended in 10 µl of
10
mM Tris-HCl (pH 8)-1 mM EDTA, and the concentration was estimated
on agarose
gels.
Preparation of R2 protein-RNA for microinjection.
R2 protein
was purified from Escherichia coli JM109/pR260 as previously
described (39). For the injection, 200-µl aliquots from
the DNA-cellulose column elutant (approximately 3 µg of protein) was
mixed with 10 to 15 µg of in vitro-transcribed R2 RNA and 120 U of
RNasin (Pharmacia). The mixture was dialyzed at 4°C versus injection
buffer containing 5mM KCl, 0.1 mM dithiothreitol, and 0.1 mM NaPO4 (pH
7.5) for 2 h. It was then concentrated threefold by spinning in a
Centricon-50 column at 6,000 rpm for 15 min and stored on ice
throughout the injection. Negligible reductions in the yield of somatic
integration events were obtained after storage of the RNP complex in
this form for several days. The final concentration of RNP complex in
the injection was approximately 50 µg/ml.
Microinjection and Drosophila maintenance.
Injections were performed as described by Spradling (34)
with minor modifications. W1118 preblastoderm embryos (15)
from 1-h collections on apple juice plates were dechorionated in 50% Clorox, followed by extensive washing with water. The embryos were then
aligned on a second plate with their posterior end near the edge of a
coverslip. After removal of the coverslip, the embryos were transferred
to a microscope slide containing a small amount of glue made by
dissolving double-stick Scotch tape in heptane (31). The
embryos were then dehydrated in a box containing Drierite for 7 to 10 min, covered with halocarbon oil, and injected with the protein-RNA
complex by using a Narishige Microinjector. After injection, embryos
were allowed to develop on apple juice-agarose plates overnight at
18°C in an oxygen box. Yeast was then added, and the plates were left
at 25°C for 2 days. Larvae were subsequently transferred to vials
containing a standard cornmeal media.
Preparation of flies for PCR amplification.
Adult flies were
prepared for PCR as described by Gloor and Engels (12).
Individual flies were placed into 0.5-ml tubes and mashed for 5 to
10 s with a pipette tip containing 50 µl of squishing buffer (10 mM Tris HCl, pH 8.2; 1 mM EDTA; 25 mM NaCl; and 200 µg of proteinase
K per ml). After being mashed, the buffer was expelled from the tip,
mixed with the crushed carcass, and incubated at 37°C for 30 min. The
proteinase digestion was stopped by heating the tube to 95°C for 2 min. To screen for germ line events, groups of 10 flies were mashed in
0.5-ml tubes containing 10 µl of squishing buffer; an additional 400 µl of buffer was then added. After the 30-min incubation the reaction
was stopped by heating the tube to 95°C for 3 min. For both the
somatic and germ line assays, 1 µl of the crude DNA preparation was
directly used in each 25-µl PCR reaction.
Nested PCR and sequencing of the 5' and 3' junctions of the R2Bm
insertions.
For each round of PCR, one primer was specific to
Drosophila rDNA locus (either upstream or downstream 28S
gene or the R1 element of Drosophila), while the second
primer was specific to the R2Bm element (Fig.
2). To amplify the 3' junction of
the R2Bm insertions in a previously uninserted 28S gene, the 28S(+700R) primer (5'-AAGAGCCGACATCGAAGGATC-3') and the
R2Bm(
250) primer (5'-TTGGTTGAGCCTTGCACAG-3') were used for
first-round amplification. The 28S(+350R) primer
(5'-CTCGTGATACTTTGATC-3') and the R2Bm(
190) primer
(5'-AGCTCGCTCCCTTGGC-3') were used for second-round
amplification. To amplify the R2 insertion in a 28S gene already
containing an R1 insertion, the R1Dm(+350R) primer
(5'-CAGTCCAGCAATCGTATGCTCG-3') and the R1Dm(+100R) primer
(5'-TTGCGCACCACTTCCACGGAAC-3') were used in the first and
second rounds of PCR, respectively, in conjunction with the R2Bm(
250)
and R2Bm(
190) primers described above. To obtain the 5' junctions,
the 28S(
500) primer (5'-CCAATATCCGCAGCTGG-3') and the
R2Bm(
3R) primer (5'-TCATCGCCGGATCATC-3') were used for the
first-round amplification, and Dm(
270) and R2Bm(
260R)
(5'-CCAAGGGAGCGAGCTCC-3') were used for second-round
amplification. Only first-round PCR amplifications were conducted
in screens for germ line events. To continually guard against
possible PCR artifacts or DNA contamination every fourth to seventh fly
tested was a fly which had not been injected. For junction sequences,
the second-round PCR products were cloned into mp18T2 (4),
and individual clones were sequenced.

View larger version (25K):
[in this window]
[in a new window]
|
FIG. 2.
Diagram of the R2 insertion site and the location of PCR
primers used to detect R2Bm insertions. Shown at the top of the figure
is a diagram of the D. melanogaster rDNA unit with the
location of the R1 and R2 insertion sites indicated. Shown at the
bottom are diagrams of a complete HR4 sequence inserted into a 28S gene
with or without an R1 insertion. Location and orientation of the PCR
primers are indicated with the arrows. All primers were arranged as
nested pairs to monitor somatic insertions, with only the external set
used to identify germ line events. R2Bm primers are identified by their
position relative to the 3' end of the element; 28S primers are
identified relative to the R2 insertion site, and R1Dm primers are
identified relative to the 5' end of a full-length element.
Oligonucleotides oriented to prime opposite that of the 28S gene
transcript are indicated with the letter R. Dark boxes, rRNA gene
sequences; thin line, spacer regions of the rDNA unit; open boxes, R2Bm
sequences; stippled box, endogenous R1Dm element.
|
|
 |
RESULTS |
Establishment of an in vivo integration system.
Our decision
to establish a heterologous integration system, in which RNP complexes
encoded by the B. mori R2 element are injected into embryos
of D. melanogaster, was based on several perceived
advantages. First, this system would enable us to take advantage of the
procedures developed for the injection of P elements into the
preblastoderm embryos of D. melanogaster (32,
35). Injection into the continuous cytoplasm at the posterior end
of a preblastoderm stage embryo allows material to be incorporated into
multiple embryonic cells, including the primordial germ cells, during
subsequent cellularization of the embryo. This approach has enabled the
development of a number of DNA-mediated transposable element
transformation systems in addition to the P element (9, 24,
30). A second advantage of injecting B. mori R2
components into D. melanogaster is that PCR primers and
hybridization probes readily differentiate all segments of a B. mori R2 insertion (R2Bm) from the endogenous D. melanogaster R2 elements (R2Dm) already present in the ribosomal
DNA (rDNA) locus. A homologous system would be difficult to assay since
no strains of B. mori or D. melanogaster have
been identified that lack endogenous R2 elements (17, 38).
Finally, the critical 28S gene sequences required for R2 endonuclease
recognition are identical in all eukaryotes. Therefore, if injected
B. mori RNP complexes functioned in the heterologous cells
of D. melanogaster, then they are likely to function in the
cells of most other insects.
To conduct the embryo injections, R2 protein purified from
E. coli (
27) was mixed with a four- to sixfold molar
excess of
R2 RNA, concentrated and injected into the posterior end of
preblastoderm
embryos (see Materials and Methods). Our initial
injection experiments
utilized RNAs corresponding to the 800-nucleotide
(nt) RNA transcript
(HR4) used in our in vitro studies (
27).
This RNA contains the
final 550 nt encoding the R2 ORF and the 250 nt
3' UTR. We assayed
for the insertion of R2Bm sequences into the 28S
rRNA genes of
the surviving animals by PCR amplification as shown in
Fig.
2.
For each amplification, one primer was complementary to
sequences
within the 3' UTR of the R2Bm element, while the second
primer
was complementary to DNA sequences either upstream or downstream
of the 28S gene insertion site. Only a small fraction of the cells
in
the surviving animals contained the R2Bm integrations; thus,
the assays
used two rounds of PCR amplification with nested primers.
Both larval
and adult stages were initially tested for R2Bm integrations.
Because
the number and reliability of identifying the insertions
was highest
with adult DNA, all studies in this report are based
on the assays with
adult
tissues.
We initially assayed for the 3' junctions of R2Bm elements with the
D. melanogaster 28S gene sequences. Approximately 15%
of
the rDNA units in the w1118 strain of
D. melanogaster used
in the injections already contain an R2 element insertion and
were
unavailable for integration. The remaining
D. melanogaster rDNA units are approximately equally divided between rDNA units
with no
insertions and units already containing an R1 element
insertion.
R1 is a distantly related non-LTR retrotransposon which
inserts
into the 28S rRNA gene at a site 74 bp downstream of the
R2 insertion
site (
16). Adult flies which survived the injections
were
tested with PCR primers complementary to the downstream 28S
gene
sequences and with primers complementary to sequences near
the 5' end
of the R1Dm elements (Fig.
2).
Typical results from two of our initial injection experiments are shown
in Table
1. In experiment 1, the injected
HR4 RNA
was synthesized without a 5' methyl-G cap. Of the 32 adults
tested,
13 (41%) contained R2Bm sequences inserted into rDNA units
without
R1 insertions, and three (9%) had R2Bm insertions in rDNA
units
already containing an R1 element. As will be discussed below,
the
R2Bm insertions generated in this injection experiment contained
extensive deletions of their 5' sequences, presumably due to the
degradation inside the embryo of the uncapped HR4 RNA. Therefore,
in
all subsequent experiments, RNA was synthesized containing
methyl-G-capped 5' ends (see Materials and Methods). When capped
HR4
RNA-R2 protein complexes were injected (Table
1, experiment
2), the
frequency of adults with insertions was found to be 53%
for uninserted
units and 30% for R1 inserted units. A significant
fraction of the
adults contained R2Bm insertions in rDNA units
both with and without an
R1 insertion, indicating that multiple
R2Bm insertions had occurred in
the same embryo.
To demonstrate that the injected R2 protein was responsible for
initiating the integration events scored, HR4 RNA was injected
in the
absence of R2 protein (experiment 3). No R2Bm integrations
were
observed. To demonstrate that only R2 RNA sequences are utilized
in the
TPRT reactions catalyzed by this injected R2 protein, the
R2 protein
was complexed with RNA corresponding to vector sequences
(experiment
4). Again no integrations were observed, indicating
that both the
injected R2 protein and R2 RNA are required for
the in vivo integration
events.
Isolation of germ line events.
To ensure that the DNA products
derived from the PCR amplifications represented stable R2Bm
integrations into the rDNA locus of D. melanogaster,
injected embryos which survived to adulthood (G0
generation) were mass mated, and individual females were allowed to lay
eggs. From each of these lines, 20 G1 progeny (two groups of 10 flies) were tested by PCR for R2Bm integrations. Because all
cells of a positive G1 fly should contain the R2Bm
integration, only single-round PCR assays were necessary. The frequency
of germ line insertions was variable and difficult to estimate because of the low numbers of animals tested. In our most extensive experiment, five G1 positives were obtained from the progeny of 56 G0 females. However, based on several smaller data sets
this experiment probably yielded a higher than average number of germ
line events. From lines in which positive G1 progeny were
found, additional sibling pairs were allowed to lay eggs before being
tested by PCR. The G2 progeny of the
G1-positive pairs were pairwise mated and monitored by PCR,
and the process was continued until homozygous lines were obtained. By
this means, a total of 11 germ line events have been recovered as
separate lines from a total of 14 lineages initially scored as
positive. Southern blots of three representative lines probed with the
3' UTR sequences of R2Bm are shown in Fig.
3A. Two of the lines (HR4-1 and 10R36)
gave rise to restriction fragments predicted for an 800-nt segment of
the R2Bm element inserted into a 28S gene of D. melanogaster
(Fig. 3B). Southern and subsequent PCR analysis of the third line
(10R21) indicated that insertion of this R2Bm sequence into the R2 site
of a 28S gene was accompanied by a large deletion of the upstream rDNA
sequences (Fig. 3B). As will be discussed below, deletions of upstream
28S gene sequences were seen in many of the integration events
resulting from HR4 RNA. Of the 11 germ line insertions analyzed to
date, 5 were found to be insertions into the rDNA locus on the X
chromosome and 6 were inserted into the rDNA locus on the Y chromosome.

View larger version (49K):
[in this window]
[in a new window]
|
FIG. 3.
Genomic blot analysis of three germ line events
resulting from the HR4 injections compared to wild type (w1118). (A)
Approximately 2 µg of adult DNA was digested with the restriction
enzyme indicated, and the DNA was separated on a 1% agarose gel,
transferred to nitrocellulose paper, and hybridized with a
32P-labeled probe. The probe was the R2Bm 3' UTR generated
from clone pBmR2-249 and labelled by random priming (25).
DNA size markers are shown at the left. (B) Diagram of the D. melanogaster rDNA repeat, with the location of the
EcoRI (E), ClaI (C), and HindIII
(H) restriction sites as shown. Lines 10R36 and HR4-1 gave rise to
restriction fragments consistent with the insertion of an 800-bp
fragment of R2Bm into a typical rDNA unit. Line 10R21 gave rise to
restriction fragments indicating that a large deletion of the rDNA unit
had accompanied the insertion (stippled box below the diagram).
|
|
R2Bm insertions outside the 28S genes were not recovered.
While the endogenous insertion of R2 elements is extremely specific,
several R2 elements in the B. mori genome have been found outside the rDNA units (38). In most instances these
insertions have found a target site with sequence identity to the 28S
target site. The sequences of these non-rDNA R2 insertions revealed
numerous substitutions and deletions compared to the uniform population of R2 elements in the rDNA loci, indicating that they represent infrequent events that slowly accumulate mutations over time. Because
the high concentrations of R2 RNP complexes in our injection experiments might result in a lower specificity for the 28S gene insertion site, we attempted to identify R2Bm integration events that
had occurred outside the rDNA units. The insertion of elements outside
the rDNA locus will only be a complication if they occurred in lineages
used for future transcription and retrotransposition analyses; thus, we
conducted this search in the flies used to isolate the germ line events
described in the previous section. The G1 progeny of the 56 G0 females were screened with PCR primers that were both
complementary to the injected sequences. Positive products were
obtained only in those lines already known to contain insertions within
the rDNA locus. Southern blots of these lines revealed only single
insertions in the rDNA loci (see Fig. 3).
We conclude that the R2 injections give rise to the frequent
integration of R2Bm sequences into the 28S genes of
D. melanogaster,
with few (if any) insertions occurring outside the
rDNA units.
As will be shown below, the insertion mechanism used in
these
germ line events appears to be similar to that used in the
somatic
events scored in the G
0 flies. Because of the
greater difficulty
in isolating large numbers of germ line events, most
of integrations
characterized in this report are somatic events
isolated from
the adult tissues of the injected
animals.
3' junctions of the integrated R2Bm elements.
To determine the
exact location of the R2Bm insertions within the 28S gene we separately
cloned and sequenced the amplified DNA corresponding to the 3' junction
of R2Bm elements from individual flies. A total of 39 3' junctions
obtained from four different injections of HR4 RNA are shown in Fig.
4A. Most junctions were of somatic R2Bm
insertion events into previously uninserted rDNA units; however,
insertions into rDNA units already containing an R1Dm insertion (*), as
well as several germ line insertion events (+), are also included. R2
elements in all species insert into the identical site in the 28S gene
(3), which corresponds to the initial nick observed in the
in vitro TPRT reaction (27). The 28S sequence immediately
downstream of this site is TAGCCAA. As shown in Fig.
4A, all R2Bm insertions obtained from our injections occurred
within 2 bp of this site. Twelve insertions were precisely at the R2
site, six occurred 2 bp upstream of the R2 site, and twenty-one
insertions occurred 2 bp downstream of this site. It should be noted
that because of the ambiguity in the origin of an A nucleotide, these
latter insertions could also be interpreted as 1 bp downstream of the
R2 site.

View larger version (51K):
[in this window]
[in a new window]
|
FIG. 4.
R2Bm 3' junctions obtained with the various HR4 RNA
templates. (A) Junctions obtained with HR4 RNA. (B) Junctions obtained
when the HR4 RNA contained downstream 28S sequences (HR4/10R). (C)
Junctions obtained when the HR4 RNA was missing the last three
nucleotides (HR4/1A). Shown at the top of each panel is the 3' end of
the RNA used in the injection, with the dashed vertical line indicating
the normal border between R2 and 28S gene sequences. At the bottom of
each panel are the junction sequences derived from randomly selected
animals which gave rise to PCR products. 28S gene sequences are shown
to the right of the solid vertical line, R2Bm sequences are shown to
the left of the vertical line, with shaded nucleotides representing
sequences not predicted on the basis of simple reverse transcription of
the RNA. Three junctions in panel C are derived from initiations within
the HR4 R2 template. The number of animals with each junction are shown
at the right. *, R2Bm elements inserted into rDNA units already
containing an R1 element; +, R2Bm junctions obtained from germ line
events.
|
|
All 39 sequenced integration events indicated that reverse
transcription initiated near the 3' end of the HR4 RNA. The HR4
RNA
used in the injection experiments was designed to end with
the sequence
GAAAA. However, the T7 RNA polymerase used to generate
this RNA by
runoff transcription frequently adds an additional
A; thus, about half
of the RNA molecules end in GAAAAA (
27).
As shown
in Fig.
4A, most of the integrations end with four or
five As,
suggesting that TPRT began at the 3'-terminal nucleotide
of the HR4
template. In the remaining insertions, reverse transcription
either
started 1 to 3 bp internal to the 3' end or contained additional
A
nucleotides not present at the 3' end of the HR4 RNA template.
The
sequence variation at these 3' junctions is highly similar
to that seen
in our in vitro TPRT reactions (
27). The additional
nucleotides are believed to be a result of nontemplated additions
by
the R2 RT to the target site before the enzyme fully engages
the HR4
RNA
template.
Our previous in vitro data indicated that the efficiency and accuracy
of the TPRT reaction is significantly affected by changes
at the
extreme 3' end of the RNA template. Addition of 28S rRNA
sequences
downstream of the R2 sequences reduced the efficiency
of the reaction
but increased the accuracy with which the reverse
transcriptase can
initiate synthesis of the cDNA at the precise
R2-28S gene junction
(
26). Deletion of nucleotides from the
end of the HR4
template also reduced the efficiency of the TPRT
reaction, and nearly
all products contained nontemplated nucleotides
at their 3' junctions.
To determine whether the injection assay
would give analogous results
in vivo, two RNA templates were tested.
One template contained 10 nt of
downstream 28S gene sequences
(HR4/10R), while the second lacked the
last three nucleotides
of the HR4 RNA (HR4/1A). As shown in Table
1,
the efficiency
at which we were able to recover R2 integrations by
using these
RNAs (experiments 5 and 6) was similar to that obtained
with HR4
RNA. The 3' junction sequences obtained from these injections
are shown in Fig.
4B and C. The sequences at these junctions were
the
same as those seen in our in vitro assay (see reference
26;
Fig.
3). In the case of the HR4/+10R construct,
20 of 23 sequenced
events contained a precise 3' junction. In the case
of the HR4/1A
construct, almost all integrated products contained
nontemplated
additions varying from 1 to 10 nt in length. These
nontemplated
nucleotides were usually homopolymeric runs of As or Ts.
Three
clones derived from the HR4/1A RNA had initiated reverse
transcription
from a location within the HR4 template. These junctions
are very
similar to those seen in our in vitro assays with HR4/1A (see
reference
25; Fig.
4).
We conclude that the initiation of the TPRT reactions in our embryo
injections were similar to those we have scored in vitro
by using the
same RNA templates. In both assays, the addition
of downstream 28S gene
sequences increased the accuracy of the
TPRT reaction, while
nontemplated nucleotides were added if the
R2 RNA template was deleted
at its 3' end. The major difference
between the in vitro and in vivo
assays was that in the in vivo
assays cleavage of the target DNA
frequently occurred 2 bp upstream
and 2 (or 1) bp downstream of the
target site defined by endogenous
R2 elements in arthropods
(
3). We have not detected significant
cleavage at these
upstream or downstream sites in
vitro.
5' junctions of the integrated R2Bm elements.
In the TPRT
reactions characterized in vitro we did not observe attachment of the
synthesized cDNA to the DNA upstream of the target site
(27). Our embryo injection system, on the other hand,
requires that the integrated products survive until assayed either 10 days later in adults or weeks later in the progeny of these adults.
Therefore, some means of attaching the 5' end of the R2Bm sequence to
the upstream target sequences must have occurred or the chromosome
would have been lost. We cloned the 5' junctions of the R2Bm elements
by using the nested PCR primers shown in Fig. 2. Because R1 insertions
are located downstream of the R2 site, all R2Bm insertions could be
assayed with the same set of upstream 28S gene primers.
Analysis of the 5' junctions from one of our initial injections with
uncapped HR4 RNA (Table
1, experiment 1) revealed only
PCR products
approximately 300 bp in length, which was much shorter
than the 800-bp
fragment expected for the integration of a complete
HR4 reverse
transcript (Fig.
2). A summary of the sequence of
four of these short
PCR products is shown in Fig.
5A. The
amplified
DNAs correspond to insertions of R2Bm sequences containing
only
the 3' UTR of the element. Because our in vitro results suggested
that, once initiated, the R2 reverse transcriptase rapidly extends
to
the end of the RNA template, we assumed that the injected HR4
RNA had
been largely degraded by cellular nucleases with only
the 3' UTR
protected by the bound R2 protein (
25). To better
stabilize
the injected RNA within the embryo, we synthesized HR4
RNA containing
5' methyl-G caps (Table
1, experiment 2). The
majority of the
5'-junction PCR products obtained with this capped
RNA were also
approximately 300 bp in length. However, in about
30% of the animals
with R2Bm insertions, PCR fragments were seen
that varied from 600 to
800 bp. The sequences of the longer PCR
products from 11 such animals,
as well as shorter PCR products
sometimes found in these same animals,
are shown in Fig.
5B. The
5' ends of three germ line events are also
shown in Fig.
5B. It
should be noted that, while we have often
sequenced both the 5'
and 3' junctions of R2Bm insertions from the same
fly, because
multiple insertions can occur in these flies, only in the
case
of the germ line events do we know which junctions represent the
ends of the same insertion.

View larger version (40K):
[in this window]
[in a new window]
|
FIG. 5.
R2Bm 5' junctions obtained with the HR4 RNA
injections. (A) Junctions obtained with uncapped RNA. (B) Junctions
obtained with capped RNA. Almost all junctions contained
deletions of both the 28S and R2Bm sequences. Boxes to the left of the
vertical lines represent the 28S gene deletions, with the precise
length of each deletion indicated by the numbers to the right of each
box. Boxes to the right of the vertical lines are representative of the
length of the R2Bm insertions, with shaded areas representing the 3'
UTR. The exact length of the R2Bm deletions are indicated by the
numbers within or to the left of these boxes. Extra nucleotides at the
junction that were not derived from either the 28S gene or HR4 RNA
are given between the vertical lines. +, junctions obtained from
three germ line events.
|
|
Only 2 of the 19 R2Bm insertions shown in Fig.
5 represented events
resulting from the integration of a complete copy of the
HR4 RNA
template at the target site. The remaining insertions
contained either
short (2- to 142-bp) or long (422- to 550-bp)
deletions of the HR4
sequences. 5' truncations are common in R2
elements of certain species
and in general represent one of the
most diagnostic features of non-LTR
retrotransposable elements.
However, the 5' junctions generated by our
injection experiments
were not like those seen in endogenous R2
elements found in either
B. mori or
D. melanogaster (
3,
10). In particular, the R2Bm
integrations derived from the injections contained large deletions
of
the upstream 28S gene sequences. Deletions of the 28S gene
are detected
at the 5' end of endogenous R2 elements, but these
deletions are
usually only a few base pairs in length and seldom
extend to more than
30 bp (
3,
10). Indeed, the sample of
28S deletions presented
in Fig.
5 represents an underestimate
of the deletions associated with
R2Bm insertions, since approximately
one-fourth of the injected animals
that contained an R2Bm insertion
based on the 3' junction PCR assays
did not give rise to an amplifiable
5' junction. In these instances, it
is likely that the 28S deletion
that accompanied the insertion included
the region 270 bp upstream
of the insertion site that anneals to the
PCR primer. The deletions
of upstream sequences accompanying the germ
line events were similar
to the somatic events shown in Fig.
5B (three
sequences indicated
with a "+"). In the cases of the other eight
germ line events,
PCR assays indicated that five contained deletions of
at least
80 bp and three had deletions more than 270 bp, with the
largest
deletion including almost an entire rDNA unit (see Fig.
3).
We suggest that, based on these results the R2 protein in our injection
assay is incapable of attaching the end of the HR4
sequence to the
upstream target DNA. The junctions we observed
presumably resulted from
the DNA repair processes having ligated
the free ends of the broken
chromosome. We suggest the long (420-
to 575-bp) deletions of R2Bm
sequences result from degradation
of the injected RNA, while the
shorter (2- to 150-bp) deletions
of R2Bm sequences occur after reverse
transcription just prior
to ligation of the two chromosomal
ends.
Injections with a mini-R2Bm-28S cotranscript.
RNA transcripts
of R2 elements are rare and represent cotranscription of the 28S genes
(21). Consistent with a cotranscription model, we have been
unable to identify a promoter at the 5' end of R2 elements
(11). We therefore generated a pBluescript construct which
enabled us to synthesize a model 28S-R2 cotranscript for injection into
embryos. A full-length cotranscript would be nearly 8,000 nt in length
and unlikely to fold into its proper conformation in vitro. Therefore,
we generated a mini-R2Bm-28S construct (Fig. 6A) containing 480 nt of the upstream 28S
gene, the 450-nt 5' UTR and 250-nt 3' UTR of the R2Bm element, and 80 nt of the downstream 28S gene. The flanking 28S gene sequences in this
construct comprise domain V of the 28S rRNA (14). Based on
the predicted secondary structure of this domain, the 28S sequences
downstream of the R2 insertion site should base pair with sequences
near the 5' end of the transcript. It should be noted that these
flanking 28S sequences were derived from B. mori. The level
of nucleotide identity between the 28S genes of D. melanogaster and B. mori exceeds 95% in the region
immediately upstream and downstream of the R2 insertion site, as well
as the extreme 5' end of domain 5. However, the region from 170 to 400 nt upstream of the R2 site corresponds to the expansion region of
domain V, where the B. mori and D. melanogaster
28S sequences exhibit little sequence similarity. The PCR primers used
to amplify the 5' R2Bm junctions generated in this injection experiment
are either outside of domain V (first primer) or within the expansion
domain (second primer) and thus would only anneal to the D. melanogaster 28S gene (see Fig. 2).

View larger version (49K):
[in this window]
[in a new window]
|
FIG. 6.
Full-length R2Bm 5' junctions obtained with the
mini-R2Bm-28S RNA injections. (A) Diagram of the injected
mini-R2Bm-28S RNA. The RNA contains that portion of the B. mori 28S sequence corresponding to domain V (14), as
well as the entire 5' and 3' UTRs of an R2Bm element. Domain V of the
B. mori and D. melanogaster 28S genes are highly
similar in sequence except for the expansion domain ending 170 bp
upstream of the R2 insertion site (dotted region). (B) Diagram of the
full-length R2Bm junctions obtained with the mini-R2Bm-28S RNA. Only
the 170-bp region immediately upstream of the R2 insertion site is
diagrammed, with the nucleotide differences between the B. mori and D. melanogaster sequences indicated. For each
R2 element the origin of the 28S sequences upstream of the insertion is
indicated as being derived from B. mori (RNA template, open
box) or D. melanogaster (DNA target, shaded box). One of the
insertions contained a 23-bp tandem duplication of the 28S sequences
immediately upstream of the insertion site. The only other variation at
the 5' junction was a 1-bp deletion of the 28S gene in one insertion
(solid box, three lines from the bottom). The number of animals
containing each insertion type is indicated by the numbers on the
right.
|
|
Of the 324 flies injected with the 5'-capped mini-R2Bm-28S RNA and
tested by PCR for R2Bm 5' junctions, 108 were scored as
positive. Of
these 108 flies, 20 gave rise to PCR products that
were the size
predicted for a precise integration of a full-length
mini-R2 element
into the target site. The sequence of the 17 insertions
recovered by
cloning are summarized in Fig.
6B. With two exceptions,
these sequences
revealed precise 5' junctions with neither deletions
in the R2Bm or 28S
gene sequences. Of the two exceptions, one
contained a 23-bp
duplication of the upstream 28S gene (discussed
below), and the second
contained a 1-bp deletion common in endogenous
insertions of R2Bm
(
3). The 170-bp region of the 28S gene immediately
upstream
of the R2 insertion site contains eight nucleotide differences
between
B. mori and
D. melanogaster. These eight
differences enabled
us to determine whether the 28S sequences upstream
of these R2Bm
insertions corresponded to the
D. melanogaster
28S target sequences
or were derived by reverse transcription from the
B. mori template
RNA. Seven of the 17 full-length insertions
contained upstream
28S gene sequences corresponding solely to those
derived from
the
D. melanogaster target site (shaded bar).
The remaining 10
clones contained various levels of upstream
B. mori sequences
introduced via the injected RNA.
Thus, some means of recombination
between the
B. mori and
D. melanogaster 28S gene sequences appears
to be responsible
for the attachment of the 5' end of the R2Bm
insertions.
Minimum sequences required for precise integrations: the
micro-R2Bm-28S construct.
If the attachment of the 5' end of the
R2Bm sequences to the upstream DNA target involves simple recombination
between the homologous sequences present at the target site and on the
synthesized cDNA, then this attachment may not require that the RNA
template mimic a complete 28S-R2 cotranscript. The minimum sequences
required of the injected RNA to support insertion could include only a short region of the upstream 28S sequences at the 5' end to stimulate recombination and the 3' UTR of the R2 element to bind the R2 protein
and initiate TPRT. To test this possibility the micro-R2Bm-28S construct shown in Fig. 7A was generated.
This construct contained only the 170-nt 28S upstream region that is
similar between B. mori and D. melanogaster, 50 nt of the R2Bm 5' UTR, and the 3' UTR sequences.

View larger version (51K):
[in this window]
[in a new window]
|
FIG. 7.
Full-length R2Bm 5' junctions obtained with the
micro-R2Bm-28S RNA injections. (A) Diagram of the injected
micro-R2Bm-28S RNA. The RNA contains 170 bp of upstream B. mori 28S genes sequences, 50 nt from the 5' end of the R2 5' UTR,
and the 250-nt 3' UTR. (B) Diagram of the full-length R2Bm junctions
obtained with the micro-R2Bm-28S RNA. As in Fig. 6 the 170-bp region
upstream of the R2 insertion site is shown, with the nucleotide
differences between the B. mori and D. melanogaster sequences indicated. An additional G residue
difference was introduced via PCR during the generation of the
micro-R2Bm-28S clone. For each R2 element the origin of the 28S
sequences upstream of the insertion is indicated as being derived from
B. mori (RNA template, open box) or D. melanogaster (DNA target, shaded box). One of the 5' junctions
contained 25 bp of 28S downstream sequences at the insertion site,
presumably resulting in a 25-bp target site duplication. The number of
animals containing each insertion type is indicated by the numbers on
the right.
|
|
The results from the injection of the capped micro-R2Bm-28S construct
were similar in many respects to that seen with the
mini-R2Bm-28S
construct. Of the 272 flies tested by PCR for the
presence of a 5'
junction, 70 were scored as positive for a somatic
integration event.
This 28% recovery of flies with insertions
(and the 33% recovery with
the mini-R2Bm-28S RNA in the preceding
section) is somewhat lower than
that observed for the HR4 constructs
shown in Table
1. It is not known
whether this reduced recovery
is due to minor variations in the
activity of the isolated protein
or the greater difficulty of the R2
protein transporting an RNA
containing 28S gene sequences back into the
nucleolus of a cell.
In spite of this lower total level of insertion,
the fraction
of full-length insertions was much higher. Remarkably, of
the
70 integration events scored, 58 appeared to be full-length events.
The near absence of R2Bm insertions containing large 5' truncations
suggestive of RNA degradation would also suggest that the
micro-R2Bm-28S
RNA was more stable within the injected embryo. This
greater resistance
to degradation may be the result of the binding of
cellular protein
(e.g., ribosomal protein), the secondary structure of
the upstream
28S sequences, or the binding of this RNA by the R2
protein
itself.
The PCR products from 24 individual flies scored as having a
full-length insert were cloned and sequenced. As shown in Fig.
7B the
sequences of these junctions revealed that all but one
were precise.
The single junction that was not precise contained
a duplication of
downstream 28S gene sequences at the 5' end of
the insertion. This
insertion contained in effect a 25-bp target
site duplication. There
were two differences between the junctions
generated by the
mini-R2Bm-28S construct (Fig.
6) and those generated
by the
micro-R2Bm-28S construct (Fig.
7). First, in the micro-R2Bm-28S
injection the 170-bp upstream region in 8 of the 24 junctions
contained
the entire or nearly entire sequence derived from the
injected RNA. In
the case of the mini-R2Bm-28S construct, only
two of the insertions
contained this entire region from the injected
RNA. Second, two clones
had short regions of the
B. mori 28S sequence
embedded
within the target 28S gene sequences from the
D. melanogaster target site. While the number of cases is low, these
differences
suggest a slightly different mechanism for the
recombination that
attaches the 5' end of the cDNA resulting from these
RNAs to the
target
site.
 |
DISCUSSION |
We have shown that RNP complexes derived from the R2 element of
B. mori are capable of inserting into their 28S rRNA gene target site when injected into Drosophila preblastoderm
embryos. The efficiency of these integrations was such that 28 to 63%
of the surviving animals contained R2Bm insertions in their adult tissues. The high efficiency of these R2 integrations was also evident
in the significant fraction of flies recovered with more than one
somatic insertion. The remarkable specificity of endogenous R2
retrotransposition events for the 28S gene was duplicated in these
injection experiments since no evidence was found for R2Bm insertions
outside the 28S target sites.
Selection and cleavage of the target site.
Is the ability of
the R2 protein to recognize the DNA sequences of the target site the
only factor responsible for this insertion specificity? When incubated
in large excess (~100-fold), purified R2 protein is capable of
finding the 28S target in purified genomic DNA (39). While
similar molar excesses of protein to target site are probably present
in our injection experiments, it is not known whether the chromosomal
protein bound to genomic DNA within a cell helps or hinders the DNA
recognition process. Secondly, the long history of R2 insertion into
arthropod rDNA units (2) suggests that they may have
evolved, or acquired, a nucleolar localization signal. Such signals
have been found for both proteins and small RNAs that must enter the
nucleolus (22, 36). The presence of a nucleolar localization
signal can be tested in future injection experiments by introducing the
28S gene target site outside the nucleolus and monitoring R2Bm
insertions at these sites relative to the endogenous 28S target sites
within the nucleolus. The effects of chromatin proteins on the ability
of the R2 endonuclease to find the target site can also be assayed in
vitro by comparing the ability of purified or assembled chromatin to
serve as a target site relative to that of naked DNA.
The 3' junctions of the R2Bm insertions resulting from the injections
of various RNA templates (Fig.
4) indicate that the
initial steps in
the in vivo R2 integration reaction are similar
to those defined for
the in vitro TPRT reaction. If the 3' end
of the injected RNA template
terminates at the junction of the
R2 element with the 28S gene (HR4),
then reverse transcription
usually initiates at the 3'-terminal
nucleotide of the RNA (
27).
However, both initiation at
internal nucleotides and the addition
of nontemplated nucleotides are
sometimes found. If the injected
RNA template contains downstream 28S
gene sequences at its 3'
end (HR4/10R), then the reverse transcriptase
initiates synthesis
at the precise junction between the R2 element and
28S gene sequences
(
26). Finally, if the injected RNA
template has the last few
nucleotides of the R2 element deleted
(HR4/1A), then reverse transcriptase
adds a series of nontemplated
nucleotides to the target site before
engaging the RNA template
(
25).
The only qualitative difference detected between the initial steps of
the in vitro and in vivo TPRT reactions was the surprising
finding
that, in vivo, cleavage of the DNA strand used to prime
reverse
transcription was sometimes 2 bp upstream or 1 to 2 bp
downstream of
the site of cleavage seen in vitro. (This was not
due to a sequence
difference between the 28S genes of
B. mori and
D. melanogaster since these sequences are identical in the
50-bp
region surrounding the target site.) We have sequenced the
3' junctions
of nearly 200 R2 elements from a variety of arthropods
and have found
that all but one corresponded to the precise location
of the initial
cleavage site as determined in our in vitro experiments
(
3,
7,
23). The single exception was found in
D. sechellia,
where an insertion was 2 bp upstream of the normal site (
7).
The most likely explanation for the variation in cleavage site
generated by the injected RNP is the chromatin structure of the
DNA
target site within the embryo. It is possible that when endogenous
R2
elements retrotranspose, the rDNA units are in a more accessible
(transcriptionally active) conformation. The injections are conducted
prior to cellularization of the embryo, when the rDNA units are
not
being transcribed (
28). We can approach the question of
the
effects of chromatin structure on the selection of the cleavage
site by
again using isolated chromatin, or short DNA molecules
with positioned
nucleosomes, as targets for our in vitro TPRT
reaction.
Attachment of the R2 5' end.
While the steps involved in the
integration of the 3' end of R2 have been elucidated, little is known
about the means by which the 5' end of non-LTR elements is attached to
the DNA target during retrotransposition. The only clear characteristic
of these insertions is that the elements are frequently 5' truncated.
Such 5' truncations suggest either that attachment occurs before the
reverse transcriptase has reached the 5' end of the RNA template or
that reverse transcription can occur from incomplete RNA transcripts.
The uniformity of the 5' junctions of endogenous R2 elements varies
considerably in different species. In species such as B. mori, most junctions are identical. In Drosophila
species the R2 5' junctions frequently contain short deletions of the
28S gene and/or the insertion of nontemplated sequences (3,
10). 5'-truncated R2 elements have junctions similar to those of
full-length elements, suggesting that no specific 5' sequences are
required for integration (10). Therefore, the initial R2 RNA
templates we tested in our injection assay contained only sequences
corresponding to the 3' end of the R2 element, i.e., those sequences
required for RNA binding and the initiation of TPRT. Unfortunately, the
large 28S gene deletions associated with these RNAs were not
characteristic of the endogenous R2 elements of D. melanogaster or of B. mori. We suggest that these 5'
junctions were generated by a cellular repair process which reseals the
cleaved chromosome. Extensive studies have been conducted of the
cellular responses to chromosome breakage induced by rare cutting
endonucleases (reviewed in references 13 and
18). In a variety of organisms, including D. melanogaster (1), the cut appears to be enlarged and
repaired by direct end joining. The 5' ends of the R2Bm insertions
resulting from the HR4 RNA injections appear to be similar to those
reported in these studies.
Because R2 elements are believed to be expressed as cotranscripts with
the 28S gene, we also tested R2 RNA templates that
contained flanking
28S gene sequences. A construct that might
mimic an R2 transcript
embedded in the 28S rRNA sequence, mini-R2Bm-28S
RNA (Fig.
6),
resulted in about 20% of the insertions with precise
5' junctions.
Based on the nucleotide differences between the
D. melanogaster 28S gene and the
B. mori 28S sequences on
the
RNA template, it was shown that in these precise junctions a
variable
length of the upstream 28S gene sequences was derived from the
RNA template rather than the target DNA sequences (Fig.
6). One
possible model to explain this "coconversion" is diagrammed in
Fig.
8A. Homologous recombination is
postulated to occur between
the newly made cDNA and the target DNA
sequences upstream of the
insertion site. For simplicity, the
recombination in Fig.
8A is
shown occurring on the target DNA after
second-strand cleavage.
Alternatively this recombination could occur at
the chromosomal
nick before second-strand cleavage (see Fig.
1).
Because the 28S
sequences in the mini-R2Bm-28S RNA included an
expansion segment
that is not similar between
B. mori and
D. melanogaster, only
the 170-bp region immediately upstream
of the cleavage site can
anneal to the cDNA. DNA repair of this
annealed complex, which
can undergo branch migration over this 170-bp
region, would explain
the coconversion-like gradient seen.

View larger version (23K):
[in this window]
[in a new window]
|
FIG. 8.
Possible recombination models to explain the 5'
junctions generated by the mini- and micro-R2Bm-28S RNAs. (A) Reverse
transcription proceeds beyond the 170-bp region of similarity between
the RNA and the DNA target in the mini-R2Bm-28S RNA. The cDNA strand
anneals to the upper strand of the target DNA. Branch migration within
the 170-bp region of identity accounts for the gradient of B. mori sequences accompanying the insertions seen in Fig. 6. (B)
Reverse transcription proceeds to the end of the micro-R2Bm-28S RNA.
The cDNA strand displaces the lower DNA strand of the target site,
accounting for the greater proportion of insertions with the entire
170-bp derived from B. mori (Fig. 7). DNA repair can use
either the upper or lower strand to repair the mismatched bases,
accounting for the patches seen in some insertions.
|
|
A more minimal RNA construct was next tested containing only the R2 3'
UTR and the 170 bp of upstream 28S gene sequences,
which are similar
between
B. mori and
D. melanogaster (Fig.
7).
This micro-R2Bm-28S RNA resulted in more than 80% precise 5'
junctions,
suggesting it is a better template for these injection
experiments.
As shown in Fig.
8B, the higher frequency of precise
junctions
and a greater percentage of these junctions that contained
the
entire upstream region derived from
B. mori can be
explained if
the newly made cDNA strand from this RNA template more
stably
displaces the lower DNA strand of the upstream target site. The
patchwork pattern of
D. melanogaster and
B. mori
28S sequences
seen in two of the integrations (Fig.
7B) clearly adds
support
to a heteroduplex intermediate between the cDNA and the
upstream
DNA target site. It is not clear whether the R2 protein
actively
contributes to the formation of such a
heteroduplex.
Do endogenous R2 retrotransposition events also use homologous
recombination to attach the 5' end of the element to the target
site?
Homologous recombination like that shown in Fig.
8A or B
would lead to
5' sequence uniformity; thus, it can explain those
species such as
B. mori with uniform 5' junctions (see reference
3 for various examples). What about species with R2
elements
containing variable 5' junctions? Based on an analysis of R2
elements
in those species with extensive 5' variation, we have
postulated
a template jump of the reverse transcriptase from the R2 RNA
template
onto the 28S DNA at the cleavage site (
3). Such
template jumps
can explain short deletions of the DNA target and
insertion of
nontemplated sequences at the precise R2-28S boundary. A
template
jump model also explains two other types of sequence
duplications
found at the 5' junctions of R2 elements. If the template
jump
to the cleaved DNA target site occurs a short distance after the
reverse transcriptase has passed the R2-28S junction of the RNA
template, then a tandem duplication of the 28S gene is generated.
We
have detected such tandem duplications in several arthropod
species,
including 24- to 26-bp duplications in ca. 10% of the
R2 elements in
B. mori (
3). The second type of sequence
variation
found associated with R2 insertions in certain species is
target
site duplications. To explain these duplications, we suggest
that
cleavage of the upper DNA strand can sometimes occur downstream
of
the lower-strand cleavage (instead of the normal location 2
bp upstream
of the lower-strand site). A template switch onto
a target site in
which the upper band is cleaved downstream of
the insertion site would
result in a target site duplication (
3).
While the 5' junctions generated with the mini- and micro-R2Bm-28S
RNAs were extremely uniform, we did see examples of a 1-bp
deletion
(Fig.
6), a 23-bp tandem duplication of the 28S gene
(Fig.
6),
and a 25-bp target site duplication (Fig.
7). Whether
this
variation results from a template switch or another more
complicated recombination-repair mechanism is not
known.
We are hopeful that the R2 integration system developed in this report
will enable us to study other aspects of the retrotransposition
mechanism used by R2 and that it will eventually allow us to introduce
engineered (activated) R2 elements into the rDNA loci of
D. melanogaster and potentially other arthropod species. A major
issue in these
studies is whether the inserted R2 element will be
transcribed.
Northern blots of RNA isolated from our germ line events
have
indicated that, like endogenous R2 (and R1) insertions, R2Bm
sequences
inserted into the 28S gene of
D. melanogaster are
not transcribed
(D. Eickbush, unpublished data). However, to maintain
their numbers
in a species R2 elements need only be transcribed for
short periods
in the development of ovaries or testis and not even at
each generation.
We now should have the tools necessary to directly
address these
transcription
questions.
 |
ACKNOWLEDGMENTS |
This work was supported by National Institutes of Health grant
GM42790 to T.H.E.
We thank Robert Fleming for sharing his expertise on injections and fly
development, Yi Gu for showing us how to conduct the injections, and
especially Janet George and Karen Gentile for help in aligning embryos.
We thank William Burke, Janet George, Cesar Perez-Gonzalez, Harmit
Malik, and Jin Yang for comments and encouragement.
 |
FOOTNOTES |
*
Corresponding author. Mailing address: Department of
Biology, University of Rochester, Rochester, NY 14627. Phone: (716)
275-7274. Fax: (716) 275-2070. E-mail:
eick{at}uhura.cc.rochester.edu.
Present address: Institute of Molecular Biology, University of Hong
Kong, Pokfulam, Hong Kong.
 |
REFERENCES |
| 1.
|
Bellaiche, Y.,
V. Mogila, and N. Perrimon.
1999.
I-SceI endonuclease, a new tool for studying DNA double-strand break repair mechanism in Drosophila.
Genetics
152:1037-1044[Abstract/Free Full Text].
|
| 2.
|
Burke, W. D.,
H. S. Malik,
W. C. Lathe, and T. H. Eickbush.
1998.
Are retrotransposons long term hitchhikers?
Nature
239:141-142.
|
| 3.
|
Burke, W. D.,
H. S. Malik,
J. P. Jones, and T. H. Eickbush.
1999.
The domain structure and retrotransposition mechanism of R2 elements are conserved throughout arthropods.
Mol. Biol. Evol.
16:502-511[Abstract].
|
| 4.
|
Burke, W. D.,
F. Müller, and T. H. Eickbush.
1995.
R4, a non-LTR retrotransposon specific to the large subunit rRNA gene of nematodes.
Nucleic Acids Res.
23:4628-4634[Abstract/Free Full Text].
|
| 5.
|
Cousineau, B.,
D. Smith,
S. Lawrence-Cavanagh,
J. E. Mueller,
J. Yang,
D. Mills,
D. Manias,
G. Dunny,
A. M. Lambowitz, and M. Belfort.
1998.
Retrohoming of a bacterial group II intron: mobility via complete reverse splicing, independent of homologous DNA recombination.
Cell
94:451-462[CrossRef][Medline].
|
| 6.
|
Dhellin, O.,
J. Maestre, and T. Heidmann.
1997.
Functional differences between the human LINE retrotransposon and retroviral reverse transcriptases for in vivo mRNA reverse transcription.
EMBO J.
16:6590-6602[CrossRef][Medline].
|
| 7.
|
Eickbush, D. G., and T. H. Eickbush.
1995.
Vertical transmission of the retrotransposable elements R1 and R2 during the evolution of the Drosophila melanogaster species subgroup.
Genetics
139:671-684[Abstract].
|
| 8.
|
Feng, Q.,
J. V. Moran,
H. H. Kazazian, and J. D. Boeke.
1996.
Human L1 retrotransposon encodes a conserved endonuclease required for retrotransposition.
Cell
87:905-916[CrossRef][Medline].
|
| 9.
|
Garza, D.,
M. Medhora,
A. Koga, and D. L. Hartl.
1991.
Introduction of the transposable element mariner into the germline of Drosophila melanogaster.
Genetics
128:303-310[Abstract].
|
| 10.
|
George, J. A.,
W. D. Burke, and T. H. Eickbush.
1996.
Analysis of the 5' junctions of R2 insertions with the 28S gene: implications for non-LTR retrotransposition.
Genetics
142:853-863[Abstract].
|
| 11.
|
George, J. A., and T. H. Eickbush.
1999.
Conserved features at the 5' end of Drosophila R2 retrotransposable elements: implications for transcription and translation.
Insect Mol. Biol.
8:3-10[CrossRef][Medline].
|
| 12.
|
Gloor, G. B., and W. R. Engels.
1992.
Single fly DNA preps for PCR.
Drosophila Inform. Service
71:148-149.
|
| 13.
|
Haber, J. E.
1995.
In vivo biochemistry: physical monitoring of recombination induced by site-specific endonucleases.
Bioessays
17:609-620[CrossRef][Medline].
|
| 14.
|
Hancock, J. M.,
D. Tautz, and G. A. Dover.
1988.
Evolution of the secondary structures and compensatory mutations of the ribosomal RNAs of Drosophila melanogaster.
Mol. Biol. Evol.
5:393-414[Abstract].
|
| 15.
|
Hazelrigg, T.,
R. Levis, and G. M. Rubin.
1984.
Transformation of white locus DNA in Drosophila: dosage compensation, zeste interaction, and position effects.
Cell
36:469-481[CrossRef][Medline].
|
| 16.
|
Jakubczak, J. L.,
Y. Xiong, and T. H. Eickbush.
1990.
Type I (R1) and Type II (R2) ribosomal DNA insertions of Drosophila melanogaster are retrotransposable elements closely related to those of Bombyx mori.
J. Mol. Biol.
212:37-52[CrossRef][Medline].
|
| 17.
|
Jakubczak, J. L.,
M. K. Zenni,
R. C. Woodruff, and T. H. Eickbush.
1992.
Turnover of R1 (Type I) and R2 (Type II) retrotransposable elements in the ribosomal DNA of Drosophila melanogaster.
Genetics
131:129-142[Abstract].
|
| 18.
|
Jasin, M.
1996.
Genetic manipulation of genomes with rare-cutting endonucleases.
Trends. Genet.
12:224-228[CrossRef][Medline].
|
| 19.
|
Jurka, J.
1997.
Sequence patterns indicate an enzymatic involvement in integration of mammalian retroposons.
Proc. Natl. Acad. Sci. USA
94:1872-1877[Abstract/Free Full Text].
|
| 20.
|
Kazazian, H. H., and J. V. Moran.
1998.
The impact of L1 retrotransposition on the human genome.
Nat. Genet.
19:19-24[CrossRef][Medline].
|
| 21.
|
Kidd, S. J., and D. M. Glover.
1981.
Drosophila melanogaster ribosomal DNA containing type II insertions is variably transcribed in different strains and tissues.
J. Mol. Biol.
151:645-662[CrossRef][Medline].
|
| 22.
|
Lange, T. S.,
M. Ezrokhi,
A. V. Borovjagin,
R. Rivera-Leon,
M. T. North, and S. A. Gerbi.
1998.
Nucleolar localization elements of Xenopus laevis U3 small nucleolar RNA.
Mol. Biol. Cell
9:2973-2985[Abstract/Free Full Text].
|
| 23.
|
Lathe, W. C., III, and T. H. Eickbush.
1997.
A single lineage of R2 retrotransposable elements is an active, evolutionarily stable component of the Drosophila rDNA locus.
Mol. Biol. Evol.
14:1232-1241[Abstract].
|
| 24.
|
Loukeris, T. G.,
I. Livadaras,
B. Arca,
S. Zabalou, and C. Savakis.
1995.
Gene transfer into the medfly, Ceratitis capitata, with a Drosophila hydei transposable element.
Science
270:2002-2005[Abstract/Free Full Text].
|
| 25.
|
Luan, D. D., and T. H. Eickbush.
1995.
RNA template requirements for target DNA-primed reverse transcription by the R2 retrotransposable element.
Mol. Cell. Biol.
15:3882-3891[Abstract].
|
| 26.
|
Luan, D. D., and T. H. Eickbush.
1996.
Downstream 28S gene sequences on the RNA template affect the choice of primer and the accuracy of initiation by the R2 reverse transcriptase.
Mol. Cell. Biol.
16:4726-4734[Abstract].
|
| 27.
|
Luan, D. D.,
M. H. Korman,
J. L. Jakubczak, and T. H. Eickbush.
1993.
Reverse transcription of R2Bm RNA is primed by a nick at the chromosomal target site: a mechanism for non-LTR retrotransposition.
Cell
72:595-605[CrossRef][Medline].
|
| 28.
|
McKnight, S. L., and O. L. Miller, Jr.
1976.
Ultrastructural patterns of RNA synthesis during early embryogenesis of Drosophila melanogaster.
Cell
8:305-319[CrossRef][Medline].
|
| 29.
|
Moran, J. V.,
S. E. Holmes,
T. P. Naas,
R. J. DeBerardinis,
J. D. Boeke, and H. H. Kazazian.
1996.
High frequency retrotransposition in cultured mammalian cells.
Cell
87:917-927[CrossRef][Medline].
|
| 30.
|
O'Brochta, D. A.,
W. D. Warren,
K. J. Saville, and P. W. Atkinson.
1996.
Hermes, a functional non-drosophilid insect gene vector from Musca domestica.
Genetics
142:907-914[Abstract].
|
| 31.
|
Ramos, R. G. P.,
B. G. Grimwade,
K. A. Wharton,
T. N. Scottgale, and S. Artavanis-Tsakonis.
1989.
Physical and functional definition of the Drosophila Notch locus by P element transformation.
Genetics
123:337-348[Abstract/Free Full Text].
|
| 32.
|
Rubin, G. M., and A. C. Spradling.
1982.
Genetic transformation of Drosophila with transposable element vectors.
Science
218:348-353[Abstract/Free Full Text].
|
| 33.
|
Smit, A. F. A.
1996.
The origin of interspersed repeats in the human genome.
Curr. Opin. Genet. Dev.
6:743-748[CrossRef][Medline].
|
| 34.
|
Spradling, A. C.
1986.
P element-mediated transformation, p. 175-197.
In
D. B. Roberts (ed.), Drosophila: a practical approach. IRL Press, Oxford, United Kingdom
|
| 35.
|
Spradling, A. C., and G. M. Rubin.
1982.
Transposition of cloned P elements into Drosophila germ line chromosomes.
Science
218:341-347[Abstract/Free Full Text].
|
| 36.
|
Tuteja, R., and N. Tuteja.
1998.
Nucleolin: a multifunctional major nucleolar phosphoprotein.
Crit. Rev. Biochem. Mol. Biol.
33:407-436[CrossRef][Medline].
|
| 37.
|
Xiong, Y., and T. H. Eickbush.
1988.
Functional expression of a sequence-specific endonuclease encoded by the retrotransposon R2Bm.
Cell
55:235-246[CrossRef][Medline].
|
| 38.
|
Xiong, Y.,
W. D. Burke,
J. L. Jakubczak, and T. H. Eickbush.
1988.
Ribosomal DNA insertion elements R1Bm and R2Bm can transpose in a sequence specific manner to locations outside the 28S genes.
Nucleic Acids Res.
16:10561-10573[Abstract/Free Full Text].
|
| 39.
|
Yang, J., and T. H. Eickbush.
1998.
RNA-induced changes in the activity of the endonuclease encoded by the R2 retrotransposable element.
Mol. Cell. Biol.
18:3455-3465[Abstract/Free Full Text].
|
| 40.
|
Yang, J.,
H. S. Malik, and T. H. Eickbush.
1999.
Identification of the endonuclease domain encoded by R2 and other site-specific, non-long terminal repeat retrotransposable elements.
Proc. Natl. Acad. Sci. USA
96:7847-7852[Abstract/Free Full Text].
|
| 41.
|
Zimmerly, S.,
H. Guo,
P. S. Perlman, and A. M. Lambowitz.
1995.
Group II intron mobility occurs by target DNA-primed reverse transcription.
Cell
82:545-554[CrossRef][Medline].
|
Molecular and Cellular Biology, January 2000, p. 213-223, Vol. 20, No. 1
0270-7306/0/$04.00+0
Copyright © 2000, American Society for Microbiology. All rights reserved.
This article has been cited by other articles:
-
Eickbush, D. G., Ye, J., Zhang, X., Burke, W. D., Eickbush, T. H.
(2008). Epigenetic Regulation of Retrotransposons within the Nucleolus of Drosophila. Mol. Cell. Biol.
28: 6452-6461
[Abstract]
[Full Text]
-
Babushok, D. V., Ostertag, E. M., Courtney, C. E., Choi, J. M., Kazazian, H. H. Jr.
(2006). L1 integration in a transgenic mouse model. Genome Res
16: 240-250
[Abstract]
[Full Text]
-
Kojima, K. K., Fujiwara, H.
(2005). Long-Term Inheritance of the 28S rDNA-Specific Retrotransposon R2. Mol Biol Evol
22: 2157-2165
[Abstract]
[Full Text]
-
Anzai, T., Osanai, M., Hamada, M., Fujiwara, H.
(2005). Functional roles of 3'-terminal structures of template RNA during in vivo retrotransposition of non-LTR retrotransposon, R1Bm. Nucleic Acids Res
33: 1993-2002
[Abstract]
[Full Text]
-
RUSCHAK, A. M., MATHEWS, D. H., BIBILLO, A., SPINELLI, S. L., CHILDS, J. L., EICKBUSH, T. H., TURNER, D. H.
(2004). Secondary structure models of the 3' untranslated regions of diverse R2 RNAs. RNA
10: 978-987
[Abstract]
[Full Text]
-
Fujimoto, H., Hirukawa, Y., Tani, H., Matsuura, Y., Hashido, K., Tsuchida, K., Takada, N., Kobayashi, M., Maekawa, H.
(2004). Integration of the 5' end of the retrotransposon, R2Bm, can be complemented by homologous recombination. Nucleic Acids Res
32: 1555-1565
[Abstract]
[Full Text]
-
Kojima, K. K., Fujiwara, H.
(2004). Cross-Genome Screening of Novel Sequence-Specific Non-LTR Retrotransposons: Various Multicopy RNA Genes and Microsatellites Are Selected as Targets. Mol Biol Evol
21: 207-217
[Abstract]
[Full Text]
-
Perez-Gonzalez, C. E., Burke, W. D., Eickbush, T. H.
(2003). R1 and R2 Retrotransposition and Deletion in the rDNA Loci on the X and Y Chromosomes of Drosophila melanogaster. Genetics
165: 675-685
[Abstract]
[Full Text]
-
Eickbush, D. G., Eickbush, T. H.
(2003). Transcription of Endogenous and Exogenous R2 Elements in the rRNA Gene Locus of Drosophila melanogaster. Mol. Cell. Biol.
23: 3825-3836
[Abstract]
[Full Text]
-
Kapitonov, V. V., Jurka, J.
(2003). The Esterase and PHD Domains in CR1-Like Non-LTR Retrotransposons. Mol Biol Evol
20: 38-46
[Abstract]
[Full Text]
-
Pyatkov, K. I., Shostak, N. G., Zelentsova, E. S., Lyozin, G. T., Melekhin, M. I., Finnegan, D. J., Kidwell, M. G., Evgen'ev, M. B.
(2002). Penelope retroelements from Drosophila virilis are active after transformation of Drosophila melanogaster. Proc. Natl. Acad. Sci. USA
99: 16150-16155
[Abstract]
[Full Text]
-
Chambeyron, S., Brun, C., Robin, S., Bucheton, A., Busseau, I.
(2002). Chimeric RNA transposition intermediates of the I factor produce precise retrotransposed copies. Nucleic Acids Res
30: 3387-3394
[Abstract]
[Full Text]
-
Perez-Gonzalez, C. E., Eickbush, T. H.
(2001). Dynamics of R1 and R2 Elements in the rDNA Locus of Drosophila simulans. Genetics
158: 1557-1567
[Abstract]
[Full Text]
-
Chaboissier, M.-C., Finnegan, D., Bucheton, A.
(2000). Retrotransposition of the I factor, a non-long terminal repeat retrotransposon of Drosophila, generates tandem repeats at the 3' end. Nucleic Acids Res
28: 2467-2472
[Abstract]
[Full Text]