Darcy Birse,2,
Yang Hong,1,
and
Victor Ambros1*
Dartmouth Medical School, Department of Genetics, Hanover, New Hampshire 03755,1 Stockholm University, Department of Biochemistry and Biophysics, SE-106 91 Stockholm, Sweden2
Received 4 July 2005/ Returned for modification 8 August 2005/ Accepted 16 September 2005
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
LIN-14 protein is localized to the nuclei of most somatic cells in L1 larvae (50, 59), in an anatomical pattern consistent with the cell types affected by lin-14 mutations. LIN-14 is a novel protein that shows no obvious homology to proteins in nonnematode species (20, 41, 59). The C terminus is sufficient for the nuclear localization of the protein and appears to contain a relatively expanded nuclear targeting signal, unlike classical nuclear localization signals (20). The only other predicted structural motifs in LIN-14 are an amphipathic helix in the C-terminal domain of the protein in exon 11 (59) and an alpha helix in exon 10 (P. Olsen, personal communication). These properties of LIN-14 have suggested that LIN-14 regulates stage-specific gene expression in the nucleus. To gain an understanding of the possible mechanism(s) by which lin-14 helps cells establish their temporal identity, we employed a series of assays using the native LIN-14 protein from isolated nuclei and bacterially expressed LIN-14 to characterize LIN-14 biochemical activities. Here we show that native LIN-14 exists in at least two populations in the nucleus, one of which is associated with chromosomal DNA. We show that a bacterially expressed C-terminal fragment of LIN-14 binds double-stranded DNA and can select sequences containing a putative consensus sequence from a pool of randomized double-stranded oligonucleotides. These results support the conclusion that LIN-14 could function as a DNA binding transcriptional regulator.
Based on lin-14 phenotypes, hypothetical lin-14 transcriptional targets can be broadly divided into two classes: L2-specific genes, which would be normally repressed by lin-14 during the L1 stage, and L1-specific genes, which would be normally activated by lin-14 during L1. To identify candidate LIN-14 targets, we employed DNA microarray hybridization. We compared mRNA abundance of 11,899 C. elegans genes in RNA samples from the L1 and L2 stages of wild-type worms, lin-14 loss-of-function (lf) mutants, and lin-4 loss-of-function mutants. We identified 16 genes with characteristics of potential lin-14 targets; these genes display opposite changes in mRNA abundance in response to reduced lin-14 [lin-14(lf)] and elevated lin-14 [lin-4(lf)]. We confirmed that at least one of these genes, ins-33, is a bona fide transcriptional target of LIN-14; ins-33 promoter activity is temporally regulated by lin-14 in vivo, and LIN-14 binds in vitro to ins-33 upstream sequences that are essential for negative developmental regulation by lin-14. These results demonstrate that lin-14 encodes a novel class of transcription factor that controls developmental timing via direct DNA binding.
| MATERIALS AND METHODS |
|---|
|
|
|---|
Nuclear isolation. Nuclei from L1 larvae were isolated as described (29) except that worms were homogenized in buffer A (250 mM sucrose, 10 mM Tris-HCl, pH 8.0, 10 mM MgCl2, 1 mM EGTA, 1 mM DTT, 2 µg/ml aprotinin, 1 µg/ml leupeptin, 1 µg/ml pepstatin A, 2 mM Pefabloc). Nuclei were stored at 80°C in 10% glycerol and buffer B (buffer A plus 60 mM KCl, 15 mM NaCl, 15 mM Tris-HCl, pH 7.4, 3 mM MgCl2, 2 µg/ml aprotinin, 1 µg/ml leupeptin, 1 µg/ml pepstatin A, 2 mM Pefabloc) (13). Nuclear DNA concentration was estimated by optical density at 260 nm of an aliquot in 1% sodium dodecyl sulfate (SDS).
MNase assay. Nuclei were thawed on ice and recovered by centrifugation in a refrigerated microcentrifuge at 11,500 rpm for 10 min. Nuclei were resuspended in micrococcal nuclease (MNase) buffer (10 mM HEPES, 10 mM KCl, 0.5 mM MgCl2, 2 mM CaCl2, 2 µg/ml aprotinin, 1 µg/ml leupeptin, 1 µg/ml pepstatin A, 2 mM Pefabloc) to a DNA concentration of 1 µg/µl (10). MNase (Worthington Biochemical) was added at concentrations from 10 to 150 units/100 µg DNA, and the nuclei were incubated on ice for 35 to 45 min. Digestion was stopped by the addition of an equal volume of MNase buffer supplemented with 10 mM EDTA. After a 10-min spin at 4,000 rpm in a refrigerated microcentrifuge, the supernatant was either loaded onto a sucrose gradient (see below) or subjected to centrifugation at 100,000 x g for 1 h in a Beckman tabletop ultracentrifuge (adapted from Liang and Stillman [28]). For RNase digestion, DNase-free RNase A (Sigma) was used under the same incubation conditions as those for MNase, and the digestion of RNA was verified by agarose gel electrophoresis after RNA isolation using Trizol (GibcoBRL).
For sucrose gradient analysis, 5 ml of 20 to 40% sucrose gradients in MNase stop buffer was centrifuged for 3.25 h at 40,000 rpm in a Beckman ultracentrifuge (SW 55 Ti rotor). Fractions (275 µl each) were collected by hand and split; two-thirds of each fraction was precipitated with trichloroacetic acid and resuspended in SDS-polyacrylamide gel electrophoresis (PAGE) loading buffer, while one-third was phenol extracted and ethanol precipitated to recover DNA.
Nuclear matrix assays. (i) High salt procedure. Nuclei were thawed on ice and recovered by centrifugation at 4°C and 11,500 rpm for 10 min in a refrigerated microcentrifuge. Nuclear matrix was prepared from the pelleted nuclei according to published protocols (16, 43). As a control, high-salt-extracted pellets were digested with DNase-free RNase A (Sigma).
(ii) Low-salt procedure. For the low-salt procedure, adapted from Smith et al. (53) and Mirkovitch et al. (35), nuclei were thawed on ice and recovered by centrifugation for 10 min at 4°C, 11,500 rpm, in a microcentrifuge. Nuclei were resuspended in low-salt buffer (5 mM HEPES, pH 7.9, 2 mM EDTA, 2 mM KCl, 0.25 mM spermidine, 2 µg/ml aprotinin, 1 µg/ml leupeptin, 1 µg/ml pepstatin A, 2 mM Pefabloc) at a DNA concentration of 2 µg/µl and incubated at room temperature for 15 min. (As a control, in at least one experiment the nuclei were pelleted after this step to analyze the effect of low-salt buffer alone on nuclear protein release). An equal amount of low-salt buffer supplemented with lithium 3,5-diiodosalicylate to 50 mM was added to the nuclei to a final lithium 3,5-diiodosalicylate concentration of 25 mM, and the nuclei were incubated for another 5 min at 22°C. Extracted nuclei were recovered by centrifugation at 5,000 x g for 3 min at 22°C and washed three times in buffer containing 20 mM Tris, pH 7.4, 20 mM KCl, 70 mM NaCl, 0.5 mM spermine, 0.125 mM spermidine, 2 µg/ml aprotinin, 1 µg/ml leupeptin, 1 µg/ml pepstatin A, 2 mM Pefabloc. Nuclei were resuspended in wash buffer and incubated with RNase-free DNase I (Boehringer Mannheim) at 0.5 U/µg DNA for 15 min at 37°C. After a final 3-min 5,000 x g spin, the nuclear matrix was resuspended in 8 M urea.
LIN-14 fusion protein expression and purification. LIN-14 coding sequences were amplified from a plasmid containing lin-14 cDNA and were cloned into the glutathione transferase (GST) expression vector pGEX-2T (GE Healthcare) to create a fusion protein with the GST module at the N terminus of the fusion protein. Primers were chosen to amplify sequences coding amino acids 292 to 539 or amino acids 284 to 465 [in the case of GST::LIN-14(292-539) or GST::LIN-14(284-465), respectively] (Fig. 1A). Purification of the GST fusion proteins was carried out as previously described (14), except that cells were lysed by sonication, sarcosyl was omitted from the buffer, and an additional clarifying spin step was carried out at 100,000 x g. Protein bound to glutathione-Sepharose beads (GE Healthcare) was either frozen as a 50% bead slurry in 10% glycerol, 50 mM HEPES, pH 7.9, 150 mM NaCl, and 5 mM dithiothreitol (DTT) or eluted from the beads in the above buffer supplemented with 15 mM glutathione. Elution fractions containing the fusion protein were stored at 80°C. Amino acids 284 to 465 were fused to a six-His tag by PCR amplification with appropriate primers, and the His-tagged fusion protein was purified from Escherichia coli by metal affinity resin. Protein fractions were analyzed using 11% SDS-PAGE gels, and proteins were visualized by Coomassie blue staining.
|
Binding site selection assay (SELEX). DNA sequences for consensus site selection were generated from the following oligonucleotide input sequence: 5' AGA GGG ATC CGA TTG CAG-N20-GTG TAG GAA TTC GCC GTG 3' (PO66), where the middle 20 bases (N20) were randomized during synthesis. These sequences have BamHI and EcoRI restriction sites (underlined) in the 5' and 3' ends, respectively, for subsequent cloning. Two PCR primers complementary to the 5' and 3' constant regions were also synthesized (PO67 and PO68). Double-stranded oligonucleotides were synthesized from the PO66 pool by extension with Klenow (NEB) using primer PO68.
Binding reactions were performed in a 250-µl volume for 30 min at 4°C (adapted from Chittenden et al. [11]). Twenty microliters of frozen glutathione-Sepharose beads complexed with GST::LIN-14(284-465) was diluted in binding buffer (50 mM HEPES, 100 mM NaCl, 2 mM MgCl2, 0.5 mM EDTA, 5 mM DTT, 10% glycerol, 100 ng/ml) containing double-stranded oligonucleotides. After incubation, the beads were washed three to four times with 1 ml of cold binding buffer to eliminate unbound DNA. Elution of DNA bound to the immobilized protein was carried out as described (7). Approximately 10% of the recovered DNA was amplified via PCR to prepare DNA for the next round of binding.
Amplification conditions were optimized and the following protocol was used: 95°C for 5 min and then 19 cycles of 94°C for 30 s, 55°C for 30 s, and 72°C for 140 s, with a final step of 72°C for 5 min. After the fourth and fifth rounds of selection, oligonucleotide pools were digested with EcoRI and BamHI and cloned in Bluescript SK vector. Colonies with inserts were selected by blue/white selection and sequenced using a T7 primer. Most colonies contained multiple concatamerized independent inserts. The randomized sequences were compared and aligned manually.
EMSAs. Probes were end labeled with 32P and T4 polynucleotide kinase, and approximately 20 fmol of labeled probe was added to each binding reaction (2 nM final concentration). Electrophoretic mobility shift assays (EMSAs) were performed essentially as described (7). Binding reactions were performed in binding buffer (50 mM HEPES, pH 7.9, 100 mM NaCl, 2 mM MgCl2, 1 mM DTT, 10% glycerol, dI-dC at 0.02 µg/µl final concentration, bovine serum albumin at a final concentration of 1 µg/µl) for 20 min at room temperature. A fusion protein consisting of a six-His-tagged version of LIN-14 containing amino acids 284 to 465 [6HIS::LIN-14(284-465)] was included in the binding reaction at concentrations of 8 nM to 320 nM.
Binding reactions were fractionated by electrophoresis in 3.5% or 4% (acrylamide/bisacrylamide ratio of 60:1) nondenaturing polyacrylamide gels. All gels were run at 4°C at 30 to 35 mA for 1.5 to 3 h in 0.5x Tris-borate-EDTA. Gels were exposed to a storage phosphor screen and images were captured on a Typhoon 8600 Variable Mode Imager using the ImageQuant Software (Molecular Dynamics). Images were further processed using Adobe Photoshop.
DNase I footprinting. The binding reaction for the DNase I footprinting analysis was carried out using the same reaction conditions as for EMSA except that protein [6HIS::LIN-14(284-465)] was added in molar excess to probe at quantities sufficient to retard all of the free probe in EMSA experiments. DNase I (Boehringer Mannheim) was then added to the binding reactions at a concentration of 0.01 U/ng of radiolabeled DNA probe, and the reaction mixtures were incubated for 5 min at room temperature. Digestion was stopped by the addition of an EDTA/EGTA cocktail to a final concentration of 2 mM each, and the samples were heated at 65°C for 10 min, dialyzed against dH2O, lyophilized in a Speedvac, and then resuspended in denaturing loading buffer. Samples were then heated and fractionated by electrophoresis through a 6% urea acrylamide gel (SequaGel; National Diagnostics) in 0.5x or 1x Tris-borate-EDTA buffer at 55°C for approximately 1 to 3 h. After drying, gels were exposed to a storage phosphor screen, and images were captured and processed as described above.
Western blots. Western blot analysis was performed according to standard procedures (7) using Renaissance Enhanced Luminol Chemiluminescence Reagent (New England Nuclear). Proteins were transferred onto Millipore brand Immobilon-P membranes. Primary antibody incubations were performed in 1x phosphate-buffered saline supplemented with 5% nonfat dry milk for 1 to 2 h at room temperature or at 4°C overnight. LIN-14 antisera were generated (Covance Research Products, Inc.) against 6HIS::LIN-14(284-465) (Fig. 1A). Antisera were affinity purified using purified GST::LIN-14(292-539) (Fig. 1A) according to established procedures (18) using cyanogen bromide-activated Sepharose (Amersham Pharmacia Biotech). Whenever reprobing of membranes was necessary, blots were stripped by incubating at 68°C for 30 min in 100 mM Tris, pH 7.4, 100 mM ß-mercaptoethanol, and 2% SDS.
Commercially available monoclonal antibodies were used to detect core histones and H1 (Chemicon International). Anti-green fluorescent protein (GFP) antibodies were purchased from Clonetech (Living Colors A.V.). Donkey anti-rabbit horseradish peroxidase-conjugated secondary antibodies were obtained from Amersham Life Sciences. Monoclonal antisera against C. elegans-TATA binding protein were kindly provided by N. Hernandez (48). Polyclonal antisera against C. elegans-lamin (lmn-1) were generously provided by Y. Gruenbaum (26, 45). Polyclonal antisera against LIN-26 were a gift from M. Labouesse (25).
RNA isolation and mRNA purification.
Total worm RNA was isolated using Trizol (GibcoBRL) as follows. Frozen worm pellets were thawed in the presence of 4 volumes of Trizol and vortexed for 10 min; the mixture was spun at 14,000 rpm for 10 min at 4°C, and the supernatant was removed to a fresh tube, to which one-fifth volume CHCl3 was added; the mixture was vortexed and centrifuged for 15 min at 14,000 rpm at 4°C. The aqueous phase was removed to a new tube, and the RNA was precipitated by the addition of an equal volume of isopropanol. RNA samples were stored at 80°C as an ethanol precipitate. mRNA was prepared from total worm RNA using oligo(dT) cellulose (Sigma) and following published procedures (51). mRNA samples were shipped as dried pellets on dry ice to Stuart Kim's laboratory at Stanford University for cDNA synthesis and microarray hybridization (procedures available at http://cmgm.stanford.edu/
kimlab/wmdirectorybig.html).
Microarray data and statistics. The data from six experiments [three repeats of N2 L1s versus lin-14(n179ts) L1s and three repeats of N2 L2 larvae versus lin-4(e912) L2 larvae] are stored on the Stanford Microarray Database (http://genome-www4.stanford.edu/cgi-bin/SMD/login.pl; the identifier is "experimenter: MHRISTOVA"). The first three experiments (N2 versus lin-14) were performed using microarrays containing 63% of predicted C. elegans open reading frames (42), while the second three experiments (N2 versus lin-4) were performed using microarrays containing 94% of predicted open reading frames (22). For the purposes of this paper, only the genes that were analyzed in all six experiments will be discussed.
Raw data from each experiment were downloaded from the Stanford Microarray Database into Excel files and processed as follows: (i) sort by Spot Flag and discard any rows where the Spot Flag value was nonzero, indicating a bad PCR; (ii) sort by Failed and discard any rows where the Failed value was nonzero, indicating abnormal hybridization; (iii) import into a common file for each type of experiment (i.e., lin-14 or lin-4) the columns from each raw experimental file [RAT2(R/G), which shows a log base 2 transformed ratio of normalized red/green signal for each spot; name of spot (Wormbase designation); chromosome location and description (www.wormbase.org)]; (iv) calculate an average RAT2(R/G) based on the 2 or 3 values (avg; any rows which had only one good experimental value were discarded); (v) calculate a standard deviation (stdev) for the average value; (vi) calculate a t value for each spot by using the formula t = avg*[sqrt(n 1)]/stdev, where n is the number of experiments for which good data exist, sqrt is square root, and stdev is standard deviation; (vii) sort by absolute t value and discard any rows with a t value below 4.303 (below 95% confidence interval for three experiments) or below 12.706 (below 95% confidence interval for two experiments); (viii) sort by absolute average value and discard any rows with average values below 1.0 (less than twofold change compared to control).
Northern blot hybridization. Northern blot hybridization using the formaldehyde method of gel electrophoresis was performed on poly(A)-selected RNA following established procedures (7). After exposure to a storage phosphor screen, images were captured on a Typhoon 8600 Variable Mode Imager using the ImageQuant Software (Molecular Dynamics). Images were subsequently processed using Adobe Photoshop.
Reporter fusions. A 2.0-kb fragment of the ins-33 promoter was amplified and cloned into the GFP expression vector pPD95.67 containing a nuclear targeting signal (13a) to generate the ins-33::GFP vector pVT396. Subsequently, a version of this vector containing only 452 bp of the ins-33 upstream region (for sequence, see Fig. S3 available at http://banjo.dartmouth.edu/Hristova_etal_Sup/) was generated (pVT397) by removing intervening sequences of the promoter by digestion using an internal ClaI site and the HindIII pPD95.67 cloning site. To introduce mutations at putative LIN-14 binding sites within the 452-bp fragment, PCR using oligonucleotides with the mutations was carried out, and the new sequence verified by sequencing.
Germ line transformation and analysis of GFP reporter expression. Worms were transformed by microinjection as previously described (34). Transgenes expressing various regions of LIN-14 protein fused to GFP were previously described (20). The construct carrying the lin-14(ma135) loss-of-function allele was generated by PCR from genomic DNA from mutant animals. The mutation was verified by sequencing. The transgenic strains were generated by microinjection of purified plasmid DNA to a concentration of 20 ng/µl, DraI-digested N2 genomic DNA to a concentration of 100 ng/µl, and pVT301 (col-19::GFP) to a concentration of 15 ng/µl as a coinjection marker (extrachromosomal array maEx177). The following extrachromosomal arrays were generated: maEx171 (plasmid pVT396), maEx172 (plasmid pVT397), and maEx173 (plasmid pVT397mut). Expression of GFP reporters in live transgenic worms was characterized using a Zeiss Axioscop fluorescence microscope. Worms were anesthetized using 1 mM levamisole, and images were captured using an Optronics DEI750 integrating color charge-coupled-device video camera and a Scion CG-7 RGB video capture board on a Power Macintosh G4. Images were further processed using Adobe Photoshop.
RNAi. RNA-mediated interference (RNAi) was performed as previously described (15, 56) except that the isopropyl-ß-D-thiogalactopyranoside concentration in the NGM plates was 5 mM. To make the ins-33 RNAi vector, the coding region of ins-33, including a kilobase of intronic sequence, was amplified and cloned into the feeding vector L4440 (56). Ten L4 or young adult worms were placed onto a plate seeded with the bacterial strain HT115 expressing the RNAi plasmid of interest. After 24 to 30 h, the adults were removed, and progeny were scored 16 to 18 h later for viability and visible phenotypes.
| RESULTS |
|---|
|
|
|---|
|
Among a set of truncated LIN-14::GFP fusion proteins that we tested in transgenic worms, lin-14 rescuing activity correlated precisely with chromosome association, as revealed during mitotic chromosome condensation (Fig. 1; Table 2). In transgenic worms expressing truncated versions of LIN-14 fused to GFP, fluorescence was diffusely distributed within interphase nuclei, but during mitosis, fluorescence becomes concentrated in association with mitotic chromosomes (20) (Fig. 1B to D). For example, in Fig. 1B and C, LIN-14D5::GFP fluorescence can be seen in association with mitotic chromosomes. LIN-14D5::GFP displayed mitotic chromosome association in lin-14(null) animals (Fig. 1C and Table 2), indicating that endogenous lin-14 activity is not required for the chromatin association of a LIN-14::GFP fusion protein.
|
Bacterially expressed LIN-14 protein can bind DNA. To explore further the biochemical activity of LIN-14, we generated a GST-tagged LIN-14 fusion protein [GST::LIN-14(244-465)] containing the LIN-14 sequences from the D9-GFP construct (Fig. 1A). D9 had previously been shown by Hong et al. (20) to be sufficient for lin-14 activity in vivo (Table 2). The GST module at the N terminus of the protein was used to purify GST::LIN-14(244-465) using glutathione beads (GE Healthcare) (14).
To investigate whether LIN-14 can bind nucleic acids, purified GST::LIN-14(244-465) was applied to a double-stranded DNA cellulose column in low salt (50 to 75 mM NaCl), the column was washed with 150 mM NaCl, and bound protein was eluted step-wise with increasing concentrations of NaCl (Fig. 2A). Eluted fractions were collected and run on an 11% SDS-PAGE gel. The fusion protein bound to DNA and was eluted with 400 mM salt, a concentration typical of other DNA-binding proteins (Fig. 2A, lane 5) (55). The fusion protein can also bind single-stranded DNA and elutes at similar salt concentrations (data not shown). A similar LIN-14 fusion protein tagged with a six-histidine element [Fig. 1A, 6HIS::LIN-14(244-465)] also showed similar affinity to double-stranded DNA (data not shown), indicating that the LIN-14 moiety is responsible for the in vitro DNA-binding activity of these fusion proteins.
|
Overall, 188 sequences were recovered from rounds 4 and 5 of the selection process. A total of 163 of these sequences were manually aligned by their core consensus GAAC, which was identified after visual inspection. The results of these sequence alignments (Fig. 2B) (see also Tables S1 and S2 at http://banjo.dartmouth.edu/Hristova_etal_Sup/) suggest a LIN-14 consensus binding sequence GAACRY (complement, RYGTTC).
Genes affected in mRNA abundance in lin-4(lf) and lin-14(lf) mutants. To identify candidate LIN-14 target genes whose transcription would be repressed or activated by lin-14 in the L1 stage, we performed microarray analysis of C. elegans transcript levels at the L1 and L2 stages in RNA prepared from synchronized populations of wild type, lin-14(lf), and lin-4(lf) larvae. The results from two complementary sets of microarray experiments are summarized in Table 3. Of the 11,899 genes assayed, 463 (classes A to D, G, and H) showed at least a twofold statistically significant change in expression in lin-14(lf) mutants compared to wild-type animals; of those 463 lin-14-responsive genes, 311 genes (classes B, D, and H) were elevated in the absence of lin-14, and 152 genes (classes A, C, and G) were reduced. A total of 442 genes (classes C to H) showed at least a twofold statistically significant change in expression in lin-4(lf) mutants compared to wild-type animals; of the 442 lin-4-responsive genes, 334 genes (classes C, E, and H) were elevated in the absence of lin-4, and 108 genes (classes D, F, and G) were reduced. Twenty-one genes (classes C, D, G, and H) were affected in both lin-14(lf) and lin-4(lf) mutants.
|
Although any of the 16 class C and class D genes could be direct targets of LIN-14, we applied further criteria to narrow the set of candidates. First, we compared the list of 16 candidates to published microarray gene expression data that analyzed mRNA levels in RNA from C. elegans larval stages (19). Four of the class C genes (F41C3.5, C50F2.6, W01F3.3, and C44H4.3) were found to be more abundant in L1 larvae than in L2 larvae in a study by Hill et al. (19) (Table 4). This is consistent with our results that these four genes are reduced in mRNA abundance in lin-14(lf) L1 larvae compared to wild type, and hence could be L1-enriched genes that are normally under positive temporal regulation by LIN-14. One of the class D genes (W09C5.4) was detected at a higher level in L2 larvae than in L1 larvae by Hill et al. (19), which is consistent with our finding that W09C5.4 is elevated in lin-14(lf) L1 larvae compared to the wild type, and hence could be an L2-enriched gene that is normally under negative temporal regulation by LIN-14.
|
W09C5.4 encodes a developmentally regulated insulin/insulin-like growth factor (IGF) homolog, ins-33. W09C5.4 encodes a predicted protein of 124 amino acids that is homologous to members of the C. elegans-specific alpha group of insulin/IGF-like proteins (40). In our microarray experiments, W09C5.4 transcripts were at least four times more abundant in lin-14(lf) L1 larvae compared to wild-type L1 larvae and at least three times lower in lin-4(lf) L2 larvae compared to wild-type L2 larvae. These data suggest that W09C5.4 may be an L2-specific gene that is repressed by lin-14 in the L1 stage of the wild type. To confirm this supposition, the developmental profile of ins-33 was examined using Northern blot analysis of RNA from wild-type larval stages. ins-33 transcripts are absent in the L1 and then accumulate in the L2 stage (Fig. 3, compare first and second lanes). In lin-14(lf) L1 animals, ins-33 mRNA accumulates precociously (Fig. 3, compare third and fourth lanes). In lin-4(lf) L2 larvae, where LIN-14 levels are abnormally high, ins-33 message is reduced compared to wild type (data not shown). These results indicate that ins-33 gene expression is developmentally regulated and that lin-14 negatively regulates ins-33 during the first larval stage.
|
Faint expression of the maEx172 reporter was observed in wild-type L1 animals in the head and in one or two cells in the posterior intestine (Fig. 4A). In contrast to this relative inactivity of the ins-33 promoter in L1 larvae, beginning at the end of the L1 and continuing during the L2 and later stages, expression of ins-33::GFP was dramatically up-regulated in the head and tail hypodermis, the ventral hypodermis, and the main body hypodermis, hyp-7 (although not in the hypodermal seam cells) (Fig. 4B). These observations indicate that the 452 bp of the ins-33 upstream region contained in maEx172 can mediate transcriptional upregulation of ins-33 expression in hyp-7 after the L1 stage of the wild type.
|
Bacterially expressed LIN-14 binds specifically to two sites in the ins-33 upstream sequences. The above results indicate that lin-14 regulates ins-33 expression at the level of transcription. This regulation could occur directly, through direct binding of the LIN-14 protein to ins-33 regulatory sequences in the DNA, or indirectly, by the binding of LIN-14 to another protein or gene that in turn regulates the ins-33 promoter. LIN-14 is a novel protein, and hence its structure does not easily allow prediction of biochemical function. However, evidence in favor of a hypothetical direct DNA binding activity for LIN-14 comes from its nuclear localization (50) and our findings (described herein) that bacterially expressed LIN-14 can bind double-stranded DNA and can select a specific binding site via SELEX. To test if LIN-14 protein can directly bind to ins-33 upstream sequences, we performed EMSA using bacterially expressed LIN-14 and ins-33 upstream DNA. The fusion protein 6HIS::LIN-14(284-465), which contains a fragment of LIN-14 that is sufficient to rescue lin-14 function (20) (Fig. 1A), was incubated with an end-labeled 452-bp ins-33 DNA fragment (see Fig. S3 at http://banjo.dartmouth.edu/Hristova_etal_Sup/) corresponding to the sequence that we found to confer lin-14-dependent expression of a GFP reporter in transgenic worms (Fig. 4). LIN-14 protein bound to the 452-bp DNA fragment, resulting in a species of altered electrophoretic mobility (Fig. 5A, lanes 2 to 6). At higher protein concentrations, an apparent aggregate between the protein and DNA is evident (Fig. 5A, lane 7). To further delineate the location of LIN-14 binding, a shorter fragment (see Fig. S3 at http://banjo.dartmouth.edu/Hristova_etal_Sup/) of 365 bp was tested. This fragment also shifted in the presence of LIN-14 protein (Fig. 5B, lane 2). The binding was due specifically to LIN-14 protein, as the addition of anti-LIN-14 antibody to the binding reaction caused a supershift of the probe (Fig. 5B, lane 3), while addition of preimmune serum had no effect (Fig. 5B, lane 4).
|
These results indicate that the LIN-14 protein can directly bind to at least two sites in the ins-33 upstream regulatory region. Moreover, the presence of identical nucleotides found at the core of each protected sequence strongly suggests that these nucleotides are required for binding of LIN-14 to the DNA. To test this hypothesis, we introduced mutations at six of the positions in the proximal protected element (Fig. 6A). EMSA assays using 50-bp oligonucleotides with sequences containing the wild-type or mutated proximal protected element were performed to test binding specificity. The 6HIS::LIN-14(284-465) protein bound the oligonucleotides containing the wild-type but not the mutated sequence (Fig. 6B, compare lanes 2 and 8). Moreover, the mutated sequences did not compete away the binding between 6HIS::LIN-14(284-465) and the wild-type sequences (Fig. 6B, lanes 5 and 6), whereas the wild-type sequence did (Fig. 6B, lanes 3 and 4). These results support the conclusion that LIN-14 can bind specifically to two sequences in the ins-33 upstream regulatory region and that residues in the core sequence G(TT/AA)C, possibly in conjunction with residues outside the core, are necessary for this binding.
|
|
| DISCUSSION |
|---|
|
|
|---|
Further evidence in support of a DNA binding activity for LIN-14 in vivo is our finding that LIN-14 exhibits sequence specificity of DNA binding in vitro. We performed a PCR-based binding site selection assay (SELEX) and recovered the putative consensus GAACRY (complement, RYGTTC). This consensus sequence does not appear to match previously known transcription factor binding sites, suggesting that our results identify LIN-14 as a transcriptional regulatory factor of novel sequence specificity.
Also in support of in vivo DNA binding by LIN-14 is the apparent chromatin association of endogenous full-length LIN-14 and LIN-14::GFP fusion protein. We found that a small fraction of endogenous LIN-14 is released from the bulk of chromatin by digestion of worm nuclei with MNase (see Fig. S1 available at http://banjo.dartmouth.edu/Hristova_etal_Sup/). For LIN-14::GFP fusion proteins, the evidence for functional interaction with chromatin is from a correlation between the in vivo rescuing activity of the transgenes and the association of the corresponding fusion proteins with mitotic chromosomes (Fig. 1). Other proteins known to interact directly with DNA and that also interact with condensed mitotic chromatin include ATRX (8) and the C. elegans SMAD, daf-3 (38). By contrast, some DNA binding proteins such as Sp1 and HSF1 appear to be excluded from mitotic chromatin (reference 33 and references therein). Although we find that mitotic chromosome association correlates with function for truncated, overexpressed forms of LIN-14::GFP, the endogenous LIN-14 protein was not detected in association with metaphase chromosomes (data not shown). Accordingly, we interpret the mitotic chromatin binding behavior of the LIN-14 carboxy-terminal domain to represent a low-affinity activity of the native LIN-14, which is not detected unless the protein is present at abnormally abundant amount.
Evidence supports the idea that the association of LIN-14 with DNA and/or chromatin may not represent the only in vivo activity of LIN-14. In particular, we found that a significant fraction of LIN-14 in worms is tightly bound to nuclei, perhaps in association with the nuclear matrix (see Fig. S2 at http://banjo.dartmouth.edu/Hristova_etal_Sup/), and at least some of the LIN-14 released from nuclei by MNase digestion appeared to sediment apart from polynucleosomes (see Fig. S1 at http://banjo.dartmouth.edu/Hristova_etal_Sup/). Perhaps LIN-14 exists in distinct pools in normal nuclei. We propose that one nuclear pool of LIN-14 functions to regulate developmental timing and is in equilibrium between chromatin-bound and chromatin-unbound fractions and that another pool is tightly bound (likely matrix associated). Our results suggest that nuclear matrix association may be secondary to the primary function of LIN-14. In particular, we observed that a truncated LIN-14::GFP fusion protein that can rescue lin-14 heterochronic mutant phenotypes in the hypodermis nevertheless is quantitatively eluted from nuclei with low-salt incubation. Thus, this LIN-14 variant with virtually no detectable nuclear matrix binding still retains at least partial function. We therefore conclude that the pool of LIN-14 strongly associated with the nuclear matrix in vivo does not participate directly in the regulation of gene expression, at least not in hypodermal cells, and may serve as a reservoir of LIN-14 that could be recruited for function in some physiological or developmental circumstances. It remains to be determined whether the loosely bound and tightly bound nuclear pools of LIN-14 are normally both found within the same individual nucleus and, if so, whether there is dynamic interchange among them.
LIN-14 is a transcription factor. The assignment of transcription factor activity to LIN-14 is based on (i) its apparent chromosome association in vivo, (ii) its in vitro DNA biding activity and, in particular, (iii) its functional and biochemical relationship to a canonical LIN-14 target, ins-33. Several characteristics of ins-33 suggest that ins-33 is a direct transcriptional regulatory target of LIN-14 during worm development. First, the ins-33 mRNA is developmentally up-regulated at the L2 stage of wild-type development (Fig. 3) (19), consistent with repression by LIN-14, which is high in the L1 stage and reduced in L2. Second, the level of ins-33 mRNA is dramatically elevated in the L1 stage of animals lacking lin-14 (Fig. 3 and Table 4) and is reduced in the L2 of animals with elevated LIN-14 [lin-4(lf)] (Table 4). Third, the stage-specific expression of ins-33 mRNA in the wild type and its behavior in lin-14(lf) animals are recapitulated by a reporter transgene containing 452 bp of the ins-33 upstream region (Fig. 4). Since this 452-bp region contains no ins-33 coding sequence, these results indicate that ins-33 stage-specific expression and the regulation of ins-33 by lin-14 result from transcriptional regulation via the 452-bp upstream sequence. Finally, two specific sequences within this 452-bp sequence are protected from DNase I digestion in the presence of purified LIN-14 protein (Fig. 5); these sequences are necessary for LIN-14 binding to the ins-33 452-bp region in vitro (Fig. 6) and for lin-14-dependent transcriptional repression in vivo (Fig. 7). Taken together, these results provide strong evidence that LIN-14 is a direct repressor of ins-33 transcription in wild-type worms.
Sixteen putative lin-14 targets. Because our objective has been to test the hypothesis that LIN-14 is a transcription factor, we applied strict criteria to identify a high-confidence candidate target to use as a test case. Therefore, we have not attempted to comprehensively catalog LIN-14 targets. The genes we identified by microarray hybridization as lin-14-responsive undoubtedly include direct targets as well as indirect targets. It is also quite likely that we may have missed numerous targets in our microarray screens for lin-14-responsive genes. One reason could be that a target might be broadly expressed in the animal but regulated by lin-14 in only a subset of cells. An example is cki-1 (T05A6.1), which has been shown to be regulated by lin-14 in the vulva precursor cells (VPCs) (21). By our microarray assays, cki-1 mRNA was essentially unchanged in lin-14(lf) and lin-4(lf). This is most likely because cki-1 is expressed in diverse cell types besides the VPCs, while the VPCs are the only cell type were cki-1 expression is affected by lin-14 mutations (21).
A key criterion for classifying genes as potential LIN-14 direct targets was the occurrence of a sequence fitting the consensus LIN-14 site derived by SELEX. Five of the 16 genes contain a good match to a 12-nucleotide consensus derived from the SELEX data. However, the sequence specificity of LIN-14 binding has not been explored in detail, and so the stringent consensus that we used to search for sites may not encompass all bona fide LIN-14 sites. Many of the 16 predicted target genes contain core G(AA/TT)C sequences in their upstream sequences and, hence, could be targets with sites that more loosely fit the search consensus (for sites see Table S2 at http://banjo.dartmouth.edu/Hristova_etal_Sup).
If the genes listed in Table 4 are bona fide lin-14 targets, how might they be involved in cell fate decisions controlled by lin-14 in L1 versus L2? The lin-14 mutations affect a diverse set of cells and cellular behaviors, including the hypodermal cell division pattern, intestinal cell cycle behavior, neuroblast identity (1), neuronal polarity (17), dauer larva arrest (30), and VPC cell cycle (21). The precise nature of the cellular behaviors controlled by lin-14 depends on cell type, and so it is reasonable to expect that LIN-14 may repress or activate target genes in a cell type-specific manner. Among the 16 high-probability LIN-14 targets listed in Table 4, only one, B0412.1, encodes a predicted transcription factor, suggesting that lin-14 does not chiefly function by regulating the expression of other transcription factors. None of the putative lin-14 targets in Table 4 has been shown to have a loss-of-function heterochronic phenotype. Several of these candidate targets have RNAi phenotypes that include general growth problems, suggesting that these genes function in one or more essential cellular processes. But since their phenotypes are not simply developmental timing defects as with lin-14(lf) or lin-4(lf), it would appear that the products of these genes may function as effectors of the cellular behaviors whose timing is controlled by lin-14.
The ins-33 gene and the function of lin-14 in developmental timing. The evidence presented here strongly suggests that stage-specific expression of ins-33 in hypodermal cells of wild-type worms is controlled by LIN-14 on the level of transcription, suggesting a role for lin-14 as a transcription factor controlling the timing of an L2-specific insulin/IGF signal. One developmental event that is known to be regulated by both lin-14 and by insulin signaling in C. elegans is the expression of the optional third larval stage, the dauer larva. Dauer larvae are developmentally arrested and specialized larvae formed in crowded or starved cultures. In wild-type worms, dauer larva formation can occur only at the L2 molt. The lin-14 loss-of-function mutations affect the timing of dauer formation, including the execution of the dauer remodeling program by the hypodermis (30). lin-14(lf) animals can become dauer larvae precociously, at the end of the L1 stage; moreover, they form partial dauer larvae, where the hypodermis does not remodel properly in patches of the animal. The animal's decision to arrest development as a dauer larva is controlled by at least two environmental signaling pathways: a transforming growth factor ß cascade, which detects a worm-specific pheromone and thus responds to crowding, and an insulin/IGF cascade, which likely reflects the metabolic state of the animal and thus responds to nutrient availability (44). Since dauer larva development is in part controlled by insulin signaling, ins-33 could function to couple the dauer larva program to temporal signals. Thirty-seven insulin-related ligands have been predicted in the worm, compared to three in vertebrates (40). DAF-2 is the C. elegans homolog of the insulin and IGF-1 receptors from vertebrates (24). It appears that daf-2 can mediate different insulin signals for different life history events; for instance, overexpression of ins-1 affects dauer formation, larval growth, and longevity, while overexpression of ins-9 and ins-31/19 affects larval growth but not dauer formation or longevity (40). It is possible that daf-2 also mediates different insulin signals in different cell types. These signals would then collaborate to influence the dauer arrest decision. This collaboration hypothesis is consistent with the finding that, for dauer development, daf-2 functions cell nonautonomously in a number of tissues, including neurons and hypodermis (4). The precise expression pattern of daf-2 has not been reported. Perhaps ins-33 is the insulin signal or one of the insulin signals from the hypodermis, which contribute(s) to the final dauer larva initiation decision.
A phenotype was previously reported for ins-33 loss-of-function by RNAi (15). The phenotype included embryonic lethality and slow-growing and uncoordinated larvae. The authors did not report heterochronic or dauer-related phenotypes. However, the original RNAi bacterial strain used in the study by Fraser et al. (15) has been lost (J. Ahringer, personal communication), and we did not observe an ins-33 phenotype in our own RNAi experiments (data not shown). Since a negative result in RNAi experiments could reflect factors other than the actual requirement for the gene, we cannot say at present whether ins-33 is required for the normal timing or execution of dauer larva development. It is possible that lin-14 controls the timing of dauer larva development through the regulation of ins-33 transcription in concert with another, redundant factor.
Complexity of regulatory pathways involving lin-14 and lin-4. If lin-14 were the only target of the microRNA product of lin-4, then one simple expectation would be that all the genes affected by lin-14(lf) should be reciprocally affected by lin-4(lf). Genetic analysis has shown that mutations in lin-14 are epistatic to lin-4(lf) (1), which would suggest that most, and perhaps all, of the lin-4(lf) phenotype depends on lin-14 activity. However, our results described here suggest that the lin-4/lin-14 pathway is more complex than the simplifying expectation, and may diverge significantly. For example, we observed that numerous genes were significantly altered in lin-4(lf) that were unaffected in lin-14(lf) mutants (Table 3). These genes could include bona fide lin-14 targets that respond to LIN-14 levels in the L2 stage but are independent of lin-14 in the L1 stage. However, lin-4 microRNA is known to have at least one other target, lin-28 (36), which encodes an RNA binding protein that could regulate mRNA levels posttranscriptionally. Many of the genes affected by lin-4(lf) and not by lin-14(lf) in our microarray experiments, indeed, may not be targets of LIN-14 but, rather, could be regulated by lin-4 via lin-28 or other unidentified targets of lin-4.
Some bona fide LIN-14 target genes could be among the genes that did not respond reciprocally to lin-14(lf) and lin-4(lf). In particular, genes of classes A and B (Table 3) changed in the lin-14(lf) microarray experiments but were not affected by lin-4(lf). For many of these genes, expression might have been altered as an indirect consequence of the physiology of the lin-14(lf) mutant. Others could be direct LIN-14 targets whose expression is activated by LIN-14 in the L1 stage but becomes governed by other factors in the L2 stage (rendering them insensitive to removal of lin-4).
The results reported here establish that LIN-14 likely regulates the timing of developmental events through the direct binding to specific recognition sequences of diverse target genes. Down-regulation of LIN-14 levels from the L1 to the L2 stage likely results in the derepression of genes, such as ins-33, that are negatively regulated by LIN-14. It remains to be determined whether most LIN-14 target genes are repressed by LIN-14 binding or whether LIN-14 can also activate transcription in some situations. LIN-14 presumably acts in combination with cell type-specific transcriptional regulators to affect repression or activation of target genes in conjunction with regulating L1 and L2 developmental programs. A complete understanding of how the lin-4/lin-14 circuit regulates diverse developmental events in C. elegans larvae will require a comprehensive catalog of all the genes targeted by LIN-14 in vivo; knowledge of where, when, and how they are affected by lin-14 activity; and genetic analysis of their functional relationships to lin-14.
| ACKNOWLEDGMENTS |
|---|
This work was supported by PHS grant GM34028 to V.A.
| FOOTNOTES |
|---|
Present address: Evanston Northwestern Healthcare Department of Medicine 2650 Ridge Ave., Evanston, IL 60201. ![]()
Present address: MDS Pharma Services, Biopharmaceuticals, 22011 30th Drive Southeast, Bothell, WA 98021-4444. ![]()
Present address: Howard Hughes Medical Institute, University of California San Francisco, 1550 4th Street, Room GD-481, San Francisco, CA 94143-0725. ![]()
| REFERENCES |
|---|
|
|
|---|
2. Ambros, V., and H. R Horvitz. 1987. The lin-14 locus of Caenorhabditis elegans controls the time of expression of specific postembryonic developmental events. Genes Dev. 1:398-414.
3. Ambros, V. 2000. Control of developmental timing in Caenorhabditis elegans. Current Opin. Genet. Devel. 10:428-433.
4. Apfeld, J., and C. Kenyon 1998. Cell nonautonomy of C. elegans daf-2 function in the regulation of diapause and life span. Cell 95:199-210.[CrossRef][Medline]
5. Arasu, P., B. Wightman, and G. Ruvkun. 1991. Temporal regulation of lin-14 by the antagonistic actin of two other heterochronic genes, lin-4 and lin-28. Genes Dev. 5:1825-1833.
6. Ashrafi, K., F. Y. Chang, J. L. Watts, A. G. Fraser, R. S. Kamath, J. Ahringer, and G. Ruvkun. 2003. Genome-wide RNAi analysis of Caenorhabditis elegans fat regulatory genes. Nature 421:268-272.[CrossRef][Medline]
7. Ausubel, F. M., R. Brent, R. E. Kingston, D. D. Moore, J. G. Seidman, J. A. Smith, and K. Struhl. 1994. Current protocols in molecular biology. John Wiley and Sons, New York, N.Y.