Previous Article | Next Article ![]()
Molecular and Cellular Biology, December 2007, p. 8352-8363, Vol. 27, No. 23
0270-7306/07/$08.00+0 doi:10.1128/MCB.02132-06
Copyright © 2007, American Society for Microbiology. All Rights Reserved.

Department of Microbiology and Molecular Genetics, University of California, Irvine, Irvine, California 92697,1 Developmental Biology Center and Department of Developmental and Cell Biology, University of California, Irvine, Irvine, California 92697,2 Network and Academic Computing, University of California, Irvine, Irvine, California 926973
Received 14 November 2006/ Returned for modification 16 January 2007/ Accepted 5 September 2007
|
|
|---|
|
|
|---|
83% of colon tumors and since CDX1 expression is also often observed in colon cancer, we hypothesize that E-tail isoforms of TCFs are important mediators of Wnt target gene regulation in these tumors.
![]() View larger version (41K): [in a new window] |
FIG. 1. A 30-amino-acid motif in the TCF E-tail is highly conserved and related to a sequence in two heterologous transcription factors. (A) A general domain structure of LEF/TCFs includes the N-terminal ß-catenin binding domain, the high-mobility group DNA binding domain (which also includes the nuclear localization signal [HMG box + NLS]), and the alternatively spliced E-tail. Partial alignment of homologous E-tail sequences in TCF orthologs and the unrelated Huntington's Disease Binding Proteins 1 and 2 shows a high level of sequence conservation of basic, aromatic, and cysteine residues. Regions of sequence conservation are shaded, and a consensus sequence is shown below the alignment, with the four conserved cysteines highlighted in red. hTCF-1E and hTCF-4E are human family members; dTCF, pop-1, hydra, and Ciona represent the single TCF orthologs in Drosophila, C. elegans, H. magnipapillata, and Ciona intestinalis, respectively. (B) Three luciferase reporter plasmids used in this study comprise a multimerized Wnt response element (core sequence shown in red) next to the minimal herpesvirus tk promoter.
|
94%) feature of these proteins, it has been assumed that all LEF/TCF proteins can bind identically and equivalently to any WRE and activate transcription with ß-catenin recruitment. We revisited this issue when it became clear that LEF/TCFs were not equivalent in their abilities to activate a natural Wnt target gene such as LEF1 or CDX1 and that the alternative E-tail provided a special activity (3). An early model suggested that the E-tail recruited the transcription coactivator p300 for activation of CDX1 transcription (20). We therefore tested the hypothesis that the E-tail recruits a cofactor for LEF1 transcription as well as the hypothesis that the E-tail modifies DNA bending or binding. We found that the 30-amino-acid motif within the E-tail functions as a sequence-specific auxiliary DNA interaction domain and that selected Wnt response elements require it for stable binding and ß-catenin activation. We also show, as others have, that overexpression of dominant-negative (dn) TCFs can halt the growth of colon cancer cells, but here we also show that the E-tail is crucial for this activity. We conclude that E-tail isoforms of TCFs are the most potent forms of TCFs in that they have two side-by-side DNA binding domains and are thus able to regulate specific Wnt target genes with lower-affinity WREs. This more potent target activation is important for control of colon cancer cell growth. |
|
|---|
Transient transfection assay. COS-7 cells (2.5 x 105) were transfected using 0.5 µg of reporter plasmids with different, multimerized WREs in the TOPtk backbone (LEF1 promoter 1 [–672, +314; Lop, Top, Lop/Top [Fig. 1B]), 0.1 µg of cytomegalovirus ß-galactosidase plasmid, and FuGENE 6 transfection reagent according to the protocol of the manufacturer (Roche Molecular Biochemicals). Reporter plasmids were cotransfected with expression vectors for ß-catenin (pCS2-ß-catenin; gift of B. Gumbiner) (0.4 µg), LEF/TCFs (0.2 µg), and other expression constructs indicated in the figure legends. Cells were harvested 18 to 20 h posttransfection and assayed for luciferase activity, and activities were normalized using ß-galactosidase activity values. In all cases experiments were performed with duplicate samples, and at least three or more independent experiments were performed for calculation of standard deviation values for error bars.
Western blot analysis. Cell lysates of 1 x 105 DLD1 cells treated with doxycycline (D7p11 cells; 6 x 10–4 µg/ml for dn wild-type TCF-1E [dnTCF-1EWT] induction and 1 µg/ml for mutant dnTCF-1E [dnTCF-1Emt] induction) were analyzed for induced expression of dnTCF-1 by Western blotting with a polyclonal LEF-1 antibody that detects all LEF/TCF isoforms (21) and secondary anti-rabbit immunoglobulin G conjugated to horseradish peroxidase. Blots were developed with ECL reagent (Amersham).
Sulforhodamine B cell growth assay. A 6-day growth curve experiment was performed with DLD-1 cells, which inducibly express dnTCF-1EWT or dnTCF-1Emt. Cells were plated on 96-well plates as eight replicates for each experimental time point. Every 2 days cells were provided with fresh media containing either doxycycline (concentrations indicated above for the Western blot protocol) or water (mock treatment control). Cells were fixed and stained according to published protocols (36). Optical density readings were performed at 490 nm with Spectra Max 340 from the Molecular Devices Corporation. Growth curves were carried out for 6 days, and the standard deviation generated for eight replicates is shown.
EMSA. Electrophoretic mobility shift assay (EMSA) reactions were carried out with 1 to 2 pmol (approximately 200 cps) of oligonucleotide (see sequence listed in Fig. 5) in a final reaction volume of 15 µl containing 50 mM HEPES (pH 8.0), 12.5 mM EDTA, 50% glycerol, 100 mM KCl, 2 mM MgCl2, 1 µg of poly(dI-dC), and 10 mM dithiothreitol. Extracts from COS-1 cells transiently transfected with expression vectors for full-length human TCF-1EWT and TCF-1Emt (CR1 mutant) were used for the assays. Lysates from COS-1 cells were prepared 48 h after transfection by swelling cells on ice, immersing for 15 min in hypotonic lysis buffer (10 mM Tris [pH 7.9], 50 mM KCl, 10 mM Mg2Cl, 0.01 mM EDTA, 1 mM dithiothreitol, 0.01 mM EGTA, 1 mM phenylmethylsulfonyl fluoride, protease inhibitor cocktail), and douncing. A 10% volume of glycerol was added to the cell lysate for storage at –80°C. Lysates from Colo320 colon cancer cells, which express TCF-1E and TCF-4E, were prepared with the same protocol.
![]() View larger version (44K): [in a new window] |
FIG. 5. The E-tail is a sequence-specific DNA binding motif in the context of full-length TCF-1. (A) EMSA with extracts from COS-1 cells expressing full-length histidine-tagged TCF-1E TCF-1EWT or TCF1Emt (10 µg protein) showed that the CR1 mutation in the E-tail (see Fig. 3C for sequence of mutation) decreases TCF-1E binding to an extended probe that encodes the second Wnt response element (bold italics in sequences at bottom of figure) in the LEF1 promoter and a GC-rich motif (underlined) seven nucleotides downstream of the core (LOP2+GC). A Western blot probed with antibody for the histidine tag shows equal levels of wild-type and mutant proteins in the extracts. (B) Purified recombinant GST/E-tail (GST/E-tailWT) can bind directly to the LOP2 + GC probe, but a mutant GST/E-tail with the CR1 mutation is inactive for DNA binding (GST/E-tailmt). A Coomassie-stained gel shows amounts of purified recombinant proteins equal to those used in the EMSA. (C) In a competition assay for binding to LOP2+GC, the indicated molar excess of cold competitor oligonucleotides shows that the downstream GC-rich sequence is not specifically recognized by recombinant purified GST/E-tail (GCSC refers to a "scrambled" mutation of this GC motif). These data are consistent with the SPR experiments (Fig. 4), which show the E-tail binds double-stranded DNA in a non-sequence-specific manner. (D) Extracts of Colo320 colon cancer cells (15 µg protein), which express high levels of TCF-1E and TCF-4E, were used in a competition EMSA as described for panel C. In this competition with endogenous full-length TCF-E isoforms, mutation of the GC element (as in LOP+GCSC and GCSC) reduces the ability of the oligonucleotide to compete for binding to the LOP2+GC probe. Both the WRE and the GC element can compete for some but not all of the binding activity.
|
Duplex DNAs (10 nM to 100 nM) were flowed over the immobilized GST E-tail protein. All the Biacore data were collected at 25°C with running buffer at a constant flow of 40 µl/min. Apparent association and dissociation rate constants were calculated using BiaEvaluation software version 3.0 supplied by the vendor (Biacore Inc, Uppsala, Sweden). Values were derived from three independent experiments using two different recombinant protein preparations. For the GST/E-tailWT versus GST/E-tailmt binding and competition assays, biotinylated LOP was immobilized on the streptavidin chip. The LOP mt sense strand (MOP) was used for competition with single-stranded DNA, and the following oligonucleotides were used for double-stranded DNA competition: sense, 5'-AGATCTACGGAGGACTGTCCTCCGT-3'; and antisense, 5'-ACGGAGGACAGTCCTCCGTAGATCT-3'.
CASTing analysis. CASTing (Cyclic Amplification and Selection of Targets) was performed as described previously with some modifications (details available upon request) (28, 45). A 70-mer GCGTCGACTCTAGACTGCAG-N30-GAATTCGGATCCCTCGAGCG sequence was synthesized to generate the target oligonucleotide library. A 25-µg volume of the oligonucleotide library was converted to double-stranded DNA, and 5 µg was incubated with 15 µg of COS cell extract (see above [EMSA paragraph]). Binding conditions for the first CASTing experiment were as follows: 15 mM Tris (pH 7.5), 50 mM KCl, 5% glycerol, 5 µM EGTA, 5 µM EDTA, and 5.75 mM MgCl2. Binding conditions for the second CASTing experiment were as follows: 35 mM Tris (pH 7.5), 25 mM KCl, 60 mM NaCl, 0.25% NP-40, 2.5 mM beta-mercaptoethanol, and 0.75 mM MgCl2. The second CASTing experiment used COS-1 cell extracts of overexpressed TCFs in which the cells had been lysed in 50 mM Tris-HCl (pH 7.5)-120 mM NaCl-0.5% NP-40-2 mM Pefabloc SC (AEBSF; Roche) protease inhibitor-5 mM beta-mercaptoethanol-8 mM MgCl2. Overexpressed LEF/TCFs were epitope tagged with either TAP tags (LEF-1) or histidine tags (TCF1-EWT and TCF1-Emt). After a 20-min incubation at room temperature, 2 µl of Talon Dynabeads (Invitrogen) or streptavidin beads (Invitrogen) was added and incubated for 10 min at room temperature. Bead-DNA-protein complexes were washed with 500 µl of phosphate-buffered saline containing 0.1% NP-40 and 0.1% bovine serum albumin and resuspended in 100 µl of PCR mixture. The enriched oligonucleotide pool was amplified by PCR using primers complementary to the fixed flanking sequence (5'-GCGTCGACTCTAGACTGCAG-3' and 5'-CGCTCGAGGGATCCGAATTC-3'). Sample aliquots of the PCR were removed after 6, 9, 12, and 15 cycles and analyzed on agarose gels. A 10-µl volume from the aliquot which contained the first visible PCR products was used in the next round of CASTing. From the second round on, cytosolic extract was also preincubated with 1 µg poly(dI-dC) as a nonspecific competitor. After six rounds of CASTing, the selected DNA was cloned into a TOPO vector (Invitrogen) and individual colonies were selected for sequencing. Typically, 70 to 96 colonies were grown in LB in 96-well plates, sealed, and sent to Agencourt for Miniprep treatment and sequencing.
|
|
|---|
![]() View larger version (12K): [in a new window] |
FIG. 2. The E-tail is necessary to activate transcription from weak Wnt response elements. Cos-1 cells were transiently transfected with TOP (A), LOP (B), or LOP/TOP (C) luciferase reporters with expression vectors for the indicated LEF/TCFs and ß-catenin (cotransfected in all conditions). (D) TOP and LOP were cotransfected with increasing concentrations (200, 400, 800, and 1,000 ng) of LEF-VP16 expression vector. At the highest concentration of LEF-VP16, activation of LOP was increased twofold. Error bars represent standard deviations derived from the results of three or more experiments.
|
![]() View larger version (31K): [in a new window] |
FIG. 3. A cysteine motif in the E-tail cooperates with the HMG DNA binding domain for LEF1 regulation. (A) Activation of Wnt responsive reporter plasmids LEF1, TOP, LOP, and LOP/TOP requires the HMG DNA binding domain. A two-amino-acid substitution in the HMG DNA binding domain (KK changed to EE) prevents reporter gene activation even though the E-tail remains the wild type. The E-tail alone is not sufficient to recruit ß-catenin to any of the reporter plasmids (ß-cat/Gal4/E-tail denotes a fusion protein in which the Gal4 DNA binding domain is fused at the N terminus to the ß-catenin binding domain and at the C terminus to the E-tail). (B) Transient transfection assays showed that coexpression of increasing amounts of a Gal4/E-tail fusion protein (Gal4 DNA binding domain fused to the 138-amino-acid E-tail of TCF-1E) does not inhibit the ability of TCF-1E and ß-catenin to activate transcription from the Wnt response elements in the LEF1 promoter 1. (C) A novel cysteine motif is required for LEF1 regulation but not TOP reporter regulation. Cos cells were cotransfected with ß-catenin and mutant TCF-1E expression vectors and the LEF1 reporter plasmid (left panel) or the TOPtk reporter plasmid (right panel). Mutations are single amino acid substitutions in the positions indicated in red type in an E-tail alignment (shown below). Error bars indicate standard deviations for the results of three experiments. Mock, mock treatment.
|
The E-tail contains a novel cysteine motif. The five-amino-acid CRARF substitution that destroys transcription activation removes basic residues and the first of four conserved cysteine residues (Fig. 3C). The four cysteines are 100% conserved in their sequences and spacing between all TCF orthologs (except for TCF-3), suggesting that these residues are important for functioning. The spacing between cysteines does not conform to known metal chelating finger motifs but could be important for a novel structure or finger. To determine whether individual cysteines are essential for activity and whether histidine residues could perform substitutions, the second and third conserved cysteines were individually changed to either alanine or histidine within the context of full-length TCF-1E and tested for activity in transient transfection assays. Single amino acid substitution of the cysteines completely abrogates the ability of these mutants to activate the LEF1 promoter but not TOPtk (Fig. 3C). Histidine substitution also completely inactivated the proteins, suggesting that replacing cysteines with alternative residues that can chelate metals is either not sufficient or not compatible with a functional structure. These results suggest that the cysteines play a fundamental role in TCF-1E action by establishing a tertiary structure optimal for reporter gene activation.
The E-tail is a novel DNA interaction domain. An alternative possibility for a mechanism for E-tail action is that it augments binding of the HMG DNA binding domain to nucleic acids. Support for this possibility was provided by a recent study which identified Huntington's Disease Binding Protein 1 (HDBP1) and HDBP2, two highly related transcription factors that bind to a unique GC motif (GCCGGCG) in the HD promoter (37). The DNA binding domain was delimited to a region that has remarkable amino acid sequence similarity to the 30-amino-acid cysteine motif in the TCF-1 and TCF-4 E-tails (see alignment in Fig. 1A). Interestingly, GC-rich sequences are near each of the two WREs in the LEF1 promoter (CCCGGG shown in Fig. 1B and GCCGGCG seven nucleotides downstream of the second WRE element in the LEF1 promoter; see Fig. 5). Thus, a novel type of DNA binding domain may lie very close to the well-described HMG DNA binding domain and may have sequence-specific recognition properties.
SPR measurements can detect real-time interactions between proteins and nucleic acids, and we used this method to assess E-tail interactions with nucleic acids. We fused the coding sequences of GST to the coding sequences for 125 residues of the E-tail and produced this recombinant fusion protein in bacteria. We modified the surface of SPR chips by covalent conjugation of purified GST or GST/E-tail protein and introduced increasing concentrations of double-stranded oligonucleotides with different sequences, including LOP (which includes the CCCGGG element downstream), the consensus WRE (TOP), and the mutant version of the LOP site (MOP). Rapid binding to all three oligonucleotides was detected with GST/E-tail but not GST (Fig. 4A and data not shown). A quantitative analysis of the binding shows nearly identical kinetics and affinity for binding to all three oligonucleotides, yielding an overall equilibrium dissociation constant (KD) = 1.55 (± 0.33) x 10–8 M (Fig. 4A). The overall KD places binding affinities in the nanomolar range, but the association and dissociation rates are both rapid. The fast on-off rates may explain why our DNase I footprinting reactions with TCF/E-tail isoforms and GST/E-tail protein do not detect protection of the flanking GC sequences in the LEF1 promoter (reference 3 and data not shown). Similar binding kinetics results for any double-stranded DNA sequence demonstrate that the E-tail, at least as a separate recombinant protein fragment, has no sequence specificity in its interaction. Interestingly, we observed that for all three profiles, binding is completely eliminated with excess heterologous double-stranded DNA and is only minimally affected by excess single-stranded DNA or tRNA (Fig. 4B and data not shown). These data suggest that the E-tail preferentially binds to specific structural features of double-stranded DNA and not to other nucleic acids. To more rigorously test the idea that residues in the 30-amino-acid motif are important for DNA interactions, the five-amino-acid CRARF substitution that destroys E-tail-dependent activation of transcription (3) (CRARF mutant; Fig. 3C) was introduced into the GST/E-tail protein and tested using SPR. The binding profile presented in Fig. 4C shows that this five-amino-acid substitution destroys DNA binding. We conclude from these experiments that the cysteine-rich motif in the E-tail is a DNA binding motif.
![]() View larger version (16K): [in a new window] |
FIG. 4. The E-tail is a new type of DNA binding domain. (A) Overlay sensorgrams for SPR analysis of TOP, LOP, or mutant LOP sequence (MOP) binding to immobilized GST E-tail. DNAs were injected at concentrations ranging from 10 nM to 100 nM over immobilized GST/E-tail at a flow rate of 40 µl/min at 25°C. Results are expressed in resonance units (RU) as a function of time in seconds. Increases in RU indicate binding of the double-stranded oligonucleotides to the GST/E-tail-modified surface. (B) Binding competition with a concentration of up to 1,000 nM of either single-stranded DNA or double-stranded DNA was performed with GST/E-tail bound to 100 RUs of immobilized DNA. (C) Equivalent concentrations of purified, recombinant GST/E-tail wild-type protein and GST/E-tailmt mutant protein were compared with respect to their binding activities with 1,800 RUs of immobilized LOP DNA. The E-tail mutation carries the CRARF substitution shown in Fig. 3C. Sensorgrams are representative of specific interactions.
|
The remarkable amino acid sequence similarities between the cysteine motif in the E-tail and the DNA binding domain of HDBP1/HDBP2 suggested that the E-tail could specifically recognize the GC element present in LOP2+GC. We designed a competition EMSA experiment with GST/E-tail to test this possibility. We observed that the results with respect to competition for GST/E-tail binding were similar at 10-fold and 50-fold molar excess no matter what cold competitor was used, even when oligonucleotides where the GC element was scrambled or the WRE was missing were used (Fig. 5C [compare LOP2+GCSC with GC and GCSC]). In contrast, EMSA studies with full-length TCF-1E protein showed that effective competition for TCF-1E binding was best with wild-type LOP2+GC, which included both the WRE and an intact GC element (Fig. 5D [LOP2 + GC]). An oligonucleotide with a scrambled GC element in the context of the WRE (LOP2+GCsc) was a less effective competitor at 10- and 50-fold molar excess. Surprisingly, even the GC element alone, but not a scrambled mutant version, competed for TCF-1E binding (Fig. 5D [compare competition at 10-fold molar excess between GC and GCsc]). Competition by GC appeared to be more effective than competition by the WRE alone, a result that matches the large drop in binding activity seen when the "cysteine clamp" (C-clamp) is destroyed (see Fig. 5A) and our assessment that LOP is a weak WRE on its own. However, neither the WRE alone (LOP+GCSC) nor the GC element alone was as effective for competition as the wild-type sequence which carried both elements. These results suggest that stable binding of TCF-1E to a variant or weaker WRE such as the one encoded in LOP2+GC requires recognition of both the WRE and the GC element, with a heavy reliance on the presence of the GC sequence. This dual requirement is entirely consistent with our transient transfection data, showing that both the HMG DNA binding domain and the C-clamp motif are required for binding to the reporter plasmid in vivo (Fig. 2).
Taking the SPR and EMSA data together, we conclude that the cysteine motif in the E-tail functions as a DNA interaction motif. Its unique arrangement of cysteines and basic residues suggests that it is a novel nucleic acid binding structure. We propose calling this motif a "C-clamp" to refer to its conserved cysteines and role as an auxiliary DNA binding domain. As an isolated, independent protein fragment, the C-clamp does not exhibit sequence specificity. However, in its native context in TCF-1E, where it is juxtaposed to the HMG DNA binding domain, this motif can augment overall DNA binding by providing additional nucleic acid contacts with specificity for GC-rich sequences. For some WREs, C-clamp function may be as important as the HMG DNA binding domain.
CASTing analysis of LEF/TCFs shows the C-clamp has sequence selectivity. If the C-clamp exhibits sequence selectivity within the context of full-length TCF-1 protein, then recognition of Wnt target sequences may be significantly influenced. We used CASTing to assess whether the DNA binding specificity of TCF-1 was altered by the neighboring C-clamp motif (see Materials and Methods). Previous systematic evolution of ligands by exponential enrichment experiments have been used to define LEF/TCF consensus binding sequences, but in every case, purified HMG DNA binding domain fragments were used and neither the E-tail domain nor any other part of LEF/TCF protein was included. In the CASTing experiment reported here, epitope-tagged, full-length LEF-1, TCF-1EWT, and TCF-1Emt proteins were overexpressed in COS-1 cells and extracts of these cells used for in vitro binding assays with a randomized library of oligonucleotides. The proteins were left unpurified to maintain maximum specific activity for DNA binding. The oligonucleotide library was designed with fixed 5' and 3' sequences for amplification, and these regions flanked a randomly synthesized core of 30 nucleotides. After six rounds of epitope-tag purification of the LEF/TCFs and PCR amplification of copurified oligonucleotides, the enriched oligonucleotide pool was subcloned and individual clones were sequenced. An extensive set of independent sequences (108 for LEF-1, 83 for TCF-1EWT, and 94 for TCF-1Emt) were aligned with ClustalW. Alignment files were submitted to WebLogo (http://weblogo.berkeley.edu/) to generate sequence logos that depict base frequencies at each position (Fig. 6A, left panel). The results show that an optimal recognition sequence for LEF-1 TCF-1EWT and TCF-1Emt is an extended 14-nucleotide T-rich sequence with only minor differences in nucleotide frequencies between the three motifs. This extended sequence is highly similar to the consensus sequence previously determined with a recombinant fragment of the TCF-1 HMG DNA binding domain (Fig. 6A [TCF-1 HMG]) but different in flanking positions from the positions determined in a serial analysis experiment (examining chromatin occupancy) which identified ß-catenin occupancy sites in the genome (Fig. 6A [WRE core]) (46). The extended sequence is much more constrained than is the case with native WREs and therefore represents an optimal binding site (perhaps it also represents T/A-rich oligonucleotides with phased A tracts for DNA bending). Importantly, an independent repeat of the CASTing experiment with different extracts of TCF-1EWT and TCF-1Emt produced an interesting modification to the consensus sequence. In contrast to the procedure employing hypotonic lysis of cells in the first experiment, cells expressing TCF-1EWT and TCF-1Emt were lysed with nonionic detergent and reducing agent. This second CASTing protocol yielded 66 and 72 independent sequences and identical 14-nucleotide T-rich sequences for TCF-1EWT and TCF-1Emt (Fig. 6A, right panel). However, in more than half of the sequences enriched with wild-type TCF-1EWT, a short element (5'-RCCG-3') was present downstream of the core WRE with variable spacing of up to three nucleotides. None of the sequences isolated with TCF-1Emt had this small motif. It is notable that this RCCG sequence matches the first half of the HDBP response element recognized by HDBP1/HDBP2. Taking these results together with the EMSA data highlighting the importance of the downstream GC element, we conclude that within the context of full-length TCF-1E, the C-clamp can engage in sequence-selective interactions with DNA. A sequence alignment of experimentally determined Wnt response elements from human target genes showed that the occurrence of a RCCG motif is infrequent but appears downstream of each Wnt response element in the LEF1 and CDX1 promoters (Fig. 6B).
![]() View larger version (50K): [in a new window] |
FIG. 6. CASTing analysis of full-length LEF-1 and TCF-1E. (A) Crude cell extracts containing overexpressed, epitope-tagged LEF-1 and TCF-1E were used in a CASTing analysis with a randomly synthesized library (see Materials and Methods). Two independent CASTing analyses were performed (left and right panels). The number of independent sequences used in the sequence alignment is shown for each replicate. Alignments are summarized as sequence logos in which the height of the nucleotide designation represents the frequency of occurrence of that nucleotide. A small GC element was enriched with wild-type TCF-1E for the second replicate CASTing (34 out of 66 independent sequences contained the RCCG motif downstream with variable spacing between 0 to 3 nucleotides). The consensus sequence from a previous systematic evolution of ligands by exponential enrichment experiment is shown (TCF-1 HMG) (39) as well as an affinity profile obtained with a TCF-4/Renilla luciferase fusion protein (TCF-4 profile) (18). W and S are International Union of Pure and Applied Chemistry nomenclature for A or T nucleotides (W) and C or G nucleotides (S). (B) Nucleotide alignment of experimentally validated Wnt response elements. Information about these response elements was derived from studies of LEF1 (3), CDX1 (20), CMYC (19), cyclin D1 (35), AXIN 2 (24), and MMP7 (17). Matches to the TOP sequences are shaded gray; mismatched nucleotides are shown in red. RCCG motifs are boxed. Asterisks denote two weak WRE motifs in the human MMP7 promoter that do not respond to LEF-1/ß-catenin when multimerized in a reporter similar to TOPtk and LOPtk (17). ¶ refers to a WRE core sequence that has been experimentally validated in a recent genome-wide survey of sequences occupied by ß-catenin (46).
|
![]() View larger version (16K): [in a new window] |
FIG. 7. The E-tail is involved in regulation of colon cancer cell growth. The results of doxycycline (DOX)-induced expression of dominant-negative TCF-1EWT and TCF-1Emt in DLD-1 colon cancer cells are shown. Quantitation of cell number with or without doxycycline was performed using a sulforhodamine B cell proliferation assay (see Materials and Methods). Error bars depict standard deviations of the results obtained with eight replicates. The inset shows the results of a Western blot analysis of the induced levels of expression of dnTCF-1EWT and dnTCF-1Emt proteins.
|
|
|
|---|
A transcription factor with two DNA binding domains is unusual but not unique. For example, alternative splicing at the 3' end of the p53 tumor suppressor gene generates a longer protein that carries a second DNA binding domain called the CTD (for C-terminal domain) (reviewed in reference 1). The CTD region is enriched in basic residues and binds to DNA without any sequence specificity. Originally thought to be a negative regulator of p53 DNA binding, the CTD has since been shown to stabilize DNA binding when presented with chromatinized templates, long naked DNA fragments, minicircular plasmids, or stem-loop structures (12, 15, 30). The CTD also promotes linear diffusion along DNA and efficient recognition and occupancy of p53 target sites in vivo (12, 29). Results of a study using a recent model for the structure of full-length p53 binding to DNA suggested that the CTD discourages binding to nonspecific sites but stabilizes binding to specific p53 elements (25). It is tempting to speculate that the E-tail plays a similar role for TCF-1 and TCF-4.
A new class of DNA binding domain. The C-clamp motif is highly similar to a 30-amino-acid sequence in the extreme C terminus of HDBP-1 and HDBP-2, two transcription factors that bind to the Huntingtin gene promoter (37). This motif may be uncommon, because a comprehensive BLAST search does not detect similarities to proteins in the database other than the HDBP transcription factors. HDBPs were first cloned as DNA binding proteins that bind to a promoter in the papillomavirus promoter (PRF-1) (7) and an enhancer in the GLUT4 gene (GEF) (31). However, through analysis of HDBP1/HDBP2 binding to specific sequences in the Huntingtin promoter, a DNA binding domain was delimited to the region that includes the cysteine-rich 30-amino-acid motif (37). EMSA as well as DNase I footprinting experiments with a GST fusion of the C-terminal 75 aa or 37 aa from HDBP-1 and HDBP-2 showed that both of these small fusion proteins specifically recognize a 7-bp recognition element (GCCGGCG) in the HD promoter. We assessed whether the TCF C-clamp could bind to specific sequences by use of a fusion of GST to the E-tail of TCF-1 in surface plasmon resonance and EMSA studies. Both approaches showed that the E-tail binds equally well to double-stranded oligonucleotides that encode mutant or wild-type WREs as well as a double-stranded oligonucleotide encoding a Gal4 binding element (data not shown). Therefore, at least as an independent, recombinant protein, the C-clamp does not bind to any specific sequence motif and carries moderate affinity for any double-stranded DNA sequence. There may be other activities carried out by the C-clamp, such as interaction with the transcription coactivator p300. It would be interesting to determine whether HDBPs also interact with p300 through this domain (20).
Sequence variation in Wnt response elements. Despite a lack of sequence specificity in SPR assays and EMSAs, the C-clamp exhibits preferential binding to a GC element near the WRE core sequence when in the context of full-length TCF-1E protein. Matches to the GCCGGCG Huntingtin sequence element recognized by HDBP-1 and HDBP-2 are present in the LEF1 promoter downstream of each of the two WRE elements by one and seven nucleotides (LOP, CCCGGCT; LOP2, GCCGGCG) (22). Also, the CDX1 promoter, another E-tail requiring Wnt target, has significant matches to the GCCGGCG element one and seven nucleotides downstream of two of its WREs (CGCGGCC and CCCGCCT) (20). At least for LOP2, this element is important for full-length TCF-1E binding, because its mutation in LOP2 reduced binding in the EMSA (Fig. 5D [LOP2+GCSC]). We did not test this element in a reporter plasmid, because the GC element is too far downstream to permit multimerization of the element and still maintain identical spacing with the TOPtk reporter plasmid. Instead, we multimerized the neighboring WRE from LEF1, which has a partial GC element (CCCGGCT). Mutation of this GC-rich sequence to match the flanking sequences in TOPtk does not prevent TCF-1E from activating this reporter but does reduce the level of activation by the chimeric LEF-1E (see Fig. 1B and 2C). Thus, the GC element may enhance target recognition and regulation when the core WRE is itself a variant. An alignment of the LEF1 and CDX1 WREs with other known WREs in target genes shows that most other targets do not have a flanking GC element (Fig. 6B). The sequences of these elements coincide more closely with the extended consensus derived from HMG fragments of TCF-1 and TCF-4. There is no natural sequence variation in the CTTTG core but some variation in the two nucleotides that follow this core. In fact, sequence deviation from "AT" of these next two positions correlates with WREs that have been experimentally determined to be weaker WREs (LEF1, CDX1, and hMMP7) (this study and reference 17, 20). Thus, variation in the two positions that follow the CTTTG core comes at a cost but does not preclude Wnt regulation. A recent survey of genomic sites occupied by ß-catenin in colon cancer cells identified putative WREs with the core sequence CTTTGWW. Equal frequencies of sequence variations at the "WW" positions were observed followed by preferred nucleotides at 3' positions (46). One might predict that variation at these positions confers LEF/TCF/ß-catenin regulation only through cooperative interactions with specific LEF/TCFs and neighboring factors or via TCF/E-tail isoforms that are more potent and independent. Indeed, LEF-1 regulates MMP7 via cooperative interactions with c-Jun and PEA3 (9, 11), it cooperatively activates CDX1 with CDX-1 itself, and it activates its own LEF1 expression by cooperating with PITX2 (38). Determining how common the downstream GC element is in natural targets, and whether it predicts E-tail-specific regulation, will require identification of a larger set of TCF/E-tail targets.
We used EMSA analysis to test whether the TOP sequence is a stronger WRE than that of LOP or LOP2+GC. We observed that TOP was indeed a stronger binding site (approximately 10-fold and 3-fold more probe shifted than with LOP and LOP2+GC, respectively [F. A. Atcha and B. Wu, data not shown]), supporting the model based on the idea that strong WRE sequences do not require the additional C-clamp. It should be pointed out, however, that the observed decrease in DNA binding with a mutant C-clamp is less dramatic than the effects seen in the transient transfection assays where there is a complete loss of transcription regulation. Likewise, LEF-1 protein can bind to these variant LOP WREs in EMSA and DNase I footprinting assays (3; F. Atcha, unpublished observations), but it is clearly unable to bind productively on its own in vivo. Obviously, recognition of WREs presents more of a challenge in vivo, and therefore the DNA binding activities of the E-tail may be more pronounced with supercoiled, chromatinized templates. This is the case for the auxiliary DNA binding domain in p53, where the CTD actually inhibits DNA binding in vitro but facilitates strong binding to chromatin in vivo (12, 15, 30).
DNA binding affinities and bending of full-length LEF/TCFs.
Analysis of recombinant GST/E-tail in surface plasmon resonance experiments showed that the E-tail binds double-stranded DNA directly with an affinity 10- to 20-fold weaker (approximately 16 nM) than the affinity of the HMG DNA binding domain for WREs (approximately 1 nM). Two side-by-side DNA binding domains might be predicted to give E-tail isoforms of TCFs very high affinities for WREs, and we did observe differences in the binding of wild-type and mutant TCF-1E to LOP2+GC (Fig. 5A and D) but not differences that suggest that binding is cooperative. It will be important to measure the DNA binding affinity of full-length TCF-1EWT and TCF-1Emt, but so far this has proven technically challenging. Only one study has been published in which DNA binding affinities of full-length, purified LEF/TCFs, including TCF-3E (which does not have an intact C-clamp), were measured (33). KD values for all of the proteins fell in the micromolar range, whereas, interestingly, removal of the E-tail from TCF-3 slightly improved binding (1.5 µM versus 4.5 µM). Micromolar KD values are much weaker than the nanomolar affinities measured for purified HMG DNA binding domains. This large disparity in DNA affinity measurements could be due to the fact that full-length LEF/TCF proteins suffer a large loss (down to approximately
0.1%) in specific activity when they are purified, a property that stems from the fact that purification yields unfolded, unstructured polypeptides (10, 27). The goal of future studies will be to overcome these technical issues so that the contribution of the C-clamp domain to DNA binding can be measured.
A hallmark activity of HMG boxes is that they introduce sharp bends in DNA. Bending measurements have been carried out with protein fragments of the HMG DNA binding domain but not with full-length proteins and never with the juxtaposed E-tail (14, 16, 26). We tested whether the C-clamp influenced DNA bending by use of circular permutation EMSAs with full-length LEF-1, LEF-1E, TCF-1EWT, and TCF-1Emt (F. Atcha, data not shown). Full-length LEF-1 protein induces a bend angle of 132° ± 6.8°, and fusing an E-tail onto the end of LEF-1 does not significantly alter DNA bending (121° ± 6.6°). Interestingly, full-length TCF-1EWT bends DNA at roughly half the angle seen with LEF-1 protein (i.e., 67° ± 2.0° and 70.3° ± 1.0° in independent determinations), but the CR1 mutation does not alter this bending (66.5° ± 1.9° and 70.5° ± 1.37° in independent determinations). Differences in bending could certainly influence the architectural roles of LEF-1 and TCF-1 at sites where this activity is important. Indeed, TCF-1 bending is more similar to the 70° and 80° bend angles induced by SRY, SOX-5, and SOX-2 than it is to that seen with LEF-1 (8, 34, 44). However, we conclude that bending is not a critical part of the differences between LEF-1 and TCF-1E action with respect to LEF1 and CDX1 promoter activation because the E-tail, which is the only observed requirement, does not influence DNA bending. Once again, this is similar to the p53 CTD results in that its presence does not appear to alter the conformation of the sequence-specific DNA binding domain (4).
Biological relevance of the C-clamp to LEF/TCF actions. Our findings do not rule out the possibility that LEF-1, TCF-3, and other isoforms of TCF-1 and TCF-4 missing the C-clamp are able to regulate Wnt targets such as LEF1 or CDX1. Rather, our study showed that LEF/TCF isoforms without the E-tail must achieve stable interactions through other mechanisms, such as cooperative interactions with other factors. This is highly likely, since LEF/TCFs are known to be context-dependent transcription regulators that interact with other proteins at various targets (for examples, see reference 2). In fact, a recent study by Béland et al. showed that even though the E-tail-requiring CDX1 promoter cannot be activated by LEF-1 in transient transfection experiments, LEF-1 can activate CDX1 transcription in vivo through cooperative interactions with the CDX-1 protein itself (6). Others have shown that LEF-1 can activate transcription of its own promoter when it is coexpressed with PITX2 isoforms (38). Whether these examples of cooperativity are due to stabilization of LEF-1 DNA binding or whether they involve a different mode of complex formation is not known. However, for CDX1, a LEF-1/ß-catenin fusion protein was not active on its own unless coexpressed with CDX-1, implying that CDX-1 might indeed be functioning to stabilize LEF-1 interactions with this promoter.
It is becoming increasingly clear that vertebrate LEF/TCFs exhibit unique patterns of activities and that the C-clamp is a feature of these specializations. Since the C-clamp motif is as conserved among ancient orthologues as the ß-catenin binding domain and the HMG DNA binding domain, one can hypothesize that there are cell phenotypes dependent on the presence of this motif. We asked here whether the E-tail carries any biological significance for TCF activities in DLD-1 colon cancer cells. DLD-1 cells contain very high levels of ß-catenin in the nucleus and high concentrations of ß-catenin/TCF complexes. Overexpressed dominant-negative TCFs compete with the high concentrations of ß-catenin/TCF complexes and downregulate Wnt target gene expression, including that of genes involved in the cell cycle and cell growth (5, 42). Overexpression of dominant-negative TCF-1E (dnTCF-1EWT) in DLD-1 cells causes a stall in the cell cycle but overexpression of dominant-negative TCF with a mutation in the C-clamp (dnTCF-1Emt) does not (Fig. 6). This means that some of the growth control genes downregulated by dnTCF-1EWT contain Wnt response elements that require an intact E-tail for stable association. Since overactivation of growth control genes by TCF/ß-catenin complexes can lead to cancer, interference with the function of the C-clamp might undercut these oncogenic actions. LEF-1 and TCF-3, vertebrate family members that do not have an intact C-clamp, either have lost the ability to carry out these essential functions or have compensated by acquiring the ability to cooperate with other factors and pathways. Further studies of the structure of the C-clamp and its influence on target gene selection will be important for fully understanding its role in Wnt signaling.
This work was funded by National Institutes of Health grants CA108697 and CA096878 to M.L.W. and HD36081 to J.L.M. We gratefully acknowledge the support of the Optical Biology Shared Resource and the Cancer Center Support grant (CA-62203) at the University of California, Irvine, for the SPR studies.
Published ahead of print on 24 September 2007. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»