Previous Article | Next Article ![]()
Molecular and Cellular Biology, November 2005, p. 10148-10158, Vol. 25, No. 22
0270-7306/05/$08.00+0 doi:10.1128/MCB.25.22.10148-10158.2005
Copyright © 2005, American Society for Microbiology. All Rights Reserved.
Department of Biochemistry, Center for Molecular Toxicology, Vanderbilt-Ingram Cancer Center, Vanderbilt University School of Medicine, Nashville, Tennessee 37232
Received 13 April 2005/ Returned for modification 6 June 2005/ Accepted 28 August 2005
|
|
|---|
|
|
|---|
1,850 TFs encoded within the human genome, and the comprehensive examination of these proteins and their associated regulatory pathways is required to further understand the molecular basis of disease (71). The elucidation of complex transcriptional cellular networks (the identification of TFs, binding sites within the genome, and regulated genes) is an area of intense research focus in the post-human genome-sequencing era. Many techniques such as subtractive hybridization, differential display, microarrays, and serial analysis of gene expression have been employed to monitor gene expression changes on a genome-wide level. However, a limitation of these methods is that the observed gene expression changes may be due to direct or indirect regulation, making it challenging and time-consuming to delineate the primary gene targets. To this end, chromatin immunoprecipitation (ChIP), the localization of genomic TF binding sites through the cross-linking and immunoprecipitation of an endogenous TF from cells, is a valuable tool used to identify regulatory regions directly bound by a TF in the context of the native chromatin structure. A method widely used to study TF binding is ChIP followed by microarray analysis (ChIP-chip); this method often involves microarray hybridization of the ChIP DNA fragments to specific, known promoter regions or to genomic CpG island regions (35, 41, 47, 55). One critical limitation of some ChIP-chip analyses is that only segments of promoter regions are analyzed. Thus, other genomic regions that may be bound by TFs (distal regions and enhancers) are excluded from these analyses. By using ChIP and high-density oligonucleotide arrays to analyze Sp1, c-Myc, and p53 binding on chromosomes 21 and 22, investigators found that a majority of the binding sites for these transcription factors do not reside within 1 kb of a CpG island or the 5' exon of a gene (8) and, thus, would have been overlooked using limited promoter fragments on a microarray chip. ChIP-chip involving high-density oligonucleotide arrays is beneficial for whole-genome profiling of TF binding, but currently only single chromosomes are being analyzed, limiting global pathway analyses (19, 48). To circumvent some of these issues, we developed an experimental approach to assay the genome for TF binding sites that link to transcriptionally regulated target genes and used the TF p53 to demonstrate the effectiveness of the method.
The tumor suppressor p53 binds DNA with high affinity in a sequence-specific manner (40) and regulates multiple signaling pathways involved primarily in growth arrest, DNA repair, and apoptosis in response to various cellular stresses (64). To date, more than 100 genes directly regulated by p53 have been identified, leading to a better understanding of the mechanism of p53 tumor suppression (14, 30, 50). However, it is estimated that there are
1,600 sites to which p53 can bind in the human genome (8). Thus, it seems the genome contains many unidentified p53 target genes, and the identification of these genes will give further insight to complex p53 signaling networks.
We used ChIP to capture p53 binding to genomic DNA on a global level after damage, and the resulting DNA fragments were screened for functional p53 binding and transactivation of a reporter gene using a modified yeast one-hybrid system (69). Our screen yielded not only valuable information regarding the sequence and characteristics of functional p53 binding sites within the genome but also novel target genes and would be a valuable approach to pursue for any number of TFs.
|
|
|---|
Western analyses. Western analysis was performed as described previously (72) using anti-MDM2 SMP14, anti-p53 PAb1801 (Santa Cruz Biotechnology, Santa Cruz, CA), and anti-p21Waf1/Cip1 Ab-1 (Oncogene Research Products, Cambridge, MA) antibodies.
Formaldehyde cross-linking and chromatin immunoprecipitation. Cells were formaldehyde cross-linked in a 1.6% solution for 10 min as described previously (66). For every 2 mg of protein extract processed for ChIP, 10 µg of mouse immunoglobulin G bound to protein A-Sepharose (Pharmacia Biotech, Piscataway, NJ) was used to preclear for 1 h with rocking at 4°C. The extracts were immunoprecipitated with 1 µg of anti-p53 PAb1801 (Santa Cruz) and 1 µg of anti-p53 PAb421 (Oncogene Research Products, Cambridge, MA) or 2 µg of anti-cyclin B1 GNS1 antibodies (Santa Cruz) by rocking overnight at 4°C. For the library generation, a total of 75 mg of MCF-10A cell or HMEC protein lysates was immunoprecipitated, and for the ChIP analyses shown in Fig. 3 and 4, 2 mg of protein lysates was immunoprecipitated from the MCF-10A cells, HMEC, or HK cells. Immunocomplexes were washed eight times as described previously (66), and the protein was degraded in digestion buffer (120 µg/ml proteinase K, 10 mM Tris [pH 7.5], 5 mM EDTA, and 0.5% sodium dodecyl sulfate) at 56°C overnight and then incubated at 65°C for 30 min. The DNA was phenol-chloroform extracted, and ethanol precipitated.
![]() View larger version (56K): [in a new window] |
FIG. 3. Analysis of p53 in vivo binding to consensus binding sites. Three sets each of MCF-10A cells, HMEC, and HK cells were identically processed: one set was treated with ADR (350 nM for 5 h) and formaldehyde cross-linked (ADR +, X-L +), another set was not treated with ADR and was formaldehyde cross-linked (ADR , X-L +), and a final set was treated with ADR and not formaldehyde cross-linked (ADR +, X-L ). The DNA for PCRs was derived from p53-specific and cyclin B1-specific immunoprecipitations (IP) and amplified using primers flanking the p53 response elements in genes encoding the indicated proteins. PCRs were resolved with polyacrylamide gel electrophoresis, and the gels were stained with ethidium bromide. The cyclin B1-specific IPs were included to assess any DNA fragments purified from cross-linked lysates nonspecifically. Input +, genomic input; input , water control. PCR results with primers directed to the coding region of GAPDH serve as a control for nonspecific DNA IP by p53-specific antibodies.
|
![]() View larger version (41K): [in a new window] |
FIG. 4. Comparative analysis of p53 binding to sites in promoter regions of known and candidate target genes. In the left panels, four sets of HMEC were processed as follows: one set was treated with ADR (350 nM for 5 h), formaldehyde cross-linked, and immunoprecipitated with a p53 antibody (solid bars); another set was treated with ADR, formaldehyde cross-linked, and immunoprecipitated with a cyclin B1 antibody (open bars); a third set was treated with ADR, not formaldehyde cross-linked, and immunoprecipitated with a p53 antibody (dotted bars); and a final set was formaldehyde cross-linked but not treated with ADR and then immunoprecipitated with a p53 antibody (gray bars). Quantitative real-time PCR was performed, and each sample was normalized to the same genomic DNA that was isolated from cells that were cross-linked and processed the same with the exception that the immunoprecipitation step was not performed. The binding sites shown are those that were recovered from the library screen (LS), those that were previously reported in the literature (RS), and those that were potential binding sites found by gene analysis using the p53MH algorithm (PS). The base pair match of the binding site to the p53 consensus is shown in parentheses. The results, shown as percentages of input DNA, are from at least three independent experiments, with the error bars representing standard deviations. The right panels show schematics of the genomic structure and localization of known and putative p53 binding sites analyzed. The bar shading indicates species conservation as indicated. Exons are indicated with an E followed by the exon number in either an open box or a shaded box (representing the terminal exon). The sequences of the binding sites present in the regions analyzed are shown, and in parentheses the distances of the binding sites from the start of exon 1 are given.
|
The pBM947-based library DNA fragments were amplified from the yeast that grew in a p53-dependent manner using a modified yeast colony PCR technique (7). Briefly, 50 cycles of PCR were performed after an initial 4 min at 95°C; each cycle consisted of 1 min at 95°C, 1 min at 56°C, and 1.5 min at 72°C, before a final elongation step of 10 min at 72°C. PCR DNA products were resolved in a 1% agarose gel, stained with ethidium bromide, purified, and sequenced.
RNA preparation and reverse transcription. mRNA was harvested from the HIp53 and HK cells as previously described (66). Total RNA was harvested from the HCT116 cells using the Aurum total RNA minikit (Bio-Rad Laboratories, Hercules, CA). Reverse transcription of 100 ng of poly(A) RNA from the HIp53 and HK cells and 250 ng of total RNA from the HCT116 cells was performed using the TaqMan reverse transcription reagent kit (Applied Biosystems, Foster City, CA).
PCR amplification. The ChIP PCR amplifications for CDKN1A (p21) site 1 and DDB2 (p48) were performed in 16.6 mM (NH4)2SO4, 0.67 mM Tris (pH 8.8), 6.7 mM MgCl2, 10 mM ß-mercaptoethanol, 10% dimethyl sulfoxide, 1.5 mM nucleotides, and 1.25 U of Taq polymerase (Fig. 3). A total of 175 ng of each primer was used per 25-µl reaction mixture. The ChIP PCR amplifications for EDN2, PPM1J, RPS27L, UBTD1, PDGFC, and glyceraldehyde-3-phosphate dehydrogenase (GAPDH) were performed in 10 mM Tris (pH 9.0), 50 mM KCl, 0.1% Triton X-100, 0.5 mM MgCl2, 0.25 mM nucleotides, and 1.25 U of Taq polymerase (Fig. 3). Each primer was used at 0.4 µM per 25-µl PCR mixture. Primer sequences and PCR conditions are provided in Table S1 in the supplemental material.
Quantitative real-time PCR (Q-RT-PCR) was performed for p53 binding analyses (Fig. 4) using chromatin-immunoprecipitated DNA (described above) and normalized to a genomic template that was cross-linked and sonicated, but not immunoprecipitated. Q-RT-PCR was performed for the gene expression analyses (Fig. 5) using 2 ng of reverse-transcribed mRNA or 5 ng of reverse-transcribed total RNA template. Q-RT-PCR SYBR-Green Supermix (Bio-Rad) reaction mixtures contained 200 nM of each primer and were run on an iCycler thermal cycler (Bio-Rad). Primers were designed using the Beacon Designer software (Bio-Rad). Target gene expression was normalized to GAPDH, and changes were calculated relative to control samples. For primer sequences and PCR conditions see Table S1 in the supplemental material.
![]() View larger version (31K): [in a new window] |
FIG. 5. p53-dependent regulation of representative candidate target gene expression. The isogenic pair of HCT116 p53+/+ and p53/ cells were treated with ADR (350 nM) for 0, 6, 12, and 24 h; the HIp53 cells (p53) and corresponding vector control cell line (Ø) were treated with ponasterone A (10 µM) for 24 h, and the HK cells were infected with a GFP- or p53-expressing adenovirus for 30 h. Total RNA from HCT116 cells and mRNA from HIp53 and HK cells was purified and reverse transcribed and quantitative real-time PCR performed. The samples were normalized to GAPDH, and the results are presented as changes relative to either the 0-h HCT116 p53+/+ sample (left panel), the HIp53 vector control cells treated with ponasterone (middle panel), or the HK cells infected with a GFP-expressing adenovirus (right panel). The results are the means of three independent experiments (MCF-10A cells and HMEC) or duplicate experiments (HK cells), with error bars representing the standard deviations. Note that the y axes are set to 7.0 with the exceptions of the left and middle panels for the CDKN1A gene and the left and right panels for the EDN-2 gene.
|
|
|
|---|
![]() View larger version (32K): [in a new window] |
FIG. 1. Analysis of p53, MDM2, and p21 protein levels in MCF-10A cells and HMEC after ADR treatment. Shown is Western blot analysis of p53, MDM2, and p21 protein harvested from MCF-10A and primary HMEC that were not treated () or treated (+) with ADR (350 nM) for 8 h.
|
600- to 1,000-bp fragments. The chromatin was immunoprecipitated using p53-specific antibodies, the cross-linking was reversed, and the DNA fragments were purified and cloned into the pBM947 yeast expression vector upstream of a minimal promoter that controls the expression of the HIS3 gene. The pBM947-based library of putative p53-binding DNA fragments was transformed into an auxotrophic histidine-deficient yeast strain with a galactose-inducible yeast expression vector containing the human p53 gene (pRS314SN). Yeast transformants containing both the pRS314SN and pBM947 vectors were able to grow on galactose-containing, histidine-deficient media if the expressed p53 bound to the human genomic DNA fragment in the pBM947 vector and activated transcription of the HIS3 gene. To rule out false-positive clones that grew in a p53-independent manner, we also replica plated all clones on glucose-containing, histidine-deficient media. The clones that grew in the presence of glucose and, thus, in the absence of p53, were considered false positives and not further analyzed (Fig. 2).
![]() View larger version (90K): [in a new window] |
FIG. 2. Yeast selection system. The candidate upstream activating sequences (UAS) recovered from ChIP were cloned into the pBM947 reporter vector containing the HIS3 gene under the control of a basal GAL1 promoter and a URA1 marker. The pBM947-based library was transformed into an auxotrophic His-deficient yeast strain containing the pRS314SN vector, which expresses a galactose-inducible human wild-type p53 and a TRP1 marker. Yeasts containing both the vectors were grown on galactose-containing, histidine-deficient media (SG-Trp-Ura-His) to assay for the ability of p53 to bind to the potential UAS in the pBM947 vector and activate transcription of the HIS3 gene. Replica plating of all clones on glucose-containing, histidine-deficient media (SD-Trp-Ura-His) was performed to rule out false-positive clones. The clones that grew in the presence of glucose were considered false positive, and only the clones that grew on galactose, and presumably in a p53-dependent manner, were analyzed further. A clone containing a fragment of the p21 promoter encompassing site 1 is indicated as an example of a positive result.
|
3.2 x 106 yeast transformants were screened, and 187 clones grew in a p53-dependent manner. After sequencing, 51 nonredundant sequences were identified. For the HMEC library,
1.7 x 106 yeast transformants were screened, resulting in 74 nonredundant positive clones that grew in a p53-dependent manner. A control library generated from immunoprecipitation of identically processed lysates using an isotype-matched antibody (cyclin B1) was also screened to determine the level of nonspecifically bound DNA fragments that contained sequences conforming to the p53 consensus. There was a sevenfold reduction in the number of colonies from the control library that grew in a p53-dependent manner; of those only four clones had putative p53 binding sites as identified by the p53MH algorithm (34). Thus, the majority of the DNA fragments present in the experimental libraries resulted from specific interactions with p53. Analysis of genomic DNA fragments for p53 consensus binding sites and the identification of candidate target genes. The DNA fragments isolated from the combined screening of the HMEC and MCF-10A cell libraries were analyzed for the presence of p53 consensus DNA binding sites using the p53MH algorithm (34). The canonical p53 binding site consists of two repeats of the 10-bp sequence PPPCWWGYYY (where P = A/G, W = A/T, and Y = T/C) separated by a 0- to 13-bp spacer region, (15). Of the fragments isolated from the HMEC and MCF-10A cell libraries, 99% contained putative p53 binding sites. A representative list is shown in Table 1; for the additional sites see Table S2 in the supplemental material.
|
View this table: [in a new window] |
TABLE 1. Representative p53 consensus binding sites and candidate target genes
|
100 candidate target genes were identified, including both known and predicted genes. The chromosomal location, the location of the binding site in relation to the known or candidate target gene, and the distance of the binding site from the beginning of exon 1 of the associated known or candidate gene are shown (Table 1; see Table S2 in the supplemental material). The genes identified from the screen encode proteins involved in numerous cellular pathways including protein kinase/phosphatase signaling, protein transport, growth factor signaling, RNA binding, and chromatin remodeling. Numerous predicted genes with unknown functions were also identified. Of note, less than 25% of the DNA fragments aligned to regions of the genome that did not contain known or predicted genes within the criterion range. We recovered binding sites that were both 5' to the candidate target genes and intronic, and, interestingly, we also recovered p53 binding sites 3' of putative target genes. In addition to novel genomic p53 binding sites, the library screen (LS) yielded multiple known p53 binding sites. For instance, both of the p53 binding sites from the CDKN1A (p21) promoter were isolated (17), as well as the p53 binding sites from the MDM2 (77) and DDB2 (68) genes. The recovery of known p53 response elements that regulate key p53 target genes validated our method and demonstrates that the screen selects for functionally relevant binding sites and physiologically important target genes. Analysis of in vivo binding of p53 to novel consensus binding sites. To further validate p53 binding to the novel binding sites recovered from the library screen, we generated pools of DNA fragments from ChIP of ADR-treated HMEC and MCF-10A cells and primary cultures of HK cells. The latter were included in our analyses to compare binding results across primary cell types from two human tissues. The ChIP DNA was used as a template for PCR amplification of the genomic regions containing response elements obtained from the library screen. All of the genomic binding sites shown in Table 1 were present in these replicate pools of DNA fragments, and representative ChIP-PCR results from the HMEC, MCF-10A cell, and HK cell lysates are shown in Fig. 3. The binding sites shown were chosen due to their differential locations with regard to exon 1 of the associated candidate gene, as well as the varied cellular functions of the target genes. Response elements in CDKN1A (p21) and DDB2 (p48) genes, known p53 target genes, as well as the binding sites associated with EDN-2, PPM1J, RPS27L, and PDGFC genes were bound by p53 before ADR treatment as well as after ADR treatment in all cell systems analyzed. The UBTD1 gene-associated p53 response element was bound by p53 before ADR treatment in the MCF-10A cells, but binding was not detectable in the HMEC and HK cells. However, binding of p53 to the UBTD1 gene response element was detected after ADR treatment in all cell lines analyzed.
To analyze the specificity of p53 binding to the sites that we recovered from the libraries, we performed Q-RT-PCR on ChIP DNA fragments using primers specific to regions containing the binding sites identified from our screen, as well as other putative binding sites within 20 kb upstream or downstream of the start of exon 1 identified by the p53MH algorithm (34), but not recovered from our screen (Fig. 4). Cells were treated as outlined in the Fig. 3 legend, and the ChIP DNA was analyzed by Q-RT-PCR. All samples were normalized to a sample of genomic DNA from treated, cross-linked, and sonicated lysates that were not chromatin immunoprecipitated, and the results are shown as percentages of that input. Schematics showing the structure of each gene and the locations of binding sites that were either identified through the library screen, previously reported, or identified using the p53MH algorithm are shown in the right panels of Fig. 4. Also shown for these sites is their conservation across species. For CDKN1A, four potential p53 binding sites were analyzed along with the two binding sites that were recovered from the library screen. These two binding sites are the two previously reported p53 binding sites for the CDKN1A gene (16). Significant binding of p53 was observed only at both of the previously reported sites. Of note, all of the algorithm-derived, potential binding sites matched the consensus to a higher degree than the RS/LS2 site (12 of 20); however, none of the potential sites exhibited more binding than this site. For EDN-2, five potential binding sites, including the one site isolated from our screen, were analyzed. Binding was greatest at the intronic site recovered from the library screen that fit the consensus to a high degree, 18 of 20 bp. Interestingly, this site was not one of the two sites localized nearest to exon 1. For the PPM1J gene, we analyzed the site recovered from our library as well as three other potential p53 binding sites. The only sample that showed appreciable binding was that isolated from the library screen. Interestingly, both the EDN-2 and PPM1J sites isolated from the library screen are not as well conserved (only in rats) as the other binding sites analyzed for these genes. For the RPS27L gene, two potential binding sites and the site isolated by the screen were analyzed, and for the PDGFC gene, three potential sites were analyzed as well as the library-generated site. In both genes, p53 binding was seen predominantly in the library-generated sites and was observed both before and after treatment with ADR. Although several primer sets were used, the UBTD1 gene could not be reliably analyzed by Q-RT-PCR.
p53-dependent regulation of candidate target gene mRNA. To determine if p53 binding to response elements correlated with p53-dependent transcription for the genes shown in Fig. 3, we performed Q-RT-PCR analysis on RNA isolated from three cell model systems: (i) the isogenic pair of HCT116 p53+/+ and p53/ cells (6) treated with ADR, (ii) an inducible H1299 cell line in which p53 expression was under the control of the ecdysone promoter and induced by ponasterone A addition (HIp53), and (iii) primary cultures of HK cells infected with a p53- or GFP-expressing adenovirus (Fig. 5).
p21 and p48 are encoded by known p53 target genes; they function in cell cycle arrest and DNA repair (64), respectively, and were included in the analysis for comparison of expression between novel and known targets. p21 was induced by p53 in all cell lines examined, and p48 was regulated in only two of the three cell lines. The EDN-2 gene, a gene encoding a secreted vasoactive peptide (ET-2) with a proposed role in cell survival after hypoxia in breast epithelial cells (2, 27), was elevated in a p53-dependent manner in all cell types analyzed. PPM1J, a member of the protein phosphatase type 2C family (52), was also induced in a p53-dependent manner in all cell types. As shown in Table 1, another gene, the RHOC (ras homolog gene family, member C protein [9]) gene, is located in close proximity to the PPM1J gene and the associated p53 binding site. However, p53-dependent regulation of the RHOC gene was not observed (data not shown). The RPS27L gene, a gene similar to the 40S ribosomal subunit S27 gene (65, 78), was also regulated in a p53-dependent manner in all three systems analyzed. Similar to the p48 gene, the UBTD1 gene (9, 65) was regulated by p53 in two of the three cell systems analyzed. Another gene, the ANKRD2 (ankyrin repeat domain 2 protein [39]) gene, is located in close proximity to the UTBD1 gene and the p53 binding site; however, we did not observe p53-dependent regulation of the ANKRD2 gene (data not shown). The gene encoding PDGFC, a member of the PDGF growth factor family (44), was not regulated in a p53-dependent manner in any of the cell systems used in this study. There were no open reading frames within 20 kb upstream or downstream of the binding site, thus ruling out the possibility that a neighboring gene within this distance was the target. Our analysis of PDGFC provides additional evidence that p53 binding does not necessarily equate to transcriptional regulation. In sum, the genes analyzed can be divided into three categories: those that were p53 regulated in all cell systems (CDKN1A, EDN-2, PPM1J, and RPS27L genes), those that were regulated in a subset (DDB2 and UBTD1 genes), and those that were bound but not regulated by p53 under the conditions analyzed in this study (PDGFC gene).
|
|
|---|
B binding to the human genome using ChIP followed by a genomic microarray of human chromosome 22, they also found a significant proportion of binding occurred in the intronic areas (48). We also recovered a binding site that was 3' of the candidate target PPM1J gene, the first report of a functional 3' binding site for a putative p53 target gene. These data reiterate the importance of analyzing the whole genome, not only proximal promoters, when searching for functional TF regulatory regions. A further advantage of using the ChIP-based screen described herein over a screen of random genomic DNA is that the former identifies binding sites that a TF "is" bound to at a specific time under a certain physiological condition versus a screen of random genomic DNA that only identifies fragments of DNA that a TF "can" bind to, but may not under any physiologically relevant condition. Under the conditions used to generate the libraries of DNA fragments for our screen, ADR treatment of breast epithelial cells, we recovered three known p53 target genes, the functions of which are consistent with the biological end point we observed. It was not anticipated that the screen would identify all known p53 target genes given that the ChIP performed would select only for the genomic loci that p53 was bound to after ADR treatment.
After analyzing the characteristics of the binding sites recovered in this study, we found that, for the MCF-10A library, 45% of the putative binding sites matched the consensus site at greater than or equal to 17 of 20 bp, and, for those binding sites, 56% had spacer regions of 2 bp or less. For the HMEC library, 83% of the putative binding sites obtained from the positive clones matched the consensus site at greater than or equal to 17 of 20 bp, and for those binding sites, 83% had spacer regions of 2 bp or less. In addition,
25% of the isolated fragments had p53 binding sites that fit the p53 consensus perfectly (a match of 20 of 20 bp). Preceding this study only two p53 target genes had been identified with binding sites that fit the consensus at 20 of 20 bp, the DDB2 and type IV collagenase genes (4, 68). Interestingly, the binding site of one of the novel target genes identified by our screen, the PPM1J gene, is a 20- of 20-bp fit to the canonical binding sequence. Although a majority of the binding sites recovered were in close concordance with the consensus binding site, highly degenerate response elements were also identified. For example, the highly degenerate, but physiologically relevant, second p53 response element in p21 (17), which matches the consensus at only 12 of 20 bp, was isolated. In addition, we observed trends involving the lengths of the binding site spacer regions; a majority of the novel binding sites contained a 0- to 2-bp spacer, indicating that p53 binding to genomic DNA under the conditions assayed was more prevalent at sites that have short spacers or none at all. This observation is supported by a previous finding showing that artificially constructed binding sites containing a 4- or 14-bp spacer have greater-than-90%-reduced activity in a yeast reporter assay system than did the same binding site with a 0-bp spacer (69). Our data also correlate with the characteristics of known p53 target gene binding sites,
70% of which contain 0- to 2-bp spacers (54).
The in vivo specificity of the binding sites recovered from our screen was evidenced by the fact that p53 preferentially bound to these sites even though some of the p53MH algorithm-derived binding sites in the region matched the consensus to an equal or higher degree, were closer to exon 1, or were more conserved. Our results show that p53 binding is not solely dependent upon the fit of the binding site to the consensus, and matching the consensus is not sufficient for p53 binding. Other factors could also play a role in dictating binding site specificity, including the sequence context surrounding the binding site as well as access of p53 to the DNA due to chromatin structure. In addition, posttranslational modifications of the p53 protein, such as phosphorylation and acetylation, and cofactor interactions could affect its DNA binding characteristics (5, 12).
In our analyses, we highlighted four novel p53 target genes involved in diverse signaling pathways, each potentially contributing to our further understanding of p53 biology. EDN-2 is a member of the endothelin family, first discovered as potent vasoactive peptides (2). The protein ET-2 can function as an autocrine survival factor in response to hypoxia in breast cells (27) and as a macrophage cell chemoattractant (28). The identification of the EDN-2 gene as a p53 target gene, in combination with the findings that p53 can regulate other genes involved in growth factor and survival signaling, such as genes encoding HB-EGF (heparin-binding epidermal growth factor) (20), EGFR (epidermal growth factor receptor) (46), HGF (hepatocyte growth factor) (49), TGF-
(transforming growth factor alpha) (61), and, most recently, p53CSV (p53-inducible cell survival factor) (53), strongly suggests that, in addition to the well-defined role p53 plays in promoting cell viability through cell cycle arrest and DNA repair, the protein can also play an active role in initiating survival pathway signaling in the cell.
The identification of the gene encoding PPM1J, a PP2C family member, as a p53 target gene contributes to the growing list of phosphatase genes regulated by p53, including the PTEN (62), DUSP5 (70), PAC1 (75), and WIP-1 (21) genes. Interestingly, WIP-1, also known as PPM1D, is also a family member of the protein phosphatase 2C family and inhibits UV-induced apoptosis by negatively regulating the p38 mitogen-activated protein kinase-p53 signaling pathway (67). The exact role of PPM1J in human cells is not known, but its gene is conserved in mice and rats, and the murine homolog, PP2CZ, can associate with the ubiquitin-conjugating enzyme 9 (Ubc9). Ubc9 is involved in the sumoylation of protein substrates, and the PP2CZ-Ubc9 interaction is enhanced in the presence of the small ubiquitin-related modifier (SUMO) protein (38). These observations suggest that PPM1J may be involved in the regulation of sumoylation processes in the cell. Since sumoylation regulates the localization and activity of proteins involved in numerous cellular functions including transcription, DNA repair, and chromatin structure (26), the involvement of p53 in the regulation of the gene could dramatically impact cellular dynamics.
Another gene identified as a p53 target in this study was the RPS27L gene (65, 78). Although the function of human RPS27L is unknown, data show that ARS27A, one of the three ribosomal protein S27 homologues in Arabidopsis thaliana, is involved in genotoxic stress-dependent mRNA degradation. Loss of ARS27A gene expression in Arabidopsis thaliana leads to hypersensitivity to methyl methanesulfate treatment that is accompanied by inhibition of seedling growth and formation of tumor-like structures instead of auxiliary roots. The ARS27A protein is not required for translation; instead it is needed for the degradation of damaged mRNA in response to UV (56). It is possible that the RPS27L gene could play a similar role in humans. Recently, another p53 target gene, encoding MCG10, a KH-domain containing RNA-binding protein, has been identified (80). These genes link p53 to pathways involved in RNA binding and regulation, and this aspect of p53 function has yet to be explored.
As mentioned previously, we identified candidate target genes whose functions are currently unknown. One such example is the UBTD1 gene, a gene encoding a protein containing an ubiquitin domain (9, 65). p53 is well linked to the ubiquitination pathway through its transcriptional regulation of four ubiquitin ligases, MDM2 (31, 42), COP1 (13), Siah1b (22), and Pirh2 (43). However, based on the limited information available for UBTD1, we cannot speculate on the role its gene may play in p53 signaling. Also, the UBTD1 gene is regulated in only a subset of the cell model systems assayed. This example illustrates the discriminatory nature of p53 target gene selectivity, which is dependent on various factors including, but not limited to, the cell types analyzed and the specific signals leading to p53 activation.
A number of candidate target genes identified in our screen were not regulated in a p53-dependent manner, although we validated binding to DNA regions adjacent to the genes through additional ChIP analyses (the PDGFC gene being the example shown). There are many potential reasons for this observed lack of transcriptional regulation. For instance, it is likely that p53-dependent expression of these genes could be tissue/cell type specific and/or specific to the type of cellular stress activating p53, which has been reported from other studies analyzing the regulation of p53-inducible genes (76, 79). In addition, it is possible that p53 regulates antisense or noncoding RNAs not analyzed in this study. The transcriptome consists of an RNA population that is larger than what would be expected if only taking into account the protein-coding transcripts (10, 37, 57, 60). By analyzing a subset of the novel transcripts in the transcriptome, it was concluded that they represent RNA that is most likely not protein encoding, and some of the transcripts even had an orientation that was antisense to a known protein-coding region (37). In addition, when Cawley et al. examined TF (Sp1, cMyc, and p53) binding on chromosomes 21 and 22 using high-density oligonucleotide arrays, they found that many binding sites were significantly correlated with noncoding RNAs. They concluded that there are about the same number of protein-coding and noncoding genes in the transcriptome and that these two types of genes can be bound and regulated by common TFs (8). Alternatively, p53 occupancy at certain response elements (without measurable gene regulation) may affect chromatin remodeling in select regions of the genome. It is well established that p53 is required for global genomic repair in the nucleotide excision repair pathway following DNA damage (29). Further, p53-dependent global chromatin relaxation occurs in response to low doses of UV that are not high enough to activate the protein's activity as a transcription factor (45, 58, 59, 74). Also, p53 can impact histone H3 posttranslational modifications under normal cellular conditions (1). These data suggest that p53 can play a role in chromatin relaxation and remodeling.
The lack of identification of p53-repressed genes was not surprising, given that the majority of p53-repressed genes lack a consensus p53 binding site (33) and thus would not have been isolated. Also, we did not identify DNA fragments adjacent to genes involved in apoptotic signaling, consistent with our finding that ADR (350 nM) does not induce apoptosis in HMEC and MCF-10A cells and the findings of other studies showing lack of constitutive p53 binding to proapoptotic genes (18, 66).
The screen presented herein can be used to directly identify novel genes regulated by other TFs of interest (that maintain activity in yeast) on a genome-wide scale. Comparative analyses of functional binding sites occupied by transcriptional activators under various cellular conditions can be performed with this type of screen. As more genomic, proteomic, and bioinformatic advances are made, the hope is that ChIP-based methods will allow further characterization of protein complexes and chromatin structure at binding sites identified by methods such as those described in this study.
We thank the members of the Pietenpol laboratory for helpful discussions and critical reading of the manuscript.
Supplemental material for this article may be found at http://mcb.asm.org/. ![]()
|
|
|---|
B-binding sites across human chromosome 22. Proc. Natl. Acad. Sci. USA 100:12247-12252.
Np63
phosphoprotein binds the p21 and 14-3-3
promoters in vivo and has transcriptional repressor activity that is reduced by Hay-Wells syndrome-derived mutations. Mol. Cell. Biol. 23:2264-2276.
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»