Previous Article | Next Article ![]()
Molecular and Cellular Biology, March 2008, p. 2047-2058, Vol. 28, No. 6
0270-7306/08/$08.00+0 doi:10.1128/MCB.01861-07
Copyright © 2008, American Society for Microbiology. All Rights Reserved.
,
Diletta Dolfini,
Daniele Merico,
Raffaella Gatta,
Alessandra M. Viganò,
Giulio Pavesi, and
Roberto Mantovani*
Dipartimento di Scienze Biomolecolari e Biotecnologie, Università di Milano, Via Celoria 26, 20133 Milano, Italy
Received 12 October 2007/ Returned for modification 5 November 2007/ Accepted 27 December 2007
|
|
|---|
|
|
|---|
Different entities contain the word CCAAT in their acronyms, but several types of evidence indicate that NF-Y, also termed CBF and HAP2/3/5 for Saccharomyces cerevisiae, is the CCAAT regulator. (i) Highly specific antibodies were used in supershift electrophoretic mobility shift assays and chromatin immunoprecipitations (ChIPs) with a plethora of different promoters (33; D. Dolfini and R. Mantovani, unpublished data). (ii) Specific dominant negative NF-YA vectors were employed in cotransfection and adenovirus infection experiments. (iii) Nucleotides flanking the CCAAT box emerged in the bioinformatic studies cited above, with perfect matches to NF-Y preferences, as assayed by in vitro binding studies (15). It is therefore reasonable to conclude that NF-Y is by far the major protein that regulates this element.
NF-Y is a ubiquitous heteromeric transcription factor (TF) composed of three subunits, NF-YA, NF-YB, and NF-YC, all necessary for DNA binding (34). NF-YB and NF-YC contain conserved histone fold motifs composed of three alpha helices separated by short loop/strand regions (42). NF-YB/NF-YC association is essential for NF-YA binding and sequence-specific DNA interactions. Interestingly, CCAAT boxes are present at a specific location within promoters, typically between –60 and –100 bp from the transcriptional start site (TSS). Within this context, NF-Y is not a powerful activator but rather a promoter organizer that cooperates with the activity of neighboring TFs. The emergence of genome-wide technologies now allows a look at TF binding in a more systematic and unbiased way. Previous studies, performed with CpG island arrays and with an oligonucleotide array representative of a small set of human promoters, left us with an inconclusive picture as to the widespread NF-Y distribution in vivo (12, 51).
Moreover, the simple annotation of TF binding sites is not per se sufficient to obtain satisfactory functional information, since most TFs are known to act in a peculiar chromatin environment, defined by patterns of histone posttranslational modifications. Some of them are associated with accessible, transcriptionally active chromatin, others with repressive heterochromatin, either constitutive or facultative (reviewed in references 4 and 43). In particular, H3K4 trimethylation (H3K4me3) and H3K9-14 acetylations (H3K9-14ac) are associated with an active chromatin environment in regions and promoters that are transcribed or poised for rapid induction by external stimuli (26, 44, 46). Their presence in vivo has been detailed at the single-gene level, and location analysis confirmed their widespread distribution in the proximity of promoters (2, 6, 23, 25, 26, 30, 31, 39, 40, 41). Interestingly, while the presence of these marks precedes gene activation, an increase in their levels is generally noticed in systems of inducible transcription. In NF-Y-dependent endoplasmic reticulum stress promoters, for example, a substantial increase in H3 acetylation and H3K4me3 was seen after induction, while NF-Y binding was detailed before (3, 14). On the other hand, posttranslational modifications such as H3K9, H3K27, and H4K20 methylations are known to be associated with inactive or actively repressed areas of the genome (4, 18, 38, 47). In general, understanding the relationship between histone modifications and TF binding is quite relevant; it is not completely clear, at the moment, whether the binding of certain factors is required for specific modifications to be brought in through recruitment of histone-modifying enzymatic machines or whether the binding of TFs is allowed only by a preexisting nucleosomal environment with an appropriate pattern of histone modifications. To shed light on the relationship between NF-Y and active histone marks, we used high-density tiling arrays of chromosomes 20, 21, and 22 in ChIP-on-chip experiments.
|
|
|---|
ChIP, amplicon generation, and ChIP on chip. ChIPs were performed essentially as described previously (51). Briefly, 5 x 106 cell equivalents of chromatin, <0.5 kb, were immunoprecipitated with 15 µg of anti-YB, anti-H3K4me3 (Abcam), and anti-H3K9-14ac (Upstate) antibodies. Immunoprecipitation-enriched DNAs were used to generate amplicons for hybridization experiments. Parallel ChIPs were run with a Flag control antibody (Sigma), and bona fide NF-Y targets were used to check enrichments before and after the amplification steps. The generation of amplicons from the individual ChIPs was performed by following the protocol of ligation-mediated PCR previously described (9, 54). Design of the oligonucleotides, preparation of the slides, hybridization, and scanning of the fluorescence intensities were performed by Nimblegen. Validations of the results by semiquantitative PCR were performed on independent, nonamplified ChIPs using primers listed in File S1 in the supplemental material. Negative-histone-mark ChIPs were also performed as described previously (51) using anti-H3K9me3 (Abcam), -H3K27me3 (Abcam), and -H4K20me3 (Upstate) antibodies.
ChIP-on-chip data analysis. The Cy5 and Cy3 raw data obtained from each experiment were normalized by subtracting the median and then adding back the average median of the two channels. The independent median normalization was chosen since channel distributions were not parametric and therefore the mean was not suitable. This approach does not perturb the detection of biologically relevant peaks, while allowing efficient removal of noise due to unbalance between the two channels (not shown). After single-channel normalization, the Cy5/Cy3 ratio was calculated for each probe and then converted in the corresponding Z score. This kind of transformation is useful when seeking to compare the relative standings of items from distributions with different means and/or standard deviations and fitted our requirement of performing comparison between different arrays. Box-and-whisker plots clearly showed that Z-score conversion allowed for a direct comparison of independent data sets derived from biological replicates (not shown). Peak search was then performed essentially as previously reported (9), with the so called "peak first" approach. A given percentile threshold was chosen for all experiments, and all probes with enrichment values greater than the percentile were selected. An NF-Y "peak" was defined as a stretch of at least three adjacent probes (spaced by no more than 150 bp on the genomic sequence) exceeding the percentile threshold. In H3K9-14ac and H3K4me3 experiments, five adjacent probes were considered to define a peak. A hypergeometric P-value-based distribution was then associated with the peaks obtained. Finally, peaks obtained from the different experiments were merged, and a given peak was predicted as biologically relevant if present in at least two out of four experiments for NF-Y or two out of three experiments for active histone marks, with corresponding probability values computed with a binomial distribution.
Profiling experiments. The GEO GSE6022 data set, containing a total of three untreated HeLa cell samples, each being a biological replicate, was used for HeLa profiling experiments. The GEO GSE6207 data set, containing seven untreated samples, was used for HepG2 profiling experiments. The data set from Sato et al. (45), containing three untreated samples, each being a biological replicate (Affimetrix HG-U133-A platform), was used for human embryonic stem (ES) cell profiling experiments. The GEO GSE8884 (Affimetrix HG-U133 plus 2.0 platform) was also used for the latter analysis, yielding essentially the same results despite the larger coverage of the platform (not shown).
Expression signals for microarray data were calculated according to the rma algorithm and used for quality control. Absent (A) and present (P) calls were also calculated according to the open-source version of the MAS5 algorithm and used both for quality control and comparisons. A gene was called P if it was present at least in two out of three of the samples. The same rules were applied for A calls. A gene was called UN if it did not fall in the previous categories. The A, P, and UN groups have no overlaps. The details of the analysis are available upon request.
|
|
|---|
![]() View larger version (60K): [in a new window] |
FIG. 1. NF-Y binding to CCAAT promoters in vivo. Twenty-five randomly picked CCAAT promoters from chromosome 21 were analyzed by ChIP with anti-NF-YB antibody on chromatin derived from seven different cell lines. Enrichment over that for an irrelevant Flag control antibody was assessed by semiquantitative PCR and classified as high (>30-fold), medium (5- to 30-fold), or low (2- to 5-fold). Colored boxes indicate positivity and gray boxes indicate no enrichment in ChIPs. The bottom line refers to percentages of NF-Y-positive sites for each cell line. The expression patterns of the respective genes were analyzed according to available transcriptome data (UniGene build no. 186) and reported on the right. UB (ubiquitous) refers to genes expressed in at least 10 out of 45 body sites of UniGene's EST Profile Viewer, while TS (tissue specific) refers to genes showing expression in less than 10 body sites.
|
|
View this table: [in a new window] |
TABLE 1. NF-Y location analysis: binding sites and TUs
|
Independent ChIP validations were performed on 35 loci derived from the less-stringent criteria: all NF-Y promoter loci tested scored positive in standard ChIPs and so did 11/15 sites among the "genes" cohort (see File S3 in the supplemental material). Thus, based on the prevalidation and validation ChIPs, the highest stringency reported in Table 1 highly underscored the extent of NF-Y binding and the 1.43–4 stringency reflected more closely the actual targets, with 10 to 15% false positives/negatives. Moreover, since the co-occurrence of nearby positive probes is expected to be enriched in location analysis, we monitored our peak finding procedure by randomizing each experimental track; as expected, this procedure dramatically reduced the number of loci identified (see File 4 in the supplemental material), confirming the robustness of our approach. Therefore, we pursued further analysis mainly focusing on this stringency.
We next analyzed NF-Y sites in terms of transcriptional units (TUs), defined as University of California, Santa Cruz, RefSeq-annotated genes (hg17 assembly) with their respective promoters; overall, there are 907 TUs (with no redundant promoter) within the regions considered here. As expected, by comparing the number of NF-Y promoter locations with that of the corresponding TUs (191 versus 196), we could recover, on average, one location per positive promoter. A different picture emerged for NF-Y GE category sites, since the number of NF-Y+ TUs was significantly lower than the overall number of locations: 430 versus 1,020. Considering that some of these locations referred to overlapping or divergent units, this implies that on average two or three NF-Y binding sites were found per positive TU.
We also analyzed in greater detail the distribution of NF-Y sites (Fig. 2A). Within PR, most sites reside in the core area, as expected. Within GE, the site distribution shows a relatively steep decline as a function of the distance from the TSS, in agreement with a relevant role for NF-Y at the 5' ends of genes. Interestingly, we found that more than 50% of NF-Y+ promoters possess an additional NF-Y site within the body of the gene (Fig. 2B). Within the EL category, one-third of the locations overlap (20%) or are nearby (9% + 4%) GenBank-annotated mRNAs with no RefSeq definition, which are most likely sites of Pol II activity (Fig. 2A).
![]() View larger version (22K): [in a new window] |
FIG. 2. Categories and locations of NF-Y binding sites. (A) NF-Y promoter sites were plotted according to their positions with respect to the TSS. The frequencies of GE sites versus their distance from the TSS are also represented. EL sites were classified as present within 5 kb to the 5' or 3' end of an annotated gene, as present within a GenBank-annotated human mRNA not corresponding to a RefSeq (hmRNAs), or far from annotated genes (other). (B) Venn diagrams showing the overlap between NF-Y TUs containing a site in the promoter and those containing a site within the body of the gene.
|
|
View this table: [in a new window] |
TABLE 2. NF-Y sites containing at least one CCAAT box
|
![]() View larger version (11K): [in a new window] |
FIG. 3. CCAAT boxes in NF-Y locations. Average numbers of CCAAT boxes present in the different NF-Y location categories were plotted for the four stringencies initially considered. This number was calculated based on the actual mean dimensions of the identified NF-Y binding sites, ranging from 460 to 560 bp depending on the stringency, divided by the theoretical occurrence of the pentanucleotide CCAAT (once per 512 bp).
|
|
View this table: [in a new window] |
TABLE 3. Active histone mark location analysis
|
![]() View larger version (29K): [in a new window] |
FIG. 4. Correlation between NF-Y binding and active histone marks. (A) Percentages of NF-Y+ TUs that scored positive for either H3K9-K14ac or H3K4me3 or for both histone marks. (B) Percentages of H3K9-K14ac- or H3K4me3-positive TUs and of H3K9-K14ac/H3K4me3 double positives that shared at least one NF-Y location. (C) Position of NF-Y binding sites and histone modification islands within a representative cluster of five genes.
|
Correlation between NF-Y, H3K9-14ac, H3K4me3, and gene expression. Next, we interrogated the list of NF-Y+ TUs with respect to both histone marks and expression using available HeLa profiling data sets obtained on the Affymetrix U133 2.0 Plus platform. Overall, 59% of the 907 TUs analyzed here were in the P set and 39% in the A set. As expected, the majority, 75%, of NF-Y+ promoters followed P calls (Fig. 5A). NF-Y+ genes showed modest skewing toward expression, with P calls at 67% (Fig. 5B). We then integrated the active histone mark data into this analysis. Within promoters, the NF-Y+ H3K9-14ac+ and NF-Y+/H3K4me3+ clusters showed clear skewing toward P calls, and even more did the triple-positive NF-Y+/H3K9-14ac+/H3K4me3+ clusters; only a residual 10% of TUs were scored as A in the latter class (Fig. 5A). On the other hand, the NF-Y+/H3K9-14ac–/H3K4me3– promoters were strongly enriched in A calls (Fig. 5A), indicating (i) that NF-Y is bound at these promoters in the absence of active histone marks and (ii) that this cohort of genes is not expressed. A somewhat similar situation emerged for NF-Y+ genes: enrichment of NF-Y+/H3K9-14ac+/H3K4me3+ clusters toward expression was less evident, but once again the majority of NF-Y+/H3K9-14ac–/H3K4me3– genes were found in A calls (Fig. 5B). To gain insights into this dual behavior, NF-Y-positive TUs were subjected to functional enrichment analysis after dividing them into those containing at least one active histone mark and those devoid of both modifications. Pathways expected to be positively affected by NF-Y (cell cycle, cell signaling) were present in the NF-Y+/H3K9-14ac+/H3K4me3+ cohort of genes, but no single function was overwhelmingly enriched; this is consistent with the notion that NF-Y is a broad transcriptional regulator, with specific roles in certain cellular functions (see File S7A in the supplemental material). A different picture emerged with NF-Y+ TUs devoid of active marks, since functions expected to be repressed in HeLa cells, sensory perception and immune response, were enriched (see File S7B in the supplemental material). A graphic representation of an NF-Y+/H3K9-14ac–/H3K4me3– locus, SUHW1, is shown in Fig. 5C, and the complete list of NF-Y+/H3K9-14ac–/H3K4me3– TUs is reported in File S8 in the supplemental material. Altogether, these data indicate that NF-Y binding is associated with two different chromatin states and opposite functional outcomes: in the presence of the two active histone marks, NF-Y is bound to expressed loci; binding within areas devoid of these modifications is coupled to inactive genes.
![]() View larger version (18K): [in a new window] |
FIG. 5. Correlation between NF-Y binding, active histone marks, and gene expression. (A) NF-Y+ promoters were scored for the presence (P) or absence (A) in the GSE6022 data set after the intersection with histone mark experiments. (B) Same as panel A, except that NF-Y+ genes were analyzed. (C) An NF-Y locus within the promoter of a representative nonexpressed gene, devoid of active histone marks, is shown.
|
![]() View larger version (20K): [in a new window] |
FIG. 6. ChIP with negative histone marks in nonexpressed NF-Y+ loci. Twelve NF-Y+/H3K9-K14ac–/H3K4me3– sites, randomly selected from the list of targets not expressed in HeLa cells (see File S7 in the supplemental material) were analyzed by ChIP with anti-NF-YB, anti-H3K27me3, anti-H3K9me3, and anti-H4K20me3 antibodies. Enrichment over that for an irrelevant Flag control antibody was assessed by semiquantitative PCR in duplicate experiments; enrichments greater than twofold were considered significant. The chromosome 11 satellite centromeric region and the promoter of the transcribed NF-Y target SON were used as internal controls.
|
NF-Y is an activator as well as a repressor. The binding of a TF to a target DNA sequence does not necessarily imply direct transcriptional regulation. To assay the role of NF-Y binding in gene expression, we infected HeLa and HCT116 cells with adenoviral vectors coding for an NF-YA dominant negative protein (Ad-YA-m29) containing a mutation in the DNA binding domain; the mutant still associates with the histone fold NF-YB/NF-YC dimer but renders the trimer incapable of CCAAT association (see reference 27 and references therein). In parallel, we infected cells with wild-type Ad-NF-YA and Ad-GFP viruses. Under these conditions, expression of CCAAT-containing, NF-Y-dependent promoters was previously shown to be crippled (51). Figure 7 shows RT-PCR analysis of expressed NF-Y targets associated with positive histone marks (top); dominant negative YA treatment led to decrease in expression of the genes analyzed, while little effects were observed in the control wild-type Ad-GFP or Ad-NF-YA infections. The mRNA levels of control, NF-Y-independent genes such as GAPDH were unaffected. Similar results were obtained with the HCT116 cells (Fig. 7, top right). In parallel, we also analyzed NF-Y loci not expressed and associated with negative histone marks; some of these genes, notably CYP24A1 and SERPIND, were clearly and specifically up-regulated by the YA-m29 treatment, both in HeLa and in HCT116 cells. Note that the former had high levels of H3K27me3 and low H4K20me3; SERPIND showed the opposite pattern. SUHW1, which was low on both these marks, was very modestly induced by YA-m29. Interestingly, all these genes contain prototypical CCAAT boxes in their promoters. We conclude that the presence of an active NF-Y trimer is required both for the expression of active genes and for repression of some of the NF-Y targets associated with negative histone marks.
![]() View larger version (22K): [in a new window] |
FIG. 7. Effects of NF-Y removal on gene expression. HeLa cells (left) were infected in parallel with control Ad-GFP, wild-type Ad-NF-YA, and the dominant negative Ad-YAm29 adenovirus. RT-PCR analyses of infected cells were performed in the linear range of amplification for the indicated genes. HCT116 cells (right) were analyzed under the same conditions. Western blot analysis confirmed equivalent overexpression of wild-type NF-YA and of YAm29 proteins (not shown).
|
|
|
|---|
NF-Y location analysis. Several bioinformatic studies have found CCAAT boxes in >60% of human promoters (19, 35, 48, 49); specific pathways, such as those involving cell cycle- and endoplasmic reticulum stress-regulated genes, were found to be quite enriched for CCAAT-containing genes (16, 50, 57). Interestingly, analysis of microRNA control sequences identifies NF-Y as one of the few TFs involved in the regulation of all of them (29).
The first issue tackled here is how frequent are NF-Y binding sites. The answer to this depends, to some extent, upon the stringency of the analysis. Monitoring 25 promoters (Fig. 1) and validations of additional loci helped us fine-tune our analysis, so that we "lose" only 10 to 15% of the positive locations, keeping the false-positive sites at the same level. The two extremes range from 757 to 2,120 sites; the former is an absolute minimum that considers only very-high-affinity sites, the latter is a wider look at low-affinity sites or at sites present only in a subcategory of cells, as ChIPs were made from a heterogeneous population. Extrapolation of these numbers to the whole genome brings NF-Y sites to 35,000 to 80,000, which is high compared to extrapolations performed for other TFs such as E2F1 (20,000 to 30,000 binding sites [9]). However, it should be noted that there are at least 200,000 NF-YB molecules within the nuclei of various cell lines (R. Mantovani, unpublished data). These data are consistent with the idea that NF-Y is involved in a wide range of RNA production procedures, both positive and negative; indeed, the TUs that contained at least one NF-Y site are a majority, 519 out of 907.
The second issue tackled by this work concerns the presence of the CCAAT consensus in the identified NF-Y sites. Previous location analysis experiments performed with similar platforms reported a low rate of consensus sites near locations of TFs (9, 11, 17), while others suggested significant variation from the in vitro-derived consensus (56). It was found, using stringent criteria, that >60% of NF-Y binding sites contain a CCAAT consensus, particularly promoter sites. Furthermore, 1.5 to 2.5 CCAAT boxes per locus are found; thus, the presence of one or more CCAAT boxes is in general important for NF-Y binding, in agreement with biochemical in vitro work (15). Note that the total number of CCAAT-containing promoters in the cluster analyzed here is 463 out of 907 (51%), which is lower than the 60 to 67% derived from other studies (48). Nonetheless, around 35% of the identified NF-Y loci are apparently devoid of CCAAT boxes. There are explanations for this finding. (i) Variation of a single nucleotide in the CCAAT pentanucleotide was reported to be compatible with NF-Y binding, especially when other functionally important overlapping sites are involved (20, 37). A degenerate CCAAT box would be missed in our stringent analysis. (ii) The ChIP-on-chip experiments were performed with an anti-NF-YB antibody; since NF-YB and NF-YC subunits are in excess with respect to NF-YA (Mantovani, unpublished), some CCAAT-less sites devoid of the sequence-specific NF-YA might have emerged.
The last point raised by our analysis is related to the genomic location of the identified NF-Y binding sites. The CCAAT box has long been considered almost exclusively as a promoter element crucial for Pol II recruitment (24). In keeping with expectations (33), the vast majority of NF-Y PR sites, 70%, are positioned near the TSS, and there is clear enrichment of sites near the TSS also within the GE cohort (Fig. 2). However, NF-Y locations appear to be more scattered, and roughly 50% of NF-Y-positive promoters contain an extra site within the body of the gene (Fig. 2). It is possible that their role in many such cases is related to promoter-enhancer connections, while other GE locations could be alternative promoters of the same gene or sites of divergent RNA production. Moreover, within the EL cohort, a large number of sites are located in mRNA-expressing areas, sites of Pol II activity. This accounts for 20% of the 909 EL peaks (Table 1 and Fig. 2A). If we apply to these Pol II/NF-Y+ units the same ratio of 2 sites/unit, as measured in bona fide NF-Y RefSeq GE sites, at least 100 additional Pol II units would be added. Extrapolation to the whole genome gives 4,500 NF-Y+ units outside of the annotated RefSeq genes. The remaining EL peaks could either be enhancer sequences located at a distance from transcribed regions or TUs whose RNAs have not been annotated yet.
NF-Y, active histone marks, and transcription. It is clear that H3K4me3 acetylation and H3K9-14 acetylation are hallmarks of active genes (6, 40, 43). Our data for HeLa cells confirm previous results for HepG2, obtained using similar criteria of analysis (6). First of all we find an excellent correlation between active mark islands and gene expression in HeLa cells (see File S5 in the supplemental material). Second, as the overlap between expressed and not expressed genes in HeLa and HepG2 cells is quite high, 85%, we found a remarkable degree of coincidence between histone mark sites in the two cell lines, with deviations only for H3K4me3 within genes, an effect due to cell type-specific patterns and/or to our underestimation of such a category (see File S3 in the supplemental material). A similar picture emerged also by comparison with the data set of Guenther et al. (23), obtained with human ES cells (see File S5I to K in the supplemental material). As expected, skewing toward expression was less evident for H3K4me3-positive promoters in ES cells.
NF-Y binding has been so far mostly associated with two possible transcriptional states: an actively transcribing gene and a poised promoter, with a prebound NF-Y ready to help recruit whichever TF is specifically responsible for full activation, following a stimulus. Indeed, the majority of NF-Y-bound units contain H3K9-14ac and H3K4me3 sites and are expressed. Note that active genes are expected to be readily discernible in Affymetrix profiling data, while the levels of the inducible ones might not, thus underscoring this already significant correlation.
It has long been known that the binding of TFs and cofactors to promoters is a hallmark of expression, by signaling to the Pol II transcription machinery the positional coordinates (52). It would seem logical, therefore, to postulate that positive histone marks are positioned according to a code determined by the sequence-specific TFs. Reassuringly, several studies have confirmed that TF binding indeed correlates with the presence of "active" marks and with gene expression. In a thorough study performed with quantitative PCR on the correlation between >30 histone modifications and MYC sites, Guccione et al. (22) concluded that the presence of H3K4me2/3, H3K9-14ac, and H3K79me2 is a prerequisite for MYC binding. Interestingly, elimination of MYC had little effect on the levels of these modifications, unlike what was found for H4 acetylations, which were decreased (22). Clearly, this is not the case for NF-Y, since our unbiased analysis identified NF-Y regions devoid of these active histone marks.
The most surprising result is the identification of the discrete cohort of NF-Y+ TUs in which H3K9-14ac and H3K4me3 islands are absent and specifically enriched in A calls (almost 80%); this indicates that NF-Y is not always associated with active/inducible transcription. This cluster comprises mostly tissue-specific and developmentally regulated genes (see File S7 in the supplemental material for a complete list) and is associated with repressive histone marks such as H4K20me3 and H3K27me3. While hints at a negative role for NF-Y in transcription had previously been reported (5, 27, 32, 53), we are intrigued by the extent of this phenomenon, which involves 20 to 25% of the total binding sites. Significantly, we find that removal of the NF-Y trimer leads to activation of these loci. Among the derepressed, CYP24A1, but not SERPIND, is H3K27me3+. It is known that this modification is associated with Polycomb (47); hints to a possible mechanism of repression through deposition of this negative mark come from recent experiments with Caenorhabditis elegans, in which ceNF-Y function has been genetically linked to Polycomb through direct interaction with the ESC/E(Z) component (13).
The structural resemblance of NF-YB-NF-YC to H2A-H2B should be remembered when considering the bifunctional behavior of NF-Y. The first sign of an opening chromatin cluster, and one specifically required for H3K4me3, is monoubiquitination of H2B on K123 (28); we noted that lysines are present in the corresponding region of the H2B-like NF-YB, and indeed NF-YB is monoubiquitinated (G. Donati and R. Mantovani, data not shown). On the other hand, H2A is monoubiquitinated by Polycomb components, and the functional significance of this modification is opposite to that of H2B-ubiquitin, leading to repression of transcription (55). Thus, H2A-ubiquitin could serve as a signal to recruit H3K27 methyltransferase. Given the dual dominant NF-Y function uncovered here, we are tempted to speculate that a code of posttranslational modifications may exist for the histone-like NF-Y.
The bivalent behavior of NF-Y and its independence from a specific pattern of activating versus repressive histone modifications might have important consequences for transcriptional regulation at bivalent loci that have been recently mapped in both ES and lineage-committed cells. At these sites, in fact, the activating H3K4me3 and the repressive H3K27me3 are present together on areas of the genome "poised" for alternative developmental fates (1, 7, 36). Interestingly, the role of NF-Y in stem cells has been highlighted by two recent studies: (i) NF-YA was shown to be required for maintenance of hematopoietic stem cells (58) and (ii) the CCAAT box was found to be specifically enriched in conserved regions of genes highly expressed in mouse and human ES cells, with NF-Y being required both for regulation of these elements and for cell survival (21). Moreover, a switch in the two major isoforms of NF-YA was noticed upon differentiation, with the "short" form being highest in stem cells and decreasing in embryonic bodies. Note that this isoform is specifically required for stemness in the hematopoietic system (58). These results, considered together with the widespread bifunctional role of NF-Y and its independence from specific methyl marks shown here in committed HeLa cells, suggest that NF-Y might be associated with bivalent sites in ES cells, possibly even regulating their positioning. Because of its structure and histone interactions (10), NF-Y would be ideal to maintain nucleosome-free areas, accommodate accessibility of nearby TFs, and recruit modifying complexes, positive or negative. This hypothesis and the cause-and-effect relationships between the positioning of histone marks and NF-Y binding will now be investigated with appropriate genetic experiments.
Published ahead of print on 22 January 2008. ![]()
Supplemental material for this article may be found at http://mcb.asm.org/. ![]()
These authors equally contributed to this work. ![]()
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»