Previous Article | Next Article ![]()
Molecular and Cellular Biology, December 2003, p. 8773-8785, Vol. 23, No. 23
0270-7306/03/$08.00+0 DOI: 10.1128/MCB.23.23.8773-8785.2003
Copyright © 2003, American Society for Microbiology. All Rights Reserved.
Department of Molecular Genetics, The University of Illinois at Chicago, Chicago, Illinois 60607,1 Department of Tumor Cell Biology, St. Jude Childrens Research Hospital, Memphis, Tennessee 38105,2 MUNIN Corporation, Chicago, Illinois 606123
Received 22 April 2003/ Returned for modification 16 June 2003/ Accepted 21 August 2003
| ABSTRACT |
|---|
|
|
|---|
gene's G+C-rich promoter. In situ binding assays, immunoprecipitation, and Western blot analyses demonstrated that mGPBP is a nuclear factor that can form complexes with TATA-binding protein, TFIIB, TFIIF, RNA polymerase II, and P300/CBP both in vitro and in intact cells. In cotransfection assays, increased mGPBP expression transactivated the murine Ada gene's promoter. Sequestering of GPBP present in HeLa cell nuclear extract by immunoabsorption completely and reversibly suppressed extract-dependent in vitro transcription from the murine Ada gene's G+C-rich promoter. However, transcription from the human Topo II
gene's TATA box-containing G+C-rich promoter was only partially suppressed and the adenovirus major late gene's classical TATA box-dependent promoter is totally unaffected under identical assay conditions. These results implicate GPBP as a requisite G+C-rich promoter-specific transcription factor and provide a mechanistic basis for distinguishing transcription initiated at a TATA box-deficient G+C-rich promoter from that initiated at a TATA box-dependent promoter. | INTRODUCTION |
|---|
|
|
|---|
Transcription initiation of a large number of mammalian genes, including most housekeeping and central nervous system genes and many highly regulated genes that control cell growth and differentiation, is under the control of G+C-rich promoters (2, 20, 23, 26, 28, 31). This class of promoter is not found in either the Drosophila melanogaster or yeast genome. Because a common characteristic of this type of promoter is the presence of a noncanonical TATA box and one or more Sp1 binding sites upstream of the major transcription initiation site, several reports have claimed that those sequences are the key functional elements in G+C-rich promoters (6, 18). However, reports that challenged these claims, in some cases even with respect to the same promoter, also have been published (2, 24, 25). The lack of obvious conserved sequence motifs shared among different G+C-rich promoters, with the exception of the Sp1 binding sites and the "noncanonical" TATA boxes, provides further impetus for investigations to elucidate how transcription can initiate nonrandomly at these precise genome locations.
The murine Ada gene has an archetypal G+C-rich promoter that contains multiple Sp1 binding sites and the noncanonical TATA-like element TAAAAAA. Neither the Sp1 binding sites nor the TAAAAAA sequence is required for basal or enhancer-activated promoter function (1, 2; C.-Y. Yeung et al., unpublished data). Instead, the minimal self-sufficient promoter activity resides within a 48-bp minimal self-sufficient promoter element (MSPE) that displays an imperfect dyad symmetry (2). Many other G+C-rich promoters contain elements with similar secondary structure-forming potential around their transcription initiation sites (2). We previously demonstrated that there are nuclear protein-binding sites within the Ada gene's MSPE (2). This observation led us to attempt the cloning and characterization of the nuclear protein responsible for directing the assembly of the RNA polymerase II transcription initiation complex at this promoter.
A multimerized MSPE probe was used to identify and clone a full-length cDNA that encodes a ubiquitously expressed 66-kDa murine protein. This 66-kDa protein, designated the murine G+C-rich promoter binding protein (mGPBP), can bind specifically to both the murine Ada gene's MSPE and the nonhomologous human Topo II
gene's G+C-rich promoter, as revealed by competitive gel mobility shift assays. The coding sequence of mGPBP displays domain homologies with those encoding several known transcription factors. Purified recombinant mGPBP generated from Escherichia coli was used to raise polyclonal rabbit antiserum against the protein. In situ immunostaining studies showed that mGPBP is localized in the nucleus. Immunoprecipitation and Western blot analyses revealed that GPBP can form a complex with TBP, transcription fraction IIB (TFIIB), TFIIF, RNA polymerase II, and P300/CAAT binding protein (CBP) both in vitro and in vivo. In cotransfection experiments performed on various mammalian cells, elevated mGPBP expression was found to transactivate a murine Ada gene promoter-controlled reporter gene. Immunoabsorption-induced sequestering of GPBP from HeLa cell nuclear extract resulted in a complete suppression of extract-dependent in vitro transcription directed by the mouse Ada gene's G+C-rich promoter. This suppression was reversed by the replenishment of the nuclear extract with purified recombinant mGPBP. The GPBP requirement for transcription initiated at this G+C-rich promoter is promoter specific. Parallel experiments showed that similar immunoabsorption of GPBP in HeLa nuclear extract had no adverse effect on transcription initiated at the adenovirus major late gene's classical TATA box-dependent promoter and only a partial and reversible suppressive effect on the human Topo II
gene's G+C-rich promoter, which contains a perfect TATAAA consensus element.
These results thus describe the discovery of a novel requisite promoter-specific transactivating transcription factor with demonstrable abilities to both bind a typical G+C-rich promoter and interact with multiple transcription factors that comprise the core mammalian RNA polymerase II transcription initiation complex. Since GPBP is not required for transcription directed by the adenovirus major late gene's classical TATA box-dependent promoter, the results also provide definitive proof that transcription initiated at the murine Ada gene's G+C-rich promoter is mechanistically distinct from that initiated at a classical TATA box-dependent promoter.
| MATERIALS AND METHODS |
|---|
|
|
|---|
gt11 bacteriophage expression library of 3 x 107 clones was screened with a 32P-labeled multimerized Ada promoter fragment C (2) probe. The restriction fragment C was isolated from a plasmid, gel purified, and multimerized by ligation end to end in the presence of T4 ligase. Multimers containing four or five copies of the fragment were gel purified, end labeled with polynucleotide kinase and [
-32P]ATP, and used to screen the expression library according to the method of Singh et al. (34). The full-length mGPBP clones were generated by using, for the 5' rapid amplification of cDNA ends (RACE), the oligonucleotide primer 5'-CAGGCTGGAGCAAAGTCATGCTGCGCC-3' and the AP1 primer in the first PCR and the oligonucleotide primer 5'-AGGTCCAGTCTCTCCAACTCAGTGAAAC-3' and the AP2 primer in the nested PCR. For the 3' RACE, the first PCR was performed with the oligonucleotide primer 5'-CTGATCTGTGACTTCAAGTTTGGACC-3' and the AP1 primer and the oligonucleotide primer 5'-TGACGATGTGTGAAGGAGATCCTCACAGC-3' with the AP2 primer were used for the nested PCR according to the recommendations of the manufacturer of the Marathon-ready mouse brain cDNA templates and Advantage cDNA PCR kit (Clontech), whose products were used for the PCRs. Northern blot analyses. Mouse multiple-tissue Northern blots (MTN; Clontech) were hybridized with mGPBP cDNA probes and washed under standard conditions (29). The probes used were gel-purified, individually cloned mGPBP cDNA restriction fragments. The blots used were stripped as previously described (C.-Y. Yeung et al., unpublished data) and reprobed with a ß-actin cDNA probe as a loading control. All probes were synthesized by the random hexanucleotide-priming method (14) and purified with a Bio-Gel P-30 column (Bio-Rad).
mGPBP expression constructs. The mGPBP cDNA's open reading frame (ORF) sequence was inserted downstream of the hexahistidine tag coding sequence of pET28a (Novagen) to yield the bacterial expression vector pETmGPBP, which can synthesize recombinant His-tagged mGPBP upon IPTG (isopropyl-ß-D-thiogalactopyranside) induction. The bacterial expression vector used to synthesize the recombinant glutathione S-transferase (GST)-mGPBP fusion protein was generated by inserting the mGPBP cDNA's ORF downstream of the GST cDNA sequence in plasmid pGEX-4T-1 (Pharmacia) to yield construct pGEX-mGPBP. These bacterial expression constructs were transformed into E. coli strains BL21/DE3 and BL21, respectively, for the production of recombinant mGPBP. The eukaryotic mGPBP expression vector pCDNA-mGPBP was generated by inserting the mGPBP cDNA's ORF into the pCDNA3 expression vector (Invitrogen). The hemagglutinin antigen (HA)-tagged (15) mGPBP eukaryotic expression construct pCDNAHA-mGPBP was generated by inserting the HA tag coding sequence upstream of, and in frame with, the mGPBP cDNA's ORF in pCDNA-mGPBP.
Purification of recombinant mGPBP. Recombinant mGPBP (His tagged) used to raise anti-mGPBP antiserum was purified under denaturing conditions with Ni-nitrilotriacetic acid (NTA) agarose (Qiagen) by following the vendor's instructions and then isolated as a single excised band following sodium dodecyl sulfate-8.5% polyacrylamide gel electrophoresis (SDS-8.5% PAGE). The recovered mGPBP protein was concentrated by using ULTRAFREE-MC 10,000NMWL filter units (Millipore).
Recombinant His-tagged mGPBP used for functional transcription or DNA binding assays was purified under native conditions with Qiagen's Ni-NTA agarose by following the manufacturer's protocol with minor modifications. After incubation of bacterial lysate with a 50% Ni-NTA agarose slurry at 4°C for 2 h, the resin-bound protein was eluted from the agarose matrix and the protein fractions containing the purified protein were then pooled and dialyzed twice for 60 min each, first against 500 ml of dialysis buffer (20 mM Tris-HCl [pH 7.5], 50 mM KCl, 10 mM MgCl2, 10 µM ZnSO4, 1 mM EDTA, 20% glycerol, 0.5 mM dithiothreitol [DTT], 0.2 mM phenylmethylsulfonyl fluoride [PMSF]) containing 0.25 M NaCl and then against 500 ml of the same dialysis buffer without NaCl. The GST-mGPBP fusion protein used for the pull-down assay was purified according to a published protocol (16) with minor modifications. After IPTG induction, the bacterial cells were pelleted and resuspended in phosphate-buffered saline (PBS) containing 10 mM DTT, 0.5 mM PMSF, and 2 µg each of leupeptin, pepstain A, and antipain/ml. The cells were then sonicated and subsequently treated with 1/10 volume of 10% Triton X-100. The GST fusion protein in the supernatant was then purified by one-step affinity chromatography using glutathione-Sepharose beads (Pharmacia) according to the vendor's recommendations.
EMSAs.
The probes were isolated as restriction fragments from plasmids and radiolabeled with [
-32P]dCTP by using Klenow fragments. Electrophoretic mobility shift assays (EMSAs) were performed as previously described (10) with minor modifications. Binding-reaction mixtures containing purified bacterially expressed recombinant proteins were prepared in binding buffer (10 mM Tris-HCl [pH 7.5], 60 mM KCl, 5 mM MgCl2, 0.5 mM DTT, 0.1 mM EDTA, 10 µM ZnCl2, 0.05% NP-40, 12 to 15% glycerol) containing 0.5 µg of poly(dI/dC) (total volume, 20 µl). After the binding reaction mixtures had been incubated for 10 min at room temperature, the radiolabeled probe was added for an additional 20 min of incubation at room temperature. The mixture was then separated in 4% acrylamide-bisacrylamide (29:1) nondenaturing polyacrylamide gel containing 1x Tris-borate-EDTA buffer.
Generation of anti-mGPBP antiserum and purification of the antibodies. Immunization was performed at the Immunological Resource Center at the University of Illinois. Purification of antibodies from the antiserum was carried out with the ImmunoPure(A/G) immunoglobulin G purification kit (Pierce), as suggested by the manufacturer. Eluted antibodies were combined, dialyzed against PBS, and concentrated with centrifugal filtration units (Millipore).
Antibodies. The antibodies used were anti-HA monoclonal antibodies (clone 12CA5; Boehringer Mannheim), anti-RNA polymerase II C-terminal domain monoclonal antibodies (Promega), anti-HA monoclonal antibodies conjugated with agarose, rabbit anti-TFIIB polyclonal antibody C-18, rabbit anti-TFIIF RAP30 polyclonal antibody C-17, anti-TBP monoclonal antibody 58C9, rabbit anti-CBP polyclonal antibody A-22 (Santa Cruz Biotechnology), and anti-nucleoporin p62 monoclonal antibody clone 53 (Transduction Laboratories).
Indirect immunofluorescence.
HeLa cells (5 x 105) were plated in Dulbecco's modified Eagle medium (DMEM) containing 10% fetal calf serum in 60-mm-diameter dishes containing glass coverslips and incubated at 37°C overnight. The following day the cells were transfected with 5 µg of either the pCDNAHA-mGPBP expression construct or the pCDNA-HA control vector by a calcium phosphate transfection protocol as described previously (2). Sixteen hours after transfection, the cells were rinsed extensively with PBS, treated with fresh medium, and incubated for an additional 48 h. The cells were then rinsed with PBS and fixed with 4% paraformaldehyde for 30 min at room temperature. After fixation the cells were permeabilized with 0.2% Triton X-100-PBS for 5 min, rinsed, and stained with either the anti-mGPBP (
N-80) antiserum- or preimmune antiserum-derived antibodies that had been purified by binding to protein A-conjugated beads or an antibody against HA (2 µg/ml) at 37°C for 1 h. After being washed four times (5 min/wash) with PBS, the cells were stained with either rabbit or mouse fluorescein-conjugated secondary antibodies (1:200 dilution) (Amersham) at 37°C for an additional hour. The coverslips were then washed extensively, mounted in antifade solution (Vector Lab) containing 0.25 µg of DAPI (4',6'-diamidino-2-phenylindole) to counterstain the nuclei, and photographed with a Zeiss Axiovert microscope with an attached Princeton Instruments charge-coupled device camera.
Preparation of nuclear extracts. Nuclear extracts were prepared as previously described (4) with minor modifications. Mouse Cl-1D cells were collected and washed with 10 volumes of PBS once. The cells were then suspended in 5 volumes of cold buffer A (10 mM HEPES-KOH [pH 7.9]; 10 mM KCl; 1.5 mM MgCl2; 0.1 mM EGTA; 0.5 mM DTT; 0.5 mM PMSF; 2 µg each of antipain, leupeptin, and pepstain/ml) and incubated on ice for 15 min. Cells were lysed by 20 strokes of a Dounce homogenizer (B pestle) and a further 2 to 5 strokes in the presence of 0.3 to 0.4% Nonidet P-40 (NP-40). The lysed cells were centrifuged at 1,300 x g for 10 min. The pelleted nuclei were washed three times with buffer A without NP-40, centrifuged again as described above, resuspended in 1 to 2 volumes of cold buffer B (10 mM HEPES-KOH [pH 7.9], 0.1 mM EGTA, 0.5 mM DTT, 400 mM NaCl, 5% glycerol, 0.5 mM PMSF), and incubated on ice for 60 min. The solution was then centrifuged at 48,000 x g for 1 h. The supernatant was dialyzed twice against 500 ml of dialysis buffer (20 mM HEPES-KOH [pH 7.9], 75 mM NaCl, 0.1 mM EDTA, 0.5 mM DTT, 20% glycerol, 0.5 mM PMSF) for 1 h and cleared by centrifugation in a microcentrifuge for 15 min. HeLa cell nuclear extract was from Promega.
Western blotting. Cells were lysed in radioimmunoprecipitation assay buffer (25 mM Tris [pH 8.2], 50 mM NaCl, 0.5% Nonidet P-40, 0.5% sodium deoxycholate, 0.1% SDS, 0.1% sodium azide) containing 1 mM PMSF, 10 µg of aprotinin/ml, and 10 µg of leupeptin/ml. Cell lysates were incubated on ice for 15 min and centrifuged at 12,000 x g for 10 min at 4°C. The protein concentration of the cleared lysates was determined with the Bio-Rad protein assay kit. Equal amounts of protein per lane were analyzed by SDS-8.5% PAGE. Western blotting was carried out with Amersham polyvinylidene difluoride membranes according to the vendor's recommendations.
Pull-down assay. Immobilization of bacterially expressed GST and GST-mGPBP proteins was accomplished by incubating 1 to 5 µg of the proteins with 100 µl of a 50% slurry of glutathione-Sepharose beads (Pharmacia) for 1 h at 4°C. The protein-bound beads were collected and washed three times with PBS buffer. The beads were then resuspended in binding buffer (20 mM HEPES-KOH [pH 7.9], 75 mM NaCl, 0.1 mM EDTA, 0.5 mM DTT, 20% glycerol, 0.5 mM PMSF) (17). About 1/60 of the beads were subjected to SDS-8.5% PAGE to confirm that GST proteins were bound to the beads. The rest of the beads were incubated with 100 µg of Cl-1D cell nuclear extract on a rotator overnight at 4°C. The beads were collected and washed three times with washing buffer (20 mM Tris-HCl [pH 8.0], 150 mM NaCl, 1 mM EDTA, 0.1 to 0.3% NP-40). The bound proteins were eluted with SDS sample buffer and subjected to SDS-4 to 20% PAGE. Western blotting was then carried out on the gels.
Coimmunoprecipitation analyses. Cells were collected by centrifugation in PBS 48 h after DNA transfection. Cells from each 100-mm-diameter dish were treated with 60 µl of lysis buffer containing 20 mM HEPES, pH 7.9, 400 mM NaCl, 1 mM EDTA, 0.1% NP-40, 1 mM DTT, 0.5 mM PMSF, 10% glycerol, and 1.5 µl of a protease inhibitor cocktail (Sigma) (17). Cell lysates were incubated for 30 min at 4°C and centrifuged at 13,000 x g for 10 min at 4°C. The supernatant from each 100-mm dish was treated with 300 µl of lysis buffer without NaCl and incubated with the agarose bead-bound anti-HA antibody (Santa Cruz Biotechnology) overnight at 4°C. The beads were pelleted and washed three or four times with 1 ml of buffer W (20 mM Tris-HCl [pH 8.0], 100 mM NaCl, 0.1 to 0.3% NP-40, 1 mM EDTA, 1 mM DTT, 0.5 mM PMSF). Bound proteins were analyzed by Western blotting using a 4 to 20% gradient polyacrylamide precast SDS-PAGE gel (BioWittaker Molecular Applications).
Cell culture and transfections. Monolayer cultures of murine Cl-1D LM (thymidine kinase-) fibroblast cells derived from bone marrow stromal cells of a (C57BL/6J x C3H/HeJ)F1 mouse and human embryonic kidney (HEK) 293 cells were maintained in 10% fetal calf serum in DMEM. Transfections into 293 and Cl-1D cells for transactivation assays were carried out with Lipofectamine Plus (for 3 h), as suggested by the manufacturer (Gibco BRL). The transfected cells were maintained in medium for about 24 to 36 h before being collected for luciferase assays. All transfection assays were repeated at least three times with different DNA preparations. All coimmunoprecipitation assays were performed using Cl-1D cells that had been transfected by the calcium phosphate coprecipitation technique as previously described (2). The coimmunoprecipitation assays were performed on the cells 45 to 48 h after transfection.
Luciferase assay. For each 60-mm plate of Cl-1D or 293 cells, 0.2 µg of the luciferase reporter gene under the control of the murine Ada gene promoter was cotransfected with various amounts of the mGPBP expression plasmid (pCDNA-mGPBP) and 0.1 µg of Rous sarcoma virus-ß-galactosidase (ß-Gal) plasmid. The total amount of plasmid used per plate was brought up to 2 µg with the empty pCDNA3 vector. The transfected cells were washed with PBS and then cultured in 10% fetal calf serum in DMEM for 24 to 36 h prior to being harvested for reporter gene expression analyses. The luciferase activities of the transfected-cell lysates were measured with a luciferase assay system (Promega) and the VICTOR2 multilabel counter (Wallac). The obtained values were then normalized according to the ß-Gal activity as previously published (29).
In vitro transcription assays.
In vitro transcription reactions were carried out as previously described (12) with minor modifications. Supercoiled template DNA was purified by banding twice in centrifuged CsCl gradients (29). The templates used consist of G-less cassette reporters either without a promoter (construct pC2AT- [30]), with the adenovirus major late promoter (construct PMLC2AT [30]), with the murine Ada gene promoter (pmADAPC2AT; generated by PCR), or with the human Topo II
gene promoter (see Fig. 8A). Transcription reaction mixtures (total volume, 25 µl) containing 8 mM HEPES (pH 7.9), 40 mM KCl, 6 mM MgCl2, 0.08 mM EDTA, 0.2 mM DTT, 8% glycerol, 30 U of RNase T1, 100 ng of template DNA, 9 to 10 µg of HeLa nuclear extract (Promega), and different amounts of antibodies and purified recombinant protein were incubated at 30°C for 10 min prior to the addition of 0.2 mM ATP, CTP, and GTP to 0.2 mM each, of UTP to 8 nM, of 3'-O-methyl-GTP (Pharmacia) to 0.05 mM, and of [
-32P]UTP (800 Ci/mmol) to 1 µM. The reaction mixtures were then incubated for 1 h at 30°C. The reactions were stopped by the addition of 175 µl of stop solution (0.3 M Tris-HCl [pH 7.4], 0.3 M sodium acetate, 0.5% SDS, 2 mM EDTA, 25 µg of tRNA/ml) and an extraction control 200-bp end-labeled DNA fragment. The reaction solutions were extracted twice with phenol-chloroform-isoamyl alcohol (25:24:1, vol/vol/vol). The nucleic acids in the reaction were ethanol precipitated, air dried, and dissolved in 4 µl of nuclease-free water. An equal volume of loading dye (98% formamide, 10 mM EDTA, 0.1% xylene cyanol, 0.1% bromophenol blue) was added to the solution of in vitro-transcribed RNA. These RNA solutions were then heated at 90°C for 10 min and electrophoresed in a 6% denaturing (7 M urea) polyacrylamide gel containing 0.5x Tris-borate-EDTA buffer. The amounts of radiolabeled transcript and extraction control bands were then quantified with a PhosphorImager (Molecular Dynamics).
|
| RESULTS |
|---|
|
|
|---|
gt11 bacteriophage mouse brain cDNA expression library for cloned proteins that bind to this MSPE. Screening with the negative control Sp1 binding consensus motif probe (TGGGCGGGGC) eliminated clones expressing proteins that bind nonspecifically to random G+C-rich sequences. Out of 3 x 107 clones, 2 clones were identified after tertiary screens and were further examined by subcloning, restriction digestion, and sequence analysis.
|
, which interacts with RNA polymerase II in transcription initiation complex assembly (3, 19); yeast ARS binding factor 1 (yABF-1), which is a known transcription factor involved in DNA replication initiation in yeast (5, 22); mouse high mobility group 1-like structure-specific recognition protein 1 (SSRP1), which apparently binds to DNA through structural recognition (7, 33) and which participates in both transcription activation (13, 38) and elongation (27); and helicases, which can unwind double-stranded DNA for transcription and replication (reviewed in reference 40). Based on our expectations of the properties of the gene sought, this fragment was deemed to be a good candidate clone and was used to generate two full-length cDNA clones, 3.0 and 3.5 kb in size, by 3' and 5' RACE. The entire cDNA sequence of the 3,451-bp clone, including the deduced 493-amino-acid (aa) sequence encoded by the 1,479-bp ORF and the 815-bp 3' untranslated region (UTR), is shown in Fig. 1A. This cloned gene was named the mGPBP gene.
The mGPBP transcript is ubiquitously expressed as two mRNA species differing in 3' polyadenylation site utilization.
Because G+C-rich promoters control practically all housekeeping genes, which are essential for cell survival, we predicted that, if mGPBP is critical for transcription of these genes, it would be ubiquitously expressed. The full-length mGPBP cDNA sequence was labeled and used as a probe to determine the distribution in tissue and mRNA sizes of the gene in a Northern blot analysis. The results (Fig. 2A) showed that the gene is highly expressed in all mouse tissues analyzed as two mRNA species of 3.0 and 3.5 kb. Because we had two 3' RACE products that differed in size by
500 bp and because each product contained consensus AATAAA polyadenylation signaling motifs located appropriately upstream of the poly(A) tracts, we surmised that the 3.0- and 3.5-kb mRNA species may represent the same RNA with different polyadenylation site utilization. This prediction was confirmed when we cloned out a 2.0-kb fragment from the 5' end and a 350-bp fragment from the 3' end of the longer cDNA clone and used them as probes on the same Northern blot, which was stripped between each hybridization experiment. The results (Fig. 2B) revealed that the 5' probe hybridized with both the 3.0- and 3.5-kb mRNA species, whereas the 3' probe hybridized only with the 3.5-kb mRNA species. These observations indicate that both these cross-hybridizing mRNAs contain the entire ORF of the mGPBP gene and differ only in their choice of 3' polyadenylation site usage.
|
|
|
|
N-80 to immunostain fixed and permeabilized human HeLa cells, which were then counterstained with DAPI to locate the nuclei. The anti-mGPBP signal was localized to the nucleus (Fig. 5A). In control experiments using protein A affinity-purified antibodies derived from preimmune antiserum as a probe, immunoreactive signals were negligible even upon prolonged photographic exposure (Fig. 5B). The nuclear localization of GPBP was confirmed by directly transfecting the mGPBP expression vector into human HeLa cells and immunostaining for mGPBP with our anti-mGPBP antibodies (Fig. 5C, top). Definitive proof that our recombinant mGPBP was localized to the nucleus was provided by transfecting an mGPBP-HA tag fusion protein expression vector into human HeLa cells and immunostaining for the tagged protein with anti-HA antibodies (Fig. 5C, bottom). These in situ immunohistochemical analyses demonstrated that both the endogenous human GPBP and the exogenous cDNA-encoded recombinant mGPBP could translocate into the cell nucleus. GPBP interacts with multiple key factors that participate in mammalian RNA polymerase II transcription initiation complex assembly. If GPBP binding to the G+C-rich promoter's MSPE can lead to the assembly of the transcription initiation complex at that location, GPBP should be able to interact with one or more transcription factors that normally participate in transcription initiation complex formation. This expectation was tested by assaying whether various nuclear transcription factors that comprise the RNA polymerase II transcription initiation complex could form complexes with immobilized recombinant mGPBP. Recombinant mGPBP fused in-frame to the GST tag was immobilized by binding the GST moiety to glutathione covalently linked to beads. Mouse Cl-1D nuclear extract proteins were then allowed to bind to the bead-immobilized mGPBP. Both the unbound proteins in the supernatant and the proteins that remained bound to the immobilized mGPBP after extensive washing were then analyzed by Western blotting using antibodies against various transcription factors as probes. The results showed that known transcription initiation complex factors, such as TBP, TFIIB, TFIIF RAP30, and RNA polymerase II, as well as the transcription factor P300/CBP, all formed complexes with the immobilized recombinant mGPBP (Fig. 6A). None of these proteins formed complexes with only the GST tag immobilized onto the same glutathione beads, and the unbound supernatant recovered from both binding assays showed that the same proteins were similarly present in both binding assays. The nuclear membrane protein nucleoporin p62, which does not participate in transcription initiation complex formation, showed no affinity to either immobilized mGPBP or GST (Fig. 6A). These in vitro results demonstrate that mGPBP can form complexes specifically with several key transcription initiation factors.
|
Transcription from the mouse Ada gene's G+C-rich promoter requires GPBP. To examine whether our cloned mGPBP could transactivate a luciferase reporter gene controlled by the mouse Ada gene's G+C-rich promoter, we cotransfected both mouse Cl-1D cells and human 293 cells with a constant amount of the reporter construct and increasing amounts of the mGPBP expression vector. For both cell lines, reporter gene expression increased linearly with the amount of the mGPBP expression vector added in a dose-dependent manner (Fig. 7A). These results showed that increasing the intracellular GPBP level transactivated the G+C-rich promoter-driven reporter transcription in both cell types.
|
The G+C-rich Topo II
gene promoter, which contains a canonical TATA box, can also bind specifically to mGPBP.
To examine whether mGPBP is a general requisite G+C-rich promoter-dependent transcription factor, we also examined whether this protein can bind to other G+C-rich promoters, especially one that contains a consensus TATA box and G+C-rich regions that display no obvious homology to the murine Ada gene promoter, as in the case of the human topoisomerase II
gene promoter (Fig. 8A) (41). EMSAs using the labeled Topo II
promoter as the probe showed that the Topo II
gene promoter can also bind specifically to purified recombinant mGPBP in the absence of any other mammalian transcription factors. This binding can be competed out by either excess unlabeled probe or excess unlabeled fragments containing four copies of the murine Ada gene MSPE but not by excess copies of a 200-bp fragment derived from the pUC2H vector, which does not adopt non-B form DNA structures under negative supercoiling conditions.
Transcription initiation at the Topo II
gene promoter can be only partially suppressed by immunosequestration of mGPBP.
In HeLa cell nuclear extract-dependent in vitro transcription assays, we demonstrated that transcription initiated at the murine Ada gene promoter required the presence of GPBP whereas transcription initiated at a consensus TATA box-dependent adenovirus major late gene promoter did not. Since this Topo II
gene promoter binds specifically to mGPBP but contains a canonical TATA box, we also examined how this promoter functioned when the GPBP in the nuclear extract was sequestered by immunoabsorption. Sequestering of GPBP in the HeLa nuclear extract with anti-GPBP antibodies under conditions that totally suppressed transcription initiated by the murine Ada gene promoter showed only a partial suppressive effect on transcription initiated by the Topo II
gene promoter (Fig. 8C). This suppression can also be fully reversed by the addition of purified recombinant mGPBP to the nuclear extract. The results in Fig. 8 thus demonstrated that mGPBP can indeed bind to other G+C-rich promoters and that the presence of a canonical TATA box rendered the Topo II
gene promoter only partially dependent on GPBP for transcription initiation, in contrast to the total dependence of the murine Ada gene promoter on GPBP for transcription in the absence of such a canonical element.
| DISCUSSION |
|---|
|
|
|---|
gt11 phage cDNA expression clone that encodes a GPBP. Because G+C-rich promoters are present upstream of many essential mammalian housekeeping genes but are not found in either the yeast or Drosophila genome, we predicted that, if GPBP expression is critical to transcription from G+C-rich promoters, it would be ubiquitously expressed in mammalian cells and would be absent from both yeast and Drosophila genomes. Northern (Fig. 2) and Western (Fig. 3B) blot analyses of various mouse tissues and mammalian cell lines confirmed the former prediction. Although there is some variability in GPBP mRNA levels in different tissues, with the highest levels being found in kidney, heart, liver, and brain (Fig. 2), the variability shows no correspondence with Ada gene expression in those tissues. This observation is consistent with our prediction that GPBP is a general transcription factor and would therefore function similarly to other general transcription factors such as TBP, which also does not show tissue-specific expression patterns that necessarily match the expression levels of all its many target promoters. This is because the attenuation of expression control usually does not reside within the minimal basal promoter element, as has been observed for the ada gene (2; R. J. Gum et al., unpublished data; Hsu et al., submitted). A sequence homology search of the DNA sequence database (which includes the entire yeast and Drosophila genome sequences) revealed no obvious mGPBP orthologs in any of the nonmammalian genomes, although small domain-specific homologies to a number of transcription factors were found (Fig. 1B). The domain-specific homology to the SSRP1 gene (Fig. 1B) is particularly intriguing. Since SSRP1 is known to bind DNA in a DNA structure-dependent manner (7, 33), it is possible that GPBP may also share that unusual property and bind to MSPE via structural recognition.
Surprisingly, although no obvious mGPBP orthologs were uncovered by the sequence search, the murine Ada gene's G+C-rich promoter functions when it is introduced into yeast cells (data not shown). We note with interest that yABF-1, which has some domain-specific homology with mGPBP (Fig. 1B), can reportedly transactivate the yeast TRP3 gene promoter, which possesses a "suboptimal" TATA box (22). Because yeast origins of replication do not function as such in mammalian cells and since no one has been able to identify a mammalian yABF-1 ortholog by low-stringency hybridization screening, yABF-1's ability to transactivate a yeast gene promoter with a noncanonical TATA box and its sequence homology with certain mGPBP domains raise the possibility that yABF-1 and mGPBP may have similar cellular functions but recognize and bind to highly divergent DNA sequences. It is certainly formally possible that yABF-1, which has no known mammalian orthologs, may also promote transcription at the mouse Ada gene's G+C-rich promoter in mGPBP-deficient yeast cells.
The predicted size of mGPBP based on the cDNA ORF sequence was confirmed by Western blot analysis of lysates of various mammalian cell lines derived from diverse tissues of origin (Fig. 3). The other immune cross-reacting proteins seen in all cell lysates (Fig. 3B) may be either degradation products of mammalian GPBP or other members of a GPBP gene family.
Purified recombinant mGPBP was shown to bind specifically to the murine Ada gene's MSPE and the sequence-divergent G+C-rich human Topo II
gene promoter in competitive EMSA, in the absence of any other mammalian proteins. This observation is consistent with GPBP potentially being responsible for mediating the initial assembly of the transcription initiation complex.
In situ binding assays using anti-mGPBP antiserum as a probe demonstrated that endogenous human GPBP is localized to the nucleus in HeLa cells (Fig. 5A). These results are consistent with GPBP's proposed role as a mediator of transcription initiation complex assembly at the promoter, a role which would require nuclear localization.
Western blot analysis of mouse Cl-1D cell nuclear extract proteins that either form a complex with immobilized mGPBP or are coimmunoprecipitated with HA-tagged mGPBP expressed in intact cells revealed that mGPBP can form complexes specifically with TBP, TFIIB, TFIIF, RNA polymerase II, and P300/CBP both in vitro and in vivo. Since all these mGPBP-associated transcription initiation factors are known to participate in the formation of the RNA polymerase II transcription initiation complex (8, 9), these observations are consistent with the proposed role of GPBP as a mediator of transcription initiation complex assembly, following its binding to target G+C-rich promoters.
In cotransfection assays, mGPBP was found to transactivate the murine Ada gene's G+C-rich promoter (Fig. 7A) in the two different cell lines (Cl-1D and 293) used. Because all mammalian cells tested contain significant levels of endogenous GPBP (Fig. 3B), it is not surprising that the transactivating effect of the exogenous recombinant mGPBP in the transfected cells is only in the 2.5-fold range. More significantly, in the presence of high copy numbers of the cotransfected reporter gene in both Cl-1D and 293 cells, the transactivating effect of the mGPBP expression vector remained linear over the more-than-eightfold range of levels of the GPBP expression vector used (Fig. 7A). Thus, in the presence of an excess of G+C-rich promoter sequences in the cell, GPBP appeared to be rate limiting for reporter gene transcription. Moreover, GPBP activity was shown to be essential for transcription directed by the mouse Ada gene's G+C-rich promoter. In in vitro transcription assays, sequestering of GPBP in human HeLa cell nuclear extract by immunoabsorption caused a proportional and ultimately complete suppression of transcription from the Ada gene's G+C-rich promoter (with the level of transcription declining to levels indistinguishable from that for a promoterless reporter construct; Fig. 7B and C). This suppression was reversed by replenishing the GPBP-depleted nuclear extract with purified recombinant mGPBP (Fig. 7B and C). In striking contrast to the G+C-rich promoter's absolute requirement for GPBP in the in vitro transcription assay, transcription activity from the adenovirus major late gene's classical TATA-box-dependent promoter was totally unaffected by the immunoabsorption-mediated sequestering of GPBP in the HeLa cell nuclear extract (Fig. 7B and C). The G+C-rich human Topo II
gene promoter that contains a canonical TATAAA element, on the other hand, showed only partial (and totally reversible) dependence on the presence of GPBP in similar experiments, which probably reflects this promoter's dual-promoter characteristics (i.e., both TATA box dependent and G+C rich). These results established that GPBP is required only for transcription initiated at the G+C-rich promoter but is totally dispensable for transcription initiated at a classical TATA box-dependent promoter. The observed linearity of GPBP's transactivation effect (Fig. 7A) and its absolute requirement in TATA box-deficient G+C-rich-promoter-directed transcription (Fig. 7C) are both consistent with the predicted role of GPBP as a rate-limiting factor for transcription initiated at a TATA box-deficient G+C-rich promoter.
Recently (39), the same in vitro transcription assays (with the same reporter constructs described above as templates) were used to examine transcription elongation suppression caused by immunoabsorption of a factor that associates with members of each of the three various classes (11) of transcription elongation complexes. In contrast to the results described for the immunoabsorption of GPBP, transcript production from both the G+C-rich and TATA box-dependent promoters was similarly inhibited by the addition of antibodies directed against that proposed transcription elongation factor (39). This observation suggests that elongation of transcripts initiated at both G+C-rich and TATA box-dependent promoters may utilize common elongation complexes and implicates the differential transcription-inhibitory effects displayed by GPBP sequestration as interference with a different transcription-associated process. Since GPBP can bind to a G+C-rich promoter's MSPE, can associate with all the known transcription initiation assembly factors tested, and is functionally required for transcription initiated at that promoter, the anti-GPBP antibodies probably interfered with a TATA box-independent transcription initiation complex assembly step. Whether these anti-mGPBP antibodies would interfere with transcription at non-G+C-rich, TATA box-deficient initiator element-dependent promoters and whether GPBP is similarly required for transcription initiated at all mammalian TATA box-deficient G+C-rich promoters remain unresolved questions that have become tractable with the new reagents we now have on hand.
| ACKNOWLEDGMENTS |
|---|
This work was supported in part by NIH grant AG11623 and MUNIN Corp. award MC99-2 to C.-Y.Y., NIH grant CA72572 to J.M.L., RO1 GM44088 to V.J.K., and NIH SBIR grants CA78044, CA91376, and CA89777 to J.A.G. L.-C.H. was supported by a UIC University fellowship. Generous support for these studies was also provided by the American Lebanese Syrian Associated Charities and NIH (Cancer Center core grant 5 PO1 CA21765) to SJCRH for J.M.L. and V.J.K.
| FOOTNOTES |
|---|
| REFERENCES |
|---|
|
|
|---|
2. Ackerman, S. L., A. G. Minden, and C.-Y. Yeung. 1993. The minimal self-sufficient element in a murine G+C-rich promoter is a large element with imperfect dyad symmetry. Proc. Natl. Acad. Sci. USA 90:11865-11869.
3. Aso, T., H. A. Vasavada, T. Kawaguchi, F. J. Germino, S. Ganguly, S. Kitajima, S. M. Weissman, and Y. Yasukochi. 1992. Characterization of cDNA for the large subunit of the transcription initiation factor TFIIF. Nature 355:461-464.[CrossRef][Medline]
4. Berger, S. L., and A. R. Kimmel (ed.). 1987. Methods in enzymology: guide to molecular cloning techniques, vol 152. Academic Press, Inc., San Diego, CA.
5. Biswas, E. E., M. J. Stefanec, and S. B. Biswas. 1990. Molecular cloning of a gene encoding an ARS binding factor from the yeast Saccharomyces cerevisiae. Proc. Natl. Acad. Sci. USA 87:6689-6692.
6. Blake, M. C., R. C. Jambou, A. G. Swick, J. W. Kahn, and J. C. Azizkhan. 1990. Transcriptional initiation is controlled by upstream GC-box interactions in a TATAA-less promoter. Mol. Cell. Biol. 10:6632-6641.
7. Bruhn, S. L., P. M. Pil, J. M. Essigmann, D. E. Housman, and S. J. Lippard. 1992. Isolation and characterization of human cDNA clones encoding a high mobility group box protein that recognizes structural distortions to DNA caused by binding of the anticancer agent cisplatin. Proc. Natl. Acad. Sci. USA 89:2307-2311.
8. Buratowski, S., S. Hahn, L. Guarente, and P. A. Sharp. 1989. Five intermediate complexes in transcription initiation by RNA polymerase II. Cell 56:549-561.[CrossRef][Medline]
9. Chan, H. M., and N. B. La Thangue. 2001. p300/CBP proteins: HATs for transcriptional bridges and scaffolds. J. Cell Sci. 114:2363-2373.
10. Christy, B., and D. Nathans. 1989. DNA binding site of the growth factor-inducible protein Zif268. Proc. Natl. Acad. Sci. USA 86:8737-8741.
11. Conaway, J. W., A. Shilatifard, A. Dvir, and R. C. Conaway. 2000. Control of elongation by RNA polymerase II. Trends Biochem. Sci. 25:375-380.[CrossRef][Medline]
12. Dignam, J. D., R. M. Lebovitz, and R. G. Roeder. 1983. Accurate transcription initiation by RNA polymerase II in a soluble extract from isolated mammalian nuclei. Nucleic Acids Res. 11:1475-1489.
13. Dyer, M. A., P. J. Hayes, and M. H. Baron. 1998. The HMG domain protein SSRP1/PREIIBF is involved in activation of the human embryonic beta-like globin gene. Mol. Cell. Biol. 18:2617-2628.
14. Feinberg, A. P., and B. Vogelstein. 1984. A technique for radiolabeling DNA restriction endonuclease fragments to high specific activity. Anal. Biochem. 137:266-267.[CrossRef][Medline]
15. Field, J., J. Nikawa, D. Broek, B. MacDonald, L. Rodgers, I. A. Wilson, R. A. Lerner, and M. Wigler. 1988. Purification of a RAS-responsive adenylyl cyclase complex from Saccharomyces cerevisiae by use of an epitope addition method. Mol. Cell. Biol. 8:2159-2165.
16. Guan, K. L., and J. E. Dixon. 1991. Eukaryotic proteins expressed in Escherichia coli: an improved thrombin cleavage and purification procedure of fusion proteins with glutathione S-transferase. Anal. Biochem. 192:262-267.[CrossRef][Medline]
17. Hayes, S., P. Shiyanov, X. Chen, and P. Raychaudhuri. 1998. DDB, a putative DNA repair protein, can function as a transcriptional partner of E2F1. Mol. Cell. Biol. 18:240-249.
18. Innis, J. W., D. J. Moore, S. F. Kash, V. Ramamurthy, M. Sawadogo, and R. E. Kellems. 1991. The murine adenosine deaminase promoter requires an atypical TATA box which binds transcription factor IID and transcriptional activity is stimulated by multiple upstream Sp1 binding sites. J. Biol. Chem. 266:21765-21772.
19. Killeen, M. T., and J. F. Greenblatt. 1992. The general transcription factor RAP30 binds to RNA polymerase II and prevents it from binding nonspecifically to DNA. Mol. Cell. Biol. 12:30-37.
20. Kreidberg, J. A., and T. J. Kelly. 1986. Genetic analysis of the human thymidine kinase gene promoter. Mol. Cell. Biol. 6:2903-2909.
21. Lichtsteiner, S., J. Wuarin, and U. Schibler. 1987. The interplay of DNA-binding proteins on the promoter of the mouse albumin gene. Cell 51:963-973.[CrossRef][Medline]
22. Martens, J. A., and C. J. Brandl. 1994. GCN4p activation of the yeast TRP3 gene is enhanced by ABF1p and uses a suboptimal TATA element. J. Biol. Chem. 269:15661-15667.
23. McGrogan, M., C. C. Simonsen, D. T. Smouse, P. J. Farnham, and R. T. Schimke. 1985. Heterogeneity at the 5' termini of mouse dihydrofolate reductase mRNAs. Evidence for multiple promoter regions. J. Biol. Chem. 260:2307-2314.
24. Means, A. L., and P. J. Farnham. 1990. Sequences downstream of the transcription initiation site modulate the activity of the murine dihydrofolate reductase promoter. Mol. Cell. Biol. 10:1390-1398.
25. Means, A. L., and P. J. Farnham. 1990. Transcription initiation from the dihydrofolate reductase promoter is positioned by HIP1 binding at the initiation site. Mol. Cell. Biol. 10:653-661.
26. Melton, D. W., D. S. Konecki, J. Brennand, and C. T. Caskey. 1984. Structure, expression, and mutation of the hypoxanthine phosphoribosyltransferase gene. Proc. Natl. Acad. Sci. USA 81:2147-2151.
27. Orphanides, G., W. H. Wu, W. S. Lane, M. Hampsey, and D. Reinberg. 1999. The chromatin-specific transcription elongation factor FACT comprises human SPT16 and SSRP1 proteins. Nature 400:284-288.[CrossRef][Medline]
28. Rauth, S., K. G. Yang, A. M. Seibold, D. E. Ingolia, S. R. Ross, and C.-Y. Yeung. 1990. GC-rich murine adenosine deaminase gene promoter supports diverse tissue-specific gene expression. Somat. Cell Mol. Genet. 16:129-141.[CrossRef][Medline]
29. Sambrook, J., E. F. Fritsch, and T. Maniatis. 1989. Molecular cloning: a laboratory manual, 2nd ed. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y.
30. Sawadogo, M., and R. G. Roeder. 1985. Factors involved in specific transcription by human RNA polymerase II: analysis by a rapid and quantitative in vitro assay. Proc. Natl. Acad. Sci. USA 82:4394-4398.
31. Sehgal, A., N. Patil, and M. Chao. 1988. A constitutive promoter directs expression of the nerve growth factor receptor gene. Mol. Cell. Biol. 8:3160-3167.
32. Shaw, G., and R. Kamen. 1986. A conserved AU sequence from the 3' untranslated region of GM-CSF mRNA mediates selective mRNA degradation. Cell 46:659-667.[CrossRef][Medline]
33. Shirakata, M., K. Huppi, S. Usuda, K. Okazaki, K. Yoshida, and H. Sakano. 1991. HMG1-related DNA-binding protein isolated with V-(D)-J recombination signal probes. Mol. Cell. Biol. 11:4528-4536.
34. Singh, H., J. H. LeBowitz, A. S. Baldwin, Jr., and P. A. Sharp. 1988. Molecular cloning of an enhancer binding protein: isolation by screening of an expression library with a recognition site DNA. Cell 52:415-423.[CrossRef][Medline]
35. Smale, S. T., and D. Baltimore. 1989. The "initiator" as a transcription control element. Cell 57:103-113.[CrossRef][Medline]
36. Smale, S. T., M. C. Schmidt, A. J. Berk, and D. Baltimore. 1990. Transcriptional activation by Sp1 as directed through TATA or initiator: specific requirement for mammalian transcription factor IID. Proc. Natl. Acad. Sci. USA 87:4509-4513.
37. Sopta, M., Z. F. Burton, and J. Greenblatt. 1989. Structure and associated DNA-helicase activity of a general transcription initiation factor that binds to RNA polymerase II. Nature 341:410-414.[CrossRef][Medline]
38. Spencer, J. A., M. H. Baron, and E. N. Olson. 1999. Cooperative transcriptional activation by serum response factor and the high mobility group protein SSRP1. J. Biol. Chem. 274:15686-15693.
39. Trembley, J. H., D. Hu, L. C. Hsu, C.-Y. Yeung, C. Slaughter, J. M. Lahti, and V. J. Kidd. 2002. PITSLRE p110 protein kinases associate with transcription complexes and affect their activity. J. Biol. Chem. 277:2589-2596.
40. van Brabant, A. J., R. Stan, and N. A. Ellis. 2000. DNA helicases, genomic instability, and human genetic disease. Annu. Rev. Genomics Hum. Genet. 1:409-459.[CrossRef][Medline]