Previous Article | Next Article ![]()
Molecular and Cellular Biology, November 2004, p. 9274-9285, Vol. 24, No. 21
0270-7306/04/$08.00+0 DOI: 10.1128/MCB.24.21.9274-9285.2004
Copyright © 2004, American Society for Microbiology. All Rights Reserved.
Samuel Lunenfeld Research Institute, Mount Sinai Hospital,1 Department of Molecular and Medical Genetics, University of Toronto, Toronto, Ontario, Canada2
Received 21 April 2004/ Returned for modification 19 May 2004/ Accepted 11 August 2004
|
|
|---|
|
|
|---|
Overexpression of CA150 reduces the ability of the human immunodeficiency virus type 1 (HIV-1) transactivator (Tat) to mediate production of transcripts from the long terminal repeat (LTR) promoter (53). This function requires core promoter elements such as an intact TATA box, suggesting a role for CA150 in HIV-1 gene regulation, and is dependent on both the WW and FF domains of CA150, which can associate with the pre-mRNA splicing factor SF1 and RNAPII, respectively (11, 21). Such interactions may bridge splicing complexes to actively transcribing RNAPII. Although there is excellent evidence for transcriptional coupling to the spliceosome (32, 34, 35, 44), there have been few examples of direct interactions between known elongation and splicing factors; thus, the ability of CA150 to engage both the transcriptional and splicing machinery is intriguing. Furthermore, the domains of CA150 orthologs from Caenorhabditis elegans to humans are highly conserved, arguing that they control biologically important protein interactions.
While WW domains have been extensively characterized (51), the functions of the more recently described FF domain (6) are not as well understood. FF domains are usually found in repeats of 4 to 6 and are present in several nuclear proteins related to splicing and transcription, as well as the p190 RhoGAP family of GTPases. The solution structure of a single FF repeat from HYPA/FBP11 revealed three
-helices arranged in an orthogonal bundle with the N and C termini at opposite ends, a feature that could facilitate the linking of multiple functional repeats (1). FF domains from CA150, HYPA/FBP11, and the yeast splicing factor Prp40 associate with the C-terminal domain (CTD) of the large subunit of RNAPII in a phospho-dependent manner (1, 11, 36).
The mammalian CTD contains 52 tandem repeats of the heptad amino acid sequence Y1S2P3T4S5P6S7 and is alternatively phosphorylated on Ser5 in promoter regions and Ser2 during elongation (27, 28). Several cellular kinases have the capacity to phosphorylate CTD residues (3, 14, 30, 31, 33, 37, 45, 46), and the first FF domain of HYPA/FBP11 binds a CTD peptide phosphorylated on both serines with a dissociation constant of 50 µM (1). It is still unclear, however, exactly which serine residues must be modified to facilitate FF domain binding.
Here we present evidence that the FF domains of CA150 can associate with multiple transcription- and splicing-related proteins, notably Tat-specific factor 1 (Tat-SF1). Tat-SF1 is essential for Tat-activated transcription from the HIV-1 LTR promoter and recruits U small nuclear ribonucleoproteins (snRNPs) to active sites of transcription (18, 60). Its yeast ortholog CUS2 plays an important role in splicing as well (59). CA150 FF domains can recognize multiple weak binding sites within Tat-SF1, and individual domains within the FF repeats show equivalent and noncooperative binding properties. We also identify a consensus FF domain binding motif which suggests that these domains can recognize both phosphorylated and nonphosphorylated targets.
|
|
|---|
Mouse monoclonal anti-Flag and anti-Tat-SF1 were from Sigma and Transduction Laboratories (BD Biosciences). Rabbit polyclonal anti-RNAPII and horseradish peroxidase-conjugated anti-GST antibodies were from Santa Cruz Biotechnology (Santa Cruz, Calif.), as was goat polyclonal anti-CA150. Rabbit polyclonal anti-GST was a gift from Sigal Gelkop (Ben Gurion University, Beer-Sheva, Israel). Texas Red-conjugated anti-mouse antibody and green fluorescent protein (GFP)-conjugated anti-goat antibody were purchased from Molecular Probes (Eugene, Oreg.), and rabbit polyclonal anti-GFP was purchased from Abcam (Cambridge, Mass.).
Cell culture and immunofluorescence. Human embryonic kidney 293 (HEK 293) cells were maintained in Dulbecco's modified Eagle's medium containing 10% heat-inactivated fetal calf serum and antibiotics. For exogenous expression of proteins, cells were transiently transfected with cDNAs by the calcium phosphate precipitation method (13).
For immunofluorescence, cells grown on glass coverslips were washed with phosphate-buffered saline and fixed with 4% paraformaldehyde. Samples were permeabilized with 0.25% Triton X-100 and blocked with 3% bovine serum albumin. Staining with anti-Tat-SF1 was detected with Texas Red-conjugated anti-mouse immunoglobulin (Ig), and CA150 was detected with GFP-conjugated anti-goat Ig. All samples were mounted with p-phenylenediamine to retard photobleaching. Microscopy was carried out with a Leica (Heerbrugg, Switzerland) DMIRE2 inverted microscope equipped with fluorescence and transmitted light optics and Openlab software (Quorum Technologies, Guelph, Ontario, Canada).
The images in Fig. 2B were obtained on live cells by using an Olympus 1X-70 inverted microscope equipped with fluorescent optics and Deltavision Deconvolution Microscopy software (Applied Precision, Bratislava, Slovakia).
![]() View larger version (35K): [in a new window] |
FIG. 2. CA150 and Tat-SF1 colocalize in cells and interact both in vivo and in vitro in an FF domain-dependent manner. (A) CA150 and Tat-SF1 coimmunoprecipitate. HEK 293 cells were transfected with plasmid expressing N-terminal Flag-tagged CA150 or with empty vector. CA150 and Tat-SF1 were immunoprecipitated with antibodies against either Flag or endogenous Tat-SF1. Associated proteins were subsequently detected by immunoblotting. Whole-cell lysates are shown as controls. (B) Recombinant Tat-SF1 and CA150 FF domains interact in vitro. GST alone (0.1 or 1 µM) or GST-tagged FF domains 1 to 3 were mixed with His-tagged Tat-SF1 immobilized on beads. The 1 µM inputs were loaded in the first two lanes on the left. Immunoblotting with antibodies against GST allowed detection of bound proteins. Recombinant proteins containing the CA150 FF domains were retained on the beads, whereas GST alone was not. (C) CA150 and Tat-SF1 colocalize in the nucleus. HEK 293 cells were fixed, permeabilized, and stained with antibodies against endogenous CA150 and Tat-SF1. The proteins colocalized to punctate nuclear structures resembling speckles. (D) Representation of CA150 N-terminal (WW domain) and CA150 C-terminal (FF domain) fragments used for examining the localization and Tat-SF1 binding properties of CA150. Constructs overlap at the nuclear localization sequence (NLS). (E) CA150 FF domains are coimmunoprecipitated with Tat-SF1 while the WW domain-containing N terminus is not. HEK 293 cells were transfected with pEYFP-C1 vector alone or YFP-tagged fragments encompassing the N or C terminus of CA150 (as in panel D). Tat-SF1 immunoprecipitations were probed with antibodies against GFP to identify associated proteins. (F) CA150 localization to nuclear speckles is dependent on its FF domains. Plasmids expressing YFP-tagged full-length, C-terminal, or N-terminal fragments of CA150 were transfected into HEK 293 cells. Images of single nuclei in live cells, taken by deconvolution microscopy, show localization of full-length CA150 or the C-terminal FF domains to nuclear speckles but not the WW domains. Red bar, 10 µm.
|
Immunoprecipitation, GST pull downs, and Western blotting. Transfected HEK 293 cells were lysed in NP-40 buffer (50 mM Tris-HCl [pH 7.4], 75 mM NaCl, 10% glycerol, 0.5% NP-40, 2 mM EDTA, 1 mM sodium vanadate, 1 mM PMSF, 10 µg of aprotinin ml1, and 10 µg of leupeptin ml1). Proteins were immunoprecipitated for 1 to 6 h at 4°C and washed multiple times with NP-40 buffer. For GST pull downs, HEK 293 cells lysed with TCL buffer were incubated with glutathione beads carrying recombinant GST fusion proteins for 1 to 2 h at 4°C. All bound proteins were separated by sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE) and transferred to a nitrocellulose membrane (Schleicher and Schuell Bioscience, Keene, N.H.). Membranes were blocked in Tris-buffered saline containing 5% skim milk and immunoblotted. Primary antibodies were detected with anti-mouse Ig or anti-rabbit Ig antibodies conjugated to horseradish peroxidase followed by treatment with enhanced chemiluminescence (Pierce, Rockford, Ill.).
Identification of interacting proteins by mass spectrometry. HEK 293 cells were lysed in TCL buffer and cleared by ultracentrifugation. The lysate was passed through a 0.45-µm-pore-size filter (Pall Corporation, Ann Arbor, Mich.) and precleared with glutathione beads before incubation at 4°C for 2 h with 6 µg of recombinant GST fusion proteins. After separation by SDS-PAGE, proteins were visualized with colloidal Coomassie (GelCode Blue Stain Reagent; Pierce). Bands were excised, reduced, alkylated, and digested with trypsin by using a ProGEST tryptic digestion robot (Genomic Solutions, Ann Arbor, Mich.). Mass spectrometry analysis was performed on a Finnigan LCQ Deca XP ion trap (Thermo Finnigan, Mississauga, Ontario, Canada) and by database searching with the Sonar MS/MS search engine (ProteoMetrics; Genomic Solutions).
Peptide spot array synthesis. Peptide arrays were produced according to the spot synthesis method (19). Acid-hardened cellulose membranes prederivatized with a polyethylene glycol spacer (Intavis, Cologne, Germany) and standard 9-fluorenylmethoxy carbonyl (Fmoc) chemistry were used for synthesis. Fmoc-protected and -activated amino acids (Intavis) were spotted in high-density 24-by-18 arrays on 130- by 90-mm membranes with an AbiMed (Langfield, Germany) ASP422 robot. Membranes were blocked overnight in 10% skim milk, incubated with 1 µM purified GST fusion proteins in Tris-buffered saline with Tween for 2 h at 4°C, washed three times, and probed with rabbit polyclonal anti-GST antibody. Primary antibodies were detected by horseradish peroxidase-conjugated anti-rabbit antibody followed by enhanced chemiluminescence (Pierce).
Peptide synthesis and fluorescent polarization. Peptides were synthesized on an AbiMed 431 synthesizer with the standard Fastmoc protocol. The authenticity of the product was confirmed by matrix-assisted laser desorption ionization mass spectrometry and amino acid analysis. Equilibrium binding constant determination was carried out on a Beacon fluorescence polarization system (Pan Vera, Wis.) equipped with a 100-µl sample chamber. Fluorescein-labeled probes were prepared through the reaction of carboxy-terminally labeled peptides with 5- (and 6)-carboxyfluorescein succinimidyl ester (Molecular Probes). Binding studies were conducted with 5 nM fluorescein-labeled probes dissolved in phosphate-buffered saline containing 100 µg of bovine serum albumin ml1 and 1 mM dithiothreitol. Reaction mixtures were allowed to equilibrate for 30 to 60 min at room temperature. All fluorescent polarization measurements were conducted at 22°C.
|
|
|---|
![]() View larger version (42K): [in a new window] |
FIG. 8. Binding to mock-phosphorylated RNAPII CTD is consistent with the defined FF domain interaction motif. (A) Heptad amino acid sequence repeated 52 times in mammalian RNAPII CTD. Phosphorylation of serine residues at positions 2 and 5 plays an important role in transcriptional progression. (B) A requirement for two negatively charged residues and Tyr for CTD peptides binding to CA150 FF domains. Peptide arrays were synthesized with CTD repeats containing Asp to mimic pSer and incubated with recombinant GST-tagged FF1 to 3. Asp (in boldface type) was used at either the Ser2 or Ser5 position alone (peptides 1 and 2), both positions together (peptide 3), or Ser2 with Thr4 (peptide 4). The first peptide on each membrane contains the wild-type sequence (listed horizontally across the top). Below each residue are peptides incorporating alanine substitutions. (C) Only CTD peptides with two negative charges are able to compete out CA150 FF domain binding to RNAPII in pull-down assays. Whole-cell lysate from HEK 293 cells was mixed with recombinant GST-tagged FF1 to 3 bound to glutathione beads. One millimolar synthetic peptides corresponding to two repeats of the RNAPII CTD were added. Peptides either contained two wild-type repeats, had Asp substitutions at the Ser2 or Ser5 positions, or had both together. Proteins retained on beads after washing were separated by SDS-PAGE, and the associated RNAPII was detected with specific antibody. -RNA Pol II, anti-RNAPII; WB, Western blotting.
|
![]() View larger version (39K): [in a new window] |
FIG. 1. Identification of cellular proteins binding to FF domains of CA150. (A) Domain architecture of CA150, which includes three WW domains at its N terminus followed by six FF domains. FF1-6 and FF5-6 represent segments of the protein used to identify FF domain binding partners. (B) Recombinant proteins encompassing either the FF1-6 or FF5-6 CA150 domain were used to precipitate cellular proteins from HEK 293 whole-cell lysate. Interacting proteins were separated by SDS-PAGE and detected by staining with colloidal Coomassie. The addition of lysate to GST alone (lane 1) or GST-FF fusion proteins minus lysate (lanes 2 to 3) served as controls. Protein bands identified as uniquely binding to FF domains were excised and subsequently identified by mass spectrometry.
|
|
View this table: [in a new window] |
TABLE 1. Nuclear proteins identified by mass spectrometry as interacting with the FF domains of CA150
|
The ability of CA150 FF domains to bind Tat-SF1 was insensitive to RNase treatment in vitro, which suggested a direct protein-protein interaction. To pursue this association, HEK 293 cells were transfected with murine CA150 tagged with an N-terminal Flag epitope. Upon immunoprecipitation with anti-Flag or anti-Tat-SF1 antibodies, we observed coprecipitation of CA150 with endogenous Tat-SF1 (Fig. 2A). This provides evidence that CA150 and Tat-SF1 do interact, potentially in a higher-order complex containing RNAPII and other transcription or splicing factors such as those identified as in vitro FF domain-binding partners.
To examine whether purified Tat-SF1 binds directly to CA150 FF domains, recombinant full-length Tat-SF1 protein containing an N-terminal histidine (His) tag was purified, bound to metal affinity beads, and incubated with either purified GST-tagged FF domains 1 to 3 or GST alone (Fig. 2B). FF domains were specifically precipitated by Tat-SF1 from solutions containing either 1 or 0.1 µM recombinant FF domains (lanes 5 to 6), indicating that Tat-SF1 can associate directly with the CA150 FF domains in vitro.
To test whether they could interact in vivo, we examined HEK 293 cells that endogenously express both proteins. Immunofluorescence staining of CA150 and Tat-SF1 revealed that these proteins colocalize in a heterogeneous speckle pattern within the nucleus (Fig. 2C), consistent with the possibility that CA150 and Tat-SF1 do interact in cells. We further examined the determinants of CA150 localization and Tat-SF1 binding by using YFP-tagged fragments encompassing either the N- or C-terminal region containing the WW or FF domains, respectively (Fig. 2D). These fragments overlap by only 35 amino acids in a region containing a putative nuclear localization sequence. Expression of these constructs in HEK 293 cells and immunoprecipitation with anti-Tat-SF1 antibodies revealed coprecipitation of the FF domain-containing C-terminal region of CA150 but not the N-terminal fragment (Fig. 2E). This is consistent with the idea that CA150 interacts through its FF domains with Tat-SF1 in vivo. Using deconvolution microscopy, we also examined the localization of these protein fragments and determined that the C-terminal FF domain-containing region consistently localized to speckle structures, whereas the N-terminal region encompassing the WW domains was observed uniformly throughout the nucleus (Fig. 2F). These data suggest that FF domains are specifically involved in targeting CA150 to nuclear speckles, where interactions with Tat-SF1 would occur.
FF domains of CA150 bind Tat-SF1 peptide motifs containing acidic and aromatic residues. To delineate the regions of Tat-SF1 recognized by FF domains, we synthesized a spot peptide array spanning residues 1 to 561 of murine Tat-SF1 with 12-mer peptides and a moving window of five amino acids. The membrane was incubated with 1 µM recombinant GST-tagged CA150 FF domains 1 to 3, and bound protein was detected with anti-GST antibodies (Fig. 3A). Multiple Tat-SF1 peptides interacted with the FF domains in this assay (Fig. 3B). These sequences encompassed 10 nonoverlapping regions of the protein and therefore appeared to represent multiple distinct sites for FF domain recognition. The interacting peptides were all rich in the acidic amino acids Asp and Glu, suggesting that this may be a common feature of FF domain-binding sites.
![]() View larger version (59K): [in a new window] |
FIG. 3. The FF domains of CA150 bind to multiple motifs of Tat-SF1 rich in aromatic and acidic residues. (A) Recombinant GST-tagged CA150 FF domains 1 to 3 bind to several peptide sequences derived from Tat-SF1. Overlapping 12-mer peptides spanning residues 1 to 561 of murine Tat-SF1 were synthesized on a peptide spot array. The resulting membrane was incubated with either the FF domains or GST alone as a control (spots circled in grey). Immunoblotting with antibodies against GST identified interacting peptides. (B) Complete sequences of peptides identified as interacting with FF domains 1 to 3. Sequences with overlapping regions, seen as adjacent spots, are grouped together with brackets and numbered accordingly. Acidic residues Asp or Glu are shown in boldface type. Peptides binding to GST alone or the anti-GST antibody were omitted. (C) FF domain binding to Tat-SF1-derived peptides is dependent on interactions with aromatic and acidic residues. Four peptides able to bind FF domains were selected for mutational analysis. Each residue in a given sequence was systematically mutated to alanine on a spot array. The wild-type sequence is shown above the membrane, with spots below each amino acid representing the incorporation of alanine at that position. Blots were probed with GST-tagged FF domains 1 to 3.
|
FF domains bind to multiple (D/E)2/5-F/W/Y-(D/E)2/5 motifs with low affinity. To expand the definition of an FF domain ligand, spot arrays were synthesized in which all 20 natural amino acids were substituted at each position of specific Tat-SF1 peptide sequences. The membranes were probed with FF domains 1 to 3 (Fig. 4A). Asp and Glu were accepted at the majority of positions in every peptide sequence, as were Phe, Trp, and Tyr, confirming that these amino acids are important for peptide interactions with FF domains. Each sequence contains several acidic residues that potentially contribute to binding, yet only in SGWFHVEEDRNT are these contiguous. The involvement of negatively charged amino acids in FF domain binding is most obvious in this peptide, since only acidic residues are tolerated in this core region.
![]() View larger version (107K): [in a new window] |
FIG. 4. FF domains bind preferentially to the consensus sequence (D/E)2/5-F/W/Y-(D/E)2/5. (A) All 20 natural amino acids were substituted at each position of three representative Tat-SF1 peptide ligands with a spot array. Sequences of the wild-type Tat-SF1 peptides are indicated vertically at the left of each array. Amino acid substitutions are listed horizontally across the top. Membranes were incubated with GST-tagged CA150 FF domains 1 to 3, and bound protein was detected with antibodies against GST. A strong selection for acidic amino acids, aromatic residues, and against Lys can be seen. (B) Consensus FF domain binding sequence. Tat-SF1 motifs recognized by FF domains 1 to 3 of CA150 (Fig. 3) are listed. Sequences conforming to the parameters for FF domain binding identified in panel A are underlined. Aromatic residues of Tyr, Phe, or Trp (shown in black boxes) are found in 8 of 10 motifs but may be replaced with the smaller leucine, valine, or isoleucine (shown in gray boxes). Either N- or C-terminal to the hydrophobic residues are two or more of the acidic amino acids Glu or Asp (shown in boldface type). In 7 of 10 sequences, a basic Lys or Arg residue is also observed in close proximity to the binding motif (shown in italics). (C) Potential FF domain binding sites in full-length murine Tat-SF1 (residues 1 to 757) conforming to the (D/E)2/5-F/W/Y-(D/E)2/5 consensus motif. Shown are the number of interaction motifs identified by peptide arrays as well as other potential sites found in Tat-SF1 that may influence FF domain binding, such as those created or enhanced by 19 putative DNAPK or casein kinase II phosphorylation sites (*).
|
Comparison of wild-type Tat-SF1 peptide sequences found to bind FF domains was used to establish a consensus domain-binding motif in conjunction with the amino acid scans. Eight of the 10 FF binding motifs of Tat-SF1 have aromatic residues surrounded by multiple acidic amino acids (Fig. 4B). This suggested a consensus binding motif of (D/E)2/5-F/W/Y-(D/E)2/5, where two of the five residues either preceding or following an aromatic residue are negatively charged Glu or Asp. In some cases, substitution of F/W/Y with the smaller hydrophobic residues Leu or Val may be tolerated.
While 10 Tat-SF1 sites able to bind FF domains in vitro were identified by spot arrays, an additional 16 possible FF-binding motifs are present in the sequence of full-length murine Tat-SF1 (Fig. 4C). These consist of sites that interacted with a negative control on the spot scan, were not present in the arrays (residues 562 to 757), or could potentially be generated by serine-threonine phosphorylation to create negatively charged motifs. This latter possibility is plausible, as Tat-SF1 is known to be phosphorylated on serine (60). Serine phosphorylation could also enhance the affinity of several of the identified FF domain-binding motifs.
To determine FF domain affinities for Tat-SF1 peptide ligands, we quantified binding of recombinant FF domains 1 to 3 to synthetic peptides in vitro by fluorescence polarization. Fluorescently labeled peptides representing three of the sequences from Tat-SF1 had modest affinities for FF domains. The SGWFHVEEDRNT motif bound with a dissociation constant of 61 µM (Fig. 5), whereas the other two peptides bound more weakly. For the additional two sequences, we tested whether substitutions based on the results of the amino acid scans could increase their affinities. A Lys mutation to Ala in PDDTPYEWDLDA (Fig. 4A, top panel) conferred a 207 µM affinity, whereas the addition of two Tyr residues and incorporation of a Gly and Ser in YGNDEFDEGLYS (Fig. 4A, bottom panel) gave an affinity of 170 µM. Hill coefficients of 1 indicated that there was no cooperative binding between individual domains, signifying that a single domain-peptide interaction does not increase the affinity of adjacent domains for these peptide ligands. To further examine the FF domain-binding motifs, Tat-SF1 peptides with only the core consensus motif were synthesized (YEWDLD and WFHVEED). These peptides bound to FF domains 1 to 3 with slightly tighter affinity than the extended sequences (132 and 48 µM, respectively). Finally, we found that FF domains interacted with a minimal motif of WDD with a Kd of 128 µM and a Hill coefficient of approximately 1, indicating that one aromatic residue followed by two negatively charged aspartic acids is sufficient to confer a single-site, low-affinity FF domain interaction. These data are consistent with the notion that CA150 FF domain interactions with Tat-SF1 could occur due to the avidity of multiple FF domains for multiple low-affinity binding sites. These sites are composed of the (D/E)2/5-F/W/Y-(D/E)2/5 motif but may have residues that impair affinity, such as basic residues within or adjacent to the core motif or small hydrophobic amino acids in place of F/W/Y.
![]() View larger version (21K): [in a new window] |
FIG. 5. FF domains bind with low affinity to several Tat-SF1-derived peptides. (A, B, and C) CA150 FF domain binding to Tat-SF1 peptides as measured by fluorescence polarization. Recombinant protein containing FF domains 1 to 3 was incubated with fluorescently labeled peptides representing optimal Tat-SF1 binding motifs. Peptide sequences are indicated at the top left. Mutations increasing affinities to panels A and C were made according to peptide spot analysis. Wild-type Tat-SF1 residues are shown below the sequence (black boxes) where mutations were employed. Michaelis-Menten (left), Scatchard (inset), and Hill (right) plots for peptide interactions with FF domains are shown. (D) Summary of FF domain peptide binding data displaying observed Kd and Hill coefficient values. In addition to peptides represented in the top panels, binding results for fluorescently labeled peptides corresponding to minimal FF domain binding motifs are also shown.
|
![]() View larger version (21K): [in a new window] |
FIG. 6. Individual CA150 FF domains bind target motifs with similar kinetics and affinity. (A) Recombinant protein containing GST-tagged FF domain 2 of CA150 interacts with Tat-SF1 peptide as measured by fluorescent polarization. The Tat-SF1 motif WFHVEED (identified as an FF domain ligand by spot arrays) was fluorescently labeled and mixed with increasing concentrations of FF2. Michaelis-Menten (top), Scatchard (inset), and Hill (bottom) plots are shown. (B) Summary of kinetic data obtained from individual CA150 FF domains interacting with peptides. In addition to FF2, interaction data for the first and third domains binding to Tat-SF1 peptide WFHVEED were gathered. Similarly, the affinity of individual domains for a minimal FF domain motif-peptide (WDD) was also measured. Equilibrium constants (Kd) for the interactions are listed.
|
450 µM) and is not able to precipitate the CA150 FF domains when immobilized on streptavidin beads. A construct containing an N-terminal His tag fused to five YEWDLD repeats (Fig. 7A) was examined for its ability to precipitate GST-tagged FF domains 1 to 3 or GST alone in a mixing experiment (Fig. 7B). The CA150 FF domains were able to interact with the multimeric motifs while GST alone could not. Indeed, the amount of FF domains precipitated by 5x YEWDLD beads was similar to that pulled down by beads carrying an equivalent quantity of full-length Tat-SF1 (Fig. 2B). This indicates that a minimal YEWDLD sequence is sufficient to mediate a stable interaction with multiple FF domains when oligomerized in a fashion similar to CA150 FF domain-binding sites in Tat-SF1. To determine whether this multimeric motif could outcompete the FF domain interaction with full-length Tat-SF1, we generated a construct containing the 5x YEWDLD oligomer fused to the C terminus of FF domains 1 to 3. The oligomeric YEWDLD motifs were able to successfully compete for the FF domain interaction with Tat-SF1 (Fig. 7C). Interestingly, the addition of a peptide containing only a single YEWDLD motif to a pull down with FF domains 1 to 3 was not able to inhibit binding to full-length Tat-SF1, even at peptide concentrations up to 2 mM (data not shown). Therefore, though the YEWDLD sequence is sufficient to generate an interaction with FF domains, this weak association requires multiple copies of the target motif to mediate stable binding.
![]() View larger version (42K): [in a new window] |
FIG. 7. FF domain repeats interact with multimeric ligand. (A) Sequence of recombinant His-tagged protein containing five copies of the FF domain binding motif YEWDLD, each separated by a five-residue linker (GAGAG). (B) Recombinant YEWDLD oligomer and CA150 FF domains interact in vitro. Constructs expressing the His-tagged YEWDLD motifs were expressed in bacteria. Either 1 or 5 µM GST alone or GST-tagged FF domains 1 to 3 were incubated with the immobilized, multimeric binding sites. One micromolar inputs were loaded in the first two lanes on the left. Immunoblotting with antibodies against GST allowed detection of bound proteins. (C) Oligomerized motifs can compete for Tat-SF1 interaction. Beads carrying either GST alone or recombinant GST-tagged proteins containing FF domains 1 to 3 with or without the tethered binding motifs were mixed with HEK 293 whole-cell lysate. Interacting protein was detected with antibodies against endogenous Tat-SF1. -Tat-SF1, anti-Tat-SF1. (D) Domain architecture of Tat-SF1, which has two RRM domains. Several regions of the protein, including parts of the N terminus, RRM domain 1 or 2, and C terminus were used to map the interaction with the CA150 FF domains. A point mutation abolishing binding to the FF domain target site YEWDLD (YEAALD) was also made. Fragments are shown at the bottom, with open boxes denoting RRM domains. The number of potential FF domain binding motifs in each segment is shown in parentheses. (E) The FF domains of CA150 are able to associate with several regions of Tat-SF1. Flag-tagged fragments of Tat-SF1 were transfected into HEK 293 cells, and expression was monitored by immunoblotting with anti-Flag ( -Flag) antibodies (top panel). Lysate was incubated with recombinant GST-tagged FF1 to 3 bound to glutathione beads. Proteins retained after washing were detected by immunoblotting (bottom panel). WB, Western blotting.
|
FF domain-peptide interactions are consistent with binding to an RNAPII CTD phosphorylated on serines 2 and 5. Although recruitment of several proteins to the CTD requires phosphorylation of either serine 2 or 5, no known CTD-interacting proteins have an absolute requirement for phosphorylation of both sites. Binding of the nuclear matrix protein SCAF8 to RNAPII appears to favor a doubly phosphorylated CTD, but this interaction is not well understood (42). An individual FF domain from HYPA/FBP11 was shown to bind the CTD heptad amino acid sequence (Fig. 8A) containing pSer at both the 2 and 5 positions with a binding constant of 50 µM (1). As our consensus binding motif suggested that two negatively charged residues are needed to generate an FF domain ligand, we investigated whether this was also the case for binding to the CTD. To address this, we synthesized spot peptide arrays containing 12-mer CTD peptides in which Asp was incorporated at potential Ser or Thr phosphorylation sites, owing to the inefficient synthesis of peptides containing multiple pSer residues; these arrays were then probed with CA150 FF domains 1 to 3 (Fig. 8B). While no binding was observed to peptides with Asp at only the second or fifth positions, those with substitutions for both Ser residues or Ser2 in conjunction with Thr4 associated with the CA150 FF domains. Ala substitutions of specific amino acids within each of these peptides resulted in an almost complete loss of binding, notably, substitution of Tyr1 and Asp at position 2 or 5 (acting as pSer mimics) interfered with FF domain-binding.
To pursue the notion that two negatively charged amino acids in a CTD repeat are preferred for effective binding to FF domains, we examined the capacity of these CTD peptide ligands to compete for the binding of CA150 FF domains to RNAPII. Synthetic peptides corresponding to two repeats of the CTD heptad amino acid sequence were prepared in an unmodified form or with Asp at position 2, 5, or both together to mimic pSer. These peptides (1 mM) were mixed with HEK 293 cell lysate and GST-tagged FF domains 1 to 3 on beads. Bound RNAPII was detected with antibodies to the endogenous protein (Fig. 8C). While the addition of peptides with no Asp or those with substitutions at the second or fifth positions alone did not significantly lower FF domain binding to RNAPII, peptides with Asp at both positions inhibited this interaction. The joint inclusion of peptides charged at either the second or fifth position separately had no effect as well. This result argues that two negative charges within the CTD motif enhance FF domain binding and that peptides with acidic residues at the Ser2 or Ser5 position alone have no, or significantly reduced, individual or cooperative effects on CA150 FF domain binding to RNAPII. These data, coupled with the CTD binding studies on HYPA/FBP11, indicate that FF domain interactions with RNAPII occur at sites that conform to the consensus binding motif identified by peptide arrays and could represent a CTD interaction requiring phosphorylation of Ser 2 and 5 together.
|
|
|---|
The identities of several FF domain-binding partners are consistent with a potential ability to link transcription and splicing. The CA150 FF domains isolated UBF-1, BR140, and the PD2/HRPT2 complex, which are implicated in the regulation of transcription from RNAPII promoters and represent connections to initiation and elongation (23, 50, 55). Likewise, the identification of Prp8, PSF, and p54nrb/NonO as FF domain-binding proteins suggests an interaction with the spliceosome, especially since purified spliceosome fractions contain CA150 (7, 17, 22, 39, 49). Among other in vitro binding partners for the CA150 FF domains, Btf and FBP2 have roles in the transcriptional control of apoptosis, another process to which CA150 has been linked (26, 48, 58), and the RS proteins SRm300, SC35, SRP55, and SRP75 are involved in alternative splicing (8, 9, 20, 29, 47, 57).
Consistent with the functional relevance of this proteomic analysis, several of the FF domain-interacting proteins, such as SC35, PSF, and Prp8, localize to nuclear speckles, as observed for CA150 and Tat-SF1. Nuclear speckles are rich in pre-mRNA processing components and contain hyperphosphorylated RNAPII (16) along with fractions of HIV-1 Tat and PTEF-b (24, 43). The observation that the CA150 FF domains, when targeted to the nucleus, localize to speckles correlates with their binding to transcriptional regulators that associate with these nuclear structures.
Analysis of the interaction between the CA150 FF domains and their binding motifs in Tat-SF1 has suggested a model in which an avidity for multiple weak sites can result in stable binding between two protein partners. We identified multiple FF domain-binding sites in Tat-SF1 that yielded a consensus binding motif of (D/E)2/5-F/W/Y-(D/E)2/5. Although several of the proteins listed in Table 1 may not interact directly with the CA150 FF domains, many of them contain multiple copies of this target sequence, with Tat-SF1 possessing the largest number of potential FF domain binding sites. The average number of motifs present in these proteins is well above that calculated for a randomly generated list of protein sequences and would be further enhanced with the inclusion of potential Ser phosphorylation sites.
Full amino acid scans on several peptides showed that incorporation of Asp or Glu at most positions in each peptide resulted in an increased affinity for CA150 FF domains 1 to 3, and the same held true for Phe, Trp, and Tyr. Spacing constraints between the hydrophobic and negatively charged residues are not stringent, yet there is a definite requirement for at least two acidic residues in these peptide ligands. There are a number of conserved basic residues within the six CA150 FF domains and on the surface of the first HYPA/FBP11 FF domain (1). Each of these domains possesses an overall pKa of approximately 9.5, and it is possible that the large number of basic residues (22% on average) could drive a broad selection for acidic residues in the ligand. Interestingly, among the family of FF domains, only those from the p190 RhoGAPs do not possess these conserved basic charges and instead have a pKa of 5. This implies that these domains could have a divergent mechanism of ligand recognition.
An interesting aspect of several FF domain-binding sites on Tat-SF1 is their close proximity to disfavored Lys residues, yielding sites that are suboptimal. This is reminiscent of interactions between yeast Sic1 and the WD40 domain-containing F-box protein Cdc4 (38), in which a single Cdc4 binding site exists for 9 suboptimal phosphorecognition motifs in Sic1. The situation with FF domains, however, is likely to be different due to their regular arrangement in repeated arrays. Whereas the WD40 domain of Cdc4 is able to bind to optimal phosphopeptides with comparatively high affinity (Kd of
1 µM), it seems likely that FF domain binding, even to the most favorable ligands, is relatively weak. This may explain the requirement for multiple FF domains, which would provide a sufficiently high avidity for an interaction to occur with multiple weak target sites.
Binding data for the CA150 FF domains indicates that individual domains possess roughly equal affinities for Tat-SF1 peptides or for a minimal WDD motif. Hill coefficients also indicated that each domain has only one recognition site and that there does not appear to be any cooperative effect on domain binding, whereby interactions at one site would increase the affinity of adjacent binding events. Instead, these data suggest that each FF domain within a repeated structure possesses an independent binding activity, conferring an ability to engage multiple targets through weak interactions. This is in contrast to repeated motifs such as TPR or WD40 repeats, which pack together to form tertiary structures, creating a composite binding surface for a polypeptide ligand (2, 15). The ubiquitin-interacting motifs of the yeast endocytic adaptor protein Vps27 may represent a binding mode analogous to FF domains (54). These 15-residue motifs fold autonomously into single
-helices capable of noncooperative interactions with monoubiquitin at modest affinities (Kd of 117 to 277 µM), resulting in a stable interaction with ubiquitinated proteins.
The regulation of FF domain interactions is an intriguing issue. The FF domain was originally identified as a pSer-binding module. However, in contrast to other interaction domains that recognize phosphorylated Ser/Thr or Tyr sites and absolutely require a phosphoamino acid for binding, CA150 FF domains can recognize peptides with either pSer or acidic Glu/Asp residues and therefore have a strong selection for negative charges (in the context of an aromatic residue) rather than a strict dependence on phosphorylation. These data suggest two potential modes by which FF domains can recognize ligands. We observed interactions with several Tat-SF1-derived peptides containing acidic residues flanking an aromatic amino acid and obtained evidence that stable binding requires interaction of multiple CA150 FF domains with multimeric aromatic acidic motifs. CA150 FF domains also recognize the RNAPII CTD, but in this case, the requisite negative charges must be generated by phosphorylation of the Ser2 and Ser5 residues, and the interaction is therefore phospho dependent. Of note, we observe no binding of the CA150 FF domains to unphosphorylated CTD peptides by fluorescence polarization and only slightly greater affinities for CTD peptides phosphorylated on Ser2. Peptides phosphorylated on Ser5 bind FF domains with a slightly higher affinity but still more weakly than those which are phosphorylated at both sites, consistent with the possibility that two negatively charged residues are important for stabilizing the interaction (M. J. Smith, unpublished data). The phospho-CTD has 52 potential FF domain binding sites, and a doubly phosphorylated CTD motif was shown to bind a single HYPA/FBP11 FF domain with a Kd of 50 µM (1). This is stronger than the interaction observed between unphosphorylated and acidic Tat-SF1 peptides and single CA150 FF domains and comparable to the described binding of a single WW domain from the prolyl isomerase Ess1/Pin1 to differentially phosphorylated CTD peptides (56). These results, combined with our peptide array and competition data, suggest that an RNAPII CTD containing doubly phosphorylated motifs is potentially a preferred binding partner for FF domains compared with Tat-SF1.
These observations raise the speculation that CA150 FF domains may switch between different binding partners, depending on their phosphorylation status. For example, CA150 FF domains could bind partners such as Tat-SF1 until RNAPII progresses from an initiation mode to a processive complex capable of elongation. This is when it is most likely that both Ser2 and Ser5 would be phosphorylated within individual CTD repeats, creating binding sites for FF domains. FF domain interactions with these phosphorylated sites may therefore allow the recruitment of elongation and splicing factors, such as Tat-SF1, to active sites of transcription as they are released from CA150 due to its stronger affinity for the phospho-CTD. Additionally, weak interactions between the RNAPII CTD and WW domains (which are regularly found in conjunction with FF domains) may also stabilize the complex, as several WW domains have been shown to bind differentially phosphorylated forms of the CTD (12, 56).
We have described the biochemical characteristics of an interaction between the FF domains of the mammalian elongation factor CA150 and the transcription and splicing factor Tat-SF1. It will be of interest to explore the consequences of this interaction for the transcription of mammalian genes or for those of pathogens like HIV-1. FF domains comprise a class of repeated interaction domains, with novel and versatile binding properties.
|
|
|---|
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»