Molecular and Cellular Biology, February 2009, p. 943-952, Vol. 29, No. 4
0270-7306/09/$08.00+0 doi:10.1128/MCB.02085-07
Copyright © 2009, American Society for Microbiology. All Rights Reserved.
,
Institute of Physiology and Center for Integrative Human Physiology, University of Zurich, Winterthurerstrasse 190, CH-8057 Zurich, Switzerland
Received 21 November 2007/ Returned for modification 2 January 2008/ Accepted 26 November 2008
|
|
|---|
(1-2)glucosyltransferase enzymes. We have identified two collagen galactosyltransferases using affinity chromatography and tandem mass spectrometry protein sequencing. The two collagen β(1-O)galactosyltransferases corresponded to the GLT25D1 and GLT25D2 proteins. Recombinant GLT25D1 and GLT25D2 enzymes showed a strong galactosyltransferase activity toward various types of collagen and toward the serum mannose-binding lectin MBL, which contains a collagen domain. Amino acid analysis of the products of GLT25D1 and GLT25D2 reactions confirmed the transfer of galactose to hydroxylysine residues. The GLT25D1 gene is constitutively expressed in human tissues, whereas the GLT25D2 gene is expressed only at low levels in the nervous system. The GLT25D1 and GLT25D2 enzymes are similar to CEECAM1, to which we could not attribute any collagen galactosyltransferase activity. The GLT25D1 and GLT25D2 genes now allow addressing of the biological significance of collagen glycosylation and the importance of this posttranslational modification in the etiology of connective tissue disorders. |
|
|---|
After synthesis in the endoplasmic reticulum (ER), three procollagen subunits associate to build a right-handed triple helix. However, before the formation of the triple-helix structure, the nascent procollagen polypeptides undergo several posttranslational modifications. These modifications comprise the hydroxylation of selected proline (20) and lysine (33) residues, which are catalyzed by three prolyl-4-hydroxylases (17), one prolyl-3-hydroxylase (46), and three lysyl hydroxylases (43). Hydroxylysine can be further modified by the addition of the monosaccharide Gal(β1-O) or the disaccharide Glc(
1-2)Gal(β1-O) (39).
Whereas the glycosylation of collagen was first described by Grassmann and Schleich in 1935 (9) and the structure of the glycan determined by Spiro in 1967 as being Glc(
1-2)Gal(β1-O)Hyl (40), the molecular nature of the collagen glycosyltransferase enzymes has remained elusive up to now. Collagen galactosyltransferase (ColGalT) and glucosyltransferase activities have been characterized using partially purified proteins (24, 31, 32), which appeared to be unstable. Recently the lysyl hydroxylase 3 (LH3) enzyme has been shown to catalyze a modest galactosyl and glucosyltransferase activity, suggesting that this enzyme is a combined hydroxylase and glycosyltransferase (12).
Prolyl and lysyl hydroxylation contribute to the stability of the collagen triple helix, where hydroxylysine is essential for the cross-linking of collagen molecules, thus ensuring the strength of collagen fibrils (28). In contrast, the biological significance of collagen glycosylation is still unclear. The collagen domain of adiponectin and mannose-binding lectin also carry glycosylated hydroxylysine residues, which appear to be important for the oligomerization and proper secretion of these proteins (6, 29).
The importance of collagen posttranslational modifications is reflected by the diseases caused by defective collagen modifying enzymes. Mutations of the LH1 lysyl hydroxylase 1 gene lead to the connective tissue disorder Ehlers-Danlos syndrome type VI (14), and mutations in the LH2 lysyl hydroxylase 2 gene lead to the Bruck syndrome (44). A deficiency in the prolyl 3-hydroxylase 1 gene causes a severe form of osteogenesis imperfecta (5). The availability of the collagen glycosyltransferase genes will enable comprehensive investigation of this posttranslational modification in cellular and animal models and possibly in human diseases.
|
|
|---|
MS peptide analysis. The eluted fractions from the affinity chromatography were desalted and concentrated with Amicon Ultra 10 cartridges (Millipore). Two-microgram portions of protein were reduced in 0.6 M Tris (pH 8.5)-50 mM DTT for 5 min at 80°C and alkylated for 40 min at room temperature in the dark by the addition of iodoacetamide (final concentration 200 mM; Sigma-Aldrich) and desalted by adding 9 volumes of ice-cold methanol for 18 h on ice. Alkylated proteins were digested for 18 h at 37°C with 0.01 µg trypsin (Roche). ZipTip (Millipore) purified peptides were then analyzed by liquid chromatography-mass spectrometry (MS). The desalted peptide digest was adjusted to 0.2% formic acid-3% acetonitrile (ACN) and directly injected onto a custom packed 80-mm by 0.075-mm ProntoSil-Pur C18 AQ (3 µm, 200 Å) column (Bischoff GmbH, Leonberg, Germany), connected to an LTQ-ICR-FT mass spectrometer (Thermo Scientific, Bremen, Germany). The peptides were eluted with a binary gradient of solvents A (3% ACN, 0.2% formic acid) and B (80% ACN, 0.2% formic acid) using an Eksigent-Nano high-performance liquid chromatography (HPLC) system (Eksigent technologies, Dublin, Ireland). The column was flushed for 16 min at a flow rate of 500 nl/min with 100% buffer A. Buffer B was increased to 3% over 5 min, to 60% over 50 min, and to 100% over 3 min and then held at 100% for 7 min. During gradient elution, the flow rate was maintained at 200 nl/min. The mass spectral data were acquired in the mass range of 300 to 2,000 m/z. Datum-dependent MS/MS spectra for up to four of the most intense ions with a higher charge state than 1+ were recorded using collision-induced dissociation. Target ions already selected for MS/MS were dynamically excluded for 60 s. Peptide signals exceeding 500 counts were subjected to collision-induced dissociation with a normalized collision energy of 32%. MS and MS/MS data were searched using Mascot Server 2.1 (Matrix Science, London, United Kingdom) as the search engine. Modifications used include carbamidomethylation (Cys, fixed) and oxidation (Met, variable). The monoisotopic masses of +2 and +3 charged peptides were searched with a peptide tolerance of 2 ppm and an MS/MS tolerance of 0.8 Da. MS/MS spectra were searched against the UniRef100 20051018 database (2,764,545 sequences; 1,015,909,965 residues) downloaded from the European Bioinformatics Institute (http://www.ebi.ac.uk/uniprot/database/download.html) and the Gallus gallus predicted protein database (ftp://ftp.ensembl.org/pub/release-51/fasta/gallus_gallus/pep/).
Cloning and protein expression. The GLT25D2, LH3, and MBL cDNAs were purchased from the RZPD repository (Berlin, Germany). The GLT25D1 and cerebral endothelial cell adhesion molecule 1 (CEECAM1) cDNAs were cloned by reverse transcription-PCR (RT-PCR) from human fibroblast total RNA using the primers 5'-ATCTGAATTCCCTTTAAGGCGCGGCCAGAGTC-3' and 5'-ATGTCTAGATGGAGCCTGGGCCACCGATG-3' for GLT25D1 and 5'-CGTAGAATTCGAGAGCTCCGGGGGCCGCT3' and 5'-GACTATCTAGAGTAGTGGCCTGCTCCTGGAC-3' (Microsynth, Switzerland) for CEECAM1. The RT-PCR products were subcloned as EcoRI-XbaI fragments into the pFastBacI baculovirus transfer vector (Invitrogen). The MBL cDNA was subcloned into the EcoRI site of the pFmel-protA vector (48) to yield a protein A fusion protein. The corresponding 732-bp MBL fragment was amplified with the primers 5'-ATCGAATTCATGGTGGCAGCGTCTTACTC-3' and 5'-ATCGAATTCAGGAGGGCCTGAGTGATATG-3'. Recombinant baculoviruses were produced in Spodoptera frugiperda Sf9 cells as described previously (13). Protein A-tagged MBL was coexpressed together with LH3, purified from the supernatant of infected Sf9 cells by immunoglobulin G Sepharose chromatography (48), and subsequently used as an acceptor for the enzymatic activity assay. The expression of the recombinantly expressed enzymes was analyzed on a 10% sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE) gel. Prior to electrophoresis, proteins were enriched by concanavalin A Sepharose (GE Healthcare) chromatography. Protein bands were excised from the SDS-PAGE gel, digested in gel with trypsin according to the method of Shevchenko et al. (34), and identified by MS peptide analysis.
Preparation of ColGalT acceptors. Bovine Achilles collagen type I, bovine nasal septum collagen type II, and human placenta collagen types III, IV, and V (Sigma) were deglycosylated by trifluoromethane sulfonic acid (TFMS)-mediated cleavage (7, 38). Acceptor proteins (50 µg) were lyophilized, followed by an incubation in a dry ice-ethanol bath for 20 min. Proteins were dissolved in 50 µl TFMS-toluene (16.6:1 [vol:vol]) (Sigma-Aldrich). Reactions were subsequently incubated at –20°C for 24 h and then neutralized with 150 µl pyridine-H2O (2:1 [vol:vol]) in the dry ice-ethanol bath, followed by 15 min of incubation on ice. The sample was mixed with 400 µl 50 mM ammonium acetate and dialyzed overnight against 2.5 liters of 50 mM ammonium acetate.
Collagen glycosyltransferase assays. Baculovirus-infected Sf9 cells were lysed in 1% Triton X-100-TBS (pH 7.4) for 10 min on ice, and the postnuclear supernatant was used as an enzyme source. Collagen was heat denatured for 10 min at 60°C in sodium acetate (pH 6.8) and rapidly cooled to 0°C before use. Assays were performed with 10 µl of Sf9 postnuclear supernatant in a final volume of 100 µl containing 0.5 mg/ml collagen acceptors, 60 µM UDP-Gal or UDP-Glc, 50,000 cpm UDP-[14C]Gal or UDP-[14C]Glc (GE Healthcare), 10 mM MnCl2, 20 mM NaCl, 50 mM morpholinepropanesulfonic acid (pH 7.4), and 1 mM DTT. Reactions were incubated for 3 h at 37°C and stopped by the addition of 500 µl of ice-cold 5% TCA-5% phosphotungstic acid. Enzymatic activity assays for Km analysis were performed as described above but with various amounts of either collagen type I or UDP-Gal as substrates.
Amino acid analysis. The reaction products of the collagen galactosyltransferase assays were hydrolyzed in 4 M NaOH for 72 h at 105°C, and the resulting single amino acids were derivatized with 9-fluorenylmethoxy carbonyl according to the method of Bank et al. (2). Reverse-phase HPLC (LaChrom Hitachi; Merck) of single amino acids was performed on an ODS Hypersil column (150 by 3 mm, 3-µm particle size; Thermo Electron Corporation) at 40°C. The galactosylated Hyl (GHyl) and galactosyl-glucosylated Hyl (GGHyl) standards were kindly provided by Ruggero Tenni (University of Pavia) (42). Amino acids were separated at a flow rate of 0.2 ml/min using a gradient elution with the solvents 0.5 M citric acid, 5 mM (CH3)4NCl, pH 2.85 (A); 80% of 20 mM sodium acetate trihydrate, 5 mM (CH3)4NCl, pH 4.5, 20% of methanol (B); and 100% of ACN (C). Radiolabeled [3H]Val and [14C]Tyr (Moravek Biochemicals and Radiochemicals) were used as internal standard. Radioactivity was counted in a β counter (Tri-Carb 2900TR; Packard). For β-galactosidase digestion of GHyl, hydrolyzed amino acids were loaded on AG 50W-X8(H+) resin (Bio-Rad), washed with 0.8% acetic acid, and eluted with 5% ammonia. After removal of ammonia by lyophilization, the samples were digested with 10 mU of bovine testis β-galactosidase (QA-Bio, San Mateo, CA) in 100 mM sodium citrate, pH 4.3, for 16 h at 37°C. Liberated Gal was separated from GHyl by passage through AG 50W-X8(H+), whereas GHyl was released by elution with 5% ammonia.
RNA interference. Lentivirus particles expressing the short hairpin RNA constructs TRCN0000034884, TRCN0000034885, TRCN0000034887, and TRCN0000034888, targeting human GLT25D1 (MISSION shRNA NM_024656; Sigma) were produced in HEK293T cells as described previously (10). Aliquots of 500 µl of lentivirus-producing HEK293T cell supernatants were added to 60,000 HeLa cells for 24 h. Cells expressing the short hairpin RNA constructs were selected by treatment with 2.5 µg/ml of puromycin for 10 days. Silencing of the GLT25D1 gene was monitored by quantitative RT-PCR (SYBR Green JumpStart Taq ReadyMix; Sigma) using the primers 5'-ATTGCGCGCCCACAGCAC-3' and 5'-GGTGGGAGCCGAGATGAAGC-3'. The expression of the GLT25D2 and glyceraldehyde-3-phosphate dehydrogenase genes in HeLa cells was determined using the primers 5'-GATAACATTGACCAGGCTCAG-3', 5'-CCCAAAAGGATTGGCTCCAAC-3', 5'-ATGCTGGCGCTGAGTACGTCGTG-3', and 5'-GTGATGGCATGGACTGTGGTCAT-3', respectively.
Northern blotting.
The GLT25D1, GLT25D2, and CEECAM1 cDNA probes were synthesized by PCR using the primer pairs 5'-GATGAGGCCGAGAGCTTCATGC-3' and 5'-GCATGAAGCTCTCGGCCTCATC-3', 5'-AAGCAGGCATCCAGATGTACC-3' and 5'-TCCAGCTGAGCCTGGTCAATG-3', and 5'-GTGGATGGCTGGATGCTCAAC-3' and 5'-GACTATCTAGAGTAGTGGCCTGCTCCTGGAC-3', respectively. The resulting 676-bp-long GLT25D1, 559-bp-long GLT25D2, and 785-bp-long CEECAM1 probes were labeled with [
-32P]dCTP (Hartman Analytic, Germany) by random priming (Stratagene). Multiple human tissue RNA blots (MTE array 3 [BD Bioscience] and First Choice Northern Human Blot 1 [Ambion]) were prehybridized with the QuikHyb hybridization solution (Stratagene) containing 100 µg/ml ultra-pure herring sperm DNA (Invitrogen) for 1 h at 65°C and then hybridized with 5 x 105 cpm of each labeled probe overnight at 65°C. The arrays were washed in 0.1x SSC (1x SSC is 0.15 M NaCl plus 0.015 M sodium citrate)-0.1% SDS up to 60°C and exposed on BioMax XAR film (Kodak) for 24 h at –80°C.
Nucleotide sequence accession numbers. The nucleotide sequences reported in this paper correspond to the GenBank/EBI data bank entries with the accession numbers NM_024656, NM_015101, and NM_016174.
|
|
|---|
One of the candidate proteins identified by tandem MS peptide sequencing was the putative glycosyltransferase GLT25D2 (Fig. 1A) (see Fig. S1 in the supplemental material). GLT25D2 is a type II transmembrane protein of 626 amino acids, including four N-glycosylation sites and the ER retention signal RDEL at the C terminus (Fig. 1B). No enzymatic activity was attributed to GLT25D2, but database annotations pointed to sequence homology with bacterial enzymes involved in lipopolysaccharide biosynthesis. Proteins similar to chicken GLT25D2 could be deduced from all metazoan genomes. In the human genome, GLT25D2 was found to be strongly similar to two proteins, namely, GLT25D1 and CEECAM1. The three proteins contained N-glycosylation sites and the ER retrieval signal RDEL at the C terminus and shared more than 50% sequence identity (Fig. 2).
![]() View larger version (29K): [in a new window] |
FIG. 1. ColGalT identification by mass spectrometry. Proteins isolated by affinity chromatography were analyzed by liquid chromatography-MS. (A) Peptide fragment spectra of two peptides identifying GLT25D2. (B) Protein sequence of Gallus gallus GLT25D2. The two identifying peptides are shaded in gray, the four potential N-glycosylation sites are underlined, and the ER retrieval signal is shown in bold.
|
![]() View larger version (59K): [in a new window] |
FIG. 2. Protein alignment. The three putative human ColGalT enzymes share a high degree of sequence identity (63% between GLT25D1 and GLT25D2, 50% between GLT25D2 and CEECAM1, and 55% between GLT25D1 and CEECAM1). The proteins include the C-terminal RDEL ER retrieval motif. Black squares represent amino acids identical or similar in all three proteins; gray squares represent amino acids identical or similar in two of the proteins.
|
|
View this table: [in a new window] |
TABLE 1. ColGalT activities measured in Sf9 cell lysatesa
|
![]() View larger version (20K): [in a new window] |
FIG. 3. ColGalT activity toward MBL. MBL was produced in Sf9 cells coinfected with a baculovirus expressing LH3. A ColGalT activity assay was performed as described in Materials and Methods. Bars indicate the means for four assays. Error bars indicate the standard deviations.
|
(1-2)glucosylation of Hyl (12, 47). Surprisingly, we could not detect any significant ColGalT activity for LH3 under our assay conditions using bovine Achilles collagen type I as an acceptor (Fig. 4A). However, as described previously (12, 47), we did measure a low collagen glucosyltransferase activity for LH3, whereas GLT25D1, GLT25D2, and CEECAM1 failed to show any significant collagen glucosyltransferase activities (Fig. 4B). Although no collagen glycosyltransferase activity could be attributed to CEECAM1, we did confirm that the recombinant protein was expressed in Sf9 cells as were GLT25D1, GLT25D2, and LH3, as shown by SDS-PAGE (Fig. 4C). To confirm the identities of the GLT25D1, GLT25D2, CEECAM1, and LH3 proteins, the corresponding bands were excised from the gel, digested with trypsin, and submitted to tandem MS peptide sequencing (data not shown). However, even though it was shown to be expressed, it is still possible that the levels of CEECAM1 could be too low to detect activity.
![]() View larger version (38K): [in a new window] |
FIG. 4. Time course of baculovirus-mediated protein expression in Sf9 cells. ColGalT activity (A) or collagen glucosyltransferase activity (B) was measured in cells expressing GLT25D1, GLT25D2, CEECAM1, or LH3. Bovine Achilles collagen type I was used as an acceptor substrate. The activity measured in Sf9 cells infected with an empty baculovirus is shown in both panels with filled squares. Values indicate the means for four assays. Error bars indicate the standard deviations. (C) SDS-PAGE of recombinantly expressed proteins. Arrows indicate the recombinant protein bands, as confirmed by liquid chromatography-MS-mediated protein sequencing.
|
![]() View larger version (26K): [in a new window] |
FIG. 5. Determination of the apparent Km values of GLT25D1 and GLT25D2. (A) Lineweaver-Burk blot for GLT25D1 on collagen, with the calculated Michaelis-Menten constant of 13.6 g/liter. (B) Lineweaver-Burk blot for GLT25D1 on UDP-Gal, with the calculated Michaelis-Menten constant of 18.77 µM. (C) Lineweaver-Burk blot for GLT25D2 on collagen type I, with the calculated Michaelis-Menten constant of 9.8 g/liter. (D) Lineweaver-Burk blot for GLT25D2 on UDP-Gal, with the calculated Michaelis-Menten constant of 33.53 µM.
|
![]() View larger version (34K): [in a new window] |
FIG. 6. Product identification by reverse-phase HPLC. The first panel represents an amino acid standard containing the standards for GHyl and GGHyl. The second and third panels show the amino acid profiles of bovine collagen type I and type II hydrolysates, respectively. The lower two panels show the radioactive trace obtained after reaction of collagen type I with GLT25D1 and GLT25D2. [3H]Val and [14C]Tyr were used as internal amino acid standards. Amino acids are marked in single-letter code. Hyp, hydroxyproline.
|
|
View this table: [in a new window] |
TABLE 2. β-Galactosidase digestion of GLT25D1 and GLT25D2 reaction productsa
|
![]() View larger version (15K): [in a new window] |
FIG. 7. Silencing of the GLT25D1 gene. (A) RT-PCR detection of GLT25D1 and GLT25D2 expression in HeLa cells. (B) mRNA GLT25D1 levels in wild-type HeLa cells (black bar) and in GLT25D1-silenced HeLa cells (KD #1 to KD #3, white bars). (C) Relative ColGalT activity in wild-type HeLa cells (black bar) and in GLT25D1-silenced HeLa cells (white bars). (D) Comparison between GLT25D1 mRNA levels and ColGalT activity in wild-type HeLa cells (set to 100%) and those in GLT25D1-silenced HeLa cells.
|
![]() View larger version (52K): [in a new window] |
FIG. 8. Tissue Northern blotting. The mRNA expression patterns of GLT25D1, GLT25D2, and CEECAM1 were analyzed in 10 human tissues (A) or in 36 human tissues and cell lines (B); a representative collection of additional tissues and cell types is shown in Fig. S2 in the supplemental material. PBL, peripheral blood leukocytes.
|
|
|
|---|
Alternatively, it is possible that CEECAM1 represents a ColGalT acting on a limited set of substrates. The screening of additional proteins, including collagen domains like adiponectin, the acetylcholine esterase complex COLQ, and the complement protein C1q, might confirm this possibility. Finally, CEECAM1 may have lost any enzymatic activity over the course of evolution. In fact, CEECAM1 was first described as an adhesion protein (41) which might function as a carbohydrate-binding protein at the cell surface. However, the presence of the C-terminal RDEL motif would suggest that CEECAM1 is maintained in the ER.
The lysyl hydroxylase LH3 protein has been previously reported to possess three enzymatic activities, namely, a lysyl hydroxylase, a ColGalT, and a collagen glucosyltransferase activity (47). The glycosyltransferase activities attributed to LH3 were very low, casting doubt on their biological significance (27). The ColGalT activity of LH3 reported previously reached approximately twice the levels of endogenous ColGalT activity measured in Sf9 cells (47). It is possible that we could not distinguish the ColGalT activity of LH3 from the background activity levels in our assays. By comparison, the strong ColGalT activities described here for GLT25D1 and GLT25D2 implies that these proteins indeed represent true ColGalT enzymes. The dual glycosyltransferase activity of LH3 certainly requires closer attention, since it is expected that the catalysis of both β(1-O) and
(1-2) linkages would require distinct domains responsible for the retaining
(1-2) and inverting β(1-O) glycosyltransferase activities (26).
The identification of the GLT25D1 and GLT25D2 genes as encoding two ColGalT enzymes opens new ways to investigate the biological significance of collagen glycosylation. Genes similar to the human GLT25D1 and GLT25D2 genes can be found in all metazoan genomes sequenced to date. For example, the Caenorhabditis elegans gene D2045.9 represents the probable ortholog of the human GLT25D1 and GLT25D2 genes. Knockdown of the D2045.9 gene by RNA interference yields multiple abnormalities, such as deformed mating organs, slow growth, and uncoordinated locomotion (15, 36). By comparison, the loss of the lysyl hydroxylase gene let-268 leads to a lethal phenotype associated with a defect in collagen type IV secretion (21, 25). The conservation of collagen glycosylation throughout animals and the essential role of collagen glycosylation in worms emphasize the importance of this modification. In humans, the GLT25D1 and GLT25D2 genes are found on human chromosome 19p13 and chromosome 1q25, respectively. The involvement of ColGalTs in the pathogenesis of connective tissue disorders linked to chromosomes 19p13 and 1q25, such as psoriasis (19) and epidermolysis bullosa (8, 18), can now be straightforwardly documented.
This work was supported by Telethon Action Suisse, the Wolfermann-Nägeli Foundation, and grants from the Swiss National Science Foundation (PP00A-106756 and 3100A0-116039).
Published ahead of print on 15 December 2008. ![]()
Supplemental material for this article may be found at http://mcb.asm.org/. ![]()
|
|
|---|
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»