Previous Article | Next Article ![]()
Molecular and Cellular Biology, March 2002, p. 1402-1411, Vol. 22, No. 5
0270-7306/02/$04.00+0 DOI: 10.1128/MCB.22.5.1402-1411.2002
Copyright © 2002, American Society for Microbiology. All Rights Reserved.
Department of Biochemistry, University of Nebraska, Lincoln, Nebraska 68588,1 Section on the Molecular Biology of Selenium, Basic Research Laboratory, National Cancer Institute, National Institutes of Health, Bethesda, Maryland 208922
Received 31 July 2001/ Returned for modification 10 September 2001/ Accepted 28 November 2001
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
Despite having its own codon and specific factors for its biosynthesis and insertion into protein, Sec is a rare amino acid. Only 17 proteins in mammals have been reported to contain Sec. These proteins do not have a common amino acid sequence motif, biological function, tissue expression, or intracellular localization. Hence, sequence analysis is generally not sufficient to determine the presence of Sec in protein. Thus far, the only common feature in eukaryotic selenoprotein-encoding genes (besides Sec-encoding UGA) is the presence of a stem-loop structure, called the Sec insertion sequence (SECIS) element, in the 3' untranslated region (UTR) (19). This structure is essential for insertion of Sec in response to UGA.
The functional part of a eukaryotic SECIS element is composed of a helix containing non-Watson-Crick base pairs UGAN. . . . .NGAN (designated the quartet or core), an unpaired A preceding the quartet, and an unpaired AA motif in the apical loop or bulge that is separated from the quartet by 11 to 12 nucleotides (3, 24). While having low sequence conservation, the secondary structure and free energy of eukaryotic SECIS elements are strictly conserved and can help in the identification of these stem-loop structures in nucleotide sequence databases (14, 18). It has been established that the quartet is involved in the interaction with SECIS-binding protein 2 (SBP2) (5), which is, in turn, essential for the formation of a complex with the Sec-specific elongation factor and Sec tRNA (7, 22). This complex functions by inserting Sec in response to in-frame UGA codons and preventing termination of translation at this site (1). The role of the unpaired AA motif in this process has not been established, although it is invariantly present in every eukaryotic selenoprotein mRNA identified thus far and its mutation results in a dramatic decrease in Sec incorporation into nascent polypeptide chains (2). In this work, we identified a new eukaryotic selenoprotein, designated selenoprotein M (SelM). This periplasmic protein was not detected by major public and private genome projects. We found that Sec is inserted into this protein in response to a new form of SECIS element that lacks the invariant adenosines.
| MATERIALS AND METHODS |
|---|
|
|
|---|
Constructs with GFP.
Green fluorescent protein (GFP)-SelM (wild-type SelM), GFP-SelM(Sec TGA>TGT) (the Sec codon, TGA, was mutated to a cysteine codon, TGT), GFP-SelM(quartet TGA>ACT) (the TGA triplet in the SECIS element was changed to ACT to prevent Sec incorporation), GFP-SelM(CCdel) (the CC at positions 567 and 568 was deleted), GFP-SelM(mouse loop CC>AA) (the CC at positions 567 and 568 was changed to AA), GFP-SelM(mouse loop CC>GG) (the CC at positions 567 and 568 was changed to GG), GFP-SelM(mouse loop CC>TT) (the CC at positions 567 and 568 was changed to TT), and GFP-SelM(human loop CC>AA) (the CC at positions 567 and 568 was changed to AA, and in addition, the apical loop of the mouse SECIS element was replaced with a human sequence) constructs were developed by using expression vector pEGFP-C3 (Clontech). The mouse SelM cDNA and mutant plasmids were amplified with primers Sel-15-2M (5'-CGCAACGTCGACATGAGCATCCTACTGTCG-3') and T3 and cloned into the XhoI/Bsp120I sites of pEGFP-C3. GFP-SelM(CC>AA), GFP-SelM(CCdel), GFP-SelM(CC>TT), GFP-SelM(CC>GG), and GFP-SelM(human loop CC>AA) were obtained by the QuickChange site-directed mutagenesis method (Stratagene) using primers SelM(AA>CC) (5"-GAATGAAGCGCTCAGTATAACGGGAGCATCTCCCTTG-3" and 5"-CAAGGGAGATGCTCCCGTTATACTGAGCGCTTCATTC-3"), SelM(human loop AA>CC) (5"-GCGCTCAGCATAACGGGAATACTTCTCTTGCTGAGGGCCGA-3" and 5"-TCGGCCCTCAGCAAGAGAAGTATTCCCGTTATGCTGAGCGC-3"), SelM(CCdel) (5"-GAATGAAGCGCTCAGTATCGGGAGCATCTCC-3" and 5"-GGAGATGCTCCCGATACTGAGCGCTTCATTC-3"), SelM(CC>TT) (5"-GAATGAAGCGCTCAGTATTTCGGGAGCATCTCC-3" and 5"-GGAGATGCTCCCGAAATACTGAGCGCTTCATTC-3"), and SelM(CC>GG) (5"-GAATGAAGCGCTCAGTATGGCGGGAGCATCTCC-3" and 5"-GGAGATGCTCCCGCCATACTGAGCGCTTCATTC-3"), respectively. The GFP-SelMh construct was obtained by amplification of the mutant (U48C) human SelM cDNA with primers Sal-15-2H (5"-CGCATCGTCGACATGAGCCTCCTGTTGCCTCCGCTGG-3") and T3 and cloning of the product into the XhoI/Bsp120I sites of pEGFP-C3. The SelMh-GFP construct was obtained by amplification of a mutant (U48C) human SelM cDNA with primers T7 and Xho-15-2H (5"-GCCACTCGAGGTCAGCGTGGTCCGAAG-3") and cloning of the product into the XhoI site of pEGFP-N1 (Clontech). The fragment encoding N-terminal sequences of SelM was obtained by amplification of human cDNA with primers T7-Nhe (5"-CGATGCTAGCTAATACGACTCACTATAGGG-3") and Age-15-2H (5"-CGAGACCGGTAGGCCGCTCAGACGGTTCCAGTC-3"). The N-GFP-SelMh and N-GFP constructs were made by cloning this fragment into the NheI/AgeI sites of GFP-SelMh and pEGFP-N1, respectively. The N-GFP-SelMh
construct, which codes for a SelM form lacking four C-terminal residues, was obtained by mutagenesis of N-GFP-SelMh with 15-2h-142stopF (5"-CCAGAGGAAACTTCGGACTAGGCTGACCTGTAGGTCCG-3") and 15-2h-142stopR (5"-CGGACCTACAGGTCAGCCTAGTCCGAAGTTTCCTCTGG-3"). All plasmids were transformed into Escherichia coli strain NovaBlue (Novagen), and the plasmids were isolated with a Plasmid Maxi Kit (Qiagen).
Cell growth, transfection, and metabolic labeling with 75Se. Transfection of CV-1, NIH 3T3, and human embryonic kidney (HEK) 293 cells and metabolic labeling of cells with 75Se were carried out as previously described (14). For CV-1 cells, 5 µg of DNA and 30 µl of Lipofectamine (Gibco BRL) were used for transfection of each 60-mm-diameter plate. For NIH 3T3 cells, transfection was carried out with 4 µg of DNA, 20 µl of Lipofectamine, and 12 µl of PLUS Reagent (Gibco BRL). HEK 293 cells were transfected by the calcium phosphate method (21) using 8.8 µg of DNA. The samples were analyzed on sodium dodecyl sulfate (SDS)-10% NuPAGE gels (Novex). 75Se-labeled proteins were visualized with a Storm PhosphorImager system (Molecular Dynamics).
Dual fluorescence imaging confocal microscopy. CV-1 cells cultured in 60-mm-diameter culture dishes were transfected with the appropriate constructs in the presence of Lipofectamine (Gibco BRL) and incubated for 12 h in a CO2 incubator. A fluorescent ceramide was used as a reference marker for perinuclear structures (endoplasmic reticulum [ER] and Golgi). This reagent has been shown to accumulate in the ER and Golgi and has been previously used to study protein trafficking (11, 12). The transfected cells were rinsed with serum-free Dulbecco modified Eagle medium-10 mM HEPES and then incubated for 25 min at room temperature in the same medium containing 2 µM BODIPY TR ceramide (Molecular Probes). The cells were washed twice in serum-free Dulbecco modified Eagle medium-10 mM HEPES and immediately used for image collection. Double-labeled images of live cells were collected with a water immersion lens using a dual excitation/emission and dual-channel mode on a Bio-Rad MRC1024ES laser scanning microscope.
Northern blot analysis.
A Mouse Adult Tissue Blot (Seegene) was probed with a 0.7-kb 32P-labeled XhoI-BamHI fragment of mouse SelM. Northern Territory-Human Tumor Panel Blots IV and V (Invitrogen) were probed individually with a labeled 0.7-kb human SelM cDNA. Probes were generated by a Rediprime II random prime labeling system (Amersham Pharmacia Biotech) in accordance with the manufacturer's protocol. To analyze mRNA expression of GFP-SelM constructs, total RNA was isolated from transfected CV-1 cells (
106) with an RNAqueous Kit (Ambion). RNA was loaded onto a denaturing agarose gel and transferred to a Zeta-Probe Blotting Membrane (Bio-Rad). The membrane was probed with human SelM cDNA as a probe and, after stripping, with a 32P-labeled DECAtemplate-ß-actin-mouse template (Ambion) as an internal control.
Nucleotide sequence accession numbers. The sequence data obtained in this study were submitted to the DDBJ/EMBL/GenBank databases under accession numbers AY043487 (human SelM) and AY043488 (mouse SelM).
| RESULTS |
|---|
|
|
|---|
|
1.5% of the human genome and is rich in protein-encoding genes (545 genes have been identified) and pseudogenes (134 pseudogenes have been identified). However, the SelM-encoding gene was not correctly annotated by either the public Human Genome Project (17) or the Celera private project (23).
|
44-kDa selenoprotein. The fusion selenoprotein was designed such that its mobility on SDS-polyacrylamide gel electrophoresis would be different from those of major naturally occurring selenoproteins expressed in mammalian cells. Indeed, metabolic labeling of cells with 75Se revealed a 44-kDa selenoprotein band (wild-type lanes in Fig. 2A to C). This band was not present in cells transfected with a construct in which the Sec-encoding TGA codon was mutated to a cysteine codon (Sec TGA>TGT in Fig. 2A to C).
|
|
We further tested whether the formation of a typical SECIS element by replacement of CC with AA would change the efficiency of Sec insertion into SelM. For this purpose, we developed a SelM construct containing a mouse SelM SECIS element in which AA was present in place of CC (Fig. 4B). A construct was also developed in which the mouse SECIS element had AA in place of CC, and in addition, its apical loop and minihelix were replaced with the corresponding human sequences (Fig. 4C). Interestingly, these changes had little effect on Sec insertion into mouse SelM expressed in CV-1, NIH 3T3, and HEK 293 cells (Fig. 2). However, other mutations, including deletion of the CC motif (Fig. 4D) and replacement of the CC with UU (Fig. 4E) and GG (Fig. 4F), completely disrupted Sec insertion into SelM (Fig. 2C). Thus, unpaired cytidines are essential for SelM SECIS element function and adenosines, but not other residues, could be tolerated at this position.
|
Further analysis revealed that within SelM-encoding genes, the CC-containing form of the SECIS element was restricted to mammalian genes. Indeed, in contrast to the type 2 SECIS-like CC-containing mammalian structure, the zebra fish SelM-encoding gene contained a typical type 1 SECIS element that exhibited 60% identity to the human SelM SECIS element but had an AA sequence in place of CC and lacked a minihelix (Fig. 3D). This SECIS element was easily recognized when the zebra fish database of ESTs (dbEST) was analyzed with SECISearch. In fact, zebra fish SelM was independently identified as a selenoprotein by applying SECISearch to zebra fish dbEST (Kryukov et al., unpublished). It is possible that, during evolution, SelM SECIS elements not only evolved into either type 1 or type 2 structures but also gave rise to a structure that differs from any other known SECIS element by the presence of cytidines in the loop. These observations and the apparent evolutionary linkage between mammalian and zebra fish SelM SECIS elements provided further support for the conclusion that mammalian SelM has a new form of SECIS element.
SelM is distantly homologous to Sep15, but its Sec-containing motif resembles those of SelW and SelT. SelM exhibited no homology to any known protein in the nonredundant database when analyzed by default BLAST programs. However, the use of advanced sequence analysis tools revealed a distant homology to the 15-kDa selenoprotein (Sep15) (31% identity in a 73-amino-acid overlap). Moreover, the location of Sec was conserved between these two proteins (Fig. 5A). However, Sep15 had a Cys-Gly-Sec-Lys motif whereas SelM contained Sec in a Cys-Gly-Gly-Sec motif. The latter was similar to sequences found in eukaryotic selenoproteins SelW and SelT. Interestingly, one of the zebra fish SelW forms also had the Cys-Gly-Gly-Sec motif (15).
|
-helix (Fig. 5B). The presence of CxxU and CxxC motifs upstream of an
-helix is rare in proteins and is often characteristic of a redox center (V. N. Gladyshev, unpublished data). For example, thioredoxins, protein disulfide isomerase, glutaredoxins, and other disulfide oxidoreductases contain the redox CxxC motif that is located upstream of an
-helix and serves as the protein active center (20). Expression of SelM in mammalian tissues. Analyses of the GenBank EST database revealed the presence of 91 full or partial cDNA sequences that matched the human SelM cDNA and more than 50 mouse ESTs. These clones were derived from a variety of different organs and tissues, suggesting a very broad spectrum of moderate expression of SelM mRNAs in many cell types. Direct analyses of SelM mRNA expression in mouse tissues by Northern blot assays also revealed expression of SelM in various tissues. The highest levels of SelM mRNA were observed in the brain (Fig. 6A).
Since SelM is a distant homolog of Sep15, we were interested in comparing the ways in which these two proteins are expressed. Interestingly, Sep15 has been implicated in the role of selenium in cancer prevention and exhibits altered expression in several cancers (10, 16). Thus, we compared matched pairs of tumors and normal samples derived from various human tissues for expression of SelM mRNA and analyzed Sep15 mRNA expression in parallel (Fig. 6B). We found that the expression patterns of the SelM and Sep15 mRNAs were different in human tissues. In addition, although mRNAs for both proteins had altered expression levels in several of the tumors tested (compared to matched control tissues), these changes in mRNA levels did not always correlate between the two proteins.
|
|
|
| DISCUSSION |
|---|
|
|
|---|
CC mutation disrupts Sec insertion (S. V. Novoselov and V. N. Gladyshev, unpublished data). In contrast to all other known eukaryotic SECIS elements, the SECIS element in mammalian SelM does not have adenosines in the apical loop and, instead, contains a CC motif. Nevertheless, this stem-loop structure was functional, as demonstrated by the incorporation of 75Se into the selenoprotein in response to the wild-type SelM SECIS element, but not in response to the structure containing a mutated quartet sequence. The CC motif in the apical loop and the overall SECIS element sequence were conserved among mammalian SelM mRNAs and resembled the type 2 SECIS element (9). In the zebra fish SelM-encoding gene, however, a typical type 1 SECIS element containing the AA sequence was present. It is possible that compensatory changes were responsible for the observed accommodation of CC in the unpaired apical bulge in mammalian sequences.
Our study suggests that the absolutely conserved primary sequence in SECIS elements is limited to the UGA. . . . .GA motif in the quartet, which serves as a recognition site for SBP2. Besides the quartet, the only other recognition feature of the SECIS element is its actual three-dimensional structure. These two factors of the stem-loop structure appear to be important for SECIS element function. In addition to an unpaired motif in the apical loop or bulge, a nucleoside preceding the quartet was thought to be conserved throughout eukaryotic SECIS elements. However, this previously invariant A is replaced with G in the Caenorhabditis elegans thioredoxin reductase (TR)-Se-encoding gene (4) and several other eukaryotic selenoprotein-encoding genes (8) and with C in the mouse TR2-encoding gene (unpublished observations). In addition, replacement of A with G, U, or C supported Sec insertion at
70, 70, and 30% of A, respectively (8). These observations suggest a new consensus structure of the eukaryotic SECIS element, as shown in Fig. 3C.
The identification of SelM is itself of great interest, as only 17 proteins in mammals are known to contain Sec in their polypeptide chains. The actual numbers of selenoproteins in mammalian genomes are not known, but the steady increase in the number of selenoproteins in recent years illustrates the importance of this class of proteins.
SelM appears to be distantly related to Sep15 and has similarities to selenoproteins containing the CxxU motif. The Sec location is conserved in the Sep15 and SelM sequences, but the Sec-flanking sequences are organized differently. In Sep15, Sec is separated from a conserved Cys residue by only a single residue whereas Sec in SelM is separated from a Cys residue by two glycines. The latter tetrapeptide sequence is similar to the Sec centers of SelT and SelW and is, in fact, identical to that of zebra fish SelW (15). In addition, these motifs and secondary structure patterns relate these selenoproteins to thiol/disulfide oxidoreductases. It is thus possible that Sec and Cys in SelM, SelT, and SelW form a reversible selenosulfide bond.
SelM is located in the perinuclear structures, a rare location for selenoproteins. Only its distant homolog Sep15 has been demonstrated to reside in the ER/Golgi structures. Interestingly, Sep15 was found to be associated with UDP-glucose glycoprotein glucosyltransferase, an ER-resident protein that is involved in the quality control of protein folding (13). Sep15 has also been implicated in cancer prevention (10, 16). Analyses of expression patterns of Sep15 and SelM in matched tumor and control samples, described in this report, revealed changes in mRNA expression linked to malignant transformation. However, whether SelM has any role in cancer prevention remains to be established.
It is also of interest that the human SelM-encoding gene is located on human chromosome 22, which is the first chromosome to be sequenced by the Human Genome Project and is known to contain at least 545 protein-encoding genes. However, the SelM-encoding gene was not previously correctly identified, possibly because currently available gene annotation programs recognize in-frame TGA codons as stop signals. The use of such programs is expected to miss selenoprotein-encoding genes, especially those that, like the SelM-encoding gene, have short coding regions and contain Sec in their N-terminal sequences.
| ACKNOWLEDGMENTS |
|---|
We thank You Zhou for helping with microscopy and protein localization experiments and Gregory Kryukov for discussions and help with computer analyses.
This work was supported by NIH grants CA80946 and GM61603 (V.N.G.).
| FOOTNOTES |
|---|
| REFERENCES |
|---|
|
|
|---|
2. Berry, M. J., L. Banu, J. W. Harney, and P. R. Larsen. 1993. Functional characterization of the eukaryotic SECIS elements which direct selenocysteine insertion at UGA codons. EMBO J. 12:3315-3322.[Medline]
3. Berry, M. J., G. W. Martin, 3rd, and S. C. Low. 1997. RNA and protein requirements for eukaryotic selenoprotein synthesis. Biomed. Environ. Sci. 10:182-189.[Medline]
4.
Buettner, C., J. W. Harney, and M. J. Berry. 1999. The Caenorhabditis elegans homologue of thioredoxin reductase contains a selenocysteine insertion sequence (SECIS) element that differs from mammalian SECIS elements but directs selenocysteine incorporation. J. Biol. Chem. 274:21598-21602.
5. Copeland, P. R., J. E. Fletcher, B. A. Carlson, D. L. Hatfield, and D. M. Driscoll. 2000. A novel RNA binding protein, SBP2, is required for the translation of mammalian selenoprotein mRNAs. EMBO J. 19:306-314.[CrossRef][Medline]
6. Dunham, I., et al. 1999. The DNA sequence of human chromosome 22. Nature 402:489-495.[CrossRef][Medline]
7. Fagegaltier, D., N. Hubert, K. Yamada, T. Mizutani, P. Carbon, and A. Krol. 2000. Characterization of mSelB, a novel mammalian elongation factor for selenoprotein translation. EMBO J. 19:4796-4805.[CrossRef][Medline]
8.
Fagegaltier, D., A. Lescure, R. Walczak, P. Carbon, and A. Krol. 2000. Structural analysis of new local features in SECIS RNA hairpins. Nucleic Acids Res. 28:2679-2689.
9. Grundner-Culemann, E., G. W. Martin 3rd, J. W. Harney, and M. J. Berry. 1999. Two distinct SECIS structures capable of directing selenocysteine incorporation in eukaryotes. RNA 5:625-635.[Abstract]
10.
Hu, Y. J., K. V. Korotkov, R. Mehta, D. L. Hatfield, C. N. Rotimi, A. Luke, T. E. Prewitt, R. S. Cooper, W. Stock, E. E. Vokes, M. E. Dolan, V. N. Gladyshev, and A. M. Diamond. 2001. Distribution and functional consequences of nucleotide polymorphisms in the 3'-untranslated region of the human Sep15 gene. Cancer Res. 61:2307-2310.
11. Ilgoutz, S. C., K. A. Mullin, B. R. Southwell, and M. J. McConville. 1999. Glycosylphosphatidylinositol biosynthetic enzymes are localized to a stable tubular subcompartment of the endoplasmic reticulum in Leishmania mexicana. EMBO J. 18:3643-3654.[CrossRef][Medline]
12. Kok, L. W., T. Babia, K. Klappe, G. Egea, and D. Hoekstra. 1998. Ceramide transport from endoplasmic reticulum to Golgi apparatus is not vesicle-mediated. Biochem. J. 333:779-786.
13.
Korotkov, K. V., E. Kumaraswamy, Y. Zhou, D. L. Hatfield, and V. N. Gladyshev. 2001. Association between the 15 kDa selenoprotein and UDP-glucose:glycoprotein glucosyltransferase in the endoplasmic reticulum of mammalian cells. J. Biol. Chem. 276:15330-15336.
14.
Kryukov, G. V., V. M. Kryukov, and V. N. Gladyshev. 1999. New mammalian selenocysteine-containing proteins identified with an algorithm that searches for selenocysteine insertion sequence elements. J. Biol. Chem. 274:33888-33897.
15. Kryukov, G. V., and V. N. Gladyshev. 2000. Selenium metabolism in zebra fish: multiplicity of selenoprotein genes and expression of a protein containing 17 selenocysteine residues. Genes Cells 5:1049-1060.[Abstract]
16.
Kumaraswamy, E., A. Malykh, K. V. Korotkov, S. Kozyavkin, Y. Hu, S. Y. Kwon, M. E. Moustafa, B. A. Carlson, M. J. Berry, B. J. Lee, D. L. Hatfield, A. M. Diamond, and V. N. Gladyshev. 2000. Structure-expression relationships of the 15-kDa selenoprotein gene: possible role of the protein in cancer etiology. J. Biol. Chem. 275:35540-35547.
17. Lander, E. S., et al. 2001. Initial sequencing and analysis of the human genome. Nature 409:860-921.[CrossRef][Medline]
18.
Lescure, A., D. Gautheret, P. Carbon, and A. Krol. 1999. Novel selenoproteins identified in silico and in vivo by using a conserved RNA structural motif. J. Biol. Chem. 274:38147-38154.
19. Low, S. C., and M. J. Berry. 1996. Knowing when not to stop: selenocysteine incorporation in eukaryotes. Trends Biochem. Sci. 21:203-208.[CrossRef][Medline]
20. Martin, J. L. 1995. Thioredoxin-a fold for all reasons. Structure 3:245-250.[Medline]
21. Sambrook, J., E. F. Fritsch, and T. Maniatis. 1989. Molecular cloning: a laboratory manual, 2nd ed., p. 16.32-16.36. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y.
22. Tujebajeva, R. M., P. R. Copeland, X.-M. Xu, B. A. Carlson, J. W. Harney, D. M. Driscoll, D. L. Hatfield, and M. J. Berry. 2000. Decoding apparatus for eukaryotic selenocysteine insertion. EMBO Rep. 1:158-163.[CrossRef][Medline]
23. Venter, J. C., et al. 2001. The sequence of the human genome. Science 16:1304-1351.
24. Walczak, R., E. Westhof, P. Carbon, and A. Krol. 1996. A novel RNA structural motif in the selenocysteine insertion element of eukaryotic selenoprotein mRNAs. RNA 2:367-379.[Abstract]
This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| J. Bacteriol. | J. Virol. | Eukaryot. Cell |
|---|
| Microbiol. Mol. Biol. Rev. | Clin. Vaccine Immunol. | All ASM Journals |
|---|