Previous Article | Next Article ![]()
Molecular and Cellular Biology, August 2007, p. 5835-5848, Vol. 27, No. 16
0270-7306/07/$08.00+0 doi:10.1128/MCB.00363-07
Copyright © 2007, American Society for Microbiology. All Rights Reserved.
,
Graduate Institute of Life Sciences, National Defense Medical Center, Taipei, Taiwan, Republic of China,1 Institute of Biomedical Sciences, Academia Sinica, Taipei, Taiwan, Republic of China,2 Institute of Bioinformatics, School of Medicine, National Yang-Ming University, Taipei, Taiwan, Republic of China3
Received 28 February 2007/ Returned for modification 4 April 2007/ Accepted 1 June 2007
|
|
|---|
|
|
|---|
Alternative splicing has recently emerged as a major mechanism of posttranscriptional regulation in the human genome and occurs in >60% of human genes (3); the high frequency of this alternative splicing in human is supported by expressed sequence tag (EST)-based database analysis (4, 16). Recently, Wen et al. (38) reported that very short alternative splicing in the human genome might alter protein structures and thus influence protein function. A subtle variation of transcripts may come from wobble splicing at the 5' splice site GTNGT or the 3' splice site NAGNAG sequence, thus creating single-amino-acid insertion and deletion (InDel) isoforms (22, 35). Recently, Hiller et al. (14) and Tadokoro et al. (34) also demonstrated genome-wide distribution of the NAGNAG sequence at the 3' splice site in human genes, which results in protein isoforms with one amino acid insertion or deletion. Such wobble splicing is predicted to exist in 30% of human genes and is active in at least 5% of genes, according to an EST database search. Although wobble splicing only subtly changes the protein sequence, it might increase the functional diversity of proteins; examples include PAX3, PAX7, and insulin-like growth factor receptor (10, 18, 37). Alternative splicing at the 5' or 3' tandem splice site may play important roles during the progression of diseases, as the reported cases include the human WT1 and ABCA4 genes (11, 26).
Alternative splicing involving a 5' or a 3' splice site choice can be regulated in different cell types or at different stages during development (24). However, wobble splicing at tandem NAGNAG motifs is independent of tissue type and may not be specifically regulated (34, 35). Although the NAGNAG motif is common in human genes, only a small number of the genes have been confirmed to generate wobble splicing isoforms (12, 34, 35). In most cases, one AG in the 3' tandem splice site is constitutively used, suggesting that the NAGNAG motif is not sufficient to determine wobble splicing sites. Therefore, the answer to how the 3' splice site is determined in wobble splicing remains elusive.
According to the linear scanning mechanism model (7, 9, 32, 33), the spliceosome recognizes the branch point and scans downstream for the first AG. Although the distance between two adjacent AGs is less than 6 nucleotides, the distal 3' splice site is primarily selected (9). A recent report indicates that several features of wobble splicing are different from those of constitutive splicing, such as high conservation of the intron sequence upstream of the tandem splice site and the abundance of cis elements near the tandem splice site (1). Because the regulation of NAGNAG wobble splicing has not been well characterized in the past, we experimentally addressed this question and demonstrated that the NAGNAG motif is not sufficient for determining wobble splicing and that the intronic sequence may make a significant contribution.
|
|
|---|
Plasmid constructs. The genomic DNA of ING4 exons 4 to 5 (exon 4-5) was amplified by PCR from the AZ-521 cell line genomic DNA (obtained from the ATCC) using the primer pair ING4-exon 4-F (forward) and ING4-exon 5-R (reverse). The amplified fragments were cloned into the pGEM-T easy vector (Promega, Madison, WI). Following cloning, several clones were randomly selected, and their sequences were determined by an autosequencer. Subsequently, the minigene construct was generated by subcloning the genomic DNA of ING4 exon 4-5 into the EcoRI site of the pEGFP-C1 vector (Clontech). The intron-exchanging minigene constructs were generated by overlapping PCR experiments as follows: the plasmid human minigene was used as a template for the first PCR with primers ING4-exon 4-F/ING4-exon 4-R or ING4-exon 5-F/ING4-exon 5-R, and these two PCR products were combined. The second PCR was performed using the mouse minigene as template and subcloned into the pGEM-T easy vector and pEGFP-C1 expression vector. The mutant constructs were generated by PCR with primers (see Table S1 in the supplemental material) containing the desired mutations using human and mouse minigenes as templates. The PCR fragments containing mutations were then subcloned into the pEGFP-C1 expression vector. The sequences of all the plasmids were verified using an autosequencer.
Splicing analysis in vivo. The minigene plasmids were introduced into human AZ-521 and mouse B16-F10 cells using Lipofectamine 2000 (Invitrogen) according to the manufacturer's specifications. Forty-eight hours after transfection, total RNA was extracted, and the wobble splicing products were analyzed by capillary electrophoresis (35) using primer pair EGFP-F/specific gene primers as described above (see Table S2 in the supplemental material).
Capillary electrophoresis analysis. PCRs were performed to amplify cDNA in a 20-µl mixture containing 10x PCR buffer, fluorescently labeled primers (6-carboxyfluorescein [FAM])/complementary primers, deoxynucleoside triphosphates, and Takara Taq DNA polymerase (Takara Shuzo Company, Shiga, Japan). PCR was conducted at 94°C for 5 min followed by 26 cycles at 94°C for 1 min, 58°C for 1 min, and 72°C for 1 min, and extended at 72°C for 10 min. The samples were then cooled at 4°C. One microliter of the PCR mixture was diluted to 10 µl with formamide (Applied Biosystems, Foster City, CA), and 1 µl of ROX 350 fluorescent size standard (Applied Biosystems) was added. The mixture was then denatured at 95°C for 5 min and cooled at 4°C. Amplified PCR products were separated on an ABI 3100-avant DNA analyzer using polymer 3100 POP4 (Applied Biosystems) and then quantified with GeneScan 3.7 software. Ratios of the wobble splicing isoforms was determined as the peak area for the individual isoform/total area for wobble splicing isoforms x 100.
Sequence validation of cDNA single nucleotide polymorphisms. Genomic DNA was prepared from 11 human cancer cell lines, including AGS, AZ521, KatoIII, NUGC, TSGH, Hep3B, HepG2, HEL299, huh7, LNCap, and HeLa. The splice junction fragments of the LAP1B, C14orf105, TLR3 and AP1GBP1 genes were amplified by PCR using genomic DNA as a template with individual primer pairs (LAP1B, LAP1B-1F/LAP1B-R; C14orf105, C140rf105-F/C14orf105-R; TLR3, TLR3-1F/TLR3-R; and AP1GBP1, AP1GBP1-F/AP1GBP1-R). Gene-specific PCR was conducted at 94°C for 5 min and then 35 cycles of 94°C for 20 s, 58°C for 30 s, and 72°C for 30 s, and a final extension phase at 72°C for 10 min using a PCR thermocycler and Takara Taq polymerase (Takara; Shiga, Japan). Following PCR amplification, the products were purified using a PCR clean-up kit (QIAGEN, Hilden, Germany) and subsequently sequenced by an autosequencer.
|
|
|---|
![]() View larger version (43K): [in a new window] |
FIG. 1. ING4 minigene splicing assay. (A) Schematic illustration of the pGFP-C1-ING4 minigene in which the ING4 genomic DNA fragment from exon 4 to exon 5 is placed downstream of green fluorescent protein (GFP) and is controlled by the cytomegalovirus (CMV) promoter. The PCR primers (EGFP-F/ING4-R) are indicated by arrows. Four different isoforms of ING4 (ING4GKKKG, ING4GKKS, ING4EG, and ING4G) are caused by alternative splicing at the 5' and 3' tandem motifs. (B) The 5' and 3' tandem splice sites were mutated in the human ING4 minigenes. After transfection into AZ521 cells, total RNA was isolated and analyzed by PCR and capillary electrophoresis. The splicing profiles of the wild-type (left panel), 5' GC(N)7GT GG(N)7GT (middle panel), and 3' TAGAAG TGGAAG (right panel) are shown by blue peaks. (C) A representative capillary electrophoresis scan shows the percentages of four ING4 wobble splicing isoforms (GKKKG, 182 bp; GKKS, 179 bp; EG, 173 bp; G, 170 bp) in human and mouse lung tissues (blue peaks). The reference peaks of the GS350 ROX internal size standard (Applied Biosystems) are indicated in red. (D) The ING4 splicing pattern in adult human tissues (brain, liver, lung, heart, and kidney) was compared to that in adult mouse tissues. The values are the means ± standard deviations obtained from three independent experiments (**, P < 0.01 using the Kruskal-Wallis test). (E) Alignment of the human and mouse genomic sequences of ING4 exon 4-5. Exon 4 is indicated by a dashed brown outline; exon 5 is indicated by a dashed blue outline. The putative branch point (AACCCAT) is indicated by an arrowhead. The 5' (GC(N)7GT) and 3' (TAGAAG) tandem splice sites are boxed with a solid green line and a red line, respectively.
|
|
View this table: [in a new window] |
TABLE 1. Summary of subtle wobble splicing that utilizes two successive 3' splice sites
|
To decipher cis elements involved in ING4 wobble splicing, we constructed human and mouse ING4 minigene vectors containing exon 4, intron 4, and exon 5 (H-H-H and M-M-M, respectively). Since the intron was less conserved than the exon (Fig. 1E), we made the hybrid minigene patterns H-M-H and M-H-M, in which the human and mouse introns were exchanged reciprocally (Fig. 2A). The wobble splicing pattern of M-H-M was similar to that of H-H-H and human endogenous ING4 transcripts (Fig. 2B and C). Moreover, the H-M-H minigene generated a splicing pattern that was similar to that of M-M-M and mouse endogenous ING4 transcripts (Fig. 2B and C). This suggests that the cis elements that regulate ING4 wobble splicing are primarily located in the intron. Nevertheless, the splicing pattern of these minigenes occurred irrespective of the cell line used (Fig. 2C, panels b and c), suggesting that the variant wobble splicing pattern is not due to variation of trans-acting factors.
![]() View larger version (28K): [in a new window] |
FIG. 2. The splicing assay using the hybrid ING4 minigenes. (A) Schematic illustration of the human (H-H-H), mouse (M-M-M), and hybrid ING4 minigenes (H-M-H, hExon 4-mIntron 4-hExon 5; M-H-M, mExon4-hIntron 4-mExon 5). (B) Panels a and b show the distribution of the ING4 wobble splicing isoforms in AZ521 and B16-F10 cells. (c to f) Transient transfection of human AZ521 cells was performed using the ING4 minigenes. (C) Transient transfection of human AZ521 or mouse B16-F10 cells was performed using the ING4 minigenes. The relative expression level of the ING4 four isoforms was calculated using GeneScan 3.7. The values are the average of two independent experiments. The black and gray letters represent the human and mouse experiments, respectively.
|
GG(N)7GT] abolished 5' alternative splicing, and therefore wobble splicing occurred only at the 3' splice site, generating two transcripts by wobble selection of the 3' proximal or distal AG (Fig. 1B). Such 5' splice site mutation minigenes allow us to better reveal the 3' tandem NAGNAG site choice patterns. Transient expression of the H-H-H minigene in AZ521 cells showed that the H-H-H minigene with the mutated 5' splice site used the proximal (45.1%) and distal AG (54.9%) sites equally, whereas the mutant M-M-M minigene preferentially selected the distal AG (88.1%) (Fig. 3A and Table 2). Consistent with what was observed for the endogenous splicing pattern, the result obtained from the minigenes suggested that the 3' wobble splicing of ING4 differs between that of human and that of mouse. The hybrid minigene M-H-M with a 5' splice site mutation showed a somewhat similar splicing pattern to that of the human (H-H-H) minigene construct, that is,
40% proximal AG and
60% distal AG usage. However, when the intron of the H-H-H minigene was replaced by the mouse sequence, the distal AG was preferred (Fig. 3A, H-M-H, and Table 2). According to the above-described result, the intron sequence may play a decisive role in 3' splice site selection for wobble splicing.
![]() View larger version (35K): [in a new window] |
FIG. 3. The 3' NAGNAG wobble splicing of the ING4 and NM_015179 minigenes is regulated by cis elements. (A) Four ING4 minigenes with 5' splice site mutations (H-H-H, M-M-M, M-H-M, and H-M-H) were each transfected into AZ-521 cells for splicing analysis. A representative splicing profile for the 5' mutant minigenes is shown, and the percentage of splicing resulting from distal (arrowhead) and proximal AG (arrow) is shown at the top of each peak. (B and C) The splicing pattern profiles of ING4 (exon 5-6) and NM_015179 (exon 32-33) were analyzed with different cell lines (AZ521, Huh7, and MCF7). (D and E) Schematic illustrations of the ING4 exon 4-5 (E4-I4-E5) and exon 5-6 (E5-I5-E6) and the NM_015179 (E32-I32-E33) minigenes and their fusion constructs (ING4-E5-I4-E6 and NM_015179-E32-I4-E33). The minigene constructs were introduced into AZ521 cells, and the splicing profile was analyzed by capillary electrophoresis. The arrowheads and arrows in each panel indicate the use of distal and proximal AG, respectively. The percentage of wobble splicing isoforms is shown at the top of each panel. A representative data set is shown; similar results were obtained from two independent experiments.
|
|
View this table: [in a new window] |
TABLE 2. Swap test of the wobble splicing intron and its bilateral ING4 exons in cells
|
CAGCAG) in both the I32 and the I4 intron, utilization of the proximal AG was considerably enhanced (Fig. 3E, panels d and e). These observations indicate that both the NAGNAG motif and the intron sequence contribute to wobble splicing. The sequence between the BPS and the NAGNAG motif affects 3' splice site selection. To further confirm that the intron sequence is an important factor in wobble splicing, we analyzed EST databases to evaluate additional cases of wobble splicing for which the splicing pattern is clearly different between that of human and that of mouse. We selected five genes (SIPA1L1 [NP_056371], MAPK8IP3 [NP_055948], RAGE [NP_055041], ARID1A [NP_060920], and BRD7 [NP_037395]) that showed possible species-specific expression patterns and another five genes (TCF12 [NP_003196], SMARCA4 [NP_003063], FUBP1 [NP_003893], VASP [NP_003361], and PUM1 [NP_055491]) with no noted difference by EST comparison (Table 3). Using capillary electrophoresis, we confirmed that five genes have species-specific wobble splicing and three genes do not significantly differ among species (Fig. 4A and B). In all of these cases, the exons flanking the wobble splice junction are highly conserved (>90%), whereas the intron sequences are less conserved. Thus, this result indicated that the intron sequence could indeed influence NAGNAG-based wobble splicing.
|
View this table: [in a new window] |
TABLE 3. Sequence conservation between human and mouse within 10 wobble splicing intron-exon sets
|
![]() View larger version (28K): [in a new window] |
FIG. 4. Analysis of 3' wobble splicing of selected human and mouse genes. (A) Analysis of 3' wobble splicing of eight genes (NP_056371, NP_055948, NP_055041, NP_037395, NP_060920, NP_003196, NP_003063, and NP_003893) in human and mouse lung tissue, using capillary electrophoresis. The relative percentages of the two isoforms were calculated using GeneScan 3.7 (B). (C) Schematic illustration of the human SIPA1L1 (exon 20-intron 20-exon 21) (a) and human RAGE (exon 6-intron 6-exon 7) (b) minigenes and their fusion constructs (human-mouse hybrid minigene). The minigene constructs were introduced into AZ521 cells, and the splicing profiles were analyzed by capillary electrophoresis (lower panels). Arrowheads and arrows indicate the use of distal and proximal AG, respectively. One representative data set is shown for the use of the distal and proximal AG, respectively. Stars indicate statistical differences in the splicing patterns between human and mouse.
|
60 bp) upstream of the NAGNAG sequence was relatively poorly conserved (Table 3). To further examine the intronic sequence 60 bp upstream of the tandem NAGNAG motif, we constructed the human-mouse hybrid SIPA1L1 minigene in which this intronic region of human SIPA1L1 was replaced by the mouse equivalent sequence. As shown in Fig. 4C (panel a), the preferential AG usage was shifted from proximal to distal in this hybrid minigene. Similarly, the distal AG usage was considerably suppressed by replacing the 60-bp intronic sequence of the RAGE gene with that of the mouse (Fig. 4C, panel b). These results further demonstrated that the intronic 60-bp sequence is indeed involved in determining 3' splice site NAGNAG usage. Therefore, we focused further on the intronic region between the BPS and the NAGNAG site to study its roles in wobble splicing regulation. The PPT and BPS could affect 3' site selection. The above data indicated that the intronic sequence may play an important role in NAGNAG 3' splice site choice, especially the 60 bp preceding the tandem motif (Fig. 3 and Fig. 4). A previous report has indicated that the average distance between the BPSand the 3' splice site is 33 to 34 bp (19). We thus propose that the region between the BPS and the NAGNAG site is a key factor for 3' wobble splicing regulation. To determine whether the sequence between the BPS and the NAGNAG motif plays a role in 3' splice site choice, this region of human ING4 intron 4 was replaced with that of the mouse equivalent sequence (H-Hm-H). As shown in Fig. 5, the splicing pattern of the H-Hm-H minigene (proximal, 21.7%; distal, 78.3%), which contains the mouse BPS-to-NAGNAG region, exhibited a pattern similar to that of the mouse M-M-M minigene (Fig. 3A and Table 2). However, the M-Mh-M minigene significantly increased the level of the proximal AG isoforms (36.1%) (Fig. 5 and Table 2). Insertion of three nucleotides (three T nucleotides) into the mouse BPS-to-NAGNAG region to conform the length of human ING4 intron 4 changed the splicing pattern only slightly (Fig. 5B and Table 2). Similarly, removal of three nucleotides (three T nucleotides) did not significantly change the splicing. This result suggests that the length between the BPS and the NAGNAG region is not a key factor mediating ING4 wobble splicing. Furthermore, when we randomly mutated the BPS-to-NAGNAG region of the H-H-H minigene, 3' NAGNAG wobble splicing was only slightly affected (Table 2). Because the BPS sequence is conserved between human and mouse, the PPT sequence may contain putative cis elements for 3' tandem splice site selection.
![]() View larger version (33K): [in a new window] |
FIG. 5. Sequence of PPT within the BPS-to-NAGNAG region mediates 3' NAGNAG wobble splicing. (A) Schematic illustration of the ING4 minigenes containing the mutated 5' wobble splice site and the modified sequence at the 3' end of intron 4, including M-Mh-M, H-Hm-H, M-M(+3nt)-M, M-M(–3nt)-M (where nt is nucleotide), and four random mutant constructs (9c/g, 13t/a, 15t/g, and 1c/g 15t/g). The positions of random mutant constructs are underlined (lower panel). Constructs were transfected into AZ-521 cells. (B) A representative expression profile of the minigenes is shown.
|
AACCCGT) was mutated, only a minor switch of the 3' splice site from the distal to proximal AG was observed (Fig. 6D). The PPT sequence seems to be more important in determining splice site usage than the BPS in ING4. Our result indicates that the sequence of the BPS plays an important role in regulating 3' NAGNAG-based wobble splicing in the cases we examined.
![]() View larger version (42K): [in a new window] |
FIG. 6. Influence of the intronic sequence on 3' NAGNAG wobble splicing. (A) Alignment of the human, mouse and rat ARID1A (NP_060920) intron 12 sequence near the 3' GAGCAG tandem motif. The putative BPS is predicted using the BPS predictive program (http://ast.bioinfo.tau.ac.il/). The BPS region is underlined, and the putative branch point adenosine is marked in italics. The 3' tandem splice site is marked by bold letters. (B) Splicing pattern profiles of the human, mouse, and rat minigenes containing the genomic fragment of NP_060920 exon 12-13. The splicing assay of the human, mouse, and rat were performed using capillary electrophoresis as shown in Fig. 1. The endogenous transcripts are shown in the upper panels, and the minigenes are shown in the lower panels. (C) The splicing assay was performed using the human-derived sequences shown in the upper panels and the rat-derived sequences shown in the lower panels. (D) Minigenes of ING4, SIPA1L1, and RAGE and its putative branch point A mutant constructs were analyzed by capillary electrophoresis. The usage percent of distal and proximal AG, respectively, is indicated at the top of each panel.
|
60-bp region). It is noteworthy that 9 of the 127 SNPs were located within the NAGNAG motif and 78 were found in the intron sequences near NAGNAG (see Table S4 in the supplemental material). Next, we selected four samples from the above-described 127 SNPs to examine their effect on wobble splicing. The LAP1B gene (NP_056417) possesses a 3' tandem splice site, TAGCAG, which results in one amino acid insertion or deletion in the encoded protein. Our reverse transcription (RT)-PCR analysis of four human cell lines showed that the SNPs (rs2245425) at the third position of the proximal 3' splice site almost completely destroyed wobble splicing, particularly in homozygotic alleles with the TAACAG sequence (Fig. 7A). We obtained a similar phenomenon with the SNP (rs1152522) at a second position of the proximal 3' splice site of the C14orf105 gene (NP_060638) which also completely abolished wobble splicing (Fig. 7B).
![]() ![]() View larger version (81K): [in a new window] |
FIG. 7. The influence of SNPs on 3' NAGNAG wobble splicing. (A) The SNPs (rs2245425) were identified at the TAGCAG motif in the boundary between exon 2 and exon 3 of the human LAP1B gene (NP_056417). A representative splicing profile of LAP1B with different cell lines (AZ521, KatoIII, TSGH, and Hep3B) is shown (upper panel). (B) The splicing pattern of the C14orf105 (NP_060638) gene with different cells (AGS, HepG2, Huh7, and Hep3B) was analyzed. (C and D) The SNP occurred at the N nucleotide of the tandem motif. The human Toll-like receptor 3 (NP_003256) and AP1GBP1 (NP_542117) genes were analyzed in the four differently genotyped cell lines. Green arrows indicate the position of the SNP at the NAGNAG tandem motif. The percentage of wobble splicing isoforms is shown at the top of each panel. The arrowheads and arrows represent usage of the distal and proximal AG, respectively. The genomic sequence of the wobble splice sites was identified by an autosequencer. The black arrowhead indicates the splice junction (lower panel).
|
CAGGAG) completely abolished the distal AG choice (Fig. 7D). These results are consistent with our above-described findings that the NAGNAG motif influences wobble splicing. |
|
|---|
Infrequent use of GAG as the 3' splice site was coincident with its rarity in the 3' tandem splice site motif (1.4% GAGNAG and 6.1% NAGGAG among 441 cases) (see Table S3 in the supplemental material). Perhaps inefficient splicing at the GAG site has driven its loss during evolution, resulting in its scarcity in modern genomes. However, an exception observed here is the human ARID1A gene, in which GAG could compete for the use of CAG as the 3' splice site (Fig. 6B and C). Moreover, we observed that the distal AG was preferred in the YAGAAG and the duplicate (NAGN' AG; N = N') tandem motifs of the human ING4 exon 4-5 boundary. In other words, C/UAG may lose its priority when located upstream of AAG, at least in human ING4 (Table 1). Interestingly, human and mouse ING4 intron 4 showed different preferences in the use of the UAGAAG tandem site, that is, AAG was more preferred in mouse than in human (Fig. 3A); such a difference is probably due to the intronic cis elements that differ between the two species. Moreover, the intron strength may also explain the use of inert GAG in 3' wobble splicing of the minigenes with either the NAGGAG or the GAGNAG sequence (Table 1, ING4 minigene containing the AAGGAG tandem splice site and Fig. 6B, human ARID1A minigene containing the GAGCAG tandem motif). Nevertheless, future studies are required to elucidate the detailed mechanism.
Our observation suggested that the intronic sequence immediately upstream of the NAGNAG motif might mediate the selection of the 3' tandem splice site. Akerman and Mandel-Gutfreund (1) compared constitutive and alternative splicing at the 3'-NAGNAG acceptor by using a bioinformatics approach and concluded that tandem splice site-containing genes possess a relatively conserved intronic sequence upstream of 3'-NAGNAG. This result suggests an abundance of cis elements nearby the 3' tandem splice site. Zavolan et al. (40) suggest that the wobble splicing is the result of stochastic binding of the spliceosome at neighboring splice sites. Based on this hypothesis, Chern et al. recently developed a simple physical model that could predict whether splicing occurs only at one site or at two alternative sites at the tandem splice site (8). In our study, except for the 3' tandem splice site and its adjacent intronic sequence governing splice site selection, we demonstrated that exonic sequences can also determine splice site selection. Thus, our results agree that wobble splice site recognition may involve thermodynamic interactions between various cis elements.
Exonic SNPs have a significant effect on protein function and impact cell physiology and disease progression. However, there is increasing evidence that both exonic and intronic SNPs affect pre-mRNA splicing, which could alter gene expression patterns and expand protein diversity (6, 25). Moreover, wobble splicing may particularly cause insertion or deletion of one amino acid in proteins. For example, an SNP in the ABCR gene (2588G
C) is frequently found in patients with Stargardt disease 1 (STGD1), resulting in an active TAGC2588AG motif, which generates two wobble splicing isoforms (26). In this study, we identified nine SNPs in the NAGNAG 3' tandem splice site and demonstrated that SNPs interfered with wobble splicing in LAP1B and Toll-like receptor 3 (see Table S4 in the supplemental material). Using a genome-wide screen, Hiller et al. (13) recently identified 121 SNPs at the 3' tandem splice site and found that 64 SNPs may affect alternative NAGNAG splicing, of which 18 are associated with known disease genes. Experiments are still required to confirm that these SNP sites are involved in wobble splicing regulation. Since our data indicated that the region between the BPS and the PPT is also important for wobble splicing (Fig. 5, Fig. 6, and Table 2), SNPs in this region may cause an imbalance between the wobble mRNA isoforms. Accordingly, we identified SNPs within the BPS and the PPT (8 SNPs in the BPS and 29 SNPs in the PPT) that might affect NAGNAG-based wobble splicing; this finding needs to be examined by future experiments (see Table S4 in the supplemental material).
The 3' wobble splicing at the NAGNAG tandem motif occurs with a higher frequency than 5' GTNGT splicing, since the 3' end of introns has a more intricate set of regulatory elements. However, the exact mechanism of the 3' NAGNAG-based wobble splicing is not well understood. Using a prototypical example of 3' wobble splicing, ING4, we have illustrated the important region between the BPS and the tandem motif. In mammalian introns, both the BPS and PPT are essential for splicing (2, 39). Mutations or SNPs within these elements disturb splicing or induce alternative splice site utilization (21, 29, 31). A recent genome-wide study shows that mutations in putative BPSs cause a shift from constitutive to alternative splicing and alter the exon inclusion/skipping ratio (19). Our present study shows that mutations at a putative branch point of ARID1A, SIPA1L1, and RAGE had no apparent effect on splicing efficiency (data not shown) but affected wobble splicing (Fig. 6C and D), suggesting that the BPS may also play a role in determining 3' splice site selection. Selection of the 3' splice site depends on both the BPS-to-AG distance and the distance between the two adjacent AGs (9). Therefore, the BPS mutations may activate an aberrant branch point and thus alter the BPS-to-AG distance, changing the wobble splicing pattern. However, the minor change of distance (4 bp) between the new BPS (ACTTAAC) and the NAGNAG sequence did not alter AG selection. Therefore, our results suggest that the PPT sequence is more important in determining the 3' splice site usage of ING4 than the BPS-to-NAGNAG distance (Fig. 5 and Fig. 6D). We considered that the BPS-to-NAGNAG distance and PPT strength may serve as major factors to determine 3' NAGNAG wobble splicing. However, this hypothesis needs to be examined further.
In summary, this study provides experimental evidence showing that intronic cis elements, particularly in the region between the BPS and the NAGNAG 3' splice site, play an important role in the 3' tandem splice site selection. Moreover, SNPs within these regions may directly affect 3' splice site selection and thus alter the mRNA isoform pattern generated by wobble splicing.
This study was supported in part by grants from the Academia Sinica and the National Science Council, Taiwan, Republic of China (94-2311-B001-033 and 95-2311-B-001-012, respectively).
Published ahead of print on 11 June 2007. ![]()
Supplemental material for this article may be found at http://mcb.asm.org/. ![]()
|
|
|---|
C mutation in the ABCR gene is a mild frequent founder mutation in the Western European population and allows the classification of ABCR mutations in patients with Stargardt disease. Am. J. Hum. Genet. 64:1024-1035.[CrossRef][Medline]This article has been cited by other articles:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2009 by the American Society for Microbiology. For an alternate route to Journals.ASM.org, visit: http://intl-journals.asm.org | More Info»