33 research outputs found
Intronic Alus Influence Alternative Splicing
Examination of the human transcriptome reveals higher levels of RNA editing
than in any other organism tested to date. This is indicative of extensive
double-stranded RNA (dsRNA) formation within the human transcriptome. Most of
the editing sites are located in the primate-specific retrotransposed element
called Alu. A large fraction of Alus are found in intronic sequences, implying
extensive Alu-Alu dsRNA formation in mRNA precursors. Yet, the effect of these
intronic Alus on splicing of the flanking exons is largely unknown. Here, we
show that more Alus flank alternatively spliced exons than constitutively
spliced ones; this is especially notable for those exons that have changed
their mode of splicing from constitutive to alternative during human evolution.
This implies that Alu insertions may change the mode of splicing of the
flanking exons. Indeed, we demonstrate experimentally that two Alu elements
that were inserted into an intron in opposite orientation undergo base-pairing,
as evident by RNA editing, and affect the splicing patterns of a downstream
exon, shifting it from constitutive to alternative. Our results indicate the
importance of intronic Alus in influencing the splicing of flanking exons,
further emphasizing the role of Alus in shaping of the human transcriptom
A phylogenetic generalized hidden Markov model for predicting alternatively spliced exons
BACKGROUND: An important challenge in eukaryotic gene prediction is accurate identification of alternatively spliced exons. Functional transcripts can go undetected in gene expression studies when alternative splicing only occurs under specific biological conditions. Non-expression based computational methods support identification of rarely expressed transcripts. RESULTS: A non-expression based statistical method is presented to annotate alternatively spliced exons using a single genome sequence and evidence from cross-species sequence conservation. The computational method is implemented in the program ExAlt and an analysis of prediction accuracy is given for Drosophila melanogaster. CONCLUSION: ExAlt identifies the structure of most alternatively spliced exons in the test set and cross-species sequence conservation is shown to improve the precision of predictions. The software package is available to run on Drosophila genomes to search for new cases of alternative splicing
Muscleblind-Like 1 Knockout Mice Reveal Novel Splicing Defects in the Myotonic Dystrophy Brain
Myotonic dystrophy type 1 (DM1) is a multi-systemic disorder caused by a CTG trinucleotide repeat expansion (CTGexp) in the DMPK gene. In skeletal muscle, nuclear sequestration of the alternative splicing factor muscleblind-like 1 (MBNL1) explains the majority of the alternative splicing defects observed in the HSALR transgenic mouse model which expresses a pathogenic range CTGexp. In the present study, we addressed the possibility that MBNL1 sequestration by CUGexp RNA also contributes to splicing defects in the mammalian brain. We examined RNA from the brains of homozygous Mbnl1ΔE3/ΔE3 knockout mice using splicing-sensitive microarrays. We used RT-PCR to validate a subset of alternative cassette exons identified by microarray analysis with brain tissues from Mbnl1ΔE3/ΔE3 knockout mice and post-mortem DM1 patients. Surprisingly, splicing-sensitive microarray analysis of Mbnl1ΔE3/ΔE3 brains yielded only 14 candidates for mis-spliced exons. While we confirmed that several of these splicing events are perturbed in both Mbnl1 knockout and DM1 brains, the extent of splicing mis-regulation in the mouse model was significantly less than observed in DM1. Additionally, several alternative exons, including Grin1 exon 4, App exon 7 and Mapt exons 3 and 9, which have previously been reported to be aberrantly spliced in human DM1 brain, were spliced normally in the Mbnl1 knockout brain. The sequestration of MBNL1 by CUGexp RNA results in some of the aberrant splicing events in the DM1 brain. However, we conclude that other factors, possibly other MBNL proteins, likely contribute to splicing mis-regulation in the DM1 brain
A General Definition and Nomenclature for Alternative Splicing Events
Understanding the molecular mechanisms responsible for the regulation of the transcriptome present in eukaryotic cells is one of the most challenging tasks in the postgenomic era. In this regard, alternative splicing (AS) is a key phenomenon contributing to the production of different mature transcripts from the same primary RNA sequence. As a plethora of different transcript forms is available in databases, a first step to uncover the biology that drives AS is to identify the different types of reflected splicing variation. In this work, we present a general definition of the AS event along with a notation system that involves the relative positions of the splice sites. This nomenclature univocally and dynamically assigns a specific “AS code” to every possible pattern of splicing variation. On the basis of this definition and the corresponding codes, we have developed a computational tool (AStalavista) that automatically characterizes the complete landscape of AS events in a given transcript annotation of a genome, thus providing a platform to investigate the transcriptome diversity across genes, chromosomes, and species. Our analysis reveals that a substantial part—in human more than a quarter—of the observed splicing variations are ignored in common classification pipelines. We have used AStalavista to investigate and to compare the AS landscape of different reference annotation sets in human and in other metazoan species and found that proportions of AS events change substantially depending on the annotation protocol, species-specific attributes, and coding constraints acting on the transcripts. The AStalavista system therefore provides a general framework to conduct specific studies investigating the occurrence, impact, and regulation of AS
Alternative Splicing at a NAGNAG Acceptor Site as a Novel Phenotype Modifier
Approximately 30% of alleles causing genetic disorders generate premature termination codons (PTCs), which are usually associated with severe phenotypes. However, bypassing the deleterious stop codon can lead to a mild disease outcome. Splicing at NAGNAG tandem splice sites has been reported to result in insertion or deletion (indel) of three nucleotides. We identified such a mechanism as the origin of the mild to asymptomatic phenotype observed in cystic fibrosis patients homozygous for the E831X mutation (2623G>T) in the CFTR gene. Analyses performed on nasal epithelial cell mRNA detected three distinct isoforms, a considerably more complex situation than expected for a single nucleotide substitution. Structure-function studies and in silico analyses provided the first experimental evidence of an indel of a stop codon by alternative splicing at a NAGNAG acceptor site. In addition to contributing to proteome plasticity, alternative splicing at a NAGNAG tandem site can thus remove a disease-causing UAG stop codon. This molecular study reveals a naturally occurring mechanism where the effect of either modifier genes or epigenetic factors could be suspected. This finding is of importance for genetic counseling as well as for deciding appropriate therapeutic strategies
Width of Gene Expression Profile Drives Alternative Splicing
Alternative splicing generates an enormous amount of functional and proteomic diversity in metazoan organisms. This process is probably central to the macromolecular and cellular complexity of higher eukaryotes. While most studies have focused on the molecular mechanism triggering and controlling alternative splicing, as well as on its incidence in different species, its maintenance and evolution within populations has been little investigated. Here, we propose to address these questions by comparing the structural characteristics as well as the functional and transcriptional profiles of genes with monomorphic or polymorphic splicing, referred to as MS and PS genes, respectively. We find that MS and PS genes differ particularly in the number of tissues and cell types where they are expressed.We find a striking deficit of PS genes on the sex chromosomes, particularly on the Y chromosome where it is shown not to be due to the observed lower breadth of expression of genes on that chromosome. The development of a simple model of evolution of cis-regulated alternative splicing leads to predictions in agreement with these observations. It further predicts the conditions for the emergence and the maintenance of cis-regulated alternative splicing, which are both favored by the tissue specific expression of splicing variants. We finally propose that the width of the gene expression profile is an essential factor for the acquisition of new transcript isoforms that could later be maintained by a new form of balancing selection
A Novel Conserved Isoform of the Ubiquitin Ligase UFD2a/UBE4B Is Expressed Exclusively in Mature Striated Muscle Cells
Yeast Ufd2p was the first identified E4 multiubiquitin chain assembly factor. Its vertebrate homologues later referred to as UFD2a, UBE4B or E4B were also shown to have E3 ubiquitin ligase activity. UFD2a function in the brain has been well established in vivo, and in vitro studies have shown that its activity is essential for proper condensation and segregation of chromosomes during mitosis. Here we show that 2 alternative splice forms of UFD2a, UFD2a-7 and -7/7a, are expressed sequentially during myoblast differentiation of C2C12 cell cultures and during cardiotoxin-induced regeneration of skeletal muscle in mice. UFD2a-7 contains an alternate exon 7, and UFD2a-7/7a, the larger of the 2 isoforms, contains an additional novel exon 7a. Analysis of protein or mRNA expression in mice and zebrafish revealed that a similar pattern of isoform switching occurs during developmental myogenesis of cardiac and skeletal muscle. In vertebrates (humans, rodents, zebrafish), UFD2a-7/7a is expressed only in mature striated muscle. This unique tissue specificity is further validated by the conserved presence of 2 muscle-specific splicing regulatory motifs located in the 3′ introns of exons 7 and 7a. UFD2a interacts with VCP/p97, an AAA-type ATPase implicated in processes whose functions appear to be regulated, in part, through their interaction with one or more of 15 previously identified cofactors. UFD2a-7/7a did not interact with VCP/p97 in yeast 2-hybrid experiments, which may allow the ATPase to bind cofactors that facilitate its muscle-specific functions. We conclude that the regulated expression of these UFD2a isoforms most likely imparts divergent functions that are important for myogenisis
Predicting Functional Alternative Splicing by Measuring RNA Selection Pressure from Multigenome Alignments
High-throughput methods such as EST sequencing, microarrays and deep sequencing have identified large numbers of alternative splicing (AS) events, but studies have shown that only a subset of these may be functional. Here we report a sensitive bioinformatics approach that identifies exons with evidence of a strong RNA selection pressure ratio (RSPR) —i.e., evolutionary selection against mutations that change only the mRNA sequence while leaving the protein sequence unchanged—measured across an entire evolutionary family, which greatly amplifies its predictive power. Using the UCSC 28 vertebrate genome alignment, this approach correctly predicted half to three-quarters of AS exons that are known binding targets of the NOVA splicing regulatory factor, and predicted 345 strongly selected alternative splicing events in human, and 262 in mouse. These predictions were strongly validated by several experimental criteria of functional AS such as independent detection of the same AS event in other species, reading frame-preservation, and experimental evidence of tissue-specific regulation: 75% (15/20) of a sample of high-RSPR exons displayed tissue specific regulation in a panel of ten tissues, vs. only 20% (4/20) among a sample of low-RSPR exons. These data suggest that RSPR can identify exons with functionally important splicing regulation, and provides biologists with a dataset of over 600 such exons. We present several case studies, including both well-studied examples (GRIN1) and novel examples (EXOC7). These data also show that RSPR strongly outperforms other approaches such as standard sequence conservation (which fails to distinguish amino acid selection pressure from RNA selection pressure), or pairwise genome comparison (which lacks adequate statistical power for predicting individual exons)
Identification and Classification of Conserved RNA Secondary Structures in the Human Genome
The discoveries of microRNAs and riboswitches, among others, have shown functional RNAs to be biologically more important and genomically more prevalent than previously anticipated. We have developed a general comparative genomics method based on phylogenetic stochastic context-free grammars for identifying functional RNAs encoded in the human genome and used it to survey an eight-way genome-wide alignment of the human, chimpanzee, mouse, rat, dog, chicken, zebra-fish, and puffer-fish genomes for deeply conserved functional RNAs. At a loose threshold for acceptance, this search resulted in a set of 48,479 candidate RNA structures. This screen finds a large number of known functional RNAs, including 195 miRNAs, 62 histone 3′UTR stem loops, and various types of known genetic recoding elements. Among the highest-scoring new predictions are 169 new miRNA candidates, as well as new candidate selenocysteine insertion sites, RNA editing hairpins, RNAs involved in transcript auto regulation, and many folds that form singletons or small functional RNA families of completely unknown function. While the rate of false positives in the overall set is difficult to estimate and is likely to be substantial, the results nevertheless provide evidence for many new human functional RNAs and present specific predictions to facilitate their further characterization
