136 research outputs found

    Large introns in relation to alternative splicing and gene evolution: a case study of Drosophila bruno-3

    Get PDF
    Background: Alternative splicing (AS) of maturing mRNA can generate structurally and functionally distinct transcripts from the same gene. Recent bioinformatic analyses of available genome databases inferred a positive correlation between intron length and AS. To study the interplay between intron length and AS empirically and in more detail, we analyzed the diversity of alternatively spliced transcripts (ASTs) in the Drosophila RNA-binding Bruno-3 (Bru-3) gene. This gene was known to encode thirteen exons separated by introns of diverse sizes, ranging from 71 to 41,973 nucleotides in D. melanogaster. Although Bru-3's structure is expected to be conducive to AS, only two ASTs of this gene were previously described. Results: Cloning of RT-PCR products of the entire ORF from four species representing three diverged Drosophila lineages provided an evolutionary perspective, high sensitivity, and long-range contiguity of splice choices currently unattainable by high-throughput methods. Consequently, we identified three new exons, a new exon fragment and thirty-three previously unknown ASTs of Bru-3. All exon-skipping events in the gene were mapped to the exons surrounded by introns of at least 800 nucleotides, whereas exons split by introns of less than 250 nucleotides were always spliced contiguously in mRNA. Cases of exon loss and creation during Bru-3 evolution in Drosophila were also localized within large introns. Notably, we identified a true de novo exon gain: exon 8 was created along the lineage of the obscura group from intronic sequence between cryptic splice sites conserved among all Drosophila species surveyed. Exon 8 was included in mature mRNA by the species representing all the major branches of the obscura group. To our knowledge, the origin of exon 8 is the first documented case of exonization of intronic sequence outside vertebrates. Conclusion: We found that large introns can promote AS via exon-skipping and exon turnover during evolution likely due to frequent errors in their removal from maturing mRNA. Large introns could be a reservoir of genetic diversity, because they have a greater number of mutable sites than short introns. Taken together, gene structure can constrain and/or promote gene evolution

    Fanconi Anemia Caused by Biallelic Inactivation of BRCA2 Can Present With an Atypical Cancer Phenotype in Adulthood

    Get PDF
    Inherited biallelic pathogenic variants (PVs) in BRCA2 cause Fanconi Anemia complementation group D1 (FA-D1), a severe pediatric bone marrow failure and high-risk cancer syndrome. We identified biallelic BRCA2 PVs in a young adult with multiple basal cell carcinomas, adult-onset colorectal cancer and small cell neuroendocrine carcinoma, without bone marrow failure. No PVs were identified in any other known cancer susceptibility gene, and there was no evidence of reversion mosaicism. The proband\u27s deceased sister had a classic FA-D1 presentation and was shown to carry the same biallelic BRCA2 PVs. A lymphoblastoid cell line derived from the proband demonstrated hypersensitivity to DNA damaging agents, and bone marrow showed aberrant RAD51 staining. Family expansion demonstrated the presence of BRCA2 related cancers in heterozygous family members. Our data highlight the striking phenotypic differences which can be observed within FA-D1 families and expands the clinical spectrum of FA-D1 to include adult presentation with a constellation of solid tumors not previously thought of as characteristic of Fanconi Anemia. Early recognition of this syndrome in a family could prevent further morbidity and mortality by implementation of hereditary breast and ovarian cancer screening and treatment strategies for heterozygous family members

    Alternative splicing of the mouse embryonic poly(A) binding protein (Epab) mRNA is regulated by an exonic splicing enhancer: a model for post-transcriptional control of gene expression in the oocyte

    Get PDF
    Embryonic poly(A) binding protein (EPAB), expressed in oocytes and early embryos, binds and stabilizes maternal mRNAs, and mediates initiation of their translation. We identified an alternatively spliced form of Epab lacking exon 10 (c.Ex10del) and investigated the regulation of Epab mRNA alternative splicing as a model for alternative splicing in oocytes and early preimplantation embryos. Specifically, we evaluated the following mechanisms: imprinting; RNA editing and exonic splicing enhancers (ESEs). Sequence analysis led to the identification of two single nucleotide polymorphisms (SNPs): one was detected in exon 9 (rs55858A/G), and served as a marker for the parental origin of the alternatively spliced form, and the other was found in exon 10 (rs56574G/C), and co-segregated with the exon 9 SNP. We found that the presence of rs56574G in exon 10 led to the formation of an ESE, leading to efficient exclusion of exon 10. Real-time RT–PCR results revealed a 5-fold increase in the expression of the c.Ex10del alternative splicing variant in animals carrying rs56574G/G in exon 10 compared with rs56574C/C at the same locus. Our findings suggest that SNPs may alter the ratio between alternative splicing variants of oocyte-specific proteins. The role that these subtle differences play in determining individual reproductive outcome remains to be determined

    Does Selection against Transcriptional Interference Shape Retroelement-Free Regions in Mammalian Genomes?

    Get PDF
    BACKGROUND: Eukaryotic genomes are scattered with retroelements that proliferate through retrotransposition. Although retroelements make up around 40 percent of the human genome, large regions are found to be completely devoid of retroelements. This has been hypothesised to be a result of genomic regions being intolerant to insertions of retroelements. The inadvertent transcriptional activity of retroelements may affect neighbouring genes, which in turn could be detrimental to an organism. We speculate that such retroelement transcription, or transcriptional interference, is a contributing factor in generating and maintaining retroelement-free regions in the human genome. METHODOLOGY/PRINCIPAL FINDINGS: Based on the known transcriptional properties of retroelements, we expect long interspersed elements (LINEs) to be able to display a high degree of transcriptional interference. In contrast, we expect short interspersed elements (SINEs) to display very low levels of transcriptional interference. We find that genomic regions devoid of long interspersed elements (LINEs) are enriched for protein-coding genes, but that this is not the case for regions devoid of short interspersed elements (SINEs). This is expected if genes are subject to selection against transcriptional interference. We do not find microRNAs to be associated with genomic regions devoid of either SINEs or LINEs. We further observe an increased relative activity of genes overlapping LINE-free regions during early embryogenesis, where activity of LINEs has been identified previously. CONCLUSIONS/SIGNIFICANCE: Our observations are consistent with the notion that selection against transcriptional interference has contributed to the maintenance and/or generation of retroelement-free regions in the human genome

    Trichomonas Transmembrane Cyclases Result from Massive Gene Duplication and Concomitant Development of Pseudogenes

    Get PDF
    Trichomonas vaginalis is the only medically important protist (single-cell eukaryote) that is sexually transmitted. The ∼160-Mb Trichomonas genome contains more predicted protein-encoding genes (∼60,000) than the human genome. To begin to understand why there are so many copies of some genes, we chose here to study a large family of genes encoding unique transmembrane cyclases. Our most important results include the following. More than 100 transmembrane cyclase genes do not result from chromosomal duplications, because for the most part only the coding regions of the genes, rather than flanking sequences, are duplicated. Almost half of the transmembrane cyclase genes are pseudogenes, and these pseudogenes are polymorphic among laboratory strains of Trichomonas. Messenger RNAs for numerous transmembrane cyclases are expressed simultaneously, and representative cyclase domains have adenylyl cyclase activity. In summary, the large family of Trichomonas genes encoding transmembrane adenylyl cyclases results from massive gene duplication and concomitant development of pseudogenes

    The complex genetic landscape of familial MDS and AML reveals pathogenic germline variants.

    Get PDF
    The inclusion of familial myeloid malignancies as a separate disease entity in the revised WHO classification has renewed efforts to improve the recognition and management of this group of at risk individuals. Here we report a cohort of 86 acute myeloid leukemia (AML) and myelodysplastic syndrome (MDS) families with 49 harboring germline variants in 16 previously defined loci (57%). Whole exome sequencing in a further 37 uncharacterized families (43%) allowed us to rationalize 65 new candidate loci, including genes mutated in rare hematological syndromes (ADA, GP6, IL17RA, PRF1 and SEC23B), reported in prior MDS/AML or inherited bone marrow failure series (DNAH9, NAPRT1 and SH2B3) or variants at novel loci (DHX34) that appear specific to inherited forms of myeloid malignancies. Altogether, our series of MDS/AML families offer novel insights into the etiology of myeloid malignancies and provide a framework to prioritize variants for inclusion into routine diagnostics and patient management

    Characterization of LINE-1 Ribonucleoprotein Particles

    Get PDF
    The average human genome contains a small cohort of active L1 retrotransposons that encode two proteins (ORF1p and ORF2p) required for their mobility (i.e., retrotransposition). Prior studies demonstrated that human ORF1p, L1 RNA, and an ORF2p-encoded reverse transcriptase activity are present in ribonucleoprotein (RNP) complexes. However, the inability to physically detect ORF2p from engineered human L1 constructs has remained a technical challenge in the field. Here, we have employed an epitope/RNA tagging strategy with engineered human L1 retrotransposons to identify ORF1p, ORF2p, and L1 RNA in a RNP complex. We next used this system to assess how mutations in ORF1p and/or ORF2p impact RNP formation. Importantly, we demonstrate that mutations in the coiled-coil domain and RNA recognition motif of ORF1p, as well as the cysteine-rich domain of ORF2p, reduce the levels of ORF1p and/or ORF2p in L1 RNPs. Finally, we used this tagging strategy to localize the L1–encoded proteins and L1 RNA to cytoplasmic foci that often were associated with stress granules. Thus, we conclude that a precise interplay among ORF1p, ORF2p, and L1 RNA is critical for L1 RNP assembly, function, and L1 retrotransposition

    Proteins with Complex Architecture as Potential Targets for Drug Design: A Case Study of Mycobacterium tuberculosis

    Get PDF
    Lengthy co-evolution of Homo sapiens and Mycobacterium tuberculosis, the main causative agent of tuberculosis, resulted in a dramatically successful pathogen species that presents considerable challenge for modern medicine. The continuous and ever increasing appearance of multi-drug resistant mycobacteria necessitates the identification of novel drug targets and drugs with new mechanisms of action. However, further insights are needed to establish automated protocols for target selection based on the available complete genome sequences. In the present study, we perform complete proteome level comparisons between M. tuberculosis, mycobacteria, other prokaryotes and available eukaryotes based on protein domains, local sequence similarities and protein disorder. We show that the enrichment of certain domains in the genome can indicate an important function specific to M. tuberculosis. We identified two families, termed pkn and PE/PPE that stand out in this respect. The common property of these two protein families is a complex domain organization that combines species-specific regions, commonly occurring domains and disordered segments. Besides highlighting promising novel drug target candidates in M. tuberculosis, the presented analysis can also be viewed as a general protocol to identify proteins involved in species-specific functions in a given organism. We conclude that target selection protocols should be extended to include proteins with complex domain architectures instead of focusing on sequentially unique and essential proteins only

    A Comprehensive Map of Mobile Element Insertion Polymorphisms in Humans

    Get PDF
    As a consequence of the accumulation of insertion events over evolutionary time, mobile elements now comprise nearly half of the human genome. The Alu, L1, and SVA mobile element families are still duplicating, generating variation between individual genomes. Mobile element insertions (MEI) have been identified as causes for genetic diseases, including hemophilia, neurofibromatosis, and various cancers. Here we present a comprehensive map of 7,380 MEI polymorphisms from the 1000 Genomes Project whole-genome sequencing data of 185 samples in three major populations detected with two detection methods. This catalog enables us to systematically study mutation rates, population segregation, genomic distribution, and functional properties of MEI polymorphisms and to compare MEI to SNP variation from the same individuals. Population allele frequencies of MEI and SNPs are described, broadly, by the same neutral ancestral processes despite vastly different mutation mechanisms and rates, except in coding regions where MEI are virtually absent, presumably due to strong negative selection. A direct comparison of MEI and SNP diversity levels suggests a differential mobile element insertion rate among populations
    corecore