791 research outputs found

    DeepBrain: Functional Representation of Neural In-Situ Hybridization Images for Gene Ontology Classification Using Deep Convolutional Autoencoders

    Full text link
    This paper presents a novel deep learning-based method for learning a functional representation of mammalian neural images. The method uses a deep convolutional denoising autoencoder (CDAE) for generating an invariant, compact representation of in situ hybridization (ISH) images. While most existing methods for bio-imaging analysis were not developed to handle images with highly complex anatomical structures, the results presented in this paper show that functional representation extracted by CDAE can help learn features of functional gene ontology categories for their classification in a highly accurate manner. Using this CDAE representation, our method outperforms the previous state-of-the-art classification rate, by improving the average AUC from 0.92 to 0.98, i.e., achieving 75% reduction in error. The method operates on input images that were downsampled significantly with respect to the original ones to make it computationally feasible

    How simple can a model of an empty viral capsid be? Charge distributions in viral capsids

    Full text link
    We investigate and quantify salient features of the charge distributions on viral capsids. Our analysis combines the experimentally determined capsid geometry with simple models for ionization of amino acids, thus yielding the detailed description of spatial distribution for positive and negative charge across the capsid wall. The obtained data is processed in order to extract the mean radii of distributions, surface charge densities and dipole moment densities. The results are evaluated and examined in light of previously proposed models of capsid charge distributions, which are shown to have to some extent limited value when applied to real viruses.Comment: 10 pages, 10 figures; accepted for publication in Journal of Biological Physic

    Male breast cancer

    Get PDF
    Male breast cancer (MBC) is a rare disease representing less than 1% of all breast cancers (BC) and less than 1% of cancers in men. Age at presentation is mostly in the late 60s. MBC is recognized as an estrogen-driven disease, specifically related to hyperestrogenism. About 20% of MBC patients have family history for BC. Mutations in BRCA1 and, predominantly, BRCA2, account for approximately 10% of MBC cases. Because of its rarity, MBC is often compared with female BC (FBC). Based on age-frequency distribution, age-specific incidence rate patterns and prognostic factors profiles, MBC is considered similar to late-onset, postmenopausal estrogen/progesterone receptor positive (ER+/PR+) FBC. However, clinical and pathological characteristics of MBC do not exactly overlap FBC. Compared with FBC, MBC has been reported to occur later in life, present at a higher stage, and display lower histologic grade, with a higher proportion of ER+ and PR+ tumors. Although rare, MBC remains a substantial cause for morbidity and mortality in men, probably because of its occurrence in advanced age and delayed diagnosis. Diagnosis and treatment of MBC generally is similar to that of FBC. Men tend to be treated with mastectomy rather than breast-conserving surgery. The backbone of adjuvant therapy or palliative treatment for advanced disease is endocrine, mostly tamoxifen. Use of FBC-based therapy led to the observation that treatment outcomes for MBC are worse and that survival rates for MBC do not improve like FBC. These different outcomes may suggest a non-appropriate utilization of treatments and that different underlying pathogenetic mechanisms may exist between male and female BC

    Statins are underused in recent-onset Parkinson's disease with increased vascular risk: findings from the UK Tracking Parkinson's and Oxford Parkinson's Disease Centre (OPDC) discovery cohorts.

    Get PDF
    BACKGROUND: Cardiovascular disease (CVD) influences phenotypic variation in Parkinson's disease (PD), and is usually an indication for statin therapy. It is less clear whether cardiovascular risk factors influence PD phenotype, and if statins are prescribed appropriately. OBJECTIVES: To quantify vascular risk and statin use in recent-onset PD, and examine the relationship between vascular risk, PD severity and phenotype. METHODS: Cardiovascular risk was quantified using the QRISK2 calculator (high ≥20%, medium ≥10 and <20%, low risk <10%). Motor severity and phenotype were assessed using the Movement Disorder Society Unified PD Rating Scale (UPDRS) and cognition by the Montreal cognitive assessment. RESULTS: In 2909 individuals with recent-onset PD, the mean age was 67.5 years (SD 9.3), 63.5% were men and the mean disease duration was 1.3 years (SD 0.9). 33.8% of cases had high vascular risk, 28.7% medium risk, and 22.3% low risk, while 15.2% of cases had established CVD. Increasing vascular risk and CVD were associated with older age (p<0.001), worse motor score (p<0.001), more cognitive impairment (p<0.001) and worse motor phenotype (p=0.021). Statins were prescribed in 37.2% with high vascular risk, 15.1% with medium vascular risk and 6.5% with low vascular risk, which compared with statin usage in 75.3% of those with CVD. CONCLUSIONS: Over 60% of recent-onset PD patients have high or medium cardiovascular risk (meriting statin usage), which is associated with a worse motor and cognitive phenotype. Statins are underused in these patients, compared with those with vascular disease, which is a missed opportunity for preventive treatment. TRIAL REGISTRATION NUMBER: GN11NE062, NCT02881099

    Large introns in relation to alternative splicing and gene evolution: a case study of Drosophila bruno-3

    Get PDF
    Background: Alternative splicing (AS) of maturing mRNA can generate structurally and functionally distinct transcripts from the same gene. Recent bioinformatic analyses of available genome databases inferred a positive correlation between intron length and AS. To study the interplay between intron length and AS empirically and in more detail, we analyzed the diversity of alternatively spliced transcripts (ASTs) in the Drosophila RNA-binding Bruno-3 (Bru-3) gene. This gene was known to encode thirteen exons separated by introns of diverse sizes, ranging from 71 to 41,973 nucleotides in D. melanogaster. Although Bru-3's structure is expected to be conducive to AS, only two ASTs of this gene were previously described. Results: Cloning of RT-PCR products of the entire ORF from four species representing three diverged Drosophila lineages provided an evolutionary perspective, high sensitivity, and long-range contiguity of splice choices currently unattainable by high-throughput methods. Consequently, we identified three new exons, a new exon fragment and thirty-three previously unknown ASTs of Bru-3. All exon-skipping events in the gene were mapped to the exons surrounded by introns of at least 800 nucleotides, whereas exons split by introns of less than 250 nucleotides were always spliced contiguously in mRNA. Cases of exon loss and creation during Bru-3 evolution in Drosophila were also localized within large introns. Notably, we identified a true de novo exon gain: exon 8 was created along the lineage of the obscura group from intronic sequence between cryptic splice sites conserved among all Drosophila species surveyed. Exon 8 was included in mature mRNA by the species representing all the major branches of the obscura group. To our knowledge, the origin of exon 8 is the first documented case of exonization of intronic sequence outside vertebrates. Conclusion: We found that large introns can promote AS via exon-skipping and exon turnover during evolution likely due to frequent errors in their removal from maturing mRNA. Large introns could be a reservoir of genetic diversity, because they have a greater number of mutable sites than short introns. Taken together, gene structure can constrain and/or promote gene evolution

    Predicting cell types and genetic variations contributing to disease by combining GWAS and epigenetic data

    Get PDF
    Genome-wide association studies (GWASs) identify single nucleotide polymorphisms (SNPs) that are enriched in individuals suffering from a given disease. Most disease-associated SNPs fall into non-coding regions, so that it is not straightforward to infer phenotype or function; moreover, many SNPs are in tight genetic linkage, so that a SNP identified as associated with a particular disease may not itself be causal, but rather signify the presence of a linked SNP that is functionally relevant to disease pathogenesis. Here, we present an analysis method that takes advantage of the recent rapid accumulation of epigenomics data to address these problems for some SNPs. Using asthma as a prototypic example; we show that non-coding disease-associated SNPs are enriched in genomic regions that function as regulators of transcription, such as enhancers and promoters. Identifying enhancers based on the presence of the histone modification marks such as H3K4me1 in different cell types, we show that the location of enhancers is highly cell-type specific. We use these findings to predict which SNPs are likely to be directly contributing to disease based on their presence in regulatory regions, and in which cell types their effect is expected to be detectable. Moreover, we can also predict which cell types contribute to a disease based on overlap of the disease-associated SNPs with the locations of enhancers present in a given cell type. Finally, we suggest that it will be possible to re-analyze GWAS studies with much higher power by limiting the SNPs considered to those in coding or regulatory regions of cell types relevant to a given disease

    708 Common and 2010 rare DISC1 locus variants identified in 1542 subjects:analysis for association with psychiatric disorder and cognitive traits

    Get PDF
    A balanced t(1;11) translocation that transects the Disrupted in schizophrenia 1 (DISC1) gene shows genome-wide significant linkage for schizophrenia and recurrent major depressive disorder (rMDD) in a single large Scottish family, but genome-wide and exome sequencing-based association studies have not supported a role for DISC1 in psychiatric illness. To explore DISC1 in more detail, we sequenced 528 kb of the DISC1 locus in 653 cases and 889 controls. We report 2718 validated single-nucleotide polymorphisms (SNPs) of which 2010 have a minor allele frequency of &lt;1%. Only 38% of these variants are reported in the 1000 Genomes Project European subset. This suggests that many DISC1 SNPs remain undiscovered and are essentially private. Rare coding variants identified exclusively in patients were found in likely functional protein domains. Significant region-wide association was observed between rs16856199 and rMDD (P=0.026, unadjusted P=6.3 × 10-5, OR=3.48). This was not replicated in additional recurrent major depression samples (replication P=0.11). Combined analysis of both the original and replication set supported the original association (P=0.0058, OR=1.46). Evidence for segregation of this variant with disease in families was limited to those of rMDD individuals referred from primary care. Burden analysis for coding and non-coding variants gave nominal associations with diagnosis and measures of mood and cognition. Together, these observations are likely to generalise to other candidate genes for major mental illness and may thus provide guidelines for the design of future studies. © 2014 Macmillan Publishers Limited

    KoVariome: Korean National Standard Reference Variome database of whole genomes with comprehensive SNV, indel, CNV, and SV analyses

    Get PDF
    High-coverage whole-genome sequencing data of a single ethnicity can provide a useful catalogue of population-specific genetic variations, and provides a critical resource that can be used to more accurately identify pathogenic genetic variants. We report a comprehensive analysis of the Korean population, and present the Korean National Standard Reference Variome (KoVariome). As a part of the Korean Personal Genome Project (KPGP), we constructed the KoVariome database using 5.5 terabases of whole genome sequence data from 50 healthy Korean individuals in order to characterize the benign ethnicity-relevant genetic variation present in the Korean population. In total, KoVariome includes 12.7M single-nucleotide variants (SNVs), 1.7M short insertions and deletions (indels), 4K structural variations (SVs), and 3.6K copy number variations (CNVs). Among them, 2.4M (19%) SNVs and 0.4M (24%) indels were identified as novel. We also discovered selective enrichment of 3.8M SNVs and 0.5M indels in Korean individuals, which were used to filter out 1,271 coding-SNVs not originally removed from the 1,000 Genomes Project when prioritizing disease-causing variants. KoVariome health records were used to identify novel disease-causing variants in the Korean population, demonstrating the value of high-quality ethnic variation databases for the accurate interpretation of individual genomes and the precise characterization of genetic variation

    Transcriptomes and expression profiling of deep-sea corals from the Red Sea provide insight into the biology of azooxanthellate corals

    Get PDF
    Despite the importance of deep-sea corals, our current understanding of their ecology and evolutionis limited due to difficulties in sampling and studying deep-sea environments. Moreover, a recent reevaluation of habitat limitations has been suggested after characterization of deep-sea corals in the Red Sea, where they live at temperatures of above 20 °C at low oxygen concentrations. To gain further insight into the biology of deep-sea corals, we produced reference transcriptomes and studied gene expression of three deep-sea coral species from the Red Sea, i.e. Dendrophyllia sp., Eguchipsammia fistula, and Rhizotrochus typus. Our analyses suggest that deep-sea coral employ mitochondrial hypometabolism and anaerobic glycolysis to manage low oxygen conditions present in the Red Sea. Notably, we found expression of genes related to surface cilia motion that presumably enhance small particle transport rates in the oligotrophic deep-sea environment. This is the first study to characterize transcriptomes and in situ gene expression for deep-sea corals. Our work offers several mechanisms by which deep-sea corals might cope with the distinct environmental conditions present in the Red Sea. As such, our data provides direction for future research and further insight to organismal response of deep sea coral to environmental change and ocean warming.Tis work was supported by King Abdullah University of Science and Technology (KAUST), baseline funds to CRV and Center Competitive Funding (CCF) Program FCC/1/1973-18-01
    corecore