713 research outputs found

    Quantifying the regulatory effect size of cis-acting genetic variation using allelic fold change

    Get PDF
    Mapping cis-acting expression quantitative trait loci (cis-eQTL) has become a popular approach for characterizing proximal genetic regulatory variants. In this paper, we describe and characterize log allelic fold change (aFC), the magnitude of expression change associated with a given genetic variant, as a biologically interpretable unit for quantifying the effect size of cis-eQTLs and a mathematically convenient approach for systematic modeling of cis-regulation. This measure is mathematically independent from expression level and allele frequency, additive, applicable to multiallelic variants, and generalizable to multiple independent variants. We provide efficient tools and guidelines for estimating aFC from both eQTL and allelic expression data sets and apply it to Genotype Tissue Expression (GTEx) data. We show that aFC estimates independently derived from eQTL and allelic expression data are highly consistent, and identify technical and biological correlates of eQTL effect size. We generalize aFC to analyze genes with two eQTLs in GTEx and show that in nearly all cases the two eQTLs act independently in regulating gene expression. In summary, aFC is a solid measure of cis-regulatory effect size that allows quantitative interpretation of cellular regulatory events from population data, and it is a valuable approach for investigating novel aspects of eQTL data sets.</p

    Association of Human iPSC Gene Signatures and X Chromosome Dosage with Two Distinct Cardiac Differentiation Trajectories.

    Get PDF
    Despite the importance of understanding how variability across induced pluripotent stem cell (iPSC) lines due to non-genetic factors (clone and passage) influences their differentiation outcome, large-scale studies capable of addressing this question have not yet been conducted. Here, we differentiated 191 iPSC lines to generate iPSC-derived cardiovascular progenitor cells (iPSC-CVPCs). We observed cellular heterogeneity across the iPSC-CVPC samples due to varying fractions of two cell types: cardiomyocytes (CMs) and epicardium-derived cells (EPDCs). Comparing the transcriptomes of CM-fated and EPDC-fated iPSCs, we discovered that 91 signature genes and X chromosome dosage differences are associated with these two distinct cardiac developmental trajectories. In an independent set of 39 iPSCs differentiated into CMs, we confirmed that sex and transcriptional differences affect cardiac-fate outcome. Our study provides novel insights into how iPSC transcriptional and X chromosome gene dosage differences influence their response to differentiation stimuli and, hence, cardiac cell fate

    A Gene-Based Association Method for Mapping Traits Using Reference Transcriptome Data

    Get PDF
    Genome-wide association studies (GWAS) have identified thousands of variants robustly associated with complex traits. However, the biological mechanisms underlying these associations are, in general, not well understood. We propose a gene-based association method called PrediXcan that directly tests the molecular mechanisms through which genetic variation affects phenotype. The approach estimates the component of gene expression determined by an individual’s genetic profile and correlates ‘imputed’ gene expression with the phenotype under investigation to identify genes involved in the etiology of the phenotype. Genetically regulated gene expression is estimated using whole-genome tissue-dependent prediction models trained with reference transcriptome data sets. PrediXcan enjoys the benefits of gene-based approaches such as reduced multiple-testing burden and a principled approach to the design of follow-up experiments. Our results demonstrate that PrediXcan can detect known and new genes associated with disease traits and provide insights into the mechanism of these associations

    Simultaneous enumeration of cancer and immune cell types from bulk tumor gene expression data.

    Get PDF
    Immune cells infiltrating tumors can have important impact on tumor progression and response to therapy. We present an efficient algorithm to simultaneously estimate the fraction of cancer and immune cell types from bulk tumor gene expression data. Our method integrates novel gene expression profiles from each major non-malignant cell type found in tumors, renormalization based on cell-type-specific mRNA content, and the ability to consider uncharacterized and possibly highly variable cell types. Feasibility is demonstrated by validation with flow cytometry, immunohistochemistry and single-cell RNA-Seq analyses of human melanoma and colorectal tumor specimens. Altogether, our work not only improves accuracy but also broadens the scope of absolute cell fraction predictions from tumor gene expression data, and provides a unique novel experimental benchmark for immunogenomics analyses in cancer research (http://epic.gfellerlab.org)

    Identification of a Bipolar Disorder Vulnerable Gene CHDH at 3p21.1

    Get PDF
    Genome-wide analysis (GWA) is an effective strategy to discover extreme effects surpassing genome-wide significant levels in studying complex disorders; however, when sample size is limited, the true effects may fail to achieve genome-wide significance. In such case, there may be authentic results among the pools of nominal candidates, and an alternative approach is to consider nominal candidates but are replicable across different samples. Here, we found that mRNA expression of the choline dehydrogenase gene (CHDH) was uniformly upregulated in the brains of bipolar disorder (BPD) patients compared with healthy controls across different studies. Follow-up genetic analyses of CHDH variants in multiple independent clinical datasets (including 11,564 cases and 17,686 controls) identified a risk SNP rs9836592 showing consistent associations with BPD (P meta = 5.72 × 10(-4)), and the risk allele indicated an increased CHDH expression in multiple neuronal tissues (lowest P = 6.70 × 10(-16)). These converging results may identify a nominal but true BPD susceptibility gene CHDH. Further exploratory analysis revealed suggestive associations of rs9836592 with childhood intelligence (P = 0.044) and educational attainment (P = 0.0039), a 'proxy phenotype' of general cognitive abilities. Intriguingly, the CHDH gene is located at chromosome 3p21.1, a risk region implicated in previous BPD genome-wide association studies (GWAS), but CHDH is lying outside of the core GWAS linkage disequilibrium (LD) region, and our studied SNP rs9836592 is ∼1.2 Mb 3' downstream of the previous GWAS loci (e.g., rs2251219) with no LD between them; thus, the association observed here is unlikely a reflection of previous GWAS signals. In summary, our results imply that CHDH may play a previously unknown role in the etiology of BPD and also highlight the informative value of integrating gene expression and genetic code in advancing our understanding of its biological basis

    The protocadherin 17 gene affects cognition, personality, amygdala structure and function, synapse development and risk of major mood disorders

    Get PDF
    Major mood disorders, which primarily include bipolar disorder and major depressive disorder, are the leading cause of disability worldwide and pose a major challenge in identifying robust risk genes. Here, we present data from independent large-scale clinical data sets (including 29 557 cases and 32 056 controls) revealing brain expressed protocadherin 17 (PCDH17) as a susceptibility gene for major mood disorders. Single-nucleotide polymorphisms (SNPs) spanning the PCDH17 region are significantly associated with major mood disorders; subjects carrying the risk allele showed impaired cognitive abilities, increased vulnerable personality features, decreased amygdala volume and altered amygdala function as compared with non-carriers. The risk allele predicted higher transcriptional levels of PCDH17 mRNA in postmortem brain samples, which is consistent with increased gene expression in patients with bipolar disorder compared with healthy subjects. Further, overexpression of PCDH17 in primary cortical neurons revealed significantly decreased spine density and abnormal dendritic morphology compared with control groups, which again is consistent with the clinical observations of reduced numbers of dendritic spines in the brains of patients with major mood disorders. Given that synaptic spines are dynamic structures which regulate neuronal plasticity and have crucial roles in myriad brain functions, this study reveals a potential underlying biological mechanism of a novel risk gene for major mood disorders involved in synaptic function and related intermediate phenotypes

    The Human Skeletal Muscle Proteome Project:a reappraisal of the current literature

    Get PDF
    Skeletal muscle is a large organ that accounts for up to half the total mass of the human body. A progressive decline in muscle mass and strength occurs with ageing and in some individuals configures the syndrome of 'sarcopenia', a condition that impairs mobility, challenges autonomy, and is a risk factor for mortality. The mechanisms leading to sarcopenia as well as myopathies are still little understood. The Human Skeletal Muscle Proteome Project was initiated with the aim to characterize muscle proteins and how they change with ageing and disease. We conducted an extensive review of the literature and analysed publically available protein databases. A systematic search of peer-reviewed studies was performed using PubMed. Search terms included 'human', 'skeletal muscle', 'proteome', 'proteomic(s)', and 'mass spectrometry', 'liquid chromatography-mass spectrometry (LC-MS/MS)'. A catalogue of 5431 non-redundant muscle proteins identified by mass spectrometry-based proteomics from 38 peer-reviewed scientific publications from 2002 to November 2015 was created. We also developed a nosology system for the classification of muscle proteins based on localization and function. Such inventory of proteins should serve as a useful background reference for future research on changes in muscle proteome assessed by quantitative mass spectrometry-based proteomic approaches that occur with ageing and diseases. This classification and compilation of the human skeletal muscle proteome can be used for the identification and quantification of proteins in skeletal muscle to discover new mechanisms for sarcopenia and specific muscle diseases that can be targeted for the prevention and treatment
    corecore