Search CORE

570 research outputs found

LIMIX: genetic analysis of multiple traits

Author: Casale F.P.
Lippert C.
Rakitsch B.
Stegle O.
Publication venue: 'Cold Spring Harbor Laboratory'
Publication date: 21/05/2014
Field of study

Multi-trait mixed models have emerged as a promising approach for joint analyses of multiple traits. In principle, the mixed model framework is remarkably general. However, current methods implement only a very specific range of tasks to optimize the necessary computations. Here, we present a multi-trait modeling framework that is versatile and fast: LIMIX enables to exibly adapt mixed models for a broad range of applications with different observed and hidden covariates, and variable study designs. To highlight the novel modeling aspects of LIMIX we performed three vastly different genetic studies: joint GWAS of correlated blood lipid phenotypes, joint analysis of the expression levels of the multiple transcript-isoforms of a gene, and pathway-based modeling of molecular traits across environments. In these applications we show that LIMIX increases GWAS power and phenotype prediction accuracy, in particular when integrating stepwise multi-locus regression into multi-trait models, and when analyzing large numbers of traits. An open source implementation of LIMIX is freely available at: https://github.com/PMBio/limix

Crossref

MDC Repository

Accelerating Bayesian hierarchical clustering of time series data with a randomised algorithm

Author: A Schliep
David L. Wild
E Cooke
Emma J. Cooke
G Brock
K Heller
L Bauwens
L Hubert
M Eisen
Magnus Rattray
NA Heard
NA Heard
O Stegle
P Ma
Paul D. W. Kirk
PDW Kirk
Q Liu
R Cho
Richard S. Savage
Robert Darkins
RS Savage
RS Savage
S Datta
S Frühwirth-Schnatter
W Chu
Z Bar-Joseph
Z Bar-Joseph
Zoubin Ghahramani
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 02/04/2013
Field of study

We live in an era of abundant data. This has necessitated the development of new and innovative statistical algorithms to get the most from experimental data. For example, faster algorithms make practical the analysis of larger genomic data sets, allowing us to extend the utility of cutting-edge statistical methods. We present a randomised algorithm that accelerates the clustering of time series data using the Bayesian Hierarchical Clustering (BHC) statistical method. BHC is a general method for clustering any discretely sampled time series data. In this paper we focus on a particular application to microarray gene expression data. We define and analyse the randomised algorithm, before presenting results on both synthetic and real biological data sets. We show that the randomised algorithm leads to substantial gains in speed with minimal loss in clustering quality. The randomised time series BHC algorithm is available as part of the R package BHC, which is available for download from Bioconductor (version 2.10 and above) via http://bioconductor.org/packages/2.10/bioc/html/BHC.html. We have also made available a set of R scripts which can be used to reproduce the analyses carried out in this paper. These are available from the following URL. https://sites.google.com/site/randomisedbhc/

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Warwick Research Archives Portal Repository

Using the past to estimate sensory uncertainty

Author: Beierholm U.
Ferrari A.
Noppeney U.
Rohe T.
Stegle O.
Publication venue: eLife Sciences Publications
Publication date: 01/01/2020
Field of study

To form a more reliable percept of the environment, the brain needs to estimate its own sensory uncertainty. Current theories of perceptual inference assume that the brain computes sensory uncertainty instantaneously and independently for each stimulus. We evaluated this assumption in four psychophysical experiments, in which human observers localized auditory signals that were presented synchronously with spatially disparate visual signals. Critically, the visual noise changed dynamically over time continuously or with intermittent jumps. Our results show that observers integrate audiovisual inputs weighted by sensory uncertainty estimates that combine information from past and current signals consistent with an optimal Bayesian learner that can be approximated by exponential discounting. Our results challenge leading models of perceptual inference where sensory uncertainty estimates depend only on the current stimulus. They demonstrate that the brain capitalizes on the temporal dynamics of the external world and estimates sensory uncertainty by combining past experiences with new incoming sensory signals

Durham Research Online

Crossref

OPEN FAU Online-Publikationssystem der Friedrich-Alexander-Universität Erlangen-Nürnberg

Radboud Repository (Radboud Univ.)

MPG.PuRe

Normalizing single-cell RNA sequencing data: challenges and opportunities

Single-cell transcriptomics is becoming an important component of the molecular biologist's toolkit. A critical step when analyzing data generated using this technology is normalization. However, normalization is typically performed using methods developed for bulk RNA sequencing or even microarray data, and the suitability of these methods for single-cell transcriptomics has not been assessed. We here discuss commonly used normalization approaches and illustrate how these can produce misleading results. Finally, we present alternative approaches and provide recommendations for single-cell RNA sequencing users

Crossref

UCL Discovery

eScholarship - University of California

Archivio istituzionale della ricerca - Università di Padova

Detection of regulator genes and eQTLs in gene networks

Author: A Butte
A Chatr-Aryamontri
A Clauset
A Joshi
A Joshi
A Kundaje
AA Shabalin
AJ Enright
AJ Walhout
AS Dimas
B Schwanhausser
B Zhang
B Zhang
C Cenik
CO Daub
D Koller
DA Cusanovich
DM Greenawalt
E Bonnet
E Ravasz
E Segal
EC Neto
EC Neto
EC Neto
EE Schadt
EE Schadt
EE Schadt
EE Schadt
EE Schadt
EJ Foss
F Grubert
F Yue
FA Cubillos
FW Albert
G Hemani
G Nicholson
GD Smith
GH Golub
H Foroughi Asl
H Talukdar
HN Kadarmideen
J Millstein
J Qi
J Zhu
J Zhu
J Zhu
JE Aten
JF Ayroles
JJ Faith
JL Björkegren
JS Liu
K Basso
K Qu
KG Ardlie
L Wu
LA Hindorff
LH Hartwell
LS Chen
M Ashburner
M Civelek
M Georges
M Gerstein
M Medvedovic
M Schmidt
M Scutari
MA Schaub
MB Eisen
MD Ritchie
ME Goddard
MEJ Newman
MEJ Newman
MV Rockman
MV Rockman
N Friedman
N Friedman
N Friedman
N Laird
O Stegle
P Langfelder
P Langfelder
P Langfelder
P Lu
R Sharan
R Sharan
RB Brem
RW Williams
S Lee
S Roy
S Tavazoie
SI Lee
SM Waszak
SS Rao
T Lappalainen
T Michoel
TA Manolio
TF Mackay
The ENCODE
TS Furey
VG Cheung
W Cookson
W Zhang
Y Chen
Y Li
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/11/2016
Field of study

Genetic differences between individuals associated to quantitative phenotypic traits, including disease states, are usually found in non-coding genomic regions. These genetic variants are often also associated to differences in expression levels of nearby genes (they are "expression quantitative trait loci" or eQTLs for short) and presumably play a gene regulatory role, affecting the status of molecular networks of interacting genes, proteins and metabolites. Computational systems biology approaches to reconstruct causal gene networks from large-scale omics data have therefore become essential to understand the structure of networks controlled by eQTLs together with other regulatory genes, and to generate detailed hypotheses about the molecular mechanisms that lead from genotype to phenotype. Here we review the main analytical methods and softwares to identify eQTLs and their associated genes, to reconstruct co-expression networks and modules, to reconstruct causal Bayesian gene and module networks, and to validate predicted networks in silico.Comment: minor revision with typos corrected; review article; 24 pages, 2 figure

arXiv.org e-Print Archive

Crossref

Cardiac cycle estimation for BOLD-FMRI

Author: C Chang
GH Glover
K Murphy
K Shmueli
ME Wagshul
O Stegle
R Dürichen
TD Verstynen
Publication venue: 'Springer Fachmedien Wiesbaden GmbH'
Publication date: 01/01/2018
Field of study

Previous studies [1, 2] have shown that slow variations in the cardiac cycle are coupled with signal changes in the blood-oxygen level dependent (BOLD) contrast. The detection of neurophysiological hemodynamic changes, driven by neuronal activity, is hampered by such physiological noise. It is therefore of great importance to model and remove these physiological artifacts. The cardiac cycle causes pulsatile arterial blood flow. This pulsation is translated into brain tissue and fluids bounded by the cranial cavity [3]. We exploit this pulsality effect in BOLD fMRI volumes to build a reliable cardio surrogate estimate. We propose a Gaussian Process (GP) heart rate model to build physiological noise regressors for the General Linear Model (GLM) used in fMRI analysis. The proposed model can also incorporate information from physiological recordings such as photoplethysmogram or electrocardiogram, and is able to learn the temporal interdependence of individual modalities

Crossref

UCL Discovery

King's Research Portal

Genome-Wide Association Study and Gene Expression Analysis Identifies CD84 as a Predictor of Response to Etanercept Therapy in Rheumatoid Arthritis

Author: A Parker
AI Catrina
AL Price
Alison Motsinger-Reif
AM van Gestel
Ann W. Morgan
Anne Barton
Anthony G. Wilson
Atsuo Taniguchi
B Devlin
Barbara E. Stranger
BE Stranger
BJ Scallon
C Liu
C Miceli-Richard
Chikashi Terao
Corinne Miceli
Cornelia F. Allaart
D Aeberli
D Plant
D Plant
DL Scott
Dorothee Diogo
EA Stahl
EA Stahl
EJ Toonen
Eli A. Stahl
Elizabeth W. Karlson
FM Batliwalla
Fumihiko Matsuda
Gert Jan Wolbink
GM Cooper
Gosia Trynka
H Canhao
Helena Canhao
Henk-Jan Guchelaar
Hisashi Yamanaka
IB McInnes
Irene E. van der Horst-Bruinsma
J Agnholt
J Cui
J Ernst
J Marchini
J. Bart A. Crusius
Jing Cui
Johan Askling
John D. Isaacs
João Eurico Fonseca
Katsunori Ikari
Kimme L. Hyrich
Koichiro Ohmura
L Klareskog
L Padyukov
Larry W. Moreland
LD Ward
Leonid Padyukov
Lindsey A. Criswell
M Martin
Manik Kuchroo
Marieke E. Doorenspleet
Marieke Herenius
Marieke J. H. Coenen
Mart van de Laar
Maša Umiċeviċ Mirkov
Michael E. Weinblatt
MJ Coenen
ML Prevoo
MM Soliman
Namrata Gupta
Nancy A. Shadick
Niek de Vries
O Stegle
Paul-Peter Tak
Peter K. Gregersen
Philip L. De Jager
PI de Bakker
Piet L. C. M. van Riel
PL De Jager
R Prajapati
Rene E. M. Toes
RM Plenge
Robert M. Plenge
Robert P. Kimberly
S Gudbrandsdottir
S Purcell
S. Louis Bridges
SA Gauthier
Saedis Saevarsdottir
Sara Wedrén
SG Tangye
Shigeki Momohara
Soumya Raychaudhuri
Tom W. J. Huizinga
Towfique Raj
Tsuneyo Mimori
Xavier Mariette
Y Okada
Yukinori Okada
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2013
Field of study

Anti-tumor necrosis factor alpha (anti-TNF) biologic therapy is a widely used treatment for rheumatoid arthritis (RA). It is unknown why some RA patients fail to respond adequately to anti-TNF therapy, which limits the development of clinical biomarkers to predict response or new drugs to target refractory cases. To understand the biological basis of response to anti-TNF therapy, we conducted a genome-wide association study (GWAS) meta-analysis of more than 2 million common variants in 2,706 RA patients from 13 different collections. Patients were treated with one of three anti-TNF medications: etanercept (n = 733), infliximab (n = 894), or adalimumab (n = 1,071). We identified a SNP (rs6427528) at the 1q23 locus that was associated with change in disease activity score (ΔDAS) in the etanercept subset of patients (P = 8×10-8), but not in the infliximab or adalimumab subsets (P>0.05). The SNP is predicted to disrupt transcription factor binding site motifs in the 3′ UTR of an immune-related gene, CD84, and the allele associated with better response to etanercept was associated with higher CD84 gene expression in peripheral blood mononuclear cells (P = 1×10-11 in 228 non-RA patients and P = 0.004 in 132 RA patients). Consistent with the genetic findings, higher CD84 gene expression correlated with lower cross-sectional DAS (P = 0.02, n = 210) and showed a non-significant trend for better ΔDAS in a subset of RA patients with gene expression data (n = 31, etanercept-treated). A small, multi-ethnic replication showed a non-significant trend towards an association among etanercept-treated RA patients of Portuguese ancestry (n = 139, P = 0.4), but no association among patients of Japanese ancestry (n = 151, P = 0.8). Our study demonstrates that an allele associated with response to etanercept therapy is also associated with CD84 gene expression, and further that CD84 expression correlates with disease activity. These findings support a model in which CD84 genotypes and/or expression may serve as a useful biomarker for response to etanercept treatment in RA patients of European ancestry. © 2013 Cui et al

Directory of Open Access Journals

D-Scholarship@Pitt

Leiden University Scholary Publications

The University of Manchester - Institutional Repository

White Rose Research Online

The Francis Crick Institute

Crossref

Harvard University - DASH

PubMed Central

Radboud Repository (Radboud Univ.)

Methods to study splicing from high-throughput RNA Sequencing data

Author: A Ameur
A Bhasi
A Dobin
A Mortazavi
A Oshlack
A Roberts
A Roberts
AM Mezlini
AN Brooks
B Jackson
B Kakaradov
B Langmead
B Li
B Li
BJ Haas
BJ Haas
C Trapnell
C Trapnell
C Trapnell
D Hiller
D Singh
DL Wood
DW Bryant
E Eyras
E Lee
E Turro
ET Wang
F Birzele
F Bona De
F Denoeud
F Tang
G Robertson
G Xu
GA Sacomoto
GR Grant
GS Slater
H Bao
H Jiang
H Jiang
H Kim
H Richard
J Behr
J Du
J Feng
J Hu
J Lovén
J Martin
J Salzman
J Seok
J Seok
J Wu
J Wu
JE Allen
JJ Li
JP Venables
K Schneeberger
K Wang
KD Hansen
KF Au
KL Howe
KM Borgwardt
L Chen
L Chen
L Wang
L Wang
LY Chen
M Aschoff
M Fiume
M Garber
M Griffith
M Guttman
M Stanke
M Stanke
M Sultan
MC Ryan
MF Rogers
MG Grabherr
MH Schulz
MT Dimon
N Cloonan
N Cloonan
N Deng
N Leng
N Nicolae
N Philippe
N Vijay
NA Fonseca
O Stegle
P Drewe
P Glaus
PL Martelli
PP Labaj
Q Liu
Q Liu
Q Pan
QY Zhao
R Bohnert
R Guigó
R Li
S Anders
S Djebali
S Filichkin
S Heber
S Huang
S Lee
S Mangul
S Marco-Sola
S Shen
S Sonnenburg
S Srivastava
S Tang
S Zheng
SB Montgomery
SH Nagaraj
SK Lou
T Bonfert
TA Clark
TD Wu
TD Wu
W Li
W Li
W Wang
WJ Kent
Y Hu
Y Katz
Y Li
Y Liao
Y Surget-Groba
Y Xing
Y Xing
Y Zhang
Z Xia
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 30/07/2015
Field of study

The development of novel high-throughput sequencing (HTS) methods for RNA (RNA-Seq) has provided a very powerful mean to study splicing under multiple conditions at unprecedented depth. However, the complexity of the information to be analyzed has turned this into a challenging task. In the last few years, a plethora of tools have been developed, allowing researchers to process RNA-Seq data to study the expression of isoforms and splicing events, and their relative changes under different conditions. We provide an overview of the methods available to study splicing from short RNA-Seq data. We group the methods according to the different questions they address: 1) Assignment of the sequencing reads to their likely gene of origin. This is addressed by methods that map reads to the genome and/or to the available gene annotations. 2) Recovering the sequence of splicing events and isoforms. This is addressed by transcript reconstruction and de novo assembly methods. 3) Quantification of events and isoforms. Either after reconstructing transcripts or using an annotation, many methods estimate the expression level or the relative usage of isoforms and/or events. 4) Providing an isoform or event view of differential splicing or expression. These include methods that compare relative event/isoform abundance or isoform expression across two or more conditions. 5) Visualizing splicing regulation. Various tools facilitate the visualization of the RNA-Seq data in the context of alternative splicing. In this review, we do not describe the specific mathematical models behind each method. Our aim is rather to provide an overview that could serve as an entry point for users who need to decide on a suitable tool for a specific analysis. We also attempt to propose a classification of the tools according to the operations they do, to facilitate the comparison and choice of methods.Comment: 31 pages, 1 figure, 9 tables. Small corrections adde

arXiv.org e-Print Archive

Crossref

Joint modelling of confounding factors and prominent genetic regulators provides increased accuracy in genetical genomics studies.

Author: A Myers
A Nica
A Price
BE Stranger
C Lippert
D Balding
D Locke
E Schadt
EN Smith
G Churchill
H Kang
HM Kang
HM Kang
J Listgarten
J Pickrell
J Yu
JT Leek
Matthew Stephens
MC Teixeira
MI McCarthy
Neil D. Lawrence
Nicoló Fusi
O Stegle
O Stegle
Oliver Stegle
R Breitling
RB Brem
V Plagnol
WE Johnson
X Gan
Publication venue: PLoS Comput Biol
Publication date: 01/01/2012
Field of study

Expression quantitative trait loci (eQTL) studies are an integral tool to investigate the genetic component of gene expression variation. A major challenge in the analysis of such studies are hidden confounding factors, such as unobserved covariates or unknown subtle environmental perturbations. These factors can induce a pronounced artifactual correlation structure in the expression profiles, which may create spurious false associations or mask real genetic association signals. Here, we report PANAMA (Probabilistic ANAlysis of genoMic dAta), a novel probabilistic model to account for confounding factors within an eQTL analysis. In contrast to previous methods, PANAMA learns hidden factors jointly with the effect of prominent genetic regulators. As a result, this new model can more accurately distinguish true genetic association signals from confounding variation. We applied our model and compared it to existing methods on different datasets and biological systems. PANAMA consistently performs better than alternative methods, and finds in particular substantially more trans regulators. Importantly, our approach not only identifies a greater number of associations, but also yields hits that are biologically more plausible and can be better reproduced between independent studies. A software implementation of PANAMA is freely available online at http://ml.sheffield.ac.uk/qtl/

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Publikationsserver der Universität Tübingen

Apollo (Cambridge)

White Rose Research Online

MPG.PuRe

The Francis Crick Institute