Search CORE

75 research outputs found

Genomic prediction of health traits in humans: demonstrating the value of marker selection.

Author: Agakov Felix
Bermingham Mairead
Campbell Harry
Haley Chris
Hayward Caroline
Navarro Pau
Pong-Wong Ricardo
Rudan Igor
Spiliopoulou Athina
Wilson Jim
Wright Alan
Publication venue
Publication date: 17/08/2014
Field of study

Kernelized Infomax Clustering

Author: Agakov Felix
Barber David
Publication venue: IDIAP
Publication date: 10/03/2006
Field of study

We propose a simple information-theoretic clustering approach based on maximizing the mutual information I(\sfx,y) between the unknown cluster labels

y

and the training patterns \sfx with respect to parameters of specifically constrained encoding distributions. The constraints are chosen such that patterns are likely to be clustered similarly if they lie close to specific (unknown) vectors in the feature space. The method may be conveniently applied to learning the optimal affinity matrix, which corresponds to learning parameters of the kernelized encoder. The procedure does not require computations of eigenvalues or inverses of the Gram matrices, which makes it potentially attractive for clustering large data sets

Infoscience - École polytechnique fédérale de Lausanne

Variational Information Maximization in Gaussian Channels

Author: Agakov Felix
Barber David
Publication venue: Rue de Simplon 4, Martigny, CH-1920, Switerland, IDIAP
Publication date: 10/03/2006
Field of study

Recently, we introduced a simple variational bound on mutual information, that resolves some of the difficulties in the application of information theory to machine learning. Here we study a specific application to Gaussian channels. It is well known that PCA may be viewed as the solution to maximizing information transmission between a high dimensional vector and its low dimensional representation . However, such results are based on assumptions of Gaussianity of the sources. In this paper, we show how our mutual information bound, when applied to this arena, gives PCA solutions, without the need for the Gaussian assumption. Furthermore, it naturally generalizes to providing an objective function for Kernel PCA, enabling the principled selection of kernel parameters

Infoscience - École polytechnique fédérale de Lausanne

Kernel multi-task learning using task-specific features

Author: Agakov Felix V.
Bonilla Edwin V.
Williams Christopher K.I.
Publication venue
Publication date: 01/01/2007
Field of study

In this paper we are concerned with multitask learning when task-specific features are available. We describe two ways of achieving this using Gaussian process predictors: in the first method, the data from all tasks is combined into one dataset, making use of the task-specific features. In the second method we train specific predictors for each reference task, and then combine their predictions using a gating network. We demonstrate these methods on a compiler performance prediction problem, where a task is defined as predicting the speed-up obtained when applying a sequence of code transformations to a given program

Edinburgh Research Explorer

Computational Semantics with Functional Programming, by Jan van Eijck and Christina Unger

Author: Agakov Felix
Orchard Peter
Storkey Amos
Publication venue: 'Cambridge University Press (CUP)'
Publication date: 27/09/2013
Field of study

One of the fundamental tasks of science is to find explainable relationships between observed phenomena. One approach to this task that has received attention in recent years is based on probabilistic graphical modelling with sparsity constraints on model structures. In this paper, we describe two new approaches to Bayesian inference of sparse structures of Gaussian graphical models (GGMs). One is based on a simple modification of the cutting-edge block Gibbs sampler for sparse GGMs, which results in significant computational gains in high dimensions. The other method is based on a specific construction of the Hamiltonian Monte Carlo sampler, which results in further significant improvements. We compare our fully Bayesian approaches with the popular regularisation-based graphical LASSO, and demonstrate significant advantages of the Bayesian treatment under the same computing costs. We apply the methods to a broad range of simulated data sets, and a real-life financial data set

arXiv.org e-Print Archive

Crossref

Kent Academic Repository

Kernel multi-task learning using task-specific features

Author: Agakov Felix V.
Bonilla Edwin V.
Williams Christopher K.I.
Publication venue
Publication date: 01/01/2007
Field of study

CiteSeerX

Edinburgh Research Explorer

Automated pathway and reaction prediction facilitates in silico identification of unknown metabolites in human cohort studies

Author: Agakov
Ahluwalia
Allen
Anne M. Evans
Baldassarre
Beger
Borodulin
Boudonck
Breitling
Carlsson
Chen
Creek
Evans
Felix Agakov
Frainay
Fuhrer
Gabi Kastenmüller
Grapov
Helen C. Looker
Helen M. Colhoun
Höllering
Jan D. Quell
Jan Krumsiek
Kanehisa
Kim
Krumsiek
Krumsiek
Lacroix
Leif C. Groop
Marco Colombo
Orchard
Paul McKeigue
Pearson
R Core Team
Robert Mohney
Romero
Ruttkies
Schaefer
Shin
Sumner
Thiele
Ulf de Faire
van Buuren
van der Hooft
van Helden
Veikko Salomaa
Werner
Werner Römisch-Margl
Yin
Publication venue: 'Elsevier BV'
Publication date: 01/01/2017
Field of study

Identification of metabolites in non-targeted metabolomics continues to be a bottleneck in metabolomics studies in large human cohorts. Unidentified metabolites frequently emerge in the results of association studies linking metabolite levels to, for example, clinical phenotypes. For further analyses these unknown metabolites must be identified. Current approaches utilize chemical information, such as spectral details and fragmentation characteristics to determine components of unknown metabolites. Here, we propose a systems biology model exploiting the internal correlation structure of metabolite levels in combination with existing biochemical and genetic information to characterize properties of unknown molecules. Levels of 758 metabolites (439 known, 319 unknown) in human blood samples of 2279 subjects were measured using a non-targeted metabolomics platform (LC-MS and GC-MS). We reconstructed the structure of biochemical pathways that are imprinted in these metabolomics data by building an empirical network model based on 1040 significant partial correlations between metabolites. We further added associations of these metabolites to 134 genes from genome-wide association studies as well as reactions and functional relations to genes from the public database Recon 2 to the network model. From the local neighborhood in the network, we were able to predict the pathway annotation of 180 unknown metabolites. Furthermore, we classified 100 pairs of known and unknown and 45 pairs of unknown metabolites to 21 types of reactions based on their mass differences. As a proof of concept, we then looked further into the special case of predicted dehydrogenation reactions leading us to the selection of 39 candidate molecules for 5 unknown metabolites. Finally, we could verify 2 of those candidates by applying LC-MS analyses of commercially available candidate substances. The formerly unknown metabolites X-13891 and X-13069 were shown to be 2-dodecendioic acid and 9-tetradecenoic acid, respectively. Our data-driven approach based on measured metabolite levels and genetic associations as well as information from public resources can be used alone or together with methods utilizing spectral patterns as a complementary, automated and powerful method to characterize unknown metabolites

Crossref

Lund University Publications

Julkari

Edinburgh Research Explorer

PuSH

Model Selection Approach Suggests Causal Association between 25-Hydroxyvitamin D and Colorectal Cancer

Author: A Tenesa
Albert Tenesa
BW Zanke
C Jarzynski
CF Garland
D Nitsch
DA Lawlor
DC Thomas
E Theodoratou
EE Schadt
EP Martens
Evropi Theodoratou
F Agakov
F Agakov
F Agakov
Felix Agakov
G Celeux
G Davey Smith
Harry Campbell
I Tomlinson
IP Tomlinson
J Pearl
J Yang
JE Lee
JM Lappe
K Wu
L Zgaga
Lina Zgaga
M Bochud
M Touvier
MA Hernan
Malcolm G. Dunlop
MJ Bolland
ML Neuhouser
MW Seeger
N Lartillot
NA Sheehan
NJ Timpson
P Trostel
Paolo Peterlongo
Paul McKeigue
PM McKeigue
PM Sleiman
R Tibshirani
RM Neal
RS Houlston
RS Houlston
S Gandini
S Knox
SA Lamprecht
SA Lamprecht
SB Mohr
Susan M. Farrington
TJ Wang
TM Palmer
V Didelez
WB Grant
WB Grant
Y Chen
Z Lagunova
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 24/05/2013
Field of study

Vitamin D deficiency has been associated with increased risk of colorectal cancer (CRC), but causal relationship has not yet been confirmed. We investigate the direction of causation between vitamin D and CRC by extending the conventional approaches to allow pleiotropic relationships and by explicitly modelling unmeasured confounders.Plasma 25-hydroxyvitamin D (25-OHD), genetic variants associated with 25-OHD and CRC, and other relevant information was available for 2645 individuals (1057 CRC cases and 1588 controls) and included in the model. We investigate whether 25-OHD is likely to be causally associated with CRC, or vice versa, by selecting the best modelling hypothesis according to Bayesian predictive scores. We examine consistency for a range of prior assumptions.Model comparison showed preference for the causal association between low 25-OHD and CRC over the reverse causal hypothesis. This was confirmed for posterior mean deviances obtained for both models (11.5 natural log units in favour of the causal model), and also for deviance information criteria (DIC) computed for a range of prior distributions. Overall, models ignoring hidden confounding or pleiotropy had significantly poorer DIC scores.Results suggest causal association between 25-OHD and colorectal cancer, and support the need for randomised clinical trials for further confirmations

Crossref

Directory of Open Access Journals

PubMed Central

Edinburgh Research Explorer

The Francis Crick Institute

Apolipoprotein CIII and N-terminal prohormone b-type natriuretic peptide as independent predictors for cardiovascular disease in type 2 diabetes

Author: Aroner
Bassam Farran
Battistoni
Charlton-Menys
Colhoun
Colhoun
Crosby
D'Agostino
D.John Betteridge
Daniels
de Lemos
Di Angelantonio
Felix Agakov
Gerstein
Gerstein
Gori
Hamm
Helen C. Looker
Helen M. Colhoun
Hillis
Hiukka
Huelsmann
Jensen
Kawakami
Kengne
Lee
Levey
Looker
Luo
M.Julia Brosnan
Marco Colombo
Morton
Naveed Sattar
NICE
Paul M. McKeigue
Paul N. Durrington
Paul Welsh
Pollin
R Core Team
Reinhard
Sattar
Saunders
Scheffer
Sehayek
Shona Livingstone
Soedamah-Muthu
Taskinen
van Capelleveen
van Dieren
Volpato
Welsh
Wong
Woodward
Wyler von Ballmoos
Xiong
Publication venue: 'Elsevier BV'
Publication date: 01/07/2018
Field of study

Background and aims: Developing sparse panels of biomarkers for cardiovascular disease in type 2 diabetes would enable risk stratification for clinical decision making and selection into clinical trials. We examined the individual and joint performance of five candidate biomarkers for incident cardiovascular disease (CVD) in type 2 diabetes that an earlier discovery study had yielded. Methods: Apolipoprotein CIII (apoCIII), N-terminal prohormone B-type natriuretic peptide (NT-proBNP), high sensitivity Troponin T (hsTnT), Interleukin-6, and Interleukin-15 were measured in baseline serum samples from the Collaborative Atorvastatin Diabetes trial (CARDS) of atorvastatin versus placebo. Among 2105 persons with type 2 diabetes and median age of 62.9 years (range 39.2–77.3), there were 144 incident CVD (acute coronary heart disease or stroke) cases during the maximum 5-year follow up. We used Cox Proportional Hazards models to identify biomarkers associated with incident CVD and the area under the receiver operating characteristic curves (AUROC) to assess overall model prediction. Results: Three of the biomarkers were singly associated with incident CVD independently of other risk factors; NT-proBNP (Hazard Ratio per standardised unit 2.02, 95% Confidence Interval [CI] 1.63, 2.50), apoCIII (1.34, 95% CI 1.12, 1.60) and hsTnT (1.40, 95% CI 1.16, 1.69). When combined in a single model, only NT-proBNP and apoCIII were independent predictors of CVD, together increasing the AUROC using Framingham risk variables from 0.661 to 0.745. Conclusions: The biomarkers NT-proBNP and apoCIII substantially increment the prediction of CVD in type 2 diabetes beyond that obtained with the variables used in the Framingham risk score

Crossref

Edinburgh Research Explorer

The University of Manchester - Institutional Repository

Enlighten

Discovery Research Portal

Serum kidney injury molecule 1 and β2-microglobulin perform as well as larger biomarker panels for prediction of rapid decline in renal function in type 2 diabetes

Author: AS Levey
AS Levey
B Yilmaz
Bassam Farran
BC Astor
C Donadio
Charles Turner
CK Yeung
Colin N. A. Palmer
David Dunger
E Critselis
Emma Ahlqvist
ER Pearson
F Fonseca-Wollheim da
Felix Agakov
HC Gerstein
HC Looker
HC Looker
Helen C. Looker
Helen M. Colhoun
HM Colhoun
HM Colhoun
J Siwy
JC Dickinson
JD Herrero-Morin
JE Lee
John Betteridge
K Wakabayashi
LA Inker
LA Inker
Leif Groop
MA Niewczas
Marco Colombo
Mary Julia Brosnan
Max Wong
MC Foster
ME Pavkov
MK Kim
ML Alter
NM Panduru
Paul Durrington
Paul M. McKeigue
R Core Team
R. Neil Dalton
S Abd El Dayem
Shona Livingstone
Sibylle Hess
TS Ahluwalia
VS Sabbisetti
WJ Austin
X Zhao
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2018
Field of study

Aims/hypothesis: As part of the Surrogate Markers for Micro- and Macrovascular Hard Endpoints for Innovative Diabetes Tools (SUMMIT) programme we previously reported that large panels of biomarkers derived from three analytical platforms maximised prediction of progression of renal decline in type 2 diabetes. Here, we hypothesised that smaller (n ≤ 5), platform-specific combinations of biomarkers selected from these larger panels might achieve similar prediction performance when tested in three additional type 2 diabetes cohorts. Methods: We used 657 serum samples, held under differing storage conditions, from the Scania Diabetes Registry (SDR) and Genetics of Diabetes Audit and Research Tayside (GoDARTS), and a further 183 nested case–control sample set from the Collaborative Atorvastatin in Diabetes Study (CARDS). We analysed 42 biomarkers measured on the SDR and GoDARTS samples by a variety of methods including standard ELISA, multiplexed ELISA (Luminex) and mass spectrometry. The subset of 21 Luminex biomarkers was also measured on the CARDS samples. We used the event definition of loss of >20% of baseline eGFR during follow-up from a baseline eGFR of 30–75 ml min−1 [1.73 m]−2. A total of 403 individuals experienced an event during a median follow-up of 7 years. We used discrete-time logistic regression models with tenfold cross-validation to assess association of biomarker panels with loss of kidney function. Results: Twelve biomarkers showed significant association with eGFR decline adjusted for covariates in one or more of the sample sets when evaluated singly. Kidney injury molecule 1 (KIM-1) and β2-microglobulin (B2M) showed the most consistent effects, with standardised odds ratios for progression of at least 1.4 (p < 0.0003) in all cohorts. A combination of B2M and KIM-1 added to clinical covariates, including baseline eGFR and albuminuria, modestly improved prediction, increasing the area under the curve in the SDR, Go-DARTS and CARDS by 0.079, 0.073 and 0.239, respectively. Neither the inclusion of additional Luminex biomarkers on top of B2M and KIM-1 nor a sparse mass spectrometry panel, nor the larger multiplatform panels previously identified, consistently improved prediction further across all validation sets. Conclusions/interpretation: Serum KIM-1 and B2M independently improve prediction of renal decline from an eGFR of 30–75 ml min−1 [1.73 m]−2 in type 2 diabetes beyond clinical factors and prior eGFR and are robust to varying sample storage conditions. Larger panels of biomarkers did not improve prediction beyond these two biomarkers

Lund University Publications

Crossref

Edinburgh Research Explorer

The University of Manchester - Institutional Repository

Discovery Research Portal

Swepub