Search CORE

590 research outputs found

Encoding of low-quality DNA profiles as genotype probability matrices for improved profile comparisons, relatedness evaluation and database searches

Author: Balding David J.
Ryan K.
Williams D. Gareth
Publication venue: 'Elsevier BV'
Publication date: 14/09/2016
Field of study

Many DNA profiles recovered from crime scene samples are of a quality that does not allow them to be searched against, nor entered into, databases. We propose a method for the comparison of profiles arising from two DNA samples, one or both of which can have multiple donors and be affected by low DNA template or degraded DNA. We compute likelihood ratios to evaluate the hypothesis that the two samples have a common DNA donor, and hypotheses specifying the relatedness of two donors. Our method uses a probability distribution for the genotype of the donor of interest in each sample. This distribution can be obtained from a statistical model, or we can exploit the ability of trained human experts to assess genotype probabilities, thus extracting much information that would be discarded by standard interpretation rules. Our method is compatible with established methods in simple settings, but is more widely applicable and can make better use of information than many current methods for the analysis of mixed-source, low-template DNA profiles. It can accommodate uncertainty arising from relatedness instead of or in addition to uncertainty arising from noisy genotyping. We describe a computer program GPMDNA, available under an open source license, to calculate LRs using the method presented in this paper.Comment: 28 pages. Accepted for publication 2-Sep-2016 - Forensic Science International: Genetic

arXiv.org e-Print Archive

Crossref

UCL Discovery

Bayesian models for syndrome- and gene-specific probabilities of novel variant pathogenicity

Author: Balding DJ
Cook SA
Ruklisa D
Walsh R
Ware JS
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2015
Field of study

BACKGROUND: With the advent of affordable and comprehensive sequencing technologies, access to molecular genetics for clinical diagnostics and research applications is increasing. However, variant interpretation remains challenging, and tools that close the gap between data generation and data interpretation are urgently required. Here we present a transferable approach to help address the limitations in variant annotation. METHODS: We develop a network of Bayesian logistic regression models that integrate multiple lines of evidence to evaluate the probability that a rare variant is the cause of an individual's disease. We present models for genes causing inherited cardiac conditions, though the framework is transferable to other genes and syndromes. RESULTS: Our models report a probability of pathogenicity, rather than a categorisation into pathogenic or benign, which captures the inherent uncertainty of the prediction. We find that gene- and syndrome-specific models outperform genome-wide approaches, and that the integration of multiple lines of evidence performs better than individual predictors. The models are adaptable to incorporate new lines of evidence, and results can be combined with familial segregation data in a transparent and quantitative manner to further enhance predictions. Though the probability scale is continuous, and innately interpretable, performance summaries based on thresholds are useful for comparisons. Using a threshold probability of pathogenicity of 0.9, we obtain a positive predictive value of 0.999 and sensitivity of 0.76 for the classification of variants known to cause long QT syndrome over the three most important genes, which represents sufficient accuracy to inform clinical decision-making. A web tool APPRAISE [http://www.cardiodb.org/APPRAISE] provides access to these models and predictions. CONCLUSIONS: Our Bayesian framework provides a transparent, flexible and robust framework for the analysis and interpretation of rare genetic variants. Models tailored to specific genes outperform genome-wide approaches, and can be sufficiently accurate to inform clinical decision-making

Crossref

Springer - Publisher Connector

PubMed Central

Spiral - Imperial College Digital Repository

University of Melbourne Institutional Repository

A Genome-Wide Association Study of Neuroticism in a Population-Based Sample

Author: Antoniades A
Balding DJ
Calboli FCF
Galwey NW
Johnson MR
Mooser V
Muglia P
Preisig M
Tozzi F
Vollenweider P
Waeber G
Waterworth D
Publication venue: PUBLIC LIBRARY SCIENCE
Publication date: 01/01/2010
Field of study

Neuroticism is a moderately heritable personality trait considered to be a risk factor for developing major depression, anxiety disorders and dementia. We performed a genome-wide association study in 2,235 participants drawn from a population-based study of neuroticism, making this the largest association study for neuroticism to date. Neuroticism was measured by the Eysenck Personality Questionnaire. After Quality Control, we analysed 430,000 autosomal SNPs together with an additional 1.2 million SNPs imputed with high quality from the Hap Map CEU samples. We found a very small effect of population stratification, corrected using one principal component, and some cryptic kinship that required no correction. NKAIN2 showed suggestive evidence of association with neuroticism as a main effect (p<10(-6)) and GPC6 showed suggestive evidence for interaction with age (p approximate to 10(-7)). We found support for one previously-reported association (PDE4D), but failed to replicate other recent reports. These results suggest common SNP variation does not strongly influence neuroticism. Our study was powered to detect almost all SNPs explaining at least 2% of heritability, and so our results effectively exclude the existence of loci having a major effect on neuroticism

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

UNIL IRIS | Institutional Research Information System

PubMed Central

UCL Discovery

Spiral - Imperial College Digital Repository

University of Melbourne Institutional Repository

The Francis Crick Institute

Diffusional Relaxation in Random Sequential Deposition

Author: Abramowitz M
Asher Baram
Balding D
Bartelt M C
Eli Eisenberg
Kurrat R
Privman V
Ramsden J J
Publication venue: 'IOP Publishing'
Publication date: 25/10/1996
Field of study

The effect of diffusional relaxation on the random sequential deposition process is studied in the limit of fast deposition. Expression for the coverage as a function of time are analytically derived for both the short-time and long-time regimes. These results are tested and compared with numerical simulations.Comment: 9 pages + 2 figure

arXiv.org e-Print Archive

Crossref

Model of Cluster Growth and Phase Separation: Exact Results in One Dimension

Author: A. A. Lushnikov
A. J. Bray
B. Hede
D. C. Torney
D. J. Balding
J. D. Gunton
J. G. Amar
J. T. Cox
M. Bramson
M. Bramson
M. Scheucher
P. Clifford
P. Meakin
R. Holley
V. Kuzovkov
Vladimir Privman
Z. Racz
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 02/07/1992
Field of study

We present exact results for a lattice model of cluster growth in 1D. The growth mechanism involves interface hopping and pairwise annihilation supplemented by spontaneous creation of the stable-phase, +1, regions by overturning the unstable-phase, -1, spins with probability p. For cluster coarsening at phase coexistence, p=0, the conventional structure-factor scaling applies. In this limit our model falls in the class of diffusion-limited reactions A+A->inert. The +1 cluster size grows diffusively, ~t**(1/2), and the two-point correlation function obeys scaling. However, for p>0, i.e., for the dynamics of formation of stable phase from unstable phase, we find that structure-factor scaling breaks down; the length scale associated with the size of the growing +1 clusters reflects only the short-distance properties of the two-point correlations.Comment: 12 page

arXiv.org e-Print Archive

Crossref

Accurate Liability Estimation Improves Power in Ascertained Case Control Studies

Author: AL Price
AL Price
C Lippert
C Widmer
Christoph Lippert
D Golan
D Welter
Dan Geiger
David Heckerman
DJ Balding
ER Dempster
J Listgarten
J Yang
J Yang
J Yang
LA Hindorff
LC Tsoi
M Fakiola
N Fusi
N Patterson
N Zaitlen
N Zaitlen
Omer Weissbrod
S Sawcer
S Wright
SH Lee
X Zhou
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/04/2015
Field of study

Linear mixed models (LMMs) have emerged as the method of choice for confounded genome-wide association studies. However, the performance of LMMs in non-randomly ascertained case-control studies deteriorates with increasing sample size. We propose a framework called LEAP (Liability Estimator As a Phenotype, https://github.com/omerwe/LEAP) that tests for association with estimated latent values corresponding to severity of phenotype, and demonstrate that this can lead to a substantial power increase

arXiv.org e-Print Archive

Crossref

MDC Repository

Genetic determinants of common epilepsies: a meta-analysis of genome-wide association studies

Author: Anney RJL
Avbersek A
Balding D
Baum L
Becker F
Berkovic SF
Bradfield JP
Cherny SS
Publication venue: 'Elsevier BV'
Publication date: 01/01/2014
Field of study

published_or_final_versio

HKU Scholars Hub

Superselectors: Efficient Constructions and Applications

Author: A. Bonis De
A. Bonis De
A.G. D’yachkov
A.G. D’yachkov
B.S. Chlebus
D. Eppstein
D.J. Balding
D.Z. Du
E. Porat
G. Cormode
J. Wolf
M. Mitzenmacher
N. Alon
N. Alon
N. Linial
P. Erdös
Piotr Indyk
R. Clifford
R. Kumar
S. Ganguly
T. Cover
T. Moran
V. Grebinsky
W.H. Kautz
Y. Cheng
Y. Cheng
Publication venue
Publication date: 01/01/2010
Field of study

We introduce a new combinatorial structure: the superselector. We show that superselectors subsume several important combinatorial structures used in the past few years to solve problems in group testing, compressed sensing, multi-channel conflict resolution and data security. We prove close upper and lower bounds on the size of superselectors and we provide efficient algorithms for their constructions. Albeit our bounds are very general, when they are instantiated on the combinatorial structures that are particular cases of superselectors (e.g., (p,k,n)-selectors, (d,\ell)-list-disjunct matrices, MUT_k(r)-families, FUT(k, a)-families, etc.) they match the best known bounds in terms of size of the structures (the relevant parameter in the applications). For appropriate values of parameters, our results also provide the first efficient deterministic algorithms for the construction of such structures

arXiv.org e-Print Archive

Crossref

Catalogo dei prodotti della ricerca

Archivio della Ricerca - Università di Salerno

Anisotropic Diffusion-Limited Reactions with Coagulation and Annihilation

Author: A. A. Lushnikov
António M. R. Cadilhe
B. P. Lee
D. ben-Avraham
D. C. Torney
D. J. Balding
D. Toussaint
H. Takayasu
H. Takayasu
I. Ispolatov
I. M. Sokolov
J. G. Amar
J. L. Spouge
K. Kang
K. Kang
K. Kang
M. Bramson
M. Bramson
M. Bramson
M. Lawrence Glasser
P. Krapivsky
P. Krapivsky
R. Kopelman
R. Kroon
S. A. Janowsky
S. A. Janowsky
S. Cornell
S. N. Majumdar
T. Liggett
V. Kuzovkov
V. Privman
V. Privman
V. Privman
V. Privman
V. Privman
V. Privman
V. Privman
Vladimir Privman
Z. Racz
Publication venue: 'American Physical Society (APS)'
Publication date: 12/03/1995
Field of study

One-dimensional reaction-diffusion models A+A -> 0, A+A -> A, and $A+B -> 0, where in the latter case like particles coagulate on encounters and move as clusters, are solved exactly with anisotropic hopping rates and assuming synchronous dynamics. Asymptotic large-time results for particle densities are derived and discussed in the framework of universality.Comment: 13 pages in plain Te

arXiv.org e-Print Archive

Crossref