590 research outputs found
Encoding of low-quality DNA profiles as genotype probability matrices for improved profile comparisons, relatedness evaluation and database searches
Many DNA profiles recovered from crime scene samples are of a quality that
does not allow them to be searched against, nor entered into, databases. We
propose a method for the comparison of profiles arising from two DNA samples,
one or both of which can have multiple donors and be affected by low DNA
template or degraded DNA. We compute likelihood ratios to evaluate the
hypothesis that the two samples have a common DNA donor, and hypotheses
specifying the relatedness of two donors. Our method uses a probability
distribution for the genotype of the donor of interest in each sample. This
distribution can be obtained from a statistical model, or we can exploit the
ability of trained human experts to assess genotype probabilities, thus
extracting much information that would be discarded by standard interpretation
rules. Our method is compatible with established methods in simple settings,
but is more widely applicable and can make better use of information than many
current methods for the analysis of mixed-source, low-template DNA profiles. It
can accommodate uncertainty arising from relatedness instead of or in addition
to uncertainty arising from noisy genotyping. We describe a computer program
GPMDNA, available under an open source license, to calculate LRs using the
method presented in this paper.Comment: 28 pages. Accepted for publication 2-Sep-2016 - Forensic Science
International: Genetic
Bayesian models for syndrome- and gene-specific probabilities of novel variant pathogenicity
BACKGROUND: With the advent of affordable and comprehensive sequencing technologies, access to molecular genetics for clinical diagnostics and research applications is increasing. However, variant interpretation remains challenging, and tools that close the gap between data generation and data interpretation are urgently required. Here we present a transferable approach to help address the limitations in variant annotation. METHODS: We develop a network of Bayesian logistic regression models that integrate multiple lines of evidence to evaluate the probability that a rare variant is the cause of an individual's disease. We present models for genes causing inherited cardiac conditions, though the framework is transferable to other genes and syndromes. RESULTS: Our models report a probability of pathogenicity, rather than a categorisation into pathogenic or benign, which captures the inherent uncertainty of the prediction. We find that gene- and syndrome-specific models outperform genome-wide approaches, and that the integration of multiple lines of evidence performs better than individual predictors. The models are adaptable to incorporate new lines of evidence, and results can be combined with familial segregation data in a transparent and quantitative manner to further enhance predictions. Though the probability scale is continuous, and innately interpretable, performance summaries based on thresholds are useful for comparisons. Using a threshold probability of pathogenicity of 0.9, we obtain a positive predictive value of 0.999 and sensitivity of 0.76 for the classification of variants known to cause long QT syndrome over the three most important genes, which represents sufficient accuracy to inform clinical decision-making. A web tool APPRAISE [http://www.cardiodb.org/APPRAISE] provides access to these models and predictions. CONCLUSIONS: Our Bayesian framework provides a transparent, flexible and robust framework for the analysis and interpretation of rare genetic variants. Models tailored to specific genes outperform genome-wide approaches, and can be sufficiently accurate to inform clinical decision-making
A Genome-Wide Association Study of Neuroticism in a Population-Based Sample
Neuroticism is a moderately heritable personality trait considered to be a risk factor for developing major depression, anxiety disorders and dementia. We performed a genome-wide association study in 2,235 participants drawn from a population-based study of neuroticism, making this the largest association study for neuroticism to date. Neuroticism was measured by the Eysenck Personality Questionnaire. After Quality Control, we analysed 430,000 autosomal SNPs together with an additional 1.2 million SNPs imputed with high quality from the Hap Map CEU samples. We found a very small effect of population stratification, corrected using one principal component, and some cryptic kinship that required no correction. NKAIN2 showed suggestive evidence of association with neuroticism as a main effect (p<10(-6)) and GPC6 showed suggestive evidence for interaction with age (p approximate to 10(-7)). We found support for one previously-reported association (PDE4D), but failed to replicate other recent reports. These results suggest common SNP variation does not strongly influence neuroticism. Our study was powered to detect almost all SNPs explaining at least 2% of heritability, and so our results effectively exclude the existence of loci having a major effect on neuroticism
Diffusional Relaxation in Random Sequential Deposition
The effect of diffusional relaxation on the random sequential deposition
process is studied in the limit of fast deposition. Expression for the coverage
as a function of time are analytically derived for both the short-time and
long-time regimes. These results are tested and compared with numerical
simulations.Comment: 9 pages + 2 figure
Model of Cluster Growth and Phase Separation: Exact Results in One Dimension
We present exact results for a lattice model of cluster growth in 1D. The
growth mechanism involves interface hopping and pairwise annihilation
supplemented by spontaneous creation of the stable-phase, +1, regions by
overturning the unstable-phase, -1, spins with probability p. For cluster
coarsening at phase coexistence, p=0, the conventional structure-factor scaling
applies. In this limit our model falls in the class of diffusion-limited
reactions A+A->inert. The +1 cluster size grows diffusively, ~t**(1/2), and the
two-point correlation function obeys scaling. However, for p>0, i.e., for the
dynamics of formation of stable phase from unstable phase, we find that
structure-factor scaling breaks down; the length scale associated with the size
of the growing +1 clusters reflects only the short-distance properties of the
two-point correlations.Comment: 12 page
Accurate Liability Estimation Improves Power in Ascertained Case Control Studies
Linear mixed models (LMMs) have emerged as the method of choice for
confounded genome-wide association studies. However, the performance of LMMs in
non-randomly ascertained case-control studies deteriorates with increasing
sample size. We propose a framework called LEAP (Liability Estimator As a
Phenotype, https://github.com/omerwe/LEAP) that tests for association with
estimated latent values corresponding to severity of phenotype, and demonstrate
that this can lead to a substantial power increase
Genetic determinants of common epilepsies: a meta-analysis of genome-wide association studies
published_or_final_versio
Superselectors: Efficient Constructions and Applications
We introduce a new combinatorial structure: the superselector. We show that
superselectors subsume several important combinatorial structures used in the
past few years to solve problems in group testing, compressed sensing,
multi-channel conflict resolution and data security. We prove close upper and
lower bounds on the size of superselectors and we provide efficient algorithms
for their constructions. Albeit our bounds are very general, when they are
instantiated on the combinatorial structures that are particular cases of
superselectors (e.g., (p,k,n)-selectors, (d,\ell)-list-disjunct matrices,
MUT_k(r)-families, FUT(k, a)-families, etc.) they match the best known bounds
in terms of size of the structures (the relevant parameter in the
applications). For appropriate values of parameters, our results also provide
the first efficient deterministic algorithms for the construction of such
structures
Anisotropic Diffusion-Limited Reactions with Coagulation and Annihilation
One-dimensional reaction-diffusion models A+A -> 0, A+A -> A, and $A+B -> 0,
where in the latter case like particles coagulate on encounters and move as
clusters, are solved exactly with anisotropic hopping rates and assuming
synchronous dynamics. Asymptotic large-time results for particle densities are
derived and discussed in the framework of universality.Comment: 13 pages in plain Te
- …
