1,846 research outputs found
Minimum Density Hyperplanes
Associating distinct groups of objects (clusters) with contiguous regions of
high probability density (high-density clusters), is central to many
statistical and machine learning approaches to the classification of unlabelled
data. We propose a novel hyperplane classifier for clustering and
semi-supervised classification which is motivated by this objective. The
proposed minimum density hyperplane minimises the integral of the empirical
probability density function along it, thereby avoiding intersection with high
density clusters. We show that the minimum density and the maximum margin
hyperplanes are asymptotically equivalent, thus linking this approach to
maximum margin clustering and semi-supervised support vector classifiers. We
propose a projection pursuit formulation of the associated optimisation problem
which allows us to find minimum density hyperplanes efficiently in practice,
and evaluate its performance on a range of benchmark datasets. The proposed
approach is found to be very competitive with state of the art methods for
clustering and semi-supervised classification
Assessing identity, redundancy and confounds in Gene Ontology annotations over time
MOTIVATION: The Gene Ontology (GO) is heavily used in systems biology, but the potential for redundancy, confounds with other data sources and problems with stability over time have been little explored. RESULTS: We report that GO annotations are stable over short periods, with 3% of genes not being most semantically similar to themselves between monthly GO editions. However, we find that genes can alter their 'functional identity' over time, with 20% of genes not matching to themselves (by semantic similarity) after 2 years. We further find that annotation bias in GO, in which some genes are more characterized than others, has declined in yeast, but generally increased in humans. Finally, we discovered that many entries in protein interaction databases are owing to the same published reports that are used for GO annotations, with 66% of assessed GO groups exhibiting this confound. We provide a case study to illustrate how this information can be used in analyses of gene sets and networks. AVAILABILITY: Data available at http://chibi.ubc.ca/assessGO. CONTACT: [email protected] SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online
Progress and challenges in the computational prediction of gene function using networks: 2012-2013 update
In an opinion published in 2012, we reviewed and discussed our studies of how gene network-based guilt-by-association (GBA) is impacted by confounds related to gene multifunctionality. We found such confounds account for a significant part of the GBA signal, and as a result meaningfully evaluating and applying computationally-guided GBA is more challenging than generally appreciated. We proposed that effort currently spent on incrementally improving algorithms would be better spent in identifying the features of data that do yield novel functional insights. We also suggested that part of the problem is the reliance by computational biologists on gold standard annotations such as the Gene Ontology. In the year since, there has been continued heavy activity in GBA-based research, including work that contributes to our understanding of the issues we raised. Here we provide a review of some of the most relevant recent work, or which point to new areas of progress and challenges
Using predictive specificity to determine when gene set analysis is biologically meaningful
Gene set analysis, which translates gene lists into enriched functions, is among the most common bioinformatic methods. Yet few would advocate taking the results at face value. Not only is there no agreement on the algorithms themselves, there is no agreement on how to benchmark them. In this paper, we evaluate the robustness and uniqueness of enrichment results as a means of assessing methods even where correctness is unknown. We show that heavily annotated ('multifunctional') genes are likely to appear in genomics study results and drive the generation of biologically non-specific enrichment results as well as highly fragile significances. By providing a means of determining where enrichment analyses report non-specific and non-robust findings, we are able to assess where we can be confident in their use. We find significant progress in recent bias correction methods for enrichment and provide our own software implementation. Our approach can be readily adapted to any pre-existing package
Structured light techniques for 3D surface reconstruction in robotic tasks
Robotic tasks such as navigation and path planning can be greatly enhanced by a vision system capable of providing depth perception from fast and accurate 3D surface reconstruction. Focused on robotic welding tasks we present a comparative analysis of a novel mathematical formulation for 3D surface reconstruction and discuss image processing requirements for reliable detection of patterns in the image. Models are presented for a parallel and angled configurations of light source and image sensor. It is shown that the parallel arrangement requires 35\% fewer arithmetic operations to compute a point cloud in 3D being thus more appropriate for real-time applications. Experiments show that the technique is appropriate to scan a variety of surfaces and, in particular, the intended metallic parts for robotic welding tasks
Multi-membership gene regulation in pathway based microarray analysis
This article is available through the Brunel Open Access Publishing Fund. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.Background: Gene expression analysis has been intensively researched for more than a decade. Recently, there has been elevated interest in the integration of microarray data analysis with other types of biological knowledge in a holistic analytical approach. We propose a methodology that can be facilitated for pathway based microarray data analysis, based on the observation that a substantial proportion of genes present in biochemical pathway databases are members of a number of distinct pathways. Our methodology aims towards establishing the state of individual pathways, by identifying those truly affected by the experimental conditions based on the behaviour of such genes. For that purpose it considers all the pathways in which a gene participates and the general census of gene expression per pathway. Results: We utilise hill climbing, simulated annealing and a genetic algorithm to analyse the consistency of the produced results, through the application of fuzzy adjusted rand indexes and hamming distance. All algorithms produce highly consistent genes to pathways allocations, revealing the contribution of genes to pathway functionality, in agreement with current pathway state visualisation techniques, with the simulated annealing search proving slightly superior in terms of efficiency. Conclusions: We show that the expression values of genes, which are members of a number of biochemical pathways or modules, are the net effect of the contribution of each gene to these biochemical processes. We show that by manipulating the pathway and module contribution of such genes to follow underlying trends we can interpret microarray results centred on the behaviour of these genes.The work was sponsored by the studentship scheme of the School of Information Systems, Computing and Mathematics, Brunel Universit
A Force-Balanced Control Volume Finite Element Method for Multi-Phase Porous Media Flow Modelling
Dr D. Pavlidis would like to acknowledge the support from the following research grants: Innovate UK ‘Octopus’, EPSRC ‘Reactor Core-Structure Re-location Modelling for Severe Nuclear Accidents’) and Horizon 2020 ‘In-Vessel Melt Retention’. Funding for Dr P. Salinas from ExxonMobil is gratefully acknowledged. Dr Z. Xie is supported by EPSRC ‘Multi-Scale Exploration of Multi-phase Physics in Flows’. Part funding for Prof Jackson under the TOTAL Chairs programme at Imperial College is also acknowledged. The authors would also like to acknowledge Mr Y. Debbabi for supplying analytic solutions.Peer reviewedPublisher PD
Properties of synchronization in the systems of non-identical coupled van der Pol and van der Pol - Duffing oscillators. Broadband synchronization
The particular properties of dynamics are discussed for the dissipatively
coupled van der Pol oscillators, non-identical in values of parameters
controlling the Hopf bifurcation. Possibility of a special synchronization
regime in an infinitively long band between oscillation death and quasiperiodic
areas is shown for such system. Features of the bifurcation picture are
discussed for different values of the control parameters and for the case of
additional Duffing-type nonlinearity. Analysis of the abridged equations is
presented.Comment: 19 pages, 9 figure
Improving the convergence behaviour of a fixed-point-iteration solver for multiphase flow in porous media
A new method to admit large Courant numbers in the numerical simulation of multiphase flow is presented. The governing equations are discretized in time using an adaptive θ-method. However, the use of implicit discretizations does not guarantee convergence of the nonlinear solver for large Courant numbers. In this work, a double-fixed point iteration method with backtracking is presented, which improves both convergence and convergence rate. Moreover, acceleration techniques are presented to yield a more robust nonlinear solver with increased effective convergence rate. The new method reduces the computational effort by strengthening the coupling between saturation and velocity, obtaining an efficient backtracking parameter, using a modified version of Anderson's acceleration and adding vanishing artificial diffusion
- …
