5,245 research outputs found

    Adaptive Seeding for Gaussian Mixture Models

    Full text link
    We present new initialization methods for the expectation-maximization algorithm for multivariate Gaussian mixture models. Our methods are adaptions of the well-known KK-means++ initialization and the Gonzalez algorithm. Thereby we aim to close the gap between simple random, e.g. uniform, and complex methods, that crucially depend on the right choice of hyperparameters. Our extensive experiments indicate the usefulness of our methods compared to common techniques and methods, which e.g. apply the original KK-means++ and Gonzalez directly, with respect to artificial as well as real-world data sets.Comment: This is a preprint of a paper that has been accepted for publication in the Proceedings of the 20th Pacific Asia Conference on Knowledge Discovery and Data Mining (PAKDD) 2016. The final publication is available at link.springer.com (http://link.springer.com/chapter/10.1007/978-3-319-31750-2 24

    Machine-learning of atomic-scale properties based on physical principles

    Full text link
    We briefly summarize the kernel regression approach, as used recently in materials modelling, to fitting functions, particularly potential energy surfaces, and highlight how the linear algebra framework can be used to both predict and train from linear functionals of the potential energy, such as the total energy and atomic forces. We then give a detailed account of the Smooth Overlap of Atomic Positions (SOAP) representation and kernel, showing how it arises from an abstract representation of smooth atomic densities, and how it is related to several popular density-based representations of atomic structure. We also discuss recent generalisations that allow fine control of correlations between different atomic species, prediction and fitting of tensorial properties, and also how to construct structural kernels---applicable to comparing entire molecules or periodic systems---that go beyond an additive combination of local environments

    Mechanical Systems with Symmetry, Variational Principles, and Integration Algorithms

    Get PDF
    This paper studies variational principles for mechanical systems with symmetry and their applications to integration algorithms. We recall some general features of how to reduce variational principles in the presence of a symmetry group along with general features of integration algorithms for mechanical systems. Then we describe some integration algorithms based directly on variational principles using a discretization technique of Veselov. The general idea for these variational integrators is to directly discretize Hamilton’s principle rather than the equations of motion in a way that preserves the original systems invariants, notably the symplectic form and, via a discrete version of Noether’s theorem, the momentum map. The resulting mechanical integrators are second-order accurate, implicit, symplectic-momentum algorithms. We apply these integrators to the rigid body and the double spherical pendulum to show that the techniques are competitive with existing integrators

    Retinal blood vessels extraction using probabilistic modelling

    Get PDF
    © 2014 Kaba et al.; licensee BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.This article has been made available through the Brunel Open Access Publishing Fund.The analysis of retinal blood vessels plays an important role in detecting and treating retinal diseases. In this review, we present an automated method to segment blood vessels of fundus retinal image. The proposed method could be used to support a non-intrusive diagnosis in modern ophthalmology for early detection of retinal diseases, treatment evaluation or clinical study. This study combines the bias correction and an adaptive histogram equalisation to enhance the appearance of the blood vessels. Then the blood vessels are extracted using probabilistic modelling that is optimised by the expectation maximisation algorithm. The method is evaluated on fundus retinal images of STARE and DRIVE datasets. The experimental results are compared with some recently published methods of retinal blood vessels segmentation. The experimental results show that our method achieved the best overall performance and it is comparable to the performance of human experts.The Department of Information Systems, Computing and Mathematics, Brunel University

    Planet Populations as a Function of Stellar Properties

    Full text link
    Exoplanets around different types of stars provide a window into the diverse environments in which planets form. This chapter describes the observed relations between exoplanet populations and stellar properties and how they connect to planet formation in protoplanetary disks. Giant planets occur more frequently around more metal-rich and more massive stars. These findings support the core accretion theory of planet formation, in which the cores of giant planets form more rapidly in more metal-rich and more massive protoplanetary disks. Smaller planets, those with sizes roughly between Earth and Neptune, exhibit different scaling relations with stellar properties. These planets are found around stars with a wide range of metallicities and occur more frequently around lower mass stars. This indicates that planet formation takes place in a wide range of environments, yet it is not clear why planets form more efficiently around low mass stars. Going forward, exoplanet surveys targeting M dwarfs will characterize the exoplanet population around the lowest mass stars. In combination with ongoing stellar characterization, this will help us understand the formation of planets in a large range of environments.Comment: Accepted for Publication in the Handbook of Exoplanet

    MultiMetEval: comparative and multi-objective analysis of genome-scale metabolic models

    Get PDF
    Comparative metabolic modelling is emerging as a novel field, supported by the development of reliable and standardized approaches for constructing genome-scale metabolic models in high throughput. New software solutions are needed to allow efficient comparative analysis of multiple models in the context of multiple cellular objectives. Here, we present the user-friendly software framework Multi-Metabolic Evaluator (MultiMetEval), built upon SurreyFBA, which allows the user to compose collections of metabolic models that together can be subjected to flux balance analysis. Additionally, MultiMetEval implements functionalities for multi-objective analysis by calculating the Pareto front between two cellular objectives. Using a previously generated dataset of 38 actinobacterial genome-scale metabolic models, we show how these approaches can lead to exciting novel insights. Firstly, after incorporating several pathways for the biosynthesis of natural products into each of these models, comparative flux balance analysis predicted that species like Streptomyces that harbour the highest diversity of secondary metabolite biosynthetic gene clusters in their genomes do not necessarily have the metabolic network topology most suitable for compound overproduction. Secondly, multi-objective analysis of biomass production and natural product biosynthesis in these actinobacteria shows that the well-studied occurrence of discrete metabolic switches during the change of cellular objectives is inherent to their metabolic network architecture. Comparative and multi-objective modelling can lead to insights that could not be obtained by normal flux balance analyses. MultiMetEval provides a powerful platform that makes these analyses straightforward for biologists. Sources and binaries of MultiMetEval are freely available from https://github.com/PiotrZakrzewski/MetEv​al/downloads

    Microalgae production in fresh market wastewater and its utilization as a protein substitute in formulated fish feed for oreochromis spp.

    Get PDF
    Rapid growing of human population has led to increasing demand of aquaculture production. Oreochromis niloticus or known as tilapia is one of the most globally cultured freshwater fish due to its great adaptation towards extreme environment. Besides, farming of tilapia not only involves small scales farming for local consumption but also larger scales for international market which contributes to a foreign currency earning. Extensive use of fishmeal as feed for fish and for other animals indirectly caused an increasing depletion of the natural resource and may consequently cause economic and environmental unstable. Microalgae biomass seems to be a promising feedstock in aquaculture industry. It can be used for many purposes such as live food for fish larvae and dried microalgae to substitute protein material in fish feed. The microalgae replacement in fish feed formulation as protein alternative seem potentially beneficial for long term aqua-business sustainability. The present chapter discussed the potential of microalgae as an alternative nutrition in fish feed formulations, specifically Tilapia

    TRY plant trait database - enhanced coverage and open access

    Get PDF
    Plant traits-the morphological, anatomical, physiological, biochemical and phenological characteristics of plants-determine how plants respond to environmental factors, affect other trophic levels, and influence ecosystem properties and their benefits and detriments to people. Plant trait data thus represent the basis for a vast area of research spanning from evolutionary biology, community and functional ecology, to biodiversity conservation, ecosystem and landscape management, restoration, biogeography and earth system modelling. Since its foundation in 2007, the TRY database of plant traits has grown continuously. It now provides unprecedented data coverage under an open access data policy and is the main plant trait database used by the research community worldwide. Increasingly, the TRY database also supports new frontiers of trait-based plant research, including the identification of data gaps and the subsequent mobilization or measurement of new data. To support this development, in this article we evaluate the extent of the trait data compiled in TRY and analyse emerging patterns of data coverage and representativeness. Best species coverage is achieved for categorical traits-almost complete coverage for 'plant growth form'. However, most traits relevant for ecology and vegetation modelling are characterized by continuous intraspecific variation and trait-environmental relationships. These traits have to be measured on individual plants in their respective environment. Despite unprecedented data coverage, we observe a humbling lack of completeness and representativeness of these continuous traits in many aspects. We, therefore, conclude that reducing data gaps and biases in the TRY database remains a key challenge and requires a coordinated approach to data mobilization and trait measurements. This can only be achieved in collaboration with other initiatives

    Retargeted adenoviruses for radiation-guided gene delivery

    Get PDF
    The combination of radiation with radiosensitizing gene delivery or oncolytic viruses promises to provide an advantage that could improve the therapeutic results for glioblastoma. X-rays can induce significant molecular changes in cancer cells. We isolated the GIRLRG peptide that binds to radiation-inducible 78 kDa glucose-regulated protein (GRP78), which is overexpressed on the plasma membranes of irradiated cancer cells and tumor-associated microvascular endothelial cells. The goal of our study was to improve tumor-specific adenovirus-mediated gene delivery by selectively targeting the adenovirus binding to this radiation-inducible protein. We employed an adenoviral fiber replacement approach to conduct a study of the targeting utility of GRP78-binding peptide. We have developed fiber-modified adenoviruses encoding the GRP78-binding peptide inserted into the fiber-fibritin. We have evaluated the reporter gene expression of fiber-modified adenoviruses in vitro using a panel of glioma cells and a human D54MG tumor xenograft model. The obtained results demonstrated that employment of the GRP78-binding peptide resulted in increased gene expression in irradiated tumors following infection with fiber-modified adenoviruses, compared with untreated tumor cells. These studies demonstrate the feasibility of adenoviral retargeting using the GRP78-binding peptide that selectively recognizes tumor cells responding to radiation treatment

    The Hubbard model within the equations of motion approach

    Full text link
    The Hubbard model has a special role in Condensed Matter Theory as it is considered as the simplest Hamiltonian model one can write in order to describe anomalous physical properties of some class of real materials. Unfortunately, this model is not exactly solved except for some limits and therefore one should resort to analytical methods, like the Equations of Motion Approach, or to numerical techniques in order to attain a description of its relevant features in the whole range of physical parameters (interaction, filling and temperature). In this manuscript, the Composite Operator Method, which exploits the above mentioned analytical technique, is presented and systematically applied in order to get information about the behavior of all relevant properties of the model (local, thermodynamic, single- and two- particle ones) in comparison with many other analytical techniques, the above cited known limits and numerical simulations. Within this approach, the Hubbard model is shown to be also capable to describe some anomalous behaviors of the cuprate superconductors.Comment: 232 pages, more than 300 figures, more than 500 reference
    corecore