1,755 research outputs found
Integrative machine learning approach for multi-class SCOP protein fold classification
Classification and prediction of protein structure has been a central research theme in structural bioinformatics. Due to the imbalanced distribution of proteins over multi SCOP classification, most discriminative machine learning suffers the well-known ‘False Positives ’ problem when learning over these types of problems. We have devised eKISS, an ensemble machine learning specifically designed to increase the coverage of positive examples when learning under multiclass imbalanced data sets. We have applied eKISS to classify 25 SCOP folds and show that our learning system improved over classical learning methods
Renormings of
We investigate the best order of smoothness of . We prove in
particular that there exists a -smooth bump function on if
and only if and are both even integers and is a multiple of .Comment: 18 pages; AMS-Te
Multi-class protein fold classification using a new ensemble machine learning approach.
Protein structure classification represents an important process in understanding the associations
between sequence and structure as well as possible functional and evolutionary relationships.
Recent structural genomics initiatives and other high-throughput experiments have populated the
biological databases at a rapid pace. The amount of structural data has made traditional methods
such as manual inspection of the protein structure become impossible. Machine learning has been
widely applied to bioinformatics and has gained a lot of success in this research area. This work
proposes a novel ensemble machine learning method that improves the coverage of the classifiers
under the multi-class imbalanced sample sets by integrating knowledge induced from different base
classifiers, and we illustrate this idea in classifying multi-class SCOP protein fold data. We have
compared our approach with PART and show that our method improves the sensitivity of the
classifier in protein fold classification. Furthermore, we have extended this method to learning over
multiple data types, preserving the independence of their corresponding data sources, and show
that our new approach performs at least as well as the traditional technique over a single joined
data source. These experimental results are encouraging, and can be applied to other bioinformatics
problems similarly characterised by multi-class imbalanced data sets held in multiple data
sources
Non-meanfield deterministic limits in chemical reaction kinetics far from equilibrium
A general mechanism is proposed by which small intrinsic fluctuations in a
system far from equilibrium can result in nearly deterministic dynamical
behaviors which are markedly distinct from those realized in the meanfield
limit. The mechanism is demonstrated for the kinetic Monte-Carlo version of the
Schnakenberg reaction where we identified a scaling limit in which the global
deterministic bifurcation picture is fundamentally altered by fluctuations.
Numerical simulations of the model are found to be in quantitative agreement
with theoretical predictions.Comment: 4 pages, 4 figures (submitted to Phys. Rev. Lett.
Improving Prolog Programs: Refactoring for Prolog
Refactoring is an established technique from the OO-community to restructure
code: it aims at improving software readability, maintainability and
extensibility. Although refactoring is not tied to the OO-paradigm in
particular, its ideas have not been applied to Logic Programming until now.
This paper applies the ideas of refactoring to Prolog programs. A catalogue
is presented listing refactorings classified according to scope. Some of the
refactorings have been adapted from the OO-paradigm, while others have been
specifically designed for Prolog. Also the discrepancy between intended and
operational semantics in Prolog is addressed by some of the refactorings.
In addition, ViPReSS, a semi-automatic refactoring browser, is discussed and
the experience with applying \vipress to a large Prolog legacy system is
reported. Our main conclusion is that refactoring is not only a viable
technique in Prolog but also a rather desirable one.Comment: To appear in ICLP 200
Freezing-induced self-assembly of amphiphilic molecules
The self-assembly of amphiphilic molecules usually takes place in a liquid
phase, near room temperature. Here, using small angle X-ray scattering (SAXS)
experiments performed in real time, we show that freezing of aqueous solutions
of copolymer amphiphilic molecules can induce self-assembly below 0{\deg}C.Comment: 10 pages, 3 figure
Adaptive mesh refinement with spectral accuracy for magnetohydrodynamics in two space dimensions
We examine the effect of accuracy of high-order spectral element methods,
with or without adaptive mesh refinement (AMR), in the context of a classical
configuration of magnetic reconnection in two space dimensions, the so-called
Orszag-Tang vortex made up of a magnetic X-point centered on a stagnation point
of the velocity. A recently developed spectral-element adaptive refinement
incompressible magnetohydrodynamic (MHD) code is applied to simulate this
problem. The MHD solver is explicit, and uses the Elsasser formulation on
high-order elements. It automatically takes advantage of the adaptive grid
mechanics that have been described elsewhere in the fluid context [Rosenberg,
Fournier, Fischer, Pouquet, J. Comp. Phys. 215, 59-80 (2006)]; the code allows
both statically refined and dynamically refined grids. Tests of the algorithm
using analytic solutions are described, and comparisons of the Orszag-Tang
solutions with pseudo-spectral computations are performed. We demonstrate for
moderate Reynolds numbers that the algorithms using both static and refined
grids reproduce the pseudo--spectral solutions quite well. We show that
low-order truncation--even with a comparable number of global degrees of
freedom--fails to correctly model some strong (sup--norm) quantities in this
problem, even though it satisfies adequately the weak (integrated) balance
diagnostics.Comment: 19 pages, 10 figures, 1 table. Submitted to New Journal of Physic
A multidomain spectral method for solving elliptic equations
We present a new solver for coupled nonlinear elliptic partial differential
equations (PDEs). The solver is based on pseudo-spectral collocation with
domain decomposition and can handle one- to three-dimensional problems. It has
three distinct features. First, the combined problem of solving the PDE,
satisfying the boundary conditions, and matching between different subdomains
is cast into one set of equations readily accessible to standard linear and
nonlinear solvers. Second, touching as well as overlapping subdomains are
supported; both rectangular blocks with Chebyshev basis functions as well as
spherical shells with an expansion in spherical harmonics are implemented.
Third, the code is very flexible: The domain decomposition as well as the
distribution of collocation points in each domain can be chosen at run time,
and the solver is easily adaptable to new PDEs. The code has been used to solve
the equations of the initial value problem of general relativity and should be
useful in many other problems. We compare the new method to finite difference
codes and find it superior in both runtime and accuracy, at least for the
smooth problems considered here.Comment: 31 pages, 8 figure
Structural pathway of regulated substrate transfer and threading through an Hsp100 disaggregase
Refolding aggregated proteins is essential in combating cellular proteotoxic stress. Together with Hsp70, Hsp100 chaperones, including Escherichia coli ClpB, form a powerful disaggregation machine that threads aggregated polypeptides through the central pore of tandem adenosine triphosphatase (ATPase) rings. To visualize protein disaggregation, we determined cryo–electron microscopy structures of inactive and substrate-bound ClpB in the presence of adenosine 5′-O-(3-thiotriphosphate), revealing closed AAA+ rings with a pronounced seam. In the substrate-free state, a marked gradient of resolution, likely corresponding to mobility, spans across the AAA+ rings with a dynamic hotspot at the seam. On the seam side, the coiled-coil regulatory domains are locked in a horizontal, inactive orientation. On the opposite side, the regulatory domains are accessible for Hsp70 binding, substrate targeting, and activation. In the presence of the model substrate casein, the polypeptide threads through the entire pore channel and increased nucleotide occupancy correlates with higher ATPase activity. Substrate-induced domain displacements indicate a pathway of regulated substrate transfer from Hsp70 to the ClpB pore, inside which a spiral of loops contacts the substrate. The seam pore loops undergo marked displacements, along with ordering of the regulatory domains. These asymmetric movements suggest a mechanism for ATPase activation and substrate threading during disaggregation
Compositionality, stochasticity and cooperativity in dynamic models of gene regulation
We present an approach for constructing dynamic models for the simulation of
gene regulatory networks from simple computational elements. Each element is
called a ``gene gate'' and defines an input/output-relationship corresponding
to the binding and production of transcription factors. The proposed reaction
kinetics of the gene gates can be mapped onto stochastic processes and the
standard ode-description. While the ode-approach requires fixing the system's
topology before its correct implementation, expressing them in stochastic
pi-calculus leads to a fully compositional scheme: network elements become
autonomous and only the input/output relationships fix their wiring. The
modularity of our approach allows to pass easily from a basic first-level
description to refined models which capture more details of the biological
system. As an illustrative application we present the stochastic repressilator,
an artificial cellular clock, which oscillates readily without any cooperative
effects.Comment: 15 pages, 8 figures. Accepted by the HFSP journal (13/09/07
- …
