2,595 research outputs found
Query Learning with Exponential Query Costs
In query learning, the goal is to identify an unknown object while minimizing
the number of "yes" or "no" questions (queries) posed about that object. A
well-studied algorithm for query learning is known as generalized binary search
(GBS). We show that GBS is a greedy algorithm to optimize the expected number
of queries needed to identify the unknown object. We also generalize GBS in two
ways. First, we consider the case where the cost of querying grows
exponentially in the number of queries and the goal is to minimize the expected
exponential cost. Then, we consider the case where the objects are partitioned
into groups, and the objective is to identify only the group to which the
object belongs. We derive algorithms to address these issues in a common,
information-theoretic framework. In particular, we present an exact formula for
the objective function in each case involving Shannon or Renyi entropy, and
develop a greedy algorithm for minimizing it. Our algorithms are demonstrated
on two applications of query learning, active learning and emergency response.Comment: 15 page
Enhancing the functional content of protein interaction networks
Protein interaction networks are a promising type of data for studying
complex biological systems. However, despite the rich information embedded in
these networks, they face important data quality challenges of noise and
incompleteness that adversely affect the results obtained from their analysis.
Here, we explore the use of the concept of common neighborhood similarity
(CNS), which is a form of local structure in networks, to address these issues.
Although several CNS measures have been proposed in the literature, an
understanding of their relative efficacies for the analysis of interaction
networks has been lacking. We follow the framework of graph transformation to
convert the given interaction network into a transformed network corresponding
to a variety of CNS measures evaluated. The effectiveness of each measure is
then estimated by comparing the quality of protein function predictions
obtained from its corresponding transformed network with those from the
original network. Using a large set of S. cerevisiae interactions, and a set of
136 GO terms, we find that several of the transformed networks produce more
accurate predictions than those obtained from the original network. In
particular, the measure proposed here performs particularly well for
this task. Further investigation reveals that the two major factors
contributing to this improvement are the abilities of CNS measures, especially
, to prune out noisy edges and introduce new links between
functionally related proteins
Theoretical Study of Physisorption of Nucleobases on Boron Nitride Nanotubes: A New Class of Hybrid Nano-Bio Materials
We investigate the adsorption of the nucleic acid bases, adenine (A), guanine
(G), cytosine (C), thymine (T) and uracil (U) on the outer wall of a high
curvature semiconducting single-walled boron nitride nanotube (BNNT) by first
principles density functional theory calculations. The calculated binding
energy shows the order: G>A\approxC\approxT\approxU implying that the
interaction strength of the (high-curvature) BNNT with the nucleobases, G being
an exception, is nearly the same. A higher binding energy for the G-BNNT
conjugate appears to result from a stronger hybridization of the molecular
orbitals of G and BNNT, since the charge transfer involved in the physisorption
process is insignificant. A smaller energy gap predicted for the G-BNNT
conjugate relative to that of the pristine BNNT may be useful in application of
this class of biofunctional materials to the design of the next generation
sensing devices.Comment: 17 pages 6 figure
- …
