2,595 research outputs found

    Query Learning with Exponential Query Costs

    Full text link
    In query learning, the goal is to identify an unknown object while minimizing the number of "yes" or "no" questions (queries) posed about that object. A well-studied algorithm for query learning is known as generalized binary search (GBS). We show that GBS is a greedy algorithm to optimize the expected number of queries needed to identify the unknown object. We also generalize GBS in two ways. First, we consider the case where the cost of querying grows exponentially in the number of queries and the goal is to minimize the expected exponential cost. Then, we consider the case where the objects are partitioned into groups, and the objective is to identify only the group to which the object belongs. We derive algorithms to address these issues in a common, information-theoretic framework. In particular, we present an exact formula for the objective function in each case involving Shannon or Renyi entropy, and develop a greedy algorithm for minimizing it. Our algorithms are demonstrated on two applications of query learning, active learning and emergency response.Comment: 15 page

    Enhancing the functional content of protein interaction networks

    Full text link
    Protein interaction networks are a promising type of data for studying complex biological systems. However, despite the rich information embedded in these networks, they face important data quality challenges of noise and incompleteness that adversely affect the results obtained from their analysis. Here, we explore the use of the concept of common neighborhood similarity (CNS), which is a form of local structure in networks, to address these issues. Although several CNS measures have been proposed in the literature, an understanding of their relative efficacies for the analysis of interaction networks has been lacking. We follow the framework of graph transformation to convert the given interaction network into a transformed network corresponding to a variety of CNS measures evaluated. The effectiveness of each measure is then estimated by comparing the quality of protein function predictions obtained from its corresponding transformed network with those from the original network. Using a large set of S. cerevisiae interactions, and a set of 136 GO terms, we find that several of the transformed networks produce more accurate predictions than those obtained from the original network. In particular, the HC.contHC.cont measure proposed here performs particularly well for this task. Further investigation reveals that the two major factors contributing to this improvement are the abilities of CNS measures, especially HC.contHC.cont, to prune out noisy edges and introduce new links between functionally related proteins

    Theoretical Study of Physisorption of Nucleobases on Boron Nitride Nanotubes: A New Class of Hybrid Nano-Bio Materials

    Full text link
    We investigate the adsorption of the nucleic acid bases, adenine (A), guanine (G), cytosine (C), thymine (T) and uracil (U) on the outer wall of a high curvature semiconducting single-walled boron nitride nanotube (BNNT) by first principles density functional theory calculations. The calculated binding energy shows the order: G>A\approxC\approxT\approxU implying that the interaction strength of the (high-curvature) BNNT with the nucleobases, G being an exception, is nearly the same. A higher binding energy for the G-BNNT conjugate appears to result from a stronger hybridization of the molecular orbitals of G and BNNT, since the charge transfer involved in the physisorption process is insignificant. A smaller energy gap predicted for the G-BNNT conjugate relative to that of the pristine BNNT may be useful in application of this class of biofunctional materials to the design of the next generation sensing devices.Comment: 17 pages 6 figure
    corecore