2,625 research outputs found

    Efficient Inference of Gaussian Process Modulated Renewal Processes with Application to Medical Event Data

    Full text link
    The episodic, irregular and asynchronous nature of medical data render them difficult substrates for standard machine learning algorithms. We would like to abstract away this difficulty for the class of time-stamped categorical variables (or events) by modeling them as a renewal process and inferring a probability density over continuous, longitudinal, nonparametric intensity functions modulating that process. Several methods exist for inferring such a density over intensity functions, but either their constraints and assumptions prevent their use with our potentially bursty event streams, or their time complexity renders their use intractable on our long-duration observations of high-resolution events, or both. In this paper we present a new and efficient method for inferring a distribution over intensity functions that uses direct numeric integration and smooth interpolation over Gaussian processes. We demonstrate that our direct method is up to twice as accurate and two orders of magnitude more efficient than the best existing method (thinning). Importantly, the direct method can infer intensity functions over the full range of bursty to memoryless to regular events, which thinning and many other methods cannot. Finally, we apply the method to clinical event data and demonstrate the face-validity of the abstraction, which is now amenable to standard learning algorithms.Comment: 8 pages, 4 figure

    Application of Wavelet Decomposition to Document Line Segmentation

    Get PDF
    ACM Computing Classification System (1998): I.7, I.7.5.In this paper an approach to document line segmentation is presented. The algorithm is based on a wavelet transform of the horizontal projective profile of the document image. The projective profile is examined as a one-dimensional discrete signal which is decomposed using the pyramidal wavelet algorithm up to a precise scale, where local minima and maxima are discovered. These local extrema, projected into the input signal, correspond to the spacing between document lines and to the pivots of the lines. The method has been tested on a broad set of printed and handwritten documents and proven to be stable and efficient

    Promoting cooperation by preventing exploitation: The role of network structure

    Full text link
    A growing body of empirical evidence indicates that social and cooperative behavior can be affected by cognitive and neurological factors, suggesting the existence of state-based decision-making mechanisms that may have emerged by evolution. Motivated by these observations, we propose a simple mechanism of anonymous network interactions identified as a form of generalized reciprocity - a concept organized around the premise "help anyone if helped by someone", and study its dynamics on random graphs. In the presence of such mechanism, the evolution of cooperation is related to the dynamics of the levels of investments (i.e. probabilities of cooperation) of the individual nodes engaging in interactions. We demonstrate that the propensity for cooperation is determined by a network centrality measure here referred to as neighborhood importance index and discuss relevant implications to natural and artificial systems. To address the robustness of the state-based strategies to an invasion of defectors, we additionally provide an analysis which redefines the results for the case when a fraction of the nodes behave as unconditional defectors.Comment: 11 pages, 5 figure

    Adaptive Document Image Binarization with Application in Processing Astronomical Logbooks

    Get PDF
    ACM Computing Classification System (1998): I.7, I.7.5.Recently, the digitalization of the astronomical scientific heritage has been considered an important task that can facilitate much researches in astronomy. The creation of digital libraries and databases of astronomical photographic plates brings up the problem of digitalization astronomical logbooks, since the data contained in them is crucial for the usage of the plates. An optical character recognition (OCR) system for the handwritten numerical data is needed in order to speed up the process of database creation and extension. In this paper document image binarization is considered since it is a critical stage for the subsequent steps in an OCR software system. A specific method is proposed which outmatches the state-of-the-art techniques in the case of the images of interest.This work has been partially supported by Grant No. DO02-275/2008, Bulgarian NSF, Ministry of Education and Science
    corecore