2,625 research outputs found
Efficient Inference of Gaussian Process Modulated Renewal Processes with Application to Medical Event Data
The episodic, irregular and asynchronous nature of medical data render them
difficult substrates for standard machine learning algorithms. We would like to
abstract away this difficulty for the class of time-stamped categorical
variables (or events) by modeling them as a renewal process and inferring a
probability density over continuous, longitudinal, nonparametric intensity
functions modulating that process. Several methods exist for inferring such a
density over intensity functions, but either their constraints and assumptions
prevent their use with our potentially bursty event streams, or their time
complexity renders their use intractable on our long-duration observations of
high-resolution events, or both. In this paper we present a new and efficient
method for inferring a distribution over intensity functions that uses direct
numeric integration and smooth interpolation over Gaussian processes. We
demonstrate that our direct method is up to twice as accurate and two orders of
magnitude more efficient than the best existing method (thinning). Importantly,
the direct method can infer intensity functions over the full range of bursty
to memoryless to regular events, which thinning and many other methods cannot.
Finally, we apply the method to clinical event data and demonstrate the
face-validity of the abstraction, which is now amenable to standard learning
algorithms.Comment: 8 pages, 4 figure
Application of Wavelet Decomposition to Document Line Segmentation
ACM Computing Classification System (1998): I.7, I.7.5.In this paper an approach to document line segmentation is presented. The algorithm is based on a wavelet transform of the horizontal
projective profile of the document image. The projective profile is examined as a one-dimensional discrete signal which is decomposed using the pyramidal wavelet algorithm up to a precise scale, where local minima and maxima are discovered. These local extrema, projected into the input signal, correspond to the spacing between document lines and to the pivots of the lines. The method has been tested on a broad set of printed and handwritten documents and proven to be stable and efficient
Promoting cooperation by preventing exploitation: The role of network structure
A growing body of empirical evidence indicates that social and cooperative
behavior can be affected by cognitive and neurological factors, suggesting the
existence of state-based decision-making mechanisms that may have emerged by
evolution. Motivated by these observations, we propose a simple mechanism of
anonymous network interactions identified as a form of generalized reciprocity
- a concept organized around the premise "help anyone if helped by someone",
and study its dynamics on random graphs. In the presence of such mechanism, the
evolution of cooperation is related to the dynamics of the levels of
investments (i.e. probabilities of cooperation) of the individual nodes
engaging in interactions. We demonstrate that the propensity for cooperation is
determined by a network centrality measure here referred to as neighborhood
importance index and discuss relevant implications to natural and artificial
systems. To address the robustness of the state-based strategies to an invasion
of defectors, we additionally provide an analysis which redefines the results
for the case when a fraction of the nodes behave as unconditional defectors.Comment: 11 pages, 5 figure
Adaptive Document Image Binarization with Application in Processing Astronomical Logbooks
ACM Computing Classification System (1998): I.7, I.7.5.Recently, the digitalization of the astronomical scientific heritage has been considered an important task that can facilitate much researches in astronomy. The creation of digital libraries and databases of astronomical photographic plates brings up the problem of digitalization astronomical logbooks, since the data contained in them is crucial for the usage of the plates. An optical character recognition (OCR) system for the handwritten numerical data is needed in order to speed up the process of database creation and extension.
In this paper document image binarization is considered since it is a critical stage for the subsequent steps in an OCR software system. A specific method is proposed which outmatches the state-of-the-art techniques in the case of the images of interest.This work has been partially supported by Grant No. DO02-275/2008, Bulgarian NSF,
Ministry of Education and Science
- …
