701 research outputs found

    An expectation-maximization algorithm for probabilistic reconstructions of full-length isoforms from splice graphs.

    Get PDF
    Reconstructing full-length transcript isoforms from sequence fragments (such as ESTs) is a major interest and challenge for bioinformatic analysis of pre-mRNA alternative splicing. This problem has been formulated as finding traversals across the splice graph, which is a directed acyclic graph (DAG) representation of gene structure and alternative splicing. In this manuscript we introduce a probabilistic formulation of the isoform reconstruction problem, and provide an expectation-maximization (EM) algorithm for its maximum likelihood solution. Using a series of simulated data and expressed sequences from real human genes, we demonstrate that our EM algorithm can correctly handle various situations of fragmentation and coupling in the input data. Our work establishes a general probabilistic framework for splice graph-based reconstructions of full-length isoforms

    Automaticity in processing spatial-numerical associations: Evidence from a perceptual orientation judgment task of Arabic digits in frames.

    Get PDF
    Human adults are faster to respond to small/large numerals with their left/right hand when they judge the parity of numerals, which is known as the SNARC (spatial-numerical association of response codes) effect. It has been proposed that the size of the SNARC effect depends on response latencies. The current study introduced a perceptual orientation task, where participants were asked to judge the orientation of a digit or a frame surrounding the digit. The present study first confirmed the SNARC effect with native Chinese speakers (Experiment 1) using a parity task, and then examined whether the emergence and size of the SNARC effect depended on the response latencies (Experiments 2, 3, and 4) using a perceptual orientation judgment task. Our results suggested that (a) the automatic processing of response-related numerical-spatial information occurred with Chinese-speaking participants in the parity task; (b) the SNARC effect was also found when the task did not require semantic access; and (c) the size of the effect depended on the processing speed of the task-relevant dimension. Finally, we proposed an underlying mechanism to explain the SNARC effect in the perceptual orientation judgment task

    An exploratory data analysis method to reveal modular latent structures in high-throughput data

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Modular structures are ubiquitous across various types of biological networks. The study of network modularity can help reveal regulatory mechanisms in systems biology, evolutionary biology and developmental biology. Identifying putative modular latent structures from high-throughput data using exploratory analysis can help better interpret the data and generate new hypotheses. Unsupervised learning methods designed for global dimension reduction or clustering fall short of identifying modules with factors acting in linear combinations.</p> <p>Results</p> <p>We present an exploratory data analysis method named MLSA (Modular Latent Structure Analysis) to estimate modular latent structures, which can find co-regulative modules that involve non-coexpressive genes.</p> <p>Conclusions</p> <p>Through simulations and real-data analyses, we show that the method can recover modular latent structures effectively. In addition, the method also performed very well on data generated from sparse global latent factor models. The R code is available at <url>http://userwww.service.emory.edu/~tyu8/MLSA/</url>.</p

    Improving gene expression data interpretation by finding latent factors that co-regulate gene modules with clinical factors

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>In the analysis of high-throughput data with a clinical outcome, researchers mostly focus on genes/proteins that show first-order relations with the clinical outcome. While this approach yields biomarkers and biological mechanisms that are easily interpretable, it may miss information that is important to the understanding of disease mechanism and/or treatment response. Here we test the hypothesis that unobserved factors can be mobilized by the living system to coordinate the response to the clinical factors.</p> <p>Results</p> <p>We developed a computational method named Guided Latent Factor Discovery (GLFD) to identify hidden factors that act in combination with the observed clinical factors to control gene modules. In simulation studies, the method recovered masked factors effectively. Using real microarray data, we demonstrate that the method identifies latent factors that are biologically relevant, and extracts more information than analyzing only the first-order response to the clinical outcome.</p> <p>Conclusions</p> <p>Finding latent factors using GLFD brings extra insight into the mechanisms of the disease/drug response. The R code of the method is available at <url>http://userwww.service.emory.edu/~tyu8/GLFD</url>.</p
    corecore