Search CORE

701 research outputs found

An expectation-maximization algorithm for probabilistic reconstructions of full-length isoforms from splice graphs.

Author: Kim Joseph
Lee Christopher
Roy Meenakshi
Wu Ying Nian
Xing Yi
Yu Tianwei
Publication venue: eScholarship, University of California
Publication date: 01/01/2006
Field of study

Reconstructing full-length transcript isoforms from sequence fragments (such as ESTs) is a major interest and challenge for bioinformatic analysis of pre-mRNA alternative splicing. This problem has been formulated as finding traversals across the splice graph, which is a directed acyclic graph (DAG) representation of gene structure and alternative splicing. In this manuscript we introduce a probabilistic formulation of the isoform reconstruction problem, and provide an expectation-maximization (EM) algorithm for its maximum likelihood solution. Using a series of simulated data and expressed sequences from real human genes, we demonstrate that our EM algorithm can correctly handle various situations of fragmentation and coupling in the input data. Our work establishes a general probabilistic framework for splice graph-based reconstructions of full-length isoforms

PubMed Central

eScholarship - University of California

Automaticity in processing spatial-numerical associations: Evidence from a perceptual orientation judgment task of Arabic digits in frames.

Author: Chen Chuansheng
Gao Xuefei
Gong Tianwei
Jiang Ting
Li Baichen
Li Xiaomei
Li Zhaojun
Yu Shuyuan
Zhang Meng
Zhang Shudong
Publication venue: eScholarship, University of California
Publication date: 01/01/2020
Field of study

Human adults are faster to respond to small/large numerals with their left/right hand when they judge the parity of numerals, which is known as the SNARC (spatial-numerical association of response codes) effect. It has been proposed that the size of the SNARC effect depends on response latencies. The current study introduced a perceptual orientation task, where participants were asked to judge the orientation of a digit or a frame surrounding the digit. The present study first confirmed the SNARC effect with native Chinese speakers (Experiment 1) using a parity task, and then examined whether the emergence and size of the SNARC effect depended on the response latencies (Experiments 2, 3, and 4) using a perceptual orientation judgment task. Our results suggested that (a) the automatic processing of response-related numerical-spatial information occurred with Chinese-speaking participants in the parity task; (b) the SNARC effect was also found when the task did not require semantic access; and (c) the size of the effect depended on the processing speed of the task-relevant dimension. Finally, we proposed an underlying mechanism to explain the SNARC effect in the perceptual orientation judgment task

Maastricht University Research Portal

Directory of Open Access Journals

Edinburgh Research Explorer

eScholarship - University of California

The Francis Crick Institute

An exploratory data analysis method to reveal modular latent structures in high-throughput data

Author: Yu Tianwei
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background Modular structures are ubiquitous across various types of biological networks. The study of network modularity can help reveal regulatory mechanisms in systems biology, evolutionary biology and developmental biology. Identifying putative modular latent structures from high-throughput data using exploratory analysis can help better interpret the data and generate new hypotheses. Unsupervised learning methods designed for global dimension reduction or clustering fall short of identifying modules with factors acting in linear combinations. Results We present an exploratory data analysis method named MLSA (Modular Latent Structure Analysis) to estimate modular latent structures, which can find co-regulative modules that involve non-coexpressive genes. Conclusions Through simulations and real-data analyses, we show that the method can recover modular latent structures effectively. In addition, the method also performed very well on data generated from sparse global latent factor models. The R code is available at <url>http://userwww.service.emory.edu/~tyu8/MLSA/</url>.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Improving gene expression data interpretation by finding latent factors that co-regulate gene modules with clinical factors

Author: Bai Yun
Yu Tianwei
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background In the analysis of high-throughput data with a clinical outcome, researchers mostly focus on genes/proteins that show first-order relations with the clinical outcome. While this approach yields biomarkers and biological mechanisms that are easily interpretable, it may miss information that is important to the understanding of disease mechanism and/or treatment response. Here we test the hypothesis that unobserved factors can be mobilized by the living system to coordinate the response to the clinical factors. Results We developed a computational method named Guided Latent Factor Discovery (GLFD) to identify hidden factors that act in combination with the observed clinical factors to control gene modules. In simulation studies, the method recovered masked factors effectively. Using real microarray data, we demonstrate that the method identifies latent factors that are biologically relevant, and extracts more information than analyzing only the first-order response to the clinical outcome. Conclusions Finding latent factors using GLFD brings extra insight into the mechanisms of the disease/drug response. The R code of the method is available at <url>http://userwww.service.emory.edu/~tyu8/GLFD</url>.</p

Crossref

Springer

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Philadelphia College of Osteopathic Medicine: DigitalCommons@PCOM