247 research outputs found
Infectious Disease Ontology
Technological developments have resulted in tremendous increases in the volume and diversity of the data and information that must be processed in the course of biomedical and clinical research and practice. Researchers are at the same time under ever greater pressure to share data and to take steps to ensure that data resources are interoperable. The use of ontologies to annotate data has proven successful in supporting these goals and in providing new possibilities for the automated processing of data and information. In this chapter, we describe different types of vocabulary resources and emphasize those features of formal ontologies that make them most useful for computational applications. We describe current uses of ontologies and discuss future goals for ontology-based computing, focusing on its use in the field of infectious diseases. We review the largest and most widely used vocabulary resources relevant to the study of infectious diseases and conclude with a description of the Infectious Disease Ontology (IDO) suite of interoperable ontology modules that together cover the entire infectious disease domain
Transcriptional profiling of rat hypothalamus response to 2,3,7,8-tetrachlorodibenzo-p-dioxin
In some mammals, halogenated aromatic hydrocarbon (HAH) exposure causes wasting syndrome, defined as significant weight loss associated with lethal outcomes. The most potent HAH in causing wasting is 2,3,7,8-tetrachlorodibenzo-rho-dioxin (TCDD), which exerts its toxic effects through the aryl hydrocarbon receptor (AHR). Since TCDD toxicity is thought to predominantly arise from dysregulation of AHR-transcribed genes, it was hypothesized that wasting syndrome is a result of to TCDD-induced dysregulation of genes involved in regulation of food-intake. As the hypothalamus is the central nervous systems' regulatory center for food-intake and energy balance. Therefore, mRNA abundances in hypothalamic tissue from two rat strains with widely differing sensitivities to TCDD-induced wasting syndrome: TCDD-sensitive Long-Evans rats and TCDD-resistant Han/Wistar rats, 23 h after exposure to TCDD (100 mu g/kg) or corn oil vehicle. TCDD exposure caused minimal transcriptional dysregulation in the hypothalamus, with only 6 genes significantly altered in Long-Evans rats and 15 genes in Han/Wistar rats. Two of the most dysregulated genes were Cyp1a1 and Nqo1, which are induced by TCDD across a wide range of tissues and are considered sensitive markers of TCDD exposure. The minimal response of the hypothalamic transcriptome to a lethal dose of TCDD at an early time-point suggests that the hypothalamus is not the predominant site of initial events leading to hypophagia and associated wasting. TCDD may affect feeding behaviour via events upstream or downstream of the hypothalamus, and further work is required to evaluate this at the level of individual hypothalamic nuclei and subregions. (C) 2014 The Authors. Published by Elsevier Ireland Ltd.Peer reviewe
Systems biology via redescription and ontologies (I): finding phase changes with applications to malaria temporal data
Biological systems are complex and often composed of many subtly interacting components. Furthermore, such systems evolve through time and, as the underlying biology executes its genetic program, the relationships between components change and undergo dynamic reorganization. Characterizing these relationships precisely is a challenging task, but one that must be undertaken if we are to understand these systems in sufficient detail. One set of tools that may prove useful are the formal principles of model building and checking, which could allow the biologist to frame these inherently temporal questions in a sufficiently rigorous framework. In response to these challenges, GOALIE (Gene ontology algorithmic logic and information extractor) was developed and has been successfully employed in the analysis of high throughput biological data (e.g. time-course gene-expression microarray data and neural spike train recordings). The method has applications to a wide variety of temporal data, indeed any data for which there exist ontological descriptions. This paper describes the algorithms behind GOALIE and its use in the study of the Intraerythrocytic Developmental Cycle (IDC) of Plasmodium falciparum, the parasite responsible for a deadly form of chloroquine resistant malaria. We focus in particular on the problem of finding phase changes, times of reorganization of transcriptional control
SpliceMiner: a high-throughput database implementation of the NCBI Evidence Viewer for microarray splice variant analysis
BACKGROUND: There are many fewer genes in the human genome than there are expressed transcripts. Alternative splicing is the reason. Alternatively spliced transcripts are often specific to tissue type, developmental stage, environmental condition, or disease state. Accurate analysis of microarray expression data and design of new arrays for alternative splicing require assessment of probes at the sequence and exon levels. DESCRIPTION: SpliceMiner is a web interface for querying Evidence Viewer Database (EVDB). EVDB is a comprehensive, non-redundant compendium of splice variant data for human genes. We constructed EVDB as a queryable implementation of the NCBI Evidence Viewer (EV). EVDB is based on data obtained from NCBI Entrez Gene and EV. The automated EVDB build process uses only complete coding sequences, which may or may not include partial or complete 5' and 3' UTRs, and filters redundant splice variants. Unlike EV, which supports only one-at-a-time queries, SpliceMiner supports high-throughput batch queries and provides results in an easily parsable format. SpliceMiner maps probes to splice variants, effectively delineating the variants identified by a probe. CONCLUSION: EVDB can be queried by gene symbol, genomic coordinates, or probe sequence via a user-friendly web-based tool we call SpliceMiner (). The EVDB/SpliceMiner combination provides an interface with human splice variant information and, going beyond the very valuable NCBI Evidence Viewer, supports fluent, high-throughput analysis. Integration of EVDB information into microarray analysis and design pipelines has the potential to improve the analysis and bioinformatic interpretation of gene expression data, for both batch and interactive processing. For example, whenever a gene expression value is recognized as important or appears anomalous in a microarray experiment, the interactive mode of SpliceMiner can be used quickly and easily to check for possible splice variant issues
Altered gene expression and DNA damage in peripheral blood cells from Friedreich's ataxia patients: Cellular model of pathology
The neurodegenerative disease Friedreich's ataxia (FRDA) is the most common autosomal-recessively inherited ataxia and is caused by a GAA triplet repeat expansion in the first intron of the frataxin gene. In this disease, transcription of frataxin, a mitochondrial protein involved in iron homeostasis, is impaired, resulting in a significant reduction in mRNA and protein levels. Global gene expression analysis was performed in peripheral blood samples from FRDA patients as compared to controls, which suggested altered expression patterns pertaining to genotoxic stress. We then confirmed the presence of genotoxic DNA damage by using a gene-specific quantitative PCR assay and discovered an increase in both mitochondrial and nuclear DNA damage in the blood of these patients (p<0.0001, respectively). Additionally, frataxin mRNA levels correlated with age of onset of disease and displayed unique sets of gene alterations involved in immune response, oxidative phosphorylation, and protein synthesis. Many of the key pathways observed by transcription profiling were downregulated, and we believe these data suggest that patients with prolonged frataxin deficiency undergo a systemic survival response to chronic genotoxic stress and consequent DNA damage detectable in blood. In conclusion, our results yield insight into the nature and progression of FRDA, as well as possible therapeutic approaches. Furthermore, the identification of potential biomarkers, including the DNA damage found in peripheral blood, may have predictive value in future clinical trials
SpliceCenter: A suite of web-based bioinformatic applications for evaluating the impact of alternative splicing on RT-PCR, RNAi, microarray, and peptide-based studies
<p>Abstract</p> <p>Background</p> <p>Over 60% of protein-coding genes in vertebrates express mRNAs that undergo alternative splicing. The resulting collection of transcript isoforms poses significant challenges for contemporary biological assays. For example, RT-PCR validation of gene expression microarray results may be unsuccessful if the two technologies target different splice variants. Effective use of sequence-based technologies requires knowledge of the specific splice variant(s) that are targeted. In addition, the critical roles of alternative splice forms in biological function and in disease suggest that assay results may be more informative if analyzed in the context of the targeted splice variant.</p> <p>Results</p> <p>A number of contemporary technologies are used for analyzing transcripts or proteins. To enable investigation of the impact of splice variation on the interpretation of data derived from those technologies, we have developed SpliceCenter. SpliceCenter is a suite of user-friendly, web-based applications that includes programs for analysis of RT-PCR primer/probe sets, effectors of RNAi, microarrays, and protein-targeting technologies. Both interactive and high-throughput implementations of the tools are provided. The interactive versions of SpliceCenter tools provide visualizations of a gene's alternative transcripts and probe target positions, enabling the user to identify which splice variants are or are not targeted. The high-throughput batch versions accept user query files and provide results in tabular form. When, for example, we used SpliceCenter's batch siRNA-Check to process the Cancer Genome Anatomy Project's large-scale shRNA library, we found that only 59% of the 50,766 shRNAs in the library target all known splice variants of the target gene, 32% target some but not all, and 9% do not target any currently annotated transcript.</p> <p>Conclusion</p> <p>SpliceCenter <url>http://discover.nci.nih.gov/splicecenter</url> provides unique, user-friendly applications for assessing the impact of transcript variation on the design and interpretation of RT-PCR, RNAi, gene expression microarrays, antibody-based detection, and mass spectrometry proteomics. The tools are intended for use by bench biologists as well as bioinformaticists.</p
TGFβ Signaling Increases Net Acid Extrusion, Proliferation and Invasion in Panc-1 Pancreatic Cancer Cells:SMAD4 Dependence and Link to Merlin/NF2 Signaling
Nonlinear gene cluster analysis with labeling for microarray gene expression data in organ development
RedundancyMiner: De-replication of redundant GO categories in microarray and proteomics analysis
<p>Abstract</p> <p>Background</p> <p>The Gene Ontology (GO) Consortium organizes genes into hierarchical categories based on biological process, molecular function and subcellular localization. Tools such as GoMiner can leverage GO to perform ontological analysis of microarray and proteomics studies, typically generating a list of significant functional categories. Two or more of the categories are often redundant, in the sense that identical or nearly-identical sets of genes map to the categories. The redundancy might typically inflate the report of significant categories by a factor of three-fold, create an illusion of an overly long list of significant categories, and obscure the relevant biological interpretation.</p> <p>Results</p> <p>We now introduce a new resource, RedundancyMiner, that de-replicates the redundant and nearly-redundant GO categories that had been determined by first running GoMiner. The main algorithm of RedundancyMiner, MultiClust, performs a novel form of cluster analysis in which a GO category might belong to several category clusters. Each category cluster follows a "complete linkage" paradigm. The metric is a similarity measure that captures the overlap in gene mapping between pairs of categories.</p> <p>Conclusions</p> <p>RedundancyMiner effectively eliminated redundancies from a set of GO categories. For illustration, we have applied it to the clarification of the results arising from two current studies: (1) assessment of the gene expression profiles obtained by laser capture microdissection (LCM) of serial cryosections of the retina at the site of final optic fissure closure in the mouse embryos at specific embryonic stages, and (2) analysis of a conceptual data set obtained by examining a list of genes deemed to be "kinetochore" genes.</p
Adding a Little Reality to Building Ontologies for Biology
BACKGROUND: Many areas of biology are open to mathematical and computational modelling. The application of discrete, logical formalisms defines the field of biomedical ontologies. Ontologies have been put to many uses in bioinformatics. The most widespread is for description of entities about which data have been collected, allowing integration and analysis across multiple resources. There are now over 60 ontologies in active use, increasingly developed as large, international collaborations. There are, however, many opinions on how ontologies should be authored; that is, what is appropriate for representation. Recently, a common opinion has been the "realist" approach that places restrictions upon the style of modelling considered to be appropriate. METHODOLOGY/PRINCIPAL FINDINGS: Here, we use a number of case studies for describing the results of biological experiments. We investigate the ways in which these could be represented using both realist and non-realist approaches; we consider the limitations and advantages of each of these models. CONCLUSIONS/SIGNIFICANCE: From our analysis, we conclude that while realist principles may enable straight-forward modelling for some topics, there are crucial aspects of science and the phenomena it studies that do not fit into this approach; realism appears to be over-simplistic which, perversely, results in overly complex ontological models. We suggest that it is impossible to avoid compromise in modelling ontology; a clearer understanding of these compromises will better enable appropriate modelling, fulfilling the many needs for discrete mathematical models within computational biology
- …
