368 research outputs found

    SteinerNet: a web server for integrating ‘omic’ data to discover hidden components of response pathways

    Get PDF
    High-throughput technologies including transcriptional profiling, proteomics and reverse genetics screens provide detailed molecular descriptions of cellular responses to perturbations. However, it is difficult to integrate these diverse data to reconstruct biologically meaningful signaling networks. Previously, we have established a framework for integrating transcriptional, proteomic and interactome data by searching for the solution to the prize-collecting Steiner tree problem. Here, we present a web server, SteinerNet, to make this method available in a user-friendly format for a broad range of users with data from any species. At a minimum, a user only needs to provide a set of experimentally detected proteins and/or genes and the server will search for connections among these data from the provided interactomes for yeast, human, mouse, Drosophila melanogaster and Caenorhabditis elegans. More advanced users can upload their own interactome data as well. The server provides interactive visualization of the resulting optimal network and downloadable files detailing the analysis and results. We believe that SteinerNet will be useful for researchers who would like to integrate their high-throughput data for a specific condition or cellular response and to find biologically meaningful pathways. SteinerNet is accessible at http://fraenkel.mit.edu/steinernet.National Institutes of Health (U.S.) (U54-CA112967)National Institutes of Health (U.S.) (R01-GM089903)National Science Foundation (Award Number DB1-0821391)National Institutes of Health (U.S.) (U54-CA112967

    Reconstruction of the temporal signaling network in Salmonella-infected human cells

    Get PDF
    Salmonella enterica is a bacterial pathogen that usually infects its host through food sources. Translocation of the pathogen proteins into the host cells leads to changes in the signaling mechanism either by activating or inhibiting the host proteins. Using high-throughput ‘omic’ technologies, changes in the signaling components can be quantified at different levels; however, experimental hits are usually incomplete to represent the whole signaling system as some driver proteins stay hidden within the experimental data. Given that the bacterial infection modifies the response network of the host, more coherent view of the underlying biological processes and the signaling networks can be obtained by using a network modeling approach based on the reverse engineering principles in which a confident region from the protein interactome is found by inferring hits from the omic experiments. In this work, we have used a published temporal phosphoproteomic dataset of Salmonella-infected human cells and reconstructed the temporal signaling network of the human host by integrating the interactome and the phosphoproteomic datasets. We have combined two well-established network modeling frameworks, the Prize-collecting Steiner Forest (PCSF) approach and the Integer Linear Programming (ILP) based edge inference approach. The resulting network conserves the information on temporality, direction of interactions, while revealing hidden entities in the signaling, such as the SNARE binding, mTOR signaling, immune response, cytoskeleton organization, and apoptosis pathways. Targets of the Salmonella effectors in the host cells such as CDC42, RHOA, 14-3-3δ, Syntaxin family, Oxysterol-binding proteins were included in the reconstructed signaling network although they were not present in the initial phosphoproteomic data. We believe that integrated approaches have a high potential for the identification of clinical targets in infectious diseases, especially in the Salmonella infections

    Network-Based Interpretation of Diverse High-Throughput Datasets through the Omics Integrator Software Package

    Get PDF
    High-throughput, ‘omic’ methods provide sensitive measures of biological responses to perturbations. However, inherent biases in high-throughput assays make it difficult to interpret experiments in which more than one type of data is collected. In this work, we introduce Omics Integrator, a software package that takes a variety of ‘omic’ data as input and identifies putative underlying molecular pathways. The approach applies advanced network optimization algorithms to a network of thousands of molecular interactions to find high-confidence, interpretable subnetworks that best explain the data. These subnetworks connect changes observed in gene expression, protein abundance or other global assays to proteins that may not have been measured in the screens due to inherent bias or noise in measurement. This approach reveals unannotated molecular pathways that would not be detectable by searching pathway databases. Omics Integrator also provides an elegant framework to incorporate not only positive data, but also negative evidence. Incorporating negative evidence allows Omics Integrator to avoid unexpressed genes and avoid being biased toward highly-studied hub proteins, except when they are strongly implicated by the data. The software is comprised of two individual tools, Garnet and Forest, that can be run together or independently to allow a user to perform advanced integration of multiple types of high-throughput data as well as create condition-specific subnetworks of protein interactions that best connect the observed changes in various datasets. It is available at http://fraenkel.mit.edu/omicsintegrator and on GitHub at https://github.com/fraenkel-lab/OmicsIntegrator.National Institutes of Health (U.S.) (grant U54CA112967)National Institutes of Health (U.S.) (grant U01CA184898)National Institutes of Health (U.S.) (grant U54NS091046)National Institutes of Health (U.S.) (grant R01GM089903

    Composite structural motifs of binding sites for delineating biological functions of proteins

    Get PDF
    Most biological processes are described as a series of interactions between proteins and other molecules, and interactions are in turn described in terms of atomic structures. To annotate protein functions as sets of interaction states at atomic resolution, and thereby to better understand the relation between protein interactions and biological functions, we conducted exhaustive all-against-all atomic structure comparisons of all known binding sites for ligands including small molecules, proteins and nucleic acids, and identified recurring elementary motifs. By integrating the elementary motifs associated with each subunit, we defined composite motifs which represent context-dependent combinations of elementary motifs. It is demonstrated that function similarity can be better inferred from composite motif similarity compared to the similarity of protein sequences or of individual binding sites. By integrating the composite motifs associated with each protein function, we define meta-composite motifs each of which is regarded as a time-independent diagrammatic representation of a biological process. It is shown that meta-composite motifs provide richer annotations of biological processes than sequence clusters. The present results serve as a basis for bridging atomic structures to higher-order biological phenomena by classification and integration of binding site structures.Comment: 34 pages, 7 figure

    Propensity vectors of low-ASA residue pairs in the distinction of protein interactions

    Full text link
    We introduce low-ASA residue pairs as classification features for distinguishing the different types of protein interactions. A low-ASA residue pair is defined as two contact residues each from one chain that have a small solvent accessible surface area (ASA). This notion of residue pairs is novel as it first combines residue pairs with the O-ring theory, an influential proposition stating that the binding hot spots at the interface are often surrounded by a ring of energetically less important residues. As binding hot spots lie in the core of the stability for protein interactions, we believe that low-ASA residue pairs can sharpen the distinction of protein interactions. The main part of our feature vector is 210-dimensional, consisting of all possible low-ASA residue pairs; the value of every feature is determined by a propensity measure. Our classification method is called OringPV, which uses propensity vectors of protein interactions for support vector machine. OringPV is tested on three benchmark datasets for a variety of classification tasks such as the distinction between crystal packing and biological interactions, the distinction between two different types of biological interactions, etc. The evaluation frameworks include within-dataset, crossdataset comparison, and leave-one-out crossvalidation. The results show that low-ASA residue pairs and the propensity vector description of protein interactions are truly strong in the distinction. In particular, many cross-dataset generalization capability tests have achieved excellent recalls and overall accuracies, much outperforming existing benchmark methods. © 2009 Wiley-Liss, Inc

    Genome-Scale Networks Link Neurodegenerative Disease Genes to α-Synuclein through Specific Molecular Pathways

    Get PDF
    Numerous genes and molecular pathways are implicated in neurodegenerative proteinopathies, but their inter-relationships are poorly understood. We systematically mapped molecular pathways underlying the toxicity of alpha-synuclein (α-syn), a protein central to Parkinson's disease. Genome-wide screens in yeast identified 332 genes that impact α-syn toxicity. To “humanize” this molecular network, we developed a computational method, TransposeNet. This integrates a Steiner prize-collecting approach with homology assignment through sequence, structure, and interaction topology. TransposeNet linked α-syn to multiple parkinsonism genes and druggable targets through perturbed protein trafficking and ER quality control as well as mRNA metabolism and translation. A calcium signaling hub linked these processes to perturbed mitochondrial quality control and function, metal ion transport, transcriptional regulation, and signal transduction. Parkinsonism gene interaction profiles spatially opposed in the network (ATP13A2/PARK9 and VPS35/PARK17) were highly distinct, and network relationships for specific genes (LRRK2/PARK8, ATXN2, and EIF4G1/PARK18) were confirmed in patient induced pluripotent stem cell (iPSC)-derived neurons. This cross-species platform connected diverse neurodegenerative genes to proteinopathy through specific mechanisms and may facilitate patient stratification for targeted therapy. Keywords: alpha-synuclein; iPS cell; Parkinson’s disease; stem cell; mRNA translation; RNA-binding protein; LRRK2; VPS35; vesicle trafficking; yeas

    Context dependent isoform specific PI3K inhibition confers drug resistance in hepatocellular carcinoma cells.

    Get PDF
    Background: Targeted therapies for Primary liver cancer (HCC) is limited to the multi-kinase inhibitors, and not fully effective due to the resistance to these agents because of the heterogeneous molecular nature of HCC developed during chronic liver disease stages and cirrhosis. Although combinatorial therapy can increase the efficiency of targeted therapies through synergistic activities, isoform specific effects of the inhibitors are usually ignored. This study concentrated on PI3K/Akt/mTOR pathway and the differential combinatory bioactivities of isoform specific PI3K-alpha inhibitor (PIK-75) or PI3K-beta inhibitor (TGX-221) with Sorafenib dependent on PTEN context

    Quantifying the geographical distribution effect on decreasing aggregated nitrogen oxides intensity in the Chinese electrical generation system

    Get PDF
    Over the past 20 years, the spatial distribution of electrical generation and its relationship to cross-regional power transmission has impacted China's power generation system and significantly affected the total amount of NO x and the aggregated nitrogen oxides intensity (ANI) of the system. An investigation of the driving mechanisms of ANI that considers the unevenness of regional electricity generation will be crucial to future improvements in the NO x efficiency of the electrical generation system in China. In this study, we built a decomposition model for ANI by incorporating the spatial distribution of electrical generation and found that the spatial distribution of electricity generation together with energy-related factors gradually caused decreases in ANI. The efficiency of electricity generation presented the dominant inhibitory effect on ANI, but its effect size has weakened since 2010. In contrast, the fossil fuel structure of thermal power shows an increasingly positive effect on changes in ANI. The primary energy composition only slightly affected changes in ANI. Moreover, the changed geographical distribution of electricity generation is non-negligible and has a positive effect on reduction of the ANI of the Chinese electrical generation system. The transferred amount of local NO x emissions by cross-provincial electricity transmission, however, could cause lead to additional environmental costs for generators. This issue should receive more attention in the future

    Pan-cancer clinical impact of latent drivers from double mutations

    Get PDF
    Here, we discover potential ‘latent driver’ mutations in cancer genomes. Latent drivers have low frequencies and minor observable translational potential. As such, to date they have escaped identification. Their discovery is important, since when paired in cis, latent driver mutations can drive cancer. Our comprehensive statistical analysis of the pan-cancer mutation profiles of ~60,000 tumor sequences from the TCGA and AACR-GENIE cohorts identifies significantly co-occurring potential latent drivers. We observe 155 same gene double mutations of which 140 individual components are cataloged as latent drivers. Evaluation of cell lines and patient-derived xenograft response data to drug treatment indicate that in certain genes double mutations may have a prominent role in increasing oncogenic activity, hence obtaining a better drug response, as in PIK3CA. Taken together, our comprehensive analyses indicate that same-gene double mutations are exceedingly rare phenomena but are a signature for some cancer types, e.g., breast, and lung cancers. The relative rarity of doublets can be explained by the likelihood of strong signals resulting in oncogene-induced senescence, and by doublets consisting of non-identical single residue components populating the background mutational load, thus not identified
    corecore