Search CORE

4,700 research outputs found

Information reuse in comparative genomics

Author: Szklarczyk R.J.
Publication venue: Amsterdam: Vrije Universiteit
Publication date: 01/01/2007
Field of study

Heringa, J. [Promotor]Nekrutenko, A. [Copromotor

VU Research Portal

Tracking repeats using significance and transitivty.

Author: Heringa J.
Szklarczyk R.
Publication venue
Publication date: 01/01/2004
Field of study

transitivity; extreme value distribution Motivation: Internal repeats in coding sequences correspond to structural and functional units of proteins. Moreover, duplication of fragments of coding sequences is known to be a mechanism to facilitate evolution. Identification of repeats is crucial to shed light on the function and structure of proteins, and explain their evolutionary past. The task is difficult because during the course of evolution many repeats diverged beyond recognition. Results: We introduce a new method TRUST, for ab-initio determination of internal repeats in proteins. It provides an improvement in prediction quality as compared to alternative state-of-the-art methods. The increased sensitivity and accuracy of the method is achieved by exploiting the concept of transitivity of alignments. Starting from significant local suboptimal alignments, the application of transitivity allows us to: 1) identify distant repeat homologues for which no alignments were found; 2) gain confidence about consistently well-aligned regions; and 3) recognize and reduce the contribution of nonhomologous repeats. This reassessment step enables us to derive a virtually noise-free profile representing a generalized repeat with high fidelity. We also obtained superior specificity by employing rigid statistical testing for self-sequence and profile-sequence alignments. Assessment was done using a database of repeat annotations based on structural superpositioning. The results show that TRUST is a useful and reliable tool for mining tandem and non-tandem repeats in protein sequence databases, able to predict multiple repeat types with varying intervening segments within a single sequence

CiteSeerX

Crossref

VU Research Portal

Polki i Polacy na rynku pracy : raport z badań ludności w wieku produkcyjnym realizowanych w 2010 r. w ramach projektu "Bilans Kapitału Ludzkiego"

Author: Czarnik Szymon
Keler Karolina
Strzebońska Anna
Szklarczyk Dariusz
Publication venue: Polska Agencja Rozwoju Przedsiębiorczości
Publication date: 01/01/2011
Field of study

The study of people of working age is the key research module of the supply side of labour market in BKL Study. Generally, in all the analyses - both those concerning various forms of employment, occupations pursued and positions sought, and those that focus on education and self-evaluation of competences, what comes very much to the forefront is the segmentation of the labour market by the gender of the employers. In the period from August to November 2010 (the duration of the study), more or less one in two people working were employed on the grounds of an employment contract (umowa o pracę), one in nine conducted his own business, one in fourteen had worked during the previous year on the grounds of commission agreement (umowa zlecenia)or contract for a specific task (umowa o dzieło), and one in 20 was working without a formal contract. The differentiation between the genders is already visible at this stage: as far as in the case of regular employment women account for half of all employed people, among those running their own business outside agriculture and also working without contracts, there are on average two men for each woman

Jagiellonian Univeristy Repository

Knowledge centers as an innovative knowledge transfer mechanism : lesson learned from the program implemented in Lesser Poland

Author: Antosz Patrycja
Drożdżak Zuzanna
Krupnik Seweryn
Strycharz Jan
Szczucka Anna
Szklarczyk Dariusz
Łukasiewicz Karolina
Publication venue
Publication date: 01/01/2015
Field of study

The aim of the paper is to present the lessons learnt from the "SPIN" regional public project. The project was implemented in the region of Lesser Poland. The objective of the project was to increase the intensity of knowledge transfer from universities to enterprises. The goal was achieved by establishing four Centres for Knowledge Transfer at major universities. Each of them was dedicated to a specific domain of knowledge - regional smart specialization - biotechnology, translational medicine, smart grids and energy-saving buildings. The paper discusses the implementation and effects of the project. The most important conclusions stemming from the project concern the fact that the context of the implementation needs to be taken into account during the project as well as the importance of leadership. More attention should also be devoted to the motivation and skills of those involved in the implementation

Jagiellonian Univeristy Repository

STITCH 3: zooming in on protein-chemical interactions

Author: Bork Peer
Franceschini Andrea
Jensen Lars Juhl
Kuhn Michael
Szklarczyk Damian
von Mering Christian
Publication venue
Publication date: 02/08/2017
Field of study

To facilitate the study of interactions between proteins and chemicals, we have created STITCH, an aggregated database of interactions connecting over 300 000 chemicals and 2.6 million proteins from 1133 organisms. Compared to the previous version, the number of chemicals with interactions and the number of high-confidence interactions both increase 4-fold. The database can be accessed interactively through a web interface, displaying interactions in an integrated network view. It is also available for computational studies through downloadable files and an API. As an extension in the current version, we offer the option to switch between two levels of detail, namely whether stereoisomers of a given compound are shown as a merged entity or as separate entities. Separate display of stereoisomers is necessary, for example, for carbohydrates and chiral drugs. Combining the isomers increases the coverage, as interaction databases and publications found through text mining will often refer to compounds without specifying the stereoisomer. The database is accessible at http://stitch.embl.d

RERO DOC Digital Library

Knowledge Graph Completion to Predict Polypharmacy Side Effects

Author: AM Manicone
D Sridhar
D Szklarczyk
DA Fishman
F Cheng
GE Hinton
HG Munshi
M Kuhn
M Zitnik
N. P. Tatonetti
W Zhang
Publication venue
Publication date: 22/10/2018
Field of study

The polypharmacy side effect prediction problem considers cases in which two drugs taken individually do not result in a particular side effect; however, when the two drugs are taken in combination, the side effect manifests. In this work, we demonstrate that multi-relational knowledge graph completion achieves state-of-the-art results on the polypharmacy side effect prediction problem. Empirical results show that our approach is particularly effective when the protein targets of the drugs are well-characterized. In contrast to prior work, our approach provides more interpretable predictions and hypotheses for wet lab validation.Comment: 13th International Conference on Data Integration in the Life Sciences (DILS2018

arXiv.org e-Print Archive

Crossref

Predictive and experimental approaches for elucidating protein–protein interactions and quaternary structures

Author: Bernstein
Duhovny
Ehrenberger
Gao
Günzl
John Nealon
Lee
Liam McGuffin
Limcy Philomina
Phizicky
Shimoda
Szklarczyk
Tang
Publication venue: 'MDPI AG'
Publication date: 01/12/2017
Field of study

The elucidation of protein–protein interactions is vital for determining the function and action of quaternary protein structures. Here, we discuss the difficulty and importance of establishing protein quaternary structure and review in vitro and in silico methods for doing so. Determining the interacting partner proteins of predicted protein structures is very time-consuming when using in vitro methods, this can be somewhat alleviated by use of predictive methods. However, developing reliably accurate predictive tools has proved to be difficult. We review the current state of the art in predictive protein interaction software and discuss the problem of scoring and therefore ranking predictions. Current community-based predictive exercises are discussed in relation to the growth of protein interaction prediction as an area within these exercises. We suggest a fusion of experimental and predictive methods that make use of sparse experimental data to determine higher resolution predicted protein interactions as being necessary to drive forward development

Multidisciplinary Digital Publishing Institute

Central Archive at the University of Reading

Crossref

Directory of Open Access Journals

STITCH 4: integration of protein-chemical interactions with user data

Author: Blicher Thomas H.
Bork Peer
Jensen Lars J.
Kuhn Michael
Pletscher-Frankild Sune
Szklarczyk Damian
von Mering Christian
Publication venue
Publication date: 02/08/2017
Field of study

STITCH is a database of protein-chemical interactions that integrates many sources of experimental and manually curated evidence with text-mining information and interaction predictions. Available at http://stitch.embl.de, the resulting interaction network includes 390 000 chemicals and 3.6 million proteins from 1133 organisms. Compared with the previous version, the number of high-confidence protein-chemical interactions in human has increased by 45%, to 367 000. In this version, we added features for users to upload their own data to STITCH in the form of internal identifiers, chemical structures or quantitative data. For example, a user can now upload a spreadsheet with screening hits to easily check which interactions are already known. To increase the coverage of STITCH, we expanded the text mining to include full-text articles and added a prediction method based on chemical structures. We further changed our scheme for transferring interactions between species to rely on orthology rather than protein similarity. This improves the performance within protein families, where scores are now transferred only to orthologous proteins, but not to paralogous proteins. STITCH can be accessed with a web-interface, an API and downloadable file

RERO DOC Digital Library

Aubergene - a sensitive genome alignment tool.

Author: Arslan
Heger
J. Heringa
Karolchik
Kellis
Miller
Morgenstern
Murphy
Notredame
Park
R. Szklarczyk
Thomas
Vingron
Waterston
Ye
Zhang
Publication venue
Publication date: 01/01/2006
Field of study

Motivation: The accumulation of genome sequences will only accelerate in the coming years. We aim to use this abundance of data to improve the quality of genomic alignments and devise a method which is capable of detecting regions evolving under weak or no evolutionary constraints. Results: We describe a genome alignment program AuberGene, which explores the idea of transitivity of local alignments. Assessment of the program was done based on a 2 Mbp genomic region containing the CFTR gene of 13 species. In this region, we can identify 53% of human sequence sharing common ancestry with mouse, as compared with 44% found using the usual pairwise alignment. Between human and tetraodon 93 orthologous exons are found, as compared with 77 detected by the pairwise human-tetraodon comparison. AuberGene allows the user to (1) identify distant, previously undetected, conserved orthogonal regions such as ORFs or regulatory regions; (2) identify neutrally evolving regions in related species which are often overlooked by other alignment programs; (3) recognize false orthologous genomic regions. The increased sensitivity of the method is not obtained at the cost of reduced specificity. Our results suggest that, over the CFTR region, human shares 10% more sequence with mouse than previously thought (∼50%, instead of 40% found with the pairwise alignment). © 2006 Oxford University Press

Crossref

VU Research Portal

Targeted sequencing identifies genetic polymorphisms of flavin-containing monooxygenase genes contributing to susceptibility of nicotine dependence in European American and African American

Author: Beuten
Bierut
Biesalski
Bloom
Boyle
Carmelli
Dickson
Doll
Doll
Feng
Gabriel
Gerra
Hall
Heatherton
Hecht
Hines
Hinrichs
Hong
Huang
Israni
Kennedy
Kremer
Krueger
Laniado-Laborín
Lee
Lerman
Liu
Mackillop
Mangold
Olfson
Park
Price
Pritchard
Pruim
Purcell
Quail
Ripley
Sabol
Saccone
Shafey
Szklarczyk
Thorgeirsson
Thorgeirsson
Tucker
Tønnesen
Uhl
Vink
Wang
World Health Organization
Xie
Publication venue: Digital Commons@Becker
Publication date: 01/01/2017
Field of study

BACKGROUND: Smoking is a leading cause of preventable death. Early studies based on samples of twins have linked the lifetime smoking practices to genetic predisposition. The flavin‐containing monooxygenase (FMO) protein family consists of a group of enzymes that metabolize drugs and xenobiotics. Both FMO1 and FMO3 were potentially susceptible genes for nicotine metabolism process. METHODS: In this study, we investigated the potential of FMO genes to confer risk of nicotine dependence via deep targeted sequencing in 2,820 study subjects comprising 1,583 nicotine dependents and 1,237 controls from European American and African American. Specifically, we focused on the two genomic segments including FMO1,FMO3, and pseudo gene FMO6P, and aimed to investigate the potential association between FMO genes and nicotine dependence. Both common and low‐frequency/rare variants were analyzed using different algorithms. The potential functional significance of SNPs with association signal was investigated with relevant bioinformatics tools. RESULTS: We identified different clusters of significant common variants in European (with most significant SNP rs6674596, p = .0004, OR = 0.67, MAF_EA = 0.14, FMO1) and African Americans (with the most significant SNP rs6608453, p = .001, OR = 0.64, MAF_AA = 0.1, FMO6P). No significant signals were identified through haplotype‐based analyses. Gene network investigation indicated that both FMO1 and FMO3 have a strong relation with a variety of genes belonging to CYP gene families (with combined score greater than 0.9). Most of the significant variants identified were SNPs located within intron regions or with unknown functional significance, indicating a need for future work to understand the underlying functional significance of these signals. CONCLUSIONS: Our findings indicated significant association between FMO genes and nicotine dependence. Replications of our findings in other ethnic groups were needed in the future. Most of the significant variants identified were SNPs located within intronic regions or with unknown functional significance, indicating a need for future work to understand the underlying functional significance of these signals

Crossref

Digital Commons@Becker

PubMed Central