4,700 research outputs found

    Information reuse in comparative genomics

    Get PDF
    Heringa, J. [Promotor]Nekrutenko, A. [Copromotor

    Tracking repeats using significance and transitivty.

    Get PDF
    transitivity; extreme value distribution Motivation: Internal repeats in coding sequences correspond to structural and functional units of proteins. Moreover, duplication of fragments of coding sequences is known to be a mechanism to facilitate evolution. Identification of repeats is crucial to shed light on the function and structure of proteins, and explain their evolutionary past. The task is difficult because during the course of evolution many repeats diverged beyond recognition. Results: We introduce a new method TRUST, for ab-initio determination of internal repeats in proteins. It provides an improvement in prediction quality as compared to alternative state-of-the-art methods. The increased sensitivity and accuracy of the method is achieved by exploiting the concept of transitivity of alignments. Starting from significant local suboptimal alignments, the application of transitivity allows us to: 1) identify distant repeat homologues for which no alignments were found; 2) gain confidence about consistently well-aligned regions; and 3) recognize and reduce the contribution of nonhomologous repeats. This reassessment step enables us to derive a virtually noise-free profile representing a generalized repeat with high fidelity. We also obtained superior specificity by employing rigid statistical testing for self-sequence and profile-sequence alignments. Assessment was done using a database of repeat annotations based on structural superpositioning. The results show that TRUST is a useful and reliable tool for mining tandem and non-tandem repeats in protein sequence databases, able to predict multiple repeat types with varying intervening segments within a single sequence

    Polki i Polacy na rynku pracy : raport z badań ludności w wieku produkcyjnym realizowanych w 2010 r. w ramach projektu "Bilans Kapitału Ludzkiego"

    Get PDF
    The study of people of working age is the key research module of the supply side of labour market in BKL Study. Generally, in all the analyses - both those concerning various forms of employment, occupations pursued and positions sought, and those that focus on education and self-evaluation of competences, what comes very much to the forefront is the segmentation of the labour market by the gender of the employers. In the period from August to November 2010 (the duration of the study), more or less one in two people working were employed on the grounds of an employment contract (umowa o pracę), one in nine conducted his own business, one in fourteen had worked during the previous year on the grounds of commission agreement (umowa zlecenia)or contract for a specific task (umowa o dzieło), and one in 20 was working without a formal contract. The differentiation between the genders is already visible at this stage: as far as in the case of regular employment women account for half of all employed people, among those running their own business outside agriculture and also working without contracts, there are on average two men for each woman

    Knowledge centers as an innovative knowledge transfer mechanism : lesson learned from the program implemented in Lesser Poland

    Get PDF
    The aim of the paper is to present the lessons learnt from the "SPIN" regional public project. The project was implemented in the region of Lesser Poland. The objective of the project was to increase the intensity of knowledge transfer from universities to enterprises. The goal was achieved by establishing four Centres for Knowledge Transfer at major universities. Each of them was dedicated to a specific domain of knowledge - regional smart specialization - biotechnology, translational medicine, smart grids and energy-saving buildings. The paper discusses the implementation and effects of the project. The most important conclusions stemming from the project concern the fact that the context of the implementation needs to be taken into account during the project as well as the importance of leadership. More attention should also be devoted to the motivation and skills of those involved in the implementation

    STITCH 3: zooming in on protein-chemical interactions

    Get PDF
    To facilitate the study of interactions between proteins and chemicals, we have created STITCH, an aggregated database of interactions connecting over 300 000 chemicals and 2.6 million proteins from 1133 organisms. Compared to the previous version, the number of chemicals with interactions and the number of high-confidence interactions both increase 4-fold. The database can be accessed interactively through a web interface, displaying interactions in an integrated network view. It is also available for computational studies through downloadable files and an API. As an extension in the current version, we offer the option to switch between two levels of detail, namely whether stereoisomers of a given compound are shown as a merged entity or as separate entities. Separate display of stereoisomers is necessary, for example, for carbohydrates and chiral drugs. Combining the isomers increases the coverage, as interaction databases and publications found through text mining will often refer to compounds without specifying the stereoisomer. The database is accessible at http://stitch.embl.d

    Knowledge Graph Completion to Predict Polypharmacy Side Effects

    Full text link
    The polypharmacy side effect prediction problem considers cases in which two drugs taken individually do not result in a particular side effect; however, when the two drugs are taken in combination, the side effect manifests. In this work, we demonstrate that multi-relational knowledge graph completion achieves state-of-the-art results on the polypharmacy side effect prediction problem. Empirical results show that our approach is particularly effective when the protein targets of the drugs are well-characterized. In contrast to prior work, our approach provides more interpretable predictions and hypotheses for wet lab validation.Comment: 13th International Conference on Data Integration in the Life Sciences (DILS2018

    Predictive and experimental approaches for elucidating protein–protein interactions and quaternary structures

    Get PDF
    The elucidation of protein–protein interactions is vital for determining the function and action of quaternary protein structures. Here, we discuss the difficulty and importance of establishing protein quaternary structure and review in vitro and in silico methods for doing so. Determining the interacting partner proteins of predicted protein structures is very time-consuming when using in vitro methods, this can be somewhat alleviated by use of predictive methods. However, developing reliably accurate predictive tools has proved to be difficult. We review the current state of the art in predictive protein interaction software and discuss the problem of scoring and therefore ranking predictions. Current community-based predictive exercises are discussed in relation to the growth of protein interaction prediction as an area within these exercises. We suggest a fusion of experimental and predictive methods that make use of sparse experimental data to determine higher resolution predicted protein interactions as being necessary to drive forward development

    STITCH 4: integration of protein-chemical interactions with user data

    Get PDF
    STITCH is a database of protein-chemical interactions that integrates many sources of experimental and manually curated evidence with text-mining information and interaction predictions. Available at http://stitch.embl.de, the resulting interaction network includes 390 000 chemicals and 3.6 million proteins from 1133 organisms. Compared with the previous version, the number of high-confidence protein-chemical interactions in human has increased by 45%, to 367 000. In this version, we added features for users to upload their own data to STITCH in the form of internal identifiers, chemical structures or quantitative data. For example, a user can now upload a spreadsheet with screening hits to easily check which interactions are already known. To increase the coverage of STITCH, we expanded the text mining to include full-text articles and added a prediction method based on chemical structures. We further changed our scheme for transferring interactions between species to rely on orthology rather than protein similarity. This improves the performance within protein families, where scores are now transferred only to orthologous proteins, but not to paralogous proteins. STITCH can be accessed with a web-interface, an API and downloadable file

    Aubergene - a sensitive genome alignment tool.

    Get PDF
    Motivation: The accumulation of genome sequences will only accelerate in the coming years. We aim to use this abundance of data to improve the quality of genomic alignments and devise a method which is capable of detecting regions evolving under weak or no evolutionary constraints. Results: We describe a genome alignment program AuberGene, which explores the idea of transitivity of local alignments. Assessment of the program was done based on a 2 Mbp genomic region containing the CFTR gene of 13 species. In this region, we can identify 53% of human sequence sharing common ancestry with mouse, as compared with 44% found using the usual pairwise alignment. Between human and tetraodon 93 orthologous exons are found, as compared with 77 detected by the pairwise human-tetraodon comparison. AuberGene allows the user to (1) identify distant, previously undetected, conserved orthogonal regions such as ORFs or regulatory regions; (2) identify neutrally evolving regions in related species which are often overlooked by other alignment programs; (3) recognize false orthologous genomic regions. The increased sensitivity of the method is not obtained at the cost of reduced specificity. Our results suggest that, over the CFTR region, human shares 10% more sequence with mouse than previously thought (∼50%, instead of 40% found with the pairwise alignment). © 2006 Oxford University Press

    Targeted sequencing identifies genetic polymorphisms of flavin-containing monooxygenase genes contributing to susceptibility of nicotine dependence in European American and African American

    Get PDF
    BACKGROUND: Smoking is a leading cause of preventable death. Early studies based on samples of twins have linked the lifetime smoking practices to genetic predisposition. The flavin‐containing monooxygenase (FMO) protein family consists of a group of enzymes that metabolize drugs and xenobiotics. Both FMO1 and FMO3 were potentially susceptible genes for nicotine metabolism process. METHODS: In this study, we investigated the potential of FMO genes to confer risk of nicotine dependence via deep targeted sequencing in 2,820 study subjects comprising 1,583 nicotine dependents and 1,237 controls from European American and African American. Specifically, we focused on the two genomic segments including FMO1,FMO3, and pseudo gene FMO6P, and aimed to investigate the potential association between FMO genes and nicotine dependence. Both common and low‐frequency/rare variants were analyzed using different algorithms. The potential functional significance of SNPs with association signal was investigated with relevant bioinformatics tools. RESULTS: We identified different clusters of significant common variants in European (with most significant SNP rs6674596, p = .0004, OR = 0.67, MAF_EA = 0.14, FMO1) and African Americans (with the most significant SNP rs6608453, p = .001, OR = 0.64, MAF_AA = 0.1, FMO6P). No significant signals were identified through haplotype‐based analyses. Gene network investigation indicated that both FMO1 and FMO3 have a strong relation with a variety of genes belonging to CYP gene families (with combined score greater than 0.9). Most of the significant variants identified were SNPs located within intron regions or with unknown functional significance, indicating a need for future work to understand the underlying functional significance of these signals. CONCLUSIONS: Our findings indicated significant association between FMO genes and nicotine dependence. Replications of our findings in other ethnic groups were needed in the future. Most of the significant variants identified were SNPs located within intronic regions or with unknown functional significance, indicating a need for future work to understand the underlying functional significance of these signals
    corecore