325 research outputs found

    Inter-protein sequence co-evolution predicts known physical interactions in bacterial ribosomes and the trp operon

    Get PDF
    Interaction between proteins is a fundamental mechanism that underlies virtually all biological processes. Many important interactions are conserved across a large variety of species. The need to maintain interaction leads to a high degree of co-evolution between residues in the interface between partner proteins. The inference of protein-protein interaction networks from the rapidly growing sequence databases is one of the most formidable tasks in systems biology today. We propose here a novel approach based on the Direct-Coupling Analysis of the co-evolution between inter-protein residue pairs. We use ribosomal and trp operon proteins as test cases: For the small resp. large ribosomal subunit our approach predicts protein-interaction partners at a true-positive rate of 70% resp. 90% within the first 10 predictions, with areas of 0.69 resp. 0.81 under the ROC curves for all predictions. In the trp operon, it assigns the two largest interaction scores to the only two interactions experimentally known. On the level of residue interactions we show that for both the small and the large ribosomal subunit our approach predicts interacting residues in the system with a true positive rate of 60% and 85% in the first 20 predictions. We use artificial data to show that the performance of our approach depends crucially on the size of the joint multiple sequence alignments and analyze how many sequences would be necessary for a perfect prediction if the sequences were sampled from the same model that we use for prediction. Given the performance of our approach on the test data we speculate that it can be used to detect new interactions, especially in the light of the rapid growth of available sequence data

    Large-scale identification of coevolution signals across homo-oligomeric protein interfaces by Direct Coupling Analysis

    Full text link
    Proteins have evolved to perform diverse cellular functions, from serving as reaction catalysts to coordinating cellular propagation and development. Frequently, proteins do not exert their full potential as monomers but rather undergo concerted interactions as either homo-oligomers or with other proteins as hetero-oligomers. The experimental study of such protein complexes and interactions has been arduous. Theoretical structure prediction methods are an attractive alternative. Here, we investigate homo-oligomeric interfaces by tracing residue coevolution via the global statistical Direct Coupling Analysis (DCA). DCA can accurately infer spatial adjacencies between residues. These adjacencies can be included as constraints in structure-prediction techniques to predict high-resolution models. By taking advantage of the on-going exponential growth of sequence databases, we go significantly beyond anecdotal cases of a few protein families and apply DCA to a systematic large-scale study of nearly 2000 PFAM protein families with sufficient sequence information and structurally resolved homo-oligomeric interfaces. We find that large interfaces are commonly identified by DCA. We further demonstrate that DCA can differentiate between subfamilies of different binding modes within one large PFAM family. Sequence derived contact information for the subfamilies proves sufficient to assemble accurate structural models of the diverse protein-oligomers. Thus, we provide an approach to investigate oligomerization for arbitrary protein families leading to structural models complementary to often difficult experimental methods. Combined with ever more abundant sequential data, we anticipate that this study will be instrumental to allow the structural description of many hetero-protein complexes in the future.Comment: 18 pages, 6 figures, to appears in PNA

    Identification of direct residue contacts in protein-protein interaction by message passing

    Full text link
    Understanding the molecular determinants of specificity in protein-protein interaction is an outstanding challenge of postgenome biology. The availability of large protein databases generated from sequences of hundreds of bacterial genomes enables various statistical approaches to this problem. In this context covariance-based methods have been used to identify correlation between amino acid positions in interacting proteins. However, these methods have an important shortcoming, in that they cannot distinguish between directly and indirectly correlated residues. We developed a method that combines covariance analysis with global inference analysis, adopted from use in statistical physics. Applied to a set of >2,500 representatives of the bacterial two-component signal transduction system, the combination of covariance with global inference successfully and robustly identified residue pairs that are proximal in space without resorting to ad hoc tuning parameters, both for heterointeractions between sensor kinase (SK) and response regulator (RR) proteins and for homointeractions between RR proteins. The spectacular success of this approach illustrates the effectiveness of the global inference approach in identifying direct interaction based on sequence information alone. We expect this method to be applicable soon to interaction surfaces between proteins present in only 1 copy per genome as the number of sequenced genomes continues to expand. Use of this method could significantly increase the potential targets for therapeutic intervention, shed light on the mechanism of protein-protein interaction, and establish the foundation for the accurate prediction of interacting protein partners.Comment: Supplementary information available on http://www.pnas.org/content/106/1/67.abstrac

    Dissecting the Specificity of Protein-Protein Interaction in Bacterial Two-Component Signaling: Orphans and Crosstalks

    Get PDF
    Predictive understanding of the myriads of signal transduction pathways in a cell is an outstanding challenge of systems biology. Such pathways are primarily mediated by specific but transient protein-protein interactions, which are difficult to study experimentally. In this study, we dissect the specificity of protein-protein interactions governing two-component signaling (TCS) systems ubiquitously used in bacteria. Exploiting the large number of sequenced bacterial genomes and an operon structure which packages many pairs of interacting TCS proteins together, we developed a computational approach to extract a molecular interaction code capturing the preferences of a small but critical number of directly interacting residue pairs. This code is found to reflect physical interaction mechanisms, with the strongest signal coming from charged amino acids. It is used to predict the specificity of TCS interaction: Our results compare favorably to most available experimental results, including the prediction of 7 (out of 8 known) interaction partners of orphan signaling proteins in Caulobacter crescentus. Surveying among the available bacterial genomes, our results suggest 15~25% of the TCS proteins could participate in out-of-operon "crosstalks". Additionally, we predict clusters of crosstalking candidates, expanding from the anecdotally known examples in model organisms. The tools and results presented here can be used to guide experimental studies towards a system-level understanding of two-component signaling.Comment: Supplementary information available on http://www.plosone.org/article/info:doi/10.1371/journal.pone.001972

    Inter-residue, inter-protein and inter-family coevolution: bridging the scales

    Full text link
    Interacting proteins coevolve at multiple but interconnected scales, from the residue-residue over the protein-protein up to the family-family level. The recent accumulation of enormous amounts of sequence data allows for the development of novel, data-driven computational approaches. Notably, these approaches can bridge scales within a single statistical framework. While being currently applied mostly to isolated problems on single scales, their immense potential for an evolutionary informed, structural systems biology is steadily emerging.Comment: 12 pages, review pape

    Mechanism of activation and autophosphorylation of a histidine kinase

    Get PDF
    Histidine kinases (HK) are one of the main prokaryotic signaling systems. Two structurally conserved catalytic domains inside the HK enable autokinase, phosphotransfer, and phosphatase activities. Here, we focus on a detailed mechanistic understanding of the functional cycle of the WalK HK by a multi-scale simulation approach, consisting of classical as well as hybrid QM/MM molecular dynamics simulation. Strikingly, a conformational transition induced solely in DHp leads to the correct activated conformation in CA crucial for autophosphorylation. This finding explains how variable sensor domains induce the transition from inactive to active state. The subsequent autophosphorylation inside DHp proceeds via a penta-coordinated transition state to a protonated phosphohistidine intermediate. This intermediate is consequently deprotonated by a suitable nearby base. The reaction energetics are controlled by the final proton acceptor and presence of a magnesium cation. The slow rates of the process result from the high energy barrier of the conformational transition between inactive and active states. The phosphorylation step exhibits a lower barrier and down-the-hill energetics. Thus, our work suggests a detailed mechanistic model for HK autophosphorylation

    Coevolved mutations reveal distinct architectures for two core proteins in the bacterial flagellar motor

    Get PDF
    Switching of bacterial flagellar rotation is caused by large domain movements of the FliG protein triggered by binding of the signal protein CheY to FliM. FliG and FliM form adjacent multi-subunit arrays within the basal body C-ring. The movements alter the interaction of the FliG C-terminal (FliGC) "torque" helix with the stator complexes. Atomic models based on the Salmonella entrovar C-ring electron microscopy reconstruction have implications for switching, but lack consensus on the relative locations of the FliG armadillo (ARM) domains (amino-terminal (FliGN), middle (FliGM) and FliGC) as well as changes during chemotaxis. The generality of the Salmonella model is challenged by the variation in motor morphology and response between species. We studied coevolved residue mutations to determine the unifying elements of switch architecture. Residue interactions, measured by their coevolution, were formalized as a network, guided by structural data. Our measurements reveal a common design with dedicated switch and motor modules. The FliM middle domain (FliMM) has extensive connectivity most simply explained by conserved intra and inter-subunit contacts. In contrast, FliG has patchy, complex architecture. Conserved structural motifs form interacting nodes in the coevolution network that wire FliMM to the FliGC C-terminal, four-helix motor module (C3-6). FliG C3-6 coevolution is organized around the torque helix, differently from other ARM domains. The nodes form separated, surface-proximal patches that are targeted by deleterious mutations as in other allosteric systems. The dominant node is formed by the EHPQ motif at the FliMMFliGM contact interface and adjacent helix residues at a central location within FliGM. The node interacts with nodes in the N-terminal FliGc α-helix triad (ARM-C) and FliGN. ARM-C, separated from C3-6 by the MFVF motif, has poor intra-network connectivity consistent with its variable orientation revealed by structural data. ARM-C could be the convertor element that provides mechanistic and species diversity.JK was supported by Medical Research Council grant U117581331. SK was supported by seed funds from Lahore University of Managment Sciences (LUMS) and the Molecular Biology Consortium

    YycH Regulates the Activity of the Essential YycFG Two-Component System in Bacillus subtilis

    Full text link
    Of the numerous two-component signal transduction systems found in bacteria, only a very few have proven to be essential for cell viability. Among these is the YycF (response regulator)-YycG (histidine kinase) system, which is highly conserved in and specific to the low-G+C content gram-positive bacteria. Given the pathogenic nature of several members of this class of bacteria, the YycF-YycG system has been suggested as a prime antimicrobial target. In an attempt to identify genes involved in regulation of this two-component system, a transposon mutagenesis study was designed to identify suppressors of a temperature-sensitive YycF mutant in Bacillus subtilis. Suppressors could be identified, and the prime target was the yycH gene located adjacent to yycG and within the same operon. A lacZ reporter assay revealed that YycF-regulated gene expression was elevated in a yycH strain, whereas disruption of any of the three downstream genes within the operon, yycI, yycJ, and yycK, showed no such effect. The concentrations of both YycG and YycF, assayed immunologically, remained unchanged between the wild-type and the yycH strain as determined by immunoassay. Alkaline phosphatase fusion studies showed that YycH is located external to the cell membrane, suggesting that it acts in the regulation of the sensor domain of the YycG sensor histidine kinase. The yycH strain showed a characteristic cell wall defect consistent with the previously suggested notion that the YycF-YycG system is involved in regulating cell wall homeostasis and indicating that either up- or down-regulation of YycF activity affects this homeostatic mechanism

    Is PhoR–PhoP partner fidelity strict? PhoR is required for the activation of the pho regulon in Streptomyces coelicolor

    Get PDF
    Two-component regulatory systems play a key role in the cell metabolism adaptation to changing nutritional and environmental conditions. The fidelity between the two cognate proteins of a two-component system is important since it determines whether a specific response regulator integrates the signals transmitted by different sensor kinases. Phosphate regulation in Streptomyces coelicolor is mostly mediated by the PhoR–PhoP two-component system. Previous studies elucidated the mechanisms that control phosphate regulation as well as the genes directly regulated by the response regulator PhoP (pho regulon) in this organism. However, the role of the histidine kinase PhoR in Streptomyces coelicolor had not been unveiled so far. In this work, we report the characterization of a non-polar ΔphoR deletion mutant in S. coelicolor that keeps its native promoter. Induction of the phoRP operon was dependent upon phosphorylation of PhoP, but the ΔphoR mutant expressed phoP at a basal level. RT-PCR and reporter luciferase assays demonstrated that PhoR plays a key role in the activation of the pho regulon in this organism. Our results point towards a strict cognate partner specificity in terms of the phosphorylation of PhoP by PhoR thus corroborating the tight interaction between the two-components of this system

    Purification and Activity Testing of the Full-Length YycFGHI Proteins of Staphylococcus aureus

    Get PDF
    Background: The YycFG two-component regulatory system (TCS) of Staphylococcus aureus represents the only essential TCS that is almost ubiquitously distributed in Gram-positive bacteria with a low G+C-content. YycG (WalK/VicK) is a sensor histidine-kinase and YycF (WalR/VicR) is the cognate response regulator. Both proteins play an important role in the biosynthesis of the cell envelope and mutations in these proteins have been involved in development of vancomycin and daptomycin resistance. Methodology/Principal Findings: Here we present high yield expression and purification of the full-length YycG and YycF proteins as well as of the auxiliary proteins YycH and YycI of Staphylococcus aureus. Activity tests of the YycG kinase and a mutated version, that harbours an Y306N exchange in its cytoplasmic PAS domain, in a detergent-micelle-model and a phosholipid-liposome-model showed kinase activity (autophosphorylation and phosphoryl group transfer to YycF) only in the presence of elevated concentrations of alkali salts. A direct comparison of the activity of the kinases in the liposomemodel indicated a higher activity of the mutated YycG kinase. Further experiments indicated that YycG responds to fluidity changes in its microenvironment. Conclusions/Significance: The combination of high yield expression, purification and activity testing of membrane and membrane-associated proteins provides an excellent experimental basis for further protein-protein interaction studies an
    corecore