325 research outputs found
Inter-protein sequence co-evolution predicts known physical interactions in bacterial ribosomes and the trp operon
Interaction between proteins is a fundamental mechanism that underlies
virtually all biological processes. Many important interactions are conserved
across a large variety of species. The need to maintain interaction leads to a
high degree of co-evolution between residues in the interface between partner
proteins. The inference of protein-protein interaction networks from the
rapidly growing sequence databases is one of the most formidable tasks in
systems biology today. We propose here a novel approach based on the
Direct-Coupling Analysis of the co-evolution between inter-protein residue
pairs. We use ribosomal and trp operon proteins as test cases: For the small
resp. large ribosomal subunit our approach predicts protein-interaction
partners at a true-positive rate of 70% resp. 90% within the first 10
predictions, with areas of 0.69 resp. 0.81 under the ROC curves for all
predictions. In the trp operon, it assigns the two largest interaction scores
to the only two interactions experimentally known. On the level of residue
interactions we show that for both the small and the large ribosomal subunit
our approach predicts interacting residues in the system with a true positive
rate of 60% and 85% in the first 20 predictions. We use artificial data to show
that the performance of our approach depends crucially on the size of the joint
multiple sequence alignments and analyze how many sequences would be necessary
for a perfect prediction if the sequences were sampled from the same model that
we use for prediction. Given the performance of our approach on the test data
we speculate that it can be used to detect new interactions, especially in the
light of the rapid growth of available sequence data
Large-scale identification of coevolution signals across homo-oligomeric protein interfaces by Direct Coupling Analysis
Proteins have evolved to perform diverse cellular functions, from serving as
reaction catalysts to coordinating cellular propagation and development.
Frequently, proteins do not exert their full potential as monomers but rather
undergo concerted interactions as either homo-oligomers or with other proteins
as hetero-oligomers. The experimental study of such protein complexes and
interactions has been arduous. Theoretical structure prediction methods are an
attractive alternative. Here, we investigate homo-oligomeric interfaces by
tracing residue coevolution via the global statistical Direct Coupling Analysis
(DCA). DCA can accurately infer spatial adjacencies between residues. These
adjacencies can be included as constraints in structure-prediction techniques
to predict high-resolution models. By taking advantage of the on-going
exponential growth of sequence databases, we go significantly beyond anecdotal
cases of a few protein families and apply DCA to a systematic large-scale study
of nearly 2000 PFAM protein families with sufficient sequence information and
structurally resolved homo-oligomeric interfaces. We find that large interfaces
are commonly identified by DCA. We further demonstrate that DCA can
differentiate between subfamilies of different binding modes within one large
PFAM family. Sequence derived contact information for the subfamilies proves
sufficient to assemble accurate structural models of the diverse
protein-oligomers. Thus, we provide an approach to investigate oligomerization
for arbitrary protein families leading to structural models complementary to
often difficult experimental methods. Combined with ever more abundant
sequential data, we anticipate that this study will be instrumental to allow
the structural description of many hetero-protein complexes in the future.Comment: 18 pages, 6 figures, to appears in PNA
Identification of direct residue contacts in protein-protein interaction by message passing
Understanding the molecular determinants of specificity in protein-protein
interaction is an outstanding challenge of postgenome biology. The availability
of large protein databases generated from sequences of hundreds of bacterial
genomes enables various statistical approaches to this problem. In this context
covariance-based methods have been used to identify correlation between amino
acid positions in interacting proteins. However, these methods have an
important shortcoming, in that they cannot distinguish between directly and
indirectly correlated residues. We developed a method that combines covariance
analysis with global inference analysis, adopted from use in statistical
physics. Applied to a set of >2,500 representatives of the bacterial
two-component signal transduction system, the combination of covariance with
global inference successfully and robustly identified residue pairs that are
proximal in space without resorting to ad hoc tuning parameters, both for
heterointeractions between sensor kinase (SK) and response regulator (RR)
proteins and for homointeractions between RR proteins. The spectacular success
of this approach illustrates the effectiveness of the global inference approach
in identifying direct interaction based on sequence information alone. We
expect this method to be applicable soon to interaction surfaces between
proteins present in only 1 copy per genome as the number of sequenced genomes
continues to expand. Use of this method could significantly increase the
potential targets for therapeutic intervention, shed light on the mechanism of
protein-protein interaction, and establish the foundation for the accurate
prediction of interacting protein partners.Comment: Supplementary information available on
http://www.pnas.org/content/106/1/67.abstrac
Dissecting the Specificity of Protein-Protein Interaction in Bacterial Two-Component Signaling: Orphans and Crosstalks
Predictive understanding of the myriads of signal transduction pathways in a
cell is an outstanding challenge of systems biology. Such pathways are
primarily mediated by specific but transient protein-protein interactions,
which are difficult to study experimentally. In this study, we dissect the
specificity of protein-protein interactions governing two-component signaling
(TCS) systems ubiquitously used in bacteria. Exploiting the large number of
sequenced bacterial genomes and an operon structure which packages many pairs
of interacting TCS proteins together, we developed a computational approach to
extract a molecular interaction code capturing the preferences of a small but
critical number of directly interacting residue pairs. This code is found to
reflect physical interaction mechanisms, with the strongest signal coming from
charged amino acids. It is used to predict the specificity of TCS interaction:
Our results compare favorably to most available experimental results, including
the prediction of 7 (out of 8 known) interaction partners of orphan signaling
proteins in Caulobacter crescentus. Surveying among the available bacterial
genomes, our results suggest 15~25% of the TCS proteins could participate in
out-of-operon "crosstalks". Additionally, we predict clusters of crosstalking
candidates, expanding from the anecdotally known examples in model organisms.
The tools and results presented here can be used to guide experimental studies
towards a system-level understanding of two-component signaling.Comment: Supplementary information available on
http://www.plosone.org/article/info:doi/10.1371/journal.pone.001972
Inter-residue, inter-protein and inter-family coevolution: bridging the scales
Interacting proteins coevolve at multiple but interconnected scales, from the
residue-residue over the protein-protein up to the family-family level. The
recent accumulation of enormous amounts of sequence data allows for the
development of novel, data-driven computational approaches. Notably, these
approaches can bridge scales within a single statistical framework. While being
currently applied mostly to isolated problems on single scales, their immense
potential for an evolutionary informed, structural systems biology is steadily
emerging.Comment: 12 pages, review pape
Mechanism of activation and autophosphorylation of a histidine kinase
Histidine kinases (HK) are one of the main prokaryotic signaling systems. Two structurally conserved catalytic domains inside the HK enable autokinase, phosphotransfer, and phosphatase activities. Here, we focus on a detailed mechanistic understanding of the functional cycle of the WalK HK by a multi-scale simulation approach, consisting of classical as well as hybrid QM/MM molecular dynamics simulation. Strikingly, a conformational transition induced solely in DHp leads to the correct activated conformation in CA crucial for autophosphorylation. This finding explains how variable sensor domains induce the transition from inactive to active state. The subsequent autophosphorylation inside DHp proceeds via a penta-coordinated transition state to a protonated phosphohistidine intermediate. This intermediate is consequently deprotonated by a suitable nearby base. The reaction energetics are controlled by the final proton acceptor and presence of a magnesium cation. The slow rates of the process result from the high energy barrier of the conformational transition between inactive and active states. The phosphorylation step exhibits a lower barrier and down-the-hill energetics. Thus, our work suggests a detailed mechanistic model for HK autophosphorylation
Coevolved mutations reveal distinct architectures for two core proteins in the bacterial flagellar motor
Switching of bacterial flagellar rotation is caused by large domain movements of the FliG protein triggered by binding of the signal protein CheY to FliM. FliG and FliM form adjacent multi-subunit arrays within the basal body C-ring. The movements alter the interaction of the FliG C-terminal (FliGC) "torque" helix with the stator complexes. Atomic models based on the Salmonella entrovar C-ring electron microscopy reconstruction have implications for switching, but lack consensus on the relative locations of the FliG armadillo (ARM) domains (amino-terminal (FliGN), middle (FliGM) and FliGC) as well as changes during chemotaxis. The generality of the Salmonella model is challenged by the variation in motor morphology and response between species. We studied coevolved residue mutations to determine the unifying elements of switch architecture. Residue interactions, measured by their coevolution, were formalized as a network, guided by structural data. Our measurements reveal a common design with dedicated switch and motor modules. The FliM middle domain (FliMM) has extensive connectivity most simply explained by conserved intra and inter-subunit contacts. In contrast, FliG has patchy, complex architecture. Conserved structural motifs form interacting nodes in the coevolution network that wire FliMM to the FliGC C-terminal, four-helix motor module (C3-6). FliG C3-6 coevolution is organized around the torque helix, differently from other ARM domains. The nodes form separated, surface-proximal patches that are targeted by deleterious mutations as in other allosteric systems. The dominant node is formed by the EHPQ motif at the FliMMFliGM contact interface and adjacent helix residues at a central location within FliGM. The node interacts with nodes in the N-terminal FliGc α-helix triad (ARM-C) and FliGN. ARM-C, separated from C3-6 by the MFVF motif, has poor intra-network connectivity consistent with its variable orientation revealed by structural data. ARM-C could be the convertor element that provides mechanistic and species diversity.JK was supported by Medical Research Council grant U117581331. SK was supported by seed funds from Lahore University of Managment Sciences (LUMS) and the Molecular Biology Consortium
YycH Regulates the Activity of the Essential YycFG Two-Component System in Bacillus subtilis
Of the numerous two-component signal transduction systems found in bacteria, only a very few have proven to be essential for cell viability. Among these is the YycF (response regulator)-YycG (histidine kinase) system, which is highly conserved in and specific to the low-G+C content gram-positive bacteria. Given the pathogenic nature of several members of this class of bacteria, the YycF-YycG system has been suggested as a prime antimicrobial target. In an attempt to identify genes involved in regulation of this two-component system, a transposon mutagenesis study was designed to identify suppressors of a temperature-sensitive YycF mutant in Bacillus subtilis. Suppressors could be identified, and the prime target was the yycH gene located adjacent to yycG and within the same operon. A lacZ reporter assay revealed that YycF-regulated gene expression was elevated in a yycH strain, whereas disruption of any of the three downstream genes within the operon, yycI, yycJ, and yycK, showed no such effect. The concentrations of both YycG and YycF, assayed immunologically, remained unchanged between the wild-type and the yycH strain as determined by immunoassay. Alkaline phosphatase fusion studies showed that YycH is located external to the cell membrane, suggesting that it acts in the regulation of the sensor domain of the YycG sensor histidine kinase. The yycH strain showed a characteristic cell wall defect consistent with the previously suggested notion that the YycF-YycG system is involved in regulating cell wall homeostasis and indicating that either up- or down-regulation of YycF activity affects this homeostatic mechanism
Is PhoR–PhoP partner fidelity strict? PhoR is required for the activation of the pho regulon in Streptomyces coelicolor
Two-component regulatory systems play a key role in the cell metabolism adaptation to changing nutritional and environmental conditions. The fidelity between the two cognate proteins of a two-component system is important since it determines whether a specific response regulator integrates the signals transmitted by different sensor kinases. Phosphate regulation in Streptomyces coelicolor is mostly mediated by the PhoR–PhoP two-component system. Previous studies elucidated the mechanisms that control phosphate regulation as well as the genes directly regulated by the response regulator PhoP (pho regulon) in this organism. However, the role of the histidine kinase PhoR in Streptomyces coelicolor had not been unveiled so far. In this work, we report the characterization of a non-polar ΔphoR deletion mutant in S. coelicolor that keeps its native promoter. Induction of the phoRP operon was dependent upon phosphorylation of PhoP, but the ΔphoR mutant expressed phoP at a basal level. RT-PCR and reporter luciferase assays demonstrated that PhoR plays a key role in the activation of the pho regulon in this organism. Our results point towards a strict cognate partner specificity in terms of the phosphorylation of PhoP by PhoR thus corroborating the tight interaction between the two-components of this system
Purification and Activity Testing of the Full-Length YycFGHI Proteins of Staphylococcus aureus
Background: The YycFG two-component regulatory system (TCS) of Staphylococcus aureus represents the only essential TCS that is almost ubiquitously distributed in Gram-positive bacteria with a low G+C-content. YycG (WalK/VicK) is a sensor histidine-kinase and YycF (WalR/VicR) is the cognate response regulator. Both proteins play an important role in the biosynthesis of the cell envelope and mutations in these proteins have been involved in development of vancomycin and daptomycin resistance. Methodology/Principal Findings: Here we present high yield expression and purification of the full-length YycG and YycF proteins as well as of the auxiliary proteins YycH and YycI of Staphylococcus aureus. Activity tests of the YycG kinase and a mutated version, that harbours an Y306N exchange in its cytoplasmic PAS domain, in a detergent-micelle-model and a phosholipid-liposome-model showed kinase activity (autophosphorylation and phosphoryl group transfer to YycF) only in the presence of elevated concentrations of alkali salts. A direct comparison of the activity of the kinases in the liposomemodel indicated a higher activity of the mutated YycG kinase. Further experiments indicated that YycG responds to fluidity changes in its microenvironment. Conclusions/Significance: The combination of high yield expression, purification and activity testing of membrane and membrane-associated proteins provides an excellent experimental basis for further protein-protein interaction studies an
- …
