203 research outputs found
Anisotropic coarse-grained statistical potentials improve the ability to identify native-like protein structures
We present a new method to extract distance and orientation dependent
potentials between amino acid side chains using a database of protein
structures and the standard Boltzmann device. The importance of orientation
dependent interactions is first established by computing orientational order
parameters for proteins with alpha-helical and beta-sheet architecture.
Extraction of the anisotropic interactions requires defining local reference
frames for each amino acid that uniquely determine the coordinates of the
neighboring residues. Using the local reference frames and histograms of the
radial and angular correlation functions for a standard set of non-homologue
protein structures, we construct the anisotropic pair potentials. The
performance of the orientation dependent potentials was studied using a large
database of decoy proteins. The results demonstrate that the new distance and
orientation dependent residue-residue potentials present a significantly
improved ability to recognize native folds from a set of native and decoy
protein structures.Comment: Submitted to "The Journal of Chemical Physics
Potentials of Mean Force for Protein Structure Prediction Vindicated, Formalized and Generalized
Understanding protein structure is of crucial importance in science, medicine
and biotechnology. For about two decades, knowledge based potentials based on
pairwise distances -- so-called "potentials of mean force" (PMFs) -- have been
center stage in the prediction and design of protein structure and the
simulation of protein folding. However, the validity, scope and limitations of
these potentials are still vigorously debated and disputed, and the optimal
choice of the reference state -- a necessary component of these potentials --
is an unsolved problem. PMFs are loosely justified by analogy to the reversible
work theorem in statistical physics, or by a statistical argument based on a
likelihood function. Both justifications are insightful but leave many
questions unanswered. Here, we show for the first time that PMFs can be seen as
approximations to quantities that do have a rigorous probabilistic
justification: they naturally arise when probability distributions over
different features of proteins need to be combined. We call these quantities
reference ratio distributions deriving from the application of the reference
ratio method. This new view is not only of theoretical relevance, but leads to
many insights that are of direct practical use: the reference state is uniquely
defined and does not require external physical insights; the approach can be
generalized beyond pairwise distances to arbitrary features of protein
structure; and it becomes clear for which purposes the use of these quantities
is justified. We illustrate these insights with two applications, involving the
radius of gyration and hydrogen bonding. In the latter case, we also show how
the reference ratio method can be iteratively applied to sculpt an energy
funnel. Our results considerably increase the understanding and scope of energy
functions derived from known biomolecular structures
Knowledge-based energy functions for computational studies of proteins
This chapter discusses theoretical framework and methods for developing
knowledge-based potential functions essential for protein structure prediction,
protein-protein interaction, and protein sequence design. We discuss in some
details about the Miyazawa-Jernigan contact statistical potential,
distance-dependent statistical potentials, as well as geometric statistical
potentials. We also describe a geometric model for developing both linear and
non-linear potential functions by optimization. Applications of knowledge-based
potential functions in protein-decoy discrimination, in protein-protein
interactions, and in protein design are then described. Several issues of
knowledge-based potential functions are finally discussed.Comment: 57 pages, 6 figures. To be published in a book by Springe
Comprehensive computational analysis of Hmd enzymes and paralogs in methanogenic Archaea
<p>Abstract</p> <p>Background</p> <p>Methanogenesis is the sole means of energy production in methanogenic Archaea. H<sub>2</sub>-forming methylenetetrahydromethanopterin dehydrogenase (Hmd) catalyzes a step in the hydrogenotrophic methanogenesis pathway in class I methanogens. At least one <it>hmd </it>paralog has been identified in nine of the eleven complete genome sequences of class I hydrogenotrophic methanogens. The products of these paralog genes have thus far eluded any detailed functional characterization.</p> <p>Results</p> <p>Here we present a thorough computational analysis of Hmd enzymes and paralogs that includes state of the art phylogenetic inference, structure prediction, and functional site prediction techniques. We determine that the Hmd enzymes are phylogenetically distinct from Hmd paralogs but share a common overall structure. We predict that the active site of the Hmd enzyme is conserved as a functional site in Hmd paralogs and use this observation to propose possible molecular functions of the paralog that are consistent with previous experimental evidence. We also identify an uncharacterized site in the N-terminal domains of both proteins that is predicted by our methods to directly impart function.</p> <p>Conclusion</p> <p>This study contributes to our understanding of the evolutionary history, structural conservation, and functional roles, of the Hmd enzymes and paralogs. The results of our phylogenetic and structural analysis constitute datasets that will aid in the future study of the Hmd protein family. Our functional site predictions generate several testable hypotheses that will guide further experimental characterization of the Hmd paralog. This work also represents a novel approach to protein function prediction in which multiple computational methods are integrated to achieve a detailed characterization of proteins that are not well understood.</p
The evolution and functional repertoire of translation proteins following the origin of life
<p>Abstract</p> <p>Background</p> <p>The RNA world hypothesis posits that the earliest genetic system consisted of informational RNA molecules that directed the synthesis of modestly functional RNA molecules. Further evidence suggests that it was within this RNA-based genetic system that life developed the ability to synthesize proteins by translating genetic code. Here we investigate the early development of the translation system through an evolutionary survey of protein architectures associated with modern translation.</p> <p>Results</p> <p>Our analysis reveals a structural expansion of translation proteins immediately following the RNA world and well before the establishment of the DNA genome. Subsequent functional annotation shows that representatives of the ten most ancestral protein architectures are responsible for all of the core protein functions found in modern translation.</p> <p>Conclusions</p> <p>We propose that this early robust translation system evolved by virtue of a positive feedback cycle in which the system was able to create increasingly complex proteins to further enhance its own function.</p> <p>Reviewers</p> <p>This article was reviewed by Janet Siefert, George Fox, and Antonio Lazcano (nominated by Laura Landweber)</p
Neuer Kopf, alte Ideen? : "Normalisierung" des Front National unter Marine Le Pen
In this article, it is investigated
whether vibrational entropy
(VE) is an important contribution to the free energy of globular proteins
at ambient conditions. VE represents the major configurational-entropy
contribution of these proteins. By definition, it is an average of
the configurational entropies of the protein within single minima
of the energy landscape, weighted by their occupation probabilities.
Its large part originates from thermal motion of flexible torsion
angles giving rise to the finite peak widths observed in torsion angle
distributions. While VE may affect the equilibrium properties of proteins,
it is usually neglected in numerical calculations as its consideration
is difficult. Moreover, it is sometimes believed that all well-packed
conformations of a globular protein have similar VE anyway. Here, we measure explicitly the VE for six different conformations from simulation data of a test protein. Estimates
are obtained using the quasi-harmonic approximation for three coordinate
sets, Cartesian, bond-angle-torsion (BAT), and a new set termed rotamer-degeneracy
lifted BAT coordinates by us. The new set gives improved estimates
as it overcomes a known shortcoming of the quasi-harmonic approximation
caused by multiply populated rotamer states, and it may serve for
VE estimation of macromolecules in a very general context. The obtained
VE values depend considerably on the type of coordinates used. However,
for all coordinate sets we find large entropy differences between
the conformations, of the order of the overall stability of the protein.
This result may have important implications on the choice of free
energy expressions used in software for protein structure prediction,
protein design, and NMR refinement
An Estimate of the Numbers and Density of Low-Energy Structures (or Decoys) in the Conformational Landscape of Proteins
The conformational energy landscape of a protein, as calculated by known potential energy functions, has several minima, and one of these corresponds to its native structure. It is however difficult to comprehensively estimate the actual numbers of low energy structures (or decoys), the relationships between them, and how the numbers scale with the size of the protein.We have developed an algorithm to rapidly and efficiently identify the low energy conformers of oligo peptides by using mutually orthogonal Latin squares to sample the potential energy hyper surface. Using this algorithm, and the ECEPP/3 potential function, we have made an exhaustive enumeration of the low-energy structures of peptides of different lengths, and have extrapolated these results to larger polypeptides.We show that the number of native-like structures for a polypeptide is, in general, an exponential function of its sequence length. The density of these structures in conformational space remains more or less constant and all the increase appears to come from an expansion in the volume of the space. These results are consistent with earlier reports that were based on other models and techniques
Using neural networks and evolutionary information in decoy discrimination for protein tertiary structure prediction
Background: We present a novel method of protein fold decoy discrimination using machine learning, more specifically using neural networks. Here, decoy discrimination is represented as a machine learning problem, where neural networks are used to learn the native-like features of protein structures using a set of positive and negative training examples. A set of native protein structures provides the positive training examples, while negative training examples are simulated decoy structures obtained by reversing the sequences of native structures. Various features are extracted from the training dataset of positive and negative examples and used as inputs to the neural networks.Results: Results have shown that the best performing neural network is the one that uses input information comprising of PSI-BLAST [1] profiles of residue pairs, pairwise distance and the relative solvent accessibilities of the residues. This neural network is the best among all methods tested in discriminating the native structure from a set of decoys for all decoy datasets tested. Conclusion: This method is demonstrated to be viable, and furthermore evolutionary information is successfully used in the neural networks to improve decoy discrimination
LoCo: a novel main chain scoring function for protein structure prediction based on local coordinates
Protein mimetic amyloid inhibitor potently abrogates cancer-associated mutant p53 aggregation and restores tumor suppressor function
Missense mutations in p53 are severely deleterious and occur in over 50% of all human cancers. The majority of these mutations are located in the inherently unstable DNA-binding domain (DBD), many of which destabilize the domain further and expose its aggregation-prone hydrophobic core, prompting self-assembly of mutant p53 into inactive cytosolic amyloid-like aggregates. Screening an oligopyridylamide library, previously shown to inhibit amyloid formation associated with Alzheimer\u2019s disease and type II diabetes, identified a tripyridylamide, ADH-6, that abrogates self-assembly of the aggregation-nucleating subdomain of mutant p53 DBD. Moreover, ADH-6 targets and dissociates mutant p53 aggregates in human cancer cells, which restores p53\u2019s transcriptional activity, leading to cell cycle arrest and apoptosis. Notably, ADH-6 treatment effectively shrinks xenografts harboring mutant p53, while exhibiting no toxicity to healthy tissue, thereby substantially prolonging survival. This study demonstrates the successful application of a bona fide small-molecule amyloid inhibitor as a potent\ua0anticancer agent
- …
