Search CORE

71 research outputs found

Serverification of Molecular Modeling Applications: the Rosetta Online Server that Includes Everyone (ROSIE)

Author: Bonneau Richard
Borgo Benjamin
Chou Fang-Chieh
Conchúir Shane Ó
Das Rhiju
Der Bryan S.
Drew Kevin
Gray Jeffrey J.
Havranek James J.
Kortemme Tanja
Kuhlman Brian
Kuroda Daisuke
Lyskov Sergey
Renfrew P. Douglas
Sripakdeevong Parin
Weitzner Brian D.
Xu Jianqing
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2013
Field of study

The Rosetta molecular modeling software package provides experimentally tested and rapidly evolving tools for the 3D structure prediction and high-resolution design of proteins, nucleic acids, and a growing number of non-natural polymers. Despite its free availability to academic users and improving documentation, use of Rosetta has largely remained confined to developers and their immediate collaborators due to the code's difficulty of use, the requirement for large computational resources, and the unavailability of servers for most of the Rosetta applications. Here, we present a unified web framework for Rosetta applications called ROSIE (Rosetta Online Server that Includes Everyone). ROSIE provides (a) a common user interface for Rosetta protocols, (b) a stable application programming interface for developers to add additional protocols, (c) a flexible back-end to allow leveraging of computer cluster resources shared by RosettaCommons member institutions, and (d) centralized administration by the RosettaCommons to ensure continuous maintenance. This paper describes the ROSIE server infrastructure, a step-by-step 'serverification' protocol for use by Rosetta developers, and the deployment of the first nine ROSIE applications by six separate developer teams: Docking, RNA de novo, ERRASER, Antibody, Sequence Tolerance, Supercharge, Beta peptide design, NCBB design, and VIP redesign. As illustrated by the number and diversity of these applications, ROSIE offers a general and speedy paradigm for serverification of Rosetta applications that incurs negligible cost to developers and lowers barriers to Rosetta use for the broader biological community. ROSIE is available at http://rosie.rosettacommons.org

arXiv.org e-Print Archive

Crossref

Directory of Open Access Journals

Digital Commons@Becker

PubMed Central

Carolina Digital Repository

eScholarship - University of California

Real-Time PyMOL Visualization for Rosetta and PyRosetta

Author: A Leaver-Fay
BR Brooks
Brian D. Weitzner
DC Richardson
Evan H. Baugh
F Frigerioa
GT Johnson
Jeffrey J. Gray
KW Kaufmann
N Guex
R Das
RA Sayle
S Chaudhury
S Cooper
Sergey Lyskov
SI O'Donoghue
SJ Fleishman
T Schwede
Vladimir N. Uversky
W Humphrey
WL DeLano
Y Wang
Publication venue: Public Library of Science
Publication date: 16/08/2011
Field of study

Computational structure prediction and design of proteins and protein-protein complexes have long been inaccessible to those not directly involved in the field. A key missing component has been the ability to visualize the progress of calculations to better understand them. Rosetta is one simulation suite that would benefit from a robust real-time visualization solution. Several tools exist for the sole purpose of visualizing biomolecules; one of the most popular tools, PyMOL (Schrödinger), is a powerful, highly extensible, user friendly, and attractive package. Integrating Rosetta and PyMOL directly has many technical and logistical obstacles inhibiting usage. To circumvent these issues, we developed a novel solution based on transmitting biomolecular structure and energy information via UDP sockets. Rosetta and PyMOL run as separate processes, thereby avoiding many technical obstacles while visualizing information on-demand in real-time. When Rosetta detects changes in the structure of a protein, new coordinates are sent over a UDP network socket to a PyMOL instance running a UDP socket listener. PyMOL then interprets and displays the molecule. This implementation also allows remote execution of Rosetta. When combined with PyRosetta, this visualization solution provides an interactive environment for protein structure prediction and design

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

The Francis Crick Institute

Better together: Elements of successful scientific software development in a distributed collaborative community

Author: Adolf-Bryfogle Jared
Baker David
Bonneau Richard
Bradley Philip
Bystroff Christopher
Consortium RosettaCommons
Dunbrack Roland
Gray Jeffrey J.
Gront Dominik
Kortemme Tanja
Krys Justyna
Kuhlman Brian
Labonte Jason W.
Leaver-Fay Andrew
Leman Julia Koehler
Lewis Steven M.
Lyskov Sergey
Meiler Jens
Moretti Rocco
Mulligan Vikram Khipple
Renfrew P. Douglas
Schief William
Schueler-Furman Ora
Strauss Charlie E. M.
Watkins Andrew M.
Weitzner Brian D.
Publication venue: Public Library of Science (PLoS)
Publication date: 04/05/2020
Field of study

Many scientific disciplines rely on computational methods for data analysis, model generation, and prediction. Implementing these methods is often accomplished by researchers with domain expertise but without formal training in software engineering or computer science. This arrangement has led to underappreciation of sustainability and maintainability of scientific software tools developed in academic environments. Some software tools have avoided this fate, including the scientific library Rosetta. We use this software and its community as a case study to show how modern software development can be accomplished successfully, irrespective of subject area. Rosetta is one of the largest software suites for macromolecular modeling, with 3.1 million lines of code and many state-of-the-art applications. Since the mid 1990s, the software has been developed collaboratively by the RosettaCommons, a community of academics from over 60 institutions worldwide with diverse backgrounds including chemistry, biology, physiology, physics, engineering, mathematics, and computer science. Developing this software suite has provided us with more than two decades of experience in how to effectively develop advanced scientific software in a global community with hundreds of contributors. Here we illustrate the functioning of this development community by addressing technical aspects (like version control, testing, and maintenance), community-building strategies, diversity efforts, software dissemination, and user support. We demonstrate how modern computational research can thrive in a distributed collaborative community. The practices described here are independent of subject area and can be readily adopted by other software development communities

Carolina Digital Repository

Benchmarking and Analysis of Protein Docking Performance in Rosetta v3.2

Author: A Leaver-Fay
A Sircar
A Sircar
A Sivasubramanian
A Sivasubramanian
B Efron
B Pierce
B Raveh
Brian D. Weitzner
C Dominguez
C Wang
C Wang
EH Baugh
H Hwang
Hannah Bergman
HM Berman
I Andre
IW Davis
J Mintseris
JA Hermoso
Jeffrey J. Gray
JJ Gray
K Wiehe
KT Simons
M Berrondo
MD Daily
MF Lensink
MF Lensink
Monica Berrondo
Pravin Muthu
R Mendez
R Morales
R Méndez
S Chaudhury
S Chaudhury
S Chaudhury
S Chaudhury
Sidhartha Chaudhury
SJ Fleishman
SJ Fleishman
SR Comeau
Vladimir N. Uversky
WE Bocik
Publication venue: Public Library of Science
Publication date: 02/08/2011
Field of study

RosettaDock has been increasingly used in protein docking and design strategies in order to predict the structure of protein-protein interfaces. Here we test capabilities of RosettaDock 3.2, part of the newly developed Rosetta v3.2 modeling suite, against Docking Benchmark 3.0, and compare it with RosettaDock v2.3, the latest version of the previous Rosetta software package. The benchmark contains a diverse set of 116 docking targets including 22 antibody-antigen complexes, 33 enzyme-inhibitor complexes, and 60 ‘other’ complexes. These targets were further classified by expected docking difficulty into 84 rigid-body targets, 17 medium targets, and 14 difficult targets. We carried out local docking perturbations for each target, using the unbound structures when available, in both RosettaDock v2.3 and v3.2. Overall the performances of RosettaDock v2.3 and v3.2 were similar. RosettaDock v3.2 achieved 56 docking funnels, compared to 49 in v2.3. A breakdown of docking performance by protein complex type shows that RosettaDock v3.2 achieved docking funnels for 63% of antibody-antigen targets, 62% of enzyme-inhibitor targets, and 35% of ‘other’ targets. In terms of docking difficulty, RosettaDock v3.2 achieved funnels for 58% of rigid-body targets, 30% of medium targets, and 14% of difficult targets. For targets that failed, we carry out additional analyses to identify the cause of failure, which showed that binding-induced backbone conformation changes account for a majority of failures. We also present a bootstrap statistical analysis that quantifies the reliability of the stochastic docking results. Finally, we demonstrate the additional functionality available in RosettaDock v3.2 by incorporating small-molecules and non-protein co-factors in docking of a smaller target set. This study marks the most extensive benchmarking of the RosettaDock module to date and establishes a baseline for future research in protein interface modeling and structure prediction

Public Library of Science (PLOS)

Crossref

PubMed Central

I am a United Academic Worker

Author: Brian D. Weitzner
Publication venue: American Association for the Advancement of Science (AAAS)
Publication date: 13/10/2017
Field of study

Crossref

A Framework to Simplify Combined Sampling Strategies in Rosetta.

Author: Brian D Weitzner
Justin R Porter
Oliver F Lange
Publication venue: Public Library of Science (PLoS)
Publication date: 01/01/2015
Field of study

A core task in computational structural biology is the search of conformational space for low energy configurations of a biological macromolecule. Because conformational space has a very high dimensionality, the most successful search methods integrate some form of prior knowledge into a general sampling algorithm to reduce the effective dimensionality. However, integrating multiple types of constraints can be challenging. To streamline the incorporation of diverse constraints, we developed the Broker: an extension of the Rosetta macromolecular modeling suite that can express a wide range of protocols using constraints by combining small, independent modules, each of which implements a different set of constraints. We demonstrate expressiveness of the Broker through several code vignettes. The framework enables rapid protocol development in both biomolecular design and structural modeling tasks and thus is an important step towards exposing the rich functionality of Rosetta's core libraries to a growing community of users addressing a diverse set of tasks in computational biology

Crossref

Directory of Open Access Journals

Digital Commons@Becker

PubMed Central

The Francis Crick Institute

The Origin of CDR H3 Structural Diversity

Author: Dunbrack Roland L.
Gray Jeffrey J.
Weitzner Brian D.
Publication venue: Elsevier Ltd.
Publication date: 01/02/2015
Field of study

SummaryAntibody complementarity determining region (CDR) H3 loops are critical for adaptive immunological functions. Although the other five CDR loops adopt predictable canonical structures, H3 conformations have proven unclassifiable, other than an unusual C-terminal “kink” present in most antibodies. To determine why the majority of H3 loops are kinked and to learn whether non-antibody proteins have loop structures similar to those of H3, we searched a set of 15,679 high-quality non-antibody structures for regions geometrically similar to the residues immediately surrounding the loop. By incorporating the kink into our search, we identified 1,030 H3-like loops from 632 protein families. Some protein families, including PDZ domains, appear to use the identified region for recognition and binding. Our results suggest that the kink is conserved in the immunoglobulin heavy chain fold because it disrupts the β-strand pairing at the base of the loop. Thus, the kink is a critical driver of the observed structural diversity in CDR H3

Crossref

Elsevier - Publisher Connector

PubMed Central

A computational method for design of connected catalytic networks in proteins

Author: Baker David
Daniel A. Gerard
Hilvert Donald
Kipnis Yakov
Weitzner Brian D.
Publication venue: 'Wiley'
Publication date: 01/12/2019
Field of study

Computational design of new active sites has generally proceeded by geometrically defining interactions between the reaction transition state(s) and surrounding side‐chain functional groups which maximize transition‐state stabilization, and then searching for sites in protein scaffolds where the specified side‐chain–transition‐state interactions can be realized. A limitation of this approach is that the interactions between the side chains themselves are not constrained. An extensive connected hydrogen bond network involving the catalytic residues was observed in a designed retroaldolase following directed evolution. Such connected networks could increase catalytic activity by preorganizing active site residues in catalytically competent orientations, and enabling concerted interactions between side chains during catalysis, for example, proton shuffling. We developed a method for designing active sites in which the catalytic side chains, in addition to making interactions with the transition state, are also involved in extensive hydrogen bond networks. Because of the added constraint of hydrogen‐bond connectivity between the catalytic side chains, to find solutions, a wider range of interactions between these side chains and the transition state must be considered. Our new method starts from a ChemDraw‐like two‐dimensional representation of the transition state with hydrogen‐bond donors, acceptors, and covalent interaction sites indicated, and all placements of side‐chain functional groups that make the indicated interactions with the transition state, and are fully connected in a single hydrogen‐bond network are systematically enumerated. The RosettaMatch method can then be used to identify realizations of these fully‐connected active sites in protein scaffolds. The method generates many fully‐connected active site solutions for a set of model reactions that are promising starting points for the design of fully‐preorganized enzyme catalysts.ISSN:0961-8368ISSN:1469-896

Repository for Publications and Research Data

Crossref

Blind prediction performance of RosettaAntibody 3.0: Grafting, relaxation, kinematic loop modeling, and full CDR optimization

Author: Brian D. Weitzner
Daisuke Kuroda
Jeffrey J. Gray
Jianqing Xu
Nicholas Marze
Publication venue: Wiley
Publication date: 31/03/2014
Field of study

Antibody Modeling Assessment II (AMA-II) provided an opportunity to benchmark RosettaAntibody on a set of eleven unpublished antibody structures. RosettaAntibody produced accurate, physically realistic models, with all framework regions and 42 of the 55 non-H3 CDR loops predicted to under an Ångström. The performance is notable when modeling H3 on a homology framework, where RosettaAntibody produced the best model among all participants for four of the eleven targets, two of which were predicted with sub-Ångström accuracy. To improve RosettaAntibody, we pursued the causes of model errors. The most common limitation was template unavailability, underscoring the need for more antibody structures and/or better de novo loop methods. In some cases, better templates could have been found by considering residues outside of the CDRs. De novo CDR H3 modeling remains challenging at long loop lengths, but constraining the C-terminal end of H3 to a kinked conformation allows near-native conformations to be sampled more frequently. We also found that incorrect V(L)–V(H) orientations caused models with low H3 RMSDs to score poorly, suggesting that correct V(L)–V(H) orientations will improve discrimination between near-native and incorrect conformations. These observations will guide the future development of RosettaAntibody

Crossref

PubMed Central