104 research outputs found
Computational protein design quantifies structural constraints on amino acid covariation.
Amino acid covariation, where the identities of amino acids at different sequence positions are correlated, is a hallmark of naturally occurring proteins. This covariation can arise from multiple factors, including selective pressures for maintaining protein structure, requirements imposed by a specific function, or from phylogenetic sampling bias. Here we employed flexible backbone computational protein design to quantify the extent to which protein structure has constrained amino acid covariation for 40 diverse protein domains. We find significant similarities between the amino acid covariation in alignments of natural protein sequences and sequences optimized for their structures by computational protein design methods. These results indicate that the structural constraints imposed by protein architecture play a dominant role in shaping amino acid covariation and that computational protein design methods can capture these effects. We also find that the similarity between natural and designed covariation is sensitive to the magnitude and mechanism of backbone flexibility used in computational protein design. Our results thus highlight the necessity of including backbone flexibility to correctly model precise details of correlated amino acid changes and give insights into the pressures underlying these correlations
Xist recruits the X chromosome to the nuclear lamina to enable chromosome-wide silencing
The Xist long noncoding RNA orchestrates X chromosome inactivation, a process that entails chromosome-wide silencing and remodeling of the three-dimensional (3D) structure of the X chromosome. Yet, it remains unclear whether these changes in nuclear structure are mediated by Xist and whether they are required for silencing. Here, we show that Xist directly interacts with the Lamin B receptor, an integral component of the nuclear lamina, and that this interaction is required for Xist-mediated silencing by recruiting the inactive X to the nuclear lamina and by doing so enables Xist to spread to actively transcribed genes across the X. Our results demonstrate that lamina recruitment changes the 3D structure of DNA, enabling Xist and its silencing proteins to spread across the X to silence transcription
Structural analysis of the evolution of steroid specificity in the mineralocorticoid and glucocorticoid receptors
BACKGROUND: The glucocorticoid receptor (GR) and mineralocorticoid receptor (MR) evolved from a common ancestor. Still not completely understood is how specificity for glucocorticoids (e.g. cortisol) and mineralocorticoids (e.g. aldosterone) evolved in these receptors. RESULTS: Our analysis of several vertebrate GRs and MRs in the context of 3D structures of human GR and MR indicates that with the exception of skate GR, a cartilaginous fish, there is a deletion in all GRs, at the position corresponding to Ser-949 in human MR. This deletion occurs in a loop before helix 12, which contains the activation function 2 (AF2) domain, which binds coactivator proteins and influences transcriptional activity of steroids. Unexpectedly, we find that His-950 in human MR, which is conserved in the MR in chimpanzee, orangutan and macaque, is glutamine in all teleost and land vertebrate MRs, including New World monkeys and prosimians. CONCLUSION: Evolution of differences in the responses of the GR and MR to corticosteroids involved deletion in the GR of a residue corresponding to Ser-949 in human MR. A mutation corresponding to His-950 in human MR may have been important in physiological changes associated with emergence of Old World monkeys from prosimians
Chapter Four Flexible Backbone Sampling Methods to Model and Design Protein Alternative Conformations
Sampling alternative conformations is key to understanding how proteins work and engineering them for new functions. However, accurately characterizing and modeling protein conformational ensembles remain experimentally and computationally challenging. These challenges must be met before protein conformational heterogeneity can be exploited in protein engineering and design. Here, as a stepping stone, we describe methods to detect alternative conformations in proteins and strategies to model these near-native conformational changes based on backrub-type Monte Carlo moves in Rosetta. We illustrate how Rosetta simulations that apply backrub moves improve modeling of point mutant side-chain conformations, native side-chain conformational heterogeneity, functional conformational changes, tolerated sequence space, protein interaction specificity, and amino acid covariation across protein-protein interfaces. We include relevant Rosetta command lines and RosettaScripts to encourage the application of these types of simulations to other systems. Our work highlights that critical scoring and sampling improvements will be necessary to approximate conformational landscapes. Challenges for the future development of these methods include modeling conformational changes that propagate away from designed mutation sites and modulating backbone flexibility to predictively design functionally important conformational heterogeneity
Long non-coding RNAs: spatial amplifiers that control nuclear structure and gene expression
Over the past decade, it has become clear that mammalian genomes encode thousands of long non-coding RNAs (lncRNAs), many of which are now implicated in diverse biological processes. Recent work studying the molecular mechanisms of several key examples — including Xist, which orchestrates X chromosome inactivation — has provided new insights into how lncRNAs can control cellular functions by acting in the nucleus. Here we discuss emerging mechanistic insights into how lncRNAs can regulate gene expression by coordinating regulatory proteins, localizing to target loci and shaping three-dimensional (3D) nuclear organization. We explore these principles to highlight biological challenges in gene regulation, in which lncRNAs are well-suited to perform roles that cannot be carried out by DNA elements or protein regulators alone, such as acting as spatial amplifiers of regulatory signals in the nucleus
Higher-Order Inter-chromosomal Hubs Shape 3D Genome Organization in the Nucleus
Eukaryotic genomes are packaged into a 3-dimensional structure in the nucleus. Current methods for studying genome-wide structure are based on proximity ligation. However, this approach can fail to detect known structures, such as interactions with nuclear bodies, because these DNA regions can be too far apart to directly ligate. Accordingly, our overall understanding of genome organization remains incomplete. Here, we develop split-pool recognition of interactions by tag extension (SPRITE), a method that enables genome-wide detection of higher-order interactions within the nucleus. Using SPRITE, we recapitulate known structures identified by proximity ligation and identify additional interactions occurring across larger distances, including two hubs of inter-chromosomal interactions that are arranged around the nucleolus and nuclear speckles. We show that a substantial fraction of the genome exhibits preferential organization relative to these nuclear bodies. Our results generate a global model whereby nuclear bodies act as inter-chromosomal hubs that shape the overall packaging of DNA in the nucleus
Amino-acid site variability among natural and designed proteins
Computational protein design attempts to create protein sequences that fold stably into pre-specified structures. Here we compare alignments of designed proteins to alignments of natural proteins and assess how closely designed sequences recapitulate patterns of sequence variation found in natural protein sequences. We design proteins using RosettaDesign, and we evaluate both fixed-backbone designs and variable-backbone designs with different amounts of backbone flexibility. We find that proteins designed with a fixed backbone tend to underestimate the amount of site variability observed in natural proteins while proteins designed with an intermediate amount of backbone flexibility result in more realistic site variability. Further, the correlation between solvent exposure and site variability in designed proteins is lower than that in natural proteins. This finding suggests that site variability is too uniform across different solvent exposure states (i.e., buried residues are too variable or exposed residues too conserved). When comparing the amino acid frequencies in the designed proteins with those in natural proteins we find that in the designed proteins hydrophobic residues are underrepresented in the core. From these results we conclude that intermediate backbone flexibility during design results in more accurate protein design and that either scoring functions or backbone sampling methods require further improvement to accurately replicate structural constraints on site variability
A single-cell method to map higher-order 3D genome organization in thousands of individual cells reveals structural heterogeneity in mouse ES cells
In eukaryotes, the nucleus is organized into a three dimensional structure consisting of both local interactions such as those between enhancers and promoters, and long-range higher-order structures such as nuclear bodies. This organization is central to many aspects of nuclear function, including DNA replication, transcription, and cell cycle progression. Nuclear structure intrinsically occurs within single cells; however, measuring such a broad spectrum of 3D DNA interactions on a genome-wide scale and at the single cell level has been a great challenge. To address this, we developed single-cell split-pool recognition of interactions by tag extension (scSPRITE), a new method that enables measurements of genome-wide maps of 3D DNA structure in thousands of individual nuclei. scSPRITE maximizes the number of DNA contacts detected per cell enabling high-resolution genome structure maps within each cells and is easy-to-use and cost-effective. scSPRITE accurately detects chromosome territories, active and inactive compartments, topologically associating domains (TADs), and higher-order structures within single cells. In addition, scSPRITE measures cell-to-cell heterogeneity in genome structure at different levels of resolution and shows that TADs are dynamic units of genome organization that can vary between different cells within a population. scSPRITE will improve our understanding of nuclear architecture and its relationship to nuclear function within an individual nucleus from complex cell types and tissues containing a diverse population of cells
RNA promotes the formation of spatial compartments in the nucleus
The nucleus is a highly organized arrangement of RNA, DNA, and protein molecules that are compartmentalized within three-dimensional (3D) structures involved in shared functional and regulatory processes. Although RNA has long been proposed to play a global role in organizing nuclear structure, exploring the role of RNA in shaping nuclear structure has remained a challenge because no existing methods can simultaneously measure RNA-RNA, RNA-DNA, and DNA-DNA contacts within 3D structures. To address this, we developed RNA & DNA SPRITE (RD-SPRITE) to comprehensively map the location of all RNAs relative to DNA and other RNAs. Using this approach, we identify many RNAs that are localized near their transcriptional loci (RNA-DNA) together with other diffusible ncRNAs (RNA-RNA) within higher-order DNA structures (DNA-DNA). These RNA-chromatin compartments span three major classes of nuclear functions: RNA processing (including ribosome biogenesis, mRNA splicing, snRNA biogenesis, and histone mRNA processing), heterochromatin assembly, and gene regulation. More generally, we identify hundreds of ncRNAs that form stable nuclear compartments in spatial proximity to their transcriptional loci. We find that dozens of nuclear compartments require RNA to guide protein regulators into these 3D structures, and focusing on several ncRNAs, we show that these ncRNAs specifically regulate heterochromatin assembly and the expression of genes contained within these compartments. Together, our results demonstrate a unique mechanism by which RNA acts to shape nuclear structure by forming high concentration territories immediately upon transcription, binding to diffusible regulators, and guiding them into spatial compartments to regulate a wide range of essential nuclear functions
- …
