389 research outputs found
Recommended from our members
Folding the Carpenter's Tape: Boundary Layer Effects
Abstract
The “carpenter’s measuring tape” is a thin spring-steel strip, preformed to a curved cross section of radius R, which is straight when being used for measuring. Under bending moments, it forms a localized hinge, in which the transverse curvature is suppressed, and the longitudinal radius r is approximately equal to R. Rimrott made a simple strain energy analysis of the hinge region for isotropic material, which predicted that r = R. Both experimental observations and finite element computations show that ξ = r/R > 1, where the value of ξ exceeds unity by up to 15%, depending on whether the tape is bent in “equal-sense” or “opposite-sense” curvature; ξ varies linearly with Poisson’s ratio in both cases. We make a minor change to Rimrott’s analysis by introducing a boundary layer, in order better to satisfy the physical conditions at the free edges; this successfully accounts for the observed behavior of the tape.Non
A stitch in time: Efficient computation of genomic DNA melting bubbles
Background: It is of biological interest to make genome-wide predictions of
the locations of DNA melting bubbles using statistical mechanics models.
Computationally, this poses the challenge that a generic search through all
combinations of bubble starts and ends is quadratic.
Results: An efficient algorithm is described, which shows that the time
complexity of the task is O(NlogN) rather than quadratic. The algorithm
exploits that bubble lengths may be limited, but without a prior assumption of
a maximal bubble length. No approximations, such as windowing, have been
introduced to reduce the time complexity. More than just finding the bubbles,
the algorithm produces a stitch profile, which is a probabilistic graphical
model of bubbles and helical regions. The algorithm applies a probability peak
finding method based on a hierarchical analysis of the energy barriers in the
Poland-Scheraga model.
Conclusions: Exact and fast computation of genomic stitch profiles is thus
feasible. Sequences of several megabases have been computed, only limited by
computer memory. Possible applications are the genome-wide comparisons of
bubbles with promotors, TSS, viral integration sites, and other melting-related
regions.Comment: 16 pages, 10 figure
Installing hydrolytic activity into a completely <i>de novo </i>protein framework
The design of enzyme-like catalysts tests our understanding of sequence-to-structure/function relationships in proteins. Here we install hydrolytic activity predictably into a completely de novo and thermostable α-helical barrel, which comprises seven helices arranged around an accessible channel. We show that the lumen of the barrel accepts 21 mutations to functional polar residues. The resulting variant, which has cysteine–histidine–glutamic acid triads on each helix, hydrolyses p-nitrophenyl acetate with catalytic efficiencies that match the most-efficient redesigned hydrolases based on natural protein scaffolds. This is the first report of a functional catalytic triad engineered into a de novo protein framework. The flexibility of our system also allows the facile incorporation of unnatural side chains to improve activity and probe the catalytic mechanism. Such a predictable and robust construction of truly de novo biocatalysts holds promise for applications in chemical and biochemical synthesis
Reliability analysis of the Ahringer Caenorhabditis elegans RNAi feeding library: a guide for genome-wide screens
<p>Abstract</p> <p>Background</p> <p>The Ahringer <it>C. elegans </it>RNAi feeding library prepared by cloning genomic DNA fragments has been widely used in genome-wide analysis of gene function. However, the library has not been thoroughly validated by direct sequencing, and there are potential errors, including: 1) mis-annotation (the clone with the retired gene name should be remapped to the actual target gene); 2) nonspecific PCR amplification; 3) cross-RNAi; 4) mis-operation such as sample loading error, <it>etc</it>.</p> <p>Results</p> <p>Here we performed a reliability analysis on the Ahringer <it>C. elegans </it>RNAi feeding library, which contains 16,256 bacterial strains, using a bioinformatics approach. Results demonstrated that most (98.3%) of the bacterial strains in the library are reliable. However, we also found that 2,851 (17.54%) bacterial strains need to be re-annotated even they are reliable. Most of these bacterial strains are the clones having the retired gene names. Besides, 28 strains are grouped into unreliable category and 226 strains are marginal because of probably expressing unrelated double-stranded RNAs (dsRNAs). The accuracy of the prediction was further confirmed by direct sequencing analysis of 496 bacterial strains. Finally, a freely accessible database named CelRNAi (<url>http://biocompute.bmi.ac.cn/CelRNAi/</url>) was developed as a valuable complement resource for the feeding RNAi library by providing the predicted information on all bacterial strains. Moreover, submission of the direct sequencing result or any other annotations for the bacterial strains to the database are allowed and will be integrated into the CelRNAi database to improve the accuracy of the library. In addition, we provide five candidate primer sets for each of the unreliable and marginal bacterial strains for users to construct an alternative vector for their own RNAi studies.</p> <p>Conclusions</p> <p>Because of the potential unreliability of the Ahringer <it>C. elegans </it>RNAi feeding library, we strongly suggest the user examine the reliability information of the bacterial strains in the CelRNAi database before performing RNAi experiments, as well as the post-RNAi experiment analysis.</p
G+C content dominates intrinsic nucleosome occupancy
<p>Abstract</p> <p>Background</p> <p>The relative preference of nucleosomes to form on individual DNA sequences plays a major role in genome packaging. A wide variety of DNA sequence features are believed to influence nucleosome formation, including periodic dinucleotide signals, poly-A stretches and other short motifs, and sequence properties that influence DNA structure, including base content. It was recently shown by Kaplan et al. that a probabilistic model using composition of all 5-mers within a nucleosome-sized tiling window accurately predicts intrinsic nucleosome occupancy across an entire genome <it>in vitro</it>. However, the model is complicated, and it is not clear which specific DNA sequence properties are most important for intrinsic nucleosome-forming preferences.</p> <p>Results</p> <p>We find that a simple linear combination of only 14 simple DNA sequence attributes (G+C content, two transformations of dinucleotide composition, and the frequency of eleven 4-bp sequences) explains nucleosome occupancy <it>in vitro </it>and <it>in vivo </it>in a manner comparable to the Kaplan model. G+C content and frequency of AAAA are the most important features. G+C content is dominant, alone explaining ~50% of the variation in nucleosome occupancy <it>in vitro</it>.</p> <p>Conclusions</p> <p>Our findings provide a dramatically simplified means to predict and understand intrinsic nucleosome occupancy. G+C content may dominate because it both reduces frequency of poly-A-like stretches and correlates with many other DNA structural characteristics. Since G+C content is enriched or depleted at many types of features in diverse eukaryotic genomes, our results suggest that variation in nucleotide composition may have a widespread and direct influence on chromatin structure.</p
Maintaining and breaking symmetry in homomeric coiled-coil assemblies
Higher order coiled coils with five or more helices can form α-helical barrels. Here the authors show that placing β-branched aliphatic residues along the lumen yields stable and open α-helical barrels, which is of interest for the rational design of functional proteins; whereas, the absence of β-branched side chains leads to unusual low-symmetry α-helical bundles
Beta-Strand Interfaces of Non-Dimeric Protein Oligomers Are Characterized by Scattered Charged Residue Patterns
Protein oligomers are formed either permanently, transiently or even by default. The protein chains are associated through intermolecular interactions constituting the protein interface. The protein interfaces of 40 soluble protein oligomers of stœchiometries above two are investigated using a quantitative and qualitative methodology, which analyzes the x-ray structures of the protein oligomers and considers their interfaces as interaction networks. The protein oligomers of the dataset share the same geometry of interface, made by the association of two individual β-strands (β-interfaces), but are otherwise unrelated. The results show that the β-interfaces are made of two interdigitated interaction networks. One of them involves interactions between main chain atoms (backbone network) while the other involves interactions between side chain and backbone atoms or between only side chain atoms (side chain network). Each one has its own characteristics which can be associated to a distinct role. The secondary structure of the β-interfaces is implemented through the backbone networks which are enriched with the hydrophobic amino acids favored in intramolecular β-sheets (MCWIV). The intermolecular specificity is provided by the side chain networks via positioning different types of charged residues at the extremities (arginine) and in the middle (glutamic acid and histidine) of the interface. Such charge distribution helps discriminating between sequences of intermolecular β-strands, of intramolecular β-strands and of β-strands forming β-amyloid fibers. This might open new venues for drug designs and predictive tool developments. Moreover, the β-strands of the cholera toxin B subunit interface, when produced individually as synthetic peptides, are capable of inhibiting the assembly of the toxin into pentamers. Thus, their sequences contain the features necessary for a β-interface formation. Such β-strands could be considered as ‘assemblons’, independent associating units, by homology to the foldons (independent folding unit). Such property would be extremely valuable in term of assembly inhibitory drug development
Effective transcription factor binding site prediction using a combination of optimization, a genetic algorithm and discriminant analysis to capture distant interactions
<p>Abstract</p> <p>Background</p> <p>Reliable transcription factor binding site (TFBS) prediction methods are essential for computer annotation of large amount of genome sequence data. However, current methods to predict TFBSs are hampered by the high false-positive rates that occur when only sequence conservation at the core binding-sites is considered.</p> <p>Results</p> <p>To improve this situation, we have quantified the performance of several Position Weight Matrix (PWM) algorithms, using exhaustive approaches to find their optimal length and position. We applied these approaches to bio-medically important TFBSs involved in the regulation of cell growth and proliferation as well as in inflammatory, immune, and antiviral responses (NF-κB, ISGF3, IRF1, STAT1), obesity and lipid metabolism (PPAR, SREBP, HNF4), regulation of the steroidogenic (SF-1) and cell cycle (E2F) genes expression. We have also gained extra specificity using a method, entitled SiteGA, which takes into account structural interactions within TFBS core and flanking regions, using a genetic algorithm (GA) with a discriminant function of locally positioned dinucleotide (LPD) frequencies.</p> <p>To ensure a higher confidence in our approach, we applied resampling-jackknife and bootstrap tests for the comparison, it appears that, optimized PWM and SiteGA have shown similar recognition performances. Then we applied SiteGA and optimized PWMs (both separately and together) to sequences in the Eukaryotic Promoter Database (EPD). The resulting SiteGA recognition models can now be used to search sequences for BSs using the web tool, SiteGA.</p> <p>Analysis of dependencies between close and distant LPDs revealed by SiteGA models has shown that the most significant correlations are between close LPDs, and are generally located in the core (footprint) region. A greater number of less significant correlations are mainly between distant LPDs, which spanned both core and flanking regions. When SiteGA and optimized PWM models were applied together, this substantially reduced false positives at least at higher stringencies.</p> <p>Conclusion</p> <p>Based on this analysis, SiteGA adds substantial specificity even to optimized PWMs and may be considered for large-scale genome analysis. It adds to the range of techniques available for TFBS prediction, and EPD analysis has led to a list of genes which appear to be regulated by the above TFs.</p
Unprocessed Viral DNA Could Be the Primary Target of the HIV-1 Integrase Inhibitor Raltegravir
Integration of HIV DNA into host chromosome requires a 3′-processing (3′-P) and a strand transfer (ST) reactions catalyzed by virus integrase (IN). Raltegravir (RAL), commonly used in AIDS therapy, belongs to the family of IN ST inhibitors (INSTIs) acting on IN-viral DNA complexes (intasomes). However, studies show that RAL fails to bind IN alone, but nothing has been reported on the behaviour of RAL toward free viral DNA. Here, we assessed whether free viral DNA could be a primary target for RAL, assuming that the DNA molecule is a receptor for a huge number of pharmacological agents. Optical spectroscopy, molecular dynamics and free energy calculations, showed that RAL is a tight binder of both processed and unprocessed LTR (long terminal repeat) ends. Complex formation involved mainly van der Waals forces and was enthalpy driven. Dissociation constants (Kds) revealed that RAL affinity for unbound LTRs was stronger than for bound LTRs. Moreover, Kd value for binding of RAL to LTRs and IC50 value (half concentration for inhibition) were in same range, suggesting that RAL binding to DNA and ST inhibition are correlated events. Accommodation of RAL into terminal base-pairs of unprocessed LTR is facilitated by an extensive end fraying that lowers the RAL binding energy barrier. The RAL binding entails a weak damping of fraying and correlatively of 3′-P inhibition. Noteworthy, present calculated RAL structures bound to free viral DNA resemble those found in RAL-intasome crystals, especially concerning the contacts between the fluorobenzyl group and the conserved 5′C4pA33′ step. We propose that RAL inhibits IN, in binding first unprocessed DNA. Similarly to anticancer drug poisons acting on topoisomerases, its interaction with DNA does not alter the cut, but blocks the subsequent joining reaction. We also speculate that INSTIs having viral DNA rather IN as main target could induce less resistance
- …
