4,900 research outputs found
Biology's next revolution
The interpretation of recent environmental genomics data exposes the
far-reaching influence of horizontal gene transfer, and is changing our basic
concepts of organism, species and evolution itself.Comment: Slightly expanded version of invited essay published in Nature. The
most important addition is a complete set of references that could not be
included in the published version due to space limitations and acknowledgment
of the grant that supported our wor
Identification of Birds through DNA Barcodes
Short DNA sequences from a standardized region of the genome provide a DNA barcode for identifying species. Compiling a public library of DNA barcodes linked to named specimens could provide a new master key for identifying species, one whose power will rise with increased taxon coverage and with faster, cheaper sequencing. Recent work suggests that sequence diversity in a 648-bp region of the mitochondrial gene, cytochrome c oxidase I (COI), might serve as a DNA barcode for the identification of animal species. This study tested the effectiveness of a COI barcode in discriminating bird species, one of the largest and best-studied vertebrate groups. We determined COI barcodes for 260 species of North American birds and found that distinguishing species was generally straightforward. All species had a different COI barcode(s), and the differences between closely related species were, on average, 18 times higher than the differences within species. Our results identified four probable new species of North American birds, suggesting that a global survey will lead to the recognition of many additional bird species. The finding of large COI sequence differences between, as compared to small differences within, species confirms the effectiveness of COI barcodes for the identification of bird species. This result plus those from other groups of animals imply that a standard screening threshold of sequence difference (10× average intraspecific difference) could speed the discovery of new animal species. The growing evidence for the effectiveness of DNA barcodes as a basis for species identification supports an international exercise that has recently begun to assemble a comprehensive library of COI sequences linked to named specimens
The Genetic Code as a Periodic Table: Algebraic Aspects
The systematics of indices of physico-chemical properties of codons and amino
acids across the genetic code are examined. Using a simple numerical labelling
scheme for nucleic acid bases, data can be fitted as low-order polynomials of
the 6 coordinates in the 64-dimensional codon weight space. The work confirms
and extends recent studies by Siemion of amino acid conformational parameters.
The connections between the present work, and recent studies of the genetic
code structure using dynamical symmetry algebras, are pointed out.Comment: 26 pages Latex, 10 figures (4 ps, 6 Tex). Refereed version, small
changes to discussion (conclusion unaltered). Minor alterations to format of
figures and tables. To appear in BioSystem
A complex adaptive systems approach to the kinetic folding of RNA
The kinetic folding of RNA sequences into secondary structures is modeled as
a complex adaptive system, the components of which are possible RNA structural
rearrangements (SRs) and their associated bases and base pairs. RNA bases and
base pairs engage in local stacking interactions that determine the
probabilities (or fitnesses) of possible SRs. Meanwhile, selection operates at
the level of SRs; an autonomous stochastic process periodically (i.e., from one
time step to another) selects a subset of possible SRs for realization based on
the fitnesses of the SRs. Using examples based on selected natural and
synthetic RNAs, the model is shown to qualitatively reproduce characteristic
(nonlinear) RNA folding dynamics such as the attainment by RNAs of alternative
stable states. Possible applications of the model to the analysis of properties
of fitness landscapes, and of the RNA sequence to structure mapping are
discussed.Comment: 23 pages, 4 figures, 2 tables, to be published in BioSystems (Note:
updated 2 references
Phylogeny of Prokaryotes and Chloroplasts Revealed by a Simple Composition Approach on All Protein Sequences from Complete Genomes Without Sequence Alignment
The complete genomes of living organisms have provided much information on their phylogenetic relationships. Similarly, the complete genomes of chloroplasts have helped to resolve the evolution of this organelle in photosynthetic eukaryotes. In this paper we propose an alternative method of phylogenetic analysis using compositional statistics for all protein sequences from complete genomes. This new method is conceptually simpler than and computationally as fast as the one proposed by Qi et al. (2004b) and Chu et al. (2004). The same data sets used in Qi et al. (2004b) and Chu et al. (2004) are analyzed using the new method. Our distance-based phylogenic tree of the 109 prokaryotes and eukaryotes agrees with the biologists tree of life based on 16S rRNA comparison in a predominant majority of basic branching and most lower taxa. Our phylogenetic analysis also shows that the chloroplast genomes are separated to two major clades corresponding to chlorophytes s.l. and rhodophytes s.l. The interrelationships among the chloroplasts are largely in agreement with the current understanding on chloroplast evolution
Origin of symbol-using systems: speech, but not sign, without the semantic urge
Natural language—spoken and signed—is a multichannel phenomenon, involving facial and body expression, and voice and visual intonation that is often used in the service of a social urge to communicate meaning. Given that iconicity seems easier and less abstract than making arbitrary connections between sound and meaning, iconicity and gesture have often been invoked in the origin of language alongside the urge to convey meaning. To get a fresh perspective, we critically distinguish the origin of a system capable of evolution from the subsequent evolution that system becomes capable of. Human language arose on a substrate of a system already capable of Darwinian evolution; the genetically supported uniquely human ability to learn a language reflects a key contact point between Darwinian evolution and language. Though implemented in brains generated by DNA symbols coding for protein meaning, the second higher-level symbol-using system of language now operates in a world mostly decoupled from Darwinian evolutionary constraints. Examination of Darwinian evolution of vocal learning in other animals suggests that the initial fixation of a key prerequisite to language into the human genome may actually have required initially side-stepping not only iconicity, but the urge to mean itself. If sign languages came later, they would not have faced this constraint
Evidence for diversifying selection of genetic regions of encoding putative collagen-like host-adhesive fibers in Pasteuria penetrans
© FEMS 2018. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.Pasteuria spp. belong to a group of genetically diverse endospore-forming bacteria (phylum: Firmicutes) that are known to parasitize plant-parasitic nematodes and water fleas (Daphnia spp.). Collagen-like fibres form the nap on the surface of endospores and the genes encoding these sequences have been hypothesised to be involved in the adhesion of the endospores of Pasteuria spp. to their hosts. We report a group of 17 unique collagen-like genes putatively encoded by Pasteuria penetrans (strain: Res148) that formed five different phylogenetic clusters and suggest that collagen-like proteins are an important source of genetic diversity in animal pathogenic Firmicutes including Pasteuria. Additionally, and unexpectedly, we identified a putative collagen-like sequence which had a very different sequence structure to the other collagen-like proteins but was similar to the protein sequences in Megaviruses that are involved in host-parasite interactions. We, therefore, suggest that these diverse endospore surface proteins in Pasteuria are involved in biological functions, such as cellular adhesion; however, they are not of monophyletic origin and were possibly obtained de novo by mutation or possibly through selection acting upon several historic horizontal gene transfer events.Peer reviewedFinal Published versio
Four small puzzles that Rosetta doesn't solve
A complete macromolecule modeling package must be able to solve the simplest
structure prediction problems. Despite recent successes in high resolution
structure modeling and design, the Rosetta software suite fares poorly on
deceptively small protein and RNA puzzles, some as small as four residues. To
illustrate these problems, this manuscript presents extensive Rosetta results
for four well-defined test cases: the 20-residue mini-protein Trp cage, an even
smaller disulfide-stabilized conotoxin, the reactive loop of a serine protease
inhibitor, and a UUCG RNA tetraloop. In contrast to previous Rosetta studies,
several lines of evidence indicate that conformational sampling is not the
major bottleneck in modeling these small systems. Instead, approximations and
omissions in the Rosetta all-atom energy function currently preclude
discriminating experimentally observed conformations from de novo models at
atomic resolution. These molecular "puzzles" should serve as useful model
systems for developers wishing to make foundational improvements to this
powerful modeling suite.Comment: Published in PLoS One as a manuscript for the RosettaCon 2010 Special
Collectio
- …
