258 research outputs found
A Revised Design for Microarray Experiments to Account for Experimental Noise and Uncertainty of Probe Response
Background
Although microarrays are analysis tools in biomedical research, they are known to yield noisy output that usually requires experimental confirmation. To tackle this problem, many studies have developed rules for optimizing probe design and devised complex statistical tools to analyze the output. However, less emphasis has been placed on systematically identifying the noise component as part of the experimental procedure. One source of noise is the variance in probe binding, which can be assessed by replicating array probes. The second source is poor probe performance, which can be assessed by calibrating the array based on a dilution series of target molecules. Using model experiments for copy number variation and gene expression measurements, we investigate here a revised design for microarray experiments that addresses both of these sources of variance.
Results
Two custom arrays were used to evaluate the revised design: one based on 25 mer probes from an Affymetrix design and the other based on 60 mer probes from an Agilent design. To assess experimental variance in probe binding, all probes were replicated ten times. To assess probe performance, the probes were calibrated using a dilution series of target molecules and the signal response was fitted to an adsorption model. We found that significant variance of the signal could be controlled by averaging across probes and removing probes that are nonresponsive or poorly responsive in the calibration experiment. Taking this into account, one can obtain a more reliable signal with the added option of obtaining absolute rather than relative measurements.
Conclusion
The assessment of technical variance within the experiments, combined with the calibration of probes allows to remove poorly responding probes and yields more reliable signals for the remaining ones. Once an array is properly calibrated, absolute quantification of signals becomes straight forward, alleviating the need for normalization and reference hybridizations
Copy number variants and selective sweeps in natural populations of the house mouse (Mus musculus domesticus)
Copy–number variants (CNVs) may play an important role in early adaptations, potentially facilitating rapid divergence of populations. We describe an approach to study this question by investigating CNVs present in natural populations of mice in the early stages of divergence and their involvement in selective sweeps. We have analyzed individuals from two recently diverged natural populations of the house mouse (Mus musculus domesticus) from Germany and France using custom, high–density, comparative genome hybridization arrays (CGH) that covered almost 164 Mb and 2444 genes. One thousand eight hundred and sixty one of those genes we previously identified as differentially expressed between these populations, while the expression of the remaining genes was invariant. In total, we identified 1868 CNVs across all 10 samples, 200 bp to 600 kb in size and affecting 424 genic regions. Roughly two thirds of all CNVs found were deletions. We found no enrichment of CNVs among the differentially expressed genes between the populations compared to the invariant ones, nor any meaningful correlation between CNVs and gene expression changes. Among the CNV genes, we found cellular component gene ontology categories of the synapse overrepresented among all the 2444 genes tested. To investigate potential adaptive significance of the CNV regions, we selected six that showed large differences in frequency of CNVs between the two populations and analyzed variation in at least two microsatellites surrounding the loci in a sample of 46 unrelated animals from the same populations collected in field trappings. We identified two loci with large differences in microsatellite heterozygosity (Sfi1 and Glo1/Dnahc8 regions) and one locus with low variation across the populations (Cmah), thus suggesting that these genomic regions might have recently undergone selective sweeps. Interestingly, the Glo1 CNV has previously been implicated in anxiety–like behavior in mice, suggesting a differential evolution of a behavioral trai
Physico-chemical foundations underpinning microarray and next-generation sequencing experiments
Hybridization of nucleic acids on solid surfaces is a key process involved in high-throughput technologies such as microarrays and, in some cases, next-generation sequencing (NGS). A physical understanding of the hybridization process helps to determine the accuracy of these technologies. The goal of a widespread research program is to develop reliable transformations between the raw signals reported by the technologies and individual molecular concentrations from an ensemble of nucleic acids. This research has inputs from many areas, from bioinformatics and biostatistics, to theoretical and experimental biochemistry and biophysics, to computer simulations. A group of leading researchers met in Ploen Germany in 2011 to discuss present knowledge and limitations of our physico-chemical understanding of high-throughput nucleic acid technologies. This meeting inspired us to write this summary, which provides an overview of the state-of-the-art approaches based on physico-chemical foundation to modeling of the nucleic acids hybridization process on solid surfaces. In addition, practical application of current knowledge is emphasized
Was the Scanner Calibration Slide used for its intended purpose?
In the article, Scanner calibration revisited, BMC Bioinformatics 2010, 11:361, Dr. Pozhitkov used the Scanner Calibration Slide, a key product of Full Moon BioSystems to generate data in his study of microarray scanner PMT response and proposed a mathematic model for PMT response [1]. In the end, the author concluded that "Full Moon BioSystems calibration slides are inadequate for performing calibration," and recommended "against using these slides." We found these conclusions are seriously flawed and misleading, and his recommendation against using the Scanner Calibration Slide was not properly supported
Scanner calibration revisited
<p>Abstract</p> <p>Background</p> <p>Calibration of a microarray scanner is critical for accurate interpretation of microarray results. Shi et al. (<it>BMC Bioinformatics</it>, 2005, <b>6</b>, Art. No. S11 Suppl. 2.) reported usage of a Full Moon BioSystems slide for calibration. Inspired by the Shi et al. work, we have calibrated microarray scanners in our previous research. We were puzzled however, that most of the signal intensities from a biological sample fell below the sensitivity threshold level determined by the calibration slide. This conundrum led us to re-investigate the quality of calibration provided by the Full Moon BioSystems slide as well as the accuracy of the analysis performed by Shi et al.</p> <p>Methods</p> <p>Signal intensities were recorded on three different microarray scanners at various photomultiplier gain levels using the same calibration slide from Full Moon BioSystems. Data analysis was conducted on raw signal intensities without normalization or transformation of any kind. Weighted least-squares method was used to fit the data.</p> <p>Results</p> <p>We found that initial analysis performed by Shi et al. did not take into account autofluorescence of the Full Moon BioSystems slide, which led to a grossly distorted microarray scanner response. Our analysis revealed that a power-law function, which is explicitly accounting for the slide autofluorescence, perfectly described a relationship between signal intensities and fluorophore quantities.</p> <p>Conclusions</p> <p>Microarray scanners respond in a much less distorted fashion than was reported by Shi et al. Full Moon BioSystems calibration slides are inadequate for performing calibration. We recommend against using these slides.</p
The effects of mismatches on hybridization in DNA microarrays: determination of nearest neighbor parameters
Quantifying interactions in DNA microarrays is of central importance for a
better understanding of their functioning. Hybridization thermodynamics for
nucleic acid strands in aqueous solution can be described by the so-called
nearest-neighbor model, which estimates the hybridization free energy of a
given sequence as a sum of dinucleotide terms. Compared with its solution
counterparts, hybridization in DNA microarrays may be hindered due to the
presence of a solid surface and of a high density of DNA strands. We present
here a study aimed at the determination of hybridization free energies in DNA
microarrays. Experiments are performed on custom Agilent slides. The solution
contains a single oligonucleotide. The microarray contains spots with a perfect
matching complementary sequence and other spots with one or two mismatches: in
total 1006 different probe spots, each replicated 15 times per microarray. The
free energy parameters are directly fitted from microarray data. The
experiments demonstrate a clear correlation between hybridization free energies
in the microarray and in solution. The experiments are fully consistent with
the Langmuir model at low intensities, but show a clear deviation at
intermediate (non-saturating) intensities. These results provide new
interesting insights for the quantification of molecular interactions in DNA
microarrays.Comment: 31 pages, 5 figure
Towards microbiome transplant as a therapy for periodontitis: an exploratory study of periodontitis microbial signature contrasted by oral health, caries and edentulism
published_or_final_versio
An algorithm for the determination and quantification of components of nucleic acid mixtures based on single sequencing reactions
BACKGROUND: Determination and quantification of nucleic acid components in a mixture is usually accomplished by microarray approaches, where the mixtures are hybridized against specific probes. As an alternative, we propose here that a single sequencing reaction from a mixture of nucleic acids holds enough information to potentially distinguish the different components, provided it is known which components can occur in the mixture. RESULTS: We describe an algorithm that is based on a set of linear equations which can be solved when the sequencing profiles of the individual components are known and when the number of sequenced nucleotides is larger than the number of components in the mixture. We have implemented the procedure for one type of sequencing approach, pyrosequencing, which produces a stepwise output of peaks that is particularly suitable for the procedure. As an example we use signature sequences from ribosomal RNA to distinguish and quantify several different species in a mixture. Using simulations, we show that the procedure may also be applicable for dideoxy sequencing on capillary sequencers, requiring only some instrument specific adaptations of protocols and software. CONCLUSION: The parallel sequencing approach described here may become a simple and cheap alternative to microarray experiments which aim at routine re-determination and quantification of known nucleic acid components from environmental samples or tissue samples
Molecular taxonomy. Bioinformatics and practical evaluation
Summary Molecular taxonomy is a field that studies the diversity of organisms based on molecular markers. This work is devoted to develop a methodology of molecular taxonomy of small organisms. The ribosomal RNA (rRNA) is used as a molecular marker since its nucleotide sequence includes stretches of various levels of conservation, which can be used as species, genus and taxa specific regions. The organisms live in complex communities. To discover the composition of these communities, a hybridization assay employing oligonucleotide microarrays is developed to indicate the presence of a certain rRNA, in a sample under investigation. An additional method based on the pyrosequencing process is proposed here. In this case the mixture of rRNA genes is directly sequenced and the proportion of individual sequences is then calculated from the obtained pyrogram. The work comprises two parts: theoretical bioinformatics and practical evaluation. The first part tackles the problem of DNA-RNA duplex stability prediction. As a result, an ad hoc stability function is proposed. An algorithm and a program are developed for the design of oligonucleotides employed in the microarray approach. The kinetics of DNA-RNA duplex dissociation is considered as well. In addition, the formalism of the pyrosequencing approach is elaborated theoretically. The experimental part deals with the issues of oligonucleotide microarray establishment, including fabrication, immobilization, hybridization and scanning. A real-time kinetic setup for observing the RNA-DNA duplex dissociation was developed. The theoretical findings and quality of the oligonucleotide design are practically evaluated. The theory is found to be in a good accordance with experiment. The pyrosequencing approach is tested as well and is demonstrated to have enough power to discover the composition of a complex mixture of rRNA genes
- …
