278 research outputs found

    Direct integration of intensity-level data from Affymetrix and Illumina microarrays improves statistical power for robust reanalysis

    Get PDF
    BACKGROUND: Affymetrix GeneChips and Illumina BeadArrays are the most widely used commercial single channel gene expression microarrays. Public data repositories are an extremely valuable resource, providing array-derived gene expression measurements from many thousands of experiments. Unfortunately many of these studies are underpowered and it is desirable to improve power by combining data from more than one study; we sought to determine whether platform-specific bias precludes direct integration of probe intensity signals for combined reanalysis. RESULTS: Using Affymetrix and Illumina data from the microarray quality control project, from our own clinical samples, and from additional publicly available datasets we evaluated several approaches to directly integrate intensity level expression data from the two platforms. After mapping probe sequences to Ensembl genes we demonstrate that, ComBat and cross platform normalisation (XPN), significantly outperform mean-centering and distance-weighted discrimination (DWD) in terms of minimising inter-platform variance. In particular we observed that DWD, a popular method used in a number of previous studies, removed systematic bias at the expense of genuine biological variability, potentially reducing legitimate biological differences from integrated datasets. CONCLUSION: Normalised and batch-corrected intensity-level data from Affymetrix and Illumina microarrays can be directly combined to generate biologically meaningful results with improved statistical power for robust, integrated reanalysis

    Environmental Implications of Increased Bioenergy Production on Midwest Soil Landscapes [abstract]

    Get PDF
    Only abstract of poster available.Track III: Energy InfrastructurePrairie soil landscapes encompass over 16 million acres in Missouri and surrounding states. Much of this area has been degraded by erosion but is still used for grain production. Erosion has caused variable topsoil depth within fields which in turn has resulted in greater within-field variability of crop yield, magnified the drought-prone nature of these soils, and lowered the overall soil productivity and ecosystem function. In recent years, pressure on these sensitive soils has risen due to higher demand for grain production, in part for ethanol and biodiesel. In some areas, highly erodible fields which were historically managed as CRP and pasture are being turned into grain crop acres. Thus as new and fluctuating feed and bioenergy markets develop, land management practices will also shift, resulting in changes in soil and water quality of watersheds. This presentation will explore the likely environmental implications of different types of bioenergy production on the soil resource. Further, the positive benefits of potential changes in land use will be in explored. For example, one alternative for sensitive soils is production of perennial grass as a feedstock for coal co-burning plants and for potential future use in cellulosic ethanol production. Perennial grass yields are likely to be less variable than grain yields, both year-to-year and within fields, primarily because of greater resistance to drought. Grass production systems also provide environmental services not obtained from annual grain crops. We will also discuss our work on developing ways to target the most appropriate places in the landscape for grain or perennial production so as to enhance ecosystem services and improve soil and water quality

    Tissue of origin determines cancer-associated CpG island promoter hypermethylation patterns

    Get PDF
    ABSTRACT: BACKGROUND: Aberrant CpG island promoter DNA hypermethylation is frequently observed in cancer and is believed to contribute to tumor progression by silencing the expression of tumor suppressor genes. Previously, we observed that promoter hypermethylation in breast cancer reflects cell lineage rather than tumor progression and occurs at genes that are already repressed in a lineage-specific manner. To investigate the generality of our observation we analyzed the methylation profiles of 1,154 cancers from 7 different tissue types. RESULTS: We find that 1,009 genes are prone to hypermethylation in these 7 types of cancer. Nearly half of these genes varied in their susceptibility to hypermethylation between different cancer types. We show that the expression status of hypermethylation prone genes in the originator tissue determines their propensity to become hypermethylated in cancer; specifically, genes that are normally repressed in a tissue are prone to hypermethylation in cancers derived from that tissue. We also show that the promoter regions of hypermethylation-prone genes are depleted of repetitive elements and that DNA sequence around the same promoters is evolutionarily conserved. We propose that these two characteristics reflect tissue-specific gene promoter architecture regulating the expression of these hypermethylation prone genes in normal tissues. CONCLUSIONS: As aberrantly hypermethylated genes are already repressed in pre-cancerous tissue, we suggest that their hypermethylation does not directly contribute to cancer development via silencing. Instead aberrant hypermethylation reflects developmental history and the perturbation of epigenetic mechanisms maintaining these repressed promoters in a hypomethylated state in normal cells.Publisher PDFPeer reviewe

    Evaluation of machine-learning methods for ligand-based virtual screening

    Get PDF
    Machine-learning methods can be used for virtual screening by analysing the structural characteristics of molecules of known (in)activity, and we here discuss the use of kernel discrimination and naive Bayesian classifier (NBC) methods for this purpose. We report a kernel method that allows the processing of molecules represented by binary, integer and real-valued descriptors, and show that it is little different in screening performance from a previously described kernel that had been developed specifically for the analysis of binary fingerprint representations of molecular structure. We then evaluate the performance of an NBC when the training-set contains only a very few active molecules. In such cases, a simpler approach based on group fusion would appear to provide superior screening performance, especially when structurally heterogeneous datasets are to be processed

    Role of inherent soil characteristics in assessing soil health across Missouri

    Get PDF
    Soil health indicator values vary based on parent material, native vegetation, and other soil forming factors; therefore, useful interpretations require consideration of inherent soil characteristics. Our objective was to evaluate the distribution of soil health indicators across soil and climate gradients throughout the state of Missouri through a statewide cover crop cost-share program. Soil samples (0–7 cm) were collected from 5,300 agricultural fields and analyzed for several soil health indicators. Comparisons were made among six regions in the state based on Major Land Resource Area and county boundaries. Results varied for soil organic carbon (C), active C, potentially mineralizable nitrogen, water stable aggregates, and cation exchange capacity by region and corresponded with soil forming factors. Interpretation of soil health indicators must account for regional factors, recognizing that areas with different inherent values have a different potential for soil health

    Association of polymorphisms in HCN4 with mood disorders and obsessive compulsive disorder

    Get PDF
    Hyperpolarization activated cyclic nucleotide-gated (HCN) potassium channels are implicated in the control of neuronal excitability and are expressed widely in the brain. HCN4 is expressed in brain regions relevant to mood and anxiety disorders including specific thalamic nuclei, the basolateral amygdala, and the midbrain dopamine system. We therefore examined the association of HCN4 with a group of mood and anxiety disorders. We genotyped nine tag SNPs in the HCN4 gene using Sequenom iPLEX Gold technology in 285 Caucasian patients with DSM-IV mood disorders and/or obsessive compulsive disorder and 384 Caucasian controls. HCN4 polymorphisms were analyzed using single marker and haplotype-based association methods. Three SNPs showed nominal association in our population (rs12905211, rs3859014, rs498005). SNP rs12905211 maintained significance after Bonferroni correction, with allele T and haplotype CTC overrepresented in cases. These findings suggest HCN4 as a genetic susceptibility factor for mood and anxiety disorders; however, these results will require replication using a larger sample

    Correcting for intra-experiment variation in Illumina BeadChip data is necessary to generate robust gene-expression profiles

    Get PDF
    Background: Microarray technology is a popular means of producing whole genome transcriptional profiles, however high cost and scarcity of mRNA has led many studies to be conducted based on the analysis of single samples. We exploit the design of the Illumina platform, specifically multiple arrays on each chip, to evaluate intra-experiment technical variation using repeated hybridisations of universal human reference RNA (UHRR) and duplicate hybridisations of primary breast tumour samples from a clinical study.Results: A clear batch-specific bias was detected in the measured expressions of both the UHRR and clinical samples. This bias was found to persist following standard microarray normalisation techniques. However, when mean-centering or empirical Bayes batch-correction methods (ComBat) were applied to the data, inter-batch variation in the UHRR and clinical samples were greatly reduced. Correlation between replicate UHRR samples improved by two orders of magnitude following batch-correction using ComBat (ranging from 0.9833-0.9991 to 0.9997-0.9999) and increased the consistency of the gene-lists from the duplicate clinical samples, from 11.6% in quantile normalised data to 66.4% in batch-corrected data. The use of UHRR as an inter-batch calibrator provided a small additional benefit when used in conjunction with ComBat, further increasing the agreement between the two gene-lists, up to 74.1%.Conclusion: In the interests of practicalities and cost, these results suggest that single samples can generate reliable data, but only after careful compensation for technical bias in the experiment. We recommend that investigators appreciate the propensity for such variation in the design stages of a microarray experiment and that the use of suitable correction methods become routine during the statistical analysis of the data.</p

    Functional Genomic and Proteomic Analysis Reveals Disruption of Myelin-Related Genes and Translation in a Mouse Model of Early Life Neglect

    Get PDF
    Early life neglect is an important public health problem which can lead to lasting psychological dysfunction. Good animal models are necessary to understand the mechanisms responsible for the behavioral and anatomical pathology that results. We recently described a novel model of early life neglect, maternal separation with early weaning (MSEW), that produces behavioral changes in the mouse that persist into adulthood. To begin to understand the mechanism by which MSEW leads to these changes we applied cDNA microarray, next-generation RNA-sequencing (RNA-seq), label-free proteomics, multiple reaction monitoring (MRM) proteomics, and methylation analysis to tissue samples obtained from medial prefrontal cortex to determine the molecular changes induced by MSEW that persist into adulthood. The results show that MSEW leads to dysregulation of markers of mature oligodendrocytes and genes involved in protein translation and other categories, an apparent downward biasing of translation, and methylation changes in the promoter regions of selected dysregulated genes. These findings are likely to prove useful in understanding the mechanism by which early life neglect affects brain structure, cognition, and behavior

    Diffusion MRI of the facial-vestibulocochlear nerve complex: a prospective clinical validation study

    Get PDF
    Objectives: Surgical planning of vestibular schwannoma surgery would benefit greatly from a robust method of delineating the facial-vestibulocochlear nerve complex with respect to the tumour. This study aimed to optimise a multi-shell readout-segmented diffusion-weighted imaging (rs-DWI) protocol and develop a novel post-processing pipeline to delineate the facial-vestibulocochlear complex within the skull base region, evaluating its accuracy intraoperatively using neuronavigation and tracked electrophysiological recordings./ Methods: In a prospective study of five healthy volunteers and five patients who underwent vestibular schwannoma surgery, rs-DWI was performed and colour tissue maps (CTM) and probabilistic tractography of the cranial nerves were generated. In patients, the average symmetric surface distance (ASSD) and 95% Hausdorff distance (HD-95) were calculated with reference to the neuroradiologist-approved facial nerve segmentation. The accuracy of patient results was assessed intraoperatively using neuronavigation and tracked electrophysiological recordings./ Results: Using CTM alone, the facial-vestibulocochlear complex of healthy volunteer subjects was visualised on 9/10 sides. CTM were generated in all 5 patients with vestibular schwannoma enabling the facial nerve to be accurately identified preoperatively. The mean ASSD between the annotators’ two segmentations was 1.11 mm (SD 0.40) and the mean HD-95 was 4.62 mm (SD 1.78). The median distance from the nerve segmentation to a positive stimulation point was 1.21 mm (IQR 0.81–3.27 mm) and 2.03 mm (IQR 0.99–3.84 mm) for the two annotators, respectively./ Conclusions: rs-DWI may be used to acquire dMRI data of the cranial nerves within the posterior fossa./ Clinical relevance statement: Readout-segmented diffusion-weighted imaging and colour tissue mapping provide 1–2 mm spatially accurate imaging of the facial-vestibulocochlear nerve complex, enabling accurate preoperative localisation of the facial nerve. This study evaluated the technique in 5 healthy volunteers and 5 patients with vestibular schwannoma./ Key Points: • Readout-segmented diffusion-weighted imaging (rs-DWI) with colour tissue mapping (CTM) visualised the facial-vestibulocochlear nerve complex on 9/10 sides in 5 healthy volunteer subjects./ • Using rs-DWI and CTM, the facial nerve was visualised in all 5 patients with vestibular schwannoma and within 1.21–2.03 mm of the nerve’s true intraoperative location./ • Reproducible results were obtained on different scanners

    Relative impact of key sources of systematic noise in Affymetrix and Illumina gene-expression microarray experiments

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Systematic processing noise, which includes batch effects, is very common in microarray experiments but is often ignored despite its potential to confound or compromise experimental results. Compromised results are most likely when re-analysing or integrating datasets from public repositories due to the different conditions under which each dataset is generated. To better understand the relative noise-contributions of various factors in experimental-design, we assessed several Illumina and Affymetrix datasets for technical variation between replicate hybridisations of Universal Human Reference (UHRR) and individual or pooled breast-tumour RNA.</p> <p>Results</p> <p>A varying degree of systematic noise was observed in each of the datasets, however in all cases the relative amount of variation between standard control RNA replicates was found to be greatest at earlier points in the sample-preparation workflow. For example, 40.6% of the total variation in reported expressions were attributed to replicate extractions, compared to 13.9% due to amplification/labelling and 10.8% between replicate hybridisations. Deliberate probe-wise batch-correction methods were effective in reducing the magnitude of this variation, although the level of improvement was dependent on the sources of noise included in the model. Systematic noise introduced at the chip, run, and experiment levels of a combined Illumina dataset were found to be highly dependant upon the experimental design. Both UHRR and pools of RNA, which were derived from the samples of interest, modelled technical variation well although the pools were significantly better correlated (4% average improvement) and better emulated the effects of systematic noise, over all probes, than the UHRRs. The effect of this noise was not uniform over all probes, with low GC-content probes found to be more vulnerable to batch variation than probes with a higher GC-content.</p> <p>Conclusions</p> <p>The magnitude of systematic processing noise in a microarray experiment is variable across probes and experiments, however it is generally the case that procedures earlier in the sample-preparation workflow are liable to introduce the most noise. Careful experimental design is important to protect against noise, detailed meta-data should always be provided, and diagnostic procedures should be routinely performed prior to downstream analyses for the detection of bias in microarray studies.</p
    corecore