26 research outputs found
Automated Workflow for Preparation of cDNA for Cap Analysis of Gene Expression on a Single Molecule Sequencer
Background: Cap analysis of gene expression (CAGE) is a 59 sequence tag technology to globally determine transcriptional starting sites in the genome and their expression levels and has most recently been adapted to the HeliScope single molecule sequencer. Despite significant simplifications in the CAGE protocol, it has until now been a labour intensive protocol. Methodology: In this study we set out to adapt the protocol to a robotic workflow, which would increase throughput and reduce handling. The automated CAGE cDNA preparation system we present here can prepare 96 ‘HeliScope ready ’ CAGE cDNA libraries in 8 days, as opposed to 6 weeks by a manual operator.We compare the results obtained using the same RNA in manual libraries and across multiple automation batches to assess reproducibility. Conclusions: We show that the sequencing was highly reproducible and comparable to manual libraries with an 8 fold increase in productivity. The automated CAGE cDNA preparation system can prepare 96 CAGE sequencing samples simultaneously. Finally we discuss how the system could be used for CAGE on Illumina/SOLiD platforms, RNA-seq and fulllengt
A Computational Profiling of Changes in Gene Expression and Transcription Factors Induced by vFLIP K13 in Primary Effusion Lymphoma
Infection with Kaposi's sarcoma associated herpesvirus (KSHV) has been linked to the development of primary effusion lymphoma (PEL), a rare lymphoproliferative disorder that is characterized by loss of expression of most B cell markers and effusions in the body cavities. This unique clinical presentation of PEL has been attributed to their distinctive plasmablastic gene expression profile that shows overexpression of genes involved in inflammation, adhesion and invasion. KSHV-encoded latent protein vFLIP K13 has been previously shown to promote the survival and proliferation of PEL cells. In this study, we employed gene array analysis to characterize the effect of K13 on global gene expression in PEL-derived BCBL1 cells, which express negligible K13 endogenously. We demonstrate that K13 upregulates the expression of a number of NF-κB responsive genes involved in cytokine signaling, cell death, adhesion, inflammation and immune response, including two NF-κB subunits involved in the alternate NF-κB pathway, RELB and NFKB2. In contrast, CD19, a B cell marker, was one of the genes downregulated by K13. A comparison with K13-induced genes in human vascular endothelial cells revealed that although there was a considerable overlap among the genes induced by K13 in the two cell types, chemokines genes were preferentially induced in HUVEC with few exceptions, such as RANTES/CCL5, which was induced in both cell types. Functional studies confirmed that K13 activated the RANTES/CCL5 promoter through the NF-κB pathway. Taken collectively, our results suggest that K13 may contribute to the unique gene expression profile, immunophenotype and clinical presentation that are characteristics of KSHV-associated PEL
Differential Gene Expression in the Siphonophore Nanomia bijuga (Cnidaria) Assessed with Multiple Next-Generation Sequencing Workflows
We investigated differential gene expression between functionally specialized feeding polyps and swimming medusae in the siphonophore Nanomia bijuga (Cnidaria) with a hybrid long-read/short-read sequencing strategy. We assembled a set of partial gene reference sequences from long-read data (Roche 454), and generated short-read sequences from replicated tissue samples that were mapped to the references to quantify expression. We collected and compared expression data with three short-read expression workflows that differ in sample preparation, sequencing technology, and mapping tools. These workflows were Illumina mRNA-Seq, which generates sequence reads from random locations along each transcript, and two tag-based approaches, SOLiD SAGE and Helicos DGE, which generate reads from particular tag sites. Differences in expression results across workflows were mostly due to the differential impact of missing data in the partial reference sequences. When all 454-derived gene reference sequences were considered, Illumina mRNA-Seq detected more than twice as many differentially expressed (DE) reference sequences as the tag-based workflows. This discrepancy was largely due to missing tag sites in the partial reference that led to false negatives in the tag-based workflows. When only the subset of reference sequences that unambiguously have tag sites was considered, we found broad congruence across workflows, and they all identified a similar set of DE sequences. Our results are promising in several regards for gene expression studies in non-model organisms. First, we demonstrate that a hybrid long-read/short-read sequencing strategy is an effective way to collect gene expression data when an annotated genome sequence is not available. Second, our replicated sampling indicates that expression profiles are highly consistent across field-collected animals in this case. Third, the impacts of partial reference sequences on the ability to detect DE can be mitigated through workflow choice and deeper reference sequencing
Sequencing technologies and genome sequencing
The high-throughput - next generation sequencing (HT-NGS) technologies are currently the hottest topic in the field of human and animals genomics researches, which can produce over 100 times more data compared to the most sophisticated capillary sequencers based on the Sanger method. With the ongoing developments of high throughput sequencing machines and advancement of modern bioinformatics tools at unprecedented pace, the target goal of sequencing individual genomes of living organism at a cost of $1,000 each is seemed to be realistically feasible in the near future. In the relatively short time frame since 2005, the HT-NGS technologies are revolutionizing the human and animal genome researches by analysis of chromatin immunoprecipitation coupled to DNA microarray (ChIP-chip) or sequencing (ChIP-seq), RNA sequencing (RNA-seq), whole genome genotyping, genome wide structural variation, de novo assembling and re-assembling of genome, mutation detection and carrier screening, detection of inherited disorders and complex human diseases, DNA library preparation, paired ends and genomic captures, sequencing of mitochondrial genome and personal genomics. In this review, we addressed the important features of HT-NGS like, first generation DNA sequencers, birth of HT-NGS, second generation HT-NGS platforms, third generation HT-NGS platforms: including single molecule Heliscope™, SMRT™ and RNAP sequencers, Nanopore, Archon Genomics X PRIZE foundation, comparison of second and third HT-NGS platforms, applications, advances and future perspectives of sequencing technologies on human and animal genome research
Inflammatory Gene Regulatory Networks in Amnion Cells Following Cytokine Stimulation: Translational Systems Approach to Modeling Human Parturition
A majority of the studies examining the molecular regulation of human labor have
been conducted using single gene approaches. While the technology to produce
multi-dimensional datasets is readily available, the means for facile analysis
of such data are limited. The objective of this study was to develop a systems
approach to infer regulatory mechanisms governing global gene expression in
cytokine-challenged cells in vitro, and to apply these methods
to predict gene regulatory networks (GRNs) in intrauterine tissues during term
parturition. To this end, microarray analysis was applied to human amnion
mesenchymal cells (AMCs) stimulated with interleukin-1β, and differentially
expressed transcripts were subjected to hierarchical clustering, temporal
expression profiling, and motif enrichment analysis, from which a GRN was
constructed. These methods were then applied to fetal membrane specimens
collected in the absence or presence of spontaneous term labor. Analysis of
cytokine-responsive genes in AMCs revealed a sterile immune response signature,
with promoters enriched in response elements for several inflammation-associated
transcription factors. In comparison to the fetal membrane dataset, there were
34 genes commonly upregulated, many of which were part of an acute inflammation
gene expression signature. Binding motifs for nuclear factor-κB were
prominent in the gene interaction and regulatory networks for both datasets;
however, we found little evidence to support the utilization of
pathogen-associated molecular pattern (PAMP) signaling. The tissue specimens
were also enriched for transcripts governed by hypoxia-inducible factor. The
approach presented here provides an uncomplicated means to infer global
relationships among gene clusters involved in cellular responses to
labor-associated signals
Comparison of the Equine Reference Sequence with Its Sanger Source Data and New Illumina Reads
Impact of Polymerase Fidelity on Background Error Rates in Next-Generation Sequencing with Unique Molecular Identifiers/Barcodes
Pathogenic variants in CDC45 on the remaining allele in patients with a chromosome 22q11.2 deletion result in a novel autosomal recessive condition
Analysis of Co-Associated Transcription Factors via Ordered Adjacency Differences on Motif Distribution
Transcription factors (TFs) binding to specific DNA sequences or motifs, are elementary to the regulation of transcription. The gene is regulated by a combination of TFs in close proximity. Analysis of co-TFs is an important problem in understanding the mechanism of transcriptional regulation. Recently, ChIP-seq in mapping TF provides a large amount of experimental data to analyze co-TFs. Several studies show that if two TFs are co-associated, the relative distance between TFs exhibits a peak-like distribution. In order to analyze co-TFs, we develop a novel method to evaluate the associated situation between TFs. We design an adjacency score based on ordered differences, which can illustrate co-TF binding affinities for motif analysis. For all candidate motifs, we calculate corresponding adjacency scores, and then list descending-order motifs. From these lists, we can find co-TFs for candidate motifs. On ChIP-seq datasets, our method obtains best AUC results on five datasets, 0.9432 for NMYC, 0.9109 for KLF4, 0.9006 for ZFX, 0.8892 for ESRRB, 0.8920 for E2F1. Our method has great stability on large sample datasets. AUC results of our method on all datasets are above 0.8
