112 research outputs found

    Avoiding the pitfalls of gene set enrichment analysis with SetRank.

    Get PDF
    The purpose of gene set enrichment analysis (GSEA) is to find general trends in the huge lists of genes or proteins generated by many functional genomics techniques and bioinformatics analyses. Here we present SetRank, an advanced GSEA algorithm which is able to eliminate many false positive hits. The key principle of the algorithm is that it discards gene sets that have initially been flagged as significant, if their significance is only due to the overlap with another gene set. The algorithm is explained in detail and its performance is compared to that of other methods using objective benchmarking criteria. Furthermore, we explore how sample source bias can affect the results of a GSEA analysis. The benchmarking results show that SetRank is a highly specific tool for GSEA. Furthermore, we show that the reliability of results can be improved by taking sample source bias into account. SetRank is available as an R package and through an online web interface

    Identification of Novel Androgen-Regulated Pathways and mRNA Isoforms through Genome-Wide Exon-Specific Profiling of the LNCaP Transcriptome

    Get PDF
    Androgens drive the onset and progression of prostate cancer (PCa) by modulating androgen receptor (AR) transcriptional activity. Although several microarray-based studies have identified androgen-regulated genes, here we identify in-parallel global androgen-dependent changes in both gene and alternative mRNA isoform expression by exon-level analyses of the LNCaP transcriptome. While genome-wide gene expression changes correlated well with previously-published studies, we additionally uncovered a subset of 226 novel androgen-regulated genes. Gene expression pathway analysis of this subset revealed gene clusters associated with, and including the tyrosine kinase LYN, as well as components of the mTOR (mammalian target of rapamycin) pathway, which is commonly dysregulated in cancer. We also identified 1279 putative androgen-regulated alternative events, of which 325 (similar to 25%) mapped to known alternative splicing events or alternative first/last exons. We selected 30 androgen-dependent alternative events for RT-PCR validation, including mRNAs derived from genes encoding tumour suppressors and cell cycle regulators. Of seven positively-validating events (similar to 23%), five events involved transcripts derived from alternative promoters of known AR gene targets. In particular, we found a novel androgen-dependent mRNA isoform derived from an alternative internal promoter within the TSC2 tumour suppressor gene, which is predicted to encode a protein lacking an interaction domain required for mTOR inhibition. We confirmed that expression of this alternative TSC2 mRNA isoform was directly regulated by androgens, and chromatin immunoprecipitation indicated recruitment of AR to the alternative promoter region at early timepoints following androgen stimulation, which correlated with expression of alternative transcripts. Together, our data suggest that alternative mRNA isoform expression might mediate the cellular response to androgens, and may have roles in clinical PCa

    TREM-1 links dyslipidemia to inflammation and lipid deposition in atherosclerosis.

    Get PDF
    Triggering receptor expressed on myeloid cells-1 (TREM-1) is a potent amplifier of pro-inflammatory innate immune responses, but its significance in non-infectious diseases remains unclear. Here, we demonstrate that TREM-1 promotes cardiovascular disease by exacerbating atherosclerosis. TREM-1 is expressed in advanced human atheromas and is highly upregulated under dyslipidemic conditions on circulating and on lesion-infiltrating myeloid cells in the Apoe(-/-) mouse model. TREM-1 strongly contributes to high-fat, high-cholesterol diet (HFCD)-induced monocytosis and synergizes with HFCD serum-derived factors to promote pro-inflammatory cytokine responses and foam cell formation of human monocyte/macrophages. Trem1(-/-)Apoe(-/-) mice exhibit substantially attenuated diet-induced atherogenesis. In particular, our results identify skewed monocyte differentiation and enhanced lipid accumulation as novel mechanisms through which TREM-1 can promote atherosclerosis. Collectively, our findings illustrate that dyslipidemia induces TREM-1 surface expression on myeloid cells and subsequently synergizes with TREM-1 to enhance monopoiesis, pro-atherogenic cytokine production and foam cell formation

    Ribonuclease inhibitor 1 regulates erythropoiesis by controlling GATA1 translation.

    Get PDF
    Ribosomal proteins (RP) regulate specific gene expression by selectively translating subsets of mRNAs. Indeed, in Diamond-Blackfan anemia and 5q- syndrome, mutations in RP genes lead to a specific defect in erythroid gene translation and cause anemia. Little is known about the molecular mechanisms of selective mRNA translation and involvement of ribosomal-associated factors in this process. Ribonuclease inhibitor 1 (RNH1) is a ubiquitously expressed protein that binds to and inhibits pancreatic-type ribonucleases. Here, we report that RNH1 binds to ribosomes and regulates erythropoiesis by controlling translation of the erythroid transcription factor GATA1. Rnh1-deficient mice die between embryonic days E8.5 and E10 due to impaired production of mature erythroid cells from progenitor cells. In Rnh1-deficient embryos, mRNA levels of Gata1 are normal, but GATA1 protein levels are decreased. At the molecular level, we found that RNH1 binds to the 40S subunit of ribosomes and facilitates polysome formation on Gata1 mRNA to confer transcript-specific translation. Further, RNH1 knockdown in human CD34+ progenitor cells decreased erythroid differentiation without affecting myelopoiesis. Our results reveal an unsuspected role for RNH1 in the control of GATA1 mRNA translation and erythropoiesis

    Screening synteny blocks in pairwise genome comparisons through integer programming

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>It is difficult to accurately interpret chromosomal correspondences such as true orthology and paralogy due to significant divergence of genomes from a common ancestor. Analyses are particularly problematic among lineages that have repeatedly experienced whole genome duplication (WGD) events. To compare multiple "subgenomes" derived from genome duplications, we need to relax the traditional requirements of "one-to-one" syntenic matchings of genomic regions in order to reflect "one-to-many" or more generally "many-to-many" matchings. However this relaxation may result in the identification of synteny blocks that are derived from ancient shared WGDs that are not of interest. For many downstream analyses, we need to eliminate weak, low scoring alignments from pairwise genome comparisons. Our goal is to objectively select subset of synteny blocks whose total scores are maximized while respecting the duplication history of the genomes in comparison. We call this "quota-based" screening of synteny blocks in order to appropriately fill a quota of syntenic relationships within one genome or between two genomes having WGD events.</p> <p>Results</p> <p>We have formulated the synteny block screening as an optimization problem known as "Binary Integer Programming" (BIP), which is solved using existing linear programming solvers. The computer program QUOTA-ALIGN performs this task by creating a clear objective function that maximizes the compatible set of synteny blocks under given constraints on overlaps and depths (corresponding to the duplication history in respective genomes). Such a procedure is useful for any pairwise synteny alignments, but is most useful in lineages affected by multiple WGDs, like plants or fish lineages. For example, there should be a 1:2 ploidy relationship between genome A and B if genome B had an independent WGD subsequent to the divergence of the two genomes. We show through simulations and real examples using plant genomes in the rosid superorder that the quota-based screening can eliminate ambiguous synteny blocks and focus on specific genomic evolutionary events, like the divergence of lineages (in cross-species comparisons) and the most recent WGD (in self comparisons).</p> <p>Conclusions</p> <p>The QUOTA-ALIGN algorithm screens a set of synteny blocks to retain only those compatible with a user specified ploidy relationship between two genomes. These blocks, in turn, may be used for additional downstream analyses such as identifying true orthologous regions in interspecific comparisons. There are two major contributions of QUOTA-ALIGN: 1) reducing the block screening task to a BIP problem, which is novel; 2) providing an efficient software pipeline starting from all-against-all BLAST to the screened synteny blocks with dot plot visualizations. Python codes and full documentations are publicly available <url>http://github.com/tanghaibao/quota-alignment</url>. QUOTA-ALIGN program is also integrated as a major component in SynMap <url>http://genomevolution.com/CoGe/SynMap.pl</url>, offering easier access to thousands of genomes for non-programmers.</p

    Feedback between p21 and reactive oxygen production is necessary for cell senescence

    Get PDF
    The sustained activation of CDKN1A (p21/Waf1/Cip1) by a DNA damage response induces mitochondrial dysfunction and reactive oxygen species (ROS) production via signalling through CDKN1A-GADD45A-MAPK14- GRB2-TGFBR2-TGFbeta in senescing primary human and mouse cells in vitro and in vivo.Enhanced ROS production in senescing cells generates additional DNA damage. Although this damage is repairable and transient, it elevates the average levels of DNA damage response permanently, thus forming a positive feedback loop.This loop is necessary and sufficient to maintain the stability of growth arrest until a ‘point of no return' is reached during establishment of senescence

    Minimal Absent Words in Prokaryotic and Eukaryotic Genomes

    Get PDF
    Minimal absent words have been computed in genomes of organisms from all domains of life. Here, we explore different sets of minimal absent words in the genomes of 22 organisms (one archaeota, thirteen bacteria and eight eukaryotes). We investigate if the mutational biases that may explain the deficit of the shortest absent words in vertebrates are also pervasive in other absent words, namely in minimal absent words, as well as to other organisms. We find that the compositional biases observed for the shortest absent words in vertebrates are not uniform throughout different sets of minimal absent words. We further investigate the hypothesis of the inheritance of minimal absent words through common ancestry from the similarity in dinucleotide relative abundances of different sets of minimal absent words, and find that this inheritance may be exclusive to vertebrates

    Elusive Origins of the Extra Genes in Aspergillus oryzae

    Get PDF
    The genome sequence of Aspergillus oryzae revealed unexpectedly that this species has approximately 20% more genes than its congeneric species A. nidulans and A. fumigatus. Where did these extra genes come from? Here, we evaluate several possible causes of the elevated gene number. Many gene families are expanded in A. oryzae relative to A. nidulans and A. fumigatus, but we find no evidence of ancient whole-genome duplication or other segmental duplications, either in A. oryzae or in the common ancestor of the genus Aspergillus. We show that the presence of divergent pairs of paralogs is a feature peculiar to A. oryzae and is not shared with A. nidulans or A. fumigatus. In phylogenetic trees that include paralog pairs from A. oryzae, we frequently find that one of the genes in a pair from A. oryzae has the expected orthologous relationship with A. nidulans, A. fumigatus and other species in the subphylum Eurotiomycetes, whereas the other A. oryzae gene falls outside this clade but still within the Ascomycota. We identified 456 such gene pairs in A. oryzae. Further phylogenetic analysis did not however indicate a single consistent evolutionary origin for the divergent members of these pairs. Approximately one-third of them showed phylogenies that are suggestive of horizontal gene transfer (HGT) from Sordariomycete species, and these genes are closer together in the A. oryzae genome than expected by chance, but no unique Sordariomycete donor species was identifiable. The postulated HGTs from Sordariomycetes still leave the majority of extra A. oryzae genes unaccounted for. One possible explanation for our observations is that A. oryzae might have been the recipient of many separate HGT events from diverse donors

    2R and remodeling of vertebrate signal transduction engine

    Get PDF
    <p>Abstract</p> <p><b>Background</b></p> <p>Whole genome duplication (WGD) is a special case of gene duplication, observed rarely in animals, whereby all genes duplicate simultaneously through polyploidisation. Two rounds of WGD (2R-WGD) occurred at the base of vertebrates, giving rise to an enormous wave of genetic novelty, but a systematic analysis of functional consequences of this event has not yet been performed.</p> <p><b>Results</b></p> <p>We show that 2R-WGD affected an overwhelming majority (74%) of signalling genes, in particular developmental pathways involving receptor tyrosine kinases, Wnt and transforming growth factor-β ligands, G protein-coupled receptors and the apoptosis pathway. 2R-retained genes, in contrast to tandem duplicates, were enriched in protein interaction domains and multifunctional signalling modules of Ras and mitogen-activated protein kinase cascades. 2R-WGD had a fundamental impact on the cell-cycle machinery, redefined molecular building blocks of the neuronal synapse, and was formative for vertebrate brains. We investigated 2R-associated nodes in the context of the human signalling network, as well as in an inferred ancestral pre-2R (AP2R) network, and found that hubs (particularly involving negative regulation) were preferentially retained, with high connectivity driving retention. Finally, microarrays and proteomics demonstrated a trend for gradual paralog expression divergence independent of the duplication mechanism, but inferred ancestral expression states suggested preferential subfunctionalisation among 2R-ohnologs (2ROs).</p> <p><b>Conclusions</b></p> <p>The 2R event left an indelible imprint on vertebrate signalling and the cell cycle. We show that 2R-WGD preferentially retained genes are associated with higher organismal complexity (for example, locomotion, nervous system, morphogenesis), while genes associated with basic cellular functions (for example, translation, replication, splicing, recombination; with the notable exception of cell cycle) tended to be excluded. 2R-WGD set the stage for the emergence of key vertebrate functional novelties (such as complex brains, circulatory system, heart, bone, cartilage, musculature and adipose tissue). A full explanation of the impact of 2R on evolution, function and the flow of information in vertebrate signalling networks is likely to have practical consequences for regenerative medicine, stem cell therapies and cancer treatment.</p
    corecore