Search CORE

244 research outputs found

Differential expression analysis for sequence count data

Author: A Agresti
A Mortazavi
AC Cameron
AM Smith
AS Morrissy
B Langmead
C Loader
CI Bliss
DD Licatalosi
G Robertson
GK Smyth
GK Smyth
I Lönnstedt
J Bullard
JC Marioni
JF Lawless
JS Bloom
K Saha
L Wang
L Whitaker
M Kasowski
MD Robinson
MD Robinson
MD Robinson
MD Robinson
P Engström
P McCullagh
RC Gentleman
Simon Anders
SJ Clark
U Nagalakshmi
Wolfgang Huber
Y Benjamini
Publication venue
Publication date: 01/01/2010
Field of study

*Motivation:* High-throughput nucleotide sequencing provides quantitative readouts in assays for RNA expression (RNA-Seq), protein-DNA binding (ChIP-Seq) or cell counting (barcode sequencing). Statistical inference of differential signal in such data requires estimation of their variability throughout the dynamic range. When the number of replicates is small, error modelling is needed to achieve statistical power.

*Results:* We propose an error model that uses the negative binomial distribution, with variance and mean linked by local regression, to model the null distribution of the count data. The method controls type-I error and provides good detection power. 

*Availability:* A free open-source R software package, _DESeq_, is available from the Bioconductor project and from "http://www-huber.embl.de/users/anders/DESeq":http://www-huber.embl.de/users/anders/DESeq

Crossref

Springer

Springer - Publisher Connector

PubMed Central

Institute of Mathematics AS CR, v. v. i.

Nature Precedings

FRA2A is a CGG repeat expansion associated with silencing of AFF3

Author: A Ruiz-Herrera
A Ruiz-Herrera
A Tukun
AJMH Verkerk
AR La Spada
B Winnepenninckx
C Jones
C Ma
C McMurray
CE Pearson
CE Pearson
Chandra Sekhar Reddy Chilamakuri
Christopher E. Pearson
D Kumari
David I. Wilson
David R. FitzPatrick
DD Licatalosi
DD Rudnicki
DS Murthy
E de Graaff
E Steichen-Gersdorf
Edwin Reyniers
Eric Haan
Evelyn Douglas
G Annerén
Geert Vandeweyer
Geoffrey Thompson
Harris Morrison
Hemant Bengani
J Benítez
J Gécz
J Rainger
J Tost
Jacqueline Rainger
JE Parrish
JK Nancarrow
Jozef Gecz
K Debacker
K Debacker
K Gronskov
K Mondal
KE Davies
L Chakrabarti
Liesbeth Rooms
M Kato
M Melko
M Pieretti
M Pieretti
M Wojciechowska
MA Costa Lima
MA Lancaster
Martin S. Taylor
MM Axford
O Britanova
R Illingworth
R Kodzius
R O'Rahilly
R Willemsen
R. Frank Kooy
RI Richards
RI Richards
SJL Knight
SL Nolin
Sofie Metsu
SS Chong
T Sarafidou
T Taki
T Zu
X Liao
Y Gu
Y Trottier
YaW Lin
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2014
Field of study

Folate-sensitive fragile sites (FSFS) are a rare cytogenetically visible subset of dynamic mutations. Of the eight molecularly characterized FSFS, four are associated with intellectual disability (ID). Cytogenetic expression results from CGG tri-nucleotide-repeat expansion mutation associated with local CpG hypermethylation and transcriptional silencing. The best studied is the FRAXA site in the FMR1 gene, where large expansions cause fragile X syndrome, the most common inherited ID syndrome. Here we studied three families with FRA2A expression at 2q11 associated with a wide spectrum of neurodevelopmental phenotypes. We identified a polymorphic CGG repeat in a conserved, brain-active alternative promoter of the AFF3 gene, an autosomal homolog of the X-linked AFF2/FMR2 gene: Expansion of the AFF2 CGG repeat causes FRAXE ID. We found that FRA2A-expressing individuals have mosaic expansions of the AFF3 CGG repeat in the range of several hundred repeat units. Moreover, bisulfite sequencing and pyrosequencing both suggest AFF3 promoter hypermethylation. cSNP-analysis demonstrates monoallelic expression of the AFF3 gene in FRA2A carriers thus predicting that FRA2A expression results in functional haploinsufficiency for AFF3 at least in a subset of tissues. By whole-mount in situ hybridization the mouse AFF3 ortholog shows strong regional expression in the developing brain, somites and limb buds in 9.5-12.5dpc mouse embryos. Our data suggest that there may be an association between FRA2A and a delay in the acquisition of motor and language skills in the families studied here. However, additional cases are required to firmly establish a causal relationship

Southampton (e-Prints Soton)

Crossref

Adelaide Research & Scholarship

Directory of Open Access Journals

PubMed Central

Edinburgh Research Explorer

The Francis Crick Institute

DGCR8 HITS-CLIP reveals novel functions for the Microprocessor

Author: A Shenoy
A Shiohama
A Shiohama
Agata Stajuda
AM Denli
BN Davis
C Ender
D Tollervey
DD Licatalosi
DD Licatalosi
DG Zisoulis
DP Bartel
E Bernstein
E Bernstein
E Lund
Eduardo Eyras
FV Karginov
G Hutvagner
G Michlewski
Gracjan Michlewski
GS Slater
H Wu
J Han
J Han
J Han
J Konig
J Krol
J Ule
J Winter
Javier F Cáceres
JF Caceres
JM Pawlicki
JR Sanford
K Fenelon
KL Stark
M Faller
M Faller
M Hafner
M Landthaler
M Morlando
Mireya Plass
MM Chong
MS Scott
MT Bohnsack
P Flicek
P Ji
PA Fujita
R Triboulet
R Yi
RI Gregory
RJ Taft
S Guil
S Kadener
Sara Macias
SW Chi
T Kiss
TJ Liang
Y Wang
Y Zeng
YT Lin
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2012
Field of study

The Drosha-DGCR8 complex (Microprocessor) is required for microRNA (miRNA) biogenesis. DGCR8 recognizes the RNA substrate, whereas Drosha functions as the endonuclease. High-throughput sequencing and crosslinking immunoprecipitation (HITS-CLIP) was used to identify RNA targets of DGCR8 in human cells. Unexpectedly, miRNAs were not the most abundant targets. DGCR8-bound RNAs also comprised several hundred mRNAs as well as snoRNAs and long non-coding RNAs. We found that the Microprocessor controls the abundance of several mRNAs as well as of MALAT-1. By contrast, DGCR8-mediated cleavage of snoRNAs is independent of Drosha, suggesting the involvement of DGCR8 in cellular complexes with other endonucleases. Interestingly, binding of DGCR8 to cassette exons, acts as a novel mechanism to regulate the relative abundance of alternatively spliced isoforms. Collectively, these data provide new insights in the complex role of DGCR8 in controlling the fate of several classes of RNAs

Crossref

PubMed Central

Copenhagen University Research Information System

Edinburgh Research Explorer

UPF Digital Repository

lincRNAs act in the circuitry controlling pluripotency and differentiation

Author: A Gaspar-Maia
A Marson
A Meissner
A Subramanian
A Visel
A Wutz
AA Avilion
AG Smith
Alexander Meissner
AM Khalil
Anne Bergstrom Lucas
Aviv Regev
B Langmead
BE Bernstein
BK Dey
Bryce W. Carey
C Bock
D Pasini
D Sproul
DA Barbie
David E. Root
DC Zappulla
DD Licatalosi
EE Morrisey
Eric S. Lander
F De Santa
G Hu
G Kunarso
Geneva Young
GK Geiss
Glen Munson
H Jiang
H Niwa
H Niwa
I Chambers
Ido Amit
IG Brons
J Jiang
J Lamb
J Moffat
J Nichols
J Ponjavic
J Ponjavic
J Silva
J Ule
Jennifer K. Grenier
JL Rinn
John L. Rinn
JS Mattick
Julie Donaghey
K Aiba
K Aiba
K Mitsui
K Plath
KD Pruitt
LA Boyer
Laurakay Bruhn
M Dejosez
M Ebisuya
M Guttman
M Guttman
M Huarte
Manuel Garber
MC Tsai
ME Torres-Padilla
Mitchell Guttman
MJ Koziol
MV Koerner
N Ivanova
P Yuan
PA Cloos
QL Ying
R Jaenisch
RI Sherwood
Robert Ach
S Bilodeau
T Brambrink
TG Fazzio
The FANTOM Consortium
TK Kim
UA Ørom
X Chen
X Shen
Xiaoping Yang
Y Katz
Y Nakatake
Y Zhang
YH Yang
Z Wang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/03/2011
Field of study

Although thousands of large intergenic non-coding RNAs (lincRNAs) have been identified in mammals, few have been functionally characterized, leading to debate about their biological role. To address this, we performed loss-of-function studies on most lincRNAs expressed in mouse embryonic stem (ES) cells and characterized the effects on gene expression. Here we show that knockdown of lincRNAs has major consequences on gene expression patterns, comparable to knockdown of well-known ES cell regulators. Notably, lincRNAs primarily affect gene expression in trans. Knockdown of dozens of lincRNAs causes either exit from the pluripotent state or upregulation of lineage commitment programs. We integrate lincRNAs into the molecular circuitry of ES cells and show that lincRNA genes are regulated by key transcription factors and that lincRNA transcripts bind to multiple chromatin regulatory proteins to affect shared gene expression programs. Together, the results demonstrate that lincRNAs have key roles in the circuitry controlling ES cell state.Broad InstituteHarvard UniversityNational Human Genome Research Institute (U.S.)Merkin Family Foundation for Stem Cell Researc

Genome-wide identification of Ago2 binding sites from mouse embryonic stem cells with and without mature microRNAs

Author: A Grimson
A Marson
A Rosa
A Siepel
AA Caudy
AK Leung
AK Leung
Amanda G Young
Andrew D Bosson
Anthony K L Leung
Arjun Bhutkar
BP Lewis
C Ciaudo
C Melton
C Xiao
CB Nielsen
Cydney B Nielsen
D Baek
D Edbauer
DD Licatalosi
DG Zisoulis
DP Bartel
G Stefani
Grace X Zheng
GS Tan
GW Yeo
HB Houbaviy
I Behm-Ansmant
J Brennecke
J Höck
J Tsang
J Ule
JE Babiarz
JG Doench
JM Calabrese
JM Calabrese
KA O'Donnell
KK Farh
KM Foshay
L Sinkkonen
LP Lim
M Blanchette
M Hafner
M Hafner
M Kertesz
M Yoda
P Landgraf
Phillip A Sharp
R Benetti
RC Friedman
RL Judson
RS Pillai
RW Carthew
S Djuranovic
S Gu
S Sinha
SW Chi
TL Bailey
TL Bailey
V Ambros
WP Kloosterman
WY Choi
X Li
X Xie
Y Tay
Y Wang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/04/2010
Field of study

MicroRNAs (miRNAs) are 19–22-nucleotide noncoding RNAs that post-transcriptionally regulate mRNA targets. We have identified endogenous miRNA binding sites in mouse embryonic stem cells (mESCs), by performing photo-cross-linking immunoprecipitation using antibodies to Argonaute (Ago2) followed by deep sequencing of RNAs (CLIP-seq). We also performed CLIP-seq in Dicer[superscript −/−] mESCs that lack mature miRNAs, allowing us to define whether the association of Ago2 with the identified sites was miRNA dependent. A significantly enriched motif, GCACUU, was identified only in wild-type mESCs in 3′ untranslated and coding regions. This motif matches the seed of a miRNA family that constitutes ~68% of the mESC miRNA population. Unexpectedly, a G-rich motif was enriched in sequences cross-linked to Ago2 in both the presence and absence of miRNAs. Expression analysis and reporter assays confirmed that the seed-related motif confers miRNA-directed regulation on host mRNAs and that the G-rich motif can modulate this regulation.Leukemia & Lymphoma Society of AmericaUnited States. Public Health Service (Grant R01-GM34277)United States. Public Health Service (Grant R01-CA133404)National Cancer Institute (U.S.) (Grant P01-CA42063)National Cancer Institute (U.S.) Cancer Center Support (Grant P30-CA14051

DSpace@MIT

Crossref

PubMed Central

Cytoplasmic Polyadenylation Element Binding Protein Deficiency Stimulates PTEN and Stat3 mRNA Translation and Induces Hepatic Insulin Resistance

Author: A Mora
AR Morris
B Feve
Bryan O'Sullivan-Murphy
D Burns
Dae Young Jung
DC Barnard
DD Licatalosi
DD Licatalosi
DM Burns
Fumihiko Urano
H Herranz
HJ Kim
Hwi Jin Ko
I Groisman
I Groisman
Ilya M. Alexandrov
J Paris
J Tay
Jason K. Kim
JD Keene
JD Richter
JH Kim
JM Alarcon
Joel D. Richter
K Ueki
L Wu
LE Hake
Maria Ivshina
Mei Xu
MF White
N Liu
P Anderson
R Mendez
Randall Friedline
Rita Bortell
S Nottrott
S Wang
SE Kahn
T Boettger
T Maniatis
W Huang da
W Huang da
Wataru Ogawa
Yen-Tsung Huang
Publication venue: Public Library of Science
Publication date: 01/01/2012
Field of study

The cytoplasmic polyadenylation element binding protein CPEB1 (CPEB) regulates germ cell development, synaptic plasticity, and cellular senescence. A microarray analysis of mRNAs regulated by CPEB unexpectedly showed that several encoded proteins are involved in insulin signaling. An investigation of Cpeb1 knockout mice revealed that the expression of two particular negative regulators of insulin action, PTEN and Stat3, were aberrantly increased. Insulin signaling to Akt was attenuated in livers of CPEB–deficient mice, suggesting that they might be defective in regulating glucose homeostasis. Indeed, when the Cpeb1 knockout mice were fed a high-fat diet, their livers became insulin-resistant. Analysis of HepG2 cells, a human liver cell line, depleted of CPEB demonstrated that this protein directly regulates the translation of PTEN and Stat3 mRNAs. Our results show that CPEB regulated translation is a key process involved in insulin signaling

CiteSeerX

Crossref

Harvard University - DASH

Directory of Open Access Journals

PubMed Central

eScholarship@UMassChan

The Francis Crick Institute

Predicting RNA-Protein Interactions Using Only Sequence Information

Author: A Barkan
A Martínez-antonio
A Pacheco
AP Gerber
B Blencowe
BA Lewis
C Charon
D Ray
D Ursic
DD Licatalosi
DD Licatalosi
DJ Hogan
Drena Dobbs
E Kaymak
H Hwang
HM Berman
HM Berman
I Breiman
I Sola
J Shen
JC Nacher
JD Keene
JG Lees
JR Sanford
KB Cook
L Pérez-Cano
M Bellucci
M Hafner
M Hafner
M Hall
M Khorshid
M Terribilini
MY Kim
N Mittal
NG Tsvetanova
P Baldi
P Zhou
S Kishore
S Lee
T Wu
T-Y Wang
TE Baroni
TI Lee
Usha K Muppirala
V Pancaldi
V Vapnik
Vasant G Honavar
VP Vidal
X Shao
X-W Chen
Y Wang
Z Li
Z-P Liu
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background RNA-protein interactions (RPIs) play important roles in a wide variety of cellular processes, ranging from transcriptional and post-transcriptional regulation of gene expression to host defense against pathogens. High throughput experiments to identify RNA-protein interactions are beginning to provide valuable information about the complexity of RNA-protein interaction networks, but are expensive and time consuming. Hence, there is a need for reliable computational methods for predicting RNA-protein interactions. Results We propose <it>RPISeq</it>, a family of classifiers for predicting <it>R</it>NA-<it>p</it>rotein <it>i</it>nteractions using only <it>seq</it>uence information. Given the sequences of an RNA and a protein as input, <it>RPIseq </it>predicts whether or not the RNA-protein pair interact. The RNA sequence is encoded as a normalized vector of its ribonucleotide 4-mer composition, and the protein sequence is encoded as a normalized vector of its 3-mer composition, based on a 7-letter reduced alphabet representation. Two variants of <it>RPISeq </it>are presented: <it>RPISeq-SVM</it>, which uses a Support Vector Machine (SVM) classifier and <it>RPISeq-RF</it>, which uses a Random Forest classifier. On two non-redundant benchmark datasets extracted from the Protein-RNA Interface Database (PRIDB), <it>RPISeq </it>achieved an AUC (Area Under the Receiver Operating Characteristic (ROC) curve) of 0.96 and 0.92. On a third dataset containing only mRNA-protein interactions, the performance of <it>RPISeq </it>was competitive with that of a published method that requires information regarding many different features (e.g., mRNA half-life, GO annotations) of the putative RNA and protein partners. In addition, <it>RPISeq </it>classifiers trained using the PRIDB data correctly predicted the majority (57-99%) of non-coding RNA-protein interactions in NPInter-derived networks from <it>E. coli, S. cerevisiae, D. melanogaster, M. musculus</it>, and <it>H. sapiens</it>. Conclusions Our experiments with <it>RPISeq </it>demonstrate that RNA-protein interactions can be reliably predicted using only sequence-derived information. <it>RPISeq </it>offers an inexpensive method for computational construction of RNA-protein interaction networks, and should provide useful insights into the function of non-coding RNAs. <it>RPISeq </it>is freely available as a web-based server at <url>http://pridb.gdcb.iastate.edu/RPISeq/.</url></p

Digital Repository @ Iowa State University (ISU)

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Detection and Removal of Biases in the Analysis of Next-Generation Sequencing Reads

Author: A Barski
A Valouev
AC Seila
AP Boyle
AP Fejes
AR Kornblihtt
B Li
C Wang
DD Licatalosi
DD Licatalosi
E Hodges
ET Wang
G Hon
G Kunarso
GA Heap
Gil Ast
GW Muse
H Tilgner
I Listerman
IE Schor
J Li
J Rozowsky
J Zeitlinger
JC Dohm
JC Marioni
JF Degner
JW Brown
KD Hansen
KJ Gaulton
L Laurent
L Zhu
LJ Core
LW Hillier
M de la Mata
M Kircher
MJ Weber
ML Metzker
N Philippe
N Sela
N Spies
P Flicek
P Kolasinska-Zwierz
P Medvedev
Purification Lopez-Garcia
R Andersson
R Lister
R Morin
Ram Oren
S Griffiths-Jones
S Griffiths-Jones
S Nahkuri
S Pepke
S Schwartz
Schraga Schwartz
T Kiss
T Kiss
T Kiss
TH Kim
W Chen
W Filipowicz
Y Gilad
Z Wang
Z Wang
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

Since the emergence of next-generation sequencing (NGS) technologies, great effort has been put into the development of tools for analysis of the short reads. In parallel, knowledge is increasing regarding biases inherent in these technologies. Here we discuss four different biases we encountered while analyzing various Illumina datasets. These biases are due to both biological and statistical effects that in particular affect comparisons between different genomic regions. Specifically, we encountered biases pertaining to the distributions of nucleotides across sequencing cycles, to mappability, to contamination of pre-mRNA with mRNA, and to non-uniform hydrolysis of RNA. Most of these biases are not specific to one analyzed dataset, but are present across a variety of datasets and within a variety of genomic contexts. Importantly, some of these biases correlated in a highly significant manner with biological features, including transcript length, gene expression levels, conservation levels, and exon-intron architecture, misleadingly increasing the credibility of results due to them. We also demonstrate the relevance of these biases in the context of analyzing an NGS dataset mapping transcriptionally engaged RNA polymerase II (RNAPII) in the context of exon-intron architecture, and show that elimination of these biases is crucial for avoiding erroneous interpretation of the data. Collectively, our results highlight several important pitfalls, challenges and approaches in the analysis of NGS reads

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Mapping exosome-substrate interactions in vivo by UV cross-linking

Author: AC Tuck
C Delan-Forino
C Delan-Forino
DD Licatalosi
EL Van Nostrand
F Ramírez
J Konig
JJ Tree
M Dodt
M Hafner
P Flicek
S Granneman
S Granneman
S Schneider
S Webb
V Libri
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2020
Field of study

International audienceAbstract The RNA exosome complex functions in both the accurate processing and rapid degradation of many classes of RNA in eukaryotes and Archaea. Functional and structural analyses indicate that RNA can either be threaded through the central channel of the exosome or more directly access the active sites of the ribonucleases Rrp44 and Rrp6, but in most cases, it remains unclear how many substrates follow each pathway in vivo. Here we describe the method for using an UV cross-linking technique termed CRAC to generate stringent, transcriptome-wide mapping of exosome–substrate interaction sites in vivo and at base-pair resolution

Crossref

Edinburgh Research Explorer

HAL: Hyper Article en Ligne

Evolutionary Constraint Helps Unmask a Splicing Regulatory Region in BRCA1 Exon 11

Author: Andrew G. L. Douglas
AR Grosso
C Yuli
CA Wilson
Claudia Tammaro
David I. Wilson
DD Licatalosi
Diana Baralle
F Pagani
F Pagani
F Piva
I Paz
J Lee
JR Thompson
JV Chamary
KA Dittmar
L Good
LD Hurst
Ludmila Prokunina-Olsson
M Lu
Michela Raponi
P de la Grange
PD Ryan
R Bachelier
RD Brandão
S Thakur
TI Orban
TI Orban
TI Orban
W Tang
Y Qin
ZE Sauna
Publication venue: Public Library of Science
Publication date: 16/05/2012
Field of study

BACKGROUND: Alternative splicing across exon 11 produces several BRCA1 isoforms. Their proportion varies during the cell cycle, between tissues and in cancer suggesting functional importance of BRCA1 splicing regulation around this exon. Although the regulatory elements driving exon 11 splicing have never been identified, a selective constraint against synonymous substitutions (silent nucleotide variations that do not alter the amino acid residue sequence) in a critical region of BRCA1 exon 11 has been reported to be associated with the necessity to maintain regulatory sequences. METHODOLOGY/PRINCIPAL FINDINGS: Here we have designed a specific minigene to investigate the possibility that this bias in synonymous codon usage reflects the need to preserve the BRCA1 alternative splicing program. We report that in-frame deletions and translationally silent nucleotide substitutions in the critical region affect splicing regulation of BRCA1 exon 11. CONCLUSIONS/SIGNIFICANCE: Using a hybrid minigene approach, we have experimentally validated the hypothesis that the need to maintain correct alternative splicing is a selective pressure against translationally silent sequence variations in the critical region of BRCA1 exon 11. Identification of the trans-acting factors involved in regulating exon 11 alternative splicing will be important in understanding BRCA1-associated tumorigenesis

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

The Francis Crick Institute