Search CORE

122 research outputs found

Genome-Wide Profiling of H3K56 Acetylation and Transcription Factor Binding Sites in Human Adipocytes

Author: A Marson
A Subramanian
Amy P. Baumann
B Schwer
C Das
Christopher J. Donahue
CM Conboy
D Dutta
DT Odom
ED Rosen
Ernest Fraenkel
F Liang
GW Swart
H Tilg
K Orford
KD MacIsaac
KD MacIsaac
Kinyui Alice Lo
L He
L Janderova
L Qiao
Lisa S. Hayes
Marc Tjwa
Mark A. Thiede
Mary K. Bauchmann
MI Lefterova
MS Hamza
NJ Butcher
PD Thomas
PW Caton
R Nielsen
RM Cowherd
SD Westerheide
Shelley Ann G. des Etages
SM Rangwala
T Yamauchi
TS Mikkelsen
W Huang da
W Huang da
W Xie
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/10/2010
Field of study

The growing epidemic of obesity and metabolic diseases calls for a better understanding of adipocyte biology. The regulation of transcription in adipocytes is particularly important, as it is a target for several therapeutic approaches. Transcriptional outcomes are influenced by both histone modifications and transcription factor binding. Although the epigenetic states and binding sites of several important transcription factors have been profiled in the mouse 3T3-L1 cell line, such data are lacking in human adipocytes. In this study, we identified H3K56 acetylation sites in human adipocytes derived from mesenchymal stem cells. H3K56 is acetylated by CBP and p300, and deacetylated by SIRT1, all are proteins with important roles in diabetes and insulin signaling. We found that while almost half of the genome shows signs of H3K56 acetylation, the highest level of H3K56 acetylation is associated with transcription factors and proteins in the adipokine signaling and Type II Diabetes pathways. In order to discover the transcription factors that recruit acetyltransferases and deacetylases to sites of H3K56 acetylation, we analyzed DNA sequences near H3K56 acetylated regions and found that the E2F recognition sequence was enriched. Using chromatin immunoprecipitation followed by high-throughput sequencing, we confirmed that genes bound by E2F4, as well as those by HSF-1 and C/EBPα, have higher than expected levels of H3K56 acetylation, and that the transcription factor binding sites and acetylation sites are often adjacent but rarely overlap. We also discovered a significant difference between bound targets of C/EBPα in 3T3-L1 and human adipocytes, highlighting the need to construct species-specific epigenetic and transcription factor binding site maps. This is the first genome-wide profile of H3K56 acetylation, E2F4, C/EBPα and HSF-1 binding in human adipocytes, and will serve as an important resource for better understanding adipocyte transcriptional regulation.Singapore. Agency for Science, Technology and Research (National Science Scholarship )Massachusetts Institute of Technology (Eugene Bell Career Development Chair)National Science Foundation (U.S.) (Award No. DBI-0821391)Pfizer Inc

Public Library of Science (PLOS)

DSpace@MIT

Crossref

Directory of Open Access Journals

PubMed Central

The Francis Crick Institute

Wide-Scale Analysis of Human Functional Transcription Factor Binding Reveals a Strong Bias towards the Transcription Start Site

Author: A Ambesi-Impiombato
A Blais
A Eto
A Subramanian
AE Kel
AG Clark
AL Lam
AM McGuire
Anat Reiner
Assif Yitzhaky
B Ren
C Kimura-Yoshida
C Plessy
C Yang
CT Harbison
D Pfeifer
D Wang
DB Allison
E Emberly
E Segal
Eytan Domany
FP Roth
GC Pipes
GC Yuan
GQ Yao
GZ Hertz
H Li
H Lodish
J Zheng
JD Hughes
JL DeRisi
JQ Ling
K Frech
K Quandt
KD MacIsaac
L Amir-Zilberstein
L Elnitski
L Marino-Ramirez
L McCue
M Ashburner
M Kellis
M Milyavsky
MA Nobrega
Mark Koudritsky
MC Frith
ML Howard
ML Whitfield
N Rajewsky
Or Zuk
P Carninci
P Carninci
P Cliften
PM Haverty
PR Buckland
R Elkon
R Liu
R Sharan
Ran Brosh
S Aerts
S Rashi-Elkeles
S Tavazoie
SJ Cooper
SJ Ho Sui
Sui Huang
U Gerland
Varda Rotter
WW Wasserman
X Xie
Y Barash
Y Benjamini
Y Benjamini
Y Tabach
Yossi Buganim
Yuval Tabach
Z Wang
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2007
Field of study

We introduce a novel method to screen the promoters of a set of genes with shared biological function, against a precompiled library of motifs, and find those motifs which are statistically over-represented in the gene set. The gene sets were obtained from the functional Gene Ontology (GO) classification; for each set and motif we optimized the sequence similarity score threshold, independently for every location window (measured with respect to the TSS), taking into account the location dependent nucleotide heterogeneity along the promoters of the target genes. We performed a high throughput analysis, searching the promoters (from 200bp downstream to 1000bp upstream the TSS), of more than 8000 human and 23,000 mouse genes, for 134 functional Gene Ontology classes and for 412 known DNA motifs. When combined with binding site and location conservation between human and mouse, the method identifies with high probability functional binding sites that regulate groups of biologically related genes. We found many location-sensitive functional binding events and showed that they clustered close to the TSS. Our method and findings were put to several experimental tests. By allowing a "flexible" threshold and combining our functional class and location specific search method with conservation between human and mouse, we are able to identify reliably functional TF binding sites. This is an essential step towards constructing regulatory networks and elucidating the design principles that govern transcriptional regulation of expression. The promoter region proximal to the TSS appears to be of central importance for regulation of transcription in human and mouse, just as it is in bacteria and yeast.Comment: 31 pages, including Supplementary Information and figure

arXiv.org e-Print Archive

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Cancer somatic mutations cluster in a subset of regulatory sites predicted from the ENCODE data

Author: A Mortazavi
A Pohl
A Visel
AP Boyle
C Melton
David R. Westhead
DK Goode
FW Huang
J Ernst
JA Wamstad
JH Friedman
JR Landry
KD MacIsaac
M. S. Vijayabaskar
MB Gerstein
MS Lawrence
N Weinhold
Nisar A. Shar
NJ Fredriksson
PA Futreal
RE Thurman
RS Hansen
S Djebali
SA Forbes
TH Rabbitts
WJ Kent
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

Background: Transcriptional regulation of gene expression is essential for cellular differentiation and function, and defects in the process are associated with cancer. The ENCODE project has mapped potential regulatory sites across the complete genome in many cell types, and these regions have been shown to harbour many of the somatic mutations that occur in cancer cells, suggesting that their effects may drive cancer initiation and development. The ENCODE data suggests a very large number of regulatory sites, and methods are needed to identify those that are most relevant and to connect them to the genes that they control. Methods: Predictive models of gene expression were developed by integrating the ENCODE data for regulation, including transcription factor binding and DNase1 hypersensitivity, with RNA-seq data for gene expression. A penalized regression method was used to identify the most predictive potential regulatory sites for each transcript. Known cancer somatic mutations from the COSMIC database were mapped to potential regulatory sites, and we examined differences in the mapping frequencies associated with sites chosen in regulatory models and other (rejected) sites. The effects of potential confounders, for example replication timing, were considered. Results: Cancer somatic mutations preferentially occupy those regulatory regions chosen in our models as most predictive of gene expression. Conclusion: Our methods have identified a significantly reduced set of regulatory sites that are enriched in cancer somatic mutations and are more predictive of gene expression. This has significance for the mechanistic interpretation of cancer mutations, and the understanding of genetic regulation

Crossref

Springer - Publisher Connector

PubMed Central

White Rose Research Online

The Francis Crick Institute

Network deconvolution as a general method to distinguish direct dependencies in networks

Author: A de la Fuente
A Greenfield
A Hartemink
A Pinna
A Seth
AA Margolin
AC Haury
AJ Butte
BG Giraud
CJ Quinn
CK Hemelrijk
D Altschuh
D di Bernardo
D Jones
D Marbach
D Marbach
Daniel Marbach
DFT Veiga
DJ Reiss
DS Marks
DS Marks
E Neher
F Morcos
J Tang
JJ Faith
KD MacIsaac
L Burger
M Ding
M Ekeberg
M Granovetter
M Weigt
Manolis Kellis
MEJ Newman
MEJ Newman
MEJ Newman
MJ Wainwright
Muriel Médard
N Friedman
N Friedman
N Meinshausen
NE Friedkin
R Bonneau
R De Smet
R Koetter
R Küffner
R Sharan
S Gama-Castro
S Lapedes A
SD Dunn
Soheil Feizi
T Nugent
TA Hopf
U Göbel
VA Huynh-Thu
X Shi
X Song
Z Bar-Joseph
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2013
Field of study

Recognizing direct relationships between variables connected in a network is a pervasive problem in biological, social and information sciences as correlation-based networks contain numerous indirect relationships. Here we present a general method for inferring direct effects from an observed correlation matrix containing both direct and indirect effects. We formulate the problem as the inverse of network convolution, and introduce an algorithm that removes the combined effect of all indirect paths of arbitrary length in a closed-form solution by exploiting eigen-decomposition and infinite-series sums. We demonstrate the effectiveness of our approach in several network applications: distinguishing direct targets in gene expression regulatory networks; recognizing directly interacting amino-acid residues for protein structure prediction from sequence alignments; and distinguishing strong collaborations in co-authorship social networks using connectivity information alone. In addition to its theoretical impact as a foundational graph theoretic tool, our results suggest network deconvolution is widely applicable for computing direct dependencies in network science across diverse disciplines.National Institutes of Health (U.S.) (grant R01 HG004037)National Institutes of Health (U.S.) (grant HG005639)Swiss National Science Foundation (Fellowship)National Science Foundation (U.S.) (NSF CAREER Award 0644282

SOX2 Co-Occupies Distal Enhancer Elements with Distinct POU Factors in ESCs and NPCs to Specify Cell State

Author: A Gritti
A Marson
A Postigo
A Rada-Iglesias
A Remenyi
A Reményi
A Taguchi
AA Avilion
AE Kiernan
Albert W. Cheng
AM Ashique
B Andersen
C Beard
C Buecker
C Gontan
C Plachez
C Rochette-Egly
CE Campbell
Christopher W. Ng
CKL Ng
CT Ong
CY McLean
D Baas
D Liber
D Michel
DA Kleinjan
DB Gordon
DC Ambrosetti
DC Ambrosetti
DC Williams Jr
DLC Van Den Berg
E Engelen
E Lujan
E Tzatzalos
E Wingender
Ernest Fraenkel
ET Domyan
G Steele-Perkins
GE Zentner
Gregory S. Barsh
H Cui
H Cui
H Kondoh
H Kondoh
H Suh
H Yuan
HC Lai
HP Ostendorff
I Chambers
I Chambers
J Collignon
J Ernst
J Ghislain
J Holmberg
J Jiang
J Kim
J Liang
J Shin
J Wang
J Yang
JA Wamstad
JB Kim
JC Rochet
JL Chew
Joseph A. Wamstad
K Arnold
K Hochedlinger
K Kuhlbrodt
K Mitsui
K Okamoto
K Phillips
K Takahashi
K Takahashi
KD Macisaac
Kevin K. Thai
L Dailey
L das Neves
L Ferraris
LA Boyer
Laurie A. Boyer
LC Andreae
LM Staudt
M Bani-Yaghoub
M Bergsland
M Bulger
M Bylund
M Friedli
M Iwafuchi-Doi
M Nishimoto
M Piper
M Salmon-Divon
M Wernig
MF Cole
MF Rose
MH Sham
Michael A. Lodato
MJ O'Hare
MP Creyghton
MS Sundrud
N Le
N Sugo
N Yasuhara
ND Heintzman
ND Heintzman
OV Taranova
P Grabowski
PJ Blackshear
PL Boutz
QL Ying
R Catena
R Favaro
R Jauch
R Wu
RA Young
RJ McEvilly
Rudolf Jaenisch
S Alcantara
S Bilodeau
S Masui
S Miyagi
S Miyagi
S Okabe
S Reiprich
S Schneider-Maunoury
S Tanaka
S Temple
S Temple
S Verma-Kurvari
T Honda
T Okubo
TI Lee
TK Nowling
TL Bailey
TS Mikkelsen
V Botquin
V Graham
V Tropepe
X Chen
X He
X Qian
Y Buganim
Y Hara
Y Kamachi
Y Kamachi
Y Nakatake
Y Sano
Y Sugitani
YH Loh
Z Jin
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/06/2012
Field of study

SOX2 is a master regulator of both pluripotent embryonic stem cells (ESCs) and multipotent neural progenitor cells (NPCs); however, we currently lack a detailed understanding of how SOX2 controls these distinct stem cell populations. Here we show by genome-wide analysis that, while SOX2 bound to a distinct set of gene promoters in ESCs and NPCs, the majority of regions coincided with unique distal enhancer elements, important cis-acting regulators of tissue-specific gene expression programs. Notably, SOX2 bound the same consensus DNA motif in both cell types, suggesting that additional factors contribute to target specificity. We found that, similar to its association with OCT4 (Pou5f1) in ESCs, the related POU family member BRN2 (Pou3f2) co-occupied a large set of putative distal enhancers with SOX2 in NPCs. Forced expression of BRN2 in ESCs led to functional recruitment of SOX2 to a subset of NPC-specific targets and to precocious differentiation toward a neural-like state. Further analysis of the bound sequences revealed differences in the distances of SOX and POU peaks in the two cell types and identified motifs for additional transcription factors. Together, these data suggest that SOX2 controls a larger network of genes than previously anticipated through binding of distal enhancers and that transitions in POU partner factors may control tissue-specific transcriptional programs. Our findings have important implications for understanding lineage specification and somatic cell reprogramming, where SOX2, OCT4, and BRN2 have been shown to be key factors

CiteSeerX

DSpace@MIT

Crossref

Directory of Open Access Journals

PubMed Central

The Francis Crick Institute

c-REDUCE: Incorporating sequence conservation to detect motifs that correlate with expression

Author: A Stathopoulos
BC Foat
CT Harbison
D Cora
D Das
EM Conlon
Hao Li
HJ Bussemaker
JD Hughes
JD Thompson
Katerina Kechris
KD MacIsaac
KD MacIsaac
LD Ward
M Kellis
M Markstein
M Markstein
MW Gaunt
O Elemento
P Cliften
R Siddharthan
R Wu
S Keles
SJ Ho Sui
T Barrett
T Wang
W Zhong
WJ Kent
WW Wasserman
X Cai
X Li
X Liu
Y Kawahara
Y Liu
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Abstract Background Computational methods for characterizing novel transcription factor binding sites search for sequence patterns or "motifs" that appear repeatedly in genomic regions of interest. Correlation-based motif finding strategies are used to identify motifs that correlate with expression data and do not rely on promoter sequences from a pre-determined set of genes. Results In this work, we describe a method for predicting motifs that combines the correlation-based strategy with phylogenetic footprinting, where motifs are identified by evaluating orthologous sequence regions from multiple species. Our method, c-REDUCE, can account for variability at a motif position inferred from evolutionary information. c-REDUCE has been tested on ChIP-chip data for yeast transcription factors and on gene expression data in <it>Drosophila</it>. Conclusion Our results indicate that utilizing sequence conservation information in addition to correlation-based methods improves the identification of known motifs.</p

Crossref

Directory of Open Access Journals

PubMed Central

Linking Proteomic and Transcriptional Data through the Interactome and Epigenome Reveals a Map of Oncogene-induced Signaling

Author: A Ceol
A Gaulton
A Ghazalpour
A Lan
AB Heimberger
Adam Labadorf
AE Kel
B Aranda
B Hanstein
B Langmead
B Mukherjee
B Schwanhäusser
BC Foat
BC Foat
BC Foat
C Kim
C Knox
C Liu
C Ritz
C Stark
C-L Tso
Candace R. Chouinard
CD Andl
CE Pelloski
CM Klinge
CM-E Sauvageot
CS Ross-Innes
CT Harbison
D Guo
D Hanahan
D Hanahan
D Yin
David C. Clarke
DB Ramnarain
Douglas A. Lauffenburger
DP Schunemann
DT Odom
E Cerami
E Eden
E Galanis
E Lee
E Lundberg
E Yeger-Lotem
ER Levin
Ernest Fraenkel
F Markowetz
F Yamoutpour
G Cuellar Partida
G Ling
GC Kabat
GD Bader
GK Smyth
H Dong
H Johnson
H Shao
H-W Lo
HI Robins
HS Huang
I Ljubić
I Thiele
I Ulitsky
IY Eyüpoglu
JM Gil
JR Hesselberth
JS Lewis-Wambi
JV Olsen
KD MacIsaac
KH Emami
KV Lu
L Björnström
L Choy
LJ Zhu
M Bansal
M Lepourcelet
MD Robinson
MJ Clark
MM Feldkamp
MS Carro
MW Pedersen
MW Pedersen
N de la Iglesia
P Flicek
P Hallock
P Pu
P-C Leow
PH Huang
PJ Sabo
Q Li
R Bonavia
R Chen
R Kalluri
R Nishikawa
R Pique-Regi
R Schiff
R Zeineldin
RGW Verhaak
RH Shoemaker
RM Hallett
RM Myers
S Bamford
S Imarisio
S Kerrien
S Razick
S Schinner
S-SC Huang
SA Prigent
Sara J. C. Gosline
Shao-shan Carol Huang
SP Panicker
SZ Usmani
T Nagashima
T Takano
TS Keshava Prasad
V Matys
V Milano
W Couldwell
W Lu
W Wei
W Wick
William Gordon
William Stafford Noble
X Liu
Y Benjamini
Y Narita
Y Ning
Y Wang
Y Zhang
Z Wu
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/03/2012
Field of study

Cellular signal transduction generally involves cascades of post-translational protein modifications that rapidly catalyze changes in protein-DNA interactions and gene expression. High-throughput measurements are improving our ability to study each of these stages individually, but do not capture the connections between them. Here we present an approach for building a network of physical links among these data that can be used to prioritize targets for pharmacological intervention. Our method recovers the critical missing links between proteomic and transcriptional data by relating changes in chromatin accessibility to changes in expression and then uses these links to connect proteomic and transcriptome data. We applied our approach to integrate epigenomic, phosphoproteomic and transcriptome changes induced by the variant III mutation of the epidermal growth factor receptor (EGFRvIII) in a cell line model of glioblastoma multiforme (GBM). To test the relevance of the network, we used small molecules to target highly connected nodes implicated by the network model that were not detected by the experimental data in isolation and we found that a large fraction of these agents alter cell viability. Among these are two compounds, ICG-001, targeting CREB binding protein (CREBBP), and PKF118–310, targeting β-catenin (CTNNB1), which have not been tested previously for effectiveness against GBM. At the level of transcriptional regulation, we used chromatin immunoprecipitation sequencing (ChIP-Seq) to experimentally determine the genome-wide binding locations of p300, a transcriptional co-regulator highly connected in the network. Analysis of p300 target genes suggested its role in tumorigenesis. We propose that this general method, in which experimental measurements are used as constraints for building regulatory networks from the interactome while taking into account noise and missing data, should be applicable to a wide range of high-throughput datasets.National Science Foundation (U.S.) (DB1-0821391)National Institutes of Health (U.S.) (Grant U54-CA112967)National Institutes of Health (U.S.) (Grant R01-GM089903)National Institutes of Health (U.S.) (P30-ES002109

Public Library of Science (PLOS)

DSpace@MIT

Crossref

Directory of Open Access Journals

PubMed Central

The Francis Crick Institute

The value of position-specific priors in motif discovery using MEME

Author: BC Foat
CT Harbison
DC Bauer
E Redhead
F Fang
FA Buske
GD Stormo
GZ Hertz
KD MacIsaac
L Narlikar
L Narlikar
MC Frith
Mikael Bodén
Philip Machanick
R Gordân
R Siddharthan
RC McLeay
S Sinha
Timothy L Bailey
TL Bailey
TL Bailey
TL Bailey
Tom Whitington
V Matys
WH Kruskal
WJ Kent
X Chen
Y Barash
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background Position-specific priors have been shown to be a flexible and elegant way to extend the power of Gibbs sampler-based motif discovery algorithms. Information of many types–including sequence conservation, nucleosome positioning, and negative examples–can be converted into a prior over the location of motif sites, which then guides the sequence motif discovery algorithm. This approach has been shown to confer many of the benefits of conservation-based and discriminative motif discovery approaches on Gibbs sampler-based motif discovery methods, but has not previously been studied with methods based on expectation maximization (EM). Results We extend the popular EM-based MEME algorithm to utilize position-specific priors and demonstrate their effectiveness for discovering transcription factor (TF) motifs in yeast and mouse DNA sequences. Utilizing a discriminative, conservation-based prior dramatically improves MEME's ability to discover motifs in 156 yeast TF ChIP-chip datasets, more than doubling the number of datasets where it finds the correct motif. On these datasets, MEME using the prior has a higher success rate than eight other conservation-based motif discovery approaches. We also show that the same type of prior improves the accuracy of motifs discovered by MEME in mouse TF ChIP-seq data, and that the motifs tend to be of slightly higher quality those found by a Gibbs sampling algorithm using the same prior. Conclusions We conclude that using position-specific priors can substantially increase the power of EM-based motif discovery algorithms such as MEME algorithm.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

DeBi: Discovering Differentially Expressed Biclusters using a Frequent Itemset Approach

Author: A Ben-Dor
A Prelic
A Rosenwald
A Tanay
AD Basehoar
Akdes Serin
B Andreopoulos
BKH Chia
CT Harbison
D Burdick
DR Ciocca
G Li
GA Grothaus
J Lamb
JA Hartigan
JA Hartigan
JL Jensen
JN Keller
KD MacIsaac
Martin Vingron
R Shamir
RR Sokal
S Barkow
S Bergmann
S Hochreiter
SC Madeira
TM Murali
TR Hughes
XG Ni
Y Cheng
Y Hoshida
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background The analysis of massive high throughput data via clustering algorithms is very important for elucidating gene functions in biological systems. However, traditional clustering methods have several drawbacks. Biclustering overcomes these limitations by grouping genes and samples simultaneously. It discovers subsets of genes that are co-expressed in certain samples. Recent studies showed that biclustering has a great potential in detecting marker genes that are associated with certain tissues or diseases. Several biclustering algorithms have been proposed. However, it is still a challenge to find biclusters that are significant based on biological validation measures. Besides that, there is a need for a biclustering algorithm that is capable of analyzing very large datasets in reasonable time. Results Here we present a fast biclustering algorithm called DeBi (Differentially Expressed BIclusters). The algorithm is based on a well known data mining approach called frequent itemset. It discovers maximum size homogeneous biclusters in which each gene is strongly associated with a subset of samples. We evaluate the performance of DeBi on a yeast dataset, on synthetic datasets and on human datasets. Conclusions We demonstrate that the DeBi algorithm provides functionally more coherent gene sets compared to standard clustering or biclustering algorithms using biological validation measures such as Gene Ontology term and Transcription Factor Binding Site enrichment. We show that DeBi is a computationally efficient and powerful tool in analyzing large datasets. The method is also applicable on multiple gene expression datasets coming from different labs or platforms.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

MPG.PuRe

A Bayesian Partition Method for Detecting Pleiotropic and Epistatic eQTL Modules

Author: A Colman-Lerner
A Manichaikul
AC Cervino
AH Enyenihi
BM Bolstad
C Jiang
CJ Geyer
CM Kendziorski
CY Wu
D Mangin
EE Schadt
EE Schadt
EE Schadt
Eric E. Schadt
ES Lander
G Yvert
Gary D. Stormo
J Ronald
J Zhu
JD Storey
JS Liu
Jun S. Liu
Jun Zhu
KD MacIsaac
M Morley
N Yi
PJ Green
RB Brem
RB Brem
RB Brem
SI Lee
TR Hughes
V Emilsson
W Zou
Wei Zhang
Y Chen
Publication venue: Public Library of Science
Publication date: 01/01/2010
Field of study

Studies of the relationship between DNA variation and gene expression variation, often referred to as “expression quantitative trait loci (eQTL) mapping”, have been conducted in many species and resulted in many significant findings. Because of the large number of genes and genetic markers in such analyses, it is extremely challenging to discover how a small number of eQTLs interact with each other to affect mRNA expression levels for a set of co-regulated genes. We present a Bayesian method to facilitate the task, in which co-expressed genes mapped to a common set of markers are treated as a module characterized by latent indicator variables. A Markov chain Monte Carlo algorithm is designed to search simultaneously for the module genes and their linked markers. We show by simulations that this method is more powerful for detecting true eQTLs and their target genes than traditional QTL mapping methods. We applied the procedure to a data set consisting of gene expression and genotypes for 112 segregants of S. cerevisiae. Our method identified modules containing genes mapped to previously reported eQTL hot spots, and dissected these large eQTL hot spots into several modules corresponding to possibly different biological functions or primary and secondary responses to regulatory perturbations. In addition, we identified nine modules associated with pairs of eQTLs, of which two have been previously reported. We demonstrated that one of the novel modules containing many daughter-cell expressed genes is regulated by AMN1 and BPH1. In conclusion, the Bayesian partition method which simultaneously considers all traits and all markers is more powerful for detecting both pleiotropic and epistatic effects based on both simulated and empirical data

CiteSeerX

Public Library of Science (PLOS)

Crossref

Harvard University - DASH

Directory of Open Access Journals

PubMed Central

The Francis Crick Institute