Search CORE

Use of partial least squares regression to impute SNP genotypes in Italian Cattle breeds

Author: AJ Chamberlain
APW de Roos
BJ Hayes
BJ Hayes
BJ Hayes
BL Browning
C Dimauro
C Hagger
Corrado Dimauro
D Boichard
D Segelke
DP Berry
G Li
G Moser
Gabriele Marras
GCB Schopen
Giustino Gaspa
H Abdi
HA Mulder
HD Daetwyler
I Medugorac
J Chen
JE Pryce
JM Hickey
K Kizilkaya
KA Weigel
KA Weigel
Massimo Cellesi
Nicolò PP Macciotta
P Ajmone-Marsan
P Scheet
Paolo Ajmone-Marsan
PM VanRaden
R Dassonneville
R Dassonneville
Roberto Steri
T Druet
T Druet
TH Meuwissen
Z Zhang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2013
Field of study

Background The objective of the present study was to test the ability of the partial least squares regression technique to impute genotypes from low density single nucleotide polymorphisms (SNP) panels i.e. 3K or 7K to a high density panel with 50K SNP. No pedigree information was used. Methods Data consisted of 2093 Holstein, 749 Brown Swiss and 479 Simmental bulls genotyped with the Illumina 50K Beadchip. First, a single-breed approach was applied by using only data from Holstein animals. Then, to enlarge the training population, data from the three breeds were combined and a multi-breed analysis was performed. Accuracies of genotypes imputed using the partial least squares regression method were compared with those obtained by using the Beagle software. The impact of genotype imputation on breeding value prediction was evaluated for milk yield, fat content and protein content. Results In the single-breed approach, the accuracy of imputation using partial least squares regression was around 90 and 94% for the 3K and 7K platforms, respectively; corresponding accuracies obtained with Beagle were around 85% and 90%. Moreover, computing time required by the partial least squares regression method was on average around 10 times lower than computing time required by Beagle. Using the partial least squares regression method in the multi-breed resulted in lower imputation accuracies than using single-breed data. The impact of the SNP-genotype imputation on the accuracy of direct genomic breeding values was small. The correlation between estimates of genetic merit obtained by using imputed versus actual genotypes was around 0.96 for the 7K chip. Conclusions Results of the present work suggested that the partial least squares regression imputation method could be useful to impute SNP genotypes when pedigree information is not available

CiteSeerX

Springer - Publisher Connector

PubliCatt

CINECA IRIS Institutial research information system UNISS

UnissResearch

Reconstructing Druze population history

Author: A Nebel
A South
A Yardumian
AL Leutenegger
B Charlesworth
B Yunusbayev
BL Browning
BL Browning
D Graur
D Tarkhnishvili
DH Alexander
DM Behar
DM Behar
DM Behar
E Elhaik
E Elhaik
E Elhaik
E Leitersdorf
G Hellenthal
I Lazaridis
I Lazaridis
J Hey
J Novembre
J Novembre
J Zidan
J Zlotogora
JA Hodgson
JD Cristofaro
JR Kyllingstad
JT Prchal
JZ Li
LI Shlush
M Haber
M Haber
M Mezzavilla
M Reidla
M Sprengling
N Cohen
P Klein
P Shen
PR Loh
R Vardi-Saliternik
RL Cann
RM Durbin
S Haddad
S Purcell
TJ Pemberton
U Hodoglugil
V Lipphardt
V Macaulay
WY Yang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 16/11/2016
Field of study

The Druze are an aggregate of communities in the Levant and Near East living almost exclusively in the mountains of Syria, Lebanon and Israel whose ~1000 year old religion formally opposes mixed marriages and conversions. Despite increasing interest in genetics of the population structure of the Druze, their population history remains unknown. We investigated the genetic relationships between Israeli Druze and both modern and ancient populations. We evaluated our findings in light of three hypotheses purporting to explain Druze history that posit Arabian, Persian or mixed Near Eastern-Levantine roots. The biogeographical analysis localised proto-Druze to the mountainous regions of southeastern Turkey, northern Iraq and southeast Syria and their descendants clustered along a trajectory between these two regions. The mixed Near Eastern-Middle Eastern localisation of the Druze, shown using both modern and ancient DNA data, is distinct from that of neighbouring Syrians, Palestinians and most of the Lebanese, who exhibit a high affinity to the Levant. Druze biogeographic affinity, migration patterns, time of emergence and genetic similarity to Near Eastern populations are highly suggestive of Armenian-Turkish ancestries for the proto-Druze

Lund University Publications

White Rose Research Online

Genomics of Divergence along a Continuum of Parapatric Population Differentiation

MM received funding from the Max Planck innovation funds for this project. PGDF was supported by a Marie Curie European Reintegration Grant (proposal nr 270891). CE was supported by German Science Foundation grants (DFG, EI 841/4-1 and EI 841/6-1)

OceanRep

Queen Mary Research Online

Bern Open Repository and Information System (BORIS)

MPG.PuRe

The Francis Crick Institute

Genome-wide linkage analysis of 972 bipolar pedigrees using single-nucleotide polymorphisms.

Author: A Kong
A MacLean
A Serretti
AE Baum
AJ Jasinska
AL Price
BK Suarez
BL Browning
C-Y Liu
D Blackwood
D Grozeva
D Grozeva
D Koller
D Lambert
D Morris
D Sadovnick
D Zhang
DH Blackwood
DL Pauls
E Green
E Green
E S Gershon
EM Wijsman
EN Smith
ER Hauser
ET Cirulli
F J McMahon
F Mathieu
G J Lyon
G Kirov
GR Abecasis
H Coon
H Edenberg
HJ Edenberg
HM Ollila
I Jones
J A Badner
J B Potash
J Fan
J I Nurnberger
J Kelsoe
J Ross
JI Nurnberger
JM Ekholm
JPA Ioannidis
JR Kelsoe
JR O’Connell
KA Frazer
KR Merikangas
KY Liang
L Jones
LJ Adams
LJ Scott
M Ayub
M Gill
M Hamshere
MA Spence
MAR Ferreira
MB McQueen
MS McPeek
N Craddock
P Keck
P P Zandi
P Sklar
PP Zandi
R Remick
R Robinson
R Segurado
S Macgregor
S Purcell
SA Bacanu
T Foroud
T Venken
TC Matise
V L Willour
W Byerley
W McMahon
WH Berrettini
WTCC Consortium
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/12/2011
Field of study

Because of the high costs associated with ascertainment of families, most linkage studies of Bipolar I disorder (BPI) have used relatively small samples. Moreover, the genetic information content reported in most studies has been less than 0.6. Although microsatellite markers spaced every 10 cM typically extract most of the genetic information content for larger multiplex families, they can be less informative for smaller pedigrees especially for affected sib pair kindreds. For these reasons we collaborated to pool family resources and carried out higher density genotyping. Approximately 1100 pedigrees of European ancestry were initially selected for study and were genotyped by the Center for Inherited Disease Research using the Illumina Linkage Panel 12 set of 6090 single-nucleotide polymorphisms. Of the ~1100 families, 972 were informative for further analyses, and mean information content was 0.86 after pruning for linkage disequilibrium. The 972 kindreds include 2284 cases of BPI disorder, 498 individuals with bipolar II disorder (BPII) and 702 subjects with recurrent major depression. Three affection status models (ASMs) were considered: ASM1 (BPI and schizoaffective disorder, BP cases (SABP) only), ASM2 (ASM1 cases plus BPII) and ASM3 (ASM2 cases plus recurrent major depression). Both parametric and non-parametric linkage methods were carried out. The strongest findings occurred at 6q21 (non-parametric pairs LOD 3.4 for rs1046943 at 119 cM) and 9q21 (non-parametric pairs logarithm of odds (LOD) 3.4 for rs722642 at 78 cM) using only BPI and schizoaffective (SA), BP cases. Both results met genome-wide significant criteria, although neither was significant after correction for multiple analyses. We also inspected parametric scores for the larger multiplex families to identify possible rare susceptibility loci. In this analysis, we observed 59 parametric LODs of 2 or greater, many of which are likely to be close to maximum possible scores. Although some linkage findings may be false positives, the results could help prioritize the search for rare variants using whole exome or genome sequencing

ScholarWorks IU Indianapolis

Durham Research Online

Online Research @ Cardiff

Research Repository UCD

Cold Spring Harbor Laboratory Institutional Repository

Irish Universities

Edinburgh Research Explorer

The geography of recent genetic ancestry across Europe

Author: A Albrechtsen
A Auton
A Gillett
A Gusev
A Keller
A Zeileis
AE Hoerl
AL Price
AL Price
AM Stuart
B Winney
BL Browning
BM Henn
BM Henn
C Tyler-Smith
CD Huff
Chris Tyler-Smith
CL Epstein
CT O'Dushlaine
DJ Lawson
DLT Rohde
E Jakkula
F Rousset
G McVean
Graham Coop
H Li
J Chang
J Novembre
J Novembre
J Novembre
JA Tennessen
JE Pool
JE Powell
JFC Kingman
JK Gusev Lowe
JN Fenner
K Harris
KA Frazer
KP Donnelly
M Slatkin
MD Brown
MR Nelson
MR Nelson
N Patterson
N Patterson
N Takahata
NH Chapman
O Lao
P Menozzi
P Moorjani
P Skoglund
P Soares
Peter Ralph
PF Palamara
R Hudson
RA Fisher
RL Cann
S Carmi
S Giglio
S Gravel
S Purcell
Y Petrov
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 07/05/2013
Field of study

The recent genealogical history of human populations is a complex mosaic formed by individual migration, large-scale population movements, and other demographic events. Population genomics datasets can provide a window into this recent history, as rare traces of recent shared genetic ancestry are detectable due to long segments of shared genomic material. We make use of genomic data for 2,257 Europeans (the POPRES dataset) to conduct one of the first surveys of recent genealogical ancestry over the past three thousand years at a continental scale. We detected 1.9 million shared genomic segments, and used the lengths of these to infer the distribution of shared ancestors across time and geography. We find that a pair of modern Europeans living in neighboring populations share around 10-50 genetic common ancestors from the last 1500 years, and upwards of 500 genetic ancestors from the previous 1000 years. These numbers drop off exponentially with geographic distance, but since genetic ancestry is rare, individuals from opposite ends of Europe are still expected to share millions of common genealogical ancestors over the last 1000 years. There is substantial regional variation in the number of shared genetic ancestors: especially high numbers of common ancestors between many eastern populations likely date to the Slavic and/or Hunnic expansions, while much lower levels of common ancestry in the Italian and Iberian peninsulas may indicate weaker demographic effects of Germanic expansions into these areas and/or more stably structured populations. Recent shared ancestry in modern Europeans is ubiquitous, and clearly shows the impact of both small-scale migration and large historical events. Population genomic datasets have considerable power to uncover recent demographic history, and will allow a much fuller picture of the close genealogical kinship of individuals across the world.Comment: Full size figures available from http://www.eve.ucdavis.edu/~plralph/research.html; or html version at http://ralphlab.usc.edu/ibd/ibd-paper/ibd-writeup.xhtm

arXiv.org e-Print Archive

The Francis Crick Institute

Breeding histories and selection criteria for oilseed rape in Europe and China identified by genome wide pedigree dissection

Author: A Albrechtsen
AG Sharpe
AJ Klassen
B Chalhoub
BL Browning
C Jiang
D Qiu
EM Blue
F Curk
F Li
F Tajima
G Thomson
H Liu
I Bancroft
J Feng
J Lai
J Wang
J Zhang
JM Smith
JP Lynch
K Liu
L Han
L Li
L Shi
M Kimura
M Nei
M Slatkin
M Yang
MA Lodhi
N Wang
N Wang
P Librado
PA Quijada
PJ White
R Augustine
S Liu
SI Wright
SR Browning
TC Osborn
W Lukowitz
WE Clarke
X Xu
Y Long
Y Long
Y Xiao
Y Zhang
YB Fu
Z Cao
Z Chen
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 09/05/2017
Field of study

Selection breeding has played a key role in the improvement of seed yield and quality in oilseed rape (Brassica napus L.). We genotyped Tapidor (European), Ningyou7 (Chinese) and their progenitors with the Brassica 60 K Illumina Infinium SNP array and mapped a total of 29,347 SNP markers onto the reference genome of Darmor-bzh. Identity by descent (IBD) refers to a haplotype segment of a chromosome inherited from a shared common ancestor. IBDs identified on the C subgenome were larger than those on the A subgenome within both the Tapidor and Ningyou7 pedigrees. IBD number and length were greater in the Ningyou7 pedigree than in the Tapidor pedigree. Seventy nine QTLs for flowering time, seed quality and root morphology traits were identified in the IBDs of Tapidor and Ningyou7. Many more candidate genes had been selected within the Ningyou7 pedigree than within the Tapidor pedigree. These results highlight differences in the transfer of favorable gene clusters controlling key traits during selection breeding in Europe and China

Nottingham ePrints

Nottingham eTheses

Repository@Nottingham

Genetic and environmental transactions underlying the associationbetween physical fitness/physical exercise and body composition

Author: A Skytthe
AE Raftery
AJ Stunkard
Aja L. Murray
AL Hasselbalch
B Benyamin
BE Ainsworth
BL Heitmann
CA Hulle van
CM Lindgren
D Singh
DS Falconer
E Turkheimer
F Marlowe
H Akaike
HDI Abarbanel
HJ Schneider
I Janssen
Ingrid de Ruiter
JM McCaffery
K Schoesboe
K Schousboe
K Silventoinen
Kirsten Ohm Kyvik
KM Flegal
M Fogelholm
M Hoed den
M McGue
MB Haslam
MC Neale
ML Browning
N Karnehad
National Institute of Health
Organization for Economic Cooperation and Development
PJ Rathouz
PT Williams
R Kruschitz
RJ Little
RP Shook
S Ahmad
S Purcell
SA French
SE Medland
T Rankinen
Thorkild I. A. Sørensen
TI Sorensen
TO Kilpelainen
V Heywood
W Johnson
Wendy Johnson
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 31/10/2014
Field of study

We examined mean effects and variance moderating effects of measures of physical activity and fitness on six measures of adiposity and their reciprocal effects in a subsample of the population-representative Danish Twin Registry. Consistent with prior studies, higher levels of physical activity suppressed variance in adiposity, but this study provided further insight. Variance suppression appeared to have both genetic and environmental pathways. Some mean effects appeared due to reciprocal influences of environmental circumstances differing among families but not between co-twins, suggesting these reciprocal effects are uniform. Some variance moderating effects also appeared due to biases in individual measures of adiposity, as well as to differences and inaccuracies in measures of physical activity. This suggests a need to avoid reliance on single measures of both physical activity and adiposity in attempting to understand the pathways involved in their linkages, and constraint in interpreting results if only single measures are available. Future research indications include identifying which physical activity-related environmental circumstances have relatively uniform effects on adiposity in everyone, and which should be individually tailored to maximize motivation to continue involvement.</p

Copenhagen University Research Information System

Edinburgh Research Explorer

Syddansk Universitets Forskerportal

Reconstructing Roma History from Genome-Wide Data

Author: A Gusev
A Gusmão
A Gusmão
B Morar
Bela I. Melegh
BL Browning
Bonnie Berger
Béla Melegh
D Altshuler
D Gresham
D Reich
D Reich
David Reich
DH Alexander
DL Herráez
F Busing
H Pamjav
I Mendizabal
J Fenner
J Li
L Kalaydjieva
L Kalaydjieva
M Nelson
M Regueiro
Mark Lipson
Michael Bonin
Michael D. Petraglia
N Patterson
N Rai
Nick Patterson
Olaf Rieß
P Moorjani
Po-Ru Loh
Priya Moorjani
Péter Kisfali
S Purcell
TG Schurr
Ľudevít Kádaši
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/11/2012
Field of study

The Roma people, living throughout Europe and West Asia, are a diverse population linked by the Romani language and culture. Previous linguistic and genetic studies have suggested that the Roma migrated into Europe from South Asia about 1,000–1,500 years ago. Genetic inferences about Roma history have mostly focused on the Y chromosome and mitochondrial DNA. To explore what additional information can be learned from genome-wide data, we analyzed data from six Roma groups that we genotyped at hundreds of thousands of single nucleotide polymorphisms (SNPs). We estimate that the Roma harbor about 80% West Eurasian ancestry–derived from a combination of European and South Asian sources–and that the date of admixture of South Asian and European ancestry was about 850 years before present. We provide evidence for Eastern Europe being a major source of European ancestry, and North-west India being a major source of the South Asian ancestry in the Roma. By computing allele sharing as a measure of linkage disequilibrium, we estimate that the migration of Roma out of the Indian subcontinent was accompanied by a severe founder event, which appears to have been followed by a major demographic expansion after the arrival in Europe.Országos Tudományos Kutatási Alapprogramok (OTKA K 103983)Országos Tudományos Kutatási Alapprogramok (OTKA 73430)National Science Foundation (U.S.) (HOMINID grant 1032255)National Institutes of Health (U.S.) (grant GM100233

arXiv.org e-Print Archive

Public Library of Science (PLOS)

DSpace@MIT

Harvard University - DASH

eScholarship - University of California

The Francis Crick Institute

Rapid haplotype inference for nuclear families

Author: A Kong
A Kong
A Kong
A Kong
AL Williams
AM Andrés
Amy L Williams
BL Browning
BN Howie
David E Housman
David K Gifford
DF Gudbjartsson
DF Gudbjartsson
ES Lander
G Coop
G Gao
GR Abecasis
J Gayán
J Li
J Li
J Marchini
JE Wigginton
JR O'Connell
K Doi
K Markianos
L Kruglyak
L Kruglyak
M Fishelson
M Fujita
M Stephens
Martin C Rinard
P Scheet
PC Sabeti
S Lin
S Lin
SR Browning
T Niu
T Niu
Y Li
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2010
Field of study

Hapi is a new dynamic programming algorithm that ignores uninformative states and state transitions in order to efficiently compute minimum-recombinant and maximum likelihood haplotypes. When applied to a dataset containing 103 families, Hapi performs 3.8 and 320 times faster than state-of-the-art algorithms. Because Hapi infers both minimum-recombinant and maximum likelihood haplotypes and applies to related individuals, the haplotypes it infers are highly accurate over extended genomic distances.National Institutes of Health (U.S.) (NIH grant 5-T90-DK070069)National Institutes of Health (U.S.) (Grant 5-P01-NS055923)National Science Foundation (U.S.) (Graduate Research Fellowship

CiteSeerX

DSpace@MIT

Springer - Publisher Connector