Search CORE

298 research outputs found

Diagnostic applications of next generation sequencing: working towards quality standards

Author: Adey
Andrea Gehring
Anna Benet-Pagès
Bainbridge
Bentley
Carsten Bergmann
Clark
Clement
Ding
Dohm
Gerlinger
Gregory
Greif
Greif
Hanno Jörn Bolz
Hanns-Georg Klein
Harismendy
Ina Vogl
Jiang
Johansson
Kaimo Hirv
Kalari
Klaus H. Metzeler
Koboldt
Li
Lister
Loman
Mamanova
Manfred Stuhrmann
Mardis
Marius Kuhn
Mertes
Meyerson
Minoche
Nakamura
Philipp A. Greif
Robinson
Rothberg
Saskia Biskup
Sebastian H. Eck
Shendure
Stefan Kotschote
Stratton
Publication venue: 'Walter de Gruyter GmbH'
Publication date: 01/01/2012
Field of study

Over the past 6 years, next generation sequencing (NGS) has been established as a valuable high-throughput method for research in molecular genetics and has successfully been employed in the identification of rare and common genetic variations. All major NGS technology companies providing commercially available instruments (Roche 454, Illumina, Life Technologies) have recently marketed bench top sequencing instruments with lower throughput and shorter run times, thereby broadening the applications of NGS and opening the technology to the potential use for clinical diagnostics. Although the high expectations regarding the discovery of new diagnostic targets and an overall reduction of cost have been achieved, technological challenges in instrument handling, robustness of the chemistry and data analysis need to be overcome. To facilitate the implementation of NGS as a routine method in molecular diagnostics, consistent quality standards need to be developed. Here the authors give an overview of the current standards in protocols and workflows and discuss possible approaches to define quality criteria for NGS in molecular genetic diagnostics

Crossref

Open Access LMU ( Ludwig-Maximilians-Univ. München)

PuSH

Are sites with multiple single nucleotide variants in cancer genomes a consequence of drivers, hypermutable sites or sequencing errors?

Author: Alexandrov
Benson
Bird
Bulmer
Cooper
Derrien
Eyre-Walker
Flicek
Francioli
Fryxell
Gojobori
Harismendy
Harris
Harris
Hodgkinson
Hodgkinson
Hodgkinson
Huang
Hwang
Johnson
Karolchik
Kong
Lawrence
Liu
Lynch
Makova
Martinocorena
Michaelson
Minoche
Nachman
Nazarian
Nelder
Polak
Quail
Rosenfeld
Schrider
Schuster-Bockler
Smith
Treangen
Woo
Zhuang
Publication venue: 'PeerJ'
Publication date: 30/05/2016
Field of study

Across independent cancer genomes it has been observed that some sites have been recurrently hit by single nucleotide variants (SNVs). Such recurrently hit sites might be either (i) drivers of cancer that are postively selected during oncogenesis, (ii) due to mutation rate variation, or (iii) due to sequencing and assembly errors. We have investigated the cause of recurrently hit sites in a dataset of >3 million SNVs from 507 complete cancer genome sequences. We find evidence that many sites have been hit significantly more often than one would expect by chance, even taking into account the effect of the adjacent nucleotides on the rate of mutation. We find that the density of these recurrently hit sites is higher in non-coding than coding DNA and hence conclude that most of them are unlikely to be drivers. We also find that most of them are found in parts of the genome that are not uniquely mappable and hence are likely to be due to mapping errors. In support of the error hypothesis, we find that recurently hit sites are not randomly distributed across sequences from different laboratories. We fit a model to the data in which the rate of mutation is constant across sites but the rate of error varies. This model suggests that ∼4% of all SNVs are errors in this dataset, but that the rate of error varies by thousands-of-fold between sites

Crossref

Directory of Open Access Journals

PubMed Central

Sussex Research Online

Evaluation of genomic high-throughput sequencing data generated on Illumina HiSeq and Genome Analyzer systems

Author: Dohm Juliane C
Himmelbauer Heinz
Minoche André E
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

ABSTRACT: BACKGROUND: The generation and analysis of high-throughput sequencing data are becoming a major component of many studies in molecular biology and medical research. Illumina's Genome Analyzer (GA) and HiSeq instruments are currently the most widely used sequencing devices. Here, we comprehensively evaluate properties of genomic HiSeq and GAIIx data derived from two plant genomes and one virus, with read lengths of 95 to 150 bases. RESULTS: We provide quantifications and evidence for GC bias, error rates, error sequence context, effects of quality filtering, and the reliability of quality values. By combining different filtering criteria we reduced error rates 7-fold at the expense of discarding 12.5% of alignable bases. While overall error rates are low in HiSeq data we observed regions of accumulated wrong base calls. Only 3% of all error positions accounted for 24.7% of all substitution errors. Analyzing the forward and reverse strands separately revealed error rates of up to 18.7%. Insertions and deletions occurred at very low rates on average but increased to up to 2% in homopolymers. A positive correlation between read coverage and GC content was found depending on the GC content range. CONCLUSIONS: The errors and biases we report have implications for the use and the interpretation of Illumina sequencing data. GAIIx and HiSeq data sets show slightly different error profiles. Quality filtering is essential to minimize downstream analysis artifacts. Supporting previous recommendations, the strand-specificity provides a criterion to distinguish sequencing errors from low abundance polymorphisms

CiteSeerX

Crossref

Springer - Publisher Connector

PubMed Central

MPG.PuRe

Determining the site index of Teak (Tectona grandis L.) plantations in Tabasco, Mexico

Author: Dominguez-Dominguez Marivel
Herrero de Aza Celia
Martinez-Zurimendi Pablo
Minoche Djhon
Publication venue: Facultad de Agronomía e Ingeniería de la Pontificia Universidad Católica de Chile
Publication date: 28/08/2017
Field of study

Forest stand productivity is defined as the quantitative estimation of a specific area’s potential to produce biomass over a determined period of time. The site index has been the predominant method used to evaluate forest stand productivity. Teak is one of the most accepted species within the international timber market due to the physical and aesthetic qualities of this wood. The aim of this study was to determine the site index of teak plantations. The study was conducted in teak plantations of Tabasco. Data were obtained from a network of 10 plantations consisting of 35 plots measured over four successive inventories (2003 to 2006). The data were fitted to five models, of which four were based on proposed finite difference equations and a non-integrated function. The most suitable of the five models was chosen, taking into account the goodness of fit, the residual analysis, and the validation with a data subsample from the plantation. The Sloboda model was finally selected, and the results obtained were compared with the model proposed by Upadhyay. This model proved to be a useful tool, not only in evaluating station quality but also in improving the planning and management of teak plantations in Tabasco

Portal de Revistas UC (Pontificia Universidad Católica de Chile)

Genome sequencing of the extinct Eurasian wild aurochs, Bos primigenius, illuminates the phylogeography and evolution of cattle

Author: A Achilli
A Achilli
A Esteve-Codina
A Gotherstrom
A Seguin-Orlando
A Vaysse
A Winter
AE Minoche
AJ Amaral
Alison Murphy
Amanda J. Lohan
Andrew T. Chamberlain
AV Zimin
B Grisart
B Grisart
BP Lewis
Brendan J. Loftus
C Gamba
C Glaser
Ceiridwen J. Edwards
CG Elsik
Charles Spillane
CJ Edwards
CJ Edwards
CJ Edwards
CJ Rubin
CJ Stevens
CM Leu
CS Troy
D Reich
Daniel G. Bradley
David A. Magee
David E. MacHugh
DE MacHugh
DG Bradley
DM Larkin
DP Toews
E Palkopoulou
E Svensson
EJ McTavish
EY Durand
H Jonsson
H Jonsson
H Li
H Zhang
HD Daetwyler
J Clutton-Brock
J Diamond
J Kantanen
J Lenstra
J Schibler
JA Guerra-Assuncao
JD Vigne
JE Decker
JK Pickrell
JK Pritchard
JS Pedersen
K Prufer
KA Moutou
Kévin Rue-Albrecht
L Orlando
L Perez-Pardal
LK Matukumalli
M Gautier
M Hofreiter
M Li
M Meyer
M Raghavan
M Rasmussen
M Schubert
MA DePristo
MA Greagg
MA Groenen
Mark T. Donoghue
Martin Braud
Matthew D. Teasdale
MJ Montague
N Murakami
N Patterson
NA Rosenberg
O Smith
Paul A. McGettigan
R Bollongino
R Chen
RA Gibbs
RE Green
RE Green
RH Meadow
RR Hudson
RT Loftus
S Bonfiglio
S Bonfiglio
S Bonfiglio
S Guindon
S Koks
S Paabo
S Qanbari
S Sawyer
Shuaishuai Tai
Stephen D E Park
Steven Schroeder
Tad S. Sonstegard
TH Lee
W McLaren
Y Benjamini
Yuan Liu
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2015
Field of study

Background Domestication of the now-extinct wild aurochs, Bos primigenius, gave rise to the two major domestic extant cattle taxa, B. taurus and B. indicus. While previous genetic studies have shed some light on the evolutionary relationships between European aurochs and modern cattle, important questions remain unanswered, including the phylogenetic status of aurochs, whether gene flow from aurochs into early domestic populations occurred, and which genomic regions were subject to selection processes during and after domestication. Here, we address these questions using whole-genome sequencing data generated from an approximately 6,750-year-old British aurochs bone and genome sequence data from 81 additional cattle plus genome-wide single nucleotide polymorphism data from a diverse panel of 1,225 modern animals. Results Phylogenomic analyses place the aurochs as a distinct outgroup to the domestic B. taurus lineage, supporting the predominant Near Eastern origin of European cattle. Conversely, traditional British and Irish breeds share more genetic variants with this aurochs specimen than other European populations, supporting localized gene flow from aurochs into the ancestors of modern British and Irish cattle, perhaps through purposeful restocking by early herders in Britain. Finally, the functions of genes showing evidence for positive selection in B. taurus are enriched for neurobiology, growth, metabolism and immunobiology, suggesting that these biological processes have been important in the domestication of cattle. Conclusions This work provides important new information regarding the origins and functional evolution of modern cattle, revealing that the interface between early European domestic populations and wild aurochs was significantly more complex than previously thought

Crossref

Springer - Publisher Connector

PubMed Central

Spiral - Imperial College Digital Repository

The University of Manchester - Institutional Repository

University of Galway Research Repository

University of Huddersfield Repository

Recommended from our members

Missed, not missing: Phylogenomic evidence for the existence of Avian FoxP3

The Forkhead box transcription factor FoxP3 is pivotal to the development and function of regulatory T cells (Tregs), which make a major contribution to peripheral tolerance. FoxP3 is believed to perform a regulatory role in all the vertebrate species in which it has been detected. The prevailing view is that FoxP3 is absent in birds and that avian Tregs rely on alternative developmental and suppressive pathways. Prompted by the automated annotation of foxp3 in the ground tit (Parus humilis) genome, we have questioned this assumption. Our analysis of all available avian genomes has revealed that the foxp3 locus is missing, incomplete or of poor quality in the relevant genomic assemblies for nearly all avian species. Nevertheless, in two species, the peregrine falcon (Falco peregrinus) and the saker falcon (F. cherrug), there is compelling evidence for the existence of exons showing synteny with foxp3 in the ground tit. A broader phylogenomic analysis has shown that FoxP3 sequences from these three species are similar to crocodilian sequences, the closest living relatives of birds. In both birds and crocodilians, we have also identified a highly proline-enriched region at the N terminus of FoxP3, a region previously identified only in mammals

Central Archive at the University of Reading

Crossref

Directory of Open Access Journals

PubMed Central

Birkbeck Institutional Research Online

The Francis Crick Institute

Genome Biology / Genome and transcriptome analysis of the Mesoamerican common bean and the role of gene duplications in establishing tissue and temporal specialization of genes

Background: Legumes are the third largest family of angiosperms and the second most important crop class. Legume genomes have been shaped by extensive large-scale gene duplications, including an approximately 58 million year old whole genome duplication shared by most crop legumes. Results: We report the genome and the transcription atlas of coding and non-coding genes of a Mesoamerican genotype of common bean (Phaseolus vulgaris L., BAT93). Using a comprehensive phylogenomics analysis, we assessed the past and recent evolution of common bean, and traced the diversification of patterns of gene expression following duplication. We find that successive rounds of gene duplications in legumes have shaped tissue and developmental expression, leading to increased levels of specialization in larger gene families. We also find that many long non-coding RNAs are preferentially expressed in germ-line-related tissues (pods and seeds), suggesting that they play a significant role in fruit development. Our results also suggest that most bean-specific gene family expansions, including resistance gene clusters, predate the split of the Mesoamerican and Andean gene pools. Conclusions: The genome and transcriptome data herein generated for a Mesoamerican genotype represent a counterpart to the genomic resources already available for the Andean gene pool. Altogether, this information will allow the genetic dissection of the characters involved in the domestication and adaptation of the crop, and their further implementation in breeding strategies for this important crop

Publikationsserver der Universitätsbibliothek Bodenkultur Wien

Population genomics reveals that within-fungus polymorphism is common and maintained in populations of the mycorrhizal fungus Rhizophagus irregularis.

Author: 1000 Genomes Project Consortium
A Colard
AE Minoche
AM Koch
AM Koch
AM Reitzel
B Börstler
B Börstler
BB Larsen
BK Peterson
C Angelard
C Angelard
C Angelard
D Cantu
D Croll
D Laehnemann
D Sanglard
D Scaglione
D Wibberg
E Boon
E Boon
E Paradis
E Tisserant
F Ronquist
Frédéric G Masclaux
G Kuhn
H Kim
H Li
I Ceballos
Ian R Sanders
IR Sanders
J Catchen
J Ropars
JI Hoffman
JK Hane
JS Paul
K Katoh
K Lin
KA Sedzielewska
KJ Emerson
L Munkvold
M Ehinger
M Hijri
M Öpik
Marco Pagni
MGA van der Heijden
MGA van der Heijden
MO Ehinger
N Corradi
N Wang
NA Baird
PA Hohenlohe
Pawel Rosikiewicz
SE Smith
T Jones
T Magoč
Tania Wyss
TL Parchman
V Lange
V Ter-Hovhannisyan
WR Pearson
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

Arbuscular mycorrhizal (AM) fungi are symbionts of most plants, increasing plant growth and diversity. The model AM fungus Rhizophagus irregularis (isolate DAOM 197198) exhibits low within-fungus polymorphism. In contrast, another study reported high within-fungus variability. Experiments with other R. irregularis isolates suggest that within-fungus genetic variation can affect the fungal phenotype and plant growth, highlighting the biological importance of such variation. We investigated whether there is evidence of differing levels of within-fungus polymorphism in an R. irregularis population. We genotyped 20 isolates using restriction site-associated DNA sequencing and developed novel approaches for characterizing polymorphism among haploid nuclei. All isolates exhibited higher within-isolate poly-allelic single-nucleotide polymorphism (SNP) densities than DAOM 197198 in repeated and non-repeated sites mapped to the reference genome. Poly-allelic SNPs were independently confirmed. Allele frequencies within isolates deviated from diploids or tetraploids, or that expected for a strict dikaryote. Phylogeny based on poly-allelic sites was robust and mirrored the standard phylogeny. This indicates that within-fungus genetic variation is maintained in AM fungal populations. Our results predict a heterokaryotic state in the population, considerable differences in copy number variation among isolates and divergence among the copies, or aneuploidy in some isolates. The variation may be a combination of all of these hypotheses. Within-isolate genetic variation in R. irregularis leads to large differences in plant growth. Therefore, characterizing genomic variation within AM fungal populations is of major ecological importance

Crossref

UNIL IRIS | Institutional Research Information System

PubMed Central

The efficacy of high-throughput sequencing and target enrichment on charred archaeobotanical remains

Author: A Cooper
A Ginolhac
A Mitra
A Schlumbaum
A Schlumbaum
AE Minoche
AM Mikić
AP Møller
C Gamba
C Trapnell
D Huson
D Nadel
DJ Chalfoun
E Fernández
EK Nitsch
F Gugerli
H Bilgic
H Jónsson
H Li
H Smith
HA Burbano
HN Poinar
HR Oliveira
J Dabney
J Threadgold
J-F Manen
K O´Donoghue
KI Bos
L Kistler
L Kistler
L Orlando
LS Epp
M Kircher
M Knapp
M Schubert
M Schubert
M Schubert
M-T Gansauge
MC Ávila-Arcos
ML Carpenter
N Wales
N Wales
O Smith
P Goloubinoff
P Smýkal
R Schmieder
RG Allaby
RHE Blatter
S Boardman
S Boessenkool
S Pääbo
SA Palmer
SA Palmer
SF Altschul
SJ Salter
SL Bunning
TA Brown
TA Brown
TA Brown
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

The majority of archaeological plant material is preserved in a charred state. Obtaining reliable ancient DNA data from these remains has presented challenges due to high rates of nucleotide damage, short DNA fragment lengths, low endogenous DNA content and the potential for modern contamination. It has been suggested that high-throughput sequencing (HTS) technologies coupled with DNA enrichment techniques may overcome some of these limitations. Here we report the findings of HTS and target enrichment on four important archaeological crops (barley, grape, maize and rice) performed in three different laboratories, presenting the largest HTS assessment of charred archaeobotanical specimens to date. Rigorous analysis of our data-excluding false-positives due to background contamination or incorrect index assignments-indicated a lack of endogenous DNA in nearly all samples, except for one lightly-charred maize cob. Even with target enrichment, this sample failed to yield adequate data required to address fundamental questions in archaeology and biology. We further reanalysed part of an existing dataset on charred plant material, and found all purported endogenous DNA sequences were likely to be spurious. We suggest these technologies are not suitable for use with charred archaeobotanicals and urge great caution when interpreting data obtained by HTS of these remains

Crossref

Copenhagen University Research Information System

PubMed Central

NORA - Norwegian Open Research Archives

White Rose Research Online

Genome sequencing as a first-line genetic test in familial dilated cardiomyopathy

Author: Bagnall RD
Cowley MJ
Dinger ME
Drew AP
Fatkin D
Gayevskiy V
Horvat C
Ingles J
Johnson R
Lundie B
Minoche AE
Morton SU
Seidman CE
Seidman JG
Semsarian C
Statham AL
Woo K
Publication venue: Elsevier
Publication date: 01/03/2019
Field of study

Purpose: We evaluated genome sequencing (GS) as an alternative to multigene panel sequencing (PS) for genetic testing in dilated cardiomyopathy (DCM). Methods: Forty-two patients with familial DCM underwent PS and GS, and detection rates of rare single-nucleotide variants and small insertions/deletions in panel genes were compared. Loss-of-function variants in 406 cardiac-enriched genes were evaluated, and an assessment of structural variation was performed. Results: GS provided broader and more uniform coverage than PS, with high concordance for rare variant detection in panel genes. GS identified all PS-identified pathogenic or likely pathogenic variants as well as two additional likely pathogenic variants: one was missed by PS due to low coverage, the other was a known disease-causing variant in a gene not included on the panel. No loss-of-function variants in the extended gene set met clinical criteria for pathogenicity. One BAG3 structural variant was classified as pathogenic. Conclusion: Our data support the use of GS for genetic testing in DCM, with high variant detection accuracy and a capacity to identify structural variants. GS provides an opportunity to go beyond suites of established disease genes, but the incremental yield of clinically actionable variants is limited by a paucity of genetic and functional evidence for DCM association

UNSWorks