Search CORE

23 research outputs found

Accurate reconstruction of microbial strains from metagenomic sequencing using representative reference genomes

Author: A Sczyrba
ABR McIntyre
B Langmead
BD Ondov
C Quast
C Quince
D Kim
DE Wood
DH Huson
DT Truong
EC Pielou
F Maixner
FM Key
GL Kay
H Marakeby
J Dröge
KT Konstantinidis
NA O’Leary
S Nayfach
S Rasmussen
SF Altschul
TH Ahn
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2018
Field of study

Exploring the genetic diversity of microbes within the environment through metagenomic sequencing first requires classifying these reads into taxonomic groups. Current methods compare these sequencing data with existing biased and limited reference databases. Several recent evaluation studies demonstrate that current methods either lack sufficient sensitivity for species-level assignments or suffer from false positives, overestimating the number of species in the metagenome. Both are especially problematic for the identification of low-abundance microbial species, e. g. detecting pathogens in ancient metagenomic samples. We present a new method, SPARSE, which improves taxonomic assignments of metagenomic reads. SPARSE balances existing biased reference databases by grouping reference genomes into similarity-based hierarchical clusters, implemented as an efficient incremental data structure. SPARSE assigns reads to these clusters using a probabilistic model, which specifically penalizes non-specific mappings of reads from unknown sources and hence reduces false-positive assignments. Our evaluation on simulated datasets from two recent evaluation studies demonstrated the improved precision of SPARSE in comparison to other methods for species-level classification. In a third simulation, our method successfully differentiated multiple co-existing Escherichia coli strains from the same sample. In real archaeological datasets, SPARSE identified ancient pathogens with ≤0.02% abundance, consistent with published findings that required additional sequencing data. In these datasets, other methods either missed targeted pathogens or reported non-existent ones

Crossref

Warwick Research Archives Portal Repository

University of East Anglia digital repository

Successful amplification of DNA aboard the International Space Station

Author: ABR McIntyre
B Yi
G Sonnenfeld
J Straub
LA Mermel
LG Napolitano
ME Potok
S Tauber
T Zuo
TM Powledge
X Ou
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Lightweight Metagenomic Classification via eBWT

Author: A Cox
A Restivo
ABR McIntyre
B Langmead
D Kim
DE Wood
F Louza
F Louza
FA Louza
KH Ng
L Egidi
L Janin
L Yang
M Bauer
M Pedersen
MI Abouelhoda
P Bonizzoni
P Menzel
R Ounit
R Ounit
S Mantaci
S Mantaci
S Mantaci
S Mantaci
S Vinga
W-K Hon
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2019
Field of study

The development of Next Generation Sequencing has had a major impact on the study of genetic sequences, and in particular, on the advancement of metagenomics, whose aim is to identify the microorganisms that are present in a sample collected directly from the environment. In this paper, we describe a new lightweight alignment-free and assembly-free framework for metagenomic classification that compares each unknown sequence in the sample to a collection of known genomes. We take advantage of the combinatorial properties of an extension of the Burrows-Wheeler transform, and we sequentially scan the required data structures, so that we can analyze unknown sequences of large collections using little internal memory. For the best of our knowledge, this is the first approach that is assembly- and alignment-free, and is not based on k-mers. We show that our experiments confirm the effectiveness of our approach and the high accuracy even in negative control samples. Indeed we only classify 1 short read on 5,726,358 random shuffle reads. Finally, the results are comparable with those achieved by read-mapping classifiers and by k-mer based classifiers

Crossref

Archivio della Ricerca - Università di Pisa

Systematic benchmarking of tools for CpG methylation detection from nanopore sequencing

Author: ABR McIntyre
AC Rand
AH Laszlo
C Grunau
E-A Raiber
F Kader
F Pedregosa
GE Crooks
H Li
I Dunham
J Köster
JP O’Shea
JT Robinson
JT Simpson
K Labun
L Breiman
M Ehrich
MVC Greenberg
P Ni
P-Y Chen
PA Jones
Q Liu
Q Liu
W-S Yong
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Challenges in benchmarking metagenomic profilers

Author: A Milanese
A Sczyrba
ABR McIntyre
AD Kostic
B Buchfink
B Liu
C Martino
D Li
DE Wood
DE Wood
DP Faith
DT Truong
F Chen
FP Breitwieser
GB Gloor
Human Microbiome Project Consortium.
J Aitchison
J Aitchison
J Lu
J Soppa
JE Mendell
K Mavromatis
L McInnes
LJP van der Maaten
M Arumugam
N Mantel
N Segata
P Legendre
P Legendre
P Menzel
R Knight
S Dray
S Lindgreen
S Nurk
S Sunagawa
SH Ye
T Hsu
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/06/2021
Field of study

Accurate microbial identification and abundance estimation are crucial for metagenomics analysis. Various methods for classifying metagenomic data and estimating taxonomic profiles, broadly referred to as metagenomic profilers, have been developed. Yet, benchmarking metagenomic profilers remains challenging because some tools are designed to report relative sequence abundance while others report relative taxonomic abundance. Here, we show how misleading conclusions can be drawn by neglecting this distinction between relative abundance types when benchmarking metagenomic profilers. Moreover, we show compelling evidence that interchanging sequence abundance and taxonomic abundance will influence both per-sample summary statistics and cross-sample comparisons. We suggest that the microbiome research community should pay attention to potentially misleading biological conclusions arising from this issue when benchmarking metagenomic profilers, by carefully considering the type of abundance data that was analyzed and interpreted, and clearly stating the strategy used for metagenomic profiling

Crossref

PubMed Central

eScholarship - University of California

KrakenUniq: confident and fast metagenomics classification using unique k-mer counts

Author: A Sobih
ABR McIntyre
AE Darling
B Buchfink
B Buchfink
C Quince
C Zhang
D. N. Baker
Daniel H. Huson
DE Wood
DH Huson
DT Truong
F. P. Breitwieser
GL Rosen
JR Brister
JR Brown
M Thoendel
P Flajolet
PJ Simner
R Ounit
R Ounit
S Mukherjee
S. L. Salzberg
SF Altschul
SJ Salter
SK Ames
SL Salzberg
TA Freitas
TH Dadi
Y Xu
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Accurate detection of m6A RNA modifications in native RNA sequences

Author: ABR McIntyre
AG Torres
B Delatte
B Linder
D Arango
D Dai
D Dominissini
D Dominissini
DR Garalde
EM Novoa
ID Vilfan
IU Haussmann
J Widagdo
K-J Yoon
KD Meyer
L Kan
LP Sarin
LP Vu
M Jain
M Loose
M Safra
MW Keller
N Jonkhout
N Liu
N Liu
PJ Batista
S Schwartz
S Schwartz
S Schwartz
SD Agarwala
T Lence
TM Carlile
V Marchand
X Yang
X Zhao
Y Saletore
Y-L Weng
Z Li
Z-X Liu
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2019
Field of study

The epitranscriptomics field has undergone an enormous expansion in the last few years; however, a major limitation is the lack of generic methods to map RNA modifications transcriptome-wide. Here, we show that using direct RNA sequencing, N6-methyladenosine (m6A) RNA modifications can be detected with high accuracy, in the form of systematic errors and decreased base-calling qualities. Specifically, we find that our algorithm, trained with m6A-modified and unmodified synthetic sequences, can predict m6A RNA modifications with ~90% accuracy. We then extend our findings to yeast data sets, finding that our method can identify m6A RNA modifications in vivo with an accuracy of 87%. Moreover, we further validate our method by showing that these 'errors' are typically not observed in yeast ime4-knockout strains, which lack m6A modifications. Our results open avenues to investigate the biological roles of RNA modifications in their native RNA context

Crossref

UNSWorks

UPF Digital Repository

Direct RNA sequencing reveals m6A modifications on adenovirus RNA are necessary for efficient splicing

Author: A Dobin
A Louloupi
A Misra
A Pombo
ABR McIntyre
ABR McIntyre
AR Quinlan
B Linder
B Moss
B Tan
BA Flusberg
BC Poling
BS Zhao
CR Hesser
D Hazra
DG Courtney
DP Depledge
DR Garalde
ED Reyes
EM Kennedy
F Ramírez
G Lichinchi
G Lichinchi
G Zheng
GD Williams
H Hao
H Huang
H Imam
H Li
H Li
H Liu
J Liu
J Mauer
J Russo
J-M Fustin
JE Squires
K Chen
K Tsai
KD Meyer
KD Meyer
KI Zhou
L Cong
M Bartosovic
M Lawrence
MA Garcia-Campos
MK Doma
MT Parker
N Liu
NS Gokhale
P Khandelia
R Winkler
RB Darnell
RM Krug
RM Rubio
S Chen-Kiang
S Heinz
S Ke
S Ke
S Lavi
S Lin
S Schwartz
S Sommer
SD Kasowitz
SE Kane
SM Bresson
TD Wu
TM Carlile
W Xiao
X Li
X Li
X Wang
X Wang
Y Yue
Y Zeng
Y Zhang
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Reference reagents could be first step to standardizing microbiome studies

Author: A Apprill
A Klindworth
ABR McIntyre
AE Parada
AG Clooney
BJ Callahan
BJ Callahan
C Quast
CH Coxon
D Kim
DA Soergel
DE Wood
DT Truong
E Bolyen
F Fouhy
Human Microbiome Project C
IA Chen
J Jovel
J Lu
J Qin
JC Dohm
JR Bray
JS Johnson
JT Nearing
JW Arnold
LR Thompson
NA Bokulich
P Menzel
PI Costea
R Flores
R Sinha
R Sinha
S Lindgreen
SH Ye
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Detection of DNA base modifications by deep recurrent neural network on Oxford Nanopore sequencing data

Author: A Meissner
ABR McIntyre
AC Rand
AH Laszlo
AT Muller
BA Flusberg
BM Davis
C Lovkvist
CL Xiao
D Dominissini
D Meyer Kate
EL Greer
F Miura
FR Blattner
GZ Luo
J Beaulaurier
J Schreiber
JT Simpson
L Shi
M Ehrlich
M Jain
MF Paz
NR Cohen
Q Liu
R Kanwal
S Hochreiter
SS Merchant
T Thireou
TA Clark
TA Clark
XJ He
Y Fu
Y Saletore
Z Feng
ZK O'Brown
ZL Wescoe
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref