328 research outputs found
The third international hackathon for applying insights into large-scale genomic composition to use cases in a wide range of organisms
In October 2021, 59 scientists from 14 countries and 13 U.S. states collaborated virtually in the Third Annual Baylor College of Medicine & DNANexus Structural Variation hackathon. The goal of the hackathon was to advance research on structural variants (SVs) by prototyping and iterating on open-source software. This led to nine hackathon projects focused on diverse genomics research interests, including various SV discovery and genotyping methods, SV sequence reconstruction, and clinically relevant structural variation, including SARS-CoV-2 variants. Repositories for the projects that participated in the hackathon are available at https://github.com/collaborativebioinformatics
Waterproof casts for the management of upper limb fractures in children: a systematic review and meta-analysis
AimsUpper limb fractures in children are often managed in casts. Waterproof casts which allow safe immersion in water may be used. These may improve comfort, convenience, and satisfaction when compared with standard casts. The aim of this review was to compare the efficacy, safety, and satisfaction of waterproof casts with standard casts in the management of upper limb fractures in children.MethodsA systematic review of randomized controlled trials (RCTs) was conducted in September 2024. Comprehensive searches of Medline, PubMed, Cochrane CENTRAL, and EMBASE databases were performed. Studies were included which recruited children aged 0 to 18 years with upper limb fractures, which were managed in waterproof or standard casts. Patient-reported outcomes, functional outcomes, and complication rates were assessed.ResultsA total of five studies involving 390 children were included. Those managed with a waterproof cast reported significantly superior satisfaction with regard to comfort (mean difference (MD) 1.92 (95% CI 0.15 to 3.69); p = 0.034), itchiness (MD 0.21 (95% CI 0.00 to 0.43); p = 0.047), and overall child and parent satisfaction (MD 0.53 (95% CI 0.01 to 1.05); p = 0.048). Those managed with a waterproof cast also had significantly improved functional outcomes as measured by the Activities Scale for Kids-Performance (ASK-P) score, with a MD of 16.90 (95% CI 6.87 to 26.93; p = 0.001). There were no significant differences regarding heat or sweatiness, pain, return to recreational activities, unexpected returns for cast maintenance, radiological deformity, or skin problems.ConclusionWaterproof casts seem to provide an alternative to standard casts in the management of upper limb fractures in children. There were improved functional outcomes at the time of removal of the cast, improved comfort, and less itching. Pooling of the studies was limited due to the heterogeneity of the reporting of outcomes and the small sizes of the studies. Neither long-term outcomes, nor economic analysis based on healthcare-related quality of life, are available. A definitive RCT based on a core outcome set is required to confirm the efficacy and investigate the cost-effectiveness of waterproof casts in children
dbVar structural variant cluster set for data analysis and variant comparison
dbVar houses over 3 million submitted structural variants (SSV) from 120 human studies including copy number variations (CNV), insertions, deletions, inversions, translocations, and complex chromosomal rearrangements. Users can submit multiple SSVs to dbVAR that are presumably identical, but were ascertained by different platforms and samples, to calculate whether the variant is rare or common in the population and allow for cross validation. However, because SSV genomic location reporting can vary – including fuzzy locations where the start and/or end points are not precisely known – analysis, comparison, annotation, and reporting of SSVs across studies can be difficult. This project was initiated by the Structural Variant Comparison Group for the purpose of generating a non-redundant set of genomic regions defined by counts of concordance for all human SSVs placed on RefSeq assembly GRCh38 (RefSeq accession GCF_000001405.26). We intend that the availability of these regions, called structural variant clusters (SVCs), will facilitate the analysis, annotation, and exchange of SV data and allow for simplified display in genomic sequence viewers for improved variant interpretation. Sets of SVCs were generated by variant type for each of the 120 studies as well as for a combined set across all studies. Starting from 3.64 million SSVs, 2.5 million and 3.4 million non-redundant SVCs with count \u3e=1 were generated by variant type for each study and across all studies, respectively. In addition, we have developed utilities for annotating, searching, and filtering SVC data in GVF format for computing summary statistics, exporting data for genomic viewers, and annotating the SVC using external data sources
PubRunner: a light-weight framework for updating text mining results
Biomedical text mining promises to assist biologists in quickly navigating the combined knowledge in their domain. This would allow improved understanding of the complex interactions within biological systems and faster hypothesis generation. New biomedical research articles are published daily and text mining tools are only as good as the corpus from which they work. Many text mining tools are underused because their results are static and do not reflect the constantly expanding knowledge in the field. In order for biomedical text mining to become an indispensable tool used by researchers, this problem must be addressed. To this end, we present PubRunner, a framework for regularly running text mining tools on the latest publications. PubRunner is lightweight, simple to use, and can be integrated with an existing text mining tool. The workflow involves downloading the latest abstracts from PubMed, executing a user-defined tool, pushing the resulting data to a public FTP or Zenodo dataset, and publicizing the location of these results on the public PubRunner website. We illustrate the use of this tool by re-running the commonly used word2vec tool on the latest PubMed abstracts to generate up-to-date word vector representations for the biomedical domain. This shows a proof of concept that we hope will encourage text mining developers to build tools that truly will aid biologists in exploring the latest publications
Extending TCGA queries to automatically identify analogous genomic data from dbGaP [version 1; referees: 2 approved, 1 approved with reservations]
Data sharing is critical to advance genomic research by reducing the demand to collect new data by reusing and combining existing data and by promoting reproducible research. The Cancer Genome Atlas (TCGA) is a popular resource for individual-level genotype-phenotype cancer related data. The Database of Genotypes and Phenotypes (dbGaP) contains many datasets similar to those in TCGA. We have created a software pipeline that will allow researchers to discover relevant genomic data from dbGaP, based on matching TCGA metadata. The resulting research provides an easy to use tool to connect these two data sources
NovoGraph: Human genome graph construction from multiple long-read de novo assemblies [version 2; referees: 2 approved]
Genome graphs are emerging as an important novel approach to the analysis of high-throughput human sequencing data. By explicitly representing genetic variants and alternative haplotypes in a mappable data structure, they can enable the improved analysis of structurally variable and hyperpolymorphic regions of the genome. In most existing approaches, graphs are constructed from variant call sets derived from short-read sequencing. As long-read sequencing becomes more cost-effective and enables de novo assembly for increasing numbers of whole genomes, a method for the direct construction of a genome graph from sets of assembled human genomes would be desirable. Such assembly-based genome graphs would encompass the wide spectrum of genetic variation accessible to long-read-based de novo assembly, including large structural variants and divergent haplotypes. Here we present NovoGraph, a method for the construction of a human genome graph directly from a set of de novo assemblies. NovoGraph constructs a genome-wide multiple sequence alignment of all input contigs and creates a graph by merging the input sequences at positions that are both homologous and sequence-identical. NovoGraph outputs resulting graphs in VCF format that can be loaded into third-party genome graph toolkits. To demonstrate NovoGraph, we construct a genome graph with 23,478,835 variant sites and 30,582,795 variant alleles from de novo assemblies of seven ethnically diverse human genomes (AK1, CHM1, CHM13, HG003, HG004, HX1, NA19240). Initial evaluations show that mapping against the constructed graph reduces the average mismatch rate of reads from sample NA12878 by approximately 0.2%, albeit at a slightly increased rate of reads that remain unmapped
Plasma proteomic signature predicts myeloid neoplasm risk
PURPOSE: Clonal hematopoiesis (CH) is thought to be the origin of myeloid neoplasms (MN). Yet, our understanding of the mechanisms driving CH progression to MN and clinical risk prediction of MN remains limited. The human proteome reflects complex interactions between genetic and epigenetic regulation of biological systems. We hypothesized that the plasma proteome might predict MN risk and inform our understanding of the mechanisms promoting MN development.
EXPERIMENTAL DESIGN: We jointly characterized CH and plasma proteomic profiles of 46,237 individuals in the UK Biobank at baseline study entry. During 500,036 person-years of follow-up, 115 individuals developed MN. Cox proportional hazard regression was used to test for an association between plasma protein levels and MN risk.
RESULTS: We identified 115 proteins associated with MN risk, of which 30% (N = 34) were also associated with CH. These were enriched for known regulators of the innate and adaptive immune system. Plasma proteomics improved the prediction of MN risk (AUC = 0.85; P = 5×10-9) beyond clinical factors and CH (AUC = 0.80). In an independent group (N = 381,485), we used inherited polygenic risk scores (PRS) for plasma protein levels to validate the relevance of these proteins toMNdevelopment. PRS analyses suggest that most MN-associated proteins we identified are not directly causally linked toMN risk, but rather represent downstream markers of pathways regulating the progression of CH to MN.
CONCLUSIONS: These data highlight the role of immune cell regulation in the progression of CH to MN and the promise of leveraging multi-omic characterization of CH to improveMN risk stratification. See related commentary by Bhalgat and Taylor, p. 3095
Recommended from our members
3D Simulations of Secondary Electron Generation and Transport in a Diamond Electron Beam Amplifier
The Relativistic Heavy Ion Collider (RHIC) contributes fundamental advances to nuclear physics by colliding a wide range of ions. A novel electron cooling section, which is a key component of the proposed luminosity upgrade for RHIC, requires the acceleration of high-charge electron bunches with low emittance and energy spread. A promising candidate for the electron source is the recently developed concept of a high quantum efficiency photoinjector with a diamond amplifier. To assist in the development of such an electron source, we have implemented algorithms within the VORPAL particle-in-cell framework for modeling secondary electron and hole generation, and for charge transport in diamond. The algorithms include elastic, phonon, and impurity scattering processes over a wide range of charge carrier energies. Results from simulations using the implemented capabilities will be presented and discussed
The role of recent admixture in forming the contemporary West Eurasian genomic landscape
Over the past few years, studies of DNA isolated from human fossils and archaeological remains have generated considerable novel insight into the history of our species. Several landmark papers have described the genomes of ancient humans across West Eurasia, demonstrating the presence of large-scale, dynamic population movements over the last 10,000 years, such that ancestry across present-day populations is likely to be a mixture of several ancient groups [1-7]. While these efforts are bringing the details of West Eurasian prehistory into increasing focus, studies aimed at understanding the processes behind the generation of the current West Eurasian genetic landscape have been limited by the number of populations sampled or have been either too regional or global in their outlook [8-11]. Here, using recently described haplotype-based techniques [11], we present the results of a systematic survey of recent admixture history across Western Eurasia and show that admixture is a universal property across almost all groups. Admixture in all regions except North Western Europe involved the influx of genetic material from outside of West Eurasia, which we date to specific time periods. Within Northern, Western, and Central Europe, admixture tended to occur between local groups during the period 300 to 1200 CE. Comparisons of the genetic profiles of West Eurasians before and after admixture show that population movements within the last 1,500 years are likely to have maintained differentiation among groups. Our analysis provides a timeline of the gene flow events that have generated the contemporary genetic landscape of West Eurasia
Incidence and prevalence of asthma, chronic obstructive pulmonary disease and interstitial lung disease between 2004 and 2023:harmonised analyses of longitudinal cohorts across England, Wales, South-East Scotland and Northern Ireland
Background We describe the epidemiology of asthma, chronic obstructive pulmonary disease (COPD) and interstitial lung disease (ILD) from 2004 to 2023 in England, Wales, Scotland and Northern Ireland (NI) using a harmonised approach. Methods Data from the National Health Service England (NHSE), Clinical Practice Research Datalink Aurum in England, Secure Anonymised Information Linkage Databank in Wales, DataLoch in South-East Scotland and the Honest Broker Service in NI were used. A harmonised approach to COPD, asthma and ILD case definitions, study designs and study populations across the four nations was performed. Age-sex-standardised incidence rates and point prevalence were calculated between 2004 and 2023 depending on data availability. Logistic and negative binomial regression compared incidence and prevalence rates between the start and end of each study period. Linear extrapolation projected incidence rates between 2020 and 2023 to illustrate how observed and projected rates differed. Results Incidence rates were lower in 2019 versus 2005 for asthma (England: incidence rate ratio 0.89, 95% CI 0.88 to 0.90; Wales: 0.66, 0.65 to 0.68; Scotland: 0.67, 0.64 to 0.71; NI: 0.84, 0.81 to 0.86), COPD (England: 0.83, 0.82 to 0.85; Wales: 0.67, 0.65 to 0.69) and higher for ILD (England: 3.27, 3.05 to 3.50; Wales: 1.39, 1.27 to 1.53; Scotland: 1.63, 1.36 to 1.95; NI: 3.03, 2.47 to 3.72). In NHSE, the incidence of asthma was similar in June 2023 versus November 2019, but lower for COPD and higher for ILD. Prevalence of asthma in 2019 in England, Wales, Scotland and NI was 9.7%, 15.9%, 13.2% and 7.0%, respectively, for COPD 4.5%, 5.1%, 4.4% and 3.0%, and for ILD 0.4%, 0.5%, 0.6% and 0.3%. Projected incidence rates were 2.8, 3.4 and 1.8 times lower for asthma, COPD and ILD compared with observed rates at the height of the pandemic. Interpretation Asthma, COPD and ILD affect over 10 million people across the four nations, and a substantial number of diagnoses were missed during the pandemic.</p
- …
