Search CORE

173 research outputs found

TechMiner: Extracting Technologies from Academic Publications

Author: A Bandrowski
C Bizer
C Fellbaum
F Osborne
F Osborne
F Ronzano
K Scanning Douw
P Corbett
R Usbeck
S Peroni
T Groza
W Huang
Publication venue
Publication date: 01/01/2016
Field of study

In recent years we have seen the emergence of a variety of scholarly datasets. Typically these capture ‘standard’ scholarly entities and their connections, such as authors, affiliations, venues, publications, citations, and others. However, as the repositories grow and the technology improves, researchers are adding new entities to these repositories to develop a richer model of the scholarly domain. In this paper, we introduce TechMiner, a new approach, which combines NLP, machine learning and semantic technologies, for mining technologies from research publications and generating an OWL ontology describing their relationships with other research entities. The resulting knowledge base can support a number of tasks, such as: richer semantic search, which can exploit the technology dimension to support better retrieval of publications; richer expert search; monitoring the emergence and impact of new technologies, both within and across scientific fields; studying the scholarly dynamics associated with the emergence of new technologies; and others. TechMiner was evaluated on a manually annotated gold standard and the results indicate that it significantly outperforms alternative NLP approaches and that its semantic features improve performance significantly with respect to both recall and precision

Crossref

Online Research @ Cardiff

Open Research Online

A hybrid human and machine resource curation pipeline for the Neuroscience Information Framework

Author: A. E. Bandrowski
Akil
Bug
Gardner
Gupta
H. M. Muller
J. Cachat
J. S. Grethe
L. Marenco
M. E. Martone
Marenco
Muller
P. Ciccarese
P. W. Sternberg
R. Wang
T. Clark
Tenenbaum
V. Astakhov
Y. Li
Publication venue: Oxford University Press
Publication date: 20/03/2012
Field of study

The breadth of information resources available to researchers on the Internet continues to expand, particularly in light of recently implemented data-sharing policies required by funding agencies. However, the nature of dense, multifaceted neuroscience data and the design of contemporary search engine systems makes efficient, reliable and relevant discovery of such information a significant challenge. This challenge is specifically pertinent for online databases, whose dynamic content is ‘hidden’ from search engines. The Neuroscience Information Framework (NIF; http://www.neuinfo.org) was funded by the NIH Blueprint for Neuroscience Research to address the problem of finding and utilizing neuroscience-relevant resources such as software tools, data sets, experimental animals and antibodies across the Internet. From the outset, NIF sought to provide an accounting of available resources, whereas developing technical solutions to finding, accessing and utilizing them. The curators therefore, are tasked with identifying and registering resources, examining data, writing configuration files to index and display data and keeping the contents current. In the initial phases of the project, all aspects of the registration and curation processes were manual. However, as the number of resources grew, manual curation became impractical. This report describes our experiences and successes with developing automated resource discovery and semiautomated type characterization with text-mining scripts that facilitate curation team efforts to discover, integrate and display new content. We also describe the DISCO framework, a suite of automated web services that significantly reduce manual curation efforts to periodically check for resource updates. Lastly, we discuss DOMEO, a semi-automated annotation tool that improves the discovery and curation of resources that are not necessarily website-based (i.e. reagents, software tools). Although the ultimate goal of automation was to reduce the workload of the curators, it has resulted in valuable analytic by-products that address accessibility, use and citation of resources that can now be shared with resource owners and the larger scientific community

Crossref

Harvard University - DASH

PubMed Central

Caltech Authors

Recommended from our members

Data reporting quality and semantic interoperability increase with community-based data elements (CoDEs). Analysis of the open data commons for spinal cord injury (ODC-SCI)

Author: Axtman P. J.
Bandrowski Anita
Bixby John L.
Davis Lex Maliga
Ferguson Adam R.
Fond Kenneth A.
Fouad Karim
Gensel John C.
Grethe Jeffrey S.
Huie J. R.
Lemmon Vance P.
Martone Maryann E.
Sheoran Anushka
Torres-Espin Abel
Vavrek Romana
Visser Ubbo
Publication venue: UKnowledge
Publication date: 01/03/2025
Field of study

Data interoperability is crucial for effectively combining data for scientific inquiry. To facilitate interoperability, data standards such as a common definition of variables are often developed. The Open Data Commons for Spinal Cord Injury (odc-sci.org) has established an initial set of community-based data elements (CoDEs)—a minimal set of variables for sharing—to promote data interoperability in SCI research, aligning with FAIR (Findable, Accessible, Interoperable, and Reusable) data principles. We sought to understand the use of CoDEs by the SCI community to inform current standards adherence and future standards development. We systematically analyzed 39 public datasets in relation to 17 required CoDEs and found variations between reported data and the structure specified by the CoDEs. Overall, we found that the enforcement of data standards improved reporting rates of CoDEs variables. Notably, different variables were found to require different levels of curation to ensure semantic equivalence among datasets. We also uncovered specific reporting habits of researchers such as formatting and naming patterns. A need for different data standards based on the nature of the study (e.g., human study, derivative study) was realized alongside a detailed list of issues that should be addressed when implementing such standards. Among the various approaches to developing data standards, ODC-SCI adopted a semi-formal approach by creating standards that are easy to adopt by the user. Our data-driven evaluation of actual reporting behavior shows that this flexibility can lead to subsequent problems in harmonization. This study serves as a baseline analysis of reporting behaviors for shaping and facilitating data standards

University of Miami: Scholarship@Miami

University of Kentucky

The Resource Identification Initiative: A cultural shift in publishing

Author: Bandrowski Anita
Brush Matthew
Grethe Jeffery S
Haendel Melissa A
Hill Sean
Hof Patrick R
Kennedy David N
Martone Maryann E
Pols Maaike
Resource Identification Initiative Members are listed here: https://www.force11.org/node/4463/members
Tan Serena
Vasilevsky Nicole
Washington Nicole
Zudilova-Seinstra Elena
Publication venue: eScholarship@UMassChan
Publication date: 01/01/2015
Field of study

A central tenet in support of research reproducibility is the ability to uniquely identify research resources, i.e., reagents, tools, and materials that are used to perform experiments. However, current reporting practices for research resources are insufficient to allow humans and algorithms to identify the exact resources that are reported or answer basic questions such as What other studies used resource X? To address this issue, the Resource Identification Initiative was launched as a pilot project to improve the reporting standards for research resources in the methods sections of papers and thereby improve identifiability and reproducibility. The pilot engaged over 25 biomedical journal editors from most major publishers, as well as scientists and funding officials. Authors were asked to include Research Resource Identifiers (RRIDs) in their manuscripts prior to publication for three resource types: antibodies, model organisms, and tools (including software and databases). RRIDs represent accession numbers assigned by an authoritative database, e.g., the model organism databases, for each type of resource. To make it easier for authors to obtain RRIDs, resources were aggregated from the appropriate databases and their RRIDs made available in a central web portal ( www.scicrunch.org/resources). RRIDs meet three key criteria: they are machine readable, free to generate and access, and are consistent across publishers and journals. The pilot was launched in February of 2014 and over 300 papers have appeared that report RRIDs. The number of journals participating has expanded from the original 25 to more than 40. Here, we present an overview of the pilot project and its outcomes to date. We show that authors are generally accurate in performing the task of identifying resources and supportive of the goals of the project. We also show that identifiability of the resources pre- and post-pilot showed a dramatic improvement for all three resource types, suggesting that the project has had a significant impact on reproducibility relating to research resources

Infoscience - École polytechnique fédérale de Lausanne

Crossref

Directory of Open Access Journals

PubMed Central

eScholarship - University of California

eScholarship@UMassChan

TechMiner: Extracting Technologies from Academic Publications

Author: A Bandrowski
C Bizer
C Fellbaum
F Osborne
F Osborne
F Ronzano
K Scanning Douw
P Corbett
R Usbeck
S Peroni
T Groza
W Huang
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Integration of evidence across human and model organism studies: A meeting report.

Author: Agrawal Arpana
Akbarian Schahram
Baker Erich
Bandrowski Anita
Barr Peter B
Benca-Bachman Chelsie E
Bierut Laura J
Bogue Molly A
Bray Michael
Chen Hao
Chesler Elissa J
Chitre Apurva
Edenberg Howard J
Emerson Jake
Ernst Jason
Fischer Christian
Gaddis Nathan C
Gelernter Joel
Hancock Dana B
Hawrylycz Michael
Jacobson Daniel
Johnson Emma C
Johnson Eric O
Kapoor Manav
Kwon Soo Bin
Lai Dongbing
Lu Qing
Mahaffey Spencer
Martone Maryann
Miles Michael
Nestler Eric J
Palmer Abraham A
Palmer Rohan H C
Parker Clarissa C
Philip Vivek M
Polimanti Renato
Prins Pjotr
Quach Bryan C
Reynolds Timothy
Saba Laura
Sanchez-Roige Sandra
Shen Li
Smith Desmond J
Thorgeirsson Thorgeir
Verma Anurag
Walton David O
Williams Robert W
Won Hyejung
Zhang Shan
Zhou Yuan
Publication venue: The Mouseion at the JAXlibrary
Publication date: 01/04/2021
Field of study

The National Institute on Drug Abuse and Joint Institute for Biological Sciences at the Oak Ridge National Laboratory hosted a meeting attended by a diverse group of scientists with expertise in substance use disorders (SUDs), computational biology, and FAIR (Findability, Accessibility, Interoperability, and Reusability) data sharing. The meeting\u27s objective was to discuss and evaluate better strategies to integrate genetic, epigenetic, and \u27omics data across human and model organisms to achieve deeper mechanistic insight into SUDs. Specific topics were to (a) evaluate the current state of substance use genetics and genomics research and fundamental gaps, (b) identify opportunities and challenges of integration and sharing across species and data types, (c) identify current tools and resources for integration of genetic, epigenetic, and phenotypic data, (d) discuss steps and impediment related to data integration, and (e) outline future steps to support more effective collaboration-particularly between animal model research communities and human genetics and clinical research teams. This review summarizes key facets of this catalytic discussion with a focus on new opportunities and gaps in resources and knowledge on SUDs

The Jackson Laboratory: The Mouseion at the JAXlibrary

IUPUIScholarWorks

eScholarship - University of California

Representation of behaviour change interventions and their evaluation: Development of the Upper Level of the Behaviour Change Intervention Ontology

Author: A Bandrowski
A Wright
B Smith
B Smith
B Smith
E Norris
E Norris
E Norris
F Bonin
J Elliott
J Hastings
K Schulz
M Ashburner
M Courtot
N Matentzoglu
N Noy
P Grenon
R Arp
R West
S Michie
S Michie
S Seppälä
T Hoffmann
W Ceusters
Publication venue: 'F1000 Research Ltd'
Publication date: 10/06/2020
Field of study

Background: Behaviour change interventions (BCI), their contexts and evaluation methods are heterogeneous, making it difficult to synthesise evidence and make recommendations for real-world policy and practice. Ontologies provide a means for addressing this. They represent knowledge formally as entities and relationships using a common language able to cross disciplinary boundaries and topic domains. This paper reports the development of the upper level of the Behaviour Change Intervention Ontology (BCIO), which provides a systematic way to characterise BCIs, their contexts and their evaluations. Methods: Development took place in four steps. (1) Entities and relationships were identified by behavioural and social science experts, based on their knowledge of evidence and theory, and their practical experience of behaviour change interventions and evaluations. (2) The outputs of the first step were critically examined by a wider group of experts, including the study ontology expert and those experienced in annotating relevant literature using the initial ontology entities. The outputs of the second step were tested by (3) feedback from three external international experts in ontologies and (4) application of the prototype upper-level BCIO to annotating published reports; this informed the final development of the upper-level BCIO. Results: The final upper-level BCIO specifies 42 entities, including the BCI scenario, elaborated across 21 entities and 7 relationship types, and the BCI evaluation study comprising 10 entities and 9 relationship types. BCI scenario entities include the behaviour change intervention (content and delivery), outcome behaviour, mechanism of action, and its context, which includes population and setting. These entities have corresponding entities relating to the planning and reporting of interventions and their evaluations. Conclusions: The upper level of the BCIO provides a comprehensive and systematic framework for representing BCIs, their contexts and their evaluations.Wellcome; Marie-Sklodowska-Curie fellowshi

Aberdeen University Research

Crossref

Brunel University Research Archive

A Guide to the Brain Initiative Cell Census Network Data Ecosystem

Author: Aevermann Brian
Allemang David
Ament Seth
Ascoli Giorgio A
Athey Thomas L
Baker Cody
Baker Katherine S
Baker Pamela M
Bandrowski Anita
Banerjee Samik
Bishwakarma Prajal
Bjaalie Jan G
Carr Ambrose
Chen Min
Choudhury Roni
Cool Jonah
Creasy Heather
D\u27Orazi Florence
Degatano Kylee
Dichter Benjamin
Ding Song-Lin
Dolbeare Tim
Dong Hong-Wei
Ecker Joseph R
Fang Rongxin
Fillion-Robin Jean-Christophe
Fliss Timothy P
Gee James
Ghosh Satrajit S
Gillespie Tom
Gillis Jesse
Gouwens Nathan
Halchenko Yaroslav O
Harris Nomi L
Hawrylycz Michael
Haynor David R
Herb Brian R
Hertzano Ronna
Hintiryan Houri
Hof Patrick R
Hood Gregory
Horvath Sam
Huo Bingxing
Jarecka Dorota
Jiang Shengdian
Khajouei Farzaneh
Kiernan Elizabeth A
Kim Yongsoo
Kir Huseyin
Kruse Lauren
Lee Changkyu
Lein Ed
Lelieveldt Boudewijn
Li Yang
Liu Hanqing
Liu Lijuan
Liu Yufeng
Markuhar Anup
Martone Maryann E
Mathews James
Mathews Kaylee L
Mezias Chris
Miller Jeremy A
Miller Michael I
Mitra Partha P
Mollenkopf Tyler
Mufti Shoaib
Mukamel Eran
Mungall Christopher J
Ng Lydia
Orvis Joshua
Osumi-Sutherland David
Peng Hanchuan
Puchades Maja A
Qu Lei
Ray Patrick L
Receveur Joseph P
Regev Aviv
Ren Bing
Ropelewski Alex
Sanchez Raymond
Scheuermann Richard H
Sjoquist Nathan
Staats Brian
Tan Shawn Zheng Kai
Thompson Carol L
Tickle Timothy
Tilgner Hagen
Tward Daniel
van Velthoven Cindy T J
Varghese Merina
Wang Quanxin
Wester Brock
White Owen
Xie Fangming
Xu Hua
Yao Zizhen
Yun Zhixi
Zeng Hongkui
Zhang Guo-Qiang
Zhang Yun Renee
Zheng W Jim
Zingg Brian
Publication venue: DigitalCommons@TMC
Publication date: 01/06/2023
Field of study

Characterizing cellular diversity at different levels of biological organization and across data modalities is a prerequisite to understanding the function of cell types in the brain. Classification of neurons is also essential to manipulate cell types in controlled ways and to understand their variation and vulnerability in brain disorders. The BRAIN Initiative Cell Census Network (BICCN) is an integrated network of data-generating centers, data archives, and data standards developers, with the goal of systematic multimodal brain cell type profiling and characterization. Emphasis of the BICCN is on the whole mouse brain with demonstration of prototype feasibility for human and nonhuman primate (NHP) brains. Here, we provide a guide to the cellular and spatial approaches employed by the BICCN, and to accessing and using these data and extensive resources, including the BRAIN Cell Data Center (BCDC), which serves to manage and integrate data across the ecosystem. We illustrate the power of the BICCN data ecosystem through vignettes highlighting several BICCN analysis and visualization tools. Finally, we present emerging standards that have been developed or adopted toward Findable, Accessible, Interoperable, and Reusable (FAIR) neuroscience. The combined BICCN ecosystem provides a comprehensive resource for the exploration and analysis of cell types in the brain

DigitalCommons@The Texas Medical Center

Integration of evidence across human and model organism studies: A meeting report

The National Institute on Drug Abuse and Joint Institute for Biological Sciences at the Oak Ridge National Laboratory hosted a meeting attended by a diverse group of scientists with expertise in substance use disorders (SUDs), computational biology, and FAIR (Findability, Accessibility, Interoperability, and Reusability) data sharing. The meeting's objective was to discuss and evaluate better strategies to integrate genetic, epigenetic, and 'omics data across human and model organisms to achieve deeper mechanistic insight into SUDs. Specific topics were to (a) evaluate the current state of substance use genetics and genomics research and fundamental gaps, (b) identify opportunities and challenges of integration and sharing across species and data types, (c) identify current tools and resources for integration of genetic, epigenetic, and phenotypic data, (d) discuss steps and impediment related to data integration, and (e) outline future steps to support more effective collaboration-particularly between animal model research communities and human genetics and clinical research teams. This review summarizes key facets of this catalytic discussion with a focus on new opportunities and gaps in resources and knowledge on SUDs

IUPUIScholarWorks