400 research outputs found

    Annotating patient clinical records with syntactic chunks and named entities: the Harvey corpus

    Get PDF
    The free text notes typed by physicians during patient consultations contain valuable information for the study of disease and treatment. These notes are difficult to process by existing natural language analysis tools since they are highly telegraphic (omitting many words), and contain many spelling mistakes, inconsistencies in punctuation, and non-standard word order. To support information extraction and classification tasks over such text, we describe a de-identified corpus of free text notes, a shallow syntactic and named entity annotation scheme for this kind of text, and an approach to training domain specialists with no linguistic background to annotate the text. Finally, we present a statistical chunking system for such clinical text with a stable learning rate and good accuracy, indicating that the manual annotation is consistent and that the annotation scheme is tractable for machine learning

    Gender equality and girls education: Investigating frameworks, disjunctures and meanings of quality education

    Get PDF
    The article draws on qualitative educational research across a diversity of low-income countries to examine the gendered inequalities in education as complex, multi-faceted and situated rather than a series of barriers to be overcome through linear input–output processes focused on isolated dimensions of quality. It argues that frameworks for thinking about educational quality often result in analyses of gender inequalities that are fragmented and incomplete. However, by considering education quality more broadly as a terrain of quality it investigates questions of educational transitions, teacher supply and community participation, and develops understandings of how education is experienced by learners and teachers in their gendered lives and their teaching practices. By taking an approach based on theories of human development the article identifies dynamics of power underpinning gender inequalities in the literature and played out in diverse contexts and influenced by social, cultural and historical contexts. The review and discussion indicate that attaining gender equitable quality education requires recognition and understanding of the ways in which inequalities intersect and interrelate in order to seek out multi-faceted strategies that address not only different dimensions of girls’ and women’s lives, but understand gendered relationships and structurally entrenched inequalities between women and men, girls and boys

    Assessing L2 vocabulary depth with word associates format tests: issues, findings, and suggestions

    Get PDF
    Word Associates Format (WAF) tests are often used to measure second language learners’ vocabulary depth with a focus on their network knowledge. Yet, there were often many variations in the specific forms of the tests and the ways they were used, which tended to have an impact on learners’ response behaviors and, more importantly, the psychometric properties of the tests. This paper reviews the general practices, key issues, and research findings that pertain to WAF tests in four major areas, including the design features of WAF tests, conditions for test administration, scoring methods, and test-taker characteristics. In each area, a set of variables is identified and described with relevant research findings also presented and discussed. Around eight topics, the General Discussion section provides some suggestions and directions for the development of WAF tests and the use of them as research tools in the future. This paper is hoped to help researchers become better aware that the results generated by a WAF test may vary depending on what specific design the test has, how it is administered and scored, and who the learners are, and consequently, make better decisions in their research that involves a WAF test

    Temporal and spatial instability in neutral and adaptive (MHC) genetic variation in marginal salmon populations

    Get PDF
    The role of marginal populations for the long-term maintenance of species’ genetic diversity and evolutionary potential is particularly timely in view of the range shifts caused by climate change. The Centre-Periphery hypothesis predicts that marginal populations should bear reduced genetic diversity and have low evolutionary potential. We analysed temporal stability at neutral microsatellite and adaptive MHC genetic variation over five decades in four marginal Atlantic salmon populations located at the southern limit of the species’ distribution with a complicated demographic history, which includes stocking with foreign and native salmon for at least 2 decades. We found a temporal increase in neutral genetic variation, as well as temporal instability in population structuring, highlighting the importance of temporal analyses in studies that examine the genetic diversity of peripheral populations at the margins of the species’ range, particularly in face of climate change

    A UIMA wrapper for the NCBO annotator

    Get PDF
    Summary: The Unstructured Information Management Architecture (UIMA) framework and web services are emerging as useful tools for integrating biomedical text mining tools. This note describes our work, which wraps the National Center for Biomedical Ontology (NCBO) Annotator—an ontology-based annotation service—to make it available as a component in UIMA workflows
    corecore