49 research outputs found

    An algorithm to reduce the occupational space in gender segregation studies

    Get PDF
    This paper presents an algorithm based on the bootstrap to select an admissible aggregation level, that is, the minimum number of occupational categories that yield a gender segregation value not significantly smaller than that obtained from the large number of occupational categories usually available in any data set. The approach is illustrated using labour force survey data for Spain for the comparison of gender segregation in 1977 and 1992, as well as 1994 and 2000. To measure gender segregation, an additively decomposable segregation index based on the entropy concept is used. Despite a substantial simplification in the size of the occupation space, the decrease in the segregation index is very small and not significant, regardless of the year. Consequently, intertemporal changes in gender segregation can be studied using a greatly reduced classification of occupations that permits an easier interpretation of results.Publicad

    Location, location, location: utilizing pipelines and services to more effectively georeference the world's biodiversity data

    Get PDF
    Abstract Background Increasing the quantity and quality of data is a key goal of biodiversity informatics, leading to increased fitness for use in scientific research and beyond. This goal is impeded by a legacy of geographic locality descriptions associated with biodiversity records that are often heterogeneous and not in a map-ready format. The biodiversity informatics community has developed best practices and tools that provide the means to do retrospective georeferencing (e.g., the BioGeomancer toolkit), a process that converts heterogeneous descriptions into geographic coordinates and a measurement of spatial uncertainty. Even with these methods and tools, data publishers are faced with the immensely time-consuming task of vetting georeferenced localities. Furthermore, it is likely that overlap in georeferencing effort is occurring across data publishers. Solutions are needed that help publishers more effectively georeference their records, verify their quality, and eliminate the duplication of effort across publishers. Results We have developed a tool called BioGeoBIF, which incorporates the high throughput and standardized georeferencing methods of BioGeomancer into a beginning-to-end workflow. Custodians who publish their data to the Global Biodiversity Information Facility (GBIF) can use this system to improve the quantity and quality of their georeferences. BioGeoBIF harvests records directly from the publishers' access points, georeferences the records using the BioGeomancer web-service, and makes results available to data managers for inclusion at the source. Using a web-based, password-protected, group management system for each data publisher, we leave data ownership, management, and vetting responsibilities with the managers and collaborators of each data set. We also minimize the georeferencing task, by combining and storing unique textual localities from all registered data access points, and dynamically linking that information to the password protected record information for each publisher. Conclusion We have developed one of the first examples of services that can help create higher quality data for publishers mediated through the Global Biodiversity Information Facility and its data portal. This service is one step towards solving many problems of data quality in the growing field of biodiversity informatics. We envision future improvements to our service that include faster results returns and inclusion of more georeferencing engines

    The Early Positive Approaches to Support (E-PAtS) study: study protocol for a feasibility cluster randomised controlled trial of a group programme (E-PAtS) for family caregivers of young children with intellectual disability

    Get PDF
    Background: Children with intellectual disability have an IQ < 70, associated deficits in adaptive skills and are at increased risk of having clinically concerning levels of behaviour problems. In addition, parents of children with intellectual disability are likely to report high levels of mental health and other psychological problems. The Early Positive Approaches to Support (E-PAtS) programme for family caregivers of young children (5 years and under) with intellectual and developmental disabilities is a group-based intervention which aims to enhance parental psychosocial wellbeing and service access and support positive development for children. The aim of this study is to assess the feasibility of delivering E-PAtS to family caregivers of children with intellectual disability by community parenting support service provider organisations. The study will inform a potential, definitive RCT of the effectiveness and cost-effectiveness of E-PAtS. Methods: This study is a feasibility cluster randomised controlled trial, with embedded process evaluation. Up to 2 family caregivers will be recruited from 64 families with a child (18 months to 5 years) with intellectual disability at research sites in the UK. Participating families will be allocated to intervention: control on a 1:1 basis; intervention families will be offered the E-PAtS programme immediately, continuing to receive usual practice, and control participants will be offered the opportunity to attend the E-PAtS programme at the end of the follow-up period and will continue to receive usual practice. Data will be collected at baseline, 3 months post-randomisation and 12 months post-randomisation. The primary aim is to assess feasibility via the assessment of: recruitment of service provider organisations; participant recruitment; randomisation; retention; intervention adherence; intervention fidelity and the views of participants, intervention facilitators and service provider organisations regarding intervention delivery and study processes. The secondary aim is preliminary evaluation of a range of established outcome measures for individual family members, subsystem relationships and overall family functioning, plus additional health economic outcomes for inclusion in a future definitive trial. Discussion: The results of this study will inform a potential future definitive trial, to evaluate the effectiveness and cost-effectiveness of the E-PAtS intervention to improve parental psychosocial wellbeing. Such a trial would have significant scientific impact internationally in the intellectual disability field

    Darwin Core: An Evolving Community-Developed Biodiversity Data Standard

    Get PDF
    Biodiversity data derive from myriad sources stored in various formats on many distinct hardware and software platforms. An essential step towards understanding global patterns of biodiversity is to provide a standardized view of these heterogeneous data sources to improve interoperability. Fundamental to this advance are definitions of common terms. This paper describes the evolution and development of Darwin Core, a data standard for publishing and integrating biodiversity information. We focus on the categories of terms that define the standard, differences between simple and relational Darwin Core, how the standard has been implemented, and the community processes that are essential for maintenance and growth of the standard. We present case-study extensions of the Darwin Core into new research communities, including metagenomics and genetic resources. We close by showing how Darwin Core records are integrated to create new knowledge products documenting species distributions and changes due to environmental perturbations

    Comparison of geometric morphometric outline methods in the discrimination of age-related differences in feather shape

    Get PDF
    BACKGROUND: Geometric morphometric methods of capturing information about curves or outlines of organismal structures may be used in conjunction with canonical variates analysis (CVA) to assign specimens to groups or populations based on their shapes. This methodological paper examines approaches to optimizing the classification of specimens based on their outlines. This study examines the performance of four approaches to the mathematical representation of outlines and two different approaches to curve measurement as applied to a collection of feather outlines. A new approach to the dimension reduction necessary to carry out a CVA on this type of outline data with modest sample sizes is also presented, and its performance is compared to two other approaches to dimension reduction. RESULTS: Two semi-landmark-based methods, bending energy alignment and perpendicular projection, are shown to produce roughly equal rates of classification, as do elliptical Fourier methods and the extended eigenshape method of outline measurement. Rates of classification were not highly dependent on the number of points used to represent a curve or the manner in which those points were acquired. The new approach to dimensionality reduction, which utilizes a variable number of principal component (PC) axes, produced higher cross-validation assignment rates than either the standard approach of using a fixed number of PC axes or a partial least squares method. CONCLUSION: Classification of specimens based on feather shape was not highly dependent of the details of the method used to capture shape information. The choice of dimensionality reduction approach was more of a factor, and the cross validation rate of assignment may be optimized using the variable number of PC axes method presented herein

    The taxonomic name resolution service : an online tool for automated standardization of plant names

    Get PDF
    © The Author(s), 2013. This article is distributed under the terms of the Creative Commons Attribution License. The definitive version was published in BMC Bioinformatics 14 (2013): 16, doi:10.1186/1471-2105-14-16.The digitization of biodiversity data is leading to the widespread application of taxon names that are superfluous, ambiguous or incorrect, resulting in mismatched records and inflated species numbers. The ultimate consequences of misspelled names and bad taxonomy are erroneous scientific conclusions and faulty policy decisions. The lack of tools for correcting this ‘names problem’ has become a fundamental obstacle to integrating disparate data sources and advancing the progress of biodiversity science. The TNRS, or Taxonomic Name Resolution Service, is an online application for automated and user-supervised standardization of plant scientific names. The TNRS builds upon and extends existing open-source applications for name parsing and fuzzy matching. Names are standardized against multiple reference taxonomies, including the Missouri Botanical Garden's Tropicos database. Capable of processing thousands of names in a single operation, the TNRS parses and corrects misspelled names and authorities, standardizes variant spellings, and converts nomenclatural synonyms to accepted names. Family names can be included to increase match accuracy and resolve many types of homonyms. Partial matching of higher taxa combined with extraction of annotations, accession numbers and morphospecies allows the TNRS to standardize taxonomy across a broad range of active and legacy datasets. We show how the TNRS can resolve many forms of taxonomic semantic heterogeneity, correct spelling errors and eliminate spurious names. As a result, the TNRS can aid the integration of disparate biological datasets. Although the TNRS was developed to aid in standardizing plant names, its underlying algorithms and design can be extended to all organisms and nomenclatural codes. The TNRS is accessible via a web interface at http://tnrs.iplantcollaborative.org/ webcite and as a RESTful web service and application programming interface. Source code is available at https://github.com/iPlantCollaborativeOpenSource/TNRS/ webcite.BJE was supported by NSF grant DBI 0850373 and TR by CSIRO Marine and Atmospheric Research, Australia,. BB and BJE acknowledge early financial support from Conservation International and TEAM who funded the development of early prototypes of taxonomic name resolution. The iPlant Collaborative (http://www.iplantcollaborative.org) is funded by a grant from the National Science Foundation (#DBI-0735191)

    Tracking a Medically Important Spider: Climate Change, Ecological Niche Modeling, and the Brown Recluse (Loxosceles reclusa)

    Get PDF
    Most spiders use venom to paralyze their prey and are commonly feared for their potential to cause injury to humans. In North America, one species in particular, Loxosceles reclusa (brown recluse spider, Sicariidae), causes the majority of necrotic wounds induced by the Araneae. However, its distributional limitations are poorly understood and, as a result, medical professionals routinely misdiagnose brown recluse bites outside endemic areas, confusing putative spider bites for other serious conditions. To address the issue of brown recluse distribution, we employ ecological niche modeling to investigate the present and future distributional potential of this species. We delineate range boundaries and demonstrate that under future climate change scenarios, the spider's distribution may expand northward, invading previously unaffected regions of the USA. At present, the spider's range is centered in the USA, from Kansas east to Kentucky and from southern Iowa south to Louisiana. Newly influenced areas may include parts of Nebraska, Minnesota, Wisconsin, Michigan, South Dakota, Ohio, and Pennsylvania. These results illustrate a potential negative consequence of climate change on humans and will aid medical professionals in proper bite identification/treatment, potentially reducing bite misdiagnoses

    Pleistocene Climate, Phylogeny, and Climate Envelope Models: An Integrative Approach to Better Understand Species' Response to Climate Change

    Get PDF
    Mean annual temperature reported by the Intergovernmental Panel on Climate Change increases at least 1.1°C to 6.4°C over the next 90 years. In context, a change in climate of 6°C is approximately the difference between the mean annual temperature of the Last Glacial Maximum (LGM) and our current warm interglacial. Species have been responding to changing climate throughout Earth's history and their previous biological responses can inform our expectations for future climate change. Here we synthesize geological evidence in the form of stable oxygen isotopes, general circulation paleoclimate models, species' evolutionary relatedness, and species' geographic distributions. We use the stable oxygen isotope record to develop a series of temporally high-resolution paleoclimate reconstructions spanning the Middle Pleistocene to Recent, which we use to map ancestral climatic envelope reconstructions for North American rattlesnakes. A simple linear interpolation between current climate and a general circulation paleoclimate model of the LGM using stable oxygen isotope ratios provides good estimates of paleoclimate at other time periods. We use geologically informed rates of change derived from these reconstructions to predict magnitudes and rates of change in species' suitable habitat over the next century. Our approach to modeling the past suitable habitat of species is general and can be adopted by others. We use multiple lines of evidence of past climate (isotopes and climate models), phylogenetic topology (to correct the models for long-term changes in the suitable habitat of a species), and the fossil record, however sparse, to cross check the models. Our models indicate the annual rate of displacement in a clade of rattlesnakes over the next century will be 2 to 3 orders of magnitude greater (430-2,420 m/yr) than it has been on average for the past 320 ky (2.3 m/yr)

    A Gap Analysis Methodology for Collecting Crop Genepools: A Case Study with Phaseolus Beans

    Get PDF
    Background The wild relatives of crops represent a major source of valuable traits for crop improvement. These resources are threatened by habitat destruction, land use changes, and other factors, requiring their urgent collection and long-term availability for research and breeding from ex situ collections. We propose a method to identify gaps in ex situ collections (i.e. gap analysis) of crop wild relatives as a means to guide efficient and effective collecting activities. Methodology/Principal Findings The methodology prioritizes among taxa based on a combination of sampling, geographic, and environmental gaps. We apply the gap analysis methodology to wild taxa of the Phaseolus genepool. Of 85 taxa, 48 (56.5%) are assigned high priority for collecting due to lack of, or under-representation, in genebanks, 17 taxa are given medium priority for collecting, 15 low priority, and 5 species are assessed as adequately represented in ex situ collections. Gap “hotspots”, representing priority target areas for collecting, are concentrated in central Mexico, although the narrow endemic nature of a suite of priority species adds a number of specific additional regions to spatial collecting priorities. Conclusions/Significance Results of the gap analysis method mostly align very well with expert opinion of gaps in ex situ collections, with only a few exceptions. A more detailed prioritization of taxa and geographic areas for collection can be achieved by including in the analysis predictive threat factors, such as climate change or habitat destruction, or by adding additional prioritization filters, such as the degree of relatedness to cultivated species (i.e. ease of use in crop breeding). Furthermore, results for multiple crop genepools may be overlaid, which would allow a global analysis of gaps in ex situ collections of the world's plant genetic resource

    Developing Global Maps of the Dominant Anopheles Vectors of Human Malaria

    Get PDF
    Simon Hay and colleagues describe how the Malaria Atlas Project has collated anopheline occurrence data to map the geographic distributions of the dominant mosquito vectors of human malaria
    corecore