863 research outputs found
Adaptive Sentence Boundary Disambiguation
Labeling of sentence boundaries is a necessary prerequisite for many natural
language processing tasks, including part-of-speech tagging and sentence
alignment. End-of-sentence punctuation marks are ambiguous; to disambiguate
them most systems use brittle, special-purpose regular expression grammars and
exception rules. As an alternative, we have developed an efficient, trainable
algorithm that uses a lexicon with part-of-speech probabilities and a
feed-forward neural network. After training for less than one minute, the
method correctly labels over 98.5\% of sentence boundaries in a corpus of over
27,000 sentence-boundary marks. We show the method to be efficient and easily
adaptable to different text genres, including single-case texts.Comment: This is a Latex version of the previously submitted ps file
(formatted as a uuencoded gz-compressed .tar file created by csh script). The
software from the work described in this paper is available by contacting
[email protected]
Multi Visualization and Dynamic Query for Effective Exploration of Semantic Data
Semantic formalisms represent content in a uniform way according to ontologies. This enables manipulation and reasoning via automated means (e.g. Semantic Web services), but limits the user’s ability to explore the semantic data from a point of view that originates from knowledge representation motivations. We show how, for user consumption, a visualization of semantic data according to some easily graspable dimensions (e.g. space and time) provides effective sense-making of data. In this paper, we look holistically at the interaction between users and semantic data, and propose multiple visualization strategies and dynamic filters to support the exploration of semantic-rich data.
We discuss a user evaluation and how interaction challenges could be overcome to create an effective user-centred framework for the visualization and manipulation of semantic data. The approach has been implemented and evaluated on a real company archive
Exploring Large Digital Library Collections Using a Map-Based Visualisation
In this paper we describe a novel approach for exploring large document collections using a map-based visualisation. We use hierarchically structured semantic concepts that are attached to the documents to create a visualisation of the semantic space that resembles a Google Map. The approach is novel in that we exploit the hierarchical structure to enable the approach to scale to large document collections and to create a map where the higher levels of spatial abstraction have semantic meaning. An informal evaluation is carried out to gather subjective feedback from users. Overall results are positive with users finding the visualisation enticing and easy to use
The Relationship of Area-Level Sociodemographic Characteristics, Household Composition and Individual-Level Socioeconomic Status on Walking Behavior Among Adults
Understanding the contextual factors associated with why adults walk is important for those interested in increasing walking as a mode of transportation and leisure. This paper investigates the relationships between neighborhood-level sociodemographic context, individual level sociodemographic characteristics and walking for leisure and transport. Data from two community-based studies of adults (n = 550) were used to determine the association between the Area Sociodemographic Environment (ASDE), calculated from U.S. Census variables, and individual-level SES as potential correlates of walking behavior. Descriptive statistics, mean comparisons and Pearson’s correlations coefficients were used to assess bivariate relationships. Generalized estimating equations were used to model the relationship between ASDE, as quartiles, and walking behavior. Adjusted models suggest adults engage in more minutes of walking for transportation and less walking for leisure in the most disadvantaged compared to the least disadvantaged neighborhoods but adding individual level demographics and SES eliminated the significant results. However, when models were stratified for free or reduced cost lunch, of those with children who qualified for free or reduced lunch, those who lived in the wealthiest neighborhoods engaged in 10.7 min less of total walking per day compared to those living in the most challenged neighborhoods (p < 0.001). Strategies to increase walking for transportation or leisure need to take account of individual level socioeconomic factors in addition to area-level measures
Multiple Sexual Partners and Condom use among 10 - 19 Year-olds in four Districts in Tanzania: What do we Learn?
Although some studies in Tanzania have addressed the question of sexuality and STIs among adolescents, mostly those aged 15 - 19 years, evidence on how multiple sexual partners influence condom use among 10 - 19 year-olds is limited. This study attempts to bridge this gap by testing a hypothesis that sexual relationships with multiple partners in the age group 10 - 19 years spurs condom use during sex in four districts in Tanzania. Secondary analysis was performed using data from the Adolescents Module of the cross-sectional household survey on Maternal, Newborn and Child Health (MNCH) that was done in Kigoma, Kilombero, Rufiji and Ulanga districts, Tanzania in 2008. A total of 612 adolescents resulting from a random sample of 1200 households participated in this study. Pearson Chi-Square was used as a test of association between multiple sexual partners and condom use. Multivariate logistic regression model was fitted to the data to assess the effect of multiple sexual partners on condom use, having adjusted for potential confounding variables. STATA (10) statistical software was used to carry out this process at 5% two-sided significance level. Of the 612 adolescents interviewed, 23.4% reported being sexually active and 42.0% of these reported having had multiple (> 1) sexual partners in the last 12 months. The overall prevalence of condom use among them was 39.2%. The proportion using a condom at the last sexual intercourse was higher among those who knew that they can get a condom if they want than those who did not. No evidence of association was found between multiple sexual partners and condom use (OR = 0.77, 95% CI = 0.35 - 1.67, P = 0.504). With younger adolescents (10 - 14 years) being a reference, condom use was associated with age group (15 - 19: OR = 3.69, 95% CI = 1.21 - 11.25, P = 0.022) and district of residence (Kigoma: OR = 7.45, 95% CI = 1.79 - 31.06, P = 0.006; Kilombero: OR = 8.89, 95% CI = 2.91 - 27.21, P < 0.001; Ulanga: OR = 5.88, 95% CI = 2.00 - 17.31, P = 0.001), Rufiji being a reference category. No evidence of association was found between multiple sexual partners and condom use among adolescents in the study area. The large proportion of adolescents who engage in sexual activity without using condoms, even those with multiple partners, perpetuates the risk of transmission of HIV infections in the community. Strategies such as sex education and easing access to and making a friendly environment for condom availability are important to address the risky sexual behaviour among adolescents
Biview learning for human posture segmentation from 3D points cloud
Posture segmentation plays an essential role in human motion analysis. The state-of-the-art method extracts sufficiently high-dimensional features from 3D depth images for each 3D point and learns an efficient body part classifier. However, high-dimensional features are memory-consuming and difficult to handle on large-scale training dataset. In this paper, we propose an efficient two-stage dimension reduction scheme, termed biview learning, to encode two independent views which are depth-difference features (DDF) and relative position features (RPF). Biview learning explores the complementary property of DDF and RPF, and uses two stages to learn a compact yet comprehensive low-dimensional feature space for posture segmentation. In the first stage, discriminative locality alignment (DLA) is applied to the high-dimensional DDF to learn a discriminative low-dimensional representation. In the second stage, canonical correlation analysis (CCA) is used to explore the complementary property of RPF and the dimensionality reduced DDF. Finally, we train a support vector machine (SVM) over the output of CCA. We carefully validate the effectiveness of DLA and CCA utilized in the two-stage scheme on our 3D human points cloud dataset. Experimental results show that the proposed biview learning scheme significantly outperforms the state-of-the-art method for human posture segmentation. © 2014 Qiao et al
Acceptability of Condom Promotion and Distribution Among 10-19 Year-Old Adolescents in Mpwapwa and Mbeya Rural Districts, Tanzania.
\ud
The HIV/AIDS pandemic remains a leading challenge for global health. Although condoms are acknowledged for their key role on preventing HIV transmission, low and inappropriate use of condoms persists in Tanzania and elsewhere in Africa. This study assesses factors affecting acceptability of condom promotion and distribution among adolescents in Mpwapwa and Mbeya rural districts of Tanzania. Data were collected in 2011 as part of a larger cross-sectional survey on condom use among 10-19 year-olds in Mpwapwa and Mbeya rural districts of Tanzania using a structured questionnaire. Associations between acceptability of condom promotion and distribution and each of the explanatory variables were tested using Chi Square. Multivariate logistic regression model was used to examine independent predictors of the acceptability of condom promotion and distribution using STATA (11) statistical software at 5% significance level. Mean age of the 1,327 adolescent participants (50.5% being males) was 13.5 years (SD = 1.4). Acceptance of condom promotion and distribution was found among 37% (35% in Mpwapwa and 39% in Mbeya rural) of the adolescents. Being sexually active and aged 15-19 was the strongest predictor of the acceptability of condom promotion and distribution (OR = 7.78, 95% CI 4.65-12.99). Others were; not agreeing that a condom is effective in preventing transmissions of STIs including HIV (OR = 0.34, 95% CI 0.20-0.56), being a resident of Mbeya rural district (OR = 1.67, 95% CI 1.28-2.19), feeling comfortable being seen by parents/guardians holding/buying condoms (OR = 2.20, 95% CI 1.40-3.46) and living with a guardian (OR = 1.48, 95% CI 1.08-2.04). Acceptability of condom promotion and distribution among adolescents in Mpwapwa and Mbeya rural is low. Effect of sexual activity on the acceptability of condom promotion and distribution is age-dependent and was the strongest. Feeling comfortable being seen by parents/guardians buying or holding condoms, perceived ability of condoms to offer protection against HIV/AIDS infections, district of residence and living arrangements also offered significant predictive effect. Knowledge of these factors is vital in designing successful and sustainable condom promotion and distribution programs in Tanzania.\u
Text Mining the History of Medicine
Historical text archives constitute a rich and diverse source of information, which is becoming increasingly readily accessible, due to large-scale digitisation efforts. However, it can be difficult for researchers to explore and search such large volumes of data in an efficient manner. Text mining (TM) methods can help, through their ability to recognise various types of semantic information automatically, e.g., instances of concepts (places, medical conditions, drugs, etc.), synonyms/variant forms of concepts, and relationships holding between concepts (which drugs are used to treat which medical conditions, etc.). TM analysis allows search systems to incorporate functionality such as automatic suggestions of synonyms of user-entered query terms, exploration of different concepts mentioned within search results or isolation of documents in which concepts are related in specific ways. However, applying TM methods to historical text can be challenging, according to differences and evolutions in vocabulary, terminology, language structure and style, compared to more modern text. In this article, we present our efforts to overcome the various challenges faced in the semantic analysis of published historical medical text dating back to the mid 19th century. Firstly, we used evidence from diverse historical medical documents from different periods to develop new resources that provide accounts of the multiple, evolving ways in which concepts, their variants and relationships amongst them may be expressed. These resources were employed to support the development of a modular processing pipeline of TM tools for the robust detection of semantic information in historical medical documents with varying characteristics. We applied the pipeline to two large-scale medical document archives covering wide temporal ranges as the basis for the development of a publicly accessible semantically-oriented search system. The novel resources are available for research purposes, while the processing pipeline and its modules may be used and configured within the Argo TM platform
Validity and reliability of a modified english version of the physical activity questionnaire for adolescents
BACKGROUND: Adaptation of physical activity self-report questionnaires is sometimes required to reflect the activity behaviours of diverse populations. The processes used to modify self-report questionnaires though are typically underreported. This two-phased study used a formative approach to investigate the validity and reliability of the Physical Activity Questionnaire for Adolescents (PAQ-A) in English youth. Phase one examined test content and response process validity and subsequently informed a modified version of the PAQ-A. Phase two assessed the validity and reliability of the modified PAQ-A. METHODS: In phase one, focus groups (n = 5) were conducted with adolescents (n = 20) to investigate test content and response processes of the original PAQ-A. Based on evidence gathered in phase one, a modified version of the questionnaire was administered to participants (n = 169, 14.5 ± 1.7 years) in phase two. Internal consistency and test-retest reliability were assessed using Cronbach’s alpha and intra-class correlations, respectively. Spearman correlations were used to assess associations between modified PAQ-A scores and accelerometer-derived physical activity, self-reported fitness and physical activity self-efficacy. RESULTS: Phase one revealed that the original PAQ-A was unrepresentative for English youth and that item comprehension varied. Contextual and population/cultural-specific modifications were made to the PAQ-A for use in the subsequent phase. In phase two, modified PAQ-A scores had acceptable internal consistency (α = 0.72) and test-retest reliability (ICC = 0.78). Modified PAQ-A scores were significantly associated with objectively assessed moderate-to-vigorous physical activity (r = 0.39), total physical activity (r = 0.42), self-reported fitness (r = 0.35), and physical activity self-efficacy (r = 0.32) (p ≤ 0.01). CONCLUSIONS: The modified PAQ-A had acceptable internal consistency and test-retest reliability. Modified PAQ-A scores displayed weak-to-moderate correlations with objectively measured physical activity, self-reported fitness, and self-efficacy providing evidence of satisfactory criterion and construct validity, respectively. Further testing with more diverse English samples is recommended to provide a more complete assessment of the tool. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13690-016-0115-2) contains supplementary material, which is available to authorized users
- …
