786 research outputs found
Incremental Information Gain Analysis of Input Attribute Impact on RBF-Kernel SVM Spam Detection
The massive increase of spam is posing a very serious threat to email and SMS, which have become an important means of communication. Not only do spams annoy users, but they also become a security threat. Machine learning techniques have been widely used for spam detection. Email spams can be detected through detecting senders’ behaviour, the contents of an email, subject and source address, etc, while SMS spam detection usually is based on the tokens or features of messages due to short content. However, a comprehensive analysis of email/SMS content may provide cures for users to aware of email/SMS spams. We cannot completely depend on automatic tools to identify all spams. In this paper, we propose an analysis approach based on information entropy and incremental learning to see how various features affect the performance of an RBF-based SVM spam detector, so that to increase our awareness of a spam by sensing the features of a spam. The experiments were carried out on the spambase and SMSSpemCollection databases in UCI machine learning repository. The results show that some features have significant impacts on spam detection, of which users should be aware, and there exists a feature space that achieves Pareto efficiency in True Positive Rate and True Negative Rate
Fuzzy min-max neural networks for categorical data: application to missing data imputation
The fuzzy min–max neural network classifier is a supervised learning method. This classifier takes the hybrid neural networks and fuzzy systems approach. All input variables in the network are required to correspond to continuously valued variables, and this can be a significant constraint in many real-world situations where there are not only quantitative but also categorical data. The usual way of dealing with this type of variables is to replace the categorical by numerical values and treat them as if they were continuously valued. But this method, implicitly defines a possibly unsuitable metric for the categories. A number of different procedures have been proposed to tackle the problem. In this article, we present a new method. The procedure extends the fuzzy min–max neural network input to categorical variables by introducing new fuzzy sets, a new operation, and a new architecture. This provides for greater flexibility and wider application. The proposed method is then applied to missing data imputation in voting intention polls. The micro data—the set of the respondents’ individual answers to the questions—of this type of poll are especially suited for evaluating the method since they include a large number of numerical and categorical attributes
A linguistic analysis of lying in negative evaluations: The speech act performance of Chinese learners of Korean
이 논문은 중국인 한국어 학습자와 한국어 화자들 사이의 ‘거짓말’ 화행 양상을 언어학적으로 분석한 연구이다. 여기서 말하는 ‘거짓말’이란 요청, 사과, 거절 등과 같은 화행의 일종으로서 ‘부정적 평가’에 속하며 대화 참여자나 상황을 고려한 소위 ‘선의의 거짓말’을 가리키는 것으로 이해할 수 있을 것이다. 우리는 중국인 한국어 학습자 15명과 한국어 화자 15명을 대상으로 담화완성테스트(DCT)와 부연설명질문지(QFE)를 사용하여 피실험자들의 화행을 분석하였다. 피실험자 자신들의 설명과 한국어교육 전문가 다섯 명의 판정을 종합해 ‘거짓말’ 화행을 가려내고 통계 처리를 바탕으로 다음과 같은 결론에 도달했다. 한국어 화자들이 중국인 한국어 학습자들보다 (선의의) 거짓말을 더 많이 수행하는 것으로 나타났다. 그리고 두 집단 모두 부정적 평가가 사물에 관련된 경우보다 사람에 관련된 경우에 ‘거짓말’ 화행을 더 많이 사용한다. 그러나 화자와 청자 사이의 친소관계(distance)나 상하관계(power)는 거짓말 사용에 직접적 상관 관계를 보여주지 않았다. 이 연구는 지금까지 화행 연구 중에서 상대적으로 연구가 부진했던 부정평가와 ‘거짓말’ 화행에 대한 분석을 시도했다는 점에서 의미가 있다. 또한 한국어 화자와 중국인 한국어 학습자 사이에 보이는 화행 수행의 차이를 문화인식(cultural awareness)의 관점에서 해석해 볼 수 있는 가능성도 열어 주었다
Set-Codes with Small Intersections and Small Discrepancies
We are concerned with the problem of designing large families of subsets over
a common labeled ground set that have small pairwise intersections and the
property that the maximum discrepancy of the label values within each of the
sets is less than or equal to one. Our results, based on transversal designs,
factorizations of packings and Latin rectangles, show that by jointly
constructing the sets and labeling scheme, one can achieve optimal family sizes
for many parameter choices. Probabilistic arguments akin to those used for
pseudorandom generators lead to significantly suboptimal results when compared
to the proposed combinatorial methods. The design problem considered is
motivated by applications in molecular data storage and theoretical computer
science
Chloroplast Distribution in Arabidopsis thaliana (L.) Depends on Light Conditions during Growth
The branded and gendered Brazilian body: Material and symbolic constructions in an overlooked context
Since its first conference in 2006, CCT has been growing in size and reach. Some have noted that CCT has become much more European in the recent years, with North American and European countries taking turns to host the event
Recommended from our members
Re-calibrating DIY: Testing digital participation across dust sensors, fry pans and environmental pollution
An increasing number of low-cost and do-it-yourself (DIY) digital sensors for monitoring air quality are now in circulation. DIY technologies attempt to democratize environmental practices such as air quality sensing that might ordinarily be the domain of expert scientists. But in the process of setting up and using DIY sensors, citizens encounter just as many challenges for ensuring the accuracy of their devices and the validity of their data. In this article, we look specifically at the infrastructures and practices of DIY digital sensing. Through an analysis of urban sensing in London as an environmental media practice, we consider the specific techniques and challenges of calibrating DIY digital sensors for measuring air pollution to ensure the relative accuracy and validity of data. We ask, “How are DIY calibration practices expressive of particular political subjects and environmental relations—and not others?” “How might we re-calibrate DIY as a digital practice and political commitment through engagements with multiple genealogies and counter-genealogies of citizen-led inquiry?”ERC 31334
The Risk Factors Associated with Grip Lock Injuries in Artistic Gymnasts: A Systematic Review
Artistic gymnastics (AG) is a sport that demands grace, strength, and flexibility, leading to a broad spectrum of injuries. The dowel grip (DG) is widely used by gymnasts to securely hold onto the high bar or uneven bars. However, incorrect usage of the DG can result in grip lock (GL) injuries. This systematic review aims to (1) identify studies that have investigated the risk factors related to GL injuries among gymnasts and (2) synthesize the key evidence. A comprehensive electronic search was conducted in the following databases: PubMed, ScienceDirect, Elsevier, SportDiscus, and Google Scholar, covering the period from their inception until November 2022. The data extraction and analysis were independently completed by two investigators. A total of 90 relevant studies were initially identified, out of which seven clinical trials met the eligibility criteria. For the quantitative synthesis, five studies were included. The details extracted from each article include: the sample characteristics (number, gender, age, and health status), the study design, the instrumentation or intervention used, and the final results. Our results revealed that the underlying causes of the risk factors of GL injuries were the irregular checking of the dowel grip and the mating surface of the bar, the tearing of the dowel of the leather strap, and the use of the dowel grip in different competition apparatuses. In addition, GL injuries may occur either as severe forearm fractures or mild injuries. Excessive flexion of the forearm and overpronation of the wrist during rotational movements, such as the swing or backward/forward giant circle, may increase the possibility of GL injury on the high bar. Future studies should focus on GL injury prevention strategy and rehabilitation protocol for GL injuries. Further high-quality research is required to establish the validity of these findings. © 2023 by the authors.European Commission, EC: .02.2.69/0.0/0.0/18_054/0014627Published with the financial support of the European Union, as part of the project entitled Development of capacities and environment for boosting the international, intersectoral, and interdisciplinary cooperation At UWB, project reg. no.CZ.02.2.69/0.0/0.0/18_054/0014627
Multi-level Graph Convolutional Networks for Cross-platform Anchor Link Prediction
© 2020 ACM. Cross-platform account matching plays a significant role in social network analytics, and is beneficial for a wide range of applications. However, existing methods either heavily rely on high-quality user generated content (including user profiles) or suffer from data insufficiency problem if only focusing on network topology, which brings researchers into an insoluble dilemma of model selection. In this paper, to address this problem, we propose a novel framework that considers multi-level graph convolutions on both local network structure and hypergraph structure in a unified manner. The proposed method overcomes data insufficiency problem of existing work and does not necessarily rely on user demographic information. Moreover, to adapt the proposed method to be capable of handling large-scale social networks, we propose a two-phase space reconciliation mechanism to align the embedding spaces in both network partitioning based parallel training and account matching across different social networks. Extensive experiments have been conducted on two large-scale real-life social networks. The experimental results demonstrate that the proposed method outperforms the state-of-the-art models with a big margin
- …
