603 research outputs found

    Data Augmentation with In-Context Learning and Comparative Evaluation in Math Word Problem Solving

    Full text link
    Math Word Problem (MWP) solving presents a challenging task in Natural Language Processing (NLP). This study aims to provide MWP solvers with a more diverse training set, ultimately improving their ability to solve various math problems. We propose several methods for data augmentation by modifying the problem texts and equations, such as synonym replacement, rule-based: question replacement, and rule based: reversing question methodologies over two English MWP datasets. This study extends by introducing a new in-context learning augmentation method, employing the Llama-7b language model. This approach involves instruction-based prompting for rephrasing the math problem texts. Performance evaluations are conducted on 9 baseline models, revealing that augmentation methods outperform baseline models. Moreover, concatenating examples generated by various augmentation methods further improves performance.Comment: Accepted in SN Computer Scienc

    Scalable curriculum learning for artificial neural networks

    Get PDF
    WOS: 000405294100008Learning process of people usually starts with easy samples and goes towards hard ones. Using this method for machine learning is called curriculum learning. Samples are given in an order related to their difficulty level, rather than in random order. The aim of this approach is to create models that have better generalization performance. In existing studies, difficulty levels of the samples were determined by prior knowledge and given to the system. However, this is not a scalable approach for every application. Because of that, such studies were usually carried out in very limited application areas. In this study, a new approach is proposed that automatically generates difficulty levels of the samples from data sets. In this way, it is possible to overcome mentioned constraint in the implementations. Thus, curriculum and anti-curriculum learning methods could be applied on many different application areas. In the experiments where artificial neural networks are used as learners, more successful results were obtained with curriculum and anti- curriculum learning compared with the models where samples were given in random order. After various methods have been tried for determining the difficulty ratings of the samples, this study showed that ensemble learning-based approach is more successful

    Investigating Semi-Supervised Learning Algorithms in Text Datasets

    Full text link
    Using large training datasets enhances the generalization capabilities of neural networks. Semi-supervised learning (SSL) is useful when there are few labeled data and a lot of unlabeled data. SSL methods that use data augmentation are most successful for image datasets. In contrast, texts do not have consistent augmentation methods as images. Consequently, methods that use augmentation are not as effective in text data as they are in image data. In this study, we compared SSL algorithms that do not require augmentation; these are self-training, co-training, tri-training, and tri-training with disagreement. In the experiments, we used 4 different text datasets for different tasks. We examined the algorithms from a variety of perspectives by asking experiment questions and suggested several improvements. Among the algorithms, tri-training with disagreement showed the closest performance to the Oracle; however, performance gap shows that new semi-supervised algorithms or improvements in existing methods are needed.Comment: Innovations in Intelligent Systems and Applications Conference (ASYU

    Enhancing Retrieval-Augmented Generation Accuracy with Dynamic Chunking and Optimized Vector Search

    Get PDF
    Retrieval-Augmented Generation (RAG) architectures depend on the integration of efficient retrieval and ranking mechanisms to enhance response accuracy and relevance. This study investigates a novel approach to improving the response performance of RAG systems, leveraging dynamic chunking for contextual coherence, Sentence-Transformers (all-mpnet-base-v2) for high-quality embeddings, and cross-encoder-based re-ranking for retrieval refinement. Our evaluation utilizes RAGAS metrics to assess key performance metrics, including faithfulness, relevancy, correctness, and context precision. Empirical evaluations highlighted the significant impact of index choice on the performance. Our proposed approach integrates the FAISS HNSW index with re-ranking, resulting in a balanced architecture that improves response fidelity without compromising efficiency. These insights underscore the importance of advanced indexing and retrieval techniques in bridging the gap between large-scale language models and domain-specific information needs. The findings provide a robust framework for future research in optimizing RAG systems, particularly in scenarios requiring high-context preservation and precision

    Machine learning for human-centered and value-sensitive building energy efficiency

    Get PDF
    Enhancing building energy efficiency is one of the best strategies to reduce energy consumption and associated CO2 emissions. Recent studies emphasized the importance of occupant behavior as a key means of enhancing building energy efficiency. However, it is also critical that while we strive to enhance the energy efficiency of buildings through improving occupant behavior, we still pay enough attention to occupant comfort and satisfaction. Towards this goal, this research proposes a data-driven machine-learning-based approach to behavioral building energy efficiency, which could help better understand and predict the impact of occupant behavior on building energy consumption and occupant comfort; and help optimize occupant behavior for both energy saving and occupant comfort. Three types of models were developed and tested – simulation-data-driven, real-data-driven, and hybrid. Accordingly, the research included five primary research tasks. First, the importance levels of energy-related human values (e.g., thermal comfort) to building occupants and their current satisfaction levels with these values were identified, in order to better understand the factors that are associated with higher/lower importance and/or satisfaction levels and identify the potential factors that could help predict occupant comfort. Second, a data sensing and occupant feedback collection plan was developed, in order to capture and monitor the indoor environmental conditions, energy consumption, energy-related occupant behavior, and occupant comfort in real buildings. Third, a set of buildings were simulated, in order to model the energy consumption of different buildings in different contexts – in terms of occupant behavior, building sizes, weather conditions, etc.; and a simulation-data-driven occupant-behavior-sensitive machine learning-based model, which learns from simulation data, was developed for predicting hourly cooling energy consumption. Fourth, a set of real-data-driven occupant-behavior-sensitive machine learning-based models, which learn from real data (data collected from real buildings and real occupants), were developed for predicting hourly cooling and lighting energy consumption and thermal and visual occupant comfort; and a genetic algorithm-based optimization model for determining the optimal occupant behavior that can simultaneously reduce energy consumption and improve occupant comfort was developed. Compared to the simulation-data-driven approach, the real-data-driven approach aims to better capture and model the real-life behavior and comfort of occupants and the real-life energy-consumption patterns of buildings. Although successful in this regard, the resulting models may not generalize well outside of their training range. Fifth, a hybrid, occupant-behavior-sensitive machine learning-based model, which learns from both simulation data and real data, was developed for predicting hourly cooling and lighting energy consumption. The hybrid approach aims to overcome the limitations of both simulation-data-driven and real-data-driven approaches – especially the limited ability to capture occupant behavior and real-life consumption patterns in simulation-data-driven approaches and the limited generalizability of real-data-driven approaches to different cases – by learning from both types of data simultaneously. The experimental results show the potential of the proposed approach. The energy consumption prediction models achieved high prediction performance, and the thermal and visual comfort models were able to accurately represent the individual and group comfort levels. The optimization results showed potential behavioral energy savings in the range of 11% and 22%, with significant improvement in occupant comfort

    LegalTurk Optimized BERT for Multi-Label Text Classification and NER

    Full text link
    The introduction of the Transformer neural network, along with techniques like self-supervised pre-training and transfer learning, has paved the way for advanced models like BERT. Despite BERT's impressive performance, opportunities for further enhancement exist. To our knowledge, most efforts are focusing on improving BERT's performance in English and in general domains, with no study specifically addressing the legal Turkish domain. Our study is primarily dedicated to enhancing the BERT model within the legal Turkish domain through modifications in the pre-training phase. In this work, we introduce our innovative modified pre-training approach by combining diverse masking strategies. In the fine-tuning task, we focus on two essential downstream tasks in the legal domain: name entity recognition and multi-label text classification. To evaluate our modified pre-training approach, we fine-tuned all customized models alongside the original BERT models to compare their performance. Our modified approach demonstrated significant improvements in both NER and multi-label text classification tasks compared to the original BERT model. Finally, to showcase the impact of our proposed models, we trained our best models with different corpus sizes and compared them with BERTurk models. The experimental results demonstrate that our innovative approach, despite being pre-trained on a smaller corpus, competes with BERTurk

    Perceptions about the sexuality of women with fibromyalgia syndrome: a phenomenological study

    Get PDF
    Aims: The aim of this study was to explore and understand the perceptions and experiences of women with fibromyalgia syndrome regarding their sexuality. Background: Fibromyalgia syndrome is a chronic pathology, which compromises a woman’s physical, mental and emotional health. Although concerns related to sexuality are commonly reported, research has tended to focus on the physical symptoms. Design: An interpretive qualitative research methodology using Gadamer’s philosophical hermeneutics was carried out. Methods: This qualitative study explores the sexuality of women with fibromyalgia syndrome. A focus group and semi-structured interviews were conducted with 13 women with fibromyalgia syndrome. Data were collected between April - June 2014. Participants were recruited until findings reached saturation. Findings: Three themes define the perception of sexuality for these women: (i) Physical impact: don’t touch, don’t look; (ii) Sexuality and identity: fighting against their loss; (iii) Impact on the relationship: sexuality as a way of connecting the couple. Conclusion: Despite limitations, sexuality is important for the identity and quality of life of women with fibromyalgia syndrome. Together with the physical symptomology, guilt, fear and a lack of understanding compromise the coping process. Women need the support of their partner, their socio-family environment and health professionals. Nurses can aid the successful adjustment to sexual problems related to fibromyalgia syndrome

    Lingual ortodonti

    Get PDF
    Currently, aesthetics of the anterior teeth is a significant issue in general dentistry and the most frequently cited reason for patients seeking orthodontic treatment. For many years, lingual orthodontics was perceived as complex and problematic treatment procedures and therefore not widely used internationally. However, during the last decade, the percentage of patients treated with lingual orthodontics has increased, appliance systems have renewed, and the technique has developed to such an extent that in some cases, it is easier, quicker, and more accurate than traditional buccal orthodontics. The aim of this review is to evaluate and generally consider the usage of lingual orthodontics which commonly preferred by patients in routine orthodontic procedures and look through current applications. Additionally, special considerations regarding the contemporary diagnosis and treatment planning in lingual orthodontics are presented in this study. ÖZET Günümüzde, ön dişlerin estetik olarak dizilimi ve görünümü genel diş hekimliğinde önemli bir konu olmuş ve bu önem hastalar için lingual ortodonti tercihinde birinci sebep halini almıştır. Lingual ortodonti yıllardır karmaşık ve uygulanması zor tedavi yöntemlerinden kabul edilmiş ve uluslar arası alanda yaygın bir kullanım alanı bulamamıştır. Ancak son dönemlerde, bu yöntem aracılığıyla tedavi edilen hasta yüzdesi artmış, vakalarda uygulanan aygıtlar yenilenmiş, tedaviler kolaylaştırılmış, hızlandırılmış ve her geçen gün geleneksel ortodontik tedavilerle elde edilen başarılara ulaşılmaya başlanmıştır. Bu derlemenin amacı, ortodontide her geçen gün daha fazla hastanın tercih ettiği bu uygulamayı genel olarak değerlendirmek ve güncel uygulamalara genel bir bakış yapmaktır. Ayrıca lingual ortodontik uygulamalardaki güncel muayene ve tedavi planlamalarından, karşılaşılan özel durumlardan da bu çalışmada bahsedilecektir. Anahtar Kelimeler: Lingual ortodonti, güncel uygulamala

    Single dominant left coronary artery: An autopsy case report with review of literature

    Get PDF
    Coronary artery anomalous course is rare, reported incidence is approximately 0.3–1.3% of patients undergoing coronary angiography and approximately 1% of routine autopsy examinations. A single coronary artery is an unusual congenital anomaly where only one coronary artery arises from the aortic trunk from a single coronary ostium, supplying the entire heart. We describe here a rare case with an unusual dominant left circumflex artery and absent right coronary artery
    corecore