249 research outputs found
Steganography integration into a low-bit rate speech codec
Low bit-rate speech codecs have been widely used in audio communications like VoIP and mobile communications, so that steganography in low bit-rate audio streams would have broad applications in practice. In this paper, the authors propose a new algorithm for steganography in low bit-rate VoIP audio streams by integrating information hiding into the process of speech encoding. The proposed algorithm performs data embedding while pitch period prediction is conducted during low bit-rate speech encoding, thus maintaining synchronization between information hiding and speech encoding. The steganography algorithm can achieve high quality of speech and prevent detection of steganalysis, but also has great compatibility with a standard low bit-rate speech codec without causing further delay by data embedding and extraction. Testing shows, with the proposed algorithm, the data embedding rate of the secret message can attain 4 bits / frame (133.3 bits / second)
Effective Distillation of Table-based Reasoning Ability from LLMs
Large Language Models (LLMs) have demonstrated remarkable performance across
a wide range of natural language processing tasks. However, their remarkable
parameter size and their impressive high requirement of computing resources
pose challenges for their practical deployment. Recent research has revealed
that specific capabilities of LLMs, such as numerical reasoning, can be
transferred to smaller models through distillation. Some studies explore the
potential of leveraging LLMs to perform table-based reasoning. Nevertheless,
prior to our work, there has been no investigation into the prospect of
specialising table reasoning skills in smaller models specifically tailored for
table-to-text generation tasks. In this paper, we propose a novel table-based
reasoning distillation, with the aim of distilling distilling LLMs into
tailored, smaller models specifically designed for table-based reasoning task.
Experimental results have shown that a 0.22 billion parameter model
(Flan-T5-base) fine-tuned using distilled data, not only achieves a significant
improvement compared to traditionally fine-tuned baselines but also surpasses
specific LLMs like gpt-3.5-turbo on the scientific table-to-text generation
dataset (SciGen). The code and data are released in
https://github.com/Bernard-Yang/TableDistill
Effective Distillation of Table-based Reasoning Ability from LLMs
Large Language Models (LLMs) have demonstrated remarkable performance across a wide range of natural language processing tasks. However, their enormous parameter size and extremely high requirements for compute power pose challenges for their practical deployment. Recent research has revealed that specific capabilities of LLMs, such as numerical reasoning, can be transferred to smaller models through distillation. Some studies explore the potential of leveraging LLMs to perform table-based reasoning. However, there has been no prior work focusing on table reasoning skills in smaller models specifically tailored for scientific table-to-text generation tasks. In this paper, we propose a novel table-based reasoning distillation approach, with the aim of distilling LLMs into tailored smaller models. Our experimental results have shown that a 220 million parameter model (Flan-T5-base) fine-tuned using distilled data, not only achieves a significant improvement compared to traditionally fine-tuned baselines, but also surpasses specific LLMs on a scientific table-to-text generation dataset. Our code is available at https://github.com/Bernard-Yang/DistillTableCoT
Posterior urethral hamartoma with hypospadias in a child: a case report and literature review
BackgroundHamartoma is a mass formed by the proliferation and disorder of two or more kinds of cells inherent in normal organs or anatomical parts, which can occur in any part of the body. The most common hamartoma are kidney hamartoma, spleen hamartoma, liver hamartoma, and lung hamartoma. Urethral hamartoma is extremely rare in clinical practice.Case reportCombined with literature review, the diagnosis and treatment process of a child with posterior urethral hamartoma and hypospadias in our hospital were analyzed. The patient was cured after surgical treatment, the lesion was completely removed, the appearance was satisfactory, and there was no recurrence, urethral stricture, urethral fistula, and other complications. The pathological results of this case support the histological diagnosis of hamartoma, which provides reference for the clinical diagnosis and treatment of congenital malformation and tumor of urogenital in children.ConclusionWhen a child has posterior urethral hamartoma, the symptoms may not be very typical, and it is often combined with urethral malformation. Therefore, it is necessary to perform careful physical examination combined with pathological examination to be able to make an accurate diagnosis. Under normal circumstances, the prognosis of urethral hamartoma is good. However, more cases are needed to be observed for verification, and a long-term effective follow-up after surgery is needed
BioMNER: A Dataset for Biomedical Method Entity Recognition
Named entity recognition (NER) stands as a fundamental and pivotal task
within the realm of Natural Language Processing. Particularly within the domain
of Biomedical Method NER, this task presents notable challenges, stemming from
the continual influx of domain-specific terminologies in scholarly literature.
Current research in Biomedical Method (BioMethod) NER suffers from a scarcity
of resources, primarily attributed to the intricate nature of methodological
concepts, which necessitate a profound understanding for precise delineation.
In this study, we propose a novel dataset for biomedical method entity
recognition, employing an automated BioMethod entity recognition and
information retrieval system to assist human annotation. Furthermore, we
comprehensively explore a range of conventional and contemporary open-domain
NER methodologies, including the utilization of cutting-edge large-scale
language models (LLMs) customised to our dataset. Our empirical findings reveal
that the large parameter counts of language models surprisingly inhibit the
effective assimilation of entity extraction patterns pertaining to biomedical
methods. Remarkably, the approach, leveraging the modestly sized ALBERT model
(only 11MB), in conjunction with conditional random fields (CRF), achieves
state-of-the-art (SOTA) performance
ChatKBQA: A Generate-then-Retrieve Framework for Knowledge Base Question Answering with Fine-tuned Large Language Models
Knowledge Base Question Answering (KBQA) aims to derive answers to natural
language questions over large-scale knowledge bases (KBs), which are generally
divided into two research components: knowledge retrieval and semantic parsing.
However, three core challenges remain, including inefficient knowledge
retrieval, retrieval errors adversely affecting semantic parsing, and the
complexity of previous KBQA methods. In the era of large language models
(LLMs), we introduce ChatKBQA, a novel generate-then-retrieve KBQA framework
built on fine-tuning open-source LLMs such as Llama-2, ChatGLM2 and Baichuan2.
ChatKBQA proposes generating the logical form with fine-tuned LLMs first, then
retrieving and replacing entities and relations through an unsupervised
retrieval method, which improves both generation and retrieval more
straightforwardly. Experimental results reveal that ChatKBQA achieves new
state-of-the-art performance on standard KBQA datasets, WebQSP, and
ComplexWebQuestions (CWQ). This work also provides a new paradigm for combining
LLMs with knowledge graphs (KGs) for interpretable and knowledge-required
question answering. Our code is publicly available.Comment: Preprin
Study on induction hardening performance of 34CrNi3MoA steel crankshaft
The evolution of the temperature field, microstructure field, and residual stress field of a 34CrNi3MoA steel marine diesel engine crankshaft during medium-frequency induction hardening was studied based on an electromagnetic-thermal-transformation-stress coupled numerical model, which considers the effect of internal stress induced by transformation induced plasticity on residual stress. Using the equal conversion rate method, the austenitizing region of the crankshaft was determined during the induction heating stage. In the quenching stage, the parameters of the phase transformation model are derived from the continuous heating expansion curve and the continuous cooling transformation curve, and the phase transformation kinetics equation is used to analyze the phase transformation process of the crankshaft. The results indicate that extending the heating time can enhance the uniformity of the surface temperature of the crankshaft and the thickness of the hardened layer. The simulation results are validated by measurements of hardened layer, hardness and residual stress, and the simulation results are in good agreement with the experimental results
A nomogram for predicting cancer-specific survival and overall survival in elderly patients with nonmetastatic renal cell carcinoma
BackgroundRenal cell carcinoma (RCC) is a common malignant tumor in the elderly, with an increasing trend in recent years. We aimed to construct a nomogram of cancer-specific survival (CSS) and overall survival (OS) in elderly patients with nonmetastatic renal cell carcinoma (nmRCC).MethodsClinicopathological information was downloaded from the Surveillance, Epidemiology, and End Results (SEER) program in elderly patients with nmRCC from 2010 to 2015. All patients were randomly assigned to a training cohort (70%) or a validation cohort (30%). Univariate and multivariate Cox regression analyses were used to identify independent risk factors for patient outcomes in the training cohort. A nomogram was constructed based on these independent risk factors to predict the 1-, 3-, and 5-year CSS and OS in elderly patients with nmRCC. We used a range of methods to validate the accuracy and reliability of the model, including the calibration curve, consistency index (C-index), and the area under the receiver operating curve (AUC). Decision curve analysis (DCA) was used to test the clinical utility of the model.ResultsA total of 12,116 patients were enrolled in the study. Patients were randomly assigned to the training cohort (N = 8,514) and validation cohort (N = 3,602). In the training cohort, univariate and multivariate Cox regression analysis showed that age, marriage, tumor histological type, histological tumor grade, TN stage, tumor size, and surgery are independent risk factors for prognosis. A nomogram was constructed based on independent risk factors to predict CSS and OS at 1-, 3-, and 5- years in elderly patients with nmRCC. The C-index of the training and validation cohorts in CSS were 0.826 and 0.831; in OS, they were 0.733 and 0.734, respectively. The AUC results of the training and validation cohort were similar to the C-index. The calibration curve indicated that the observed value is highly consistent with the predicted value, meaning the model has good accuracy. DCA results suggest that the clinical significance of the nomogram is better than that of traditional TNM staging.ConclusionsWe built a nomogram prediction model to predict the 1-, 3- and 5-year CSS and OS of elderly nmRCC patients. This model has good accuracy and discrimination and can help doctors and patients make clinical decisions and active monitoring
DOTA: A Dynamically-Operated Photonic Tensor Core for Energy-Efficient Transformer Accelerator
The wide adoption and significant computing resource consumption of
attention-based Transformers, e.g., Vision Transformer and large language
models, have driven the demands for efficient hardware accelerators. While
electronic accelerators have been commonly used, there is a growing interest in
exploring photonics as an alternative technology due to its high energy
efficiency and ultra-fast processing speed. Optical neural networks (ONNs) have
demonstrated promising results for convolutional neural network (CNN) workloads
that only require weight-static linear operations. However, they fail to
efficiently support Transformer architectures with attention operations due to
the lack of ability to process dynamic full-range tensor multiplication. In
this work, we propose a customized high-performance and energy-efficient
photonic Transformer accelerator, DOTA. To overcome the fundamental limitation
of existing ONNs, we introduce a novel photonic tensor core, consisting of a
crossbar array of interference-based optical vector dot-product engines, that
supports highly-parallel, dynamic, and full-range matrix-matrix multiplication.
Our comprehensive evaluation demonstrates that DOTA achieves a >4x energy and a
>10x latency reduction compared to prior photonic accelerators, and delivers
over 20x energy reduction and 2 to 3 orders of magnitude lower latency compared
to the electronic Transformer accelerator. Our work highlights the immense
potential of photonic computing for efficient hardware accelerators,
particularly for advanced machine learning workloads.Comment: The short version is accepted by Next-Gen AI System Workshop at MLSys
202
- …
