407 research outputs found
Effective Image Tampering Localization via Semantic Segmentation Network
With the widespread use of powerful image editing tools, image tampering
becomes easy and realistic. Existing image forensic methods still face
challenges of low accuracy and robustness. Note that the tampered regions are
typically semantic objects, in this letter we propose an effective image
tampering localization scheme based on deep semantic segmentation network.
ConvNeXt network is used as an encoder to learn better feature representation.
The multi-scale features are then fused by Upernet decoder for achieving better
locating capability. Combined loss and effective data augmentation are adopted
to ensure effective model training. Extensive experimental results confirm that
localization performance of our proposed scheme outperforms other
state-of-the-art ones
MSDRP: A Deep Learning Model Based on Multisource Data for Predicting Drug Response
Motivation: Cancer heterogeneity drastically affects cancer therapeutic outcomes. Predicting drug response in vitro is expected to help formulate personalized therapy regimens. In recent years, several computational models based on machine learning and deep learning have been proposed to predict drug response in vitro. However, most of these methods capture drug features based on a single drug description (e.g. drug structure), without considering the relationships between drugs and biological entities (e.g. target, diseases, and side effects). Moreover, most of these methods collect features separately for drugs and cell lines but fail to consider the pairwise interactions between drugs and cell lines.
Results: In this paper, we propose a deep learning framework, named MSDRP for drug response prediction. MSDRP uses an interaction module to capture interactions between drugs and cell lines, and integrates multiple associations/interactions between drugs and biological entities through similarity network fusion algorithms, outperforming some state-of-the-art models in all performance measures for all experiments. The experimental results of de novo test and independent test demonstrate the excellent performance of our model for new drugs. Furthermore, several case studies illustrate the rationality for using feature vectors derived from drug similarity matrices from multisource data to represent drugs and the interpretability of our model
Bring Adaptive Binding Prototypes to Generalized Referring Expression Segmentation
Referring Expression Segmentation (RES) has attracted rising attention,
aiming to identify and segment objects based on natural language expressions.
While substantial progress has been made in RES, the emergence of Generalized
Referring Expression Segmentation (GRES) introduces new challenges by allowing
expressions to describe multiple objects or lack specific object references.
Existing RES methods, usually rely on sophisticated encoder-decoder and feature
fusion modules, and are difficult to generate class prototypes that match each
instance individually when confronted with the complex referent and binary
labels of GRES. In this paper, reevaluating the differences between RES and
GRES, we propose a novel Model with Adaptive Binding Prototypes (MABP) that
adaptively binds queries to object features in the corresponding region. It
enables different query vectors to match instances of different categories or
different parts of the same instance, significantly expanding the decoder's
flexibility, dispersing global pressure across all queries, and easing the
demands on the encoder. Experimental results demonstrate that MABP
significantly outperforms state-of-the-art methods in all three splits on
gRefCOCO dataset. Meanwhile, MABP also surpasses state-of-the-art methods on
RefCOCO+ and G-Ref datasets, and achieves very competitive results on RefCOCO.
Code is available at https://github.com/buptLwz/MABPComment: 11 pages,7 figure
Three-dimensional subsurface defect shape reconstruction and visualisation by pulsed thermography
Defects detected by most thermographic inspection are represented in the form of 2D image, which might limit the understanding of where the defects initiate and how they grow over time. This paper introduces a novel technique to rapidly estimate the defect depth and thickness simultaneously based on one single-side inspection. For the first time, defects are reconstructed and visualised in the form of a 3D image using cost-effective and rapid pulsed thermography technology. The feasibility and effectiveness of the proposed solution is demonstrated through inspecting a composite specimen and a steel specimen with semi-closed airgaps. For the composite specimen, this technique can deliver comparatively low averaged percentage error of the estimated total 3D defect volume of less than 10%
Identifying the Serious Clinical Outcomes of Adverse Reactions to Drugs by a Multi-Task Deep Learning Framework
Adverse Drug Reactions (ADRs) have a direct impact on human health. As continuous pharmacovigilance and drug monitoring prove to be costly and time-consuming, computational methods have emerged as promising alternatives. However, most existing computational methods primarily focus on predicting whether or not the drug is associated with an adverse reaction and do not consider the core issue of drug benefit-risk assessment-whether the treatment outcome is serious when adverse drug reactions occur. To this end, we categorize serious clinical outcomes caused by adverse reactions to drugs into seven distinct classes and present a deep learning framework, so-called GCAP, for predicting the seriousness of clinical outcomes of adverse reactions to drugs. GCAP has two tasks: one is to predict whether adverse reactions to drugs cause serious clinical outcomes, and the other is to infer the corresponding classes of serious clinical outcomes. Experimental results demonstrate that our method is a powerful and robust framework with high extendibility. GCAP can serve as a useful tool to successfully address the challenge of predicting the seriousness of clinical outcomes stemming from adverse reactions to drugs
Low-complexity full-field ultrafast nonlinear dynamics prediction by a convolutional feature separation modeling method
The modeling and prediction of the ultrafast nonlinear dynamics in the
optical fiber are essential for the studies of laser design, experimental
optimization, and other fundamental applications. The traditional propagation
modeling method based on the nonlinear Schr\"odinger equation (NLSE) has long
been regarded as extremely time-consuming, especially for designing and
optimizing experiments. The recurrent neural network (RNN) has been implemented
as an accurate intensity prediction tool with reduced complexity and good
generalization capability. However, the complexity of long grid input points
and the flexibility of neural network structure should be further optimized for
broader applications. Here, we propose a convolutional feature separation
modeling method to predict full-field ultrafast nonlinear dynamics with low
complexity and high flexibility, where the linear effects are firstly modeled
by NLSE-derived methods, then a convolutional deep learning method is
implemented for nonlinearity modeling. With this method, the temporal relevance
of nonlinear effects is substantially shortened, and the parameters and scale
of neural networks can be greatly reduced. The running time achieves a 94%
reduction versus NLSE and an 87% reduction versus RNN without accuracy
deterioration. In addition, the input pulse conditions, including grid point
numbers, durations, peak powers, and propagation distance, can be flexibly
changed during the predicting process. The results represent a remarkable
improvement in the ultrafast nonlinear dynamics prediction and this work also
provides novel perspectives of the feature separation modeling method for
quickly and flexibly studying the nonlinear characteristics in other fields.Comment: 15 pages,9 figure
A clustering approach to detect faults with multi-component degradations in aircraft fuel systems
Accurate fault diagnosis and prognosis can significantly increase the safety and reliability of engineering systems and also reduce the maintenance costs. There is very limited relative research reported on the fault diagnosis of a complex system with multi-component degradation. The Complex Systems (CS) problem, which features multiple components simultaneously and nonlinearly interacting with each other and corresponding environment on multiple levels, has become an essential challenge in system engineering. In CS, even a single component degradation could cause misidentification of the fault severity level and lead to serious consequences. This paper introduces a new test rig to simulate multi-component degradations of the aircraft fuel system. A data analysis approach based on machine learning classification of both the time and frequency domain features is then proposed to detect and identify the fault severity level of CS with multi-component degradation. Results show that a) the fault can be sensitively detected with an accuracy > 99%; b) the severity of fault can be identified with an accuracy of 100%
Learning Domain-Aware Detection Head with Prompt Tuning
Domain adaptive object detection (DAOD) aims to generalize detectors trained
on an annotated source domain to an unlabelled target domain. However, existing
methods focus on reducing the domain bias of the detection backbone by
inferring a discriminative visual encoder, while ignoring the domain bias in
the detection head. Inspired by the high generalization of vision-language
models (VLMs), applying a VLM as the robust detection backbone following a
domain-aware detection head is a reasonable way to learn the discriminative
detector for each domain, rather than reducing the domain bias in traditional
methods. To achieve the above issue, we thus propose a novel DAOD framework
named Domain-Aware detection head with Prompt tuning (DA-Pro), which applies
the learnable domain-adaptive prompt to generate the dynamic detection head for
each domain. Formally, the domain-adaptive prompt consists of the
domain-invariant tokens, domain-specific tokens, and the domain-related textual
description along with the class label. Furthermore, two constraints between
the source and target domains are applied to ensure that the domain-adaptive
prompt can capture the domains-shared and domain-specific knowledge. A prompt
ensemble strategy is also proposed to reduce the effect of prompt disturbance.
Comprehensive experiments over multiple cross-domain adaptation tasks
demonstrate that using the domain-adaptive prompt can produce an effectively
domain-related detection head for boosting domain-adaptive object detection
Balancing Logit Variation for Long-tailed Semantic Segmentation
Semantic segmentation usually suffers from a long-tail data distribution. Due
to the imbalanced number of samples across categories, the features of those
tail classes may get squeezed into a narrow area in the feature space. Towards
a balanced feature distribution, we introduce category-wise variation into the
network predictions in the training phase such that an instance is no longer
projected to a feature point, but a small region instead. Such a perturbation
is highly dependent on the category scale, which appears as assigning smaller
variation to head classes and larger variation to tail classes. In this way, we
manage to close the gap between the feature areas of different categories,
resulting in a more balanced representation. It is noteworthy that the
introduced variation is discarded at the inference stage to facilitate a
confident prediction. Although with an embarrassingly simple implementation,
our method manifests itself in strong generalizability to various datasets and
task settings. Extensive experiments suggest that our plug-in design lends
itself well to a range of state-of-the-art approaches and boosts the
performance on top of them
- …
