Search CORE

611 research outputs found

Towards Hard-Positive Query Mining for DETR-based Human-Object Interaction Detection

Author: Ding Changxing
Huang Shaoli
Li Zijian
Zhong Xubin
Publication venue
Publication date: 12/07/2022
Field of study

Human-Object Interaction (HOI) detection is a core task for high-level image understanding. Recently, Detection Transformer (DETR)-based HOI detectors have become popular due to their superior performance and efficient structure. However, these approaches typically adopt fixed HOI queries for all testing images, which is vulnerable to the location change of objects in one specific image. Accordingly, in this paper, we propose to enhance DETR's robustness by mining hard-positive queries, which are forced to make correct predictions using partial visual cues. First, we explicitly compose hard-positive queries according to the ground-truth (GT) position of labeled human-object pairs for each training image. Specifically, we shift the GT bounding boxes of each labeled human-object pair so that the shifted boxes cover only a certain portion of the GT ones. We encode the coordinates of the shifted boxes for each labeled human-object pair into an HOI query. Second, we implicitly construct another set of hard-positive queries by masking the top scores in cross-attention maps of the decoder layers. The masked attention maps then only cover partial important cues for HOI predictions. Finally, an alternate strategy is proposed that efficiently combines both types of hard queries. In each iteration, both DETR's learnable queries and one selected type of hard-positive queries are adopted for loss computation. Experimental results show that our proposed approach can be widely applied to existing DETR-based HOI detectors. Moreover, we consistently achieve state-of-the-art performance on three benchmarks: HICO-DET, V-COCO, and HOI-A. Code is available at https://github.com/MuchHair/HQM.Comment: Accepted by ECCV202

arXiv.org e-Print Archive

Magnetic resonance imaging features of alveolar soft part sarcoma: report of 14 cases

Author: Xubin Li
Zhaoxiang Ye
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

Crossref

A convexity approach to dynamic output feedback robust MPC for LPV systems with bounded disturbances

Author: Ding Baocang
Li Zhiwu
Ping Xubin
Raissi Tarek
Yang Sen
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 26/02/2019
Field of study

International audienceA convexity approach to dynamic output feedback robust model predictive control (OFRMPC) is proposed for linear parameter varying (LPV) systems with bounded disturbances. At each sampling time, the model parameters and disturbances are assumed to be unknown but bounded within pre-specified convex sets. Robust stability conditions on the augmented closed-loop system are derived using the techniques of robust positively invariant (RPI) set and the S-procedure. A convexity method reformulates the non-convex bilinear matrix inequalities (BMIs) problem as a convex optimization one such that the on-line computational burden is significantly reduced. The on-line optimized dynamic output feedback controller parameters steer the augmented states to converge within RPI sets and recursive feasibility of the optimization problem is guaranteed. Furthermore, bounds of the estimation error set are refreshed by updating the shape matrix of the future ellipsoidal estimation error set. The dynamic OFRMPC approach guarantees that the disturbance-free augmented closed-loop system (without consideration of disturbances) converges to the origin. In addition, when the system is subject to bounded disturbances, the augmented closed-loop system converges to a neighborhood of the origin. Two simulation examples are given to verify the effectiveness of the approach

Crossref

HAL Portal Cnam (French National Conservatory for Arts and Crafts)

HAL: Hyper Article en Ligne

Long Short-Term Planning for Conversational Recommendation Systems

Author: Li Xian
Li Xubin
Nguyen Cam-Tu
Shi Hongguang
Wang Yunfei
Zhang Yeqin
Publication venue
Publication date: 23/10/2023
Field of study

In Conversational Recommendation Systems (CRS), the central question is how the conversational agent can naturally ask for user preferences and provide suitable recommendations. Existing works mainly follow the hierarchical architecture, where a higher policy decides whether to invoke the conversation module (to ask questions) or the recommendation module (to make recommendations). This architecture prevents these two components from fully interacting with each other. In contrast, this paper proposes a novel architecture, the long short-term feedback architecture, to connect these two essential components in CRS. Specifically, the recommendation predicts the long-term recommendation target based on the conversational context and the user history. Driven by the targeted recommendation, the conversational model predicts the next topic or attribute to verify if the user preference matches the target. The balance feedback loop continues until the short-term planner output matches the long-term planner output, that is when the system should make the recommendation.Comment: 14 pages, 3 figures. Accepted by ICONIP 202

arXiv.org e-Print Archive

Exhaustive Exploitation of Nature-inspired Computation for Cancer Screening in an Ensemble Manner

Author: Li Xiangtao
Ma Zhiqing
Wang Xubin
Wang Yunhe
Wong Ka-Chun
Publication venue
Publication date: 06/04/2024
Field of study

Accurate screening of cancer types is crucial for effective cancer detection and precise treatment selection. However, the association between gene expression profiles and tumors is often limited to a small number of biomarker genes. While computational methods using nature-inspired algorithms have shown promise in selecting predictive genes, existing techniques are limited by inefficient search and poor generalization across diverse datasets. This study presents a framework termed Evolutionary Optimized Diverse Ensemble Learning (EODE) to improve ensemble learning for cancer classification from gene expression data. The EODE methodology combines an intelligent grey wolf optimization algorithm for selective feature space reduction, guided random injection modeling for ensemble diversity enhancement, and subset model optimization for synergistic classifier combinations. Extensive experiments were conducted across 35 gene expression benchmark datasets encompassing varied cancer types. Results demonstrated that EODE obtained significantly improved screening accuracy over individual and conventionally aggregated models. The integrated optimization of advanced feature selection, directed specialized modeling, and cooperative classifier ensembles helps address key challenges in current nature-inspired approaches. This provides an effective framework for robust and generalized ensemble learning with gene expression biomarkers. Specifically, we have opened EODE source code on Github at https://github.com/wangxb96/EODE

arXiv.org e-Print Archive

Constructing Media-based Enterprise Networks for Stock Market Risk Analysis

Author: Dong Shengxin
Duan Jiang
Li Qing
Luo Xubin
Wang Qinyu
Xing Rong
Publication venue: AIS Electronic Library (AISeL)
Publication date: 04/12/2017
Field of study

Stock comovement analysis is essential to understand the mechanism of stock markets. Previous studies focus on the comovement from the perspectives of fundamentals or preferences of investors. In this article, we propose a framework to explore the comovements of stocks in terms of their relationships in Web media. This is achieved by constructing media-based enterprise networks in terms of the co-exposure in news reports of stocks and mutual attentions among them. Our experiments based on CSI 300 listed firms show the significant comovements of stocks brought out by their behaviors in Web media. Furthermore, utilizing media based enterprise networks can help us identify the most influential firms which can stir up the stock markets

AIS Electronic Library (AISeL)

Graph Transformer for Recommendation

Author: Huang Chao
Li Chaoliu
Ren Xubin
Xia Lianghao
Xu Yong
Ye Yaowen
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 04/06/2023
Field of study

This paper presents a novel approach to representation learning in recommender systems by integrating generative self-supervised learning with graph transformer architecture. We highlight the importance of high-quality data augmentation with relevant self-supervised pretext tasks for improving performance. Towards this end, we propose a new approach that automates the self-supervision augmentation process through a rationale-aware generative SSL that distills informative user-item interaction patterns. The proposed recommender with Graph TransFormer (GFormer) that offers parameterized collaborative rationale discovery for selective augmentation while preserving global-aware user-item relationships. In GFormer, we allow the rationale-aware SSL to inspire graph collaborative filtering with task-adaptive invariant rationalization in graph transformer. The experimental results reveal that our GFormer has the capability to consistently improve the performance over baselines on different datasets. Several in-depth experiments further investigate the invariant rationale-aware augmentation from various aspects. The source code for this work is publicly available at: https://github.com/HKUDS/GFormer.Comment: Accepted by SIGIR'202

arXiv.org e-Print Archive