953 research outputs found
Evolution of anthropogenic air pollutant emissions in Guangdong Province, China, from 2006 to 2015
Guangdong Province (GD), one of the most prosperous and populous regions in China, still experiences haze events and growing ozone pollution in spite of the substantial air-quality improvement in recent years. Integrated control of fine particulate matter (PM2.5) and ozone in GD calls for a systematic review of historical emissions. In this study, emission trends, spatial variations, source-contribution variations, and reduction potentials of sulfur dioxide (SO2), nitrogen oxides (NO), PM2.5, inhalable particles (PM10), carbon monoxide (CO), ammonia (NH3), and volatile organic compounds (VOCs) in GD from 2006 to 2015 were first examined using a dynamic methodology, taking into account economic development, technology penetration, and emission controls. The relative change rates of anthropogenic emissions in GD during 2006-2015 are -48% for SO2, -0.5% for NO, -16% for PM2.5, -22% for PM10, 13% for CO, 3% for NH3, and 13% for VOCs. The declines of SO2, NO, PM2.5, and PM10 emissions in the whole province mainly resulted from the stringent emission control in the Pearl River delta (PRD) region, where most previous control measures were focused, especially on power plants (SO2 and NO), industrial combustion (SO2, PM2.5, PM10), on-road mobile sources (NO), and dust sources (PM2.5 and PM10). Emissions from other areas (non-PRD, NPRD), nevertheless, remain relatively stable due to the lax control measures and rapidly growing energy consumption. In addition, emission leaks of SO2 and NO from industries are observed from PRD to NPRD in 2010 and 2011. As a result, emissions in NPRD are increasingly important in GD, particularly those from industrial combustion. The contribution of NPRD to the total SO2 emissions in GD, for example, increased from 27% in 2006 to 48% in 2015. On-road mobile sources and solvent use are the two key sources that should receive more effective control measures in GD. Current control-driven emission reductions from on-road mobile sources are neutralized by the substantial growth of the vehicle population, while VOC emissions in GD steadily increase due to the growth of solvent use and the absence of effective control measures. Besides, future work could focus on power plants and industrial combustion in GD and industrial process sources in NPRD, which still have large emission reduction potentials. The historical emission inventory developed in this study not only helps to understand the emission evolution in GD, but also provides robust data to quantify the impact of emission and meteorology variations on air quality and unveil the primary cause of significant air-quality change in GD in the recent decade
RAD: A Comprehensive Dataset for Benchmarking the Robustness of Image Anomaly Detection
Robustness against noisy imaging is crucial for practical image anomaly
detection systems. This study introduces a Robust Anomaly Detection (RAD)
dataset with free views, uneven illuminations, and blurry collections to
systematically evaluate the robustness of current anomaly detection methods.
Specifically, RAD aims to identify foreign objects on working platforms as
anomalies. The collection process incorporates various sources of imaging
noise, such as viewpoint changes, uneven illuminations, and blurry collections,
to replicate real-world inspection scenarios. Subsequently, we assess and
analyze 11 state-of-the-art unsupervised and zero-shot methods on RAD. Our
findings indicate that: 1) Variations in viewpoint, illumination, and blurring
affect anomaly detection methods to varying degrees; 2) Methods relying on
memory banks and assisted by synthetic anomalies demonstrate stronger
robustness; 3) Effectively leveraging the general knowledge of foundational
models is a promising avenue for enhancing the robustness of anomaly detection
methods. The dataset is available at https://github.com/hustCYQ/RAD-dataset.Comment: 6 pages, 5 figure
PrefixMol: Target- and Chemistry-aware Molecule Design via Prefix Embedding
Is there a unified model for generating molecules considering different
conditions, such as binding pockets and chemical properties? Although
target-aware generative models have made significant advances in drug design,
they do not consider chemistry conditions and cannot guarantee the desired
chemical properties. Unfortunately, merging the target-aware and chemical-aware
models into a unified model to meet customized requirements may lead to the
problem of negative transfer. Inspired by the success of multi-task learning in
the NLP area, we use prefix embeddings to provide a novel generative model that
considers both the targeted pocket's circumstances and a variety of chemical
properties. All conditional information is represented as learnable features,
which the generative model subsequently employs as a contextual prompt.
Experiments show that our model exhibits good controllability in both single
and multi-conditional molecular generation. The controllability enables us to
outperform previous structure-based drug design methods. More interestingly, we
open up the attention mechanism and reveal coupling relationships between
conditions, providing guidance for multi-conditional molecule generation
Aesthetics and Empowerment
Aesthetics in design are known to be empowering in terms of various perspectives in one’s life. This paper aims to discuss innovative approaches, methodologies, and technologies that can further enhance the role of aesthetics in empowering individuals in the digital age. By delving into the relationship between aesthetics and empowerment, we seek to foster discussions and collaborations among researchers, practitioners, designers, artists, and industry professionals. It is worth building up a platform for sharing insights, exchanging ideas, and exploring new and emerging trends in this exciting area
Reasoning on Efficient Knowledge Paths:Knowledge Graph Guides Large Language Model for Domain Question Answering
Large language models (LLMs), such as GPT3.5, GPT4 and LLAMA2 perform
surprisingly well and outperform human experts on many tasks. However, in many
domain-specific evaluations, these LLMs often suffer from hallucination
problems due to insufficient training of relevant corpus. Furthermore,
fine-tuning large models may face problems such as the LLMs are not open source
or the construction of high-quality domain instruction is difficult. Therefore,
structured knowledge databases such as knowledge graph can better provide
domain background knowledge for LLMs and make full use of the reasoning and
analysis capabilities of LLMs. In some previous works, LLM was called multiple
times to determine whether the current triplet was suitable for inclusion in
the subgraph when retrieving subgraphs through a question. Especially for the
question that require a multi-hop reasoning path, frequent calls to LLM will
consume a lot of computing power. Moreover, when choosing the reasoning path,
LLM will be called once for each step, and if one of the steps is selected
incorrectly, it will lead to the accumulation of errors in the following steps.
In this paper, we integrated and optimized a pipeline for selecting reasoning
paths from KG based on LLM, which can reduce the dependency on LLM. In
addition, we propose a simple and effective subgraph retrieval method based on
chain of thought (CoT) and page rank which can returns the paths most likely to
contain the answer. We conduct experiments on three datasets: GenMedGPT-5k
[14], WebQuestions [2], and CMCQA [21]. Finally, RoK can demonstrate that using
fewer LLM calls can achieve the same results as previous SOTAs models
2nd Place Winning Solution for the CVPR2023 Visual Anomaly and Novelty Detection Challenge: Multimodal Prompting for Data-centric Anomaly Detection
This technical report introduces the winning solution of the team Segment Any
Anomaly for the CVPR2023 Visual Anomaly and Novelty Detection (VAND) challenge.
Going beyond uni-modal prompt, e.g., language prompt, we present a novel
framework, i.e., Segment Any Anomaly + (SAA), for zero-shot anomaly
segmentation with multi-modal prompts for the regularization of cascaded modern
foundation models. Inspired by the great zero-shot generalization ability of
foundation models like Segment Anything, we first explore their assembly (SAA)
to leverage diverse multi-modal prior knowledge for anomaly localization.
Subsequently, we further introduce multimodal prompts (SAA) derived from
domain expert knowledge and target image context to enable the non-parameter
adaptation of foundation models to anomaly segmentation. The proposed SAA
model achieves state-of-the-art performance on several anomaly segmentation
benchmarks, including VisA and MVTec-AD, in the zero-shot setting. We will
release the code of our winning solution for the CVPR2023 VAN.Comment: The first two author contribute equally. CVPR workshop challenge
report. arXiv admin note: substantial text overlap with arXiv:2305.1072
FabGPT: An Efficient Large Multimodal Model for Complex Wafer Defect Knowledge Queries
Intelligence is key to advancing integrated circuit (IC) fabrication. Recent
breakthroughs in Large Multimodal Models (LMMs) have unlocked unparalleled
abilities in understanding images and text, fostering intelligent fabrication.
Leveraging the power of LMMs, we introduce FabGPT, a customized IC fabrication
large multimodal model for wafer defect knowledge query. FabGPT manifests
expertise in conducting defect detection in Scanning Electron Microscope (SEM)
images, performing root cause analysis, and providing expert question-answering
(Q&A) on fabrication processes. FabGPT matches enhanced multimodal features to
automatically detect minute defects under complex wafer backgrounds and reduce
the subjectivity of manual threshold settings. Besides, the proposed modulation
module and interactive corpus training strategy embed wafer defect knowledge
into the pre-trained model, effectively balancing Q&A queries related to defect
knowledge and original knowledge and mitigating the modality bias issues.
Experiments on in-house fab data (SEM-WaD) show that our FabGPT achieves
significant performance improvement in wafer defect detection and knowledge
querying
Multi-Modal Gaze Following in Conversational Scenarios
Gaze following estimates gaze targets of in-scene person by understanding
human behavior and scene information. Existing methods usually analyze scene
images for gaze following. However, compared with visual images, audio also
provides crucial cues for determining human behavior.This suggests that we can
further improve gaze following considering audio cues. In this paper, we
explore gaze following tasks in conversational scenarios. We propose a novel
multi-modal gaze following framework based on our observation ``audiences tend
to focus on the speaker''. We first leverage the correlation between audio and
lips, and classify speakers and listeners in a scene. We then use the identity
information to enhance scene images and propose a gaze candidate estimation
network. The network estimates gaze candidates from enhanced scene images and
we use MLP to match subjects with candidates as classification tasks. Existing
gaze following datasets focus on visual images while ignore audios.To evaluate
our method, we collect a conversational dataset, VideoGazeSpeech (VGS), which
is the first gaze following dataset including images and audio. Our method
significantly outperforms existing methods in VGS datasets. The visualization
result also prove the advantage of audio cues in gaze following tasks. Our work
will inspire more researches in multi-modal gaze following estimation
Brain structural covariance networks in obsessive-compulsive disorder: a graph analysis from the ENIGMA Consortium.
Brain structural covariance networks reflect covariation in morphology of different brain areas and are thought to reflect common trajectories in brain development and maturation. Large-scale investigation of structural covariance networks in obsessive-compulsive disorder (OCD) may provide clues to the pathophysiology of this neurodevelopmental disorder. Using T1-weighted MRI scans acquired from 1616 individuals with OCD and 1463 healthy controls across 37 datasets participating in the ENIGMA-OCD Working Group, we calculated intra-individual brain structural covariance networks (using the bilaterally-averaged values of 33 cortical surface areas, 33 cortical thickness values, and six subcortical volumes), in which edge weights were proportional to the similarity between two brain morphological features in terms of deviation from healthy controls (i.e. z-score transformed). Global networks were characterized using measures of network segregation (clustering and modularity), network integration (global efficiency), and their balance (small-worldness), and their community membership was assessed. Hub profiling of regional networks was undertaken using measures of betweenness, closeness, and eigenvector centrality. Individually calculated network measures were integrated across the 37 datasets using a meta-analytical approach. These network measures were summated across the network density range of K = 0.10-0.25 per participant, and were integrated across the 37 datasets using a meta-analytical approach. Compared with healthy controls, at a global level, the structural covariance networks of OCD showed lower clustering (P < 0.0001), lower modularity (P < 0.0001), and lower small-worldness (P = 0.017). Detection of community membership emphasized lower network segregation in OCD compared to healthy controls. At the regional level, there were lower (rank-transformed) centrality values in OCD for volume of caudate nucleus and thalamus, and surface area of paracentral cortex, indicative of altered distribution of brain hubs. Centrality of cingulate and orbito-frontal as well as other brain areas was associated with OCD illness duration, suggesting greater involvement of these brain areas with illness chronicity. In summary, the findings of this study, the largest brain structural covariance study of OCD to date, point to a less segregated organization of structural covariance networks in OCD, and reorganization of brain hubs. The segregation findings suggest a possible signature of altered brain morphometry in OCD, while the hub findings point to OCD-related alterations in trajectories of brain development and maturation, particularly in cingulate and orbitofrontal regions
Code Reviewer Recommendation Based on a Hypergraph with Multiplex Relationships
Code review is an essential component of software development, playing a
vital role in ensuring a comprehensive check of code changes. However, the
continuous influx of pull requests and the limited pool of available reviewer
candidates pose a significant challenge to the review process, making the task
of assigning suitable reviewers to each review request increasingly difficult.
To tackle this issue, we present MIRRec, a novel code reviewer recommendation
method that leverages a hypergraph with multiplex relationships. MIRRec encodes
high-order correlations that go beyond traditional pairwise connections using
degree-free hyperedges among pull requests and developers. This way, it can
capture high-order implicit connectivity and identify potential reviewers. To
validate the effectiveness of MIRRec, we conducted experiments using a dataset
comprising 48,374 pull requests from ten popular open-source software projects
hosted on GitHub. The experiment results demonstrate that MIRRec, especially
without PR-Review Commenters relationship, outperforms existing stateof-the-art
code reviewer recommendation methods in terms of ACC and MRR, highlighting its
significance in improving the code review process.Comment: The 31st IEEE International Conference on Software Analysis,
Evolution, and Reengineering (SANER
- …
