122 research outputs found
Low-mass dark matter search results from full exposure of PandaX-I experiment
We report the results of a weakly-interacting massive particle (WIMP) dark
matter search using the full 80.1\;live-day exposure of the first stage of the
PandaX experiment (PandaX-I) located in the China Jin-Ping Underground
Laboratory. The PandaX-I detector has been optimized for detecting low-mass
WIMPs, achieving a photon detection efficiency of 9.6\%. With a fiducial liquid
xenon target mass of 54.0\,kg, no significant excess event were found above the
expected background. A profile likelihood analysis confirms our earlier finding
that the PandaX-I data disfavor all positive low-mass WIMP signals reported in
the literature under standard assumptions. A stringent bound on the low mass
WIMP is set at WIMP mass below 10\,GeV/c, demonstrating that liquid xenon
detectors can be competitive for low-mass WIMP searches.Comment: v3 as accepted by PRD. Minor update in the text in response to
referee comments. Separating Fig. 11(a) and (b) into Fig. 11 and Fig. 12.
Legend tweak in Fig. 9(b) and 9(c) as suggested by referee, as well as a
missing legend for CRESST-II legend in Fig. 12 (now Fig. 13). Same version as
submitted to PR
Distribution-Aware Data Expansion with Diffusion Models
The scale and quality of a dataset significantly impact the performance of
deep models. However, acquiring large-scale annotated datasets is both a costly
and time-consuming endeavor. To address this challenge, dataset expansion
technologies aim to automatically augment datasets, unlocking the full
potential of deep models. Current data expansion techniques include image
transformation and image synthesis methods. Transformation-based methods
introduce only local variations, leading to limited diversity. In contrast,
synthesis-based methods generate entirely new content, greatly enhancing
informativeness. However, existing synthesis methods carry the risk of
distribution deviations, potentially degrading model performance with
out-of-distribution samples. In this paper, we propose DistDiff, a
training-free data expansion framework based on the distribution-aware
diffusion model. DistDiff constructs hierarchical prototypes to approximate the
real data distribution, optimizing latent data points within diffusion models
with hierarchical energy guidance. We demonstrate its capability to generate
distribution-consistent samples, significantly improving data expansion tasks.
DistDiff consistently enhances accuracy across a diverse range of datasets
compared to models trained solely on original data. Furthermore, our approach
consistently outperforms existing synthesis-based techniques and demonstrates
compatibility with widely adopted transformation-based augmentation methods.
Additionally, the expanded dataset exhibits robustness across various
architectural frameworks. Our code is available at
https://github.com/haoweiz23/DistDiffComment: Project: https://github.com/haoweiz23/DistDif
Enhancing Tabular Data Optimization with a Flexible Graph-based Reinforced Exploration Strategy
Tabular data optimization methods aim to automatically find an optimal
feature transformation process that generates high-value features and improves
the performance of downstream machine learning tasks. Current frameworks for
automated feature transformation rely on iterative sequence generation tasks,
optimizing decision strategies through performance feedback from downstream
tasks. However, these approaches fail to effectively utilize historical
decision-making experiences and overlook potential relationships among
generated features, thus limiting the depth of knowledge extraction. Moreover,
the granularity of the decision-making process lacks dynamic backtracking
capabilities for individual features, leading to insufficient adaptability when
encountering inefficient pathways, adversely affecting overall robustness and
exploration efficiency. To address the limitations observed in current
automatic feature engineering frameworks, we introduce a novel method that
utilizes a feature-state transformation graph to effectively preserve the
entire feature transformation journey, where each node represents a specific
transformation state. During exploration, three cascading agents iteratively
select nodes and idea mathematical operations to generate new transformation
states. This strategy leverages the inherent properties of the graph structure,
allowing for the preservation and reuse of valuable transformations. It also
enables backtracking capabilities through graph pruning techniques, which can
rectify inefficient transformation paths. To validate the efficacy and
flexibility of our approach, we conducted comprehensive experiments and
detailed case studies, demonstrating superior performance in diverse scenarios.Comment: 17 page
Spectrum resource allocation for high-throughput satellite communications based on behavior cloning
In high-throughput multi-beam satellite systems, the dimensionality of the spectrum resource allocation problem increased drastically with the number of satellite beams and service users, which caused an exponential rise in the complexity of the solution. To address the challenge, a two-stage algorithm that combined behavior cloning (BC) with deep reinforcement learning (DRL) was proposed. In the first stage, the strategy network was pretrained using existing decision data from satellite operation through behavior cloning, which mimicked expert behavior to reduce blind exploration and accelerate algorithm convergence. In the second stage, the strategy network was further optimized using the proximal policy optimization (PPO), and a convolutional block attention module (CBAM) was employed to better extract the user traffic features, thereby enhancing overall algorithm performance. Simulation results demonstrate that the proposed algorithm outperforms the benchmark algorithms in terms of convergence speed and algorithm stability, and also delivers superior performance in system delay, average system satisfaction, and spectrum efficiency
Metabolomics Analysis of Colostrum and Mature Milk from Saanen Goats
The differences in metabolites and related metabolism pathways in colostrum and mature milk from Saanen goats at different lactation stages were explored by untargeted metabolomics based on ultra-high performance liquid chromatography-quadrupole electrostatic field orbitrap mass spectrometry (UPLC-QE-orbitrap-MS). The results showed that a total of 118 differential metabolites were found between colostrum and mature milk, among which 62 had higher relative contents in colostrum than in mature milk and 56 had lower relative contents in colostrum than in mature milk. These metabolites were mainly lipids, amino acids, and nucleosides. Nine key metabolic pathways most associated with these metabolites were selected, which jointly regulated the lactation process of Saanen goats, and the citric acid cycle could act as a bridge connecting other metabolic pathways. The number of differential metabolites involved in these metabolic pathways was 12. The differential metabolites with relatively high contents in colostrum were taurine, hypotaurine, taurocholic acid, L-phenylalanine, L-tyrosine, succinic acid, isocitrate, D-maltose, α-lactose, 4-hydroxyphenylpyruvate and glycine. The differential metabolite with relatively high contents in mature milk was N1-methyl-4-pyridone-3-carboxamide. They could be used as potential marker metabolites in the colostrum and mature milk of Saanen goats. Metabolomics technology can also be used for identifying differential metabolites in milk from other dairy species at different lactation stages
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
We present DeepSeek-V2, a strong Mixture-of-Experts (MoE) language model
characterized by economical training and efficient inference. It comprises 236B
total parameters, of which 21B are activated for each token, and supports a
context length of 128K tokens. DeepSeek-V2 adopts innovative architectures
including Multi-head Latent Attention (MLA) and DeepSeekMoE. MLA guarantees
efficient inference through significantly compressing the Key-Value (KV) cache
into a latent vector, while DeepSeekMoE enables training strong models at an
economical cost through sparse computation. Compared with DeepSeek 67B,
DeepSeek-V2 achieves significantly stronger performance, and meanwhile saves
42.5% of training costs, reduces the KV cache by 93.3%, and boosts the maximum
generation throughput to 5.76 times. We pretrain DeepSeek-V2 on a high-quality
and multi-source corpus consisting of 8.1T tokens, and further perform
Supervised Fine-Tuning (SFT) and Reinforcement Learning (RL) to fully unlock
its potential. Evaluation results show that, even with only 21B activated
parameters, DeepSeek-V2 and its chat versions still achieve top-tier
performance among open-source models
Global burden and strength of evidence for 88 risk factors in 204 countries and 811 subnational locations, 1990–2021: a systematic analysis for the Global Burden of Disease Study 2021
Background: Understanding the health consequences associated with exposure to risk factors is necessary to inform public health policy and practice. To systematically quantify the contributions of risk factor exposures to specific health outcomes, the Global Burden of Diseases, Injuries, and Risk Factors Study (GBD) 2021 aims to provide comprehensive estimates of exposure levels, relative health risks, and attributable burden of disease for 88 risk factors in 204 countries and territories and 811 subnational locations, from 1990 to 2021. Methods: The GBD 2021 risk factor analysis used data from 54 561 total distinct sources to produce epidemiological estimates for 88 risk factors and their associated health outcomes for a total of 631 risk–outcome pairs. Pairs were included on the basis of data-driven determination of a risk–outcome association. Age-sex-location-year-specific estimates were generated at global, regional, and national levels. Our approach followed the comparative risk assessment framework predicated on a causal web of hierarchically organised, potentially combinative, modifiable risks. Relative risks (RRs) of a given outcome occurring as a function of risk factor exposure were estimated separately for each risk–outcome pair, and summary exposure values (SEVs), representing risk-weighted exposure prevalence, and theoretical minimum risk exposure levels (TMRELs) were estimated for each risk factor. These estimates were used to calculate the population attributable fraction (PAF; ie, the proportional change in health risk that would occur if exposure to a risk factor were reduced to the TMREL). The product of PAFs and disease burden associated with a given outcome, measured in disability-adjusted life-years (DALYs), yielded measures of attributable burden (ie, the proportion of total disease burden attributable to a particular risk factor or combination of risk factors). Adjustments for mediation were applied to account for relationships involving risk factors that act indirectly on outcomes via intermediate risks. Attributable burden estimates were stratified by Socio-demographic Index (SDI) quintile and presented as counts, age-standardised rates, and rankings. To complement estimates of RR and attributable burden, newly developed burden of proof risk function (BPRF) methods were applied to yield supplementary, conservative interpretations of risk–outcome associations based on the consistency of underlying evidence, accounting for unexplained heterogeneity between input data from different studies. Estimates reported represent the mean value across 500 draws from the estimate's distribution, with 95% uncertainty intervals (UIs) calculated as the 2·5th and 97·5th percentile values across the draws. Findings: Among the specific risk factors analysed for this study, particulate matter air pollution was the leading contributor to the global disease burden in 2021, contributing 8·0% (95% UI 6·7–9·4) of total DALYs, followed by high systolic blood pressure (SBP; 7·8% [6·4–9·2]), smoking (5·7% [4·7–6·8]), low birthweight and short gestation (5·6% [4·8–6·3]), and high fasting plasma glucose (FPG; 5·4% [4·8–6·0]). For younger demographics (ie, those aged 0–4 years and 5–14 years), risks such as low birthweight and short gestation and unsafe water, sanitation, and handwashing (WaSH) were among the leading risk factors, while for older age groups, metabolic risks such as high SBP, high body-mass index (BMI), high FPG, and high LDL cholesterol had a greater impact. From 2000 to 2021, there was an observable shift in global health challenges, marked by a decline in the number of all-age DALYs broadly attributable to behavioural risks (decrease of 20·7% [13·9–27·7]) and environmental and occupational risks (decrease of 22·0% [15·5–28·8]), coupled with a 49·4% (42·3–56·9) increase in DALYs attributable to metabolic risks, all reflecting ageing populations and changing lifestyles on a global scale. Age-standardised global DALY rates attributable to high BMI and high FPG rose considerably (15·7% [9·9–21·7] for high BMI and 7·9% [3·3–12·9] for high FPG) over this period, with exposure to these risks increasing annually at rates of 1·8% (1·6–1·9) for high BMI and 1·3% (1·1–1·5) for high FPG. By contrast, the global risk-attributable burden and exposure to many other risk factors declined, notably for risks such as child growth failure and unsafe water source, with age-standardised attributable DALYs decreasing by 71·5% (64·4–78·8) for child growth failure and 66·3% (60·2–72·0) for unsafe water source. We separated risk factors into three groups according to trajectory over time: those with a decreasing attributable burden, due largely to declining risk exposure (eg, diet high in trans-fat and household air pollution) but also to proportionally smaller child and youth populations (eg, child and maternal malnutrition); those for which the burden increased moderately in spite of declining risk exposure, due largely to population ageing (eg, smoking); and those for which the burden increased considerably due to both increasing risk exposure and population ageing (eg, ambient particulate matter air pollution, high BMI, high FPG, and high SBP). Interpretation: Substantial progress has been made in reducing the global disease burden attributable to a range of risk factors, particularly those related to maternal and child health, WaSH, and household air pollution. Maintaining efforts to minimise the impact of these risk factors, especially in low SDI locations, is necessary to sustain progress. Successes in moderating the smoking-related burden by reducing risk exposure highlight the need to advance policies that reduce exposure to other leading risk factors such as ambient particulate matter air pollution and high SBP. Troubling increases in high FPG, high BMI, and other risk factors related to obesity and metabolic syndrome indicate an urgent need to identify and implement interventions
- …
