622 research outputs found
A Transmissive X-ray Polarimeter Design For Hard X-ray Focusing Telescopes
The X-ray Timing and Polarization (XTP) is a mission concept for a future
space borne X-ray observatory and is currently selected for early phase study.
We present a new design of X-ray polarimeter based on the time projection gas
chamber. The polarimeter, placed above the focal plane, has an additional rear
window that allows hard X-rays to penetrate (a transmission of nearly 80% at 6
keV) through it and reach the detector on the focal plane. Such a design is to
compensate the low detection efficiency of gas detectors, at a low cost of
sensitivity, and can maximize the science return of multilayer hard X-ray
telescopes without the risk of moving focal plane instruments. The sensitivity
in terms of minimum detectable polarization, based on current instrument
configuration, is expected to be 3% for a 1mCrab source given an observing time
of 10^5 s. We present preliminary test results, including photoelectron tracks
and modulation curves, using a test chamber and polarized X-ray sources in the
lab
In-Orbit Instrument Performance Study and Calibration for POLAR Polarization Measurements
POLAR is a compact space-borne detector designed to perform reliable
measurements of the polarization for transient sources like Gamma-Ray Bursts in
the energy range 50-500keV. The instrument works based on the Compton
Scattering principle with the plastic scintillators as the main detection
material along with the multi-anode photomultiplier tube. POLAR has been
launched successfully onboard the Chinese space laboratory TG-2 on 15th
September, 2016. In order to reliably reconstruct the polarization information
a highly detailed understanding of the instrument is required for both data
analysis and Monte Carlo studies. For this purpose a full study of the in-orbit
performance was performed in order to obtain the instrument calibration
parameters such as noise, pedestal, gain nonlinearity of the electronics,
threshold, crosstalk and gain, as well as the effect of temperature on the
above parameters. Furthermore the relationship between gain and high voltage of
the multi-anode photomultiplier tube has been studied and the errors on all
measurement values are presented. Finally the typical systematic error on
polarization measurements of Gamma-Ray Bursts due to the measurement error of
the calibration parameters are estimated using Monte Carlo simulations.Comment: 43 pages, 30 figures, 1 table; Preprint accepted by NIM
Influence of the Earth on the background and the sensitivity of the GRM and ECLAIRs instruments aboard the Chinese-French mission SVOM
SVOM (Space-based multi-band astronomical Variable Object Monitor) is a
future Chinese-French satellite mission which is dedicated to Gamma-Ray Burst
(GRB) studies. Its anti-solar pointing strategy makes the Earth cross the field
of view of its payload every orbit. In this paper, we present the variations of
the gamma-ray background of the two high energy instruments aboard SVOM, the
Gamma-Ray Monitor (GRM) and ECLAIRs, as a function of the Earth position. We
conclude with an estimate of the Earth influence on their sensitivity and their
GRB detection capability.Comment: 24 pages, 15 figures, accepted for publication in Experimental
Astronom
Efficient and Economic Large Language Model Inference with Attention Offloading
Transformer-based large language models (LLMs) exhibit impressive performance
in generative tasks but introduce significant challenges in real-world serving
due to inefficient use of the expensive, computation-optimized accelerators.
This mismatch arises from the autoregressive nature of LLMs, where the
generation phase comprises operators with varying resource demands.
Specifically, the attention operator is memory-intensive, exhibiting a memory
access pattern that clashes with the strengths of modern accelerators,
especially as context length increases. To enhance the efficiency and
cost-effectiveness of LLM serving, we introduce the concept of attention
offloading. This approach leverages a collection of cheap, memory-optimized
devices for the attention operator while still utilizing high-end accelerators
for other parts of the model. This heterogeneous setup ensures that each
component is tailored to its specific workload, maximizing overall performance
and cost efficiency. Our comprehensive analysis and experiments confirm the
viability of splitting the attention computation over multiple devices. Also,
the communication bandwidth required between heterogeneous devices proves to be
manageable with prevalent networking technologies. To further validate our
theory, we develop Lamina, an LLM inference system that incorporates attention
offloading. Experimental results indicate that Lamina can provide 1.48x-12.1x
higher estimated throughput per dollar than homogeneous solutions
- …
