76 research outputs found
Profiling of OCR'ed Historical Texts Revisited
In the absence of ground truth it is not possible to automatically determine
the exact spectrum and occurrences of OCR errors in an OCR'ed text. Yet, for
interactive postcorrection of OCR'ed historical printings it is extremely
useful to have a statistical profile available that provides an estimate of
error classes with associated frequencies, and that points to conjectured
errors and suspicious tokens. The method introduced in Reffle (2013) computes
such a profile, combining lexica, pattern sets and advanced matching techniques
in a specialized Expectation Maximization (EM) procedure. Here we improve this
method in three respects: First, the method in Reffle (2013) is not adaptive:
user feedback obtained by actual postcorrection steps cannot be used to compute
refined profiles. We introduce a variant of the method that is open for
adaptivity, taking correction steps of the user into account. This leads to
higher precision with respect to recognition of erroneous OCR tokens. Second,
during postcorrection often new historical patterns are found. We show that
adding new historical patterns to the linguistic background resources leads to
a second kind of improvement, enabling even higher precision by telling
historical spellings apart from OCR errors. Third, the method in Reffle (2013)
does not make any active use of tokens that cannot be interpreted in the
underlying channel model. We show that adding these uninterpretable tokens to
the set of conjectured errors leads to a significant improvement of the recall
for error detection, at the same time improving precision
Direct observation of Levy flight of holes in bulk n-InP
We study the photoluminescence spectra excited at an edge side of n-InP slabs
and observed from the broadside. In a moderately doped sample the intensity
drops off as a power-law function of the distance from the excitation - up to
several millimeters - with no change in the spectral shape.The hole
distribution is described by a stationary Levy-flight process over more than
two orders of magnitude in both the distance and hole concentration. For
heavily-doped samples, the power law is truncated by free-carrier absorption.
Our experiments are near-perfectly described by the Biberman-Holstein transport
equation with parameters found from independent optical experiments.Comment: 4 pages, 3 figure
Effects of the stellar wind on X-ray spectra of Cygnus X-3
We study X-ray spectra of Cyg X-3 from BeppoSAX, taking into account
absorption and emission in the strong stellar wind of its companion. We find
the intrinsic X-ray spectra are well modelled by disc blackbody emission, its
upscattering by hot electrons with a hybrid distribution, and by Compton
reflection. These spectra are strongly modified by absorption and reprocessing
in the stellar wind, which we model using the photoionization code cloudy. The
form of the observed spectra implies the wind is composed of two phases. A hot
tenuous plasma containing most of the wind mass is required to account for the
observed features of very strongly ionized Fe. Small dense cool clumps filling
<0.01 of the volume are required to absorb the soft X-ray excess, which is
emitted by the hot phase but not present in the data. The total mass-loss rate
is found to be (0.6--1.6) x 10^-5 solar masses per year. We also discuss the
feasibility of the continuum model dominated by Compton reflection, which we
find to best describe our data. The intrinsic luminosities of our models
suggest that the compact object is a black hole.Comment: MNRAS, in pres
Central Stars of Planetary Nebulae in the Large Magellanic Cloud: A Far-UV Spectroscopic Analysis
We observed seven central stars of planetary nebulae (CSPN) in the Large
Magellanic Cloud (LMC) with the Far Ultraviolet Spectroscopic Explorer (FUSE),
and performed a model-based analysis of these spectra in conjunction with
Hubble Space Telescope (HST) spectra in the UV and optical range to determine
the stellar and nebular parameters. Most of the objects show wind features, and
they have effective temperatures ranging from 38 to 60 kK with mass-loss rates
of ~= 5x10^-8 Msun/yr. Five of the objects have typical LMC abundances. One
object (SMP LMC 61) is a [WC4] star, and we fit its spectra with He/C/O-rich
abundances typical of the [WC] class, and find its atmosphere to be
iron-deficient. Most objects have very hot (T ~> 2000 K) molecular hydrogen in
their nebulae, which may indicate a shocked environment. One of these (SMP LMC
62) also displays OVI 1032-38 nebular emission lines, rarely observed in PN.Comment: 53 pages, 15 figures (11 color). Accepted for publication in Ap
L\'evy flights of photons in hot atomic vapours
Properties of random and fluctuating systems are often studied through the
use of Gaussian distributions. However, in a number of situations, rare events
have drastic consequences, which can not be explained by Gaussian statistics.
Considerable efforts have thus been devoted to the study of non Gaussian
fluctuations such as L\'evy statistics, generalizing the standard description
of random walks. Unfortunately only macroscopic signatures, obtained by
averaging over many random steps, are usually observed in physical systems. We
present experimental results investigating the elementary process of anomalous
diffusion of photons in hot atomic vapours. We measure the step size
distribution of the random walk and show that it follows a power law
characteristic of L\'evy flights.Comment: This final version is identical to the one published in Nature
Physic
Far-UV Spectroscopic Analyses of Four Central Stars of Planetary Nebulae
We analyze the Far-UV/UV spectra of four central stars of planetary nebulae
with strong wind features -- NGC 2371, Abell 78, IC 4776 and NGC 1535, and
derive their photospheric and wind parameters by modeling high-resolution FUSE
(Far-Ultraviolet Spectroscopic Explorer) data in the Far-UV and HST-STIS and
IUE data in the UV with spherical non-LTE line-blanketed model atmospheres.
Abell 78 is a hydrogen-deficient transitional [WR]-PG 1159 object, and we find
NGC 2371 to be in the same stage, both migrating from the constant-luminosity
phase to the white dwarf cooling sequence with Teff ~= 120 kK, Mdot ~= 5x10^-8
Msun/yr. NGC 1535 is a ``hydrogen-rich'' O(H) CSPN, and the exact nature of IC
4776 is ambiguous, although it appears to be helium burning. Both objects lie
on the constant-luminosity branch of post-AGB evolution and have Teff ~= 65 kK,
Mdot ~= 1x10^-8 Msun/yr. Thus, both the H-rich and H-deficient channels of PN
evolution are represented in our sample. We also investigate the effects of
including higher ionization stages of iron (up to FeX) in the model atmosphere
calculations of these hot objects (usually neglected in previous analyses), and
find iron to be a useful diagnostic of the stellar parameters in some cases.
The Far-UV spectra of all four objects show evidence of hot (T ~ 300 K)
molecular hydrogen in their circumstellar environments.Comment: 38 pages, 8 figures (6 color). Accepted for publication in Ap
Atmospheric NLTE-Models for the Spectroscopic Analysis of Blue Stars with Winds. II. Line-Blanketed Models
We present new or improved methods for calculating NLTE, line-blanketed model
atmospheres for hot stars with winds (spectral types A to O), with particular
emphasis on a fast performance. These methods have been implemented into a
previous, more simple version of the model atmosphere code FASTWIND
(Santolaya-Rey et al.1997) and allow to spectroscopically analyze rather large
samples of massive stars in a reasonable time-scale, using state-of-the-art
physics.
We describe our (partly approximate) approach to solve the equations of
statistical equilibrium for those elements which are primarily responsible for
line-blocking and blanketing, as well as an approximate treatment of the
line-blocking itself, which is based on a simple statistical approach using
suitable means for line opacities and emissivities. Furthermore, we comment on
our implementation of a consistent temperature structure.
In the second part, we concentrate on a detailed comparison with results from
those two codes which have been used in alternative spectroscopical
investigations, namely CMFGEN (Hillier & Miller 1998) and WM-Basic (Pauldrach
et al. 2001). All three codes predict almost identical temperature structures
and fluxes for lambda > 400 A, whereas at lower wavelengths a number of
discrepancies are found. Optical H/He lines as synthesized by FASTWIND are
compared with results from CMFGEN, obtaining a remarkable coincidence, except
for the HeI singlets in the temperature range between 36,000 to 41,000 K for
dwarfs and between 31,000 to 35,000 K for supergiants, where CMFGEN predicts
much weaker lines. Consequences due to these discrepancies are discussed.Comment: 30 pages incl. 20 figures, accepted by A&
Optimizing the Training of Models for Automated Post-Correction of Arbitrary OCR-ed Historical Texts
Systems for post-correction of OCR-results for historical texts are based on statistical correction models obtained by supervised learning. For training, suitable collections of ground truth materials are needed. In this paper we investigate the dependency of the power of automated OCR post-correction on the form of ground truth data and other training settings used for the computation of a post-correction model. The post-correction system A-PoCoTo considered here is based on a profiler service that computes a statistical profile for an OCR-ed input text. We also look in detail at the influence of the profiler resources and other settings selected for training and evaluation. As a practical result of several fine-tuning steps, a general post-correction model is achieved where experiments for a large and heterogeneous collection of OCR-ed historical texts show a consistent improvement of base OCR accuracy. The results presented are meant to provide insights for libraries that want to apply OCR post-correction to a larger spectrum of distinct OCR-ed historical printings and ask for "representative" results
Mass-loss rates of Very Massive Stars
We discuss the basic physics of hot-star winds and we provide mass-loss rates
for (very) massive stars. Whilst the emphasis is on theoretical concepts and
line-force modelling, we also discuss the current state of observations and
empirical modelling, and address the issue of wind clumping.Comment: 36 pages, 15 figures, Book Chapter in "Very Massive Stars in the
Local Universe", Springer, Ed. Jorick S. Vin
- …
