74 research outputs found
Profiling of OCR'ed Historical Texts Revisited
In the absence of ground truth it is not possible to automatically determine
the exact spectrum and occurrences of OCR errors in an OCR'ed text. Yet, for
interactive postcorrection of OCR'ed historical printings it is extremely
useful to have a statistical profile available that provides an estimate of
error classes with associated frequencies, and that points to conjectured
errors and suspicious tokens. The method introduced in Reffle (2013) computes
such a profile, combining lexica, pattern sets and advanced matching techniques
in a specialized Expectation Maximization (EM) procedure. Here we improve this
method in three respects: First, the method in Reffle (2013) is not adaptive:
user feedback obtained by actual postcorrection steps cannot be used to compute
refined profiles. We introduce a variant of the method that is open for
adaptivity, taking correction steps of the user into account. This leads to
higher precision with respect to recognition of erroneous OCR tokens. Second,
during postcorrection often new historical patterns are found. We show that
adding new historical patterns to the linguistic background resources leads to
a second kind of improvement, enabling even higher precision by telling
historical spellings apart from OCR errors. Third, the method in Reffle (2013)
does not make any active use of tokens that cannot be interpreted in the
underlying channel model. We show that adding these uninterpretable tokens to
the set of conjectured errors leads to a significant improvement of the recall
for error detection, at the same time improving precision
Direct observation of Levy flight of holes in bulk n-InP
We study the photoluminescence spectra excited at an edge side of n-InP slabs
and observed from the broadside. In a moderately doped sample the intensity
drops off as a power-law function of the distance from the excitation - up to
several millimeters - with no change in the spectral shape.The hole
distribution is described by a stationary Levy-flight process over more than
two orders of magnitude in both the distance and hole concentration. For
heavily-doped samples, the power law is truncated by free-carrier absorption.
Our experiments are near-perfectly described by the Biberman-Holstein transport
equation with parameters found from independent optical experiments.Comment: 4 pages, 3 figure
Effects of the stellar wind on X-ray spectra of Cygnus X-3
We study X-ray spectra of Cyg X-3 from BeppoSAX, taking into account
absorption and emission in the strong stellar wind of its companion. We find
the intrinsic X-ray spectra are well modelled by disc blackbody emission, its
upscattering by hot electrons with a hybrid distribution, and by Compton
reflection. These spectra are strongly modified by absorption and reprocessing
in the stellar wind, which we model using the photoionization code cloudy. The
form of the observed spectra implies the wind is composed of two phases. A hot
tenuous plasma containing most of the wind mass is required to account for the
observed features of very strongly ionized Fe. Small dense cool clumps filling
<0.01 of the volume are required to absorb the soft X-ray excess, which is
emitted by the hot phase but not present in the data. The total mass-loss rate
is found to be (0.6--1.6) x 10^-5 solar masses per year. We also discuss the
feasibility of the continuum model dominated by Compton reflection, which we
find to best describe our data. The intrinsic luminosities of our models
suggest that the compact object is a black hole.Comment: MNRAS, in pres
Far-UV Spectroscopic Analyses of Four Central Stars of Planetary Nebulae
We analyze the Far-UV/UV spectra of four central stars of planetary nebulae
with strong wind features -- NGC 2371, Abell 78, IC 4776 and NGC 1535, and
derive their photospheric and wind parameters by modeling high-resolution FUSE
(Far-Ultraviolet Spectroscopic Explorer) data in the Far-UV and HST-STIS and
IUE data in the UV with spherical non-LTE line-blanketed model atmospheres.
Abell 78 is a hydrogen-deficient transitional [WR]-PG 1159 object, and we find
NGC 2371 to be in the same stage, both migrating from the constant-luminosity
phase to the white dwarf cooling sequence with Teff ~= 120 kK, Mdot ~= 5x10^-8
Msun/yr. NGC 1535 is a ``hydrogen-rich'' O(H) CSPN, and the exact nature of IC
4776 is ambiguous, although it appears to be helium burning. Both objects lie
on the constant-luminosity branch of post-AGB evolution and have Teff ~= 65 kK,
Mdot ~= 1x10^-8 Msun/yr. Thus, both the H-rich and H-deficient channels of PN
evolution are represented in our sample. We also investigate the effects of
including higher ionization stages of iron (up to FeX) in the model atmosphere
calculations of these hot objects (usually neglected in previous analyses), and
find iron to be a useful diagnostic of the stellar parameters in some cases.
The Far-UV spectra of all four objects show evidence of hot (T ~ 300 K)
molecular hydrogen in their circumstellar environments.Comment: 38 pages, 8 figures (6 color). Accepted for publication in Ap
Atmospheric NLTE-Models for the Spectroscopic Analysis of Blue Stars with Winds. II. Line-Blanketed Models
We present new or improved methods for calculating NLTE, line-blanketed model
atmospheres for hot stars with winds (spectral types A to O), with particular
emphasis on a fast performance. These methods have been implemented into a
previous, more simple version of the model atmosphere code FASTWIND
(Santolaya-Rey et al.1997) and allow to spectroscopically analyze rather large
samples of massive stars in a reasonable time-scale, using state-of-the-art
physics.
We describe our (partly approximate) approach to solve the equations of
statistical equilibrium for those elements which are primarily responsible for
line-blocking and blanketing, as well as an approximate treatment of the
line-blocking itself, which is based on a simple statistical approach using
suitable means for line opacities and emissivities. Furthermore, we comment on
our implementation of a consistent temperature structure.
In the second part, we concentrate on a detailed comparison with results from
those two codes which have been used in alternative spectroscopical
investigations, namely CMFGEN (Hillier & Miller 1998) and WM-Basic (Pauldrach
et al. 2001). All three codes predict almost identical temperature structures
and fluxes for lambda > 400 A, whereas at lower wavelengths a number of
discrepancies are found. Optical H/He lines as synthesized by FASTWIND are
compared with results from CMFGEN, obtaining a remarkable coincidence, except
for the HeI singlets in the temperature range between 36,000 to 41,000 K for
dwarfs and between 31,000 to 35,000 K for supergiants, where CMFGEN predicts
much weaker lines. Consequences due to these discrepancies are discussed.Comment: 30 pages incl. 20 figures, accepted by A&
Optimizing the Training of Models for Automated Post-Correction of Arbitrary OCR-ed Historical Texts
Systems for post-correction of OCR-results for historical texts are based on statistical correction models obtained by supervised learning. For training, suitable collections of ground truth materials are needed. In this paper we investigate the dependency of the power of automated OCR post-correction on the form of ground truth data and other training settings used for the computation of a post-correction model. The post-correction system A-PoCoTo considered here is based on a profiler service that computes a statistical profile for an OCR-ed input text. We also look in detail at the influence of the profiler resources and other settings selected for training and evaluation. As a practical result of several fine-tuning steps, a general post-correction model is achieved where experiments for a large and heterogeneous collection of OCR-ed historical texts show a consistent improvement of base OCR accuracy. The results presented are meant to provide insights for libraries that want to apply OCR post-correction to a larger spectrum of distinct OCR-ed historical printings and ask for "representative" results
Mass-loss rates of Very Massive Stars
We discuss the basic physics of hot-star winds and we provide mass-loss rates
for (very) massive stars. Whilst the emphasis is on theoretical concepts and
line-force modelling, we also discuss the current state of observations and
empirical modelling, and address the issue of wind clumping.Comment: 36 pages, 15 figures, Book Chapter in "Very Massive Stars in the
Local Universe", Springer, Ed. Jorick S. Vin
Chandra spectroscopy of the hot star beta Crucis and the discovery of a pre-main-sequence companion
In order to test the O star wind-shock scenario for X-ray production in less
luminous stars with weaker winds, we made a pointed 74 ks observation of the
nearby early B giant, beta Cru (B0.5 III), with the Chandra HETGS. We find that
the X-ray spectrum is quite soft, with a dominant thermal component near 3
million K, and that the emission lines are resolved but quite narrow, with
half-widths of 150 km/s. The forbidden-to-intercombination line ratios of Ne IX
and Mg XI indicate that the hot plasma is distributed in the wind, rather than
confined near the photosphere. It is difficult to understand the X-ray data in
the context of the standard wind-shock paradigm for OB stars, primarily because
of the narrow lines, but also because of the high X-ray production efficiency.
A scenario in which the bulk of the outer wind is shock heated is broadly
consistent with the data, but not very well motivated theoretically. It is
possible that magnetic channeling could explain the X-ray properties, although
no field has been detected on beta Cru. We detected periodic variability in the
hard (hnu > 1 keV) X-rays, modulated on the known optical period of 4.58 hours,
which is the period of the primary beta Cep pulsation mode for this star. We
also have detected, for the first time, an apparent companion to beta Cru at a
projected separation of 4 arcsec. This companion was likely never seen in
optical images because of the presumed very high contrast between it and beta
Cru in the optical. However, the brightness contrast in the X-ray is only 3:1,
which is consistent with the companion being an X-ray active low-mass
pre-main-sequence star. The companion's X-ray spectrum is relatively hard and
variable, as would be expected from a post T Tauri star.Comment: Accepted for publication in MNRAS; 19 pages, 15 figures, some in
color; version with higher-resolution figures available at
http://astro.swarthmore.edu/~cohen/papers/bcru_mnras2008.pd
- …
