Search CORE

74 research outputs found

Profiling of OCR'ed Historical Texts Revisited

Author: Fink Florian
Schulz Klaus-U.
Springmann Uwe
Publication venue
Publication date: 19/01/2017
Field of study

In the absence of ground truth it is not possible to automatically determine the exact spectrum and occurrences of OCR errors in an OCR'ed text. Yet, for interactive postcorrection of OCR'ed historical printings it is extremely useful to have a statistical profile available that provides an estimate of error classes with associated frequencies, and that points to conjectured errors and suspicious tokens. The method introduced in Reffle (2013) computes such a profile, combining lexica, pattern sets and advanced matching techniques in a specialized Expectation Maximization (EM) procedure. Here we improve this method in three respects: First, the method in Reffle (2013) is not adaptive: user feedback obtained by actual postcorrection steps cannot be used to compute refined profiles. We introduce a variant of the method that is open for adaptivity, taking correction steps of the user into account. This leads to higher precision with respect to recognition of erroneous OCR tokens. Second, during postcorrection often new historical patterns are found. We show that adding new historical patterns to the linguistic background resources leads to a second kind of improvement, enabling even higher precision by telling historical spellings apart from OCR errors. Third, the method in Reffle (2013) does not make any active use of tokens that cannot be interpreted in the underlying channel model. We show that adding these uninterpretable tokens to the set of conjectured errors leads to a significant improvement of the recall for error detection, at the same time improving precision

arXiv.org e-Print Archive

Crossref

Direct observation of Levy flight of holes in bulk n-InP

Author: A. F. Molisch
Arsen Subashiev
E. F. Schubert
G. Rybicki
K. Seeger
L. M. Biberman
L. M. Biberman
Oleg Semyonov
P. Lévy
S. Luryi
Serge Luryi
U. Springmann
V. F. Gantmakher
V. V. Ivanov
Zhichao Chen
Publication venue: 'American Physical Society (APS)'
Publication date: 22/05/2012
Field of study

We study the photoluminescence spectra excited at an edge side of n-InP slabs and observed from the broadside. In a moderately doped sample the intensity drops off as a power-law function of the distance from the excitation - up to several millimeters - with no change in the spectral shape.The hole distribution is described by a stationary Levy-flight process over more than two orders of magnitude in both the distance and hole concentration. For heavily-doped samples, the power law is truncated by free-carrier absorption. Our experiments are near-perfectly described by the Biberman-Holstein transport equation with parameters found from independent optical experiments.Comment: 4 pages, 3 figure

arXiv.org e-Print Archive

Crossref

Effects of the stellar wind on X-ray spectra of Cygnus X-3

Author: Andrzej A. Zdziarski
Anna Szostek
Bevington P. R.
Hamann W. R.
Hamann W. R.
Hillier D. J.
Jaroszynski M.
Kitamoto S.
Langer N.
Lauque R.
Magdziarz P.
Matt G.
Mitra A.
Morris P. W.
Nakamura H.
Nandra K.
Nugis T.
Predehl P.
Shakura N. I.
Springmann U.
Tanaka Y.
van Kerkwijk M. H.
van Kerkwijk M. H.
Wright A. E.
Zycki P. T.
Publication venue: 'Wiley'
Publication date: 04/02/2008
Field of study

We study X-ray spectra of Cyg X-3 from BeppoSAX, taking into account absorption and emission in the strong stellar wind of its companion. We find the intrinsic X-ray spectra are well modelled by disc blackbody emission, its upscattering by hot electrons with a hybrid distribution, and by Compton reflection. These spectra are strongly modified by absorption and reprocessing in the stellar wind, which we model using the photoionization code cloudy. The form of the observed spectra implies the wind is composed of two phases. A hot tenuous plasma containing most of the wind mass is required to account for the observed features of very strongly ionized Fe. Small dense cool clumps filling <0.01 of the volume are required to absorb the soft X-ray excess, which is emitted by the hot phase but not present in the data. The total mass-loss rate is found to be (0.6--1.6) x 10^-5 solar masses per year. We also discuss the feasibility of the continuum model dominated by Compton reflection, which we find to best describe our data. The intrinsic luminosities of our models suggest that the compact object is a black hole.Comment: MNRAS, in pres

arXiv.org e-Print Archive

Crossref

Far-UV Spectroscopic Analyses of Four Central Stars of Planetary Nebulae

Author: Becker S. R.
Bianchi L.
Cahn J. H.
Gesicki K.
Gorny S. K.
Hamann W.-R.
Hamann W.-R.
Hummer D. G.
J. E. Herald
Kaler J. B.
Koesterke L.
Kudritzki R.-P.
L. Bianchi
Medina S.
Mendez R. H.
Napiwotzki R.
Nussbaumer H.
Nussbaumer H.
Parthasarathy M.
Phillips J. P.
Pradhan A. K.
Sabbadin F.
Sabbadin F.
Schmutz W.
Springmann U.
Stanghellini L.
Tylenda R.
Tylenda R.
Weinberger R.
Werner K.
Publication venue: 'University of Chicago Press'
Publication date: 01/01/2004
Field of study

We analyze the Far-UV/UV spectra of four central stars of planetary nebulae with strong wind features -- NGC 2371, Abell 78, IC 4776 and NGC 1535, and derive their photospheric and wind parameters by modeling high-resolution FUSE (Far-Ultraviolet Spectroscopic Explorer) data in the Far-UV and HST-STIS and IUE data in the UV with spherical non-LTE line-blanketed model atmospheres. Abell 78 is a hydrogen-deficient transitional [WR]-PG 1159 object, and we find NGC 2371 to be in the same stage, both migrating from the constant-luminosity phase to the white dwarf cooling sequence with Teff ~= 120 kK, Mdot ~= 5x10^-8 Msun/yr. NGC 1535 is a ``hydrogen-rich'' O(H) CSPN, and the exact nature of IC 4776 is ambiguous, although it appears to be helium burning. Both objects lie on the constant-luminosity branch of post-AGB evolution and have Teff ~= 65 kK, Mdot ~= 1x10^-8 Msun/yr. Thus, both the H-rich and H-deficient channels of PN evolution are represented in our sample. We also investigate the effects of including higher ionization stages of iron (up to FeX) in the model atmosphere calculations of these hot objects (usually neglected in previous analyses), and find iron to be a useful diagnostic of the stellar parameters in some cases. The Far-UV spectra of all four objects show evidence of hot (T ~ 300 K) molecular hydrogen in their circumstellar environments.Comment: 38 pages, 8 figures (6 color). Accepted for publication in Ap

arXiv.org e-Print Archive

CiteSeerX

Crossref

Atmospheric NLTE-Models for the Spectroscopic Analysis of Blue Stars with Winds. II. Line-Blanketed Models

Author: A. Jokuthy
Abbott
Abbott
Aufdenberg
Bianchi
Bouret
Castor
Crowther
de Koter
de Koter
Drew
Drew
Feldmeier
Fullerton
Gabler
Garcia
Grevesse
Gräfener
Hamann
Hauschildt
Herrero
Hillier
Hillier
Hillier
Hubeny
Hummer
J. Puls
Kramer
Kubát
Kudritzki
Lenorzer
Lucy
Lucy
M. A. Urbaneja
M. R. Mokiem
Martins
Martins
Massey
Mazzali
Mazzali
Mihalas
Oskinova
Pauldrach
Pauldrach
Pauldrach
Przybilla
Puls
Puls
Puls
Puls
R. Venero
Repolust
Rybicki
Santolaya-Rey
Schaerer
Schaerer
Schaerer
Schmutz
Seaton
T. Repolust
Taresch
Trundle
U. Springmann
Urbaneja
Waldenfels
Wehrse
Publication venue: 'EDP Sciences'
Publication date: 01/01/2005
Field of study

We present new or improved methods for calculating NLTE, line-blanketed model atmospheres for hot stars with winds (spectral types A to O), with particular emphasis on a fast performance. These methods have been implemented into a previous, more simple version of the model atmosphere code FASTWIND (Santolaya-Rey et al.1997) and allow to spectroscopically analyze rather large samples of massive stars in a reasonable time-scale, using state-of-the-art physics. We describe our (partly approximate) approach to solve the equations of statistical equilibrium for those elements which are primarily responsible for line-blocking and blanketing, as well as an approximate treatment of the line-blocking itself, which is based on a simple statistical approach using suitable means for line opacities and emissivities. Furthermore, we comment on our implementation of a consistent temperature structure. In the second part, we concentrate on a detailed comparison with results from those two codes which have been used in alternative spectroscopical investigations, namely CMFGEN (Hillier & Miller 1998) and WM-Basic (Pauldrach et al. 2001). All three codes predict almost identical temperature structures and fluxes for lambda > 400 A, whereas at lower wavelengths a number of discrepancies are found. Optical H/He lines as synthesized by FASTWIND are compared with results from CMFGEN, obtaining a remarkable coincidence, except for the HeI singlets in the temperature range between 36,000 to 41,000 K for dwarfs and between 31,000 to 35,000 K for supergiants, where CMFGEN predicts much weaker lines. Consequences due to these discrepancies are discussed.Comment: 30 pages incl. 20 figures, accepted by A&

arXiv.org e-Print Archive

Crossref

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

EDP Sciences OAI-PMH repository (1.2.0)

UvA-DARE

International Migration, Integration and Social Cohesion online publications

Optimizing the Training of Models for Automated Post-Correction of Arbitrary OCR-ed Historical Texts

Author: Englmeier Tobias
Fink Florian
Schulz Klaus U.
Springmann Uwe
Publication venue: German Society for Computational Linguistics and Language Technology (GSCL)
Publication date: 03/12/2022
Field of study

Systems for post-correction of OCR-results for historical texts are based on statistical correction models obtained by supervised learning. For training, suitable collections of ground truth materials are needed. In this paper we investigate the dependency of the power of automated OCR post-correction on the form of ground truth data and other training settings used for the computation of a post-correction model. The post-correction system A-PoCoTo considered here is based on a profiler service that computes a statistical profile for an OCR-ed input text. We also look in detail at the influence of the profiler resources and other settings selected for training and evaluation. As a practical result of several fine-tuning steps, a general post-correction model is achieved where experiments for a large and heterogeneous collection of OCR-ed historical texts show a consistent improvement of base OCR accuracy. The results presented are meant to provide insights for libraries that want to apply OCR post-correction to a larger spectrum of distinct OCR-ed historical printings and ask for "representative" results

Crossref

Journal for Language Technology and Computational Linguistics (JLCL)

Mass-loss rates of Very Massive Stars

Author: A. D. Code
A. E. Wright
A. I. MacFadyen
A. J. Marle van
A. W. A. Pauldrach
A. W. A. Pauldrach
A. W. Fullerton
A.W.A. Pauldrach
B. Davies
B. Davies
B. Surlan
C. A. Iglesias
C. Leitherer
C. Trundle
C. V. Rodrigues
D. C. Abbott
D. C. Abbott
D. H. Cohen
D. J. Hillier
D. J. Hillier
D. Schaerer
E. A. Milne
E. Anders
F. Martins
G. Gräfener
G. Gräfener
G. Gräfener
G. Meynet
G. Meynet
H. Belkus
I. Baraffe
I. Brott
J. Castor
J. H. Groh
J. H. Groh
J. J. Eldridge
J. Krticka
J. O. Sundqvist
J. Puls
J. Puls
J. Puls
J. Puls
J. Puls
J. S. Vink
J. S. Vink
J. S. Vink
J. S. Vink
J.-P. Zahn
K. G. Gayley
K. G. Gayley
L. B. Lucy
L. B. Lucy
L. B. Lucy
L. M. Oskinova
L. Muijres
L. Muijres
L. Muijres
L. R. Yungelson
M. Cantiello
M. Limongi
M. R. Mokiem
N. J. Shaviv
N. Langer
N. Panagia
N. Smith
N. Smith
N. Smith
N. Yusof
P. E. Müller
P. E. Müller
P. S. Conti
Q.-K. Li
R. H. D. Townsend
R. Kotak
R. M. Humphreys
R.-P. Kudritzki
S. P. Owocki
S. P. Owocki
S. P. Owocki
S.-C. Bouret
T. J. Harries
T. Repolust
U. Springmann
W. Glatzel
W.-R. Hamann
W.-R. Hamann
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 20/06/2014
Field of study

We discuss the basic physics of hot-star winds and we provide mass-loss rates for (very) massive stars. Whilst the emphasis is on theoretical concepts and line-force modelling, we also discuss the current state of observations and empirical modelling, and address the issue of wind clumping.Comment: 36 pages, 15 figures, Book Chapter in "Very Massive Stars in the Local Universe", Springer, Ed. Jorick S. Vin

arXiv.org e-Print Archive

Crossref

Chandra spectroscopy of the hot star beta Crucis and the discovery of a pre-main-sequence companion

In order to test the O star wind-shock scenario for X-ray production in less luminous stars with weaker winds, we made a pointed 74 ks observation of the nearby early B giant, beta Cru (B0.5 III), with the Chandra HETGS. We find that the X-ray spectrum is quite soft, with a dominant thermal component near 3 million K, and that the emission lines are resolved but quite narrow, with half-widths of 150 km/s. The forbidden-to-intercombination line ratios of Ne IX and Mg XI indicate that the hot plasma is distributed in the wind, rather than confined near the photosphere. It is difficult to understand the X-ray data in the context of the standard wind-shock paradigm for OB stars, primarily because of the narrow lines, but also because of the high X-ray production efficiency. A scenario in which the bulk of the outer wind is shock heated is broadly consistent with the data, but not very well motivated theoretically. It is possible that magnetic channeling could explain the X-ray properties, although no field has been detected on beta Cru. We detected periodic variability in the hard (hnu > 1 keV) X-rays, modulated on the known optical period of 4.58 hours, which is the period of the primary beta Cep pulsation mode for this star. We also have detected, for the first time, an apparent companion to beta Cru at a projected separation of 4 arcsec. This companion was likely never seen in optical images because of the presumed very high contrast between it and beta Cru in the optical. However, the brightness contrast in the X-ray is only 3:1, which is consistent with the companion being an X-ray active low-mass pre-main-sequence star. The companion's X-ray spectrum is relatively hard and variable, as would be expected from a post T Tauri star.Comment: Accepted for publication in MNRAS; 19 pages, 15 figures, some in color; version with higher-resolution figures available at http://astro.swarthmore.edu/~cohen/papers/bcru_mnras2008.pd

arXiv.org e-Print Archive

CiteSeerX

Crossref