9,727 research outputs found
VoxCeleb2: Deep Speaker Recognition
The objective of this paper is speaker recognition under noisy and
unconstrained conditions.
We make two key contributions. First, we introduce a very large-scale
audio-visual speaker recognition dataset collected from open-source media.
Using a fully automated pipeline, we curate VoxCeleb2 which contains over a
million utterances from over 6,000 speakers. This is several times larger than
any publicly available speaker recognition dataset.
Second, we develop and compare Convolutional Neural Network (CNN) models and
training strategies that can effectively recognise identities from voice under
various conditions. The models trained on the VoxCeleb2 dataset surpass the
performance of previous works on a benchmark dataset by a significant margin.Comment: To appear in Interspeech 2018. The audio-visual dataset can be
downloaded from http://www.robots.ox.ac.uk/~vgg/data/voxceleb2 .
1806.05622v2: minor fixes; 5 page
Disentangled Speech Embeddings using Cross-modal Self-supervision
The objective of this paper is to learn representations of speaker identity
without access to manually annotated data. To do so, we develop a
self-supervised learning objective that exploits the natural cross-modal
synchrony between faces and audio in video. The key idea behind our approach is
to tease apart--without annotation--the representations of linguistic content
and speaker identity. We construct a two-stream architecture which: (1) shares
low-level features common to both representations; and (2) provides a natural
mechanism for explicitly disentangling these factors, offering the potential
for greater generalisation to novel combinations of content and identity and
ultimately producing speaker identity representations that are more robust. We
train our method on a large-scale audio-visual dataset of talking heads `in the
wild', and demonstrate its efficacy by evaluating the learned speaker
representations for standard speaker recognition performance.Comment: ICASSP 2020. The first three authors contributed equally to this wor
The 125 GeV Higgs and Electroweak Phase Transition Model Classes
Recently, the ATLAS and CMS detectors have discovered a bosonic particle
which, to a reasonable degree of statistical uncertainty, fits the profile of
the Standard Model Higgs. One obvious implication is that models which predict
a significant departure from Standard Model phenomenology, such as large exotic
(e.g., invisible) Higgs decay or mixing with a hidden sector scalar, are
already ruled out. This observation threatens the viability of electroweak
baryogenesis, which favors, for example, a lighter Higgs and a Higgs coupled to
or mixed with light scalars. To assess the broad impact of these constraints,
we propose a scheme for classifying models of the electroweak phase transition
and impose constraints on a class-by-class basis. We find that models, such as
the MSSM, which rely on thermal loop effects are severely constrained by the
measurement of a 125 GeV Higgs. Models which rely on tree-level effects from a
light singlet are also restricted by invisible decay and mixing constraints.
Moreover, we find that the parametric region favored by electroweak
baryogenesis often coincides with an enhanced symmetry point with a distinctive
phenomenological character. In particular, enhancements arising through an
approximate continuous symmetry are phenomenologically disfavored, in contrast
with enhancements from discrete symmetries. We also comment on the excess of
diphoton events observed by ATLAS and CMS. We note that although Higgs portal
models can accommodate both enhanced diphoton decay and a strongly first order
electroweak phase transition, the former favors a negative Higgs portal
coupling whereas the latter favors a positive one, and therefore these two
constraints are at tension with one another.Comment: 35 pages, 7 figure
A fast direct numerical simulation method for characterising hydraulic roughness
We describe a fast direct numerical simulation (DNS) method that promises to
directly characterise the hydraulic roughness of any given rough surface, from
the hydraulically smooth to the fully rough regime. The method circumvents the
unfavourable computational cost associated with simulating high-Reynolds-number
flows by employing minimal-span channels (Jimenez & Moin 1991).
Proof-of-concept simulations demonstrate that flows in minimal-span channels
are sufficient for capturing the downward velocity shift, that is, the Hama
roughness function, predicted by flows in full-span channels. We consider two
sets of simulations, first with modelled roughness imposed by body forces, and
second with explicit roughness described by roughness-conforming grids. Owing
to the minimal cost, we are able to conduct DNSs with increasing roughness
Reynolds numbers while maintaining a fixed blockage ratio, as is typical in
full-scale applications. The present method promises a practical, fast and
accurate tool for characterising hydraulic resistance directly from
profilometry data of rough surfaces.Comment: Published in the Journal of Fluid Mechanic
The circumgalactic medium in Lyman-alpha: a new constraint on galactic outflow models
Galactic outflows are critical to our understanding of galaxy formation and
evolution. However the details of the underlying feedback process remain
unclear. We compare Ly observations of the circumgalactic medium (CGM)
of Lyman Break Galaxies (LBGs) with mock observations of their simulated CGM.
We use cosmological hydrodynamical `zoom-in' simulations of an LBG which
contains strong, momentum-driven galactic outflows. Simulation snapshots at
and are used, corresponding to the available observational
data. The simulation is post-processed with the radiative transfer code
\textsc{crash} to account for the impact of ionising photons on hydrogen gas
surrounding the simulated LBG. We generate mock absorption line maps for
comparison with data derived from observed close galaxy-galaxy pairs. We
perform calculations of Ly photons scattering through the CGM with our
newly developed Monte-Carlo code \textsc{slaf}, and compare to observations of
diffuse Ly halos around LBGs. Our fiducial galactic outflow model comes
closer to reproducing currently observed characteristics of the CGM in
Ly than a reference inefficient feedback model used for comparison.
Nevertheless, our fiducial model still struggles to reproduce the observed data
of the inner CGM (at impact parameter kpc). Our results suggest that
galactic outflows affect Ly absorption and emission around galaxies
mostly at impact parameters kpc, while cold accretion flows dominate at
larger distances. We discuss the implications of this result, and underline the
potential constraining power of CGM observations - in emission and absorption -
on galactic outflow models.Comment: 14 pages, 12 figure
A Possible Resolution of the Black Hole Information Puzzle
The problem of information loss is considered under the assumption that the
process of black hole evaporation terminates in the decay of the black hole
interior into a baby universe. We show that such theories can be decomposed
into superselection sectors labeled by eigenvalues of the third-quantized baby
universe field operator, and that scattering is unitary within each
superselection sector. This result relies crucially on the quantum-mechanical
variability of the decay time. It is further argued that the decay rate in the
black hole rest frame is necessarily proportional to , where
is the total entropy produced during the evaporation process,
entailing a very long-lived remnant.Comment: 15 pages, 3 uuencoded figures. Revised version contains some
notational simplification
- …
