Search CORE

13,680 research outputs found

Practical Open-Loop Optimistic Planning

Author: D Silver
D Silver
D Silver
J-F Hren
L Buşoniu
O Cappé
R Bellman
R Coulom
Publication venue
Publication date: 09/04/2019
Field of study

We consider the problem of online planning in a Markov Decision Process when given only access to a generative model, restricted to open-loop policies - i.e. sequences of actions - and under budget constraint. In this setting, the Open-Loop Optimistic Planning (OLOP) algorithm enjoys good theoretical guarantees but is overly conservative in practice, as we show in numerical experiments. We propose a modified version of the algorithm with tighter upper-confidence bounds, KLOLOP, that leads to better practical performances while retaining the sample complexity bound. Finally, we propose an efficient implementation that significantly improves the time complexity of both algorithms

arXiv.org e-Print Archive

Crossref

INRIA a CCSD electronic archive server

HAL Descartes

Hal-Diderot

Cathodoluminescence of nanocrystalline Y2O3:Eu3+ with various Eu3+ concentrations

Author: den Engelsen D
Harris P
Ireland T
Silver J
Publication venue: 'The Electrochemical Society'
Publication date: 03/11/2014
Field of study

© The Author(s) 2014. Published by ECS. This is an open access article distributed under the terms of the Creative Commons Attribution 4.0 License (CC BY, http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse of the work in any medium, provided the original work is properly cited.This article has been made available through the Brunel Open Access Publishing Fund.Herein a study on the preparation and cathodoluminescence of monosized spherical nanoparticles of Y2O3:Eu3+ having a Eu3+ concentration that varies between 0.01 and 10% is described. The luminous efficiency and decay time have been determined at low a current density, whereas cathodoluminescence-microscopy has been carried out at high current density, the latter led to substantial saturation of certain spectral transitions. A novel theory is presented to evaluate the critical distance for energy transfer from Eu3+ ions in S6 to Eu3+ ions in C2 sites. It was found that Y2O3:Eu3+ with 1–2% Eu3+ has the highest luminous efficiency of 16lm/w at 15keV electron energy. Decay times of the emission from 5D0 (C2) and 5D1 (C2) and 5D0 (S6) levels were determined. The difference in decay time from the 5D0 (C2) and 5D1 (C2) levels largely explained the observed phenomena in the cathodoluminescence-micrographs recorded with our field emission scanning electron microscope

Crossref

Brunel University Research Archive

Hi-Val: Iterative Learning of Hierarchical Value Functions for Policy Generation

Author: D Silver
D Silver
G Chowdhary
G Konidaris
J Hostetler
Levente Kocsis
M Jun
P Auer
RS Sutton
TG Dietterich
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2019
Field of study

Task decomposition is effective in manifold applications where the global complexity of a problem makes planning and decision-making too demanding. This is true, for example, in high-dimensional robotics domains, where (1) unpredictabilities and modeling limitations typically prevent the manual specification of robust behaviors, and (2) learning an action policy is challenging due to the curse of dimensionality. In this work, we borrow the concept of Hierarchical Task Networks (HTNs) to decompose the learning procedure, and we exploit Upper Confidence Tree (UCT) search to introduce HOP, a novel iterative algorithm for hierarchical optimistic planning with learned value functions. To obtain better generalization and generate policies, HOP simultaneously learns and uses action values. These are used to formalize constraints within the search space and to reduce the dimensionality of the problem. We evaluate our algorithm both on a fetching task using a simulated 7-DOF KUKA light weight arm and, on a pick and delivery task with a Pioneer robot

Crossref

Archivio della ricerca- Università di Roma La Sapienza

Optimization of design of space experiments from the standpoint of data processing Semiannual report, 1 Oct. 1967 - 31 Mar. 1968

Author: Algazi V. R.
Sakrison D. J.
Silver S.
Publication venue
Publication date
Field of study

Design and construction work on spacecraft array processor for onboard processing of experimental dat

NASA Technical Reports Server

Spaceflight performance of several types of silicon solar cells on the LIPS 3 satellite

Author: Silver J.
Warfield D.
Publication venue
Publication date
Field of study

Results from exposure of several types of Solarex silicon cells to a space environment for nearly two years on the LIPS 3 satellite are presented. Experiments include standard thickness (10 mil) cells with and without back surface fields, and ultrathin (2 mil) cells also with and without back surface fields. A comparison between a widely used coverslide adhesive, DC 93-500 and a potential alternate is also presented. The major findings from the data are that the 2 mil cells without a back surface field show the smallest normalized short circuit current degradation and that the 10 mil back surface field cells show the greatest absolute power output for the radiation exposures and temperatures encountered. The new encapsulant (McGhan Nusil CV-2500) exhibits a degradation comparable to DC 93-500. A comparison is made with each of the cell types in this experiment with expectations based on JPL Radiation Handbook data

NASA Technical Reports Server

Cathodoluminescence studies of phosphors in a scanning electron microscope

Author: Den engelsen D
Fern G
Harris P
Ireland T
Silver J
Publication venue: 'IOP Publishing'
Publication date: 17/06/2015
Field of study

Cathodoluminescence studies are reported of phosphors in a field emission scanning electron microscope (FESEM). A number of phosphor materials have been studied and exhibited a pronounced comet-like structure at high scan rates, because the particle continued to emit light after the beam had moved onto subsequent pixels. Image analysis has been used to study the loss of brightness along the tail and hence to determine the decay time of the materials. This technique provides a simple and convenient way to study the decay times of individual particles

Brunel University Research Archive

Cathodoluminescence of Double Layers of Phosphor Particles

Author: den Engelsen D
Harris P
Ireland T
Silver J
Publication venue: 'The Electrochemical Society'
Publication date: 01/01/2014
Field of study

This article has been made available through the Brunel Open Access Publishing Fund.We present radiance measurements of particle layers of ZnO:Zn, Y2O3:Eu and Y2O2S:Eu bombarded with electrons at anode voltages between 1 and 15 kV. The layers described in this work refer to single component layers, double layers and two component mixtures. The phosphor layers are deposited on ITO-coated glass slides by settling; the efficiency of the cathodoluminescence is determined by summing the radiances and luminances in the reflected and transmitted modes respectively. The efficiency of a double layer of Y2O3:Eu on top of ZnO:Zn at high electron energy is significantly larger than the efficiency of a corresponding layer in which the two components are mixed. This result is interpreted in terms of the penetration-model, which predicts a larger efficiency for a high-voltage phosphor on top of a low-voltage phosphor. When a layer of the low-voltage phosphor ZnO:Zn is on top of the high-voltage phosphor Y2O3:Eu, we also observe a higher efficiency than that of the corresponding layer with both components mixed. In this case the efficiency increases due to suppression of charging in the Y2O3:Eu layer. Double layers of ZnO:Zn and Y2O2S:Eu did not show enhanced efficiency, because the size of the Y2O2S:Eu particles was too large to evoke the penetration effect. © The Author(s) 2014. Published by ECS

Crossref

Brunel University Research Archive

Control of the finite size corrections in exact diagonalization studies

Author: C. Gros
Claudius Gros
D. Poilblanc
E. Dagotto
J. Jaklic
N. Furukawa
R. N. Silver
Publication venue: 'American Physical Society (APS)'
Publication date: 23/11/1995
Field of study

We study the possibility of controlling the finite size corrections in exact diagonalization studies quantitatively. We consider the one- and two dimensional Hubbard model. We show that the finite-size corrections can be be reduced systematically by a grand-canonical integration over boundary conditions. We find, in general, an improvement of one order of magnitude with respect to studies with periodic boundary conditions only. We present results for ground-state properties of the 2D Hubbard model and an evaluation of the specific heat for the 1D and 2D Hubbard model.Comment: Phys. Rev. B (Brief Report), in pres

arXiv.org e-Print Archive

Crossref

Assessing the Potential of Classical Q-learning in General Game Playing

Author: CB Browne
CJCH Watkins
CP Robert
D Silver
D Silver
H Wang
J Hu
J Méhat
M Genesereth
M Genesereth
M Świechowski
RS Sutton
V Mnih
Publication venue
Publication date: 14/10/2018
Field of study

After the recent groundbreaking results of AlphaGo and AlphaZero, we have seen strong interests in deep reinforcement learning and artificial general intelligence (AGI) in game playing. However, deep learning is resource-intensive and the theory is not yet well developed. For small games, simple classical table-based Q-learning might still be the algorithm of choice. General Game Playing (GGP) provides a good testbed for reinforcement learning to research AGI. Q-learning is one of the canonical reinforcement learning methods, and has been used by (Banerjee

\&

Stone, IJCAI 2007) in GGP. In this paper we implement Q-learning in GGP for three small-board games (Tic-Tac-Toe, Connect Four, Hex)\footnote{source code: https://github.com/wh1992v/ggp-rl}, to allow comparison to Banerjee et al.. We find that Q-learning converges to a high win rate in GGP. For the

\epsilon

-greedy strategy, we propose a first enhancement, the dynamic

\epsilon

algorithm. In addition, inspired by (Gelly

\&

Silver, ICML 2007) we combine online search (Monte Carlo Search) to enhance offline learning, and propose QM-learning for GGP. Both enhancements improve the performance of classical Q-learning. In this work, GGP allows us to show, if augmented by appropriate enhancements, that classical table-based Q-learning can perform well in small games.Comment: arXiv admin note: substantial text overlap with arXiv:1802.0594

arXiv.org e-Print Archive

Crossref

Leiden University Scholary Publications

Fe XVII X-ray Line Ratios for Accurate Astrophysical Plasma Diagnostics

Author: Aggarwal
Audard
Beiersdorfer
Bevington
Bhatia
Brickhouse
Brown
Brown
Cowan
Currell
Donnelly
Doron
Drake
E. Silver
G.-X. Chen
Gillaspy
Gillaspy
Griffin
Gu
Gu
Huenemoerder
J. D. Gillaspy
J. M. Laming
J. M. Pomeroy
J. N. Tan
L. Tedesco
Laming
Loch
Marrs
Mohan
N. Brickhouse
Osten
Parkinson
Shirai
Silver
Silver
Smith
T. Lin
Tan
Publication venue: 'IOP Publishing'
Publication date: 14/06/2011
Field of study

New laboratory measurements using an Electron Beam Ion Trap (EBIT) and an x-ray microcalorimeter are presented for the n=3 to n=2 Fe XVII emission lines in the 15 {\AA} to 17 {\AA} range, along with new theoretical predictions for a variety of electron energy distributions. This work improves upon our earlier work on these lines by providing measurements at more electron impact energies (seven values from 846 to 1185 eV), performing an in situ determination of the x-ray window transmission, taking steps to minimize the ion impurity concentrations, correcting the electron energies for space charge shifts, and estimating the residual electron energy uncertainties. The results for the 3C/3D and 3s/3C line ratios are generally in agreement with the closest theory to within 10%, and in agreement with previous measurements from an independent group to within 20%. Better consistency between the two experimental groups is obtained at the lowest electron energies by using theory to interpolate, taking into account the significantly different electron energy distributions. Evidence for resonance collision effects in the spectra is discussed. Renormalized values for the absolute cross sections of the 3C and 3D lines are obtained by combining previously published results, and shown to be in agreement with the predictions of converged R-matrix theory. This work establishes consistency between results from independent laboratories and improves the reliability of these lines for astrophysical diagnostics. Factors that should be taken into account for accurate diagnostics are discussed, including electron energy distribution, polarization, absorption/scattering, and line blends.Comment: 29 pages, including 7 figure

arXiv.org e-Print Archive

Crossref