13,680 research outputs found
Practical Open-Loop Optimistic Planning
We consider the problem of online planning in a Markov Decision Process when
given only access to a generative model, restricted to open-loop policies -
i.e. sequences of actions - and under budget constraint. In this setting, the
Open-Loop Optimistic Planning (OLOP) algorithm enjoys good theoretical
guarantees but is overly conservative in practice, as we show in numerical
experiments. We propose a modified version of the algorithm with tighter
upper-confidence bounds, KLOLOP, that leads to better practical performances
while retaining the sample complexity bound. Finally, we propose an efficient
implementation that significantly improves the time complexity of both
algorithms
Cathodoluminescence of nanocrystalline Y2O3:Eu3+ with various Eu3+ concentrations
© The Author(s) 2014. Published by ECS. This is an open access article distributed under the terms of the Creative Commons Attribution 4.0 License (CC BY, http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse of the work in any medium, provided the original work is properly cited.This article has been made available through the Brunel Open Access Publishing Fund.Herein a study on the preparation and cathodoluminescence of monosized spherical nanoparticles of Y2O3:Eu3+ having a Eu3+ concentration that varies between 0.01 and 10% is described. The luminous efficiency and decay time have been determined at low a current density, whereas cathodoluminescence-microscopy has been carried out at high current density, the latter led to substantial saturation of certain spectral transitions. A novel theory is presented to evaluate the critical distance for energy transfer from Eu3+ ions in S6 to Eu3+ ions in C2 sites. It was found that Y2O3:Eu3+ with 1–2% Eu3+ has the highest luminous efficiency of 16lm/w at 15keV electron energy. Decay times of the emission from 5D0 (C2) and 5D1 (C2) and 5D0 (S6) levels were determined. The difference in decay time from the 5D0 (C2) and 5D1 (C2) levels largely explained the observed phenomena in the cathodoluminescence-micrographs recorded with our field emission scanning electron microscope
Hi-Val: Iterative Learning of Hierarchical Value Functions for Policy Generation
Task decomposition is effective in manifold applications where the global complexity of a problem makes planning and decision-making too demanding. This is true, for example, in high-dimensional robotics domains, where (1) unpredictabilities and modeling limitations typically prevent the manual specification of robust behaviors, and (2) learning an action policy is challenging due to the curse of dimensionality. In this work, we borrow the concept of Hierarchical Task Networks (HTNs) to decompose the learning procedure, and we exploit Upper Confidence Tree (UCT) search to introduce HOP, a novel iterative algorithm for hierarchical optimistic planning with learned value functions. To obtain better generalization and generate policies, HOP simultaneously learns and uses action values. These are used to formalize constraints within the search space and to reduce the dimensionality of the problem. We evaluate our algorithm both on a fetching task using a simulated 7-DOF KUKA light weight arm and, on a pick and delivery task with a Pioneer robot
Optimization of design of space experiments from the standpoint of data processing Semiannual report, 1 Oct. 1967 - 31 Mar. 1968
Design and construction work on spacecraft array processor for onboard processing of experimental dat
Spaceflight performance of several types of silicon solar cells on the LIPS 3 satellite
Results from exposure of several types of Solarex silicon cells to a space environment for nearly two years on the LIPS 3 satellite are presented. Experiments include standard thickness (10 mil) cells with and without back surface fields, and ultrathin (2 mil) cells also with and without back surface fields. A comparison between a widely used coverslide adhesive, DC 93-500 and a potential alternate is also presented. The major findings from the data are that the 2 mil cells without a back surface field show the smallest normalized short circuit current degradation and that the 10 mil back surface field cells show the greatest absolute power output for the radiation exposures and temperatures encountered. The new encapsulant (McGhan Nusil CV-2500) exhibits a degradation comparable to DC 93-500. A comparison is made with each of the cell types in this experiment with expectations based on JPL Radiation Handbook data
Cathodoluminescence studies of phosphors in a scanning electron microscope
Cathodoluminescence studies are reported of phosphors in a field emission scanning electron microscope (FESEM). A number of phosphor materials have been studied and exhibited a pronounced comet-like structure at high scan rates, because the particle continued to emit light after the beam had moved onto subsequent pixels. Image analysis has been used to study the loss of brightness along the tail and hence to determine the decay time of the materials. This technique provides a simple and convenient way to study the decay times of individual particles
Cathodoluminescence of Double Layers of Phosphor Particles
This article has been made available through the Brunel Open Access Publishing Fund.We present radiance measurements of particle layers of ZnO:Zn, Y2O3:Eu and Y2O2S:Eu bombarded with electrons at anode voltages between 1 and 15 kV. The layers described in this work refer to single component layers, double layers and two component mixtures. The phosphor layers are deposited on ITO-coated glass slides by settling; the efficiency of the cathodoluminescence is determined by summing the radiances and luminances in the reflected and transmitted modes respectively. The efficiency of a double layer of Y2O3:Eu on top of ZnO:Zn at high electron energy is significantly larger than the efficiency of a corresponding layer in which the two components are mixed. This result is interpreted in terms of the penetration-model, which predicts a larger efficiency for a high-voltage phosphor on top of a low-voltage phosphor. When a layer of the low-voltage phosphor ZnO:Zn is on top of the high-voltage phosphor Y2O3:Eu, we also observe a higher efficiency than that of the corresponding layer with both components mixed. In this case the efficiency increases due to suppression of charging in the Y2O3:Eu layer. Double layers of ZnO:Zn and Y2O2S:Eu did not show enhanced efficiency, because the size of the Y2O2S:Eu particles was too large to evoke the penetration effect.
© The Author(s) 2014. Published by ECS
Control of the finite size corrections in exact diagonalization studies
We study the possibility of controlling the finite size corrections in exact
diagonalization studies quantitatively. We consider the one- and two
dimensional Hubbard model. We show that the finite-size corrections can be be
reduced systematically by a grand-canonical integration over boundary
conditions. We find, in general, an improvement of one order of magnitude with
respect to studies with periodic boundary conditions only. We present results
for ground-state properties of the 2D Hubbard model and an evaluation of the
specific heat for the 1D and 2D Hubbard model.Comment: Phys. Rev. B (Brief Report), in pres
Assessing the Potential of Classical Q-learning in General Game Playing
After the recent groundbreaking results of AlphaGo and AlphaZero, we have
seen strong interests in deep reinforcement learning and artificial general
intelligence (AGI) in game playing. However, deep learning is
resource-intensive and the theory is not yet well developed. For small games,
simple classical table-based Q-learning might still be the algorithm of choice.
General Game Playing (GGP) provides a good testbed for reinforcement learning
to research AGI. Q-learning is one of the canonical reinforcement learning
methods, and has been used by (Banerjee Stone, IJCAI 2007) in GGP. In this
paper we implement Q-learning in GGP for three small-board games (Tic-Tac-Toe,
Connect Four, Hex)\footnote{source code: https://github.com/wh1992v/ggp-rl}, to
allow comparison to Banerjee et al.. We find that Q-learning converges to a
high win rate in GGP. For the -greedy strategy, we propose a first
enhancement, the dynamic algorithm. In addition, inspired by (Gelly
Silver, ICML 2007) we combine online search (Monte Carlo Search) to
enhance offline learning, and propose QM-learning for GGP. Both enhancements
improve the performance of classical Q-learning. In this work, GGP allows us to
show, if augmented by appropriate enhancements, that classical table-based
Q-learning can perform well in small games.Comment: arXiv admin note: substantial text overlap with arXiv:1802.0594
Fe XVII X-ray Line Ratios for Accurate Astrophysical Plasma Diagnostics
New laboratory measurements using an Electron Beam Ion Trap (EBIT) and an
x-ray microcalorimeter are presented for the n=3 to n=2 Fe XVII emission lines
in the 15 {\AA} to 17 {\AA} range, along with new theoretical predictions for a
variety of electron energy distributions. This work improves upon our earlier
work on these lines by providing measurements at more electron impact energies
(seven values from 846 to 1185 eV), performing an in situ determination of the
x-ray window transmission, taking steps to minimize the ion impurity
concentrations, correcting the electron energies for space charge shifts, and
estimating the residual electron energy uncertainties. The results for the
3C/3D and 3s/3C line ratios are generally in agreement with the closest theory
to within 10%, and in agreement with previous measurements from an independent
group to within 20%. Better consistency between the two experimental groups is
obtained at the lowest electron energies by using theory to interpolate, taking
into account the significantly different electron energy distributions.
Evidence for resonance collision effects in the spectra is discussed.
Renormalized values for the absolute cross sections of the 3C and 3D lines are
obtained by combining previously published results, and shown to be in
agreement with the predictions of converged R-matrix theory. This work
establishes consistency between results from independent laboratories and
improves the reliability of these lines for astrophysical diagnostics. Factors
that should be taken into account for accurate diagnostics are discussed,
including electron energy distribution, polarization, absorption/scattering,
and line blends.Comment: 29 pages, including 7 figure
- …
