2,796 research outputs found
Reinforcement learning or active inference?
This paper questions the need for reinforcement learning or control theory when optimising behaviour. We show that it is fairly simple to teach an agent complicated and adaptive behaviours using a free-energy formulation of perception. In this formulation, agents adjust their internal states and sampling of the environment to minimize their free-energy. Such agents learn causal structure in the environment and sample it in an adaptive and self-supervised fashion. This results in behavioural policies that reproduce those optimised by reinforcement learning and dynamic programming. Critically, we do not need to invoke the notion of reward, value or utility. We illustrate these points by solving a benchmark problem in dynamic programming; namely the mountain-car problem, using active perception or inference under the free-energy principle. The ensuing proof-of-concept may be important because the free-energy formulation furnishes a unified account of both action and perception and may speak to a reappraisal of the role of dopamine in the brain
Nano-Architecture of nitrogen-doped graphene films synthesized from a solid CN source
New synthesis routes to tailor graphene properties by controlling the concentration and chemical configuration of dopants show great promise. Herein we report the direct reproducible synthesis of 2-3% nitrogen-doped ‘few-layer’ graphene from a solid state nitrogen carbide a-C:N source synthesized by femtosecond pulsed laser ablation. Analytical investigations, including synchrotron facilities, made it possible to identify the configuration and chemistry of the nitrogen-doped graphene films. Auger mapping successfully quantified the 2D distribution of the number of graphene layers over the surface, and hence offers a new original way to probe the architecture of graphene sheets. The films mainly consist in a Bernal ABA stacking three-layer architecture, with a layer number distribution ranging from 2 to 6. Nitrogen doping affects the charge carrier distribution but has no significant effects on the number of lattice defects or disorders, compared to undoped graphene synthetized in similar conditions. Pyridinic, quaternary and pyrrolic nitrogen are the dominant chemical configurations, pyridinic N being preponderant at the scale of the film architecture. This work opens highly promising perspectives for the development of self-organized nitrogen-doped graphene materials, as synthetized from solid carbon nitride, with various functionalities, and for the characterization of 2D materials using a significant new methodology
Measurement of the inclusive and dijet cross-sections of b-jets in pp collisions at sqrt(s) = 7 TeV with the ATLAS detector
The inclusive and dijet production cross-sections have been measured for jets
containing b-hadrons (b-jets) in proton-proton collisions at a centre-of-mass
energy of sqrt(s) = 7 TeV, using the ATLAS detector at the LHC. The
measurements use data corresponding to an integrated luminosity of 34 pb^-1.
The b-jets are identified using either a lifetime-based method, where secondary
decay vertices of b-hadrons in jets are reconstructed using information from
the tracking detectors, or a muon-based method where the presence of a muon is
used to identify semileptonic decays of b-hadrons inside jets. The inclusive
b-jet cross-section is measured as a function of transverse momentum in the
range 20 < pT < 400 GeV and rapidity in the range |y| < 2.1. The bbbar-dijet
cross-section is measured as a function of the dijet invariant mass in the
range 110 < m_jj < 760 GeV, the azimuthal angle difference between the two jets
and the angular variable chi in two dijet mass regions. The results are
compared with next-to-leading-order QCD predictions. Good agreement is observed
between the measured cross-sections and the predictions obtained using POWHEG +
Pythia. MC@NLO + Herwig shows good agreement with the measured bbbar-dijet
cross-section. However, it does not reproduce the measured inclusive
cross-section well, particularly for central b-jets with large transverse
momenta.Comment: 10 pages plus author list (21 pages total), 8 figures, 1 table, final
version published in European Physical Journal
Role of Dopamine D2 Receptors in Human Reinforcement Learning
Influential neurocomputational models emphasize dopamine (DA) as an electrophysiological and neurochemical correlate of reinforcement learning. However, evidence of a specific causal role of DA receptors in learning has been less forthcoming, especially in humans. Here we combine, in a between-subjects design, administration of a high dose of the selective DA D2/3-receptor antagonist sulpiride with genetic analysis of the DA D2 receptor in a behavioral study of reinforcement learning in a sample of 78 healthy male volunteers. In contrast to predictions of prevailing models emphasizing DA's pivotal role in learning via prediction errors, we found that sulpiride did not disrupt learning, but rather induced profound impairments in choice performance. The disruption was selective for stimuli indicating reward, while loss avoidance performance was unaffected. Effects were driven by volunteers with higher serum levels of the drug, and in those with genetically-determined lower density of striatal DA D2 receptors. This is the clearest demonstration to date for a causal modulatory role of the DA D2 receptor in choice performance that might be distinct from learning. Our findings challenge current reward prediction error models of reinforcement learning, and suggest that classical animal models emphasizing a role of postsynaptic DA D2 receptors in motivational aspects of reinforcement learning may apply to humans as well.Neuropsychopharmacology accepted article peview online, 09 April 2014; doi:10.1038/npp.2014.84
Search for direct pair production of the top squark in all-hadronic final states in proton-proton collisions at s√=8 TeV with the ATLAS detector
The results of a search for direct pair production of the scalar partner to the top quark using an integrated luminosity of 20.1fb−1 of proton–proton collision data at √s = 8 TeV recorded with the ATLAS detector at the LHC are reported. The top squark is assumed to decay via t˜→tχ˜01 or t˜→ bχ˜±1 →bW(∗)χ˜01 , where χ˜01 (χ˜±1 ) denotes the lightest neutralino (chargino) in supersymmetric models. The search targets a fully-hadronic final state in events with four or more jets and large missing transverse momentum. No significant excess over the Standard Model background prediction is observed, and exclusion limits are reported in terms of the top squark and neutralino masses and as a function of the branching fraction of t˜ → tχ˜01 . For a branching fraction of 100%, top squark masses in the range 270–645 GeV are excluded for χ˜01 masses below 30 GeV. For a branching fraction of 50% to either t˜ → tχ˜01 or t˜ → bχ˜±1 , and assuming the χ˜±1 mass to be twice the χ˜01 mass, top squark masses in the range 250–550 GeV are excluded for χ˜01 masses below 60 GeV
Observation of associated near-side and away-side long-range correlations in √sNN=5.02 TeV proton-lead collisions with the ATLAS detector
Two-particle correlations in relative azimuthal angle (Δϕ) and pseudorapidity (Δη) are measured in √sNN=5.02 TeV p+Pb collisions using the ATLAS detector at the LHC. The measurements are performed using approximately 1 μb-1 of data as a function of transverse momentum (pT) and the transverse energy (ΣETPb) summed over 3.1<η<4.9 in the direction of the Pb beam. The correlation function, constructed from charged particles, exhibits a long-range (2<|Δη|<5) “near-side” (Δϕ∼0) correlation that grows rapidly with increasing ΣETPb. A long-range “away-side” (Δϕ∼π) correlation, obtained by subtracting the expected contributions from recoiling dijets and other sources estimated using events with small ΣETPb, is found to match the near-side correlation in magnitude, shape (in Δη and Δϕ) and ΣETPb dependence. The resultant Δϕ correlation is approximately symmetric about π/2, and is consistent with a dominant cos2Δϕ modulation for all ΣETPb ranges and particle pT
Search for new phenomena in final states with an energetic jet and large missing transverse momentum in pp collisions at √ s = 8 TeV with the ATLAS detector
Results of a search for new phenomena in final states with an energetic jet and large missing transverse momentum are reported. The search uses 20.3 fb−1 of √ s = 8 TeV data collected in 2012 with the ATLAS detector at the LHC. Events are required to have at least one jet with pT > 120 GeV and no leptons. Nine signal regions are considered with increasing missing transverse momentum requirements between Emiss T > 150 GeV and Emiss T > 700 GeV. Good agreement is observed between the number of events in data and Standard Model expectations. The results are translated into exclusion limits on models with either large extra spatial dimensions, pair production of weakly interacting dark matter candidates, or production of very light gravitinos in a gauge-mediated supersymmetric model. In addition, limits on the production of an invisibly decaying Higgs-like boson leading to similar topologies in the final state are presente
Recommended from our members
PS18kh: A New Tidal Disruption Event with a Non-axisymmetric Accretion Disk
We present the discovery of PS18kh, a tidal disruption event discovered at the center of SDSS J075654.53+341543.6 (d ≃ 322 Mpc) by the Pan-STARRS Survey for Transients. Our data set includes pre-discovery survey data from Pan-STARRS, the All-sky Automated Survey for Supernovae, and the Asteroid Terrestrial-impact Last Alert System as well as high-cadence, multiwavelength follow-up data from ground-based telescopes and Swift, spanning from 56 days before peak light until 75 days after. The optical/UV emission from PS18kh is well-fit as a blackbody with temperatures ranging from T ≃ 12,000 K to T ≃ 25,000 K and it peaked at a luminosity of L ≃ 8.8 × 10 erg s . PS18kh radiated E = (3.45 ± 0.22) × 10 erg over the period of observation, with (1.42 ± 0.20) × 10 erg being released during the rise to peak. Spectra of PS18kh show a changing, boxy/double-peaked Hα emission feature, which becomes more prominent over time. We use models of non-axisymmetric accretion disks to describe the profile of the Hα line and its evolution. We find that at early times the high accretion rate leads the disk to emit a wind which modifies the shape of the line profile and makes it bell-shaped. At late times, the wind becomes optically thin, allowing the non-axisymmetric perturbations to show up in the line profile. The line-emitting portion of the disk extends from r ∼ 60r to an outer radius of r ∼ 1400r and the perturbations can be represented either as an eccentricity in the outer rings of the disk or as a spiral arm in the inner disk. 43 -1 50 50 in g out
Aging Skin: Nourishing from Out-In. Lessons from Wound Healing
Skin lesion therapy, peculiarly in the elderly, cannot be isolated from understanding that the skin is an important organ consisting of different tissues. Furthermore, dermis health is fundamental for epidermis
integrity, and so adequate nourishment is mandatory in maintaining skin integrity. The dermis nourishes the epidermis, and a healthy epidermis protects the dermis from the environment, so nourishing the dermis
through the epidermal barrier is a technical problem yet to be resolved. This is also a consequence of the laws and regulations restricting cosmetics, which cannot have properties that pass the epidermal layer.
There is higher investment in cosmetics than in the pharmaceutical industry dealing with skin therapies, because the costs of drug registration are enormous and the field is unprofitable. Still, wound healing may
be seen as an opportunity to “feed” the dermis directly. It could also verify whether providing substrates could promote efficient healing and test optimal skin integrity maintenance, if not skin rejuvenation, in an
ever aging population
Targeted apoptosis in ovarian cancer cells through mitochondrial dysfunction in response to Sambucus nigra agglutinin
Ovarian carcinoma (OC) patients encounter the severe challenge of clinical management owing to lack of screening measures, chemoresistance and finally dearth of non-toxic therapeutics. Cancer cells deploy various defense strategies to sustain the tumor microenvironment, among which deregulated apoptosis remains a versatile promoter of cancer progression. Although recent research has focused on identifying agents capable of inducing apoptosis in cancer cells, yet molecules efficiently breaching their
survival advantage are yet to be classified. Here we identify lectin, Sambucus nigra agglutinin (SNA) to exhibit selectivity towards identifying OC by virtue of its specific recognition of α-2, 6-linked sialic acids. Superficial binding of SNA to the OC cells confirm
the hyper-sialylated status of the disease. Further, SNA activates the signaling pathways of AKT and ERK1/2, which eventually promotes de-phosphorylation of dynamin-related protein-1 (Drp-1). Upon its translocation to the mitochondrial fission loci Drp-1 mediates the central role of switch in the mitochondrial phenotype to attain fragmented morphology. We confirmed mitochondrial
outer membrane permeabilization resulting in ROS generation and cytochrome-c release into the cytosol. SNA response resulted in an allied shift of the bioenergetics profile from Warburg phenotype to elevated mitochondrial oxidative phosphorylation, altogether highlighting the involvement of mitochondrial dysfunction in restraining cancer progression. Inability to replenish the SNA-induced energy crunch of the proliferating cancer cells on the event of perturbed respiratory outcome resulted in cell cycle
arrest before G2/M phase. Our findings position SNA at a crucial juncture where it proves to be a promising candidate for impeding progression of OC. Altogether we unveil the novel aspect of identifying natural molecules harboring the inherent capability of targeting mitochondrial structural dynamics, to hold the future for developing non-toxic therapeutics for treating OC
- …
