Essays about: "policy iteration"

Showing result 1 - 5 of 13 essays containing the words policy iteration.

  1. 1. Risk-Averse Multi-Armed Bandit Problem with Multiple Plays

    University essay from Göteborgs universitet/Institutionen för data- och informationsteknik

    Author : Siri Dahlgren; Nicholas Marriott; [2023-10-23]
    Keywords : MAB; Gittins; Markovian bandit; risk-aversion; policy iteration; multiple plays;

    Abstract : This study aims to construct an efficient heuristic, referred to as RA, for a riskaverse Markovian multi-armed bandit problem (MAB) with multiple plays. The RA incorporates risk-aversion and multiple plays by modifying the Gittins index strategy. READ MORE

  2. 2. Tackling Non-Stationarity in Reinforcement Learning via Latent Representation : An application to Intraday Foreign Exchange Trading

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Adriano Mundo; [2023]
    Keywords : Reinforcement Learning; Latent Representation; VAE; Non-Stationary; FQI; FX Trading; Förstärkningsinlärning; Latent representation; VAE; Icke-stationär; FQI; FX handel;

    Abstract : Reinforcement Learning has applications in various domains, but the typical assumption is of a stationary process. Hence, when this hypothesis does not hold, performance may be sub-optimal. READ MORE

  3. 3. RDF vocabulary : Translation of policies with RDF

    University essay from Linköpings universitet/Institutionen för datavetenskap

    Author : Sergio Garcia Bernabeu; Lukas Bergdahl; [2023]
    Keywords : RDF; RDF vocabulary; Security; ODRL; Dublin core; Policy; RDF translation;

    Abstract : Throughout this thesis, we have worked on translating policies into RDF formats andtesting RDF vocabularies. Our goal is to create policies that can be applied to future indus-tries within a circular economy. While Onto-Deside is the primary source of motivation forthis work, we do not focus on it in this thesis. READ MORE

  4. 4. Changing the Stories We Live By: Revolutionizing the North American Model of Wildlife Conservation Through Transformative Conservation

    University essay from Uppsala universitet/Institutionen för geovetenskaper

    Author : Tess Marie Burroughs; [2022]
    Keywords : Sustainable Development; Wildlife Conservation; Biodiversity Loss; Critical Discourse Analysis; Transformative Conservation;

    Abstract : As biodiversity continues to diminish worldwide, an interrogation of long-standing conservation discourse is needed to reformulate a new conservation rhetoric that confronts the socio-ecological complexities of the world and reorients the relationship between humans and nature. Using ecologically sensitive critical discourse analysis, this research investigates the dominant ideologies perpetuated within an iteration of mainstream American wildlife discourse and explores opportunities for transformative conservation alternatives. READ MORE

  5. 5. Offline Reinforcement Learning for Remote Electrical Tilt Optimization : An application of Conservative Q-Learning

    University essay from KTH/Matematik (Avd.)

    Author : Marcus Kastengren; [2021]
    Keywords : Remote Electrical Tilt; Antenna Tilt Optimization; Reinforcement Learning; Offline Reinforcement Learning; Conservative Q-Learning; Fjärrlutning; Antennlutningsoptimering; Förstärkningsinlärning; Offline-förstärkningsinlärning; Konservativ Q-inlärning;

    Abstract : In telecom networks adjusting the tilt of antennas in an optimal manner, the so called remote electrical tilt (RET) optimization, is a method to ensure quality of service (QoS) for network users. Tilt adjustments made during operations in real-world networks are usually executed through a suboptimal policy, and a significant amount of data is collected during the execution of such policy. READ MORE