Essays about: "policy iteration"
Showing result 1 - 5 of 13 essays containing the words policy iteration.
-
1. Risk-Averse Multi-Armed Bandit Problem with Multiple Plays
University essay from Göteborgs universitet/Institutionen för data- och informationsteknikAbstract : This study aims to construct an efficient heuristic, referred to as RA, for a riskaverse Markovian multi-armed bandit problem (MAB) with multiple plays. The RA incorporates risk-aversion and multiple plays by modifying the Gittins index strategy. READ MORE
-
2. Tackling Non-Stationarity in Reinforcement Learning via Latent Representation : An application to Intraday Foreign Exchange Trading
University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)Abstract : Reinforcement Learning has applications in various domains, but the typical assumption is of a stationary process. Hence, when this hypothesis does not hold, performance may be sub-optimal. READ MORE
-
3. RDF vocabulary : Translation of policies with RDF
University essay from Linköpings universitet/Institutionen för datavetenskapAbstract : Throughout this thesis, we have worked on translating policies into RDF formats andtesting RDF vocabularies. Our goal is to create policies that can be applied to future indus-tries within a circular economy. While Onto-Deside is the primary source of motivation forthis work, we do not focus on it in this thesis. READ MORE
-
4. Changing the Stories We Live By: Revolutionizing the North American Model of Wildlife Conservation Through Transformative Conservation
University essay from Uppsala universitet/Institutionen för geovetenskaperAbstract : As biodiversity continues to diminish worldwide, an interrogation of long-standing conservation discourse is needed to reformulate a new conservation rhetoric that confronts the socio-ecological complexities of the world and reorients the relationship between humans and nature. Using ecologically sensitive critical discourse analysis, this research investigates the dominant ideologies perpetuated within an iteration of mainstream American wildlife discourse and explores opportunities for transformative conservation alternatives. READ MORE
-
5. Offline Reinforcement Learning for Remote Electrical Tilt Optimization : An application of Conservative Q-Learning
University essay from KTH/Matematik (Avd.)Abstract : In telecom networks adjusting the tilt of antennas in an optimal manner, the so called remote electrical tilt (RET) optimization, is a method to ensure quality of service (QoS) for network users. Tilt adjustments made during operations in real-world networks are usually executed through a suboptimal policy, and a significant amount of data is collected during the execution of such policy. READ MORE