Essays about: "Markov decision process"

Showing result 31 - 35 of 47 essays containing the words Markov decision process.

  1. 31. Learning comparison: Reinforcement Learning vs Inverse Reinforcement Learning : How well does inverse reinforcement learning perform in simple markov decision processes in comparison to reinforcement learning?

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Pablo Izquierdo Ayala; [2019]
    Keywords : ;

    Abstract : This research project elaborates a qualitative comparison between two different learning approaches, Reinforcement Learning (RL) and Inverse Reinforcement Learning (IRL) over the Gridworld Markov Decision Process. The interest focus will be set on the second learning paradigm, IRL, as it is considered to be relatively new and little work has been developed in this field of study. READ MORE

  2. 32. Transfer of reinforcement learning for a robotic skill

    University essay from Luleå tekniska universitet/Institutionen för system- och rymdteknik

    Author : Dulce Adriana Gómez Rosal; [2018]
    Keywords : Transfer learning; Reinforcement learning; Simulation; Robotics;

    Abstract : In this work, we develop the transfer learning (TL) of reinforcement learning (RL) for the robotic skill of throwing a ball into a basket, from a computer simulated environment to a real-world implementation. Whereas learning of the same skill has been previously explored by using a Programming by Demonstration approach directly on the real-world robot, for our work, the model-based RL algorithm PILCO was employed as an alternative as it provides the robot with no previous knowledge or hints, i. READ MORE

  3. 33. Deep Reinforcement Learning in Real-time Bidding

    University essay from Lunds universitet/Matematik LTH

    Author : Oskar Stigland; [2018]
    Keywords : Machine learning; reinforcement learning; markov decision process; neural network; deep Q-network; real-time bidding; online display advertisement; Mathematics and Statistics;

    Abstract : Real-time bidding is getting increasingly popular for buying and selling online display advertisement. This has spurred a research interest into how to design optimal bidding algorithms, with advances during the last two to three years focusing heavily on reinforcement learning. READ MORE

  4. 34. Learning Operational Goals for Propulsion System Using Reinforcement Learning

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Johan Lewenhaupt; [2018]
    Keywords : ;

    Abstract : This degree project, conducted at ABB, aims to analyze and solve differentsituations that a crew on board a vessel might face by controllingits propulsion system. The propulsion system is viewed as static,transition-deterministic, as well as stochastic when measuring data. READ MORE

  5. 35. Probabilistic Least-violating Control Strategy Synthesis with Safety Rules

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Ludvig Janiuk; Johan Sjölén; [2018]
    Keywords : ;

    Abstract : We consider the problem of automatic control strategy synthesis for discrete models of robotic systems, where the goal is to travel from some region to another while obeying a given set of safety rules in an environment with uncertain properties. This is a probabilistic extension of the work by Jana Tumová et al. READ MORE