Essays about: "Markov decision processes"

Showing result 1 - 5 of 12 essays containing the words Markov decision processes.

  1. 1. A Bandit Approach to Indirect Inference

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Erik Ildring; Felix Steinberger Eriksson; [2023]
    Keywords : ;

    Abstract : We present a novel approach to the family of parameter estimation methods known asindirect inference (II), using results from bandit optimization, a sub-field of reinforcementlearning concerned with stateless Markov decision processes (MDPs). First, we present theproblem of indirect inference and show how it may be cast into the general framework ofMDPs. READ MORE

  2. 2. Deep Reinforcement Learning and Simulation for the Optimization of Production Systems

    University essay from Uppsala universitet/Institutionen för informationsteknologi

    Author : Siyuan Chen; [2022]
    Keywords : ;

    Abstract : The main objective of this master thesis project is to use the deep reinforcement learning (DRL) and simulation method for optimization of production systems. In this project, the Deep Q-learning Networks (DQN) algorithm is first used to optimize seven decision variables in Averill Law’s production system to find the best profit, with 99. READ MORE

  3. 3. Autonomous UAV Path Planning using RSS signals in Search and Rescue Operations

    University essay from Linköpings universitet/Reglerteknik

    Author : Axel Anhammer; Hugo Lundeberg; [2022]
    Keywords : UAV; DQN; Deep Q Network; particle filter; point mass filter; MDP; POMDP; Markov decision process; partially observable Markov decision process;

    Abstract : Unmanned aerial vehicles (UAVs) have emerged as a promising technology in search and rescue operations (SAR). UAVs have the ability to provide more timely localization, thus decreasing the crucial duration of SAR operations. READ MORE

  4. 4. Policy-based Reinforcement learning control for window opening and closing in an office building

    University essay from Högskolan Dalarna/Mikrodataanalys

    Author : Gokul Kaisaravalli Bhojraj; Yeswanth Surya Achyut Markonda; [2020]
    Keywords : Markov decision processes; Policy-based Reinforcement learning; Value-based Reinforcement learning; Q-learning; REINFORCE; policy gradient; window control; indoor comfort level;

    Abstract : The level of indoor comfort can highly be influenced by window opening and closing behavior of the occupant in an office building. It will not only affect the comfort level but also affects the energy consumption, if not properly managed. This occupant behavior is not easy to predict and control in conventional way. READ MORE

  5. 5. Constructing a Context-aware Recommender System with Web Sessions

    University essay from

    Author : Albin Bramstång; Yanling Jin; [2019-07-03T13:44:02Z 2019-07-03T13:44:02Z 2015]
    Keywords : Informations- och kommunikationsteknik; Data- och informationsvetenskap; Information Communication Technology; Computer and Information Science;

    Abstract : During the last decade, the importance of recommender systems has been increasing to the point that the success of many well-known service providers depends on these technologies. Recommender systems can assist people in their decision making process by anticipating preferences. READ MORE