Essays about: "Markov Decision Process"

Showing result 11 - 15 of 47 essays containing the words Markov Decision Process.

  1. 11. Autonomous UAV Path Planning using RSS signals in Search and Rescue Operations

    University essay from Linköpings universitet/Reglerteknik

    Author : Axel Anhammer; Hugo Lundeberg; [2022]
    Keywords : UAV; DQN; Deep Q Network; particle filter; point mass filter; MDP; POMDP; Markov decision process; partially observable Markov decision process;

    Abstract : Unmanned aerial vehicles (UAVs) have emerged as a promising technology in search and rescue operations (SAR). UAVs have the ability to provide more timely localization, thus decreasing the crucial duration of SAR operations. READ MORE

  2. 12. Offline Reinforcement Learning for Optimization of Therapy Towards a Clinical Endpoint

    University essay from KTH/Medicinteknik och hälsosystem

    Author : Simon Jenner; [2022]
    Keywords : Offline; Reinforcement learning; Double Deep Q-Network; Cognitive behavior therapy; Digital therapeutics; Optimization; Förstärkningsinlärning; Dubbelt djupt Q-nätverk; Kognitiv beteendeterapi; Digital terapeutika; Optimering;

    Abstract : The improvement of data acquisition and computer heavy methods in recentyears has paved the way for completely digital healthcare solutions. Digitaltherapeutics (DTx) are such solutions and are often provided as mobileapplications that must undergo clinical trials. READ MORE

  3. 13. Graph Bandits : Multi-Armed Bandits with Locality Constraints

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Kasper Johansson; [2022]
    Keywords : Multi-armed bandits; locality constraints; reinforcement learning; Flerarmade banditer; lokala restriktioner; förstärkningsinlärning;

    Abstract : Multi-armed bandits (MABs) have been studied extensively in the literature and have applications in a wealth of domains, including recommendation systems, dynamic pricing, and investment management. On the one hand, the current MAB literature largely seems to focus on the setting where each arm is available to play at each time step, and ignores how agents move between the arms. READ MORE

  4. 14. A Study on Data-driven Methods for Selection and Evaluation of Beam Subsets in 5G NR

    University essay from Lunds universitet/Institutionen för elektro- och informationsteknik

    Author : Nic Ekman; Ilias Theodoros Skordas; [2022]
    Keywords : 5G; NR; telecom; telecommunications; ML; machine learning; algorithms; beam; widebeam; propagation; beamforming; subset; RAN; radio; radio environment; ericsson; Technology and Engineering;

    Abstract : 5G New Radio is the next generation of mobile networks and it comes with promises of ultra-high speeds, ultra-high reliability and ultra-low latency. This has posed a challenge for the engineers entrusted with the task of finding solutions which could fulfil the specification, and as a result, some promising areas have received increased attention in recent years. READ MORE

  5. 15. Risk Averse Path Planning Using Lipschitz Approximated Wasserstein Distributionally Robust Deep Q-Learning

    University essay from Lunds universitet/Institutionen för reglerteknik

    Author : Cem Alptürk; [2022]
    Keywords : Technology and Engineering;

    Abstract : We investigate the problem of risk averse robot path planning using the deep reinforcement learning and distributionally robust optimization perspectives. Our problem formulation involves modelling the robot as a stochastic linear dynamical system, assuming that a collection of process noise samples is available. READ MORE