Essays about: "thesis on reward systems"

Showing result 1 - 5 of 34 essays containing the words thesis on reward systems.

  1. 1. Scalable Reinforcement Learning for Formation Control with Collision Avoidance : Localized policy gradient algorithm with continuous state and action space

    University essay from KTH/Skolan för teknikvetenskap (SCI); KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Andreu Matoses Gimenez; [2023]
    Keywords : Control theory; Multi-agent systems; Distributed systems; Formation control; Collision avoidance; Reinforcement learning; Teoria de control; Sistemes multiagent; Sistemes distribuïts; Control de formació; Prevenció de col·lisions; Reinforcement Learning; Reglerteknik; Multi-agent system; Distribuerade system; formationskontroll; Kollisionsundvikande; Reinforcement learning; Teoría de control; Sistemas multiagente; Sistemas distribuidos; Control de formación; Prevención de colisiones; Reinforcement Learning;

    Abstract : In the last decades, significant theoretical advances have been made on the field of distributed mulit-agent control theory. One of the most common systems that can be modelled as multi-agent systems are the so called formation control problems, in which a network of mobile agents is controlled to move towards a desired final formation. READ MORE

  2. 2. Deep Reinforcement Learning and Simulation for the Optimization of Production Systems

    University essay from Uppsala universitet/Institutionen för informationsteknologi

    Author : Siyuan Chen; [2022]
    Keywords : ;

    Abstract : The main objective of this master thesis project is to use the deep reinforcement learning (DRL) and simulation method for optimization of production systems. In this project, the Deep Q-learning Networks (DQN) algorithm is first used to optimize seven decision variables in Averill Law’s production system to find the best profit, with 99. READ MORE

  3. 3. Investigating Multi-Objective Reinforcement Learning for Combinatorial Optimization and Scheduling Problems : Feature Identification for multi-objective Reinforcement Learning models

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Rikard Fridsén Skogsberg; [2022]
    Keywords : Multi-Objective Reinforcement Learning; Radio Resource Scheduling; Deep Q-Networks; Single-policy; Multi-policy; Scalarization.; Flermåls förstärkningsinlärning; Radio resurs schemaläggning; Djupa Q-nätverk; Enskilt mål; Flermål;

    Abstract : Reinforcement Learning (RL) has in recent years become a core method for sequential decision making in complex dynamical systems, being of great interest to support improvements in scheduling problems. This could prove important to areas in the newer generation of cellular networks. READ MORE

  4. 4. Graph Bandits : Multi-Armed Bandits with Locality Constraints

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Kasper Johansson; [2022]
    Keywords : Multi-armed bandits; locality constraints; reinforcement learning; Flerarmade banditer; lokala restriktioner; förstärkningsinlärning;

    Abstract : Multi-armed bandits (MABs) have been studied extensively in the literature and have applications in a wealth of domains, including recommendation systems, dynamic pricing, and investment management. On the one hand, the current MAB literature largely seems to focus on the setting where each arm is available to play at each time step, and ignores how agents move between the arms. READ MORE

  5. 5. How do players experience a gacha game depending on their perspective as a starting or a veteran player? : A case study of Genshin Impact

    University essay from Uppsala universitet/Institutionen för speldesign

    Author : Dominykas Jėčius; Alexander Frestadius; [2022]
    Keywords : Computer games; gacha; game systems; reward systems; player experience; dark patterns; locked content;

    Abstract : The purpose of this thesis is to explore and examine how gameplay experiences differ in the gacha game Genshin Impact (miHoYo, 2020). In particular, there is a focus on the economic systems of the game along with the general play experience. READ MORE