Essays about: "reward system"

Showing result 21 - 25 of 141 essays containing the words reward system.

  1. 21. Improving a Reinforcement Learning Algorithm for Resource Scheduling

    University essay from Lunds universitet/Institutionen för reglerteknik

    Author : Elin Wilson Andersson; Johan Håkansson; [2022]
    Keywords : Technology and Engineering;

    Abstract : This thesis aims to further investigate the viability of using reinforcement learning, specifically Q-learning, to schedule shared resources on the Ericsson Many-Core Architecture (EMCA). This was first explored by Patrik Trulsson in his master thesis Dynamic Scheduling of Shared Resources using Reinforcement Learning (2021). READ MORE

  2. 22. Benchmarking Deep Reinforcement Learning on Continuous Control Tasks : AComparison of Neural Network Architectures and Environment Designs

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Daniel Sahlin; [2022]
    Keywords : Deep learning; Reinforcement learning; Reward functions; Neural networks; Furuta pendulum; Djupinlärning; Förstärkningsinlärning; Belöningsfunktioner; Neurala nätverk; Furuta-pendel;

    Abstract : Deep Reinforcement Learning (RL) has received much attention in recent years. This thesis investigates how reward functions, environment termination conditions, Neural Network (NN) architectures, and the type of the deep RL algorithm aect the performance for continuous control tasks. READ MORE

  3. 23. Deep Reinforcement Learning for Mapless Mobile Robot Navigation

    University essay from Luleå tekniska universitet/Institutionen för system- och rymdteknik

    Author : Ameer Hamza; [2022]
    Keywords : Mobile Robot; DRL; Deep Learning; Navigation; Mapless;

    Abstract : Navigation is the fundamental capability of mobile robots which allows them to move fromone point to another without any human interference. The autonomous operation of theserobots is depended on reliable, robust, and intelligent navigation system. READ MORE

  4. 24. Risk Averse Path Planning Using Lipschitz Approximated Wasserstein Distributionally Robust Deep Q-Learning

    University essay from Lunds universitet/Institutionen för reglerteknik

    Author : Cem Alptürk; [2022]
    Keywords : Technology and Engineering;

    Abstract : We investigate the problem of risk averse robot path planning using the deep reinforcement learning and distributionally robust optimization perspectives. Our problem formulation involves modelling the robot as a stochastic linear dynamical system, assuming that a collection of process noise samples is available. READ MORE

  5. 25. Platoon Coordination of Electric Trucks at a Charging Station

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Elin Björklund; Ebba Lindstedt; [2022]
    Keywords : Platooning; Electric trucks; Truck coordination; E-platooning; Platoon matching; Integer linear programming;

    Abstract : Electric trucks and platooning technology are expected to be part of the transportation system in the near future. Therefore, it is important to develop platoon coordination strategies and study the potential of platooning for when trucks are electric. READ MORE