Essays about: "Actor-critic"
Showing result 1 - 5 of 20 essays containing the word Actor-critic.
-
1. Scalable Reinforcement Learning for Linear-Quadratic Control of Networks
University essay from Lunds universitet/Institutionen för reglerteknikAbstract : Distributed optimal control is known to be challenging and can become intractable even for linear-quadratic regulator problems. In this work, we study a special class of such problems where distributed state feedback controllers can give near-optimal performance. READ MORE
-
2. Scalable Reinforcement Learning for Formation Control with Collision Avoidance : Localized policy gradient algorithm with continuous state and action space
University essay from KTH/Skolan för teknikvetenskap (SCI); KTH/Skolan för elektroteknik och datavetenskap (EECS)Abstract : In the last decades, significant theoretical advances have been made on the field of distributed mulit-agent control theory. One of the most common systems that can be modelled as multi-agent systems are the so called formation control problems, in which a network of mobile agents is controlled to move towards a desired final formation. READ MORE
-
3. Intelligent autoscaling in Kubernetes : the impact of container performance indicators in model-free DRL methods
University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)Abstract : A key challenge in the field of cloud computing is to automatically scale software containers in a way that accurately matches the demand for the services they run. To manage such components, container orchestrator tools such as Kubernetes are employed, and in the past few years, researchers have attempted to optimise its autoscaling mechanism with different approaches. READ MORE
-
4. Deep Reinforcement Learning Approach to Portfolio Optimization
University essay from Lunds universitet/Nationalekonomiska institutionenAbstract : This paper evaluates whether a deep reinforcement learning (DRL) approach can be implemented, on the Swedish stock market, to optimize a portfolio. The objective is to create and train two DRL algorithms that can construct portfolios that will be benchmarked against the market portfolio, tracking OMXS30, and the two conventional methods, the naive portfolio, and minimum variance portfolio. READ MORE
-
5. Spiking Reinforcement Learning for Robust Robot Control Under Varying Operating Conditions
University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)Abstract : Over the last few years, deep reinforcement learning (RL) has gained increasing popularity for its successful application to a variety of complex control and decision-making tasks. As the demand for deep RL algorithms deployed in challenging real-world environments grows, their robustness towards uncertainty, disturbances and perturbations of the environment becomes more and more important. READ MORE