Essays about: "policy gradient"

Showing result 1 - 5 of 44 essays containing the words policy gradient.

  1. 1. Deep reinforcement learning for automated building climate control

    University essay from Linköpings universitet/Institutionen för datavetenskap

    Author : Erik Snällfot; Martin Hörnberg; [2024]
    Keywords : Machine Learning; Reinforcement Learning; Deep Learning; Deep Reinforcement Learning; Building Control; Control System;

    Abstract : The building sector is the single largest contributor to greenhouse gas emissions, making it a natural focal point for reducing energy consumption. More efficient use of energy is also becoming increasingly important for property managers as global energy prices are skyrocketing. READ MORE

  2. 2. S-MARL: An Algorithm for Single-To-Multi-Agent Reinforcement Learning : Case Study: Formula 1 Race Strategies

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Marinaro Davide; [2023]
    Keywords : Reinforcement Learning; Single-to-Multi-Agent; Learning Stability; Exploration-Exploitation trade-off; Race Strategy Optimization; Förstärkningsinlärning; Från en till flera agenter; Stabilitet vid inlärning; Utforskning-exploatering; Optimering av tävlingsstrategier;

    Abstract : A Multi-Agent System is a group of autonomous, intelligent, interacting agents sharing an environment that they observe through sensors, and upon which they act with actuators. The behaviors of these agents can be either defined upfront by programmers or learned by trial-and-error resorting to Reinforcement Learning. READ MORE

  3. 3. Uncontrolled intersection coordination of the autonomous vehicle based on multi-agent reinforcement learning.

    University essay from Malmö universitet/Fakulteten för teknik och samhälle (TS)

    Author : Isaac Arnold McSey; [2023]
    Keywords : Autonomous Vehicles AVs ; Road Safety; Fuel Efficiency; Business Dynamics; Intersections; Human-Driven Vehicles HDVs ; Pedestrians; Multi-Agent Reinforcement Learning MARL ; Multi-Agent Deep Deterministic Policy Gradient MADDPG ; Algorithmic Interactions; Uncontrolled Intersections; Global Insights; Safety Improvements; Comfort Improvements; Learning Process; Global Experiences; Complex Environments; Passenger Comfort; Navigation;

    Abstract : This study explores the application of multi-agent reinforcement learning (MARL) to enhance the decision-making, safety, and passenger comfort of Autonomous Vehicles (AVs)at uncontrolled intersections. The research aims to assess the potential of MARL in modeling multiple agents interacting within a shared environment, reflecting real-world situations where AVs interact with multiple actors. READ MORE

  4. 4. Scalable Reinforcement Learning for Linear-Quadratic Control of Networks

    University essay from Lunds universitet/Institutionen för reglerteknik

    Author : Johan Olsson; [2023]
    Keywords : Technology and Engineering;

    Abstract : Distributed optimal control is known to be challenging and can become intractable even for linear-quadratic regulator problems. In this work, we study a special class of such problems where distributed state feedback controllers can give near-optimal performance. READ MORE

  5. 5. Scalable Reinforcement Learning for Formation Control with Collision Avoidance : Localized policy gradient algorithm with continuous state and action space

    University essay from KTH/Skolan för teknikvetenskap (SCI); KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Andreu Matoses Gimenez; [2023]
    Keywords : Control theory; Multi-agent systems; Distributed systems; Formation control; Collision avoidance; Reinforcement learning; Teoria de control; Sistemes multiagent; Sistemes distribuïts; Control de formació; Prevenció de col·lisions; Reinforcement Learning; Reglerteknik; Multi-agent system; Distribuerade system; formationskontroll; Kollisionsundvikande; Reinforcement learning; Teoría de control; Sistemas multiagente; Sistemas distribuidos; Control de formación; Prevención de colisiones; Reinforcement Learning;

    Abstract : In the last decades, significant theoretical advances have been made on the field of distributed mulit-agent control theory. One of the most common systems that can be modelled as multi-agent systems are the so called formation control problems, in which a network of mobile agents is controlled to move towards a desired final formation. READ MORE