Essays about: "Reward functions"

Showing result 1 - 5 of 34 essays containing the words Reward functions.

  1. 1. Decreasing Training Time of Reinforcement Learning Agents for Remote Tilt Optimization using a Surrogate Neural Network Approximator

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Jiaming Huang; [2023]
    Keywords : ;

    Abstract : One possible application of reinforcement learning in the telecommunication field is antenna tilt optimization. However, one of key challenges we face is that the use of handcrafted simulators as environments to provide information for agents is often time-consuming regarding training reinforcement learning agents. READ MORE

  2. 2. Påverkas anknytningsbeteende hos hund av ägarens personlighet och/eller vuxna anknytningsstil?

    University essay from SLU/Dept. of Clinical Sciences

    Author : Andres Ellexelius; [2023]
    Keywords : hund-människa-interaktion; omvårdnad; vuxen anknytningsstil; personlighet; big five; ASQ; anknytningsstil; hundbeteende; antrozoologi;

    Abstract : Hunden kan skapa ett band med sin djurägare som är lika unikt som viktigt. Genom evolutionära anpassningar och selektiv avel och har flera förändringar skett som förändrat hundens morfologi, anknytningssystem och belöningssystem. Ett resultat av detta är bland annat hundens förmåga att forma anknytningsband till sin djurägare. READ MORE

  3. 3. Scalable Reinforcement Learning for Formation Control with Collision Avoidance : Localized policy gradient algorithm with continuous state and action space

    University essay from KTH/Skolan för teknikvetenskap (SCI); KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Andreu Matoses Gimenez; [2023]
    Keywords : Control theory; Multi-agent systems; Distributed systems; Formation control; Collision avoidance; Reinforcement learning; Teoria de control; Sistemes multiagent; Sistemes distribuïts; Control de formació; Prevenció de col·lisions; Reinforcement Learning; Reglerteknik; Multi-agent system; Distribuerade system; formationskontroll; Kollisionsundvikande; Reinforcement learning; Teoría de control; Sistemas multiagente; Sistemas distribuidos; Control de formación; Prevención de colisiones; Reinforcement Learning;

    Abstract : In the last decades, significant theoretical advances have been made on the field of distributed mulit-agent control theory. One of the most common systems that can be modelled as multi-agent systems are the so called formation control problems, in which a network of mobile agents is controlled to move towards a desired final formation. READ MORE

  4. 4. Improving Behavior Trees that Use Reinforcement Learning with Control Barrier Functions : Modular, Learned, and Converging Control through Constraining a Learning Agent to Uphold Previously Achieved Sub Goals

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Jannik Wagner; [2023]
    Keywords : Behavior Trees; Reinforcement Learning; Control Barrier Functions; Robotics; Artificial Intelligence; Verhaltensbäume; Verstärkendes Lernen; Kontrollbarrierefunktionen; Robotik; Künstliche Intelligenz; Beteendeträd; Förstärkningsinlärning; Kontrollbarriärfunktioner; Robotik; Artificiell Intelligens;

    Abstract : This thesis investigates combining learning action nodes in behavior trees with control barrier functions based on the extended active constraint conditions of the nodes and whether the approach improves the performance, in terms of training time and policy quality, compared to a purely learning-based approach. Behavior trees combine several behaviors, called action nodes, into one behavior by switching between them based on the current state. READ MORE

  5. 5. A Generic Model of Motivation in Artificial Animals Based on Reinforcement Learning

    University essay from Göteborgs universitet/Institutionen för data- och informationsteknik

    Author : Birger Kleve; Pietro Ferrari; [2022-05-06]
    Keywords : reinforcement; learning; reward; shaping; animat; homeostasis; ecosystem; motivation;

    Abstract : This thesis is a part of a broader research project at Chalmers University of Technology focused on ecosystems’ simulations using reinforcement learning artificial animals, called animats. The scope of this project is to provide animats with a reward signal which should ultimately drive animats’ learning towards adaptation of their environment. READ MORE