Essays about: "Reward functions"
Showing result 1 - 5 of 34 essays containing the words Reward functions.
-
1. Decreasing Training Time of Reinforcement Learning Agents for Remote Tilt Optimization using a Surrogate Neural Network Approximator
University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)Abstract : One possible application of reinforcement learning in the telecommunication field is antenna tilt optimization. However, one of key challenges we face is that the use of handcrafted simulators as environments to provide information for agents is often time-consuming regarding training reinforcement learning agents. READ MORE
-
2. Påverkas anknytningsbeteende hos hund av ägarens personlighet och/eller vuxna anknytningsstil?
University essay from SLU/Dept. of Clinical SciencesAbstract : Hunden kan skapa ett band med sin djurägare som är lika unikt som viktigt. Genom evolutionära anpassningar och selektiv avel och har flera förändringar skett som förändrat hundens morfologi, anknytningssystem och belöningssystem. Ett resultat av detta är bland annat hundens förmåga att forma anknytningsband till sin djurägare. READ MORE
-
3. Scalable Reinforcement Learning for Formation Control with Collision Avoidance : Localized policy gradient algorithm with continuous state and action space
University essay from KTH/Skolan för teknikvetenskap (SCI); KTH/Skolan för elektroteknik och datavetenskap (EECS)Abstract : In the last decades, significant theoretical advances have been made on the field of distributed mulit-agent control theory. One of the most common systems that can be modelled as multi-agent systems are the so called formation control problems, in which a network of mobile agents is controlled to move towards a desired final formation. READ MORE
-
4. Improving Behavior Trees that Use Reinforcement Learning with Control Barrier Functions : Modular, Learned, and Converging Control through Constraining a Learning Agent to Uphold Previously Achieved Sub Goals
University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)Abstract : This thesis investigates combining learning action nodes in behavior trees with control barrier functions based on the extended active constraint conditions of the nodes and whether the approach improves the performance, in terms of training time and policy quality, compared to a purely learning-based approach. Behavior trees combine several behaviors, called action nodes, into one behavior by switching between them based on the current state. READ MORE
-
5. A Generic Model of Motivation in Artificial Animals Based on Reinforcement Learning
University essay from Göteborgs universitet/Institutionen för data- och informationsteknikAbstract : This thesis is a part of a broader research project at Chalmers University of Technology focused on ecosystems’ simulations using reinforcement learning artificial animals, called animats. The scope of this project is to provide animats with a reward signal which should ultimately drive animats’ learning towards adaptation of their environment. READ MORE