Essays about: "estimeringsträning"

Found 1 essay containing the word estimeringsträning.

1. Stuck state avoidance through PID estimation training of Q-learning agent
University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)
Author : Johan Moritz; Albin Winkelmann; [2019]
Keywords : Q-learning; QL; PID; wheeled inverted pendulum; WIP; reinforcement learning; estimation training; Q-learning; QL; PID; självbalanserande robot; reinforcement learning; estimeringsträning;

Abstract : Reinforcement learning is conceptually based on an agent learning through interaction with its environment. This trial-and-error learning method makes the process prone to situations in which the agent is stuck in a dead-end, from which it cannot keep learning. READ MORE

Result pages:

1