Essays about: "estimeringsträning"

Found 1 essay containing the word estimeringsträning.

  1. 1. Stuck state avoidance through PID estimation training of Q-learning agent

    University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)

    Author : Johan Moritz; Albin Winkelmann; [2019]
    Keywords : Q-learning; QL; PID; wheeled inverted pendulum; WIP; reinforcement learning; estimation training; Q-learning; QL; PID; självbalanserande robot; reinforcement learning; estimeringsträning;

    Abstract : Reinforcement learning is conceptually based on an agent learning through interaction with its environment. This trial-and-error learning method makes the process prone to situations in which the agent is stuck in a dead-end, from which it cannot keep learning. READ MORE