Essays about: "estimeringsträning"
Found 1 essay containing the word estimeringsträning.
-
1. Stuck state avoidance through PID estimation training of Q-learning agent
University essay from KTH/Skolan för elektroteknik och datavetenskap (EECS)Abstract : Reinforcement learning is conceptually based on an agent learning through interaction with its environment. This trial-and-error learning method makes the process prone to situations in which the agent is stuck in a dead-end, from which it cannot keep learning. READ MORE
Result pages:
1