Advanced search
Found 1 essay matching the above criteria.
-
1. Reinforcement Learning– Intelligent Weighting of Monte Carlo and Temporal Differences
University essay from Lunds universitet/Institutionen för reglerteknikAbstract : In Reinforcement learning the updating of the value functions determines the information spreading across the state/state-action space which condenses the valuebased control policy. It is important to have an information propagation across the value domain in a manner that is effective. READ MORE
Result pages:
1