Reinforcement Learning-Frozen Lakes Toy Example
Completion: March 2023
Using Q-Learning and Decaying exploration, after many episodes, the man learns the optimal path to reach the goal while avoiding holes in the ice.
By adjusting the exploration and learning-rate parameters, the success rate of the program is maximized.
Tags: Machine Learning Python Columbia