Reinforcement Learning-Frozen Lakes Toy Example

Completion: March 2023

Using Q-Learning and Decaying exploration, after many episodes, the man learns the optimal path to reach the goal while avoiding holes in the ice.
By adjusting the exploration and learning-rate parameters, the success rate of the program is maximized.

Tags: Machine Learning Python Columbia

How to contact me


Email
cjd2186@columbia.edu