Question: 2- Implement an exploring reinforcement learning agent that uses direct utility estimation. Make two versions-one with a tabular representation and one using the function approximator

2- Implement an exploring reinforcement learning agent that uses direct utility estimation. Make two versions-one with a tabular representation and one using the function approximator U^(x,y)=0+1x+2y. Compare their performance in the environments: A 1010 world with no obstacles and a+1 reward at (5,5)
Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
