Question: Gridworld - Q Learning Create a 5 5 grid world An agent to move around Four possible actions Have a goal state. Reward a Goal

Gridworld - Q Learning
Create a 55 grid world
An agent to move around
Four possible actions
Have a goal state.
Reward a Goal =5 and Another
terminal state =-5
Elsewhere Reward =0
Any action that takes you outside
boundary, Reward =-1
Run 100,000 episodes
Keep a random no. seed
Plot the converged policy and value function for this grid world.
Do it for =0.1,0.5 and 0.9, take epsilon =0.1.
For gamma =0.9, plot the no. of steps to reach the goal across
episodes for epsilon =0.1,0.3 and 0.5.
For all the above, keep the learning rate alpha =0.1.
 Gridworld - Q Learning Create a 55 grid world An agent

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!