Question: Another problem type excellent for reinforcement learning is the so-called grid world. We present a simple 4 x 4 grid world in Figure 10.26. The

Another problem type excellent for reinforcement learning is the so-called grid world. We present a simple 4 x 4 grid world in Figure 10.26. The two greyed corners are the desired terminal states for the agent. From all other states, agent movement is either up, down, left, or right. The agent cannot move off the grid: attempting to, leaves the state unchanged. The reward for all transitions, except to the terminal states is −1. Work through a sequence of grids that produce a solution based on the temporal difference algorithm presented in Section 10.7.2. See Chapter 13 for more discussion of the grid world problem.

 Figure 10.26actions r = -1 on all transitions Figure 10.26 An example of a 4 x 4 grid world, adapted from Sutton and

 


Step by Step Solution

3.42 Rating (155 Votes )

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock

Certainly Lets explore the grid world problem and work through a sequence of grids using the tempora... View full answer

blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Artificial Intelligence Structures Questions!