Compute the true utility function and the best linear approximation in x and y (as in Equation (21.9)) for the following environments:
a. A l0 x 10 world with a single +1 terminal state at (10, 10).
b. As in (a), but add a —1 terminal state at (10, 1).
c. As in (b), but add obstacles in 10 randomly selected squares.
d. As in (b), but place a wall stretching from (5, 2) to (5, 9).
e. As in (a), but with the terminal state at (5, 5).
The actions are deterministic moves in the four directions. In each case, compare the results using three-dimensional plots. For each environment, propose additional features (besides x and y) that would improve the approximation and show the results.

  • CreatedFebruary 14, 2011
  • Files Included
Post your question