Compute the true utility function and the best linear
Compute the true utility function and the best linear approximation in x and y (as in Equation (21.9)) for the following environments:
a. A l0 x 10 world with a single +1 terminal state at (10, 10).
b. As in (a), but add a —1 terminal state at (10, 1).
c. As in (b), but add obstacles in 10 randomly selected squares.
d. As in (b), but place a wall stretching from (5, 2) to (5, 9).
e. As in (a), but with the terminal state at (5, 5).
The actions are deterministic moves in the four directions. In each case, compare the results using three-dimensional plots. For each environment, propose additional features (besides x and y) that would improve the approximation and show the results.
Membership TRY NOW
  • Access to 800,000+ Textbook Solutions
  • Ask any question from 24/7 available
    Tutors
  • Live Video Consultation with Tutors
  • 50,000+ Answers by Tutors
OR
Relevant Tutors available to help