Question: Exercise 11.11 In SARSA with linear function approximators, if you use linear regression to minimize r + Qw(s, a) Qw(s, a), you get a
Exercise 11.11 In SARSA with linear function approximators, if you use linear regression to minimize r + γQw(s, a) − Qw(s, a), you get a different result than we have here. Explain what you get and why what is described in the text may be preferable (or not).
Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
