Question: 4. Instead of having < 1, we can have = 1 but with a negative reward of c for all intermediate (nongoal) states.
4. Instead of having γ < 1, we can have γ = 1 but with a negative reward of −c for all intermediate (nongoal) states. What is the difference?
Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
