Question: (b) Use amortization (specifically, the potential method) to obtain an upper bound on the best possible average reward. If it helps to think in terms

(b) Use amortization (specifically, the potential method) to obtain an upper bound on the best possible average reward. If it helps to think in terms of cost instead, pretend that your friend is playing this game and you must pay their reward. In this context, you want to find an upper bound on the average cost per move. For amortization, as
Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
