Question: Consider an (N + 1) (N + 1) (N + 1) cubic gridworld. Luckily, all the cells are empty there are no
Consider an (N + 1) × (N + 1) × (N + 1) cubic gridworld. Luckily, all the cells are empty – there are no walls within the cube. For each cell, there is an action for each adjacent facing open cell (no corner movement), as well as an action stay. The actions all move into the corresponding cell with probability p but stay with probability 1 − p. Stay always stays. The reward is always zero except when you enter the goal cell at (N, N, N), in which case it is 1 and the game then ends. The discount is 0 < γ < 1.
a. How many iterations k of value iteration will there be before Vk(0, 0, 0) becomes nonzero? If this will never happen, write never.
b. If and when Vk(0, 0, 0) first becomes non-zero, what will it become? If this will never happen, write never.
c. What is V∗ (0, 0, 0)? If it is undefined, write undefined.
Step by Step Solution
3.48 Rating (164 Votes )
There are 3 Steps involved in it
a 3N At V 0 the value of the goal is correct At V 1 all cells next to the goal are nonzero at V 2 al... View full answer
Get step-by-step solutions from verified subject matter experts
