Question: Reward function Question: Implement the reward function described in the setup. Specifically, given a - dimensional vector 'avg', return - dimensional vector 'rew' such that
Reward function Question: Implement the reward function described in the setup. Specifically, given a - dimensional vector 'avg', return - dimensional vector 'rew' such that []=[]+[] where [](0,) where is the identity matrix of size
Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
