Question: Coding Problem ( * ) Create a Python function named mdp _ policy _ enumerate to enumerate all the possible stationary policies in a MDP
Coding Problem
Create a Python function named mdppolicyenumerate to enumerate all the possible stationary policies
in a MDP The detailed requirements are listed as follows.
Two inputs: set of states; set of actions.
Output: a list that contains all the policies.
Test the function using the states and actions from the machine repair example.
Packages and functions you may need to use click will lead you to the webpages:
itertools.product
Note: adding comment in each line of code will receive one extra point given that code runs correctly.
Step by Step Solution
There are 3 Steps involved in it
1 Expert Approved Answer
Step: 1 Unlock
Question Has Been Solved by an Expert!
Get step-by-step solutions from verified subject matter experts
Step: 2 Unlock
Step: 3 Unlock
