Question: I am having trouble with this assignment, The program we are using is MatLab and honestly I am not to sure where to even start

I am having trouble with this assignment, The program we are usingI am having trouble with this assignment, The program we are using is MatLab and honestly I am not to sure where to even start

- Correct initialization proper n*n Q-matrix, R matrix or vector, etc. according to your implementation ): 3 points - Correct transition function or matrix to get the next state given the current state and the action: 3 points - Correct function or code block for choosing a random and valid action, or similar 3 points - Implement episode iterations, calculate q value and update q matrix correctly: 6 points - Return the correct path of reaching the goal state given Q matrix : 5 points (this means you need to create a concrete gridworld using your implementation and find the solution) Ps: you can set learning rate alpha equal to 1 so as to use the simplest form of equation : Q(s.a)

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!