Question: Assignment The goal of this assignment is to implement QLearning method on Taxi - v 3 enviroment at openai gym framework. Your task in this
Assignment
The goal of this assignment is to implement QLearning method on Taxiv enviroment at openai gym framework.
Your task in this enviroment is to pick up the passenger at one location and drop him off in another, located at possible locations labeled by different letters In the example given below, you are expected to pick him up at Y and drop him at G You receive points for a successful dropoff, and lose point for every timestep it takes. There is also a point penalty for illegal pickup and dropoff actions.
Note that dynamics of the model are assumed to be unknown.
Step by Step Solution
There are 3 Steps involved in it
1 Expert Approved Answer
Step: 1 Unlock
Question Has Been Solved by an Expert!
Get step-by-step solutions from verified subject matter experts
Step: 2 Unlock
Step: 3 Unlock
