Question: Assignment The goal of this assignment is to implement QLearning method on Taxi - v 3 enviroment at openai gym framework. Your task in this

Assignment
The goal of this assignment is to implement QLearning method on Taxi-v3 enviroment at openai gym framework.
Your task in this enviroment is to pick up the passenger at one location and drop him off in another, located at possible 4 locations (labeled by different letters). In the example given below, you are expected to pick him up at Y and drop him at G. You receive +20 points for a successful dropoff, and lose 1 point for every timestep it takes. There is also a 10 point penalty for illegal pick-up and drop-off actions.
Note that dynamics of the model are assumed to be unknown.

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!