Question: Your Task ( part 2 ) : Now that you've implemented q - learning for one task, you will move to the mountain car task.

Your Task (part 2):
Now that you've implemented q-learning for one task, you will move to the mountain car task.
Instead of 2 actions (left, right), this task has three (left, null, right). The task also has different
state variables (only 2)
?x : the location of the robot is the left, -.45 is approximately the valley, 0.6 is the
rightmost part of the board, 0.5 is the location of the flag)
? xdot: the velocity of the robot (this can go from -0.07 to 0.07)
This will require you to change the number of bins for state descritization as well as the
alpha and gamma values. Additionally, you need to implement the exploration vs
exploitation part for this problem as well.
Once your model is trained it will be saved the Q-table as 'car.npy' file. Make sure to that you
don't change this file name.
 Your Task (part 2): Now that you've implemented q-learning for one

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!