Question: Then in the while loop insert steps = steps + 1 and then change return route to return route, steps 7. The Q-learning process for


Then in the while loop insert steps = steps + 1 and then change return route to return route, steps 7. The Q-learning process for loop in get_optimal_route() performs 1000 iterations. Do you think these many iterations are required? Try with 50. Try with 200. Explain what happens and why you think it is happening. Remember to set it back to 1000 iterations before you work on the next questions
Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
