What happens if the temporal difference algorithm of Problem 13 plays tic-tac-toe against itself? Data from problem

Question:

What happens if the temporal difference algorithm of Problem 13 plays tic-tac-toe against itself?

Data from problem 13

Consider the tic-tac-toe example of Section 10.7.2. Implement the temporal difference learning algorithm in the language of your choice. If you designed the algorithm to take into account problem symmetries, what do you expect to happen? How might this limit your solution?

Fantastic news! We've Found the answer you've been seeking!

Step by Step Answer:

Question Posted: