Question: Maze Navigation Reinforcement Learning Java, C++, C, Python For the maze, assume bottom left cell is at (1, 1) and the format for the coordinates

Maze Navigation Reinforcement Learning Java, C++, C, Python

For the maze, assume bottom left cell is at (1, 1) and the format for the coordinates is (, )

b. 10 X 10 world with no obstacles and reward +1 at (5, 5)

Write a program that prompts the user via a start menu to first select the RL algorithm (1) Direct utility estimation, (2) Adaptive Dynamic Programming, (3)Temporal Difference. Your program should select an appropriate number of trials or epochs to learn the utilities and/or model. When the algorithm finishes, your program should again prompt the user to input a start state (two integer coordinates separated by a space, with check for input being a valid state inside environment, not obstacle). From this start state, your agent should navigate until it reaches a terminal state (correct operation should reach the +1 terminal state). Your program should then printout the coordinates of the states the agent navigated through until it reached the terminal state.

Remember to return the program to the start menu after each run, and add an exit option to the start menu, so that the program can be tested multiple times.

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

Calculate Pearson's correlation coefficient () between the variables Weight and Head in the babyanth.complete data frame using the following formula. Include the correlation value you calculated in a...

Jones & Bartlett Learning, LLC. NOT FOR RESALE OR DISTRIBUTION CHAPTER Hot Spot Analysis 10 LEARNING OBJECTIVES C A R R Provide a working definition of a \"hot spot.\" , Be able to explain different...

I am working on this homework, and I don't even know where to start. The program is written in C++ Use a 2D array to represent a maze. Start with a 10x10 array of char. The array of char can hold...

Generate a random two-dimensional square maze whose size is specified by the user; and Read in a maze from a given text file (more about this later). Once the program has generated a random maze or...

(JAVA - DATA STRUCTURES) PLEASE, DO NOT AVOID MY QUESTION. THIS IS THE THIRD TIME I POST THIS QUESTION AND NOBODY WANTS TO HELP ME. The code that you write for this assignment will build on top of...

Please solve the following question using Haskell Problem Description In this project, you will write a simple Haskell program to solve the following problem: You are given a maze of h x w cells and...

A discrete sequence {xn} can be converted into a continuous representation x(t) = ts X n= (t n ts) xn, where ts is the sampling period. (a) State two characteristic properties of Dirac's function. [2...

CHA P TER 9 Understanding Software: A Primer for Managers 1. INTRODUCTION L E A R N I N G O B J E C T I V E S 1. Recognize the importance of software and its implications for the rm and strategic...

Java Ex. 1. a. Table class. Open either Eclipse or NetBeans. Create a project called lab2 and a class called Table inside it. Attributes. Add an instance attribute to this class called maze as a 2D...

Experiment 3 Acceleration Due To Gravity Questions How do you compute an object's velocity from its position? How do you measure the acceleration of a mass as it falls under the influence of gravity?...

Make the following entries from Credit Card and Debit Card Sales. a) A sale of $2,000 was made + HST 13%. The customer paid with his Bank VISA Card. The Service fee is 4%. (11 Marks) b) A sale of...

A survey conducted by JCB asked 250 whether or not they have shopped on the new Shopping Mall. HAVE NEVER GENDER HAVE SHOPPED TAL SHOPPED Male 20 70 90 Female 130 30 160 Total 150 100 250 The...

14. Under what conditions must an employer accrue a liability for the cost of compensated absences?

Seved Help 14 Wisconsin Snowmobile Corp. is considering a switch to level production Cost efficiencies would occur under level production, and aftertax costs would decline by $31,500, but inventory...

What are Measures in OLAP Cubes?

How do OLAP Databases provide for Drilling Down into data?

How are OLAP Cubes different from Production Relational Databases?