Question: CAP 6 6 2 9 : Reinforcement Learning Spring 2 0 2 4 Course project 2 Submission: Two files ( one report in . pdf

CAP

6629

: Reinforcement Learning Spring

2024

Course project

2

Submission: Two files

(

one report in

.

pdf and one

.

ipynb

/

code

) .

Please follow the project report guidelines and submit the report with setup, results and

analysis.

In project

1,

you may realize that when you have a large grid world maze setup, it takes a long

time for the agent to learn a value table. One way to eliminate this challenge is to use neural

networks to approximate the value function. There are two options provided below and you may

choose either one to implement.

Based on your results in project

1,

you can choose to build a neural network

(

or deep

neural network

)

to approximate your obtained Q or V table.

You can design another complex grid world example and develop the QNN

(

or deep

QNN

)

method based on that.

Either way, you are using a neural network to generate your Q or V value so that you can guide

the agent to move to achieve the goal.

Report requirments:

Maze Description: Design your own grid world example and describe it at the beginning

of the report.

Problem Formulation: Define your states, actions, and rewards.

Q Network Design: Design and implement your Q network.

Pseudo Code: Provide the pseudo code in the report.

Results and Discussions: Show the convergence process of mean square error

(

objective

function

)

and the weights trajectories.

Reference: cite all your reference here.

CAP 6629: Reinforcement Learning Spring 2024 Course project 2 Submission: Two

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

Here is the new homework question. Please review the General Homework instructions due to the instructor being so strict. Were there any podcasts or Homework document notes that you needed me to...

Here is the other assignment for Quantitative Methods, Thank You!!! General Homework Instructions BADM 3963 Read and follow the instructions on each homework assignment. They may vary from assignment...

Prerequisites You are provided with a complete circular linked-list deque implementation in deque.c. You should be familiar with the Deque data structure and how to use it before starting this...

CAP 6 6 2 9 : Reinforcement Learning Spring 2 0 2 4 Course project 2 Submission: Two files ( one report in . pdf and one . ipynb / code ) . Please follow the project report guidelines and submit the...

Good Morning, This is the 3rd homework assignment I am requesting of you as you have did excellent on the two prior which I greatly appreciate. This is a new course that is starting today and I am...

wwnload Copy to Version history RE Week 09: MS Project - Schedule Assignment Please note that in our course we collectively spent two classes at least learning MS Project and therefore this...

A comprehensive paper will be dueSunday of week seven of the course. The topic is based upon an actual case study of two small hospitals located in central Washingtonstate. The final project report...

Summary Using your knowledge of loops and String variables, you will create a program that calculate and create a report of some basic statistics and genomic quantities on DNA base-pair sequences....

Description of Part (a) 40 points In MAL an integer is represented using 32 bits. Assume that the bits are numbered right to left from 0 to 31. Thus, the rightmost bit is numbered 0 and the leftmost...

Provide detailed mechanism. OH o 1) OOH 2) H3&/H20

A trader bought a 10 year T note futures at 128'13 and sold it at 109'25. how much is his profit? Keep your answer to two decimal places. Suppose that 6-month, 12-month, 18-month, and 24-month zero...

8. The elements lithium and oxygen are in Period 2. Which of the following is the same for both elements (a) the number of electrons surrounding the nucleus (b) the number of electron shells (c) the...

Compared with half a century ago, adoption has become _ _ _ _ _ _ _ _ _ common, but it is more open and acceptabl e , so we probably discuss it _ _ _ _ _ _ _ . fill in the blanks more or much less or...

In the Data Source View in Visual Studio, what option is available to view data in any Source View Table? What are the primary uses this capability?

What Microsoft Analysis Services Extension for Visual Studio 2017 needs to be installed before beginning work on a Multidimensional OLAP Cube Project? How can the installation be verified?

Why would the FedScope Employment database be more representative of the General Population in terms of Salary Data than the CPS studies?