Question: Suppose that states for a certain environment are represented as three - element tuples of floating point values. Assume also that agents in the environment
Suppose that states for a certain environment are represented as threeelement tuples of floating point values. Assume also that agents in the environment has three actions Final Layer Weights
Suppose that a state is provided to the network. After the second layer is processed, the output of the nodes A B C and D are as shown below:
: :::
Use the information provided to calculate
available from any state, and that the actions are encoded as integers and The neural network used to approximate the actionvalue function in a DQN for the
environment is shown in the figure below. A description of the network is as follows:
The three values representing a state provided as inputs to the network.
The network has hidden layers, each with nodes, and each using a ReLU activation function.
The output layer has three nodes, one for each action.
The final layer does not use an activation function, and simply outputs the weighted sums of the inputs.
Notice that in the diagram above, the input nodes for the final layer are denoted by letters A D and the output nodes are denoted by letters E G The weight associated with
each connection between pairs of nodes in the final layer are provided in the table below.
Step by Step Solution
There are 3 Steps involved in it
1 Expert Approved Answer
Step: 1 Unlock
Question Has Been Solved by an Expert!
Get step-by-step solutions from verified subject matter experts
Step: 2 Unlock
Step: 3 Unlock
