This question provides a simple two-layer neural network with a hidden layer of three neurons an...
Fantastic news! We've Found the answer you've been seeking!
Question:
Transcribed Image Text:
This question provides a simple two-layer neural network with a hidden layer of three neurons an output layer of one neuron, and two input neurons, In both the hidden layer and the output layer, the sigmoid function is considered as an activation function. X= [x1, x2] is a vector that the network receives as input. A parameter vector called W (¹)= [w1,w2, w3, w4, w5, w6] connects the input layer to the hidden layer. A parameter vector W(2) [w7, w8, w9] is the parameter that connects the hidden layer to the output layer (examples shown in the figure below). X1 Input W₁ W7 W₂ Output (a) Write the mathematical expression for the output of the hidden layer, including the bias term. (10%) (b) Write the mathematical expression for the output of the output layer, including the bias term. (10 %) (c) The true label for this input is y = 0. The cost function used in this network is the mean squared error (MSE). Write the mathematical expression for the cost for this input. (10%) (d) Use backpropagation to calculate the gradient descent of the cost with respect to the weights connecting the input layer to the hidden layer [w1,w2, w3, w4, w5, w6] and the weights connecting the hidden layer to the output layer [w7, w8, w9]. (10%) This question provides a simple two-layer neural network with a hidden layer of three neurons an output layer of one neuron, and two input neurons, In both the hidden layer and the output layer, the sigmoid function is considered as an activation function. X= [x1, x2] is a vector that the network receives as input. A parameter vector called W (¹)= [w1,w2, w3, w4, w5, w6] connects the input layer to the hidden layer. A parameter vector W(2) [w7, w8, w9] is the parameter that connects the hidden layer to the output layer (examples shown in the figure below). X1 Input W₁ W7 W₂ Output (a) Write the mathematical expression for the output of the hidden layer, including the bias term. (10%) (b) Write the mathematical expression for the output of the output layer, including the bias term. (10 %) (c) The true label for this input is y = 0. The cost function used in this network is the mean squared error (MSE). Write the mathematical expression for the cost for this input. (10%) (d) Use backpropagation to calculate the gradient descent of the cost with respect to the weights connecting the input layer to the hidden layer [w1,w2, w3, w4, w5, w6] and the weights connecting the hidden layer to the output layer [w7, w8, w9]. (10%)
Expert Answer:
Answer rating: 100% (QA)
a Output of the hidden layer h W1X b1 where h is a vector of size 3 representing the output of the hidden layer neurons is the sigmoid function z 1 1 expz W1 is the weight matrix of size 3x2 connectin... View the full answer
Related Book For
Applied Statistics And Probability For Engineers
ISBN: 9781118539712
6th Edition
Authors: Douglas C. Montgomery, George C. Runger
Posted Date:
Students also viewed these programming questions
-
How did Vulcan Energy generate revenue for the company prior to its acquisition of the renewable energy plant?
-
What do you understand about perception? Do you think that perception is an interesting topic? Why is this perception important to you? Will this new information change your perception? What...
-
The average annual salary for all U.S. teachers is $47,750. Assume that the distribution is normal and the standard deviation is $5680. Find the probability that a randomly selected teacher earns a....
-
Who most exemplifies the virtue of courage-the person who finds it difficult to be brave or the person who finds it easy to be courageous?
-
The graph of y = (x 2 ) x has two horizontal tangent lines. Find equations for both of them.
-
State which one of the following is correct: (a) \(d G=-S d T+V d P\) (b) \(d U=T d S+P d V\) (c) \(d H=T d S-V d P\) (d) \(d A=-S d T+P d V\).
-
Data collected on the yearly registrations for a Six Sigma seminar at the Quality College are shown in the following table: (a) Develop a 3-year moving average to forecast registrations from year 4...
-
Given a ring R, we can consider the unit group R*. For each of the following rings R, find the structure of the group R*. (a) R=Z/18Z (b) R = Z[a] (where = 2 + i)
-
Read the case study and answer the question below with a one page response. What does a SWOT analysis reveal about the overall attractiveness of Under Armours situation? Founded in 1996 by former...
-
In a vertical jump shown below, there is a velocity versus time graph. a ) ? Where is the person jumping in the air? How do you know this? b ) ? If the time between each tick in the graph is 0 . 1...
-
Sketch the graph of each equation in Problems 3-30. \(y=-2 x^{2}+3\)
-
Sketch the graph of each equation in Problems 3-30. \(y=9-x^{2}\)
-
Graph the solution of each system given in Problems 5-18. \(\left\{\begin{array}{l}x-y \geq 0 \\ y \leq 0\end{array} ight.\)
-
Lets assume that human height is a polygenic trait determined by three genes: A, B, and C. For each gene, there is a tall allele and a short allele. Well use T and S to indicate these. Seven people...
-
Graph the first-degree inequalities in two unknowns in Problems 13-48. \(x+4 y>-12\)
-
After two years study, you finally graduate and start a job as Junior accountant at XYZ Inc. Your manager is responsible for the nationwide distribution of men grooming sets. Because of the new...
-
An annual report of The Campbell Soup Company reported on its income statement $2.4 million as equity in earnings of affiliates. Journalize the entry that Campbell would have made to record this...
-
Consider the unlisted telephone number data in Exercise 10-101. Find 95% CIs on the difference in the proportions of unlisted telephone numbers for Phoenix and Scottsdale residents using both...
-
The nine measurements that follow are furnace temperatures recorded on successive batches in a semiconductor manufacturing process (units are F): 953, 950, 948, 955, 951, 949, 957, 954, 955. (a)...
-
An experiment was run to determine whether four specific firing temperatures affect the density of a certain type of brick. The experiment led to the following data. (a) Does the firing temperature...
-
Express the vibration of a machine given by \(x(t)=-3.0 \sin 5 t-2.0 \cos 5 t\) in the form \(x(t)=A \cos (5 t+\phi)\).
-
An exponential function is expressed as \(x(t)=A e^{-\alpha t}\) with the values of \(x(t)\) known at \(t=1\) and \(t=2\) as \(x(1)=0.752985\) and \(x(2)=0.226795\), respectively. Determine the...
-
If the motion of a machine is described as \(8 \sin (5 t+1)=A \sin 5 t+B \cos 5 t\), determine the values of \(A\) and \(B\).
Study smarter with the SolutionInn App