Consider the following neural network: do w w? a1 az w w2 a3 W w3 W...
Fantastic news! We've Found the answer you've been seeking!
Question:
Transcribed Image Text:
Consider the following neural network: do w w? a1 az w w2 a3 W w3 W a4 This is not a standard neural network as its neurons are not fully connected. The connection situation is shown in the figure above. The neural network does not contain bias in the forward propagation so that a = ;w}zj, z = fi(a) for i = 1,2,3,4. In the network, zo = ao (an input neuron), f3(x) = relu(x), and f(x)=f (x) = f4(x) = sigmoid(x). relu(x) corresponds to a rectified linear unit transfer function defined as: relu(x) = max {0, x}. The cost function is defined as J(w) = (Z y). We consider a single data sample x = 1.0 and the corresponding label y = 0.1. Use the training data to develop the neural network model and solve it by using gradient descent algorithm with an initial setting: w[0] = [w, w, w, w2, w, w, w3] = [0.3, 0, 0.5, 0.4, 1.0, 0.8, 0], and n = 0.01. a) Write a function F to simulate the neural network. [5 marks] b) Compute the values of w3[1], wi[1], w [1] after the first iteration. [15 marks] Consider the following neural network: do w w? a1 az w w2 a3 W w3 W a4 This is not a standard neural network as its neurons are not fully connected. The connection situation is shown in the figure above. The neural network does not contain bias in the forward propagation so that a = ;w}zj, z = fi(a) for i = 1,2,3,4. In the network, zo = ao (an input neuron), f3(x) = relu(x), and f(x)=f (x) = f4(x) = sigmoid(x). relu(x) corresponds to a rectified linear unit transfer function defined as: relu(x) = max {0, x}. The cost function is defined as J(w) = (Z y). We consider a single data sample x = 1.0 and the corresponding label y = 0.1. Use the training data to develop the neural network model and solve it by using gradient descent algorithm with an initial setting: w[0] = [w, w, w, w2, w, w, w3] = [0.3, 0, 0.5, 0.4, 1.0, 0.8, 0], and n = 0.01. a) Write a function F to simulate the neural network. [5 marks] b) Compute the values of w3[1], wi[1], w [1] after the first iteration. [15 marks]
Expert Answer:
Related Book For
Posted Date:
Students also viewed these programming questions
-
answer all questions as instructed below. attend all questions. 4 Computer Vision (a) Explain why such a tiny number of 2D Gabor wavelets as shown in this sequence are so efficient at representing...
-
MUST BE CORRECT ANSWERS A small software company has the following simplified cashflow, funded by shareholders' equity of 20,000 and a bank overdraft of 5000: Invoiced money received 2 months after...
-
In January, the Cabinet Company worked on six job orders for specialty kitchen cabinets. It began job A-62 are as follows: The Cabinet Company produced a total of 34 cabinets for job A-62. Its...
-
The following excerpt is from Whirlpool Corporation's 2008 10-K report filed with the SEC and is a required disclosure: The company is a major supplier to Sears of laundry, refrigerator, dishwasher,...
-
The Gap owns several chains, including Old Navy and Banana Republic. What type of growth opportunity was The Gap pursuing when it opened each of these retail concepts? Which is the most synergistic...
-
Consider the multiple linear regression model for the rental price data in Problem 3.42. Problem 3.42 Table B.24 contains data on median family home rental price and other data for 51 US cities. Fit...
-
The Center for Information Technology at State University has outgrown its office in Bates (B) Hall and is moving to Allen (A) Hall, which has more space. The move will take place during the 3-week...
-
Can you explain how to set up this problem by providing a response to each prompt using physics concepts or formulas? Please include each math step so I can check my own work. This may involve the...
-
You are the merchandise manager for Best Buy electronics and have been asked to expand the assortment of music products. Industry trends suggest more people are downloading their music online. How...
-
Using real-word examples, discuss the extent to which demand-side policies are effective in reducing inflation.
-
One of the arguments for punitive damages (damages paid to the plaintiff in excess of actual damages) is that such damage payments compensate for the fact that lawsuits are not always filed, even if...
-
Does the problem of monopoly provision of a bad arise with a true Pigovian fee? Why?
-
Some policy makers argue that it would be desirable to reward generators of hazardous wastes for appropriately disposing of their wastes. Describe a possible reward scheme. Why is it a good idea?...
-
Your boss, who never took an engineering economy course, is buying a new house and needs your help in answering some questions. The loan amount will be in the jumbo loan category of $600,000 at (1) 7...
-
Determine the convergence of on A=(0,) Detemime the convegence of tem o) on A= CO,to) 2.
-
Illini Company, Inc. Balance Sheet as of 12/31/20X0 Assets Current Assets: Cash $1,500,000 Accounts receivable, net 18,000 Inventory 50,000 Total current assets 1,568,000 Equipment 90,000 Goodwill...
-
Show for a single orbital of a fermion system that = (1 ), If is the average number of fermions in that orbital. Notice that the fluctuation vanishes for orbitals with energies deep enough below....
-
Consider the chemical equilibrium of a solution of linear polymers made up of identical units. The basic reaction step is monomer + Nmer = (N + 1)mer. Let K N denote the equilibrium constant for this...
-
(a) Find a quantitative expression for the thermal equilibrium concentration n = n+ = n in the particle-antiparticle reaction A+ + A = 0. The reactants may be electrons and positrons; protons and...
-
One might naively think that a collection \(\left(X_{t}, t \in \mathbb{R}^{+} ight)\)of independent r.v's may be chosen "measurably," i.e., with the map \[\left(\mathbb{R}^{+} \times \Omega,...
-
Chi-squared Law. A noncentral chi-squared law \(\chi^{2}(\delta, \alpha)\) with \(\delta\) degrees of freedom and noncentrality parameter \(\alpha\) has the density \[\begin{aligned}f(x ; \delta,...
-
If \(X\) is a positive random variable, prove that its negative moments are given by, for \(r>0\) : (a) \(\quad \mathbb{E}\left(X^{-r} ight)=\frac{1}{\Gamma(r)} \int_{0}^{\infty} t^{r-1}...
Study smarter with the SolutionInn App