Assume we are training a neural network with two inputs and for the OR gate. The...
Fantastic news! We've Found the answer you've been seeking!
Question:
Transcribed Image Text:
Assume we are training a neural network with two inputs and for the OR gate. The table below represents the input and expected output for the OR gate: i 1 2 3 4 1 2 0 0 0 1 0 1 output y 0 1 1 1 We will create a neural network with two inputs, one output, and no hidden layer. The activation function will be the sigmoid function S(x) = 1 1+ e-z' x1 X2 W W2 = 0.4 and = -0.2, and initial bias b = 0.1. Begin with the initial weights w = The observed output is calculated as = S(s) where s = wx1 +wx2 + b. If we were to implement this in a practical example, we know that the output is expected as either 0 or 1, so we could simply round to the nearest integer afterwards. (a) For each of the four inputs (1, 2) in the table above, determine the observed output for the initial weights and bias. (b) Using the mean squared error function error = i=1 (i - y) calculate the error observed in part (a). (c) Using the chain rule, calculate the following derivatives derror wi Jerror w derror b After the 100 epochs, what are the values s and for each input? What are the values of w, W2, b? (e) Did the observed output adjust properly to mimic the expected output? Note: For those familiar with neural networks, this training simulation has quite a few issues with it. For instance, we are overfitting the data since our actual data is the training data. This is due to the small data amounts. Another issue is we are not randomizing the order of the inputs when training. All of this (and more!) will be discussed in your future ML & AI courses. Assume we are training a neural network with two inputs and for the OR gate. The table below represents the input and expected output for the OR gate: i 1 2 3 4 1 2 0 0 0 1 0 1 output y 0 1 1 1 We will create a neural network with two inputs, one output, and no hidden layer. The activation function will be the sigmoid function S(x) = 1 1+ e-z' x1 X2 W W2 = 0.4 and = -0.2, and initial bias b = 0.1. Begin with the initial weights w = The observed output is calculated as = S(s) where s = wx1 +wx2 + b. If we were to implement this in a practical example, we know that the output is expected as either 0 or 1, so we could simply round to the nearest integer afterwards. (a) For each of the four inputs (1, 2) in the table above, determine the observed output for the initial weights and bias. (b) Using the mean squared error function error = i=1 (i - y) calculate the error observed in part (a). (c) Using the chain rule, calculate the following derivatives derror wi Jerror w derror b After the 100 epochs, what are the values s and for each input? What are the values of w, W2, b? (e) Did the observed output adjust properly to mimic the expected output? Note: For those familiar with neural networks, this training simulation has quite a few issues with it. For instance, we are overfitting the data since our actual data is the training data. This is due to the small data amounts. Another issue is we are not randomizing the order of the inputs when training. All of this (and more!) will be discussed in your future ML & AI courses.
Expert Answer:
Answer rating: 100% (QA)
Solutions Step 1 This is the complete answer to this with a very detailed explanation This is the complete answer for part A in this we need to calculate the observed output for each of the four input... View the full answer
Related Book For
Business Statistics In Practice
ISBN: 9780073401836
6th Edition
Authors: Bruce Bowerman, Richard O'Connell
Posted Date:
Students also viewed these mechanical engineering questions
-
Note: All ML code must be explained clearly (INJAVAXX)and should be free of needless complexity. 2 CST.2016.1.3 2 Foundations of Computer Science Please help. (2c) (a) A prime number sieve is an...
-
I need it in JAVAx Objects: Electronic health records (EHRs) in a nationwide service. Policy: The owner (patient) may read from its own EHR. A qualified and employed doctor may read and write the EHR...
-
The file CigaretteTax contains the state cigarette tax ($) for each state as of January 1, 2013. a. Construct an ordered array. b. Plot a percentage histogram. c. What conclusions can you reach about...
-
Write the augmented matrix for the system of linear equations. 1. 2. 3. 4. 5. 6. [2x y = 7 |x + y = 2 5x + 2y = 13 - 3x + y = -24 - 24
-
Exercise 3.58 describes a study in which college students found it unpleasant to sit alone and think. The same article describes a second study in which college students appear to prefer receiving an...
-
While you're at the library, select two other articles from an area in which you are interested and write a brief description of the sample and how it was selected from the population. Be sure to...
-
1. Your notebook computers hard drive recently crashed, and you decide to take it to a local repair technician to have it fixed. In this relationship, a. you are the agent. b. the technician is the...
-
A local Dunkin Donuts franchise must buy a new piece ofequipment in 5 years that will cost $88,000. The company is settingup a sinking fund to finance the purchase. What will the quarterlydeposit be...
-
As of December 31, Grapefruit Company had $2,086 of raw materials inventory. At the beginning of the year, there was $3,206 of materials on hand. During the year, the company purchased $288,172 of...
-
Cullumber Company completes and transfers out 15,360 units and has 2,560 units of ending work in process that are 25% complete as to conversion costs. Materials are entered at the beginning of the...
-
a. Journalize the entries to record (1) The declaration of the dividend, capitalizing an amount equal to market value, and (2) The issuance of the stock certificates. b. Determine the following...
-
Record the following transactions on the books of Ivanhoe Co. (Credit account titles are automatically indented when amount is entered Do not indent manually.) a. On July 1, Ivanhoe Co. sold...
-
?Journalize Loondance ' s July transactions. ( You do not need to record the cost of goods sold or inventory entry or ?entries. ) ( Record debits ?first, then credits. Exclude explanations from any...
-
People management is complex and involves a wide range of skills. What are the most important skills to work on right now, to become a more effective manager? How can someone apply "learn today,...
-
Consider the model where y = x + u, t= 1,.., T U =put-1+E, t=...,0,1,2,..., with lpl <1, and (e) is a sequence of i.i.d. disturbances, with E(e) = 0, Var(e) = o, vt. a) Explain how the above linear...
-
The Smiths buy a house. They borrow 80 percent of the purchase price from the local ABC Savings and Loan. Before they make their first payment, ABC transfers the right to receive mortgage payments to...
-
Consider the display panel situation in Exercise 11.3, and let A, B, and C (represent the mean times to stabilize the emergency condition when using display panels A, B, and C, respectively. Figure...
-
On January 4. 2000, the Gallup Organization released the results of a poll dealing with the likelihood of computer-related Y2K problems and the possibility of terrorist attacks during the New Year's...
-
Discuss how we use residual plots to check the regression assumptions for a multiple regression model.
-
What are the pros and cons of virtual teams? What are some of the potential legal issues an organization can encounter with virtual teams?
-
What are the five stages of team development? Describe each stage and how those stages may appear as behaviors in a health care setting.
-
What are the differences among an onsite team, a virtual team, a task force, and a committee? What are some of the potential differences in dynamics between people in these different groups?
Study smarter with the SolutionInn App