1. An exponential loss function f(w) is defined as f(w) e-2(w-1), w ...
Fantastic news! We've Found the answer you've been seeking!
Question:
Transcribed Image Text:
1. An exponential loss function f(w) is defined as f(w) e-2(w-1), w <1 { ew-1, w1 a) Is f(w) convex? Why? Hint: Graph the function. b) Is f(w) differentiable everywhere? If not, where not? c) The "differential set" of (w) is the set of subgradients v af (w) for which f(u) f(w) + (u - w)Tv. Find the differential set for f(w) as a function of w. 2. We are trying to predict whether a certain chemical reaction will take place as a function of our experimental conditions: temperature, pressure, concentration of catalyst, and several other factors. For each experiment i = 1,..., m we record the experimental conditions in the vector x; R" and the outcome in the scalar b; {1,1} (+1 if the reaction occurred and -1 if it did not). We will train our linear classifier to minimize hinge loss. Namely, we solve: m minimize (1-bxw)+ w i=1 where (u)+ =max(0, u) is the hinge loss operator a) Derive a gradient descent method for solving this problem. Explicitly give the computations required at each step. Note: you may ignore points where the function is non-differentiable. b) Explain what happens to the algorithm if you land at a wk that classifies all the points perfectly, and by a substantial margin. 3. You have four training samples y = 1, x1 = , Y2 = 2, X2 = , Y3 -2 -2 -1, x3 = and y4 = -2,x4 = 0 Use cyclic stochastic gradient descent to find the first two updates for the LASSO problem miny Xw|+2||W||1 assuming a step size of T six updates. = 1 and w(0) = 0. Also indicate the data used for the first 1. An exponential loss function f(w) is defined as f(w) e-2(w-1), w <1 { ew-1, w1 a) Is f(w) convex? Why? Hint: Graph the function. b) Is f(w) differentiable everywhere? If not, where not? c) The "differential set" of (w) is the set of subgradients v af (w) for which f(u) f(w) + (u - w)Tv. Find the differential set for f(w) as a function of w. 2. We are trying to predict whether a certain chemical reaction will take place as a function of our experimental conditions: temperature, pressure, concentration of catalyst, and several other factors. For each experiment i = 1,..., m we record the experimental conditions in the vector x; R" and the outcome in the scalar b; {1,1} (+1 if the reaction occurred and -1 if it did not). We will train our linear classifier to minimize hinge loss. Namely, we solve: m minimize (1-bxw)+ w i=1 where (u)+ =max(0, u) is the hinge loss operator a) Derive a gradient descent method for solving this problem. Explicitly give the computations required at each step. Note: you may ignore points where the function is non-differentiable. b) Explain what happens to the algorithm if you land at a wk that classifies all the points perfectly, and by a substantial margin. 3. You have four training samples y = 1, x1 = , Y2 = 2, X2 = , Y3 -2 -2 -1, x3 = and y4 = -2,x4 = 0 Use cyclic stochastic gradient descent to find the first two updates for the LASSO problem miny Xw|+2||W||1 assuming a step size of T six updates. = 1 and w(0) = 0. Also indicate the data used for the first
Expert Answer:
Related Book For
Income Tax Fundamentals 2013
ISBN: 9781285586618
31st Edition
Authors: Gerald E. Whittenburg, Martha Altus Buller, Steven L Gill
Posted Date:
Students also viewed these mechanical engineering questions
-
"internet radios" for streaming audio, and personal video recorders and players. Describe design and evaluation processes that could be used by a start-up company to improve the usability of such...
-
Let A, B be sets. Define: (a) the Cartesian product (A B) (b) the set of relations R between A and B (c) the identity relation A on the set A [3 marks] Suppose S, T are relations between A and B, and...
-
Josie's new job has a yearly salary of $42,000. Her salary will increase by $4000 each year thereafter. If Josie works for at this job for 30 years how much will she have been paid total over the 30...
-
The precise mechanism of ammonia toxicity to the brain is not known. Speculate on a possible mechanism, based on possible effects of ammonia on levels of key intermediates in energy generation.
-
What are the different terms used to describe banks during the bank collection process?
-
BlackBerry Ltd. has a target current ratio of 2.0 but has experienced some difficulties financing its expanding sales in the past few months. At present, the firm has current assets of $750,000 and a...
-
1. What problems such as lawsuits, reputation, and public image would GarageTek face if they closed failing franchises? 2. Is shuttering the failed franchises the right move for Shuman? What are his...
-
1. In the given reaction, XYZ3 2. 3. 4. 5. X+Y+3Z If one mole of each of X and Y with 0.05 mol of Z gives compound XYZ3. (Given: Atomic masses of X, Y and Z are 10, 20 and 30 amu, respectively.) The...
-
Identifytwo (2) additional needs or barriers to participation related to each of the five areas stated; Behavioural or psychological disorders Child at risk of harm Family circumstances and needs,...
-
In October, Penny Bear, Treasury Manager for the Winter Den Company plans on selling $1,000,000 of 3-month day T-bills in December. At this time, the 3-month day T-bill discount rate is 2.70%,...
-
Compliance requirements for Plan and prepare to perform construction calculations to determine carpentry material requirements,
-
1. A student sees a newspaper ad for an apartment that has 1330 square feet (ft) of floor space. How many square meters of area are there? Answer: 124 m 2 2.Bicyclists in the Tour de France reach...
-
Sean, age 37, wants to maximize his 2020 Roth IRA contribution and purchase as much Beyond Meat stock as possible. If BYND stock is trading at 10/share, and Sean is able to contribute the maximum in...
-
A female employee wants to take time off because she is pregnant. State facts that would entitle her to FMLA and facts that would entitle her to disability leave under the Pregnancy Discrimination...
-
Requirements Chocolate Arts Inc. needs your assistance in documenting its business processes for the payroll cycle for paying employees. The following is narrative. Every Friday after calculating...
-
Read Managing Talent: Can Yahoo Still Attract Tech Workers? at the end of chapter five and answer the following three questions: a. How could Yahoo strengthen its internal recruiting? How would these...
-
In the simple quantity theory of money, what will lead to an increase in aggregate demand? In monetarism, what will lead to an increase in aggregate demand?
-
Which of the following is a cash outflow? (a) Proceeds from borrowing. (b) Repayments of debt principal. (c) Payment for taxes. (d) Both (b) and (c).
-
Which of the following is not a cash inflow? (a) Proceeds from borrowing. (b) Returns on interest-earning assets. (c) Payment of dividends. (d) Returns on equity securities.
-
How would the sale of a building be classified? (a) Operating outflow. (b) Operating inflow. (c) Investing inflow. (d) Financing inflow.
Study smarter with the SolutionInn App