Question: Hi! I really need help on this machine learning questions. Any fast help would be appreciated, on any of the parts. We derived a cost

Hi! I really need help on this machine learning

Hi! I really need help on this machine learning questions.

Any fast help would be appreciated, on any of the parts.

We derived a cost function for regression problems by assuming that sample points and their labels arise from the following process, and applying maximum likelihood estimation (MLE). Sample points come from an unknown distribution, X, D. Labels y, are the sum of a deterministic function g plus random noise: Vi, y = g(x) + 6, where - N(0,0). For this problem, we will assume that & N0.07) that is, the variance of of the noise is different for each sample point and we will examine how our cost function changes as a result. We assume that (magically) we know the value of each of You are given an nxd design matrix X, an t-vector y of labels, such that the label y, of sample point X is generated as described above, and a list of the noise variancesc (a) [8 pts Apply MLE to derive the optimization problem that will yield the maximum likelihood estimate of the distribution parameter g. (Note: g is a function, but we can still trcat it as the parameter of an optimization problem.) Express your cost function as a summation of loss functions (where you decide what the loss function is), one per sample point. (b) [4 pts) We decide to do linear regression, so we parameterize g(x) as g(x) = w. X., where w is a d-vector of weights Write an equivalent optimization problem where your optimization variable is w and the cost function is a function of X. y, w, and the variances Find a way to express your cost function in matrix notation, with no summations. This may entail defining a new matrix.) (e) [4 pts) Write the solution to your optimization problem as the solution of a linear system of equations. (Again, in matrix notation, with no summations.) (d) 12 pts Does your solution resemble that of a similar method you know? What is its name? (e) 12 pts Compure your solution to the case in which we assume that every sample point has the same noise distribution In simple terms, how does the amount of noise affect the optimization, and why does this secm like the intuntively night thing to do? Answer in 3 sentences or fewer. We derived a cost function for regression problems by assuming that sample points and their labels arise from the following process, and applying maximum likelihood estimation (MLE). Sample points come from an unknown distribution, X, D. Labels y, are the sum of a deterministic function g plus random noise: Vi, y = g(x) + 6, where - N(0,0). For this problem, we will assume that & N0.07) that is, the variance of of the noise is different for each sample point and we will examine how our cost function changes as a result. We assume that (magically) we know the value of each of You are given an nxd design matrix X, an t-vector y of labels, such that the label y, of sample point X is generated as described above, and a list of the noise variancesc (a) [8 pts Apply MLE to derive the optimization problem that will yield the maximum likelihood estimate of the distribution parameter g. (Note: g is a function, but we can still trcat it as the parameter of an optimization problem.) Express your cost function as a summation of loss functions (where you decide what the loss function is), one per sample point. (b) [4 pts) We decide to do linear regression, so we parameterize g(x) as g(x) = w. X., where w is a d-vector of weights Write an equivalent optimization problem where your optimization variable is w and the cost function is a function of X. y, w, and the variances Find a way to express your cost function in matrix notation, with no summations. This may entail defining a new matrix.) (e) [4 pts) Write the solution to your optimization problem as the solution of a linear system of equations. (Again, in matrix notation, with no summations.) (d) 12 pts Does your solution resemble that of a similar method you know? What is its name? (e) 12 pts Compure your solution to the case in which we assume that every sample point has the same noise distribution In simple terms, how does the amount of noise affect the optimization, and why does this secm like the intuntively night thing to do? Answer in 3 sentences or fewer

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related General Management Questions!

hi i really need help with this project. its about 60%+ complete i just cant find the rest of the numbers. its due tomorrow, id really appreciate help. TS2761 I SEE THE LIGHT Thandi, when you are...

Read the above passage and then answer short questions Summarize and elaborate the research method of this article in concise language Application Research Based on Machine Learning in Network...

. Requred: Question: make 8 slide of power point presentation on the above case study Rapid advances in machine learning (ML) that is a or click-through conversions are giving way to more machine's...

Chapter 2 User-Centered Systems Design: A Brief History Abstract The intention of this book is to help you think about design from a user-centered perspective. Our aim is to help you understand what...

Please i need 8 slide power point presentation on this article please . Required: Question: Make 8 slide presentation of powerpoint Leading With Next-Generation Key Performance Indicators 4 Executive...

see case to answer question only you don't need no other reference. Case Overview Founded by Jeff Bezos, online giant Amazon.com, Inc. (Amazon), was incorporated in the state of Washington in July,...

Instructions: Read UCB case study and answer the following questions: a) Discuss TWO primary roles of De Prins as a Chief Information Officer of UCB b) Based on the answers in a) identify the level...

(a) In SystemVerilog, what is the difference between: (i) The ternary operator ? and if...then...else statements? [2 marks] (ii) always_ff and always_comb? [2 marks] (iii) Blocking, non-blocking and...

Do you expect robots to have a bigger impact inside or outside of factories in the next 15 years, and what implications does your answer have for the kinds of strategies that will gain competitive...

Describe 1 of the ways that this discussion around AI impacts people at work.Be specific in your connection to the podcast. TRANSCRIPT OF PODCAST: Working Well with AI episode 2: AI and...

Bill & Ben manage the Flowerpot Men, a company that makes and sells flower pots. Bill manages the sales department, and Ben manages the production department. The sales budget for this year is to...

R-Kraine Inc. is considering acquiring an existing project (with financial backing from the government). The project is expected to have another 8 (full) years of economic life. The project's...

Questionl ( 2 5 Marks ) Morris - Meyer Mining Compary must install $ 1 . 5 miltion of new machinery in its Nevada mine. It can obtain a bark loan for 1 0 0 percent of the required amount....

As the manager of Smith Construction, you need to make a decision on the number of homes to build in a new residential area where you are the only builder. Unfortunately, you must build the homes...