Question: pogramming Problem: [44 points] In this problem, we write a program to find the coefficients for a linear regression model for the dataset provided (data2.txt).

pogramming Problem: [44 points] In this problem, we write a program to find the coefficients for a linear regression model for the dataset provided (data2.txt). Assume a linear model: y = Wo + wi*x. You need to 1) Plot the data (i.e., X-axis for 1st column, y-axis for 2nd column), and use Python to implement the following methods to find the coefficients: 2) Normal equation, and 3) Gradient Descent using batch AND stochastic modes respectively: a) Determine an appropriate termination condition (e.g., when cost function is less than a threshold, and/or after a given number of iterations). b) Print the cost function vs. iterations for each mode; compare and discuss batch and stochastic modes in terms of the accuracy and the speed of convergence. c) Choose a best learning rate. For example, you can plot cost function vs. learning rate to determine the best learning rate. Please implement the algorithms by yoursef and do NOT use the fit() function of the library. pogramming Problem: [44 points] In this problem, we write a program to

3. [44 points] Write a program to find the coefficients for a linear regression model for the dataset provided (data2.txt). Assume a linear model: y=w0+w1x. You need to 1) Plot the data (i.e., x-axis for the 1st column, y-axis for the 2nd column), and use Python to implement the following methods to find the coefficients: 2) Normal equation, and 3) Gradient Descent using batch AND stochastic modes respectively: a) Split dataset into 80% for training and 20% for testing. b) Plot MSE vs. iteration of each mode for both training set and testing set (i.e., batch training and testing; stochastic - training and testing). Compare batch and stochastic modes (with discussion) in terms of accuracy (of testing set) and speed of convergence (You need to determine an appropriate termination condition, e.g., when cost function is less than a threshold, and/or after a given number of iterations.) c) Plot MSE of the testing set vs. learning rate (using 0.001,0.002,0.003,0.004,0.005, 0.006,0.007,0.008,0.009,0.01) and determine the best learning rate. Please implement the algorithms by yourself and do NOT use the fit() function of the library. Applied Machine Learning - CPE 695 @2023 1

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

I need answer for Question 3 4. [44 points] Write a program to find the coefficients for a linear regression model for the dataset provided (data2.txt). Assume a linear model: y=w0+w1x. You need to...

Curve-fitting Project - Linear Model (due at the end of Week 5) Instructions For this assignment, collect data exhibiting a relatively linear trend, find the line of best fit, plot the data and the...

Day Date Weekday Daily Demand Weekend weekday Average daily demand weekdays 1 4/25/2016 Mon 297 0 sun 453.25 mon 2 4/26/2016 Tue 293 0 mon 301.38 tue CheckWed the formula 3 4/27/2016 327 0 tue 315.13...

Bank of Delaware The National Bank of Delaware County (NBDC), with 12 locations in New York, is an active lender for customers looking to purchase a home, finance a small business, or open a new...

please help with this problem!!!! BU 5710 Fall 2019 Homework 2 1. Page 100, Question 4.3 (a-d) 2. Pages 100-101, Question 4.5 (a-d) 3. Baseball Franchise Values The value of a sports franchise is...

Philadelphia Modern housing areas often seek to obtain, or at least convey, low rates of crimes. Homeowners and families like to live in safe areas and presumably are willing to pay a premium for the...

(a) In SystemVerilog, what is the difference between: (i) The ternary operator ? and if...then...else statements? [2 marks] (ii) always_ff and always_comb? [2 marks] (iii) Blocking, non-blocking and...

Using the article Comparing Crime Rates between Undocumented Immigrants, Legal Immigrants, and Native-born US citizens in Texas, write a 2-3 page-paper (double-spaced, 1-inch margins, times roman...

ECO 578 Fall 2015 Page 1 of 14 Name: _______________________ ID: _______________________ Online Final Exam There are 4 parts: Part A: True/ False (1-20) Part B: Select the correct answer for the...

Unit 3 Problem Set NAME: Elements of Statistics--FHSU Virtual College--Summer 2015 REMEMBER, these are assessed preparatory problems related to the content of Unit 3. The Unit 3 Exam will consist of...

What is the managerial succession process? How important are the internal and external managerial labor markets to this process.

Pillow Company is purchasing an 80% interest in the common stock of Sleep Company. Sleeps balance sheet amounts at book and fair values are as follows: Use valuation analysis schedules to determine...

Frederik speaks fluent Spanish, and English is his second language. When communicating with his colleagues, he uses words that are normal in his language, but they consider them inappropriate. What...

record in appropriate special journals below uction or distribution Principles of Accounting 1) A company uses four special journals: purchases, sales, cash r following purchase and cash payments...

2. Dominos Pizza was interested in determining whether a new employee could learn how to make a pizza using a computer-based training method (CD-ROM). The CD-ROM application addresses the proper...

6. How might you estimate the benefits of a training program designed to teach employees how to use the World Wide Web to monitor stock prices?

4. What are results outcomes? Why do you think most organizations dont use results outcomes for evaluating their training programs?