Question: Problem 2. 1. While we can formalize the Likelihood Function there is no close form expression for the coefcients 50,131 maximizing the above log-likelihood in

Problem 2. 1. While we can formalize the Likelihood Function there is no close form expression for the coefcients 50,131 maximizing the above log-likelihood in Problem 1. Hence, we will use an iterative algorithm to solve for the coefcients. We can see that 111 max(_ 2111(1 + 8-yi(150+1812|])) = [1111\": In (1 + e-mfo+izl))) i=1 i=1 We will describe our function loss as L 1mg"; in (1 + e \"'(+1\"')). Our objective is to iteratively decrease this loss as we keep computing the optimal coefcients. Here 1; E R In this problem we will be working with real image data where the goal is to clas- sify if the image is 0 or 1 using logistic regression. The input X E R m x d, is a matrix with dimensions [m x d], where a single data point 11:, 6 Rd with d = 784. The labels matrix Y E R'", where each label y,: E {0, 1} 0 Load the data into the memory and visualize one input as an image for each of label 0 and label 1. (The data should be reshaped back to [28 x 28] to be able to visualize it.) o The data is in between 0 to 255. Normalise the data to [0, 1] 0 Set y, = 1 for images labeled 0 and y, = -1 for images labeled 1. Split the data randomly into train and test with a ratio of 80:20. Why is random splitting better than sequential splitting in our case? 0 Initialize the coefcients using a univariate \"normal\" (Gaussian) distribution of mean 0 and variance 1. (Remember that coefcients are a vector of [130,131...,6d], where d is the dimension of the input) 0 Compute the loss using the above mentioned Loss L. (The loss can be written as L = :12"; ln(1+ e._""("r"'+zji1 =0 "U+1}x\")), where (i, j) represent the if\" data point, where i 6 {1,2, ..,m} and 3"" dimension of the data point 11:, forj E {0,...d 1}) a To minimize the loss function, a widely known algorithm is going in the direction opposite to the gradients of the loss function. (It's helpful to write the coefcients [131, ..., 135] as a vector 13, and g as a scalar. NowERdandoeR) We can write the gradients of loss function as a matrix operation 8 _yr'."'(160+18 2T) 3; = _ mZ 1+ e-tve \"(1%szin _ "'30 6L 1 m e-ve t60+ 2T) $= E :1 1 + e(ye'fod''wfll we, d6 Write a function to compute the gradients . Update the parameters as B = 3 - 0.05 * dB Bo = Bo - 0.05 * d Bo (Gradient updates should be computed based on the train set) . Repeat the process for 50 iterations and report the loss after the 50th epoch. . Plot the loss for each iteration for the train and test sets . Logistic regression is a classification problem. We classify as +1 if P(Y = 1|X) 2 0.5. Derive the classification rule for the threshold 0.5. (Not a programming question) . For the classification rule derived compute the accuracy on the test set for each iteration and plot the accuracy The final code should be along this format import numpy as np from matplotlib import pyplot as plt def compute_loss (data, labels, B, B_0): return logloss def compute_gradients (data, labels, B, B_0): return dB, dB_0 if _ _name__ == ' '_ _main_ _ ' : x = np . load (data) y = np . load (label) ## Split the data to train and test x_train, y_train, x_test, y_test = #split_data B = np . random . randn (1, x . shape [1] ) B_0 = np . random . randn(1) 1r = 0.05 for _ in range (50) : ## Compute Loss loss = compute_loss (x_train, y_train, B, B_0)

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Mathematics Questions!

1 2 3 4 7 8 9 12 13 14 15 16 17 18 19 20 21 22 23 24 28 29 30 31 38 40 41 44 47 48 49 50 51 62 63 64 66 67 68 69 70 71 73 74 76 77 78 79 80 81 82 85 86 87 88 89 90 91 92 93 94 95 99 100 101 104 105...

MATHEMATICS FOR MACHINE LEARNING Marc Peter Deisenroth A. Aldo Faisal Cheng Soon Ong Contents Foreword 1 Part I Mathematical Foundations 9 1 Introduction and Motivation 11 1.1 Finding Words for...

Statistics 149 Spring 2016 Assignment 1 Due Monday February 8, 2016 Homework is to be handed in either at the Monday lecture, or directly into the Stat 149 dropbox on the seventh oor of the Science...

Hi, math experts. I am recently learning from Pattern Recognition and Machine Learning, Chris Bishop. Please provide a detailed explainations briefly for the following sections. 1. Section 1.2.2...

2 CS229 Problem Set #4 Solutions log m ! p(x(i) |)p() = log p() + m " log p(x(i) |) i=1 i=1 = log p() + m " log " p(x(i) , z (i) |) log " Qi (z (i) ) i=1 = log p() + log p() + m " z (i) i=1 z (i) m...

COVER PAGE STAT 608 Homework 05 Summer 2017 Please TYPE your name and email address below, then convert to PDF and attach as the first page of your homework upload. NAME: EMAIL: HOMEWORK NUMBER:...

1. Convexity of Generalized Linear Models In this question we will explore and show some nice properties of Generalized Linear Models, specically those related to its use of Exponential Family...

Exercises Chapter 2 2.1 Marginal and conditional probability: The social mobility data from Section 2.5 gives a joint probability distribution on (Y1 , Y2 )= (father's occupation, son's occupation)....

Korea Advanced Institute of Science and Technology Department of Electrical Engineering & Computer Science EE531 Statistical Learning Theory, Spring 2016 Assignment I Issued: Mar. 19, 2016 Due: Apr....

1) Find all real solutions of: 4 2 2 x + x =3 2 x 4 + x23=0 4 2 2 2 x 2 x +3 x 3=0 x ( 21)=0 2 x 2 ( x 21)+ 3 ( 2 x2 +3 ) ( x 21 ) =0 ( 2 x2 +3 ) =0 2 2 x =3 3 x 2= 2 3 x= 2 3 x= i 2 2) Given the...

Alexa & Co. holds 40% of the total common shares outstanding of Olivia LLC. During the current year, Olivia LLC declares and pays a $100,000 cash dividend on its common shares and reports net income...

The study Digital Footprints ( Pew Internet & American Life Project, www. pewinternet. org, 2007) reported that 47% of Internet users have searched for information about themselves online. The 47%...

9 ) Can Property Assessed Clean Energy ( PACE ) financing, which can advance debt proceeds up to 9 0 % LTC / LTV as a supplement to a first mortgage, at a non - recourse, fixed - rate, long - term,...

Exercise 6.7A (Algo) Treatment of NSF check LO 6.3 Han's Supplies's bank statement contained a \$380 NSF check that one of its customers had written to pay for supplies purchased. Required: a. \& c....