Question: Using Python implement the follo Question 1 We now need to identify the principal components. For this we need to compute a covariance matrix and

Using Python implement the follo

Question 1 We now need to identify the principal components. For this we need to compute a covariance matrix and then determine its eigenvalues and eigenvectors. Sort these in ascending order and pick the first l largest eigenvalues and their corresponding eigenvectors. These become the principal components. There are two ways to determine principal components. Let's investigate both ways.

a. Use the Numpy package to determine the covariance matrix, and work out its eigenvalues and eigenvectors.

b. Alternatively, for l>1, the covariance matrix is given by the l eigenvectors of XT X corresponding to the largest eigenvalues. We can derive the principal components via singular value decomposition of XT X.

c. Is there any relationship between the features in X ? What does it mean when features are highly correlated?

Task 3: Reduce the data

Question 1 Sort the eigenvalues and their corresponding eigvenvectors in descending order.

Question 2 Select the first l principal components, i.e., select the first l eigenvectors and concatenate all them to form a matrix V with one eigenvector per column: V=[v(1),...,v(l)).

Question 3 Transform the original dataset from n dimensions to l dimensions with the following matrix multiplication XV.

Question 4 Reconstruct the original data from the reduced data set using V . What do you observe in terms of the norm?

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

Instructions: Please interpret your analysis results using concise and clear language and focusing on interesting findings. Remember to include your Python codes and only necessary Python outputs. 1....

Show me the steps to solve The state - crime data contain the name and the following variables for 5 0 states: > Abbr ( descriptor ) > Unemploy, Police, InSchool ( numeric X variables ) > Murder,...

Please use minitab to help explain. Five variables are used to monitor a chemical process. They are X, percentage impurities, Xz temperature C, X nitrogen content (ppm), X4 oxygen content (ppm), and...

MATHEMATICS FOR MACHINE LEARNING Marc Peter Deisenroth A. Aldo Faisal Cheng Soon Ong Contents Foreword 1 Part I Mathematical Foundations 9 1 Introduction and Motivation 11 1.1 Finding Words for...

3 [50 pts] Dimensionality reduction For this question, you will use the Cloud Dataset from the UCI ML repository: https://archive.ics.uci.edu/ml/datasets/Cloud . Read about it to get familiar with...

Dataset You will work with the colon.csv file, which contains gene data from each patient. The dataset includes various gene expression measurements ( features ) and a label indicating the stage...

You will work with the Thyloid.csv file, which contains gene data from each patient. The dataset includes various gene expression measurements ( features ) and a label indicating the stage...

Read the information in all photos and answer the questions using what you read. 2.3 Students treated as customers Gruber et al. (2010), in their research paper, treated students as customers in the...

Use Matlab or Python programming language for the implementation. Matlab is preferred. Thanks! Write a script that implements the following methods. You can use built-in functions as needed....

3.2 Identifying FR level of importance using factor analysis The level of importance of the FRs is determined in this study by the application of factor analysis (principal component technique). The...

Which compound would you expect to have the greater - Hovalue, 1,2-pentadiene or 1,4 pentadiene?

Use the data for GlenTel Inc. in Exercise 14-16 to present a multiple-step income statement for 2014. You need not complete the earnings per share calculations.

When a firm faces hard rationing, Group of answer choices all positive net present value projects will be accepted. there will be no available funds for capital expenditures. the firm will finance...

Compared with half a century ago, adoption has become _ _ _ _ _ _ _ _ _ common, but it is more open and acceptabl e , so we probably discuss it _ _ _ _ _ _ _ . fill in the blanks more or much less or...

Was there an effort to involve the appropriate people?

18. If you have power, then people will dislike and fear you.

15. On very controversial issues, it often is best to delay or avoid your involvement as long as possible.