Question: Data Mining tep 3: perform Principal component analysis (PCA) to generate a number of Principal Components (PCs) capturing 85% of total data variance. a) Plot

Data Mining

Data Mining tep 3: perform Principal component analysis (PCA) to generate anumber of Principal Components (PCs) capturing 85% of total data variance. a)

tep 3: perform Principal component analysis (PCA) to generate a number of Principal Components (PCs) capturing 85% of total data variance. a) Plot percentage of variances of each Principal Components(PC) in a decreasing order (5 points) b) How many components do you need to capture >85% total data variance? (5 points) c) Plot the generated (new) top P PC variables you selected (5 points) d) Compute the covarion matrix of the P PC variables (this should be P-by-P matrix), and compute the total variance of PCs (sum of diagonal elements of covariance matrix ), compare this value with the total variance of data in Step2_b, what \% variance kept in PCs (5 points), e) compute correlation (Pearson's correlation) between new variable PC1 and new variable PC2 (5 points), f) Plot the PC coefficients (or projection direction) of N components you selected (5 points)

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!