In class we used the Courant-Fischer theorem to prove that the best low-rank approximation to any...
Fantastic news! We've Found the answer you've been seeking!
Question:
Transcribed Image Text:
In class we used the Courant-Fischer theorem to prove that the best low-rank approximation to any matrix X Rnxd is given by XVV where Vk Rdxk contains the top k eigenvectors of XX (i.e., the top k singular vectors of X). Here you will prove this from scratch, using just the basic properties of projection matrices and eigenvectors. 1. (2 points) Let X E Rnxd be any matrix and B Rnxd be any rank-k matrix with SVD B = WSZT for orthonormal W Rnk, Z Rdxk, and diagonal S Rkxk. Prove that ||X - B|| = ||XZZ B||/ + ||X XZZT||2. Hint: Use the Pythagorean theorem. 2. (2 points) Use part (1) to show that if B = || ||X-M|| then we have XZZ = B. arg min M:rank (M)=k 3. (2 points) Using a similar argument as above, one can show that if B is an optimal rank-k approximation of X then WWTX = B. Use this and part (2) to show that: XZ = WS and WTX = SZT. 4. (2 points) Use part (3) to show that if B is an optimal rank-k approximation of X then XXZ = ZS and use this to argue that each column of Z is an eigenvector of XTX. 5. (2 points) Complete the proof, showing that the best low-rank approximation of X is given ows by XVV where V contains the top k eigenvectors of XTX. Go to Settings to activate In class we used the Courant-Fischer theorem to prove that the best low-rank approximation to any matrix X Rnxd is given by XVV where Vk Rdxk contains the top k eigenvectors of XX (i.e., the top k singular vectors of X). Here you will prove this from scratch, using just the basic properties of projection matrices and eigenvectors. 1. (2 points) Let X E Rnxd be any matrix and B Rnxd be any rank-k matrix with SVD B = WSZT for orthonormal W Rnk, Z Rdxk, and diagonal S Rkxk. Prove that ||X - B|| = ||XZZ B||/ + ||X XZZT||2. Hint: Use the Pythagorean theorem. 2. (2 points) Use part (1) to show that if B = || ||X-M|| then we have XZZ = B. arg min M:rank (M)=k 3. (2 points) Using a similar argument as above, one can show that if B is an optimal rank-k approximation of X then WWTX = B. Use this and part (2) to show that: XZ = WS and WTX = SZT. 4. (2 points) Use part (3) to show that if B is an optimal rank-k approximation of X then XXZ = ZS and use this to argue that each column of Z is an eigenvector of XTX. 5. (2 points) Complete the proof, showing that the best low-rank approximation of X is given ows by XVV where V contains the top k eigenvectors of XTX. Go to Settings to activate
Expert Answer:
Related Book For
Posted Date:
Students also viewed these databases questions
-
02: You are to select a site for a new facility where there are seven potential site locations. The key attributes/criteria (A, B, C, and D) for the facility and their weights of importance...
-
QUIZ... Let D be a poset and let f : D D be a monotone function. (i) Give the definition of the least pre-fixed point, fix (f), of f. Show that fix (f) is a fixed point of f. [5 marks] (ii) Show that...
-
Questions for discussion: read the material in Exhibit 3.3 below and answer the three questions presented at the end of the exhibit. EXHIBIT 3.3 WHAT IS THE MOST DESIRABLE LEVEL OF POLLUTION?...
-
What is the present value today of an ordinary annuity cash flow of $3,000 per year for forty years at an interest rate of 10.0% per year if the first cash flow is six years from today?
-
On January 1, 2013, Sweetwater Furniture Company leased office space under a 21-year operating lease agreement. The contract calls for annual rent payments on December 31 of each year. The payments...
-
Again consider the 400 parts in Table 2.2. From this table, \[ P(D \mid F)=\frac{P(D \cap F)}{P(F)}=\frac{10}{400} / \frac{40}{400}=\frac{10}{40} \] Note that in this example all four of the...
-
Indicate whether a firm exhibiting each of the following characteristics would more likely be centralized (C) or decentralized (D). a. Large number of employees who telecommute b. Slow growth rate c....
-
3. Assume the one switch input (SW1) and the two LEDs (LED1, LED2). Both LEDs should be initially off. After each press and release of SW1, change the LED state (LED1/LED2) in the sequence: ON/ON,...
-
In early 2016, Doc and Lyn McGee formed the McGee Cake Company. The company produced a full line of cakes, and its specialties included chess cake,* lemon pound cake, and double-iced, double...
-
1. (Supplemental Exercise - Required) For a recent season, several variables were recorded for 125 professional golfers. We are interested in using multiple regression to predict Earnings ($) from...
-
Explain the utility of breakeven analysis, balanced scorecards, and discounted cash flow techniques in evaluating and tracking performance of an organization. provide also the reference you used.
-
A particle moves along the curve of intersection of the surfaces x + y = 16 and z = x+y. Find a vector function r(t) for the particle's path and use it to find the normal and tangential components of...
-
A ticket to the school dance is $6 and usually 250 students attend. The dance committee knows that for every $1 increase in the price of a ticket, 25 fewer students attend the dance. What ticket...
-
3. A hiker is walking on a mountain with height above the z = 0 plane given by z = f(x, y) = 8-xy. The positive x-axis points east and the positive y-axis points north, and the hiker starts from the...
-
1. The agreement for leasing a new car with a purchasing price of $21 000 includes a down payment of $1500 and a monthly payment of $325.00 over four years. At the end of the four years, you could...
-
EatAway Enterprises sells food at the Calgary International Airport. Please refer below to some of their Profitability ratios for the 2020 and 2019 financial years. 2020 2019 EatAway Enterprises'...
-
The text defined intrinsic value as the value of an asset given a hypothetically complete understanding of the assets investment characteristics. Discuss why hypothetically is included in the...
-
For any two closed curves lying on a cylinder whose central axis is the z-axis (Figure 21). C2 C1 FIGURE 21
-
Define g(t) = 21/(t1) for t 0. Answer the following questions, using a plot if necessary. (a) Can g(1) be defined so that g(t) is continuous at t = 1? (b) How should g(1) be defined so that g(t) is...
-
In Exercises, use a computer algebra system to compute S10, S100, S500, and S1000 for the series. Do these values suggest convergence to the given value? -3 1 =2-3-4 4-5-6+6-7-8 8.9-10
-
One might naively think that a collection \(\left(X_{t}, t \in \mathbb{R}^{+} ight)\)of independent r.v's may be chosen "measurably," i.e., with the map \[\left(\mathbb{R}^{+} \times \Omega,...
-
Chi-squared Law. A noncentral chi-squared law \(\chi^{2}(\delta, \alpha)\) with \(\delta\) degrees of freedom and noncentrality parameter \(\alpha\) has the density \[\begin{aligned}f(x ; \delta,...
-
If \(X\) is a positive random variable, prove that its negative moments are given by, for \(r>0\) : (a) \(\quad \mathbb{E}\left(X^{-r} ight)=\frac{1}{\Gamma(r)} \int_{0}^{\infty} t^{r-1}...
Study smarter with the SolutionInn App