Question: Consider the application of EM to learn the parameters for the network in Figure (a), given the true parameters in Equation (20.7). a. Explain why

Consider the application of EM to learn the parameters for the network in Figure (a), given the true parameters in Equation (20.7).

a. Explain why the EM algorithm would not work if there were just two attributes in the model rather than three.

b. Show the calculations for the first iteration of EM starting from Equation (20.8).

c. What happens if we start with all the parameters set to the same value p?

d. Write out an expression for the log likelihood of the tabulated candy data in terms of the parameters, calculate the partial derivatives with respect to each parameter, and investigate the nature of the fixed point reached in part(c)

Pih, Id) P(A, I d) 0.8 P(h, I d) P(h, I d)

Pih, Id) P(A, I d) 0.8 P(h, I d) P(h, I d) Pih, I d) 0.6 0.9 0.8 0.7 0.4 0.6 0.2 0.5 0.4 2 10 6. 8. 10 Number of samples in d Number of samples in d (b) (a) Posterior probability of hypothesis Probability that next candy is I me

Step by Step Solution

3.40 Rating (162 Votes )

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock

a With three attributes there are seven parameters in the model and the empirical data give frequencies for 2 8 classes which supply 7 independent numbers since the 8 frequencies have to sum to the total sample size Thus the problem is neither under nor overconstrained With two attributes there are five parameters in the model and the empirical data give frequencies for 22 4 classes which supply 3 independent num ... View full answer

blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Document Format (1 attachment)

Word file Icon

21-C-S-A-I (286).docx

120 KBs Word File

Students Have Also Explored These Related Artificial Intelligence Questions!