Question: This is the homework from the data science. Please help HW3.pdf Due Oct 15, 10 pm Name (1) (20 points) The figure below shows three

This is the homework from the data science. Please help This is the homework from the data science. Please help HW3.pdf Due

HW3.pdf Due Oct 15, 10 pm Name (1) (20 points) The figure below shows three possible feature test for the root node of a decision tree to predict spam e-mail messages (based on the example discussed in class). (a) Calculate the expected information gain for each feature test (please show your formula) (b) Which feature test will be selected by the Decision Tree Learning algorithm? Subjecl Has Sender in Contact free money Yes Nei Ye Na Yes No (2) (20 points) Construct a regular expression as the value for the parameter token pattern so that CountVectorizer can extract hashtags, twitter user names (e.g., @realDonalTrumpl, and words from tweets as tokens. Explain how you construct the regular expression

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!