Question: This is the homework from the data science. Please help HW3.pdf Due Oct 15, 10 pm Name (1) (20 points) The figure below shows three
This is the homework from the data science. Please help 
HW3.pdf Due Oct 15, 10 pm Name (1) (20 points) The figure below shows three possible feature test for the root node of a decision tree to predict spam e-mail messages (based on the example discussed in class). (a) Calculate the expected information gain for each feature test (please show your formula) (b) Which feature test will be selected by the Decision Tree Learning algorithm? Subjecl Has Sender in Contact free money Yes Nei Ye Na Yes No (2) (20 points) Construct a regular expression as the value for the parameter token pattern so that CountVectorizer can extract hashtags, twitter user names (e.g., @realDonalTrumpl, and words from tweets as tokens. Explain how you construct the regular expression
Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
