Question: Hello, I would like some help in solving these questions. Thank you in advance Rebecca works at Croissants4Ever, a internet startup that ships freshly-baked croissants
Hello, I would like some help in solving these questions. Thank you in advance

Rebecca works at Croissants4Ever, a internet startup that ships freshly-baked croissants to customers in various geographic regions in Illinois and surrounding states. Motivated by the surge of internet sales due to the pandemic, the startup is looking to expand into new geographic regions. Rebecca is tasked with building a model which can identify geographic regions in which Croissants/ Ever will be profitable. Rebecca compiled a training dataset (see table below) comprised of eleven geographic regions that are currently served by the company. avg. income pop. density ## competitors profitable low rural N yes low suburban no low suburban yes low urban no med suburban no med suburban yes med urban no HNONNON med urban yes high suburban yes high rural no high urban yes (a) Considering "profitable" as the target variable, which of the attributes would you select as the root in a decision tree that is constructed using the information gain impurity measure? (b) Use the Gini index impurity measure and construct the full decision tree for this data set. (c) Consider the following set of points as your test data. avg. income pop. density # competitors profitable low suburban yes med suburban yes med suburban yes med suburban OOHHOOK yes low rural no med suburban no high suburban yes med urban yes What is the accuracy of your decision tree built in part (b) on the test data
Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
