Question: ask expert The Virtuous Cycle of Data Mining 3 7 Defining the Inputs The data mining techniques described in this book automate the central core

ask expert The Virtuous Cycle of Data Mining
37
Defining the Inputs
The data mining techniques described in this book automate the central core of the model building process. Given a collection of input data fields, and a target field (in this case, purchase of the new product) they can find patterns and rules that explain the target in terms of the inputs. For data mining to succeed, there must be some relationship between the input variables and the target.
In practice, this means that it often takes much more time and effort to identify, locate, and prepare input data than it does to create and run the models, especially since data mining tools make it so easy to create models. It is impossible to do a good job of selecting input variables without knowledge of the business problem being addressed. This is true even when using data mining tools that claim the ability to accept all the data and figure out automatically which fields are important. Information that knowledgeable people in the industry expect to be important is often not represented in raw input data in a way data mining tools can recognize.
The wireless phone company understood the importance of selecting the right input data. Experts from several different functional areas including marketing, sales, and customer support met together with outside data mining consultants to brainstorm about the best way to make use of available data. There were three data sources available:
A marketing customer information file
A call detail database
A demographic database
The call detail database was the largest of the three by far. It contained a record for each call made or received by every customer in the target market. The marketing database contained summarized customer data on usage, tenure, product history, price plans, and payment history. The third database contained purchased demographic and lifestyle data about the customers.
Derived Inputs
As a result of the brainstorming meetings and preliminary analysis, several summary and descriptive fields were added to the customer data to be used as input to the predictive model:
Minutes of use
Number of incoming calls
Frequency of calls
Sphere of influence
Voice mail user flag
ask expert The Virtuous Cycle of Data Mining 3 7

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Programming Questions!