Data mining is the process of using sophisticated statistical techniques to extract and analyze data from large

Question:

Data mining is the process of using sophisticated statistical techniques to extract and analyze data from large databases to discern patterns and trends that were not previously known. The patterns we find are correlations, but not necessarily causation.
Every time we see a link between the occurrence of one event or action with another, we generally assume that one event or action caused the other. But simply linking one occurrence with another does not always prove that the result has been caused by the other.
For example, such data mining might suggest the following relationship between arcade revenue and computer science doctorates awarded in the United States, with a correlation of 0.985.

Total revenue generated by arcades
correlates with
Computer science doctorates awarded in the United States
Correlation: .% (r = .)

2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 $2 billion 2000 degrees 1500 degrees $1.5 billion 1000 degrees 500 deg


a. Is this simply a correlation? Or is there a possibility that computer science doctorates affect total revenue generated by arcades or vice versa? Explain your answer.

Number of people who drowned by falling into a pool
correlates with
Films Nicolas Cage appeared in
Correlation: .% (r = .)

1999 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 140 drownings L 16 films 120 drownings 4 films 100 drownings 2 fi


b. Is this simply a correlation? Or is there a possibility that the number of people who drowned was affected by the films Nicolas Cage appeared in? Explain your answer.

Fantastic news! We've Found the answer you've been seeking!

Step by Step Answer:

Related Book For  book-img-for-question

Accounting Information Systems

ISBN: 978-1260153156

2nd edition

Authors: Vernon Richardson, Chengyee Chang, Rod Smith

Question Posted: