Question: 4. A local retailer has a database that stores 10,000 transactions of last summer. After analyzing the data, a data science team has identified the

4. A local retailer has a database that stores 10,000 transactions of last summer. After analyzing the data, a data science team has identified the following statistics: {battery} appears in 6,000 transactions. {sunscreen} appears in 5,000 transactions. {sandals) appears in 4,000 transactions. {bowls } appears in 2,000 transactions. {battery, sunscreen) appears in 1,500 transactions. {battery, sandals} appears in 1,000 transactions. {battery, bowls) appears in 250 transactions. {battery, sunscreen,sandals} appears in 600 transactions. Answer the following questions: a. What are the support values of the preceding itemsets? b. Assuming the minimum support is 0.05, which itemsets are considered frequent? c. What are the confidence values of (battery}-{sunscreen and {battery, sunscreen}-{sandals}? Which of the two rules is more interesting? d. List all the candidate rules that can be formed from the statistics. Which rules are considered interesting at the minimum confidence 0.25? Out of these interesting rules, which rule is con- sidered the most useful (that is, least coincidental)
Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
