Question: Step 1 : Clean the data by * removing rows whose StockCode or Invoice values contain non - digit characters * removing rows whose Price

Step 1 : Clean the data by
* removing rows whose StockCode or Invoice values contain non-digit characters
* removing rows whose Price values are less than 10
* removing rows whose country values are not equal to "United Kingdom", "Italy", "France", "Germany", "Norway", "Finland", "Austria", "Belgium", "European Community", "Cyprus", "Greece", "Iceland", "Malta", "Netherlands", "Portugal", "Spain", "Sweden", or "Switzerland".
* removing rows whose quantity values are negative.
* trimming the description using string.strip function
Step 2: Find the frequent itemsets with min_support =0.01
Step 3: Find the association rules with confidence greater than 10%. Among them, which rule(s) has the highest value of lift?

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!