Question: Consider the traffic accident data set shown in Table 7.1. (a) Show a binarized version of the data set. (b) What is the maximum width
(a) Show a binarized version of the data set.
(b) What is the maximum width of each transaction in the binarized data?
(c) Assuming that support threshold is 30%, how many candidate and frequent itemsets will be generated?
Table 7.2. Traffic accident data set
.png)
(d) Create a data set that contains only the following asymmetric binary attributes: (Weather = Bad, Driver's condition = Alcohol-impaired, Traffic violation = Yes, Seat Belt = No, Crash Severity = Major). For Traffic violation, only None has a value of 0. The rest of the attribute values are assigned to 1. Assuming that support threshold is 30%, how many candidate and frequent itemsets will be generated?
(e) Compare the number of candidate and frequent itemsets generated in parts (c) and (d).
011001000001 el Y 0 1 1 1 0 1 1 1 0 0 1 1 el-i 0 0 0 1 0 0 0 1 1 0 0 1-0 0 0 0 1 0 0 1 0 1 0 0 0001001000001 010000101000 100100000010 011110010101 100001101010 010010100101 101101011010
Step by Step Solution
3.53 Rating (170 Votes )
There are 3 Steps involved in it
a See Table 72 b 5 c The number of candidate itemsets from siz... View full answer
Get step-by-step solutions from verified subject matter experts
Document Format (1 attachment)
908-M-S-D-A (8656).docx
120 KBs Word File
