Question: Please help me with this project. Need to select quality data set from given link. Can't understand which dataset would be good for the project.

Please help me with this project. Need to select quality data set from given link. Can't understand which dataset would be good for the project.


CST 8390 Project Part 1- Project Report Need to select a data set from the following online portals ( a csv le, not an ARFF file): htt s: data.texas. ov htt s: o en.ottawa.ca When to select dataset, make sure that there are at least 10 relevant attributes (including the ones that you can extract or create) and 100 instances in it. Need to clean the data, remove outliers and then run the algorithms ( using kNN, Clustering, Decision Tree, Clustering kMeans, Association, and Regression) on the data. For numeric attributes, can calculate min, max, average, standard deviation, and co-variance/ correlation. If the attributes are nominal, then can calculate the frequency of each label. If the attributes are of mixed (numeric and nominal) type, then can convert numeric to nominal using lters or using some other meaningful translations (for example, convert numbers to ranges like \"low\Should frame a question that want to answer by your analysis. This question should be written on the bottom of the cover page. This question cannot be easily answered using Excel (which means question should be dependent on more than 3 factors). Need to have 5 main sections , Data collection (with the source link), Preprocessing, Data Analysis, Results, and Conclusion. Report should have a cover page (with names of both students and student numbers), table of contents, tables, pictures etc., introduction, conclusion and references
Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
