Search and download datasets from the internet by topic Retail ( for example: kaggle
Question:
Search and download datasets from the internet by topic Retailfor example: kaggle datasets and so on Then determine what problems can be raised based on the dataset.
Carry out the Big Data Life Cycle process by answering and explaining questionsfollowing questions:
a Big Data Generation: Where did the data source in the dataset come from? include the link
b Data Aggregation & Data Preprocessing, which includes:
Data Integration: What data is combined?
Data Cleaning: How do you carry out the data cleaning processinappropriate data such as errors, inconsistencies, redundancies, and so on?
Data Reduction: Does your dataset need a data volume reduction process?
Data Transformation: Does your dataset need to be processed to change the data format into a suitable form for further analysis?
c Big Data Analytics: Select one of the analysis techniques used below, then select from that technique can be one or more jenistype used:
Quantitative analysis Nominal dataordinal datainterval dataratio data
Qualitative analysis Content analysisnarrative analysisdiscourse analysisframework analysisgrounded theory
Statistical analysis AB testingcorrelationregression
d Visualizing Big Data: Choose a visualization technique and then visualize the dataset after cycle asd c is done.
e Give a conclusion in the form of insight gained from discussing the topic andor data so that the problems raised at the beginning are answered in conclusion.