Question: BUAN 6 2 0 Data Science Foundations Designed by: Dr . Roy Jafari Programing Assignmnet # 6 Fill in your name first: - Name: -
BUAN Data Science Foundations
Designed by: Dr Roy Jafari
Programing Assignmnet #
Fill in your name first:
Name:
Redlands Email ID:
In : import pandas as pd
import matplotlib.pyplot as plt
import numpy as
import seaborn as sns
Chapter Excercise
In this exercise, we would like to use the dataset Cereals.csv This dataset contains rows of information about different cereal products. We would like to perform clustering analysis on this dataset, first using Kmeans and then using PCA. Perform the following steps.
In : cerealdf pdreadcsvCerealscsv
cerealdfhead
tablename,type,calories,protein,fat,sodium,fiber,carbo,sugars,potass,vitamins,shelf,weight,cups,ratingran,ran,QAllBran,KAllBranwithExtraFiber,KAlmondDelight,NaN,
a Impute a central tendency of the attribute for all the missing values.
In :
b What central tendency did you choose and why?
In :
c Why did we impute using the central tendency? why not other methods? Answer by commenting on how the data will b e used next clustering
name,mfrtype,calories,protein,fat,sodium,fiber,carbo,sugars,potass,vitamins,shelf,weight,cups,rating
Bran,NC
NaturalBran,QC
AllBran,KC
AllBranwithExtraFiber,KC
AlmondDelight,RC
AppleCinnamonCheerios,GC
AppleJacks,KC
BasicGC
BranChex,RC
BranFlakes,PC
Cap'n'Crunch,QC
Cheerios,GC
CinnamonToastCrunch,GC
Clusters,GC
CocoaPuffs,GC
CornChex,RC
CornFlakes,KC
CornPops,KC
CountChocula,GC
Cracklin'OatBran,KC
CreamofWheatQuickNH
Crispix,KC
CrispyWheat&Raisins,GC
DoubleChex,RC
FrootLoops,KC
FrostedFlakes,KC
Step by Step Solution
There are 3 Steps involved in it
1 Expert Approved Answer
Step: 1 Unlock
Question Has Been Solved by an Expert!
Get step-by-step solutions from verified subject matter experts
Step: 2 Unlock
Step: 3 Unlock
