Question: I have data set named air traffic passenger statistics which is a csv file of 1 5 0 0 6 rows and 1 7 columns
I have data set named air traffic passenger statistics which is a csv file of rows and columns of which all columns are
Index integer ex
Activity Period integer ex
Operating Airline String ex ATA Airlines
Operating Airline IATA Code String exTZ
Published Airline String exATA Airlines
Published Airline IATA Code String ex TZ
GEO Summary String ex Domestic
GEO Region String ex US
Activity Type Code String exDeplaned
Price Category Code String ex Low fare
Terminal string integer ex Terminal
Boarding Area char exB
Passenger Count integer ex
Adjusted Activity Type Code string integer ex ThruTransit
Adjusted Passenger Count integer ex
Year integer ex
Month string exjuly
Now I want to perform the clustering on the dataset, you can choose any different clustering methodalgorithm which is the best latest and provide the best result. Firstly, select any feature which would be best and appropriate and justify it using mathematical expression of evaluation that why you had chosen that amongst all the columns. secondly you have to find the cluster using the elbow method or whichever you feel the best is to taken for finding the cluster size dont take cluster size take more than that plot that in a welldefined labelled graph. Then perform the clustering on different algorithms, plot the scatter plot showing the formation of cluster, data points, centroid. Create a highlevel graph with proper labelling and then lastly find the Silhouette Score, CalinskiHarabasz index, and DaviesBouldin index score. Create a table showing which is the best amongst algo having row as algorithm name and column as score name also explain justify why that particular algorithm is the best.
Tip: You can access the above whole dataset from Kaggle by named Air Traffic Passenger statistics A New Look at an Old Problem
Please perform the highlevel clustering for all algorithms, as the clustering code which are available on the internet is simple, I have already implemented that but I need more in that more enhanced version of all the algorithms which shows more better clustering and output.
Note : please do not copy paste the online code or AIGPT code. Write your own logic code and enhance the all different algorithms.
Note : Provide all the explanation, all the code, table and its output.
Step by Step Solution
There are 3 Steps involved in it
1 Expert Approved Answer
Step: 1 Unlock
Question Has Been Solved by an Expert!
Get step-by-step solutions from verified subject matter experts
Step: 2 Unlock
Step: 3 Unlock
