Question: using python code solve the question The ads24x 7 is a Digital Marketing company which has now got seed funding of $10 Million. They are

 using python code solve the question The ads24x 7 is a

using python code solve the question

Digital Marketing company which has now got seed funding of $10 Million.

The ads24x 7 is a Digital Marketing company which has now got seed funding of $10 Million. They are expanding their wings in Marketing Analytics. They collected data from their Marketing Intelligence team and now wants you (their newly appointed data analyst) to segment type of ads based on the features provided. Use Clustering procedure to segment ads into homogeneous groups. The following three features are commonly used in digital marketing: CPM = (Total Campaign Spend / Number of Impressions) * 1,000. Note that thie Total Campaign Spend refers to the 'Spend' Column in the dataset and the Number of Impressions refers to the 'Impressions' Column in the dataset. CPC = Total Cost (spend) / Number of Clicks. Note that the Total Cost (spend) refers to the 'Spend' Column in the dataset and the Number of Clicks refers to the 'Clicks' Column in the dataset. CTR= Total Measured Clicks / Total Measured Ad Impressions x 100. Note that the Total Measured Clicks refers to the 'Clicks' Column in the dataset and the Total Measured Ad Impressions refers to the 'Impressions' Column in the dataset. The Data Dictionary and the detailed description of the formulas for CPM, CPC and CTR are given in the sheet 2 of the Clustering Clean ads_data Excel File. Perform the following in given order: - Read the data and perform basic analysis such as printing a few rows (head and tail), info, data summary, null values duplicate values, etc. - Treat missing values in CPC, CTR and CPM using the formula given. You may refer to the Bank_KMeans Solution File to understand the coding behind treating the missing values using a specific formula. You have to basically create an user defined function and then call the function for imputing. - Check if there are any outliers. - Do you think treating outliers is necessary for K-Means clustering? Based on your judgement decide whether to The Data Dictionary and the detailed description of the formulas for CPM, CPC and CTR are given in the sheet 2 the Clustering Clean ads_data Excel File. Perform the following in given order: - Read the data and perform basic analysis such as printing a few rows (head and tail), info, data summary, null values duplicate values, etc. - Treat missing values in CPC, CTR and CPM using the formula given. You may refer to the Bank_KMeans Solution File to understand the coding behind treating the missing values using a specific formula. You have to basically create an user defined function and then call the function for imputing. - Check if there are any outliers. - Do you think treating outliers is necessary for K-Means clustering? Based on your judgement decide whether to treat outliers and if yes, which method to employ. (As an analyst your judgement may be different from another analyst). - Perform z-score scaling and discuss how it affects the speed of the algorithm. - Perform clustering and do the following: - Perform Hierarchical by constructing a Dendrogram using WARD and Euclidean distance. - Make Elbow plot (up to n=10 ) and identify optimum number of clusters for k-means algorithm. - Print silhouette scores for up to 10 clusters and identify optimum number of clusters. - Profile the ads based on optimum number of clusters using silhouette score and your domain understanding [Hint: Group the data by clusters and take sum or mean to identify trends in clicks, spend, revenue, CPM, CTR, \& CPC based on Device Type. Make bar plots.] - Conclude the project by providing summary of your learnings

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!