Question: This problem uses the dataset gameData.xlsx that contains real - time strategy game data from 3 0 0 players. The data contains the strategy of

This problem uses the dataset gameData.xlsx that contains real-time strategy game data from 300 players. The data contains the strategy of the players at the midpoint in the game. The column Soldier reports the number of lethal units a player has to attack an opponent, and the column Worker reports the number of workers to harvest economic resources. Larger numbers of soldiers suggest a strategy focused on security and larger number of workers suggest a strategy focused on economy.
Use this data for the following.
a)[10pt] Visualize the data with a scatter plot. Label the axes properly. Visually determine how many clusters you should look for in a clustering problem. (There is no single right answer here, but you need to justify your decision)
b)[20pt] Formulate a k-means clustering problem to identify the cluster centers for the number of clusters you picked in part a). Report screenshots of your setup from Excel/Python and describe your formulae.

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!