Question: USING R STUDIO Cleaning data You were hired as an intern in a social media think tank. As your first assignment, you were given Tik_Tok_data.csv

USING R STUDIO

Cleaning data

You were hired as an intern in a social media think tank. As your first assignment, you were given Tik_Tok_data.csv data set. Your supervisor needs to produce some scatter plots to give an insight as to what affects the popularity of some videos. Unfortunately, the data was scraped in a human readable format and not a machine readable format. Thats where you come in. On your resume you mentioned taking Data Analysis class and mastering R. Use your regex skills to clean the data set. Make sure that video duration is measured in seconds. Run a regression with views as the dependent variable and followers, likes, comments, shares and duration as the independent variables. Interpret your results. If the video has Share as a value for the number of shares, it means that the video got no shares. If instead of the view count number, it says Participating in this. . . , that means that the video went private. Make sure to comment the code you used to clean the data and write a note explaining what decisions you made when dealing with missing/ambiguous data. Calculate the total number of views, likes and comments gained by each user. Produce a publication-quality table that has all of the users, ordered by the total number of views. Please use the tidyverse.

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!