Question: I need help cleaning a dataset please provide the code it can be downloaded from here https://www.kaggle.com/tmdb/tmdb-movie-metadata/data what i have done so far below, dont

I need help cleaning a dataset please provide the code

it can be downloaded from here https://www.kaggle.com/tmdb/tmdb-movie-metadata/data

what i have done so far below, dont mind the importing because I will use the rest when I have a clean set.

from datetime import timedelta, date import datetime import numpy as np import pandas as pd import string import re import csv import requests import string

data from https://www.kaggle.com/tmdb/tmdb-movie-metadata/data df_movies = pd.read_csv('tmdb_5000_movies.csv', delimiter = ',', header = 0, skipinitialspace = True)

df_movies.drop(columns='homepage', inplace=True) df_movies.drop(columns='popularity', inplace=True) df_movies.drop(columns='overview', inplace=True) df_movies.drop(columns='status', inplace=True) df_movies.drop(columns='tagline', inplace=True) df_movies.drop(columns='vote_average', inplace=True) df_movies.drop(columns='vote_count', inplace=True) df_movies.drop(columns='id', inplace=True)

df_movies.drop(columns='id', inplace=True)

df_movies.head()

I want it so that the 'genres' column only says the genre whether it is action adventure and so on. Same goes for 'production_company' and 'production_country' and 'spoken_language'.

Then I need you to remove all rows where 'spoken_language is not english or en, and create a separate column with just the year of the movie's release, titled 'release_year' and order it by 'release-year' and then 'revenue'.

Thanks!

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

will require integrating human resource management theory and thought (as described in the textbook) while addressing major issues in the field and therefore will require proper referencing and...

Python help We're ready to implement one of our menu items, and a couple of our class methods. We will be importing a file that contains thousands of temperature readings taken during a week at the...

will require integrating human resource management theory and thought (as described in the textbook) while addressing major issues in the field and therefore will require proper referencing and...

will require integrating human resource management theory and thought while addressing major issues in the field and therefore will require proper referencing and citations. Please make sure that you...

IN REGARDS TO THE TWO CASES PROVIDED BELOW, For this assignment, please draft on the following topic: UNAUTHORIZED PRACTICE OF LAW ( UPL) and the practical applications of UPL in the real world The...

Please answer the following question after reading the case study which is attached below: How much PM is enough PM? How much PMO support is enough PMO support? The AtekPC Project Management Office...

Use case CASE 1.9 ZZZZ Best Company, Inc. On May 19, 1987, a short article in the Wall Street Journal reported that ZZZZ Best Company, Inc., of Reseda, California, had signed a contract for a $13.8...

I have a survey result, but for the open end questions can you please read through the responses and help me to analyze the survey maybe by counting and making a short list of what percent and count...

Lisa Benton (A) Case Study (OB, HR Management and Legal) In this project, we will be looking at a workplace scenario and individual behaviour while analysing them using the concepts learnt. Questions...

Assessment 1 - Case study For this assessment you are going to use the 'simulated 'business: 'Blue fish grill.' Please download the copy of the customer service policies and procedures manual that...

Describe the socio-historical and contemporary contexts for multicultural and bilingual education in your school district. How have decisions such as Brown vs. The Board of Education, the 2007 ruling...

Question 5 What is the total memory size reserved in the following instruction? reg [15: 0] memram [0: 8192]; Not yet answered Select one: Marked out of O a. 16 MByte 1.00 O b. None of the given P...

14. Why is using the Purchases Returns and Allowances account preferred to crediting these transactions to Purchases?

Seved Help 14 Wisconsin Snowmobile Corp. is considering a switch to level production Cost efficiencies would occur under level production, and aftertax costs would decline by $31,500, but inventory...

How many Tables Will Base HCMSs typically have? Why?

What is the process of normalization?

What is Notation in Data Modeling, and what is the most common Notation Type used?