Question: This assignment is to load data from CSV, populate them in a SQLite database, and run in-database analytics with SQL. Data Source The data is
This assignment is to load data from CSV, populate them in a SQLite database, and run in-database analytics with SQL.
Data Source
The data is given in the CSV format (UCI_Credit_Card.csv) available to download from canvas) and the source and descriptions of the data in the CSV file is from the following site.
https://www.kaggle.com/uciml/default-of-credit-card-clients-dataset
According to the site:
This dataset contains information on default payments, demographic factors, credit data, history of payment, and bill statements of credit card clients in Taiwan from April 2005 to September 2005.
Write Python codeto do the following.
- Focus on the Marriage column of the datasheet.
- Count the total number of singles (Marriage=2 is single, marriage=3 (others) counted towards single).
- Count the total number of married (Marriage =1 is married)
- Count the total number of the column for default.payment.next.month for single (2 and 3) and for married (1).
- Divide the two numbers obtained in step 4 with number from step 2 and step 3. For example:
- Total number of single default.payment.next.month/total number of singles
- Total number of married default.payment.next.month/total number of married
- What does the two numbers tell you from step 5?
Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
