Question: Module 1: What is Data Analytics? Project Proposal Instructions For your final project and presentation in this course, you will need to propose a dataset

Module 1: What is Data Analytics?

Project Proposal Instructions

For your final project and presentation in this course, you will need to propose a dataset to analyze. This proposal has four parts, in which you will choose your dataset, provide the background for that dataset including why you chose it and from where, provide information on the data itself, and provide an image from your imported data.

This assignment is due by 11:59 CT on Sunday.

Deliverables:

A word document containing writing, tables, and images from Part 1, Part 2, Part 3 and Part 4. Your chosen dataset

PART 1: CHOOSING A DATASET

For the project in this course you will need to choose a dataset to analyze. This dataset will need to meet the following requirements:

  • has at least 5 fields (at least three should be numeric)
  • has at least 100 rows
  • data is labeled

Recommended sites to find a dataset:

  • Kaggle
  • City of Chicago Data Portal

PART 2: DATASET BACKGROUND

Write a summary describing the topic of the dataset. Include: why you chose that dataset, where it came from, and what kind of problem you could solve with it.

PART 3: DATASET INFO

Provide information on the dimensions of the dataset. Include information on the fields and their data types. Be sure to state if they are continuous/numeric or categorical?

PART 4: IMPORT YOUR DATASET

Provide an image of the results of importing your dataset and using the head() function for printing.

Table of Contents Module 2: Getting Your Dataset Ready - Data Wrangling Module 2: Project Data Wrangling

Module 2: Project Data Wrangling

Previous Next

Instructions

For this portion of the project, you will examine your dataset for incorrect data. Any incorrect data should be removed, corrected, or imputed. Follow these steps:

  • Remove irrelevant data. If you are unsure if it is irrelevant, then keep it.
  • Remove duplicate records that are repeated.
  • Make sure numbers are interpreted as numerical data types.
  • Fix typos.
  • Standardize.
  • Investigate outliers.
  • Check and manage missing values.
  • Format and normalize data if needed.
  • Change categorical values into numbers if needed.

Once you have completed this, you will need to provide a Word document summarizing the pre-processing steps performed on your dataset.

Module 3: Project Exploratory Analysis

Previous Next

Instructions

In this assignment, you will perform an exploratory analysis that will allow you to get a feel for the data and start exploring potential relationships. This may include:

  • Descriptive statistics
  • Histograms
  • Bar charts
  • Heat maps
  • Line graphs
  • Box plots
  • Frequency tables

Once your analysis is complete, you will need to provide a Word document showing and describing the results of your exploratory analysis.

  1. Using your chosen dataset, reevaluate the heat map from the last module.
  2. Consider ways to perform a visual check to see if there is a relationship between fields.
  3. With this insight, develop a model using either linear regression or multiple linear regression.
  4. Report the intercepts, slope, model accuracy, output to predicted comparison, and a scatterplot with line portraying the model.

Once you complete these steps, you will need to provide a Word document showing and explaining the results of your model development.

After finishing Proposal create a final report of 5-6 pages

Use Python, Jupyter and show the visuals of the data analysis with introduction, conclusion

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!