Task 1: Load and clean the data 1. [5 pts] Load the data set as a...
Fantastic news! We've Found the answer you've been seeking!
Question:
Transcribed Image Text:
▾ Task 1: Load and clean the data 1. [5 pts] Load the data set as a DataFrame in Pandas. It is available at URL https://raw.githubusercontent.com/asukul/DS201/master/datasets/Lemonade2016-2.csv or https://lidicky.name/DS201/Lemonade2016- S 2.csv 2. Clean the data o [10 pts] Find and remove any duplicate rows in the data. O [10 pts] Find and resolve any missing values in the data, either by interpolating missing values that are in an obvious sequence or by using the average value for the column (rounded to the nearest whole number) into any entries where a value is missing. Task 2: Add Derived Columns (Note: a derived column is a new column whose values depend on other columns' values.) 1. [5 pts] Add a column named sales to DataFrame, which calculates the total sales count of Lemon and orange. 2. [5 pts] Add a column named Revenue to the DataFrame, which calculates the sales revenue by multiplying the total sales count and price. 3. [5 pts] Calculate the total revenue for the summer, and make a note of this value. [ ] # TODO Task 3: Create Charts for Data exploration Create the following 6 plots. 1. [10pts] Create a line chart that shows Date and Revenue. 2. [10pts] Create a scatter-plot chart that shows Leaflets on the X-axis and sales on the Y-axis, add axis title to show which is Leaflets and which is sales, and add title Leaflets and Sales plot to the plot. 3. [10pts] Create a histogram that shows Revenue distributed into 10 bins, give it a title Revenue Histograms of Lemonade Sales and note whether the distribution of this data is normal, left-skewed, or right-skewed. 4. [20pts] Create two Box and Whisker plots in one figure. • The first one for two Location (one box for Park, one for Beach) showing Total Sales values. Give it a title All temperatures. o The second one for Location (one box for Park, one for Beach) showing Total Sales values but include ONLY the days where the Temperature was at least 75. Give it a title Temperatures >= 75. • Place them side-by-side with common y-axis label. Label the y axis as sales. Label the enitre figure as Sales and Location. • Each boxes must be labeled to know which box is for Park, which box is for Beach. 5. [10pts] Create a seaborn pair plot http://seaborn.pydata.org/generated/seaborn.pairplot.html to show relationship of following columns: Revenue, Temperature, Leaflets and group by Location. ▾ Task 1: Load and clean the data 1. [5 pts] Load the data set as a DataFrame in Pandas. It is available at URL https://raw.githubusercontent.com/asukul/DS201/master/datasets/Lemonade2016-2.csv or https://lidicky.name/DS201/Lemonade2016- S 2.csv 2. Clean the data o [10 pts] Find and remove any duplicate rows in the data. O [10 pts] Find and resolve any missing values in the data, either by interpolating missing values that are in an obvious sequence or by using the average value for the column (rounded to the nearest whole number) into any entries where a value is missing. Task 2: Add Derived Columns (Note: a derived column is a new column whose values depend on other columns' values.) 1. [5 pts] Add a column named sales to DataFrame, which calculates the total sales count of Lemon and orange. 2. [5 pts] Add a column named Revenue to the DataFrame, which calculates the sales revenue by multiplying the total sales count and price. 3. [5 pts] Calculate the total revenue for the summer, and make a note of this value. [ ] # TODO Task 3: Create Charts for Data exploration Create the following 6 plots. 1. [10pts] Create a line chart that shows Date and Revenue. 2. [10pts] Create a scatter-plot chart that shows Leaflets on the X-axis and sales on the Y-axis, add axis title to show which is Leaflets and which is sales, and add title Leaflets and Sales plot to the plot. 3. [10pts] Create a histogram that shows Revenue distributed into 10 bins, give it a title Revenue Histograms of Lemonade Sales and note whether the distribution of this data is normal, left-skewed, or right-skewed. 4. [20pts] Create two Box and Whisker plots in one figure. • The first one for two Location (one box for Park, one for Beach) showing Total Sales values. Give it a title All temperatures. o The second one for Location (one box for Park, one for Beach) showing Total Sales values but include ONLY the days where the Temperature was at least 75. Give it a title Temperatures >= 75. • Place them side-by-side with common y-axis label. Label the y axis as sales. Label the enitre figure as Sales and Location. • Each boxes must be labeled to know which box is for Park, which box is for Beach. 5. [10pts] Create a seaborn pair plot http://seaborn.pydata.org/generated/seaborn.pairplot.html to show relationship of following columns: Revenue, Temperature, Leaflets and group by Location.
Expert Answer:
Answer rating: 100% (QA)
Task 1 Load and clean the data import pandas as pd Load the data url httpsrawgithubusercontentcomasukulDS201masterdatasetsLemonade20162csv Alternative ... View the full answer
Related Book For
Systems Analysis and Design in a Changing World
ISBN: 978-1305117204
7th edition
Authors: John W. Satzinger, Robert B. Jackson, Stephen D. Burd
Posted Date:
Students also viewed these programming questions
-
The highest value of total cost was $80,000 in June for Acai Beverages, Inc. Its lowest value of total cost was $50,000 in December. The company makes a single product. The production volume in June...
-
Create and Format a Table 1. Open VRSeries.xlsx and save it with the name 3-VRSeries. 2. Select the range A3:K34 and create a table. The table has headers. 3. Name the table Inventory. 4. Apply the...
-
Assume you work as an accountant for Finish Line (the publicly traded shoe company based in Indianapolis, Indiana ticker symbol FINL). The CFO would like you to perform some financial analysis that...
-
In the United States, a principal responsibility for preserving endangered species (e.g., a pair of endangered birds that chooses to nest on private land) and the costs of exercising that...
-
An insulating sphere is 8.00 cm in diameter and carries a 5.70-%C charge uniformly distributed throughout its interior volume. Calculate the charge enclosed by a concentric spherical surface with...
-
Would a $1,000 expenditure be considered material to all businesses? Explain.
-
In a random sample of 150 complaints filed against a construction company for mixing excess sand in their concrete mixture, 95 complaints showed that the proportion of sand in the mix exceeded 75...
-
The Kurten Corporation is authorized to issue $500,000 of 8% bonds. Interest on the bonds is payable semiannually; the bonds are dated January 1, 2007 and are due December 31, 2012. Required Prepare...
-
Background: Tension in the courtroom increased during the bail hearing as the case of Ms. Freasse, a young teenager involved in legal proceedings because of an incident on July 8, 2022, was examined....
-
Select a cloud platform (e.g., AWS, Azure, or GCP) and research its storage services. What types of storage or storage classes does it support? Describe one example-use scenario for each.
-
Assignment 5 HTML + CSS + JS 1. Please design below HTML along with CSS Omnivox Matrix College Students Employees Employee number 00000 Password First use? Forgot your password? Log In >
-
Sally Sallington unfortunately did not engage the parking brake in her manual transmission car when she parked it in a level ground parking lot. A strong gust of wind started Sally's car moving....
-
5. How many moles are there in a gas that occupies a volume of 30 L at a pressure of 60 kPa and a temperature of 20 C. (1 point) A. O 0.933 B. O 1.03 C. O 1.203 0.823 D. E. O 0.74 6. How many grams...
-
Imagine that the camp has been running for five years. During those years, the annual net cash flows each year were only $40,000. The company is running low on cash, and management has decided to...
-
2. The benefits of financial planning Aa Aa Why Engage in Personal Financial Planning? Many people mistakenly believe that personal financial planning is an activity appropriate for only the wealthy;...
-
Using the balance sheet provided for American Imports, determine the weighted average cost of capital. The firm's tax rate is 40%, the preferred stock pays a dividend of $0.45 per share, the beta of...
-
10 men can finish a piece of work in 10 days, where as it take 12 women to finish it in 10 days. If 15 men and 6 women undertake to complete the work, how many days will they take to complete it ? a....
-
On April 29, 2015, Auk Corporation acquires 100% of the outstanding stock of Amazon Corporation (E & P of $750,000) for $1.2 million. Amazon has assets with a fair market value of $1.4 million (basis...
-
1. In what order would you develop the four subsystems? Support your answer. 2. Reviewing your work from Chapter 3, assign each of your use cases to a particular subsystem. Does this change your...
-
Refer to the RMO CSMS Reporting subsystem shown in Figure 3-10. These reports were identified by asking users about temporal events, meaning points in time that require the system to produce...
-
How are use cases and related information used by designers when designing application components?
-
A political pollster approaches people on the street and asks them to describe their political affiliation. Twenty-eight people describe themselves as Democrats, 25 as Republicans, 8 people provide a...
-
Listed below are a number of hypothetical research hypotheses. For each hypothesis, identify the independent and dependent variable. a. Male drivers are more likely to exhibit road rage behaviors...
-
Listed below are a number of research questions and hypotheses from actual published articles. For each hypothesis, identify the independent and dependent variable. a. The use of color in a Yellow...
Study smarter with the SolutionInn App