Question: Could mostly use help on step 4 and 5 but if could show all the work for the other steps would be very helpful as

Could mostly use help on step 4 and 5 but if could show all the work for the other steps would be very helpful as well.

In this lab, you need to read in a dataset and work on that (in a dataframe). Then, we will explore the distribution within the dataset.

Step 1: Create a function (named readStates) to read a CSV file into R

You need to read a URL, not a local file to your computer.

The file is a dataset on state populations (within the United States).

The URL is:

http://www2.census.gov/programs-surveys/popest/tables/2010-2011/state/totals/nst-est2011-01.csv (Links to an external site.)Links to an external site.

Step 2: Clean the dataframe

Note the issues that need to be fixed (removing columns, removing rows, changing column names).

Within your function, make sure there are 51 rows (one per state + the district of Columbia). Make sure there are only 5 columns with the columns having the following names (stateName, Census, Estimates, Pop2010, Pop2011).

Make sure the last four columns are numbers (i.e. not strings).

Step 3: Store and explore the dataset

Store the dataset into a dataframe, called dfStates.

Test your dataframe by calculating the mean for the 2011 data, by doing:

mean(dfStates$Pop2011)

***you should get an answer of 6,109,645

Step 4: Find the state with the highest population

Based on the 2011 data, what is the population of the state with the highest population? What is the name of that state?

Sort the data, in increasing order, based on the 2011 data.

Step 5: Explore the distribution of the states

Write a function (function name: "Distribution") that takes two parameters. The first is a vector and the second is a number. For example, Distribution <- function(vector, number). This step is just a setup for the following instruction.

The function will return the percentage of elements within the vector that is less than the number (i.e. cumulative distribution below the value provided). For example, (1) only keep the elements within the vector that are less than the number, and store the number of eligible elements into the variable "count": count <- length(vector[vector

Test the function with the vector dfStates$Pop2011, and the mean of dfStates$Pop2011. *** you should get 0.6666667 as a result

There are many ways to write this function (described in point 10) so please try to write multiple versions of this function which do you think is best?

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

can someone help with the blue reader project, please? I have the journal entries I need help with journal ledger and trial balance so I can I do the financial statements. thanks can someone help me...

Blue Raider Adventure Park ACTG 2110 - Principles of Accounting Accounting Cycle Packet Serial Problem Covering Chapters 2-4 (Spring 2020) INTRODUCTION: Matt Lapinski, an MTSU student, consistently...

Each of these steps is explained in detail in Chapters 1-4 of our textbook. Step 1, Analyze transactions, is covered in Chapter 1. While this step is very important, it is a thought process, the...

i need help on how to list these for a general ledger, general journal, and following questions please! Already made an income statement, statement of owners equity, and adjusted trial balance. Blue...

v Lab Assignment 2: Making Exact Change at Sub Assignment Due Monday by 11:59pm Points 25 Submitting a filc upload In this assignment you will practice practical control structures. If you can store...

I have completed the General Journal. I just need help doing the General Ledger. August 1 Matt opens a business checking account in the name of BRAP and makes an initial deposit of $5,000 Matt had...

how the cash here become 1135? because if you check my trial balance the cash is 8810. and i can I get help in step 8,9? and I provide u my journal entry this is the steps 8,9 From the information...

A. Lab # BSBA BIS245A-3 B. Lab 3 of 7: Database Design Using Visio and Based on Data Requirements and Business Rules C. Lab Overview--Scenario/Summary COs: 2. Given a situation containing entities,...

Problem Solving and Decision-Making Introduction The Board of Directors of Bright Road Health Care System is considering which electronic health record (EHR) system to use and how to implement the...

APPLICATION FOR HEALTHCARE QUANTITATIVE RESEARCH ASSIGNMENT INSTRUCTIONS OVERVIEW The assignment will provide practical knowledge in determining cost containment is the predominant challenge for...

1-9 The length of the mercury column in a certain mercury-in-glass thermometer is 5.00 cm when the thermometer is in contact with water at its triple point. Consider the length of the mercury column...

Can an education expense incurred by one taxpayer be deductible whereas the same expense incurred by another taxpayer is not deductible? Explain.

Banks manage liquidity risk mainly through the following way except Question 1 Select one: a . establishing diverse sources of funds and issuing longer - term securities b . lending in money markets...

Question 6 Process costing is applicable to production operations that Select one: a. utilize several processes, departments, or work cells in a series. b. do not assign overhead costs to operations....

1. I dream of being so good at what I do that my expert advice will be sought continually.

2. Why?

1. Where do these biases come from?