Question: # Function 1 : Create a function called readStates: #Step 1 : Create a function ( named readStates ) to read a CSV file into

# Function

1

: Create a function called "readStates":

#Step

1

: Create a function

(

named readStates

)

to read a CSV file into R: within the Function

1

1 .

You need to read a URL, not a local file to your computer.

2 .

The file is a dataset on state populations

(

within the United States

) .

#Step

2

: Clean the dataframe: within Function

1

3 .

Note the issues that need to be fixed

(

removing columns, removing rows, changing column names

) .

4 .

Within your function, make sure there are

51

rows

(

one per state

+

the district of Columbia

) .

Make sure there are only

5

columns with the columns having the following names

(

stateName

,

Census, Estimates, Pop

2010,

Pop

2011) .

5 .

Make sure the last four columns are numbers

(

.

.

not strings

) .

#Step

3

: Store and explore the dataset: outside of Function

1

6 .

Store the dataset into a dataframe, called dfStates.

# When you run the following, it should print a clean dataframe. Please include the output of "dfStates" in the compiled file by running dfStates as below.

dfStates

< -

readStates

(

urlToRead

)

dfStates

7 .

Test your dataframe by calculating the mean for the

2011

data, by doing

(

include your output

)

mean

(

dfStates$Pop

2011)

# You should get an answer of

6, 109, 645

#Step

4

: Find the state with the highest population: outside the Function

1

8 .

Based on the

2011

data, what is the population of the state with the highest population? What is the name of that state, and what is the value of the population?

9 .

Sort the data, in increasing order, based on the

2011

data.

# Function

2

: Create a function called "Distribution"

#Step

5

: Explore the distribution of the states: You need to create a new function called "Distribution"

10 .

You will write a function to calculate percentage of states that have population that is lower than the average.

The function

(

function name: "Distribution"

)

takes two parameters. The first is a vector and the second is a number. For example, Distribution

< -

function

(

vector

,

number

) .

# The function will return the percentage of elements within the vector that is less than the number

(

.

.

cumulative distribution below the value provided

) .

(1)

Think about this: You only keep the elements within the vector that are less than the number, and store the number of eligible elements into the variable "count". Populate XXXX to complete this line of code:

count

< -

length

(

vector

[

XXXX

])

(2)

Then, you will calculate the percentage and return the results. Populate XXXX to complete this line of code:

return

((

count

/

XXXX

) * 100)

(3)

Test the function with the vector

dfStates$Pop

2011,

and the mean of

dfStates$Pop

2011 . * * *

you should get

66.66667

as a result.

table with row headers in column A and column headers in rows

3

through

4 . (

leading dots indicate sub

-

parts

)

Table

1 .

Annual Estimates of the Population for the United States, Regions, States, and Puerto Rico:

April

1, 2010

to July

1, 2011

Geographic Area

1 -

Apr

- 10

Population Estimates

(

as of July

1)

Census Estimates Base

2010 2011

United States ######## ######## ######## ######## Northeast ######## ######## ######## ######## Midwest ######## ######## ######## ######## South ######## ######## ######## ######## West ######## ######## ######## ########

.

Alabama

4, 779, 736 4, 779, 735 4, 785, 401 4, 802, 740 .

Alaska

710, 231 710, 231 714, 146 722, 718 .

Arizona

6, 392, 017 6, 392, 013 6, 413, 158 6, 482, 505 .

Arkansas

2, 915, 918 2, 915, 921 2, 921, 588 2, 937, 979 .

California ######## ######## ######## ########

.

Colorado

5, 029, 196 5, 029, 196 5, 047, 692 5, 116, 796 .

Connecticut

3, 574, 097 3, 574, 097 3, 575, 498 3, 580, 709 .

Delaware

897, 934 897, 934 899, 792 907, 135 .

District of Columbia

601, 723 601, 723 604, 912 617, 996 .

Florida ######## ######## ######## ########

.

Georgia

9, 687, 653 9, 687, 660 9, 712, 157 9, 815, 210 .

Hawaii

1, 360, 301 1, 360, 301 1, 363, 359 1, 374, 810 .

Idaho

1, 567, 582 1, 567, 582 1, 571, 102 1, 584, 985 .

Illinois ######## ######## ######## ########

.

Indiana

6, 483, 802 6, 483, 800 6, 490, 622 6, 516, 922 .

Iowa

3, 046, 355 3, 046, 350 3, 050, 202 3, 062, 309 .

Kansas

2, 853, 118 2, 853, 118 2, 859, 143 2, 871, 238 .

Kentucky

4, 339, 367 4, 339, 362 4, 347, 223 4, 369, 356 .

Louisiana

4, 533, 372 4, 533, 372 4, 545, 343 4, 574, 836 .

Maine

1, 328, 361 1, 328, 361 1, 327, 379 1, 328, 188 .

Maryland

5, 773, 552 5, 773, 552 5, 785, 681 5, 828, 289 .

Massachusetts

6, 547, 629 6, 547, 629 6, 555, 466 6, 587, 536 .

Michigan

9, 883, 640 9, 883, 635 9, 877, 143 9, 876, 187 .

Minnesota

5, 303, 925 5, 303, 925 5, 310, 658 5, 344, 861 .

Mississippi

2, 967, 297 2, 967, 297 2, 970, 072 2, 978, 512 .

Missouri

5, 988, 927 5, 988, 927 5, 995, 715 6, 010, 688 .

Montana

989, 415 989, 415 990, 958 998, 199 .

Nebraska

1, 826, 341 1, 826, 341 1, 830, 141 1, 842, 641 .

Nevada

2, 700, 551 2, 700, 551 2, 704, 283 2, 723, 322 .

New Hampshire

1, 316, 470 1, 316, 472 1, 316, 807 1, 318, 194 .

New Jersey

8, 791, 894 8, 791, 894 8, 799, 593 8, 821, 155 .

New Mexico

2, 059, 179 2, 059, 180 2, 065, 913 2, 082, 224 .

New York ######## ######## ######## ########

.

North Carolina

9, 535, 483 9, 535, 475 9, 560, 234 9, 656, 401 .

North Dakota

672, 591 672, 591 674, 629 683, 932 .

Ohio ######## ######

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Programming Questions!

Unit 5 JavaScript - Dom Event and Dom Elements - Just Need JavaScript code - NEED HELP ON STEPS 1 THROUGH 16 PLEASE . (Note: I have added the handout below as a point of reference. ) CIS 198:...

Unit 5 JavaScript - Dom Event and Dom Elements - Just Need JavaScript code - Help Please. CIS 198: JavaScript Learning Unit 5: Activity Lab Activity 1. Create a folder called Quiz. 2. Within the...

Lab 4: Cleaning/Munging Dataframes # Function 1: Create function called "readStates": #Step 1: Create function (named readStates) to read a CSV file into R:within the Function 1 # Q1 . You need to...

1. In the file function-declaration.js comment out past code and follow the video to create the function calculateTax and return a value: Use the value 0.10 as the tax rate, i.e. 10% OST function...

Could mostly use help on step 4 and 5 but if could show all the work for the other steps would be very helpful as well. In this lab, you need to read in a dataset and work on that (in a dataframe)....

CIS 162 Project 4 Baby Names Database Due Date at the start of class on November 14, 2017 Before Starting the Project Read sections 8.1 - 8.5 about ArrayLists Read this entire project description...

someone answered the A1, B1, i need the other remaining parts from 3 to 5 3 of 5 sem, store each in a point variable and print them all. Again, this main() is also for unit testing. B. establish the...

The Language is Java. Please post code for the following steps 17-23. Thank You in advance. Scenario - OraclProduction OraclProduction Ltd are specialists in producing production line manufacturing...

PLEASE DO NOT SPAM THE QUESTION I WILL AUTOMATICALLY DISLIKE IT (THANKS) Build a script (Python) that reads the saved csv called RiskResults.csv .This requires using a matrix-like data structure...

Please provide help in Python! Lab 7 involves some problem solving so you can choose to work with one partner if you like. If you work with one partner, make sure both partners work together so both...

1 (a) Study Fig. 6, which shows a cross section of an area affected by an earthquake. fault plates seismic waves Fig. 6 (1) Tick () the one statement in the table below which is the correct...

A solar cell measures 10 inches by 10 inches, and it has 4 non-overlapping regions that generate electricity. Each of these regions is 4 inches by 4 inches in size (there is a border between the...

Recognizing cost of goods sold in the same period as the related sale of goods is example of:

The first production department of Stone Incorporated reports the following for April. Units Direct Materials Conversion Percent Complete Percent Complete Beginning work in process inventory 60,000 60