Question: This assignment has two parts: processing 2 csv files related to City Bike ridership for the same month in different years and comparing the two

This assignment has two parts: processing

2

csv files related to City Bike ridership for the

same month in different years and comparing the two files.

Part

1

Input: No input from the user required. The input file names to use in the script are NYC

-

CitiBike

-

Mar

17 .

csv

and NYC

-

CitiBike

-

Mar

23 .

csv

Please note:

file name has to be without its complete file path. For example: c:

/

stevens

/

624 /

NYC

-

CitiBike

-

Mar

17 .

csv is NOT right. If you change platform or move the file to another directory, the script

wouldn

t work. Be sure your script and your file are in the same directory.

No data structure management library

(

like pandas

)

should be used for this assignment.

The two files use a slightly different structure. In the Mar

17

one, user type is defined as "User Type",

and this can be "Subscriber"

(

meaning users with a subscription

)

or "Customer"

(

meaning

occasional users

) .

In the Mar

23

one, user type is defined as

"

member

_

casual" and this can be

"member"

(

meaning users with a subscription

)

or "casual"

(

meaning occasional users

) .

Output: Skip a line

/

print a blank line, then print per each file:

o The file has n lines

(

1

will be for Mar

17,

2

for Mar

23) .

Of those n:

Mar

17

: x

1

of them have "Customer" as usertype, y

1

have "Subscriber" as usertype. "Customer"

are z

1 %

of the total

Mar

23

: x

2

of them have "casual" as member

_

casual, y

2

have "member" as member

_

casual.

"casual" are z

2 %

of the total

Procedure:

1 .

Open the files.

2 .

Loop into the files.

3 .

For each file:

.

Count the number of lines

-

Note: skip the header. You will get

2

number: n

1

for Mar

17

and

2

for Mar

23 .

.

Count of the number of lines with:

(

Mar

17)

"Customer" as usertype

(

this is x

1)

(

Mar

23)

"casual" as member

_

casual

(

this is x

2) .

.

Count of the number of lines with:

" (

Mar

17) "

Subscriber" as usertype

(

this is y

1)

(

Mar

23)

"member" as member

_

casual

(

this is y

2) .

.

Calculate the z

1 %

of "Customer" in Mar

17 -

Calculate the z

2 %

of "casual" in Mar

23 .

.

Print: The file has n lines. Of those x are occasional users, y have a subscription. Occasional

users are z

%

of the total.

Part

2

Input: No input from the user required. Use the data from Part

1

Output: Comparison of the data in the

2

files. Considering the files are related to different periods, we want

to compare them to optimize the bike availability. In particular, because one file is related to data before the

pandemic while the other is after, we want to evaluate the impact of the pandemic on the ridership.

Procedure:

1 .

Check IF n

1 >

2

IF n

1 >

2,

print: The Mar

17

riders are more than the Mar

23

else, print: The Mar

23

riders are more or equal than the Mar

17

2 .

Check IF z

1 >

2

IF z

1 >

2,

print: Before the pandemic there were more occasional users than after the

pandemic

else, print: After the pandemic there were more or equal occasional users than before the

pandemic

3 .

Write a

1

page interpretation. The interpretation would be a narrative describing

/

explaining in plain

English the results of your Python script.

Submit the

3

parts as a single

.

py file via Canvas and the interpretation in a separate doc

/

pdf file

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

Assignment Specification Description: This assignment has two parts: processing 2 csv files related to City Bike ridership for the same month in different years and comparing the two files. Part 1 :...

Assignment 5: Hash Table implementation and concordance There are three parts to this assignment. In the first two parts, you will complete the implementation of a hash map and a concordance program....

There are three parts to this assignment. In the rst two parts, you will complete the implementation of a hash map and a concordance program. In the third part, you will answer a number of questions...

Please read all the instruction, there should be at least 4 functions including the main function. This is the third time I am posting this question please help me. Lab lesson 9 has two parts. Part 2...

C++ at least 4 functions including the main function. Lab lesson 9 has two parts. Part 2 be making use of functions, pass by reference, and files. Part 2 is worth 65 points (55 points for passing the...

Lab lesson 9 has two parts. Part 2 be making use of functions, pass by reference, and files. Part 2 is worth 65 points (55 points for passing the tests and 10 for your code). Failure to meet the...

Please read all the instructions below. And please cose in C++, thank you. Lab lesson 9 has two parts. Part 2 be making use of functions, pass by reference, and files. Part 2 is worth 65 points (55...

I need to know how to solve and the answer of # 7 please help me ACC-330 Spring 2015 Case Assignment - in Two Parts Part I is worth 5% and Part II is worth 10% towards your final grade (a total of...

Find the type, transform to normal form, and solve, (Show the details of your work.) xuxy yuyy = 0

Chance Company had two operating divisions, one manufacturing farm equipment and the other office supplies. Both divisions are considered separate components as defined by generally accepted...

The country best known for providing financial assistance to young adults to help them move into their own places is: Germany Finland Sweden Norway

Compared with half a century ago, adoption has become _ _ _ _ _ _ _ _ _ common, but it is more open and acceptabl e , so we probably discuss it _ _ _ _ _ _ _ . fill in the blanks more or much less or...

How do Data Types perform data validation?

How does Referential Integrity work?

What are Dimensional Relational Databases designed to hold primarily?