Question: Define functions with the following names: process _ word ( ) , process _ line ( ) , process _ file ( ) , find

Define functions with the following names: process

_

word

(),

process

_

line

(),

process

_

file

(),

find

_

unique

(),

find

_

frequency

(),

most

_

common

(),

remove

_

stop

(),

count

_

_

length

(),

and count

_

_

first

() .

Descriptions of each of these functions are provided below.

process

_

word

()

This function should accept a single parameter named word. This parameter is expected to contain a string representing

a word. The function should remove any punctuation from the string and convert it to lowercase. This can be done by

performing the following steps.

Store the string

' . ? ?, "' () *_

0123456789'

in a variable named remove. This string contains all of the

characters to be removed from the beginning and end of word, if they are present.

Use the strip

()

method for strings to remove punctuation and digits from the beginning and end of word.

Pass the method the string remove. Store the stripped string in a variable.

Use the replace

()

method on the string created in Step

1

to replace any single quote characters

(

likely

representing apostrophes

)

with an empty string. That is

,

replace occurrences of

"' "

with

" " .

Store the result.

Use the lower

()

method on the string created in Step

2

to convert it to lower case. Store the result.

The function should return the string created in Step

3 .

process

_

line

()

This function should accept a single parameter named line. This parameter is expected to contain a string representing

a line of text read from a file. The function should perform the following processing steps to the line:

Use the replace

()

method to replace any dash characters

" - "

with spaces, storing the result in a variable.

Apply the split

()

method to the string created in Step

1

to create a list of individual words contained within

the string. Store the resulting list in a variable named words.

Loop over the elements of words. Apply the process

_

word

()

function to each string in this list. It is possible

for the resulting processed word to be an empty string. If the processed word is not empty

(

in other words, if it

has a length greater than

0),

then store it in a list named processed

_

words.

The function should return the list processed

_

words.

process

_

file

()

This function should accept a single parameter named path. This parameter is expected to contain a string representing

the relative location of a text file. The function will create and return a list of processed words contained in the file by

performing the following tasks.

Use with and open

()

to open the file whose location is stored in path. Use readlines

()

to read the

contents of the file into a list. Each string in this list will represent an entire line of text from the file.

Create an empty list named words.

Loop over the list created in Step

1 .

Apply the process

_

line

()

function to each string in this list. The list of

words returned by process

_

line

()

should be concatenated to the end of the list words. The combined list

should be stored back into words. Recall that you can concatenate two lists using the

+

operator.

The function should return the list words.

find

_

unique

()

This function should accept a single parameter named words. This parameter is expected to contain a list of strings

representing words. The function should create a list that contains exactly one copy of any string that appears in words.

Create an empty list to store the unique words.

Loop over the elements of words. If a particular element has not already been added to the list of unique

words, then append it to that list. Do nothing if the element has already been added to the unique list.

The function should return the list of unique words.

find

_

frequency

()

This function should accept a single parameter named words. This parameter is expected to contain a list of strings

representing words. The function should create a dictionary recording the number of times each individual word appears

in words. Each dictionary key should be a string representing a word, and each value should be a count representing the

number of times that string appeared in words.

Create an empty dictionary named freq

_

dict to store the counts.

Loop over the elements of words. If a particular element has already been added to freq

_

dict as a key then

increment the value associated with that key. If the element does not appear as a key in freq

_

dict, then add

it as a key with a value of

1 .

The function should return the dictionary freq

_

dict.

Define functions with the following names: process_word (), process_line(), process_file(), find_unique(),

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

Using Python I can't figure it out the rest of the problem. What am I doing wrong on the one I'm getting errors on? These are the processes I created in the script. This is what I'm supposed to do in...

I need help with Python. These are the functions created in a script. I don't need help with the functions. I need help with the problems that are done after the functions are created. I created the...

Define functions with the following names: process _ word ( ) , process _ line ( ) , process _ file ( ) , find _ unique ( ) , find _ frequency ( ) , most _ common ( ) , remove _ stop ( ) , count _ by...

Instructions for the Script File Define functions with the following names: process _ word ( ) , process _ line ( ) , process _ file ( ) , find _ unique ( ) , find _ frequency ( ) , most _ common ( )...

Instructions for the Script File Define functions with the following names: load _ puzzle ( ) , display _ puzzle ( ) , get _ next ( ) , get _ options ( ) , copy _ puzzle ( ) , and solve ( ) ....

I can't figure these python problems out. I posted the functions below to solve the problems. I ONLY NEED THE PROBLEMS SOLVED, NOT THE FUNCTIONS...

the bottom of the main loop (after getting user input), increment the current player. Then, if the number is too high, reset it to 0. Before printing whose turn it is, print the board using one of...

please use Python to do this thank you very much RSS Overview Many websites have content that is updated on an unpredictable schedule. News sites, such as Google News, are a good example of this. One...

Assignment 5: Hash Table implementation and concordance There are three parts to this assignment. In the first two parts, you will complete the implementation of a hash map and a concordance program....

What is a native XML database?

The 1HNMR spectrum of D-glucose in D2O exhibits two high-frequency (low-field) doublets. What is responsible for these doublets?

The speed of sound in a material is given as 1750 m/s. Is the material likely to be a solid, liquid or gas?

On December 31, 2015, Dow Steel Corporation had 690,000 shares of common stock and 39,000 shares of 9%, noncumulative, nonconvertible preferred stock issued and outstanding. Dow issued a 5% common...