Question: Part 2 : Implement a Basic File / Text Processing Python Program.You will use Visual Studio Code to create a Python program that reads three

Part 2: Implement a Basic File/Text Processing Python Program.You will use Visual Studio Code to create a Python program that reads three (3) files, combines them together as a single piece of text, and then saves the combined text into a single new file. You will obtain the three source text files from the instructors GitHub repository.
Before the file is saved, all text should be converted to lowercase and the following "stop words" removed from the text: "a", "for", "how", and "the"
"Stop Words" are words in text that are often removed when going through the process of "indexing" documents. The indexing process attempts to determine what the subject matter of the document is - so matters are the "terms" in the text - the nouns and verbs.Create a directory structure for the project. Use the command line to create a directory named "inet4031project" inside your user's "Documents" directory.Navigate to the "inet4031project" directory above using command line. To change directories on the command line use the "cd" command: cd inet4031project Once you are in the "inet4031project" directory, clone the following GitHub repository: This repository contains the text files the Python program will read and process. Inside the "repo" should be three files: D1.txt, D2.txt, D3.txt.(I have links for the repo that needs to be cloned, let me know to update, but i think you get the information) continued: Use the command line to create another directory named "sourcefiles". Copy the D1, D2, and D3 text files from the "inet4031documentrepo" directory to the "sourcefiles" directory using a single "copy" command that copies all the files at once.Your program should output the processed file that contains combined text and save it in another directory named "processedfiles". This directory should be located in the "inet4031project" directory. Create the "processedfiles" directory using the command line. now part 3 is what needs to be done, part 2 has been done so it is just posted for refernece. here is part 3 : Links to an external site.From the command line start the JupyterLab Notebook server on your laptop and create a new notebook (which will be stored as an .ipynb file) Recreate the code written in Part 2 in your new notebook. Save the notebook (.ipynb file) in the "assignment2" directory created in Part 2. The notebook code should store the combined text file in a directory under the "inet4031project" directory named "processedfiles2". Remember to use the command line to create this directory. OK so what i need done is part 3 specifically recreating code from part 2 as it says above and the code should do what it says above in part 3. please provide text of the correct python code to accomplish these tasks, thank you
if any questions please ask, again i have the repo for cloning link, let me know if you want it, althought you shouldn't need to complete and can create your own same name inet4031documentrepo with D1 D2 D3 etc.

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Programming Questions!