Can I get help fixing, completing my python code Our program will open and read the urls contained in the file, and it will report back on the subset of urls that contain a reference to the specified topic I have included comments sources txt file http web archive org web 20180307004551 https foothill eduews http web archive org web 20151030182314 https www deanza eduews http web archive org web 20151030182406 http blogs sjsu eduewsroom http web archive org web 20151030182501 http ews stanford edu http invalidurlurlcs21a com http web archive org web 20151030182547 http ews berkeley edu http web archive org web 20151030182644 http www scu edu scunews http web archive org web 20151030172714 http ews ucsc edu http web archive org web 20151030183138 http www news ucsb edu http web archive org web 20151030183532 http ucsdnews ucsd edu http www deanza edu counseling documents Substitution 20Petition pdf EXAMPLE output artsummary txt file Source url http web archive org web 20151030182314 https www deanza eduews Euphrat Museum of Art Chain link fence art installation explores civil liberties issues Euphrat Museum of Art exhibition features two student projects Source url http web archive org web 20151030183138 http www news ucsb edu Recent acquisitions by the Art, Design Architecture Museum explore narratives of art and architecture art Test case 1 python aggregator py sources txt art The following error messages should be generated Error opening url http invalidurlurlcs21a com Error decoding url http www deanza edu counseling documents Substitution 20Petition pdf 'utf 8' codec can't decode byte 0xc4 in position 10 invalid continuation byte The output file (artsummary txt) should match the file artsummary txt Make sure you pick up references to Art and art and make sure you do NOT pick up the reference to arts Make sure you pick up the reference to Art when it is followed by punctuation as in Recent acquisitions by the Art, Design ImpLement a simple general purpose aggregator Usage aggregator py filename topic filename input file that contains a list of the online sources (urls) topic topic to be researched and reported on import urllib request import urllib error import re import sys Enter your function definitions here def getfiletopic) Check for correct number of arguments 3 arguments name, filename, topic print( number of arguments d len (sys argv)) for arg in sys argv print( command line argument s arg ) getfiletopic) def main) read command ine, flename, topic filenamesys argv 1 read file, line by line with open (filename) as f contentf readlines() for line in lines for each line, pull contents from web using urllib check contents for match to topic if match, write contents to file if name ' main ' main() ImpLement a simple general purpose aggregator Usage aggregator py filename topic filename input file that contains a list of the online sources (urls) topic topic to be researched and reported on import urllib request import urllib error import re import sys Enter your function definitions here def getfiletopic) Check for correct number of arguments 3 arguments name, filename, topic print( number of arguments d len (sys argv)) for arg in sys argv print( command line argument s arg ) getfiletopic) def main) read command ine, flename, topic filenamesys argv 1 read file, line by line with open (filename) as f contentf readlines() for line in lines for each line, pull contents from web using urllib check contents for match to topic if match, write contents to file if name ' main ' main()

The Answer is in the image, click to view ...

Question: Can I get help fixing, completing my python code? Our program will open and read the urls contained in the file, and it will report

Can I get help fixing, completing my python code?

Our program will open and read the urls contained in the file, and it will report back on the subset of urls that contain a reference to the specified topic.

I have included comments.

Can I get help fixing, completing my python code? Our program will

#------------------------------------------

sources.txt file:

http://web.archive.org/web/20180307004551/https://foothill.eduews/ http://web.archive.org/web/20151030182314/https://www.deanza.eduews/ http://web.archive.org/web/20151030182406/http://blogs.sjsu.eduewsroom/ http://web.archive.org/web/20151030182501/http:/ews.stanford.edu/ http://invalidurlurlcs21a.com/ http://web.archive.org/web/20151030182547/http:/ews.berkeley.edu/ http://web.archive.org/web/20151030182644/http://www.scu.edu/scunews/ http://web.archive.org/web/20151030172714/http:/ews.ucsc.edu/ http://web.archive.org/web/20151030183138/http://www.news.ucsb.edu/ http://web.archive.org/web/20151030183532/http://ucsdnews.ucsd.edu/ http://www.deanza.edu/counseling/documents/Substitution%20Petition.pdf

#------------------------------

EXAMPLE output

artsummary.txt file:

Source url:

http://web.archive.org/web/20151030182314/https://www.deanza.eduews/

Euphrat Museum of Art

Chain link fence art installation explores civil liberties issues

Euphrat Museum of Art exhibition features two student projects

Source url:

http://web.archive.org/web/20151030183138/http://www.news.ucsb.edu/

Recent acquisitions by the Art, Design & Architecture Museum explore

narratives of art and architecture

art

--------------

Test case 1:

python aggregator.py sources.txt art

The following error messages should be generated:

Error opening url: http://invalidurlurlcs21a.com/

Error decoding url: http://www.deanza.edu/counseling/documents/Substitution%20Petition.pdf 'utf-8' codec can't decode byte 0xc4 in position 10: invalid continuation byte

The output file (artsummary.txt) should match the file artsummary.txt open and read the urls contained in the file, and it will .

Make sure you pick up references to Art and art and make sure you do NOT pick up the reference to arts.

Make sure you pick up the reference to Art when it is followed by punctuation as in: Recent acquisitions by the Art, Design...

ImpLement a simple general purpose aggregator Usage: aggregator.py filename topic filename: input file that contains a list of the online sources (urls). topic: topic to be researched and reported on import urllib.request import urllib.error import re import sys # Enter your function definitions here def getfiletopic): # Check for correct number of arguments #3 arguments: name, filename, topic print( "number of arguments %d" len (sys.argv)) for arg in sys.argv: print( "command line argument: %s" % arg ) getfiletopic) def main): # read command ine, flename, topic filenamesys.argv[-1] # read file, line by line with open (filename) as f: contentf.readlines() for line in lines: # for each line, pull contents from web using urllib # check contents for match to topic # if match, write contents to file if-name?== ' main-' : main() ImpLement a simple general purpose aggregator Usage: aggregator.py filename topic filename: input file that contains a list of the online sources (urls). topic: topic to be researched and reported on import urllib.request import urllib.error import re import sys # Enter your function definitions here def getfiletopic): # Check for correct number of arguments #3 arguments: name, filename, topic print( "number of arguments %d" len (sys.argv)) for arg in sys.argv: print( "command line argument: %s" % arg ) getfiletopic) def main): # read command ine, flename, topic filenamesys.argv[-1] # read file, line by line with open (filename) as f: contentf.readlines() for line in lines: # for each line, pull contents from web using urllib # check contents for match to topic # if match, write contents to file if-name?== ' main-' : main()

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

Your mission in this assignment is to write a simple text-based adventure game in the tradition of Will Crowthers pioneering Adventure program of the early 1970s. In games of this sort, the player...

-----Lab9-start.py# Open and read health data file one line at a time # Columns are # disease,increase,location,number,population,year file = open("health-no-head-sample.csv", "r") # Process each...

I need help with a python program. All the instructions are going to be provided in order to be able to do this program. Note: It is important to mention that we have just covered while loop, for...

I need help with a python program. All the instructions are going to be provided in order to be able to create this program. Note: It is important to mention that we have just covered while loop, for...

Regex, urllib Python Project Files: https://drive.google.com/drive/folders/1n_B7qjez_fGbf6xOq9841v1NdbxyNqVN?usp=sharing ------------------------ Your task is to implement a simplified general...

Python Project Files: https://drive.google.com/drive/folders/1n_B7qjez_fGbf6xOq9841v1NdbxyNqVN?usp=sharing Your task is to implement a simplified general purpose aggregator. An aggregator is a...

Python Project Files: https://drive.google.com/drive/folders/1n_B7qjez_fGbf6xOq9841v1NdbxyNqVN?usp=sharing ------------------------------- Your task is to implement a simplified general purpose...

l. Introduction In this programming assignment, you will create a front end that is, a scanner and a parserfor the intermediate representation, ILOC, that will be used as input in the next two...

this is a python program please can anyone help me thank you Introduction In problem set 5, you will build a program to monitor news feeds over the Internet. Your program will filter the news,...

I have to create a program in C and I can't figure it out. The program has to read a source file. Please help. /******************************************************************** PROJECT: Glossary...

Mr Jones and Mr King are two neighbouring farmers and the only two users of a dirt track connecting their farms to the main road. The quality of this dirt track will affect both farmers' travel times...

Blackstone Inc. manufactures western boots and saddles. The company is considering replacing an outmoded leather processing machine with a new, more efficient model. The old machine was purchased for...

Two coins, one-rupee and two-rupee coins, are tossed once. Find the sample space.

Seved Help 14 Wisconsin Snowmobile Corp. is considering a switch to level production Cost efficiencies would occur under level production, and aftertax costs would decline by $31,500, but inventory...

2. How does Dare to Differentiate relate to succession planning? What role does assessment play in differentiate between employees?

1. One of Bill Conatys tips for developing leadership is Be Inclusive. Based on what was discussed in Chapters 7, 8, 9, and 10, what does Be Inclusive mean to you?

3. How can employees and their managers determine whether they are interested and qualified for leadership positions? Source: D. Brady, Secrets of an HR Superstar, Business Week (April 9, 2007)....