Question: Python 3: Develop a crawler that collects the email addresses in the visited web pages. You can use function emails() from Problem 11.22 (below) to

Python 3: Develop a crawler that collects the email addresses in the visited web pages. You can use function emails() from Problem 11.22 (below) to find email addresses in a web page. Design it so that the crawler only follows links hosted on the same host as the starting web page.

The emails() function mentioned above is this:

def emails(page):

email_addresses = re.findall(r'[\w\.-]+@[\w\.-]+', page)

print(email_addresses)

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!