Question: In java Description: In this lab, you will gain some experience with file I/O, text parsing, and URL connections. You will build an application that

In java

Description:

In this lab, you will gain some experience with file I/O, text parsing, and URL connections. You will build an application that provides users with a guided web browsing capability that searches files for user-specified keywords.

A typical search engine reads many files off the web and saves information about them in a database that is used to answer the search queries posed by users. However, your application will not do any prefetching of data. Instead, it will search files in response to user requests, as described in the specification given below.

Specification:

User enters the specific URL such as http://www.bbc.com/ and search word to search for on the command line.

The program opens an URLConnection for the given URL. Your program should parse the file in order to display the following information:

The number of occurrences of the user-specified word in the HTML file

The URLs for all the links to other HTML files that are given in the user-selected file (things of the form href="xxxxx"), along with the number of occurances of he keyword in each. To do this, open a URL connectin for each of HTML links and parse the file, counting the number of times the keywird ouccurs. You have to display all the URLs that were parsed, sorted by the number of occurrences of the keyword, in decreasing order, omitting files that don't contain the keyword at all. For each URL, displau the URL for the file, followed by the number of occurrences of the keyword in parentheses.

After each search, use fileOutputStream to save the result of the application to a file called "searchdata.www", overwriting the data from the previous search.

Assumptions:

All of the actual URL files will end with the .html suffix. However, the link names may not show the suffix explictly. If the link ends with a /, append the string index.html before processing. If a link does not end with a / and also does not end with .html, append the string /index.html before processing.

If you are currently looking at a page whose URL is http://www.example.com/abc/nonsense.html, then the path http://www.example.com/abc/ is considered to be currently directory URL.

If a link does not begin with http:// then it is a relative link, meaning that you should prefix it with the current directory URL before processing. For more information regarding absoulte/relative link, please refer to http://www.scriptingok.com/tutorial/HTML-links-2

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

I have to create a program in C and I can't figure it out. The program has to read a source file. Please help. /******************************************************************** PROJECT: Glossary...

In this write up, the following should be covered: What is the primary issue facing the firm? What are the secondary issues facing the firm? What is the most appropriate action for the firm? What...

Please help with this. Text below BCS 101 - Programming Concepts and Problem Solving This course will provide an introduction to programming logic and problem solving techniques using different...

Can someone please help me make this run? I don't understand why it is not //AssignReadString public class AssignReadString { } import java.awt.image.BufferedImageFilter; import...

Can someone help make this java file run? The code is probable the main issue I am having //AssignReadString public class AssignReadString { } import java.awt.image.BufferedImageFilter; import...

PROJECT SCOPE [Instructions for what to include in this section: Define the scope of work that will be undertaken to provide the deliverable(s) mentioned in the Project Charter (PC). Craft this...

Predictive text entry systems are familiar on touch screens and mobile phones. This question asks you to consider how the same principles might be used in a programming editor for creating Java code....

Java protests instead of sending messages as message.Characterize, in a programming language documentation of your choice, a recursive drop parser that will foster the hypothetical sentence structure...

5-407-753 MARK JEFFERY Air France Internet Marketing: Optimizing Google, Yahoo!, MSN, and Kayak Sponsored Search Rob Griffin, senior vice president and U.S. director of search for Media Contacts, a...

Navigation in SAP Systems Introduction to Navigation in SAP solutions on the basis of SAP ERP Product SAP ERP 6.08 Global Bike Level Beginner Focus Navigation Authors Babett Koch Stefan Weidner...

You purchase a bond with an invoice price of $1,027. The bond has a coupon rate of 6.8 percent, and there are four months to the next semiannual coupon date. What is the clean price of the bond?

(a) Show that a ring has only one zero element. [Hint: If there were more than one, how many solutions would the equation OR+ x = 0Rhave?] (b) Show that a ring Rwith identity has only one identity...

Concerning incremental project cash flow, which of these is a cost one would bever count as an expense of the project? Intial investment, financial costs, operating expenses of the prject, taxes paid

(Thousands of Dollars) Assets Cash Accounts receivable (net) Inventory Dec. 31, 2016 Dec. 31, 2015 $18,300 41,000 39,500 98,800 52,600 15,600 167,000 $18,000 36,000 43,700 97,700 50,500 13,800...

7. Remind employees about the process periodically, such as once a year, so that they know it is operational and effective.

6. Notify employees of the change. Announce the change to an online handbook in a way that ensures all employees will know about the new format. Consider a mandatory sign-on within the transition...

3. Research the various ADR options to determine which ones are the best fit for the organization. For example, peer review is most successful when there is a high level of trust within the workforce.