Question: The following code counts how often different words occur in a text file. In this program, bstmap has an interface and behavior identical to a

The following code counts how often different words occur in a text file. In this program, bstmap has an interface and behavior identical to a std::map but is implemented using simple binary search trees.

char toLC (char c) { return (c >= 'A' && c <= 'Z') ? c-'A'+'a' : c; } void countWordsInFile (istream& inFile, bstmap& counts) { string word; while (inFile >> word) { // strip away any non-alphabetic characters string::size_type n = word.find_first_not_of (wordCharacters); if (n != string::npos) word = word.substr (0, n); if (word.length() > 0) { // if there's anything left, count it word = transform(word.begin(), word.end(), word.begin(), toLC); /** ... increment the appropriate counter in counts ... **/ map::iterator pos = counts.find (word); if (pos == counts.end()) // if this is the 1st time we've seen // this word counts.insert (map::value_type(word, 1)); else // else if we've seen this word before ++((*pos).second); } } }

Let W denote the number of words in the document file being processed. Many of these words, in a typical document, will be duplicates. Let D denote the number of distinct (non-duplicate) words, D <= W. A study of several large text files has suggested that, for sufficiently large W, authors will use an average vocabulary of D=c*log(W) distinct words, where c is some constant. Assume the length of any individual word is bounded by some constant (e.g., no single word is longer than "antidisestablishmentarianism"). What is the average-case complexity for countWordsInFile?

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

Background Information This assignment tests your understanding of and ability to apply the programming concepts we have covered throughout the unit. The concepts covered in the second half of the...

Create the following program using java and post the output as well. Assignment In this assignment you are to implement a hash table for use in a simple spell checker. The spell checker should be set...

Write c++ programming!! districts.txt Barryland,1,5,7 Rabbitville,1,55,12,2,654,0,3,79,711 Jelly Bean Forest,1,11,49,2,337,99,3,764,64091,4,79666,22278,5,116364,56350 Earth,1,0,1,2,45,67 New...

I have to create a program in C and I can't figure it out. The program has to read a source file. Please help. /******************************************************************** PROJECT: Glossary...

An Array-Based Implementation of John Conway's Game of Life Objectives To practice using two-dimensional arrays To learn how to implement methods that satisfy pre-post conditions To learn how to use...

For this project, you will: Be introduced to DataCount stemming such as removing 's' from the end of a word can lead to erroneous results (such as "bu" from "bus") and require special logic. Even our...

The Game of 20 Questions: In this assignment you will implement a yeso guessing game called "20 Questions." Each round of the game begins by you (the human player) thinking of an object. The computer...

Need help coding this. Project Description This first project is meant to ensure that you are able to apply and extend your prerequisite knowledge as well as introduce you to developing and testing a...

Overall Assignment - 100 points Write a program (in C) targeted at the Linux platform which reads words from one or more files, and prints out a list of the most frequently occurring sequential pairs...

More lenses Object O stands on the central axis of a thin symmetric lens. For this situation, each problem in Table 34-8 refers to (a) The lens type, converging (C) or diverging (D), (b) The focal...

A wind turbine is rotating at 15 rpm under steady winds flowing through the turbine at a rate of 42,000 kg/s. The tip velocity of the turbine blade is measured to be 250 km/h. If 180 kW power is...

You are offered $ 1 2 0 0 0 0 today or $ 3 6 0 0 0 0 in 1 1 years. Assuming that you can earn 1 2 percent on your money, which should you choose? If you are offered $ 3 6 0 0 0 0 in 1 1 years and you...

Periodic interest rates. In the following table, fill in the periodic rates and the effective annual rates. First, fill in the periodic rates in the following table. (Round to two decimal places.)...

4. Your company has reaped the benefits of having long-term, tenured employees, but many of them are now approaching retirement. It is anticipated that approximately 20% of the companys workforce...

1. Since surveys can be important tools for obtaining employee feedback, how can HR professionals use them more effectively? What else could Carolina Biological do to get useful feedback from its...

3. As the HR Manager, you must provide the senior management team with turnover costs for the following high-turnover position. Use websites such as www.talentkeepers.com and www.keepemployees.com,...