Question: Declare a struct TokenFreq that consists of two data members: (1) string value ; and (2) int freq; Obviously, an object of this struct will

Declare a struct TokenFreq that consists of two data members: (1) string value; and (2) int freq; Obviously, an object of this struct will be used to store a specific token and its frequency. For example, the following object word stores the token "dream" and its frequency 100:

TokenFreq word;

word.value="dream";

word.freq=100;

Remember to declare this struct at the beginning of your program and outside any function. A good place would be right after the "using namespace std;" line. This way, all the functions in your program will be able to use this struct to declare variables.

Implement the function vector getTokenFreq( string inFile_name); This function reads the specified input file line by line, identifies all the unique tokens in the file and the frequency of each token. It stores all the identified (token, freq) pairs in a vector and returns this vector to the calling function. Don't forget to close the file before exiting the function. In this homework, these tokens are case insensitive. For example, "Hello" and "hello" are considered to be the same token.

Implement the selection sort algorithm to sort a vector in ascending order of token frequency. The pseudo code of the selection algorithm can be found at http://www.algolist.net/Algorithms/Sorting/Selection_sort You can also watch an animation of the sorting process at http://visualgo.net/sorting -->under "select". This function has the following prototype:

void selectionSort( vector & tokFreqVector ); This function receives a vector of TokenFreq objects by reference and applies the selections sort algorithm to sort this vector in increasing order of token frequencies.

Implement the insertion sort algorithm to sort a vector in descending order of token frequency. The pseudo code of the selection algorithm can be found at http://www.algolist.net/Algorithms/Sorting/Insertion_sort Use the same link above to watch an animation of this algorithm. This function has the following prototype:

void insertionSort( vector & tokFreqVector );

Implement the void writeToFile( vector &tokFreqV, string outFileName); function. This function receives a vector of TokenFreq objects and writes each token and its frequency on a separate line in the specified output file.

Implement the int main() function to contain the following features: (1) asks the enduser of your program to specify the name of the input file, (2) ) call the getTokenFreq() to identify each unique token and its frequency, (3) call your selection sort and insertion sort functions to sort the vector of TokenFreq objects assembled in (2); and (4) call the WriteToFile() function to print out the sorted vectors in two separate files, one in ascending order and the other in descending order.

Example input and outputs:

Assume that your input file contains the following paragraph: "And no, I'm not a walking C++ dictionary. I do not keep every technical detail in my head at all times. If I did that, I would be a much poorer programmer. I do keep the main points straight in my head most of the time, and I do know where to find the details when I need them. by Bjarne Stroustrup"

After having called the getTokenFreq() function, you should identify the following list of (token, freq) pairs and store them in a vector (note that the order might be different from yours): {'no,': 1, 'and': 1, 'walking': 1, 'be': 1, 'dictionary.': 1, 'Bjarne': 1, 'all': 1, 'need': 1, 'Stroustrup': 1, 'at': 1, 'times.': 1, 'in': 2, 'programmer.': 1, 'where': 1, 'find': 1, 'that,': 1, 'would': 1, 'when': 1, 'detail': 1, 'time,': 1, 'to': 1, 'much': 1, 'details': 1, 'main': 1, 'do': 3, 'head': 2, 'I': 6, 'C++': 1, 'poorer': 1, 'most': 1, 'every': 1, 'a': 2, 'not': 2, "I'm": 1, 'by': 1, 'And': 1, 'did': 1, 'of': 1, 'straight': 1, 'know': 1, 'keep': 2, 'technical': 1, 'points': 1, 'them.': 1, 'the': 3, 'my': 2, 'If': 1}

After having called the selectionSort() function, the sorted vector of token-freq pairs will contain the following information (again, the tokens of the same frequency might appear in different order from yours) : [('no,', 1), ('and', 1), ('walking', 1), ('be', 1), ('dictionary.', 1), ('Bjarne', 1), ('all', 1), ('need', 1), ('Stroustrup', 1), ('at', 1), ('times.', 1), ('programmer.', 1), ('where', 1), ('find', 1), ('that,', 1), ('would', 1), ('when', 1), ('detail', 1), ('time,', 1), ('to', 1), ('much', 1), ('details', 1), ('main', 1), ('C++', 1), ('poorer', 1), ('most', 1), ('every', 1), ("I'm", 1), ('by', 1), ('And', 1), ('did', 1), ('of', 1), ('straight', 1), ('know', 1), ('technical', 1), ('points', 1), ('them.', 1), ('If', 1), ('in', 2), ('head', 2), ('a', 2), ('not', 2), ('keep', 2), ('my', 2), ('do', 3), ('the', 3), ('I', 6)]

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

My current code does not account for multiple white spaces and needs to reduce anytime there is more than one white space back to one. (Currently when reading a string with two white spaces in a row...

Can anyone help me write this C++ program ? The main goal of this program is to count and sort all the unique tokens in a file by applying the following knowledge points: reading and writing a text...

4.22 Coding lab: 2-dimensional arrays, a string tokenizer, insertion sort, and selection sort Important, please read! Make sure you use the specified struct name and function prototypes, as they will...

(NOTE: THE PROGRAM SHOULD BE WRITTEN IN C++) Important, please read! Make sure you use the specified struct name and function prototypes, as they will be referred to as such in the unit tests. Please...

THIS IS NEEDED IN C++ 2-dimensional arrays, a string tokenizer, insertion sort, and selection sort Important, please read! Make sure you use the specified struct name and function prototypes, as they...

The main objectives of this lab include: set up a 2d array (or matrix) with proper initial values using vector of vectors given a string, implement a tokenizer to identify all the unique tokens...

Make sure you use the specified struct name and function prototypes, as they will be referred to as such in the unit tests. Please feel free to introduce additional subroutines (i.e., functions) to...

C++ Question: we need to get the words/speech from .txt file Steve Jobs delivered a touching and inspiring speech at Stanford's 2005 commencement. The transcript of this speech is attached at the end...

1. Implement the following function to create a matrix of dimensionality numRows x numCols , where matrix starts with an initial size of 0. Furthermore, initialize the value at matrix[i][j] to the...

Determine the root sensitivity for the dominant roots of the design for Problem P7.18 for the gain K = 4/ and the pole s = -2.

What are the six explicit functions of the Fed?

Suppose a seven - year, $ 1 , 0 0 0 bond with a coupon rate of 7 . 6 % and semiannual coupons is trading with a yield to maturity of 6 . 2 9 % . a . Is this bond currently trading at a discount, at...

Hello! I need help with these 3 quick questions. Thank you in advance! 1) Which of the following needs to be adjusted, if a restatement of a prior period financial statement is necessary? Group of...

3. In terms of your career, do you think you are or will mostly be a. fixed in place? b. always moving? _______

What did they do? What did they say?

How did you feel about the change? Were you an early convert to the new way of operating or one of the last to give in?