Question: 1. Project Definition You are expected to write a c++ console application that reads and counts unique words used in documents: articles, chapters, books about

 1. Project Definition You are expected to write a c++ consoleapplication that reads and counts unique words used in documents: articles, chapters,

1. Project Definition You are expected to write a c++ console application that reads and counts unique words used in documents: articles, chapters, books about Applied sciences, Mathematics, Information science published from 1900 to 2021 and find Top 10 frequent words used in these documents. Articles Dataset.txt file contains all the metadata information of documents. unigramCount contains all unique words and their number of occurrences for each document. There are 1500 publications recorded in the txt file. Find total frequency of all the unigrams used in all publications and print top 10 frequent words in these documents. Here is an example entry for a document: {"creator":["Romain Allais","Julie Gobert"], "datePublished":"2018-05-30", "docType":"article", "doi":"10.1051Vmattech/2018010", "id":"ark:W27927Vphz1Ohn2bh3", "isPartOf":"Mat\u00e9riaux & Techniques", "issueNumber":"5-6","language":["eng"], "outputFormat":["unigram","bigram","trigram"], "pageCount":7, "pagination":"pp. null-null", "provider":"portico", "publication Year":2018, "publisher":"EDP Sciences", "sequence":3.0, "tdm Category":["Applied Sciences - Engineering"], "title":"Environmental assessment of PSS", "url":"http:Wdoi.orgV10.1051VmattechV2018010", "volumeNumber":"105", "WordCount":4446, "unigramCount": {"others":1,"air":1,"networks,":1,"conventional":1,"IEEE":1 } Your program must pull out the unigram counts for each document and store them in a suitable data structure. 1. Run your code in Release Mode, with an option full optimization to get the result quickly. (As a matter of fact, your code must run in Release Mode without crashing or any problem.) 2. You need to test your code in Visual Studio (any version is OK). All projects will be run on Visual Studio for evaluation. Be sure that there is no compiler dependent problem occurs for your project. 3.A struct/class definition for word will be useful for storing the word and its count information together on the data structure you implement

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!