Question: Preface This lab is based on Learn By Doing 1 3 . 1 0 Exercise 1 and a famous nifty assignment on authorship detection. In

Preface

This lab is based on Learn By Doing

13.10

Exercise

1

and a famous

nifty assignment

on authorship detection. In this part of the lab, you will create and store an array of AuthorStats structs on the heap, and use them to find the author who most likely wrote the mystery file.

Purpose

This lab is an opportunity for you to practice your skills in

Using header files

Creating an array of Structs on the heap

Writing a simple optimization algorithm

file processing

You will write this lab building on lab

1 .

Data

authorStats.txt Download authorStats.txt

Minimize File Preview

Jane Austen

4.41553119311 0.0563451817574 0.02229943808

Lewis Carroll

4.22709528497 0.111591342227 0.0537026953444

Charles Dickens

4.34760725241 0.0803220950584 0.0390662700499

Agatha Christie

4.40212537354 0.103719383127 0.0534892315963

Brothers Grimm

3.96868608302 0.0529378997714 0.0208217283571

Tasks

Part

1

Create a copy of your lab

1

project and call it lab

3 .

If you haven

t downloaded all five mystery files and the author stats file, please download them now. Create a

.

h file, and in that file create a struct to store author stats including first and last name, average word length, type token ratio, and hapax legomena ratio. Cut and past the define, include, and using statements from lab

1 .

cpp into the new

.

h file. Add a method header for a method with the following signature:

int loadAuthorStats

(

AuthorStats

* *

authorInfo, int & numAuthors

)

;

Include the new

.

h file in your

.

cpp file.

Part

2

Create an array of pointers to pointers of AuthorStats objects, an integer to track the number of authors

(100

max

),

and write the line of code to call load Author Stats. Write the load AuthorStats function. This function should allocate space on the heap for the authorstats, and copy from the file authorstats.txt to the array of structures on the heap. The method should require approximately

30

lines of code. You are encouraged to use struct syntax including object

- >

property instead of

(*

object

) .

property.

Part

3

Your goal is to find the author from your author stats list with the stats most similar to the stats of the mystery file. In this algorithm, you will be looking for a similarity that is the minimal difference. Because the numbers for averageWordLength are quite a bit bigger than the other stats, we will be weighting the numbers so that all can contribute to the similarity score. You will need to take the absolute value of the difference of each author

s stats and the stats of the text. You will weight the averageWordLength by multiplying it by

11,

the typeTokenRatio by multiplying it by

33,

and the HapaxLegoMana ratio by multiplying it by

50 .

For mystery

1,

with Jane Austen, the similarity score should be

2.39 (((4.41 - 4.24) * 11) + ((. 056 - . 046) * 33) + ((. 022 - . 018) * 50)) .

You should print a sentence of the form

This text was most likely written by

.

You do not need to print the similarity score. This should be about

20

lines of code. You are STRONGLY encouraged to deallocate memory that you have allocated, but I am NOT going to double check with the debug flags for this lab.

(

Estimated to be about

10

lines of code.

)

You are also STRONGLY encouraged to run the lab with all

5

mystery texts

-

which texts lead to correct

/

incorrect author identifications? Again, I will NOT deduct for not running with all

5

texts. You must be able to run for mystery

1

and one other file of your choice.

Submission and Criteria

Create a Zip file with your

.

,

your

.

cpp

,

and a READ

_

.

txt that lists the fileNames of the mystery files you tested, the predicted author by your algorithm, and the actual author. Submit your

.

zip file to Canvas.

Attendance and submission is worth

1 %

of your final grade. Each part is worth

1 %

of your final grade.

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Programming Questions!

"DISCUSS THE EFFECTS OF COVID 19 ON LABOR MARKETS AND THE ECONOMY" Provide an introduction and the background of your study, and clearly state what your research question or objective is. What real...

See page 129- 137 on attachment for more details there are five steps to the project. Step 1: Create the loan amortization schedule for the property. Step 2: Create the depreciation schedule. Step 3:...

KINGS OWN INSTITUTE* Success in Higher Education ICT106 DATA COMMUNICATIONS AND NETWORKS T223 Page 1 of 18 AUSTRALIAN INSTITUTE OF BUSINESS AND MANAGEMENT PTY LTD ABN: 72 132 629 979 CRICOS 03171A...

\f \f11TH EDITION STRATEGIC MANAGEMENT THEORY 11TH EDITION Strategic Management THEORY Charles W. L. Hill University of Washington - Foster School of Business Gareth R. Jones Melissa A. Schilling New...

2006 National Institute of Standards and Technology Technology Administration Department of Commerce Baldrige National Quality Program Arroyo Fresco Community Health Center Case Study 2006 National...

Table of Contents Introduction. Hypothesis. Methods ..5 148 194714) Results.. Table I Western Governor Township Race by Family History of Heart Disease. Table 3 Analysis of Variance Difference in...

Hands-on Exercise Ex3: Database Transactions 1 Connect to your MySQL Container Make sure your device is connected to the campus network by being on campus connected to the wired/wifi or if remote by...

%% Lab 2 - Your Name - MAT 275 Lab %% Example code % Example 1 % NOTE: Delete examples before submission. A = [1 0; 0 -1] A = [1, 0; 0, -1] % NOTE: The two matrices above are the same. We can...

1 DE MATH 127 : Calculus 1 for the Sciences Instructor: B. Forrest INDEPENDENT TERM PROJECT Weight: 10% Due: RECEIVED at the Centre for Extended Learning (CEL) Office or UW Campus Drop Box before...

1:26 . LTE 44 9 Physics lab data 10 vx, Object #1 Run # 1 9 8 7 6 x-Velocity, Object # 1 (m/'s) 5 4 W Linear mt + b m = -0.331 + 0.20 N b = 2.10 + 0.13 r = -0.320 1 O -2 -1 0 1 2 3 4 5 6 7 8 Time (s...

Product A Product B Estimated sales (units) 5,000 2,000 Selling price per unit $ 8.00 $ 15.00 Purchase price per unit $ 2.00 $ 6.00 Freight in per unit $ 1.00 $ 1.00 The total production overhead is...

In ordinary circumstances, when the corporate veil has not been pierced, a shareholder may be liable for: a portion of corporate fines for environmental violations. a portion of the settlement in a...

Velasquez Corporation has a debt - equity ratio of . 7 5 . The company is considering a new plant that will cost $ 5 0 million to build. When the company issues new equity, it incurs a flotation cost...

5 (20 minutes) Sanchev Company runs two convenient stores, one in Vancouver and one in Surrey Operating income for each store in 2021 is detailed below. Revenues Expenses Cost of goods sold Vancouver...