Question: the table: 2. Indexing Models and Term Weighting Consider the following document-term table containing raw term frequencies. Answer the following questions, and in each case

 the table: 2. Indexing Models and Term Weighting Consider the following

the table:

document-term table containing raw term frequencies. Answer the following questions, and in

2. Indexing Models and Term Weighting Consider the following document-term table containing raw term frequencies. Answer the following questions, and in each case give the formulas you used to perform the necessary computations. Note: you should not do these computations manually. You may use a spreadsheet program such as Microsoft Excel, or you can considering writing your own program do the computations. In either case, include your spreadsheet or code in your assignment submissions. Download the table below as an Excel Spreadsheet] Terml Term2 Term3 Term4 Terms Termo Term7 Term8 0 0 2 1 @ 300 3 4 0 0 Hin DOC1 0 3 1 DOC2 53 DOC3 3 0 4 DOC4 1 DOC5 @ DOC6 2 DOCZ 250 DOC8 3 3 0 DOC9 0 0 3 DOC10 1 0 5 2 WNANO ++ oHo 3 0 1 2 0 0 3 3 0 0 0 24 0 2. a. Compute the new weights for all the terms in document DOC4 using the if x idf approach. b. Compute the new weights for all the terms in documents DOC4 using the signal-to-noise ratio approach. c. Using the Keyword Discrimination approach, determine if Term4 is a good index term or not (by computing it's discriminant). To compute average similarities use Cosine similarity as your similarity measure. Show your work. A I C Term2 D Term3 E Term4 F Term5 G Term6 H Term7 Term8 2 B Term1 0 5 3 1 0 0 0 8 4 0 4 0 DOC1 DOC2 DOC3 DOC4 DOC5 DOC6 DOC7 DOC8 DOC9 DOC10 0 2 mmomno 10 3 0 1 11 12

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!