Doc-id house for sale in Geelong Melbourne 39 11 32 22 22 19 19 3 15...
Fantastic news! We've Found the answer you've been seeking!
Question:
Transcribed Image Text:
Doc-id house for sale in Geelong Melbourne 39 11 32 22 22 19 19 3 15 19 20 1 3 12 20 14 1 1 2 3 4 16 21 13 4 21 9 13 (houses OR for OR sale OR in OR Geelong OR Melbourne) (houses AND for AND sale AND in AND Geelong OR Melbourne) Suppose these are issued to a search engine that uses the ranked Boolean retrieval model. Assume, for simplicity, only four documents in the collection (with document ids 1-4). Answer the following questions. The above table gives the number of times each query-term occurs in each document. (i) Compute the document scores and the ranking associated with the query (houses OR for OR sale OR in OR Geelong OR Melbourne). (ii) How is the ranking produced probably sub-optimal and why does this happen? (iii) Compute the document scores and the ranking associated with the query (houses AND for AND sale AND in AND Geelong OR Melbourne). (iv) How is the ranking produced probably sub-optimal and why does this happen? (v) How would you extend the Boolean retrieval model to handle AND NOT constraints (e.g., houses AND NOT Geelong)? Your proposed solution should give a higher score to documents that contain fewer occurrences of the term to the right of the AND NOT (e.g., Geelong). Please be as mathematical as possible. In other words, saying: "I would reduce the score for documents that contain the word to the right of AND NOT." is too vague. (vi) Using the index, what would be the Boolean retrieval model scores given to documents 1-4 by your proposed scoring method for the query "houses AND NOT Geelong"? Doc-id house for sale in Geelong Melbourne 39 11 32 22 22 19 19 3 15 19 20 1 3 12 20 14 1 1 2 3 4 16 21 13 4 21 9 13 (houses OR for OR sale OR in OR Geelong OR Melbourne) (houses AND for AND sale AND in AND Geelong OR Melbourne) Suppose these are issued to a search engine that uses the ranked Boolean retrieval model. Assume, for simplicity, only four documents in the collection (with document ids 1-4). Answer the following questions. The above table gives the number of times each query-term occurs in each document. (i) Compute the document scores and the ranking associated with the query (houses OR for OR sale OR in OR Geelong OR Melbourne). (ii) How is the ranking produced probably sub-optimal and why does this happen? (iii) Compute the document scores and the ranking associated with the query (houses AND for AND sale AND in AND Geelong OR Melbourne). (iv) How is the ranking produced probably sub-optimal and why does this happen? (v) How would you extend the Boolean retrieval model to handle AND NOT constraints (e.g., houses AND NOT Geelong)? Your proposed solution should give a higher score to documents that contain fewer occurrences of the term to the right of the AND NOT (e.g., Geelong). Please be as mathematical as possible. In other words, saying: "I would reduce the score for documents that contain the word to the right of AND NOT." is too vague. (vi) Using the index, what would be the Boolean retrieval model scores given to documents 1-4 by your proposed scoring method for the query "houses AND NOT Geelong"?
Expert Answer:
Answer rating: 100% (QA)
i The document scores are Doc 1 1 1 1 0 1 1 6 Doc 2 1 1 0 1 0 0 3 Doc 3 1 0 1 1 1 0 5 Doc 4 0 1 1 0 ... View the full answer
Related Book For
Federal Taxation 2016 Comprehensive
ISBN: 9780134104379
29th edition
Authors: Thomas R. Pope, Timothy J. Rupert, Kenneth E. Anderson
Posted Date:
Students also viewed these accounting questions
-
Lisa has a $25,000 basis in her partnership interest before receiving a current distribution of $4,000 cash and land with a $30,000 FMV and a $14,000 basis to the partnership. Assume that any...
-
LMN Inc. has granted permission to another party to use LMNs mark in any manner it chooses. What are the dangers of such a permission, and what type of permission has been granted?
-
Nick sells live Christmas trees each year beginning in late November. He needs to place an order for the Douglas fir variety in early fall from the tree farm. Nick is deciding whether to order 100,...
-
You are given two planes in parametric form, x1 x2 1 x3 where x1, x2, 3, , 2, 1,42 R. Let I be the line of intersection of II and II2. a. Find vectors n and no that are normals to II and II 2 must...
-
Consider two brothers, Eddy and Larry, who, despite growing up in the same household, have grown quite different personalities. A: Eddy is known to his friends as steady Eddy he likes predictability...
-
Boyne University offers an extensive continuing education program in many cities throughout the state. For the convenience of its faculty and administrative staff and to save costs, the university...
-
Conduct a competitive analysis by collecting information on the product specifications of one of the DAA products from each of the companies identified in Exercise 26.1. Specifically, the data should...
-
Using the information provided in Exercise 2.8, prepare a statement of owners equity for the month of September and a balance sheet for Perez Investment Services as of September 30, 2016. In Exercise...
-
Shankar Company uses a perpetual system to record inventory transactions. The company purchases inventory on account on February 2 for $35,000 and then sells this inventory on account on March 17 for...
-
What Are Empirical/Research Articles? Describe the resource assigned to you including two examples of how this resource will support your successful completion of the capstone project?
-
9. Let x...,x) be linearly independent solutions of x' = P(t)x, where P is continuous on %3D
-
Use the substitution method to solve the system of equations. 2y+4x=18 2y-3x=4
-
From the trial balance and the additional information provided, record the adjusting entries in the journal and prepare an interim statement of profit or loss for three months ending July 31st, 2023....
-
Selma operates a contractor's supply store. She maintains her books using the cash method. At the end of 2023, her accountant computes her accrual basis income that is used on her tax return. For...
-
The pH scale for acidity is defined by pH=-log[H+]. Where H+ is the concentration of hydrogen ions measured in moles per liter. 1. Calculate the concentration of hydrogen ions in moles per liter for...
-
Carla Vista Ltd. provides a defined contribution pension plan for its employees. Currently, the company has 47 full-time and 52 part- time employees. The pension plan requires the company to make an...
-
There ar 3 obstacles to decision making: Confirmation Bias/ Counterproductive Heuristics/ Overconfidence. Please construct a scenario either personal or fictitious in which these obstacles might pose...
-
Making use of the tables of atomic masses, find the velocity with which the products of the reaction B10 (n, ) Li7 come apart; the reaction proceeds via interaction of very slow neutrons with...
-
Woodland Corporation reports the following financial accounting results and other depreciation information for the current year: Sales revenue .. $ 2,000,000 Plus: Interest income on municipal bonds...
-
Taxpayers who deduct an expense one year but recover it the next year are required to include the recovered amount in gross income. The tax benefit rule provides relief if the original deduction did...
-
White Corporation has 100 shares of stock outstanding. Ann owns 40 of these shares, and unrelated individuals own the remaining 60 shares. White redeems 30 of Anns shares for $30,000. In the year of...
-
You are an analyst following a medium-sized country that has a fixed exchange rate of 5 dinars per U.S. dollar (with a band of 2 percent above and below this par value). During March of last year you...
-
Assume instead that the spot exchange rate between the dollar and Swiss franc is a fixed or pegged rate within a narrow band around a central rate. For each change shown in Problem 9, assume that...
-
It is best for a country never to borrow from foreign lenders. Do you agree or disagree? Why?
Study smarter with the SolutionInn App