Question: 2. Problem solving: Positional indexes (3 Points! 1 Point for solving each sub-problem) Consider the following documents: Doc 1: today you are you, that is

2. Problem solving: Positional indexes (3 Points! 1 Point for solving each sub-problem) Consider the following documents: Doc 1: today you are you, that is truer than true. Doc 2: you have brains in your head. you have feet in your shoes. you can steer yourself any direction you choose, you are on your own, and you know what you know. and you are the one who'll decide where to go. 1). Positional indexes are very useful to search against documents. Let's build positional indexes based on these documents using the format DocID: ; ..., for example, the positional indexes for the words "are" and "those" are as follows. are: 1: ; 2: today: 1: No need to consider any type of normalization of tokens, in addition, the punctuation marks should be stripped off from words and ignored when you count tokens. Now please show the positional indexes for the words "you", "head" and "feet". you: head: feet: 2). A phrase query "word1 word2" retrieves documents where word1 is immediately followed by word2, A /k query "word1 /k word2" (k is a positive integer) retrieves documents where word1 occurs within k words of word2 on either side. For example, k=1 demands that word1 be adjacent to word2, but word1 may come either before or after word2. For the following queries, return all the docs and corresponding positions (phrase starting positions) for which the query conditions are met. If no document meets the criteria, return none. "you are you" "head feet" "you /2 you" 3). Let's say we want to find documents in which "you /2 you, and the two words "you" and "you" are in the same sentence. This condition only applies to document 1. How would you modify the positional index to support queries that demand the terms to be in the same sentence? You can assume that the parsing step is able to identify the sentences in a document. Please write down an example of the modified postings list for the words "you". you: 2. Problem solving: Positional indexes (3 Points! 1 Point for solving each sub-problem) Consider the following documents: Doc 1: today you are you, that is truer than true. Doc 2: you have brains in your head. you have feet in your shoes. you can steer yourself any direction you choose, you are on your own, and you know what you know. and you are the one who'll decide where to go. 1). Positional indexes are very useful to search against documents. Let's build positional indexes based on these documents using the format DocID: ; ..., for example, the positional indexes for the words "are" and "those" are as follows. are: 1: ; 2: today: 1: No need to consider any type of normalization of tokens, in addition, the punctuation marks should be stripped off from words and ignored when you count tokens. Now please show the positional indexes for the words "you", "head" and "feet". you: head: feet: 2). A phrase query "word1 word2" retrieves documents where word1 is immediately followed by word2, A /k query "word1 /k word2" (k is a positive integer) retrieves documents where word1 occurs within k words of word2 on either side. For example, k=1 demands that word1 be adjacent to word2, but word1 may come either before or after word2. For the following queries, return all the docs and corresponding positions (phrase starting positions) for which the query conditions are met. If no document meets the criteria, return none. "you are you" "head feet" "you /2 you" 3). Let's say we want to find documents in which "you /2 you, and the two words "you" and "you" are in the same sentence. This condition only applies to document 1. How would you modify the positional index to support queries that demand the terms to be in the same sentence? You can assume that the parsing step is able to identify the sentences in a document. Please write down an example of the modified postings list for the words "you". you

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

2. Problem solving: Positional indexes (3 Points! 1 Point for solving each sub-problem) Consider the following documents: Doc 1: Today you are you. That is truer than true. Doc 2: Be who you are. Say...

Distributed Systems Einstein has established that there is no universal time. For earth-based computer systems discuss how events might be assiganed a time stamp which is reasonably close to...

National Business Institute of Australia 20 Clark Rd. Ivanhoe Victoria 3079 www.nbi.com.au 03 9499 7872 FNSORG601A Negotiate to achieve goals and manage disputes Student Workbook melbourne . sydney ....

Solving Two-stage Robust Optimization Problems by A Constraint-and-Column Generation Method Bo Zeng Department of Industrial and Management Systems Engineering University of South Florida, Email:...

Case Summary Read the Discussion Assignment 1-1 on p.24 of the text Winning and Longevity. Select a health care entity to focus on, this could be a clinic or hospital of your choosing. Apply the case...

This paper should include 3-5 pages of content with an additional cover and reference page. This is a total of 5-7 pages. Please be aware that a properly formatted page will include approximately 350...

Page 562 Writing Proposals and Progress Reports Chapter Outline Writing Proposals Proposal Questions Proposal Style Proposals for Class Research Projects Proposals for Action Sales Proposals Business...

DAVID DOESN'T DELEGATE Overcoming an Individual's Immunity to Change AS ANY EXPERIENCED MANAGER will tell us, being an effective delegator is crucial to using everyone's time, skills, and knowledge...

MATHEMATICIANS RISE TO A CHALLENGE ne of the theorems we teach in eighth grade is a + b= *, where c is the length of the hypotenuse of a right triangle in Euclidean space, and a and b are the lengths...

If 12.39 g of Urea (CN_(2)OH_(4)) are produced when 8.87 g of Ammonia react completely with Carbon dioxide gas, what is the percent yield for this reaction? 2NH_(3)(g) + CO_(2)(g) CN_(2)OH_(4)(s) +...

The Bekele Company was incorporated on April 1, 20X0. Bekele had 10 holders of common stock. Rosa Bekele, the president and chief executive officer, held 51% of the shares. The company rented space...

XYZ Sensor Corp. ended the year carrying $13,845,000 worth of inventory. Had they sold their entire inventory at their current prices as noted below, how many more dollars of contribution margin...

If the portfolio standard deviation is relatively large, then the risk in investing in the portfolio is generally Blank _ _ _ _ _ _ . Multiple choice question. low moderate high

Compared with half a century ago, adoption has become _ _ _ _ _ _ _ _ _ common, but it is more open and acceptabl e , so we probably discuss it _ _ _ _ _ _ _ . fill in the blanks more or much less or...

6. Employees need access to career information sources (including advisors and positions available).

3. Employees are encouraged to take active roles in career management.

2. Discuss the protean career and how it differs from the traditional career.