Question: Consider ranking of documents using a Language Modeling - based IR algorithm. P ( q , M d ) = p r o d d

Consider ranking of documents using a Language Modeling

-

based IR algorithm.

P (q, M_{d}) = p r o d_{d i s t i n c t t e r n t i n ?} P (t, M_{d})^{t h t}

where

t f_{t, q}

is the term frequency

-

number of occurrences of

t

in query

We estimate the parameters

P (t, M_{d})

using Maximum Likelihood Estimate

(

MLE

)

as:

(M_{d} |) = \frac{t f t_{t} d}{d}

where

| d |

is the length of document

d

t f_{t, d}

is the term frequency

-

number of occurrences of

t

in documen d

To avoid problem with zero probabilities, we smooth the estimates. First, we define:

where

M_{c}

is the collection model.

c f_{t}

is the number of occurrences of

t

in the collection.

T =_{t}^{?} c f_{t}

is the total number of tokens in the collection.

We use

(M_{c} |)

to smooth

P (t | d)

using Jelinek

-

Mercer smoothing as:

(M_{d} (M_{c} |) |)

Consider the following documents and

d_{2})

and query

(q)

d_{1} =

epistemological considerations should also address learning design.

d_{2} =

epistemological considerations such as what is being measured.

q

: epistemological design

Rank

d_{1}

and

d_{2}

with respect to

q

using Jelinek

-

Mercer smoothing. Use

= 0.75 .

Consider ranking of documents using a Language

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Programming Questions!

Developments in Technology Light is incident from air on the end face of a multimode optical fibre at angle of incidence as shown below. n n 1 2 The refractive indices of the core and cladding are...

This question involves the use of AGGREGATE linear PYTHOIN regression on the Auto data set. (a) Perform a simple linear regression with mpg as the response and horsepower as the predictor. Describe...

This question concerns lexical grammars. (a) Tree Adjoining Grammars contain two types of elementary tree. (i) What are these trees called? [1 mark] (ii) If one were building a grammar for English...

) Explain the collision detection mechanism applied in standard wired medium access control associated with CSMA and indicate why this might be unsuitable for wireless networks. [2 marks] (ii)...

) Consider integer division of one two's-complement binary number by another. Programming languages may vary in the result when one argument is negative. What differing conventions might they be...

Give Correct ANSWERS Human-Computer Interaction (a) If you had been one of the original inventors of the WIMP interface, and engineers on the technical team had been sceptical about the advantages...

INTERNATIONAL REVIEW OF L AW C OMPUTERS & TECHNOLOGY , VOLUME 11, N UMBER 2, P AGES 251-261, 1997 The Data Mart: A New Approach to Data Warehousing PAM ELA PIPE Introduction Vendors have recently...

QUIZ... Let D be a poset and let f : D D be a monotone function. (i) Give the definition of the least pre-fixed point, fix (f), of f. Show that fix (f) is a fixed point of f. [5 marks] (ii) Show that...

Consider the Markov Chain, Xn, on the states i = 0, 1, 2, . . . with transition matrix given by pi,i1 = p i = 1, 2, . . . pi,i+1 = 1 p i = 0, 1, . . . p0,0 = p where 0

answer the question clearly You are building a flight-control system for which a convincing safety case must be made. Would you assign the tasks of safety requirements engineering, test case...

Management accounting you and your sister have a good business idea No upon reflection it is not only a good business idea you have 15 years experience with robotics with an automotive manufacturer...

On December 31, 2012, Paxon Corporation acquired 90 percent of the outstanding common stock of Saxon Company for $3,240,000,000 cash. The fair value of the 10 percent noncontrolling interest in Saxon...

n which of the following situations will the spousal credit be greatest for a married couple? Question content area bottom Part 1 A . One spouse has net income for tax purposes of $ 4 2 , 0 0 0 ,...

Sketch each region and its center of mass. The region bounded by y = x^3 x and y = x^ 2 1