Question: Task 2 : Design a Jelinek - Mercer based Language Model ( JM _ LM ) that ranks documents in each data collection using the

Task

2

: Design a Jelinek

-

Mercer based Language Model

(

_

)

that ranks documents in

each data collection using the corresponding topic

(

query

)

for all

50

data collections.

Inputs:

50

long queries

(

topics

)

in the

50

Queries.txt and the corresponding

50

data collections

(

Data

_

101,

Data

_

102, . . .,

Data

_

150) .

Output:

50

ranked document files

(

.

.,

for Query R

107,

the output file name is

_

_

107

Ranking.dat

)

for all

50

data collections and save them in the folder

RankingOutputs

.

For each long query

(

topic

)

,

you need to use the following equation to calculate a conditional

probability for each document D in the corresponding data collection

(

dataset

)

3

where is the number of times query word qi occurs in document D

, |

|

is the number of

word occurrences in D

,

is the number of times query word qi occurs in the data collection

Data

_

, |

Data

_

|

is the total number of word occurrences in data collection Data

_

,

and

parameter

\

lambda

= 0.4 .

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock blur-text-image

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Databases Questions!

Task 2 : Design a Jelinek - Mercer based Language Model ( JM _ LM ) that ranks documents in each data collection using the corresponding topic ( query ) for all 5 0 data collections. Inputs: 5 0 long...

Task 1 : Design a BM 2 5 - based IR model ( BM 2 5 ) that ranks documents in each data collection using the corresponding topic ( query ) for all 5 0 data collections. Inputs: 5 0 long queries (...

Task 3 . Based on the knowledge you gained from this unit, design a pseudo - relevance model ( My _ PRM ) to rank documents in each data collection using the corresponding topic ( query ) for all 5 0...

I have to create a program in C and I can't figure it out. The program has to read a source file. Please help. /******************************************************************** PROJECT: Glossary...

INTERNATIONAL REVIEW OF L AW C OMPUTERS & TECHNOLOGY , VOLUME 11, N UMBER 2, P AGES 251-261, 1997 The Data Mart: A New Approach to Data Warehousing PAM ELA PIPE Introduction Vendors have recently...

Identify digital transformation theories mentioned in the article SMR JOURNAL OF SERVICE B 20280 MANAGEMENT RESEARCH SMR . Journal of Service Management Research . Issue 02/2018 EDITORS Digital...

MATHEMATICS FOR MACHINE LEARNING Marc Peter Deisenroth A. Aldo Faisal Cheng Soon Ong Contents Foreword 1 Part I Mathematical Foundations 9 1 Introduction and Motivation 11 1.1 Finding Words for...

I have attached 2 business research. Write a 700- to 1,050-word paper in which you practice identifying the critical first stage of developing any research study: State the purpose of the business...

There are two problems due this week (each worth 35 points) as follows. Problem 1.6 (page 20) In comprehensive paragraphs, answerrequirements a to e. You will have 5 paragraphs total of four to five...

Hi, I have an Assignment for my Finance Subject. I have attached the necessary documentation here for you to view including the Lecture slides of all the Topics covered for this assignment. Please...

On November 24, 2008, 26 passengers on Tom Paris Airlines Flight No. 901 were injured upon landing when the plane skidded off the runway. Personal injury suits for damages totaling $5,000,000 were...

How would you restate the statement of financial position of a business?

We can estimate a stock's value by Multiple Choice using the book value of the total stockholder equity section. using the book value of the total assets divided by fhe number of shares outstanding....

CT Corp Comprehensive Question Canadian Tire Corporation, Limited (Canadian Tire) is a family of companies that includes a retail segment and a financial services division, among others. The retail...

3. Explain the relationship between history, power, and intercultural communication.

8. Explain the contact hypothesis.

7. Identify four antecedents that influence intercultural contact.