Accession Number:

ADA459304

Title:

Probabilistic Structured Query Methods

Descriptive Note:

Technical rept.

Corporate Author:

MARYLAND UNIV COLLEGE PARK LANGUAGE AND MEDIA PROCESSING LAB

Personal Author(s):

Report Date:

2003-02-01

Pagination or Media Count:

9.0

Abstract:

Structured methods for query term replacement rely on separate estimates of term frequency and document frequency to compute the weight for each query term. This paper reviews prior work on structured query techniques and introduces three new variants that leverage estimates of replacement probabilities. Statistically significant improvements in retrieval effectiveness are demonstrated for cross-language retrieval and for retrieval based on optical character recognition when replacement probabilities are used to estimate both term frequency and document frequency.

Subject Categories:

  • Information Science
  • Statistics and Probability

Distribution Statement:

APPROVED FOR PUBLIC RELEASE