Accession Number:

ADA440321

Title:

Statistical Language Modeling for Information Retrieval

Descriptive Note:

Corporate Author:

MASSACHUSETTS UNIV AMHERST CENTER FOR INTELLIGENT INFORMATION RETRIEVAL

Personal Author(s):

Report Date:

2005-01-01

Pagination or Media Count:

42.0

Abstract:

This chapter reviews research and applications in statistical language modeling for information retrieval IR that has emerged within the past several years as a new probabilistic framework for describing information retrieval processes. Generally speaking, statistical language modeling, or more simply, language modeling LM, refers to the task of estimating a probability distribution that captures statistical regularities of natural language use. Applied to information retrieval, language modeling refers to the problem of estimating the likelihood that a query and a document could have been generated by the same language model, given the language model of the document and with or without a language model of the query.

Subject Categories:

  • Information Science
  • Statistics and Probability

Distribution Statement:

APPROVED FOR PUBLIC RELEASE