Accession Number:

ADA470701

Title:

Using Information Extraction to Improve Document Retrieval

Descriptive Note:

Research paper

Corporate Author:

SRI INTERNATIONAL MENLO PARK CA

Report Date:

1998-01-09

Pagination or Media Count:

12.0

Abstract:

The authors describe an approach to applying a particular kind of Natural Language Processing NLP system to the TREC routing task in Information Retrieval IR. Rather than attempting to use NLP techniques in indexing documents in a corpus, they adapted an information extraction IE system to act as a post-filter on the output of an IR system. The IE system was configured to score each of the top 2000 documents as determined by an IR system and on the basis of that score to rerank those 2000 documents. One aim was to improve precision on routing tasks. Another was to make it easier to write IE grammars for multiple topics.

Subject Categories:

  • Linguistics
  • Computer Programming and Software
  • Cybernetics

Distribution Statement:

APPROVED FOR PUBLIC RELEASE