Using Information Extraction to Improve Document Retrieval
SRI INTERNATIONAL MENLO PARK CA
Pagination or Media Count:
The authors describe an approach to applying a particular kind of Natural Language Processing NLP system to the TREC routing task in Information Retrieval IR. Rather than attempting to use NLP techniques in indexing documents in a corpus, they adapted an information extraction IE system to act as a post-filter on the output of an IR system. The IE system was configured to score each of the top 2000 documents as determined by an IR system and on the basis of that score to rerank those 2000 documents. One aim was to improve precision on routing tasks. Another was to make it easier to write IE grammars for multiple topics.
- Computer Programming and Software