Accession Number:

ADA456310

Title:

IBM's Statistical Question Answering System - TREC-11

Descriptive Note:

Corporate Author:

IBM THOMAS J WATSON RESEARCH CENTER YORKTOWN HEIGHTS NY

Personal Author(s):

Report Date:

2006-01-01

Pagination or Media Count:

9.0

Abstract:

In this paper, we document our efforts to extend our statistical question answering system for TREC-11. We incorporated a web search feature, and novel extensions of statistical machine translation as well as extracting lexical patterns for exact answers from a supervised corpus. Without modification to our base set of thirty-one categories, we were able to achieve a confidence weighted score of 0.455 and an accuracy of 29. We improved our model on selecting exact answers by insisting on exact answers in the training corpus and this resulted in a 7 gain on TREC-11 but a much larger gain of 46 on TREC-10.

Subject Categories:

  • Information Science
  • Cybernetics

Distribution Statement:

APPROVED FOR PUBLIC RELEASE