Accession Number:

ADA517740

Title:

POSTECH at TREC 2009 Blog Track: Top Stories Identification

Descriptive Note:

Conference paper

Corporate Author:

POHANG UNIV OF SCIENCE AND TECHNOLOGY (SOUTH KOREA) DEPT OF ELECTRONIC AND ELECTRICAL ENGINEERING

Report Date:

2009-11-01

Pagination or Media Count:

8.0

Abstract:

This paper describes our participation in the TREC 2009 Blog Track. Our system consists of the query likelihood component and the news headline prior component, based on the language model framework. For the query likelihood, we propose several approaches to estimate the query language model and the news headline language model. We also suggest two approaches to choose the 10 supporting relevant posts Feed-Based Selection and Cluster-Based Selection. Furthermore, we propose two criteria to estimate the news headline prior for a given day. Experimental results show that using the prior significantly improves the performance of the task.

Subject Categories:

  • Information Science
  • Computer Programming and Software
  • Test Facilities, Equipment and Methods
  • Linguistics

Distribution Statement:

APPROVED FOR PUBLIC RELEASE