Accession Number:

ADA512700

Title:

Document and Query Expansion Models for Blog Distillation

Descriptive Note:

Conference paper

Corporate Author:

CARNEGIE-MELLON UNIV PITTSBURGH PA LANGUAGE TECHNOLOGIES INST

Report Date:

2008-11-01

Pagination or Media Count:

7.0

Abstract:

This paper presents the CMU submission to the 2008 TREC blog distillation track. Similar to last years experiments, we evaluate different retrieval models and apply a query expansion method that leverages the link structure in Wikipedia. We also explore using a corpus that combines several different representations of the documents, using both the feed XML and permalink HTML, and apply initial experiments with spam filtering.

Subject Categories:

  • Information Science

Distribution Statement:

APPROVED FOR PUBLIC RELEASE