Document and Query Expansion Models for Blog Distillation
CARNEGIE-MELLON UNIV PITTSBURGH PA LANGUAGE TECHNOLOGIES INST
Pagination or Media Count:
This paper presents the CMU submission to the 2008 TREC blog distillation track. Similar to last years experiments, we evaluate different retrieval models and apply a query expansion method that leverages the link structure in Wikipedia. We also explore using a corpus that combines several different representations of the documents, using both the feed XML and permalink HTML, and apply initial experiments with spam filtering.
- Information Science