RMIT University at TREC 2009: Web Track
Abstract:
RMIT participated in the 2009 Web Track tasks. Our submissions utilised the Zettair search engine to index and search the Category B subset of the ClueWeb collection used by the Web Track. The Web Track was composed of two tasks, a traditional adhoc retrieval task, and a new diversity task where participants attempted to retrieve documents covering a range of sub topics for each query. Sub topics were not provided with the queries. Our experiments utilised the well known measures Okapi BM25 and language modeling with Dirichlet smoothing for the adhoc task. For the diversity task we attempted to improve the diversity of query results by minimising the number of documents returned for a single domain. Runs were generated using a customised version of the Zettair search engine which was adapted to deal with the large scale ClueWeb collection.