ICTNET at Web Track TREC2014
CHINESE ACADEMY OF SCIENCES BEIJING INST OF COMPUTING TECHNOLOGY
Pagination or Media Count:
An ad-hoc task in TREC investigates the performance of systems that search a static set of documents using previously- unseen topics. This year, the ClueWeb12 1 dataset are used. The overall goal of the risk - sensitive task is to explore algorithms and evaluation methods for systems that try to jointly maximize an average effectiveness measure across queries, while minimizing effectiveness losses with respect to a provided baseline. Two baselines from different IR systems are supplied this year in order to understand the nature of risk- reward tradeoffs achievable by a system that can adapt to different baselines. The rest of this paper is organized as follows. In Section 2, we discuss the processing of ClueWeb 12, derived data and external resources. In Section 3, the BM25 model with term proximity , the diversification method and the results fusion strategy are introduced. We report experimental results and the corresponding re-ranking strategy in Section 4. Finally, our work is concluded in Section 5.
- Information Science