Accession Number:

ADA618656

Title:

ICTNET at Web Track TREC2014

Descriptive Note:

Conference paper

Corporate Author:

CHINESE ACADEMY OF SCIENCES BEIJING INST OF COMPUTING TECHNOLOGY

Report Date:

2014-11-01

Pagination or Media Count:

4.0

Abstract:

An ad-hoc task in TREC investigates the performance of systems that search a static set of documents using previously- unseen topics. This year, the ClueWeb12 1 dataset are used. The overall goal of the risk - sensitive task is to explore algorithms and evaluation methods for systems that try to jointly maximize an average effectiveness measure across queries, while minimizing effectiveness losses with respect to a provided baseline. Two baselines from different IR systems are supplied this year in order to understand the nature of risk- reward tradeoffs achievable by a system that can adapt to different baselines. The rest of this paper is organized as follows. In Section 2, we discuss the processing of ClueWeb 12, derived data and external resources. In Section 3, the BM25 model with term proximity , the diversification method and the results fusion strategy are introduced. We report experimental results and the corresponding re-ranking strategy in Section 4. Finally, our work is concluded in Section 5.

Subject Categories:

  • Information Science

Distribution Statement:

APPROVED FOR PUBLIC RELEASE