Accession Number:



Informedia at TRECVID2014: MED and MER, Semantic Indexing, Surveillance Event Detection

Descriptive Note:

Conference Paper

Corporate Author:

Carnegie Mellon University Pittsburgh United States

Report Date:


Pagination or Media Count:



We report on our results in the TRECVID 2011 Multimedia Event Detection MED and Semantic Indexing SIN tasks. Generally, both of these tasks consist of three main steps extracting features, training detectors and fusing. In the feature extraction part, we extracted many low-level features, high-level features and text features. We used the Spatial-Pyramid Matching technique to represent the low-level visual local features, such as SIFT and MoSIFT, which describe the location information of feature points. In the detector training part, besides the traditional SVM, we proposed a Sequential Boosting SVM classifier to deal with the large-scale unbalanced classification problem. In the fusion part, to take the advantages from different features, we tried three different fusion methods early fusion, late fusion and double fusion. Double fusion is a combination of early fusion and late fusion. The experimental results demonstrated that double fusion is consistently better than or at least comparable to early fusion and late fusion.

Subject Categories:

  • Information Science
  • Cybernetics

Distribution Statement: