Style-aware Mid-level Representation for Discovering Visual Connections in Space and Time (Open Access)
Carnegie Mellon University Pittsburgh United States
Pagination or Media Count:
We present a weakly-supervised visual data mining approach that discovers connections between recurring mid-level visual elements in historic temporal and geographic spatial image collections, and attempts to capture the underlying visual style. In contrast to existing discovery methods that mine for patterns that remain visually consistent throughout the dataset, our goal is to discover visual elements whose appearance changes due to change in time or location i.e., exhibit consistent stylistic variations across the label space date or geo-location. To discover these elements, we first identify groups of patches that are style-sensitive. We then incrementally build correspondences to find the same element across the entire dataset. Finally, we train style-aware regressors that model each elements range of stylistic differences. We apply our approach to date and geo-location prediction and show substantial improvement over several baselines that do not model visual style. We also demonstrate the methods effectiveness on the related task of fine-grained classification.