Accession Number : AD1033823


Title :   Speaker Linking and Applications using Non-Parametric Hashing Methods


Descriptive Note : Technical Report


Corporate Author : MIT Lincoln Laboratory Lexington United States


Personal Author(s) : Campbell,William M ; Sturim,Douglas E


Full Text : https://apps.dtic.mil/dtic/tr/fulltext/u2/1033823.pdf


Report Date : 08 Sep 2016


Pagination or Media Count : 5


Abstract : Large unstructured audio data sets have become ubiquitous and present a challenge for organization and search. One logical approach for structuring data is to find common speakers and link occurrences across different recordings. Prior approaches to this problem have focused on basic methodology for the linking task. In this paper, we introduce a novel trainable nonparametric hashing method for indexing large speaker recording data sets. This approach leads to tunable computational complexity methods for speaker linking. We focus on a scalable clustering method based on hashingcanopy-clustering. We apply this method to a large corpus of speaker recordings, demonstrate performance tradeoffs, and compare to other hashing methods.


Descriptors :   clustering , speech recognition , algorithms , hash tables , computations , pattern recognition


Subject Categories : Voice Communications


Distribution Statement : APPROVED FOR PUBLIC RELEASE