Accession Number : AD1034768


Title :   Analysis of Factors Affecting System Performance in the ASpIRE Challenge


Descriptive Note : Technical Report


Corporate Author : MIT Lincoln Laboratory Lexington United States


Personal Author(s) : Melot,Jennifer T ; Malyska,Nicolas ; Ray,Jessica M ; Shen,Wade


Full Text : https://apps.dtic.mil/dtic/tr/fulltext/u2/1034768.pdf


Report Date : 13 Dec 2015


Pagination or Media Count : 6


Abstract : This paper presents an analysis of factors affecting system performance in the ASpIRE (Automatic Speech recognition In Reverberant Environments) challenge. In particular, overall word error rate (WER) of the solver systems is analyzed as a function of room, distance between talker and microphone, and microphone type. We also analyze speech activity detection performance of the solver systems and investigate its relationship to WER. The primary goal of the paper is to provide insight into the factors affecting system performance in the ASpIRE evaluation set across many systems given annotations and metadata that are not available to the solvers. This analysis will inform the design of future challenges and provide insight into the efficacy of current solutions addressing noisy reverberant speech in mismatched conditions.


Descriptors :   automated speech recognition , microphones , voice communications , metadata , analysis of variance , training , attenuation


Subject Categories : Voice Communications


Distribution Statement : APPROVED FOR PUBLIC RELEASE