DID YOU KNOW? DTIC has over 3.5 million final reports on DoD funded research, development, test, and evaluation activities available to our registered users. Click
HERE to register or log in.
Accession Number:
AD1038283
Title:
Multilingual Data Selection for Low Resource Speech Recognition
Descriptive Note:
Conference Paper
Corporate Author:
IBM THOMAS J WATSON RESEARCH CENTER YORKTOWN HEIGHTS NY YORKTOWN HEIGHTS United States
Report Date:
2016-09-12
Pagination or Media Count:
5.0
Abstract:
Feature representations extracted from deep neural network-based multilingual frontends provide significant improvements to speech recognition systems in low resource settings. To effectively train these frontends, we introduce a data selection technique that discovers language groups from an available set of training languages. This data selection method reduces the required amount of training data and training time by approximately 40, with minimal performance degradation. We present speech recognition results on 7 very limited language pack VLLP languages from the second option period of the IARPA Babel program using multilingual features trained on up to 10 languages. The proposed multilingual features provide up to 15 relative improvement over baseline acoustic features on the VLLP languages.
Distribution Statement:
APPROVED FOR PUBLIC RELEASE