Accession Number : AD1001124

Title :   Deep Neural Network Based Supervised Speech Segregation Generalizes to Novel Noises through Large-scale Training

Descriptive Note : Technical Report

Corporate Author : Ohio State University Columbus

Personal Author(s) : Wang,Yuxuan ; Chen,Jitong ; Wang,DeLiang

Full Text :

Report Date : 01 Jan 2015

Pagination or Media Count : 8

Abstract : Deep neural network (DNN) based supervised speech segregation has been successful in improving human speech intelligibility in noise, especially when DNN is trained and tested on the same noise type. A simple and effective way for improving generalization is to train with multiple noises. This letter demonstrates that by training with a large number of different noises, the objective intelligibility results of DNN based supervised speech segregation on novel noises can match or even outperform those on trained noises. This demonstration has an important implication that improving human speech intelligibility in unknown noisy environments is potentially achievable.

Descriptors :   artificial neural networks , intelligibility , noise , SPEECH ANALYSIS , training

Distribution Statement : APPROVED FOR PUBLIC RELEASE