Accession Number : AD1001124


Title :   Deep Neural Network Based Supervised Speech Segregation Generalizes to Novel Noises through Large-scale Training


Descriptive Note : Technical Report


Corporate Author : Ohio State University Columbus


Personal Author(s) : Wang,Yuxuan ; Chen,Jitong ; Wang,DeLiang


Full Text : https://apps.dtic.mil/dtic/tr/fulltext/u2/1001124.pdf


Report Date : 01 Jan 2015


Pagination or Media Count : 8


Abstract : Deep neural network (DNN) based supervised speech segregation has been successful in improving human speech intelligibility in noise, especially when DNN is trained and tested on the same noise type. A simple and effective way for improving generalization is to train with multiple noises. This letter demonstrates that by training with a large number of different noises, the objective intelligibility results of DNN based supervised speech segregation on novel noises can match or even outperform those on trained noises. This demonstration has an important implication that improving human speech intelligibility in unknown noisy environments is potentially achievable.


Descriptors :   artificial neural networks , intelligibility , noise , SPEECH ANALYSIS , training


Distribution Statement : APPROVED FOR PUBLIC RELEASE