Conversational Telephone Speech Corpus Collection for the NIST Speaker Recognition Evaluation 2004
NATIONAL INST OF STANDARDS AND TECHNOLOGY GAITHERSBURG MD
Pagination or Media Count:
This paper discusses some of the factors that should be considered when designing a speech corpus collection to be used for text-independent speaker recognition evaluation. The factors include telephone handset type, telephone transmission type, language, and non-telephone microphone type. The paper describes the design of the new corpus collection being undertaken by the Linguistic Data Consortium LDC to support the 2004 and subsequent NIST speech recognition evaluations. Some preliminary information on the resulting 2004 evaluation test set is offered.
- Voice Communications