Accession Number:

ADA151320

Title:

Speech Analysis/Synthesis Based on Perception.

Descriptive Note:

Technical rept.,

Corporate Author:

MASSACHUSETTS INST OF TECH LEXINGTON LINCOLN LAB

Personal Author(s):

Report Date:

1984-11-05

Pagination or Media Count:

272.0

Abstract:

This dissertation describes a speech system based on a combination of physiological and psychoacoustic results which has been developed. The system contains a nonuniform FilterDetector bank. A new relationship between FilterDetectors and the Short-time Fourier Transform magnitude is derived, and a generalized version of the Short-Time Fourier Transform magnitude is used to implement the anlaysis system. The new relationship is also applied to a discussion of channel vocoders, spectrograms, the sliding Discrete Fourier Transform, average power spectrum estimation, and nonuniform bandwidth analysis. Next, a new synthesis approach is used to reconstruct signals form the magnitude data produced by the nonuniform analysis. Apart form an overall sign factor, the analysissynthesis system achieves exact reconstruction in the absence of data modification. The ability of the system to reconstruct signals from modified data is also demonstrated. Suggestions for further research, including data reduction and automatic speech recognition applications, are given. Keywords include Auditory modeling, short-time fourier transform, magnitude-only reconstruction, Power spectrum estimation, Perception, Filter banks, Speech recognition, Spectrograms, and Vocoders.

Subject Categories:

  • Psychology
  • Anatomy and Physiology
  • Acoustic Detection and Detectors
  • Voice Communications

Distribution Statement:

APPROVED FOR PUBLIC RELEASE