Trainable Videorealistic Speech Animation

Ezzat, Tony F.

Trainable Videorealistic Speech Animation

Active / Technical Report | Accession Number: ADA456049 |

Open PDF

Abstract:

I describe how to create with machine learning techniques a generative, video realistic, speech animation module. A human subject is first recorded using a video camera as heshe utters a pre-determined speech corpus. After processing the corpus automatically, a visual speech module is learned from the data that is capable of synthesizing the human subjects mouth littering entirely novel utterances that were not recorded in the original video. The synthesized utterance is re-composited onto a background sequence which contains natural head and eye movement. The final output is video- realistic in the sense that it looks like a video camera recording of the subject. At run time, the input to the system can be either real audio sequences or synthetic audio produced by a text-to-speech system, as long as they have been phonetically aligned.

Author(s):

Ezzat, Tony F.

Author Organization(s):

MASSACHUSETTS INST OF TECH CAMBRIDGE

Descriptive Note:

Doctoral thesis

Supplementary Note:

The original document contains color images.

Pagination:

0059

Security Markings

DOCUMENT & CONTEXTUAL SUMMARY

Distribution:

Approved For Public Release

Distribution Statement:

Approved For Public Release; Distribution Is Unlimited.

RECORD

Collection: TR

Identifying Numbers

Monitor Series:

XD

Subject Terms

Joint Capability Areas:

JCA_8_Building Partnerships

Modernization Areas:

Autonomy

Communities of Interest:

Autonomy

Descriptor(s):

*LEARNING MACHINES, *SPEECH, *VIDEO RECORDING, *VIDEO SIGNALS, INPUT, OUTPUT, MODULAR CONSTRUCTION, CAMERAS, MOUTH, EYE MOVEMENTS, LEARNING, VISION, BACKGROUND, SOUND, SEQUENCES, METHODOLOGY, HUMANS

Field(s)/Group(s):

Recording and Playback Devices, Radio Communications

Keyword(s):

*ANIMATIONS, *VIDEOREALISTIC SPEECH, TEXT TO SPEECH SYSTEMS, MMM(MULTIDIMENSIONAL MORPHABLE MODEL), TRAJECTORY SYNTHESIS MODULES

Report Date:

2002 Jun 01

Creation Date:

2006 Nov 08