AES E-Library

Phoneme Recognition for 3D Modeled Digital Character Talking Emulation

The current paper focuses on the design and implementation of a phoneme recognition algorithm that is used to extract the appropriate parameters in order to drive a 3d graphics facial expression and animation procedure. This is used to emulate speech generation to 3d modeled digital characters. At the first development step, LPC, STFT analysis, wavelets, cepstrum and pattern recognition techniques were tested for phoneme recognition and speaker classification. Then, 3d graphics facial expressions and phonemes were related in a library. A client/server application that processes speech, combines library data via morphing techniques and generates a digital character, virtually speaking according to the given speech, was finally designed. Possible applications include cartoon dubbing and web based virtual teleconference.

 

Author (s):
Affiliation: (See document for exact affiliation information.)
AES Convention: Paper Number:
Publication Date:
Session subject:

DOI:


Click to purchase paper as a non-member or login as an AES member. If your company or school subscribes to the E-Library then switch to the institutional version. If you are not an AES member Join the AES. If you need to check your member status, login to the Member Portal.

Type:
16938
Choose your country of residence from this list: