Method and system for aligning natural and synthetic video to speech synthesis

Number of patents in Portfolio can not be more than 2000

United States of America Patent

PATENT NO 7844463
SERIAL NO

12193397

Stats

ATTORNEY / AGENT: (SPONSORED)

Importance

Loading Importance Indicators... loading....

Abstract

See full text

According to MPEG-4's TTS architecture, facial animation can be driven by two streams simultaneously—text and Facial Animation Parameters. A Text-To-Speech converter drives the mouth shapes of the face. An encoder sends Facial Animation Parameters to the face. The text input can include codes, or bookmarks, transmitted to the Text-to-Speech converter, which are placed between and inside words. The bookmarks carry an encoder time stamp. Due to the nature of text-to-speech conversion, the encoder time stamp does not relate to real-world time, and should be interpreted as a counter. The Facial Animation Parameter stream carries the same encoder time stamp found in the bookmark of the text. The system reads the bookmark and provides the encoder time stamp and a real-time time stamp. The facial animation system associates the correct facial animation parameter with the real-time time stamp using the encoder time stamp of the bookmark as a reference.

Loading the Abstract Image... loading....

First Claim

See full text

Family

Loading Family data... loading....

Patent Owner(s)

Patent OwnerAddress
NUANCE COMMUNICATIONS INC1 WAYSIDE ROAD BURLINGTON MA 01803

International Classification(s)

Inventor(s)

Inventor Name Address # of filed Patents Total Citations
Basso, Andrea North Long Beach, US 219 4835
Beutnagel, Mark Charles High Bridge, US 32 484
Ostermann, Joern Red Bank, US 79 1671

Cited Art Landscape

Load Citation

Patent Citation Ranking

Forward Cite Landscape

Load Citation