Neural network acoustic and visual speech recognition system training method and apparatus

Number of patents in Portfolio can not be more than 2000

United States of America Patent

PATENT NO 5621858
SERIAL NO

08137318

Stats

ATTORNEY / AGENT: (SPONSORED)

Importance

Loading Importance Indicators... loading....

Abstract

See full text

The apparatus for the recognition of speech includes an acoustic preprocessor, a visual preprocessor, and a speech classifier that operates on the acoustic and visual preprocessed data. The acoustic preprocessor comprises a log mel spectrum analyzer that produces an equal mel bandwidth log power spectrum. The visual processor detects the motion of a set of fiducial markers on the speaker's face and extracts a set of normalized distance vectors describing lip and mouth movement. The speech classifier uses a multilevel time-delay neural network operating on the preprocessed acoustic and visual data to form an output probability distribution that indicates the probability of each candidate utterance having been spoken, based on the acoustic and visual data. The training system includes the speech recognition apparatus and a control processor with an associated memory. Noisy acoustic input training data together with visual data is used to generate acoustic and visual feature training vectors for processing by the speech classifier. A control computer adjusts the synaptic weights of the speech classifier based upon the noisy input training data and exemplar output vectors for producing a robustly trained classifier based on the analogous visual counterpart of the Lombard effect.

Loading the Abstract Image... loading....

First Claim

See full text

Family

Loading Family data... loading....

Patent Owner(s)

Patent OwnerAddress
RICOH COMPANY LTD3-6 NAKAMAGOME 1-CHOME OHTA-KU TOKYO 143-8555
RICOH COPORATION2882 SAND HILL SUITE 115 MENLO PARK CA 94025

International Classification(s)

  • [Classification Symbol]
  • [Patents Count]

Inventor(s)

Inventor Name Address # of filed Patents Total Citations
Stork, David G Stanford, CA 80 3812
Wolff, Gregory J Mountain View, CA 87 6870

Cited Art Landscape

Load Citation

Patent Citation Ranking

Forward Cite Landscape

Load Citation