US Patent No: 5,621,859

Number of patents in Portfolio can not be more than 2000

Single tree method for grammar directed, very large vocabulary speech recognizer

Stats

ATTORNEY / AGENT: (SPONSORED)
 

Importance

Loading Importance Indicators... loading....

Abstract

The invention provides a method of large vocabulary speech recognition that employs a single tree-structured phonetic hidden Markov model (HMM) at each frame of a time-synchronous process. A grammar probability is utilized upon recognition of each phoneme of a word, before recognition of the entire word is complete. Thus, grammar probabilities are exploited as early as possible during recognition of a word. At each frame of the recognition process, a grammar probability is determined for the transition from the most likely preceding grammar state to a set of words that share at least one common phoneme. The grammar probability is combined with accumulating phonetic evidence to provide a measure of the likelihood that a state in the HMM will lead to the word most likely to have been spoken. In a preferred embodiment, phonetic context information is exploited, even before the complete context of a phoneme is known. Instead of an exact triphone model, wherein the phonemes previous and subsequent to a phoneme are considered, a composite triphone model is used that exploits partial phonetic context information to provide a phonetic model that is more accurate than aphonetic model that ignores context. In another preferred embodiment, the single phonetic tree method is used as the forward pass of a forward/backward recognition process, wherein the backward pass employs a recognition process other than the single phonetic tree method.

Loading the Abstract Image... loading....

First Claim

Related Publications

Loading Related Publications... loading....

Patent Owner(s)

Patent OwnerAddressTotal Patents
GOOGLE INC.MOUNTAIN VIEW, CA6665
GTE SERVICES CORPORATIONCAMBRIDGE, MA24
RAMP HOLDINGS, INC. (F/K/A EVERYZING, INC.)CAMBRIDGE, MA19

International Classification(s)

  • [Classification Symbol]
  • [Patents Count]

Inventor(s)

Inventor Name Address # of filed Patents Total Citations
Nguyen, Long Stoneham, MA 49 555
Schwartz, Richard M Sudbury, MA 17 719

Cited Art

Patent Info (Count) # Cites Year
 
INTERNATIONAL BUSINESS MACHINES CORPORATION (2)
4,741,036 Determination of phone weights for markov models in a speech recognition system 34 1985
4,748,670 Apparatus and method for determining a likely word sequence from labels generated by an acoustic processor 37 1985
 
KABUSHIKI KAISHA TOSHIBA (1)
5,457,768 Speech recognition apparatus using syntactic and semantic analysis 65 1992
 
MATSUSHITA ELECTRIC INDUSTRIAL CO., LTD. (1)
5,349,645 Word hypothesizer for continuous speech decoding using stressed-vowel centered bidirectional tree searches 37 1991
 
RAMP HOLDINGS, INC. (F/K/A EVERYZING, INC.) (1)
5,241,619 Word dependent N-best search method 73 1991
 
TEXAS INSTRUMENTS INCORPORATED (1)
4,984,178 Chart parser for stochastic unification grammar 64 1989
 
XEROX CORPORATION (1)
5,075,896 Character and phoneme recognition based on probability clustering 82 1989

Patent Citation Ranking

Forward Cites

Patent Info (Count) # Cites Year
 
MICROSOFT CORPORATION (20)
5,913,193 Method and system of runtime acoustic unit selection for speech synthesis 93 1996
5,937,384 Method and system for speech recognition using continuous density hidden Markov models 60 1996
6,904,402 System and iterative method for lexicon, segmentation and language model joint optimization 43 2000
7,451,075 Compressed speech lexicon and method and apparatus for creating and accessing the speech lexicon 1 2000
7,139,709 Middleware layer between speech related applications and engines 2 2000
6,957,184 Context free grammar engine for speech recognition system 7 2000
6,961,694 Method and apparatus for reducing latency in speech-based applications 16 2001
6,931,376 Speech-related event notification system 5 2001
7,529,671 Block synchronous decoding 0 2003
7,089,189 Speech-related event notification system 6 2004
7,162,425 Speech-related event notification system 1 2004
7,177,807 Middleware layer between speech related applications and engines 1 2004
7,177,813 Middleware layer between speech related applications and engines 0 2004
7,155,392 Context free grammar engine for speech recognition system 2 2005
7,206,742 Context free grammar engine for speech recognition system 2 2005
8,209,175 Uncertainty interval content sensing within communications 0 2006
7,379,874 Middleware layer between speech related applications and engines 12 2006
8,135,590 Position-dependent phonetic models for reliable pronunciation identification 1 2007
8,060,360 Word-dependent transition models in HMM based word alignment for statistical machine translation 1 2007
8,355,917 Position-dependent phonetic models for reliable pronunciation identification 0 2012
 
APPLE INC. (11)
6,154,722 Method and apparatus for a speech recognition system language model that integrates a finite state grammar probability and an N-gram probability 39 1997
6,064,960 Method and apparatus for improved duration modeling of phonemes 17 1997
6,374,217 Fast update implementation for efficient latent semantic language modeling 18 1999
6,366,884 Method and apparatus for improved duration modeling of phonemes 6 1999
6,477,488 Method for dynamic context scope selection in hybrid n-gram+LSA language modeling 4 2000
6,697,779 Combined dual spectral and temporal alignment method for user authentication by voice 9 2000
6,553,344 Method and apparatus for improved duration modeling of phonemes 3 2002
6,778,952 Method for dynamic context scope selection in hybrid N-gram+LSA language modeling 5 2002
6,785,652 Method and apparatus for improved duration modeling of phonemes 3 2002
7,191,118 Method for dynamic context scope selection in hybrid N-gram+LSA language modeling 1 2004
7,720,673 Method for dynamic context scope selection in hybrid N-GRAM+LSA language modeling 0 2007
 
SONY ELECTRONICS INC. (4)
6,006,186 Method and apparatus for a parameter sharing speech recognition system 19 1997
6,173,258 Method for reducing noise distortions in a speech recognition system 24 1998
6,768,979 Apparatus and method for noise attenuation in a speech recognition system 25 1999
7,139,708 System and method for speech recognition using an enhanced phone set 5 1999
 
U.S. PHILIPS CORPORATION (4)
5,873,061 Method for constructing a model of a new word for addition to a word model database of a speech recognition system 27 1996
5,995,930 Method and apparatus for recognizing spoken words in a speech signal by organizing the vocabulary in the form of a tree 10 1996
6,081,779 Language model adaptation for automatic speech recognition 12 1998
6,182,026 Method and device for translating a source text into a target using modeling and dynamic programming 19 1998
 
AT&T CORP. (3)
6,233,544 Method and apparatus for language translation 60 1996
6,574,597 Fully expanded context-dependent networks for speech recognition 74 2000
7,440,893 Automated dialog method with first and second thresholds for adapted dialog strategy 7 2005
 
BRITISH TELECOMMUNICATIONS PUBLIC LIMITED COMPANY (3)
5,819,222 Task-constrained connected speech recognition of propagation of tokens only if valid propagation path is present 5 1995
5,848,388 Speech recognition with sequence parsing, rejection and pause detection options 31 1995
5,905,971 Automatic speech recognition 8 1996
 
CANON KABUSHIKI KAISHA (3)
5,812,975 State transition model design method and voice recognition method and apparatus using same 50 1996
6,226,610 DP Pattern matching which determines current path propagation using the amount of path overlap to the subsequent time point 26 1999
7,565,290 Speech recognition method and apparatus 0 2005
 
SONY ONLINE ENTERTAINMENT LLC (3)
8,050,924 System for generating and selecting names 0 2005
7,912,716 Generating words and names using N-grams of phonemes 0 2005
8,359,200 Generating profiles of words 0 2011
 
CASTELL SOFTWARE LIMITED LIABILITY COMPANY (2)
7,085,717 Scoring and re-scoring dynamic time warping of speech 4 2002
6,983,246 Dynamic time warping using frequency distributed distance measures 2 2002
 
GREAT NORTHERN RESEARCH, LLC (2)
8,165,886 Speech interface system and method for control and interaction with applications on a computing system 13 2008
8,219,407 Method for processing the output of a speech recognizer 0 2008
 
INTEL CORPORATION (2)
7,346,495 Method and system for building a domain specific statistical language model from rule based grammar specifications 6 2000
6,980,954 Search method based on single triphone tree for large vocabulary continuous speech recognizer 5 2000
 
LUCENT TECHNOLOGIES INC. (2)
5,832,430 Devices and methods for speech recognition of vocabulary words with simultaneous detection and verification 55 1995
5,870,706 Method and apparatus for an improved language recognition system 59 1996
 
NUANCE COMMUNICATIONS, INC. (2)
5,963,905 Method and apparatus for improving acoustic fast match speed using a cache for phone probabilities 3 1997
7,143,035 Methods and apparatus for generating dialog state conditioned language models 5 2002
 
ORACLE INTERNATIONAL CORPORATION (2)
8,396,859 Subject matter context search engine 0 2004
8,190,985 Frame-slot architecture for data conversion 0 2009
 
VERIZON CORPORATE SERVICES GROUP INC. (2)
7,092,888 Unsupervised training in natural language call routing 46 2002
7,478,043 Estimation of speech spectral parameters in the presence of noise 1 2003
 
ASAHI KASEI KOGYO KABUSHIKI KAISHA (1)
7,272,561 Speech recognition device and speech recognition method 4 2001
 
AT&T INTELLECTUAL PROPERTY II, L.P. (1)
7,487,088 Method and system for predicting understanding errors in a task classification system 24 2006
 
CISCO TECHNOLOGY, INC. (1)
6,230,128 Path link passing speech recognition with vocabulary node being capable of simultaneously processing plural path links 7 1995
 
DENSO CORPORATION (1)
7,818,171 Speech recognition apparatus and speech recognition program 0 2007
 
FONDAZIONE BRUNO KESSLER (1)
5,765,133 System for building a language model network for speech recognition 24 1996
 
GENERAL INSTRUMENT CORPORATION (1)
8,316,302 Method and apparatus for annotating video content with metadata generated using speech recognition technology 0 2007
 
HEWLETT-PACKARD DEVELOPMENT COMPANY, L.P. (1)
5,819,220 Web triggered word set boosting for speech interfaces to the world wide web 169 1996
 
INTERNATIONAL BUSINESS MACHINES CORPORATION (1)
7,398,274 Mention-synchronous entity tracking system and method for chaining mentions 0 2004
 
LONGSAND LIMITED (1)
5,983,180 Recognition of sequential data using finite state sequence models organized in a tree structure 33 1998
 
LOQUENDO S.P.A. (1)
7,827,031 Method for accelerating the execution of speech recognition neural networks and the related speech recognition device 0 2003
 
MASSACHUSETTS INSTITUTE OF TECHNOLOGY (1)
6,317,716 Automatic cueing of speech 23 1998
 
MATSUSHITA ELECTRIC INDUSTRIAL CO., LTD. (1)
7,013,273 Speech recognition based captioning system 14 2001
 
MITSUBISHI ELECTRIC RESEARCH LABORATORIES, INC. (1)
7,171,358 Compression of language model structures and word identifiers for automated speech recognition systems 0 2003
 
MOTOROLA MOBILITY LLC (1)
6,182,038 Context dependent phoneme networks for encoding speech information 46 1997
 
NIPPON TELEGRAPH AND TELEPHONE CORPORATION (1)
5,835,890 Method for speaker adaptation of speech models recognition scheme using the method and recording medium having the speech recognition method recorded thereon 34 1997
 
NOKIA CORPORATION (1)
7,319,960 Speech recognition method and system 2 2001
 
NOVAURIS TECHNOLOGIES LTD (1)
7,403,941 System, method and technique for searching structured databases 2 2005
 
POPKIN FAMILY ASSETS, L.L.C. (1)
6,397,179 Search optimization system and method for continuous speech recognition 38 1998
 
RAMP HOLDINGS, INC. (F/K/A EVERYZING, INC.) (1)
6,052,682 Method of and apparatus for recognizing and labeling instances of name classes in textual environments 18 1997
 
RENESAS ELECTRONICS CORPORATION (1)
6,112,173 Pattern recognition device using tree structure data 3 1998
 
ROCKSTAR BIDCO, LP (1)
6,092,045 Method and apparatus for speech recognition 36 1998
 
SCANSOFT, INC. (1)
6,275,802 Search algorithm for large vocabulary speech recognition 5 1999
 
SONY CORPORATION (1)
5,787,395 Word and pattern recognition through overlapping hierarchical tree defined by relational features 9 1996
 
US PHILIPS ELECTRONICS (1)
7,006,971 Recognition of a speech utterance available in spelled form 5 2000
 
VICTOR COMPANY OF JAPAN, LTD. (1)
5,799,277 Acoustic model generating method for speech recognition 12 1995
 
VOLT DELTA RESOURCES LLC (1)
5,987,414 Method and apparatus for selecting a vocabulary sub-set from a speech recognition dictionary for use in real time automated directory assistance 28 1996