US Patent No: 5,839,106

Number of patents in Portfolio can not be more than 2000

Large-vocabulary speech recognition using an integrated syntactic and semantic statistical language model

1 Status Updates

Stats

ATTORNEY / AGENT: (SPONSORED)
 

Importance

Loading Importance Indicators... loading....

Abstract

Methods and apparatus for performing large-vocabulary speech recognition employing an integrated syntactic and semantic statistical language model. In an exemplary embodiment, a stochastic language model is developed using a hybrid paradigm in which latent semantic analysis is combined with, and subordinated to, a conventional n-gram paradigm. The hybrid paradigm provides an estimate of the likelihood that a particular word, chosen from an underlying vocabulary will occur given a prevailing contextual history. The estimate is computed as a conditional probability that a word will occur given an "integrated" history combining an n-word, syntactic-type history with a semantic-type history based on a much larger contextual framework. Thus, the exemplary embodiment seamlessly blends local language structures with global usage patterns to provide, in a single language model, the proficiency of a short-horizon, syntactic model with the large-span effectiveness of semantic analysis.

Loading the Abstract Image... loading....

First Claim

Related Publications

Loading Related Publications... loading....

Patent Owner(s)

Patent OwnerAddressTotal Patents
APPLE INC.CUPERTINO, CA7542

International Classification(s)

  • [Classification Symbol]
  • [Patents Count]

Inventor(s)

Inventor Name Address # of filed Patents Total Citations
Bellegarda, Jerome R Los Gatos, CA 65 729

Cited Art

Patent Info (Count) # Cites Year
 
APPLE INC. (1)
5,384,892 Dynamic language model for speech recognition 174 1992
 
CISCO TECHNOLOGY, INC. (1)
5,502,774 Automatic recognition of a consistent message using multiple complimentary sources of information 111 1994

Patent Citation Ranking

Forward Cites

Patent Info (Count) # Cites Year
 
AT&T CORP. (18)
6,044,337 Selection of superwords based on criteria relevant to both speech recognition and understanding 66 1997
6,021,384 Automatic generation of superwords 82 1997
6,317,707 Automatic clustering of tokens from a corpus for grammar acquisition 18 1998
6,415,248 Method for building linguistic models from a corpus 14 1999
7,085,720 Method for task classification using morphemes 15 2000
7,158,935 Method and system for predicting problematic situations in a automated dialog 20 2000
6,941,266 Method and system for predicting problematic dialog situations in a task classification system 44 2000
7,003,459 Method and system for predicting understanding errors in automated dialog systems 22 2001
6,751,591 Method and system for predicting understanding errors in a task classification system 48 2001
6,751,584 Automatic clustering of tokens from a corpus for grammar acquisition 3 2001
7,286,984 Method and system for automatically detecting morphemes in a task classification system using lattices 9 2002
7,149,687 Method of active learning for automatic speech recognition 17 2002
7,356,462 Automatic clustering of tokens from a corpus for grammar acquisition 0 2003
7,139,698 System and method for generating morphemes 3 2003
7,127,395 Method and system for predicting understanding errors in a task classification system 21 2004
7,472,060 Automated dialog system and method 22 2005
7,440,893 Automated dialog method with first and second thresholds for adapted dialog strategy 7 2005
7,440,897 Method and system for automatically detecting morphemes in a task classification system using lattices 5 2006
 
PHOENIX SOLUTIONS, INC. (16)
7,725,307 Query engine for processing voice based queries including semantic decoding 24 2003
7,555,431 Method for processing speech using dynamic grammars 25 2004
7,729,904 Partial speech processing device and method for use in distributed systems 18 2004
7,702,508 System and method for natural language processing of query answers 19 2004
7,657,424 System and method for processing sentence based queries 18 2004
7,624,007 System and method for natural language processing of sentence based queries 23 2004
7,831,426 Network based interactive speech recognition system 18 2006
7,647,225 Adjustable resource based speech recognition system 16 2006
8,352,277 Method of interacting through speech with a web-connected server 0 2007
7,725,320 Internet based speech recognition system with dynamic grammars 15 2007
7,698,131 Speech recognition system for client devices having differing computing capabilities 13 2007
7,912,702 Statistical language model trained with semantic variants 13 2007
7,873,519 Natural language speech lattice containing semantic variants 20 2007
7,672,841 Method for processing speech data for a distributed recognition system 15 2008
8,229,734 Semantic decoding of user queries 0 2008
7,725,321 Speech based query system using semantic decoding 16 2008
 
AT&T INTELLECTUAL PROPERTY II, L.P. (9)
8,392,188 Method and system for building a phonotactic model for domain independent speech recognition 0 2001
8,433,558 Methods and systems for natural language understanding using human knowledge and collected data 0 2005
7,529,667 Automated dialog system and method 6 2005
7,487,088 Method and system for predicting understanding errors in a task classification system 24 2006
7,957,970 Method and system for predicting problematic situations in automated dialog 1 2006
7,620,548 Method and system for automatic detecting morphemes in a task classification system using lattices 3 2007
7,966,174 Automatic clustering of tokens from a corpus for grammar acquisition 0 2008
8,010,361 Method and system for automatically detecting morphemes in a task classification system using lattices 3 2008
8,200,491 Method and system for automatically detecting morphemes in a task classification system using lattices 0 2011
 
NUANCE COMMUNICATIONS, INC. (8)
6,167,377 Speech recognition language models 42 1997
6,052,657 Text segmentation and identification of topic using language models 56 1997
6,996,519 Method and apparatus for performing relational speech recognition 5 2001
8,036,893 Method and system for identifying and correcting accent-induced speech recognition difficulties 3 2004
7,640,159 System and method of speech recognition for non-native speakers of a language 2 2004
7,308,404 Method and apparatus for speech recognition using a dynamic vocabulary 6 2004
7,533,020 Method and apparatus for performing relational speech recognition 2 2005
8,285,546 Method and system for identifying and correcting accent-induced speech recognition difficulties 0 2011
 
APPLE INC. (6)
6,374,217 Fast update implementation for efficient latent semantic language modeling 18 1999
6,477,488 Method for dynamic context scope selection in hybrid n-gram+LSA language modeling 4 2000
6,697,779 Combined dual spectral and temporal alignment method for user authentication by voice 9 2000
6,778,952 Method for dynamic context scope selection in hybrid N-gram+LSA language modeling 5 2002
7,191,118 Method for dynamic context scope selection in hybrid N-gram+LSA language modeling 1 2004
7,720,673 Method for dynamic context scope selection in hybrid N-GRAM+LSA language modeling 0 2007
 
INTERNATIONAL BUSINESS MACHINES CORPORATION (5)
6,577,999 Method and apparatus for intelligently managing multiple pronunciations for a speech recognition vocabulary 8 1999
6,385,579 Methods and apparatus for forming compound words for use in a continuous speech recognition system 19 1999
6,529,902 Method and system for off-line detection of textual topical changes and topic identification via likelihood based methods for improved language modeling 31 1999
7,644,057 System and method for electronic communication management 4 2004
7,752,159 System and method for classifying text 4 2007
 
RAYTHEON BBN TECHNOLOGIES CORP. (5)
7,401,023 Systems and methods for providing automated directory assistance using transcripts 3 2000
7,447,636 System and methods for using transcripts to train an automated directory assistance service 3 2005
7,890,539 Semantic matching using predicate-argument structure 5 2007
8,131,536 Extraction-empowered machine translation 1 2007
8,260,817 Semantic matching using predicate-argument structure 0 2011
 
ECOLLEGE.COM (2)
6,871,043 Variable types of sensory interaction for an on-line educational system 8 2002
6,965,752 On-line educational system having an electronic notebook feature 5 2003
 
INTEL CORPORATION (2)
7,346,495 Method and system for building a domain specific statistical language model from rule based grammar specifications 6 2000
7,275,033 Method and system for using rule-based knowledge to build a class-based domain specific statistical language model 7 2000
 
MATSUSHITA ELECTRIC INDUSTRIAL CO., LTD. (2)
6,182,039 Method and apparatus using probabilistic language model based on confusable sets for speech recognition 56 1998
6,233,561 Method for goal-oriented speech translation in hand-held devices using meaning extraction and dialogue 43 1999
 
MULTIMODAL TECHNOLOGIES, LLC (2)
7,584,103 Automated extraction of semantic content and generation of a structured document from speech 7 2004
8,321,199 Verification of extracted data 2010
 
OPTICAL RESEARCH PARTNERS LLC (2)
6,904,405 Message recognition using shared language model 15 2002
8,204,737 Message recognition using shared language model 1 2005
 
BELLSOUTH INTELLECTUAL PROPERTY CORPORATION (1)
6,751,595 Multi-stage large vocabulary speech recognition system and method 7 2001
 
INTELLIGENT AUTOMATION, INC. (1)
7,062,220 Automated, computer-based reading tutoring systems and methods 12 2001
 
KONINKLIJKE PHILIPS ELECTRONICS N.V. (1)
7,424,428 Automatic dialog system with database language model 4 2002
 
MICROSOFT CORPORATION (1)
7,844,449 Scalable probabilistic latent semantic analysis 0 2006
 
NIPPON TELEGRAPH AND TELEPHONE CORPORATION (1)
6,173,261 Grammar fragment acquisition using syntactic and semantic clustering 100 1998
 
RAMP HOLDINGS, INC. (F/K/A EVERYZING, INC.) (1)
6,609,087 Fact recognition system 15 1999
 
RAMP, INC. (1)
8,280,719 Methods and systems relating to information extraction 0 2006
 
SIEMENS AKTIENGESELLSCHAFT (1)
6,640,207 Method and configuration for forming classes for a language model based on linguistic classes 11 2001
 
SOPHIA SEARCH LIMITED (1)
7,747,593 Computer aided document retrieval 9 2004
 
UBS AG (1)
7,236,931 Systems and methods for automatic acoustic speaker adaptation in computer-assisted transcription systems 9 2003
 
OTHER [CHECK PATENT PROFILE FOR ASSIGNMENT INFORMATION] (1)
6,601,055 Explanation generation system for a diagnosis support tool employing an inference system 46 1999