US Patent No: 5,839,106

Number of patents in Portfolio can not be more than 2000

Large-vocabulary speech recognition using an integrated syntactic and semantic statistical language model

Stats

ATTORNEY / AGENT: (SPONSORED)

Importance

Loading Importance Indicators... loading....

Abstract

See full text

Methods and apparatus for performing large-vocabulary speech recognition employing an integrated syntactic and semantic statistical language model. In an exemplary embodiment, a stochastic language model is developed using a hybrid paradigm in which latent semantic analysis is combined with, and subordinated to, a conventional n-gram paradigm. The hybrid paradigm provides an estimate of the likelihood that a particular word, chosen from an underlying vocabulary will occur given a prevailing contextual history. The estimate is computed as a conditional probability that a word will occur given an 'integrated' history combining an n-word, syntactic-type history with a semantic-type history based on a much larger contextual framework. Thus, the exemplary embodiment seamlessly blends local language structures with global usage patterns to provide, in a single language model, the proficiency of a short-horizon, syntactic model with the large-span effectiveness of semantic analysis.

Loading the Abstract Image... loading....

First Claim

See full text

all claims..

Related Publications

Loading Related Publications... loading....

Patent Owner(s)

Patent OwnerAddressTotal Patents
APPLE INC.CUPERTINO, CA10526

International Classification(s)

  • [Classification Symbol]
  • [Patents Count]

Inventor(s)

Inventor Name Address # of filed Patents Total Citations
Bellegarda, Jerome R Los Gatos, CA 78 1094

Cited Art Landscape

Patent Info (Count) # Cites Year
 
APPLE INC. (1)
5,384,892 Dynamic language model for speech recognition 244 1992
 
CISCO TECHNOLOGY, INC. (1)
5,502,774 Automatic recognition of a consistent message using multiple complimentary sources of information 116 1994

Patent Citation Ranking

Forward Cite Landscape

Patent Info (Count) # Cites Year
 
APPLE INC. (36)
6,374,217 Fast update implementation for efficient latent semantic language modeling 21 1999
6,477,488 Method for dynamic context scope selection in hybrid n-gram+LSA language modeling 37 2000
6,697,779 Combined dual spectral and temporal alignment method for user authentication by voice 10 2000
6,778,952 Method for dynamic context scope selection in hybrid N-gram+LSA language modeling 35 2002
7,191,118 Method for dynamic context scope selection in hybrid N-gram+LSA language modeling 1 2004
8,677,377 Method and apparatus for building an intelligent automated assistant 0 2006
7,720,673 Method for dynamic context scope selection in hybrid N-GRAM+LSA language modeling 0 2007
8,645,137 Fast, language-independent method for user authentication by voice 0 2007
8,620,662 Context-aware unit selection 1 2007
8,768,702 Multi-tiered voice feedback in an electronic device 0 2008
8,712,776 Systems and methods for selective text to speech synthesis 0 2008
8,583,418 Systems and methods of detecting language and natural language strings for text to speech synthesis 1 2008
8,676,904 Electronic devices with voice command and contextual data processing capabilities 0 2008
8,614,431 Automated response to and sensing of user activity in portable devices 0 2009
8,682,649 Sentiment prediction from textual data 0 2009
8,600,743 Noise profile determination for voice-related feature 0 2010
8,682,667 User profiling for selecting user specific voice input processing information 0 2010
8,713,021 Unsupervised document clustering using latent semantic density analysis 0 2010
8,719,006 Combined statistical and rule-based part-of-speech tagging for text-to-speech synthesis 0 2010
8,719,014 Electronic device with text error correction based on voice recognition data 0 2010
8,781,836 Hearing assistance system for providing consistent human speech 0 2011
8,812,294 Translating phrases from one language into another using an order-based set of declarative rules 0 2011
8,706,472 Method for disambiguating multiple readings in language conversion 0 2011
8,762,156 Speech recognition repair using contextual information 0 2011
8,688,446 Providing text input using speech data and non-speech data 0 2011
8,775,442 Semantic search using a single-source semantic model 0 2012
8,762,469 Electronic devices with voice command and contextual data processing capabilities 0 2012
8,713,119 Electronic devices with voice command and contextual data processing capabilities 0 2012
8,670,985 Devices and methods for identifying a prompt corresponding to a voice input in a sequence of prompts 0 2012
8,799,000 Disambiguation based on active input elicitation by intelligent automated assistant 0 2012
8,706,503 Intent deduction based on previous user interactions with voice assistant 0 2012
8,670,979 Active input elicitation by intelligent automated assistant 0 2012
8,660,849 Prioritizing selection criteria by automated assistant 0 2012
8,718,047 Text to speech conversion of text messages from mobile communication devices 0 2012
8,751,238 Systems and methods for determining the language to use for speech generated by a text to speech engine 0 2013
8,731,942 Maintaining context information between user interactions with a voice assistant 0 2013
 
NUANCE COMMUNICATIONS, INC. (25)
6,167,377 Speech recognition language models 64 1997
6,052,657 Text segmentation and identification of topic using language models 75 1997
6,996,519 Method and apparatus for performing relational speech recognition 6 2001
7,725,307 Query engine for processing voice based queries including semantic decoding 67 2003
7,555,431 Method for processing speech using dynamic grammars 72 2004
8,036,893 Method and system for identifying and correcting accent-induced speech recognition difficulties 7 2004
7,640,159 System and method of speech recognition for non-native speakers of a language 4 2004
7,308,404 Method and apparatus for speech recognition using a dynamic vocabulary 12 2004
7,729,904 Partial speech processing device and method for use in distributed systems 61 2004
7,702,508 System and method for natural language processing of query answers 62 2004
7,657,424 System and method for processing sentence based queries 63 2004
7,624,007 System and method for natural language processing of sentence based queries 65 2004
7,533,020 Method and apparatus for performing relational speech recognition 5 2005
7,831,426 Network based interactive speech recognition system 65 2006
7,647,225 Adjustable resource based speech recognition system 54 2006
8,352,277 Method of interacting through speech with a web-connected server 4 2007
7,725,320 Internet based speech recognition system with dynamic grammars 56 2007
7,698,131 Speech recognition system for client devices having differing computing capabilities 51 2007
8,762,152 Speech recognition system interactive agent 0 2007
7,912,702 Statistical language model trained with semantic variants 52 2007
7,873,519 Natural language speech lattice containing semantic variants 68 2007
7,672,841 Method for processing speech data for a distributed recognition system 52 2008
8,229,734 Semantic decoding of user queries 1 2008
7,725,321 Speech based query system using semantic decoding 54 2008
8,285,546 Method and system for identifying and correcting accent-induced speech recognition difficulties 0 2011
 
AT&T CORP. (18)
6,044,337 Selection of superwords based on criteria relevant to both speech recognition and understanding 73 1997
6,021,384 Automatic generation of superwords 98 1997
6,317,707 Automatic clustering of tokens from a corpus for grammar acquisition 50 1998
6,415,248 Method for building linguistic models from a corpus 15 1999
7,085,720 Method for task classification using morphemes 17 2000
7,158,935 Method and system for predicting problematic situations in a automated dialog 23 2000
6,941,266 Method and system for predicting problematic dialog situations in a task classification system 51 2000
7,003,459 Method and system for predicting understanding errors in automated dialog systems 25 2001
6,751,591 Method and system for predicting understanding errors in a task classification system 68 2001
6,751,584 Automatic clustering of tokens from a corpus for grammar acquisition 4 2001
7,286,984 Method and system for automatically detecting morphemes in a task classification system using lattices 10 2002
7,149,687 Method of active learning for automatic speech recognition 20 2002
7,356,462 Automatic clustering of tokens from a corpus for grammar acquisition 0 2003
7,139,698 System and method for generating morphemes 4 2003
7,127,395 Method and system for predicting understanding errors in a task classification system 22 2004
7,472,060 Automated dialog system and method 31 2005
7,440,893 Automated dialog method with first and second thresholds for adapted dialog strategy 7 2005
7,440,897 Method and system for automatically detecting morphemes in a task classification system using lattices 6 2006
 
AT&T INTELLECTUAL PROPERTY II, L.P. (11)
8,392,188 Method and system for building a phonotactic model for domain independent speech recognition 1 2001
8,433,558 Methods and systems for natural language understanding using human knowledge and collected data 0 2005
7,529,667 Automated dialog system and method 6 2005
7,487,088 Method and system for predicting understanding errors in a task classification system 33 2006
7,957,970 Method and system for predicting problematic situations in automated dialog 2 2006
7,620,548 Method and system for automatic detecting morphemes in a task classification system using lattices 4 2007
7,966,174 Automatic clustering of tokens from a corpus for grammar acquisition 0 2008
8,010,361 Method and system for automatically detecting morphemes in a task classification system using lattices 4 2008
8,200,491 Method and system for automatically detecting morphemes in a task classification system using lattices 0 2011
8,612,212 Method and system for automatically detecting morphemes in a task classification system using lattices 1 2013
8,798,990 Methods and systems for natural language understanding using human knowledge and collected data 0 2013
 
RAYTHEON BBN TECHNOLOGIES CORP. (6)
7,401,023 Systems and methods for providing automated directory assistance using transcripts 9 2000
7,447,636 System and methods for using transcripts to train an automated directory assistance service 6 2005
7,890,539 Semantic matching using predicate-argument structure 7 2007
8,131,536 Extraction-empowered machine translation 2 2007
8,595,222 Methods and systems for representing, using and displaying time-varying information on the semantic web 0 2008
8,260,817 Semantic matching using predicate-argument structure 1 2011
 
INTERNATIONAL BUSINESS MACHINES CORPORATION (5)
6,577,999 Method and apparatus for intelligently managing multiple pronunciations for a speech recognition vocabulary 8 1999
6,385,579 Methods and apparatus for forming compound words for use in a continuous speech recognition system 23 1999
6,529,902 Method and system for off-line detection of textual topical changes and topic identification via likelihood based methods for improved language modeling 38 1999
7,644,057 System and method for electronic communication management 8 2004
7,752,159 System and method for classifying text 9 2007
 
MULTIMODAL TECHNOLOGIES, LLC (3)
7,584,103 Automated extraction of semantic content and generation of a structured document from speech 15 2004
8,560,314 Applying service levels to transcripts 0 2007
8,321,199 Verification of extracted data 2010
 
ECOLLEGE.COM (2)
6,871,043 Variable types of sensory interaction for an on-line educational system 11 2002
6,965,752 On-line educational system having an electronic notebook feature 8 2003
 
INTEL CORPORATION (2)
7,346,495 Method and system for building a domain specific statistical language model from rule based grammar specifications 6 2000
7,275,033 Method and system for using rule-based knowledge to build a class-based domain specific statistical language model 9 2000
 
NIPPON TELEGRAPH AND TELEPHONE CORPORATION (2)
6,173,261 Grammar fragment acquisition using syntactic and semantic clustering 141 1998
8,666,744 Grammar fragment acquisition using syntactic and semantic clustering 0 2000
 
OPTICAL RESEARCH PARTNERS LLC (2)
6,904,405 Message recognition using shared language model 22 2002
8,204,737 Message recognition using shared language model 2 2005
 
PANASONIC INTELLECTUAL PROPERTY CORPORATION OF AMERICA (2)
6,182,039 Method and apparatus using probabilistic language model based on confusable sets for speech recognition 66 1998
6,233,561 Method for goal-oriented speech translation in hand-held devices using meaning extraction and dialogue 58 1999
 
BELLSOUTH INTELLECTUAL PROPERTY CORPORATION (1)
6,751,595 Multi-stage large vocabulary speech recognition system and method 9 2001
 
INTELLIGENT AUTOMATION, INC. (1)
7,062,220 Automated, computer-based reading tutoring systems and methods 17 2001
 
KONINKLIJKE PHILIPS ELECTRONICS N.V. (1)
7,424,428 Automatic dialog system with database language model 5 2002
 
MICROSOFT CORPORATION (1)
7,844,449 Scalable probabilistic latent semantic analysis 0 2006
 
RAMP HOLDINGS, INC. (F/K/A EVERYZING, INC.) (1)
6,609,087 Fact recognition system 28 1999
 
Ramp, Inc. (1)
8,280,719 Methods and systems relating to information extraction 3 2006
 
RESOLVITY, INC. (1)
8,682,660 Method and system for post-processing speech recognition results 0 2009
 
SIEMENS AKTIENGESELLSCHAFT (1)
6,640,207 Method and configuration for forming classes for a language model based on linguistic classes 12 2001
 
SOPHIA SEARCH LIMITED (1)
7,747,593 Computer aided document retrieval 14 2004
 
UBS AG, STAMFORD BRANCH (1)
7,236,931 Systems and methods for automatic acoustic speaker adaptation in computer-assisted transcription systems 17 2003
 
Other [Check patent profile for assignment information] (1)
6,601,055 Explanation generation system for a diagnosis support tool employing an inference system 57 1999