US Patent No: 5,839,106

Number of patents in Portfolio can not be more than 2000

Large-vocabulary speech recognition using an integrated syntactic and semantic statistical language model

Stats

ATTORNEY / AGENT: (SPONSORED)

Importance

Loading Importance Indicators... loading....

Abstract

See full text

Methods and apparatus for performing large-vocabulary speech recognition employing an integrated syntactic and semantic statistical language model. In an exemplary embodiment, a stochastic language model is developed using a hybrid paradigm in which latent semantic analysis is combined with, and subordinated to, a conventional n-gram paradigm. The hybrid paradigm provides an estimate of the likelihood that a particular word, chosen from an underlying vocabulary will occur given a prevailing contextual history. The estimate is computed as a conditional probability that a word will occur given an 'integrated' history combining an n-word, syntactic-type history with a semantic-type history based on a much larger contextual framework. Thus, the exemplary embodiment seamlessly blends local language structures with global usage patterns to provide, in a single language model, the proficiency of a short-horizon, syntactic model with the large-span effectiveness of semantic analysis.

Loading the Abstract Image... loading....

First Claim

See full text

Family

Loading Family data... loading....

Patent Owner(s)

Patent OwnerAddressTotal Patents
APPLE INC.CUPERTINO, CA12700

International Classification(s)

  • [Classification Symbol]
  • [Patents Count]

Inventor(s)

Inventor Name Address # of filed Patents Total Citations
Bellegarda, Jerome R Los Gatos, CA 87 1310

Cited Art Landscape

Patent Info (Count) # Cites Year
 
APPLE INC. (1)
* 5,384,892 Dynamic language model for speech recognition 279 1992
 
CISCO TECHNOLOGY, INC. (1)
* 5,502,774 Automatic recognition of a consistent message using multiple complimentary sources of information 123 1994
* Cited By Examiner

Patent Citation Ranking

Forward Cite Landscape

Patent Info (Count) # Cites Year
 
Other [Check patent profile for assignment information] (1)
* 6,601,055 Explanation generation system for a diagnosis support tool employing an inference system 61 1999
 
RAMP HOLDINGS, INC. (F/K/A EVERYZING, INC.) (1)
* 6,609,087 Fact recognition system 37 1999
 
Ramp, Inc. (1)
8,280,719 Methods and systems relating to information extraction 3 2006
 
NUANCE COMMUNICATIONS, INC. (25)
* 6,167,377 Speech recognition language models 76 1997
* 6,052,657 Text segmentation and identification of topic using language models 83 1997
* 6,996,519 Method and apparatus for performing relational speech recognition 6 2001
7,725,307 Query engine for processing voice based queries including semantic decoding 90 2003
7,555,431 Method for processing speech using dynamic grammars 93 2004
* 8,036,893 Method and system for identifying and correcting accent-induced speech recognition difficulties 9 2004
* 7,640,159 System and method of speech recognition for non-native speakers of a language 6 2004
7,308,404 Method and apparatus for speech recognition using a dynamic vocabulary 21 2004
7,729,904 Partial speech processing device and method for use in distributed systems 81 2004
7,702,508 System and method for natural language processing of query answers 73 2004
7,657,424 System and method for processing sentence based queries 80 2004
7,624,007 System and method for natural language processing of sentence based queries 76 2004
7,533,020 Method and apparatus for performing relational speech recognition 5 2005
7,831,426 Network based interactive speech recognition system 82 2006
7,647,225 Adjustable resource based speech recognition system 75 2006
8,352,277 Method of interacting through speech with a web-connected server 7 2007
7,725,320 Internet based speech recognition system with dynamic grammars 76 2007
7,698,131 Speech recognition system for client devices having differing computing capabilities 66 2007
8,762,152 Speech recognition system interactive agent 0 2007
7,912,702 Statistical language model trained with semantic variants 62 2007
7,873,519 Natural language speech lattice containing semantic variants 91 2007
7,672,841 Method for processing speech data for a distributed recognition system 64 2008
8,229,734 Semantic decoding of user queries 1 2008
7,725,321 Speech based query system using semantic decoding 65 2008
8,285,546 Method and system for identifying and correcting accent-induced speech recognition difficulties 4 2011
 
OPTICAL RESEARCH PARTNERS LLC (3)
6,904,405 Message recognition using shared language model 27 2002
* 8,204,737 Message recognition using shared language model 3 2005
* 2005/0171,783 Message recognition using shared language model 11 2005
 
INTELLIGENT AUTOMATION, INC. (1)
7,062,220 Automated, computer-based reading tutoring systems and methods 17 2001
 
MULTIMODAL TECHNOLOGIES, LLC (3)
7,584,103 Automated extraction of semantic content and generation of a structured document from speech 17 2004
8,560,314 Applying service levels to transcripts 0 2007
8,321,199 Verification of extracted data 2010
 
APPLE INC. (45)
6,374,217 Fast update implementation for efficient latent semantic language modeling 27 1999
6,477,488 Method for dynamic context scope selection in hybrid n-gram+LSA language modeling 47 2000
6,697,779 Combined dual spectral and temporal alignment method for user authentication by voice 10 2000
* 6,778,952 Method for dynamic context scope selection in hybrid N-gram+LSA language modeling 45 2002
* 7,191,118 Method for dynamic context scope selection in hybrid N-gram+LSA language modeling 5 2004
8,677,377 Method and apparatus for building an intelligent automated assistant 1 2006
* 7,720,673 Method for dynamic context scope selection in hybrid N-GRAM+LSA language modeling 0 2007
8,977,255 Method and system for operating a multi-function portable electronic device using voice-activation 0 2007
8,645,137 Fast, language-independent method for user authentication by voice 0 2007
8,620,662 Context-aware unit selection 2 2007
8,996,376 Intelligent text-to-speech conversion 0 2008
8,768,702 Multi-tiered voice feedback in an electronic device 0 2008
8,898,568 Audio user interface 0 2008
8,712,776 Systems and methods for selective text to speech synthesis 0 2008
8,583,418 Systems and methods of detecting language and natural language strings for text to speech synthesis 2 2008
8,676,904 Electronic devices with voice command and contextual data processing capabilities 1 2008
8,862,252 Audio user interface for displayless electronic device 0 2009
8,614,431 Automated response to and sensing of user activity in portable devices 1 2009
8,682,649 Sentiment prediction from textual data 4 2009
8,600,743 Noise profile determination for voice-related feature 0 2010
8,682,667 User profiling for selecting user specific voice input processing information 1 2010
8,713,021 Unsupervised document clustering using latent semantic density analysis 0 2010
8,719,006 Combined statistical and rule-based part-of-speech tagging for text-to-speech synthesis 0 2010
8,719,014 Electronic device with text error correction based on voice recognition data 0 2010
8,781,836 Hearing assistance system for providing consistent human speech 0 2011
8,812,294 Translating phrases from one language into another using an order-based set of declarative rules 0 2011
8,706,472 Method for disambiguating multiple readings in language conversion 1 2011
8,762,156 Speech recognition repair using contextual information 2 2011
8,688,446 Providing text input using speech data and non-speech data 0 2011
8,775,442 Semantic search using a single-source semantic model 0 2012
8,762,469 Electronic devices with voice command and contextual data processing capabilities 0 2012
8,713,119 Electronic devices with voice command and contextual data processing capabilities 0 2012
8,670,985 Devices and methods for identifying a prompt corresponding to a voice input in a sequence of prompts 0 2012
8,935,167 Exemplar-based latent perceptual modeling for automatic speech recognition 0 2012
8,942,986 Determining user intent based on ontologies of domains 0 2012
8,903,716 Personalized vocabulary for digital assistant 0 2012
8,892,446 Service orchestration for intelligent automated assistant 0 2012
8,799,000 Disambiguation based on active input elicitation by intelligent automated assistant 1 2012
8,706,503 Intent deduction based on previous user interactions with voice assistant 1 2012
8,670,979 Active input elicitation by intelligent automated assistant 1 2012
8,660,849 Prioritizing selection criteria by automated assistant 2 2012
8,718,047 Text to speech conversion of text messages from mobile communication devices 0 2012
8,751,238 Systems and methods for determining the language to use for speech generated by a text to speech engine 0 2013
8,930,191 Paraphrasing of user requests and results by automated digital assistant 0 2013
8,731,942 Maintaining context information between user interactions with a voice assistant 1 2013
 
INTERACTIONS LLC (2)
* 7,149,687 Method of active learning for automatic speech recognition 22 2002
8,990,084 Method of active learning for automatic speech recognition 0 2014
 
RESOLVITY, INC. (1)
* 8,682,660 Method and system for post-processing speech recognition results 0 2009
 
MICROSOFT TECHNOLOGY LICENSING, LLC (1)
* 7,844,449 Scalable probabilistic latent semantic analysis 1 2006
 
THE TRUSTEES OF THE STEVENS INSTITUTE OF TECHNOLOGY (1)
* 2012/0254,333 AUTOMATED DETECTION OF DECEPTION IN SHORT AND MULTILINGUAL ELECTRONIC MESSAGES 16 2012
 
RAYTHEON BBN TECHNOLOGIES CORP. (6)
7,401,023 Systems and methods for providing automated directory assistance using transcripts 11 2000
7,447,636 System and methods for using transcripts to train an automated directory assistance service 10 2005
7,890,539 Semantic matching using predicate-argument structure 7 2007
8,131,536 Extraction-empowered machine translation 2 2007
8,595,222 Methods and systems for representing, using and displaying time-varying information on the semantic web 0 2008
8,260,817 Semantic matching using predicate-argument structure 1 2011
 
UBS AG, STAMFORD BRANCH (1)
* 7,236,931 Systems and methods for automatic acoustic speaker adaptation in computer-assisted transcription systems 27 2003
 
INTERNATIONAL BUSINESS MACHINES CORPORATION (5)
* 6,577,999 Method and apparatus for intelligently managing multiple pronunciations for a speech recognition vocabulary 8 1999
* 6,385,579 Methods and apparatus for forming compound words for use in a continuous speech recognition system 24 1999
* 6,529,902 Method and system for off-line detection of textual topical changes and topic identification via likelihood based methods for improved language modeling 41 1999
* 7,644,057 System and method for electronic communication management 8 2004
* 7,752,159 System and method for classifying text 9 2007
 
KONINKLIJKE PHILIPS ELECTRONICS N.V. (1)
* 7,424,428 Automatic dialog system with database language model 6 2002
 
SOPHIA SEARCH LIMITED (1)
* 7,747,593 Computer aided document retrieval 17 2004
 
ECOLLEGE.COM (2)
6,871,043 Variable types of sensory interaction for an on-line educational system 11 2002
6,965,752 On-line educational system having an electronic notebook feature 8 2003
 
BELLSOUTH INTELLECTUAL PROPERTY CORPORATION (1)
6,751,595 Multi-stage large vocabulary speech recognition system and method 13 2001
 
SIEMENS AKTIENGESELLSCHAFT (1)
* 6,640,207 Method and configuration for forming classes for a language model based on linguistic classes 12 2001
 
MMODAL IP LLC (1)
8,959,102 Structured searching of dynamic structured document corpuses 0 2011
 
NEWVALUEXCHANGE GLOBAL AI LLP (1)
8,977,584 Apparatuses, methods and systems for a digital conversation management platform 0 2011
 
AT&T INTELLECTUAL PROPERTY II, L.P. (10)
8,392,188 Method and system for building a phonotactic model for domain independent speech recognition 1 2001
8,433,558 Methods and systems for natural language understanding using human knowledge and collected data 1 2005
7,957,970 Method and system for predicting problematic situations in automated dialog 3 2006
7,620,548 Method and system for automatic detecting morphemes in a task classification system using lattices 5 2007
7,966,174 Automatic clustering of tokens from a corpus for grammar acquisition 0 2008
8,010,361 Method and system for automatically detecting morphemes in a task classification system using lattices 5 2008
8,200,491 Method and system for automatically detecting morphemes in a task classification system using lattices 0 2011
8,612,212 Method and system for automatically detecting morphemes in a task classification system using lattices 1 2013
8,798,990 Methods and systems for natural language understanding using human knowledge and collected data 0 2013
8,909,529 Method and system for automatically detecting morphemes in a task classification system using lattices 0 2013
 
INTEL CORPORATION (2)
* 7,346,495 Method and system for building a domain specific statistical language model from rule based grammar specifications 6 2000
* 7,275,033 Method and system for using rule-based knowledge to build a class-based domain specific statistical language model 10 2000
 
NIPPON TELEGRAPH AND TELEPHONE CORPORATION (2)
* 6,173,261 Grammar fragment acquisition using syntactic and semantic clustering 160 1998
* 8,666,744 Grammar fragment acquisition using syntactic and semantic clustering 0 2000
 
AT&T ALEX HOLDINGS, LLC (8)
6,941,266 Method and system for predicting problematic dialog situations in a task classification system 56 2000
7,003,459 Method and system for predicting understanding errors in automated dialog systems 26 2001
6,751,591 Method and system for predicting understanding errors in a task classification system 74 2001
7,127,395 Method and system for predicting understanding errors in a task classification system 25 2004
7,529,667 Automated dialog system and method 6 2005
7,472,060 Automated dialog system and method 37 2005
7,440,893 Automated dialog method with first and second thresholds for adapted dialog strategy 7 2005
7,487,088 Method and system for predicting understanding errors in a task classification system 39 2006
 
PANASONIC INTELLECTUAL PROPERTY CORPORATION OF AMERICA (2)
* 6,182,039 Method and apparatus using probabilistic language model based on confusable sets for speech recognition 69 1998
* 6,233,561 Method for goal-oriented speech translation in hand-held devices using meaning extraction and dialogue 66 1999
 
AT&T CORP. (11)
* 6,044,337 Selection of superwords based on criteria relevant to both speech recognition and understanding 75 1997
* 6,021,384 Automatic generation of superwords 106 1997
* 6,317,707 Automatic clustering of tokens from a corpus for grammar acquisition 62 1998
* 6,415,248 Method for building linguistic models from a corpus 16 1999
7,085,720 Method for task classification using morphemes 19 2000
7,158,935 Method and system for predicting problematic situations in a automated dialog 25 2000
* 6,751,584 Automatic clustering of tokens from a corpus for grammar acquisition 7 2001
7,286,984 Method and system for automatically detecting morphemes in a task classification system using lattices 12 2002
7,356,462 Automatic clustering of tokens from a corpus for grammar acquisition 0 2003
7,139,698 System and method for generating morphemes 4 2003
7,440,897 Method and system for automatically detecting morphemes in a task classification system using lattices 7 2006
* Cited By Examiner