Building scalable N-gram language models using maximum likelihood maximum entropy N-gram models

Number of patents in Portfolio can not be more than 2000

United States of America Patent

PATENT NO 5467425
SERIAL NO

08023543

Stats

ATTORNEY / AGENT: (SPONSORED)

Importance

Loading Importance Indicators... loading....

Abstract

See full text

The present invention is an n-gram language modeler which significantly reduces the memory storage requirement and convergence time for language modelling systems and methods. The present invention aligns each n-gram with one of 'n' number of non-intersecting classes. A count is determined for each n-gram representing the number of times each n-gram occurred in the training data. The n-grams are separated into classes and complement counts are determined. Using these counts and complement counts factors are determined, one factor for each class, using an iterative scaling algorithm. The language model probability, i.e., the probability that a word occurs given the occurrence of the previous two words, is determined using these factors.

Loading the Abstract Image... loading....

First Claim

See full text

Family

Loading Family data... loading....

Patent Owner(s)

Patent OwnerAddress
NUANCE COMMUNICATIONS INC1 WAYSIDE ROAD BURLINGTON MA 01803

International Classification(s)

  • [Classification Symbol]
  • [Patents Count]

Inventor(s)

Inventor Name Address # of filed Patents Total Citations
Lau, Raymond Cambridge, MA 97 2291
Rosenfeld, Ronald Pittsburgh, PA 2 300
Roukos, Salim Scarsdale, NY 16 1258

Cited Art Landscape

Load Citation

Patent Citation Ranking

Forward Cite Landscape

Load Citation