Word disambiguation apparatus and methods

Number of patents in Portfolio can not be more than 2000

United States of America Patent

PATENT NO 5541836
SERIAL NO

07814850

Stats

ATTORNEY / AGENT: (SPONSORED)

Importance

Loading Importance Indicators... loading....

Abstract

See full text

Apparatus and methods for determining whether a word/sense pair is proper for a context. Wide contexts (100 words) are employed for both training and testing, and testing is done by adding the weights of vocabulary words from the context. The weights are determined by Bayesian techniques which interpolate between the probability of occurrence of a vocabulary word in a conditional sample of the training text and the probability of its occurrence in the entire training text. A further improvement in testing takes advantage of the fact that a word is generally used in only a single sense in a single discourse. Also disclosed are automated training techniques including training on bilingual bodies of text and training using categories from Roget's Thesaurus.

Loading the Abstract Image... loading....

First Claim

See full text

Family

Loading Family data... loading....

Patent Owner(s)

Patent OwnerAddress
THE CHASE MANHATTAN BANK AS COLLATERAL AGENTP O BOX 2558 HOUSTON TX 77252

International Classification(s)

  • [Classification Symbol]
  • [Patents Count]

Inventor(s)

Inventor Name Address # of filed Patents Total Citations
Church, Kenneth W Chatham, NJ 31 1252
Gale, William A Maplewood, NJ 1 184
Yarowsky, David E Summit, NJ 3 213

Cited Art Landscape

Load Citation

Patent Citation Ranking

Forward Cite Landscape

Load Citation