Contextual tagger utilizing deterministic finite state transducer

Number of patents in Portfolio can not be more than 2000

United States of America Patent

PATENT NO 5610812
SERIAL NO

08264981

Stats

ATTORNEY / AGENT: (SPONSORED)

Importance

Loading Importance Indicators... loading....

Abstract

See full text

A system for assigning part-of-speech tags to English text includes an improved contextual tagger which utilizes a deterministic finite state transducer to improve tagging speed such that large documents can have its sentences accurately tagged as to parts of speech to permit fast grammar checking, spell checking, information retrieval, text indexing and optical character recognition. The subject system performs by first acquiring a set of rules by examining a training corpus of tagged text. Then, these rules are transformed into a deterministic finite-state transducer through the utilization of non-deterministic transducers, a composer and a determiniser. In order to tag an input sentence, the sentence is initially tagged by first assigning each word in the sentence with its most likely part of speech tag regardless of the surrounding words in the sentences. The deterministic finite-state transducer is then applied on the resulting sequence of part of speech tags using the surrounding words and obtains the final part of speech tags. The Subject System requires an amount of time to compute the part-of-speech tags which is proportional to the number of words in the input sentence and which is independent of the number of rules it has applied.

Loading the Abstract Image... loading....

First Claim

See full text

Family

Loading Family data... loading....

Patent Owner(s)

  • BINARY SERVICES LIMITED LIABILITY COMPANY

International Classification(s)

  • [Classification Symbol]
  • [Patents Count]

Inventor(s)

Inventor Name Address # of filed Patents Total Citations
Roche, Emmanuel Boston, MA 44 2770
Schabes, Yves Boston, MA 35 2769

Cited Art Landscape

Load Citation

Patent Citation Ranking

Forward Cite Landscape

Load Citation