Learning automatic data extraction system

Number of patents in Portfolio can not be more than 2000

United States of America Patent

PATENT NO 6662190
APP PUB NO 20020138491A1
SERIAL NO

09812425

Stats

ATTORNEY / AGENT: (SPONSORED)

Importance

Loading Importance Indicators... loading....

Abstract

See full text

An improvement to an automatic data extractor has the capability of discovering new values that are not recognized by the vocabulary of the automatic data extractor and adding them to the record being formed and to the vocabulary, thus accumulating new vocabulary through use. The extractor gleans new values by deducing them from the structure of the text data and learns them by adding them to its vocabulary. The data extractor determines the structure of the data in much the same way as prior art data extractors but then a discovery process is used to identify a series of field lists using preferably at least one field parser and a field grader. The results of the grader are returned to an attribute mapper that identifies the position in the field list for each of the attributes. The content of each field, if not already added to the record and associated with the correct attribute using the recognizer, can now be associated by its position in the field list with an attribute and written to the record as the value for that attribute. Furthermore, a learner assigns that field to the vocabulary list if not already present in the vocabulary.

Loading the Abstract Image... loading....

First Claim

See full text

Family

Loading Family data... loading....

Patent Owner(s)

  • TAMIRAS PER PTE. LTD., LLC

International Classification(s)

  • [Classification Symbol]
  • [Patents Count]

Inventor(s)

Inventor Name Address # of filed Patents Total Citations
Bax, Eric T Pasadena, CA 9 153
Pellico, Julian Agoura Hills, CA 1 8

Cited Art Landscape

Load Citation

Patent Citation Ranking

Forward Cite Landscape

Load Citation