SYSTEM AND METHOD FOR ENTITY EXTRACTION FROM SEMI-STRUCTURED TEXT DOCUMENTS

Number of patents in Portfolio can not be more than 2000

United States of America Patent

APP PUB NO 20170300565A1
SERIAL NO

15098856

Stats

ATTORNEY / AGENT: (SPONSORED)

Importance

Loading Importance Indicators... loading....

Abstract

See full text

A method for extracting entities from a text document includes, for at least a section of a text document, providing a first set of entities extracted from the at least a section, clustering at least a subset of the extracted entities in the first set into clusters, based on locations of the entities in the document. Complete ones of the clusters of entities are identified. Patterns for extracting new entities are learned based on the complete clusters. New entities are extracted from incomplete clusters based on the learned patterns.

Loading the Abstract Image... loading....

First Claim

See full text

Family

Loading Family data... loading....

Patent Owner(s)

Patent OwnerAddressTotal Patents
XEROX CORPORATIONSTAMFORD, CT13750

International Classification(s)

Inventor(s)

  • No Inventor to display

Cited Art Landscape

  • No Cited Art to Display

Patent Citation Ranking

Forward Cite Landscape

  • No Forward Cites to Display