SYSTEM AND METHOD FOR ENTITY EXTRACTION FROM SEMI-STRUCTURED TEXT DOCUMENTS

Number of patents in Portfolio can not be more than 2000

United States of America Patent

APP PUB NO 20170300565A1
SERIAL NO

15098856

Stats

ATTORNEY / AGENT: (SPONSORED)

Importance

Loading Importance Indicators... loading....

Abstract

See full text

A method for extracting entities from a text document includes, for at least a section of a text document, providing a first set of entities extracted from the at least a section, clustering at least a subset of the extracted entities in the first set into clusters, based on locations of the entities in the document. Complete ones of the clusters of entities are identified. Patterns for extracting new entities are learned based on the complete clusters. New entities are extracted from incomplete clusters based on the learned patterns.

Loading the Abstract Image... loading....

First Claim

See full text

Family

Loading Family data... loading....

Patent Owner(s)

Patent OwnerAddressTotal Patents
XEROX CORPORATIONSTAMFORD, CT13415

International Classification(s)

Inventor(s)

Inventor Name Address # of filed Patents Total Citations
Calapodescu, Ioan Grenoble, FR 5 9
Guerin, Nicolas Notre-Dame-de Mesage, FR 18 390
Jacques, Fanchon Meylan, FR 1 0

Cited Art Landscape

Load Citation

Patent Citation Ranking

Forward Cite Landscape

Load Citation