Method for extracting, interpreting and standardizing tabular data from unstructured documents

Number of patents in Portfolio can not be more than 2000

United States of America Patent

PATENT NO 7590647
APP PUB NO 20060288268A1
SERIAL NO

11140340

Stats

ATTORNEY / AGENT: (SPONSORED)

Importance

Loading Importance Indicators... loading....

Abstract

See full text

A system, method, and computer program for automatically identifying, parsing, and interpreting tabular data from unstructured documents stored in various formats such as ASCII text, Unicode text, HTML, PDF text, and PDF image format is provided. A set of table identification, parsing/tokenizing, and interpreting/mapping rules are developed with grammar descriptors. These rules are then applied to a set of documents to identify a table, parse the content of the table, and interpret the parsed content, if required, thereby standardizing the tabular data.

Loading the Abstract Image... loading....

First Claim

See full text

Family

Loading Family data... loading....

Patent Owner(s)

Patent OwnerAddress
GENPACT USA INC521 FIFTH AVENUE 14TH FLOOR NEW YORK NY 10175

International Classification(s)

  • [Classification Symbol]
  • [Patents Count]

Inventor(s)

Inventor Name Address # of filed Patents Total Citations
Alam, Rummana Walpole , US 4 295
Bharadwaj, Srinivasan Sharon , US 4 295
Kothiwale, Mahantesh Mansfield , US 10 296
Srinivasan, Venkatesan Weston , US 8 411

Cited Art Landscape

Load Citation

Patent Citation Ranking

Forward Cite Landscape

Load Citation