Cell identification in table analysis

Number of patents in Portfolio can not be more than 2000

United States of America Patent

PATENT NO 6006240
SERIAL NO

08828847

Stats

ATTORNEY / AGENT: (SPONSORED)

Importance

Loading Importance Indicators... loading....

Abstract

See full text

The present invention handles fully-lined, semi-lined and line-less cell tables by identifying the cells and cell separators during page recomposition processes as part of optical character recognition processes. The invention accomplishes such by iteratively identifying cell separators and cells. The processes accomplishes this by iteratively merging word boxes into cells, iteratively finding separators, and iteratively merging cells bounded by the same separators, and repeating these steps until the correct cell structure is found. With this method, rows are estimated, close words are merged into cells, columns are then estimated, cells within columns are merged, columns re-estimated, cells in the same row and column are merged into bigger cells, and then rows and cells are merged according to the detection of various table styles. This invention handles large complex tables with multiple lines of symbols per cell. This method handles multiple line cells in lined, semi-lined and line-less tables.

Loading the Abstract Image... loading....

First Claim

See full text

Family

Loading Family data... loading....

Patent Owner(s)

Patent OwnerAddress
XEROX CORPORATION201 MERRITT 7 NORWALK CT 06851-1056

International Classification(s)

  • [Classification Symbol]
  • [Patents Count]

Inventor(s)

Inventor Name Address # of filed Patents Total Citations
Handley, John C Fairport, NY 80 1457

Cited Art Landscape

Load Citation

Patent Citation Ranking

Forward Cite Landscape

Load Citation