Method and apparatus for detecting pagination constructs including a header and a footer in legacy documents

Number of patents in Portfolio can not be more than 2000

United States of America Patent

PATENT NO 7937653
APP PUB NO 20060156226A1
SERIAL NO

11032817

Stats

ATTORNEY / AGENT: (SPONSORED)

Importance

Loading Importance Indicators... loading....

Abstract

See full text

A method for identifying header/footer content of a document, in order to sequence text fragments comprising recognizable text blocks as derived from the document. The textual variability of lines comprised of text blocks, including the different kinds of text blocks within the line is analyzed for assessment of textual variability. Header/footer zones are defined by textual content having a low textual variability. An alternative embodiment identifies pagination constructs by comparing selected text-boxes for similarity and proximity and clustering the text boxes satisfying a predetermined similarity value, wherein the clustered text boxes are deemed to comprise pagination constructs.

Loading the Abstract Image... loading....

First Claim

See full text

Family

Loading Family data... loading....

Patent Owner(s)

Patent OwnerAddress
XEROX CORPORATION45 GLOVER AVENUE P O BOX 4505 NORWALK CT 06856-4505

International Classification(s)

  • [Classification Symbol]
  • [Patents Count]

Inventor(s)

Inventor Name Address # of filed Patents Total Citations
Déjean, Hervé Grenoble, FR 24 277
Meunier, Jean-Luc St. Nazaire les Eymes, FR 62 1831

Cited Art Landscape

Load Citation

Patent Citation Ranking

Forward Cite Landscape

Load Citation