Techniques for inducing high quality structural templates for electronic documents

Number of patents in Portfolio can not be more than 2000

United States of America Patent

PATENT NO 8046681
APP PUB NO 20080072140A1
SERIAL NO

11945749

Stats

ATTORNEY / AGENT: (SPONSORED)

Importance

Loading Importance Indicators... loading....

Abstract

See full text

Techniques are disclosed herein to automatically learn a template that describes a common structure present in documents in a training set. The structure of the template is compared to the structure of the documents (or at least a part of each document) in the training set, one-by-one, and generalized in response to differences between the template and the document to which the template is currently being compared. If the structure of any particular document is considered too dissimilar from the structure of the template, then the template is not modified. Various generalization operators are added to the template to generalize the template. One such generalization operator is an “OR”, which indicates that only one of “n” sub-trees below the “OR” operator in the template is allowed at the corresponding position in a document.

Loading the Abstract Image... loading....

First Claim

See full text

Family

Loading Family data... loading....

Patent Owner(s)

Patent OwnerAddress
R2 SOLUTIONS LLC6136 FRISCO SQUARE BLVD SUITE 400 FRISCO TX 75034

International Classification(s)

  • [Classification Symbol]
  • [Patents Count]

Inventor(s)

Inventor Name Address # of filed Patents Total Citations
Madaan, Amit Uttar Pradesh, IN 9 107
Mehta, Rupesh R Maharashtra, IN 8 205
Vydiswaran, V G Vinod Maharashtra, IN 3 32

Cited Art Landscape

Load Citation

Patent Citation Ranking

Forward Cite Landscape

Load Citation