Method for automatic wrapper repair

Number of patents in Portfolio can not be more than 2000

United States of America Patent

PATENT NO 7035841
SERIAL NO

10277662

Stats

ATTORNEY / AGENT: (SPONSORED)

Importance

Loading Importance Indicators... loading....

Abstract

See full text

A method for repairing a wrapper associated with an information source, includes defining a classifier, based on content features of extracted and labeled information using the wrapper, using the classifier to extract content information from the file according to a set of classifier extraction rules; analyzing the extracted content information according to the content features and assigning a label to any extracted content information which satisfies the label's rules; and defining a repaired wrapper as the classifier and those labels in the set which have been assigned to extracted content information. Additional content information and labels can be extracted by iteratively creating a classifier based on both content features and structure features of extracted strings.

Loading the Abstract Image... loading....

First Claim

See full text

Family

Loading Family data... loading....

Patent Owner(s)

Patent OwnerAddress
XEROX CORPORATION201 MERRITT 7 P O BOX 4505 NORWALK CT 06851-1056

International Classification(s)

  • [Classification Symbol]
  • [Patents Count]

Inventor(s)

Inventor Name Address # of filed Patents Total Citations
Chidlovskii, Boris Meylan, FR 73 3408

Cited Art Landscape

Load Citation

Patent Citation Ranking

Forward Cite Landscape

Load Citation