Detecting duplicate and near-duplicate files

Number of patents in Portfolio can not be more than 2000

United States of America Patent

PATENT NO 6658423
SERIAL NO

09768947

Stats

ATTORNEY / AGENT: (SPONSORED)

Importance

Loading Importance Indicators... loading....

Abstract

See full text

Improved duplicate and near-duplicate detection techniques may assign a number of fingerprints to a given document by (i) extracting parts from the document, (ii) assigning the extracted parts to one or more of a predetermined number of lists, and (iii) generating a fingerprint from each of the populated lists. Two documents may be considered to be near-duplicates if any one of their fingerprints match.

Loading the Abstract Image... loading....

First Claim

See full text

Family

Loading Family data... loading....

Patent Owner(s)

Patent OwnerAddress
GOOGLE LLC1600 AMPHITHEATRE PARKWAY MOUNTAIN VIEW CA 94043

International Classification(s)

  • [Classification Symbol]
  • [Patents Count]

Inventor(s)

Inventor Name Address # of filed Patents Total Citations
Henzinger, Monika H Menlo Park, CA 47 2920
Pugh, William Kensington, MD 27 1324

Cited Art Landscape

Load Citation

Patent Citation Ranking

Forward Cite Landscape

Load Citation