Detecting duplicate and near-duplicate files

Number of patents in Portfolio can not be more than 2000

United States of America Patent

PATENT NO 6658423
SERIAL NO

09768947

Stats

ATTORNEY / AGENT: (SPONSORED)

Importance

Loading Importance Indicators... loading....

Abstract

See full text

Improved duplicate and near-duplicate detection techniques may assign a number of fingerprints to a given document by (i) extracting parts from the document, (ii) assigning the extracted parts to one or more of a predetermined number of lists, and (iii) generating a fingerprint from each of the populated lists. Two documents may be considered to be near-duplicates if any one of their fingerprints match.

Loading the Abstract Image... loading....

First Claim

See full text

Family

Loading Family data... loading....

Patent Owner(s)

  • GOOGLE LLC

International Classification(s)

  • [Classification Symbol]
  • [Patents Count]

Inventor(s)

Inventor Name Address # of filed Patents Total Citations
Henzinger, Monika H Menlo Park, CA 47 2857
Pugh, William Kensington, MD 27 1281

Cited Art Landscape

Load Citation

Patent Citation Ranking

Forward Cite Landscape

Load Citation