Method and apparatus for detecting and summarizing document similarity within large document sets

Number of patents in Portfolio can not be more than 2000

United States of America Patent

PATENT NO 6240409
SERIAL NO

09127105

Stats

ATTORNEY / AGENT: (SPONSORED)

Importance

Loading Importance Indicators... loading....

Abstract

See full text

A method and apparatus are disclosed for comparing an input or query file to a set of files to detect similarities and formatting the output comparison data are described. An input query file that can be segmented into multiple query file substrings is received. A query file substring is selected and used to search a storage area containing multiple ordered file substrings that were taken from previously analyzed files. If the selected query file substring matches any of the multiple ordered file substrings, match data relating to the match between the selected query file substring and the matching ordered file substring is stored in a temporary file. The matching ordered file substring and another ordered file substring are joined if the matching ordered file substring and the second ordered file substring are in a particular sequence and if the selected query file substring and a second query file substring are in the same particular sequence. If the matching ordered file substring and the second query file substring match, a coalesced matching ordered substring and a coalesced query file substring are formed that can be used to format output comparison data.

Loading the Abstract Image... loading....

First Claim

See full text

Family

Loading Family data... loading....

Patent Owner(s)

Patent OwnerAddress
REGENTS OF THE UNIVERSITY OF CALIFORNIA THE300 LAKESIDE DRIVE 22ND FLOOR OAKLAND CA 94612 UNITED STATES OF AMERICA

International Classification(s)

  • [Classification Symbol]
  • [Patents Count]

Inventor(s)

Inventor Name Address # of filed Patents Total Citations
Aiken, Alexander San Mateo, CA 3 424

Cited Art Landscape

Load Citation

Patent Citation Ranking

Forward Cite Landscape

Load Citation