Method and system for mining generalized sequential patterns in a large database

Number of patents in Portfolio can not be more than 2000

United States of America Patent

PATENT NO 5742811
SERIAL NO

08541665

Stats

ATTORNEY / AGENT: (SPONSORED)

Importance

Loading Importance Indicators... loading....

Abstract

See full text

A method and apparatus are disclosed for mining generalized sequential patterns from a large database of data sequences, taking into account user specified constraints on the time-gap between adjacent elements of the patterns, sliding time-window, and taxonomies over data items. The invention first identifies the items with at least a minimum support, i.e., those contained in more than a minimum number of data sequences. The items are used as a seed set to generate candidate sequences. Next, the support of the candidate sequences are counted. The invention then identifies those candidate sequences that are frequent, i.e., those with a support above the minimum support. The frequent candidate sequences are entered into the set of sequential patterns, and are used to generate the next group of candidate sequences. Preferably, the candidate sequences are generated by joining previously found frequent candidate sequences, and candidate sequences having a contiguous subsequence without minimum support are discarded. In addition, the invention includes a hash-tree data structure for storing the candidate sequences and memory management techniques for performance improvement.

Loading the Abstract Image... loading....

First Claim

See full text

Family

Loading Family data... loading....

Patent Owner(s)

Patent OwnerAddress
INTERNATIONAL BUSINESS MACHINES CORPORATIONNEW ORCHARD ROAD ARMONK NY 10504

International Classification(s)

  • No Non-US Classification to display

Inventor(s)

Inventor Name Address # of filed Patents Total Citations
Agrawal, Rakesh San Jose, CA 265 9553
Srikant, Ramakrishnan San Jose, CA 44 1749

Cited Art Landscape

Load Citation

Patent Citation Ranking

  • No Patent Citation Ranking to display

Forward Cite Landscape

Load Citation