Method and apparatus for reducing the computational requirements of K-means data clustering

Number of patents in Portfolio can not be more than 2000

United States of America Patent

PATENT NO 5983224
SERIAL NO

08962470

Stats

ATTORNEY / AGENT: (SPONSORED)

Importance

Loading Importance Indicators... loading....

Abstract

See full text

The present invention is directed to an improved data clustering method and apparatus for use in data mining operations. The present invention determines the pattern vectors of a k-d tree structure which are closest to a given prototype cluster by pruning prototypes through geometrical constraints, before a k-means process is applied to the prototypes. For each sub-branch in the k-d tree, a candidate set of prototypes is formed from the parent of a child node. The minimum and maximum distances from any point in the child node to any prototype in the candidate set is determined. The smallest of the maximum distances found is compared to the minimum distances of each prototype in the candidate set. Those prototypes with a minimum distance greater than the smallest of the maximum distances are pruned or eliminated. Pruning the number of remote prototypes reduces the number of distance calculations for the k-means process, significantly reducing the overall computation time.

Loading the Abstract Image... loading....

First Claim

See full text

Family

Loading Family data... loading....

Patent Owner(s)

Patent OwnerAddress
HITACHI AMERICA LTD2535 AUGUSTINE DRIVE SANTA CLARA CA 95054

International Classification(s)

  • [Classification Symbol]
  • [Patents Count]

Inventor(s)

Inventor Name Address # of filed Patents Total Citations
Alsabti, Khaled Gainesville, FL 2 174
Ranka, Sanjay Gainesville, FL 20 753
Singh, Vineet San Jose, CA 22 774

Cited Art Landscape

Load Citation

Patent Citation Ranking

Forward Cite Landscape

Load Citation