Computer-Implemented System And Method For Clustering Documents Based On Scored Concepts

Number of patents in Portfolio can not be more than 2000

United States of America Patent

SERIAL NO

14148686

Stats

ATTORNEY / AGENT: (SPONSORED)

Importance

Loading Importance Indicators... loading....

Abstract

See full text

A computer-implemented system and method for clustering documents based on scored concepts is provided. A set of documents is obtained and concepts are extracted from the documents. A score is calculated for each concept. The score is determined as a function of summation of a frequency of occurrence, concept weight, structural weight, and corpus weight. The documents in the set are clustered based on the scores. A vector is formed for each document based on the concepts in that document and the scores associated with the concepts. A similarity is determined between each document and each of the other documents based on the formed vectors. Those documents that are sufficiently distinct from the other documents are identified as seed documents for separate document clusters. Each of the remaining documents are grouped into one of the clusters most similar to that remaining document.

Loading the Abstract Image... loading....

First Claim

See full text

Family

Loading Family data... loading....

Patent Owner(s)

Patent OwnerAddress
NUIX NORTH AMERICA INC13755 SUNRISE VALLEY DR HERNDON VA 20171

International Classification(s)

  • [Classification Symbol]
  • [Patents Count]

Inventor(s)

Inventor Name Address # of filed Patents Total Citations
Evans, Lynne Marie Renton, US 33 399
Kawai, Kenji Seattle, US 181 3153

Cited Art Landscape

Load Citation

Patent Citation Ranking

Forward Cite Landscape

Load Citation