System and method for analysis and clustering of documents for search engine

Number of patents in Portfolio can not be more than 2000

United States of America Patent

APP PUB NO 20020065857A1
SERIAL NO

09920732

Stats

ATTORNEY / AGENT: (SPONSORED)

Importance

Loading Importance Indicators... loading....

Abstract

See full text

A system and method for searching documents in a data source and more particularly, to a system and method for analyzing and clustering of documents for a search engine. The system and method includes analyzing and processing documents to secure the infrastructure and standards for optimal document processing. By incorporating Computational Intelligence (CI) and statistical methods, the document information is analyzed and clustered using novel techniques for knowledge extraction. A comprehensive dictionary is built based on the keywords identified by the these techniques from the entire text of the document. The text is parsed for keywords or the number of its occurrences and the context in which the word appears in the documents. The whole document is identified by the knowledge that is represented in its contents. Based on such knowledge extracted from all the documents, the documents are clustered into meaningful groups in a catalog tree. The results of document analysis and clustering information are stored in a database.

Loading the Abstract Image... loading....

First Claim

See full text

Family

Loading Family data... loading....

Patent Owner(s)

Patent OwnerAddress
NUTECH SOLUTIONS INC8401 UNIVERSITY EXECUTIVE PARK SUITE 102 CHARLOTTE NC 28262

International Classification(s)

  • [Classification Symbol]
  • [Patents Count]

Inventor(s)

Inventor Name Address # of filed Patents Total Citations
Jankowski, Andrzej Warsaw, PL 15 384
Michalewicz, Zbigniew Charlotte, NC 4 305

Cited Art Landscape

Load Citation

Patent Citation Ranking

Forward Cite Landscape

Load Citation