Document information retrieval using global word co-occurrence patterns

Number of patents in Portfolio can not be more than 2000

United States of America Patent

PATENT NO 5675819
SERIAL NO

08260575

Stats

ATTORNEY / AGENT: (SPONSORED)

Importance

Loading Importance Indicators... loading....

Abstract

See full text

A method and apparatus accesses relevant documents based on a query. A thesaurus of word vectors is formed for the words in the corpus of documents. The word vectors represent global lexical co-occurrence patterns and relationships between word neighbors. Document vectors, which are formed from the combination of word vectors, are in the same multi-dimensional space as the word vectors. A singular value decomposition is used to reduce the dimensionality of the document vectors. A query vector is formed from the combination of word vectors associated with the words in the query. The query vector and document vectors are compared to determine the relevant documents. The query vector can be divided into several factor clusters to form factor vectors. The factor vectors are then compared to the document vectors to determine the ranking of the documents within the factor cluster.

Loading the Abstract Image... loading....

First Claim

See full text

Family

Loading Family data... loading....

Patent Owner(s)

Patent OwnerAddress
TECHNOLOGY LICENSING CORPORATION20520 PROSPECT ROAD SUITE 200 SARATOGA CA 95070

International Classification(s)

  • [Classification Symbol]
  • [Patents Count]

Inventor(s)

Inventor Name Address # of filed Patents Total Citations
Schuetze, Hinrich Stanford, CA 18 3964

Cited Art Landscape

Load Citation

Patent Citation Ranking

Forward Cite Landscape

Load Citation