Method and apparatus for automatically identifying keywords within a document

Number of patents in Portfolio can not be more than 2000

United States of America Patent

PATENT NO 6470307
SERIAL NO

08880392

Stats

ATTORNEY / AGENT: (SPONSORED)

Importance

Loading Importance Indicators... loading....

Abstract

See full text

A trainable method of extracting keywords of one or more words is disclosed. According to the method, every word within a document that is not a stop word is stemmed and evaluated and receives a score. The scoring is performed based on a plurality of parameters which are adjusted through training prior to use of the method for keyword extraction. Each word having a high score is then replaced by a word phrase that is delimited by punctuation or stop words. The word phrase is selected from word phrases having the stemmed word therein. Repeated keywords are removed. The keywords are expanded and capitalisation is determined. The resulting list forms extracted keywords.

Loading the Abstract Image... loading....

First Claim

See full text

Family

Loading Family data... loading....

Patent Owner(s)

Patent OwnerAddress
NATIONAL RESEARCH COUNCIL OF CANANDA1500 MONTREAL ROAD OTTAWA ONTARIO K1A O

International Classification(s)

  • [Classification Symbol]
  • [Patents Count]

Inventor(s)

Inventor Name Address # of filed Patents Total Citations
Turney, Peter D Gloucester, CA 1 200

Cited Art Landscape

Load Citation

Patent Citation Ranking

Forward Cite Landscape

Load Citation