
US Patent No: 5,371,807
Number of patents in Portfolio can not be more than 2000
Method and apparatus for text classification
Stats
-
Dec 6, 1994
Issued date -
Mar 20, 1992
filing date -
07/855,378
serial no -
Expired
status
Importance
Abstract
A text classification system and method that can be used by an application for classifying natural language text input into a computer system having a domain specific knowledge base that includes a knowledge base having a plurality of categories. The text classification system classifies input natural language input text by first parsing the natural language input text into a first list of recognized keywords. This list is then used to deduce further facts from the natural language input text which are then compiled into a second list. Next, a numeric similarity score for each one of the plurality of categories in the knowledge base is calculated which indicates how similar one of the plurality of categories is to the natural language input text. A dynamic threshold is then applied to determine which ones of the plurality of categories are most similar to the recognized keywords of the natural language input text. A third list is compiled of the ones of the plurality of categories determined to be most similar to the recognized keywords. An optional rule base can be utilized to further refine the determination of which ones of the plurality of categories are most similar to the recognized keywords of the natural language input text. Also, an optional learning capability can be added to improve the accuracy of the text classification system.
First Claim
Related Publications
International Classification(s)
- [Classification Symbol]
- [Patents Count]
Cited Art
| Patent Info | (Count) | # Cites | Year |
|---|---|---|---|
|
|
|||
| 4,674,065 System for detecting and correcting contextual errors in a text processing system | 111 | 1985 | |
| 5,146,406 Computer method for identifying predicate-argument structures in natural language text | 62 | 1989 | |
|
|
|||
| 5,128,865 Method for determining the semantic relatedness of lexical items in a text | 77 | 1990 | |
|
|
|||
| 4,876,731 Neural network model in pattern recognition using probabilistic contextual information | 74 | 1988 | |
|
|
|||
| 4,682,365 System and method for preparing a recognition dictionary | 34 | 1985 | |
|
|
|||
| 5,050,218 Apparatus for recognizing address appearing on mail article | 22 | 1991 | |
|
|
|||
| 4,754,489 Means for resolving ambiguities in text based upon character context | 58 | 1985 | |
|
|
|||
| 5,083,268 System and method for parsing natural language by unifying lexical features of words | 53 | 1990 | |
|
|
|||
| 5,056,021 Method and apparatus for abstracting concepts from natural language | 131 | 1989 | |