Bayes rule based and decision tree hybrid classifier

Number of patents in Portfolio can not be more than 2000

United States of America Patent

PATENT NO 6182058
SERIAL NO

08810217

Stats

ATTORNEY / AGENT: (SPONSORED)

Importance

Loading Importance Indicators... loading....

Abstract

See full text

The present invention provides a hybrid classifier, called the NB-Tree classifier, for classifying a set of records. According to the present invention, the NB-Tree classifier includes a Decision-Tree structure having zero or more decision-nodes and one or more leaf-nodes. At each decision-node, a test is performed based on one or more attributes. At each leaf-node, a classifier based on Bayes Rule classifies the records. Furthermore, the present invention provides a method for inducing the NB-Tree classifier from a set of labeled instances. To induce the NB-Tree classifier, a utility C.sub.1 of a Bayes classifier at a root-node is first estimated. Next, a utility D.sub.1 of a split into a plurality of child-nodes with a Bayes classifier at the child-nodes is estimated. The utility of a split is the weighted sum of the utility of the child-nodes, where the weight given to a child-node is proportional to the number of instances that go down that child-node. Next, it is determined if C.sub.1 is higher than D.sub.1. If C.sub.1 is higher than D.sub.1, the root-node is transformed into a leaf-node with a Bayes classifier. If C.sub.1 is not higher than D.sub.1, the root-node is transformed into a decision-node, and the instances are partitioned into a plurality of child-nodes. The method then recursively performs the previous steps for each child-node as if it is a root-node. The present invention approximates whether a generalization accuracy for a Naive-Bayes classifier at each leaf-node is higher than a single Naive-Bayes classifier at the decision-node. According to one embodiment of the present invention, to avoid splits with little value, a split is defined to be significant if the relative (not absolute) reduction in error is greater than 5% and there are at least 30 instances in the node.

Loading the Abstract Image... loading....

First Claim

See full text

Family

Loading Family data... loading....

Patent Owner(s)

Patent OwnerAddress
RPX CORPORATIONFOUR EMBARCADERO SUITE 4000 SAN FRANCISCO CA 94111

International Classification(s)

  • [Classification Symbol]
  • [Patents Count]

Inventor(s)

Inventor Name Address # of filed Patents Total Citations
Kohavi, Ron Mountain View, CA 18 839

Cited Art Landscape

Load Citation

Patent Citation Ranking

Forward Cite Landscape

Load Citation