System and method for improving feature selection for a spam filtering model

Number of patents in Portfolio can not be more than 2000

United States of America Patent

PATENT NO 8417783
SERIAL NO

11444731

Stats

ATTORNEY / AGENT: (SPONSORED)

Importance

Loading Importance Indicators... loading....

Abstract

See full text

A system and method for removing ineffective features from a spam feature set. In particular, in one embodiment of the invention, the an entropy value is calculated for the feature set based on the effectiveness of the feature set at differentiating between ham and spam. Features are then removed one at a time and the entropy is recalculated. Features which increase the overall entropy are removed and features which decrease the overall entropy are retained. In another embodiment of the invention, the value of certain type of time consuming features (e.g., rules) is determined based on both the information gain associated with the features and the time consumed implementing the features. Those features which have relatively low information gain and which consume a significant amount of time to implement are removed from the feature set.

Loading the Abstract Image... loading....

First Claim

See full text

Family

Loading Family data... loading....

Patent Owner(s)

Patent OwnerAddress
PROOFPOINT INCC/O PROOFPOINT INC IP LEGAL DEPARTMENT 925 W MAUDE AVE SUNNYVALE CA 94085

International Classification(s)

  • [Classification Symbol]
  • [Patents Count]

Inventor(s)

Inventor Name Address # of filed Patents Total Citations
Lewis, Steve San Jose, US 9 127
Myers, John Gardiner Santa Clara, US 4 22
Sharma, Vipul Sunnyvale, US 18 999

Cited Art Landscape

Load Citation

Patent Citation Ranking

Forward Cite Landscape

Load Citation