Text categorization with knowledge transfer from heterogeneous datasets

Number of patents in Portfolio can not be more than 2000

United States of America Patent

PATENT NO 8103671
APP PUB NO 20090171956A1
SERIAL NO

12249809

Stats

ATTORNEY / AGENT: (SPONSORED)

Importance

Loading Importance Indicators... loading....

Abstract

See full text

The present invention provides a method for incorporating features from heterogeneous auxiliary datasets into input text data for use in classification. Heterogeneous auxiliary datasets, such as labeled datasets and unlabeled datasets, are accessed after receiving input text data. Features are extracted from each of the heterogeneous auxiliary datasets. The features are combined with the input text data to generate a set of features which may potentially be used to classify the input text data. Classification features are then extracted from the set of features and used to classify the input text data. In one embodiment, the classification features are extracted by calculating a mutual information value associated with each feature in the set of features and identifying features having a mutual information value exceeding a threshold value.

Loading the Abstract Image... loading....

First Claim

See full text

Family

Loading Family data... loading....

Patent Owner(s)

  • HONDA MOTOR CO., LTD.

International Classification(s)

  • [Classification Symbol]
  • [Patents Count]

Inventor(s)

Inventor Name Address # of filed Patents Total Citations
Gupta, Rakesh Mountain View, US 60 1478
Ratinov, Lev Raymond, US 4 51

Cited Art Landscape

Load Citation

Patent Citation Ranking

Forward Cite Landscape

Load Citation