Augmenting a training set for document categorization

Number of patents in Portfolio can not be more than 2000

United States of America Patent

PATENT NO 7457801
APP PUB NO 20070112753A1
SERIAL NO

11273714

Stats

ATTORNEY / AGENT: (SPONSORED)

Importance

Loading Importance Indicators... loading....

Abstract

See full text

A method and system for augmenting a training set used to train a classifier of documents is provided. The augmentation system augments a training set with training data derived from features of documents based on a document hierarchy. The training data of the initial training set may be derived from the root documents of the hierarchies of documents. The augmentation system generates additional training data that includes an aggregate feature that represents the overall characteristics of a hierarchy of documents, rather than just the root document. After the training data is generated, the augmentation system augments the initial training set with the newly generated training data.

Loading the Abstract Image... loading....

First Claim

See full text

Family

Loading Family data... loading....

Patent Owner(s)

Patent OwnerAddress
MICROSOFT TECHNOLOGY LICENSING LLCONE MICROSOFT WAY REDMOND WA 98052

International Classification(s)

  • [Classification Symbol]
  • [Patents Count]

Inventor(s)

Inventor Name Address # of filed Patents Total Citations
Liu, Tie-Yan Beijing, CN 87 1466
Ma, Wei-Ying Beijing, CN 219 9185

Cited Art Landscape

Load Citation

Patent Citation Ranking

Forward Cite Landscape

Load Citation