US Patent No: 6,546,389

Number of patents in Portfolio can not be more than 2000

Method and system for building a decision-tree classifier from privacy-preserving data

Stats

ATTORNEY / AGENT: (SPONSORED)
 

Importance

Loading Importance Indicators... loading....

Abstract

A system and method for mining data while preserving a user's privacy includes perturbing user-related information at the user's computer and sending the perturbed data to a Web site. At the Web site, perturbed data from many users is aggregated, and from the distribution of the perturbed data, the distribution of the original data is reconstructed, although individual records cannot be reconstructed. Based on the reconstructed distribution, a decision tree classification model or a Naive Bayes classification model is developed, with the model then being provided back to the users, who can use the model on their individual data to generate classifications that are then sent back to the Web site such that the Web site can display a page appropriately configured for the user's classification. Or, the classification model need not be provided to users, but the Web site can use the model to, e.g., send search results and a ranking model to a user, with the ranking model being used at the user computer to rank the search results based on the user's individual classification data.

Loading the Abstract Image... loading....

First Claim

Related Publications

Loading Related Publications... loading....

Patent Owner(s)

Patent OwnerAddressTotal Patents
INTERNATIONAL BUSINESS MACHINES CORPORATIONARMONK, NY68180

International Classification(s)

  • [Classification Symbol]
  • [Patents Count]

Inventor(s)

Inventor Name Address # of filed Patents Total Citations
Agrawal, Rakesh San Jose, CA 253 4612
Srikant, Ramakrishnan San Jose, CA 41 939

Cited Art

Patent Info (Count) # Cites Year
 
INTERNATIONAL BUSINESS MACHINES CORPORATION (5)
5,787,274 Data mining method and system for generating a decision tree classifier for data records based on a minimum description length (MDL) and presorting of records 76 1995
5,870,735 Method and system for generating a decision-tree classifier in parallel in a multi-processor system 45 1996
5,799,311 Method and system for generating a decision-tree classifier independent of system memory size 59 1996
6,230,151 Parallel classification for data mining in a shared-memory multiprocessor system 35 1998
6,138,115 Method and system for generating a decision-tree classifier in parallel in a multi-processor system 39 1999
 
AT&T CORP. (1)
6,055,510 Method for performing targeted marketing over a large computer network 199 1997
 
LUCENT TECHNOLOGIES INC. (1)
6,247,016 Decision tree classifier with integrated building and pruning phases 29 1998

Patent Citation Ranking

Forward Cites

Patent Info (Count) # Cites Year
 
GOOGLE INC. (11)
7,231,399 Ranking documents based on large data sets 32 2003
7,222,127 Large scale machine learning systems and methods 16 2003
7,716,225 Ranking documents based on user behavior and/or feature data 16 2004
7,870,147 Query revision using known highly-ranked queries 7 2005
7,769,763 Large scale machine learning systems and methods 3 2007
7,743,050 Model generation for ranking documents based on large data sets 4 2007
8,140,524 Estimating confidence for query revision models 0 2008
8,117,209 Ranking documents based on user behavior and/or feature data 0 2010
8,195,674 Large scale machine learning systems and methods 0 2010
8,375,049 Query revision using known highly-ranked queries 0 2010
8,364,618 Large scale machine learning systems and methods 0 2012
 
INTERNATIONAL BUSINESS MACHINES CORPORATION (7)
6,931,403 System and architecture for privacy-preserving data mining 7 2000
6,694,303 Method and system for building a Naive Bayes classifier from privacy-preserving data 8 2000
6,687,691 Method and system for reconstructing original distributions from randomized numeric data 4 2000
6,871,201 Method for building space-splitting decision tree 12 2001
7,810,142 Auditing compliance with a hippocratic database 0 2005
7,853,545 Preserving privacy of one-dimensional data streams using dynamic correlations 1 2007
7,840,516 Preserving privacy of one-dimensional data streams by perturbing data with noise and using dynamic autocorrelation 1 2007
 
MICROSOFT CORPORATION (2)
8,392,380 Load-balancing and scaling for analytics data 0 2009
8,082,247 Best-bet recommendations 1 2009
 
HEWLETT-PACKARD DEVELOPMENT COMPANY, L.P. (1)
6,738,950 Method and system for dynamic generation of web site content for specific user communities from a single content base 15 2000
 
INTEL CORPORATION (1)
7,644,049 Decision forest based classifier for determining predictive importance in real-time data analysis 1 2004
 
ORACLE INTERNATIONAL CORPORATION (1)
8,280,915 Binning predictors using per-predictor trees and MDL pruning 0 2006
 
SIEMENS MEDICAL SOLUTIONS USA, INC. (1)
8,250,013 System and method for privacy preserving predictive models for lung cancer survival analysis 0 2009

Maintenance Fees

Fee Large entity fee small entity fee micro entity fee due date
11.5 Year Payment $7400.00 $3700.00 $1850.00 Oct 8, 2014
Fee Large entity fee small entity fee micro entity fee
Surcharge - 11.5 year - Late payment within 6 months $160.00 $80.00 $40.00
Surcharge after expiration - Late payment is unavoidable $700.00 $350.00 $175.00
Surcharge after expiration - Late payment is unintentional $1,640.00 $820.00 $410.00