US Patent No: 6,192,360

Number of patents in Portfolio can not be more than 2000

Methods and apparatus for classifying text and for building a text classifier

1 Status Updates

Stats

ATTORNEY / AGENT: (SPONSORED)
 

Importance

Loading Importance Indicators... loading....

Abstract

A text classifier and building the text classifier by determining appropriate parameters for the text classifier.

Loading the Abstract Image... loading....

First Claim

Related Publications

Loading Related Publications... loading....

Patent Owner(s)

Patent OwnerAddressTotal Patents
MICROSOFT CORPORATIONREDMOND, WA24565

International Classification(s)

  • [Classification Symbol]
  • [Patents Count]

Inventor(s)

Inventor Name Address # of filed Patents Total Citations
Dumais, Susan T Kirkland, WA 97 1969
Heckerman, David Bellevue, WA 6 287
Horvitz, Eric Kirkland, WA 111 3695
Platt, John Carlton Bellevue, WA 5 323
Sahami, Mehran Redwood City, CA 37 961

Cited Art

Patent Info (Count) # Cites Year
 
ICAD, INC. (1)
6,115,488 Method and system for combining automated detections from digital mammograms with observed detections of a human interpreter 32 1999
 
MICROSOFT CORPORATION (1)
6,115,708 Method for refining the initial conditions for clustering with applications to small and large database clustering 46 1998
 
MITSUBISHI ELECTRIC RESEARCH LABORATORIES, INC. (1)
6,115,052 System for reconstructing the 3-dimensional motions of a human figure from a monocularly-viewed image sequence 121 1998

Patent Citation Ranking

Forward Cites

Patent Info (Count) # Cites Year
 
MICROSOFT CORPORATION (56)
6,622,160 Methods for routing items for communications based on a measure of criticality 24 1999
6,816,847 computerized aesthetic judgment of images 15 1999
6,502,082 Modality fusion for object tracking with training system and method 62 1999
6,728,690 Classification system trainer employing maximum margin back-propagation with probabilistic outputs 42 1999
6,697,769 Method and apparatus for fast machine training 13 2000
6,898,737 Automatic classification of event data 11 2001
7,298,903 Method and system for separating text and drawings in digital ink 3 2001
7,392,472 Layout analysis 7 2002
7,263,227 Activity detector 5 2002
7,164,797 Clustering 1 2002
8,046,832 Spam detector with challenges 6 2002
7,120,297 Segmented layered image system 25 2002
7,110,596 System and method facilitating document image compression utilizing a mask 4 2002
7,016,884 Probability estimate for K-nearest neighbor 8 2002
7,266,559 Method and apparatus for adapting a search classifier based on user queries 7 2002
8,335,683 System for using statistical classifiers for spoken language understanding 0 2003
7,249,162 Adaptive junk message filtering system 40 2003
7,219,148 Feedback loop for spam prevention 67 2003
7,483,947 Message rendering for identification of content features 7 2003
8,166,392 Method for automatically assigning priorities to documents and messages 0 2003
7,272,853 Origination/destination features and lists for spam prevention 68 2003
7,711,779 Prevention of outgoing spam 12 2003
7,519,668 Obfuscation of spam filter 4 2003
7,337,181 Methods for routing items for communications based on a measure of criticality 11 2003
8,214,438 (More) advanced spam detection features 0 2004
7,444,384 Integration of a computer-based message priority system with mobile electronic devices 12 2004
7,233,954 Methods for routing items for communications based on a measure of criticality 4 2004
7,464,264 Training filters for detecting spasm based on IP addresses and text-related features 25 2004
7,499,588 Low resolution OCR for camera acquired documents 6 2004
7,409,708 Advanced URL and IP features 24 2004
7,664,819 Incremental anti-spam lookup and update service 4 2004
7,904,517 Challenge response systems 3 2004
7,660,865 Spam filtering with probabilistic secure hashes 3 2004
7,698,339 Method and system for summarizing a document 2 2004
7,464,093 Methods for routing items for communications based on a measure of criticality 1 2005
7,930,353 Trees of classifiers for detecting email spam 5 2005
7,376,275 Clustering 5 2005
8,065,370 Proofs to filter spam 1 2005
7,397,952 "Don't care" pixel interpolation 3 2005
7,451,123 Probability estimate for K-nearest neighbor 0 2005
7,333,965 Classifying text in a code editor using multiple classifiers 1 2006
7,512,274 Block retouching 2 2006
7,764,834 System and method facilitating document image compression utilizing a mask 2 2006
7,376,266 Segmented layered image system 9 2006
8,224,905 Spam filtration utilizing sender activity data 1 2006
8,065,307 Parsing, analysis and scoring of document content 2 2006
7,665,131 Origination/destination features and lists for spam prevention 6 2007
8,364,617 Resilient classification of data 0 2007
7,873,583 Combining resilient classifiers 1 2007
7,558,832 Feedback loop for spam prevention 2 2007
7,386,171 Activity detector 0 2007
8,392,816 Page classifier engine 0 2007
8,250,469 Document layout extraction 0 2007
7,853,599 Feature selection for ranking 0 2008
8,250,159 Message rendering for identification of content features 0 2009
8,140,567 Measuring entity extraction complexity 0 2010
 
EVRI INC. (10)
6,862,710 Internet navigation using soft hyperlinks 90 2000
6,510,406 Inverse inference engine for high performance web search 189 2000
6,757,646 Extended functionality for an inverse inference engine based web search 75 2001
7,283,951 Method and system for enhanced data searching 16 2001
7,051,017 Inverse inference engine for high performance web search 35 2002
7,398,201 Method and system for enhanced data searching 18 2003
7,269,598 Extended functionality for an inverse inference engine based web search 16 2004
7,526,425 Method and system for extending keyword searching to syntactically and semantically annotated data 24 2004
8,131,540 Method and system for extending keyword searching to syntactically and semantically annotated data 2 2009
7,953,593 Method and system for extending keyword searching to syntactically and semantically annotated data 0 2009
 
BUZZMETRICS, LTD., AN ISRAEL CORPORATION (9)
7,725,414 Method for developing a classifier for classifying communications 5 2004
7,523,085 Topical sentiments in electronically stored communications 19 2005
8,271,316 Consumer to business data capturing system 2006
7,596,552 Method and system for extracting web data 9 2006
7,600,017 System and method for scoring electronic messages 19 2007
7,844,483 System and method for predicting external events from electronic author activity 4 2007
7,844,484 System and method for benchmarking electronic message activity 3 2007
7,877,345 Topical sentiments in electronically stored communications 4 2009
8,041,669 Topical sentiments in electronically stored communications 2 2010
 
INTERNATIONAL BUSINESS MACHINES CORPORATION (7)
6,477,551 Interactive electronic messaging system 47 1999
6,519,576 Method and system for predicting transaction 11 2000
6,785,683 Categorization and presentation tool for code resources 24 2000
6,721,737 Method of ranking items using efficient queries 5 2001
6,609,124 Hub for strategic intelligence 16 2001
7,130,833 Classification method of labeled ordered trees using support vector machines 2 2003
7,565,369 System and method for mining time-changing data streams 1 2004
 
AT&T INTELLECTUAL PROPERTY I, L.P. (6)
7,664,812 Phonetic filtering of undesired email messages 7 2003
7,610,341 Filtered email differentiation 5 2003
7,451,184 Child protection from harmful email 7 2003
7,506,031 Filtering email messages corresponding to undesirable domains 7 2006
8,090,778 Foreign network SPAM blocker 0 2006
7,949,718 Phonetic filtering of undesired email messages 1 2009
 
HEWLETT-PACKARD DEVELOPMENT COMPANY, L.P. (6)
6,823,323 Automatic classification method and apparatus 15 2001
7,415,445 Feature selection for two-class classification systems 7 2002
7,720,781 Feature selection method and apparatus 2 2003
7,185,008 Document classification method and apparatus 15 2003
7,593,903 Method and medium for feature selection of partially labeled data 2 2004
7,437,334 Preparing data for machine learning 3 2004
 
KOFAX, INC. (6)
7,386,527 Effective multi-class support vector machine classification 10 2003
7,958,067 Data classification methods using machine learning techniques 4 2007
7,937,345 Data classification methods using machine learning techniques 4 2007
7,761,391 Methods and systems for improved transductive maximum entropy discrimination classification 2 2007
8,374,977 Methods and systems for transductive data classification 0 2010
8,239,335 Data classification using machine learning techniques 1 2011
 
BDGB ENTERPRISE SOFTWARE S.A.R.L. (5)
7,509,578 Classification method and apparatus 5 2005
8,015,198 Method for automatically indexing documents 2 2008
7,908,430 Associative memory 0 2008
8,276,067 Classification method and apparatus 0 2008
8,209,481 Associative memory 0 2011
 
APPLE INC. (4)
7,076,527 Method and apparatus for filtering email 21 2001
7,991,720 Method and apparatus for organizing information in a computer system 3 2003
7,836,135 Method and apparatus for filtering email 0 2006
7,856,479 Method and apparatus for filtering email 1 2006
 
MCAFEE, INC. (4)
6,732,157 Comprehensive anti-spam system, method, and computer program product for filtering unwanted e-mail messages 149 2002
7,953,814 Stopping and remediating outbound messaging abuse 3 2006
7,680,890 Fuzzy logic voting method and system for classifying e-mail using inputs from multiple spam classifiers 6 2006
8,363,793 Stopping and remediating outbound messaging abuse 0 2011
 
SAS INSTITUTE INC. (4)
6,532,467 Method for selecting node variables in a binary decision tree structure 16 2000
6,996,575 Computer-implemented system and method for text-based document processing 19 2002
7,809,539 Method for selecting node variables in a binary decision tree structure 0 2002
7,127,466 Method for selecting node variables in a binary decision tree structure 2 2003
 
BELLSOUTH INTELLECTUAL PROPERTY CORPORATION (3)
7,996,470 Processing rules for digital messages 2 2003
7,930,351 Identifying undesired email messages having attachments 1 2003
7,844,678 Filtering email messages corresponding to undesirable domains 3 2008
 
GOOGLE INC. (3)
6,640,228 Method for detecting incorrectly categorized data 6 2000
7,734,627 Document similarity detection 11 2003
8,209,339 Document similarity detection 0 2010
 
LIMELIGHT NETWORKS, INC. (3)
8,204,891 Method and subsystem for searching media content within a content-search-service system 0 2008
7,917,492 Method and subsystem for information acquisition and aggregation to facilitate ontology and language-model generation within a content-search-service system 4 2008
8,396,878 Methods and systems for generating automated tags for video files 0 2011
 
NEC CORPORATION (3)
6,718,333 Structured document classification device, structured document search system, and computer-readable memory causing a computer to function as the same 17 1999
7,295,977 Extracting classifying data in music from an audio bitstream 11 2001
7,406,450 Spread kernel support vector machine 3 2006
 
STRAGENT, LLC (3)
8,204,945 Hash-based systems and methods for detecting and preventing transmission of unwanted e-mail 0 2008
8,272,060 Hash-based systems and methods for detecting and preventing transmission of polymorphic network worms and viruses 2010
8,166,549 Hash-based systems and methods for detecting and preventing transmission of polymorphic network worms and viruses 0 2010
 
SYMANTEC CORPORATION (3)
7,882,193 Apparatus and method for weighted and aging spam filtering rules 9 2002
7,831,667 Method and apparatus for filtering email spam using email noise reduction 4 2004
8,402,102 Method and apparatus for filtering email spam using email noise reduction 0 2010
 
WEST SERVICES, INC. (3)
7,593,920 System, method, and software for identifying historically related legal opinions 3 2002
7,620,626 System, method, and software for identifying historically related legal opinions 3 2006
7,984,053 System, method, and software for identifying historically related legal cases 1 2009
 
AURILAB, LLC (2)
8,331,656 Robust pattern recognition system and method using Socratic agents 0 2012
8,331,657 Robust pattern recognition system and method using socratic agents 0 2012
 
CITIBANK, N.A. (2)
8,041,632 Method and system for using a Bayesian belief network to ensure data integrity 0 2000
8,341,075 Method and system for using a bayesian belief network to ensure data integrity 0 2011
 
CLOUDMARK, INC. (2)
7,519,565 Methods and apparatuses for classifying electronic documents 14 2004
7,890,441 Methods and apparatuses for classifying electronic documents 2 2009
 
DTI OF WASHINGTON, LLC (2)
6,751,628 Process and system for sparse vector and matrix representation of document indexing and retrieval 21 2002
7,328,204 Process and system for sparse vector and matrix representation of document indexing and retrieval 5 2004
 
EMC CORPORATION (2)
8,380,696 Methods and apparatus for dynamically classifying objects 0 2006
8,375,020 Methods and apparatus for classifying objects 0 2006
 
NIELSEN COMPANY (US), LLC, THE (2)
7,660,783 System and method of ad-hoc analysis of data 5 2007
8,347,326 Identifying key media events and modeling causal relationships between key events and reported feelings 0 2007
 
Reynolds, Tom (2)
7,769,626 Determining strategies for increasing loyalty of a population to an entity 5 2004
8,301,482 Determining strategies for increasing loyalty of a population to an entity 0 2007
 
ROSKIND, JAMES A. (2)
7,714,712 Mobile surveillance 11 2007
8,049,615 Mobile surveillance 0 2010
 
SECURE COMPUTING CORPORATION (2)
7,903,549 Content-based policy compliance systems and methods 1 2006
8,214,497 Multi-dimensional reputation scoring 2007
 
STERN, JULIAN N. (2)
7,970,718 Method for feature selection and for evaluating features identified as significant for classifying data 1 2010
8,095,483 Support vector machine—recursive feature elimination (SVM-RFE) 0 2010
 
The United States of America as represented by the Secretary of the Navy (2)
6,560,582 Dynamic memory processor 2 2000
RE42255 Color sensor 0 2006
 
APR SMARTLOGIK LIMITED (1)
6,556,987 Automatic text classification system 49 2000
 
AT&T CORP. (1)
6,539,391 Method and system for squashing a large data set 14 1999
 
AXWAY INC. (1)
7,653,606 Dynamic message filtering 1 2006
 
BANK OF AMERICA, N.A. (1)
7,245,765 Method and apparatus for capturing paper-based information on a mobile computing device 1 2004
 
BARRACUDA NETWORKS, INC. (1)
6,778,941 Message and user attributes in a message filtering method and system 41 2001
 
Bayes Information Technology, Ltd. (1)
6,873,325 Visualization method and visualization system 5 2002
 
BRAINWARE, INC. (1)
8,321,357 Method and system for extraction 2009
 
DSPV, LTD. (1)
7,447,362 System and method of enabling a cellular/wireless device with imaging capabilities to decode printed alphanumeric characters 3 2005
 
DVPV, LTD. (1)
7,551,782 System and method of user interface and data entry from a video call 3 2006
 
GLOBAL EPROCURE (1)
8,429,098 Classification confidence estimating tool 0 2010
 
HEALTH DISCOVERY CORPORATION (1)
7,805,388 Method for feature selection in a support vector machine using feature ranking 4 2007
 
HUAWEI TECHNOLOGIES CO., LTD. (1)
8,082,263 Method, apparatus and system for multimedia model retrieval 2 2008
 
KONINKLIJKE PHILIPS ELECTRONICS N.V. (1)
6,798,912 Apparatus and method of program classification based on syntax of transcript information 3 2000
 
LEXIS-NEXIS Group (1)
6,772,149 System and method for identifying facts and legal discussion in court case law documents 12 1999
 
LEXMARK INTERNATIONAL, INC. (1)
7,532,755 Image classification using concentration ratio 0 2004
 
MOTOROLA MOBILITY LLC (1)
6,438,519 Apparatus and method for rejecting out-of-class inputs for pattern classification 8 2000
 
MX LOGIC (1)
7,051,077 Fuzzy logic voting method and system for classifying e-mail using inputs from multiple spam classifiers 49 2004
 
NUANCE COMMUNICATIONS, INC. (1)
6,925,433 System and method for context-dependent probabilistic modeling of words and documents 17 2001
 
PALO ALTO RESEARCH CENTER INCORPORATED (1)
7,577,654 Systems and methods for new event detection 1 2003
 
PROOFPOINT, INC. (1)
8,417,783 System and method for improving feature selection for a spam filtering model 0 2006
 
RAMP HOLDINGS, INC. (F/K/A EVERYZING, INC.) (1)
6,405,188 Information retrieval system 23 1998
 
REQUISITE SOFTWARE, INC. (1)
7,043,492 Automated classification of items using classification mappings 35 2002
 
RICOH COMPANY, LTD. (1)
6,826,724 DOCUMENT PROCESSOR, DOCUMENT CLASSIFICATION DEVICE, DOCUMENT PROCESSING METHOD, DOCUMENT CLASSIFICATION METHOD, AND COMPUTER-READABLE RECORDING MEDIUM FOR RECORDING PROGRAMS FOR EXECUTING THE METHODS ON A COMPUTER 7 1999
 
ROBERT BOSCH GMBH (1)
7,796,820 Method for evaluation and stabilization over time of classification results 0 2004
 
ROCKWELL AUTOMATION TECHNOLOGIES, INC. (1)
6,944,616 System and method for historical database training of support vector machines 28 2001
 
ROSKIND, JAMES (1)
7,818,317 Location-based tasks 3 2004
 
SER SYSTEME AG (1)
6,976,207 Classification method and apparatus 26 2000
 
SONICWALL, INC. (1)
7,739,253 Link-based content ratings of pages 8 2005
 
STANDARD & POOR'S FINANCIAL SERVICES LLC (1)
8,180,713 System and method for searching and identifying potential financial risks disclosed within a document 0 2008
 
Telestra New Wave Pty Ltd (1)
8,005,293 Gradient based training method for a support vector machine 0 2001
 
TEXTWISE LLC (1)
7,912,868 Advertisement placement method and system using semantic analysis 0 2005
 
The MathWorks, Inc. (1)
8,260,602 Timer analysis and identification 0 2007
 
TREND MICRO INCORPORATED (1)
6,622,134 Method of constructing data classifiers and classifiers constructed according to the method 38 1999
 
YAHOO! INC. (1)
7,440,944 Method and apparatus for efficient training of support vector machines 4 2004
 
ZYCUS INFOTECH PVT LTD. (1)
7,165,068 System and method for electronic catalog classification using a hybrid of rule based and statistical method 4 2002
 
Other [Check patent profile for assignment information] (3)
7,575,171 System and method for reliable content access using a cellular/wireless device with imaging capabilities 25 2006
8,226,418 Method and apparatus for personal awareness and growth 1 2011
8,449,300 Method and apparatus for personal awareness and growth 0 2012