US Patent No: 7,130,837

Number of patents in Portfolio can not be more than 2000

Systems and methods for determining the topic structure of a portion of text

Stats

ATTORNEY / AGENT: (SPONSORED)

Importance

Loading Importance Indicators... loading....

Abstract

See full text

Systems and methods for determining the topic structure of a document including text utilize a Probabilistic Latent Semantic Analysis (PLSA) model and select segmentation points based on similarity values between pairs of adjacent text blocks. PLSA forms a framework for both text segmentation and topic identification. The use of PLSA provides an improved representation for the sparse information in a text block, such as a sentence or a sequence of sentences. Topic characterization of each text segment is derived from PLSA parameters that relate words to 'topics', latent variables in the PLSA model, and 'topics' to text segments. A system executing the method exhibits significant performance improvement. Once determined, the topic structure of a document may be employed for document retrieval and/or document summarization.

Loading the Abstract Image... loading....

First Claim

See full text

Family

Loading Family data... loading....

Patent Owner(s)

Patent OwnerAddressTotal Patents
XEROX CORPORATIONSTAMFORD, CT13828

International Classification(s)

  • [Classification Symbol]
  • [Patents Count]

Inventor(s)

Inventor Name Address # of filed Patents Total Citations
Brants, Thorsten H Palo Alto, CA 8 166
Chen, Francine R Menlo Park, CA 39 2086
Tsochantaridis, Ioannis Providence, RI 2 34

Cited Art Landscape

Patent Info (Count) # Cites Year
 
FUJI XEROX CO., LTD. (1)
5,943,669 Document retrieval device 53 1997
 
TECHNOLOGY LICENSING CORPORATION (1)
5,675,819 Document information retrieval using global word co-occurrence patterns 400 1994
 
XEROX CORPORATION (5)
5,606,643 Real-time audio recording system for automatic speaker indexing 22 1994
5,659,766 Method and apparatus for inferring the topical content of a document based upon its lexical content without supervision 50 1994
5,687,364 Method for learning to infer the topical content of documents based upon their lexical content 67 1994
6,128,634 Method and apparatus for facilitating skimming of text 28 1998
6,239,801 Method and system for indexing and controlling the playback of multimedia documents 49 1999
* Cited By Examiner

Patent Citation Ranking

Forward Cite Landscape

Patent Info (Count) # Cites Year
 
Other [Check patent profile for assignment information] (1)
* 2008/0114,737 METHOD AND SYSTEM FOR AUTOMATICALLY IDENTIFYING USERS TO PARTICIPATE IN AN ELECTRONIC CONVERSATION 38 2007
 
INTERNATIONAL BUSINESS MACHINES CORPORATION (1)
* 2011/0202,484 ANALYZING PARALLEL TOPICS FROM CORRELATED DOCUMENTS 4 2010
 
NETWORKED INSIGHTS, LLC (1)
7,925,743 Method and system for qualifying user engagement with a website 21 2008
 
NEC CORPORATION (2)
* 9,015,161 Mismatch detection system, method, and program 1 2011
* 2013/0031,098 MISMATCH DETECTION SYSTEM, METHOD, AND PROGRAM 0 2011
 
PAYPAL, INC. (2)
* 8,631,005 Header-token driven automatic text segmentation 1 2006
9,053,091 Header-token driven automatic text segmentation 0 2013
 
SONY CORPORATION (1)
8,666,915 Method and device for information retrieval 0 2011
 
XEROX CORPORATION (2)
* 7,457,808 Method and apparatus for explaining categorization decisions 31 2004
* 2006/0136,410 Method and apparatus for explaining categorization decisions 1 2004
 
NETWORKED INSIGHTS, INC. (WISCONSIN CORPORATION) (1)
* 2009/0222,551 METHOD AND SYSTEM FOR QUALIFYING USER ENGAGEMENT WITH A WEBSITE 39 2008
 
MICROSOFT TECHNOLOGY LICENSING, LLC (6)
* 8,335,683 System for using statistical classifiers for spoken language understanding 4 2003
* 2004/0148,154 System for using statistical classifiers for spoken language understanding 24 2003
* 7,853,596 Mining geographic knowledge using a location aware topic model 3 2007
* 2008/0319,974 MINING GEOGRAPHIC KNOWLEDGE USING A LOCATION AWARE TOPIC MODEL 21 2007
* 2009/0119,284 METHOD AND SYSTEM FOR CLASSIFYING DISPLAY PAGES USING SUMMARIES 4 2008
* 8,924,391 Text classification using concept kernel 0 2010
 
APTIMA, INC. (2)
* 7,822,750 Method and system to compare data entities 8 2008
* 2008/0250,064 METHOD AND SYSTEM TO COMPARE DATA ENTITIES 1 2008
 
MITSUBISHI ELECTRIC RESEARCH LABORATORIES, INC. (2)
* 9,251,250 Method and apparatus for processing text with variations in vocabulary usage 0 2012
* 2013/0262,083 Method and Apparatus for Processing Text with Variations in Vocabulary Usage 0 2012
* Cited By Examiner

Maintenance Fees

Fee Large entity fee small entity fee micro entity fee due date
11.5 Year Payment $7400.00 $3700.00 $1850.00 Apr 30, 2018
Fee Large entity fee small entity fee micro entity fee
Surcharge - 11.5 year - Late payment within 6 months $160.00 $80.00 $40.00
Surcharge after expiration - Late payment is unavoidable $700.00 $350.00 $175.00
Surcharge after expiration - Late payment is unintentional $1,640.00 $820.00 $410.00