US Patent No: 7,996,208

Number of patents in Portfolio can not be more than 2000

Methods and systems for selecting a language for text segmentation

1 Status Updates

Stats

ALSO PUBLISHED AS: 20060074628
ATTORNEY / AGENT: (SPONSORED)
 

Importance

Loading Importance Indicators... loading....

Abstract

Methods and systems for selecting a language for text segmentation are disclosed. In one embodiment, at least a first candidate language and a second candidate language associated with a string of characters are identified, at least a first segmented result associated with the first candidate language and a second segmented result associated with the second candidate language are determined, a first frequency of occurrence for the first segmented result and a second frequency of occurrence for the second segmented result are determined, and an operable language is identified from the first candidate language and the second candidate language based at least in part on the first frequency of occurrence and the second frequency of occurrence.

Loading the Abstract Image... loading....

First Claim

Related Publications

Loading Related Publications... loading....

Patent Owner(s)

Patent OwnerAddressTotal Patents
GOOGLE INC.MOUNTAIN VIEW, CA6665

International Classification(s)

  • [Classification Symbol]
  • [Patents Count]

Inventor(s)

Inventor Name Address # of filed Patents Total Citations
Elbaz, Gilad Israel Los Angeles, CA 25 220
Mandelson, Jacob Leon Pasadena, CA 11 13

Cited Art

Patent Info (Count) # Cites Year
 
GOOGLE INC. (13)
5,845,278 Method for automatically selecting collections to search in full text searches 236 1997
6,493,702 System and method for searching and recommending documents in a collection using share bookmarks 270 1999
6,754,873 Techniques for finding related hyperlinked documents using link-based analysis 116 2000
6,615,209 Detecting query-specific duplicate documents 138 2000
6,529,903 Methods and apparatus for using a modified index to provide search results in response to an ambiguous search query 143 2000
2002/0133,481 Methods and apparatus for providing search results in response to an ambiguous search query 15 2000
6,658,423 Detecting duplicate and near-duplicate files 231 2001
6,526,440 Ranking search results by reranking the results based on local inter-connectivity 146 2001
2002/0123,988 Methods and apparatus for employing usage statistics in document retrieval 162 2001
2004/0059,708 Methods and apparatus for serving relevant advertisements 241 2002
2004/0119,740 Methods and apparatus for displaying and replying to electronic messages 84 2002
6,725,259 Ranking search results by reranking the results based on local inter-connectivity 50 2003
2005/0228,797 Suggesting and/or providing targeting criteria for advertisements 35 2003
 
MICROSOFT CORPORATION (10)
5,966,686 Method and system for computing semantic logical forms from syntax trees 86 1996
6,076,051 Information retrieval utilizing semantic representation of text 79 1997
5,933,822 Apparatus and methods for an information retrieval system that employs natural language processing of search results to improve overall precision 340 1997
6,272,456 System and method for identifying the language of written text having a plurality of different length n-gram profiles 61 1998
6,640,006 Word segmentation in chinese text 7 1998
6,678,409 Parameterized word segmentation of unsegmented text 19 2000
6,910,003 System, method and article of manufacture for concept based information searching 108 2000
6,766,320 Search engine with natural language-based robust parsing for user query and relevance feedback learning 162 2000
6,968,308 Method for segmenting non-segmented text using syntactic parse 10 2000
2005/0131,872 Query recognizer 39 2003
 
IAC SEARCH & MEDIA, INC. (4)
6,006,222 Method for organizing information 133 1997
6,014,665 Method for organizing information 134 1997
6,078,916 Method for organizing information 194 1998
6,182,068 Personalized search methods 194 1999
 
INTERNATIONAL BUSINESS MACHINES CORPORATION (4)
5,423,032 Method for extracting multi-word technical terms from text 58 1992
6,230,168 Method for automatically constructing contexts in a hypertext collection 57 1997
6,233,575 Multilevel taxonomy based on features derived from training documents classification using fisher values as discrimination values 286 1998
6,334,131 Method for cataloging, filtering, and relevance ranking frame-based hierarchical information structures 115 1998
 
HNC, INC. (2)
5,325,298 Methods for generating or revising context vectors for a plurality of word stems 202 1991
5,619,709 System and method of context vector generation and retrieval 385 1995
 
TELECOM PARTNERS INC. (2)
6,298,348 Consumer profiling system 201 1999
6,324,519 Advertisement auction system 254 1999
 
THE BOARD OF TRUSTEES OF THE LELAND STANFORD JUNIOR UNIVERSITY (2)
6,285,999 Method for node ranking in a linked database 427 1998
6,678,681 Information extraction from a database 99 2000
 
VERITY, INC. (2)
5,778,364 Evaluation of content of a data set using multiple and/or complex queries 45 1996
6,738,764 Apparatus and method for adaptively ranking search results 77 2001
 
APPLE INC. (1)
6,826,559 Hybrid category mapping for on-line query tool 57 1999
 
ARIBA, INC. (1)
6,714,939 Creation of structured data from plain text 61 2001
 
BRITISH TELECOMMUNICATIONS PUBLIC LIMITED COMPANY (1)
7,107,218 Method and apparatus for processing queries 46 2000
 
CNET, INC. (1)
6,067,552 User interface system and method for browsing a hypertext database 120 1998
 
CONTENT ANALYST COMPANY, LLC (1)
4,839,853 Computer information retrieval using latent semantic structure 216 1988
 
ERICSSON INC. (1)
2002/0198,027 Convenient dialing of names and numbers from a phone without alpha keypad 1 2001
 
ESDR NETWORK SOLUTIONS LLC (1)
2008/0059,607 METHOD, PRODUCT, AND APPARATUS FOR PROCESSING A DATA REQUEST 26 2004
 
FFICIENCY SOFTWARE, INC. (1)
5,454,046 Universal symbolic handwriting recognition system 109 1993
 
FULL CIRCLE SOFTWARE, INC. (1)
6,119,164 Method and apparatus for distributing over a network unsolicited information to a targeted audience 56 1997
 
GOTO.COM. (1)
6,269,361 System and method for influencing a position on a search result list generated by a computer network search engine 567 1999
 
HAPAX LIMITED (1)
6,810,375 Method for segmentation of text 22 2000
 
HITACHI AMERICA, LTD. (1)
6,185,559 Method and apparatus for dynamically counting large itemsets 37 1997
 
HTC CORPORATION (1)
6,044,375 Automatic extraction of metadata using a neural network 86 1998
 
INTEL CORPORATION (1)
5,778,363 Method for measuring thresholded relevance of a document to a specified topic 69 1996
 
KUHURO INVESTMENTS AG, L.L.C. (1)
6,134,532 System and method for optimal adaptive matching of users to most relevant entity and information in real-time 466 1997
 
LOQUENDO S.P.A. (1)
2007/0118,356 Automatic segmentation of texts comprising chunks without separators 9 2003
 
MATSUSHITA ELECTRIC CORPORATION OF AMERICA (1)
5,499,360 Method for proximity searching with range testing and range adjustment 31 1994
 
MATSUSHITA ELECTRIC INDUSTRIAL CO., LTD. (1)
6,542,401 SRAM device 10 2002
 
NATIONAL SECURITY AGENCY (1)
7,409,334 Method of text processing 8 2004
 
NEC RESEARCH, INC. (1)
6,289,342 Autonomous citation indexing and literature browsing using citation context 112 1998
 
NOKIA CORPORATION (1)
2005/0086,065 Automatic field completion in capacity-constrained media 5 2003
 
NOMURA PLATING CO., LTD. (1)
2005/0282,473 Surface treatment method for vacuum member 2 2005
 
OINGO, INC. (1)
6,453,315 Meaning-based information organization and retrieval 104 1999
 
ORACLE INTERNATIONAL CORPORATION (1)
6,314,419 Methods and apparatus for generating query feedback based on co-occurrence patterns 37 1999
 
S.L.I. SYSTEMS, INC. (1)
6,421,675 Search engine 408 1998
 
SUFFOLK TECHNOLOGIES, LLC (1)
6,178,419 Data access system 97 1998
 
VANTAGE TECHNOLOGY HOLDINGS, LLC (1)
5,890,103 Method and apparatus for improved tokenization of natural language text 69 1996
 
WEBMD, INC. (1)
6,289,353 Intelligent query system for automatically indexing in a database and automatically categorizing users 148 1999
 
WORDSTREAM, INC. (1)
2002/0002,452 Network-based text composition, translation, and document searching 36 2001
 
XEROX CORPORATION (1)
6,269,189 Finding selected character strings in text and providing information relating to the selected character strings 28 1998

Patent Citation Ranking

Forward Cites

Patent Info (Count) # Cites Year
 
GOOGLE INC. (1)
8,380,488 Identifying a property of a document 0 2007
 
INTERNATIONAL BUSINESS MACHINES CORPORATION (1)
8,165,869 Learning word segmentation from non-white space languages corpora 0 2007
 
OTHER [CHECK PATENT PROFILE FOR ASSIGNMENT INFORMATION] (1)
8,442,965 Query language identification 0 2007

Maintenance Fees

Fee Large entity fee small entity fee micro entity fee due date
3.5 Year Payment $1600.00 $800.00 $400.00 Feb 9, 2015
7.5 Year Payment $3600.00 $1800.00 $900.00 Feb 9, 2019
11.5 Year Payment $7400.00 $3700.00 $1850.00 Feb 9, 2023
Fee Large entity fee small entity fee micro entity fee
Surcharge - 3.5 year - Late payment within 6 months $160.00 $80.00 $40.00
Surcharge - 7.5 year - Late payment within 6 months $160.00 $80.00 $40.00
Surcharge - 11.5 year - Late payment within 6 months $160.00 $80.00 $40.00
Surcharge after expiration - Late payment is unavoidable $700.00 $350.00 $175.00
Surcharge after expiration - Late payment is unintentional $1,640.00 $820.00 $410.00