US Patent No: 5,864,863

Number of patents in Portfolio can not be more than 2000

Method for parsing, indexing and searching world-wide-web pages

Stats

ATTORNEY / AGENT: (SPONSORED)

Importance

Loading Importance Indicators... loading....

Abstract

See full text

A system indexes Web pages of the Internet. The pages are stored in computers distributively connected to each other by a communications network. Each page has a unique URL (universal record locator). Some of the pages can include URL links to other pages. A communication interface connected to the Internet is used for fetching a batch of Web pages from the computers in accordance with the URLs and URL links. The URLs are determined by an automated Web browser connected to the communications interface. A parser sequentially partitions the batch of specified pages into indexable words where each word represents an indexable portion of information of a specific page, or the word represents an attribute of one or more portions of the specific page. The parser sequentially assigns locations to the words as they are parsed. The locations indicates the unique occurrences of the word in the Web. The output of the parser is stored in a memory as an index. The index includes one index entry for each unique word. Each index entry also includes one or more location entries indicating where the unique word occurs in the Web. A query module parses a query into terms and operators. The operators relate the terms. A search engine uses object-oriented stream readers to sequentially read location of specified index entries, the specified index entries correspond to the terms of a query. A display module presents qualified pages located by the search engine to users of the Web.

Loading the Abstract Image... loading....

First Claim

See full text

all claims..

Related Publications

Loading Related Publications... loading....

Patent Owner(s)

Patent OwnerAddressTotal Patents
YAHOO! INC.SUNNYVALE, CA1944

International Classification(s)

  • [Classification Symbol]
  • [Patents Count]

Inventor(s)

Inventor Name Address # of filed Patents Total Citations
Burrows, Michael Palo Alto, CA 94 3167

Cited Art Landscape

Patent Info (Count) # Cites Year
 
INTERNATIONAL BUSINESS MACHINES CORPORATION (3)
5,640,558 Identifying and analyzing multiple level class relationships in an object oriented system by parsing source code without compilation 29 1995
5,581,758 Computer program product for object specification, generation, and management in a distributed database 104 1995
5,668,988 Method for mining path traversal patterns in a web environment by converting an original log sequence into a set of traversal sub-sequences 38 1995
 
CISCO TECHNOLOGY, INC. (1)
5,551,027 Multi-tiered indexing method for partitioned data 326 1995
 
COOPER UNION FOR THE ADVANCEMENT OF SCIENCE AND ART (1)
5,678,041 System and method for restricting user access rights on the internet based on rating information stored in a relational database 533 1995
 
FINANCIAL SYSTEMS TECHNOLOGY (INTELLECTUAL PROPERTY) PTY LTD (1)
5,652,882 Data processing system and method for detecting mandatory relations violation in a relational database 41 1995
 
HEWLETT-PACKARD DEVELOPMENT COMPANY, L.P. (1)
5,440,744 Methods and apparatus for implementing server functions in a distributed heterogeneous environment 298 1994
 
JPMORGAN CHASE BANK, N.A. (1)
5,652,880 Apparatus and method for storing, retrieving and presenting objects with rich links 46 1994
 
OPEN INVENTION NETWORK, LLC (1)
5,649,186 System and method for a computer-based dynamic information clipping service 552 1995
 
ORACLE INTERNATIONAL CORPORATION (1)
5,280,610 Methods and apparatus for implementing data bases to provide object-oriented invocation of applications 136 1990

Patent Citation Ranking

Forward Cite Landscape

Patent Info (Count) # Cites Year
 
INTERNATIONAL BUSINESS MACHINES CORPORATION (49)
6,643,657 Computer system 13 1997
6,101,503 Active markup--a system and method for navigating through text collections 88 1998
6,266,060 Menu management mechanism that displays menu items based on multiple heuristic factors 58 1998
6,665,838 Web page thumbnails and user configured complementary information provided from a server 100 1999
6,356,908 Automatic web page thumbnail generation 101 1999
6,529,939 User-initiated maintenance of document locators 8 1999
6,823,341 Method, system and program for providing indexed web page contents to a search engine database 22 1999
6,539,371 System and method for filtering query statements according to user-defined filters of query explain data 23 2000
7,099,859 System and method for integrating off-line ratings of businesses with search engines 18 2000
7,080,064 System and method for integrating on-line user ratings of businesses with search engines 40 2000
6,931,397 System and method for automatic generation of dynamic search abstracts contain metadata by crawler 58 2000
6,785,670 Automatically initiating an internet-based search from within a displayed document 72 2000
6,584,469 Automatically initiating a knowledge portal query from within a displayed document 60 2000
6,567,801 Automatically initiating a knowledge portal query from within a displayed document 8 2000
8,478,732 Database aliasing in information access system 0 2000
7,127,450 Intelligent discard in information access system 5 2000
6,745,181 Information access method 29 2000
6,714,905 Parsing ambiguous grammar 44 2000
6,711,561 Prose feedback in information access system 24 2000
6,847,387 Menu management mechanism that displays menu items based on multiple heuristic factors 91 2001
7,136,846 Wireless information retrieval 12 2001
7,054,855 Method and system for performing a pattern match search for text strings 11 2001
6,990,494 Identifying links of interest in a web page 15 2001
7,343,372 Direct navigation for information retrieval 3 2002
8,290,768 System and method for determining a set of attributes based on content of communications 0 2002
7,073,121 Web page thumbnails and user configured complementary information provided from a server 17 2002
6,718,324 Metadata search results ranking system 219 2003
7,167,875 Web page thumbnails and user configured complementary information provided from a server 8 2003
7,243,095 Prose feedback in information access system 13 2003
8,296,304 Method, system, and program for handling redirects in a search engine 0 2004
7,499,913 Method for handling anchor text 6 2004
7,424,467 Architecture for an indexer with fixed width sort and variable width sort 10 2004
7,376,641 Information retrieval from a collection of data 10 2004
8,495,002 Software tool for training and testing a knowledge base 1 2004
7,644,057 System and method for electronic communication management 8 2004
7,461,064 Method for searching documents for ranges of numeric values 33 2004
8,417,693 Enforcing native access control to indexed documents 0 2005
7,840,894 Web page thumbnails and user configured complementary information provided from a server 3 2006
7,447,684 Determining searchable criteria of network resources based on a commonality of content 6 2006
7,743,060 Architecture for an indexer 3 2007
7,756,810 Software tool for training and testing a knowledge base 1 2007
7,752,159 System and method for classifying text 9 2007
7,895,232 Object-oriented twig query evaluation 0 2007
7,783,643 Direct navigation for information retrieval 1 2008
7,702,677 Information retrieval from a collection of data 11 2008
8,346,759 Searching documents for ranges of numeric values 1 2008
8,271,498 Searching documents for ranges of numeric values 1 2008
8,285,724 System and program for handling anchor text 0 2008
8,655,888 Searching documents for ranges of numeric values 0 2011
 
MICROSOFT CORPORATION (24)
6,366,956 Relevance access of Internet information services 72 1997
6,654,741 URL mapping methods and systems 55 1999
7,451,075 Compressed speech lexicon and method and apparatus for creating and accessing the speech lexicon 1 2000
7,139,709 Middleware layer between speech related applications and engines 3 2000
6,957,184 Context free grammar engine for speech recognition system 13 2000
6,931,376 Speech-related event notification system 6 2001
7,266,553 Content data indexing 24 2002
7,548,858 System and method for selective audible rendering of data to a user based on user input 4 2003
6,882,999 URL mapping methods and systems 18 2003
7,293,016 Index partitioning based on document relevance for document indexes 22 2004
7,962,843 Browser session overview 15 2004
7,774,721 Intelligent backward resource navigation 13 2004
7,089,189 Speech-related event notification system 7 2004
7,162,425 Speech-related event notification system 1 2004
7,177,807 Middleware layer between speech related applications and engines 1 2004
7,177,813 Middleware layer between speech related applications and engines 0 2004
7,155,392 Context free grammar engine for speech recognition system 2 2005
7,849,093 Searches over a collection of items through classification and display of media galleries 1 2005
7,206,742 Context free grammar engine for speech recognition system 3 2005
7,379,874 Middleware layer between speech related applications and engines 50 2006
7,987,189 Content data indexing and result ranking 0 2007
7,970,768 Content data indexing with content associations 0 2007
8,082,258 Updating an inverted index in a real time fashion 27 2009
8,281,259 Intelligent backward resource navigation 1 2010
 
GOOGLE INC. (15)
7,565,363 Search engine with user activity memory 25 2002
7,293,005 Pipelined architecture for global analysis and index building 14 2004
7,631,310 Loadbalancing multiple files across computing devices 18 2004
8,166,021 Query phrasification 4 2007
8,166,045 Phrase extraction using subphrase scoring 12 2007
7,925,655 Query scheduling using hierarchical tiers of index servers 17 2007
7,702,614 Index updating using segment swapping 16 2007
7,693,813 Index server architecture using tiered and sharded phrase posting lists 21 2007
7,783,626 Pipelined architecture for global analysis and index building 0 2007
8,453,153 Loadbalancing multiple files across computing devices 0 2009
8,090,723 Index server architecture using tiered and sharded phrase posting lists 8 2010
8,402,033 Phrase extraction using subphrase scoring 3 2011
8,527,538 Using geographic data to identify correlated geographic synonyms 1 2011
8,682,901 Index server architecture using tiered and sharded phrase posting lists 1 2011
8,700,583 Dynamic tiermaps for large online databases 0 2012
 
FRISKIT, INC. (10)
7,281,034 System and method for media playback over a network using links that contain control signals and commands 57 2000
7,228,305 Rating system for streaming media playback system 32 2000
6,721,741 Streaming media search system 69 2000
6,389,467 Streaming media search and continuous playback system of media resources located by multiple network addresses 353 2000
6,519,648 Streaming media search and continuous playback of multiple media resources located on a network 103 2000
7,010,537 Method and system for visual network searching 35 2001
6,484,199 Streaming media search and playback system for continuous playback of media resources through a network 101 2002
6,735,628 Media search and continuous playback of multiple media resources distributed on a network 103 2002
6,725,275 Streaming media search and continuous playback of multiple media resources located on a network 35 2002
7,469,283 Streaming media search and playback system 33 2004
 
ORACLE OTC SUBSIDIARY LLC (8)
7,035,864 Hierarchical data-driven navigation system and method for information retrieval 56 2000
7,617,184 Scalable hierarchical data-driven navigation system and method for information retrieval 11 2001
7,325,201 System and method for manipulating content in a hierarchical data-driven search and navigation system 37 2002
7,428,528 Integrated application for manipulating content in a hierarchical data-driven search and navigation system 15 2004
7,567,957 Hierarchical data-driven search and navigation system and method for information retrieval 11 2006
8,676,802 Method and system for information retrieval with clustering 0 2006
7,912,823 Hierarchical data-driven navigation system and method for information retrieval 9 2007
7,856,434 System and method for filtering rules for manipulating search results in a hierarchical search and navigation system 6 2007
 
YAHOO! INC. (7)
6,073,135 Connectivity server for locating linkage information between Web pages 70 1998
6,876,997 Method and apparatus for indentifying related searches in a database search system 152 2000
6,701,317 Web page connectivity server construction 114 2000
6,598,051 Web page connectivity server 65 2000
6,711,568 Method for estimating coverage of web search engines 45 2000
7,308,445 Method for estimating coverage of web search engines 4 2004
7,657,555 Method and apparatus for identifying related searches in a database search system 10 2005
 
BATTELLE MEMORIAL INSTITUTE (5)
7,106,329 Methods and apparatus for displaying disparate types of information using an interactive surface map 4 1999
6,990,238 Data processing, analysis, and visualization system for use with disparate data types 51 1999
6,898,530 Method and apparatus for extracting attributes from sequence strings and biopolymer material 3 1999
6,940,509 Systems and methods for improving concept landscape visualizations as a data analysis tool 2 2000
6,718,336 Data import system for data analysis system 19 2000
 
INFOGIN LTD. (5)
8,140,111 Methods and apparatus for analyzing, processing and formatting network information such as web-pages 4 2006
8,739,027 Methods and apparatus for enabling use of web content on various types of devices 0 2007
8,694,680 Methods and apparatus for enabling use of web content on various types of devices 0 2007
7,877,677 Methods and apparatus for enabling use of web content on various types of devices 2 2007
7,920,894 Methods and apparatus for analyzing, processing and formatting network information such as web-pages 8 2007
 
DEEP WEB, LLC (4)
7,716,303 Accessing deep web information associated with transportation services using a search engine 2 2006
7,730,160 Accessing deep web information associated with hospitality services using a search engine 1 2007
8,321,529 Accessing deep web information associated with transportation services using a search engine 2010
8,645,493 Accessing deep web information using a search engine 0 2012
 
OLASS CONSULTING AG, LLC (4)
6,516,337 Sending to a central indexing site meta data or signatures from objects on a computer network 187 1999
7,133,870 Index cards on network hosts for searching, rating, and ranking 39 2000
6,983,322 System for discrete parallel processing of queries and updates 74 2000
RE42051 Peer-to-peer automated anonymous asynchronous file sharing 0 2008
 
BIAP SYSTEMS, INC. (3)
7,313,588 Locally executing software agent for retrieving remote content and method for creation and use of the agent 19 2000
7,152,058 Apparatus for and method of selectively retrieving information and enabling its subsequent display 11 2001
7,613,790 Apparatus for and method of executing customized interactive computing services in a broadband network environment 0 2002
 
EVRESEARCH, LTD. (3)
6,175,830 Information management, retrieval and display system and associated method 117 1999
6,484,166 Information management, retrieval and display system and associated method 75 2000
RE42167 Information management, retrieval and display systems and associated methods 1 2003
 
GENERAL INVENTIONS INSTITUTE, INC. (3)
6,295,530 Internet service of differently formatted viewable data signals including commands for browser execution 215 1996
6,961,737 Serving signals 5 2001
6,826,565 Method and apparatus for serving files to browsing clients 179 2002
 
IPF, Inc. (3)
7,143,055 INTERNET-BASED SYSTEM FOR COLLECTING, MANAGING AND SERVING CONSUMER PRODUCT-RELATED INFORMATION OVER THE INTERNET USING TRADEMARKS AND UNIVERSAL RESOURCE LOCATORS (URLS) SYMBOLICALLY-LINKED BY MANUFACTURERS OF CONSUMER PRODUCTS AND/OR THEIR AGENTS 31 1997
8,065,201 Internet-based method of and system for managing and delivering consumer product information at points along the world wide web using consumer product information (CPI) requesting and graphical user interface (GUI) displaying subsystems driven by server-side components and managed by consumer product manufactures and/or authorized parties 19 2009
8,458,055 Internet-based method of and system for managing and delivering consumer product information at points along the world wide web using consumer product information (CPI) requesting and graphical user interface (GUI) displaying subsystems driven by server-side objects and managed by consumer product manufacturers and/or authorized parties 2 2011
 
SONY PICTURES ENTERTAINMENT INC. (3)
7,516,215 Device, method, and network system for replacing content 1 2005
7,949,791 Device and method for replacing terms in documents with hyperlinks 8 2009
8,195,796 Observation device, method, and computer program product for replacing content 1 2011
 
UNOTE INFORMATION SYSTEMS, L.L.C. (3)
6,924,828 Method and apparatus for improved information representation 49 2000
6,836,768 Method and apparatus for improved information representation 50 2000
7,882,115 Method and apparatus for improved information representation 3 2005
 
CONTEXTWEB, INC. (2)
7,945,476 Internet contextual advertisement delivery system 4 2001
7,912,752 Internet contextual communication system 2 2001
 
GATEWAY, INC. (2)
7,174,513 System and method for advanced network viewing 6 1999
6,670,974 Persistent usage context 54 1999
 
KAYAK SOFTWARE CORPORATION (2)
7,349,894 Method and apparatus for dynamic information connection search engine 29 2004
8,458,162 Purchase handoff in a travel search engine 0 2011
 
POINTOFDATA CORPORATION (2)
7,523,130 Storing and retrieving objects on a computer network in a distributed database 11 2005
8,301,634 Storing and retrieving objects on a computer network in a distributed database 1 2009
 
RAICHUR REVOCABLE TRUST, ARVIND A. AND BECKY D. RAICHUR, AND THEIR SUCCESSORS, AS TRUSTEE, UNDER TRUST AGREEMENT DATED NOVEMBER 12, 2007 (2)
7,613,992 Dynamic index and search engine server 3 2000
8,504,554 Dynamic index and search engine server 0 2007
 
SONY CORPORATION (2)
6,912,571 Method of replacing content 46 2000
8,244,703 Apparatus and method for computation of a pricing index 0 2005
 
SUPERMEDIA LLC (2)
7,653,870 System and method of dynamically generating index information 4 1999
8,402,358 System and method of dynamically generating index information 0 2009
 
VIZIBILITY, LLC (2)
7,831,609 System and method for searching, formulating, distributing and monitoring usage of predefined internet search queries 3 2010
8,280,879 System and method for quantifying visibility within search engines 0 2011
 
WEB.COM HOLDING COMPANY, INC. (2)
5,999,941 Database access using active server pages 64 1997
6,363,398 Database access using active server pages 57 1999
 
BAYARD CHIMNEY ROCK LLC (1)
RE42413 Web search engine with graphic snapshots 3 2005
 
EBAY INC. (1)
6,237,035 System and method for preventing duplicate transactions in an internet browser/internet server environment 66 1997
 
ELECTRICAL SYSTEMS & NETWORKS (ESN) LLC (1)
7,627,598 User interface and search system for internet and other applications 111 2006
 
ENDECA TECHNOLOGIES, INC. (1)
8,019,752 System and method for information retrieval from object collections with complex interrelationships 3 2005
 
EXCELLERATE, LLC (1)
6,278,992 Search engine using indexing method for storing and retrieving data 209 1999
 
FUJITSU LIMITED (1)
6,725,251 Local-file-transfer method and local-filed-transfer system for client-server system 3 1997
 
GENESIS PARK LP (1)
6,490,577 Search engine with user activity memory 79 2000
 
INFOSPACE.COM, INC. (1)
6,442,544 System and method for organizing search categories for use in an on-line search query engine based on geographic descriptions 28 1998
 
INNOVATIVE SOLUTIONS, INC. (1)
7,979,453 Efficient search result update mechanism 2 2007
 
INTEL CORPORATION (1)
6,070,176 Method and apparatus for graphically representing portions of the world wide web 74 1997
 
KABUSHIKI KAISHA TOSHIBA (1)
6,055,535 Information retrieving method and apparatus 17 1998
 
LENOVO (SINGAPORE) PTE LTD. (1)
6,405,192 Navigation assistant-method and apparatus for providing user configured complementary information for data browsing in a viewer context 36 1999
 
LIAISON TECHNOLOGIES, INC. (1)
6,782,505 Method and system for generating structured data from semi-structured data sources 5 1999
 
MICHAEL Z. MORICZ (1)
7,693,956 Accessing deep web information using a search engine 2 2003
 
MISSIONTREK LTD. CO. (1)
7,249,315 System and method of creating and following URL tours 36 2001
 
MOETTELI, JOHN B. (1)
7,315,848 Web snippets capture, storage and retrieval system and method 29 2001
 
NOVELL INTELLECTUAL PROPERTY HOLDINGS, INC. (1)
6,216,123 Method and system for rapid retrieval in a full text indexing system 89 1998
 
NOVELL, INC. (1)
6,697,801 Methods of hierarchically parsing and indexing text 21 2000
 
Oldham Associates, L.L.P. (1)
6,694,333 Systems and methods of referencing uniform resource locators 5 2001
 
ORACLE INTERNATIONAL CORPORATION (1)
6,772,141 Method and apparatus for organizing and using indexes utilizing a search decision table 52 1999
 
PULSEPOINT, INC. (1)
8,751,302 Method and system for placement and pricing of internet-based advertisements or services 0 2006
 
Radix Holdings, LLC (1)
7,831,581 Enhanced search 5 2005
 
REPUTATION.COM, INC. (1)
8,290,941 System and method for detecting changes within search results 1 2010
 
SHARP KABUSHIKI KAISHA (1)
6,098,069 Data managing method and data managing device using the same for manipulating data independently from networks 19 1998
 
SIEMENS MEDICAL SOLUTIONS USA, INC. (1)
7,343,385 System for processing objects for storage in a document or other storage system 2 2005
 
Siemens Medical Systems Health Services Inc. (1)
6,999,972 System for processing objects for storage in a document or other storage system 7 2002
 
SOUND VIEW INNOVATIONS, LLC (1)
6,993,591 Method and apparatus for prefetching internet resources based on estimated round trip time 34 1998
 
UNIVERSITY OF SOUTHERN CALIFORNIA (1)
6,606,625 Wrapper induction by hierarchical data analysis 78 2000
 
WebCriteria, Inc. (1)
6,360,235 Objective measurement and graph theory modeling of web sites 28 1999
 
YOURAMIGO LIMITED (1)
7,987,165 Indexing system and method 2 2000
 
Other [Check patent profile for assignment information] (4)
6,629,092 Search engine 36 1999
8,676,667 Parallel computer network and method for real time financial resource management, inventory control, and online purchasing 0 2010
8,069,162 Enhanced search indexing 1 2010
8,396,856 Database system and method for data acquisition and perusal 0 2010