Collaborative team crawling:Large scale information gathering over the internet

Number of patents in Portfolio can not be more than 2000

United States of America Patent

PATENT NO 6182085
SERIAL NO

09086379

Stats

ATTORNEY / AGENT: (SPONSORED)

Importance

Loading Importance Indicators... loading....

Abstract

See full text

A distributed collection of web-crawlers to gather information over a large portion of the cyberspace. These crawlers share the overall crawling through a cyberspace partition scheme. They also collaborate with each other through load balancing to maximally utilize the computing resources of each of the crawlers. The invention takes advantage of the hierarchical nature of the cyberspace namespace and uses the syntactic components of the URL structure as the main vehicle for dividing and assigning crawling workload to individual crawler. The partition scheme is completely distributed in which each crawler makes the partitioning decision based on its own crawling status and a globally replicated partition tree data structure.

Loading the Abstract Image... loading....

First Claim

See full text

Family

Loading Family data... loading....

Patent Owner(s)

Patent OwnerAddress
INTERNATIONAL BUSINESS MACHINES CORPORATIONNEW ORCHARD ROAD ARMONK NY 10504

International Classification(s)

  • [Classification Symbol]
  • [Patents Count]

Inventor(s)

Inventor Name Address # of filed Patents Total Citations
Eichstaedt, Matthias San Jose, CA 33 2052
Ford, Daniel Alexander Los Gatos, CA 20 2102
Lehman, Tobin Jon Los Gatos, CA 13 953
Lu, Qi San Jose, CA 179 7628
Teng, Shang-Hua Champaign, IL 12 1198

Cited Art Landscape

Load Citation

Patent Citation Ranking

Forward Cite Landscape

Load Citation