Search and retrieval using document decomposition

Number of patents in Portfolio can not be more than 2000

United States of America Patent

PATENT NO 6397213
SERIAL NO

09311200

Stats

ATTORNEY / AGENT: (SPONSORED)

Importance

Loading Importance Indicators... loading....

Abstract

See full text

Document query and search techniques in which documents to be searched are 'decomposed' into 'zones,' with each zone representing a grouping of text or graphical image or a combination thereof. The zones are defined within, and associated with a document page. One or more zones in the documents are selected for annotation with text (e.g., keywords), image features, or a combination of both. Document query and search are based on a combination of text annotations and image features. In one implementation for operating a document retrieval system, an unindexed (also referred to as a 'query' or 'search key') document is captured into electronic form and decomposed into a number of zones. The zones can be segmented into text zones and image zones. Descriptors are formed for at least one of the zones. The descriptors can include text annotations for text zones, and text annotations and image features for image zones. Documents in a document database are searched, based on the formed descriptors for the unindexed document and the descriptors for the documents in the database. At least one document in the database is identifying as matching the unindexed document and reported as such.

Loading the Abstract Image... loading....

First Claim

See full text

Family

Loading Family data... loading....

Patent Owner(s)

Patent OwnerAddress
RICOH COMPANY LTDTOKYO

International Classification(s)

  • [Classification Symbol]
  • [Patents Count]

Inventor(s)

Inventor Name Address # of filed Patents Total Citations
Cullen, John F Mountain View, CA 17 1513
Hull, Jonathan J San Carlos, CA 238 13395

Cited Art Landscape

Load Citation

Patent Citation Ranking

Forward Cite Landscape

Load Citation