Automatic separation of text from background in scanned images of complex documents

Number of patents in Portfolio can not be more than 2000

United States of America Patent

PATENT NO 5280367
SERIAL NO

07705838

Stats

ATTORNEY / AGENT: (SPONSORED)

Importance

Loading Importance Indicators... loading....

Abstract

See full text

A system that converts a scanned image of a complex document into an image where text has been preserved and separated from the background. The system first subdivides the scanned image into blocks and then examines each block pixel by pixel to construct a histogram of the gray scale values of the pixels. The histogram is partitioned into a first, middle and last regions. If one or more peaks occur in the first and last regions, and a single histogram peak occurs within the middle region, the pixels are reexamined to determine the frequency of occurrence of pixels having a gray scale level of the middle peak nearby pixels which have a level of a first region peak. If this frequency is high, the middle peak is assumed to be background information. After determining the threshold, the system rescans the block applying the threshold to separate the text from background information within the block.

Loading the Abstract Image... loading....

First Claim

See full text

Family

Loading Family data... loading....

Patent Owner(s)

Patent OwnerAddress
HEWLETT-PACKARD COMPANYPALO ALTO CA

International Classification(s)

Inventor(s)

Inventor Name Address # of filed Patents Total Citations
Zuniga, Oscar A Ft. Collins, CO 8 267

Cited Art Landscape

Load Citation

Patent Citation Ranking

Forward Cite Landscape

Load Citation