Systems and methods for retrieving tabular data from textual sources

Number of patents in Portfolio can not be more than 2000

United States of America Patent

PATENT NO 5950196
SERIAL NO

08901234

Stats

ATTORNEY / AGENT: (SPONSORED)

Importance

Loading Importance Indicators... loading....

Abstract

See full text

Tables form an important kind of data element in text retrieval. Often, the gist of an entire news article or other exposition can be concisely captured in tabular form. Information other than the key words in a digital document can be exploited to provide the users with more flexible and powerful query capabilities. More specifically, the structural information in a document is exploited to identify tables and their component fields and let the users query based on these fields. Component fields can include table lines, caption lines, row headings, column headings, or other table components. Empirical results have demonstrated that heuristic method based table extraction and component tagging can be performed effectively and efficiently. Moreover, experiments in retrieval using the system of the present invention strongly indicate that such structural decomposition can facilitate better representation of user's information needs and hence more effective retrieval of tables.

Loading the Abstract Image... loading....

First Claim

See full text

Family

Loading Family data... loading....

Patent Owner(s)

  • OPEN TEXT S.A.;DATAWARE TECHNOLOGIES, INC.

International Classification(s)

  • [Classification Symbol]
  • [Patents Count]

Inventor(s)

Inventor Name Address # of filed Patents Total Citations
Croft, W Bruce Amherst, MA 1 92
Pyreddy, Pallavi Chicago, IL 2 98

Cited Art Landscape

Load Citation

Patent Citation Ranking

Forward Cite Landscape

Load Citation