Efficient fuzzy match for evaluating data records

Number of patents in Portfolio can not be more than 2000

United States of America Patent

PATENT NO 7296011
APP PUB NO 20040260694A1
SERIAL NO

10600083

Stats

ATTORNEY / AGENT: (SPONSORED)

Importance

Loading Importance Indicators... loading....

Abstract

See full text

To help ensure high data quality, data warehouses validate and clean, if needed incoming data tuples from external sources. In many situations, input tuples or portions of input tuples must match acceptable tuples in a reference table. For example, product name and description fields in a sales record from a distributor must match the pre-recorded name and description fields in a product reference relation. A disclosed system implements an efficient and accurate approximate or fuzzy match operation that can effectively clean an incoming tuple if it fails to match exactly with any of the multiple tuples in the reference relation. A disclosed similarity function that utilizes token substrings referred to as q-grams overcomes limitations of prior art similarity functions while efficiently performing a fuzzy match process.

Loading the Abstract Image... loading....

First Claim

See full text

Family

Loading Family data... loading....

Patent Owner(s)

Patent OwnerAddress
MICROSOFT TECHNOLOGY LICENSING LLCONE MICROSOFT WAY REDMOND WA 98052

International Classification(s)

  • [Classification Symbol]
  • [Patents Count]

Inventor(s)

Inventor Name Address # of filed Patents Total Citations
Chaudhuri, Surajit Redmond, WA 188 6858
Ganjam, Kris Seattle, WA 22 393
Ganti, Venkatesh Redmond, WA 43 2178
Motwani, Rajeev Palo Alto, CA 17 705

Cited Art Landscape

Load Citation

Patent Citation Ranking

Forward Cite Landscape

Load Citation