Classification of audio as speech or non-speech using multiple threshold values

Number of patents in Portfolio can not be more than 2000

United States of America Patent

PATENT NO 7249015
APP PUB NO 20060136211A1
SERIAL NO

11276419

Stats

ATTORNEY / AGENT: (SPONSORED)

Importance

Loading Importance Indicators... loading....

Abstract

See full text

A portion of an audio signal is separated into multiple frames from which one or more different features are extracted. These different features are used, in combination with a set of rules, to classify the portion of the audio signal into one of multiple different classifications (for example, speech, non-speech, music, environment sound, silence, etc.). In one embodiment, these different features include one or more of line spectrum pairs (LSPs), a noise frame ratio, periodicity of particular bands, spectrum flux features, and energy distribution in one or more of the bands. The line spectrum pairs are also optionally used to segment the audio signal, identifying audio classification changes as well as speaker changes when the audio signal is speech.

Loading the Abstract Image... loading....

First Claim

See full text

Family

Loading Family data... loading....

Patent Owner(s)

  • MICROSOFT TECHNOLOGY LICENSING, LLC

International Classification(s)

  • [Classification Symbol]
  • [Patents Count]

Inventor(s)

Inventor Name Address # of filed Patents Total Citations
Jiang, Hao Beijing, CN 145 1005
Zhang, Hong-Jiang Beijing, CN 109 4221

Cited Art Landscape

Load Citation

Patent Citation Ranking

Forward Cite Landscape

Load Citation