End-to-End Speech Recognition Adapted for Multi-Speaker Applications

Number of patents in Portfolio can not be more than 2000

United States of America

APP PUB NO 20240153508A1
SERIAL NO

18049712

Stats

ATTORNEY / AGENT: (SPONSORED)

Importance

Loading Importance Indicators... loading....

Abstract

See full text

A system for performing end-to-end automatic speech recognition (ASR). The system configured to collect a sequence of acoustic frames associated with a mixture of speeches performed by multiple speakers. Each frame from the sequence of acoustic frames is encoded using a multi-head encoder which encodes each frame into a likelihood of a transcription output and a likelihood of an identity of a speaker. The multi-head encoder thus produces a sequence of likelihoods of transcription outputs and a sequence of likelihoods of identities of the speakers corresponding to the sequence of acoustic frames that are decoded using a decoder performing an alignment operation for producing a sequence of transcription outputs annotated with identities of the speakers, for performing speaker separation.

Loading the Abstract Image... loading....

First Claim

See full text

Family

Loading Family data... loading....

Patent Owner(s)

  • Owner owned or assignment not recorded

International Classification(s)

Inventor(s)

Inventor Name Address # of filed Patents Total Citations
Hori, Takaaki Lexington, US 24 371
Le, Roux Jonathan Cambridge, US 56 568
Moritz, Niko Allston, US 14 93

Cited Art Landscape

Load Citation

Patent Citation Ranking

Forward Cite Landscape

Load Citation