USING NON-PARALLEL VOICE CONVERSION FOR SPEECH CONVERSION MODELS

Number of patents in Portfolio can not be more than 2000

United States of America

APP PUB NO 20250095639A1
SERIAL NO

18962686

Stats

ATTORNEY / AGENT: (SPONSORED)

Importance

Loading Importance Indicators... loading....

Abstract

See full text

A method includes receiving a set of training utterances each including a non-synthetic speech representation of a corresponding utterance, and for each training utterance, generating a corresponding synthetic speech representation by using a voice conversion model. The non-synthetic speech representation and the synthetic speech representation form a corresponding training utterance pair. At each of a plurality of output steps for each training utterance pair, the method also includes generating, for output by a speech recognition model, a first probability distribution over possible non-synthetic speech recognition hypotheses for the non-synthetic speech representation and a second probability distribution over possible synthetic speech recognition hypotheses for the synthetic speech representation. The method also includes determining a consistent loss term for the corresponding training utterance pair based on the first and second probability distributions and updating parameters of the speech recognition model based on the consistent loss term.

Loading the Abstract Image... loading....

First Claim

See full text

Family

Loading Family data... loading....

Patent Owner(s)

Patent OwnerAddress
GOOGLE LLCMOUNTAIN VIEW CA 94043

International Classification(s)

  • [Classification Symbol]
  • [Patents Count]

Inventor(s)

Inventor Name Address # of filed Patents Total Citations
Biadsy, Fadi Mountain View, US 48 1175
Ramabhadran, Bhuvana Mt. Kisco, US 125 2541
Rosenberg, Andrew M Brooklyn, US 20 52
Wang, Gary Mountain View, US 54 730

Cited Art Landscape

Load Citation

Patent Citation Ranking

Forward Cite Landscape

Load Citation