Espnet: Integrating waveglow with espnet fastspeech

Created on 24 Sep 2019 · 4Comments · Source: espnet/espnet

Hello, is anyone try this approach? This seems the fastest combination at this moment.
I've tried but it seems there is something wrong with my implementation, which generated inaudible audio (I can vaguely hear some words but it's far from wavenet option)
Sample
waveglow+fastspeech

Discussion

Source

enamoria

Most helpful comment

Did you use the exact same parameters for log-melspectrogram extraction? I guess the reason you got non intelligible speech is the feature mismatch between espnet and waveglow.

r9y9 on 24 Sep 2019

👍2

All 4 comments

Did you use the exact same parameters for log-melspectrogram extraction? I guess the reason you got non intelligible speech is the feature mismatch between espnet and waveglow.

r9y9 on 24 Sep 2019

👍2

Unfortunately, as @r9y9 said, hyperparamers of feature extraction is different.
It is necessary to fix the setting to use waveglow.

kan-bayashi on 24 Sep 2019

👍1

I integrated with the real-time neural vocoder ParallelWaveGAN.
You can try in Google Colab.
https://colab.research.google.com/github/espnet/notebook/blob/master/tts_realtime_demo.ipynb

kan-bayashi on 14 Nov 2019

Unfortunately, as @r9y9 said, hyperparamers of feature extraction is different.

> It is necessary to fix the setting to use waveglow.

Hi,
can you tell me, which parameters for waveglow should be changed?

EsmaeilFarhang on 15 Dec 2019

Was this page helpful?

0 / 5 - 0 ratings

Related issues

Mandarin Chinese TTS with the input text in Chinese characters

vjdtao · 5Comments

RuntimeError: Error(s) in loading state_dict for Transformer: size mismatch for encoder.embed.0.weight: copying a param with shape torch.Size([43, 384]) from checkpoint, the shape in current model is torch.Size([37, 384]).

thrfdth · 4Comments

Reproduce SOTA TTS result on LJspeech

Syrup274 · 4Comments

Is it possible to reuse Librispeech LM without wordpieces model?

smolendawid · 4Comments

Implement TCEN for speech translation

CherrieWang97 · 4Comments