Wav2letter: [How can I use mel-spectrogram as features?]

Created on 28 Aug 2020  路  3Comments  路  Source: flashlight/wav2letter

Question

I have checked in Defines.cpp file, and just see wav2letter using mfsc, or mfcc feature, and don't have option for using mel-spectrogram as feature? I need use mel-spectrograms as feature?
And I wanna ask how can i use mel-spectrogram as feature in wav2letter, thanks you.

question

Most helpful comment

mfsc and mfcc are both variants of mel spec. mfcc is mfsc with a DCT. mfsc works fine.

All 3 comments

mfsc and mfcc are both variants of mel spec. mfcc is mfsc with a DCT. mfsc works fine.

Thanks lunixbochs
I know mfsc and mfcc are both variants of mel spec, but I don't deep understand the difference of each feature. My language has tone, so pitch feature is important. I have train my dataset with nemo(quartznet network) and transformer with espnet and both have fast converge and good result. But all feature they use is mel spectrogram, and I am not sure, problem when i train model with streaming convnet is feature or about difference about architecture of network.
Do you have any suggest?

Probably the issue with the architecture and its hyperparameters for your data. So you need to tweak model size, optimization to make it work with your data.

Was this page helpful?
0 / 5 - 0 ratings

Related issues

abhinavkulkarni picture abhinavkulkarni  路  3Comments

gauenk picture gauenk  路  3Comments

JanX2 picture JanX2  路  5Comments

EdwinWenink picture EdwinWenink  路  4Comments

megharangaswamy picture megharangaswamy  路  5Comments