Hi
Currently wav2letter support spectrogram augmentation (SAUG) and dither noise. What if I want to apply more augmentation? What is the proposed way of doing some augmentation say adding modeled noise (pink, babble, etc.), adding tempo and gain perturbations?
I also read in SpecAugment.h that currently time warping is not supported. Is it a WIP or there is no plan to implement this source of noise?
Thank you in advance
Since my question is about the same topic, I will add it here. Looking at the recipes, the architecture file for the streaming convnets includes SpecAugment as suggested by the paper. However, the config-file does not have SAUG flags (--use_saug and --saug_start_update). Is either/both of these flags unnecessary if the architecture file already includes SAUG?
@AVSuni
In case of transformer AM model we need to have warmup stage during which we don't use augmentation. To support this for now we solved it by added flags use_saug and saug_start_update. So in case you want to train model with warmup (and no augmentation during warmup) you just operate with these flags and spec augment flags and don't include SAUG layer in the arch file. If you don't do warmup you can simply specify SAUG layer in the arch.
Also now one can use more complicated combinations, like use augmentation at some specific period of time during training.
@tlikhomanenko
Is time warping applied in SAUG or not? I'm a little bit confused with the documentation.
@hajix, sorry for delay in answering.
About SpecAugment:
About other augmentations:
And always pull requests are welcome and appreciated!
Most helpful comment
@AVSuni
In case of transformer AM model we need to have warmup stage during which we don't use augmentation. To support this for now we solved it by added flags
use_saugandsaug_start_update. So in case you want to train model with warmup (and no augmentation during warmup) you just operate with these flags and spec augment flags and don't include SAUG layer in the arch file. If you don't do warmup you can simply specify SAUG layer in the arch.Also now one can use more complicated combinations, like use augmentation at some specific period of time during training.