Deepspeech: Release model checkpoint files

Created on 6 Dec 2017  路  11Comments  路  Source: mozilla/DeepSpeech

We have some audios which we want to use for training on the top of pre-trained deepspeech model. Is it possible to get the links for the release model's checkpoint files ?

Most helpful comment

All 11 comments

I would like to second this issue :+1:

It's going to happen, as soon as we can :)

Is there a time estimate as to when those will be available?

Is it possible to add on more words to it, without loosing the capability to loose old ones?

I guess with the new checkpoints, we also need to train with the old data + new data, instead of new data alone to add more words. Is there any possible way to retain the old knowledge, while training on new data alone?. So that as a result it can recognize all sort of things from old and new data.
Same question as @kapursu, when can we expect those checkpoints to be made available?

Is there no way of importing the parameters from the provided .pb file in deepspeech-0.1.0-models.tar.gz ?

Yea is there a way to fine tune given trained model (.pb file)? any success with it?

@arunpatala @bernardpazio The .pb file is a frozen version of the tensorflow graph that's optimized for inference and cannot be reused for transfer learning (i.e. additional training). To be able to train an already trained model with your data, you need checkpoints that haven't been released yet.

More info on frozen models can be found in tensorflow documentation about freezing.

@saikishor in theory, you can use only new data to fine tune the model's performance but you need to be careful to not overfit it for your custom data. Quite a few blogs describe different techniques for transfer learning in tensorflow (transfer learning blog to name one).
However what method is going to work well for deepspeech and for your use case e.g. adding just a few specialized English terms (like medical vocabulary), or switching to another language (like reusing high level features and retraining the rest for Spanish) will need to be tested and hopefully shared by the community with all params like learning speed, number of frozen layers etc.

Appologies for the lengthy text, it's probably more suitable for forums.

Clearly there is a way to reuse the frozen graph as described in deepspeech doc so ignore my previous comment.

@pvanickova
Thanks for the update.

This thread has been automatically locked since there has not been any recent activity after it was closed. Please open a new issue for related bugs.

Was this page helpful?
0 / 5 - 0 ratings

Related issues

yoann1995 picture yoann1995  路  49Comments

MalikMahnoor picture MalikMahnoor  路  79Comments

breandan picture breandan  路  41Comments

stes picture stes  路  154Comments

aaronzira picture aaronzira  路  29Comments