Rasa: what is Actual working of epochs?

Created on 17 May 2018  路  1Comment  路  Source: RasaHQ/rasa

    agent.train(
            training_data,
            epochs=400,
            batch_size=100,
            validation_split=0.2
    )

Issue:
Here in Restaurant example, there is agent.train.

  1. Here what is the working of epochs? what if I will modify it to 400 to 2. what will be the difference of 400 vs 2 epchos
  2. What is batch_size? same what if I modify if from 100 to 3 or 100 to 500? what will be the difference?
  3. what is validation_split? what if I modify it from 0.2 to 0.5? what is the range of validation_split?

Most helpful comment

Epochs is the number of times the machine learning model will see each training example during the training. When training the model, we go through all the training stories and train the models, and we repeat that process for 400 times (or for 2 if you set it to that value).

Batch size is the number of examples we look at at the same time. So instead of looking at one example and updating the model, we will look at 100 examples and after that update the model once (instead of a hundred times) - this makes sure that outlier examples do not influence the model to much, as their influence as one of a hundred examples is a lot smaller as if we would have done a single update for that example.

Validation split is the percentage (so between 0 and 1, including both ends) of examples to put aside for testing the model. The percentage of examples put aside will not be used for training. Instead, once in a while during the training we evaluate the performance of the model on these examples. This is good practice, as it doesn't make to much sense to evaluate how good the model is on the training data, as the model already "knows" that data.

>All comments

Epochs is the number of times the machine learning model will see each training example during the training. When training the model, we go through all the training stories and train the models, and we repeat that process for 400 times (or for 2 if you set it to that value).

Batch size is the number of examples we look at at the same time. So instead of looking at one example and updating the model, we will look at 100 examples and after that update the model once (instead of a hundred times) - this makes sure that outlier examples do not influence the model to much, as their influence as one of a hundred examples is a lot smaller as if we would have done a single update for that example.

Validation split is the percentage (so between 0 and 1, including both ends) of examples to put aside for testing the model. The percentage of examples put aside will not be used for training. Instead, once in a while during the training we evaluate the performance of the model on these examples. This is good practice, as it doesn't make to much sense to evaluate how good the model is on the training data, as the model already "knows" that data.

Was this page helpful?
0 / 5 - 0 ratings

Related issues

nikhilcss97 picture nikhilcss97  路  3Comments

Poojan66 picture Poojan66  路  3Comments

johnson7788 picture johnson7788  路  3Comments

rayush7 picture rayush7  路  3Comments

nicolasfarina picture nicolasfarina  路  3Comments