Transformers: Accuracy on classification task is lower than the official tensorflow version

Created on 30 Nov 2018 · 2Comments · Source: huggingface/transformers

Hi, I am running the same task with the same hyper parameters as the official Google Tensorflow implementation of BERT, however, I am getting around 1.5% lower accuracy. Can you please give any hint about the possible cause?

Thanks！

Source

ejld

Most helpful comment

Hi @ejld, yes BERT has a large variance on many fine-tuning tasks (see also the discussion in #64).
You should try a bunch of different seeds (like 10 seeds for example) and compare the mean and standard deviation of the results.

thomwolf on 30 Nov 2018

👍2

All 2 comments

Hi!
Could it be different seeds?
See e.g. https://github.com/huggingface/pytorch-pretrained-BERT/issues/53#issuecomment-441565229

davidefiocco on 30 Nov 2018

👍1

Hi @ejld, yes BERT has a large variance on many fine-tuning tasks (see also the discussion in #64).
You should try a bunch of different seeds (like 10 seeds for example) and compare the mean and standard deviation of the results.

thomwolf on 30 Nov 2018

👍2

Was this page helpful?

0 / 5 - 0 ratings

Related issues

Need a Restore training mechenisim in run_lm_finetuning.py

chuanmingliu · 3Comments

Tokenizer not found after conversion from TF checkpoint to PyTorch

HansBambel · 3Comments

fp16+xlnet did not gain any speed increase

fyubang · 3Comments

What should be the label of sub-word units in Token Classification with Bert

ereday · 3Comments

TypeError: '<' not supported between instances of 'NoneType' and 'int'

quocnle · 3Comments