Pytorch-lightning: How to properly fix random seed with pytorch lightning?

Created on 22 Apr 2020 · 5Comments · Source: PyTorchLightning/pytorch-lightning

What is your question?

Hello guys
I wonder how to fix seed to get reproducibility of my experiments

Right now I'm using this function before the start of the training

def seed_everything(seed=42):
    random.seed(seed)
    os.environ['PYTHONHASHSEED'] = str(seed)
    np.random.seed(seed)
    torch.manual_seed(seed)
    torch.cuda.manual_seed(seed)
    torch.cuda.manual_seed_all(seed)
    torch.backends.cudnn.deterministic = True
    torch.backends.cudnn.benchmark = False

But it doesn't work.
I run training in DDP mode if it is somehow important.

Thanks in advance!

What's your environment?

OS: Ubuntu 18.04
Packaging: pip
Version: 0.7.1

question

Source

belskikh

👍24

Most helpful comment

@awaelchli tried and failed

belskikh on 23 Apr 2020

👍2

All 5 comments

Also have the same problem without DDP mode.

What's your environment?

OS: Ubuntu 18.04
Packaging: pip
Version: 0.7.3

kumuji on 22 Apr 2020

Could you set num workers to 0 to see if it is related to the dataloading? I had this problem before with regular pytorch and I think I solved it by setting the seed also in the dataloading, because each subprocess would have its own seed.

awaelchli on 22 Apr 2020

@awaelchli tried and failed

belskikh on 23 Apr 2020

👍2

Is there a chance you could share a colab with a minimal example? If not I will try to reproduce with the pl_exampels this weekend when i get to it.

awaelchli on 24 Apr 2020

In my case, it is caused by dropout.
I seed everything again in the spawed process before training fix the problem basically.
you can do this in on_train_start hook