Models: batch-norm of inception v3 on multi GPU

Created on 16 May 2016 · 3Comments · Source: tensorflow/models

Dose every tower batch-norm on it's batch (part of batch in multi GPU mode),
or the Wx+b of all towers are concatenated to calculate batch-norm of batch*num_GPU eaxamples?
The latter maybe much slower due the sycronization.

Source

sun9700

Most helpful comment

does it mean the moving_mean and moving_variance on each tower will potentially be updated to different values even when the variables are shared across towers?
When we save the model, which tower's moving_mean/variance is saved?
Is there away to handle this correctly?

yifita on 25 Nov 2016

👍5

All 3 comments

Each tower performs batch_norm on its own part of the batch, there is no synchronization across towers for that.

sguada on 16 May 2016

👍1

@sun9700: Please reopen if that doesn't answer your question.

girving on 16 May 2016

does it mean the moving_mean and moving_variance on each tower will potentially be updated to different values even when the variables are shared across towers?
When we save the model, which tower's moving_mean/variance is saved?
Is there away to handle this correctly?

yifita on 25 Nov 2016

👍5

Was this page helpful?

0 / 5 - 0 ratings

Related issues

tutorial image cifar10 estimator generate TFRecord error

jacknlliu · 3Comments

Permission Denied Error

Mostafaghelich · 3Comments

SyntaxNet : echo "i am human" | syntaxnet/demo.sh

dsindex · 3Comments

where can i find pretrained resnet model?

kamal4493 · 3Comments

Parsing error from training script while trying to train on pet dataset.

XavDCtpz · 3Comments