Hi :)
I have three questions:
1) I rescaled my images to [-1,1] because of the use of tanh in generator, is this not needed?
2) My images are 140x140. should I change the number of resnet's? And what is the intuition behind this?
3) I changed my batch_size to 8 to speed up and kept the InstanceNormalization. When making this change, should I change the learning rate also? And are there other things I should consider changing? Should I use batchNormalization instead?
Thanks in regards!
--netG resnet_6blocks.Thanks for your reply! Its really great that you are helping us out :)
I normalized my data to - 1,1 and set resnet to 6. I also tried to change real label to 0.9. But my discriminators loss is still going close to 0. Like 0.00011. Is this normal or did my system collapse?
Do you have any other implementation tips for the discriminator loss not going towards zero? Or some link to a place? Should i train generator more then discriminator?
Your training collapsed. You can remove 1-2 layers from D, or increase the learning rate of G.
Thanks ill try that out! Just one last question :)
Is it correct that you don't have any activation function in the last layer of the discriminator? And if understood correctly, why not? I thought it should have sigmoid?
It is implemented in the GANLoss class. See this line.
@kpagels : You have similar issue that I have before. Some comments may works
@junyanz
Instancenorm should be fine. You can slightly increase the learning rate. Some folks suggest sqrt(K) or K when the batch size is K. Using the same learning rate is also fine.
If my learning rate for batch size of 1 is 0.0002 then the batch size of 16 should use the learning rate of 0.0002 * sprt(16) = 0.0008? Am I right?
I am not sure what is best. Some folks suggest sqrt(K) or K when the batch size is K. Using the same learning rate is also fine.
Most helpful comment
--netG resnet_6blocks.