Caffe: why two blobs_lr?

Created on 12 Aug 2014 · 2Comments · Source: BVLC/caffe

Hello,
Why define blobs_lr and weight_decay twice in the conv layer?

layers {
name: "conv1"
type: CONVOLUTION
blobs_lr: 1
blobs_lr: 2
weight_decay: 1
weight_decay: 0
...

question

Source

mender05

Most helpful comment

Cf. MNISTtutorial for why people use 2 different strategy for learning rate:
blobs_lr are the learning rate adjustments for the layer’s learnable parameters. In this case, we will set the weight learning rate to be the same as the learning rate given by the solver during runtime, and the bias learning rate to be twice as large as that - this usually leads to better convergence rates.

hnoel on 12 Aug 2014

👍4

All 2 comments

First blobs_lr is for convolution filter weights, second blob_lr is for bias parameter.
The same for weight_decay.

davidduo on 12 Aug 2014

Cf. MNISTtutorial for why people use 2 different strategy for learning rate:
blobs_lr are the learning rate adjustments for the layer’s learnable parameters. In this case, we will set the weight learning rate to be the same as the learning rate given by the solver during runtime, and the bias learning rate to be twice as large as that - this usually leads to better convergence rates.

hnoel on 12 Aug 2014

👍4

Was this page helpful?

0 / 5 - 0 ratings

Related issues

Check failed: error == cudaSuccess (2 vs. 0) out of memory in solver phase.

vladislavdonchev · 3Comments

Which python is better? Python2 or Python3

shiorioxy · 3Comments

matcaffe - ubuntu

prathmeshrmadhu · 3Comments

how to use dropout layer?

lixin7895123 · 3Comments

Any tools to convert pretrained mxnet model to model for caffe?

iamhankai · 3Comments