Keras-retinanet: Pretrained models for other backbone models

Created on 24 Feb 2018 · 25Comments · Source: fizyr/keras-retinanet

Hi,

Thank you for the great work!
Is there any chance you may release the pretrained models for other backbone models, e.g. resnet101, resnet152 or mobilenet128_1.0, mobilenet128_0.75, mobilenet160_1.0? Currently we only have pretrained models for resnet50.

That would be super helpful for transfer learning. Otherwise, I might need to train on COCO from scratch.

Thanks a lot!

help wanted

Source

ChengshuLi

Most helpful comment

@lvaleriu could you please share with me the Pretrained models for mobilenet128_1.0 backbone ?

scstu on 10 Apr 2019

👍3

All 25 comments

Considering our resources for this project are limited, we don't provide the pretrained models for the other architectures. If in the future we happen to have trained these architectures on COCO we Will probably make them publicly available. For now, your best bet is to start with imagenet trained weights and then fine-tune on COCO or your own dataset.

hgaiser on 24 Feb 2018

Cool. Thanks for letting me know!

ChengshuLi on 26 Feb 2018

I assigned the label help wanted. We'd be happy to add pretrained (COCO/Pascal) networks to this repository if they are provided to us, but there is a risk that the architecture changes which causes those models to become obsolete. If that is the case, we likely won't update the pretrained models (except for ResNet50 on COCO).

hgaiser on 26 Feb 2018

For training on coco what are the parameters? batch_size=1, flip_x augmentation? (if that matters)

Yes, models might change a bit. It might be a good idea to use the official keras repository models (from applications), the ones from https://github.com/keras-team/keras-contrib or copy them directly in this repository (but we still need to link to the imagenet weights).

lvaleriu on 27 Feb 2018

For training on coco what are the parameters? batch_size=1, flip_x augmentation? (if that matters)

Yeah, only those.

hgaiser on 27 Feb 2018

Started training on COCO (train2017+ val2017) using mobilenet224_1.0 + batch_size=1 + flip_x + image_min_side=800, image_max_side=1333+ perform NMS per class+ FPN correction

I'll keep updating this post with training results.

Epoch 2:
10000/10000 [==============================] - 2939s 294ms/step - loss: 3.7631 - regression_loss: 2.8676 - classification_loss: 0.8955

Epoch 4:
10000/10000 [==============================] - 2810s 281ms/step - loss: 3.3364 - regression_loss: 2.5441 - classification_loss: 0.7923

Epoch 6:
10000/10000 [==============================] - 2757s 276ms/step - loss: 3.1225 - regression_loss: 2.3957 - classification_loss: 0.7268

Epoch 8:
10000/10000 [==============================] - 4962s 496ms/step - loss: 2.9642 - regression_loss: 2.2937 - classification_loss: 0.6706

Epoch 11:
10000/10000 [==============================] - 3330s 333ms/step - loss: 2.7947 - regression_loss: 2.1787 - classification_loss: 0.6160

Epoch 13:
10000/10000 [==============================] - 28566s 3s/step - loss: 2.7168 - regression_loss: 2.1301 - classification_loss: 0.5867

Epoch 16:
10000/10000 [==============================] - 8350s 835ms/step - loss: 2.6045 - regression_loss: 2.0449 - classification_loss: 0.5596

Epoch 18:

10000/10000 [==============================] - 21270s 2s/step - loss: 2.5862 - regression_loss: 2.0303 - classification_loss: 0.5559

Epoch 20:
10000/10000 [==============================] - 2790s 279ms/step - loss: 2.4398 - regression_loss: 1.9232 - classification_loss: 0.5166

lvaleriu on 28 Feb 2018

@lvaleriu How is your mobilenet training process? My training on COCO using densenet169 as backbone gives only a MAP of 0.028 at epoch 24.

panda9095 on 5 Mar 2018

@panda9095 Very bad. So i'll start again using the FPN-CORRECTION.

lvaleriu on 6 Mar 2018

Actually it got merged into master.

hgaiser on 6 Mar 2018

👍1

Could I use mobilennet initial weights from here?
https://github.com/experiencor/basic-yolo-keras
does it work?

smehdia on 7 Mar 2018

@panda9095 Started training mobilenet on coco again. I'll update the previous comment with the results after each epoch.

lvaleriu on 7 Mar 2018

@panda9095 It seems better now.
@hgaiser Can you take a look at the learning progression? I've never trained resnet50 from scratch on coco till now and dont have a reference for the learning curve.

lvaleriu on 8 Mar 2018

Here are the results after training mobilenet224_1.0 for 140+ epoch(keras-retinanet0.2,batch_size=1)
Every epoch takes 60min on my single 1080ti.The GPU utilization is 90%+.
The red line is mobilenet224_1.0 and the orange line is res50_retinanet.It seems that the loss decrease very slow.
The learning rate change because i keep training from epoch 100 using --weights command.
screen shot 2018-03-10 at 10 45 52 am

jjiunlin on 10 Mar 2018

@lvaleriu, can u please explain why do we get 6 values of precision and recall?

sujeet-gandhi on 30 Aug 2018

As defined in http://cocodataset.org/#detection-eval, here are the 12 metrics:

lvaleriu on 4 Sep 2018

👍1

@lvaleriu Thanks.

sujeet-gandhi on 5 Sep 2018

Actually I need the more powerfull backbone support ,such as ResNeXt, or the SE-ResNeXt. Of course I tried by myself , but the performence dropped a litter while I excepted for higher. Maybe it's because that I used the customed dataset which contains about 10K images. I will train on the COCO. If there is any idea for higher performence, I would be gratefull

VCBE123 on 8 Oct 2018

👍1

PRs for those backbones would be very welcome.

Pretraining on COCO sounds like the right thing to do, it also gives you a better measure of how well the backbone works.

hgaiser on 8 Oct 2018

Hey all!
I just tried to train a net with mobilenet160_0.75 as backbone. I just added "--backbone mobilenet160_0.75" to the command provided in the README.md for training on csv datasets. It is throwing an error while creating mobilenet in site-package keras-applications. Did i forget an argument?

TimoK93 on 13 Nov 2018

That's better suited for a separate issue (also, mention the error, it helps to find the cause).

hgaiser on 13 Nov 2018

@lvaleriu could you please share with me the Pretrained models for mobilenet128_1.0 backbone ?

scstu on 10 Apr 2019

👍3

For mobilenet, I saw keras-retinanet is used in vehicle detection:
https://github.com/yangliupku/retinanet_detection
Can someone merge it?

liminghuiv on 2 Jul 2019

I'm closing this in favor of https://github.com/fizyr/keras-retinanet/issues/1161

hgaiser on 4 Nov 2019

Actually I need the more powerfull backbone support ,such as ResNeXt, or the SE-ResNeXt. Of course I tried by myself , but the performence dropped a litter while I excepted for higher. Maybe it's because that I used the customed dataset which contains about 10K images. I will train on the COCO. If there is any idea for higher performence, I would be gratefull

Where did you get the pretrained weights of ResNext? Which implementation of ResNext did you follow?

hasan-mh-aziz on 1 Dec 2019

Actually I need the more powerfull backbone support ,such as ResNeXt, or the SE-ResNeXt. Of course I tried by myself , but the performence dropped a litter while I excepted for higher. Maybe it's because that I used the customed dataset which contains about 10K images. I will train on the COCO. If there is any idea for higher performence, I would be gratefull

Did you trained ResNeXt on COCO? If yes can you please provide me with the pretrained model.