Vision: Add ProxylessNAS models

Created on 4 Aug 2019 · 11Comments · Source: pytorch/vision

ProxylessNAS is a method proposed in https://arxiv.org/abs/1812.00332, which aims to _directly_ and _efficiently_ search CNN on target task (CIFAR/ImageNet) and hardware (CPU/GPU/Mobile).

Because the direct specialization, ProxylessNAS is 1.8x times faster than MobilenetV2 (at ResNet50 level accuracy) and also has better accuracy / speed trade-off on different hardware

The code/models are already released at github. I would like to create a pull request to contribute this model to torchvision. Is there any point I need to take care?

awaiting response models needs discussion classification

Source

Lyken17

Most helpful comment

Hi @Lyken17, do you have any plan for releasing the trained CIFAR-10 models as well? It looks like the repository (https://github.com/mit-han-lab/ProxylessNAS) only contains the ImageNet models.

gyeongin on 11 Oct 2019

👍2

All 11 comments

Hi,

Thanks for raising this issue.

More recently, there has been MobileNetV3 that has been implemented. Do you think there are advantages of ProxylessNAS over MobileNetV3?

fmassa on 5 Aug 2019

@fmassa

Performance: Compared with MobilenetV3, proxylessnas's accuracy is still competitive when tricks like SE / Swish / special training schedule are not used.
MobilnetV3 (75.2) / Proxyless-CPU (75.3) / Proxyless-GPU (75.1) / Proxyless-Mobile (74.6)
Hardware Specialization: MobilnetV3 does not consider underlie hardware, but it is known that each platform has different preferences, like GPU is better at parallel computing and CPU has higher frequency. MobilenetV3 tends to build a general CNN for all platforms, but the figure below shows the customization can bring 20~35% speed boost without loss on accuracy.

Lyken17 on 5 Aug 2019

@fmassa Any suggestions for making a PR?

Lyken17 on 13 Aug 2019

Hi @Lyken17

Sorry for the delay in replying, I was on holidays for the last couple of weeks.

I think it might be better for now to add those models to TorchHub https://pytorch.org/hub .
TorchHub has been built exactly for that, and I think would be a great fit for now for ProxylessNAS.

The training code for Proxyless seem to be different than what we have in torchvision/references, which we try to enforce as much as possible.

Thoughts?

fmassa on 28 Aug 2019

ProxylessNAS is a method proposed in https://arxiv.org/abs/1812.00332, which aims to _directly_ and _efficiently_ search CNN on target task (CIFAR/ImageNet) and hardware (CPU/GPU/Mobile).

Because the direct specialization, ProxylessNAS is 1.8x times faster than MobilenetV2 (at ResNet50 level accuracy) and also has better accuracy / speed trade-off on different hardware

The code/models are already released at github. I would like to create a pull request to contribute this model to torchvision. Is there any point I need to take care?

Any plan to release the search pipeline for ProxlessNAS?

zhanghang1989 on 24 Sep 2019

@zhanghang1989 Sure, we love open source. The searching code is now released on the same git repo.

@fmassa Sorry for coming back late, was rushing for ICLR in last few weeks .

As for the training settings, we are using

The standard data preprocessing (color jitter, random crop, random flip)
Learning rate warmup for first 5 epochs (to better fit distributed training, can be removed if on a single machine)
Learning rate cosine scheduling instead (different torchvision/references using stepLR)
Optional label smoothing for better performance.

The key difference is we are using cosine LR schedule instead of the multi-stage strategy. As mentioned in MobilenetV2 paper, they decay the learning rate by 0.97 every 2.4 epochs [1]. This schedule is specially designed for MBConv modules and hurts other models' performance (e.g., ResNet). Therefore, we choose a more general cosine schedule. It (1) performs as good as StepLR on tradition models like VGG and ResNet (2) fully re-produces the accuracy of MobilenetV2. I think it is a fair setting. Let me know if you have any comments.

[1] https://arxiv.org/pdf/1807.11626.pdf MnasNet: Platform-Aware Neural Architecture Search for Mobile
[2] https://arxiv.org/pdf/1812.01187.pdf Bag of Tricks for Image Classification with Convolutional Neural Networks

Lyken17 on 2 Oct 2019

@Lyken17 sounds good, thanks a lot for the information!

Can you first send a PR to torchhub https://github.com/pytorch/hub adding your models? I think it would be a better fit for now, and then we can reconsider adding ProxylessNAS into torchvision in a few months.

fmassa on 2 Oct 2019

Hi @Lyken17, do you have any plan for releasing the trained CIFAR-10 models as well? It looks like the repository (https://github.com/mit-han-lab/ProxylessNAS) only contains the ImageNet models.

gyeongin on 11 Oct 2019

👍2

@fmassa How does the hubconf.py look? I have tested it locally and can load without error.

model = torch.hub.load('mit-han-lab/ProxylessNAS', 'proxyless_cpu')

and the PR is https://github.com/pytorch/hub/pull/61. Let me know if you have any suggestions.

PS: hub is indeed a cool feature. Love it!

Lyken17 on 17 Oct 2019

👍1

Hi @gyeongin , we have no plan to clean the CIFAR codebase and release it. If you are interested in searching on CIFAR, you can have a look at Path-level ENAS, from which our CIFAR experiment settings are adapted from.

For pretrained models, we have shared it on Google Drive.

Lyken17 on 17 Oct 2019

ProxylessNAS is now available via TorchHub, and can be found in https://pytorch.org/hub/pytorch_vision_proxylessnas/

Closing this issue for now, will revisit it in 6 months.

Thanks for the discussion!

fmassa on 18 Oct 2019

Was this page helpful?

0 / 5 - 0 ratings

Related issues

Non-Maximum Supression on the GPU

dssa56 · 60Comments

Proposal for extending transforms

fmassa · 30Comments

FasterRCNN to ONNX model

Finniu · 24Comments

[Feature Request] Add EfficientNet

timonbimon · 28Comments

RuntimeError: No such operator torchvision::nms

ppwwyyxx · 33Comments