Fairseq: JIT Exporter for Transformer Model

Created on 11 Jul 2019 · 10Comments · Source: pytorch/fairseq

Is there any update on using TorchScript annotations (nn.Module->torch.jit.ScriptModule, script_method and trace) to load transformer model without using Python interpretor and end-end inference including beam search.

enhancement

Source

gvskalyan

👍3

Most helpful comment

Still working on fairseq jit. Beam search may be a big issue.

Meteorix on 31 Jul 2019

👍2

All 10 comments

@myleott?

huihuifan on 16 Jul 2019

Something like this end-end https://twitter.com/Thom_Wolf/status/1151169470498582529

gvskalyan on 17 Jul 2019

@zhangguanheng66 I think transformer is jit traceable and does it decrease latency (in case of transformer) after being converted.

gvskalyan on 29 Jul 2019

@gvskalyan Yes. the transformer module in pytorch core library is jit traceable, which should decrease the latency. But I haven't benchmarked it yet.

zhangguanheng66 on 29 Jul 2019

An another fork, https://github.com/NVIDIA/DeepLearningExamples/tree/master/PyTorch/Translation/Transformer#changelog
Please refer June 2019 in changelog - jit support added

gvskalyan on 31 Jul 2019

@gvskalyan @zhangguanheng66 I have tried bert jit and benchmarked it. It will be 25% faster on gpu.

Meteorix on 31 Jul 2019

👍2

Still working on fairseq jit. Beam search may be a big issue.

Meteorix on 31 Jul 2019

👍2

@Meteorix do you know where one may find simple jit-transformer with beam search?

Oktai15 on 1 Dec 2019

@Meteorix do you know where one may find simple jit-transformer with beam search?

https://github.com/pytorch/translate I used this repo a couple of months ago.

Meteorix on 1 Dec 2019

👍1

@Meteorix have you seen jittable LM with beam search?

Oktai15 on 25 Dec 2019

Was this page helpful?

0 / 5 - 0 ratings

Related issues

RuntimeError: Creating MTGP constants failed. at /pytorch/aten/src/THC/THCTensorRandom.cu:33

kyquang97 · 3Comments

errors trying to decode with mbart model

mjpost · 3Comments

Currently fairseq-py requires PyTorch version >= 0.4.0 ?

mali-nuist · 3Comments

Enable per-token classification in RoBERTa

prihoda · 3Comments

Error during inference of model trained on fp16

Raghava14 · 3Comments