Fairseq: Export a fairseq trained transformer (base) model

Created on 21 Jan 2019 · 17Comments · Source: pytorch/fairseq

I trained a translation model based on scaling neural machine translation and wanted to convert into into caffe2 but when used docker image it is throwing an error ::

According to https://github.com/pytorch/translate/issues/60 only RNN trained models are supported ?
How about the transformer trained models .
Should I retrain again to export to caffe2 ? @myleott

Is this under works or is their a problem with the command, as in some other command to export transformer?

Source

gvskalyan

Most helpful comment

hi @jmp84 , I think that you're the only one online that succeeded to get transformer onnx, is there any chance of sharing the onnx file on some drive? or any other way?
And maybe show you command, and where you wrote you command, I'm not kidding when saying that I saw no one, actually no one succeeding this export!

Esaada on 18 Feb 2019

👍2

All 17 comments

Hmm, I'm not sure the latest status of ONNX export for Transformer models, but generally ONNX export is handled by pytorch/translate. Thoughts @jmp84?

myleott on 24 Jan 2019

@gvskalyan, export for transformer is working for us. Have you tried training a transformer with pytorch/translate directly?

jmp84 on 24 Jan 2019

Yeah, will train in translate and let you know. Closing For now.

It would be good to bring all the translate domain under a single repository/code base (translate + fairseq) and this helps in future to incorporate changes and representations like these https://github.com/facebookresearch/LASER

gvskalyan on 24 Jan 2019

Esaada on 18 Feb 2019

👍2

hi @myleott, I noticed that in /pytorch_translate/ensemble_export.py, there's a list of model supported:

https://github.com/pytorch/translate/blob/master/pytorch_translate/ensemble_export.py

and that's where i stuck when trying model export
will there be more in the future or do your team have such plan?

Charleshzhang on 28 Feb 2020

@Charleshzhang, see https://github.com/pytorch/fairseq/pull/1620 for scripting transformer.
Please use fairseq instead of translate (some functionality in translate will be moved to fairseq and translate will be deprecated).

jmp84 on 28 Feb 2020

@Charleshzhang, see #1620 for scripting transformer.
Please use fairseq instead of translate (some functionality in translate will be moved to fairseq and translate will be deprecated).

Thanks for reply, I tried it today, using the method in 'export unittest' and find myself ran into such problem:

"Compiled functions can't take variable number of arguments "
"or use keyword-only arguments with defaults"

in https://github.com/pytorch/pytorch/blob/master/torch/jit/frontend.py line 223

py_args.kwarg not support

So is this going to be supported in the future, or if I did something wrong with my code? Many thanks.

P.S. I am trying exporting a 'transformer_vaswani_wmt_en_de_big' Arch. network to onnx, from a pre-built check-point file, and it's base class have forward Func like forward( ... , **kwargs)

Charleshzhang on 2 Mar 2020

@Charleshzhang ONNX will not be supported, we are only working with TorchScript (https://pytorch.org/docs/stable/jit.html).
You probably want to adapt the test_export_transformer function from https://github.com/pytorch/fairseq/pull/1620

jmp84 on 2 Mar 2020

👍1

@Charleshzhang ONNX will not be supported, we are only working with TorchScript (https://pytorch.org/docs/stable/jit.html).
You probably want to adapt the test_export_transformer function from #1620

Got it, thanks a lot.
感谢大佬

Charleshzhang on 3 Mar 2020

is there a way to export completely from end-end including sequence_generator, like in https://github.com/pytorch/translate/blob/master/pytorch_translate/ensemble_export.py#L1137

gvskalyan on 3 Mar 2020

@gvskalyan, we are working on a export solution for a simplified sequence generator, that works with scripted models like transformer. It's still under testing.

cndn on 4 Mar 2020

Hi!
@cndn how are you doing with these tests? Is there a way to to try it myself? What would I have to do to export MT model to TorchScript?

jahutwb on 22 Jun 2020

@jahutwb, hi, do you know how to export a MT model to TorchScript now? I still can't find a solution to this.

Mr-wang2016 on 15 Sep 2020

look here #2439

jahutwb on 15 Sep 2020

You can export an MT model with the following code: https://gist.github.com/jmp84/b98dfc5788b530d594220a552e4fdb27

jmp84 on 16 Sep 2020

@jmp84 In the gist, can you please tell how to use the scripted_generator, it is getting stuck when:

    generator = SequenceGenerator([model], model_dict, beam_size=args.beam_size)
    src_tokens = torch.randint(3, 50, (2, 10)).long()
    src_tokens = torch.cat((src_tokens, torch.LongTensor([[eos], [eos]])), -1)
    src_lengths = torch.LongTensor([2, 10])
    sample = {
            "net_input": {"src_tokens": src_tokens, "src_lengths": src_lengths}
        }
    generator(sample)                                        # Working
    scripted_generator = torch.jit.script(generator)
    scripted_generator(sample)                               # Not Working

Sources - 1,2

gvskalyan on 16 Sep 2020

@gvskalyan could you open a separate issue with a repro (including model training and error messages, etc.)?

jmp84 on 16 Sep 2020

👍1

Was this page helpful?

0 / 5 - 0 ratings