Fairseq: Any performance comparison between pre-norm and post-norm for Transformer on Machine Translation

Created on 17 Jun 2020  路  3Comments  路  Source: pytorch/fairseq

As the name implies, can you provide any performance comparison between pre-norm and post-norm performance comparison using a transformer on Machine Translation Dataset?

question

Most helpful comment

As the name implies, can you provide any performance comparison between pre-norm and post-norm performance comparison using a transformer on Machine Translation Dataset?

You can refer to the ACL2019 paper https://arxiv.org/abs/1906.01787, which is based on fairseq.

All 3 comments

What is the pre-norm and post-norm?

As the name implies, can you provide any performance comparison between pre-norm and post-norm performance comparison using a transformer on Machine Translation Dataset?

You can refer to the ACL2019 paper https://arxiv.org/abs/1906.01787, which is based on fairseq.

Have a look at https://github.com/wangqiangneu/dlcl to reproduce the results.

Was this page helpful?
0 / 5 - 0 ratings