Tensor2tensor: Attention visualization

Created on 12 Jul 2017 · 7Comments · Source: tensorflow/tensor2tensor

Hi,

in the appendix of Attention Is All You Need some very nice visualizations of the attention mechanism can be seen.

How can I create those visualizations for my own dataset/language pair?

Would be great if the authors can give some hints :)

Source

stefan-it

Most helpful comment

LLion, who wrote the scripts to do the visualizations, is on vacation this week. I'll leave this open and ping when he's back.

lukaszkaiser on 13 Jul 2017

👍4

All 7 comments

LLion, who wrote the scripts to do the visualizations, is on vacation this week. I'll leave this open and ping when he's back.

lukaszkaiser on 13 Jul 2017

👍4

@lukaszkaiser That would be great :)

stefan-it on 17 Jul 2017

@lukaszkaiser I too will wait for this. Thanks very much for the guidance

vishalnus on 20 Jul 2017

It seems that dot_product_attention (which is used in multihead_attention which is used in the Transformer) calls attention_image_summary which adds an image summary op, which should be written out to TensorBoard during training. Have you checked the image summaries tab in TensorBoard during training?