Hi,
in the appendix of Attention Is All You Need some very nice visualizations of the attention mechanism can be seen.
How can I create those visualizations for my own dataset/language pair?
Would be great if the authors can give some hints :)
LLion, who wrote the scripts to do the visualizations, is on vacation this week. I'll leave this open and ping when he's back.
@lukaszkaiser That would be great :)
@lukaszkaiser I too will wait for this. Thanks very much for the guidance
It seems that dot_product_attention (which is used in multihead_attention which is used in the Transformer) calls attention_image_summary which adds an image summary op, which should be written out to TensorBoard during training. Have you checked the image summaries tab in TensorBoard during training?
I meant these kind of visualizations:

:)
An iPython notebook with our viz code is included in the new release 1.2.0, give it a try!
Most helpful comment
LLion, who wrote the scripts to do the visualizations, is on vacation this week. I'll leave this open and ping when he's back.