Addons: Please add HashingMemory layer

Created on 24 Jul 2019 · 10Comments · Source: tensorflow/addons

System information

TensorFlow version (you are using): 2b1
TensorFlow Addons version: latest
Is it in the tf.contrib (if so, where): no
Are you willing to contribute it (yes/no): maybe
Are you willing to maintain it going forward? (yes/no): maybe

Describe the feature and the current behavior/state.
This paper introduces a structured memory which can be easily integrated into a neural
network. The memory is very large by design and therefore significantly increases the capacity of the architecture, by up to a billion parameters with a negligible computational overhead.
Its design and access pattern is based on product keys, which enable fast and exact nearest
neighbor search. The ability to increase the number of parameters while keeping the same
computational budget lets the overall system strike a better trade-off between prediction accuracy and computation efficiency both at training and test time. This memory layer allows us to tackle very large scale language modeling tasks. In our experiments we consider a dataset with up to 30 billion words, and we plug our memory layer in a state-of-the-art transformer-based
architecture. In particular, we found that a memory augmented model with only 12 layers
outperforms a baseline transformer model with 24 layers, while being twice faster at inference
time. We release our code for reproducibility purposes.
https://arxiv.org/pdf/1907.05242v1.pdf
https://github.com/facebookresearch/XLM/blob/master/src/model/memory/memory.py
Will this change the current api? How?
yeah, new layer with lots of memory for the model
Who will benefit with this feature?
people who use TFA + Keras api
Any Other info.
i like pie

Feature Request layers

Source

bionicles

Most helpful comment

I would wanna try understanding the paper and try implementing this layer if no one is working on this issue.

sayoojbk on 19 Aug 2019

👍3

All 10 comments

Hi @bionicles this looks like a really interesting paper/concept. It seems like it could be a fit as a TFA Layer.

You mentioned you may be interested in contributing what would that be dependent upon?

seanpmorgan on 25 Jul 2019

Just time, and my ability to understand the paper/code...
This one also looks good: https://arxiv.org/pdf/1907.09720v1.pdf
I’m definitely interested to contribute to TFA! Maybe some simpler stuff we already have working would be better in the short term

bionicles on 25 Jul 2019

👍3

I would wanna try understanding the paper and try implementing this layer if no one is working on this issue.

sayoojbk on 19 Aug 2019

👍3

Any update on this?

gaceladri on 26 Apr 2020

Sorry I was busy with some projects and could not finish the work on this. If you are looking to contribute to this go forward, it would be really helpful :D . If not I might try to pull out some time and look back to implementing it.

sayoojbk on 26 Apr 2020

Likewise, I can’t do this now, but it would be cool!

bionicles on 27 Apr 2020

Hi, @bionicles @sayoojbk @Squadrick I want to take up this issue if it's okay. Thanks

gaurav-singh1998 on 5 May 2020

🚀1 👍1

Hi, @bionicles @sayoojbk @Squadrick I want to take up this issue if it's okay. Thanks

Sure @gaurav-singh1998 you can move forward with this. If need any help ping anyone of us on gitter.

sayoojbk on 5 May 2020

Hello @sayoojbk as I am new in this repository I may take some time to get acquainted with the code base and finally come up with a PR. Is it okay?

gaurav-singh1998 on 5 May 2020

Yea take your time! If need any help ping on the official gitter channel for SIG-addons :P

sayoojbk on 5 May 2020

👍1

Was this page helpful?

0 / 5 - 0 ratings

Related issues

WeightNormalization data init fails with Keras experimental_run_tf_function

seanpmorgan · 4Comments

Add contribution guideline for moving from tf.contrib

seanpmorgan · 3Comments

CohenKappa reset_states distributed training setup errors.

n3011 · 4Comments

BeamSearchDecoder with non LSTM cells raises ValueError exception

jimthompson5802 · 3Comments

Merging tfa.callbacks.tqdm_progress_bar with tqdm.keras

shun-lin · 4Comments