Addons: Add "ReLU6, HardSwish, KAF, ELiSH, HardELiSH"

Created on 27 Jul 2020 · 7Comments · Source: tensorflow/addons

Describe the feature and the current behavior/state.

These activation functions are required by communities and are commonly used by up-to-date architectures.

ReLU6
HardSwish
KAF
ELiSH
HardELiSH

Relevant information

Are you willing to contribute it (yes/no): YES
Are you willing to maintain it going forward? (yes/no): YES
Is there a relevant academic paper? (if so, where): YES

ReLU6
HardSwish
KAF
ELiSH, HardELiSH

Is there already an implementation in another framework? (if so, where): torch, tf.nn
Was it part of tf.contrib? (if so, where): NO

Which API type would this fall under (layer, metric, optimizer, etc.)

tfa.activation

Who will benefit with this feature?

Researchers need up-to-date Activation

Any other info.

activations discussion needed

Source

marload

Most helpful comment

Yes what I meant is that if we could clear up this in the upstreaming process it could be easier to define for TFA (or the other way around).

bhack on 27 Jul 2020

👍2

All 7 comments

relu6 and swish are already in TF core. I haven't heard about any of the others. Can you point out any papers using them?

AakashKumarNain on 27 Jul 2020

As far as I know, HardSwish is not in Core. I think the link I attached will help you explain other Activation Function.

Thank You!

marload on 27 Jul 2020

HardSwish Is in Tensorflow lite

bhack on 27 Jul 2020

Thanks for the info. IMO, it is okay to add new activation functions but we also need to check the coverage, like how many times the paper for a corresponding activation has been cited. If the coverage for a any functionality is very low, it doesn't add much value to the ecosystem but rather increase the burden of maintenance.

Anyways, that's just my opinion. Let's hear what other maintainers have to say about this. cc: @seanpmorgan @WindQAQ

AakashKumarNain on 27 Jul 2020

👍1

@AakashKumarNain we was thinking something similar in https://github.com/tensorflow/community/pull/241/files#diff-24cf1a9d17b0988437751c2472f2c296R13 but it is still not expanded.
Probably we could expand on that and just lower the barrier for TFA.

bhack on 27 Jul 2020

👍1

@bhack I agree that the barrier for TFA should be low as compared to core TF. But we need to decide that factor IMO to make it more clear to the contributors

AakashKumarNain on 27 Jul 2020

👍1

Yes what I meant is that if we could clear up this in the upstreaming process it could be easier to define for TFA (or the other way around).

bhack on 27 Jul 2020

👍2

Was this page helpful?

0 / 5 - 0 ratings

Related issues

euclidean_distance_transform_op.cc build falied on GPU

facaiy · 3Comments

Implement Sparsemax Activation and Loss

seanpmorgan · 3Comments

How to use addons in Java/Scala

maziyarpanahi · 3Comments

WeightNormalization data init fails with Keras experimental_run_tf_function

seanpmorgan · 4Comments

Cannot compile with GPU Support

iskorini · 4Comments