Transformers: config.attention_head_size for structured pruning out-of-box

Created on 11 Nov 2020 · 2Comments · Source: huggingface/transformers

🚀 Feature request

for structured pruning like fastformers, https://github.com/microsoft/fastformers#pruning-models ,

we should modify the source code of transformers for `attention_head_size.

for example,

is it possible to set attention_head_size from outside(config.json) ?

Source

dsindex

@dsindex FYI, I am working on creating a PR including this feature for the effort of #8083 .

ykim362 on 12 Nov 2020

❤3

@dsindex FYI, I am working on creating a PR including this feature for the effort of #8083 .

ykim362 on 12 Nov 2020

❤3

@ykim362 great! closing issue here.

dsindex on 13 Nov 2020

Was this page helpful?

0 / 5 - 0 ratings

TypeError: '<' not supported between instances of 'NoneType' and 'int'

quocnle · 3Comments

Weights not initialized from pretrained model

lemonhu · 3Comments

What should be the label of sub-word units in Token Classification with Bert

ereday · 3Comments

Limit on the input text length?

lcswillems · 3Comments

Fine-tune specific layers

hsajjad · 3Comments