Transformers: LXMERT visual feature extraction during training/fine-tuning phase

Created on 20 Sep 2020  路  6Comments  路  Source: huggingface/transformers

馃殌 Feature request

Thanks a lot for releasing LXMERT model. In the LXMERT model code samples, the visual feature extraction code (using generalized faster-rcnn: modeling_frcnn) only in the inference step is given. However, the visual feature extraction during the training phase is not given. For this reason if we use the same code for fine-tuning, it raises NotImplementedError as the visual feature extraction during training is not implemented. Is it possible to share the visual feature extraction during training?

Most helpful comment

Haha, yes we only added the FRCNN for evaluation to accommodate lxmert in the demo. I'll add the training code sometime this week, and then post back here once it is done, in the future it may be useable as a publicly available model following the HF api, but for the time being ill just push the changes to where it is now.

All 6 comments

Yes, I also came up with this error. It would be great if the feature gets published. TIA.

Tagging LXMERT's implementation author @eltoto1219

Haha, yes we only added the FRCNN for evaluation to accommodate lxmert in the demo. I'll add the training code sometime this week, and then post back here once it is done, in the future it may be useable as a publicly available model following the HF api, but for the time being ill just push the changes to where it is now.

Thanks for the prompt feedback.
Looking forward to it.
@eltoto1219

@eltoto1219 thanks, that will be quite a help.

@eltoto1219 Looking forward to it.

Was this page helpful?
0 / 5 - 0 ratings