Transformers: Fine tuned to Multi-choice dataset?

Created on 5 Dec 2018  路  7Comments  路  Source: huggingface/transformers

Is it posible to fine tuned to the multi choices problems , which usually has one passage, question and ABCD four options?

Most helpful comment

Yes it is, the code is not written yet but I'm planning to work on it. The idea is to format the input data the same way the authors of Improving Language Understanding with Unsupervised Learning

Multiple choice GPT

You run an inference (context, choice) for each choice, you compute the image of the [CLS] token by a linear layer with 1 output and then compute a softmax over the output of all choices.

I will try to create a PR with this code very soon.

hi锛孴he multi choices problem usually has one passage, question and ABCD four options銆侷n your model, dose context means passage&question ?

All 7 comments

Yes it is, the code is not written yet but I'm planning to work on it. The idea is to format the input data the same way the authors of Improving Language Understanding with Unsupervised Learning

Multiple choice GPT

You run an inference (context, choice) for each choice, you compute the image of the [CLS] token by a linear layer with 1 output and then compute a softmax over the output of all choices.

I will try to create a PR with this code very soon.

Thx for the reply.
Actually, I have the same plan. But I am not sure whether it will work. Anyway, I will have a try.

If it worked in the OpenAI paper, I don't really see why it wouldn't work with this architecture.

@Qzsl123 The code for multiple choice task is available in PR #96 if you want to test it.

@rodgzilla yeah, I am trying to run it. Thanks for the wonderful job!

Yes it is, the code is not written yet but I'm planning to work on it. The idea is to format the input data the same way the authors of Improving Language Understanding with Unsupervised Learning

Multiple choice GPT

You run an inference (context, choice) for each choice, you compute the image of the [CLS] token by a linear layer with 1 output and then compute a softmax over the output of all choices.

I will try to create a PR with this code very soon.

hi锛孴he multi choices problem usually has one passage, question and ABCD four options銆侷n your model, dose context means passage&question ?

Any update on this issue?

Was this page helpful?
0 / 5 - 0 ratings

Related issues

adigoryl picture adigoryl  路  3Comments

guanlongtianzi picture guanlongtianzi  路  3Comments

fabiocapsouza picture fabiocapsouza  路  3Comments

rsanjaykamath picture rsanjaykamath  路  3Comments

fyubang picture fyubang  路  3Comments