Fasttext: Which algorithm is being used for the classification task ?

Created on 11 May 2018 · 3Comments · Source: facebookresearch/fastText

I have followed the references provided and the code and understood how the word-embedding are being generated. What I am not able to get is:

How is sentence/document vector is created using individual word embedding
Which algorithm is being used for the classification task (when using ./fasttext supervised supervised and then ./fasttext supervised predict)

Source

a11apurva

👍3

Most helpful comment

Hi @a11apurva,

A sentence/document vector is obtained by averaging the word/ngram embeddings.
For the classification task, multinomial logistic regression is used, where the sentence/document vector corresponds to the features. When applying fastText on problems with a large number of classes, you can use the hierarchical softmax to speed-up the computation (with the command line option -loss hs).

Please note that the two weight matrices (corresponding to the word/ngram embeddings and the classifiers) are learned jointly.

Best,
Edouard.

EdouardGrave on 8 Jun 2018

👍5 ❤2

Hi @a11apurva,

A sentence/document vector is obtained by averaging the word/ngram embeddings.
For the classification task, multinomial logistic regression is used, where the sentence/document vector corresponds to the features. When applying fastText on problems with a large number of classes, you can use the hierarchical softmax to speed-up the computation (with the command line option -loss hs).

Please note that the two weight matrices (corresponding to the word/ngram embeddings and the classifiers) are learned jointly.

Best,
Edouard.

EdouardGrave on 8 Jun 2018

👍5 ❤2

Hi, in which file can I see the implementation of multinomial logistic regression? I only saw a binary logistic function in a file.