Models: MobileNet models for Text classification

Created on 7 Dec 2017 · 6Comments · Source: tensorflow/models

Given the new tensorflow lite and new upcoming mobile cpu hardware support for it, it would great to add pre-trained MobileNet models for binary (spam,ham) or multi-label Text Classification (CNN or Word2Vec), Sentences Embedding (Word2Vec), RNN Sentiment Classification, etc.
Also more tutorial how to train a MobileNet like model for text based tasks would be helpful.

lite help wanted feature

Source

loretoparisi

Most helpful comment

More examples sounds like a good idea.

/CC @nealwu @tfboyd

drpngx on 9 Dec 2017

❤1 🎉1 👍1

All 6 comments

MobileNets are convolutional network architectures for image classification, IIRC. You can of course make your own small recurrent or fully connected network for those tasks and push them onto your phone, but I'd say Word2Vec by itself is already too large to fit into most phone memory.

nguyeho7 on 7 Dec 2017

@nguyeho7 thanks for the clarification, this is a very challenging task. In fact regarding Word2Vec, Mikolov et al. used a quantization approach for Word2Vec embedding models, that should fit at runtime in recent phone.
By example the FastText language identification model (172 languages) features a 98.5% of accuracy with the standard model (subwords embedding) that is 350MB, and less than 1MB model with an acceptable accuracy. See here.
Following this approach, could be interesting to create MobileNet models for text based tasks then as well.

loretoparisi on 7 Dec 2017

More examples sounds like a good idea.

/CC @nealwu @tfboyd

drpngx on 9 Dec 2017

❤1 🎉1 👍1

FYI in case you haven't already seen it, we do have a MobileNet implementation here (search the page for "MobileNet").

nealwu on 12 Dec 2017

@nealwu thanks! It would be awesome to have in TF-slim text based tasks, since it is not so easy to translate/port the image based cnn tasks into rnn/cnn based classification tasks to run on mobilenet.

loretoparisi on 12 Dec 2017

Thanks for the suggestions on pretrained models to release. We are prioritizing various models to train as we get time. Will close this for now.