Given the new tensorflow lite and new upcoming mobile cpu hardware support for it, it would great to add pre-trained MobileNet models for binary (spam,ham) or multi-label Text Classification (CNN or Word2Vec), Sentences Embedding (Word2Vec), RNN Sentiment Classification, etc.
Also more tutorial how to train a MobileNet like model for text based tasks would be helpful.
MobileNets are convolutional network architectures for image classification, IIRC. You can of course make your own small recurrent or fully connected network for those tasks and push them onto your phone, but I'd say Word2Vec by itself is already too large to fit into most phone memory.
@nguyeho7 thanks for the clarification, this is a very challenging task. In fact regarding Word2Vec, Mikolov et al. used a quantization approach for Word2Vec embedding models, that should fit at runtime in recent phone.
By example the FastText language identification model (172 languages) features a 98.5% of accuracy with the standard model (subwords embedding) that is 350MB, and less than 1MB model with an acceptable accuracy. See here.
Following this approach, could be interesting to create MobileNet models for text based tasks then as well.
More examples sounds like a good idea.
/CC @nealwu @tfboyd
FYI in case you haven't already seen it, we do have a MobileNet implementation here (search the page for "MobileNet").
@nealwu thanks! It would be awesome to have in TF-slim text based tasks, since it is not so easy to translate/port the image based cnn tasks into rnn/cnn based classification tasks to run on mobilenet.
Thanks for the suggestions on pretrained models to release. We are prioritizing various models to train as we get time. Will close this for now.
Most helpful comment
More examples sounds like a good idea.
/CC @nealwu @tfboyd