Bert: Serving fine-tuned Model - best solution

Created on 11 Jul 2019 · 7Comments · Source: google-research/bert

What is the best solution to serve a fine-tuned model and returning predictions?

Source

JimAva

Most helpful comment

Although it might not be the most efficient method I find wrapping the prediction in a Flask API to work quite well. It works something like this:

First export your model after training:
estimator.export_saved_model(model_dir, serving_input_receiver_fn)

Then load your model in the API using something in the lines of:

from tensorflow.contrib import predictor
predict_fn = predictor.from_saved_model(model_dir)
result = predict_fn(...)

Now you can use predict_fn to serve predictions. I have a rough implementation that I could share if you need it :)

sarnikowski on 7 Aug 2019

👍2

All 7 comments

this is a duplicate of issue #679

jaymody on 11 Jul 2019

Thanks Jay for the response, however, bert-as-service will only encode sentences. I've looked at their documentation and have it running and it does not do any prediction.

JimAva on 11 Jul 2019

👍1

I think this will help you. https://github.com/SunYanCN/BERT-chinese-text-classification-and-deployment
@JimAva

SunYanCN on 12 Jul 2019

Although it might not be the most efficient method I find wrapping the prediction in a Flask API to work quite well. It works something like this:

First export your model after training:
estimator.export_saved_model(model_dir, serving_input_receiver_fn)

Then load your model in the API using something in the lines of:

from tensorflow.contrib import predictor
predict_fn = predictor.from_saved_model(model_dir)
result = predict_fn(...)

Now you can use predict_fn to serve predictions. I have a rough implementation that I could share if you need it :)

sarnikowski on 7 Aug 2019

👍2

@sarnikowski I am actually using Flask to serve predictions however, it's very slow. Is it possible for you to share the code with me?
TIA!

Ayush-iitkgp on 26 Aug 2019

For anyone else interested in this, i wrote a rough implementation and made it available here: https://github.com/sarnikowski/bert_in_a_flask

sarnikowski on 25 Oct 2019

Maybe check this out if you are looking for serving BERT fine-tuned model.
BERT Serving and Inferencing from fine-tuned

jageshmaharjan on 9 Feb 2020

Was this page helpful?

0 / 5 - 0 ratings

Related issues

why need to change words to "###*"by apply tokenization?

waallf · 4Comments

What are the requirements of the language in order to included in the BERT?

sharavsambuu · 3Comments

How to extract the word embedding parameters from the pretrained files?

dzhao123 · 3Comments

Model become 3 times larger after finetune?

wangwei7175878 · 4Comments

run_classifier.py gets struck while saving checkpoint

santhoshkolloju · 3Comments