The documentation describing how create and serve a custom Estimator, how to serve a tensorflow model in general, and how to perform graph transforms is very helpful - each on its own - but it is unclear how all these components fit together in the same ecosystem. This google cloud documentation on deploying models seems to suggest this (create -> transform -> serve) is the intent, I just cannot seem to find any documentation on how to:
@gautamvasudevan - Hi, any update on this documentation ?
No. @lamberta any thoughts on this?
I've seen an internal doc that addresses some of this (b/116674557), but nothing that is ready to publish.
Might make a nice tutorial for the serving docs.
Looking for this as well.
@lamberta Anything on this you could share? The ideal scenario would be documentation on how to:
I am able to accomplish many of these steps independently:
but pruning / transformer the output of an estimator, and generating a valid servable from this, would be great.
The doc I saw became this blog post: Optimizing TensorFlow Models for Serving
We're building out the TFX section which will have more pipelines for serving: https://www.tensorflow.org/tfx/
For TFX, you can see the Chicago Taxi end-to-end example (and notebook).
We'll continue to build this out.
Thank you @lamberta.
@lamberta thanks for posting
Most helpful comment
The doc I saw became this blog post: Optimizing TensorFlow Models for Serving
We're building out the TFX section which will have more pipelines for serving: https://www.tensorflow.org/tfx/
For TFX, you can see the Chicago Taxi end-to-end example (and notebook).
We'll continue to build this out.