I want to deploy a model by tensorflowServing+nvidia-docker on GPU .
HOW CAN I limit the GPU's MEMORY .
cause it's tend to use all memory of GPU .
I want to limit the GPU memory used to below 5G (10G in total) .
Thanks for your reply, ANd I know how to solve this problem
if i use the following way to run my model serve , Is there a suitable environment variable to control GPU's memory usage
docker run --runtime=nvidia -p 8501:8501 \
--mount type=bind, \
source=/tmp/tfserving/serving/tensorflow_serving/servables/tensorflow/testdata/saved_model_half_plus_two_gpu, \
target=/models/half_plus_two \
-e NVIDIA_VISIBLE_DEVICES=3 \
-e MODEL_NAME=half_plus_two -t tensorflow/serving:latest-gpu &
@goodluck4s You can take a look at this comment where your question has been answered. Thanks!
thanks for you reply and i had solve this problem
@goodluck4s Thank you for the reply. I am closing this issue as it has been resolved.
@gowthamkpr how to solve this problem, thanks
Most helpful comment
if i use the following way to run my model serve , Is there a suitable environment variable to control GPU's memory usage
docker run --runtime=nvidia -p 8501:8501 \
--mount type=bind, \
source=/tmp/tfserving/serving/tensorflow_serving/servables/tensorflow/testdata/saved_model_half_plus_two_gpu, \
target=/models/half_plus_two \
-e NVIDIA_VISIBLE_DEVICES=3 \
-e MODEL_NAME=half_plus_two -t tensorflow/serving:latest-gpu &