Keras: TypeError: Fail to find the dnn implementation.

Created on 10 Jul 2018 · 21Comments · Source: keras-team/keras

Platform: Windows10
Tensorflow Version: 1.7.0（GPU）
Cuda compilation tools, release 9.0, V9.0.176
CUDNN： 7.1.2
Graphic processor: Nvidia Geforce GTX 1050

My code:

from keras.layers import CuDNNLSTM,Bidirectional
lstmsize=6
lstm0 = CuDNNLSTM(lstmsize,return_sequences = True)

Error:

UnknownError (see above for traceback): Fail to find the dnn implementation.
[[Node: cu_dnngru_1/CudnnRNN = CudnnRNN[T=DT_FLOAT, direction="unidirectional", dropout=0, input_mode="linear_input", is_training=true, rnn_mode="gru", seed=87654321, seed2=0, _device="/job:localhost/replica:0/task:0/device:GPU:0"](cu_dnngru_1/transpose, cu_dnngru_1/ExpandDims_1, cu_dnngru_1/Const_1, cu_dnngru_1/concat)]]
[[Node: loss/mul/_73 = _Recvclient_terminated=false, recv_device="/job:localhost/replica:0/task:0/device:CPU:0", send_device="/job:localhost/replica:0/task:0/device:GPU:0", send_device_incarnation=1, tensor_name="edge_618_loss/mul", tensor_type=DT_FLOAT, _device="/job:localhost/replica:0/task:0/device:CPU:0"]]

Hopefully for help!

Source

rosefun

👍6

Most helpful comment

Try this. It works

gpus = tf.config.experimental.list_physical_devices('GPU')
for gpu in gpus:
        tf.config.experimental.set_memory_growth(gpu, True)
tf.config.experimental.set_virtual_device_configuration(gpus[0], [tf.config.experimental.VirtualDeviceConfiguration(memory_limit=1024)])

Shekhrozx on 3 Apr 2020

👍14 🎉2 ❤1

All 21 comments

I have the same problem under Linux Ubuntu 16.04

Maybe this helps:

https://devtalk.nvidia.com/default/topic/1030610/cuda-setup-and-installation/fail-to-find-the-dnn-implementation-/

ChristofHenkel on 14 Jul 2018

👍4

Same problem on Ubuntu 18.04.1 LTS running Cuda V9.0.176 and cuDNN 7.2.1. Ditto on RHEL 7.4 with Cuda V9.0.176 and cuDNN 9.0-v7

duhaime on 20 Oct 2018

👍2

for cuda 7.1.1 and cudnn 9.0 :
CuDNNLSTM, or CuDNNGRU ran successfully, then after some days gave the same error.
Fixed : reinstalling cuda and cudnn.

There has to be some other better solution. This way is too tiresome and lengthy!!

ASH1998 on 9 Nov 2018

👍3

I'm also seeing this error on Ubuntu 18.04, RTX 2070, cuda 10, keras, and tf-nightly-gpu. I cross posted on NVidia but haven't seen much help there: https://devtalk.nvidia.com/default/topic/1046589/cuda-setup-and-installation/issues-with-tensorflow-on-cuda10-and-rtx2080/

kyleabeauchamp on 26 Feb 2019

I had the same issue , when I updated tensorflow to 1.12. Error got resolved after updating my CuDNN verstion to 7.5 from 7. I followed the steps mentioned in the below url for updating the CuDNN version (Note: The steps mentioned in the link are for installing CUDNN , but the same is applicable for update as well)

https://jhui.github.io/2017/09/07/AWS-P2-CUDA-CuDNN-TensorFlow/

infinitylogesh on 28 Feb 2019

I ended up fixing this issue with the allow_growth = True comment on https://github.com/tensorflow/tensorflow/issues/24496

kyleabeauchamp on 6 Mar 2019

👍9

Platform: Ubuntu 18.04
Tensorflow Version: 1.13.1（GPU）
CUDA: V10.0.130
CUDNN: 7.4.2
GPU: RTX 2080Ti

I got the same error. I have built the graph, it occurred when initializing variables. When I use tf-nightly-gpu of version 1.13 I didn't have this error.
And I have set the allow_growth = True, it didn't work.

shiningliang on 11 Mar 2019

I got this error while running cudnn LSTM. They worked for a while then they quit working. I did "conda update tensorflow-gpu" and that fixed it. The problem must be in tensorflow somewhere?

oinksterthepig on 17 Apr 2019

I got this error last night while working on the tensorflow tutorial "https://www.tensorflow.org/alpha/tutorials/load_data/text". I was using tensorflow-gpu 2.0alpha on an Ubuntu 18.04x64 machine and python version 3.6. I updated my Cudnn from 7.4 to 7.5.1 and tried up upgrade tensorflow too--but that did not change anything. I was able to compile the Cudnn samples Mnist network--which is the usual test for a successful install. Just wanted to let you know about the continuing issue.

00krishna on 26 Apr 2019

unsubscribe please

[email protected]

From: Krishna Bhogaonkernotifications@github.com
Date: 2019-04-27 02:33
To: keras-team/keraskeras@noreply.github.com
CC: Subscribedsubscribed@noreply.github.com
Subject: Re: [keras-team/keras] TypeError: Fail to find the dnn implementation. (#10634)

I got this error last night while working on the tensorflow tutorial "https://www.tensorflow.org/alpha/tutorials/load_data/text". I was using tensorflow-gpu 2.0alpha on an Ubuntu 18.04x64 machine and python version 3.6. I updated my Cudnn from 7.4 to 7.5.1 and tried up upgrade tensorflow too--but that did not change anything. I was able to compile the Cudnn samples Mnist network--which is the usual test for a successful install. Just wanted to let you know about the continuing issue.

—
You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHubhttps://github.com/keras-team/keras/issues/10634#issuecomment-487157424, or mute the threadhttps://github.com/notifications/unsubscribe-auth/AKXHW4SYRLENBQXLRNXPYU3PSNDGDANCNFSM4FJDZWHA.

zyu511008 on 27 Apr 2019

🚀2

I got this error while running cudnn LSTM. They worked for a while then they quit working. I did "conda update tensorflow-gpu" and that fixed it. The problem must be in tensorflow somewhere?

I also use 'conda update tensorflow-gpu' and fixed it.. thanks!

cageyoko on 18 May 2019

I got this error last night while working on the tensorflow tutorial "https://www.tensorflow.org/alpha/tutorials/load_data/text". I was using tensorflow-gpu 2.0alpha on an Ubuntu 18.04x64 machine and python version 3.6. I updated my Cudnn from 7.4 to 7.5.1 and tried up upgrade tensorflow too--but that did not change anything. I was able to compile the Cudnn samples Mnist network--which is the usual test for a successful install. Just wanted to let you know about the continuing issue.

Reference: https://github.com/tensorflow/tensorflow/issues/20067#issuecomment-420245102

Have you make sure your GPU is available? If you have any other session running on the same GPU on Windows, you would want to do halt and close.

try the following snippet to check if you have a GPU available. This will occur when there is no available device:

from tensorflow.python.client import device_lib

def get_available_gpus():
    local_device_protos = device_lib.list_local_devices()
    return [x.name for x in local_device_protos if x.device_type == 'GPU']

gerlaic on 20 May 2019

I fixed this issue by upgrading cuddn from 7.0 to 7.5. I am using cuda10.1 and tf-gpu1.14 on Ubuntu 16.04.

VertexC on 10 Jun 2019

I ended up fixing this issue with the allow_growth = True comment on tensorflow/tensorflow#24496

Thanks! I solved this problem by your way

morningsky on 18 Jun 2019

In tensorflow 2.0 i got the same error while running RNN LSTM model.The reason was due to lower version of my cuDNN.In the tensorflow gpu requirements page it was recommended to have
cuDNN SDK >= 7.4.1.You can refer for more details in https://www.tensorflow.org/install/gpu
Asked in Tensorflow Reddit forum https://www.reddit.com/r/tensorflow/comments/dxnnq2/i_am_getting_an_error_while_running_the_rnn_lstm/?utm_source=share&utm_medium=web2x

FrozenWolf-Cyber on 18 Nov 2019

maybe u can solve it by "tf.config.experimental.set_memory_growth()"！！！

tricyzhou on 23 Mar 2020

👍4 👎2 🚀1 🎉1

Try this. It works

gpus = tf.config.experimental.list_physical_devices('GPU')
for gpu in gpus:
        tf.config.experimental.set_memory_growth(gpu, True)
tf.config.experimental.set_virtual_device_configuration(gpus[0], [tf.config.experimental.VirtualDeviceConfiguration(memory_limit=1024)])

Shekhrozx on 3 Apr 2020

👍14 🎉2 ❤1

I got the same error after trying to train again a model... and I solve it with the same solution of @Shekhrozx

anisayari on 28 May 2020

I solve this problem using this way:
physical_devices = tf.config.list_physical_devices('GPU') tf.config.experimental.set_memory_growth(physical_devices[0], enable=True)

sergio12S on 17 Jun 2020

👍2

The recommended format directly from the TF docs in 2.0+ is:

try:
  tf.config.experimental.set_memory_growth(physical_devices[0], True)
except:
  # Invalid device or cannot modify virtual devices once initialized.
  pass