Pytorch: Error when doing CUDA Conv2d with 1x1 kernel.

Created on 23 Jan 2017 · 3Comments · Source: pytorch/pytorch

Conv2d with 1x1 kernel is not working on GPU, although it works fine on CPU:

net = nn.Conv2d(1, 6, kernel_size=(1,1))
net.cuda()
x = Variable(torch.randn(1, 1, 100, 100))
x.cuda()
net(x)

Error message:

TypeError: FloatSpatialConvolutionMM_updateOutput received an invalid combination of arguments - got (int, torch.FloatTensor, torch.FloatTensor, torch.cuda.FloatTensor, torch.cuda.FloatTensor, torch.FloatTensor, torch.FloatTensor, int, int, int, int, int, int), but expected (int state, torch.FloatTensor input, torch.FloatTensor output, torch.FloatTensor weight, [torch.FloatTensor bias or None], torch.FloatTensor finput, torch.FloatTensor fgradInput, int kW, int kH, int dW, int dH, int padW, int padH)

I tried disabling cudnn with torch.backends.cudnn.enabled = False but still got the same error message.

I use Ubuntu 14.04, Cuda 7.5, Cudnn 5.1.5, Python 3.5.2, and Pytorch is installed from binaries.

Source

NgPDat

Most helpful comment

To clarify, instead of:

x = Variable(torch.randn(1, 1, 100, 100))
x.cuda()  # This creates a copy on the GPU and immediately discards it. "x" is still on the CPU

You should write:

x = Variable(torch.randn(1, 1, 100, 100).cuda())

colesbury on 23 Jan 2017

👍7

All 3 comments

If you look closely at the argument types that were given to conv, you'll see that some of the tensors are torch.cuda.FloatTensors, while the others are torch.FloatTensors. You probably forgot to send the input to the GPU.

apaszke on 23 Jan 2017

To clarify, instead of:

x = Variable(torch.randn(1, 1, 100, 100))
x.cuda()  # This creates a copy on the GPU and immediately discards it. "x" is still on the CPU

You should write:

x = Variable(torch.randn(1, 1, 100, 100).cuda())

colesbury on 23 Jan 2017

👍7

I think it it better to make model.cuda() and x.cuda() behaves consistently to avoid confusion.

jdhao on 31 Oct 2017

Was this page helpful?

0 / 5 - 0 ratings

Related issues

In Con2d layer,how to pad "Same" not "Zero"

a1363901216 · 3Comments

'Variable' object has no attribute 'shape' [v0.2]

miguelvr · 3Comments

Why is model.cuda() a class method instead of property ?

kdexd · 3Comments

'torch.nn' has no attribute 'TripletMarginLoss'

bartolsthoorn · 3Comments

Matrix multiplication operator

bartvm · 3Comments