Vision: Dataloader does not work with inputs of different size

Created on 22 Jul 2018 · 6Comments · Source: pytorch/vision

I am not sure if this is a bug or it is build on purpose like this, but i notice if i have data with size of BxCxAxWxH, where A can be different from sample to sample the dataloader through an error.

Example:

Training = MyDataset(VideosPath)
for i in range(3):
    sample = Training[i]
    print(i, sample['frames'].size()  )
0 torch.Size([1, 3, 10, 10, 10])
1 torch.Size([1, 3, 10, 10, 10])
2 torch.Size([1, 3, 10, 10, 10])
dataloader = DataLoader(Training, batch_size=2, shuffle=False, num_workers=4)
for i_batch, sample_batched in enumerate(dataloader):
    print(i_batch, sample_batched['frames'].size() )

works fine.

but if i have:

Training = MyDataset(VideosPath)
for i in range(3):
    sample = Training[i]
    print(i, sample['frames'].size()  )
0 torch.Size([1, 3, 90, 10, 10])
1 torch.Size([1, 3, 211, 10, 10])
2 torch.Size([1, 3, 370, 10, 10])

dataloader = DataLoader(Training, batch_size=2, shuffle=False, num_workers=4)
for i_batch, sample_batched in enumerate(dataloader):
    print(i_batch, sample_batched['frames'].size() )

it does not work and throw an error

RuntimeError Traceback (most recent call last)
in ()
----> 1 for i_batch, sample_batched in enumerate(dataloader):
2 print(i_batch, sample_batched['frames'].size() )
3

~/anaconda3/lib/python3.6/site-packages/torch/utils/data/dataloader.py in __next__(self)
284 self.reorder_dict[idx] = batch
285 continue
--> 286 return self._process_next_batch(batch)
287
288 next = __next__ # Python 2 compatibility

~/anaconda3/lib/python3.6/site-packages/torch/utils/data/dataloader.py in _process_next_batch(self, batch)
305 self._put_indices()
306 if isinstance(batch, ExceptionWrapper):
--> 307 raise batch.exc_type(batch.exc_msg)
308 return batch
309

RuntimeError: Traceback (most recent call last):
File "/home/alireza/anaconda3/lib/python3.6/site-packages/torch/utils/data/dataloader.py", line 57, in _worker_loop
samples = collate_fn([dataset[i] for i in batch_indices])
File "/home/alireza/anaconda3/lib/python3.6/site-packages/torch/utils/data/dataloader.py", line 135, in default_collate
return {key: default_collate([d[key] for d in batch]) for key in batch[0]}
File "/home/alireza/anaconda3/lib/python3.6/site-packages/torch/utils/data/dataloader.py", line 135, in
return {key: default_collate([d[key] for d in batch]) for key in batch[0]}
File "/home/alireza/anaconda3/lib/python3.6/site-packages/torch/utils/data/dataloader.py", line 115, in default_collate
return torch.stack(batch, 0, out=out)
RuntimeError: invalid argument 0: Sizes of tensors must match except in dimension 0. Got 90 and 211 in dimension 3 at /opt/conda/conda-bld/pytorch_1524586445097/work/aten/src/TH/generic/THTensorMath.c:3586

the error itself is saying that: Sizes of tensors must match except in dimension 0, meaning i can have permuted the dimensions to bring A to dimension 0 and have the rest the same, meaning
0 torch.Size([90, 1, 3, 10, 10])
1 torch.Size([ 211, 1, 3, 10, 10])

but even doing this will give me an error

Source

isalirezag

Most helpful comment

That's right.
You need to write your own collate_fn and pass it to DataLoader so that you can have batches of different sizes (for example, by padding the images with zero so that they have the same size and can be concatenated).

It should be fairly easy to write your own collate_fn for handling your use-case. Let me know if it isn't the case.

fmassa on 23 Jul 2018

👍4 😕3 😄1

All 6 comments

It should be fairly easy to write your own collate_fn for handling your use-case. Let me know if it isn't the case.

fmassa on 23 Jul 2018

👍4 😕3 😄1

It would be useful if you guys can consider it as an enhancement for future releases

isalirezag on 23 Jul 2018

The tricky part is that it is not really clear what the default behavior should be, and it is probably task dependent.
For Detectron we are going to be releasing a collate_fn that handles the case of varying input sizes.

fmassa on 30 Jul 2018

Sometimes we want to process images with different sizes. Padding with 0 is not ideal for this case.

LilySnow on 23 Jul 2020

@LilySnow that's correct, and the work of NestedTensors by @cpuhrsch https://github.com/pytorch/nestedtensor will give an unified and elegant way to solve this.

fmassa on 30 Jul 2020

Thanks. That would be very helpful.

LilySnow on 30 Jul 2020

Was this page helpful?

0 / 5 - 0 ratings