Darknet: image data augmentation

Created on 22 May 2018 · 9Comments · Source: AlexeyAB/darknet

@AlexeyAB , Went through the issue log and understood that random cropping, multi-scaling, color and lighting aspects of data augmentation has been already taken care in this repo. Wondering if there is a plan to cover other types as well enabling data aug on the fly ? like - random rotations, flips, perspective transforms etc.,? (Maybe at least rotations and flips).

want enhancement

Source

kmsravindra

Most helpful comment

@AlexeyAB
Is random rotation merged in the code ?

deimsdeutsch on 19 Jun 2019

👍2

All 9 comments

horizontal flips - aready implemented, and enabled by default. To disable use flip=0 in the [net]-section in cfg-file
vertical flips - I do not know where it could help.
rotations - is in the plans, but has not yet found the datasets in which it was required, except satellite imagery
perspective transforms - since this will change the entire image, it will create disproportional distortions for near and far objects. I do not know the datasets for which this could help.

AlexeyAB on 22 May 2018

@AlexeyAB, I thnk rotations, vertical flips and perspective transforms could be helpful in medical imaging. Given the possibillity of camera being titled at any angle + having limited access to training data would make a good case for such augmentation.

kmsravindra on 23 May 2018

@kmsravindra on a different thread, can you point me to the multi-scaling part of the code, or where is the github issue discussing this? I am interested to learn more about the multi-scale data augmentation, but cannot find any docs/issues talking about it.

Thanks!

xiaoyongzhu on 23 May 2018

@xiaoyongzhu

random=1 in cfg-file - resizes neural network size: https://github.com/AlexeyAB/darknet/blob/6390a5a2ab61a0bdf6f1a9a6b4a739c16b36e0d7/src/detector.c#L132-L151
or the same but by using OpenCV (3.5x times faster): https://github.com/AlexeyAB/darknet/blob/6390a5a2ab61a0bdf6f1a9a6b4a739c16b36e0d7/src/http_stream.cpp#L256

jitter=0.1 - jitter=0.49 - resizes image size: https://github.com/AlexeyAB/darknet/blob/6390a5a2ab61a0bdf6f1a9a6b4a739c16b36e0d7/src/data.c#L743-L793

AlexeyAB on 23 May 2018

@AlexeyAB Very helpful! Thanks for the links!

xiaoyongzhu on 23 May 2018

@AlexeyAB, I have implemented a standalone rotation augmentation in python. But this creates additional images in the training data thus occupying lot of storage. I wanted to incorporate this into the code that does the rotation augmentation on the fly during the training run time. Could you point me to some code in this rep where I can include my augmentation code to run it on the fly?

kmsravindra on 27 Jun 2018

@kmsravindra

You should rotate both Images and Labels (bounded boxes).