Py-faster-rcnn: Rotational Detection

Created on 7 Dec 2016 · 10Comments · Source: rbgirshick/py-faster-rcnn

Hello,

Having prepared the dataset properly, I tested an example image and got the following result,

car

Is there any clever way to improve the detection probabilities of rotated cars?

Source

eonurk

Most helpful comment

I have modified a version of labelimg, which can label rotation BBs.
https://github.com/cgvict/roLabelImg
Hope to be helpful.

cgvict on 25 Aug 2017

❤4 👍2

All 10 comments

@eonurk
I think you should provide rotated bounding boxes in your ground truth. So the system will learn that. Also in result visualization step, you have to modify the code to be able to show rotation BBs.

I am now trying to detect the same images with rotated bounding boxes, but no luck yet. First I have to prepare an annotation tool which allows rotated bounding boxes.

arasharchor on 14 Dec 2016

👍1

Actually I think I found the reason. I was taking the RoIs without a precised model, some of them were halves of car, some of then were the cars bottom parts. Yesterday, I deleted those RoIs from my dataset, and only took the cars that are very obvious in the picture. It kind of solved my problem. Still not perfect though! I am also writing a matlab code that rotates the image and the bboxes 45 degree so that the ground truth images are good to go. I will share it if I do it:)

eonurk on 15 Dec 2016

@eonurk Thanks. I really appreciate that if you could, but I think that would more efficient if we could change this annotation tool to support rotated BBoxes.Extension - BBox Angle (not an issue)
I have not started to change it so far, but I am so willing to start perhaps in this week.
I am working on big (5kx3k) images with very small(20x10) objects which appear in different rotation angles and I need to estimate their rotation angle as well.

arasharchor on 15 Dec 2016

@eonurk have you managed to add rotation classification?

arasharchor on 9 Mar 2017

I actually wrote a code for changing the rotations of the images and their bounding boxes by 45 degrees. Also my collegues wrote a simple GUI to implement my code. So in the end if you have 100 ground truth images, you will end up with 800 with bounding boxes. Also, I wrote a program that is compitible with the inria.py training example of Faster R-CNN. If I have some spare time, I will create a repo for these stuff.

eonurk on 9 Mar 2017

@eonurk
thanks. I also wrote a matlab code to rotate images by 90 for data augmentation along with their boudingboxes. As Fasterrcnn does flipping as another sort of data augmentation, rotating 90 was enough for me as by flipping I will cover four main directions. However, as far as I know faster rcnn takes ground truth with format of (xmin,ymin,xmax,ymax) and converts it to (centerx,centery,width,height). How is it possible to predict rotated bounding boxes in this case?

I am trying to divide 360 degrees into 8 classes and add another loss function besides other two ones for rotation prediction. The another way is to add angles to bounding box groundtuthes as a fifth array, but I am not sure the second approach would work better that the first one. So I have already asked other guys, but now response yet. #https://github.com/rbgirshick/py-faster-rcnn/issues/515

another way of predicting rotated bboxes is predicting the coordinates of 4 points of bboxes, but I think predicting center, width, height along angle will be better. I am not sure. As said I am trying to add a new loss function but I am getting errors.

arasharchor on 9 Mar 2017

@smajida I think you can have a look at this paper "https://arxiv.org/pdf/1703.01086.pdf"

helxsz on 28 Jun 2017

I have modified a version of labelimg, which can label rotation BBs.
https://github.com/cgvict/roLabelImg
Hope to be helpful.

cgvict on 25 Aug 2017

❤4 👍2

@helxsz thanks. a good paper indeed! :)
@cgvict great! unfortunately I tried this new feature, but it did not show any difference with no rotation bboxes features. I opened an issue regarding that.

arasharchor on 29 Aug 2017

does anyone tried to apply the modified version of @cgvict to an object detection api such as the one from Tensorflow ? is it even possible to change the bounding box from the api so it would be rotated !