Mask_rcnn: image_area in PyramidROIAlign

Created on 27 Jan 2018  路  1Comment  路  Source: matterport/Mask_RCNN

Hi,

Could you explain why we need (224.0 / tf.sqrt(image_area)), instead of just using 224.0 at the following line?

roi_level = log2_graph(tf.sqrt(h * w) / (224.0 / tf.sqrt(image_area))) // model.py line 364

screen shot 2018-01-26 at 5 50 47 pm

When I see the equation in FPN paper, I do not see tf.sqrt(image_area) part.

Thank you very much.

Most helpful comment

The equation in the paper assumes the width and height are in pixels. In the code, at that spot, the width and height are normalized (0 to 1). The additional division handles the difference in unit of measurement.

>All comments

The equation in the paper assumes the width and height are in pixels. In the code, at that spot, the width and height are normalized (0 to 1). The additional division handles the difference in unit of measurement.

Was this page helpful?
0 / 5 - 0 ratings