Mask_rcnn: image_area in PyramidROIAlign

Created on 27 Jan 2018  路  1Comment  路  Source: matterport/Mask_RCNN

Hi,

Could you explain why we need (224.0 / tf.sqrt(image_area)), instead of just using 224.0 at the following line?

roi_level = log2_graph(tf.sqrt(h * w) / (224.0 / tf.sqrt(image_area))) // model.py line 364

screen shot 2018-01-26 at 5 50 47 pm

When I see the equation in FPN paper, I do not see tf.sqrt(image_area) part.

Thank you very much.

Most helpful comment

The equation in the paper assumes the width and height are in pixels. In the code, at that spot, the width and height are normalized (0 to 1). The additional division handles the difference in unit of measurement.

>All comments

The equation in the paper assumes the width and height are in pixels. In the code, at that spot, the width and height are normalized (0 to 1). The additional division handles the difference in unit of measurement.

Was this page helpful?
0 / 5 - 0 ratings

Related issues

PaulChongPeng picture PaulChongPeng  路  4Comments

ziyigogogo picture ziyigogogo  路  3Comments

canerozer picture canerozer  路  3Comments

wadmes picture wadmes  路  4Comments

LifeBeyondExpectations picture LifeBeyondExpectations  路  4Comments