Mask_rcnn: image_area in PyramidROIAlign

Created on 27 Jan 2018 · 1Comment · Source: matterport/Mask_RCNN

Hi,

Could you explain why we need (224.0 / tf.sqrt(image_area)), instead of just using 224.0 at the following line?

roi_level = log2_graph(tf.sqrt(h * w) / (224.0 / tf.sqrt(image_area))) // model.py line 364

screen shot 2018-01-26 at 5 50 47 pm

When I see the equation in FPN paper, I do not see tf.sqrt(image_area) part.

Thank you very much.

Source

TgithubJ

Most helpful comment

The equation in the paper assumes the width and height are in pixels. In the code, at that spot, the width and height are normalized (0 to 1). The additional division handles the difference in unit of measurement.

waleedka on 12 Feb 2018

👍6

>All comments

waleedka on 12 Feb 2018

👍6

Was this page helpful?

0 / 5 - 0 ratings

Related issues

the resnet50 backbone on mask rcnn model pretrained weight in h5 file

simonhandsome · 3Comments

Load Validation Dataset there is a mistake and how to solve it

chrispolo · 4Comments

Repeated anchors in generate_pyramid_anchors()

Mabinogiysk · 3Comments

What is the meaning of BACKBONE_STRIDES?

Mabinogiysk · 3Comments

TypeError: Axis must be specified when shapes of a and weights differ.

Mhaiyang · 4Comments