Incubator-mxnet: Custom Operator Seg Fault

Created on 27 Jul 2019  路  7Comments  路  Source: apache/incubator-mxnet

After this PR: https://github.com/apache/incubator-mxnet/pull/15518, custom operators will seg fault in symbolic mode.
I have created a script to reproduce the issue. If we set export MXNET_SUBGRAPH_BACKEND=NONE, then the script will run ok.

https://gist.github.com/Zha0q1/b89675763f49a63064ab1a37ea0174f2

I built from source (lattest) and ran the script on Deep Learning Base AMI (Ubuntu) Version 18.0 p2.8xlarge. I used cmake -DUSE_CUDA=0 -DUSE_MKLDNN=1 -GNinja .. to build.

Error Message:


ubuntu@ip-172-31-38-36:~/abc/incubator-mxnet$ python ../test.py 
[23:15:23] ../src/executor/graph_executor.cc:2018: Subgraph backend MKLDNN is activated.
[23:15:23] ../src/executor/graph_executor.cc:2018: Subgraph backend MKLDNN is activated.

Segmentation fault: 11

Stack trace:
  [bt] (0) /home/ubuntu/abc/incubator-mxnet/python/mxnet/../../build/libmxnet.so(+0x8e6149) [0x7ffa1d95b149]
  [bt] (1) /lib/x86_64-linux-gnu/libc.so.6(+0x354b0) [0x7ffa25ec14b0]
  [bt] (2) python() [0x4954ed]
  [bt] (3) python() [0x46a16d]
  [bt] (4) /usr/lib/python2.7/lib-dynload/_ctypes.x86_64-linux-gnu.so(+0xe719) [0x7ffa251b6719]
  [bt] (5) /usr/lib/x86_64-linux-gnu/libffi.so.6(ffi_closure_unix64_inner+0x1bf) [0x7ffa24fa5c2f]
  [bt] (6) /usr/lib/x86_64-linux-gnu/libffi.so.6(ffi_closure_unix64+0x46) [0x7ffa24fa5fa8]
  [bt] (7) /home/ubuntu/abc/incubator-mxnet/python/mxnet/../../build/libmxnet.so(+0xceb8a2) [0x7ffa1dd608a2]
  [bt] (8) /home/ubuntu/abc/incubator-mxnet/python/mxnet/../../build/libmxnet.so(std::_Sp_counted_base<(__gnu_cxx::_Lock_policy)2>::_M_release()+0x46) [0x7ffa1d7d22d6]
Bug

Most helpful comment

@Zha0q1 Thanks for reporting this. PR https://github.com/apache/incubator-mxnet/pull/15671 is created to fix this, your provided case can pass now. Please have a try for your real case to see if it works.

All 7 comments

Hey, this is the MXNet Label Bot.
Thank you for submitting the issue! I will try and suggest some labels so that the appropriate MXNet community members can help resolve it.
Here are my recommended labels: Bug

@PatricZhao @ZhennanQin

@mxnet-label-bot add [Bug]

@Zha0q1 Thanks for reporting this. PR https://github.com/apache/incubator-mxnet/pull/15671 is created to fix this, your provided case can pass now. Please have a try for your real case to see if it works.

@Zha0q1 please help double-check if the bug is fixed.

@Zha0q1 please help double-check if the bug is fixed.

@pengzhao-intel Thanks! I will l check and get back to this issue

@pengzhao-intel @ZhennanQin My use case works now. Thanks!

Was this page helpful?
0 / 5 - 0 ratings

Related issues

phunterlau picture phunterlau  路  3Comments

sbodenstein picture sbodenstein  路  3Comments

xzqjack picture xzqjack  路  3Comments

ranti-iitg picture ranti-iitg  路  3Comments

WangcsShuai picture WangcsShuai  路  3Comments