Incubator-mxnet: Automatic Batching for Dynamic Graphs

Created on 23 May 2017 · 3Comments · Source: apache/incubator-mxnet

@piiswrong: This seems very relevant to MXNet's efforts to support dynamic graphs #5705:

@jekbradbury: any idea whether PyTorch is thinking about adding support for something like this?

Source

sbodenstein

Most helpful comment

DyNet's autobatching approach is predicated on their lazy execution semantics (build a graph imperatively but don't execute it until later -- the autobatching can happen in between) and this limits the applicability of autobatching to truly dynamic situations (where the graph structure depends on runtime values of tensors e.g. because of discrete samples from a stochastic policy network).

PyTorch and Chainer are currently using eager execution semantics (other than the minimal laziness that comes from CUDA async execution), which means they'd either need to implement lazy eval or apply a very different autobatching approach. The problems with choosing lazy eval are a) the runtime conditionals mentioned above and b) the fact that as GPUs get faster and faster graph-building starts to be more costly than execution.

Ultimately what we need is either a magical static analyzer that can convert imperative-looking Python into an MXNet-like symbolic graph or a combination of autobatching with lazy eval together with annotations that say "this function can be statically compiled." MXNet is pretty well-positioned to do that since it's always had a mixed imperative-symbolic approach.

jekbradbury on 23 May 2017

👍12

All 3 comments

Yeah we have been thinking about doing the same thing. Haven't got to it yet. Never thought you can write a paper about it...

piiswrong on 23 May 2017

DyNet's autobatching approach is predicated on their lazy execution semantics (build a graph imperatively but don't execute it until later -- the autobatching can happen in between) and this limits the applicability of autobatching to truly dynamic situations (where the graph structure depends on runtime values of tensors e.g. because of discrete samples from a stochastic policy network).

PyTorch and Chainer are currently using eager execution semantics (other than the minimal laziness that comes from CUDA async execution), which means they'd either need to implement lazy eval or apply a very different autobatching approach. The problems with choosing lazy eval are a) the runtime conditionals mentioned above and b) the fact that as GPUs get faster and faster graph-building starts to be more costly than execution.

Ultimately what we need is either a magical static analyzer that can convert imperative-looking Python into an MXNet-like symbolic graph or a combination of autobatching with lazy eval together with annotations that say "this function can be statically compiled." MXNet is pretty well-positioned to do that since it's always had a mixed imperative-symbolic approach.

jekbradbury on 23 May 2017

👍12

This issue is closed due to lack of activity in the last 90 days. Feel free to reopen if this is still an active issue. Thanks!

yajiedesign on 30 Sep 2017

Was this page helpful?

0 / 5 - 0 ratings

Related issues

The API doc of mx.nd.arange doesn't explain infer_range

zheng-da · 3Comments

a error! Mxnet will crash!

dushoufu · 3Comments

Training Accuracy and Validation Accuracy problem

WangcsShuai · 3Comments

im2rec tool

luoruisichuan · 3Comments

Is there a simple way to make two similar networks share same weights?

xzqjack · 3Comments