Tvm: [RELAY][OP] Relay Operator Sprint

Created on 3 Oct 2018 · 52Comments · Source: apache/tvm

Now that the Relay RFC is being merged and we are stabilizing the type inference interface, we should sprint to add new operators to relay to make it on parity with NNVM.

1798 shows an example on how to do so for conv2d operator.

General Steps of Porting

Implement the TypeRelation function, when necessary
- The shapes represented by IndexExpr(symbolic integer)
  - When possible, support symbolic shape inference
  - You can, however, get the integer out from symbolic shape if it is a must, that will require the inference to work on concrete shapes.
- User reporter->Assign to set the inferred result
- Use reporter->AssertEQ to assert symbolic integer equivalence
  - It will return false if there is an unsatisfied constraint
Use tvm::Attrs to replace dmlc::Parameter
We switch to directly create python wrappers by calling into positional functions so that the operator signature is explicit in python

General Principles

Numpy consistency, always consistent with numpy
- All binary operators broadcast
- This means we will use add, subtract instead of broadcast_add, broadcast_sub ...
- elemwise_add version will not be supported for now as we can just use the broadcast version
Consistent with nnvm when possible
Fields in Attrs
- Use concrete types when possible(int, string, bool)
- If you need None, you can use IndexExpr, which gives you that

List of Operators to be covered

Generally, we need to cover everything we have so far https://docs.tvm.ai/nnvm_top.html
Please use this issue to coordinate what you will be working on. As we expect things to move quickly, try to do "fine grained locking" and only claim things that you are working on right now and aim to get things in a few days.

The List

Level 1: Common Basic Ops

Enough to get MLP

[x] nn.dense
[x] nn.relu
[x] tanh
[x] sigmoid
[x] exp
[x] log
[x] sqrt
[x] add
[x] subtract
[x] multiply
[x] divide
[x] mod
[x] nn.batch_flatten
[x] concatenate
[x] nn.softmax
[x] nn.log_softmax
[x] nn.batch_norm
[x] nn.dropout
[x] expand_dims

Level 2: Convolutions

Enough to get convnet

[x] nn.conv2d
[x] nn.conv2d_transpose
[x] nn.max_pool2d
[x] nn.avg_pool2d
[x] nn.global_max_pool2d
[x] nn.global_avg_pool2d
[x] nn.pad
[x] nn.lrn

Level 3: Additional Math And Transform Operators

[x] reshape
[x] copy
[x] negative
[x] floor
[x] ceil
[x] round
[x] trunc
[x] clip
[x] abs
[x] leaky_relu
[x] tranpose
[x] split
[x] squeeze
[x] take
[x] full
[x] zeros
[x] ones
[x] transpose
[x] zeros_like
[x] ones_like

Level 4: All broadcast and reduction functions that are not in previous level

[x] pow
[x] less
[x] greater
[x] less_than
[x] greater_than
[x] right_shift
[x] left_shift
[x] maximum
[x] minimum
[x] sum
[x] max
[x] prod
[x] argmax, argmin
[x] strided_slice
[x] broadcast_to
[x] where

Level 5: Vision Operators

[x] image.resize
[x] vision.multibox_prior
[x] vision.nms

Level 10: Backend Operators

Operators necessary as intermediate stage of optimizations, or gradient, can be influx

help wanted

Source

tqchen

All 52 comments

I had 'implemented' some elem wise function which I need for ad (negative, multiplication, division)
('implemented' because it isnt possible to lower code right now)
I can take some operators, and I dont have any particular preference.

MarisaKirisame on 3 Oct 2018

I could get through resize operator (will PR once #1798 is merged) to start with and proceeding with transforms.

srkreddy1238 on 3 Oct 2018

I strongly agree with the numpy consistency, e.g. nnvm.symbol.flatten should be renamed.

A good example could be that TensorFlow's API uses batch_xxx for batched operators.

junrushao1994 on 4 Oct 2018

I can work on some shape-related APIs.

junrushao1994 on 4 Oct 2018

@junrushao1994
Can you share the ops you are working on to avoid the duplication?

srkreddy1238 on 4 Oct 2018

@tqchen
How about relaying an assert on a condition which involve two variables ?
Ref. Transposed convolution where channels should divide groups and both are variables.

srkreddy1238 on 4 Oct 2018

@srkreddy1238
I am starting with expand_dims, the easiest one

1819

junrushao1994 on 4 Oct 2018

👍1

One note about int32 vs int64 when constructing constant was raised by @junrushao1994 @srkreddy1238 . This is an issue we should think about now. int32 will likely cause some regression on large arrays which need to be fixed. I think we should prefer int64 when possible for constants, and let the compiler to automatically detect and downgrade to int32.

A temporary workaround is always to keep the inferred shape type consistent with the input shape type, and we can make the switch more easily in one place later

tqchen on 4 Oct 2018

👍1

Another thing I am concerning is user friendliness.

First, examples provided by Python API docs should be at least runnable by copy-pasting, like PyTorch (https://pytorch.org/docs/stable/tensors.html) or NumPy (https://docs.scipy.org/doc/numpy/reference/generated/numpy.expand_dims.html).

Second, Python API docs should be self-contained, at least those designed for DL practitioners who may not take a good look at the C++ code.

It does not seem to be a big deal for now, but we should put more effort into it in the future.

junrushao1994 on 4 Oct 2018

+1 for API docs friendless, I would recommend we do it now than later. Maybe I am having a bad lead example in the conv2d docs as it was pretty minimum, I will send an updated PR to update that, and let us make sure the new ops are being well documented with examples, especially non-trival ones

tqchen on 4 Oct 2018

👍1

Expr like below is not getting simplified !!

TensorTypeNode(float32, [n, c, int32((float32(100)*2.000000f)), int32((float32(200)*2.000000f))])

any idea ?

srkreddy1238 on 4 Oct 2018

The eager CSE is done among integer expression only so far. For floating points, we still need to call explicitly simplification, or use as_const_int to get out and explicit simplify

tqchen on 4 Oct 2018

👍1

I am going to grab ~~transpose~~ (update: no, it does not exist in the list)

I am going to grab less, greater, less_equal, greater_equal...

junrushao1994 on 5 Oct 2018

@junrushao1994 transpose should be in the list, sorry the list was not complete

tqchen on 5 Oct 2018

I am taking/had taken multiply/divide/mod/relu/tanh/sigmoid/negative.

MarisaKirisame on 5 Oct 2018

@MarisaKirisame
I have covered multiply, divide, mod, tanh, sigmoid, negative already in #1813

srkreddy1238 on 5 Oct 2018

To keep all of us on same page #1813 covers
multiply, mod, tanh, sigmoid, negative, floor, ceil, trunc, abs, pow, resize, upsampling, batch_flatten, pool2d and global_pool2d

srkreddy1238 on 5 Oct 2018

expand_dims is in #1819
Comparisons greater greater_equal less less_equal not_equal equal are in #1824

junrushao1994 on 5 Oct 2018

ok I am going to take reshape, transpose, copy and concatenate.

concatenate seems to have been done, so what i need to do is only change the API a little bit to keep numpy consistency.

junrushao1994 on 5 Oct 2018

@srkreddy1238 you forgot to mention that you have done round and all pool2d-related operators in your pr #1813 as well.

To keep all of us on same page #1813 covers
multiply, mod, tanh, sigmoid, negative, floor, ceil, trunc, abs, pow, resize, upsampling, batch_flatten, pool2d and global_pool2d

junrushao1994 on 5 Oct 2018

take right_shift

tqchen on 5 Oct 2018

take left_shift

tmoreau89 on 5 Oct 2018

I need squeeze for ad with broadcast (right now it is assuming no broadcast). I will take it.

MarisaKirisame on 5 Oct 2018

I had done zeros_like and ones_like. I think I will take zeros and ones too for symmetricity

MarisaKirisame on 5 Oct 2018

attempting maximum

Luo-Liang on 5 Oct 2018

attempting minimum

Mutinifni on 5 Oct 2018

Attempting pad

slyubomirsky on 5 Oct 2018

take clip

joshpoll on 5 Oct 2018

take sigmoid

cowanmeg on 5 Oct 2018

take softmax

merrymercy on 5 Oct 2018

take full and full_like (the latter appears to be missing from the list even though it's an NNVM op)

slyubomirsky on 6 Oct 2018

take mutibox_prior and nms

kevinthesun on 6 Oct 2018

Some random notes: should we, or is there any tools could, do runtime type checking on the relay's Python API side?

junrushao1994 on 6 Oct 2018

@junrushao1994 can you elaborate ?

tqchen on 6 Oct 2018

@tqchen Sorry, I mean sanity check. It is not related to relay's type, just wondering if it is necessary to add some sanity checks to guard type safety around ffi calls.

For example, the signature of some function is def f(a: List[int], b: int), but python won't check actual type of a and b, and f directly passes a and b to ctypes, who doesn't seem to do sanity check either. In this case, if a and b were not of proper types, we would observe the code crashes without an informative error message, which would cause debug to be hard.

junrushao1994 on 6 Oct 2018

I think there is a python parser that does the similar thing but needs to confirm with @jroesch @joshpoll

tqchen on 6 Oct 2018

@tqchen Sounds cool!

junrushao1994 on 6 Oct 2018

You can do this with mypy types, see #1781 for an example. We could add type annotations to the operators, which will provide the sanity checks you want. Mypy is a static analyzer, so in order to get its benefits you need to run it separately or have it integrated into your IDE.

joshpoll on 6 Oct 2018

👍1

taking conv2d_transpose

srkreddy1238 on 8 Oct 2018

@tqchen how to handle multiple outputs? i was trying dropout and batchnorm, both have multiple outputs.
In RELAY_REGISTER_OP, like set_num_inputs, do we have something like set_num_outputs?
I think currently Array<Type>& types will have only (inputs + one output)

siju-samuel on 8 Oct 2018

multiple outputs is needed for split too.

srkreddy1238 on 8 Oct 2018

@siju-samuel @srkreddy1238 multiple output is possible only by wrapping all of them in a tuple type

MarisaKirisame on 8 Oct 2018

❤1

~I'll take batch_norm and dropout as well, to finish out level 1~

edit: Sorry, @siju-samuel, I didn't see your comment and didn't mean to snipe you with those! Tell me if you're still trying those, or else I could finish my own attempts. I don't mind either way.

slyubomirsky on 9 Oct 2018

i started with reduce ops, you can do with batch_norm & dropout

siju-samuel on 9 Oct 2018

broadcast_to, collapse_sum, broadcast_to_like, collapse_sum_like

MarisaKirisame on 9 Oct 2018

attempting where

zhiics on 9 Oct 2018

strided_slice

siju-samuel on 10 Oct 2018

Attempting split to conclude Level 3

srkreddy1238 on 10 Oct 2018

Will attempt prod

anijain2305 on 10 Oct 2018

Thanks to everyone for the hard work on getting 99% of the way there. I'm making a push to now add the compute and scheduling behavior for all of these operators which should enable users to use Relay for end-to-end inference tasks, enable new frontends and more. If you you would be interested in helping read more here: https://github.com/dmlc/tvm/issues/2051.