Tvm: Ask: TVM vs TensorRT

Created on 19 Dec 2017 · 5Comments · Source: apache/tvm

If I understand correctly, TensorRT and TVM search to accelerate prediction. .
TensorRT optimise prediction on GPU and TVM optimised prediction on almost all platform support GPU, ARM, Mobile ...

Is there a comparison between both on GPU ?

Source

edmBernard

Most helpful comment

so far tvm does not yet optimizes for int8 which TensorRT is optimized for. But there are some on going effort on this, so answer is TensorRT is faster currently and we are keep improving TVM to cover optimizations used in TensorRT for all platforms

tqchen on 19 Dec 2017

👍9

All 5 comments

tqchen on 19 Dec 2017

👍9

thanks for your answer

edmBernard on 19 Dec 2017

SO basically TVW will be a generic tensorrt, looking forward to see the new version

ouceduxzk on 22 Feb 2018

Hi @tqchen , curious if there's any updated perspective on TVM vs. TensorRT? Also how does ONNX relate to this project? Does it replace the need for an open exchange format?

austinmw on 15 Apr 2020

Some updates https://tvm.apache.org/2019/04/29/opt-cuda-quantized please feel free to followup on https://discuss.tvm.ai/

tqchen on 15 Apr 2020

👍2

Was this page helpful?

0 / 5 - 0 ratings

Related issues

[DOC] Documentation on Quantization

masahi · 4Comments

[WINDOWS][AutoTVM] OSError: [WinError 10048] Only one usage of each socket address (protocol/network address/port) is normally permitted and OSError: [WinError 10049] The requested address is not valid in its context

Coderx7 · 5Comments

Support Boundary Checking for Loop Dependent Iterators

yzh119 · 3Comments

[TEXPR][PASS] Loop distribution pass generates incorrect code

derisavi · 6Comments

[RELAY][RFC] Modify repr to return a valid Python AST

jroesch · 5Comments