Taichi: C/C++ bindings

Created on 5 Feb 2020 · 8Comments · Source: taichi-dev/taichi

Concisely describe the proposed feature
A few users want to ship compiled Taichi kernels, so that they can run it without Python

Describe the solution you'd like
We can add a method like ti.export_all(lang='C'/'C++'), to dump

A header that contains all the interfaces
A shared object with all compiled kernels

Then users can basically do something like

#include "mpm99_exported.h"

int main() {
  initialize_taichi();
  mpm99_substep();
  finalize_taichi();
}

... or using a more OOP C++ version.

Additional comments
If you also need this or have any suggestions, please feel free to comment! :-)

discussion feature request stale welcome contribution

Source

yuanming-hu

❤2 🎉1 👍1

Most helpful comment

For CPU code we can just dump LLVM IR and use llc to compile it into a .obj. Not sure what else we should do to make it a loadable shared object. The place where the optimized LLVM IR is emitted: https://github.com/taichi-dev/taichi/blob/abbf5b13537ab5e0f1d951fb3502b65a4f509bb1/taichi/backends/codegen_llvm_x86.cpp#L104

We should explore this direction and a good starting point is to compile a simple taichi kernel, such as

for i in range(n):
  a[i] += 1

For GPU code we can simply dump the compiled PTX and invoke the CUDA runtime to load and run the PTX code.

yuanming-hu on 15 Feb 2020

👍2

All 8 comments

I would really like the ability to do this, esp. with rust support.

samuela on 14 Feb 2020

👍1

Yeah I think we can start with a standard C interface and most other languages like C++/rust/go/Ruby can make use of it as well.

yuanming-hu on 14 Feb 2020

Makes sense. I'd be happy to help out with this but I'm personally not sure where to start.

samuela on 15 Feb 2020

👍2

We should explore this direction and a good starting point is to compile a simple taichi kernel, such as

for i in range(n):
  a[i] += 1

For GPU code we can simply dump the compiled PTX and invoke the CUDA runtime to load and run the PTX code.

yuanming-hu on 15 Feb 2020

👍2

Possible solution:
First, dump llvm::errs() into /tmp/a.ll.
Second, call llc /tmp/a.ll -o /tmp/a.o.
Third, call gcc -fPIC -shared -o /tmp/a.so /tmp/a.o.