Alpaka: Avoid device synchronization on cudaFree

Created on 26 Apr 2020 · 2Comments · Source: alpaka-group/alpaka

See this blog post: https://devblogs.nvidia.com/introducing-low-level-gpu-virtual-memory-management/

CUDA Help Wanted

Source

BenjaminW3

👍2

Most helpful comment

CUDA 11.2 introduces cudaMallocAsync and cudaFreeAsync with stream semantic:

fwyzard on 16 Dec 2020

👍2

All 2 comments

The disadvantage is that this function require the driver API. Currently only the tests in alpaka depends the driver API.
If we implement something like that we need to take care that we are able to build on a system without a CUDA driver. The CUDA driver ships cuda.so.
On a system without CUDA driver we need to build against lib/stubs/libcuda.so. At all it is no big deal but we need to check if our CMake can handle it correctly.

psychocoderHPC on 27 Apr 2020

CUDA 11.2 introduces cudaMallocAsync and cudaFreeAsync with stream semantic:

fwyzard on 16 Dec 2020

👍2

Was this page helpful?

0 / 5 - 0 ratings

Related issues

Travis: Stages

ax3l · 4Comments

Travis GitHub Marketplace

ax3l · 5Comments

Boost 1.67.0 beta1

ax3l · 5Comments

Clang still fails

tdd11235813 · 4Comments

bad performance with OpenMP

psychocoderHPC · 4Comments