Hi @guillaumeeb ,
I also contact the dask_cuda
team and they said using more than one thread per worker (default) is not officially support for LocalCUDACluster
at the moment. Please look at Dask LocalCudaCluster compute error when threads_per_worker
not equal to 1 · Issue #1262 · rapidsai/dask-cuda (github.com)
Thanks for your help anyway.