Dask cluster with large number of workers gives "asyncio.exceptions.TimeoutError: Nanny failed to start"

Hi @Arnaud, out of curiosity, do you see a timeout error also with LocalCluster when you launch a lot of worker processes on a single node, as in [Best practice] Deploy a cluster on an interactive compute node on a slurm cluster ?

There seems to be more timeout issues with Workers connecting to Scheduler lately.

1 Like