Nanny Forces Single Core Usage

mronda · May 7, 2025, 11:21pm

Hi all,

I am running into an odd issue whenever my tasks call a compiled C++ executable (via subprocess). With the Nanny=True (default), each task gets pinned to 1 CPU core (not what I want), but as soon as I start workers with Nanny=False, it works as expected and spreads code across all cores. Ideally, I’d want Nanny=True for all its benefits in terms of worker restart.

Minimal Code:

def run():
    
    script = textwrap.dedent(
        f"""
        bash -l <<-'HEREDOC'
        conda activate my-kernel
        /code/compiledCppCodeHere
        HEREDOC
        """
    )
    process = subprocess.Popen(
        script, shell=True, stdout=subprocess.PIPE, stderr=subprocess.PIPE
    )
    stdout, stderr = process.communicate()    
    return stdout

f = [client.submit(run, pure=False) for i in range(0, 2)]
client.gather(f)

CPU profile with Nanny=True (What I dont expect)

CPU Profile with Nanny=False (what I expect)

Digging a bit deeper, I see that Nanny spawns itself via multiprocessing module. Not sure if this is the issue?

So a few questions:

How can I keep Nanny=True and allow subprocess to run as expected?
If not, is there an alternative method (different than multiprocessing) that I can configure the Nanny workers? Not sure if that is the problem to be honest.

Any pointers or background as to this would be much appreciated! Thanks!

guillaumeeb · May 9, 2025, 3:20pm

Hi @mronda, welcome back!

I would first try two things:

Try to change the multiprocessing method from spawn to fork. If you are using a LocalCluster/Client:

import dask
dask.config.set({'distributed.worker.multiprocessing-method': 'fork'})

from dask.distributed import Client

Try to use a single multithreaded worker for this kind of tasks, e.g. LocalCluster(n_workers=1, threads_per_worker=N)

If this doesn’t have any effect, try to look at CPU affinity somewhere in the libs you use.

Topic		Replies	Views
Local Cluster with Two Nodes (Desktops) Distributed distributed	1	526	September 21, 2022
Multiple processes per worker while using gateway Distributed dask-gateway , distributed	7	853	April 27, 2022
Dask and Python's `signal` incompatibility Distributed	7	78	June 19, 2024
How to get each worker to process only one task when using SGECluster Distributed	11	638	September 27, 2023
Closing Nanny in aws Lambda while creating Client(n_workers=4) Distributed	1	95	December 6, 2023

Nanny Forces Single Core Usage

Related topics