Changing Worker Resources At Runtime

freebie · May 24, 2022, 12:27pm

Hi,

I’m looking to see if I can change the assigned worker resources at runtime. So I can dynamically scale the number of workers that can handle handle certain tasks as workers go on/offline.

My thoughts were to currently do this by using a Plugin, either on the scheduler or workers. There does seem to be a Worker.set_resources function, though in that context, I don’t have access to the total number of other workers.

Making a SchedulerPlugin seems like a better place to do this. However, it doesn’t seem as though you can change the workers resources safely from here:

# scheduler_setup.py

class MyPlugin(SchedulerPlugin):
    def add_worker(self, scheduler=None, worker=None, **kwargs):
        state = scheduler.workers[worker]
        state.resources["new_resource"] = 1000
        print("Total workers:", len(scheduler.workers))

# main.py

def main():
    client = Client(asynchronous=False, scheduler_kwargs={"preload": "scheduler_setup"})
    sum_ = da.arange(10, chunks=2).sum()
    result = client.compute(sum_, resources={'new_resource': 1}).result()
    print(result)
    client.close()


if __name__ == '__main__':
    main()

This method will just hang, as the changes to resources from the scheduler don’t get populated out. Maybe there’s another function for doing this that I’m unaware of, or a way to get scheduler info from the WorkerPlugin instead.

Any thoughts on doing something like this?

pavithraes · May 25, 2022, 8:30pm

@freebie Thanks for your question!

’m looking to see if I can change the assigned worker resources at runtime.

I don’t think so, resources are set when workers are created.

Based on your example though, looks like you do want to set resources when you create new workers?

Would you be allocating the same resource to the workers you plan on scaling or different resources to different workers?

If it’s the same, you can set an environment variable or use a global config, as mentioned in the docs you linked. That way, when you create new workers, they will automatically have that resource.

If it’s different for different workers, it might get tricky because I believe resources weren’t intended to be used dynamically. We can look into some workarounds though. Let me know!

freebie · May 25, 2022, 8:36pm

Hi @pavithraes, yeah the second. I was looking at trying to scale them dynamically at run time. Reallocating resources around based on however many workers are available.

I guess this use case is not quite like saying ‘these workers have these resource e.g. gpu’, but more like ‘these workers can fulfil this role’. This is related to this other topic I posted before, and you had kindly responded to: High Availability and Resource Tracking - #2 by pavithraes

Best,
James

Topic		Replies	Views
High Availability and Resource Tracking	1	318	April 28, 2022
Set up local cluster with custom resource assignments Distributed distributed	3	324	April 8, 2022
Workers not scaling up despite tasks being locked by limited resource Distributed kubernetes , distributed	1	35	January 15, 2025
Change Number of Workers During Runtime Distributed kubernetes , distributed	4	1417	January 17, 2023
How to find count of idle workers from scheduler_info?	7	92	September 18, 2024

Changing Worker Resources At Runtime

Related topics