Stop FargateCluster from retiring workers

ian.liu88 · July 4, 2022, 5:53pm

I have a code that is similar to this:

dfs = []

for task in tasks:
    subtasks = gen_subtasks(task)
    df = (
        bag
        .from_sequence(subtasks)
        .to_dataframe(...)
        .groupby(...)
        .compute())

pd.concat(dfs).to_parquet(...)

My problem is that whenever the loop finishes an iteration, FargateCluster seems to retire all the workers, and then start them all over again on the next iteration.

Is there a way to keep the cluster alive until the end of the program?

As an alternative, I also tried to generate the subtasks as a dask task, but I couldn’t figure out how to do this. How could I integrate the whole process in dask?

pavithraes · July 22, 2022, 1:16pm

I’m not very familiar with dask-cloudprovider, but @jacobtomlinson or @guillaumeeb might have thoughts on this.

guillaumeeb · July 22, 2022, 5:51pm

I’m not familiar with it either, but I don’t see a feature that would cause this without any reason.

@ian.liu88, could you give us the code you use to build and configure your FargateCluster? Do you use adaptivity?

jacobtomlinson · August 17, 2022, 10:13am

Yeah it’s hard to help here without seeing an example of how you set up your FargateCluster.

Topic		Replies	Views
How to limit number of tasks per node in FargateCluster? Distributed	2	364	December 15, 2021
Basic question about Dask AWS cloudprovider and scheduled/routine processing dask-cloudprovider	2	95	April 3, 2024
Can I submit tasks and exit the local client? Distributed dask-cloudprovider	1	598	April 9, 2022
Workers constantly dying Distributed future , distributed	4	351	July 20, 2023
How to handle conflicting tasks Distributed	6	265	February 12, 2023

Stop FargateCluster from retiring workers

Related topics