Minimizing wall time in Dask Delayed chunking

KennSmith · June 8, 2022, 6:19pm

I have a general question on the best way to troubleshoot a situation where the total CPU time is fairly short, but the wall time is much longer. In the context of this code example below, the total CPU time was 1.5 hours, while the wall time was over 6 hours.

I wonder if this is caused by how I implemented the chunking with Dask Delayed, or in the delayed function itself. For added explanation the process_tile method is using Rasterio library to fetch image metadata from GeoTIFFs stored on Azure Blob.

%%time
chunk_size = 20

for aoi in AOIs:
    aoi_s1_tiles = dataset_tree['s1'][aoi]
    
    # create chunks of tiles
    for i in range(0, len(aoi_s1_tiles), chunk_size):
        future_pool = []
        tile_chunk = aoi_s1_tiles[i:i+chunk_size]
        
        # loop over each sentinel-1 chip in chunk
        for tile in tile_chunk:
            future = dask.delayed(process_tile)(tile, aoi)
            future_pool.append(future)
        future_pool = dask.persist(*future_pool)
        dask.compute(*future_pool)

CPU times: user 1h 19min 18s, sys: 21min 49s, total: 1h 41min 8s
Wall time: 6h 23min 33s

pavithraes · June 9, 2022, 7:29pm

@KennSmith Thanks for the question, could you please share some sample/synthetic data to help us reproduce this?

Some general notes about your code:

Calling .compute() right after .persist() isn’t really helpful, you can call .compute() directly
It’s generally a good idea to minimize for-loops – persist and compute inside a loop are also expensive
It looks like each chuck is probably executed sequentially? – Because the delayed is getting computed within a loop, so the outer loops might be executing sequentially…

I’d also encourage you to look at the diagnostic dashboard, it might give some insight into timings.

Topic		Replies	Views
Using da.delayed for Zarr processing: memory overhead & how to do it better? Dask Array dask-array , delayed	15	1840	January 16, 2024
Parallelizing a for loop using dask delayed and appending the results Dask Array delayed	7	1733	January 24, 2023
Why does dask take long time to compute regardless of the size of dataframe and partitions Dask DataFrame	2	2407	April 1, 2022
Dilemma: Schedule IO-Bound / CPU-Bound tasks in cascaded clients Distributed delayed	11	124	September 27, 2024
Delayed functions with Dask - Worse performance delayed	1	299	July 13, 2023

Minimizing wall time in Dask Delayed chunking

Related topics