Hi,
I am working with the dask-image library and specifically the ndmeasure.label, for a geospatial analysis package I’m developing. My raster has shape 14040 x 25200 with chunksize 5120 x 5120. I am using a local distributed scheduler. The issue I have is that when I call compute/persist on the output of ‘label’, I get a warning “UserWarning: Sending large graph of size 337.43 MiB”. I checked out the amount of tasks in the task graph, but there are only 218 tasks.
The best practice page mentions to avoid large task graphs but seems to cover only the case where this is caused by too many tasks.
What could be the issue here? Is it likely to be related to my code, or inherent to the label function?