The problem here is that the workers are being sent large_object again and again for each batch. Which is wasting a lot of time. I know that combining the batches into one compute call would solve this, but I can’t do that as I need to save results in batches so that the process can be resumed if interrupted.
I’ve tried fiddling with worker memory thresholds to no avail. How can I get the workers to hang on to the large object?
Many thanks!
I am curious if there is a mechanism to specify during the scatter that the object needs to be preserved in the a local cache for future use and when the map calls are made the worker checks for the required key in local-cache before fetching it from other workers.
Just adding client.amm.stop() fixed this in the end! It seems that the default AMM is quite agressive - clearing out everything to a single copy every two seconds.