Linear-time blockwise layer fusion

martindurant · May 3, 2023, 6:59pm

See dask-awkward/optimize.py at main · dask-contrib/dask-awkward · GitHub for a function that does fusion on chains of blockwise layers in linear time, rather than the current N**2 algorithm in dask. It is more limited in what it can fuse, but could be an effective precursor stage to standard fuse as a way to cut down time.

Since it was developed for dask-awkward, it might only apply when there is just one axis of partitioning, i.e., dataframes rather than arrays, because this matches dask-awkward’s model.

Topic		Replies	Views
Using dask-awkward to speed up dask-awkward Blogs	0	226	April 11, 2023
Optimal graph optimization when mixing dask objects Dask Array dask-array , delayed , high-level-graph	5	79	March 28, 2025
OutOfMemory, when merging multiple dataframes! Help me optimize! Dask DataFrame optimization , high-level-graph	2	1371	October 23, 2023
Column optimzation Dask DataFrame optimization , high-level-graph	3	245	May 2, 2023
Sparse Cholesky Factorization? Dask Array	1	199	July 14, 2022

Linear-time blockwise layer fusion

Related topics