Dask array with pytorch

zeroth · June 13, 2023, 3:52am

Hi All,
I have a question about the Dask array pytorch example
https://blog.dask.org/2021/03/29/apply-pretrained-pytorch-model

In Step 4

# Apply UNet featurization
out = da.map_blocks(unet_featurize, imgs, model, dtype=np.float32, chunks=(1, 1, imgs.shape[2], imgs.shape[3], 16), new_axis=-1)

why the chunk shape/size is (1, 1, imgs.shape[2], imgs.shape[3], 16)
I am confused why there is 16 at the end.

Thanks

guillaumeeb · June 13, 2023, 8:48pm

Hi @zeroth, welcome to Dask community!

In this example, we are applying a pretrained model to a Dask Array, using map_blocks to apply the model to each chunk of data. As explained in Step 2:

This UNet model takes in an 2D image and returns a 2D x 16 array

So we expect a new dimension of len 16 after applying the model to the Dask Array, which is why we are telling map_blocks that the output chunk shape is (1, 1, imgs.shape[2], imgs.shape[3], 16).

Does it make things clearer to you ?

zeroth · June 14, 2023, 9:14am

Hi @guillaumeeb ,
Thanks for the explanation.
This makes sense.

Topic		Replies	Views
Da.map_blocks introduces unexpected chunks? Dask Array dask-array	3	62	July 5, 2024
Map_blocks unexpected behavior adds rows to dim when specifying chunks Dask Array	2	202	August 3, 2023
Parallelize or map chunks of arrays with different sizes, shapes and number of blocks Dask Array dask-array	4	630	July 31, 2023
Using map_blocks() to predict 2D keras model on 3D dask array Dask Array	1	28	November 27, 2024
`map_blocks` with different `chunks` and `new_axis` collapses a dimension when summed Dask Array	8	45	July 17, 2024

Dask array with pytorch

Related topics