Performing HOG Matrices on PIMS Chunks through ImageIO

ParticularMiner · April 14, 2022, 6:21pm

Hmm … it seems the error is not dask-related, so I can’t be sure from here what is causing it. Corrupted video file perhaps?
Correction: once you call compute() dask will execute the task-graph, not generate it. A task-graph is built any time you call a dask function like map_blocks() or imread().
Since the chunksize (number of images in a chunk) has been changed to 100, you must be aware that make_hogs() receives a chunk of 100 images at once as input. So you probably should loop hog() over all those images and stack-up all the processed images, since hog() is unable to process more than one image at a time (right?). Alternatively, you could rewrite hog()'s source code to do that (without looping), like I did for pims.as_grey(). In fact, I encourage you to try that and see if it boosts performance. In the meantime, here’s an untested code snippet:

def make_hogs(frames, coords):
    new_frames = frames[
        :,
        coords[1]:coords[1] + coords[3],
        coords[0]:coords[1] + coords[2],
    ]
	
	nframes = new_frames.shape[0]
	first_frame = new_frames[0]

    hog_descriptor, hog_image = hog(first_frame)
    hog_images = np.empty((nframes,) + hog_image.shape, dtype=frames.dtype)
    hog_descriptors = np.empty((nframes,) + hog_descriptor.shape, dtype=frames.dtype)

    for i, image in enumerate(new_frames):
        hog_descriptor, hog_image = hog(image)
        hog_descriptors[i, ...] = hog_descriptor
        hog_images[i, ...] = hog_image

    return hog_descriptors, hog_images

I do not know exactly what hog() does. I’ve simply assumed above that it returns a tuple of two arrays whose sizes remain constant over all the images in the chunk.

See this link for help on how to return more than one output.

Topic		Replies	Views
Parallelize or map chunks of arrays with different sizes, shapes and number of blocks Dask Array dask-array	4	630	July 31, 2023
Dask image array to jpg Dask Array dask-array	0	393	November 12, 2022
Subtracting Arrays from Chunks Efficiently Dask Array zarr , dask-array , distributed	7	633	April 12, 2022
Performing Pairwise Correlation Coefficient Calculations Across Chunks (and map_blocks vs blockwise) Dask Array dask-array	6	1453	May 24, 2022
Using Cull-like func to prune branches based on knowledge of chunks with all zeros Dask Array dask-array	1	339	May 23, 2022

Performing HOG Matrices on PIMS Chunks through ImageIO

Related topics