I recently built a project that parallelizes multiple transformers on a single GPU using Dask, would love to hear your feedback

Rishikesh_gharat · July 17, 2025, 4:23pm

this project was faster than using minikube containers, and gave me a great throughput of 400k reviews in 3 minutes.

the projects aims to first categorize and then, summarize, these categorized reviews.
the core of this program is the architecture of using dask local cluster to parallelize multiple sentence transformers and summarizers on a single GPU / treating this GPU as a minicluster.

I would like some feedback on this project/architecture, as i have not seen any project do this.
drop by my github and maybe leave a star if you like the project. I am currently working on metrics to compare my architecture with other libaries/ technologies to see which is best for my usecase.

Topic		Replies	Views
Distributed multi-GPU Distributed distributed	4	365	November 24, 2023
Dask Arrays with TensorFlow Dask Array dask-array , distributed	3	1182	August 5, 2022
Using dask distributed scheduler with CuPy Distributed	10	1565	March 21, 2023
Run dask in parallel doesn't work as expected, in distributed kubernetes pods Distributed	11	547	March 17, 2023
Dask distributed performance issues Distributed kubernetes , future , distributed	1	269	December 7, 2022

I recently built a project that parallelizes multiple transformers on a single GPU using Dask, would love to hear your feedback

Related topics