Help with Dask Xgboost performance

tlee14 · February 24, 2023, 5:03pm

From my experience, I’ve found the Dask Xgboost classifier to be much slower than using the same amount of core on a bigger since machine. For example, using a large machine with 96 cores and regular Xgboost, would be 2-4 times faster than using a dask cluster with 4 workers with 24 cores with Dask Xgboost.

Is this supposed to be the case? Does anyone know how I can improve the speed on Dask Xgboost?

guillaumeeb · February 25, 2023, 7:33am

Hi @tlee14, welcome here!

Distributed processing always comes with a cost: serialization, data exchange between process or through network, synchronization… This is not surprising at all to observe that a code is more efficient on a single machine without distributed computing than with a distributed processing on several processes or servers. Especially if Xgboost is already optimized and parallelized when using it on a single machine.

Anyway, it would be interesting to understand where is the efficiency loss when using Dask. In order to do this, it would be necessary to have a Minimum Reproducible Example of some Xgboost code on a smaller case, like something which could run on a standard laptop.

mrocklin · February 28, 2023, 2:48pm

Yes, as Guillaume says above, memory bandwidth is faster than network bandwidth. If you can fit your problem on one large machine then you should certainly do so. You should only use Dask when you need to switch to multiple machines. You should avoid this move as long as possible.

Topic		Replies	Views
Is Dask XGBoost a good option	1	80	July 17, 2024
Errors training xgboost with parquet files on single node Dask DataFrame	3	420	April 28, 2023
Need help with efficient parallelization [local machine] Distributed delayed , distributed	2	255	July 30, 2022
Dask AWS ECS Cluster VS AWS Ec2 instance + Local Cluster Distributed	16	1014	March 8, 2023
Am I hitting a network bottleneck or is there any room for improvements? Dask Array	1	25	May 9, 2025

Help with Dask Xgboost performance

Related topics