Meta-Estimators with Multiple Models

mattalhonte-srm · June 6, 2022, 10:46pm

What’s a good way to try multiple model types with the meta-estimators (like RandomizedSearchCV) in dask-ml? map?

pavithraes · June 7, 2022, 2:09pm

Could you please share some more details (maybe pseudocode, scikit-learn code) of your intended workflow?

mattalhonte-srm · June 9, 2022, 12:32am

Something to the effect of

with worker_client() as client:
    clf = dcv.RandomizedSearchCV(
        model,
        parameters,
        n_iter=10,
        scheduler=client,
        scoring="f1",
        refit=False,
        return_train_score=True,
    )
    clf.fit(
        train_x,
        loadedtrain_y,
    )

With multiple model types (so like, XGBoost and RandomForest).
Right now I’m just wrapping the above in different Prefect tasks (running on an ephemeral Dask cluster).

Topic		Replies	Views
Basic question about parallelizing different model fits with dask(-ml) Distributed dask-ml	0	190	November 17, 2022
DaskLGBMClassifier and Hypertuning using RandomizedSearchCV with DASK ECS Fargate Cluster Dask DataFrame future , distributed , dask-ml	2	570	March 23, 2023
Using GridsearchCV on Multi-GPU with RF Dask DataFrame distributed , gpu	4	922	April 19, 2023
Save and Load XGBoost model with mutl label Distributed dask-ml , xgboost	2	1030	March 29, 2022
Clarification on Distributed Dask ML (Is ML really Distributed?) Distributed dask-ml	1	236	August 23, 2022

Meta-Estimators with Multiple Models

Related topics