Is it necessary to call compute() before calling to_parquet()?

So calling to_parquet without calling compute() is writing to a number of parquet files within a folder, whereas after calling compute() it is writing to a single file (presumably because it is saving a Pandas dataframe in that case and not a Dask one).

Apologies - I didn’t notice the folders that were being created, I was only looking at the files.

1 Like