I’m encountering the following error where my code deals with heavy group by operations on the dask dataframe.
ValueError: cannot reindex on an axis with duplicate labels
Hi @mihir,
Could you share ome of you code and hat you Dataframes look like?
After a quick internet search, this kind of error usually comes from duplicated values on an index, or on column names.
I was able to somehow get around this error by removing a repartition statement in the code.