Given this Dask DataFrame :
Dask DataFrame Structure:
date value symbol
npartitions=2
object int64 category[known]
... ...
... ...
Dask Name: from-delayed, 6 tasks2130
How can I set_index on 'symbol' column (which is category[known)?
df = df.set_index('symbol')
Traceback (most recent call last):
[...]
TypeError: Categorical is not ordered for operation max
you can use .as_ordered() to change the Categorical to an ordered one