I have a Koalas / Pandas-on-Spark dataframe named df
.
When I try the function below I get a TypeError: str object is not callable
df[~(df.time.eq('00:00:00').groupby(df.vehicle_id).transform('sum')>=2)]
When I check the datatypes of both columns I get:
print(df.time.dtype)
<U0
print(df.vehicle_id.dtype)
<U0
Is that something that might have to deal with it?