I'd like to take a subset of rows of a Dask dataframe based on a set of index keys. (Specifically, I want to find rows of ddf1 whose index is not in the index of ddf2.)
Both cache.drop([overlap_list])
and diff = cache[should_keep_bool_array]
either throw a NotImplementedException or otherwise don't work.
What is the best way to do this?