I have a large arrow file with 14 million rows. In my app I select two columns and bin them using the count/binby functionality in Vaex.
df.count(
binby=axes,
limits=limits,
shape=(binnum,)*len(axes),
delay=True
)
Some of my columns act as a mask and have either a 0 or 1. Here's an example
# x y mask
1 1.5 4.7 0
2 0.3 2.3 1
3 2.6 9.4 1
4 5.0 3.7 0
I wish to bin the points in the x and y axes that only have a 1 in the mask column. How do I do this?