Hi I am wondering if there is a Vaex equivalent to the below pandas python join syntax. Essentially I am trying to join a dataframe onto itself where values in column 1 match values in column 1 and values in column 2 do not match values in column 2.
My dataframe has ~550 million rows so I'm trying to use Vaex for the speed.
df[['col1','col2']].join(df[['col1','col2']],on='col1',rsuffix ='_y').query('col2!=col2_y')
Example input
Column A | Column B |
---|---|
1 | a |
1 | b |
1 | c |
2 | d |
2 | e |