I cant resolve this since this morning,
I have a dataset with 2 columns : id_campaign and id_client
I need to know the number of client in a couple of campaign, he an exemple :
I want a dataset like this :
But with this command I have 0's or I want the number of crossed campaign
Have you got any ideas please ? I can explain more if necessary
Edit : I can do it in PySpark or in Python
And I tried multiple combinations of aggregation with groupBy() ; pivot_table() ; and partitionBy() without satisfying results