I've been spending most of yesterdays time on the following problem and haven't found a solution yet to the following problem:
I have a dataframe with categorical data: say category1: has values A and B; Antother column category2 has values C, D, F, G; category3 has values H and so on...
I want to make a Sankey diagram showing how many (through the widths of the bands from node to node) from category1 A are in C, D, F, G. And this for all other combinations in the grouped dataframe as well.
It's basically a tree with the width of the branches showing how many counts are in the particular branch.
Is there a way on how to do this in a flexible way so that it works for most groupings in categorical DF's?