There are a lot of packages for Sankey diagrams. However, these packages assume the data is already structured. I'm looking at a transaction dataset where I would like to pull out the first sequence of products in a time series. Assume the time series is already ordered.
Here is the dataset:
structure(list(date = structure(c(1546300800, 1546646400, 1547510400, 1547596800, 1546387200, 1546646400, 1546732800), class = c("POSIXct", "POSIXt"), tzone = "UTC"),
client = c("a", "a", "a", "a", "b", "b", "b"),
product = c("butter", "cheese", "cheese", "butter", "milk", "garbage bag", "candy"),
qty = c(2, 3, 4, 1, 3, 4, 6)), row.names = c(NA, -7L), class = c("tbl_df", "tbl", "data.frame"))
Here is the desired output: