Lets say I have flight data (from Foundry Academy).
Starting dataset:
Date | flight_id | origin_state | carrier_name |
---|---|---|---|
jan | 000000001 | California | delta air |
jan | 000000002 | Alabama | delta air |
jan | 000000003 | California | southwest |
feb | 000000004 | California | southwest |
... | ... | ... | ... |
I'm doing monthly data aggregation by state and by carrier. Header of my aggregated data looks like this:
origin state | carrier name | jan | feb | ... |
---|---|---|---|---|
Alabama | delta air | 1 | 0 | ... |
California | delta air | 1 | 0 | ... |
California | southwest | 1 | 1 | ... |
I need to get subtotals for each state; I need to sort by most flights; and I want it to be sorted by states, then by carrier.
desired output
origin state | carrier name | jan | feb | ... |
---|---|---|---|---|
California | null | 2 | 1 | ... |
California | delta air | 1 | 0 | ... |
California | southwest | 1 | 1 | ... |
Alabama | null | 1 | 0 | ... |
Alabama | delta air | 1 | 0 | ... |
PIVOT - doesn't provide subtotals for categories;
EXPRESSION - doesn't offer possibility to split date column into columns.