I hope I'm not missing an easy solution am still getting used to DAX and can't yet find an appropriate logic.
I have a large dataset, >10m rows which I want to test. An identifier column "DocumentNumber" might occur on multiple rows and I want to find where the sum of "Value" over these rows for a given "DocumentNumber" is non-zero.
Tried to use
PowerQuery > removed all
but these twocolumns > Group By > DocumentNumber > Sum of Value
. However my 32 bit version of Excel appears to run out of memory performing this stepExpression.Error: Evaluation ran out of memory and can't continue.
Wrote a DAX
measure > Sum of Values
and dropped into a pivot table with a view to filtering out the zero values but when I try to drag in theDocumentNumber
to rows there are more than a million rows so the table won't render.
Is there a logic I should follow in DAX that would achieve step 2 before bringing it to the pivot table? Can DAX actually create a new table in the data model which is the aggregated and filtered data rather than using a pivot? I believe this is possible in PowerBI but not sure about Excel evironment.