How to deal with Unsync Tables in databricks notebooks

Question

So, my question is: i have a bronze - silver injestion, and later this data will be consumed by gold notebooks, i wan't to check if all tables that notebooks uses is updated. What i thinked, use dlt expectations and in my golds, i will create expectations to check if the used silvers timestamps are dated, but dlt expectations only support expectations in the current table column scope...

Another ways that i think to solve:

Check with code inside each notebook (bad to keep changing, always need to warn to put a check block)
run before a notebook, a test one in the pipeline for each gold notebook, which checks and generates an error if any table is missing (it will clog the test nb)
check with code which notebooks can run and from a loop go to understand the available notebooks (do not modify anything inside the codes, the pipeline is scalable, however, visibility is lost)
block overwrite and update of delta tables that should not be updated, generating an error (kind of bad because the append still works)
ideal: silver generates an error, gold does not update (it does not work in our scenario)

What do you want to achieve if we discuss not implementation, but from the business standpoint? — Alex Ott, Aug 11 '23 at 07:03
Thanks for the answer, i'm trying to think different, like create expectations in golds to certify that the data will be consistent (it can be outdated, but consistent), later a code will report outdated silvers and stop the refresh of the powerbi datasets that use this tables. Than after solve the outdated silvers, we trigger this golds and bi refresh code again, what do you think ? The fail silver rate is very low — Ismael, Aug 11 '23 at 13:36

How to deal with Unsync Tables in databricks notebooks

0 Answers0