I have a table in SQL Server that has a few columns: ViewName
, ExternalTableDefinition_Synapse
, ExternalTableDefinition_ADB
, ViewDefinition_Synapse
, ViewDefinition_ADB
There may be hundreds of thousands of records here.
The task is to iterate over this table and get the external tables and views created on Synapse (Serverless) and Azure DataBricks. However,
- The process must be as fast as possible.
- There must be logging in case any of failed scripts.
Solutions tried till now –
- Looping thru the table in ADF and executing it via notebook (ADB) and Script Activity (Synapse) – ADF brings in parallelism but causes queueing. Also, expensive.
- Copy this table into ADLS and create an external table over it. Have some logic to concatenate the scripts and execute it – it runs as an atomic operation, so if out of 1000, 1 script has issues, the rest of them also won’t run.
Right now, I have no option but to lean towards the first one as it performs logging, albeit being slow overall.
Looking for better solutions.