The agreed pattern to insert new data into an already existing table in Azure Data Warehouse seems to be...
create table dbo.MyTable_New
with (distribution = round_robin)
as
select
Col1
,Col2
from dbo.MyTable
union all
select
Col1
,Col2
from dbo.MyNewStuff;
Now, what we are seeing is that on really large tables this will degrade over time, and it's obvious why you are reading everything you have already, and re-inserting it.. this seems sub-optimal to me...
Does anyone have any alternative approaches that they have seen work for them, i am thinking of things like partition switching for example...