0

my company wants to build a Data warehouse in Redshift. We have an OLTP database running in Amazon Aurora and we are thinking of using the DMS (data migration service). I am trying to get my head around the capabilities of CDC (change data capture). The thing is that CDC (over DMS) replicates and stores changes (in our case in Redshift) and I was wondering if it is possible to select specific columns which I want to store (this should be possible to do with table mapping - include) and based on which I want to store? As far as I understand it, if any columns of a row are updated, then the replication is triggered, which could mean a replication that is useless (e.g. if somebody updates a column that I do not want to follow)

E.g. I have a table with leads, which has some 30 columns. Now I am interested in the DW purposes only in 5 columns and I want to get a new line to the redshift table only if any of those 5 columns changes (is updated)... like if the stage of lead is changed, I will get a new line. On the other hand, I am not interested in the column 'Salesmans_comment' so if the salesman updates a comment, I do not want to have a new line, because I am not interested in it...Cheers!

I have run through most of the available yt tutorials and read through the documentation, but I haven't found a clear answer...

Thanks

John Rotenstein
  • 241,921
  • 22
  • 380
  • 470
DavidZ
  • 1
  • 1
  • Might be relevant: [mysql - AWS DMS replicate only selected columns - Stack Overflow](https://stackoverflow.com/questions/70459163/aws-dms-replicate-only-selected-columns) – John Rotenstein Jan 25 '23 at 00:19

0 Answers0