I have a scenario where I'm fetching data from one database(postgres) and loading the data into a table in a different database(Redshift)
Is there anyway in Kettle to schedule this job ?
Its a simple insert into redshift select * from postgres
I have a scenario where I'm fetching data from one database(postgres) and loading the data into a table in a different database(Redshift)
Is there anyway in Kettle to schedule this job ?
Its a simple insert into redshift select * from postgres
Using a Table Output step can be painfully slow as Redshift is optimized for bulk inserts, not row-by-row inserts. AFAIK, there are no steps/plugins in Kettle for bulk inserts into Redshift. What you can do, is to make a script in a Shell step that: