How to migrate an AWS Data Pipeline job that calls pg_dump, to use maybe AWS Glue or Lambda instead?

Asked Mar 24 '23 at 17:29

Active Mar 24 '23 at 17:29

Viewed 54 times

The Data Pipeline job runs on a schedule that calls a shell script which ultimately calls pg_dump. I'd like to continue generating the pg_dump backups since they are useful, but move away from using Data Pipeline since I notice that AWS are soon to be removing console access. So I was thinking of trying to do the work in Glue or Lambda calls instead. Any ideas for a good approach to take?

I've so far tried to write a Glue job (pyspark script) that calls pg_dump in a command line string using subprocess.

subprocess.check_call(cmdString)

where cmdString has the pg_dump command in it

But of course that does not work since the job cannot understand where to locate the pg_dump program.

asked Mar 24 '23 at 17:29

techie2000

How to migrate an AWS Data Pipeline job that calls pg_dump, to use maybe AWS Glue or Lambda instead?

0 Answers0