I am loading large .csv files into a database and want to use pgloader because it is able to skip and log erroneous records. However I frequently get
ERROR PostgreSQL Database error 22021: invalid byte sequence for encoding "UTF8": 0x00
Which I first thought was due to 0x00
characters in my data. I tried different ways to remove these to no avail, until I noticed that some variables e.g. timestamps with three or more 0
characters seem to be causing the problem. e.g. 17:46:30 +0000
. Does anyone know how to handle this problem without changing my data? About 2/3 of my records have this problem. Ideally I am looking for a fix that allows me to import the data using the same procedure.