I'm trying to move my project into a kedro pipeline but I'm struggling with the following step:
my prediction pipeline is being run by a scheduler. The scheduler supplies all the necessary parameters (dates, country codes etc.). Up until now I had a CLI which would get input parameters such as below
python predict --date 2022-01-03 --country UK
The code would then read the input dataset for a given date and for a given country, so the query would be something like:
SELECT *
FROM input_data_{country}
WHERE date = {date}
and this would be formatted using the input variables passed in the CLI.
Important note: the code has to run on any arbitary date passed by the scheduler, and not only on "today".
How would I parametrize Kedro's data catalog using CLI arguments?
I tried the examples in the documentation of Kedro but it seems that they are mainly geared towards using templates from config in reading the data. The key issue I'm struggling with is passing CLI arguments to the data catalog and haven't found a working solution. I looked into PartitionedDataSet
but I don't see an option to have CLI arguments as inputs there