I use Airflow for various ETL work, but I've also started using UDFs heavily.
I'd like to organize my UDFs in a dataset my_project.my_udfs
, and I was hoping to be able to utilize Airflow for this purpose. Is there a way to do so?
I ultimately want to be able to schedule queries like this, simple example:
CREATE FUNCTION `my_project.my_udfs.normalize`(s STRING)
RETURNS STRING
AS TRIM(LOWER(s));
A couple of answers to questions you may be thinking of:
- I'm part of a broader organization that uses Airflow, and the main benefit I want to leverage here is to have source control over these functions.
- The example is not such a case, but many of these functions are ones that will be updated periodically (monthly/quarterly).
Thanks in advance!