1

I am trying to build a service that would allow users using notebook to set automation parameters in a cell like the starting time as to when the notebook should start executing. The service would then take this input time and execute the notebook at the desired time and store the executed notebook to S3. I have looked into papermill but I believe there is no way to add automation parameters like start execution time using that. Is there any ways to achieve this? Or is there a way papermill can achieve this?

Deesha
  • 538
  • 8
  • 27

2 Answers2

1

Papermill handles just parameterizing and executing the notebooks, not scheduling. For that, you need to use another tool. You can build something yourself on top of Apache Airflow which seems to be the most widespread scheduler for such case. It has a native support for Papermill (see here). Or you can use a ready tool like Paperboy.

To read in-depth about scheduling notebooks, take a look at the article by Netflix.

Tomáš Linhart
  • 9,832
  • 1
  • 27
  • 39
0

Take a look at the code here and here for a wrapper that will schedule notebook execution

The shell scripts above create a VM, runs the notebook, saves the output and destroy the instance.

In Google Cloud AI Platform Notebooks we provide a scheduling service which is in Beta now.

gogasca
  • 9,283
  • 6
  • 80
  • 125