PyData Global 2022

Teaching papermill new tricks: creating custom engines for flexible notebook execution
12-01, 12:00–12:30 (UTC), Talk Track I

This talk will show you how to build papermill plugins. As motivating examples, we'll describe how to customize papermill for notebook debugging and profiling.


Papermill is a widely used library for executing notebooks programmatically; however, most users are unaware of its plugin mechanism that allow us to customize notebook execution.

This talk will explain how we can create papermill plugins; furthermore, as motivating examples, we'll show how to implement two plugins: one for notebook profiling and another for notebook debugging. By the end of the talk, attendees will be able to implement papermill plugins. We'll also provide example code so they can build on top of the two example use cases.

Outline
[0 - 2 minute] Introduction to papermill
[2 - 6] papermill's plugin system
[6 - 10] Creating a new engine
[10 - 18] Use case: notebook profiling
[18 - 26] Use case: notebook debugging
[26 - 28] Summary and conclusions
[28 - 30] Q&A


Prior Knowledge Expected

Previous knowledge expected

Eduardo Blancas is the Co-Founder and CEO of Ploomber, a Y Combinator-backed company developing tools to bridge the gap between interactive data work and production. Before that, he was a Data Scientist at Fidelity Investments, where he deployed the first customer-facing Machine Learning model for asset management. Eduardo holds an M.S. in Data Science from Columbia University and a B.S. in Mechatronics Engineering from Tecnológico de Monterrey.

This speaker also appears in: