PyData Global 2022

Revolutionizing the Big Data Age With Compute over Data
12-01, 22:00–22:30 (UTC), Talk Track II

Introducing a new project, Compute over Data (Bacalhau), to run any computation on decentralized data. No need to move large datasets & all languages/data are supported. If you can run Docker/WASM, you're in the game!
Bacalhau is a decentralized public computation network that takes a job and moves it near where the data stored, including across a decentralized server network that stores data and runs jobs inside it. Bacalhau runs the job near where data lives and eliminates data management for the user.


speaker: David Aronchick


Prior Knowledge Expected

No previous knowledge expected

David leads Compute over Data at Protocol Labs, helping, deploying and organizing the community building the next generation of the Internet.
Previously, he led Open Source Machine Learning Strategy at Azure, product management for Kubernetes on behalf of Google, launched Google Kubernetes Engine, and co-founded the Kubeflow project and the SAME project. He has also worked at Amazon, Chef and co-founded three startups.
When not spending too much time in service of electrons, he can be found on a mountain (on skis), traveling the world (via restaurants) or participating in kid activities, of which there are a lot more than he remembers than when he was that age.