PyData Global 2022

Katrina Riehl

Katrina is the Head of the Streamlit Data Team at Snowflake. She is joining Georgetown University as adjunct faculty this Spring. She also volunteers as the President of the Board of Directors at NumFOCUS, a non-profit supporting the PyData open source ecosystem. For almost two decades, Katrina has worked extensively in the fields of scientific computing, machine learning, data mining, and visualization. Most notably, she has helped lead data science efforts at the University of Texas Austin Applied Research Laboratory, Apple, HomeAway (now, Vrbo), and Cloudflare. Katrina received MS and PhD degrees in Computer Science from the University of Texas at Dallas.

The speaker's profile picture

Sessions

12-02
16:00
120min
Too much data? When big data starts to become a bad idea
Cheuk Ting Ho, Jesper Dramsch, Alexander CS Hendorf, Katrina Riehl, John Sandall

Nowadays we know the social media and tech giants are honesting tons of data from their users and most of us agree that the capability of these companies to deliver their suggestions and customization for you is driven by big data.

However, this brings a question: Is more data always better? Do more data equal to more accurate model? When do you need big data and when does it start becoming a bad idea? Let's find out in this panel session.

Workshop/Tutorial II