Linear Digressions

Procella: YouTube's super-system for analytics data storage

Linear Digressions

This is a re-release of an episode that originally ran in October 2019. If you’re trying to manage a project that serves up analytics data for a few very distinct uses, you’d be wise to consider having custom solutions for each use case that are optimized for the needs and constraints of that use cases. You also wouldn’t be YouTube, which found themselves with this problem (gigantic data needs and several very different use cases of what they needed to do with that data) and went a different way: they built one analytics data system to serve them all. Procella, the system they built, is the topic of our episode today: by deconstructing the system, we dig into the four motivating uses of this system, the complexity they had to introduce to service all four uses simultaneously, and the impressive engineering that has to go into building something that “just works.”

Next Episodes

Linear Digressions

The Data Science Open Source Ecosystem @ Linear Digressions

📆 2020-06-29 04:34 / 00:23:06


Linear Digressions

Rock the ROC Curve @ Linear Digressions

📆 2020-06-22 01:34 / 00:15:52


Linear Digressions

Criminology and Data Science @ Linear Digressions

📆 2020-06-15 03:26 / 00:30:57



Linear Digressions

An interstitial word from Ben @ Linear Digressions

📆 2020-06-05 03:38 / 00:05:59