Linear Digressions

Organizing Google's Datasets

Linear Digressions

If you're a data scientist, there's a good chance you're used to working with a lot of data. But there's a lot of data, and then there's Google-scale amounts of data. Keeping all that data organized is a Google-sized task, and as it happens, they've built a system for that organizational challenge. This episode is all about that system, called Goods, and in particular we'll dig into some of the details of what makes this so tough. Relevant links: http://static.googleusercontent.com/media/research.google.com/en//pubs/archive/45390.pdf

Next Episodes

Linear Digressions

Fighting Cancer with Data Science: Followup @ Linear Digressions

📆 2016-10-24 03:58 / 00:25:48


Linear Digressions

The 19-year-old determining the US election @ Linear Digressions

📆 2016-10-17 03:01 / 00:12:28


Linear Digressions

How to Steal a Model @ Linear Digressions

📆 2016-10-10 00:57 / 00:13:36


Linear Digressions

Regularization @ Linear Digressions

📆 2016-10-03 04:13 / 00:17:27


Linear Digressions

The Cold Start Problem @ Linear Digressions

📆 2016-09-26 04:24 / 00:15:37