Linear Digressions

Optimized Web Crawling

Linear Digressions

Got a fun optimization problem for you this week! It’s a two-for-one: how do you optimize the web crawling logic of an operation like Google search so that the results are, on average, as up-to-date as possible, and how do you optimize your solution of choice so that it’s maintainable by software engineers in a huge distributed system? We’re following an excellent post from the Unofficial Google Data Science blog going through this problem. Relevant links: http://www.unofficialgoogledatascience.com/2018/07/by-bill-richoux-critical-decisions-are.html

Next Episodes

Linear Digressions

Better Know a Distribution: The Poisson Distribution @ Linear Digressions

πŸ“† 2018-10-22 02:53 / βŒ› 00:31:51


Linear Digressions

Searching for Datasets with Google @ Linear Digressions

πŸ“† 2018-10-15 03:11 / βŒ› 00:19:54


Linear Digressions

It's our fourth birthday @ Linear Digressions

πŸ“† 2018-10-08 04:33 / βŒ› 00:22:06


Linear Digressions

Gigantic Searches in Particle Physics @ Linear Digressions

πŸ“† 2018-09-30 20:52 / βŒ› 00:24:46


Linear Digressions

Gigantic Searches in Particle Physics @ Linear Digressions

πŸ“† 2018-09-30 20:51 / βŒ› 00:24:46