Linear Digressions

Reinforcement Learning Gone Wrong

Linear Digressions

Last week’s episode on artificial intelligence gets a huge payoff this weekβ€”we’ll explore a wonderful couple of papers about all the ways that artificial intelligence can go wrong. Malevolent actors? You bet. Collateral damage? Of course. Reward hacking? Naturally! It’s fun to think about, and the discussion starting now will have reverberations for decades to come. https://www.technologyreview.com/s/601519/how-to-create-a-malevolent-artificial-intelligence/ http://arxiv.org/abs/1605.02817 https://arxiv.org/abs/1606.06565

Next Episodes

Linear Digressions

Reinforcement Learning for Artificial Intelligence @ Linear Digressions

πŸ“† 2016-07-03 20:28 / βŒ› 00:18:30



Linear Digressions

How the sausage gets made @ Linear Digressions

πŸ“† 2016-06-20 04:25 / βŒ› 00:29:13


Linear Digressions

SMOTE: makin' yourself some fake minority data @ Linear Digressions

πŸ“† 2016-06-13 05:06 / βŒ› 00:14:37


Linear Digressions

Conjoint Analysis: like AB testing, but on steroids @ Linear Digressions

πŸ“† 2016-06-06 04:13 / βŒ› 00:18:27