Linear Digressions

Inside a Data Analysis: Fraud Hunting at Enron

Linear Digressions

It's storytime this week--the story, from beginning to end, of how Katie designed and built the main project for Udacity's Intro to Machine Learning class, when she was developing the course. The project was to use email and financial data to hunt for signatures of fraud at Enron, one of the biggest cases of corporate fraud in history; that description makes the project sound pretty clean but getting the data into the right shape, and even doing some dataset merging (that hadn't ever been done before), made this project much more interesting to design than it might appear. Here's the story of what a data analysis like this looks like...from the inside.

Next Episodes

Linear Digressions

What's the biggest #bigdata? @ Linear Digressions

📆 2016-05-09 03:28 / 00:25:31


Linear Digressions

Data Contamination @ Linear Digressions

📆 2016-05-02 04:24 / 00:20:58


Linear Digressions

Model Interpretation (and Trust Issues) @ Linear Digressions

📆 2016-04-25 02:45 / 00:16:57


Linear Digressions

Updates! Political Science Fraud and AlphaGo @ Linear Digressions

📆 2016-04-18 04:48 / 00:31:43


Linear Digressions

Ecological Inference and Simpson's Paradox @ Linear Digressions

📆 2016-04-11 04:43 / 00:18:32