Linear Digressions

What's *really* so hard about feature engineering?

Linear Digressions

Feature engineering is ubiquitous but gets surprisingly difficult surprisingly fast. What could be so complicated about just keeping track of what data you have, and how you made it? A lot, as it turns outβ€”most data science platforms at this point include explicit features (in the product sense, not the data sense) just for keeping track of and sharing features (in the data sense, not the product sense). Just like a good library needs a catalogue, a city needs a map, and a home chef needs a cookbook to stay organized, modern data scientists need feature libraries, data dictionaries, and a general discipline around generating and caring for their datasets.

Next Episodes

Linear Digressions

Data storage for analytics: stars and snowflakes @ Linear Digressions

πŸ“† 2019-09-30 13:22 / βŒ› 00:15:22


Linear Digressions

Data storage: transactions vs. analytics @ Linear Digressions

πŸ“† 2019-09-23 03:49 / βŒ› 00:16:08



Linear Digressions

Data science teams as innovation initiatives @ Linear Digressions

πŸ“† 2019-09-09 04:24 / βŒ› 00:15:21


Linear Digressions

Can Fancy Running Shoes Cause You To Run Faster? @ Linear Digressions

πŸ“† 2019-09-02 01:44 / βŒ› 00:30:15