Linear Digressions

The Lottery Ticket Hypothesis

Linear Digressions

Recent research into neural networks reveals that sometimes, not all parts of the neural net are equally responsible for the performance of the network overall. Instead, it seems like (in some neural nets, at least) there are smaller subnetworks present where most of the predictive power resides. The fascinating thing is that, for some of these subnetworks (so-called β€œwinning lottery tickets”), it’s not the training process that makes them good at their classification or regression tasks: they just happened to be initialized in a way that was very effective. This changes the way we think about what training might be doing, in a pretty fundamental way. Sometimes, instead of crafting a good fit from wholecloth, training might be finding the parts of the network that always had predictive power to begin with, and isolating and strengthening them. This research is pretty recent, having only come to prominence in the last year, but nonetheless challenges our notions about what it means to train a machine learning model.

Next Episodes




Linear Digressions

Running experiments when there are network effects @ Linear Digressions

πŸ“† 2020-01-27 01:13 / βŒ› 00:24:45


Linear Digressions

Zeroing in on what makes adversarial examples possible @ Linear Digressions

πŸ“† 2020-01-20 03:41 / βŒ› 00:22:51