Building the howto100m Video Corpus

Data Skeptic

Video annotation is an expensive and time-consuming process. As a consequence, the available video datasets are useful but small. The availability of machine transcribed explainer videos offers a unique opportunity to rapidly develop a useful, if dirty, corpus of videos that are "self annotating", as hosts explain the actions they are taking on the screen.

Next Episodes


Data Skeptic

BERT @ Data Skeptic

📆 2019-07-29 08:42


Data Skeptic

BERT @ Data Skeptic

📆 2019-07-29 02:00


Data Skeptic

Onnx @ Data Skeptic

📆 2019-07-22 09:52


Data Skeptic

Onnx @ Data Skeptic

📆 2019-07-22 02:00