Cast List: Building the howto100m Video Corpus

Building the howto100m Video Corpus

Data Skeptic

Episode: Building the howto100m Video Corpus
Website: Data Skeptic
Feed URL: https://dataskeptic.com/api/blog/rss
Published: 2019-08-19 02:00

Video annotation is an expensive and time-consuming process. As a consequence, the available video datasets are useful but small. The availability of machine transcribed explainer videos offers a unique opportunity to rapidly develop a useful, if dirty, corpus of videos that are "self annotating", as hosts explain the actions they are taking on the screen. This episode is a discussion of the HowTo100m dataset - a project which has assembled a video corpus of 136M video clips with captions covering 23k activities. Related Links The paper will be presented at ICCV 2019 @antoine77340 Antoine on Github Antoine's homepage

Next Episodes

BERT @ Data Skeptic

📆 2019-07-29 08:42

BERT @ Data Skeptic

📆 2019-07-29 02:00

Onnx @ Data Skeptic

📆 2019-07-22 09:52

Onnx @ Data Skeptic

📆 2019-07-22 02:00

Catastrophic Forgetting @ Data Skeptic

📆 2019-07-15 10:40