Diffusion Models for Video Generation: Imagen Video

Imagen Video

Transformers

Attention Is All You Need Paper

Transfer Learning

Using Transfer Learning for Musical Instrument Classification

MERLOT: Multimodal Neural Script Knowledge Models

MERLOT: Multimodal Neural Script Knowledge Models Presentation

Learning Temporal Video-Language Grounding for Egocentric Videos

Learning Temporal Video-Language Grounding for Egocentric Videos