Three hour lecture that introduces some core topics in Vision & Language research.
LXMLS 2024: slides.