New algorithm discovers language just by watching videos. DenseAV, developed at MIT, learns to parse and understand the meaning of language just by watching videos of people talking, with potential applications in multimedia search, language learning, and robotics.

https://news.mit.edu/2024/denseav-algorithm-discovers-language-just-watching-videos-0611