Audio-visual self-supervised baby learning

Channel:

Simons Institute for the Theory of Computing

Subscribers:

68,700

Published on June 20, 2024 10:54:57 AM ● Video Link: https://www.youtube.com/watch?v=HKa17CupqhE

Duration: 48:06

428 views

Andrew Zisserman (Oxford University)
https://simons.berkeley.edu/talks/andrew-zisserman-oxford-university-2024-06-04
Understanding Lower-Level Intelligence from AI, Psychology, and Neuroscience Perspectives

Lesson 1 from the classic paper "The Development of Embodied Cognition: Six Lessons from Babies" is `Be Multimodal'. This talks explores how recent work in the computer vision literature on audio-visual self-supervised learning addresses this challenge. The aim is to learn audio and visual representations and capabilities directly from the audio-visual data stream of a video (without providing any manual supervision of the data) - much as an infant could learn from the correspondence and synchronization between what they see and hear. It is shown that a neural network that simply learns to synchronize audio and visual streams is able to localize the faces that are speaking (active speaker detection) and objects that sound.

Other Videos By Simons Institute for the Theory of Computing

2024-06-21	Large ML potentials for chemistry: generalization, inductive biases, and cancellation of errors
2024-06-21	Deep learning and numerical methods intersections for improving molecular and fluid dynamics
2024-06-21	ML gradients in Molecular Simulations
2024-06-21	Generalizable sampling of conformational ensembles with latent space dynamics
2024-06-21	The State of Protein Structure Prediction and Friends
2024-06-21	From Entropy to Artistry: on Thermodynamics and Generative AI
2024-06-21	Explorations in Exploration: Deep Learning meets Value of Information for Sequential...
2024-06-21	Error Embraced: Making Trustworthy Scientific Decisions with Imperfect Predictions
2024-06-21	Debugging genomic profiling experiments and predictive models with interpretation tools
2024-06-21	Harnessing the properties of equivariant neural networks to understand and design materials
2024-06-20	Audio-visual self-supervised baby learning
2024-06-18	The Platonic Representation Hypothesis
2024-06-18	Robot learning, with inspiration from child development
2024-06-18	Lightning Talk
2024-06-18	The Role of Prior Data in Rapid Learning of Motor Skills
2024-06-18	Pathway to Robot Intelligence
2024-06-18	The Evolutionary Sweet-Spot for Cognition
2024-06-18	Lightning Talk
2024-06-18	Principles of learning in distributed neural networks
2024-06-18	Auditory cortex plasticity supports social learning
2024-06-18	New Models of Human Hearing via Machine Learning

Tags:

Simons Institute

theoretical computer science

UC Berkeley

Computer Science

Theory of Computation

Theory of Computing

Andrew Zisserman

Understanding Lower-Level Intelligence from AI; Psychology; and Neuroscience Perspectives

Channel	Latest
Le XP3	6 hours ago
numidium3	6 hours ago
Игорь 247	6 hours ago
TOTKALO	6 hours ago
Lispert Games	6 hours ago
Wedge Lonestar	6 hours ago
Sans commentaires	6 hours ago
agrace932	6 hours ago
The Hidden Levels	6 hours ago
Krsman3	6 hours ago
AB2 CVC & MEBC - Johnsonfunky	7 hours ago
Astral Fantasy World	7 hours ago
Alain Du Vaucluse	7 hours ago
WizeBan	7 hours ago
LTR MR Production	7 hours ago
Moverlorde	7 hours ago
Bonemasher	7 hours ago
Aylex Thunder	7 hours ago
Tym	7 hours ago
РЭД АД игры	7 hours ago
Windia Nata	7 hours ago
Topia Gameplay	7 hours ago
RECIO CHIEF	7 hours ago
CheckPointProfessor	7 hours ago
xRuDo	7 hours ago