Skip to yearly menu bar Skip to main content


Timezone: America/Denver
Filter Events
Invited Talk

The remarkable progress of modern computer vision has been propelled by the relentless logic of scaling laws: bigger models, more data, more compute, predictably better performance. On benchmarks like ImageNet, deep networks now match or even surpass human accuracy. Yet beneath these headline results, the alignment with human vision is fragile: on deceptively simple probes from the cognitive sciences, even the largest models drop to near-chance, and on ImageNet itself, models that reach human accuracy do so by strikingly different visual strategies — a divergence that, troublingly, widens with scale.

In this plenary, I will argue that the path to more natural artificial vision lies not in pushing scaling laws further, but in a deeper engagement with the neural laws of biological vision: developmental principles that shape how brains learn to see, and architectural constraints that impose strong inductive biases on cortical processing. I will share recent work from my lab on two such laws. On the learning side, I will present preliminary evidence that pairing the right learning objectives with naturalistic video — sequences of object transformations like those the developing brain encounters — can pull deep networks toward markedly more human-like visual strategies. On the architectural side, I will show how recent advances in state space models can scale cortical recurrent feedback into a brain-inspired alternative to transformer self-attention, one that closes the gap on cognitive probes where transformers fail, and on ImageNet traces more favorable scaling laws than transformers.

Together, these results point toward a future in which scaling laws and neural laws are in agreement rather than in tension, and in which computer vision, in dialogue with the brain sciences, helps build AI systems that are not only more capable but more aligned with the kind of intelligence we ultimately seek to understand and emulate.

... more
Speaker Bio
Thomas Serre
Dr. Serre is a Professor of Cognitive and Psychological Sciences as well as Computer Science. He received his Ph.D. in Neuroscience from MIT in 2006 and his M.Sc. in Electrical Engineering and Computer Science from Télécom Bretagne in France in 2000. His research focuses on understanding the neural computations that support visual perception, and it has been featured in various media outlets, including the BBC, The Economist, New Scientist, Scientific American, Technology Review, and Slashdot. Dr. Serre serves as the Faculty Director of the Center for Computation and Visualization and the Associate Director of the Center for Computational Brain Science. He is also an affiliate of the Carney Institute for Brain Science and the Data Science Institute at Brown University. Additionally, he holds an International Chair in Artificial Intelligence at the Artificial and Natural Intelligence Toulouse Institute in France. He has actively participated as an area chair and senior program committee member for prestigious machine learning and computer vision conferences, such as AAAI, CVPR, ICML, ICLR, and NeurIPS. Dr. Serre is a Neuroscience section editor for the journal PLOS Computational Biology. He has received several awards, including the NSF Early Career Award, DARPA’s Young Faculty Award, and the Director's Award. Along with his team, he was awarded the 2021 PAMI Helmholtz Prize and the 2022 PAMI Mark Everingham Prize for their work on human action recognition.
... more