Founding offer · lifetime membership for a single £24, exclusive to our first members · closes 20 June Claim your place →
Global Research Partnerships £24 Lifetime Log inCreate free account

Funded Projects › HORIZON

SOTA · Scalar-Order Theories of Deep Kernel Adaptation

HORIZONStatus: SIGNED1 April 202731 March 2029EU funding €193,643Call HORIZON-MSCA-2025-PF

The AI revolution is under way, yet we still lack a thermodynamic understanding of deep learning, that is explaining what a network can learn in the limit of large dataset and large network width.The central proposal of SOTA is that, the output of a deep network on real data can be predicted by an effective kernel - a similarity measure between data points - that shows simple low-dimensional adaptation to the dataset. This reconnects two regimes often seen as completely different: “lazy” learning (where a fixed kernel predictor explains outputs) and “rich” learning (where network features are plastic and little understood).Indeed, SOTA argues that rich learning in fully-connected networks effectively reduces predictor variance, while almost no adaptation of the mean predictor is predicted and observed. Convolutional networks instead show local adaptations of the kernel that also change the mean predictor.This is where the number of data samples is proportional to width, the classical limit in statistical physics of learning. Due to the abundance (or augmentation) of data in recent deep learning practice, also the quadratic sample-width limit will need to be investigated.Based on these consequential observations, a three-pronged investigation is proposed: - Comparison of predictions to large-scale Monte-Carlo experiments on real data. - Mechanistic explanation through a proposed mechanism of auto-ensembling. - Analysis of corrections to the effective action that are sub-leading in the proportional limit, but become relevant when data is more abundant.SOTA disrupts widely held assumptions about rich learning, and makes major progress on a long-standing challenge: Explaining the performance of deep, nonlinear networks in the feature learning regime, on real data, through a simple, effective theory.

Consortium · 1 organisation

coordinator

UNIVERSITA DEGLI STUDI DI PARMA

IT · €193,643

Research fields

View the official record on CORDIS →

← Find collaborators and more funded projects

Source: CORDIS, Publications Office of the European Union. Global Research Partnerships surfaces open EU research data to help you find collaborators; we are not affiliated with the European Union.