1W-MINDS, Feb. 5: Jonas Latz (University of Manchester), Losing momentum in continuous-time...

Автор: Mark Iwen

Загружено: 2026-02-06

Просмотров: 81

Описание: Losing momentum in continuous-time stochastic optimization

The training of modern machine learning models often consists in solving high-dimensional non-convex optimisation problems that are subject to large-scale data. In this context, momentum-based stochastic optimisation algorithms have become particularly widespread. The stochasticity arises from data subsampling which reduces computational cost. Both, momentum and stochasticity help the algorithm to converge globally. In this work, we propose and analyse a continuous-time model for stochastic gradient descent with momentum. This model is a piecewise-deterministic Markov process that represents the optimiser by an underdamped dynamical system and the data subsampling through a stochastic switching. We investigate longtime limits, the subsampling-to-no-subsampling limit, and the momentum-to-no-momentum limit. We are particularly interested in the case of reducing the momentum over time. Under convexity assumptions, we show convergence of our dynamical system to the global minimiser when reducing momentum over time and letting the subsampling rate go to infinity. We then propose a stable, symplectic discretisation scheme to construct an algorithm from our continuous-time dynamical system. In experiments, we study our scheme in convex and non-convex test problems. Additionally, we train a convolutional neural network in an image classification problem. Our algorithm attains competitive results compared to stochastic gradient descent with momentum.

Joint work with Kexin Jin, Chenguang Liu, and Alessandro Scagliotti.

Associated paper: Jin et al. 2025: Journal of Machine Learning Research 26(148):1-55 (https://jmlr.org/papers/v26/23-1396.html [jmlr.org])

Не удается загрузить Youtube-плеер. Проверьте блокировку Youtube в вашей сети.
Повторяем попытку...

1W-MINDS, Feb. 5: Jonas Latz (University of Manchester), Losing momentum in continuous-time...

Доступные форматы для скачивания:

Скачать видео

Информация по загрузке:

Скачать аудио

Похожие видео

1W-MINDS, Jan 8: Stephen Becker (University of Colorado Boulder), Randomization methods for big-data

1W-MINDS, Jan 8: Stephen Becker (University of Colorado Boulder), Randomization methods for big-data

1W-MINDS, Jan. 29: Akram Aldroubi (Vanderbilt University), Dynamical sampling: source term recov...

1W-MINDS, Jan. 29: Akram Aldroubi (Vanderbilt University), Dynamical sampling: source term recov...

1W-MINDS, Jan. 22: Stephan Wojtowytsch (University of Pittsburgh), ‘Accelerated' Optimization in ML

1W-MINDS, Jan. 22: Stephan Wojtowytsch (University of Pittsburgh), ‘Accelerated' Optimization in ML

1W-MINDS, Feb. 12: Weilin Li (City University of New York), A nonconvex optimization approach to...

1W-MINDS, Feb. 12: Weilin Li (City University of New York), A nonconvex optimization approach to...

The Carbon at Risk Measure Can Unlock Financial Markets for Large-Scale Carbon Removal

The Carbon at Risk Measure Can Unlock Financial Markets for Large-Scale Carbon Removal

1W-MINDS, Nov 6: Bohan Chen (Caltech) Learning Enhanced Ensemble Filters

1W-MINDS, Nov 6: Bohan Chen (Caltech) Learning Enhanced Ensemble Filters

1W-MINDS, March 5: Anastasis Kratsios (McMaster University), Neural Networks as Universal Circuits

1W-MINDS, March 5: Anastasis Kratsios (McMaster University), Neural Networks as Universal Circuits

Discrete Probability Distributions

Discrete Probability Distributions

Как Иран стал ПРОБЛЕМОЙ

Как Иран стал ПРОБЛЕМОЙ

1W-MINDS, Jan 15: Nicholas Dexter (Florida State University), Recent progress on sparse approx...

1W-MINDS, Jan 15: Nicholas Dexter (Florida State University), Recent progress on sparse approx...

1W-MINDS, Oct. 23: Petar Nizić-Nikolac (ETH Zurich), Matrix Chaos Inequalities and Chaos of...

1W-MINDS, Oct. 23: Petar Nizić-Nikolac (ETH Zurich), Matrix Chaos Inequalities and Chaos of...

1W-MINDS, Oct. 16: Alex Cloninger (University of California, San Diego), From Local Views to...

1W-MINDS, Oct. 16: Alex Cloninger (University of California, San Diego), From Local Views to...

Музыка для работы за компьютером | Фоновая музыка для концентрации и продуктивности

Музыка для работы за компьютером | Фоновая музыка для концентрации и продуктивности

Массовый забой скота. Протестам в России быть? Зачем Трампу Иран. Максим Шевченко: Особое мнение

Массовый забой скота. Протестам в России быть? Зачем Трампу Иран. Максим Шевченко: Особое мнение

Лучший Гайд по Kafka для Начинающих За 1 Час

Лучший Гайд по Kafka для Начинающих За 1 Час

1W-MINDS, Feb. 26: Matthieu Dolbeault (Brown University), Constructive discretization and approx...

1W-MINDS, Feb. 26: Matthieu Dolbeault (Brown University), Constructive discretization and approx...

1W-MINDS, Oct. 30: Ethan Epperly (UC Berkeley)Column subset selection, active learning, and...

1W-MINDS, Oct. 30: Ethan Epperly (UC Berkeley)Column subset selection, active learning, and...

Лекция от легенды ИИ в Стэнфорде

Лекция от легенды ИИ в Стэнфорде

1W-MINDS, Dec. 4: Minxin Zhang (UCLA), Structure-Aware Adaptive Nonconvex Optimization for Deep...

1W-MINDS, Dec. 4: Minxin Zhang (UCLA), Structure-Aware Adaptive Nonconvex Optimization for Deep...

1W-MINDS, March 12: Ryan LaRose (Michigan State University), Towards experimentally realizable...

1W-MINDS, March 12: Ryan LaRose (Michigan State University), Towards experimentally realizable...