ycliper

Популярное

Музыка Кино и Анимация Автомобили Животные Спорт Путешествия Игры Юмор

Интересные видео

2025 Сериалы Трейлеры Новости Как сделать Видеоуроки Diy своими руками

Топ запросов

смотреть а4 schoolboy runaway турецкий сериал смотреть мультфильмы эдисон
Скачать

1W-MINDS, Dec. 4: Minxin Zhang (UCLA), Structure-Aware Adaptive Nonconvex Optimization for Deep...

Автор: Mark Iwen

Загружено: 2025-12-04

Просмотров: 108

Описание: Structure-Aware Adaptive Nonconvex Optimization for Deep Learning and Scientific Computing

Modern machine learning and scientific computing pose optimization challenges of unprecedented scale and complexity, demanding fundamental advances in both theory and algorithmic design for nonconvex optimization. This talk presents recent advances that address these challenges by exploiting matrix and tensor structures, integrating adaptivity, and leveraging sampling techniques. In the first part, I introduce AdaGO, a new optimizer that combines orthogonalized momentum updates with adaptive learning rates. Building on the recent success of the Muon optimizer in large language model training, AdaGO incorporates an AdaGrad-type stepsize that scales orthogonalized update directions by accumulated past gradient norms. This design preserves the structural advantage of orthogonalized updates while adapting stepsizes to noise and the optimization landscape. We establish optimal convergence rates for smooth nonconvex functions and demonstrate improved performance over Muon and Adam on classification and regression tasks. The second part focuses on zeroth-order global optimization. We develop a theoretical framework for inexact proximal point (IPP) methods for global optimization, establishing convergence guarantees when proximal operators are estimated either deterministically or stochastically. The quadratic regularization in the proximal operator induces a concentrated Gibbs measure landscape that facilitates effective sampling. We propose two sampling-based algorithms: TT-IPP, which constructs a low-rank tensor-train (TT) approximation using a randomized TT-cross algorithm, and MC-IPP, which employs Monte Carlo integration. Both IPP algorithms adaptively balance efficiency and accuracy in proximal operator estimation, achieving strong performance across diverse benchmark functions and applications. Together, these works advance structure-aware adaptive first-order optimization for deep learning and zeroth-order global optimization in scientific computing.

Не удается загрузить Youtube-плеер. Проверьте блокировку Youtube в вашей сети.
Повторяем попытку...
1W-MINDS, Dec. 4:  Minxin Zhang (UCLA), Structure-Aware Adaptive Nonconvex Optimization for Deep...

Поделиться в:

Доступные форматы для скачивания:

Скачать видео

  • Информация по загрузке:

Скачать аудио

Похожие видео

1W-MINDS, Jan 8: Stephen Becker (University of Colorado Boulder), Randomization methods for big-data

1W-MINDS, Jan 8: Stephen Becker (University of Colorado Boulder), Randomization methods for big-data

1W-MINDS, Feb. 5:  Jonas Latz (University of Manchester), Losing momentum in continuous-time...

1W-MINDS, Feb. 5: Jonas Latz (University of Manchester), Losing momentum in continuous-time...

Terence Tao - Machine assistance and the future of research mathematics - IPAM at UCLA

Terence Tao - Machine assistance and the future of research mathematics - IPAM at UCLA

1W-MINDS, Nov 6:  Bohan Chen (Caltech) Learning Enhanced Ensemble Filters

1W-MINDS, Nov 6: Bohan Chen (Caltech) Learning Enhanced Ensemble Filters

Dario Lorenzoni - Primordial Black Holes from Inflation with a Spectator Field

Dario Lorenzoni - Primordial Black Holes from Inflation with a Spectator Field

1W-MINDS, Jan. 29:  Akram Aldroubi (Vanderbilt University), Dynamical sampling: source term recov...

1W-MINDS, Jan. 29: Akram Aldroubi (Vanderbilt University), Dynamical sampling: source term recov...

1W-MINDS, Oct. 9:  Anna Veselovska (Technical University of Munich),  Gradient Descent and...

1W-MINDS, Oct. 9: Anna Veselovska (Technical University of Munich), Gradient Descent and...

Массовый забой скота. Протестам в России быть? Зачем Трампу Иран. Максим Шевченко: Особое мнение

Массовый забой скота. Протестам в России быть? Зачем Трампу Иран. Максим Шевченко: Особое мнение

Сергей Дацюк. ИИ скоро осознает себя, но мы этого не заметим.

Сергей Дацюк. ИИ скоро осознает себя, но мы этого не заметим.

1W-MINDS, Jan. 22:  Stephan Wojtowytsch (University of Pittsburgh), ‘Accelerated' Optimization in ML

1W-MINDS, Jan. 22: Stephan Wojtowytsch (University of Pittsburgh), ‘Accelerated' Optimization in ML

КЛАССИЧЕСКАЯ МУЗЫКА ДЛЯ ВОССТАНОВЛЕНИЯ НЕРВНОЙ СИСТЕМЫ🌿 Нежная музыка успокаивает нервную систему 22

КЛАССИЧЕСКАЯ МУЗЫКА ДЛЯ ВОССТАНОВЛЕНИЯ НЕРВНОЙ СИСТЕМЫ🌿 Нежная музыка успокаивает нервную систему 22

1W-MINDS, Oct. 30:   Ethan Epperly (UC Berkeley)Column subset selection, active learning, and...

1W-MINDS, Oct. 30: Ethan Epperly (UC Berkeley)Column subset selection, active learning, and...

Chiara Tocchini: Three-qubit Quantum Refrigerator - theory and implementation

Chiara Tocchini: Three-qubit Quantum Refrigerator - theory and implementation

1W-MINDS, March 5:  Anastasis Kratsios (McMaster University), Neural Networks as Universal Circuits

1W-MINDS, March 5: Anastasis Kratsios (McMaster University), Neural Networks as Universal Circuits

Sebastien Bubeck - A Combinatorics Problem - IPAM at UCLA

Sebastien Bubeck - A Combinatorics Problem - IPAM at UCLA

1W-MINDS, Oct. 16:  Alex Cloninger (University of California, San Diego),  From Local Views to...

1W-MINDS, Oct. 16: Alex Cloninger (University of California, San Diego), From Local Views to...

134th Faculty Research Lecture

134th Faculty Research Lecture

Почему даже противники Путина критикуют этот фильм?

Почему даже противники Путина критикуют этот фильм?

Врач-гастроэнтеролог объясняет что происходит с организмом во время Рамадана

Врач-гастроэнтеролог объясняет что происходит с организмом во время Рамадана

1W-MINDS, Oct. 23:  Petar Nizić-Nikolac (ETH Zurich),  Matrix Chaos Inequalities and Chaos of...

1W-MINDS, Oct. 23: Petar Nizić-Nikolac (ETH Zurich), Matrix Chaos Inequalities and Chaos of...

© 2025 ycliper. Все права защищены.



  • Контакты
  • О нас
  • Политика конфиденциальности



Контакты для правообладателей: [email protected]