ycliper

Популярное

Музыка Кино и Анимация Автомобили Животные Спорт Путешествия Игры Юмор

Интересные видео

2025 Сериалы Трейлеры Новости Как сделать Видеоуроки Diy своими руками

Топ запросов

смотреть а4 schoolboy runaway турецкий сериал смотреть мультфильмы эдисон
Скачать

DLS: Peter Bartlett • Gradient Optimization Methods: The Benefits of a Large Step-size

Автор: Faculty of Mathematics, University of Waterloo

Загружено: 2026-01-30

Просмотров: 86

Описание: Deep learning, the technology underlying the recent progress in AI, has revealed some major surprises from the perspective of theory. Optimization in deep learning relies on simple gradient descent algorithms that are traditionally viewed as a time discretization of gradient flow. However, in practice, large step sizes — large enough to cause oscillation of the loss — exhibit performance advantages.

This talk will review recent results on gradient descent with logistic loss with a step size large enough that the optimization trajectory is at the “edge of stability.” We show the benefits of this initial oscillatory phase for linear functions and for multi-layer networks, and identify an asymptotic implicit bias that gradient descent imposes for a large family of deep networks.

Based on joint work with Yuhang Cai, Michael Lindsey, Song Mei, Matus Telgarsky, Jingfeng Wu, Bin Yu and Kangjie Zhou.

Bio: Peter Bartlett is Professor of Statistics and Computer Science at UC Berkeley and Principal Scientist at Google DeepMind. At Berkeley, he is the Machine Learning Research Director at the Simons Institute for the Theory of Computing, Director of the Foundations of Data Science Institute, and Director of the Collaboration on the Theoretical Foundations of Deep Learning, and he has served as Associate Director of the Simons Institute. He is President of the Association for Computational Learning, Honorary Professor of Mathematical Sciences at the Australian National University, and co-author with Martin Anthony of the book Neural Network Learning: Theoretical Foundations.

He was awarded the Malcolm McIntosh Prize for Physical Scientist of the Year in Australia, and has been an Institute of Mathematical Statistics Medallion Lecturer, an IMS Fellow and Australian Laureate Fellow, a Fellow of the ACM, a recipient of the UC Berkeley Chancellor’s Distinguished Service Award, and a Fellow of the Australian Academy of Science.

🩷💛 Engage with us online! 🖤🩷
Instagram: instagram.com/waterloomath
LinkedIn: www.linkedin.com/showcase/faculty-of-math
Facebook: facebook.com/waterloomath

--

As North America's only dedicated Faculty of Math, we are nationally and internationally recognized as one of the top schools for Mathematics and Computer Science.

With nearly $30 million in research funding (2019/20) and an alumni network of over 45,000 across more than 100 countries, our students, faculty, and graduates continue to push the boundaries of research to discover new ways to harness the power of mathematics, computer science, and statistics.

Visit our website at uwaterloo.ca/math

Не удается загрузить Youtube-плеер. Проверьте блокировку Youtube в вашей сети.
Повторяем попытку...
DLS: Peter Bartlett • Gradient Optimization Methods: The Benefits of a Large Step-size

Поделиться в:

Доступные форматы для скачивания:

Скачать видео

  • Информация по загрузке:

Скачать аудио

Похожие видео

Моделирование Монте-Карло

Моделирование Монте-Карло

Why I Left Quantum Computing Research

Why I Left Quantum Computing Research

Визуализация скрытого пространства: PCA, t-SNE, UMAP | Глубокое обучение с анимацией

Визуализация скрытого пространства: PCA, t-SNE, UMAP | Глубокое обучение с анимацией

Terry Tao:

Terry Tao: "LLMs Are Simpler Than You Think – The Real Mystery Is Why They Work!"

We still don't understand magnetism

We still don't understand magnetism

В чем разница между матрицами и тензорами?

В чем разница между матрицами и тензорами?

Беззубчатые шестерни развивают гораздо больший крутящий момент, чем обычные, вот почему. Циклоида...

Беззубчатые шестерни развивают гораздо больший крутящий момент, чем обычные, вот почему. Циклоида...

Trump’s Name in Epstein Files “More Than ONE MILLION

Trump’s Name in Epstein Files “More Than ONE MILLION" Times & MAGA Explodes with Rage Over Bad Bunny

Теренс Тао о том, как Григорий Перельман решил гипотезу Пуанкаре | Лекс Фридман

Теренс Тао о том, как Григорий Перельман решил гипотезу Пуанкаре | Лекс Фридман

Может ли у ИИ появиться сознание? — Семихатов, Анохин

Может ли у ИИ появиться сознание? — Семихатов, Анохин

Trump in Epstein Files

Trump in Epstein Files "a Million Times" & Lutnick Admits Lunch with Epstein | The Daily Show

Математическая тревожность, нейросети, задачи тысячелетия / Андрей Коняев

Математическая тревожность, нейросети, задачи тысячелетия / Андрей Коняев

The Most Misunderstood Concept in Physics

The Most Misunderstood Concept in Physics

Лучший документальный фильм про создание ИИ

Лучший документальный фильм про создание ИИ

The Hairy Ball Theorem

The Hairy Ball Theorem

Feynman Explains Why Does the Universe Obey Math?

Feynman Explains Why Does the Universe Obey Math?

The problem with pretending quantum mechanics makes sense | Sean Carroll

The problem with pretending quantum mechanics makes sense | Sean Carroll

This New Pyramid Theory Explains the Missing Evidence

This New Pyramid Theory Explains the Missing Evidence

Управление поведением LLM без тонкой настройки

Управление поведением LLM без тонкой настройки

Trump Defends Racist Obama Meme & MAGA Rages Over Bad Bunny’s Spanish Halftime Show | The Daily Show

Trump Defends Racist Obama Meme & MAGA Rages Over Bad Bunny’s Spanish Halftime Show | The Daily Show

© 2025 ycliper. Все права защищены.



  • Контакты
  • О нас
  • Политика конфиденциальности



Контакты для правообладателей: [email protected]