ycliper

Популярное

Музыка Кино и Анимация Автомобили Животные Спорт Путешествия Игры Юмор

Интересные видео

2025 Сериалы Трейлеры Новости Как сделать Видеоуроки Diy своими руками

Топ запросов

смотреть а4 schoolboy runaway турецкий сериал смотреть мультфильмы эдисон
Скачать

A Walkthrough of Progress Measures for Grokking via Mechanistic Interpretability: What? (Part 1/3)

Автор: Neel Nanda

Загружено: 2023-04-10

Просмотров: 7298

Описание: Part 1 of a walkthrough of our paper, Progress Measures for Grokking via Mechanistic Interpretability. I'm joined by my co-author Lawrence Chan. In this part, we give an overview of the paper and discuss the key takeaways

Part 2:    • A Walkthrough of Progress Measures for Gro...  
Part 3:    • A Walkthrough of Progress Measures for Gro...  

If you want to learn more about mechanistic interpretability, check out https://neelnanda.io/getting-started

Our paper: https://arxiv.org/abs/2301.05217
Original grokking paper: https://arxiv.org/abs/2201.02177
AdamW: https://pytorch.org/docs/stable/gener...
Walkthrough of toy models of superposition:    • A Walkthrough of Toy Models of Superpositi...  
Danny Hernandez paper on scaling laws for repeated data: https://arxiv.org/abs/2205.10487
Jermyn & Schlegeris on S-Shaped Curves: https://www.alignmentforum.org/posts/...
Unifying Grokking and Double Descent: https://arxiv.org/abs/2303.06173
Omnigrok: https://arxiv.org/abs/2210.01117

0:00 - Intro
0:50 - What is grokking?
9:53 - Mechanistic interpretability
11:47 - Paper overview, modular addition algorithm
15:08 - Progress measures
21:41 - why this work is bullshit
29:30 - Predicting when it will grok?
33:45 - Why does grokking happen?
40:27 - Lottery ticket hypothesis
42:43 - Conclusion

Не удается загрузить Youtube-плеер. Проверьте блокировку Youtube в вашей сети.
Повторяем попытку...
A Walkthrough of Progress Measures for Grokking via Mechanistic Interpretability: What? (Part 1/3)

Поделиться в:

Доступные форматы для скачивания:

Скачать видео

  • Информация по загрузке:

Скачать аудио

Похожие видео

A Walkthrough of Progress Measures for Grokking via Mechanistic Interpretability: How? (Part 2/3)

A Walkthrough of Progress Measures for Grokking via Mechanistic Interpretability: How? (Part 2/3)

NEURAL NETWORKS ARE WEIRD! - Neel Nanda (DeepMind)

NEURAL NETWORKS ARE WEIRD! - Neel Nanda (DeepMind)

EP 20: Applying Econometrics in EViews for Finance & Economics Research

EP 20: Applying Econometrics in EViews for Finance & Economics Research

This Simple Optimizer Is Revolutionizing How We Train AI [Muon]

This Simple Optimizer Is Revolutionizing How We Train AI [Muon]

A Walkthrough of A Mathematical Framework for Transformer Circuits

A Walkthrough of A Mathematical Framework for Transformer Circuits

Finally: Grokking Solved - It's Not What You Think

Finally: Grokking Solved - It's Not What You Think

Causal Mechanistic Interpretability (Stanford lecture 1) - Atticus Geiger

Causal Mechanistic Interpretability (Stanford lecture 1) - Atticus Geiger

Нил Нанда – Механистическая интерпретируемость: Вихревой тур

Нил Нанда – Механистическая интерпретируемость: Вихревой тур

We Can Monitor AI’s Thoughts… For Now | Google DeepMind's Neel Nanda

We Can Monitor AI’s Thoughts… For Now | Google DeepMind's Neel Nanda

Grokking: Обобщение за пределами переобучения на небольших алгоритмических наборах данных (с пояс...

Grokking: Обобщение за пределами переобучения на небольших алгоритмических наборах данных (с пояс...

A Walkthrough of Toy Models of Superposition w/ Jess Smith

A Walkthrough of Toy Models of Superposition w/ Jess Smith

DDPS | “A first-principles approach to understanding deep learning”

DDPS | “A first-principles approach to understanding deep learning”

Масштабируемость интерпретируемости

Масштабируемость интерпретируемости

Concrete Open Problems in Mechanistic Interpretability: Neel Nanda at SERI MATS

Concrete Open Problems in Mechanistic Interpretability: Neel Nanda at SERI MATS

Bitter Lesson-Pilled Interp: A Live Paper Review (Activation Oracles & PCD)

Bitter Lesson-Pilled Interp: A Live Paper Review (Activation Oracles & PCD)

Сверхпроводимость — горячая тема. Физик Сергей Мухин. Три Сигмы #14

Сверхпроводимость — горячая тема. Физик Сергей Мухин. Три Сигмы #14

Neel Nanda: Mechanistic Interpretability & Mathematics

Neel Nanda: Mechanistic Interpretability & Mathematics

Catherine Olsson - Mechanistic Interpretability: Getting Started

Catherine Olsson - Mechanistic Interpretability: Getting Started

Open Problems in Mechanistic Interpretability: A Whirlwind Tour

Open Problems in Mechanistic Interpretability: A Whirlwind Tour

Grokking, Generalization Collapse, and Dynamics of Training Deep Neural Nets [Charles Martin] - 734

Grokking, Generalization Collapse, and Dynamics of Training Deep Neural Nets [Charles Martin] - 734

© 2025 ycliper. Все права защищены.



  • Контакты
  • О нас
  • Политика конфиденциальности



Контакты для правообладателей: [email protected]