Видео с ютуба Interpretability
Interpretability: Understanding how AI models think
Интерпретируемое и объяснимое машинное обучение
The Dark Matter of AI [Mechanistic Interpretability]
What is interpretability?
How Reasoning Models Break Mechanistic Interpretability Techniques
25. Interpretability
What Matters Right Now In Mechanistic Interpretability?
Eliezer Yudkowsky explains AI interpretability | Lex Fridman Podcast Clips
Mechanistic Interpretability explained | Chris Olah and Lex Fridman
Нил Нанда – Механистическая интерпретируемость: Вихревой тур
Open Problems in Mechanistic Interpretability: A Whirlwind Tour
The problem of model “interpretability” defined 🗃️& Golden Gate Claude 🌉 #machinelearning
Hacking LLMs: An Introduction to Mechanistic Interpretability — Jenny Vega
Atticus Geiger - State of Interpretability & Ideas for Scaling Up [Alignment Workshop]
Чтение мыслей ИИ: объяснение механистической интерпретируемости [антропные исследования]
Mechanistic Interpretability - NEEL NANDA (DeepMind)
Chris Olah - Looking Inside Neural Networks with Mechanistic Interpretability
Масштабируемость интерпретируемости
The Utility of Interpretability — Emmanuel Amiesen
What is mechanistic interpretability? Neel Nanda explains.