Lecture 05 - Quantization (Part I) | MIT 6.S965

Автор: MIT HAN Lab

Загружено: 2022-09-22

Просмотров: 18768

Описание: Lecture 5 introduces neural network quantization. In this lecture, we review the numeric data types in modern computing systems and introduce K-means-based quantization and linear quantization.

Keywords: Neural Network Quantization, Quantization, K-Means-Based-Quantization, Linear Quantization

Slides: https://efficientml.ai/schedule/

--------------------------------------------------------------------------------------

TinyML and Efficient Deep Learning Computing

Instructors:
Song Han: https://songhan.mit.edu

Have you found it difficult to deploy neural networks on mobile devices and IoT devices? Have you ever found it too slow to train neural networks? This course is a deep dive into efficient machine learning techniques that enable powerful deep learning applications on resource-constrained devices. Topics cover efficient inference techniques, including model compression, pruning, quantization, neural architecture search, and distillation; and efficient training techniques, including gradient compression and on-device transfer learning; followed by application-specific model optimization techniques for videos, point cloud, and NLP; and efficient quantum machine learning. Students will get hands-on experience implementing deep learning applications on microcontrollers, mobile phones, and quantum machines with an open-ended design project related to mobile AI.

Website:
http://efficientml.ai/

Не удается загрузить Youtube-плеер. Проверьте блокировку Youtube в вашей сети.
Повторяем попытку...

Lecture 05 - Quantization (Part I) | MIT 6.S965

Доступные форматы для скачивания:

Скачать видео

Информация по загрузке:

Скачать аудио

Похожие видео

Quantization of Neural Networks [in Russian]

Quantization of Neural Networks [in Russian]

Lecture 03 - Pruning and Sparsity (Part I) | MIT 6.S965

Lecture 03 - Pruning and Sparsity (Part I) | MIT 6.S965

КОЛМАНОВСКИЙ: "Это просто чудо". Где "проваливается" ИИ, что не так с ядом из кожи лягушки, азарт

Квантование против обрезки против дистилляции: оптимизация нейронных сетей для вывода

Квантование против обрезки против дистилляции: оптимизация нейронных сетей для вывода

Дарио Амодеи — «Мы близки к концу экспоненты»

Дарио Амодеи — «Мы близки к концу экспоненты»

Но что такое нейронная сеть? | Глава 1. Глубокое обучение

Но что такое нейронная сеть? | Глава 1. Глубокое обучение

Способ увидеть невидимое: как создают суперлинзы из оптических метаматериалов

Способ увидеть невидимое: как создают суперлинзы из оптических метаматериалов

Лучший документальный фильм про создание ИИ

Лучший документальный фильм про создание ИИ

MIT 6.S191: Reinforcement Learning

MIT 6.S191: Reinforcement Learning

LLM и GPT - как работают большие языковые модели? Визуальное введение в трансформеры

LLM и GPT - как работают большие языковые модели? Визуальное введение в трансформеры

Онлайн-курс TSAR: Модуль 3. Подготовка протоколов клинических испытаний.

Онлайн-курс TSAR: Модуль 3. Подготовка протоколов клинических испытаний.

Lec 01. Introduction to Deep Learning

Lec 01. Introduction to Deep Learning

Quantizing LLMs - How & Why (8-Bit, 4-Bit, GGUF & More)

Quantizing LLMs - How & Why (8-Bit, 4-Bit, GGUF & More)

УХТОМСКИЙ - физиолог ДОКАЗАЛ, что МОЗГ сам выбирает РЕАЛЬНОСТЬ. ОДИН против всех !

УХТОМСКИЙ - физиолог ДОКАЗАЛ, что МОЗГ сам выбирает РЕАЛЬНОСТЬ. ОДИН против всех !

Самый важный алгоритм в машинном обучении

Самый важный алгоритм в машинном обучении

EfficientML.ai Lecture 5 - Quantization (Part I) (MIT 6.5940, Fall 2023)

EfficientML.ai Lecture 5 - Quantization (Part I) (MIT 6.5940, Fall 2023)

MIT 6.S191: Convolutional Neural Networks

MIT 6.S191: Convolutional Neural Networks

Quantization in Deep Learning (LLMs)

Quantization in Deep Learning (LLMs)

Как LLM выживают в условиях низкой точности | Основы квантования

Как LLM выживают в условиях низкой точности | Основы квантования

Lecture 10 - Knowledge Distillation | MIT 6.S965

Lecture 10 - Knowledge Distillation | MIT 6.S965