2020 LLVM Developers’ Meeting: N. Hähnle “Evolving convergent: Lessons from Control Flow in AMDGPU”

Автор: LLVM

Загружено: 2020-10-26

Просмотров: 1541

Описание: https://llvm.org/devmtg/2020-09/
—
Evolving "convergent": Lessons from Control Flow in AMDGPU - Nicolai Hähnle

Slides: https://llvm.org/devmtg/2020-09/slide...
—
GPUs execute many threads of a program in lock-step by mapping them to lanes of a SIMD vector that we call “wave”. Modern GPU programming languages have cross-lane operations such as shuffles, ballots, and barriers that exchange data between the lanes of a wave. When such operations execute in divergent control flow (lanes of a wave following different paths through the CFG), only a subset of lanes participate in this data exchange. A key part of defining the semantics of cross-lane operations is defining how this subset is determined.

In LLVM, the only tool available today to help in this definition is the `convergent` attribute. We argue that its definition is subtly broken and insufficient for expressing and preserving the desired behavior of cross-lane operations. We propose a new definition of `convergent` as well as the concept of “convergence tokens” and related intrinsics that allow frontends to describe the desired semantics of cross-lane operations in IR in a way that is easy to maintain by generic transforms. We also briefly touch on how these intrinsics are used by a new “wave transform” (whole program vectorization that lowers from thread-level CFG to wave-level CFG) in the AMDGPU backend.
—

Не удается загрузить Youtube-плеер. Проверьте блокировку Youtube в вашей сети.
Повторяем попытку...

2020 LLVM Developers’ Meeting: N. Hähnle “Evolving convergent: Lessons from Control Flow in AMDGPU”

Доступные форматы для скачивания:

Скачать видео

Информация по загрузке:

Скачать аудио

Похожие видео

2020 LLVM Developers’ Meeting: S. Ehrig “Adding CUDA® Support to Cling: JIT Compile to GPUs”

2020 LLVM Developers’ Meeting: S. Ehrig “Adding CUDA® Support to Cling: JIT Compile to GPUs”

2019 LLVM Developers’ Meeting: A. Bougacha & J. McCall “arm64e: An ABI for Pointer Authentication ”

2019 LLVM Developers’ Meeting: A. Bougacha & J. McCall “arm64e: An ABI for Pointer Authentication ”

2022 LLVM Dev Mtg: Paths towards unifying LLVM and MLIR

2022 LLVM Dev Mtg: Paths towards unifying LLVM and MLIR

2020 LLVM Developers’ Meeting: K. Li & T. Islam “Towards a representation of arbitrary alias ...”

2020 LLVM Developers’ Meeting: K. Li & T. Islam “Towards a representation of arbitrary alias ...”

Swiss tables в Go. Наиболее полный разбор внутреннего устройства новой мапы

Swiss tables в Go. Наиболее полный разбор внутреннего устройства новой мапы

Вся IT-база в ОДНОМ видео: Память, Процессор, Код

Вся IT-база в ОДНОМ видео: Память, Процессор, Код

Как работает Search Engine под капотом: ранжирование и релевантность

Как работает Search Engine под капотом: ранжирование и релевантность

Проблема нержавеющей стали

Проблема нержавеющей стали

2020 LLVM Developers’ Meeting: “Understanding Changes made by a Pass in the Opt Pipeline”

2020 LLVM Developers’ Meeting: “Understanding Changes made by a Pass in the Opt Pipeline”

2019 LLVM Developers’ Meeting: A. Warzynski “Writing an LLVM Pass: 101”

2019 LLVM Developers’ Meeting: A. Warzynski “Writing an LLVM Pass: 101”

2023 EuroLLVM - Tutorial: A whirlwind tour of the LLVM optimizer

2023 EuroLLVM - Tutorial: A whirlwind tour of the LLVM optimizer

C++: Самый Противоречивый Язык Программирования

C++: Самый Противоречивый Язык Программирования

Музыка для работы за компьютером | Фоновая музыка для концентрации и продуктивности

Музыка для работы за компьютером | Фоновая музыка для концентрации и продуктивности

Уборщик испугался | Агрессивный бодибилдер против 32-килограммовой швабры в спортзале

Уборщик испугался | Агрессивный бодибилдер против 32-килограммовой швабры в спортзале

2018 LLVM Developers’ Meeting: C. Schafmeister “Lessons Learned Implementing Common Lisp with LLVM”

2018 LLVM Developers’ Meeting: C. Schafmeister “Lessons Learned Implementing Common Lisp with LLVM”

Алгоритмы и структуры данных ФУНДАМЕНТАЛЬНЫЙ КУРС от А до Я. Графы, деревья, хеш таблицы и тд

Алгоритмы и структуры данных ФУНДАМЕНТАЛЬНЫЙ КУРС от А до Я. Графы, деревья, хеш таблицы и тд

2018 LLVM Developers’ Meeting: M. Braun “Register Allocation: More than Coloring”

2018 LLVM Developers’ Meeting: M. Braun “Register Allocation: More than Coloring”

Инженерные решения, управляющие цифровым миром 🛠️⚙️💻 Как работают процессоры?

Инженерные решения, управляющие цифровым миром 🛠️⚙️💻 Как работают процессоры?

КАК УСТРОЕН TCP/IP?

КАК УСТРОЕН TCP/IP?

Для Чего РЕАЛЬНО Нужен был ГОРБ Boeing 747?

Для Чего РЕАЛЬНО Нужен был ГОРБ Boeing 747?