Masked Autoencoder for Self-Supervised Pre-training on Lidar Point Clouds [WACV 2023 PLV workshop]

Автор: Georg Hess

Загружено: 2022-12-09

Просмотров: 561

Описание: Code: https://github.com/georghess/voxel-mae
Project page: https://georghess.github.io/projects/...

Masked autoencoding has become a successful pretraining paradigm for Transformer models for text, images, and, recently, point clouds. Raw automotive datasets are suitable candidates for self-supervised pre-training as they generally are cheap to collect compared to annotations for tasks like 3D object detection (OD). However, the development of masked autoencoders for point clouds has focused solely on synthetic and indoor data. Consequently, existing methods have tailored their representations and models toward small and dense point clouds with homogeneous point this http URL this work, we study masked autoencoding for point clouds in an automotive setting, which are sparse and for which the point density can vary drastically among objects in the same scene. To this end, we propose Voxel-MAE, a simple masked autoencoding pre-training scheme designed for voxel representations. We pre-train the backbone of a Transformer-based 3D object detector to reconstruct masked voxels and to distinguish between empty and non-empty voxels. Our method improves the 3D OD performance by 1.75 mAP points and 1.05 NDS on the challenging nuScenes dataset. Further, we show that by pre-training with Voxel-MAE, we require only 40% of the annotated data to outperform a randomly initialized equivalent. Code available at https://github.com/georghess/voxel-mae.

Не удается загрузить Youtube-плеер. Проверьте блокировку Youtube в вашей сети.
Повторяем попытку...

Masked Autoencoder for Self-Supervised Pre-training on Lidar Point Clouds [WACV 2023 PLV workshop]

Доступные форматы для скачивания:

Скачать видео

Информация по загрузке:

Скачать аудио

Похожие видео

Masked Autoencoders Are Scalable Vision Learners – Paper explained and animated!

Masked Autoencoders Are Scalable Vision Learners – Paper explained and animated!

Scalable Inference Algorithms for Large Language Models | Woomin Song, KAIST | AER LABS

Scalable Inference Algorithms for Large Language Models | Woomin Song, KAIST | AER LABS

Uncovering the Computational Roles of Nonlinearity in Sequence Modeling - TMLR 2026

Uncovering the Computational Roles of Nonlinearity in Sequence Modeling - TMLR 2026

Робототехническая революция стала реальностью: почему Boston Dynamics и Figure вот-вот изменят всё.

Робототехническая революция стала реальностью: почему Boston Dynamics и Figure вот-вот изменят всё.

Pieter Pauwels:

Pieter Pauwels: "Digital Twinning in the Architecture, Engineering and Construction (AEC) Sector"

What Donald Trump's said about the Epstein files - The President’s Path podcast, BBC World Service

What Donald Trump's said about the Epstein files - The President’s Path podcast, BBC World Service

🔥TECHNO MIX 2026💥Remixes Of Popular Songs 🎧 Techno Bangers 💥Atmospheric Melodic & Progressive Flow

🔥TECHNO MIX 2026💥Remixes Of Popular Songs 🎧 Techno Bangers 💥Atmospheric Melodic & Progressive Flow

Prof. Markowski: Trump rozmontowuje świat. Europa została sama | Godzina z Jackiem #194

Prof. Markowski: Trump rozmontowuje świat. Europa została sama | Godzina z Jackiem #194

ChatGPT in a kids robot does exactly what experts warned.

ChatGPT in a kids robot does exactly what experts warned.

Podział wiru polarnego w połowie lutego. Czy nastąpi szybki zanik wiru? Zimny luty i marzec.

Podział wiru polarnego w połowie lutego. Czy nastąpi szybki zanik wiru? Zimny luty i marzec.

Developing ML using Python for the diagnosis of Periodontitis

Developing ML using Python for the diagnosis of Periodontitis

this makes me really upset

this makes me really upset

Dlaczego marszałek Sejmu nie przechodzi lustracji? Jakubiak ujawnia problem | PPT 2/2

Dlaczego marszałek Sejmu nie przechodzi lustracji? Jakubiak ujawnia problem | PPT 2/2

A Study on Industrial Artificial Intelligence-Based Edge Analysis for Machining Facilities

A Study on Industrial Artificial Intelligence-Based Edge Analysis for Machining Facilities

The Transformer explained by Ms. Coffee Bean

The Transformer explained by Ms. Coffee Bean

❄️ AUTA ZOMBIE ALE LOSUJEMY UTRUDNIENIA!? | BeamNG Drive |

❄️ AUTA ZOMBIE ALE LOSUJEMY UTRUDNIENIA!? | BeamNG Drive |

Cluster Hardware Hierarchy

Cluster Hardware Hierarchy

POSŁUCHALI RZĄDU, TERAZ MARZNĄ. W POLSCE BRAKUJE PELLETU

POSŁUCHALI RZĄDU, TERAZ MARZNĄ. W POLSCE BRAKUJE PELLETU

Kling 3.0 — это просто безумие! Используйте этот запрос, обновление Freepik.

Kling 3.0 — это просто безумие! Используйте этот запрос, обновление Freepik.

Domain Specific Application of Generative Models

Domain Specific Application of Generative Models