Red Teaming Large Language Models - Armin Buescher - NDC Security 2024

Автор: NDC Conferences

Загружено: 2024-03-18

Просмотров: 1929

Описание: This talk was recorded at NDC Security in Oslo, Norway. #ndcsecurity #ndcconferences #security #ai #developer #softwaredeveloper

Attend the next NDC conference near you:
https://ndcconferences.com
https://ndc-security.com/

Subscribe to our YouTube channel and learn every day:
/‪@NDC‬

Follow our Social Media!

  / ndcconferences
  / ndc_conferences
  / ndc_conferences

As machine learning models become increasingly integrated into our digital infrastructure, evaluating their vulnerabilities is essential for both security and ethical reasons. Large language models (LLMs) are no exception. While they represent a revolutionary leap in natural language tasks, LLMs pose unique security and ethical challenges, including the potential to generate misleading, harmful, or biased content as well as leak confidential data, denial of service, or even cause remote code execution.

This talk provides an in-depth look into red-teaming LLMs as an evaluation methodology to expose these vulnerabilities. By focusing on case studies and practical examples, we will differentiate between structured red team exercises and isolated adversarial attacks, such as model jailbreaks. Attendees will gain insights into the types of vulnerabilities that red teaming can reveal in LLMs, as well as potential strategies for mitigating these risks. The session aims to equip professionals with the knowledge to better evaluate the security and ethical dimensions of deploying Large Language Models in their organizations.

Не удается загрузить Youtube-плеер. Проверьте блокировку Youtube в вашей сети.
Повторяем попытку...

Red Teaming Large Language Models - Armin Buescher - NDC Security 2024

Доступные форматы для скачивания:

Скачать видео

Информация по загрузке:

Скачать аудио

Похожие видео

Breaking Barriers: Empowering Women to Thrive in Cyber Security - Katie McMillan & Samia Durrani

Breaking Barriers: Empowering Women to Thrive in Cyber Security - Katie McMillan & Samia Durrani

HiddenLayer Webinar: A Guide to AI Red Teaming

HiddenLayer Webinar: A Guide to AI Red Teaming

The Past, Present, and Future of Cross-Site/Cross-Origin Request Forgery - Philippe de Ryck

The Past, Present, and Future of Cross-Site/Cross-Origin Request Forgery - Philippe de Ryck

Квантование против обрезки против дистилляции: оптимизация нейронных сетей для вывода

Квантование против обрезки против дистилляции: оптимизация нейронных сетей для вывода

How to Train LLMs to

How to Train LLMs to "Think" (o1 & DeepSeek-R1)

Post compromise: Uncovering Clouds and Assessing at Lightning Speed Karim El Melhaoui

Post compromise: Uncovering Clouds and Assessing at Lightning Speed Karim El Melhaoui

No Size Fits All: Empowering Engineers with Custom Application Security tests - Michal Kamensky

No Size Fits All: Empowering Engineers with Custom Application Security tests - Michal Kamensky

Роскомнадзор рубит Telegram, Россия вырезает скот, Куба во тьме. Мартынов, Дунцова, Ступин

Роскомнадзор рубит Telegram, Россия вырезает скот, Куба во тьме. Мартынов, Дунцова, Ступин

What is Retrieval-Augmented Generation (RAG)?

What is Retrieval-Augmented Generation (RAG)?

Design Patterns - The Most Common Misconceptions (1 of N) - Klaus Iglberger - NDC TechTown. 2023

Design Patterns - The Most Common Misconceptions (1 of N) - Klaus Iglberger - NDC TechTown. 2023

Secure development with C++ - Lessons and techniques - Helge Penne - NDC TechTown 2023

Secure development with C++ - Lessons and techniques - Helge Penne - NDC TechTown 2023

Why Large Language Models Hallucinate

Why Large Language Models Hallucinate

PyRIT: A Framework for Security Risk Identification and Red Teaming in Generative AI Systems

PyRIT: A Framework for Security Risk Identification and Red Teaming in Generative AI Systems

Илон Маск про орбитальные дата‑центры и будущее ИИ

Илон Маск про орбитальные дата‑центры и будущее ИИ

LLM Security Risks and Mitigation Strategies [Cloud Masters #117]

LLM Security Risks and Mitigation Strategies [Cloud Masters #117]

Testing to Red Teaming: What’s Wrong with My AI?

Testing to Red Teaming: What’s Wrong with My AI?

Linux user namespaces: a blessing and a curse - Ignat Korchagin - NDC TechTown 2024

Linux user namespaces: a blessing and a curse - Ignat Korchagin - NDC TechTown 2024

Unlocking The Secrets Of TLS - Scott Helme - NDC Security 2024

Unlocking The Secrets Of TLS - Scott Helme - NDC Security 2024

Gaspard Baye - Hacking GenAI with LLM Red Teaming and Beyond

Gaspard Baye - Hacking GenAI with LLM Red Teaming and Beyond

Quantum-Resilient AI: How Confidential Computing and OpenSSL Secure the Future by Paul Yang

Quantum-Resilient AI: How Confidential Computing and OpenSSL Secure the Future by Paul Yang