Agreement and Alignment for Human-AI Collaborative Decision Making

Автор: Simons Institute for the Theory of Computing

Загружено: 2026-01-13

Просмотров: 352

Описание: Aaron Roth (University of Pennsylvania)
https://simons.berkeley.edu/talks/aar...
Bridging Prediction and Intervention Problems in Social Systems

As AI models become increasingly powerful, it is an attractive proposition to use them in important decision making pipelines, in collaboration with human decision makers. But how should a human being and a machine learning model collaborate to reach decisions that are better than either of them could achieve on their own? If the human and the AI model were perfect Bayesians, operating in a setting with a commonly known and correctly specified prior, Aumann’s classical agreement theorem would give us one answer: they could engage in conversation about the task at hand, and their conversation would be guaranteed to converge to (accuracy-improving) agreement. This classical result however would require making many implausible assumptions, both about the knowledge and computational power of both parties. We show how to recover similar (and more general) results using only computationally and statistically tractable assumptions, which substantially relax full Bayesian rationality. In the second part of the talk, we go on to consider a more difficult problem: that the AI model might be acting at least in part to advance the interests of its designer, rather than the interests of its user, which might be in tension. We show how market competition between different AI providers can mitigate this problem assuming only a mild “market alignment” assumption — that the user’s utility function lies in the convex hull of the AI providers utility functions — even when no single provider is well aligned. In particular, we show that in all Nash equilibria of the AI providers under this market alignment condition, the user is able to advance her own goals as well as she could have in collaboration with a perfectly aligned AI model.

This talk describes the results of three papers, which are joint works with Natalie Collina, Ira Globus-Harris, Surbhi Goel, Varun Gupta, Emily Ryu, and Mirah Shi:

Tractable Agreement Protocols: https://arxiv.org/abs/2411.19791 (STOC 2025)

Collaborative Prediction: Tractable Information Aggregation via
Agreement: https://arxiv.org/abs/2504.06075 (SODA 2026)

Emergent Alignment from Competition: https://arxiv.org/abs/2509.15090

Не удается загрузить Youtube-плеер. Проверьте блокировку Youtube в вашей сети.
Повторяем попытку...

Agreement and Alignment for Human-AI Collaborative Decision Making

Доступные форматы для скачивания:

Скачать видео

Информация по загрузке:

Скачать аудио

Похожие видео

Research on sensitive data

Research on sensitive data

Scheduling with Time-Evolving Uncertainty for Content Review Prioritization in Social Media

Scheduling with Time-Evolving Uncertainty for Content Review Prioritization in Social Media

Alignment Problems in AI GovernanceLocation

Alignment Problems in AI GovernanceLocation

Triage Score: A Counterfactual Risk Assessment Instrument

Triage Score: A Counterfactual Risk Assessment Instrument

Zhijing Jin | Testing LLM Cooperation in Multi-Agent Simulation | CAIRF Retreat

Zhijing Jin | Testing LLM Cooperation in Multi-Agent Simulation | CAIRF Retreat

Veridical Data Science for Healthcare in the Age of AI

Veridical Data Science for Healthcare in the Age of AI

Computational Challenges and Opportunities in DNA Methylation Analysis

Computational Challenges and Opportunities in DNA Methylation Analysis

Knowledge Tracing (Dr. Ryan S. Baker)

Knowledge Tracing (Dr. Ryan S. Baker)

The genetic architecture of autism: from medicine to neurodiversity

The genetic architecture of autism: from medicine to neurodiversity

Enough is enough! We need to talk about how the future can be GOOD!

Enough is enough! We need to talk about how the future can be GOOD!

Distinguished Lecture: Epidemiology is Easy – Anyone Can Do It

Distinguished Lecture: Epidemiology is Easy – Anyone Can Do It

Opening Remarks

Opening Remarks

Foundation models in health AI

Foundation models in health AI

What and how in algorithmic fairness

What and how in algorithmic fairness

Short Talks

Manfred Morari (University of Pennsylvania):

Manfred Morari (University of Pennsylvania): "A Practitioner's Perspective"

Human Values in Clinical Algorithms, Old and New

Human Values in Clinical Algorithms, Old and New

On the interplay of accuracy and fairness in computational healthcare

On the interplay of accuracy and fairness in computational healthcare

Predicting the future: A lecture by Philip Tetlock

Predicting the future: A lecture by Philip Tetlock

Bower Award Ceremony 2021: Jürgen Schmidhuber lauds Kunihiko Fukushima

Bower Award Ceremony 2021: Jürgen Schmidhuber lauds Kunihiko Fukushima