Scale Can’t Overcome Pragmatics: The Impact of Reporting Bias on Vision-Language Reasoning

Автор: USC Information Sciences Institute

Загружено: 2025-12-02

Просмотров: 34

Описание: Date Presented: 11/4/2025
Speaker: Amita Kamath, USC

Visit links below to subscribe and for details on upcoming seminars:
https://www.isi.edu/isi-seminar-series
https://www.isi.edu/events

Abstract:
The lack of reasoning capabilities in Vision-Language Models (VLMs) has remained at the forefront of research discourse. We posit that this behavior stems from a reporting bias in their training data. That is, how people communicate about visual content by default omits tacit information needed to supervise some types of reasoning. We investigate the data underlying popular VLMs through the lens of theories from pragmatics, and find that reporting bias results in insufficient representation of various reasoning skills, despite the corpora being of web-scale, and/or synthetically generated. My talk will cover the impacts of reporting bias on vision-language model reasoning at various model+data scales, as well as a potential path forward: specifically, focusing on more intentional training data curation methods, rather than counting on scale for emergence of reasoning capabilities.

Speaker's Bio:
Amita Kamath is a PhD candidate in Computer Science co-advised by Kai-Wei Chang at UCLA and Ranjay Krishna at the University of Washington. She frequently collaborates with the Allen Institute for AI (AI2). Previously, Amita was a Pre-doctoral Young Investigator (PYI) at AI2, where she worked with the PRIOR team on general purpose vision-language systems. Before that, she completed her MS in Computer Science at Stanford University, where she worked with Percy Liang's group on distribution shift in NLP. She has published papers at top NLP and CV conferences such as EMNLP and CVPR.

Не удается загрузить Youtube-плеер. Проверьте блокировку Youtube в вашей сети.
Повторяем попытку...

Scale Can’t Overcome Pragmatics: The Impact of Reporting Bias on Vision-Language Reasoning

Доступные форматы для скачивания:

Скачать видео

Информация по загрузке:

Скачать аудио

Похожие видео

Collapse of Dense Retrievers & Steering MoE LLMs

Collapse of Dense Retrievers & Steering MoE LLMs

Terry Tao - Machine assistance and the future of research mathematics - IPAM at UCLA

Terry Tao - Machine assistance and the future of research mathematics - IPAM at UCLA

Terence Tao | UCLA Connects: Bruin Talks

Terence Tao | UCLA Connects: Bruin Talks

Improving Human-AI Collaboration by Adapting to User Trust

Improving Human-AI Collaboration by Adapting to User Trust

Auditing Language Models for Hidden Objectives with Sam Marks

Auditing Language Models for Hidden Objectives with Sam Marks

Q2B25 Кремниевая долина | Джон Прескилл, профессор Калифорнийского технологического института

Q2B25 Кремниевая долина | Джон Прескилл, профессор Калифорнийского технологического института

Benchmarking MLLMs for Embodied Decision Making and Cognitive World Modeling

Benchmarking MLLMs for Embodied Decision Making and Cognitive World Modeling

Фейнман в Калифорнийском технологическом институте — Джон Прескилл и Кип Торн — 11 мая 2018 г.

Фейнман в Калифорнийском технологическом институте — Джон Прескилл и Кип Торн — 11 мая 2018 г.

#706 Opcje Trumpa ataku na Iran. Iran-strategia przetrwania.Rosyjska broń. Rebelia karteli w Meksyku

#706 Opcje Trumpa ataku na Iran. Iran-strategia przetrwania.Rosyjska broń. Rebelia karteli w Meksyku

On Minds: Reflections of a Cognitive Architect

On Minds: Reflections of a Cognitive Architect

The tariff toll: How tariff uncertainty could impact businesses

The tariff toll: How tariff uncertainty could impact businesses

"You can get lucky in research, but you can't get lucky if you are not prepared"

Będziemy płacić za emerytury Ukraińców!

Będziemy płacić za emerytury Ukraińców!

An Effective Theory of Bias Amplification

An Effective Theory of Bias Amplification

Large Language Models in the Age of Misinformation

Large Language Models in the Age of Misinformation

Ideas for Tomorrow | Gentry Lee, Chief Engineer for the Solar System Exploration Directorate at JPL

Ideas for Tomorrow | Gentry Lee, Chief Engineer for the Solar System Exploration Directorate at JPL

CELESTIAL CELEBRATIONS: 90 YEARS OF GRIFFITH OBSERVATORY AND THE STORIES IN THE SKY

CELESTIAL CELEBRATIONS: 90 YEARS OF GRIFFITH OBSERVATORY AND THE STORIES IN THE SKY

Terry Tao, Ph.D. Small and Large Gaps Between the Primes

Terry Tao, Ph.D. Small and Large Gaps Between the Primes

Vintage NASA: Voyager’s 1990 ‘Family Portrait’ News Conference

Vintage NASA: Voyager’s 1990 ‘Family Portrait’ News Conference

OpenAI Could be Bankrupt by 2027

OpenAI Could be Bankrupt by 2027