AI-Assisted Peer Review at Scale The AAAI-26 AI Review Pilot with Joydeep Biswas

Автор: NSF-Simons AI Institute for Cosmic Origins

Загружено: 2026-02-20

Просмотров: 19

Описание: AI-Assisted Peer Review at Scale: The AAAI-26 AI Review Pilot

Presenter: Joydeep Biswas, Associate Professor, Department of Computer Science, The University of Texas at Austin, and Associate Director of Texas Robotics.

Abstract: Frontier multimodal language models are rapidly reshaping how we conduct and evaluate science. This talk presents the AAAI-26 AI review pilot, which explored a specific role for AI in the scientific process: peer review. In response to the explosive growth of AI publishing (over 30,000 initial submissions for AAAI-26) and the increasing technical capabilities of state-of-the-art language models, AAAI-26 ran a pilot in which every paper received one clearly labeled AI-generated review. No human reviewers were replaced, and final decisions remained entirely under human control.

I will describe what we built: a thorough multi-stage AI reviewing system that integrates multiple tools and techniques, with explicit criteria at each step, along with the infrastructure required to generate AI reviews for the full submission set in under 24 hours.

We also conducted an extensive voluntary survey of authors, reviewers, senior program committee members, and area chairs to assess and compare them with human reviews. Overall, respondents found the AI reviews helpful, and on average, they were preferred to human reviews across 6 of 9 criteria, including overall impressions, review focus, technical accuracy, and research suggestions. We also learned about the current limitations of AI in peer review.

I will close with lessons learned, opportunities for effective human-AI teaming in peer review, and open challenges in building and evaluating AI assistance for scientific reviewing.

Bio: Joydeep Biswas is an associate professor in the Department of Computer Science at the University of Texas at Austin and Associate Director of Texas Robotics. He leads the Autonomous Mobile Robotics Laboratory (AMRL), where he directs research focused on perception and planning for long-term autonomy in open-world settings. He is a recipient of the NSF CAREER award, the Amazon Research Award, and the JP Morgan Faculty Research Award, and serves as a Trustee of the RoboCup Federation and a Councilor of AAAI. He was an Associate Program Chair for AAAI-26 and led its AI-assisted peer review pilot.

Summary
Astronomical Data Processing Challenges
Joydeep talked about creating an article on a published work and the challenges of processing large astronomical datasets. Joydeep explained the role of CosmicAI members NRA and War Lab in handling data from sources like ALMA and DESI, and suggested contacting Stephanie Juneau for more information on using LLMs in this context. The conversation touched on the difficulties of incremental data processing and the need for efficient methods to handle large datasets in astronomy.
AI Peer Review Pilot Update
Joydeep introduced the third talk in CosmicAI's Spring 2026 Hybrid Seminar Series and discussed the AI-assisted peer review pilot at AAAI 26. He acknowledged the efforts of the team, including Chad, Matt, Kiri, and Jessy, in making this project possible. Joydeep also shared his interest in astrophotography and briefly mentioned the invited speaker Daniel Byteson for the Cosmic AI talk.
AI Peer Review Study Findings
Joydeep presented a study on using AI for scientific peer review, highlighting two main discoveries: a new planetary nebula candidate and challenges in AI-assisted peer review. He discussed the technical details of their AI review system, including preprocessing steps, model capabilities, and quality control measures. The study found that AI reviews were preferred to human reviews on 6 out of 9 criteria, though there were concerns about the environmental impact of running such models. Joydeep concluded that while AI reviews showed promise, they should be used as a supplement to human reviews rather than a replacement.
AI Review Capabilities and Development
Joydeep presented findings on AI review capabilities, noting that AI reviews were preferred by authors for 6 out of 9 criteria with lower variance compared to human reviews, though they were limited in understanding novelty and ranking severity. He emphasized that there is no single optimal review format, as different stakeholders have different needs.

Не удается загрузить Youtube-плеер. Проверьте блокировку Youtube в вашей сети.
Повторяем попытку...

AI-Assisted Peer Review at Scale The AAAI-26 AI Review Pilot with Joydeep Biswas

Доступные форматы для скачивания:

Скачать видео

Информация по загрузке:

Скачать аудио

Похожие видео

Finding Exotic Transients in the Era of Big Data with Sebastian Gomez (UT Austin)

Finding Exotic Transients in the Era of Big Data with Sebastian Gomez (UT Austin)

SpaceX's Flight 12 Starship Has A Secret They Didn't Tell You About!

SpaceX's Flight 12 Starship Has A Secret They Didn't Tell You About!

2026 Canada Tax Filing Essentials: Documents, Deadlines & New Rules

2026 Canada Tax Filing Essentials: Documents, Deadlines & New Rules

Accelerating (Astro)chemical discovery with machine learned atomistic models

Accelerating (Astro)chemical discovery with machine learned atomistic models

Parking Lots to Parks | A Documentary About Adaptive Reuse

Parking Lots to Parks | A Documentary About Adaptive Reuse

Rymanowski, Rakowski: Ryczący Lew Izraela

Rymanowski, Rakowski: Ryczący Lew Izraela

Alcalde Docs | Songs in the James Turrell Skyspace

Alcalde Docs | Songs in the James Turrell Skyspace

RI Seminar: Yuke Zhu : Toward Generalist Humanoid Robots

RI Seminar: Yuke Zhu : Toward Generalist Humanoid Robots

A Building Shaped by Light Austin Central Library, 2019 (Third Place)

A Building Shaped by Light Austin Central Library, 2019 (Third Place)

Trade-offs in LLM-Supported Research Ideation with Houjiang Liu (PhD student)

Trade-offs in LLM-Supported Research Ideation with Houjiang Liu (PhD student)

Alejo José Sison Galsim,

Alejo José Sison Galsim, "Designing a Human-Centered, Virtue-Focused Approach to AI in Business”

MD: How to Make Healthy Living Really Simple | Dr. Vimal George | TEDxNorthwesternU

MD: How to Make Healthy Living Really Simple | Dr. Vimal George | TEDxNorthwesternU

Improving Bass Concert Hall for You

Improving Bass Concert Hall for You

🔴 EXPRESS BIEDRZYCKIEJ | ANDRZEJ BYRT, PROF. RADOSŁAW MARKOWSKI [NA ŻYWO]

🔴 EXPRESS BIEDRZYCKIEJ | ANDRZEJ BYRT, PROF. RADOSŁAW MARKOWSKI [NA ŻYWO]

BAP Nu Sigma Chapter Spring Semester 2026 Meeting III - Forensic Accounting

BAP Nu Sigma Chapter Spring Semester 2026 Meeting III - Forensic Accounting

A Professor’s Secret Weapon | Michael Pyrcz | TEDxUTAustin

A Professor’s Secret Weapon | Michael Pyrcz | TEDxUTAustin

Foundation Models for Survey Astronomy with Francois Lanusse (CNRS)

Foundation Models for Survey Astronomy with Francois Lanusse (CNRS)

How to track calories in an economy-wide framework: from farm to table

How to track calories in an economy-wide framework: from farm to table

Christy Ten Eyck: Harsh Beauty

Christy Ten Eyck: Harsh Beauty

Olmo 3: State-of-the-art in fully open models with Kyle Lo, Lead Research Scientist, (AI2)

Olmo 3: State-of-the-art in fully open models with Kyle Lo, Lead Research Scientist, (AI2)