arXiv AI

[2410.06172] Multimodal Situational Safety

By Advanced AI EditorApril 24, 2025No Comments2 Mins Read

[Submitted on 8 Oct 2024 (v1), last revised 22 Apr 2025 (this version, v2)]

View a PDF of the paper titled Multimodal Situational Safety, by Kaiwen Zhou and 5 other authors

View PDF
HTML (experimental)

Abstract:Multimodal Large Language Models (MLLMs) are rapidly evolving, demonstrating impressive capabilities as multimodal assistants that interact with both humans and their environments. However, this increased sophistication introduces significant safety concerns. In this paper, we present the first evaluation and analysis of a novel safety challenge termed Multimodal Situational Safety, which explores how safety considerations vary based on the specific situation in which the user or agent is engaged. We argue that for an MLLM to respond safely, whether through language or action, it often needs to assess the safety implications of a language query within its corresponding visual context. To evaluate this capability, we develop the Multimodal Situational Safety benchmark (MSSBench) to assess the situational safety performance of current MLLMs. The dataset comprises 1,820 language query-image pairs, half of which the image context is safe, and the other half is unsafe. We also develop an evaluation framework that analyzes key safety aspects, including explicit safety reasoning, visual understanding, and, crucially, situational safety reasoning. Our findings reveal that current MLLMs struggle with this nuanced safety problem in the instruction-following setting and struggle to tackle these situational safety challenges all at once, highlighting a key area for future research. Furthermore, we develop multi-agent pipelines to coordinately solve safety challenges, which shows consistent improvement in safety over the original MLLM response. Code and data: this http URL.

Submission history

From: Kaiwen Zhou [view email]
[v1]
Tue, 8 Oct 2024 16:16:07 UTC (15,161 KB)
[v2]
Tue, 22 Apr 2025 23:01:49 UTC (26,647 KB)

Previous ArticleStanford HAI’s annual report highlights rapid adoption and growing accessibility of powerful AI systems

Next Article SWiRL: The business case for AI that thinks like your best problem-solvers

Advanced AI Editor

Leave A Reply