ALMGuard: Safety Shortcuts and Where to Find Them as Guardrails for Audio–Language Models

Published in NeurIPS 2025, 2025

ALMGuard: Safety Shortcuts and Where to Find Them as Guardrails for Audio–Language Models.

To appear in the Thirty-ninth Annual Conference on Neural Information Processing Systems (NeurIPS), 2025.

Recommended citation: Weifei Jin, Yuxin Cao, Junjie Su, Minhui Xue, Jie Hao, Ke Xu, Jin Song Dong, and Derui Wang. 2025. "ALMGuard: Safety Shortcuts and Where to Find Them as Guardrails for Audio–Language Models." To appear in the Thirty-ninth Annual Conference on Neural Information Processing Systems (NeurIPS), 2025.
Download Paper