ALMGuard: Safety Shortcuts and Where to Find Them as Guardrails for Audio–Language Models
Published in NeurIPS 2025, 2025
ALMGuard: Safety Shortcuts and Where to Find Them as Guardrails for Audio–Language Models.
To appear in the Thirty-ninth Annual Conference on Neural Information Processing Systems (NeurIPS), 2025.
Recommended citation: Weifei Jin, Yuxin Cao, Junjie Su, Minhui Xue, Jie Hao, Ke Xu, Jin Song Dong, and Derui Wang. 2025. "ALMGuard: Safety Shortcuts and Where to Find Them as Guardrails for Audio–Language Models." To appear in the Thirty-ninth Annual Conference on Neural Information Processing Systems (NeurIPS), 2025.
Download Paper