ALMGuard: Safety Shortcuts and Where to Find Them as Guardrails for Audio–Language Models
Weifei Jin, Yuxin Cao, Junjie Su, Minhui Xue, Jie Hao, Ke Xu, Jin Song Dong, and Derui Wang. 2025. "ALMGuard: Safety Shortcuts and Where to Find Them as Guardrails for Audio–Language Models." To appear in the Thirty-ninth Annual Conference on Neural Information Processing Systems (NeurIPS), 2025.