SudoBench: A Contextual Authorization Benchmark for LLM Agents

The International Conference on Machine Learning (ICML 2026 AIWILD Workshop), 2025-05-31 00:00:00 -0700