(1)
Quentin Larsen; TaoLi Tian. Risk-Aware Reinforcement Learning for Safe Strategic Reasoning in Large Language Model Agents. aimls 2026, 1.