(1)
Nathan R. Lawrence; Kaihui Shao. Hierarchical Dual-System Reinforcement Learning for Long-Horizon Autonomous Planning With Large Language Models. aimls 2026, 1.