Return to Article Details Hierarchical World-Model Reinforcement Learning for Long-Horizon Reasoning in Large Language Model Agents Download Download PDF