1.
Frederick Ellsworth. Improving Exploration Efficiency in Complex Reasoning Tasks via Guided Reinforcement Learning and Large Language Model Heuristic Search Strategies. IJAIR [Internet]. 2026 May 19 [cited 2026 May 28];1(2). Available from: https://isipress.org/index.php/IJAIR/article/view/157