1.
Frederick Ellsworth. Improving Exploration Efficiency in Complex Reasoning Tasks via Guided Reinforcement Learning and Large Language Model Heuristic Search Strategies. IJAIR [Internet]. 2026 May 14 [cited 2026 May 17];1(2). Available from: https://www.isipress.org/index.php/IJAIR/article/view/157