[1]
Frederick Ellsworth, “Improving Exploration Efficiency in Complex Reasoning Tasks via Guided Reinforcement Learning and Large Language Model Heuristic Search Strategies”, IJAIR, vol. 1, no. 2, May 2026.