Frederick Ellsworth. “Improving Exploration Efficiency in Complex Reasoning Tasks via Guided Reinforcement Learning and Large Language Model Heuristic Search Strategies”. International Journal of Artificial Intelligence Research, vol. 1, no. 2, May 2026, doi:10.66280/ijair.v1i2.157.