[1]
Frederick Ellsworth 2026. Improving Exploration Efficiency in Complex Reasoning Tasks via Guided Reinforcement Learning and Large Language Model Heuristic Search Strategies. International Journal of Artificial Intelligence Research. 1, 2 (May 2026). DOI:https://doi.org/10.66280/ijair.v1i2.157.