(1)
Frederick Ellsworth. Improving Exploration Efficiency in Complex Reasoning Tasks via Guided Reinforcement Learning and Large Language Model Heuristic Search Strategies. IJAIR 2026, 1.