[1]
Philip Kensington and Scott Whitfield, “Enhancing Logical Reasoning Depth via Monte Carlo Tree Search Integrated Reinforcement Learning for Advanced Large Language Model Thinking Processes”, IJAIR, vol. 1, no. 2, May 2026.