[1]

Ethan Thornton, “Refining Reasoning Chains through Self Correcting Reinforcement Learning Architectures for Mitigating Logical Hallucinations in Large Language Models”, IJAIR, vol. 1, no. 2, May 2026.