arXiv 2509.08827

A Survey of Reinforcement Learning for Large Reasoning Models

By Kaiyan Zhang, Yuxin Zuo, et al.

Published 2025-09-10

Citation lineage

Review the prior work and downstream research connected to this paper.

In this paper, we survey recent advances in Reinforcement Learning (RL) for reasoning with Large Language Models (LLMs). RL has achieved remarkable success in advancing the frontier of LLM capabilities, particularly in addressing complex logical tasks such as mathematics and coding. As a result, RL has emerged as a foundational methodology for transforming LLMs into LRMs. With the rapid progress of the field, furthe…

View the original paper on arXiv