arXiv 2509.08827

A Survey of Reinforcement Learning for Large Reasoning Models

By Kaiyan Zhang, Yuxin Zuo, et al.

Published 2025-09-10

Mindmap

Browse the paper's core ideas, clusters, and relationships in a structured outline.

In this paper, we survey recent advances in Reinforcement Learning (RL) for reasoning with Large Language Models (LLMs). RL has achieved remarkable success in advancing the frontier of LLM capabilities, particularly in addressing complex logical tasks such as mathematics and coding. As a result, RL has emerged as a foundational methodology for transforming LLMs into LRMs. With the rapid progress of the field, furthe…

View the original paper on arXiv