arXiv 2501.11223
Reasoning Language Models: A Blueprint
By Maciej Besta, Julia Barth, et al.
Published 2025-01-20
Citation lineage
Review the prior work and downstream research connected to this paper.
Reasoning language models (RLMs), also known as Large Reasoning Models (LRMs), such as OpenAI's o1 and o3, DeepSeek-R1, and Alibaba's QwQ, have redefined AI's problem-solving capabilities by extending LLMs with advanced reasoning mechanisms. Yet, their high costs, proprietary nature, and complex architectures - uniquely combining reinforcement learning (RL), search heuristics, and LLMs - present accessibility and sc…