arXiv 2501.11223

Reasoning Language Models: A Blueprint

By Maciej Besta, Julia Barth, et al.

Published 2025-01-20

Citation lineage

Review the prior work and downstream research connected to this paper.

Reasoning language models (RLMs), also known as Large Reasoning Models (LRMs), such as OpenAI's o1 and o3, DeepSeek-R1, and Alibaba's QwQ, have redefined AI's problem-solving capabilities by extending LLMs with advanced reasoning mechanisms. Yet, their high costs, proprietary nature, and complex architectures - uniquely combining reinforcement learning (RL), search heuristics, and LLMs - present accessibility and sc…

View the original paper on arXiv