arXiv 2402.03300

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

By Zhihong Shao, Peiyi Wang, et al.

Published 2024-02-05

Wiki summary

Explore the paper's summary, context, and related research on Papiers.

Mathematical reasoning poses a significant challenge for language models due to its complex and structured nature. In this paper, we introduce DeepSeekMath 7B, which continues pre-training DeepSeek-Coder-Base-v1.5 7B with 120B math-related tokens sourced from Common Crawl, together with natural language and code data. DeepSeekMath 7B has achieved an impressive score of 51.7% on the competition-level MATH benchmark w…

View the original paper on arXiv