arXiv 2508.09932
Mathematical Computation and Reasoning Errors by Large Language Models
By Liang Zhang and Edith Aurora Graf
Published 2025-08-13
Citation lineage
Review the prior work and downstream research connected to this paper.
Large Language Models (LLMs) are increasingly utilized in AI-driven educational instruction and assessment, particularly within mathematics education. The capability of LLMs to generate accurate answers and detailed solutions for math problem-solving tasks is foundational for ensuring reliable and precise feedback and assessment in math education practices. Our study focuses on evaluating the accuracy of four LLMs (…