arXiv 2503.06029

SmartBench: Is Your LLM Truly a Good Chinese Smartphone Assistant?

By Xudong Lu, Haohao Gao, et al.

Published 2025-03-08

Mindmap

Browse the paper's core ideas, clusters, and relationships in a structured outline.

Large Language Models (LLMs) have become integral to daily life, especially advancing as intelligent assistants through on-device deployment on smartphones. However, existing LLM evaluation benchmarks predominantly focus on objective tasks like mathematics and coding in English, which do not necessarily reflect the practical use cases of on-device LLMs in real-world mobile scenarios, especially for Chinese users. To…

View the original paper on arXiv