arXiv 2601.15727
Towards Automated Kernel Generation in the Era of LLMs
By Yang Yu, Peiyu Zang, et al.
Published 2026-01-22
Mindmap
Browse the paper's core ideas, clusters, and relationships in a structured outline.
The performance of modern AI systems is fundamentally constrained by the quality of their underlying kernels, which translate high-level algorithmic semantics into low-level hardware operations. Achieving near-optimal kernels requires expert-level understanding of hardware architectures and programming models, making kernel engineering a critical but notoriously time-consuming and non-scalable process. Recent advanc…