arXiv 2511.08892

Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds

By Weihao Tan, Xiangyang Li, et al.

Published 2025-11-12

Citation lineage

Review the prior work and downstream research connected to this paper.

We introduce Lumine, the first open recipe for developing generalist agents capable of completing hours-long complex missions in real time within challenging 3D open-world environments. Lumine adopts a human-like interaction paradigm that unifies perception, reasoning, and action in an end-to-end manner, powered by a vision-language model. It processes raw pixels at 5 Hz to produce precise 30 Hz keyboard-mouse actio…

View the original paper on arXiv