arXiv 2503.15421
Probing the topology of the space of tokens with structured prompts
By Michael Robinson, Sourya Dey, et al.
Published 2025-03-19
Mindmap
Browse the paper's core ideas, clusters, and relationships in a structured outline.
This article presents a general and flexible method for prompting a large language model (LLM) to reveal its (hidden) token input embedding up to homeomorphism. Moreover, this article provides strong theoretical justification -- a mathematical proof for generic LLMs -- for why this method should be expected to work. With this method in hand, we demonstrate its effectiveness by recovering the token subspace of Llemma…