arXiv 2503.15421

Probing the topology of the space of tokens with structured prompts

By Michael Robinson, Sourya Dey, et al.

Published 2025-03-19

Mindmap

Browse the paper's core ideas, clusters, and relationships in a structured outline.

This article presents a general and flexible method for prompting a large language model (LLM) to reveal its (hidden) token input embedding up to homeomorphism. Moreover, this article provides strong theoretical justification -- a mathematical proof for generic LLMs -- for why this method should be expected to work. With this method in hand, we demonstrate its effectiveness by recovering the token subspace of Llemma…

View the original paper on arXiv