arXiv 2503.04721

Full-Duplex-Bench: A Benchmark to Evaluate Full-duplex Spoken Dialogue Models on Turn-taking Capabilities

By Guan-Ting Lin, Jiachen Lian, et al.

Published 2025-03-06

Mindmap

Browse the paper's core ideas, clusters, and relationships in a structured outline.

Spoken dialogue modeling poses challenges beyond text-based language modeling, requiring real-time interaction, turn-taking, and backchanneling. While most Spoken Dialogue Models (SDMs) operate in half-duplex mode-processing one turn at a time - emerging full-duplex SDMs can listen and speak simultaneously, enabling more natural conversations. However, current evaluations remain limited, focusing mainly on turn-base…

View the original paper on arXiv