arXiv 2510.00050

Object-AVEdit: An Object-level Audio-Visual Editing Model

By Youquan Fu, Ruiyang Si, et al.

Published 2025-09-27

Wiki summary

Explore the paper's summary, context, and related research on Papiers.

There is a high demand for audio-visual editing in video post-production and the film making field. While numerous models have explored audio and video editing, they struggle with object-level audio-visual operations. Specifically, object-level audio-visual editing requires the ability to perform object addition, replacement, and removal across both audio and visual modalities, while preserving the structural inform…

View the original paper on arXiv