Abstract

A system related to generative audio replacement via temporal and spectral matching. The system extracts a Semantic Audio Profile from a reference audio track including temporal features and spectral/tonal features, generates replacement audio using a generative model conditioned on the profile with temporal control signals for beat alignment, performs source separation to preserve dialogue and sound effects, and mixes the generated audio with preserved stems to produce synchronized replacement audio.

Creative Commons License

Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 License.

Share

COinS