Abstract
Extreme dynamic audio ranges in media content cause uncomfortable transitions between quiet dialogue and loud action sequences. Conventional reactive audio limiters introduce latency and unnatural pumping artifacts. Subtitle metadata is utilized as a lookahead control signal to preemptively adjust audio levels in media content. Text within the subtitle stream is parsed for semantic indicators of high-intensity audio events or low-intensity audio events. Based on these audio event indicators, audio volume is proactively attenuated or amplified, respectively, prior to transient audio spikes or dips. Baseline volume levels are smoothly restored following the event. Audio fidelity is preserved, and a consistent listening experience is maintained.
Creative Commons License

This work is licensed under a Creative Commons Attribution 4.0 License.
Recommended Citation
Agarwal, Nikita, "Subtitle Lookahead Audio Leveling", Technical Disclosure Commons, ()
https://www.tdcommons.org/dpubs_series/10451