Abstract

Many long-form video and audio files lack transcripts, which reduces content accessibility for individuals who prefer reading, are hearing impaired, or have constrained time for media consumption. Existing automated captioning services are often restricted to specific platforms and are not natively integrated into general web browsers. This disclosure describes a browser-integrated method that detects media on a webpage and offers background transcription. When initiated, the audio is processed by or routed to a voice-to-text service while the media plays in the background. Transcription is performed either locally using on-device models or via cloud-based processing. Once the process is complete, a notification is provided and the transcript is displayed or saved. If a media file is fully downloadable, transcription may occur silently without active playback. This method facilitates efficient content consumption by enabling the generation of text versions for any browser-accessible media.

Creative Commons License

This work is licensed under a Creative Commons Attribution 4.0 License.

Recommended Citation

Azose, Benjamin Albert, "Automated Background Transcription for Audio and Video Content in a Web Browser", Technical Disclosure Commons, (February 27, 2026)
https://www.tdcommons.org/dpubs_series/9430

Download

COinS

Technical Disclosure Commons

Defensive Publications Series

Automated Background Transcription for Audio and Video Content in a Web Browser

Abstract

Creative Commons License

Recommended Citation

Browse

Search

Submit

Additional Information

Technical Disclosure Commons

Defensive Publications Series

Automated Background Transcription for Audio and Video Content in a Web Browser

Inventor(s)

Abstract

Creative Commons License

Recommended Citation

Share

Browse

Search

Submit

Additional Information