Hao WuFollow


Automatic speech-to-text software sometimes produces transcripts that include errors which users correct manually, e.g., by inserting or deleting words. It is important that the edited transcript bears word-level timestamps that are faithful to the raw (unedited) transcript. This disclosure describes techniques that can be applied in real time to automatically and accurately recalculate timestamps of words in a transcript even as a user makes edits to the transcript. With immediate/ real-time timestamp recalculation as described herein, users can play a recording and edit the corresponding transcript while simultaneously enjoying transcript-editing utilities like karaoke-style highlighting; word, sentence, or paragraph insertion, replacement, or deletion; paragraph break insertion; paragraph merging; playing or sharing selected audio segments (sentences, paragraphs, etc.) of the transcript; etc.

Creative Commons License

Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 License.