Abstract

In educational videos, the speaker often presents a set of slides that serve as logical cues and content markers. Currently, video platforms do not use the strong cues already available in the form of slides. This disclosure describes techniques that enable video viewers to more naturally navigate the video. With user permission, computer vision techniques are applied to video content to detect whether it includes a presentation, to track individual slide changes, and to recognize content displayed on each slide. Automatic understanding of slide content is utilized to improve speech recognition and content captioning, to enhance video search, and to provide improved user interfaces, e.g., a table-of-contents for the video.

Creative Commons License

This work is licensed under a Creative Commons Attribution 4.0 License.

Recommended Citation

Sharifi, Matthew, "Slide-based Navigation and Understanding for Video Content", Technical Disclosure Commons, (February 26, 2021)
https://www.tdcommons.org/dpubs_series/4106

Download

COinS

Technical Disclosure Commons

Defensive Publications Series

Slide-based Navigation and Understanding for Video Content

Abstract

Creative Commons License

Recommended Citation

Browse

Search

Submit

Additional Information

Technical Disclosure Commons

Defensive Publications Series

Slide-based Navigation and Understanding for Video Content

Inventor(s)

Abstract

Creative Commons License

Recommended Citation

Share

Browse

Search

Submit

Additional Information