Abstract

It is difficult to make a note of a fact, look up an entity, or perform other actions on audio content in the moment to enable remembering things for later. This disclosure describes techniques to create bookmarks for audio content such as podcasts, audiobooks, etc. with easy to perform gestures without interrupting the listening session. In response to the user performing the gesture, a capture flow is executed to transcribe, save, and make the audio content available for later search, reference, consumption, sharing, or browsing, e.g., via a bookmark. A large language model (LLM) can be utilized for various purposes such as to help the user search and revisit saved bookmarks via natural language queries to a conversational agent; to automatically generate titles for the bookmarked audio snippet; to summarize the audio snippet; to extract entities from the audio snippet; etc.

Creative Commons License

This work is licensed under a Creative Commons Attribution 4.0 License.

Recommended Citation

Peiffer, Pol Henri Adrien, "Gesture-driven Audio Bookmarks Powered by Large Language Model", Technical Disclosure Commons, (July 18, 2023)
https://www.tdcommons.org/dpubs_series/6064

Download

COinS

Technical Disclosure Commons

Defensive Publications Series

Gesture-driven Audio Bookmarks Powered by Large Language Model

Abstract

Creative Commons License

Recommended Citation

Browse

Search

Submit

Additional Information

Technical Disclosure Commons

Defensive Publications Series

Gesture-driven Audio Bookmarks Powered by Large Language Model

Inventor(s)

Abstract

Creative Commons License

Recommended Citation

Share

Browse

Search

Submit

Additional Information