Defensive Publications Series

SUGGESTING TITLES FOR AUDIO RECORDINGS

Itay InbarFollow
Amit PitaruFollow
Isaac BlankensmithFollow
Dror AyalonFollow
Tiago CamolesiFollow
Guilherme SantosFollow
James LemieuxFollow
Sherry LinFollow
Jason ChoFollow

Abstract

Techniques of this disclosure may enable a computing device to suggest one or more titles based on the content of audio being recorded or audio that was previously recorded, and other data such as time and location. Rather than applying a general default title or audio file name, the computing device may request authorization from a user to analyze the contents of a recorded audio file and, after receiving explicit authorization from the user, analyze the audio, including speech, and automatically suggest titles that are indicative of the content of the audio and/or other data. The computing device may convert speech included in the audio into text and extract a plurality of terms from the text based on various factors, such as word classes (e.g., convert audio that includes “this meatball recipe adds parmesan cheese” into text and extract a plurality of nouns such as “meatball,” “recipe,” “parmesan,” and “cheese” from the text). Based on various factors, such as term frequency in the text and the relative uniqueness of the terms in the spoken language, the computing device may identify a plurality of words from the plurality of terms to represent the overall content of the audio (e.g., identify “meatball” and “recipe” from “meatball,” “recipe,” “parmesan,” and “cheese” based on term frequency in the text). The computing device may also classify non-speech audio (e.g. applause, dog barking, music) and use the classification, including metadata associated with the classified audio object, such as song titles, to identify a plurality of words to represent the overall content of the audio. The speech terms, non-speech audio classification, classified audio object metadata, and other data may be combined to identify a plurality of words to represent the overall content of the audio. The computing device may display the identified words as suggested words to be included in the title of the audio file. The user may select one or more of the identified words as the title or combine one or more of the identified words with one or more other words entered by the user. The computing device may use the selected and/or entered words as the title for the audio and/or for the name of the audio file.

Creative Commons License

This work is licensed under a Creative Commons Attribution 4.0 License.

Recommended Citation

Inbar, Itay; Pitaru, Amit; Blankensmith, Isaac; Ayalon, Dror; Camolesi, Tiago; Santos, Guilherme; Lemieux, James; Lin, Sherry; and Cho, Jason, "SUGGESTING TITLES FOR AUDIO RECORDINGS", Technical Disclosure Commons, (December 23, 2019)
https://www.tdcommons.org/dpubs_series/2823

Download

COinS

Technical Disclosure Commons

Defensive Publications Series

SUGGESTING TITLES FOR AUDIO RECORDINGS

Abstract

Creative Commons License

Recommended Citation

Browse

Search

Submit

Additional Information

Technical Disclosure Commons

Defensive Publications Series

SUGGESTING TITLES FOR AUDIO RECORDINGS

Inventor(s)

Abstract

Creative Commons License

Recommended Citation

Share

Browse

Search

Submit

Additional Information