Abstract
Techniques of this disclosure may enable a computing device to suggest one or more titles based on the content of audio being recorded or audio that was previously recorded, and other data such as time and location. Rather than applying a general default title or audio file name, the computing device may request authorization from a user to analyze the contents of a recorded audio file and, after receiving explicit authorization from the user, analyze the audio, including speech, and automatically suggest titles that are indicative of the content of the audio and/or other data. The computing device may convert speech included in the audio into text and extract a plurality of terms from the text based on various factors, such as word classes (e.g., convert audio that includes “this meatball recipe adds parmesan cheese” into text and extract a plurality of nouns such as “meatball,” “recipe,” “parmesan,” and “cheese” from the text). Based on various factors, such as term frequency in the text and the relative uniqueness of the terms in the spoken language, the computing device may identify a plurality of words from the plurality of terms to represent the overall content of the audio (e.g., identify “meatball” and “recipe” from “meatball,” “recipe,” “parmesan,” and “cheese” based on term frequency in the text). The computing device may also classify non-speech audio (e.g. applause, dog barking, music) and use the classification, including metadata associated with the classified audio object, such as song titles, to identify a plurality of words to represent the overall content of the audio. The speech terms, non-speech audio classification, classified audio object metadata, and other data may be combined to identify a plurality of words to represent the overall content of the audio. The computing device may display the identified words as suggested words to be included in the title of the audio file. The user may select one or more of the identified words as the title or combine one or more of the identified words with one or more other words entered by the user. The computing device may use the selected and/or entered words as the title for the audio and/or for the name of the audio file.
Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 License.
Recommended Citation
Inbar, Itay; Pitaru, Amit; Blankensmith, Isaac; Ayalon, Dror; Camolesi, Tiago; Santos, Guilherme; Lemieux, James; Lin, Sherry; and Cho, Jason, "SUGGESTING TITLES FOR AUDIO RECORDINGS", Technical Disclosure Commons, (December 23, 2019)
https://www.tdcommons.org/dpubs_series/2823