Abstract

Many applications used for listening to audio provide mechanisms to adjust the speed of audio playback. Due to variation in speaking speeds of speakers in the audio, users often need to keep adjusting the playback speed during playback to achieve a desired speed or have to tolerate listening to the audio at a speed different from their preferred listening speed. Both situations result in a suboptimal user experience. This disclosure describes techniques to standardize the rate of speech in audio to improve a user’s audio listening experience. The standardization of speech can be performed to achieve audio playback based on one or more listening parameters such as words per minute, speech speed multiplier, etc. The appropriate audio playback speed is then determined based on the metrics extracted from analyzing the audio. This enables the audio to be sped up or slowed down automatically during playback to provide the user-preferred audio listening user experience.

Creative Commons License

Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 License.

Share

COinS