Abstract

Speech recognition enables users to interact with devices via their voice. However, errors in speech recognition during a user’s interaction with such devices can be problematic and lead to a less than satisfactory user experience. This disclosure describes the use of language modeling to recover from automatic speech recognition (ASR) errors by identifying broken queries. The full natural language understanding (NLU) stack is executed to obtain a coherent, alternative, speech recognition. The alternative recognition (or query) runs in parallel to the original, misrecognized query. The potential actions triggered by the misrecognized and the NLU-augmented queries are compared to pick the query interpretation that is more likely to be correct.

Creative Commons License

This work is licensed under a Creative Commons Attribution 4.0 License.

Recommended Citation

Janus, Pawel; Weisz, Agoston; Boffy, Aurelien; and Michalski, Miroslaw, "Use of Language Models to Improve Automatic Speech Recognition", Technical Disclosure Commons, (March 09, 2022)
https://www.tdcommons.org/dpubs_series/4953

Download

COinS

Technical Disclosure Commons

Defensive Publications Series

Use of Language Models to Improve Automatic Speech Recognition

Abstract

Creative Commons License

Recommended Citation

Browse

Search

Submit

Additional Information

Technical Disclosure Commons

Defensive Publications Series

Use of Language Models to Improve Automatic Speech Recognition

Inventor(s)

Abstract

Creative Commons License

Recommended Citation

Share

Browse

Search

Submit

Additional Information