Defensive Publications Series

Improved Contextual Grounding by Combining Multiple Speech Transcription Hypotheses

Abstract

Processing a user’s voice command includes parsing the command to derive the components referred to therein. These identified components or arguments are then mapped to items or objects in the real world in a process known as “grounding.” In some cases, transcription inaccuracies can make it infeasible for a virtual assistant or other application to achieve accurate grounding, thus making it impossible to service the user’s command. This disclosure describes techniques to improve grounding by taking into account the top N highest-likelihood transcriptions for a user’s voice command along with contextual information accessed with the user’s permission. Improved query interpretation can enable a virtual assistant or other application to accurately interpret the command and thereby improve user experience.

Creative Commons License

This work is licensed under a Creative Commons Attribution 4.0 License.

Recommended Citation

Faruqui, Manaal; Verma, Vishal; and Gupta, Aditya, "Improved Contextual Grounding by Combining Multiple Speech Transcription Hypotheses", Technical Disclosure Commons, (February 19, 2021)
https://www.tdcommons.org/dpubs_series/4082

Download

COinS

Technical Disclosure Commons

Defensive Publications Series

Improved Contextual Grounding by Combining Multiple Speech Transcription Hypotheses

Abstract

Creative Commons License

Recommended Citation

Browse

Search

Submit

Additional Information

Technical Disclosure Commons

Defensive Publications Series

Improved Contextual Grounding by Combining Multiple Speech Transcription Hypotheses

Inventor(s)

Abstract

Creative Commons License

Recommended Citation

Share

Browse

Search

Submit

Additional Information