Abstract

This disclosure relates to contextualization transcription of non-verbal communication. All transcription applications today take inputs from audio capturing endpoints like microphones and apply voice/speech recognition algorithms and then speech-to-text transformations to transcribe the user’s speech. This transcription can further be enhanced by adding non-verbal inputs (inferred through video AI) to the transcription which can add contextual value to the transcription. This disclosure proposes methods to combine both video and audio inputs and transcribe it to solve this issue. With the use camera AI to add non-verbal context to transcription. This would complement audio transcription. Camera AI would look for common gestures, motions, activities and add to transcription. AI would learn new behaviors of participants.

Creative Commons License

This work is licensed under a Creative Commons Attribution-Share Alike 4.0 License.

Recommended Citation

INC, HP, "CONTEXTUALIZATION TRANSCRIPTION OF NON-VERBAL COMMUNICATION", Technical Disclosure Commons, (September 30, 2022)
https://www.tdcommons.org/dpubs_series/5413

Download

COinS

Technical Disclosure Commons

Defensive Publications Series

CONTEXTUALIZATION TRANSCRIPTION OF NON-VERBAL COMMUNICATION

Abstract

Creative Commons License

Recommended Citation

Browse

Search

Submit

Additional Information

Technical Disclosure Commons

Defensive Publications Series

CONTEXTUALIZATION TRANSCRIPTION OF NON-VERBAL COMMUNICATION

Inventor(s)

Abstract

Creative Commons License

Recommended Citation

Share

Browse

Search

Submit

Additional Information