Abstract

Techniques are described for a low-bandwidth call mode that replaces camera video transmission with streaming of audio-derived avatar animation parameters. During a real-time call, endpoints exchange capability information and negotiate an operating tier specifying parameter formats, update rates, and synchronization expectations. In response to user input or automatically upon degraded network conditions (e.g., low uplink bandwidth, loss, or jitter), a sender disables video frame transmission and processes speech audio in short windows using an on-device model to produce a time-stamped parameter stream including visemes and optionally expression and gesture parameters with confidence scores. The stream is adapted in rate, precision, or tier, and may fall back to neutral or lip-sync-only when confidence is low. A receiver renders an avatar locally, synchronizes animation to audio using timestamps and buffering, and applies packet-loss concealment behaviors.

Creative Commons License

This work is licensed under a Creative Commons Attribution 4.0 License.

Recommended Citation

Anonymous, "Low-Bandwidth Call Mode Using Audio-Derived Expression and Gesture Parameter Streaming", Technical Disclosure Commons, (June 30, 2026)
https://www.tdcommons.org/dpubs_series/10658

Download

COinS

Technical Disclosure Commons

Defensive Publications Series

Low-Bandwidth Call Mode Using Audio-Derived Expression and Gesture Parameter Streaming

Abstract

Creative Commons License

Recommended Citation

Browse

Search

Submit

Additional Information

Technical Disclosure Commons

Defensive Publications Series

Low-Bandwidth Call Mode Using Audio-Derived Expression and Gesture Parameter Streaming

Inventor(s)

Abstract

Creative Commons License

Recommended Citation

Share

Browse

Search

Submit

Additional Information