Abstract

Techniques are provided for speech recognition in real-time with a semi-supervised model based on Wav2vec2.0. Only minimal training data is required, thereby enabling service of under-represented/low resource languages at a quality level comparable to more widely available languages.

Creative Commons License

This work is licensed under a Creative Commons Attribution 4.0 License.

Recommended Citation

Le Groux, Sylvain and Huang, Zili, "PRODUCTION-GRADE ONLINE SPEECH RECOGNITION FOR LOW-RESOURCE LANGUAGES", Technical Disclosure Commons, (November 14, 2022)
https://www.tdcommons.org/dpubs_series/5500

Download

COinS

Technical Disclosure Commons

Defensive Publications Series

PRODUCTION-GRADE ONLINE SPEECH RECOGNITION FOR LOW-RESOURCE LANGUAGES

Abstract

Creative Commons License

Recommended Citation

Browse

Search

Submit

Additional Information

Technical Disclosure Commons

Defensive Publications Series

PRODUCTION-GRADE ONLINE SPEECH RECOGNITION FOR LOW-RESOURCE LANGUAGES

Inventor(s)

Abstract

Creative Commons License

Recommended Citation

Share

Browse

Search

Submit

Additional Information