Abstract

This disclosure presents a system and method to select unsupervised data for speech processing using the distribution of supervised data in an embedded space. The data sets are represented by different colors to differentiate between supervised and unsupervised utterances. The system samples a set of utterances from the unsupervised data, such that the distribution of the unsupervised sample matches with the distribution of the supervised utterances. The sampling method converts the data sets into bins in a two-dimensional histogram, which is then normalized using the size of the data set for each bin. The data is then manipulated and selected so that the distribution of the data selected would closely match the distribution of the supervised data set. The system and method generates useful unsupervised data sets that could help train speech recognition models effectively.

Creative Commons License

This work is licensed under a Creative Commons Attribution 4.0 License.

Recommended Citation

Bardin, Benjami Alexander, "A System For Creating Network Of Businesses To Grant And Manage Consumer Incentives Collectively", Technical Disclosure Commons, (October 10, 2016)
https://www.tdcommons.org/dpubs_series/288

Download

COinS

Technical Disclosure Commons

Defensive Publications Series

A System For Creating Network Of Businesses To Grant And Manage Consumer Incentives Collectively

Abstract

Creative Commons License

Recommended Citation

Browse

Search

Submit

Additional Information

Technical Disclosure Commons

Defensive Publications Series

A System For Creating Network Of Businesses To Grant And Manage Consumer Incentives Collectively

Inventor(s)

Abstract

Creative Commons License

Recommended Citation

Share

Browse

Search

Submit

Additional Information