Defensive Publications Series

Joint Training of Multiple Neural Networks

Urvang JoshiFollow
Debargha MukherjeeFollow
In Suk ChongFollow
Akshaya PurohitFollow
Shan LiFollow
InnFam YooFollow
Feng YangFollow

Abstract

This paper describes a technique for jointly training multiple neural network models to handle complex and diverse data distributions. The technique breaks up data into separate classes, with each model focusing on a specific subset while using a classifier-based architecture with temperature-controlled Softmax. The training process gradually transitions from uniform model contribution to specialized model selection, combining predictions through weighted summation. This enables effective distribution of different data patterns across multiple smaller neural networks while maintaining inference efficiency through single model selection. The technique is particularly valuable in scenarios where data exhibits significant variations that would otherwise require extremely large single models.

Creative Commons License

This work is licensed under a Creative Commons Attribution 4.0 License.

Recommended Citation

Joshi, Urvang; Mukherjee, Debargha; Chong, In Suk; Purohit, Akshaya; Li, Shan; Yoo, InnFam; and Yang, Feng, "Joint Training of Multiple Neural Networks", Technical Disclosure Commons, (January 13, 2025)
https://www.tdcommons.org/dpubs_series/7722

Download

COinS

Technical Disclosure Commons

Defensive Publications Series

Joint Training of Multiple Neural Networks

Abstract

Creative Commons License

Recommended Citation

Browse

Search

Submit

Additional Information

Technical Disclosure Commons

Defensive Publications Series

Joint Training of Multiple Neural Networks

Inventor(s)

Abstract

Creative Commons License

Recommended Citation

Share

Browse

Search

Submit

Additional Information