Abstract

A machine learning (ML) model often comprises many floating point numbers (model weights) and the operations applied on them (the computation graph). If a model is derived from another, the derived model often has similar weights and is of the same size. It is a waste of storage to store copies of the similar weights and computation graphs for two related models. This document describes techniques to obtain a sparse representation, referred to herein as thresholded model diff, that can be applied to a base model to reconstruct a version of a derived model. Differences between weights of the base model and a derived model obtained by fine-tuning the base model are identified and reduced with reversible operations. The reduced differences are (optionally) subjected to thresholding to obtain the thresholded model diff. A reconstructed model is obtained by selectively applying the thresholded model diff to the base model. The reconstructed model is evaluated to ensure that it can adequately perform the task that the derived model is fine-tuned for. The thresholding and application of the diff to the base model is adjusted based on the evaluation.

Creative Commons License

This work is licensed under a Creative Commons Attribution 4.0 License.

Recommended Citation

Nguyen, Nam; Hui, Jeffrey; and Mahendru, Aroma, "Reconstructing a Machine Learning Model Based on Thresholded Model Differences", Technical Disclosure Commons, (December 05, 2023)
https://www.tdcommons.org/dpubs_series/6476

Download

COinS

Technical Disclosure Commons

Defensive Publications Series

Reconstructing a Machine Learning Model Based on Thresholded Model Differences

Abstract

Creative Commons License

Recommended Citation

Browse

Search

Submit

Additional Information

Technical Disclosure Commons

Defensive Publications Series

Reconstructing a Machine Learning Model Based on Thresholded Model Differences

Inventor(s)

Abstract

Creative Commons License

Recommended Citation

Share

Browse

Search

Submit

Additional Information