Abstract
The present disclosure relates to watermarking techniques for neural network models that embed verifiable ownership information into model parameters. In one aspect, floating-point weight parameters are quantized and least significant bits are biased during rounding such that the bits collectively encode a pseudorandom error-correcting code derived from a digital signature of a model owner. In another aspect, a pseudorandom codeword generated from the digital signature is directly embedded by replacing least significant bits of quantized weight parameters. In both cases, the watermark is cryptographically verifiable and remains detectable even after model perturbations, fine-tuning, or partial retraining. Ownership verification is performed by extracting least significant bits, reconstructing the encoded sequence, and decoding the error-correcting code to recover the digital signature. The disclosed techniques provide efficient, architecture-agnostic, and tamper-resistant watermarking suitable for large-scale neural networks, enabling model owners to demonstrate authorship and deter misappropriation.
Creative Commons License

This work is licensed under a Creative Commons Attribution 4.0 License.
Recommended Citation
Kumaresam, Ranjit; Raghuraman, Srinivasan; and Ng, Ka Log, "NEURAL NETWORK MODEL WATERMARKING USING QUANTIZED LEAST SIGNIFICANT BITS AND PSEUDORANDOM ERROR-CORRECTING CODES", Technical Disclosure Commons, ()
https://www.tdcommons.org/dpubs_series/10523