Inventor(s)

Abstract

Techniques are described for temporally refining per-frame portrait segmentation masks in video. A raw mask is processed to form an eroded foreground confidence core and a three-zone partition including core, boundary, and background regions. Motion is obtained from the core region, for example using confidence-aware block matching that applies a motion-magnitude bias and produces a confidence derived from matching cost. Motion vectors are propagated from the core into boundary pixels using confidence- and distance-weighted aggregation. A prior refined mask is warped using the resulting motion field and fused with the raw mask using confidence-gated temporal blending with zone-specific rules, including forcing background pixels to zero and falling back to the raw mask when confidence is low. The framework can accept motion from block matching, a neural motion estimator, or codec motion metadata.

Creative Commons License

Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 License.

Share

COinS