Inventor(s)

Abstract

A hardware trap positioned between processing elements and a memory router and/or last-level cache intercepts tagged embedding read requests. The trap includes a dual-table structure comprising a low-frequency table that tracks access frequency for addresses without storing full embedding data and a high-frequency table that stores embedding data for addresses promoted as high-frequency. When an address exceeds a configurable promotion threshold, the trap initiates a fill request and promotes the address for local servicing. During the fill window, a configurable blocking threshold stalls additional requests for the promoted address to limit backpressure, and stalled requests are serviced upon data arrival. An aging mechanism periodically adjusts tracking state and supports eviction of stale entries. Parameters including table sizes, thresholds, and aging rates are configurable to adapt to workload characteristics exhibiting skewed access distributions.

Creative Commons License

Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 License.

Share

COinS