Defensive Publications Series

Low-Overhead In-Kernel Telemetry for ML Accelerators using Shared Memory FIFO

Abstract

Existing profiling tools for machine learning accelerators often lack the granular visibility to diagnose performance bottlenecks within collective communication kernels, typically identifying only that a kernel is slow without pinpointing the specific cause. A method for in-kernel telemetry uses a shared memory first-in-first-out (FIFO) queue established between a host central processing unit (CPU) and an accelerator device. The accelerator kernel is instrumented with lightweight hooks that write fine-grained telemetry events, such as data arrival timestamps from peer accelerators, directly to the FIFO without blocking execution. A dedicated process on the host asynchronously polls the FIFO to consume the telemetry data. This approach provides real-time, logic-level visibility into sub-kernel events, allowing for the precise identification of performance anomalies like straggler devices or congested interconnects. The use of a lock-free shared memory buffer allows for telemetry collection with minimal performance impact on the accelerator’s high-bandwidth data path.

Keywords: In-kernel telemetry, ML accelerators, shared memory FIFO, asynchronous data extraction, sub-kernel event measurement, low-overhead profiling

Creative Commons License

This work is licensed under a Creative Commons Attribution 4.0 License.

Recommended Citation

N/A and N/A, "Low-Overhead In-Kernel Telemetry for ML Accelerators using Shared Memory FIFO", Technical Disclosure Commons, (May 13, 2026)
https://www.tdcommons.org/dpubs_series/10113

Download

COinS

Technical Disclosure Commons

Defensive Publications Series

Low-Overhead In-Kernel Telemetry for ML Accelerators using Shared Memory FIFO

Abstract

Creative Commons License

Recommended Citation

Browse

Search

Submit

Additional Information

Technical Disclosure Commons

Defensive Publications Series

Low-Overhead In-Kernel Telemetry for ML Accelerators using Shared Memory FIFO

Inventor(s)

Abstract

Creative Commons License

Recommended Citation

Share

Browse

Search

Submit

Additional Information