Abstract
Proposed herein is a system to facilitate fault detection and root cause analysis (RCA) in distributed systems using a network of artificial intelligence (AI) agents that can work together to perform distributed debugging. The proposed system employs a hybrid architecture of centralized and localized AI agents, each co-located with network devices, enabling near-real-time, context-aware analysis of logs, traces, ring-buffers and other telemetry at the point of generation.
Creative Commons License

This work is licensed under a Creative Commons Attribution 4.0 License.
Recommended Citation
Jajoo, Akshay; Koehler, Timo; Halley, Josh; Kompella, Ramana; and Lee, Myungjin, "NETWORK OF AI-AGENTS FOR MULTI-STEP ITERATIVE DEBUGGING", Technical Disclosure Commons, ()
https://www.tdcommons.org/dpubs_series/8925