Abstract

Large and complex software systems are critical in many contexts. Outages and/or security incidents for such systems can have a negative monetary impact and lead to user dissatisfaction. Root cause analysis for system outages is performed manually, which is a costly and time-consuming process. Root cause analysis relies on time series data and logs from monitored systems, records of changes to the monitored systems, outages of critical infrastructure utilized by the monitored systems, etc. This disclosure advantageously utilizes the capabilities of a large language model to ingest large amounts of data and perform reasoning tasks in response to prompts. Per the techniques, relevant data about a monitored system is provided to an LLM along with a suitable prompt that instructs the LLM to perform root cause analysis. The LLM output is utilized by engineering teams to determine and execute mitigation strategies. The prompt is updated and the LLM additionally trained based on the performance of the LLM in performing the root cause analysis.

Creative Commons License

This work is licensed under a Creative Commons Attribution 4.0 License.

Recommended Citation

NA, "LLM-powered Anomaly Detection for Determining Root Cause of Outages", Technical Disclosure Commons, (June 17, 2025)
https://www.tdcommons.org/dpubs_series/8239

Download

COinS

Technical Disclosure Commons

Defensive Publications Series

LLM-powered Anomaly Detection for Determining Root Cause of Outages

Abstract

Creative Commons License

Recommended Citation

Browse

Search

Submit

Additional Information

Technical Disclosure Commons

Defensive Publications Series

LLM-powered Anomaly Detection for Determining Root Cause of Outages

Inventor(s)

Abstract

Creative Commons License

Recommended Citation

Share

Browse

Search

Submit

Additional Information