Abstract

Using large language models (LLMs) to query large, scale, graph, structured data can present challenges, as encoding an entire graph into a single prompt may exceed context window limits and can lead to reduced accuracy on multi-hop queries. Systems and methods can partition a graph into a collection of smaller, human-readable documents, or shards. An agentic orchestration process may use a global symbolic index, contained within a summary shard, to locate and retrieve relevant shards on demand. This process can decompose a complex query into a sequence of single-hop LLM calls, where each call can operate on a context comprising one or more shards. This approach can enable an LLM to function as a query engine for graphs that are substantially larger than its context capacity, which can improve scalability and may mitigate a need for a dedicated graph database system for certain analytical workloads.

Creative Commons License

This work is licensed under a Creative Commons Attribution 4.0 License.

Recommended Citation

Kommuri, Sai Charan Tej, "Agentic Graph Querying on a Sharded Architecture Using a Large Language Model", Technical Disclosure Commons, (June 07, 2026)
https://www.tdcommons.org/dpubs_series/10361

Download

COinS

Technical Disclosure Commons

Defensive Publications Series

Agentic Graph Querying on a Sharded Architecture Using a Large Language Model

Abstract

Creative Commons License

Recommended Citation

Browse

Search

Submit

Additional Information

Technical Disclosure Commons

Defensive Publications Series

Agentic Graph Querying on a Sharded Architecture Using a Large Language Model

Inventor(s)

Abstract

Creative Commons License

Recommended Citation

Share

Browse

Search

Submit

Additional Information