Abstract

Using large language models (LLMs) to query large, scale, graph, structured data can present challenges, as encoding an entire graph into a single prompt may exceed context window limits and can lead to reduced accuracy on multi-hop queries. Systems and methods can partition a graph into a collection of smaller, human-readable documents, or shards. An agentic orchestration process may use a global symbolic index, contained within a summary shard, to locate and retrieve relevant shards on demand. This process can decompose a complex query into a sequence of single-hop LLM calls, where each call can operate on a context comprising one or more shards. This approach can enable an LLM to function as a query engine for graphs that are substantially larger than its context capacity, which can improve scalability and may mitigate a need for a dedicated graph database system for certain analytical workloads.

Creative Commons License

Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 License.

Share

COinS