Abstract

Generative artificial intelligence models may face challenges when modifying components within large software systems, as they can lack the project-specific context to link a visual user interface element to its corresponding source code files. A system can utilize a multimodal retrieval-augmented generation approach where historical code revisions, which may include associated user interface images and textual descriptions, are processed with an embedding model to create a searchable vector index. At execution time, a user query, which can comprise a screenshot and a text prompt, may be used to search the index and retrieve semantically similar historical changes. This retrieved information, for example, file paths and code examples from past modifications, can then be used to augment a prompt for a downstream generative model, which may improve the accuracy and relevance of automated code modifications.

Creative Commons License

This work is licensed under a Creative Commons Attribution 4.0 License.

Recommended Citation

Liang, Zheng; Omogbai, Aileme; and Altenhof, Erin, "Multimodal Retrieval-Augmented Generation For Context-Aware Source Code Modification", Technical Disclosure Commons, (January 29, 2026)
https://www.tdcommons.org/dpubs_series/9240

Download

COinS

Technical Disclosure Commons

Defensive Publications Series

Multimodal Retrieval-Augmented Generation For Context-Aware Source Code Modification

Abstract

Creative Commons License

Recommended Citation

Browse

Search

Submit

Additional Information

Technical Disclosure Commons

Defensive Publications Series

Multimodal Retrieval-Augmented Generation For Context-Aware Source Code Modification

Inventor(s)

Abstract

Creative Commons License

Recommended Citation

Share

Browse

Search

Submit

Additional Information