Abstract

Spatial applications require accurate geometrical measurements of physical spaces. Typically, room geometry can be determined based on the position of various corners in the room. A room keypoints model can provide the locations of corners in an image of a room. However, there is a dearth of diverse types of suitable images with labeled keypoints that can be employed to train keypoint models. Manual labeling does not scale because it is tedious, slow, and expensive. This disclosure describes an LLM-based agent to automate the labeling of keypoints, such as corners within room images at scale, thus enabling speedy generation of ground truth training data at scale. The agent can be fine-tuned to output pixel coordinates within an image corresponding to the locations of various keypoints within the room based on the image and a prompt specifying the task. The techniques can be enhanced by appropriately incorporating Reinforcement Learning from Human Feedback (RLHF).

Creative Commons License

This work is licensed under a Creative Commons Attribution 4.0 License.

Recommended Citation

Zu, Adam and Fan, Fengtao, "Large Language Model Based Automated Labeling of Keypoints in Images of Rooms", Technical Disclosure Commons, (June 04, 2025)
https://www.tdcommons.org/dpubs_series/8194

Download

COinS

Technical Disclosure Commons

Defensive Publications Series

Large Language Model Based Automated Labeling of Keypoints in Images of Rooms

Abstract

Creative Commons License

Recommended Citation

Browse

Search

Submit

Additional Information

Technical Disclosure Commons

Defensive Publications Series

Large Language Model Based Automated Labeling of Keypoints in Images of Rooms

Inventor(s)

Abstract

Creative Commons License

Recommended Citation

Share

Browse

Search

Submit

Additional Information