Abstract

A system related to measuring entity perception from user-generated video content using multimodal large language models. The system implements a multi-stage data ingestion pipeline using Knowledge Graph ID filtering to identify relevant videos, processes videos through a multimodal language model to generate attribute-specific perception scores with rationales, aggregates scores across temporal intervals with coverage metrics, and generates comparative visualizations with automated statistical analysis.

Creative Commons License

Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 License.

Share

COinS