Abstract

The systems and methods described herein provide for a natural-language to generative video process using language-classification and knowledge graph identifiers to find frame segments within a codex of videos that depict the scene described. The mechanics necessary for converting speech to meaningful entities that can be matched to non-text images/videos serve as an underlying basis for generative video.

Share

COinS