Abstract

Visual stories are a popular format for online storytelling in many contexts. Visualizing text often helps a reader understand the story. There are tools that currently exist which can generate multimedia based on user input text. However, the generated media may not always match the text input and may include images that are diverse in style. This disclosure describes techniques that use generative artificial intelligence to automatically generate images, animation, and audio based on user input text and preferences. The generated assets are combined into a visual story that has a coherent visual theme and that can help viewers understand text-based content better.

Creative Commons License

This work is licensed under a Creative Commons Attribution 4.0 License.

Recommended Citation

Peng, Jinyue and Anuar, Nizam, "Automatically Generating a Video Based on User Provided Text Input", Technical Disclosure Commons, (March 24, 2023)
https://www.tdcommons.org/dpubs_series/5762

Download

COinS

Technical Disclosure Commons

Defensive Publications Series

Automatically Generating a Video Based on User Provided Text Input

Abstract

Creative Commons License

Recommended Citation

Browse

Search

Submit

Additional Information

Technical Disclosure Commons

Defensive Publications Series

Automatically Generating a Video Based on User Provided Text Input

Inventor(s)

Abstract

Creative Commons License

Recommended Citation

Share

Browse

Search

Submit

Additional Information