Create Animated Stories using GPT-4o-mini, Midjourney, Kling, and Creatomate API

This workflow achieves a fully automated process from text story creation to animated video generation. Users only need to input basic parameters, and the system will intelligently generate story prompts, illustrations, and dynamic videos, ultimately synthesizing a complete animated story video. This process significantly reduces the complexity and time costs associated with traditional animation production, making it suitable for the rapid generation of multimedia content such as children's stories and brand promotional videos, helping content creators and educators efficiently produce high-quality animated materials.

Tags

AnimationAutomation

Workflow Name

Create Animated Stories using GPT-4o-mini, Midjourney, Kling, and Creatomate API

Key Features and Highlights

This workflow enables a fully automated process from text-based story creation to animated video production. It leverages GPT-4o-mini to intelligently generate story scene prompts, integrates Midjourney to create multiple high-quality illustrations, utilizes the Kling API to transform static images into dynamic videos, and finally employs the Creatomate API for video composition, producing a complete animated story video. The workflow is highly automated, supporting multi-round asynchronous task status monitoring and waiting to ensure the quality and completeness of generated content.

Core Problems Addressed

Traditional animation production is complex and time-consuming. This workflow integrates multiple AI services to automate story ideation, image generation, video production, and composition, significantly lowering the barriers and time costs of content creation. It is ideal for rapid generation of children’s stories, brand promotional videos, and other multimedia content.

Application Scenarios

  • Children’s story animation production
  • Fast generation of brand marketing videos
  • Social media content creation
  • Automated production of educational and training video materials

Main Process Steps

  1. Basic Parameter Setup: Users input story characters, visual style, and contextual keywords.
  2. Story Prompt Generation: GPT-4o-mini generates segmented story text along with corresponding image prompts.
  3. Illustration Generation: Midjourney API is called to produce three key scene illustrations for the story.
  4. Illustration Status Monitoring and Retrieval: Polling is performed to check image generation status, ensuring images are complete before obtaining temporary URLs.
  5. Video Generation: Using the three illustrations, the Kling API generates three dynamic video clips.
  6. Video Status Monitoring and Retrieval: Polling checks the video generation status to confirm completion.
  7. Video Composition: The Creatomate API merges the three video clips into a complete animated story video, embedding the story title text.
  8. Final Output: The URL of the composed video is obtained, achieving fully automated end-to-end animated story generation.

Involved Systems or Services

  • GPT-4o-mini: Generates story text and image description prompts
  • Midjourney (via piapi.ai interface): Creates story illustrations
  • Kling (via piapi.ai interface): Converts static illustrations into dynamic videos
  • Creatomate: Video template composition and final video generation
  • n8n: Workflow orchestration and task status management

Target Users and Value

  • Content creators and animators seeking rapid generation of high-quality animated story assets
  • Marketing and branding teams aiming to improve video content production efficiency
  • Educators automating the creation of instructional animations
  • AI enthusiasts and developers exploring automated creative workflows integrating multiple AI tools

By deeply integrating various AI generation services, this workflow greatly simplifies the production pipeline from story conception to finished animation, enabling users to efficiently produce creative and professional animated content.

Recommend Templates

Dsp Agent

This workflow is triggered by Telegram messages and provides intelligent voice-to-text functionality, combined with advanced language models for signal processing and learning assistance. It can answer theoretical questions, assist with calculations, and query Wikipedia, offering a personalized learning experience. Additionally, it tracks users' learning progress, integrates with an Airtable database, supports content creation and email management, helping students and professionals efficiently solve challenges in their learning process, thereby enhancing comprehension and learning outcomes.

Intelligent Q&ASpeech to Text

Image-Based Data Extraction API using Gemini AI

This workflow utilizes a Webhook interface to intelligently extract information from images. Users only need to provide the image URL, which will be automatically downloaded and converted to Base64 format, allowing for efficient text recognition using Google Gemini AI. The extracted content can be flexibly configured and is ultimately output in a structured JSON format, facilitating subsequent system integration. This solution simplifies the traditional image text extraction process, enhancing accuracy and automation, and is suitable for data processing of various types of documents, financial receipts, and forms.

OCRData Extraction API

French Text-to-Speech and English Audio Generation Workflow

This workflow automatically converts French text into French speech, transcribes the generated audio into text, then translates it into English, and finally generates an English audio file. By combining high-quality text-to-speech and speech-to-text services, it automates the processing of multilingual content, enhancing the efficiency of language learning, content creation, and cross-national communication. It is suitable for various scenarios, including education, creative work, and translation.

Speech SynthesisMultilingual Translation

Vector DB Loader from Google Drive

This workflow is designed to automatically download and process PDF, plain text, and JSON files from Google Drive. It converts these files into vector data using OpenAI's text embedding model and stores them in the PGVector vector database within a Postgres database. This process enables efficient management and retrieval of documents, while automatically archiving processed files, thereby enhancing work efficiency and automation. It is suitable for data engineers, knowledge management teams, and research institutions.

Vector ManagementGoogle Drive Automation

My workflow 6

This workflow implements an intelligent AI chatbot through Slack's Slash commands, capable of receiving user requests and invoking the OpenAI GPT-4o-mini model to generate real-time responses. It supports the handling of multiple commands simultaneously, automating responses to reduce manual workload, while integrating Webhook and LangChain technologies to enhance contextual understanding in conversations. It is suitable for internal communication within enterprises, customer support, and other scenarios, aiming to improve communication efficiency and provide a flexible intelligent interaction experience.

Smart ChatbotSlack Integration

Travel Planning Agent with Couchbase Vector Search, Gemini 2.0 Flash, and OpenAI

This workflow is an intelligent travel planning assistant that combines large language models and vector search technology to quickly provide personalized travel recommendations to users. Users can interact with the AI agent through chat to obtain precise travel suggestions based on points of interest data. The workflow supports batch data insertion and efficient retrieval, addressing the issues of information fragmentation and low query efficiency commonly found in traditional travel planning. It is suitable for travel service platforms, travel agencies, and related application scenarios.

Smart TravelVector Search

AI Agent for Realtime Insights on Meetings

This workflow automatically joins online meetings through an intelligent assistant, enabling real-time voice transcription to accurately capture and organize meeting dialogues. By leveraging AI technology, it can perform intelligent analysis and generate notes based on keywords, while storing structured data for easy retrieval later. This solution significantly enhances the efficiency and accuracy of meeting records, making it suitable for remote teams, project management, and automatic generation of meeting minutes across various industries, thereby facilitating team collaboration and information transparency.

Smart MeetingReal-time Transcription

Image Generation API

This workflow receives user text prompts in real-time through a Webhook interface and utilizes OpenAI's image generation API to create corresponding images. Users simply need to paste the URL with the prompt into their browser to quickly obtain the AI-generated image. The entire process is automated and responsive. It simplifies the complex traditional image generation process, allowing users to easily create without writing code, making it suitable for various scenarios such as designers, content creators, and developers.

AI Image GenWebhook API