← Back to Personal Project

IA Automation Project Overview

🚀 Try the application here: https://video-automation.littlenuage.com/

This project automates the creation of narrated videos from a script, leveraging AI for image generation, voice synthesis, and video assembly. The workflow is fully automated: you provide a script, and the system generates images, audio, and assembles everything into a video.

Project Context

Developed as a personal project to streamline video content creation, this system enables rapid production of professional-quality narrated videos without manual intervention. The application is deployed online and accessible via a web interface, allowing anyone to generate videos from text descriptions.

Overview

The pipeline takes a written script, splits it into scenes, generates images for each scene using Stable Diffusion, creates narration with Coqui TTS, and then assembles everything into an animated video (slow zoom on each image, audio synchronization).

What I Developed

Core Features

1. Automated Video Generation Pipeline

2. Web Application (Flask)

3. Advanced Processing Tools

Script Generation & Verification:

Image Processing:

Audio Processing:

Video Processing:

4. Integration & Deployment

5. Music & Audio Enhancement

Technical Architecture

Backend Stack:

Key Integrations:

Processing Categories:

Innovation & Features

Intelligent Scene Management:

Quality Control System:

Scalability:

Customization:

File Organization

.
├── generate-script.py
├── generate-image.py
├── generate-audio.py
├── generate-animation.py
├── generate-video-image.py
├── input/
│   └── Audio/
│       └── Enregistrement-francais.wav
├── output/
│   ├── Script/
│   ├── Image/
│   ├── Audio/
│   │   └── scenes/
│   ├── Video/
└── stable-diffusion-webui/

Workflow

1. Script Preparation

[
  {
    "scene": 1,
    "texte": "Le soleil se lève sur la ville.",
    "prompt_image": "A beautiful sunrise over a city skyline, vibrant colors."
  }
]

2. Image Generation

3. Audio Generation

4. Animation & Video Editing

5. Final Assembly

Scripts

Stable Diffusion WebUI

Prerequisites

Installation:

pip install -r stable-diffusion-webui/requirements.txt
pip install moviepy requests TTS

Troubleshooting

Development Challenges & Solutions

Challenge 1: Audio-Visual Synchronization

Challenge 2: AI Model Integration

Challenge 3: Quality Consistency

Challenge 4: Scalability

Use Cases & Applications

Future Enhancements

Performance Metrics

Resources & Credits

License

See stable-diffusion-webui/LICENSE.txt for the WebUI license. Other scripts are under your chosen license.


Try It Yourself!

Access the application: https://video-automation.littlenuage.com/

  1. Write your script with scene descriptions
  2. Provide image prompts for each scene
  3. Upload a voice reference (optional)
  4. Click generate and wait for your video!

IA Automation Workflow