Project · 2026

Viral Faceless Shorts Generator

A fully containerized AI-assisted media pipeline that turns trending topics or custom prompts into rendered short-form videos through script generation, human approval, TTS, subtitle alignment and FFmpeg composition.

DockerGeminiPiper TTS

Why it exists

The project explores how far an automated content pipeline can go while still keeping human control before final rendering. It is not just a video generator: it is an orchestration experiment that connects trend detection, script generation, voice synthesis, forced alignment and video composition.

Project media

Architecture

  • A web interface lets the user choose a Google Trends category, custom topic or custom prompt.
  • A Gemini-powered service generates title, description and script.
  • The script is manually reviewed and optionally edited before rendering.
  • Piper TTS generates the voice track.
  • Aeneas aligns subtitles with the produced speech.
  • FFmpeg combines background footage, voiceover and subtitles into the final video.
  • Nginx fronts the web interface and reverse proxies the internal services.

Key features

  • Trending topic detection via Google Trends.
  • Prompt-based script generation.
  • Manual approval before rendering.
  • TTS with Piper.
  • Forced subtitle alignment with Aeneas.
  • Selectable output orientation.
  • SQLite-backed settings and configurable server-side output folder.

Design decisions

The most important design choice is the human-in-the-loop step. Rendering starts only after the generated script is approved, which makes the pipeline safer, more controllable and more useful than a fully blind generator.

Links