ContentMachine
๐ฅ An all-in-one AI pipeline for creating cinematic, documentary-style videos from a single topic to a fully packaged (almost)YouTube-ready project.
Install / Use
/learn @Saganaki22/ContentMachineREADME
ContentMachine
<div align="center">โค๏ธ Support This Project
An all-in-one AI pipeline for creating cinematic, documentary-style videos โ
from a single topic to a fully packaged YouTube-ready project.
The Pipeline ยท Models & APIs ยท Real-World Cost ยท Getting Started ยท Features
</div>๐ฅ Watch the Example
<p align="center"> <a href="https://www.youtube.com/watch?v=3BC8OXMzeF4"> <img src="https://img.shields.io/badge/โถ_Watch_Full_Demo-YouTube-FF0000?style=for-the-badge&logo=youtube" alt="Watch Demo on YouTube"> </a> </p>UI Demo
https://github.com/user-attachments/assets/2e440220-d31d-41e7-9adc-242fc97bd06b
API Status: I've personally tested this with Replicate and Gemini APIs โ those are the battle-tested paths. fal.ai & elevenlabs support is implemented but not fully verified โ it may have rough edges. PRs welcome!
What is ContentMachine?
ContentMachine automates the entire documentary video production workflow using state-of-the-art AI. Give it a topic, and it handles everything: researching real historical stories, planning scenes, generating images, creating video clips, writing narration scripts, generating voiceover audio, YouTube metadata, and thumbnails โ all packaged into a clean ZIP ready for your video editor.
Built for content creators, documentarians, educators, and hobbyists who want to produce high-quality, cinematic content without a full production team.
I built this as a personal all-in-one pipeline โ easy enough to run locally, flexible enough to swap AI providers, and powerful enough to produce publish-ready assets in one session.
The Pipeline
ContentMachine runs a step-by-step pipeline with a clean UI to monitor, pause, and resume at any stage.
Topic Input
โ
โผ
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
โ 1. STORY GENERATION โ
โ LLM finds 4 real, documented historical stories โ
โ with cinematic potential โ you pick one โ
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
โ
โผ
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
โ 2. SCENE PLANNING โ
โ LLM builds a full cinematic shot list with โ
โ smart pacing: durations adapt per video model โ
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
โ
โผ
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
โ 3. IMAGE GENERATION โ
โ 4 variations per scene (establishing, intimate, โ
โ detail, atmospheric) โ select the best one โ
โ All images saved as real PNG/JPG files in ZIP โ
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
โ
โผ
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
โ 4. VIDEO GENERATION โ
โ Image-to-video, 2 scenes at a time โ
โ Multiple models available โ select best clip โ
โ Browse previous versions with โ โ arrows โ
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
โ
โผ
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
โ 5. AUDIO (optional) โ
โ ElevenLabs TTS narration + SFX per scene โ
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
โ
โผ
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
โ 6. EXPORT โ
โ YouTube metadata ยท multi-select thumbnails โ
โ Full ZIP: videos + images/selected + images/all โ
โ + audio + script + restorable project.json โ
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
Visual Style
The default aesthetic uses seamless glossy porcelain mannequins โ figures always fully clothed in period-accurate outfits including explicitly named footwear (e.g. "iron-buckled brown leather knee boots"), no visible joints, stands, or supports. Photorealistic environments, ray tracing, cinematic lighting. A great starting point for YouTube-focused creators since it avoids depicting realistic scenes that may have been altered.
The visual style is fully customisable: expand Advanced โ Customize System Prompts on the start page to edit the image prompt rules for any character type. Pair this with the Character Base Images feature (see below) to lock in a consistent look across every scene.
Supported Models & APIs
Note: Replicate and Gemini are the tested providers. fal.ai is a work in progress โ contributions welcome.
LLM โ Story, Scene Planning, Scripts, Metadata
| Provider | Models | |---|---| | fal.ai (WIP) | Claude 3.5 Sonnet | | Gemini (direct) | Gemini 3 Flash (recommended), Gemini 3.1 Pro, Gemini 3 Pro, Gemini 2.5 Flash, Gemini 2.5 Pro | | Replicate | Gemini 2.5 Flash, Gemini 3 Flash, Gemini 3.1 Pro, Claude 3.5 Sonnet |
Image Generation
| Provider | Models | |---|---| | fal.ai (WIP) | Flux Pro, Flux 2 Pro, Flux Schnell, Nano Banana Pro, Qwen Image 2512, Z-Image Base, Ideogram V3, SD 3.5 Large | | Replicate | Flux 2 Pro, Flux 1.1 Pro, Nano Banana Pro (Gemini), Imagen 4 | | Gemini (direct) | Gemini 3 Pro Image Preview (2K native output) |
Video Generation
| Provider | Model | Notes | |---|---|---| | fal.ai (WIP) | LTX-2 image-to-video | Not fully verified | | Replicate | LTX-2 Pro | With generated audio, 6โ10s | | Replicate | LTX-2 Fast | 6โ20s in 2s steps, favours 12โ20s | | Replicate | Kling v3 | 3โ15s, standard/pro mode, AI audio | | Replicate | Kling v2.5 Turbo Pro | 5s or 10s only |
Audio / TTS
| Provider | Capability | |---|---| | ElevenLabs | Scene-by-scene narration voiceover + SFX generation | | Local TTS | Bring your own (QWEN TTS, Kokoro, etc.) โ zero cost |
Real-World Cost
A 4:30 minute documentary video produced with ContentMachine cost me approximately $28 USD.
| Component | Provider / Model Used | Notes | |---|---|---| | Story + Scene Planning + Scripts | Gemini 3 Flash Preview (Gemini API) | Very cheap | | Scene Images + Thumbnail | Nano Banana Pro / gemini-3-image-preview (Replicate) | Medium | | Video Clips | LTX-2 Pro (Replicate) | Largest cost driver | | Narrator TTS | QWEN TTS (local) | Free |
Tips to reduce cost:
- Use
gemini-2.5-flash(non-preview) for LLM โ higher quota, fewer rate limits - Use fal.ai LTX-2 instead of Replicate LTX-2 Pro for cheaper video (once fal.ai is fully verified)
- Use Flux Schnell for faster, cheaper image generation
- Use a free local TTS tool for zero audio cost
- Use LTX-2 Fast on Replicate for longer scenes at a similar price point
Getting Started
Prerequisites
- Node.js 18+
- API keys for at least one LLM provider and one image provider
Install & Run
# Clone
git clone https://github.com/Saganaki22/ContentMachine
cd ContentMachine
# Install all dependencies
npm install
# Start both backend and frontend
npm run dev
App runs at http://localhost:5173. Backend API at http://localhost:3000.
Configure API Keys
Open the Settings panel (gear icon, top right). Paste your API keys โ they are saved in your browser's localStorage and automatically pushed to the backend on each session startup. No .env file required for local use.
| Provider | Link | |---|---| | fal.ai | fal.ai/dashboard/keys | | Replicate | replicate.com/account/api-tokens | | Gemini | aistudio.google.com/api-keys | | ElevenLabs | elevenlabs.io/app/settings/api-keys |
Build for Production
npm run build
npm run start
Project Structure
ContentMachine/
โโโ backend/
โ โโโ server.js Express API server (200mb body limit)
โ โโโ routes/
โ โโโ claude.js LLM: stories, scene plans, prompts, scripts, metadata
โ โโโ images.js Image generation: fal.ai / Replicate / Gemini
โ โโโ videos.js Video generation + status polling
โ โโโ elevenlabs.js TTS narration + SFX generation
โ โโโ thumbnail.js Thumbnail image generation
โ โโโ export.js ZIP packaging (streams to browser)
โ โโโ session.js Auto-save sessions to output/ folder
โ โโโ settings.js
Related Skills
bluebubbles
340.2kUse when you need to send or manage iMessages via BlueBubbles (recommended iMessage integration). Calls go through the generic message tool with channel="bluebubbles".
async-pr-review
99.4kTrigger this skill when the user wants to start an asynchronous PR review, run background checks on a PR, or check the status of a previously started async PR review.
ci
99.4kCI Replicate & Status This skill enables the agent to efficiently monitor GitHub Actions, triage failures, and bridge remote CI errors to local development. It defaults to automatic replication
code-reviewer
99.4kCode Reviewer This skill guides the agent in conducting professional and thorough code reviews for both local development and remote Pull Requests. Workflow 1. Determine Review Target

