The pipeline

Twelve steps from idea to upload.

Every video runs through the same durable, retryable pipeline orchestrated by Trigger.dev. Each step is a separate AI model chosen because it’s the best one for that job.

01

Ideation

Llama 3.3 70B (Replicate)

We feed your channel theme into an LLM tuned for hook-first YouTube ideas. It returns titles that sound like they earned their click.

02

Research

Perplexity sonar-pro

Before the script gets written, we pull a real research dossier — quotes, dates, deal sizes, sources. No LLM-hallucinated 'fun facts.'

03

Script

Llama 3.3 70B

An 8+ minute long-form script with a strong cold-open, 12+ scene markers, and a paid-off ending. Tagged for shotting and voiceover chunking.

04

Storyboard

Llama 3.3 70B

Each scene gets an image prompt, a duration, and the voiceover line that plays over it. This is the manifest the rest of the pipeline runs against.

05

Images

gpt-image-2 (fal.ai)

Twelve+ cinematic key frames at 16:9, each anchored to its scene. We retry with sanitized prompts if a frame refuses.

06

Clips

Seedance 2.0 image-to-video (fal.ai)

Each storyboard frame is animated into a ~5 second motion shot. Camera moves, parallax, subject motion — all driven from the image.

07

Transitions

Veo 3.1 first-last-frame-to-video (fal.ai)

Between every pair of clips we generate a tied transition that takes the last frame of one shot and the first frame of the next. No hard cuts.

08

Voiceover

ElevenLabs v3 (fal.ai)

The script is voiced in the channel's chosen voice. Per-scene chunks so we can sync timing with the storyboard.

09

Compose

ffmpeg on Fly.io

Clips, transitions, and voiceover are stitched into a single MP4 with loudness normalization and a clean master audio bus.

10

Thumbnails

gpt-image-2

Long-form thumbnail in 16:9 and a portrait variant for shorts — each with hook text designed to earn the click, not describe the video.

11

Shorts

Llama 3.3 + ffmpeg

The script's most replayable beats are identified, cut from the rendered assets in portrait, and captioned. 3–5 shorts per long-form, every time.

12

Upload & A/B

YouTube Data API

Long-form and shorts go up on your channel. A scheduled job rotates thumbnails every 48h for the first two weeks and crowns the winner by CTR.