Seven gated stages from raw Tella footage to verified final video. Every stage has a hard gate — no skipping the storyboard, no shipping without QA.
Seven stages. Each has a gate that must pass before advancing. The storyboard is mandatory — you cannot build overlays without one.
Four skills with clear responsibilities. The orchestrator loads dependencies in the right order and delegates to the right skill at each stage.
Single entry point for all video editing. Gates each stage, loads dependencies in order (brand.md → DESIGN.md → motion-editing → video-use), tracks pipeline state per project. Prevents skipping storyboard.
Defines HOW to edit — cut cadence, zoom behavior, B-roll strategy, overlay choreography. Runs the 5-checkpoint storyboard flow. Prescribes creative direction.
MEASURES and scores rendered video against 26 quality checks. Produces structured pass/fail reports with timestamps and fix recommendations. Audits, not prescribes.
Token LOOKUP for exact hex codes, fonts, type scale, spacing, easing, shadows. The reference book — you ask it what color ember is, not whether a composition passes QA.
What each stage does, what gates it, and who it delegates to.
Identify source material, set energy level, check prerequisites. Creates the pipeline state file.
Analyze transcript, propose visual treatment per phase — hook, sections, transitions, payoff. User approves each phase.
Extract every B-roll reference from storyboard. Classify as captured, needs agent-browser, needs screen record, or needs design work.
Design each overlay with exact brand tokens — content, style, animation type, timing verified against SRT timestamps.
Assemble approved storyboard + B-roll + overlay specs into a complete Edit Decision List with sync-verified timestamps.
src_start = AUDIO_START + out_start). User reviewed segment map.
Build HyperFrames compositions, render to MP4/PNG, composite onto base video with ffmpeg, mix audio.
amix normalize=0 — voice 100%, music 6%, SFX 40%Run the full 26-point quality checklist. Flag failures with timestamps. Fix → re-render → re-check until all pass.
23 production-validated editorial rules. Each came from a real correction or confirmed approach.
Motion graphics are fullscreen transitions that replace the frame. Never floating cards on active demos.
Every fullscreen uses pixel emblems + whiteboard sketches + eyebrow pills. Missing one = incomplete.
Exactly one ember element. The stat, OR the CTA, OR the highlight word. Never multiple.
SFX fires only on motion graphic transitions. No clicks on static assets, no SFX on TH cuts.
HyperFrames WebM has no alpha channel. Render to PNG sequence, feed to ffmpeg.
Extract and check frames at transitions. Exit code 0 is not verification.
Every card gets SVG illustration areas — pixel art, whiteboard sketches. Text-only = rejected.
Inline :root CSS vars in every composition. External tokens.css can fail to load.
Every composition start matches word-level SRT timestamp. Never guess timing.
Hard-won lessons from production. Ignore at your peril.
--format=webm outputs yuv420p despite docs claiming transparency. Use --format=png-sequence for overlays, then feed PNGs directly into ffmpeg.
amix divides volume by input count. Always use normalize=0. Set levels explicitly: voice 100%, music 6%, SFX 40%.
src_start must equal AUDIO_START + out_start. Using raw SRT timestamps diverges as soon as B-roll shifts the output timeline.
raw/ until project is archived.