The Scroll
April 3, 2026 · Corey Segall · 12 min read · Behind the Scenes

Kill the Grind, Keep the Magic

AI is not creating our content. It is extracting, organizing, and unlocking the content we already made. Every word in this system was spoken by a creator. AI just made sure none of them were lost. This is the story of how we built it, why we built it the way we did, and what it means for every actual play production sitting on hundreds of hours of unorganized content.

88
Episodes
200+
Hours
401
Output Files
$131
Total Cost

The Critical Distinction

Before anything else, we need to be clear about what ethical AI use in creative production looks like, and what it does not.

Ethical AI is not writing your story for you. It is not inventing lore that doesn't exist. It is not replacing the creator's voice or making creative decisions. What it is doing: extracting what you already created, organizing it into a universe bible, correcting what machines mangled, and making it searchable and reusable. The result is that fans can browse your world, new audiences can discover it, your team can build on it, and your IP becomes an asset instead of a memory.

This distinction guided every decision we made in building the pipeline. The creators spoke every word. The GM built every world detail. The players made every choice. AI processed the volume. Humans ensured the quality. At no point did the system generate fiction, invent characters, or make narrative decisions. It listened to what was already there and organized it so nothing would be lost.

Your Content Is Trapped

Gold, Green and Red is an 88-episode live-play D&D campaign spanning three years and 200+ hours of improvised storytelling. Six to eight players plus a GM, every session different, lore generated organically across hundreds of hours. None of it was written down. The entire world existed in YouTube recordings and the memories of the people at the table.

Character names were mangled in every auto-caption. YouTube achieved only 38% speaker attribution: "Juramentum" became "Gerimentum," "Merick" became "America," "Quaylithon" became "Queen Lathon." Lore was buried across seasons, unsurfaced and unsearchable. World-building lived in the memories of the cast. Sound familiar?

Manual processing at the old rate required 2.5 to 3.5 hours per episode. For 88 episodes, that meant 220 to 308 hours of human labor. That is 6 to 9 months of a creator's life spent on production grind instead of creative work.

From Raw Content to Source of Truth

The pipeline we built chains four tools into a nine-step repeatable workflow. AI extracts and organizes what creators already built. One source of truth then feeds every format your audience touches: the Universe Hub, the World Setting Book, draft animation scripts, episode summaries, fan-facing content. Same canon, same spelling, same truth, different formats for different audiences.

The creator's content is the source. AI is the librarian. The audience is the beneficiary.

Steps 1-2: Acquisition and Transcription

Each episode is downloaded from YouTube using yt-dlp, preserving original audio quality. The MP4 is submitted to AssemblyAI's transcription engine with speaker diarization enabled. For a typical three-hour episode with eight distinct voices, the engine produces 1,200 to 1,900 attributed utterance blocks. Cost: $0.51 per episode. Compare that to YouTube auto-captions at 38% accuracy with mangled proper nouns.

Step 3: Speaker Mapping (Human Gate)

This is the most critical gate in the entire pipeline. AI proposes a speaker map based on evidence: "Speaker A = Titus, evidence: 'my grandfather founded...'" The human confirms, corrects, or flags. This takes approximately two minutes per episode. It catches split speakers, absent players, and NPC absorption into player labels. No downstream deliverable works if this step is wrong, so a human does it every time.

Step 4: AI Processing

With validated speaker mapping, Claude processes each transcript through multiple parallel passes. Every utterance is classified into one of eight content types: in-character dialogue, in-character action, narration, lore exposition, out-of-character table talk, stream content, game mechanics, or transitional. A canonical spelling dictionary with 200+ correction patterns is applied automatically, turning "America" back into "Merick" and "Gerimentum" back into "Juramentum." Gold nuggets are extracted: journal entries preserved verbatim, key reveals, character-defining moments. New lore is flagged for Bible update.

The OOC classifier started at a 27% false positive rate and improved to 0% through iterative learning. Each false positive became a permanent learned exception. "The table" means in-world furniture, not the game table. "Thank you" spoken between characters, not between players. "Candle" refers to the in-world Candle of Tales, not the stream prop. The system gets smarter with every episode processed.

Steps 5-6: Validation and Lore Bible (Human Gates)

Every OOC-flagged block is reviewed by the human validator. Every lore discovery is cross-referenced against the Master Lore Bible and either confirmed, corrected, or flagged for arbitration by the show creator. The Bible evolved through nine major versions, growing from a simple correction dictionary into a 108KB, 13-section world-setting reference: 88 NPCs with descriptions, 14 deities with domains, 63 locations with history, 171 lore entries cataloged, 75 journal entries indexed, and 200+ spelling correction patterns. Single source of truth.

Steps 7-8: Universe Hub and Summaries

The Universe Hub is the fan-facing output: an interactive universe compendium that opens in any browser with zero dependencies. 88 NPCs, 171 lore entries, 63 locations, 75 journals, 88 episode pages. Search, filter by arc, click through characters. A new team member onboards in minutes instead of weeks.

Narrative summaries are written for all 86 episodes. Not recaps. Original prose that captures the emotional weight of each session. They serve triple duty: reference for the production team, marketing content to attract new viewers, and foundation material for a potential novelization.

Step 9: The Correction Cascade

When new canonical information is confirmed in episode 86, corrections cascade backwards through every previously processed file. The project executed 567 retroactive corrections across 165 files in a single automated pass. A stack of printed scripts could never be corrected this way. The digital-first approach means the entire corpus improves as knowledge grows.

AI Proposes. Human Decides.

Three human gates in every episode. Speaker mapping, OOC validation, lore arbitration. Approximately 5 to 7 minutes of human work per episode. Everything else is automated. But those 5 to 7 minutes are where creative authority lives. The system improves, but the human directs the improvement. Each false positive becomes a permanent learned exception, but only after a human confirms it.

AI processes the volume. You ensure the quality. AI remembers every word across 200 hours. You know which words matter.

The Compounding Effect

This is what most people miss about the pipeline. It does not just save time on one task. Each step feeds the next, and the value compounds.

Raw recordings become transcripts. Transcripts become a Master Lore Bible. The Bible becomes a 176-page, 48,000-word World Setting Book, written in days instead of months. The transcripts become 42 draft animation scripts representing 126 hours of production-ready dialogue for animation houses. The Bible feeds the Universe Hub, which fans can browse and new audiences can discover.

Without the pipeline, producing all of these outputs manually would require approximately 1,200 hours of human labor, roughly 15 months of a creator's life. The pipeline compressed that into weeks.

97%
Time Reduction
5 min
Human Work / Episode
1,200 hr
Manual Equivalent
0%
Final Error Rate

What the Pipeline Built

42 draft animation scripts: 126 hours of speaker-attributed, timestamped dialogue ready for animation houses. 86 narrative episode summaries. The Master Lore Bible v9.0: 108KB, single source of truth. A 176-page World Setting Book at 48,000+ words. The Darkeport Universe Hub: 88 NPCs, 171 lore entries, 63 locations, 75 journals, fully searchable. A spelling dictionary with 200+ correction patterns that propagates retroactively. 401 total files, 44MB of structured content.

Start This Weekend

You do not need to be a developer. You do not need to surrender creative authority. You need three episodes, a free transcription tier, and a willingness to spend two minutes validating speaker maps.

Pick three episodes. Download them with yt-dlp (free). Transcribe with AssemblyAI ($50 free tier). Validate speaker maps: two minutes each. Process with Claude: classify, correct, extract. Start your Bible. Even v1.0 is better than zero. It compounds with every episode you process.

The full framework and live walkthrough will be presented at GenCon 2026. If you want to see it in action, find us there.

AI does not replace the creator. It kills the grind. And keeps the magic.

Explore the Result

The Universe Hub is live. Search every character, location, and event across 88 episodes of Gold, Green and Red.