edytlab logoedytlabGitHub
All posts
podcast editor
AI podcast
transcription
Whisper

AI Podcast Production: Record, Edit, Transcribe, and Export in One Session

May 13, 20268 min read

The typical podcast post-production workflow takes 4–6 hours per episode. Here is how to use an AI audio editor to collapse that to under 30 minutes without sacrificing quality.

Podcast post-production is one of the most repetitive audio editing tasks that exists. Every episode involves the same operations: noise removal, silence trimming, level normalization, music bed mixing, chapter markers, and export. AI audio editors can automate each of these without requiring you to learn a DAW.

The Traditional Podcast Workflow (and Why It Takes So Long)

A typical solo-host podcast episode runs 30–60 minutes. Editing a raw recording to a publishable episode in a traditional DAW involves:

  • Manual review of the waveform to find and cut long pauses.
  • Noise gate or spectral repair to remove room noise and HVAC hum.
  • Loudness normalization to LUFS broadcast targets (-16 LUFS for Spotify, -19 LUFS for Apple Podcasts).
  • Music intro/outro mixing with level automation.
  • Export to MP3 at appropriate bitrate with ID3 tags.
  • Show notes generation from timestamps.

Each of these steps requires different tools, different knowledge, and careful listening. It adds up to 4–6 hours of editing for a 1-hour episode for most solo producers.

The AI-Augmented Workflow

An AI audio editor with natural language control can replace most of this with a single session. Here is a realistic workflow using edytlab:

Step 1: Load the Raw Recording

Drag your WAV file into the session or type "load episode-045-raw.wav". The agent adds it as the first track. If you have a separate music bed file, load that too.

Step 2: Transcribe and Review

Type "transcribe track 1". The agent calls Whisper locally — no upload, no API key for transcription needed — and returns a word-level transcript with timestamps. You can now see exactly where filler words, long silences, and retakes are without scrubbing the waveform.

Whisper large-v3 runs entirely on-device in edytlab. A 60-minute audio file transcribes in approximately 4–8 minutes on a modern laptop, depending on hardware. The transcript is word-level timestamped and stored in the session.

Step 3: Describe the Edits

With the transcript in hand, describe what you want: "Cut all silences longer than 1.5 seconds. Remove the section between 12:30 and 13:45 — that was an off-topic tangent. Normalize to -16 LUFS." The agent executes each operation as a tool call against the session DAG.

Step 4: Mix Music Beds

Load your intro/outro music: "Add intro.wav to track 2, crossfade into the speech at 0:08, and duck the music under the speech to -18 dB". The agent handles the volume automation and crossfade geometry. You can preview immediately.

Step 5: Export

Type "export as MP3 192kbps with title Episode 45, author My Podcast". Done. The session state is saved as a DAG, so you can branch it, revert any edit, or export different versions (clean edit vs. explicit version) without re-doing work.

What AI Cannot Replace (Yet)

Automated workflows do not replace critical listening. AI can normalize to a target LUFS, but it does not know if your interview guest had an unusually nasally recording environment that day. Ums and filler words can be removed automatically, but rhythm editing — making the conversation flow more naturally — still benefits from a human ear. Use AI to handle the mechanical 80% and spend your time on the creative 20%.

Multi-Guest Podcast Editing

For interviews with multiple speakers, load each recording as a separate track. edytlab's stem separation can help when you only have a mixed recording — separate the louder and quieter voices, normalize each independently, then re-mix. This is not a perfect substitute for separate track recording, but it is production-viable for remote interviews recorded on a single channel.

edytlab is an open-source, local-first AI audio editor. Download the latest release or star it on GitHub.