Skip to main content

Automatic Dialogue Editor

Alt text describing the image Strip silence can’t tell a pause from a new take. Dialogue editing in Reagent transcribes your audio and uses LLM to identify each take, then splits them into separate items — trimmed, renamed to match the spoken text, and color-coded by quality. This means it can:
  • Distinguish takes from pauses — a breath or hesitation within a take won’t cause a false split
  • Detect retakes — recognizes when the speaker starts the same line over, even without a long silence
  • Handle meta-commentary — filters out things like “let me try that again” between takes
  • Classify take quality — marks each take as good (complete reading) or incomplete (cut off or interrupted), color-coding incomplete takes for quick visual identification
Edit VO
Edit the dialogue on the VO track and remove the incomplete takes

Analysis Modes

When you ask Reagent to analyze dialogue, it will ask you to choose between two modes:
  • Voice-takes — For studio recordings where a speaker reads lines with multiple takes per line. Reagent identifies individual takes, detects retakes, and classifies quality.
  • Podcast — For long-form recordings with multiple speakers. Reagent aligns your audio against a reference script and identifies sections to keep or cut.
You can also specify the mode directly in your message:
Edit VO using voice-takes mode
Edit podcast recording using the attached script

Podcast Mode

Podcast mode is designed for editing long-form recordings like podcasts, audio books, interviews, or narration sessions. When using podcast mode with a reference script, Reagent:
  • Aligns audio to your script — matches spoken sections to their corresponding lines in the script
  • Shows suggested edits — gaps and off-script sections appear as red (incomplete) items in REAPER, so you can preview what will be cut before committing
Edit this podcast using the attached script as reference
Review the red items in REAPER to see what Reagent suggests cutting, then ask it to remove the incomplete takes when you’re ready.

Pipeline Stages

Under the hood, dialogue editing runs through three stages. You can let Reagent handle all three automatically, or run them individually for more control:
  1. Transcribe — Listens to your voice recordings and generates word-level transcriptions with timestamps. Results are stored on each item for use in the next stages.
  2. Analyze — Reads the transcriptions and detects take boundaries, computes trim points, and classifies each take as good or incomplete. You can also provide custom analysis instructions instead of standard take detection — for example, finding all speaker names, marking questions vs. statements, or identifying topic changes.
  3. Apply Edits — Splits items at the detected take boundaries, trims silence, and renames each take to match the spoken text.
Running stages individually is useful when you want to inspect transcriptions before applying edits, adjust settings between steps, or use the analysis stage with custom instructions for non-standard workflows.
Transcribe the dialogue on the VO track
Now analyze it and apply the edits with 50ms padding

Script Reference Files

If you have a script, cue sheet, or line list for your recording, you can provide it as a reference file to improve take classification accuracy. Reagent compares the transcribed audio against your script to better identify which takes are complete readings and which are partial or off-script. Supported formats: CSV, TSV, TXT, XLSX, PDF, DOCX You can attach reference files directly in the chat input using the + button or by pasting the file path:
Edit the voice takes on this item using the attached script as reference
When a script reference is provided, takes that don’t match any line in the script are automatically removed.

Rename from Text

By default, edited media items will be renamed to match what was actually said. This makes it easy to identify takes at a glance in REAPER.

Volume Leveling

Automatically level dialogue or vocal recordings by generating a take volume envelope that normalizes dynamic range. Works on any voice, dialogue, or vocal items.
Level the volume on the selected items
Apply volume leveling to selected items targeting -23 LUFS
You can control the following parameters through natural language:
ParameterDefaultWhat it controls
Target level-18 dBIf you want to use LUFS specifically say “lufs” in your prompt otherwise this defaults to peak loudness.
Gain range-12 to +12 dBMin/max gain adjustment — limits how much the leveler can cut or boost
Attack20 msHow quickly gain increases — lower values react faster to quiet sections
Release80 msHow quickly gain decreases — lower values react faster to loud sections
Lookahead50 msShifts the envelope earlier to anticipate level changes
Window size50 msAnalysis window — larger windows produce smoother, less reactive leveling
Smoothing passes2Number of smoothing passes on the envelope to reduce artifacts
Level the selected items with faster attack (10ms) for more aggressive leveling
Apply gentle volume leveling with only 6dB of gain range
Level these items targeting -14dB with a 100ms window for smoother results