How to Speed Up Speaker Diarization

Speaker diarization is computationally intensive — but enabling GPU acceleration in Offline Transcriber can make it up to ~7× faster. This guide shows you how to turn on GPU support, which hardware is supported, and how to pick the right graphics adapter for your machine.

Why GPU Acceleration Matters

Diarization runs the same kind of neural-network workload as transcription itself, and graphics cards are built to run that workload many times faster than a CPU.

On our reference hardware (NVIDIA RTX 1080 Ti) a 12-minute audio file finishes diarization in about 1 minute on GPU vs ~7 minutes on CPU — roughly a 7× speedup. Newer or higher-tier GPUs can deliver even larger gains.

Heads up: The Graphics Adapter setting is global — it affects transcription, diarization, and other AI-powered features in Offline Transcriber. Switching it changes performance and behavior across the whole app, not just diarization.

Supported Hardware

Windows
Offline Transcriber uses DirectML for GPU acceleration. Any graphics card supported by DirectML works — this covers most modern NVIDIA, AMD, and Intel GPUs. DirectML ships with modern Windows 10 and Windows 11; you just need up-to-date GPU drivers.
macOS
Offline Transcriber uses the Apple Neural Engine (ANE), which is built into Apple Silicon chips (M1 and later). Intel-based Macs are not supported for GPU acceleration and will run on CPU.

How to Enable GPU Acceleration

  1. Open Settings

    Launch Offline Transcriber and open the Settings panel.

  2. Go to Graphics Adapter

    Navigate to Settings → Graphics Adapter. You'll see a list of every compatible GPU detected on your system, plus a "Not use GPU" option to fall back to CPU.

  3. Choose Your Best Graphics Card

    Select the most powerful GPU available. If your machine has both a discrete card (e.g. NVIDIA RTX, AMD Radeon RX) and an integrated GPU (Intel UHD, AMD Vega), always pick the discrete card — it will deliver substantially better performance.

  4. Run Diarization as Usual

    Once selected, GPU acceleration is active for all subsequent diarization (and transcription) runs. No per-job toggle is needed.

Choosing Between Multiple GPUs

If you have more than one supported GPU installed, Offline Transcriber lists each one separately under Graphics Adapter. A few rules of thumb when picking:

  • Discrete over integrated: Always prefer your dedicated card over an integrated GPU.
  • Newer generation beats older flagship: A newer mid-tier card often outperforms an older high-end card on ML workloads.
  • More VRAM gives headroom: Diarization itself doesn't need much VRAM, but more headroom means you can keep other GPU apps running alongside.

When to Disable GPU

There are a few legitimate reasons to switch back to "Not use GPU":

  • The GPU is needed elsewhere: Gaming, video editing, or other ML workloads on the same card.
  • Driver issues: If transcription or diarization throws errors that disappear on CPU, fall back to CPU while you update drivers.
  • Battery life on laptops: CPU may be more efficient on battery for short jobs.

Troubleshooting

My GPU isn't listed

On Windows, make sure your graphics driver is up to date and that you're on a modern version of Windows 10 or Windows 11 (DirectML requires modern Windows). On macOS, GPU acceleration requires an Apple Silicon Mac — Intel-based Macs are not supported and will fall back to CPU automatically.

Diarization isn't noticeably faster on GPU

First, confirm you actually selected a GPU under Settings → Graphics Adapter and didn't leave it on "Not use GPU". If multiple GPUs are listed, verify you've chosen the most powerful one (typically the discrete card, not integrated graphics).

I switched GPU and now my transcriptions behave differently

This is expected. The Graphics Adapter setting is global — it changes which device handles transcription, diarization, and other AI features. You don't need to switch it per task.

Privacy Note

GPU acceleration runs 100% locally, exactly like the CPU path. No audio data leaves your computer regardless of which device you select.