DeLive

🎙️

Twelve ASR Backends, One UI

Soniox, Volcengine, ElevenLabs, Mistral AI, Gladia, Deepgram, AssemblyAI, Cloudflare Workers AI, SiliconFlow, Groq, local OpenAI-compatible, and local whisper.cpp — three execution modes in one app.

🧠

AI Review Desk

Full-page workspace with four tabs — Transcript (with AI side panel), Summary (overview, action items, keywords), multi-thread Chat with streaming output, and Markmap mind maps. AI Correction with smart text-source selection.

💬

Floating Caption Overlay

Always-on-top, draggable subtitle window with source, translated, and dual-line bilingual modes. Fully customizable font, color, shadow, and background.

🔒

Local-First & Secure

Sessions in IndexedDB, secrets in Electron safeStorage, context isolation, trusted-window IPC, CSP injection, and navigation guards. Your data never leaves your machine.

🌐

Open API & MCP Ecosystem

Local REST API (8 endpoints), real-time WebSocket streaming, standalone MCP server for Claude Desktop and Cursor, Agent Skill definition, and Agent Skills for one-call transcription inside any agent.

📁

File Transcription

Upload audio or video files and transcribe them offline using ten cloud ASR engines. Supports speaker diarization, word-level timestamps, and multi-language detection.

🎨

Eight Themes, Light & Dark

Violet, Cyan, Rose, Green, Amber, Pink, Slate, and Orange accent palettes — each with full light and dark mode. New persistent sidebar navigation and Command Palette (Ctrl+K).

What's New

Premium UI overhaul, eight color themes, AI streaming chat, real-time waveform, and more

📁

File Transcription

Upload audio or video files and transcribe them offline. Ten cloud ASR engines supported — Soniox, Volcengine, ElevenLabs, Mistral, Gladia, Deepgram, AssemblyAI, Cloudflare, SiliconFlow, and Groq — with speaker diarization and word-level timestamps.

☁️

Cloudflare Workers AI

New ASR provider — Whisper-based transcription via Cloudflare Workers AI. Low cost with generous free tier, VAD filter, and anti-hallucination.

🎙️

Gladia Realtime ASR

Solaria-1 real-time streaming with sub-300ms latency and 100+ language support. Embedded proxy handles session init and authentication.

🤖

AI Correction Enhancements

Persisted correction streaming text across tab switches, real-time progress display (character count, elapsed time), and improved AI analysis status tracking.

🧠

Smart Text-Source Selection

AI post-processing now auto-selects corrected transcript when available. Configurable preference (Auto / Always Original / Always Corrected) with real-time status banners.

📋

Reordered Provider List

Provider selection reordered to: Soniox, Volcengine, ElevenLabs, Mistral AI, Gladia, Deepgram, AssemblyAI, Cloudflare, SiliconFlow, Groq, Local OpenAI, whisper.cpp.

🧪

314 Tests Passing

Expanded test suite with 314 tests across 32 files, including AI correction, hypothesis buffer, PCM/WAV encoding, and all previous coverage areas.

🔄

Refactored Windowed Batch

WindowedBatchTranscriptionProvider base class extracted — shared logic for interval-based retranscription, silence detection, and hypothesis buffer management.

🌍

Electron Main Process i18n

Main process tray menu, dialog titles, and system notifications now respect the user's language setting.

DeLiveDesktop Transcription Workspace

Twelve ASR Backends, One UI

AI Review Desk

Floating Caption Overlay

Local-First & Secure

Open API & MCP Ecosystem

File Transcription

Eight Themes, Light & Dark

See It In Action

What's New

File Transcription

Cloudflare Workers AI

Gladia Realtime ASR

AI Correction Enhancements

Smart Text-Source Selection

Reordered Provider List

314 Tests Passing

Refactored Windowed Batch

Electron Main Process i18n

Cross-Platform