Skip to content

What is DeLive?

DeLive is a desktop transcription workspace that captures system audio, routes it through the ASR backend that fits the job, keeps everything on your machine, and turns completed transcripts into a searchable AI-powered review workspace.

Key Capabilities

Multi-Provider ASR

DeLive supports twelve ASR backends behind a single unified interface. Cloud providers require an API key — see the API Key Guide for setup instructions.

ProviderTypeModeHighlights
Soniox V4CloudReal-time streamingToken-level transcription, real-time translation, bilingual captions, speaker diarization
VolcengineCloudReal-time streamingChinese-oriented; embedded proxy handles required headers
ElevenLabsCloudReal-time streamingScribe v2 Realtime; 99 languages including Chinese
Mistral AICloudReal-time streamingVoxtral Realtime; embedded proxy handles authorization
GladiaCloudReal-time streamingSolaria-1; 100+ languages; <300ms latency; proxy handles session init
DeepgramCloudReal-time streamingNova-3 / Nova-2 streaming ASR; best for English and multilingual
AssemblyAICloudReal-time streamingUniversal-3 Pro streaming; optimized for English
Cloudflare Workers AICloudWindowed batchWhisper-based; low cost with free tier; VAD filter
SiliconFlowCloudWindowed batchSenseVoice, TeleSpeech, and Qwen Omni models
GroqCloudWindowed batchWhisper large-v3-turbo with quasi-realtime updates
Local OpenAI-compatibleLocalWindowed batchWorks with Ollama or any /v1/audio/transcriptions endpoint
Local whisper.cppLocalElectron-managed runtimeFully offline; DeLive manages the binary and model lifecycle

AI Review Desk

After recording, sessions open in a full-page review workspace with six tabs:

  • Transcript — Timestamped segments with speaker badges, export to TXT/Markdown/SRT/VTT
  • AI Correction — Quick fix (streaming rewrite) or review & fix (per-issue accept/ignore); smart text-source selection for downstream AI
  • Overview — AI briefing with summary, action items, keywords, chapters, title/tag suggestions
  • AI Analysis — Deep analysis powered by the configured AI model
  • Chat — Multi-thread AI conversation with GFM Markdown rendering and code highlighting
  • Mind Map — Generate and edit Markmap-compatible Markdown, export SVG/PNG

Review History

Open Ecosystem

DeLive exposes its data through a local API:

  • REST API — 8 endpoints for sessions, topics, tags, and recording status
  • WebSocket — Real-time transcript streaming at /ws/live
  • MCP Server — Standalone stdio server for Claude Desktop, Cursor, and other AI agents
  • Agent Skill — Structured guidance for AI agents to use DeLive's capabilities
  • Agent Skills — Install the DeLive Skill and any agent can transcribe in one call, returning transcript, summary, mind map & keywords

Local-First Architecture

  • Sessions stored in IndexedDB with an in-memory cache
  • API keys encrypted via Electron safeStorage
  • Context isolation, trusted IPC verification, CSP injection
  • Open API disabled by default with optional Bearer token authentication

Platforms

DeLive runs on Windows, macOS, and Linux.

PlatformFormats
Windows.exe installer, portable .exe
macOS.dmg (Intel x64 and Apple Silicon arm64)
Linux.AppImage, .deb

Released under the Apache 2.0 License.