Changelog
Every release, on the record
Versions match the public release tags. Auto-updates ship to all tiers silently, so whatever you read here is already on your machine.
v1.7.0latest
The stabilization release
- Language setting now fully honored end to end: Auto-detect runs Whisper language detection, and all 10 language choices apply on the next recording without an engine restart.
- Long dictations no longer risk a false "engine stuck" error: the final full-session accuracy pass now budgets itself against your GPU's measured speed and steps aside when the streaming transcript is already the right answer.
- Short utterances like a lone "Yes." are no longer swallowed by the hallucination filter - real speech energy now overrides the phantom-phrase list.
- Voice commands got a precision overhaul: saying "go" or "copy" alone no longer triggers "go ahead" or "copy that". Match threshold raised, partial matches now require most of the phrase.
- GPU safety on both sides: loading Whisper now checks free VRAM and evicts the AI model when the card is full, on cold start and resume alike. No more cryptic CUDA errors when a big model is pinned.
- Plan rebalance: the free tier now includes the flagship Casual Clean cleanup tone, Pro covers every everyday template with models up to 14B, and Studio owns the AI Prompt + Code packs plus the full 24B to 70B model garage.
- Switching AI models or toggling AI off mid-generation now cancels the run cleanly instead of leaving it wedged.
v1.6.8
- Silent updates: per-user installs mean no UAC prompt on first install or any auto-update from here on.
- Keep-both is now the default GPU mode - Whisper and your AI model stay warm together, and the models panel mirrors what is actually resident in VRAM.
- Refreshed installer wizard art and branding.
v1.6.7
- Theme parity with voxmelt.com: five dark themes including Candy Red and Mono, each with a matching titlebar logo.
- Re-rendered the full icon set natively at every size - crisp from the taskbar to the Store tile.
v1.6.6
- Rebindable global hotkeys: trigger recording from any app, with defaults chosen to never steal system-wide combos.
v1.6.5
- Mini Mode now mirrors the main window's recording state perfectly across monitors.
- Closing to tray keeps voice commands listening when Mini Mode is active - "go ahead" works with the main window hidden.
v1.6.4
- The Mini Mode pill now runs its own audio analyser, so its mic animation matches the main record button frame for frame.
v1.6.3
- App-wide zoom: Ctrl + mouse wheel or Ctrl +/- /0, from 50% to 200%, with a momentary zoom HUD.
- Visual polish pass: refreshed call-to-action buttons and consistent plan badge colors everywhere.
v1.6.2
- AI stall watchdog: a generation that stops producing tokens is auto-recovered with a retryable error instead of hanging on "Processing".
- Accuracy stack tuning: chunk-overlap dedup, repetition collapse, and the hallucination filter now work as one contract.
v1.6.1
- Honest tokens-per-second: AI speed now reports pure generation rate, with model load time shown separately.
- Eliminated up to two hidden model reloads before the first token in the auto-process path.
v1.6.0
GPU-only, guaranteed
- New invariant: Voxmelt never runs a model partially on the CPU. Every load path now checks the fit before a single byte hits VRAM, and over-budget models get a clear popup instead of a 2 tok/s crawl.
- Smart swap: when Whisper must yield to a big AI model, it suspends in place and resumes in 2 to 3 seconds instead of paying a full cold start.
- Co-residency warmup: the AI model starts loading during the final transcription pass, so cleanup begins the instant your transcript is ready.
v1.5.1
- Fixed: switching the Whisper model size in settings now actually loads the new model on your next recording.
v1.5.0
- VRAM numbers you can trust: one source of truth for every model footprint, corrected by real measurements from your own card after first load.
- New int8_float16 precision option: half the Whisper VRAM with near-identical accuracy, for running both engines on tighter cards.
v1.4.0
The model garage
- Curated AI model catalog from 3B to 70B: Gemma, Qwen, Llama, DeepSeek R1, Mistral, and MoE flagships, each with measured VRAM and strengths.
- The picker filters to what your card can actually run, and downloads stream in-app with live progress.
v1.3.0
No-Hands Mode
- Voice commands: start, stop, copy, clear, and load or unload models entirely by voice, powered by an always-on tiny model that coexists with the main engine.
- Mini Mode: an always-on-top pill with live waveform, perfect for dictating into another app on a second monitor.
v1.2.0
The template library
- AI templates organized by persona: Voice Polisher, Ghostwriter, Data Architect, Polyglot, and more, with multiple tones per template.
- Custom templates: write your own system prompt, pick a persona, and it runs 100% on your GPU like every built-in.
- Template discovery page with search and per-tone live samples.
v1.1.0
Free for everyone, fairly
- Freemium launch: 15-day full trial of everything, then a free tier that stays genuinely useful - no card required.
- Pro and Studio subscriptions via Razorpay, with cryptographically signed entitlements verified in compiled code.
- Trial is anchored to your device, so a reinstall picks up where it left off.
v0.3.0
- Accounts: email + password and one-click Google sign-in, used only to manage a paid license. Recording never requires an account.
- Session history: every dictation saved locally, searchable, with optional AI output alongside.
v0.2.0
- The second brain: local AI post-processing through Ollama - cleanup, summarize, reformat - streamed token by token, on the same GPU.
- Dark theme system with live accent switching.
v0.1.2
- System tray with close-to-tray: the X button hides the window and keeps your hotkey armed and models warm.
- Branded installer, mic permission handled silently.
v0.1.1
- Zero-terminal first-run wizard: Ollama, Whisper, and your first AI model installed with clicks, not commands.
- Clear, actionable errors for driver and CUDA version issues.
v0.1.0
First public build
- Whisper large-v3 dictation on your own NVIDIA GPU: press record, speak, get text in near real time.
- Continuous dictation with overlapping chunks and boundary dedup for unlimited session length.
- Global hotkey with auto-copy and auto-paste into whatever app has focus.
- Zero cloud calls, zero telemetry, works with the network cable unplugged. The whole point, from day one.