Changelog

Every release, on the record

Versions match the public release tags. Auto-updates ship to all tiers silently, so whatever you read here is already on your machine.

v1.7.0latest

The stabilization release

  • Language setting now fully honored end to end: Auto-detect runs Whisper language detection, and all 10 language choices apply on the next recording without an engine restart.
  • Long dictations no longer risk a false "engine stuck" error: the final full-session accuracy pass now budgets itself against your GPU's measured speed and steps aside when the streaming transcript is already the right answer.
  • Short utterances like a lone "Yes." are no longer swallowed by the hallucination filter - real speech energy now overrides the phantom-phrase list.
  • Voice commands got a precision overhaul: saying "go" or "copy" alone no longer triggers "go ahead" or "copy that". Match threshold raised, partial matches now require most of the phrase.
  • GPU safety on both sides: loading Whisper now checks free VRAM and evicts the AI model when the card is full, on cold start and resume alike. No more cryptic CUDA errors when a big model is pinned.
  • Plan rebalance: the free tier now includes the flagship Casual Clean cleanup tone, Pro covers every everyday template with models up to 14B, and Studio owns the AI Prompt + Code packs plus the full 24B to 70B model garage.
  • Switching AI models or toggling AI off mid-generation now cancels the run cleanly instead of leaving it wedged.
v1.6.8
  • Silent updates: per-user installs mean no UAC prompt on first install or any auto-update from here on.
  • Keep-both is now the default GPU mode - Whisper and your AI model stay warm together, and the models panel mirrors what is actually resident in VRAM.
  • Refreshed installer wizard art and branding.
v1.6.7
  • Theme parity with voxmelt.com: five dark themes including Candy Red and Mono, each with a matching titlebar logo.
  • Re-rendered the full icon set natively at every size - crisp from the taskbar to the Store tile.
v1.6.6
  • Rebindable global hotkeys: trigger recording from any app, with defaults chosen to never steal system-wide combos.
v1.6.5
  • Mini Mode now mirrors the main window's recording state perfectly across monitors.
  • Closing to tray keeps voice commands listening when Mini Mode is active - "go ahead" works with the main window hidden.
v1.6.4
  • The Mini Mode pill now runs its own audio analyser, so its mic animation matches the main record button frame for frame.
v1.6.3
  • App-wide zoom: Ctrl + mouse wheel or Ctrl +/- /0, from 50% to 200%, with a momentary zoom HUD.
  • Visual polish pass: refreshed call-to-action buttons and consistent plan badge colors everywhere.
v1.6.2
  • AI stall watchdog: a generation that stops producing tokens is auto-recovered with a retryable error instead of hanging on "Processing".
  • Accuracy stack tuning: chunk-overlap dedup, repetition collapse, and the hallucination filter now work as one contract.
v1.6.1
  • Honest tokens-per-second: AI speed now reports pure generation rate, with model load time shown separately.
  • Eliminated up to two hidden model reloads before the first token in the auto-process path.
v1.6.0

GPU-only, guaranteed

  • New invariant: Voxmelt never runs a model partially on the CPU. Every load path now checks the fit before a single byte hits VRAM, and over-budget models get a clear popup instead of a 2 tok/s crawl.
  • Smart swap: when Whisper must yield to a big AI model, it suspends in place and resumes in 2 to 3 seconds instead of paying a full cold start.
  • Co-residency warmup: the AI model starts loading during the final transcription pass, so cleanup begins the instant your transcript is ready.
v1.5.1
  • Fixed: switching the Whisper model size in settings now actually loads the new model on your next recording.
v1.5.0
  • VRAM numbers you can trust: one source of truth for every model footprint, corrected by real measurements from your own card after first load.
  • New int8_float16 precision option: half the Whisper VRAM with near-identical accuracy, for running both engines on tighter cards.
v1.4.0

The model garage

  • Curated AI model catalog from 3B to 70B: Gemma, Qwen, Llama, DeepSeek R1, Mistral, and MoE flagships, each with measured VRAM and strengths.
  • The picker filters to what your card can actually run, and downloads stream in-app with live progress.
v1.3.0

No-Hands Mode

  • Voice commands: start, stop, copy, clear, and load or unload models entirely by voice, powered by an always-on tiny model that coexists with the main engine.
  • Mini Mode: an always-on-top pill with live waveform, perfect for dictating into another app on a second monitor.
v1.2.0

The template library

  • AI templates organized by persona: Voice Polisher, Ghostwriter, Data Architect, Polyglot, and more, with multiple tones per template.
  • Custom templates: write your own system prompt, pick a persona, and it runs 100% on your GPU like every built-in.
  • Template discovery page with search and per-tone live samples.
v1.1.0

Free for everyone, fairly

  • Freemium launch: 15-day full trial of everything, then a free tier that stays genuinely useful - no card required.
  • Pro and Studio subscriptions via Razorpay, with cryptographically signed entitlements verified in compiled code.
  • Trial is anchored to your device, so a reinstall picks up where it left off.
v0.3.0
  • Accounts: email + password and one-click Google sign-in, used only to manage a paid license. Recording never requires an account.
  • Session history: every dictation saved locally, searchable, with optional AI output alongside.
v0.2.0
  • The second brain: local AI post-processing through Ollama - cleanup, summarize, reformat - streamed token by token, on the same GPU.
  • Dark theme system with live accent switching.
v0.1.2
  • System tray with close-to-tray: the X button hides the window and keeps your hotkey armed and models warm.
  • Branded installer, mic permission handled silently.
v0.1.1
  • Zero-terminal first-run wizard: Ollama, Whisper, and your first AI model installed with clicks, not commands.
  • Clear, actionable errors for driver and CUDA version issues.
v0.1.0

First public build

  • Whisper large-v3 dictation on your own NVIDIA GPU: press record, speak, get text in near real time.
  • Continuous dictation with overlapping chunks and boundary dedup for unlimited session length.
  • Global hotkey with auto-copy and auto-paste into whatever app has focus.
  • Zero cloud calls, zero telemetry, works with the network cable unplugged. The whole point, from day one.