Changelog

Every release, on the record

Name: Voxmelt
Availability: InStock

Versions match the public release tags. Auto-updates ship to all tiers silently, so whatever you read here is already on your machine.

v1.7.0latest

The stabilization release

Language setting now fully honored end to end: Auto-detect runs Whisper language detection, and all 10 language choices apply on the next recording without an engine restart.
Long dictations no longer risk a false "engine stuck" error: the final full-session accuracy pass now budgets itself against your GPU's measured speed and steps aside when the streaming transcript is already the right answer.
Short utterances like a lone "Yes." are no longer swallowed by the hallucination filter - real speech energy now overrides the phantom-phrase list.
Voice commands got a precision overhaul: saying "go" or "copy" alone no longer triggers "go ahead" or "copy that". Match threshold raised, partial matches now require most of the phrase.
GPU safety on both sides: loading Whisper now checks free VRAM and evicts the AI model when the card is full, on cold start and resume alike. No more cryptic CUDA errors when a big model is pinned.
Plan rebalance: the free tier now includes the flagship Casual Clean cleanup tone, Pro covers every everyday template with models up to 14B, and Studio owns the AI Prompt + Code packs plus the full 24B to 70B model garage.
Switching AI models or toggling AI off mid-generation now cancels the run cleanly instead of leaving it wedged.

v1.6.8

Silent updates: per-user installs mean no UAC prompt on first install or any auto-update from here on.
Keep-both is now the default GPU mode - Whisper and your AI model stay warm together, and the models panel mirrors what is actually resident in VRAM.
Refreshed installer wizard art and branding.

v1.6.7

Theme parity with voxmelt.com: five dark themes including Candy Red and Mono, each with a matching titlebar logo.
Re-rendered the full icon set natively at every size - crisp from the taskbar to the Store tile.

v1.6.6

Rebindable global hotkeys: trigger recording from any app, with defaults chosen to never steal system-wide combos.

v1.6.5

Mini Mode now mirrors the main window's recording state perfectly across monitors.
Closing to tray keeps voice commands listening when Mini Mode is active - "go ahead" works with the main window hidden.

v1.6.4

The Mini Mode pill now runs its own audio analyser, so its mic animation matches the main record button frame for frame.

v1.6.3

App-wide zoom: Ctrl + mouse wheel or Ctrl +/- /0, from 50% to 200%, with a momentary zoom HUD.
Visual polish pass: refreshed call-to-action buttons and consistent plan badge colors everywhere.

v1.6.2

AI stall watchdog: a generation that stops producing tokens is auto-recovered with a retryable error instead of hanging on "Processing".
Accuracy stack tuning: chunk-overlap dedup, repetition collapse, and the hallucination filter now work as one contract.

v1.6.1

Honest tokens-per-second: AI speed now reports pure generation rate, with model load time shown separately.
Eliminated up to two hidden model reloads before the first token in the auto-process path.

v1.6.0

GPU-only, guaranteed

New invariant: Voxmelt never runs a model partially on the CPU. Every load path now checks the fit before a single byte hits VRAM, and over-budget models get a clear popup instead of a 2 tok/s crawl.
Smart swap: when Whisper must yield to a big AI model, it suspends in place and resumes in 2 to 3 seconds instead of paying a full cold start.
Co-residency warmup: the AI model starts loading during the final transcription pass, so cleanup begins the instant your transcript is ready.

v1.5.1

Fixed: switching the Whisper model size in settings now actually loads the new model on your next recording.

v1.5.0

VRAM numbers you can trust: one source of truth for every model footprint, corrected by real measurements from your own card after first load.
New int8_float16 precision option: half the Whisper VRAM with near-identical accuracy, for running both engines on tighter cards.

v1.4.0

The model garage

Curated AI model catalog from 3B to 70B: Gemma, Qwen, Llama, DeepSeek R1, Mistral, and MoE flagships, each with measured VRAM and strengths.
The picker filters to what your card can actually run, and downloads stream in-app with live progress.

v1.3.0

No-Hands Mode

Voice commands: start, stop, copy, clear, and load or unload models entirely by voice, powered by an always-on tiny model that coexists with the main engine.
Mini Mode: an always-on-top pill with live waveform, perfect for dictating into another app on a second monitor.

v1.2.0

The template library

AI templates organized by persona: Voice Polisher, Ghostwriter, Data Architect, Polyglot, and more, with multiple tones per template.
Custom templates: write your own system prompt, pick a persona, and it runs 100% on your GPU like every built-in.
Template discovery page with search and per-tone live samples.

v1.1.0

Free for everyone, fairly

Freemium launch: 15-day full trial of everything, then a free tier that stays genuinely useful - no card required.
Pro and Studio subscriptions via Razorpay, with cryptographically signed entitlements verified in compiled code.
Trial is anchored to your device, so a reinstall picks up where it left off.

v0.3.0

Accounts: email + password and one-click Google sign-in, used only to manage a paid license. Recording never requires an account.
Session history: every dictation saved locally, searchable, with optional AI output alongside.

v0.2.0

The second brain: local AI post-processing through Ollama - cleanup, summarize, reformat - streamed token by token, on the same GPU.
Dark theme system with live accent switching.

v0.1.2

System tray with close-to-tray: the X button hides the window and keeps your hotkey armed and models warm.
Branded installer, mic permission handled silently.

v0.1.1

Zero-terminal first-run wizard: Ollama, Whisper, and your first AI model installed with clicks, not commands.
Clear, actionable errors for driver and CUDA version issues.

v0.1.0

First public build

Whisper large-v3 dictation on your own NVIDIA GPU: press record, speak, get text in near real time.
Continuous dictation with overlapping chunks and boundary dedup for unlimited session length.
Global hotkey with auto-copy and auto-paste into whatever app has focus.
Zero cloud calls, zero telemetry, works with the network cable unplugged. The whole point, from day one.