simple monthly pricing

No per-minute fees. No API keys. No CUDA tax.

Free to run for 15 days, then $9.99/mo Pro or $14.99/mo Studio. Cloud dictation meters you per minute and uploads every word; Voxmelt is unmetered by design - one flat price on hardware you already own. Cancel anytime - privacy by architecture, not by terms of service no one reads.

Free
$0
15 day full trial

Prove it on your GPU. 15-day trial of everything, then 30 min/day dictation plus a daily AI taste: the starter tone of every template pack. Free forever, no card.

Download
Pro
$9.99
per month

Dictate all day, polished. Unlimited dictation, unlimited AI runs on every everyday template, models up to 14B, 5 custom templates, every tone shift, and read-aloud in any voice with WAV export.

Get Pro
Most Popular
Studio
$14.99
per month
Recommended · Best value

The AI power-user studio. Everything in Pro, plus the AI Prompt and Code packs, every model up to 70B, 30 custom templates, direct line.

Get Studio
Full comparison
Hover a group to focus
FeatureFreeProStudio
Recording
Whisper large-v3 accuracy
All Whisper model sizes (tiny → large-v3)
Daily dictation30 min/dayUnlimitedUnlimited
Continuous dictation (overlapping chunks)
Global hotkey · auto paste · voice commands · Mini Mode
AI post processing
AI cleanup runs10/dayUnlimitedUnlimited
AI template packs · email, social, rewrite, recaps, prompts & moreEvery pack, 2-3 free tonesAll everyday tonesEvery tone
Polyglot translation · 6 languages
Pro tones · the styled + advanced variants in each pack
Studio tones · technical specs, few-shot, worked examples, deepest formats
Custom templates (your own system prompts)1 slot5 slots30 slots
AI model catalogUp to 4BUp to 14BEverything · up to 70B
One-click tone shifts on AI output (funnier, shorter, sharper…)3 of 5 per toneAll 5All 5
Voice edits · re-style the output by speaking ("make it shorter")Set phrasesAny phrasingAny phrasing
AI Studio · run any template on pasted text (no recording)
Streaming output · cancel mid run
Read aloud
Read AI output aloud · local Kokoro voices, 0 cloud10 min/dayUnlimitedUnlimited
Voice library2 voicesAll voicesAll voices
Playback speed control
Export narration to WAV
Privacy
Local processing only · 0 cloud
Air-gapped friendly · HIPAA/GDPR ready
No telemetry · no account required to record
GPU orchestrator
Live VRAM telemetry
Dual-model orchestration (Whisper + LLM on one card)
Free GPU · idle release
Updates & support
Auto updates
Priority email support
Direct line to the maintainer
why local beats cloud

Same AI. Zero cloud dependency.

Cloud dictation tools meter you per minute and upload every word you say. Voxmelt runs the same Whisper model locally on your GPU - unmetered, private, and often faster because there is no network round-trip.

Privacy

Audio never leaves your machine

Not "encrypted in transit" - never transmitted at all. Run any network monitor while you dictate and watch zero bytes leave. HIPAA and GDPR friendly by architecture, not by contract.

Cost

Flat price, no per-minute meter

Wispr Flow charges $15/mo with word limits. Otter AI meters per minute. Voxmelt Pro is $9.99/mo flat - dictate all day with zero overage fees.

Speed

No network latency - GPU-direct

20 to 100+ tokens/second on consumer GPUs. Your RTX card processes text faster than most cloud APIs can respond - and works offline, on planes, in air-gapped labs.

Freedom

Any Ollama model, instantly

Gemma 4, Qwen 3, Llama 4, DeepSeek R1, Mistral - new models release monthly and you get them the same day. No vendor lock-in, no waiting for a provider to add support.

GPU orchestration

Smart VRAM swap - no other app does this

Voxmelt juggles two GPU models (Whisper STT + an LLM) on one card with auto-eviction. A 12 GB card runs both; a 24 GB card keeps both warm. You never manage VRAM manually.

Proof

Proof, not promises

Cloud tools ask you to trust a privacy policy. Voxmelt lets you check: open a network monitor, dictate, and see for yourself that no audio leaves. No telemetry, no analytics, no tracking pixels.

See the full comparison: Voxmelt vs Wispr Flow, Superwhisper, Otter AI, and Dragon →

FAQ

Questions, answered straight

How does billing work?

Pro ($9.99/month) and Studio ($14.99/month) are monthly subscriptions billed through Razorpay. In India, plans are billed in INR (Pro ₹899/month, Studio ₹1,399/month) - Indian cards and UPI can only be charged in rupees, so pick INR at checkout if your card was issued in India. No per-minute metering, no API keys. Subscriptions are cancellation-only and non-refundable - cancel anytime and you keep access through the end of the cycle you already paid for, then revert to the free tier.

Do you offer refunds?

No - all sales are final and purchases are non-refundable. The 15-day free Studio trial (no card required) lets you fully evaluate Voxmelt on your own hardware before you pay, so once a billing cycle is charged it is not refunded, whether it is your first payment or a renewal. Only cancellation is supported: it stops future renewals, you keep full access through the cycle you have already paid for, then your account reverts to the free tier with no further charges. Full details are in the Refund & Cancellation Policy at voxmelt.com/privacy#refund.

What do I get on the free trial?

15 days of everything - the full Studio pipeline. After the trial, the free tier keeps 30 minutes/day of dictation, 10 AI cleanup runs/day on the starter tone of every template pack, 10 minutes/day of Read Aloud with 2 voices (Clara and Marcus in 3 free tones), the 6-language translator, models up to 4B, and 1 custom template slot. Compose and Voiceover work within the same daily limits. You are never locked out of your own transcripts.

Do I need an account to use it?

No account is required to record. An account only matters for managing your Pro/Studio license across machines.

What's the difference between Pro and Studio?

Pro is the everyday plan: unlimited dictation, unlimited AI runs on the everyday template set (cleanup vibes, chat replies, meeting recaps, note distilling, signal extraction, translation), all 4 Voiceover personas and 8 delivery tones with speed control and WAV export, the Compose workspace, models up to 14B, 5 custom templates, and priority email support. Studio is the power-user plan: everything in Pro, plus the AI Prompt Architect and Code Prompt packs built for Cursor and Copilot workflows, every model in the catalog up to the 70B flagships, 30 custom template slots, and a direct line to the maintainer.

Does my data ever leave my machine?

Never. Recording, transcription, and AI cleanup all run on your GPU - no cloud calls, no telemetry, no API keys. The only network traffic is license checks, billing, and update checks, and none of it carries audio or text. Turn off WiFi and dictation keeps working. That is the whole point.

Can I cancel anytime?

Yes. Cancel from Account → Billing in one click. Cancellation stops future renewals only - you keep full Pro/Studio access through the end of the billing cycle you already paid for, then your account reverts to the free tier.

How is Voxmelt different from Wispr Flow?

Wispr Flow sends your voice to the cloud and charges $15/month. Voxmelt does the same job on your own GPU for $9.99/month, and your audio never leaves your machine. You pick the AI model that cleans your text, it keeps working with WiFi off, and you can check the privacy claim yourself: open any network monitor, dictate, and watch zero audio leave your PC.

How does Voxmelt compare to Superwhisper?

Superwhisper offers cloud-tier processing that uploads your audio. Voxmelt is 100% local across all tiers - even the AI post-processing runs on your GPU via Ollama. You also get smart GPU orchestration (auto VRAM management between Whisper and the LLM), which Superwhisper lacks. Plus full Windows support from day one.

Is Voxmelt a good alternative to Otter AI or Dragon NaturallySpeaking?

Yes. Otter AI is cloud-only and bills per minute. Dragon is expensive and legacy. Voxmelt gives you the same Whisper large-v3 model that powers most modern cloud transcription, plus a local LLM that rewrites your output - all on your own hardware, for a flat monthly fee with no per-minute meter.

Can I use Voxmelt for HIPAA/GDPR-sensitive work?

Yes. Since audio and text never leave your machine, there is no data processor to sign a BAA with - the data simply never transmits. Lawyers, doctors, and researchers use Voxmelt specifically because the privacy guarantee is architectural (local processing), not contractual (a ToS promise that could change).

How accurate is it really?

Measured, not claimed: on our published benchmark, Voxmelt's local Whisper large-v3 averaged 7.8 percent word error rate across five scripted real-world clips on an RTX 3080 Ti, including 3.1 percent on fast speech with filler words and on background noise, while transcribing 10 to 15 times faster than real time. The per-clip tables, the method, and the scoring script are all public on the benchmark page, and the 15-day trial lets you test it on your own voice, which is the only benchmark that really matters.

What GPU do I need?

Any NVIDIA GPU with 6 GB+ VRAM (RTX 3060 and up). An 8 GB card runs the small Whisper model comfortably. A 12 GB card (RTX 3080 Ti, 4070) handles large-v3 plus a 4B LLM. A 24 GB card (RTX 4090) keeps both models warm simultaneously for instant responses. Workstation cards (A6000, L40S, A100) run the 70B models.