Features — DictDrop Voice-to-Text for Developers

Transcription engine

Sub-300ms — fast enough to feel instant

DictDrop uses Groq's fastest speech recognition model — purpose-built for real-time inference, which is why the latency is dramatically lower than cloud APIs built on shared infrastructure. Most dictations complete in 200–350ms from when you stop speaking to when text appears.

Accuracy on technical vocabulary — variable names, framework names, domain jargon — is significantly better than Windows Speech Recognition or browser dictation.

Groq speech recognition

< 300ms end-to-end

Your key · Your account · Zero middleman

Text injection

Direct-to-cursor — no clipboard, no paste flicker

Most "voice-to-text" tools copy the transcript to clipboard and send Ctrl+V. This overwrites whatever you had copied, creates a visible paste flash, and fails in rich text editors that intercept paste events.

DictDrop delivers text directly to wherever your cursor is — at the OS level, not the clipboard level. The receiving app cannot tell the difference from physical keyboard input.

✗ Clipboard paste — overwrites your copy buffer

✓ DictDrop — text appears at cursor, clipboard untouched

Privacy architecture

BYOK — your audio, your key, your Groq account

DictDrop has no backend server. When you dictate, audio goes: your mic → Groq API (authenticated with your personal key) → transcript returned → injected at cursor. We are not in that chain at any step.

Your Groq account's free tier covers hundreds of hours of dictation per month at zero marginal cost. If you go heavy, Groq's pay-as-you-go is fractions of a cent per minute.

🎙 Your mic → Groq (your key) → 📝 Cursor

No DictDrop servers in this chain

Status indicator (V1.2)

Always visible, never intrusive

V1.2 introduced a lightweight floating status dot — a small circle that shows recording state without creating a taskbar entry or stealing focus from your work. It runs silently in the background at all times.

Green = actively recording. White = idle. The dot appears at a fixed corner position you can configure in settings.json. At rest, it consumes zero CPU.

Recording — speak now

Idle — ready to record

All features

Everything in the box

One purchase. All of this. Today.

Sub-300ms latency

Groq's fastest inference — faster than any cloud GPU-based transcription service.

BYOK privacy model

Bring your own Groq key. We are not in the audio pipeline. Zero data retention on our end.

Direct keyboard injection

Text lands at your cursor — no clipboard overwrite, no paste, no flicker. Indistinguishable from keyboard input.

Universal app compat

Browser, Electron, native desktop, legacy apps — any window that accepts keyboard input.

Configurable hotkey

Default Ctrl+Shift. Change it to any modifier combo in settings.json.

Floating status indicator

Lightweight floating dot. No taskbar entry. Runs silently — zero CPU at rest.

30 MB installer

Fully self-contained — bundled runtime, zero dependencies to manage, no PATH changes needed.

Lifetime updates

All future version updates are included. No upgrade pricing, no annual renewal.

Any microphone

Uses your system default audio device. Works with headsets, USB mics, built-in mic — anything Windows recognises.

Roadmap

Coming next

These are actively in design. Sign up at the bottom to get notified when they ship.

Multi-monitor support

Status dot follows active window across monitors automatically.

Voice sensitivity control

Adjustable detection threshold — fewer false triggers in noisy environments.

Offline mode

Local speech recognition — transcription without any internet connection.

All of this.
$49 once.

No tiers, no trials, no monthly fee. One installer, every feature, forever.

🎙 Get DictDrop — $49

Windows 10 / 11 · Instant download · No account required

Get notified when roadmap items ship

One email per release — no newsletters, no promotions.

Every feature. No fluff.